Hypothesis

6,880 Matching Annotations

Aug 2023
www.biorxiv.org www.biorxiv.org

New submission 24/04/2023, 10:12:02

1
1. Public_Reviews 31 Aug 2023
  
  in eLife
  
  Author Response:
  
  The following is the authors’ response to the original reviews.
  
  Reviewer #1 (Public Review):
  
  In their study, Aman et al. utilized single cell transcriptome analysis to investigate wild-type and mutant zebrafish skin tissues during the post-embryonic growth period. They identified new epidermal cell types, such as ameloblasts, and shed light on the effects of TH on skin morphogenesis. Additionally, they revealed the important role of the hypodermis in supporting pigment cells and adult stripe formation. Overall, I find their figures to be of high quality, their analyses to be appropriate and compelling, and their major claims to be wellsupported by additional experiments. Therefore, this study will be an important contribution to the field of vertebrate skin research. Although I have no major concerns, I would like to offer a few minor comments for the authors to consider.
  
  1) The discovery of ameloblasts in the zebrafish skin is a fascinating finding that could potentially provide a new research model for understanding the development and regeneration of vertebrate teeth. It would be beneficial if the authors could provide further elaboration on this aspect and discuss how the zebrafish scale model could be utilized by researchers to better understand the morphogenesis of vertebrate teeth and/or hair.
  
  We have provided additional discussion points regarding epidermal EMP+ cells with ameloblast-like transcriptional profiles. We believe that further studies of scale matrix composition and the material properties endowed by various collagenous and non-collagen matrix proteins will be useful for understanding fundamental mechanisms of biomineralization. This section of the discussion now reads:
  
  “We systematically assessed the expression of genes encoding non-collagen calcified matrix proteins throughout the skin during squamation, leading to the discovery of a transcriptionally distinct population of basal epidermal cells that express EMP transcripts, likely corresponding to epidermal secretory cells proposed to participate in scale matrix formation based on ultrastructure (Sire et al., 1997). These cells also express dlx3a, dlx4a, runx2b and msx2a but not sp7, a transcription factor suite that is shared with ameloblasts that form tooth enamel. While these transcription factors are not exclusive to ameloblasts and have been reported in osteoblasts and odontoblasts, in addition to cell types that do not produce calcified matrix, such as neurons, their co-expression along with EMP-encoding transcripts in basal epidermal cells is consistent with a common origin of ameloblasts and the EMP+ epidermal cells reported here. One alternative hypothesis is that co-expression of these gene products arose convergently and can be explained by mechanistic linkages among them. Future work aimed at functionally dissecting the regulatory mechanisms that govern EMP gene expression in a variety of organisms may clarify these issues either by providing evidence of additional commonalities, supporting a shared ancestor, or by revealing diverse, lineage-specific regulatory architectures, supporting convergent evolution of superficial enamel deposition in teeth and fish skin appendages.”
  
  2) While the overexpression-rescue experiments (i.e., fgf20a and pdafaa) provide crucial evidence to support the author's conclusions, it is important to note that overexpression driven by the heat-shock promoter is not spatially regulated. Therefore, it should be acknowledged that the rescue effects may not be cell-autonomous, as suggested in the current version.
  
  The reviewer is correct that hsp70l promotor is not spatially regulated and F0 transgenics have random mosaic expression. Importantly, since we were testing specific hypotheses regarding signaling interactions between basal epidermal cells and dermal cells, we applied stringent selection and only analyzed individuals with transgene expression in basal epidermal cells. This approach enabled us to assay the results of basal cell expression of signaling ligands in eda mutant and hypo-thyroid backgrounds. The original manuscript omitted this crucial aspect of our experimental design, and we thank the reviewer for noticing this omission. We have revised the following parts of the results section.
  
  “Indeed, heatshock-driven expression in F0 mosaics stringently selected for basal epidermal expression of Fgf20a in the skin of Eda mutants led to localized rescue of scales where transgene expression was detectable (Figure 5D).”
  
  “When we forced expression of Pdgfaa in basal cells of epidermis by heatshock induction and stringent selection of basal epidermal expression in F0 mosaics, we found, as predicted, a recruitment of dermal cells in hypoTH skin, leading to a locally stratified dermis (Figure 6E) similar to that of the wild-type (Figure 4C).”
  
  We additionally revised the legends for Figure 5 and Figure 6 to mention stringent selection of basal epidermal expression of fgf20a and pdgfaa, respectively.
  
  3) Figure 7D. The authors used the ET37:EGFP lines to visualize hypodermis. Based on the absence of EGFP signal in the deep dermis of bnc2 mutants, the authors concluded that the hypodermis may be missing, suggesting the importance of the hypodermis in pigment cell formation. However, since the EGFP evidence is indirect, it is crucial to confirm the absence of the hypodermis structure with histology.
  
  It is indeed conceivable that hypodermal cells physically persist in bnc2 mutants yet have sufficiently altered gene expression that they neither cluster with wild-type hypodermal cells in single cell RNA-seq analyses nor initiate or maintain the broadly expressed dermal reporter ET37:GFP that we used to assess the presence or absence of such cells in a defined anatomical position. Though we believe this to be somewhat unlikely (hence our original interpretation), we have added a caveat referencing this formal possibility in the revised manuscript:
  
  “It is possible that hypodermal cells physically persist in bnc2 mutants but have sufficiently altered transcriptional profiles such that they no longer cluster together with wild-type hypodermal cells or express the ET37:EGFP transgene. Nevertheless, these analyses suggest that ET37:EGFP+ hypodermal cells likely play a role in pigment pattern development.”
  
  We believe this issue raises interesting philosophical questions about the definition of a “cell-type.” If cells constituting the deep surface of the dermis physically persist, but have a profoundly altered transcriptional profile and no longer perform the biological functions of their wild-type counterparts, are they still the original cell type, or was the wild-type cell type lost? As researchers continue to discover new cell types and deepen our understanding of cell-state plasticity in normal and pathological conditions, the community will need to articulate new rubrics of categorization to ensure that “cell-type” remains a rigorous and useful concept (if, indeed, it has been one).
  
  4) As the dataset is expected to be a valuable asset to the field, please provide Excel tables summarizing the key genes and their corresponding expression levels for each major cluster that has been identified.
  
  This table has been provided in the revised manuscript (Supplementary file 2 – Table 5.)
  
  Reviewer #2 (Public Review):
  
  The authors used single cell transcriptome analysis of zebrafish skin cells and characterized various types of cells that are involved in scale formation and stripe patterning. The methods employed in this study is highly powerful to provide mechanistic explanation of these fundamental biological issues and will be a good example for many researchers studying other biological issues. Furthermore, the results characterizing differences in gene expression patterns among various types of cells will be informative for other researchers in the field.
  
  For scale formation, it is known that mineralized tissues may significantly differ in rayfins and lobefins since sox9, col2a1, and col10a1 are all expressed in osteoblasts, in addition to chondrocytes, in zebrafish and gar (Eames et al., 2012, BMC Evol. Biol.). Furthermore, in mammals, Col10 is expressed in chondrocytes in mature cartilage that undergoes ossification. Thus, unlike the authors argue, col10a1 expression is not apparently relevant to the elasticity of scales.
  
  The authors also state that the expression of dlx4a, msx2a, and runx2b characterize cells homologous to mammalian ameloblasts. However, dlx4, runx2, and msx2 are all duplicated in zebrafish, and the function of duplicated genes in teleost fishes may differ from that of single ancestral gene. Moreover, none of Dlx4, Msx2, and Runx2 is expressed specifically by ameloblasts in mammals. Indeed, both Msx2 and Runx2 are expressed in osteoblasts, while the expression of Dlx4 in ameloblasts is not reported. These results, together with the expression of an enamel gene, enam, in dermal cells (SFC), do not appear to support the homology of the surface tissue of mammalian teeth and zebrafish scales.
  
  We appreciate the reviewers’ comments and have provided caveats to our interpretation in the revised manuscript (see our response to Reviewer #1, item 1, above). In the revised manuscript, we also display results for an additional Dlx gene, dlx3b, that is coexpressed in EMP+ basal epidermal cells (Figure 3C), although dlx4 has been reported in mammalian tooth germs and elasmobranch tooth and odontode epithelia (Pemberton et al., 2007; Debiais-Thibaud et al., 2011 ; Woodruff et al., 2022).
  
  More generally, expression of specific genes can be useful characters for testing hypotheses of homology. The operant inference depends on a parsimony assumption: if a transcriptional profile is shared between celltypes in disparate organisms, one explanation is that this transcriptional profile was inherited from a common ancestor. This inference is not impacted by the teleost whole genome duplication. If the common ancestor had one ortholog and a subset of modern animals have two, the homology hypothesis predicts that at least one ortholog will be expressed in common in the tissue that descended from the common ancestor. This interpretation is entirely compatible with our understanding of the mechanisms that underlie retention of duplicated genes in animal genomes. Additionally, exclusivity is not necessarily predicted by homology hypotheses. Indeed, all the transcription factors used here as characters for evaluating homology have pleiotropic roles in many cell types.
  
  In this specific case, we found two EMP genes, ambn and enam, co-expressed with a complement of transcription factors that is also co-expressed in ameloblasts. These findings are consistent with a model in which both ameloblasts and EMP+ epidermal cells associated with zebrafish scales inherited this transcriptional profile from a common ancestral cell type. Given the temporal and phylogenetic continuity of superficial enameling in the fossil record of skin appendages, and the dual origin of mineralized matrices in extant skin appendages and teeth, we continue to favor the model where these traits are shared and conserved among vertebrates. Nevertheless, we have acknowledged in the revised manuscript the limitations of homology testing by analyses of gene expression and the possibility that these traits might have evolved convergently; we suggest additional research avenues for testing this hypothesis further (response to Reviewer #1, item 1, above).
  
  Reviewer #3 (Public Review):
  
  This work describes transcriptome profiling of dissected skin of zebrafish at post-embryonic stages, at a time when adult structures and patterns are forming. The authors have used the state-of-the-art combinatorial indexing RNA-seq approach to generate single cell (nucleus) resolution. The data appears robust and is coherent across the four different genotypes used by the authors.
  
  The authors present the data in a logical and accessible manner, with appropriate reference to the anatomy. They include helpful images of the biology and schematics to illustrate their interpretations.
  
  The datasets are then interrogated to define cell and signalling relationships between skin compartments in six diverse contexts. The hypotheses generated from the datasets are then tested experimentally. Overall, the experiments are appropriate and rigorously performed. They ask very interesting questions of interactions in the skin and identify novel and specific mechanisms. They validate these well.
  
  The authors use their datasets to define lineage relationships in the dermal scales and also in the epidermis. They show that circumferential pre-scale forming cells are precursors of focal scale forming cells while there appeared a more discontinuous relationship between lineages in the epidermis.
  
  The authors present transcriptome evidence for enamel deposition function in epidermal subdomains. This is convincingly confirmed with an ameloblastin in situ. They further demonstrate distinct expression of SCPP and collagen genes in the SFC regions.
  
  The authors then demonstrate that Eda and TH signalling to the basal epidermal cells generates FGF and PDGF ligands to signal to surrounding mesenchyme, regulating SFC differentiation and dermal stratification respectively.
  
  Finally they exploit RNA-seq data performed in parallel in the bnc2 mutants to identify the hypodermal cells as critical regulators of pigment patterning and define the signalling systems used.
  
  Whilst these six interactions in the skin are disparate, the stories are unified by use of the sci-RNA-seq data to define interactions. Overall, it's an assembly of work which identifies novel and interesting cell interactions and cross-talk mechanisms. There are some aspects that require clarification:
  
  With respect to the discontinuous relationship noted in Figure 2I in the epidermis, the authors did not make mention of the fact that there are in fact two independent sources of periderm in the zebrafish. The first periderm derives from the EVL, is segregated a gastrulation, and gradually replaced from the basal epidermis during post-embryonic stages. Could this residual EVL-derived periderm have reduced sensitivity of the trajectory mapping from basal to periderm? The authors should comment whether their transcriptome dataset likely had residual EVL-derived periderm and if this might have impacted their trajectory continuity interpretation.
  
  While dual origin of periderm may impact the single cell analysis, this should not be an issue for suprabasal cells, which also show no continuity with their basal cell progenitors in UMAP space. We thank the reviewer for bringing this issue up and comment on the dual origin of periderm in the revised manuscript.
  
  “During this stage of development, basal epidermal cells are the stem cell population that differentiate into both suprabasal and periderm cells, and each of the three major epidermal cell types are well represented in our dataset (Figure 2H,I; Figure 1—figure supplement 3)(Guzman et al., 2013; Lee et al., 2014). While periderm cells at the sampled stage are likely of dual origin, representing a mixture of early embryonic and stem cell derived cells, suprabasal cells are entirely derived from basal cells (Kimmel et al., 1990; Guzman et al., 2013; Lee et al., 2014).”
  
  During this stage of development, basal epidermal cells are the stem cell population that differentiate into both suprabasal and periderm cells, and each of the three major epidermal cell types are well represented in our dataset (Figure 2H,I; Figure 1—figure supplement 3)(Guzman et al., 2013; Lee et al., 2014). While periderm cells at the sampled stage are likely of dual origin, representing a mixture of early embryonic and stem cell derived cells, suprabasal cells are entirely derived from basal cells (Kimmel et al., 1990; Guzman et al., 2013; Lee et al., 2014).
  
  The authors ask if dermal SFCs express proteins associated with cartilage formation and use Col10a1 orthologues as markers (Fig 3B, I). I wonder if these are the best transcripts to answer this question as this has also been described to label osteoblasts in certain contexts in the fish and the authors might want to refer to Li et al 2009 or Avaron et al 2005. Were other markers of cartilage formation present such as collagen2 genes? These may be more definitive. The authors might want to reinterrogate their datasets for true cartilage markers or reframe their question.
  
  In the revised manuscript, we have clarified and moderated inferences from col10a1 ortholog expression. Col2 genes were not detected robustly in our dataset. This section now reads:
  
  “Scale elasmoidin is a flexible, collagenous ECM, material properties that are similar to cartilage (Quan et al., 2020). We therefore wondered whether dermal SFCs express matrix proteins associated with cartilage formation. Col10a1 is a major structural molecule in collagen, although its expression has also been documented in osteoblasts (Gu et al., 2014; Yang et al., 2014; Kawasaki et al., 2021). The zebrafish genome harbors genes encoding two Col10a1 orthologs (col10a1a and col10a1b) and we found both transcripts in SFCs representing distinct steps of maturation (Figure 3B,I; Figure 2—figure supplement 1F,I).”
  
  Finally, of interest, were there any clear clusters on the UMAP plots (Fig 1 Supp3A) of unassigned identity? Even comment on these and how they were dealt with would be of significant interest to the field, as it is highly unlikely all cell types in the skin have been defined. This dataset promises to be a critical reference for defining these in the future.
  
  Thanks for raising this issue. We provide a new figure (Figure 1 – supplement 4) displaying the unsupervised clustering of the wild-type dataset and a new table (Supplementary file 2 – table 5) with gene expression information for these clusters.
  
  Minor clarification:
  
  Fig 2E top. The authors interpret that fate-mapped SFCs at the posterior margin are progressively displaced towards the scale focus. This is confusing as the margin SFC in Fig 2E seems to show them staying largely at the margin. Please clarify if this is what you meant.
  
  In Figure 2E, a new row of newly differentiated, non-photoconverted SFC were added, displacing the existing row of cells towards the scale focus. Since these cells are all very thin, the net displacement was not as dramatic as the displacement found for sub-marginal SFCs. This point has been clarified in the figure legend in the revised manuscript. This figure legend now reads:
  
  “Figure 2. Postembryonic skin cell lineage relationships are not reflected in UMAP space. (A) UMAP visualization showing distribution of differentiated SFC expressing sp7 and pre-SFC progenitors expressing runx2b. (B) In-situ hybridization of sp7 and runx2b shows that a halo of pre-SFC progenitors surround the growing scale (arrows). (C) sp7:nEOS expressing differentiated SFC (magenta), were labelled by photoconversion on Day 1. Over the following two days, newly differentiated, un-photoconverted SFC appeared at the scale margin (arrows; n = 5 fish). (D) Schematic representation of differentiated SFC (purple) and the associated halo of pre-SFC (blue). (E) Photoconversion of small groups of SFC in the scale margin and sub-margin; and single-cell photoconversion of focus SFCs (arrows) showed that SFC are progressively displaced toward the scale focus and that SFC in all these regions are capable of cell division (arrows, n ≥ 4 fish for each region tested). Margin SFCs were displaced towards the posterior by newly differentiated, un-photoconverted SFCs (arrowheads). (F) SFCs in UMAP space colored by “pseudotime” rooted in the SFCs. (G) SFCs in UMAP space colored by the ratio of a mesenchymal (migratory) signature to an epithelial signature (Supplementary file 2—Table 3). (H) Schematic representation of epidermis with major substrata. (I) UMAP visualization of wild-type epidermis, subclustered independently of other cell types and displaying expression of the epidermal basal cell marker tp63 (blue) and the periderm marker krt4 (red). Scale bars, 50 μm (B,C,E); 25 μm, (C, lower). (J) The fraction of cells from panel H that pass a minimum threshold for expression of tp63, krt4 or both genes. .”
  
  References
  
  Debiais-Thibaud M, Oulion S, Bourrat F, Laurenti P, Casane D, Borday-Birraux V. 2011. The homology of odontodes in gnathostomes: insights from Dlx gene expression in the dogfish, Scyliorhinus canicula. BMC Evolutionary Biology 11:307. doi: 10.1186/1471-2148-11-307,
  
  Pemberton TJ, Li FY, Oka S, Mendoza-Fandino GA, Hsu YH, Bringas P, Jr., Chai Y, Snead ML, Mehrian-Shai R, Patel PI. 2007. Identification of novel genes expressed during mouse tooth development by microarray gene expression analysis. Dev Dyn 236:2245-57. doi: 10.1002/dvdy.21226, PMID: 17626284
  
  Woodruff ED, Kircher BK, Armfield BA, Levy JK, Bloch JI, Cohn MJ. 2022. Domestic cat embryos reveal unique transcriptomes of developing incisor, canine, and premolar teeth. Journal of Experimental Zoology Part B: Molecular and Developmental Evolution 338:516-31. doi: https://doi.org/10.1002/jez.b.23168
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2021.05.12.443782v3
www.biorxiv.org www.biorxiv.org

New submission 31/08/2023, 09:06:38

1
1. Public_Reviews 31 Aug 2023
 
 in eLife
 
 Author Response
 
 The following is the authors’ response to the original reviews.
 
 eLife assessment:
 
 This important study used a battery of cutting-edge technologies including whole exosome sequencing, knockout/knockdown animal models and comparative proteomics to define the physiological roles of ZMYND21 in the regulation of sperm flagellar development and male fertility. The data supporting the conclusion are solid, although inclusion of more patients and ultrastructural studies would have further strengthened the study. This work will be of interest to clinicians and researchers who work on male fertility, but also those working on organs/systems containing motile cilia (e.g., trachea, oviduct, ventricular ependymal cells).
 
 We thank the eLife editorial board for these very positive comments.
 
 The MMAF sperm phenotype is rare and, as for all rare diseases, the number of affected patients remains low. Moreover, the most prevalent genes have already been identified. In such case, the identification of four unrelated patients with pathogenic mutations in the same new gene is thus significant, especially as compared to most studies on the same phenotype. We agree that ultrastructural studies could provide valuable information. However, the amount of sperm cells available did not allow us to consider such experiments at this time. The production and study of the Trypanosoma enabled us to overcome these limitations.
 
 Reviewer #1 (Public Review):
 
 The goal of the authors is to use whole-exome sequencing to identify genomic factors contributing to asthenoteratozoospermia and male infertility. Using whole-exome sequencing, they discovered homozygous ZMYND12 variants in four unrelated patients. They examined the localization of key sperm tail components in sperm from the patients. To validate the findings, they knocked down the ortholog in Trypanosoma brucei. They further dissected the complex using coimmunoprecipitation and comparative proteomics with samples from Trypanosoma and Ttc29 KO mice. They concluded that ZMYND12 is a new asthenoteratozoospermia-associated gene, biallelic variants of which cause severe flagellum malformations and primary male infertility.
 
 The major strengths are that the authors used the cutting-edge technique, whole-exome sequencing, to identify genes associated with male infertility, and used a new model organism, Trypanosoma brucei to validate the findings; together with other high-throughput tools, including comparative proteomics to dissect the protein complex essential for normal sperm formation/function. The major weakness is that limited samples could be collected from the patients for further characterization by other approaches, including western blotting and TEM. In general, the authors achieved their goal and the conclusion is supported by their results. The findings not only provide another genetic marker for the diagnosis of asthenoteratozoospermia but also enrich the knowledge in cilia/flagella.
 
 We thank the reviewer for these positive comments that are helpful for improving our paper. Concerning the remark about the low amount of sperm cells available, most patients allowed us to use excess sperm samples not used for ART treatment but are generally reluctant to perform a new sperm collection. Therefore, we often have to prioritize the most relevant and suitable experiments with the amount of sperm cells available.
 
 Reviewer #2 (Public Review):
 
 The manuscript "Novel axonemal protein ZMYND12 interacts with TTC29 and DNAH1, and is required for male fertility and flagellum function" by Dacheux et al. interestingly reported homozygous deleterious variants of ZMYND12 in four unrelated men with asthenoteratozoospermia. Based on the immunofluorescence assays in human sperm cells, it was shown that ZMYND12 deficiency altered the localization of DNAH1, DNALI1, WDR66 and TTC29 (four of the known key proteins involved in sperm flagellar formation). Trypanosoma brucei and mouse models were further employed for mechanistic studies, which revealed that ZMYND12 is part of the same axonemal complex as TTC29 and DNAH1. Their findings are solid, and this manuscript will be very informative for clinicians and basic researchers in the field of human infertility.
 
 We thank the reviewer for these positive comments that are helpful for improving our paper.
 
 Reviewer #3 (Public Review):
 
 In this study, the authors identified homozygous ZMYND12 variants in four unrelated patients. In sperm cells from these individuals, immunofluorescence revealed altered localization of DNAH1, DNALI1, WDR66, and TTC29. Axonemal localization of ZMYND12 ortholog TbTAX-1 was confirmed using the Trypanosoma brucei model. RNAi knock-down of TbTAX-1 dramatically affected flagellar motility, with a phenotype similar to ZMYND12-variant-bearing human sperm. Co-immunoprecipitation and ultrastructure expansion microscopy in T. brucei revealed TbTAX-1 to form a complex with TTC29. Comparative proteomics with samples from Trypanosoma and Ttc29 KO mice identified a third member of this complex: DNAH1. The data presented revealed that ZMYND12 is part of the same axonemal complex as TTC29 and DNAH1, which is critical for flagellum function and assembly in humans, and Trypanosoma. The manuscript is informative for the clinical and basic research in the field of spermatogenesis and male infertility.
 
 We thank the reviewer for these positive comments that are helpful for improving our paper.
 
 Reviewer #1 (Recommendations For The Authors):
 
 The manuscript was very well written, and very easy to follow. Most data were presented in high quality. I only have a few minor issues with some figures.
 
 The signals in some IF images (Fig 1E, Fig. 2B are too weak;
 
 The figures were improved and modified accordingly.
 
 In some IF images, strong dot-like signals are observed (Fig. 1B, Fig. 2D, Fig. 2F). Are they specific signals or non-specific? Please specify. If they are non-specific, please replace these images.
 
 These figures were improved and modified accordingly. Indeed, the dot-like signals were non-specific.
 
 Reviewer #2 (Recommendations For The Authors):
 
 Here further revisions are suggested.
 
 1) Description of ZMYND12 genotypes of the patients and the sperm cell samples: In the title of Table 1, it is suggested to mention "homozygous" for ZMYND12 variants in the patients, since the heterozygous carriers should be unaffected.
 
 It was done as suggested
 
 In the Abstract ("with a phenotype similar to ZMYND12-variant-bearing human sperm"), it is suggested to use "with a phenotype similar to the sperm from men bearing homozygous ZMYND12 variants", since the sperm phenotypes are dependent on the biallelic genotypes of human individuals (not the monoallelic genotype of the sperm cells). Please check the whole manuscript and revise the similar points.
 
 It was done as suggested
 
 2) The database accession number for ZMYND12: There are three different numbers (NM_032257.5 vs NM_032257 vs ENSTxxxx) on Page 5 and Figure 1B. Please use NM_032257.5 for consistency.
 
 It was done as suggested
 
 3) For the exonic deletion variant, is it possible to predict the coding consequence of ZMYND12 protein?
 
 No serious and reliable in silico prediction could be perform due to the absence of the exact breakpoints of the exon deletion. mRNA (or WB) studies could precise this point, however no additional sperm samples from this patient was available.
 
 4) Please italicize the gene symbols. For example, TTC29 on Page 8 and Figure S4, Ttc29-/- KO on Page 13.
 
 It was done as suggested
 
 5) In Figure 2, there are too many panels that cannot be merged into one page. Some of the data can be shown as supplemental data.
 
 We modified the figure 2 as suggested. The new figure 2 now includes only four panels (A, B, C and D) and we added a new figure S4 with the two remaining panels. We modified the text, figure legends and numeration accordingly.
 
 6) Some of the references are duplicated. Please delete one of them. For example, there are two Broadhead et al., two Coutton et al. (Nat Commun), and two Dacheux et al.
 
 Sorry for the duplicates. It was corrected
 
 7) The information on some references is incomplete (missing volume and/or page numbers). For example, Touré et al and Wang et al. (2010).
 
 It was corrected
 
 Reviewer #3 (Recommendations For The Authors):
 
 However, I have several points as the following:
 
 The sperm concentrations of ZMYND12_3 in patient 3 and patient 4 are significantly different from the other two patients. Do you think it is just due to phenotype heterogeneity?
 
 We have no formal explanations about these observations but we think that such difference in sperm concentration are more likely due to patient heterogeneity.
 
 There is no record for detailed semen parameters of ZMYND12_ 4, and readers cannot see that the proportion of short flagella in Table 1 is 70%. Please provide complete semen routine information for this case.
 
 Unfortunately, no additional information about the semen parameters of this patient are available at this time.
 
 In this study, no immunostaining for DNAH1, DNALI1, or WDR66 was detected in sperm from individual ZMYND12_3, and subsequent validation found that TTC29 interacted with ZMYND12 in Trypanosoma brucei. DNAH1 and DNALI1 both interact with TTC29 in mice. The author concluded that ZMYND12 is part of the same axonemal complex as TTC29 and DNAH1 and plays a critical role in flagellum function and assembly. If it is possible, the author can add an experiment on the interaction between ZMYND12 and DNAH1 to make this theory more complete.
 
 Our study focuses on characterizing protein-protein interactions using IPs (Immunoprecipitations). We were able to demonstrate that the protein ZYMIND12, along with TTC29, DNAH1, and DNALI1, belongs to the same complex, IAD-4. However, this technique does not allow us to draw conclusions about direct interactions for any of the identified proteins.
 
 Our Co-IP results in T.brucei indicate that the orthologue of DNAH1 (Tb927.11.8160 orthologs) and TTC29 co-immunoprecipitate with TAX-1 (ZYMIND12), thereby complementing the study conducted in Chlamydomonas by Yamamoto et al., 2008. As suggested by reviewer 3, direct interactions between each protein could provide valuable insights into the organization of the intracomplex protein interactome. This aspect will be addressed in a separate study, as it requires the use of direct interaction techniques such as Y2H (Yeast Two-Hybrid) or DuoLink.
 
 Please check the reference section. Some references have duplication, and the content of the literature also needs to be standardized. For example,
 
 Broadhead R., Dawe HR, Farr H, Griffiths S, Hart SR, Portman N, Shaw MK, Ginger ML, Gaskell SJ, McKean PG, Gull K. 2006. Flagellar motility is required for the viability of the bloodstream trypanosome. Nature 440:224-7.
 
 Broadhead Richard, Dawe HR, Farr H, Griffiths S, Hart SR, Portman N, Shaw MK, Ginger ML, Gaskell SJ, McKean PG, Gull K. 2006. Flagellar motility is required for the viability of the bloodstream trypanosome. Nature 440:224-227. doi:10.1038/nature04541
 
 Ersfeld K, Gull K. 2001a. Targeting of cytoskeletal proteins to the flagellum of Trypanosoma brucei. J Cell Sci 114:141-148.
 
 Ersfeld K, Gull K. 2001b. Targeting of cytoskeletal proteins to the flagellum of Trypanosoma brucei. J Cell Sci 114:141-148. doi:10.1242/jcs.114.1.141
 
 Sorry for the duplicates, it was corrected.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.03.07.531490v3
www.biorxiv.org www.biorxiv.org

New submission 31/08/2023, 09:03:02

1
1. Public_Reviews 31 Aug 2023
 
 in eLife
 
 Author Response
 
 The following is the authors’ response to the original reviews.
 
 Combined Public Review:
 
 It has been shown previously that maternal aging in mice is associated with an increase in accumulation of damaged mitochondria and activation of parkin-mediated autophagy (see DOI: 10.1080/15548627.2021.1946739). It has also been shown that C-natriuretic peptide (CNP) regulates oocyte meiotic arrest and that its use during in vitro oocyte maturation can improve parameters associated with decreased oocyte quality. Here the authors tested whether use of CNP treatment in vivo could improve oocyte quality and fertility of aged mice, for which they provided convincing evidence. They also attempted to determine how CNP improves oocyte developmental competence. They showed a correlation between CNP use in vivo and the appearance (and some functional qualities) of cytoplasmic organelles more closely approximating those of oocytes from young mice. However, this correlation could not be interpreted to imply causation. Additional experiments performed using CNP during in vitro maturation were not properly controlled and so are not possible to interpret.
 
 A strength of the manuscript is that the authors use an in vivo treatment to improve oocyte quality rather than just using CNP during oocyte maturation in vitro as has been done previously. This strategy provides more potential for improving oocyte quality - over the course of oocyte growth and maturation - rather than just the final few hours of maturation alone. This strategy also has the potential to be translated into a more generally useful clinical therapeutic method that using CNP during in vitro maturation. However, it is difficult to glean information regarding how CNP might have its effects in vivo. A range of models are used in the manuscript with a mix of in vivo studies with in vitro experiments, which results in some disconnect between systemic CNP and its reported intrafollicular action as well as in the short-term versus longer-term actions of CNP on oocyte quality. Specifically, CNP was shown to be reduced in the plasma of aged mice, but this was not shown in the granulosa cells, which are the reported source of CNP that acts on oocytes. Whether the ovarian source of CNP is reduced in aged females was not demonstrated, and CNP is not known to act on oocytes through an endocrine effect. In vivo treatments with CNP by i.p. injection were performed, but the dose (120 ug/kg) and time (14 days) of treatment were not validated by any prior experiments to give them physiological relevance.
 
 Thank you for the summary and for highlighting our manuscript’s strengths and weaknesses.
 
 Weaknesses:
 
 There are errors in the manuscript writing that make the Results difficult to follow. Reference to the Figures in the Results section does not match what is shown in the Figure panels. For example, the Results text reports differences in CNP levels in aged and young mice shown in Figure 1C, but the relevant panel is actually shown in Figure 1F. Other Figures have the same problem.
 
 Thanks for the valuable suggestion. All the mistakes have been corrected in the revised manuscript.
 
 The Results section is not always clear regarding what CNP treatment was done - in vivo injections or in vitro maturation. For example, what is the difference, if any, between Figures 2C-D and Figures S2A-B?
 
 Thank you for pointing out the potential confusion regarding the experimental procedures in Figures 2C-D and Figures S2A-B. In the revised manuscript, we have included additional explanations to clarify that Figures 2C-D represent in vivo injections, while Figures S2A-B depict in vitro maturation. In brief, the results presented in the Supplementary Material (Figures S1-S7) are derived from in vitro CNP treatment.
 
 Immature oocytes from aged females (~1 year) were treated with a two-step culture system with a pre-IVM step with CNP. Controls included oocytes from young (6-8 weeks) females or oocytes from aged females treated by conventional IVM. The description of these methods suggests that control oocytes did not receive an equivalent pre-IVM culture, hence the relevance of comparisons of CNP-treated versus control oocyte is questionable. It was observed that aged oocytes pre-cultured in CNP improved polar body extrusion rates and meiotic spindle morphology compared to oocytes in conventional IVM, as has been well established. The description of statistical methods does not make clear whether the PBE rate in CNP-treated old oocytes remained significantly lower than young controls.
 
 Statistical analyses were performed using GraphPad Prism 8.00 software (GraphPad, CA, United States). Differences between two groups were assessed using the t-test. Indeed, CNP is unlikely to fully restore the PB1 rate in aged mice to the same level as in the young group. PB1 rate in CNP-treated aged oocytes remained significantly lower than young controls (P<0.05).
 
 The main effect of the CNP 2-week treatment appears to be increasing the number of follicles that grow into secondary and antral stages, but there is no attempt made to discover the mechanism by which this occurs and therefore to understand why there might be an increase in the number of ovulated eggs, quality of the eggs, and litter size. It is also not clear how an intraperitoneal injection can guarantee its effectiveness because the half-life of CNP is very short, only a few minutes.
 
 The 2-week treatment of CNP had a significant impact, leading to an increase in the number of follicles progressing to secondary and antral stages, as well as an increase in the number of ovulated eggs, improved egg quality, and enhanced litter size. Previous studies (references: 10.1530/REP-18-0470; 10.1210/me.2012-1027) have demonstrated the crucial role of CNP as an upstream regulator in stimulating preantral follicle growth and promoting the ovulation rate. These studies have also identified the influence of CNP on the expression of key ovarian genes involved in cell growth and steroidogenic enzymes. Consistent with these findings, our study provides further evidence supporting CNP as a critical regulator of preantral follicle growth and oocyte quality. Furthermore, it is important to note that oocyte-derived paracrine factors play essential roles in follicular development. CNP may regulate the communication between oocytes and somatic cells, contributing to folliculogenesis and follicular development. We are considering this aspect for further investigation in another ongoing study.
 
 To ensure the effectiveness of CNP, given its short half-life (a few minutes), aged mice (58 weeks old) received daily intraperitoneal injections of CNP (120 μg/kg body weight; Cat#B5441, ApexBio) for a duration of 14 days.
 
 Meiotic spindle morphology, as well as a number of putative markers of cytoplasmic maturation are also suggested to be improved after pre-culture with CNP. In each case a subjective interpretation of "normal" morphology of these markers is derived from observations of the young controls and the proportions of oocytes with normal or abnormal appearance is evaluated. However, parameters that define abnormal patterns of these markers appear to be subjective judgements, and whether these morphological patterns can be mechanistically attributed to the differences in developmental potential cannot be concluded.
 
 Oocyte cytoplasmic maturation involves a remarkable reorganization of the oocyte cytoplasm, encompassing the movement of vesicles, mitochondria, Golgi apparatus, and endoplasmic reticulum. This dynamic process occurs during the transitions from the germinal vesicle breakdown (GVBD) stage to the metaphase I (MI), polar body extrusion (PBE), and metaphase II (MII) stages (reference: 10.1093/humupd/dmx040). In our study, we observed that CNP treatment partially rescued cytoplasmic maturation events in aged oocytes by maintaining normal distribution patterns of cortical granules (CG), endoplasmic reticulum (ER), and Golgi apparatus. However, further experiments are needed to investigate the specific action of CNP on the function of CG, ER, and Golgi apparatus. These experiments are beyond the scope of this manuscript, but we acknowledge the importance of this aspect and will consider it for future research. In this study, our main focus was to examine the effects of CNP on mitochondria distribution and function. Therefore, we analyzed the localization patterns of mitochondria, mitochondrial membrane potential, oocyte ATP content, and ROS levels. These experiments were aimed at elucidating the impact of CNP on mitochondrial dynamics and metabolism, which are crucial for oocyte quality and development.
 
 In addition to the localization patterns of mitochondria, the mitochondrial membrane potential, oocyte ATP content and ROS levels were assessed through more objective quantitative methods. These are well known to be defective in oocytes of aged females and CNP treatment improved these measures. Mitochondrial dysfunction is the most obvious link between oocyte apoptosis, autophagy, cytoplasmic organelle miss-localization and aberrant spindle morphology. Among the most intriguing results is the finding that CNP mediated a cAMP-dependent protein kinase (PKA) dependent reduction in mitochondrial autophagy mediators PINK and Parkin and reduced the recruitment of Parkin to mitochondria in oocytes. However, it may not be possible to directly link this observation to the improvements in IVM oocyte quality, since PINK/Parkin assessments were performed in oocytes from cultured follicles treated with CNP for 6 days.
 
 The beneficial effects of CNP on oocyte quality have been extensively demonstrated through in vivo experiments (Figure 1 and 4) and “two-step” in vitro culture experiments (Figure S1 and S7). In this study, our primary focus is to analyze the signaling pathway and mechanism by which CNP inhibits mitophagy in oocytes. Previous studies have highlighted the significant role of cAMP-PKA activity in reducing mitochondrial recruitment of Parkin and mitophagy (reference: 10.1038/s42003-020-01311-7). Consistent with these findings, our study revealed that aged oocytes exhibited lower concentrations of cAMP compared to young oocytes. However, upon administration of CNP, we observed a substantial increase in intraoocyte cAMP levels. To investigate the involvement of PKA in CNP-mediated oocyte mitophagy, we conducted further experiments. We isolated preantral follicles (80-100 µm diameter) from the ovaries of aged mice and subjected them to in vitro culture with either 100 nM CNP or a combination of 100 nM CNP and 10 µM H89, a PKA inhibitor. Monitoring the growth dynamics of the follicles revealed that treatment with 100 nM CNP significantly increased follicle diameter, while H89 treatment inhibited the promotive effect of CNP on preantral follicle growth (Figure 6 K and L). Western blot analysis demonstrated that CNP supplementation led to a significant decrease in PINK1 and Parkin expression levels, which were abrogated by H89 treatment (Figure 6 M-O). It is well-established that the cAMP-PKA pathway plays a crucial role in inhibiting Parkin recruitment to damaged mitochondria (Akabane et al., 2016). Therefore, we aimed to investigate whether PKA inhibition regulates Parkin recruitment. To assess the effects of CNP on mitochondria, we performed double staining for Parkin and translocase of outer mitochondrial membrane 20 (TOMM20). The results clearly demonstrated that CNP inhibited the mitochondrial localization of Parkin, while PKA inhibition with H89 led to Parkin translocation to mitochondria, as indicated by the overlap of the two staining signals (Figure 6 P and Q). Collectively, our data suggest that the suppression of Parkin recruitment through the cAMP-PKA axis represents an important mechanism underlying the protective effect of CNP against oxidative injury in maternally aged mouse oocytes.
 
 The gold standard assay for oocyte quality is embryo transfer and live birth. The authors assessed the impact of maturing oocytes in vitro in the presence of CNP on oocyte quality by less robust assays (e.g., preimplantation embryo development in vitro), so the impact on oocyte quality is less certain.
 
 We appreciate the Revierer’s suggestion to assay live birth rates by transfer embryos obtained from IVM oocytes. However, we decided not to pursue this option for this revision because of the current technical challenges that make it difficult to get a precise result of live birth rates from IVM oocyte. Thank you for your very valuable suggestion, we have discovered the shortcomings in my current work, and I will follow your suggestions in my future work to improve the level of scientific research and achieve more results.
 
 The terminology used to describe many of the Results exaggerates the findings. For example, the authors claim that many of their immunofluorescent markers of the various organelles have a pattern that is "restored" by CNP. However, in most cases the pattern is "improved" toward the control condition but is not fully restored.
 
 We acknowledge the confusion caused by the wording of the mechanism of action of CNP in the original version. In the resubmission, we have made significant improvements by providing critical information that clarifies the action of CNP. We believe that these revisions will enhance the understanding of the mechanism of CNP and its implications. Thank you for pointing out this issue, and we appreciate your feedback in helping us improve the clarity of our work.
 
 The numbers of embryos should have been corrected for the number of eggs fertilized as a starting point so that the percentage that developed to each stage could be expressed as a percentage of successfully fertilized eggs rather than overall percentages. As currently shown in the Figures and described in the Legend, there is no information regarding what the percentage on the y-axis means. For example, does Figure 4B show the number of 2C embryos divided by the number of eggs inseminated? Or is it divided by the number of successfully fertilized eggs, and if so, how was that assessed?
 
 The embryonic development rates (Figure 4 B-F) were calculated based on the total number of oocytes, and the percentages of oocytes that developed to each stage were expressed as overall percentages.
 
 When fewer eggs are fertilized, the numbers of embryos per group are lower and so the impact of culturing multiple embryos together is lost. As a result, it is possible that culture conditions rather than oocyte quality drove the differences in the numbers of embryos that achieved each stage of development.
 
 The embryonic development rate was calculated based on the total number of oocytes. Each group included a minimum of 50 oocytes with three replicates (Young: 51, aged: 53, CNP+aged: 50). The embryo culture conditions were consistent across all groups.
 
 Not all claims in the Discussion are supported by the evidence provided. For example, "In addition, the findings demonstrated that CNP improved cytoplasmic maturation events by maintaining normal CG, ER and Golgi apparatus distribution and function in aged oocytes" but it was never demonstrated that the altered distribution had any functional impact.
 
 Oocyte cytoplasmic maturation involves a remarkable reorganization of the oocyte cytoplasm, including the movement of vesicles, mitochondria, Golgi apparatus, and endoplasmic reticulum. Extensive remodeling and repositioning of intracellular organelles occur during the transitions from GVBD to MI, PBE, and MII stages (10.1093/humupd/dmx040). Our findings indicate that CNP partially rescued cytoplasmic maturation events in aged oocytes by preserving normal distribution of CG, ER, and Golgi apparatus, as well as maintaining mitochondrial function. We acknowledge the importance of considering the impact of CNP on the function of CG, ER, and Golgi apparatus for future research. In summary, these findings demonstrate that CNP improves cytoplasmic maturation events in aged oocytes by facilitating the reorganization of CG, ER, and Golgi apparatus.
 
 Incompleteness and errors in the Methods section reduce confidence in many of the results reported.
 
 We will enhance the readability of the entire Methods section for the resubmission.
 
 The methods used for Statistical Analysis are never explained in either the Methods or the Figure legends. It is unclear whether appropriate analyses were done, and it is frequently unclear what was the sample size and how many times a particular experiment was repeated. These weaknesses detract from confidence in the data.
 
 Statistical analyses were performed using GraphPad Prism 8.00 software (GraphPad, CA, United States). Differences between two groups were assessed using the t-test. Data were reported as means ± SEM. Results of statistically significant differences were denoted by asterisk. (P < 0.05 denoted by , P < 0.01 denoted by , P < 0.001 denoted by , and P < 0.0001 denoted by **).
 
 Recommendations for the authors: please note that you control which revisions to undertake from the public reviews and recommendations for the authors
 
 The introduction does not provide critical information regarding what is already known about the mechanism of action of CNP, what other tissues are impacted by CNP treatment, and how it might affect oocyte growth. Providing this information would make it much easier to understand what is novel about the current manuscript.
 
 We acknowledge that the mechanism of action of CNP was unclear in the original version. We have now included essential information to clarify the action of CNP.
 
 Comparison of the RNAseq dataset to robust datasets from young vs aged mice would strengthen the analysis (e.g., the dataset in DOI: 10.1111/acel.13482).
 
 Thank you for your professional suggestion. According to the suggestion from you, we will make comparison of the RNAseq dataset to robust datasets from young vs aged mice in my future work .
 
 Please explain what is "Dr. Tom" that was used for RNA sequencing analysis, in the Methods.
 
 Dr. Tom is a web-based solution that offers convenient analysis, visualization, and interpretation of various types of RNA data, including mRNA, miRNA, and lncRNA. It also supports the interpretation of single-cell RNA-seq data and WGBS data. Developed by a team of expert scientists and bioinformaticians at BGI, who have extensive experience in numerous research projects, Dr. Tom provides a wide range of intuitive and interactive data visualization tools tailored to save time in conducting differential expression or pathway analysis research. Moreover, its powerful analysis tools and advanced algorithms enable users to extract new insights and derive additional value from their data beyond what is available through standard RNA analysis services. The integration of data from leading databases worldwide allows users to reference and cross-check their results and findings. Dr. Tom is already trusted by tens of thousands of scientists and researchers, serving as a valuable and essential tool alongside their own internal data curation and analysis efforts. To learn more, please visit: Dr. Tom website https://www.bgi.com/global/service/dr-tom.
 
 The Results state that single-cell transcriptomics was performed, but the Methods state that 5 oocytes were collected from each mouse. The actual Method used should be clarified.
 
 Single-cell RNA-seq is a powerful technique that enables digital transcriptome analysis at the single-cell level using deep-sequencing methods. With this approach, even a single cell can be isolated and processed through various steps to generate sequencing libraries. Given the limited availability of oocyte samples, we employed a single-cell RNA-seq library construction protocol, allowing us to analyze the transcriptomes of individual oocytes. As a result, we collected and analyzed five oocytes from each mouse in our study.
 
 The raw RNAseq data should be deposited into a publicly accessible database and reported by an accession number. It is not sufficient to state that the data is included in the manuscript and supporting information.
 
 The RNA-seq data has been submitted as supporting information and is now accessible to all readers.
 
 The image in Figure 1G is not very clear.
 
 Thank you for bringing this to our attention. We will enhance the readability of all our figures for the resubmission.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.05.08.538611v2
www.biorxiv.org www.biorxiv.org

New submission 31/08/2023, 08:50:08

1
1. Public_Reviews 31 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  Reviewer #1 (Public Review):
  
  The authors report a study, where they have sequenced whole genomes of four individuals of an extinct species of butterfly from western North America (Glaucopsyche xerces), along with seven genomes of a closely related species (Glaucopsyche lygdamus), mainly from museum specimens, several to many decades old. They then compare these fragmented genomes to a high-quality, chromosome-level assembly of a genome of a European species in the same genus (Glaucopsyche alexis). They find that the extinct species shows clear signs of declining population sizes since the last glacial period and an increase in inbreeding, perhaps exacerbating the low viability of the populations and contributing to the extinction of the species.
  
  The study really highlights how museum specimens can be used to understand the genetic variability of populations and species in the past, up to a century or more ago. This is an incredibly valuable tool, and can potentially help us to quickly identify whether current populations of rare and declining species are in danger due to inbreeding, or whether at least their genetic integrity is in good condition and other factors need to be prioritised in their conservation. In the case of extinct species, sequencing museum specimens is really our only window into the dynamics of genomic variability prior to extinction, and such information can help us understand how genetic variation is related to extinction.
  
  I think the authors have achieved their goal admirably, they have used a careful approach to mapping their genomic reads to a related species with a high-quality genome assembly. They might miss out on some interesting genetic information in the unmapped reads, but by and large, they have captured the essential information on genetic variability within their mapped reads. Their conclusions on the lower genetic variability in the extinct species are sound, and they convincingly show that Glaucopyche xerces is a separate species to Glaucopsyche lygdamus (this has been debated in the past).
  
  We thank the reviewer for his/her positive assessment and we hope to have contributed to both the knowledge of this iconic extinct species and also the possibility of applying our observations to other, endangered insects.
  
  Reviewer #2 (Public Review):
  
  The Xerces Blue is an iconic species, now extinct, that is a symbol for invertebrate conservation. Using genomic sequencing of century-old specimens of the Xerces Blue and its closest living relatives, the authors hypothesize about possible genetic indicators of the species' demise. Although the limited range and habitat destruction are the most likely culprits, it is possible that some natural reasons have been brewing to bring this species closer to extinction.
  
  The importance of this study is in its generality and applicability to any other invertebrate species. The authors find that low effective population size, high inbreeding (for tens of thousands of years), and higher fraction of deleterious alleles characterize the Xerces colonies prior to extinction. These signatures can be captured from comparative genomic analysis of any target species to evaluate its population health.
  
  It should be noted that it remains unclear if these genomic signatures are indeed predictive of extinction, or populations can bounce back given certain conditions and increase their genetic diversity somehow.
  
  Methods are detailed and explained well, and the study could be replicated. I think this is a solid piece of work. Interested researchers can apply these methods to their chosen species and eventually, we will assemble datasets to study extinction process in many species to learn some general rules.
  
  We thank the reviewer for his/her observations and suggestions for improvement and we agree that endangered species show conflicting signals sometimes associated to decreasing genetic diversity (some species are very low in numbers and yet they keep reasonably high diversity levels as compare to others); however, this aspect remains to be explored in detail in insects that have demographic dynamics to a large extent impossible to compare to those observed in vertebrates. We agree there is a full range of cases and circumstances in declining insects to be explored in the future.
  
  Several small questions/suggestions:
  
  1) The authors reference a study concluding that Shĳimiaeoides is Glaucopsyche. Their tree shows the same, confirming previous publications. And yet they still use Shĳimiaeoides, which is confusing. Why not use Glaucopsyche for all these blues?
  
  We have decided, for the sake of clarity, to change it to Glaucopsyche divina in Figure 1, as suggested by the reviewer.
  
  2) Plebejus argus is a species much more distant from P. melissa than Plebejus anna (anna and melissa are really very close to each other), and yet their tree shows the opposite. What is the problem? Misidentification? Errors in phylogenetic analyses?
  
  The reviewer is right and we think there is a mixture of potential problems here that deserve a more in depth analysis of this genus. We used MN974526 as a proxy for P. argus and we suspect now this is probably a case of misidentification (but we cannot verify it without a morphological examination of the original specimen and likely additional genomic data). MN974526 shows a 99.33% identity to the sequence by Vila et al. (2011) code NGK02C411, defined as P. melissa; as the true status of this mitogenome cannot be totally clarified (it is likely that it is in fact P. idas), we have decided to attribute it to “Plebejus sp” in the Figure 1 and explained this in the text.
  
  3) Wouldn't it be nicer to show the underside of butterfly pictures that reveals the differences between xerces and others? Now, they all look blue and like one species, no real difference.
  
  This is a good suggestion, and we have now included the underside of different species, including Xerces Blue.
  
  4) The authors stated that one of five xerces specimens failed to sequence, and yet they show 5 specimens in the tree. Was the extra specimen taken from GenBank?
  
  Yes, the extra specimen is the one reported in Grewe et al. 2021; we have marked in Figure 1 with an * this specific mitogenome (and mentioned in the legend), which clusters nicely within the set of Xerces Blue mtDNA diversity we have generated.
  
  Reviewer #1 (Recommendations For The Authors):
  
  I am curious why the authors did not attempt to do a de novo assembly of the extinct species' genomes. In our work on museum specimen genomes, we have successfully used a de novo approach to extract protein coding genes from such highly fragmented genomes. We used SPAdes to assemble the museum genomes and then assessed BUSCO completeness, finding anything from 50% to 90% BUSCO completeness. The genome assemblies themselves are pretty poor with N50s around a few thousand bp at best, but the information we can extract from such highly fragmented genomes is very useful, especially with regard to protein coding gene exons. Perhaps worth trying?
  
  Thanks for the comment. In our approach, and considering the expected low quality from some museum specimens in the lower part of the conservation spectrum, we used the standard approach based on the variant calling of short read data mapped to a close assembly. This method has been shown to be precise enough in cross species mapping (Kuderna et al. Science 2023). Local assemblies of exons and genes, while potentially informative, particularly for structural preservation, was not the priority in our objectives where only the base pair mutations were explored. Nevertheless, we are planning to generate in the near future an assembly for the closest living relative of Xerces, Glaucopsyche lygdamus, and once we get it, we will consider the possibility of undertaking the suggested approach with this new reference to explore the genomic architecture of Xerces Blue in more detail.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2021.11.08.467457v3
www.biorxiv.org www.biorxiv.org

New submission 07/08/2023, 09:38:19

2
1. Public_Reviews 31 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the previous reviews.
  
  Reviewer #1 Public Review
  
  “First, I agree with the authors of this manuscript that conformational changes in the XFEL structures with 2.8 A resolution are not reliable enough for demonstrating the subtle changes in the electron transfer events in this bacterial photosynthesis system. Actually, the data statistics in the paper by Dods et al. showed that the high-resolution range of some of the XFEL datasets may include pretty high noise (low CC1/2 and high Rsplit) so the comparison of the subtle conformational changes of the structures is problematic.
  
  The manuscript by Gai Nishikawa investigated time-dependent changes in the energetics of the electron transfer pathway based on the structures by Dods et al. by calculating redox potential of the active and inactive branches in the structures and found no clear link between the time-dependent structural changes and the electron transfer events in the XFEL structures published by Dods, R.et al. (2021). This study provided validation for the interpretation of the structures of those electrontransferring proteins.
  
  The paper was well prepared.”
  
  Thank you very much for your positive and insightful comment. We greatly appreciate your suggestion regarding the high noise levels of the XFEL structures. Including this information in the Introduction section will draw readers’ attention to the concerns about the reliability of these XFEL structures. We have incorporated it into the Introduction section.
  
  Reviewer #2 Public Review
  
  “The manuscript by Nishikawa et al. addresses time-dependent changes in the electron transfer energetics in the photosynthetic reaction center from Blastochloris viridis, whose time-dependent structural changes upon light illumination were recently demonstrated by time-resolved serial femtosecond crystallography (SFX) using X-ray free-electron laser (XFEL) (Dods et al., Nature, 2021). Based on the redox potential Em values of bacteriopheophytin in the electron transfer active branch (BL) by solving the linear Poisson-Boltzmann equation, the authors found that Em(HL) values in the charge-separated 5-ps structure obtained by XFEL are not clearly changed, suggesting that the P+HL- state is not stabilized owing to protein reorganization. Furthermore, chlorin ring deformation upon HL- formation, which was expected from their QM/MM calculation, is not recognized in the 5ps XFEL structure. Then the authors concluded that the structural changes in the XFEL structures are not related to the actual time course of charge separation. They argued that their calculated changes in Em and chlorin ring deformations using the XEFL structures may reflect the experimental errors rather than the real structural changes; they mentioned this problem is due to the fact that the XFEL structures were obtained at not high resolutions (mostly at 2.8 Å). I consider that their systematic calculations may suggest a useful theoretical interpretation of the XFEL study. However, the present manuscript insists as a whole negatively that the experimental errors may hamper to provide the actual structural changes relevant to the electron transfer events.”
  
  Thank you for your feedback on our manuscript. We appreciate your positive assessment of our systematic calculations and theoretical interpretation of the XFEL study. We have carefully considered your comments and made the necessary revisions to address your concerns.
  
  Reviewer #2 Recommendations for the authors
  
  “The authors have satisfied my concerns mostly, in particular by providing the Em(QA) changes, which seem to be more attractive in the present form. However, the Em(QA) value(s), at least in the dark structure, should be provided, and the procedure of the calculation for the Em(QA) value(s) should be described in METHODS "Calculation of Em".
  
  The calculated Em(QA) values for dataset a and dataset b in the dark structure are –223 mV and – 209 mV, respectively, using the reference Em value of –256 mV versus NHE for menaquinone-2 in water [Photosynth. Res. 134 (2017) 193]. These calculated values are comparable to experimentally measured values of –150 mV for PbRC from Blastochloris viridis (naphtoquinone) [Biochim. Biophys. Acta 440 (1976) 622] and –180 mV for PbRC from Rhodobacter sphaeroides (ubiquinone) [Arch. Biochem. Biophys 172 (1976) 329].
  
  We have now provided this information in the Method (“Calculation of Em”) and Results and Discussion (“Relevance of structural changes observed in XFEL structures”) sections.
  
  AuthorResponse
2. Public_Reviews 07 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  Reviewer #1 (Public Review):
  
  “First, I agree with the authors of this manuscript that conformational changes in the XFEL structures with 2.8 A resolution are not reliable enough for demonstrating the subtle changes in the electron transfer events in this bacterial photosynthesis system. Actually, the data statistics in the paper by Dods et al. showed that the high-resolution range of some of the XFEL datasets may include pretty high noise (low CC1/2 and high Rsplit) so the comparison of the subtle conformational changes of the structures is problematic.
  
  The manuscript by Gai Nishikawa investigated time-dependent changes in the energetics of the electron transfer pathway based on the structures by Dods et al. by calculating redox potential of the active and inactive branches in the structures and found no clear link between the time-dependent structural changes and the electron transfer events in the XFEL structures published by Dods, R.et al. (2021). This study provided validation for the interpretation of the structures of those electron-transferring proteins.
  
  The paper was well prepared.”
  
  Thank you very much for your positive and insightful comment. We greatly appreciate your suggestion regarding the high noise levels of the XFEL structures, as indicated by the low CC1/2 and high Rsplit values reported by Dods et al. Including this information in the Introduction section will draw readers’ attention to the concerns about the reliability of these XFEL structures. We have incorporated the following sentences into the Introduction section:
  
  “Furthermore, the data statistics provided by Dods et al. indicate that the high-resolution range of some XFEL datasets exhibit high levels of noise, as evidenced by low CC1/2 and high Rsplit values. These observations raise concerns about the reliable comparison of subtle conformational changes among these structures. Hence, caution must be exercised when interpreting these XFEL structures in terms of their ability to accurately capture relevant conformational changes.”
  
  The following sentences have also been added to the Conclusions section:
  
  “Hence, it is crucial to exercise caution when interpreting time-dependent XFEL structures, especially in the absence of comprehensive evaluations of the energetics and accompanying structural changes. This cautionary note should serve as a counterargument in the future, highlighting the potential pitfalls associated with presenting time-dependent XFEL structures of insufficient quality and drawing conclusive interpretations of protein structural changes that may not be distinguishable from significant experimental errors.”
  
  Recommendations for the authors
  
  “Figure 1 needs clear labels or detailed notes in the figure legend for the labels such as M, L, Pm, Pl, etc.”
  
  In Figure 1, we have increased the size of the labels to improve visibility. Additionally, we have expanded the figure legend to include detailed explanations of the abbreviations used, such as M, L, PM, PL, etc. We believe that these modifications have significantly improved the clarity and comprehensibility of Figure 1.
  
  Reviewer #2 (Public Review):
  
  “The manuscript by Nishikawa et al. addresses time-dependent changes in the electron transfer energetics in the photosynthetic reaction center from Blastochloris viridis, whose time-dependent structural changes upon light illumination were recently demonstrated by time-resolved serial femtosecond crystallography (SFX) using X-ray free-electron laser (XFEL) (Dods et al., Nature, 2021). Based on the redox potential Em values of bacteriopheophytin in the electron transfer active branch (BL) by solving the linear Poisson-Boltzmann equation, the authors found that Em(HL) values in the charge-separated 5-ps structure obtained by XFEL are not clearly changed, suggesting that the P+HL- state is not stabilized owing to protein reorganization. Furthermore, chlorin ring deformation upon HL- formation, which was expected from their QM/MM calculation, is not recognized in the 5-ps XFEL structure. Then the authors concluded that the structural changes in the XFEL structures are not related to the actual time course of charge separation. They argued that their calculated changes in Em and chlorin ring deformations using the XEFL structures may reflect the experimental errors rather than the real structural changes; they mentioned this problem is due to the fact that the XFEL structures were obtained at not high resolutions (mostly at 2.8 Å). I consider that their systematic calculations may suggest a useful theoretical interpretation of the XFEL study. However, the present manuscript insists as a whole negatively that the experimental errors may hamper to provide the actual structural changes relevant to the electron transfer events. My concerns are the following two points:
  
  Is the premise of the authors for the electron transfer energetics obviously valid?
  
  Could the authors find any positive aspect(s) in the XFEL study?
  
  The authors' argument is certainly due to their premise "Em(HL) is expected to be exclusively higher in the 5-ps and 20-ps structures than in the other XFEL structures due to the stabilization of the [PLPM]•+HL•- state by protein reorganization" as noted in the Results and Discussion (p. 12, lines 180-182); however, it is unknown whether this premise can be applied to the ps-timescale electron transfer events. The above premise is surely based on the Marcus theory, as the authors also noted in the Introduction "The anionic state formation induces not only reorganization of the protein environment (ref. 5: Marcus and Sutin, 1985) but also out-of-plane distortion of the chlorin ring (ref. 6: two of the authors, Saito and Ishikita, co-authored, 2012)"; however, it is unknown whether protein reorganization can follow the ps-timescale electron transfer events. Indeed, Dods et al. mentioned in the Nature paper (2021) "The primary electron-transfer step from SP (special pair PLPM) to BPhL (HL) occurs in 2.8 {plus minus} 0.2 ps across a distance of 10 Å by means of a two-step hopping mechanism via the monomeric BChL molecule and is more rapid than conventional Marcus theory". It was also mentioned, "By contrast, the 9 Å electron-transfer step from BPhL to QA has a single exponential decay time of 230 {plus minus} 30 ps, which is consistent with conventional Marcus theory". As for the primary electron-transfer step from PLPM to HL, Wang et al. (2007, Science 316, 747; cited as ref. 8 in the Nature paper 2021) reported, by monitoring tryptophan absorbance changes in various reaction centers in which the driving forces (namely, the Em gaps between PLPM and HL) are different, that the protein relaxation kinetics is independent of the charge separation kinetics on the picosecond timescale. On the other hand, in the EPR study cited by the authors as ref. 7 (Muh et al. (1998) Biochemistry 37, 13066), although the authors described "two distinct conformations of HL- were reported in spectroscopic studies" (p. 3, lines 44-45), it should be noted that conformation of HL- was formed by 1 or 45 s illumination prior to freezing, and hence the second-order reorganized conformations may differ from picosecond-order conformations observed by the XFEL study (Nature, 2021) and/or the transient absorption spectroscopy (Science, 2007).
  
  Therefore, I consider there is a possibility that the authors' findings may reflect not experimental errors but the actual ps-timescale phenomena presented by the first-time XFEL study on the timescale of the primary charge-separation reactions of photosynthesis. Thus I would like to suggest that the authors reconsider the premise for the electron transfer energetics on the picosecond timescale.
  
  In any case, to discuss the experimental errors in the XFEL study, it is better to calculate the Em(QA) changes in the 300-ps and 8-us XFEL structures, which showed distinctive structural changes even at the 2.8 Å resolution as discussed by Dods et al. Then, if the Em(QA) values are changed as expected from theoretical calculations, such calculated results may suggest a useful theoretical interpretation of the XFEL study as a positive aspect. If the Em(QA) values are not higher in the 300-ps and 8-us structures than in the other structures, it may be argued that the experimental errors would be so large that the XFEL structures are irrelevant to the electron transfer events expected from theoretical calculations.”
  
  We appreciate the reviewer's constructive suggestions, which significantly contributed to the improvement of our manuscript. We have performed additional calculations to address the reviewer's suggestion. We calculated the changes in Em(QA) in the XFEL structures. The Em(QA) values in the 300-ps and 8-μs structures were not significantly higher than those in the other structures (Figure 8).
  
  These findings align with the scenario proposed by the reviewer, suggesting that the experimental errors are substantial, rendering the XFEL structures irrelevant to the electron transfer events. The results further reinforce our argument that the observed structural changes in the XFEL structures are not directly linked to the expected changes in electron transfer events.
  
  We have incorporated these important points into the revised version as follows:
  
  “One might argue that the loss of the link between the formation of the charge-separated state and the Em(HL) change (Figure 5) is not due to experimental errors but rather represents the actual ps-timescale phenomena during the primary charge-separation reactions (e.g., Dods et al. noted that “the primary electron-transfer step to HL is more rapid than conventional Marcus theory” 8). However, even if this were the case, this hypothesis regarding the relevance of the XFEL structures to the electron-transfer events can be further explored by examining the changes in Em(QA) among the XFEL structures, considering the relatively slow electron-transfer step to QA that allows sufficient protein relaxation to occur (e.g., Dods et al. stated that “the electron-transfer step to QA has a single exponential decay time of 230 ± 30 ps, consistent with conventional Marcus theory” 8). That is, if the Em(QA) values are not higher in the 300-ps and 8-μs structures than in the other structures, it suggests that significant experimental errors exist, rendering the XFEL structures irrelevant to the electron transfer events. Consistent with this perspective, the present results demonstrate that the Em(QA) values in the 300-ps and 8-μs structures are not significantly higher than those in the other structures, including the dark state structure (Figure 8). Consequently, the lack of a clear relationship between the charge separated state and the changes in Em(QA) at 300 ps and 8-μs further strengthens the argument that the XFEL structures are irrelevant to the electron transfer events.”
  
  Recommendations for the authors
  
  “In addition to my main concerns, the following points should also be taken into consideration:
  
  The authors presented from QM/MM calculations out-plane distortion of HL (and HM) induced upon the reduction using the dark structure for dataset a (Table 5). However, to compare with the XFEL structures corresponding to the charge-separated state [PLPM]+HL-, positive charge should be located at the special pair (or, either PL or PM). In the present work, it is noted that counter ions were added to neutralize the entire system (in Methods: p. 6, lines104-105), but the location(s) of the positive charge is unclear.”
  
  We appreciate the valuable suggestion provided by the reviewer. To address this concern, we have calculated out-of-plane distortion of HL•– in the presence of PL•+. The results have been included in Table 5. Note that the results obtained in the presence of PL•+ are substantially the same as those obtained in PL0 (Table 5).
  
  For clarity, we have rephrased the sentence referring to counter ions as follows:
  
  “To neutralize the entire system, counter ions were added randomly around the protein using the Autoionize plugin in VMD 22.”
  
  “In relation to the calculations, the authors showed the induced out-plane distortion of HM for dataset a; however, the results for HM seem not to be mentioned anywhere. Instead, the calculations for HL of the dark structure for dataset b should be useful, especially for comparing with the time-dependent changes in the dataset b XFEL structures as shown in Figure 7.”
  
  We have made Table 6 to present the results for dataset b. The results are consistent with those for dataset a (Table 5).
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.05.31.543167v3
www.biorxiv.org www.biorxiv.org

New submission 28/06/2023, 14:48:18

1
1. Public_Reviews 30 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  Reviewer #1 (Public Review):
  
  The cerebral cortex, or surface of the brain, is where humans do most of their conscious thinking. In humans, the grooves (sulci) and bumps (convolutions) have a particular pattern in a region of the frontal lobe called Broca's area, which is important for language. Specialists study features imprinted on the internal surfaces of braincases in early hominins by casting their interiors, which produces so-called endocasts. A major question about hominin brain evolution concerns when, where, and in which fossils a humanlike Broca's area first emerged, the answer to which may have implications for the emergence of language. The researchers used advanced imaging technology to study the endocast of a hominin (KNM-ER 3732) that lived about 1.9 million years ago (Ma) in Kenya to test a recently published hypothesis that Broca's remained primitive (apelike) prior to around 1.5 Ma. The results are consistent with the hypothesis and raise new questions about whether endocasts can be used to identify the genus and/or species of fossils.
  
  We would like to thank Rev. 1 for their comments on our paper.
  
  Reviewer #2 (Public Review):
  
  The authors tried to support the hypothesis that early Homo still had a primitive condition of Broca's cap (the region in fossil endocasts corresponding to Broca's area in the brain), being more similar to the condition in chimpanzees than in humans. The evidence from the described individual points to this direction but there are some flaws in the argumentation.
  
  We are grateful to Rev. 2 for their comments, although we partially agree with some of them.
  
  First, we would like to rectify the statement of Rev. 2 that we “tried to support the hypothesis that early Homo still had a primitive condition of Broca's cap”, indeed, our aim was to test this hypothesis and not to try to validate it.
  
  First, only one human and one chimpanzee were used for comparison, although we know that patterns of brain convolutions (and in addition how they leave imprints in the endocranial bones) are very variable.
  
  We understand the point raised by Rev. 2 about the variation of brain convolutions in humans and chimpanzees. We used atlases published by Connolly (1950), Falk et al. (2018) and de Jager et al. (2019, 2022) to analyse the endocast of KNM-ER 3732 and compare it to the extant human and chimpanzee cerebral conditions. However, in Figure 2, for the sake of clarity only two Homo and Pan specimens were used to illustrate the comparison (as it has been done in other published papers, e.g., Carlson et al., 2011; Science, Gunz et al., 2020 Sci Adv). In the revised version, we modified the manuscript to explain further our approach (line 156) “We used brain and endocast atlases published in Connolly (1950), Falk et al. (2018) and de Jager et al. (2019, 2022; see also www.endomap.org) for comparing the pattern identified in KNM-ER 3732 to those described in extant humans and chimpanzees. To the best of our knowledge, these atlases are the most extensive atlases of extant human and chimpanzee brains/endocasts available to date and are widely used in the literature to explore variability in sulcal patterns. In Figure 2, the extant human and chimpanzee conditions are illustrated by one extant human (adult female) and one extant chimpanzee (adult female) specimens from the Pretoria Bone Collection at the University of Pretoria (South Africa) and in the Royal Museum for Central Africa in Tervuren (Belgium), respectively (Beaudet et al., 2018).”.
  
  Second, the evidence from this fossil specimen adds to the evidence of previously describe individuals but still not yet fully prove the hypothesis.
  
  We tempered our discussion by concluding that (line 116) “Overall, the present study not only demonstrates that Ponce de León et al.’s (2021) hypothesis of a primitive brain of early Homo cannot be rejected, but also adds information […]”.
  
  Third, there is a vicious circle in using primitive and derived features to define a fossil species and then using (the same or different) features to argue that one feature is primitive or derived in a given species. In this case, we expect members of early Homo to be derived compared to their predecessors of the genus Australopithecus and that's why it seems intriguing and/or surprising to argue that early Homo has primitive features. However, we should expect that there is some kind of continuum or mosaic in a time in which a genus "evolves into" another genus. This discussion requires far more discussions about the concepts we use, maybe less discussion about what is different between the two groups but more discussion about the evolutionary processes behind them.
  
  We fully agree with Rev. 2 on this aspect. We believe that identifying these differences/similarities between fossil and extant hominids constitute the first step of a better understanding of the evolutionary mechanisms. Our work suggests indeed a certain continuity between genera and raises questions on the genus concept and how to interpret the specimens currently attributed to early Homo. In the revised version of the manuscript we included a reference to this possible scenario (line 134): “[…] or to the absence of a definite threshold between the two genera based on the morphoarchitecture of their endocasts (Wood and Collard, 1999).”.
  
  Fourth, the data of convolutional imprints presented are rather subjective when identifying which impressions represent which brain convolutions. Not seeing an impression does not necessarily mean that the corresponding brain feature did not exist. Interestingly, the manuscript does not mention and discuss at all the frontoorbital sulcus. This is a sulcus that usually runs from the orbital surface of the frontal lobe up to divide the inferior frontal gyrus in chimpanzees, a condition totally different than in humans who do not have a frontoorbital sulcus. Could such a sulcus be identified, this would provide a far more convincing argument for a primitive condition in this specimen. In Australopithecus sediba, e.g., the condition in this region seems to be a mosaic in which some aspects of the morphology seem to be more modern while one of the sulcual impressions can well be interpreted as a short frontoorbital sulcus. For this specimen, by the way, I would come back to my third point above: some experts in the field might argue that this specimen could belong to Homo rather than Australopithecus...
  
  We agree that the presence of a fronto-orbital sulcus would be more conclusive. However, this sulcus has not been identified in KNM-ER3732 and the region in which we would expect to find it is not preserved. As demonstrated by Ponce de León et al. (2021), because of the topographic relationships between sulci (and cranial structures), it is possible to interpret imprints on endocasts and the evolutionary polarity of some traits even in the absence of landmarks such as the fronto-orbital sulcus. In Australopithecus sediba the main derived feature of the endocast corresponds to the ventrolateral bulge in the left inferior frontal gyrus, and not to the sulcal pattern itself (Carlson et al., 2011 Science). However, the discussion around the taxonomic status of this taxon confirms the urgent need for reconsidering specimens from that time period and clarifying the mosaic-like or concerted evolution of the derived Homo-like traits within our lineage. Regarding the subjective nature of this approach, we invite readers to examine the specimen on MorphoSource (https://www.morphosource.org/concern/media/000497752?locale=en) and to request access to the National Museums of Kenya to the physical or virtual specimen to falsify our hypothesis.
  
  According to my arguments above, I think that this manuscript might revive interesting discussions about this topic but it is not likely to settle them because the data presented are not strong enough to fully support the hypothesis.
  
  We would be more than happy to consider new/other specimens with similar chronological and geographical contexts and investigate further this hypothesis in the future.
  
  Reviewer #3 (Public Review):
  
  The authors provide a detailed analysis of the sulcal and sutural imprints preserved on the natural endocast and associated cranial vault fragments of the KNM-ER3732 early Homo specimen. The analyses indicate a primitive ape-like organization of this specimen's frontal cortex. Given the geological age of around 1.9 million years, this is the earliest well-documented evidence of a primitive brain organization in African Homo.
  
  In the discussion, the authors re-assess one of the central questions regarding the evolution of early Homo: was there species diversity, and if yes, how can we ascertain it? The specimen KNM-ER1470 has assumed a central role in this debate because it purportedly shows a more advanced organization of the frontal cortex compared to other largely coeval specimens (Falk, 1983). However, as outlined in Ponce de León et al. 2021 (Supplementary Materials), the imprints on the ER1470 endocranium are unlikely to represent sulcal structures and are more likely to reflect taphonomic fracturing and distortion. Dean Falk, the author of the 1983 study, basically shares this view (personal communication). Overall, I agree with the authors that the hypothesis to be tested is the following: did early Homo populations with primitive versus derived frontal lobe organizations coexist in Africa, and did they represent distinct species?
  
  I greatly appreciate that the authors make available the 3D surface data of this interesting endocast.
  
  We are grateful to Rev. 3 for their comments and for contextualizing our finding. We would also like to point out that, although the 3D surface can be viewed on MorphoSource, permission from the National Museums of Kenya has to be requested for studying the specimen and getting access to the physical specimen and/or the 3D model.
  
  Reviewer #1 (Recommendations For The Authors):
  
  Holloway, Broadfield & Yuan (2004) estimate ER 3732 as having a cranial capacity of 750 cc, which is larger than chimps and australopiths and similar to ER 1470 (752 cc, same reference). (That for Dmanisi 2282 is somewhat smaller at around 650 cc.) Cranial capacities should be mentioned along with added discussion about possible allometric scaling of (increased) numbers of sulci with increasing brain size as well as possible shifts in locations of sulci relative to cranial sutures in larger-brained (including due to ontogenetic maturation) in individuals/species. Could these variables (especially brain size) be relevant for your discussion/conclusions?
  
  We thank Rev. 1 for their suggestion. We included the estimate by Holloway et al. (2004) (line 95): “Holloway et al. (2004) estimated the endocranial volume as about 750-800 cc but insisted on the low reliability of their estimate.”. Additionally, we raised the possibility of potential allometric effect (line 149): “In parallel, the possibility of allometric scaling and influence of brain size on sulcal patterns in early Homo has to be further explored.” for future discussion.
  
  From the two figures, it appears that the authors produced a virtual endocast from the cranial remains of ER 3732 and compared its features with those seen on a virtual reproduction of the corresponding natural endocast. If so, this needs to be clarified in the text, not just the figures.
  
  We thank Rev. 1 for their suggestions that were integrated.
  
  Reviewer #3 (Recommendations For The Authors):
  
  While the sulcal imprints on the left hemisphere can be interpreted unambiguously, the anatomical assignment of those on the right side may need to be reconsidered, as they are more ambiguous. For example, the postcentral sulcus (pt) almost touches the middle frontal sulcus, which is an unlikely natural configuration.
  
  We agree that the configuration on the right hemisphere is intriguing, especially when compared to the extant human and chimpanzee atlases. As such, we decided to change the label for what we think could be the inferior frontal sulcus and leave a question mark instead.
  
  I encourage the authors to include:
  
  a posterior view in Figure 1, and mark the lambdoid suture, parts of which seem to be preserved especially on the left side. This will help the readership to better understand which parts of the endocranial morphology are preserved.
  
  a scale bar would be of great utility to appreciate the small size of this specimen. The distance from bregma to the Broca cap seems to be short, indicating an endocranial volume much smaller than the published estimate of 750 ccm. Perhaps the authors can provide a new estimate, which would provide further support for the arguments proposed in the discussion section, especially the question of any presence of Australopithecus at Koobi Fora.
  
  We included a posterior view of the specimen in Figure 1 and scale bar and modified the legend accordingly. Unfortunately, we were not able to identify with certainty the feature that could correspond to the lambdoid suture. We might see the impression where the parietal bone meets the occipital bone, but there is a risk of misidentification (which is an issue frequently raised in the literature, see for example Gunz et al. 2020 Sci Adv). Concerning the endocranial volume, in the revised version of the manuscript we included the estimate by Holloway et al. (2004). Because the specimen only preserves the superior part, we are reluctant in providing an estimate of the total volume. However, we agree that this would be an interesting feature to integrate in the interpretation of this specimen.
  
  Minor points
  
  This sentence needs to be clarified: «The superior temporal sulcus nearly intersects the lateral fissure on the right hemisphere».
  
  The terms «Broca's region» and «orbital cap» need some more context. Do the authors mean «Broca's cap» in either instance?
  
  We clarified/modified when needed, thank you very much.
  
  We included minor corrections in addition to those recommended by the reviewers:
  
  -Lines 50, 74, 142, 149: “Broca’s area” instead of “Broca’s cap”
  
  -Line 73: “in the pre-1.5 Ma Homo specimen” instead of “in pre-1.5 Ma Homo specimen”
  
  -Line 100: we specified “in human brains and endocasts”
  
  -Line 120: “sulcal pattern” instead of “sulcal patterns”
  
  -Line 144: “behaviors” (plural)
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.05.543693v2
www.biorxiv.org www.biorxiv.org

New submission 29/08/2023, 10:19:48

1
1. Public_Reviews 29 Aug 2023
  
  in eLife
  
  Author Response:
  
  Reviewer #1 (Public Review):
  
  [...] Strengths:
  
  The manuscript is well written and the experimental work well executed. It shows that major features of the classical two-component HipAB TA system have somehow been rerouted in the case of the tripartite HipBST. This includes the N-terminal domain of the HipA toxin, which now functions as bona fide antitoxin, and the partly relegated HipB antitoxin, which could only function as a transcription regulator. In addition, this work shows a new mode of inhibition of a kinase toxin and highlights the impact of the phosphorylation state of key toxin residues in controlling the activity of the antitoxin.
  
  Weaknesses:
  
  A major weakness of this work is the lack of data concerning the role of HipB, which likely does not act as an antitoxin. Does it act as a transcriptional regulator of the hipBST operon and to what extent both HipS and HipT contribute to such regulation? These are still open questions.
  
  We thank the reviewer for their feedback and will include a supplementary figure (Figure 1 supplement 2) and accompanying text that shows the transcriptional role of HipB, and how HipS and HipT influence this regulatory effect.
  
  In addition, there is no in-depth structural comparison between the structure of the HipBST solved in the work and the two recent structures of HipBST from Legionella. This is also a major weakness of this work.
  
  A structural comparison to the recent structures from Legionella will be included in the discussion, including Figure 6 supplement 1.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.01.28.478185v3
www.biorxiv.org www.biorxiv.org

New submission 29/08/2023, 10:22:40

1
1. Public_Reviews 29 Aug 2023
 
 in eLife
 
 Author Response
 
 The following is the authors’ response to the original reviews.
 
 We thank the reviewers for the constrictive and detailed feedback provided. We have adopted the proposed changes to improve the manuscript clarity and accessibility. The following revisions are included in the revised manuscript:
 
 Reviewer #1 (Public Review):
 
 The analytical framework is not sufficiently explained in the main text.
 
 We think the reviewer is referring to the conceptual framework mentioned in introduction. In the previously submitted manuscript, we did not provide details because the framework is published elsewhere. However, we agree with the reviewer that a short explanation may be helpful, which we have included in the resubmitted manuscript.
 
 The significance of findings in relation to functional changes is not clear. What are the consequences of enrichment of RNA transport or ribosome biogenesis pathways between pesticides and recovery stages, for example?
 
 We thank the reviewer for this suggestion. In the previously submitted manuscript, we included an explanation of the central functions these pathways can alter (e.g. metabolism and infection response). These functions are self-explanatory. However, we have elaborated on the consequence that the disruption of these pathways can cause in the resubmitted manuscript.
 
 The impact of individual biocides and climate variables, and their additive effects, are assessed but there is no information offered on non-additive interactions (e.g., synergistic, antagonistic).
 
 This was a misunderstanding based on our use of the term synergistic in this context. The approach by which we define a synergistic or joint effect of two environmental variables on a taxonomic group is explained in the methods section. This analysis is based on climate variables and biocide types contributing the largest covariances in the correlation analysis explained in Supplementary Fig. 5; Step 4. The combined effect of two environmental variables on a taxon was considered to be significant if the biocide type and the climate variable were each significantly correlated with the taxon over the same time window, and their average Pearson correlation was > 0.5 with padj < 0.05 (SWC analysis with 10,000 permutations). The biocide type and the climate variable were interpreted to have a joint effect on a given taxon if the linear combination of the biocide type and the climate variable had a larger Pearson correlation coefficient than each of the correlations between the family and the biocide type and the family and the climate variable individually, in the same time interval with padj < 0.05 (with 10,000 permutations in the SWC analysis). We realise that the use of synergistic or additive was not correct in this context and have replaced the term synergistic with joint effect throughout the manuscript.
 
 The level of confidence associated with results is not made explicit. The reader is given no information on the amount of variability involved in the observations, or the level of uncertainty associated with model estimates.
 
 As we didn’t use traditional statistical approaches, confidence level estimation in the traditional sense is not possible. Instead, we used permutation tests and adjusted P-values to identify significant correlations in the data. These approaches are more robust than traditional statistics for integrating and discovering complex, group-wise patterns among high-dimensional datasets. While most forms of machine learning require large sample sizes, sCCA uses fewer observations to identify the most correlated components among data matrices and captures the multivariate variability of the most important features.
 
 The major implications of the findings for regulatory ecological assessment are missed. Regulators may not be primarily interested in identifying past "ecosystem shifts". What they need are approaches which give greater confidence in monitoring outcomes by better reflecting the ecological impact of contemporary environmental change and ecosystem management. The real value of the work in this regard is that: (1) it shows that current approaches are inappropriate due to the relatively stable nature of the indicators used by regulators, despite large changes in pollutant inputs; (2) it presents some better alternatives, including both taxonomic and functional indicators; and (3) it provides a new reference (or baseline) for regulators by characterizing "semi-pristine" conditions.
 
 We thank the reviewer for this suggestion, which we have included in the main text (L451461)
 
 Reviewer #2 (Public Review):
 
 Results - They are brief and should expand some more. Particularly, there are no results regarding metabarcoding data (number of reads, filtering etc.). These details are important to know the quality of the data which represents the bulk of the analyses. Even the supplementary material gives little information on the metabarcoding results (e.g. number of ASVs - whether every ASV of each family were pooled etc.).
 
 We thank the reviewer for this suggestion. We have included a paragraph in results reporting read numbers and other statistics. The filtering criteria and handling of samples can be found in methods (L658-661; L670-675). As explained in methods the taxonomy was assigned using qiime feature-classifier classify-sklearn and used at family level where possible. When classification was not possible at family level because of incomplete/missing information in the online database or a poor match to reference database, the lowest classification possible was used.
 
 The drivers of biodiversity change section could be restructured and include main text tables showing the families positively or negatively correlated with the different variables (akin to table S2 but simplified).
 
 As there are over 180 unique families/taxonomic units correlated with at least one biocide or environmental variable, a simplified version of this table would be too large to include in the main text. Therefore, we prefer to keep this information in supplementary table 2 complete with correlation statistics.
 
 We thank the reviewers for providing detailed feedback on the manuscript and respond to their suggestions as follows:
 
 Reviewer #1 (Recommendations For The Authors):
 
 Thank you for the opportunity to review your manuscript, which I found interesting and enjoyable to read. Here are some suggestions for improving it.
 
 Remove spaces before citations in text.
 
 Lines 51-53: "Community-level biodiversity reliably explained freshwater ecosystem shifts whereas traditional quality indices (e.g. Trophic Diatom Index) and physicochemical parameters proved to be poor metrics for these shifts." Seems to be the wrong way around / not clear???
 
 Rephrased to clarify.
 
 Line 54: Should be "...advocates the use of..." or "...demonstrates the advantages of..."
 
 Done, thanks for the suggestion.
 
 Line 62: Spell out numbers <10, i.e. "sixth mass extinction"
 
 Done, thank you.
 
 Lines 66-72: These sentences lack clarity. It's not clear that "experimental manipulation of biodiversity" hasn't involved investigation of "multi-trophic changes". By the third of these four sentences it is not clear what "they" is referring to. And in the fourth sentence, "these holistic studies" are not defined. Perhaps it would suffice to say that experiments have so far focused primarily on a single trophic level and largely neglected freshwater systems.
 
 We have rephrased to improve clarity.
 
 Line 81: Delete unnecessary bracket
 
 Done, thank you.
 
 Line 82: "a minority of freshwater ecosystems" sounds as if you're saying that few freshwater ecosystems are represented in BioTIME, which seems obvious and would also apply to terrestrial and marine systems. Do you mean that freshwater ecosystems re not well represented in the data?
 
 We have clarified the sentence, thanks.
 
 Line 106: Resolve issue with citation in text at the end of the sentence (repeated at line 109 and possibly other lines).
 
 Done, thank you.
 
 Line 116: By ">1999s" do you mean 1990s?
 
 This was a typo. it was supposed to be >1999
 
 Line 120: The reader would benefit greatly from a brief explanation of explainable network models and multimodal learning in the introduction. Why are these the right tools to use? How do they work in this context? Figure 1 helps to some extent but needs more commentary in the text.
 
 We have included an explanation of the explainable network models and multimodal learning and how their use can be beneficial to the study of diverse data types.
 
 Line 144: Here and throughout the text the language could be much more efficient and readable. "Alpha diversity" does not require a definite article. Furthermore, when referring to significance it is convention to state the p-value, test statistic and test used.
 
 As there are different p-values for each barcode, we have included them in legend to Supplementary Fig. 1 to avoid crowding the main text. We prefer to leave the text unchanged for this reason.
 
 Line 155: "The primary producer's composition" is grammatically awkward and less suitable than "the composition of primary producers". This kind of awkwardness occurs again at line 285 ("diatom's") and possibly in other parts of the manuscript.
 
 Thanks, corrected.
 
 Line 169: The statement that this family was "relatively more abundant" needs a little more explanation. What is it relative to - other groups or to previous stages?
 
 More abundant than in the other phases – the sentence has been modified.
 
 Line 179: Nested brackets are unnecessary and affect readability. This could simply be a new sentence, i.e. "For example, Nitrospiraceae (nitrite oxidizers)..."
 
 Done, thanks.
 
 Line 215: "Functional biodiversity", which implies that some biodiversity is functional and some not, does not seem an appropriate term to describe the results you present in this section. Simply "functioning of the prokaryotic community" would suffice.
 
 Thanks, done.
 
 Line 214-233: This section may be inaccessible for many readers. For example, what are Kegg Orthologs and what role do they play in the functioning of a lake ecosystem? The explanation comes later in the paragraph but there needs to be a gentler introduction before diving into specific technical concepts.
 
 We appreciate this comment and have included a short explanation of what KEGG and KO terms mean.
 
 Supplementary Figure 3: It would be helpful to superimpose the lake stages here, as done in Figure 2.
 
 The figure has been updated with coloured data points corresponding to each phase, as in supplementary figure 1.
 
 Line 265: Should be "19 of which were identified..."
 
 Done, thanks.
 
 Line 284: "Predominantly" rather than "prominently"?
 
 Done
 
 Line 242-316: This section is good in that it identifies and ranks individual biocides and climate variables but there is no information on non-additive interactions (e.g., synergistic, antagonistic). Could the authors at least comment on why this was not done or not necessary, and what uncertainties this omission could introduce into the results?
 
 This was a misunderstanding based on our use of the term synergistic in this context. the approach by which we define a synergistic or joint effect of two environmental variables on a taxonomic group is explained in the methods section. This analysis is based on climate variables and biocide types contributing the largest covariances in the correlation analysis explained in Supplementary Fig. 5; Step 4. The combined effect of two environmental variables on a taxon was considered to be significant if the biocide type and the climate variable were each significantly correlated with the taxon over the same time window, and their average Pearson correlation was > 0.5 with padj < 0.05 (SWC analysis with 10,000 permutations) – this is shown in Supplementary Fig. 5; Step 6. The biocide type and the climate variable were interpreted to have an additive effect on a given taxon if the linear combination of the biocide type and the climate variable had a larger Pearson correlation coefficient than each of the correlations between the family and the biocide type and the family and the climate variable individually, in the same time interval with padj < 0.05 (with 10,000 permutations in the SWC analysis). we have replace synergistic with joint effect to avoid confusion.
 
 Figure 4: These 3-D plots are very hard to read. Without additional features (e.g. shadows on each plane, or lines connecting points to planes) it is impossible for the viewer to tell where the points are located on each axis.
 
 We have created interactive 3D plots here: https://environmental-omicsgroup.github.io/Biodiversity_Monitoring/.
 
 Figure 5: Legend entry should be "summer precipitation" not "precipitations". "Additive effect" rather than "joint effect" would be more consistent with the main text.
 
 “Precipitations” has been updated to “precipitation” where relevant throughout. We left ‘joint effect’ and unified the main text, responding to a previous comment of this reviewer on the meaning of synergistic effects in our study.
 
 Line 348: Doesn't your approach also require specialist skills? I often feel that the "traditional" versus "molecular" monitoring debate misses this point. Some comment on the training and development needs for those interested in applying the sedaDNA approach would be welcome. Otherwise it is an unfair comparison.
 
 Whereas the application of high throughput sequencing technologies requires training, these technologies are well established with publicly available standard operating procedures. As compared to direct observations, high throughput sequencing provides replicable results regardless of the operator. Moreover, the application of metabarcoding to sedaDNA or more generally eDNA can be outsourced to established environmental services, removing the need for training if it is a limiting factor. The above has been included in discussion.
 
 Line 391: "Significantly did" what? "Did significantly change over time" would be better.
 
 Done, thanks.
 
 Line 407: Should be "an indicator of..." and "did not significantly change over time..."
 
 Done, thanks.
 
 Line 408-410: Regulators are not necessarily interested in identifying past "ecosystem shifts", so this does not seem to be the best way to contrast the capabilities of the sedaDNA approach with those of LTDI2. The real value of this work, in my opinion, is threefold. First, it shows that the reliance on diatoms as indicators of ecological status is inappropriate due to the relatively stable nature of diatom communities in the face of large environmental changes. Second, it presents some better alternatives, including both taxonomic and functional indicators. And third, it provides a new reference point for regulators by characterising "semi-pristine" conditions.
 
 Thanks for the insightful suggestion. We agree with the reviewer on the advantages and have spelled them out in the resubmitted manuscript.
 
 Line 445: What are "housekeeping functions"? I checked the Cuenca-Cambronero paper cited but did not find the term there.
 
 Housekeeping functions are essential basic cellular functions that are evolutionary conserved. They are more commonly present in public databases because they have been characterised in a number of model species (e.g. Drosophila, C. elegans and Mus musculus). Our reference it not to the Cuenca-Cambronero paper, but to Mi et al, describing the reference database PANTHER. We included the definition of housekeeping functions in the main text.
 
 Line 449: Briefly state the main functional changes found here.
 
 Examples have been included.
 
 Lines 451-452: Whilst this statement may be found in the cited source, most readers I suspect would not identify with it. Indeed, one could argue that most of freshwater ecology has been dedicated to this very task (documenting chemical impacts on biodiversity)! A more balanced view is needed here.
 
 The sentence the reviewer refers to includes also reference to climate change. Climate change and chemical pollution are the two most common causes of biodiversity loss, and not only in freshwater ecosystems.
 
 Lines 463-466: These examples both point to non-additive (synergistic) effects, which were not assessed in the current study.
 
 Please refer to our explanation above about the inappropriate use of synergistic and, here, additive. We have altered the text throughout to use joint effects as we do not investigate synergistic, antagonistic and additive effects as traditionally described in ecology.
 
 Lines 472-474: This sentence is unclear. Do you mean that this approach surpasses others in terms of reliability? If so, I don't believe this has been demonstrated in the paper.
 
 We apologise. The word ‘reliability’ should have not been in the text. We have improved the clarity of this sentence.
 
 Lines 474-482: In these sentences it is unclear whether or not you are talking about your method or contrasting it with another method(s). If the latter, which method or methods are you referring to?
 
 We have fixed this sentence to better reflect that our algorithm provides a high degree of confidence that surpasses state-of-the-art analysis, which predominantly identify patterns of co-occurrence of taxa within communities (e.g. Correlation-Centric Network).
 
 Line 631: Should be "Physico-chemical variables". I have not extensively checked the rest of the methods for such errors.
 
 Thank you, the text has been changed where present.
 
 Reviewer #2 (Recommendations For The Authors):
 
 Introduction Line 80 remove extra ')'
 
 Done, thank you.
 
 Line 81 rephrase e.g includes few freshwater ecosystems
 
 We modified this sentence also following Reviewer #1
 
 Line 83 although, instead of whereas?
 
 Done, thanks.
 
 Line 106 formatting reference issue
 
 Line 109 same as above
 
 Thank you, noted.
 
 Results
 
 Line 141 - 144 how was the sampling of the sediment performed over the 100 year core? Every year? Every 5 years? Or were they pooled to represent the (as of yet unlisted) phases?
 
 The reviewer is correct that details are not provided here. They are in methods. We have added some text to explain the basic concepts of how the core was obtained and sliced and refer the reader to the method section for more details.
 
 Line 154 the authors have not yet explicitly listed the lake phases, so it is difficult to refer to them now.
 
 Noted, the addition of a short explanation at the beginning of the results section should take care of this issue.
 
 Line 216 - may be worth briefly explaining KEGG orthologs and how these relate to functional biodiversity.
 
 We thank the reviewer. Also responding to a similar comment from Reviewer #1, we included a description of KO terms and their links to functional biodiversity.
 
 Lines 249 - 260 instead of a supplementary table, it could remain in the main text
 
 Supplementary table 2 is a multi-tab table including information for each region amplified here. It is not possible to include this table in the main text.
 
 Materials and Methods Due to the formatting of the manuscript (results & discussion before materials and methods), many of the results are not clearly understood without having to visit the M&M section. Particularly, how the biocide types were obtained (Historic records plus persistence of DDT in sediments). This could be resolved y including a few sentences on how the data was gathered in the results section. Overall, materials and methods are sufficient, however, it is not clear how many of the 37 metabarcoding samples correspond to which of the lake phases. Finally, I suggest a better organization of M&Ms by having subheadings for each section. For example, under Biodiversity fingerprinting across 100 years, one subheading could de DNA extraction and sequencing, another subheading could be bioinformatics.
 
 We thank the reviewer for the suggestion. To alleviate the issues linked to the methods section coming after the results section, we have introduced a short explanation of the sediments core and the lake phases at the beginning of the results section. A description of the climate and chemical data has been included at the beginning of the section ‘Drivers of biodiversity change’ in results. Subheadings were introduced in methods as suggested.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.26.530075v2
www.biorxiv.org www.biorxiv.org

New submission 29/08/2023, 10:16:21

1
1. Public_Reviews 29 Aug 2023
 
 in eLife
 
 Author Response
 
 Reviewer #1 (Public Review):
 
 .In the best genetically and biochemically understood model of eukaryotic DNA replication, the budding yeast, Saccharomyces cerevisiae, the genomic locations at which DNA replication initiates are determined by a specific sequence motif. These motifs, or ARS elements, are bound by the origin recognition complex (ORC). ORC is required for loading of the initially inactive MCM helicase during origin licensing in G1. In human cells, ORC does not have a specific sequence binding domain and origin specification is not specified by a defined motif. There have thus been great efforts over many years to try to understand the determinants of DNA replication initiation in human cells using a variety of approaches, which have gradually become more refined over time.
 
 In this manuscript Tian et al. combine data from multiple previous studies using a range of techniques for identifying sites of replication initiation to identify conserved features of replication origins and to examine the relationship between origins and sites of ORC binding in the human genome. The authors identify a) conserved features of replication origins e.g. association with GC-rich sequences, open chromatin, promoters and CTCF binding sites. These associations have already been described in multiple earlier studies. They also examine the relationship of their determined origins and ORC binding sites and conclude that there is no relationship between sites of ORC binding and DNA replication initiation. While the conclusions concerning genomic features of origins are not novel, if true, a clear lack of colocalization of ORC and origins would be a striking finding.
 
 Thank you. That is where the novelty of the paper lies.
 
 However, the majority of the datasets used do not report replication origins, but rather broad zones in which replication origins fire. Rather than refining the localisation of origins, the approach of combining diverse methods that monitor different objects related to DNA replication leads to a base dataset that is highly flawed and cannot support the conclusions that are drawn, as explained in more detail below.
 
 We are using the narrowly defined SNS-seq peaks as the gold standard origins and making sure to focus in on those that fall within the initiation zones defined by other methods. The objective is to make a list of the most reproducible origins. Unlike what the reviewer states, this actually refines the dataset to focus on the SNS origins that have also been reproduced by the other methods in multiple cell lines. We will change the last box of Fig. 1A to say: Identify reproducible SNS-seq origins that are contained in IZs defined by Repli-seq, OK-seq and Bubble-seq. These are the “shared origins”. This and the Fig. 2B (as it is) will make our strategy clearer.
 
 Methods to determine sites at which DNA replication is initiated can be divided into two groups based on the genomic resolution at which they operate. Techniques such as bubble-seq, ok-seq can localise zones of replication initiation in the range ~50kb. Such zones may contain many replication origins. Conversely, techniques such as SNS-seq and ini-seq can localise replication origins down to less than 1kb. Indeed, the application of these different approaches has led to a degree of controversy in the field about whether human replication does indeed initiate at discrete sites (origins), or whether it initiates randomly in large zones with no recurrent sites being used. However, more recent work has shown that elements of both models are correct i.e. there are recurrent and efficient sites of replication initiation in the human genome, but these tend to be clustered and correspond to the demonstrated initiation zones (Guilbaud et al., 2022).
 
 These different scales and methodologies are important when considering the approach of Tian et al. The premise that combining all available data from five techniques will increase accuracy and confidence in identifying the most important origins is flawed for two principal reasons. First, as noted above, of the different techniques combined in this manuscript, only SNS-seq can actually identify origins rather than initiation zones. It is the former that matters when comparing sites of ORC binding with replication origin sites if a conclusion is to be drawn that the two do not co-localise.
 
 Exactly. So the reviewer should agree that our method of finding SNS-seq peaks that fall within initiation zones actually refines the origins to find the most reproducible origins. We are not losing the spatial precision of the SNS-seq peaks.
 
 Second, the authors give equal weight to all datasets. Certainly, in the case of SNS-seq, this is not appropriate. The technique has evolved over the years and some earlier versions have significantly different technical designs that may impact the reliability and/or resolution of the results e.g. in Foulk et al. (Foulk et al., 2015), lambda exonuclease was added to single stranded DNA from a total genomic preparation rather than purified nascent strands), which may lead to significantly different digestion patterns (ie underdigestion). Curiously, the authors do not make the best use of the largest SNS-seq dataset (Akerman et al., 2020) by ignoring these authors separation of core and stochastic origins. By blending all data together any separation of signal and noise is lost. Further, I am surprised that the authors have chosen not to use data and analysis from a recent study that provides subsets of the most highly used and efficient origins in the human genome, at high resolution (Guilbaud et al., 2022).
 
 1) We are using the data from Akerman et al., 2020: Dataset GSE128477 in Supplemental Table 1. We can examine the core origins defined by the authors to check its overlap with ORC binding.
 
 2) To take into account the refinement of the SNS-seq methods through the years, we actually included in our study only those SNS-seq studies after 2018, well after the lambda exonuclease method was introduced. Indeed, all 66 of SNS-seq datasets we used were obtained after the lambda exonuclease digestion step. To reiterate, we recognize that there may be many false positives in the individual origin mapping datasets. Our focus is on the True positives, the SNS-seq peaks that have some support from multiple SNS-seq studies AND fall within the initiation zones defined by the independent means of origin mapping (described in Fig. 1A and 2B). These True positives are most likely to be real and reproducible origins and should be expected to be near ORC binding sites.
 
 We will change the last box of Fig. 1A to say: Identify reproducible SNS-seq origins that are contained in IZs defined by Repli-seq, OK-seq and Bubble-seq. These are the “Shared origins”.
 
 Ini-seq by Torsten Krude and co-workers (Guillbaud, 2022) does NOT use Lambda exonuclease digestion. So using Ini-seq defined origins is at odds with the suggestion above that we focus only on SNS-seq datasets that use Lambda exonuclease. However, Ini-seq identifies a much smaller subset of SNS-seq origins, so we will do the analysis with just that smaller set in the revision of the paper.
 
 References:
 
 Akerman I, Kasaai B, Bazarova A, Sang PB, Peiffer I, Artufel M, Derelle R, Smith G, Rodriguez-Martinez M, Romano M, Kinet S, Tino P, Theillet C, Taylor N, Ballester B, Méchali M (2020) A predictable conserved DNA base composition signature defines human core DNA replication origins. Nat Commun, 11: 4826
 
 Foulk MS, Urban JM, Casella C, Gerbi SA (2015) Characterizing and controlling intrinsic biases of lambda exonuclease in nascent strand sequencing reveals phasing between nucleosomes and G-quadruplex motifs around a subset of human replication origins. Genome Res, 25: 725-735
 
 Guilbaud G, Murat P, Wilkes HS, Lerner LK, Sale JE, Krude T (2022) Determination of human DNA replication origin position and efficiency reveals principles of initiation zone organisation. Nucleic Acids Res, 50: 7436-7450
 
 Reviewer #2 (Public Review):
 
 Tian et al. perform a meta-analysis of 113 genome-wide origin profile datasets in humans to assess the reproducibility of experimental techniques and shared genomics features of origins. Techniques to map DNA replication sites have quickly evolved over the last decade, yet little is known about how these methods fare against each other (pros and cons), nor how consistent their maps are. The authors show that high-confidence origins recapitulate several known features of origins (e.g., correspondence with open chromatin, overlap with transcriptional promoters, CTCF binding sites). However, surprisingly, they find little overlap between ORC/MCM binding sites and origin locations.
 
 Overall, this meta-analysis provides the field with a good assessment of the current state of experimental techniques and their reproducibility, but I am worried about: (a) whether we've learned any new biology from this analysis; (b) how binding sites and origin locations can be so mismatched, in light of numerous studies that suggest otherwise; and (c) some methodological details described below.
 
 Major comments:
 
 Line 26: "0.27% were reproducibly detected by four techniques" -- what does this mean? Does the fragment need to be detected by ALL FOUR techniques to be deemed reproducible?
 
 If the reproducible SNS-seq peaks are included in the reproducible initiation zones found by the other methods, then we consider it reproducible across datasets. The strategy is to focus our analysis on the most reproducible SNS-seq peaks that happen to be in reproducible initiation zones. It is the best way to confidently identify a very small set of true positive origins.
 
 And what if the technique detected the fragment is only 1 of N experiments conducted; does that count as "detected"?
 
 A reproducible SNS-seq origin has been reproduced above a statistical threshold of 20 reproductions. A threshold of reproduction in 20 datasets out of 66 SNS-seq datasets gives an FDR of <0.1. This is explained in Fig. 2a and Supplementary Fig. S2. For the initiation zones, we considered a Zone even if it appears in only 1 of N experiments, because N is usually small. This relaxed method for selecting the initiation zones gives the best chance of finding SNS-seq peaks that are reproduced by the other methods.
 
 Later in Methods, the authors (line 512) say, "shared origins ... occur in sufficient number of samples" but what does sufficient mean?
 
 Sufficient means that SNS-seq origin was reproducibly detected in ≥ 20 datasets and was included in any initiation zone defined by three other techniques.
 
 Then on line 522, they use a threshold of "20" samples, which seems arbitrary to me. How are these parameters set, and how robust are the conclusions to these settings? An alternative to setting these (arbitrary) thresholds and discretizing the data is to analyze the data continuously; i.e., associate with each fragment a continuous confidence score.
 
 We explained Fig. 2a and Supplementary Fig. S2 in the text as follows: The occupancy score of each origin defined by SNS-seq (Supplementary Fig. 2a) counts the frequency at which a given origin is detected in the datasets under consideration. For the random background, we assumed that the number of origins confirmed by increasing occupancy scores decreases exponentially (see Methods and Supplementary Table 2). Plotting the number of origins with various occupancy scores when all SNS-seq datasets published after 2018 are considered together (the union origins) shows that the experimental curve deviates from the random background at a given occupancy score (Fig. 2a). The threshold occupancy score of 20 is the point where the observed number of origins deviates from the expected background number (with an FDR < 0.1) (Fig. 2a). In the Methods: In other words, the number of observed origins with occupancy score greater than 20 is 10 times more than expected in the background model. This approach is statistically sound and described by us in (Fang et al. 2020).
 
 Line 20: "50,000 origins" vs "7.5M 300bp chromosomal fragments" -- how do these two numbers relate? How many 300bp fragments would be expected given that there are ~50,000 origins? (i.e., how many fragments are there per origin, on average)? This is an important number to report because it gives some sense of how many of these fragments are likely nonsense/noise. The authors might consider eliminating those fragments significantly above the expected number, since their inclusion may muddle biological interpretation.
 
 I think we confused the reviewer by the way we wrote the abstract. The 50,000 origins that are mentioned in the abstract is the hypothetical expected number of origins that have to fire to replicate the whole 6x10^9 base diploid genome based on the average inter-origin distance of 10^5 bases (as determined by molecular combing). The 7.5M 300 bp fragments are the genomic regions where the 7.5M union SNS-seq-defined origins are located. Clearly, that is a lot of noise, some because of technical noise and some due to the fact that origins fire stochastically. Which is why our paper focuses on a smaller number of reproducible origins, the 20,250 shared origins. Our analysis is on the 20,250 shared origins, and not on all 7.5M union origins. Thus, we are not including the excess of non-reproducible (stochastic?) origins in our analysis.
 
 The revised abstract in the revised paper will say: “Based on experimentally determined average inter-origin distances of ~100 kb, DNA replication initiates from ~50,000 origins on human chromosomes in each cell-cycle. The origins are believed to be specified by binding of factors like the Origin Recognition Complex (ORC) or CTCF or other features like G-quadruplexes. We have performed an integrative analysis of 113 genome-wide human origin profiles (from five different techniques) and 5 ORC-binding site datasets to critically evaluate whether the most reproducible origins are specified by these features. Out of ~7.5 million union origins identified by 66 SNS-seq datasets, only 0.27% were reproducibly contained in initiation zones identified by three other techniques (20,250 shared origins), suggesting extensive variability in origin usage and identification in different circumstances.”
 
 Line 143: I'm not terribly convinced by the PCA clustering analysis, since the variance explained by the first 2 PCs is only ~25%. A more robust analysis of whether origins cluster by cell type, year etc is to simply compute the distribution of pairwise correlations of origin profiles within the same group (cell type, year) vs the correlation distribution between groups. Relatedly, the authors should explain what an "origin profile" is (line 141). Is the matrix (to which PCA is applied) of size 7.5M x 113, with a "1" in the (i,j) position if the ith fragment was detected in the jth dataset?
 
 The reviewer is correct about how we did the PCA and have now included the description in the Methods. We will also do the pairwise correlations the way the reviewer suggests (a) by techniques, (b) by cell types (SNS-seq), (c) by year of publication (SNS-seq).
 
 It's not clear to me what new biology (genomic features) has been learned from this meta-analysis. All the major genomic features analyzed have already been found to be associated with origin sites. For example, the correspondence with TSS has been reported before:
 
 https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6320713/
 
 https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6547456/
 
 So what new biology has been discovered from this meta-analysis?
 
 The new biology can be summarized as: (a) We can identify a set of reproducible (in multiple datasets and in multiple cell lines) SNS-seq origins that also fall within initiation zones identified by completely independent methods. These may be the best origins to study in the midst of the noise created by stochastic origin firing. (b) The overlap of these True Positive origins with known ORC binding sites is tenuous. So either all the origin mapping data, or all the ORC binding data has to be discarded, or this is the new biological reality in mammalian cancer cells: on a genome-wide scale the most reproduced origins are not in close proximity to ORC binding sites, in contrast to the situation in yeast. (c) All the features that have been reported to define origins (CTCF binding sites, G quadruplexes etc.) could simply be from the fact that those features also define transcription start sites (TSS), and origins prefer to be near TSS because of the favorable chromatin state.
 
 Line 250: The most surprising finding is that there is little overlap between ORC/MCM binding sites and origin locations. The authors speculate that the overlap between ORC1 and ORC2 could be low because they come from different cell types. Equally concerning is the lack of overlap with MCM. If true, these are potentially major discoveries that butts heads with numerous other studies that have suggested otherwise. More needs to be done to convince the reader that such a mis-match is true. Some ideas are below:
 
 Idea 1) One explanation given is that the ORC1 and ORC2 data come from different cell types. But there must be a dataset where both are mapped in the same cell type. Can the authors check the overlap here? In Fig S4A, I would expect the circles to not only strongly overlap but to also be of roughly the same size, since both ORC's are required in the complex. So something seems off here.
 
 We agree with the reviewer that there is something “off here”. Either the techniques that report these sites are all wrong, or the biology does not fit into the prevailing hypothesis. One secret in the ORC ChIP field that our lab has struggled with for quite some time is that the various ORC subunits do not necessarily ChiP-seq to the same sites. The poor overlap between the binding sites of subunits of the same complex either suggests that the subunits do not always bind to the chromatin as a six-subunit complex or that all the ChIP-seq data in the Literature is suspect. We provide in the supplementary figure S4A examples of true positive complexes (SMARCA4/ARID1A, SMC1A/SMC3, EZH2/SUZ12), whose subunits ChIP-seq to a large fraction of common sites. As shown in Supplementary Fig. S4C, we do not have ORC1 and ORC2 ChIP-seq data from the same cell-type. We have ORC1 ChIP-seq and SNS-seq data from HeLa cells and ORC2 ChIP seq and origins from K562 cells, and so will add the proximity/overlap of the binding sites to the origins in the same cell-type in the revision.
 
 Idea 2) Another explanation given is that origins fire stochastically. One way to quantify the role of stochasticity is to quantify the overlap of origin locations performed by the same lab, in the same year, in the same experiment, in the same cell type -- i.e., across replicates -- and then compute the overlap of mapped origins. This would quantify how much mis-match is truly due to stochasticity, and how much may be due to other factors.
 
 A given lab may have superior reproducibility compared to the entire field. But the notion of stochasticity is well accepted in the field because of this observation: the average inter-origin distance measured by single molecule techniques like molecular combing is ~100 kb, but the average inter-origin distance measure on a population of cells (same cell line) is ~30 kb. The only explanation is that in a population of cells many origins can fire, but in a given cell on a given allele, only one-third of those possible origins fire. This is why we did not worry about the lack of reproducibility between cell-lines, labs etc, but instead focused on those SNS-seq origins that are reproducible over multiple techniques and cell lines.
 
 Idea 3) A third explanation is that MCMs are loaded further from origin sites in human than in yeast. Is there any evidence of this? How far away does the evidence suggest, and what if this distance is used to define proximity?
 
 MCMs, of course, have to be loaded at an origin at the time the origin fires because MCMs provide the core of the helicase that starts unwinding the DNA at the origin. Thus, the lack of proximity of MCM binding sites with origins can be because the most detected MCM sites (where MCM spends the most time in a cell-population) does not correspond to where it is first active to initiate origin firing. This has been discussed. MCMs may be loaded far from origin site, but because of their ability to move along the chromatin, they have to move to the origin-site at some point to fire the origin.
 
 Idea 4) How many individual datasets (i.e., those collected and published together) also demonstrate the feature that ORC/MCM binding locations do not correlate with origins? If there are few, then indeed, the integrative analysis performed here is consistent. But if there are many, then why would individual datasets reveal one thing, but integrative analysis reveal something else?
 
 We apologize for this oversight. In the revised manuscript we will discuss PMC3530669, PMC7993996, PMC5389698, PMC10366126. None of them have addressed what we are addressing, which is whether the small subset of the most reproducible origins proximal to ORC or MCM binding sites, but the discussion is essential.
 
 Idea 5) What if you were much more restrictive when defining "high-confidence" origins / binding sites. Does the overlap between origins and binding sites go up with increasing restriction?
 
 We will make origins more restrictive by selecting those reproduced by 30-60 datasets. The number of origins will of course fall, but we will measure whether the proximity to ORC or MCM-binding sites increases/decreases in a statistically rigorous way.
 
 Overall, I have the sense that these experimental techniques may be producing a lot of junk. If true, this would be useful for the field to know! But if not, and there are indeed "unexplored mechanisms of origin specification" that would be exciting. But I'm not convinced yet.
 
 It would be nice in the Discussion for the authors to comment about the trade-offs of different techniques; what are their pros and cons, which should be used when, which should be avoided altogether, and why? This would be a valuable prescription for the field.
 
 Thanks for the suggestion. We will do what the reviewer suggests: use cell type-specific data wherever origins have been defined by at least two methods in the same cell type, specifically reporting the percent of shared origins amongst the datasets to compare whether some methods correlate better with each other. ORC ChIP-seq and MCM ChIP-seq data do not define origins: they define the binding sites of these proteins. Thus we will discuss why the ChIP-seq sites of these protein complexes should not be used to define origins.
 
 Reviewer #3 (Public Review):
 
 Summary: The authors present a thought-provoking and comprehensive re-analysis of previously published human cell genomics data that seeks to understand the relationship between the sites where the Origin Recognition Complex (ORC) binds chromatin, where the replicative helicase (Mcm2-7) is situated on chromatin, and where DNA replication actually beings (origins). The view that these should coincide is influenced by studies in yeast where ORC binds site-specifically to dedicated nucleosome-free origins where Mcm2-7 can be loaded and remains stably positioned for subsequent replication initiation. However, this is most certainly not the case in metazoans where it has already been reported that chromatin bindings sites of ORC, Mcm2-7, and origins do not necessarily overlap, likely because ORC loads the helicase in transcriptionally active regions of the genome and, since Mcm2-7 retains linear mobility (i.e., it can slide), it is displaced from its original position by other chromatin-contextualized processes (for example, see Gros et al., 2015 Mol Cell, Powell et al., 2015 EMBO J, Miotto et al., 2016 PNAS, and Prioleau et al., 2016 G&D amongst others). This study reaches a very similar conclusion: in short, they find a high degree of discordance between ORC, Mcm2-7, and origin positions in human cells.
 
 Strengths: The strength of this work is its comprehensive and unbiased analysis of all relevant genomics datasets. To my knowledge, this is the first attempt to integrate these observations and the analyses employed were suited for the questions under consideration.
 
 Thank you for recognizing the comprehensive and unbiased nature of our analysis. The fact that the major weakness is that the comprehensive view fails to move the field forward, is actually a strength. It should be viewed in the light that we cannot even find evidence to support the primary hypothesis: that the most reproducible origins must be near ORC and MCM binding sites. This finding will prevent the unwise adoption of ORC or MCM binding sites as surrogate markers of origins and may perhaps stimulate the field to try and improve methods of identifying ORC or MCM binding until the binding sites are found to be proximal to the most reproducible origins. The last possibility is that there are ORC- or MCM-independent modes of defining origins, but we have no evidence of that.
 
 Weaknesses: The major weakness of this paper is that this comprehensive view failed to move the field forward from what was already known. Further, a substantial body of relevant prior genomics literature on the subject was neither cited nor discussed. This omission is important given that this group reaches very similar conclusions as studies published a number of years ago. Further, their study seems to present a unique opportunity to evaluate and shape our confidence in the different genomics techniques compared in this study. This, however, was also not discussed.
 
 We will do what the reviewer suggests: use cell type-specific data wherever origins have been defined by at least two methods in the same cell type, specifically reporting the percent of shared origins amongst the datasets to compare whether some methods correlate better with each other. Thanks for the suggestion. ORC ChIP-seq and MCM ChIP-seq data do not define origins: they define the binding sites of these proteins. Thus, we will discuss why the ChIP-seq sites of these protein complexes should not be used to define origins.
 
 We do not cite the SNS-seq data before 2018 because of the concerns discussed above about the earlier techniques needing improvement. We will discuss other genomics data that we failed to discuss.
 
 We will cite the papers the reviewer names:
 
 Gros, Mol Cell 2015 and Powell, EMBO J. 2015 discuss the movement of MCM2-7 away from ORC in yeast and fliesand will be cited. MCM2-7 binding to sites away from ORC and being loaded in vast excess of ORC was reported earlier on Xenopus chromatin in PMC193934, and will also be cited.
 
 Miotto, PNAS, 2016: publishes ORC2 ChIP-seq sites in HeLa (data we have used in our analysis), but do not measure ORC1 ChIP-seq sites. They say: “ORC1 and ORC2 recognize similar chromatin states and hence are likely to have similar binding profiles.” This is a conclusion based on the fact that the ChIP seq sites in the two studies are in areas with open chromatin, it is not a direct comparison of binding sites of the two proteins.
 
 Prioleau, G&D, 2016: This is a review that compared different techniques of origin identification but has no primary data to say that ORC and MCM binding sites overlap with the most reproducible origins.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.07.25.550556v4
www.biorxiv.org www.biorxiv.org

New submission 29/08/2023, 10:03:56

1
1. Public_Reviews 29 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  Reviewer #1 (Public Review):
  
  This study investigates the context-specificity of facial expressions in three species of macaques to test predictions for the 'social complexity hypothesis for communicative complexity'. This hypothesis has garnered much attention in recent years. A proper test of this hypothesis requires clear definitions of 'communicative complexity' and 'social complexity'. Importantly, these two facets of a society must not be derived from the same data because otherwise, any link between the two would be trivial. For instance, if social complexity is derived from the types of interactions individuals have, and different types of signals accompany these interactions, we would not learn anything from a correlation between social and communicative complexity, as both stem from the same data.
  
  The authors of the present paper make a big step forward in operationalising communicative complexity. They used the Facial Action Coding System to code a large number of facial expressions in macaques. This system allows decomposing facial expressions into different action units, such as 'upper lid raiser', 'upper lip raiser' etc.; these units are closely linked to activating specific muscles or muscle groups. Based on these data, the authors calculated three measures derived from information theory: entropy, specificity and prediction error. These parts of the analysis will be useful for future studies.
  
  The three species of macaque varied in these three dimensions. In terms of entropy, there were differences with regard to context (and if there are these context-specific differences, then why pool the data?). Barbary and Tonkean macaques showed lower specificity than rhesus macaques. Regarding predicting context from the facial signals, a random forest classifier yielded the highest prediction values for rhesus monkeys. These results align with an earlier study by Preuschoft and van Schaik (2000), who found that less despotic species have greater variability in facial expressions and usage.
  
  Crucially, the three species under study are also known to vary in terms of their social tolerance. According to the highly influential framework proposed by Bernard Thierry, the members of the genus Macaca fall along a graded continuum from despotic (grade 1) to highly tolerant (grade 4). The three species chosen for the present study represent grade 1 (rhesus monkeys), grade 3 (Barbary macaques), and grade 4 (Tonkean macaques).
  
  The authors of the present paper define social complexity as equivalent to social tolerance - but how is social tolerance defined? Thierry used aggression and conflict resolution patterns to classify the different macaque species, with the steepness of the rank hierarchy and the degree of nepotism (kin bias) being essential. However, aggression and conflict resolution are accompanied by facial gestures. Thus, the authors are looking at two sides of the same coin when investigating the link between social complexity (as defined by the authors) and communicative complexity. Therefore, I am not convinced that this study makes a significant advance in testing the social complexity for communicative complexity hypothesis. A further weakness is that - despite the careful analysis - only three species were considered; thus, the effective sample size is very small.
  
  Social tolerance in macaques is defined by various covarying traits, among which rates of counter-aggression and conflict resolution are only two of many included (see Thierry 2021 for a recent discussion and review). We do not deviate from Thierry’s definition of social tolerance. We simply highlight that the constellation of behavioral traits in the most tolerant macaque species results in a social environment where the outcome of social interactions is more uncertain (see introduction lines 102-114). As we argue throughout the paper, higher uncertainty can be used as a proxy for higher complexity and thus we conclude that the most tolerant macaque species have the highest social complexity. While most social behavior in macaques is accompanied by some facial behavior, we were careful to define social contexts only from the body language/behavior (e.g., lunge for aggression, grooming for affiliation) of the individuals involved and ignored the facial behavior used (see method lines 371-381). Therefore, the facial behavior of macaques (communication signals) was not used in defining either social tolerance (and by extension complexity) or the social context in which it was used. We feel like this appropriately minimizes any elements of circularity in the analysis of social and communicative complexity.
  
  Regarding the effective sample size of three species, we agree that it is small, and it is a limitation of this study. However, the methodology we used is applicable to any species for which FACS is available (including other non-human primates, dogs, and horses), and therefore, we hope that other datasets will complement ours in the future. Nevertheless, we now acknowledge this limitation in the discussion (lines 314317).
  
  Reviewer #2 (Public Review):
  
  This is a well-written manuscript about a strong comparative study of diversity of facial movements in three macaque species to test arguments about social complexity influencing communicative complexity. My major criticism has to do with the lack of any reporting of inter-observer reliability statistics - see comment below. Reporting high levels of inter-observer reliability is crucial for making clear the authors have minimized chances of possible observer biases in a study like this, where it is not possible to code the data blind with regard to comparison group. My other comments and questions follow by line number:
  
  We agree that inter-observer coding reliability is an important piece of information. We now report in more detail the inter-observer reliability tests that we conducted on lines 384-392.
  
  38-40. Whereas I am an advocate of this hypothesis and have tested it myself, the authors should probably comment here, or later in the discussion, about the reverse argument - greater communicative complexity (driven by other selection pressures) could make more complicated social structures possible. This latter view was the one advocated by McComb & Semple in their foundational 2005 Biology Letters comparative study of relationships between vocal repertoire size and typical group size in non-human primate species.
  
  It is true that an increase in communicative complexity could allow/drive an increase in social complexity. Unfortunately our data is correlational in nature and we cannot determine the direction of causality. We added such a statement to the discussion (lines 311-314).
  
  72-84 and 95-96. In the paragraph here, the authors outline an argument about increasing uncertainty / entropy mapping on to increasing complexity in a system (social or communicative). In lines 95-96, though, they fall back on the standard argument about complex systems having intermediate levels of uncertainty (complete uncertainty roughly = random and complete certainty roughly = simple). Various authors have put forward what I think are useful ways of thinking about complexity in groups - from the perspective of an insider (i.e., a group member, where greater randomness is, in fact, greater complexity) vs from the perspective of an outside (i.e., a researcher trying to quantify the complexity of the system where is it relatively easy to explain a completely predictable or completely random system but harder to do so for an intermediately ordered or random system). This sort of argument (Andrew Whiten had an early paper that made this argument) might be worth raising here or later in the discussion? (I'm also curious where the authors sentiments lie for this question - they seem to touch on it in lines 285-287, but I think it's worth unpacking a little more here!)
  
  In this study we used three measures of uncertainty (entropy, context specificity, and prediction error) to approximate complexity. However, maximum entropy or uncertainty would be achieved in a system that is completely random (and thus be considered simple). Therefore, the species with the highest entropy values, or unpredictability, could be interpreted as having a simpler communication system than a species with a moderately high entropy/unpredictability value. Our argument is that animal communication systems cannot possibly be random, otherwise they would not have evolved as signals. In systems where we know the highest entropy (or unpredictability) will not be due to randomness, as is the case with animal social interactions and communication, we can conclude that the system with the highest uncertainty is the most complex. We have now expanded upon this point in the discussion (lines 286-294). See also response to reviewer 1 below.
  
  115-129. See also:
  
  Maestripieri, D. (2005). "Gestural communication in three species of macaques (Macaca mulatta, M. nemestrina, M. arctoides): use of signals in relation to dominance and social context." Gesture 5: 57-73.
  
  Maestripieri, D. and K. Wallen (1997). "Affiliative and submissive communication in rhesus macaques." Primates 38(2): 127-138.
  
  On that note, it is probably worth discussing in this paragraph and probably later in the discussion exactly how this study differs from these earlier studies of Maestripieri. I think the fact that machine learning approaches had the most difficulty assigning crested data to context is an important methodological advance for addressing these sorts of questions - there are probably other important differences between the authors' study here and these older publications that are worth bringing up.
  
  Our study differs from these two studies in that the studies above classified facial behavior into discrete categories (e.g., bared-teeth, lip-smack), whereas we adopted a bottom-up approach and made no a priori assumptions about which movements are relevant. We broke down facial behavior down to their individual muscle movements (i.e., Action Units). Measuring facial behavior at the level of individual muscle movements allows for a more detailed and objective description of the complexity of facial behavior. This is a general point in advancing the study of facial behavior that is discussed in the introduction (lines 60-71) and discussion (lines 206-208). The reason we don’t draw a direct comparison with the studies above is because they had a slightly different focus. Our study was more focused on complexity of the (facial) communication system in general rather than comparing whether the different species use the same facial behavior in the same/different social contexts.
  
  220-222. What is known about visual perception in these species? Recent arguments suggest that more socially complex species should have more sensitive perceptual processing abilities for other individuals' signals and cues (see Freeberg et al. 2019 Animal Behaviour). Are there any published empirical data to this effect, ideally from the visual domain but perhaps from any domain?
  
  This is an interesting point. We are not aware of any studies showing differences in visual perceptions within the macaque genus. Both crested macaques and rhesus macaques are able to discriminate between individuals and facial expressions in match-to-sample tasks with comparable performances (Micheletta et al., 2015a, 2015b; Parr et al. 2008; Parr & Heinz, 2009). Similarly, several macaque species are sensitive to gaze shifts from conspecifics (Tomasello et al. 1998; Teufel et al. 2010; Micheletta & Waller, 2012).
  
  274-277. I am not sure I follow this - could not different social and non-social contexts produce variation in different affective states such that "emotion"-based signals could be as flexible / uncertain as seemingly volitional / information-based / referential-like signals? This issue is probably too far away from the main points of this paper, but I suspect the authors' argument in this sentence is too simplified or overstated with regard to more affect-based signals.
  
  Emotion-based signals could, in theory, also produce flexible signals and it is possible that some facial expressions reflect an emotional state. However, some previous studies have suggested that facial expressions are only used as a display of emotion, rather than such signals having evolved for a different function such as announcing future intentions. In our study we found that macaques used, in some cases, the same facial expressions (i.e. combination of Action Units) in at least two different social contexts that, presumably, differed in their emotional valence. Thus, it is unlikely that particular facial expressions are bound to a single emotion. We think that this is an important point to make even though it is slightly beyond the scope of our paper.
  
  288 on. Given there are only three species in this study, the chances of one of the species being the 'most complex' in any measure is 0.33. Although I do not believe this argument I am making here, can the authors rule out the possibility that their findings related to crested macaques are all related to chance, statistically speaking?
  
  We are not aware of a way to rule out this possibility. However, we believe that we are appropriately cautious throughout the paper and acknowledge that having only investigated three species is a limitation of this study in the discussion (lines 314-317, see also our response to reviewer 1 above).
  
  329-330. The fact that only one male rhesus macaque was assessed here seems problematic, given the balance of sexes in the other two species. Can the authors comment more on this - are the gestures they are studying here identical across the sexes?
  
  We agree it would have been preferable to collect data on more than one male rhesus macaque, but that was unfortunately not possible. We are not aware of any studies showing differences in the use of facial behavior between male and female rhesus macaques. If differences exist, most likely these would occur in a sexual/mating context. However, in our study we only considered affiliative (non-sexual), submissive, and aggressive contexts, where we have no a priori reason to believe that there are sex differences.
  
  354-371. Inter-observer reliability statistics are required here - one of the authors who did not code the original data set, or a trained observer who is not an author, could easily code a subset of the video files to obtain inter-observer reliability data. This is important for ruling out potential unconscious observer biases in coding the data.
  
  We agree this is an important piece of information. We now report in more detail the inter-observer reliability tests that we conducted on lines 384-392:
  
  “An agreement rating of >0.7 was considered good [Ekman et al 2002] and was necessary for obtaining certification. To obtain a MaqFACS coding certification, AVR, CP, and PRC coded 23 video clips of rhesus macaques and the MaqFACS codes were compared to the data of other certified coders (https://animalfacs.com).
  
  The mean agreement ratings obtained were 0.85, 0.73, 0.83 for AVR, CP, and PRC, respectively. In addition, AVR and CP coded 7 videos of Barbary macaques with a mean agreement rating of 0.79. AVR and PRC coded 10 videos of crested macaques with a mean agreement rating of 0.74.”
  
  Reviewer #1 (Recommendations For The Authors):
  
  Given the long debate on the concept of information exchange in animal communication, I would also recommend being more careful with the term 'exchanges of information' (line 271). Perhaps it's better to be agnostic in the context of this paper.
  
  As suggested, we now changed the phrasing to focus on the behavior of the animals, rather than suggesting that information is being exchanged (lines 270-273),
  
  Line 281: "This result confirms the assumption that facial behaviour in macaques is not used randomly": the authors are knocking down a straw man. Nobody who has ever studied animal communication would consider that signals occur randomly. Otherwise, they would not have evolved as signals.
  
  Indeed, nobody claims that animal communication signals are used randomly. Although it may be taken for granted, we feel it is worthwhile to reiterate this point, given that we used relative entropy and prediction error as measures of complexity. For instance, maximum entropy or unpredictability would be achieved in a system that is completely random (and thus be considered simple). Therefore, the species with the highest entropy values, or lowest predictability, could be interpreted as having a simpler communication system than a species with a moderately high entropy value. But if we are working under the assumption that animal communication systems cannot possibly be random, then we can conclude that the species whose communication system has the highest entropy is in fact the most complex. We tried to make this justification clearer in the discussion (lines 285-294).
  
  I did not follow why there is a higher reliance on facial signals when predation pressure is higher. Apart from the fact that the authors cannot address this question, they may want to reconsider this idea altogether.
  
  We now expand on the logic of why predation pressure might affect the use of facial signals (see lines 308-309): “When predation pressure is higher, reliance on facial signals could be higher than, for example vocal signals, such as to not draw attention of predators to the signaller.”
  
  Technical comments:
  
  One methodological issue that requires clarification is what the units of analysis are. The authors write that each row in their analysis denoted an observation time of 500 ms. How many rows did the authors assemble? The authors mention a sample size of > 3000 social interactions in the abstract. How did they define social interactions? And how many 'time windows' of 500 ms were obtained? Did they take one window per interaction or several? If several, then how was this move accounted for in the analysis? The reporting needs to be more accurate here. Most likely, the bootstrapping took care of biases in the data, but still, this information needs to be provided.
  
  We have now added some additional information to the method section. Social interactions for each context had the following definitions: “Social context was labeled from the point of view of the signaler based on their general behavior and body language (but not the facial behavior itself), during or immediately following the facial behavior. An aggressive context was considered when the signaler lunged or leaned forward with the body or head, charged, chased, or physically hit the interaction partner. A submissive context was considered when the signaler leaned back with the body or head, moved away, or fled from the interaction partner. An affiliative context was considered when the signaler approached another individual without aggression (as defined previously) and remained in proximity, in relaxed body contact, or groomed either during or immediately after the facial behavior. In cases where the behavior of the signaler did not match our context definitions, or displayed behaviors belonging to multiple contexts, we labeled the social context as unclear. Social context was determined from the video itself and/or from the matching focal behavioral data, if available.” (lines 371-382). The total duration of all social interactions per social context, and thus the number of 500ms windows/rows, have been added to Table 1 (lines 395-397). There were several 500ms windows per social interaction. All 500ms time blocks per interaction were used in the statistical analyses in order to retain all the variation and complexity of the facial behavior (Action Unit combinations) used by the macaques (lines 403-405). Indeed the bootstrapping procedure was used to account for any biases in the data.
  
  Overall, I would recommend providing more information on the actual behaviour of the animals. The paper is strong in handling highly derived indices representing the behaviour, but the reader learns little about the animals' behaviour. Thus, it would be great if statements about the entropy ratio were translated into what these measures represent in real life. For context specificity, this is clear, but for entropy, not so much.
  
  A high entropy ratio essentially suggests that a species uses a high variety of unique facial behavior/signals and all signals in the repertoire are used roughly equally often (rather than one facial behavior being used 90% of the time and others rarely used). We have tried our best to better explain this point in the introduction (lines 75-81) and discussion (lines 215-222). Discussing exactly what these signals are and what they mean was beyond the scope of this paper.
  
  Line 106: nepotism, not kinship
  
  Changed as suggested (line 106).
  
  Line 113: I would avoid statements about how a monkey society is perceived by its members.
  
  We think that noting how individuals may perceive their social environment is worthwhile when defining social complexity, so have retained this point but changed the phrasing to be more speculative (lines 112-113).
  
  Line 329: I was very surprised that only one male was represented in the data for rhesus monkeys. The authors try to wriggle their way out of this issue in the supplementary material ("Therefore, we have no a priori reason to expect an overall difference in the diversity and complexity of facial behaviour between the sexes"), but I think this is a major shortcoming of the analysis. They should ascertain whether there are no sex differences in the other two species regarding their variables of interest. They could then make a very cautious case for there being no sex differences in rhesus either. But of course, they would not know for sure.
  
  As with our response to reviewer 2 above, we agree that it would have been preferable to collect data on more than one male rhesus macaque, but that was unfortunately not possible. We are not aware of any studies showing differences in the use of facial behavior between male and female rhesus macaques. If differences exist, most likely these would occur in a sexual/mating context. However, in our study we only considered affiliative (non-sexual), submissive, and aggressive contexts, where we have no a priori reason to believe that there are sex differences. Looking at sex differences in the use of facial behavior would be a worthwhile study on its own, but it is outside the scope of this paper.
  
  This paper would make a stronger contribution if it focussed on the comparative analysis of facial expressions and removed the attempt of testing the social complexity for communicative complexity hypothesis.
  
  A comparative analysis of the contextual use of specific facial movements is important. But this paper is focused on making a more general comparison of the communication style and complexity across species. The social complexity hypothesis for communicative complexity is one of the key theoretical frameworks for such an investigation and allows us to frame our study in a broader context. We contribute important data on 3 species with methods that can be replicated and extended to others species. Therefore, we believe that it is a worthy contribution to investigations of the evolution of complex communication.
  
  REFERENCES
  
  Micheletta, J., J. Whitehouse, L.A. Parr, and B.M. Waller. ‘Facial Expression Recognition in Crested Macaques (Macaca nigra)’. Animal Cognition 18 (2015): 985–90. https://doi.org/10/f7fvnh.
  
  Micheletta, Jérôme, Jamie Whitehouse, Lisa A. Parr, Paul Marshman, Antje Engelhardt, and Bridget M. Waller. ‘Familiar and Unfamiliar Face Recognition in Crested Macaques (Macaca nigra)’. Royal Society Open Science 2 (2015): 150109. https://doi.org/10/ggx9k9.
  
  Parr, L. A., and M. Heintz. ‘Facial Expression Recognition in Rhesus Monkeys, Macaca mulatta’. Animal Behaviour 77 (2009): 1507–13. https://doi.org/10/bbsp5n.
  
  Parr, L.A., M. Heintz, and G. Pradhan. ‘Rhesus Monkeys (Macaca mulatta) Lack Expertise in Face Processing’. Journal of Comparative Psychology 122 (2008): 390–402. https://doi.org/10/d7w6bv.
  
  Micheletta, J., and B.M. Waller. ‘Friendship Affects Gaze Following in a Tolerant Species of Macaque, Macaca nigra’. Animal Behaviour 83 (2012): 459–67. https://doi.org/10/c4f8n2.
  
  Thierry B. Where do we stand with the covariation framework in primate societies? Am. J. Biol. Anthropol. 128 (2021): 5–25. https://doi.org/10.1002/ajpa.24441
  
  Tomasello, M., J. Call, and B. Hare. ‘Five Primate Species Follow the Visual Gaze of Conspecifics’. Animal Behaviour 55 (1998): 1063–69. https://doi.org/10/bmq7xh.
  
  Teufel, C., A. Gutmann, R. Pirow, and J. Fischer. ‘Facial Expressions Modulate the Ontogenetic Trajectory of Gaze-Following among Monkeys’. Developmental Science 13 (2010): 913–22. https://doi.org/10/b6j5r7.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.12.07.519469v4
www.biorxiv.org www.biorxiv.org

New submission 29/08/2023, 10:00:11

1
1. Public_Reviews 29 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  We are grateful for the helpful comments of both reviewers and have revised our manuscript with them in mind.
  
  One of the main issues raised was that readers may by default assume that our models are correct. We in fact made it very clear in our discussion that the models are merely hypotheses that will need testing by “wet” experiments and we do not therefore agree that even readers unfamiliar with AF would assume that the models must be correct. It was also suggested that readers could be reassured by including extensive confidence estimates such as PAE plots. As it happens, every single model described in the manuscript had reasonably high PAE scores and more crucially the entire collection of output files, including PAE data, are readily accessible on Figshare at https://doi.org/10.6084/m9.figshare.22567318.v2, a fact that the reviewers appear to have overlooked. The Figshare link is mentioned three times in the manuscript. Embedding these data within the manuscript itself would in our view add even more details and we have therefore not included them in our revised manuscript. Likewise, it is rather simple for any reader to work out which part of a PAE matrix corresponds to an interaction observed in the corresponding pdb prediction. Besides which, it is our view that the biological plausibility and explanatory power of models is just as important as AF metrics in judging whether they may be correct, as is indeed also the case for most experimental work.
  
  Another important point was that the manuscript was too long and not readable. Yes, it is long and it could well be argued that we could have written a different type of manuscript, focusing entirely on what is possibly the simplest and most important finding, namely that our AF models suggest that in animal cells Wapl appears to form a quarternary complex with SA, Pds5, and Scc1 in a manner suggesting that a key function of Wapl’s conserved CTD is to sequester Scc1’s Nterminal domain after it has dissociated from Smc3. For right or for wrong, we decided that this story could not be presented on its own but also required 1) an explanation for how Scc1 is induced to dissociate from Smc3 in the first place and 2) how to explain that the quarternary complex predicted for animal cells was not initially predicted for fungi such as yeast. The yeast situation was an exception that clearly needed explaining if the theory was to have any generality and it turned out that delving into the intricate details of the genetics of releasing activity in yeast was eventually required and yielded valuable new insights. We also believe that our work on the recruitment of Eco/Esco acetyl transferases to cohesin and the finding that sororin binds to the Smc3/Scc1 interface also provided important insight into how releasing activity is regulated. We acknowledge that the paper is indeed long but do not think that it is badly written. It is above all a long and complex story that in our view reveals numerous novel insights into how cohesin’s association with chromosomes is regulated and have endeavoured to eliminate any excessive speculation. We feel it is not our fault that cohesin uses complex mechanisms.
  
  Notwithstanding these considerations, we have in fact simplified a few sections and removed one or two others but acknowledge that we have not made substantial cuts.
  
  It was pointed out that a key feature of our modelling, namely the predicted association of Wapl’s C-terminal domain with SA/Scc3’s CES is inconsistent with published biochemical data. The AF predictions for this interface are universally robust in all eukaryotic lineages and crucially fully consistent with published and unimpeachable genetic data. We note that any model that explains all findings is bound to be wrong for the very simple reason that some of these findings will prove to be incorrect. There is therefore an art in Science of judging which data must be explained and accommodated and which should be ignored. In this particular case, we chose to ignore the biochemistry. Time will tell whether our judgement proves correct.
  
  Last but not least, it was suggested that we might provide some experimental support for our proposed SA/Scc3-Pds5-Scc1-WaplC quaternary complex. We are in fact working on this by introducing cysteine pairs (that can be crosslinked in cells) into the proposed interfaces but decided that such studies should be the topic of a subsequent publication. It would be impossible with the resources available to our labs to follow up all of the potential interactions and we therefore decided to exclude all such experiments.
  
  We are grateful for the detailed comments provided by both reviewers, many of which were very helpful, and in many but not all cases have amended the manuscript accordingly.
  
  With regard to the more specific comments:
  
  Reviewer #1 (Recommendations For The Authors):
  
  1) One concern is that observed interfaces/complexes arise because AF-multimer will aim to pack exposed, conserved and hydrophobic surfaces or regions that contain charge complementarity. The risk is that pairwise interaction screens can result in false positive & non-physiological interactions. It is therefore important to report the level of model confidence obtained for such AF calculations:
  
  A) The authors should color the key models according to pLDDT scores obtained as reported by AF. This would allow the reader to judge the estimated accuracy of the backbone and side chain rotamers obtained. At least for the key models and interactions it would be important to know if the pLDDT score is >90 (Correct backbone and most rotamers) or >70 (only backbone is correct).
  
  B) It would also be important to report the PAE plots to allow estimation of the expected position error for most of the important interactions. pLDDT coloring and PEA plots can be shown side-by-side as shown in other published data (e.g. https://pubmed.ncbi.nlm.nih.gov/35679397/ (Supplementary data)
  
  C) The authors should include a Table showing the confidence of template modeling scores for the predicted protein interfaces as ipTM, ipTM+pTM as reported by AlphaFold-multimer. Ideally, they would also include DockQ scores but this may not be essential. Addition of such scores would help classification into Incorrect, Acceptable or of high quality. For example, line 1073 et seq the authors show a model of a SCC1SA and ESCO1 complex (Fig. 37). Are the modeling scores for these interfaces high? It does not help that the authors show cartoons without side chains? Can the authors provide a close-up view of the two interfaces? Are the amino acids are indeed packed in a manner expected for a protein interface? Can we exclude the possibility that the prediction is obtained merely because the sequence segments (e.g. in ESCO1 & ESCO2) are hydrophobic and conserved?
  
  We do not agree that including this level of detail to the text/figures of the manuscript would be suitable. All the relevant data for those who may be sceptical about the models are readily available at https://doi.org/10.6084/m9.figshare.22567318.v2. In our view, the cartoon versions of the models are easier for a reader to navigate. Anyone interested in the molecular details can look at the models directly.
  
  Importantly, no amount of statistical analysis can completely validate these models. What is required are further experiments, which will be the topic of further work from our and I dare from other laboratories.
  
  D) When they predict an interaction between the SA2:SCC1 complex and Sororin's FGF motif, they find that only 1/5 models show an interaction and that the interaction is dissimilar to that seen of CTCF. Again, it would be helpful to know about modeling scores. Can they show a close-up view of the SORORIN FGF binding interface to see if a realistic binding mode is obtained? Can they indicate the relevant region on the PAE plot?
  
  Given that AF greatly favours other interactions of Sororin’s FGF motif over its interaction with SA2-Scc1, we do not agree that dwelling on the latter would serve any purpose.
  
  2) Line 996: AF predicts with high confidence an interaction between Eco1 & SMC3hd. What are the ipTM (& DockQ if available) scores. Would the interface score High, Medium or Acceptable?
  
  As mentioned, see https://doi.org/10.6084/m9.figshare.22567318.v2.
  
  3) Line 1034 et seq: Eco1/ESCO1/ESCO2 interaction with PDS5. Interface scores need to be shown to determine that the models shown are indeed likely to occur. If these interactions have low model confidence, Fig. 36 and discussion around potential relevance to PDS5-Eco1 orientation relative to the SMC3 head remains highly speculative and could be expunged.
  
  See https://doi.org/10.6084/m9.figshare.22567318.v2. It should be clear that the predictions are very similar in fungi and animals. Crucially, we know that Pds5 is essential for acetylation in vivo, so the models appear plausible from a biological point of view.
  
  4) Considering the relatively large interface between ECO1 and SMC3, would the author consider the possibility that in addition to acetylating SMC3's ATPase domain, ECO1 remains bound to cohesin-DNA complex, as proposed for ESCO1 by Rahman et al (10.1073/pnas.1505323112)?
  
  This is certainly possible but we would not want to indulge in such speculation.
  
  5) E.g. Line 875 but also throughout the text: As there is no labeling of the N- and C-termini in the Figures, is frequently unclear what the authors are referring to when they mention that AF models orient chains in a certain manner.
  
  Good point. This has been amended. However, the positions of N- and C- is all available at https://doi.org/10.6084/m9.figshare.22567318.v2.
  
  6) Fig19B: PAE plots: authors should indicate which chains correspond to A, B, C. Which segment corresponds to the TYxxxR[T/S]L motif? Can they highlight this section on the PAE plot?
  
  Good point and amended in the revised manuscript.
  
  Minor comments:
  
  1) Line 440: the WAPL YSR motif is not shown in Fig. 14A
  
  2) Line 691: Scc3 spelling error.
  
  3) Line 931: Sentence ending '... SCC3 (SCC3N).' requires citation.
  
  4) Line 1008: Figure reference seems wrong. It should read: Fig. 34A left and right. Fig. 34B does not contain SCC1.
  
  Many thanks for spotting these. Hopefully, all corrected.
  
  5) Fig. 41 can be removed as it shows the absence of the interaction of Sororin with SMC1:SCC1. Sufficient to mention in the text that Sororin does not appear to interact with SMC1:SCC1.
  
  This is possible but we decided to leave this as is.
  
  Reviewer #2 (Recommendations For The Authors):
  
  Minor points
  
  (1) Are there any predicted models in which one of the two dimer interfaces of the hinge is open when the coiled coils are folded back, as seen in the cryo-EM structure of human cohesin-NIPBL complex in the clamped state?
  
  No AF runs ever predicted half opened hinges. It is possible that the introduction of mutations in one of the two interfaces might reveal a half-opened state and we ought to try this. However, it would not be appropriate for this manuscript, we believe.
  
  (2) Structures of the SA-Scc1 CES bound to [Y/F]xF motifs from Sgo1 and CTCF have been reported, suggesting that a similar motif could interact with SA/Scc3. Surprisingly, AF did not predict an interaction between Scc3/SA and Wapl FGF motifs, which only bind to the Pds5 WEST region. On the other hand, AF predicted interactions of the Sororin FGF motif with both Pds5 WEST and SA CES. Can the authors comment on this Wapl FGF binding specificity? What will happen if a Wapl fragment lacking the CTD is used in the prediction?
  
  This seems to be an academic point as the CTD is always present.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.04.14.536858v4
www.biorxiv.org www.biorxiv.org

New submission 10/07/2023, 10:28:37

1
1. Public_Reviews 29 Aug 2023
 
 in eLife
 
 Author Response
 
 The following is the authors’ response to the original reviews.
 
 Reviewer #1 (Recommendations For The Authors):
 
 1) The authors need to validate that RAP1-HA still retains its essential function. As indicated above, if RAP1-HA still retains its essential functions, cells carrying one RAP1-HA allele and one deleted allele are expected to grow the same as WT cells. These cells should also have the WT VSG expression pattern, and RAP1-HA should still interact with TRF.
 
 We demonstrated that C-terminally HA-tagged RAP1 co-localizes with telomeres by a combination of immunofluorescence and fluorescence in situ hybridization (Cestari and Stuart, 2015, PNAS), and co-immunoprecipitate telomeric and 70 bp repeats (Cestari et al. 2019 Mol Cell Biol). We also showed by immunoprecipitation and mass spectrometry that HA-tagged RAP1 interacts with nuclear and telomeric proteins, including PIP5Pase (Cestari et al. 2019). Others have also tagged T. brucei RAP1 with HA without disrupting its nuclear localization (Yang et al. 2009, Cell), all of which indicate that the HA-tag does not affect protein function. As for the suggested experiment, there is no guarantee that cells lacking one allele of RAP1 will behave as wildtype, i.e., normal growth and repression of VSGs genes. Also, less than 90% of T. brucei TRF was reported to interact with RAP1 (Yang et al. 2009, Cell), which might be indirect via their binding to telomeric repeats rather than direct protein-protein interactions.
 
 2) The authors need to remove the His6 tag from the recombinant RAP1 fragments before the EMSA analysis. This is essential to avoid any artifacts generated by the His6-tagged proteins.
 
 Our controls show that the His-tag is not interfering with RAP1-DNA binding. We show in Fig 3CG by EMSA and in Fig S5 by EMSA and microscale thermophoresis that His-tagged full-length rRAP1 does not bind to scrambled telomeric dsDNA sequences, which demonstrates that His-tagged rRAP1 does not bind unspecifically to DNA. Moreover, in Fig 3G and Fig S5, we show that His-tagged rRAP11-300 also does not bind to 70 bp or telomeric repeats. In contrast, the full-length His-tagged rRAP1, rRAP1301-560, or rRAP1561-855 bind to 70 bp or telomeric repeats (Fig 3C-G). Since all proteins were His-tagged, the His tag cannot be responsible for the DNA binding. We have worked with many different His-tagged proteins for nucleic acid binding and enzymatic assays without any interference from the tag (Cestari and Stuart, 2013; JBC; Cestari et al; 2013, Mol Cell Biol; Cestari and Stuart, 2015, PNAS; Cestari et al. 2016; Cell Chem Biol; Cestari et al. 2019 Mol Biol Cell).
 
 3) More details need to be provided for ChIPseq and RNAseq analysis regarding the read numbers per sample, mapping quality, etc.
 
 Table S3 includes information on sequencing throughput and read length. Mapping quality was included in the Methods section “Computational analysis of RNA-seq and ChIP-seq”, starting at line 499. In summary, we filtered reads to keep primary alignment (eliminate supplementary and secondary alignments). We also analyzed ChIP-seq with MAPQ ≥20 (99% probability of correct alignment) to distinguish RAP1 binding to specific ESs, including silent vs active ES (ChIP-seq). We included Fig S4 to show the effect of filtering alignments on the active vs silent ESs. We used MAPQ ≥30 to analyze RNA-seq mapping to VSG genes, including those in subtelomeric regions. Our scripts are available at https://github.com/cestari-lab/lab_scripts. We also included in the Methods, lines 522-524: “Scripts used for ChIP-seq, RNA-seq, and VSG-seq analysis are available at https://github.com/cestari-lab/lab_scripts. A specific pipeline was developed for clonal VSG-seq analysis, available at https://github.com/cestarilab/VSG-Bar-seq.”
 
 4) The authors should revise the Discussion section to clearly state the authors' speculations and their working models (the latter of which need solid supporting evidence). Specifically, statements in lines 218 - 219 and lines 224-226 need to be revised.
 
 The statement “likely due to RAP1 conformational changes” in line 228 discusses how binding of PI(3,4,5)3 could affect RAP1 Myb and MybL domains binding to DNA. We did not make a strong statement but discussed a possibility. We believe that it is beneficial to the reader to have the data discussed, and we do not feel this point is overly speculative. For lines 224-226 (now 234-235), the statement refers to the finding of RAP1 binding to centromeric regions by ChIP-seq, which is a new finding but not the focus of this work. To make it clear that it does not refer to telomeric ESs, we edited: “The finding of RAP1 binding to subtelomeric regions other than ESs, including centromeres, requires further validation.” Since RAP1 binding to centromeres is not the focus of the work, future studies are necessary to follow up, and we believe it is appropriate in the Discussion to be upfront and highlight this point to the readers.
 
 Our model is based on the data presented here but also on scientific literature. We have reviewed the Discussion to prevent broad speculations. When discussing a model, we stated (line 245): “The scenario suggests a model in which …”, to state that this is a working model. Similarly, in Results (line 201) we included: “Our data suggest a model in which…”.
 
 5) The authors should revise the title to reflect a more reasonable conclusion of the study.
 
 We agree that the title should be changed to imply a direct role of PI(3,4,5)P3 regulation of RAP1, which is not captured in the original title. This will provide more specific information to the readers, especially those broadly interested in telomeric gene regulation and RAP1. The new title is: PI(3,4,5)P3 allosteric regulation of repressor activator protein 1 controls antigenic variation in trypanosomes
 
 6) The authors are recommended to provide an estimation of the expression level of the V5-tagged PIP5pase from the tubulin array in reference to the endogenous protein level.
 
 The relative mRNA levels of the exclusive expression of PIP5Pase mutant compared to the wildtype is available in the Data S1, RNA-seq. The Mut PIP5Pase allele’s relative expression level is 0.85fold to the WT allele (both from tubulin loci). We also showed by Western blot the WT and Mut PIP5Pase protein expression (Cestari et al. 2019, Mol Cell Biol). Concerning PIP5Pase endogenous alleles, we compared normalized RNA-seq counts per million from the conditional null PIP5Pase cells exclusively expressing WT or the Mut PIP5Pase alleles (Data S1, this work) to our previous RNA-seq of single-marker 427 strain (Cestari et al. 2019, Mol Cell Biol). We used the single-maker 427 because the conditional null cells were generated in this strain background. The PIP5Pase WT and Mut mRNAs expressed from tubulin loci are 1.6 and 1.3-fold the endogenous PIP5Pase levels in single-marker 427, respectively. We included a statement in the Methods, lines 275-278: “The WT or Mut PIP5Pase mRNAs exclusively expressed from tubulin loci are 1.6 and 1.3-fold the WT PIP5Pase mRNA levels expressed from endogenous alleles in the single marker 427 strain. The fold-changes were calculated from RNA-seq counts per million from this work (WT and Mut PIP5Pase, Data S1) and our previous RNA-seq from single marker 427 strain (24).”
 
 7) The authors are recommended to provide more detailed EMSA conditions such as protein and substrate concentrations. Better quality EMSA gels are preferred.
 
 All concentrations were already provided in the Methods section. See line 356, in topic Electrophoretic mobility shift assays: “100 nM of annealed DNA were mixed with 1 μg of recombinant protein…”. For microscale thermophoresis, also see lines 375-376 in topic Microscale thermophoresis binding kinetics: “1 μM rRAP1 was diluted in 16 two-fold serial dilutions in 250 mM HEPES pH 7.4, 25 mM MgCl2, 500 mM NaCl, and 0.25% (v/v) N P-40 and incubated with 20 nM telomeric or 70 bp repeats…”. Note that two different biochemical approaches, EMSA and microscale thermophoresis, were used to assess rRAP1-His binding to DNA. Both show agreeable results (Fig 3 and 5, and Fig S5. Microscale thermophoresis shows the binding kinetics, data available in Table 1). The EMSA images clearly show the binding of RAP1 to 70 bp or telomeric repeats but not to scramble telomeric repeat DNA.
 
 Reviewer #2 (Recommendations For The Authors):
 
 Major comments:
 
 Figures
 
 All figures should have their axes properly labeled and units should be indicated. For many of the ChIPseq datasets it is not clear whether the authors show a fold enrichment or RPM and whether they used all reads or only uniquely mapping reads. Especially the latter is a very important piece of information when analyzing expression sites and should always be reported. The authors write, that all RNA-seq and ChIP-seq experiments were performed in triplicate. What is shown in the figures, one of the replicates? Or the average?
 
 ChIP-seq is shown as fold enrichment; we clarified this in the figures by including in the y-axis RAP1-HA ChIP/Input (log 2). We included in figure legends, see line 710: “Data show fold-change comparing ChIP vs Input.”. For quantitative graphs (Fig 2B, D, and E, and Fig 5F and G), data are shown as the mean of biological replicates. Graphs generated in the integrated genome viewer (IGV, qualitative graphs) is a representative data (Fig 2A, C, and F, and Fig 5D-E). All statistical analyses were calculated from the three biological replicates. Uniquely mapped reads were used. We also included ChIP-seq analysis with MAPQ ≥10 and 20 (90% and 99% probability of correct alignment, respectively) to distinguish RAP1 binding to ESs. Fig S4 shows the various mapping stringency and demonstrates the enrichment of RAP1-HA to silent vs active ES.
 
 Figure 1 is very important for the main argument of the manuscript, but very difficult (impossible for me) to fully understand. It would be great if the author could make an effort to clarify the figure and improve the labels. Panel Fig 1E. Here it is impossible to read the names of the genes that are activated and therefore it is impossible to verify the statements made about the activation of VSGs and the switching.
 
 We have edited Fig 1E to include the most abundant VSGs, which decreased the amount of information in the graph and increased the label font. We also re-labeled each VSG with chromosome or ES name and common VSG name when known (e.g., VSG2). We included Table S1 in the supplementary information with the data used to generate Fig 1E. In Table S1, the reader will be able to check the VSG gene IDs and evaluate the data in detail. We included in the legend, line 700: “See Table S1 for data and gene IDs of VSGs.”
 
 Figure 1F: This panel is important and should be shown in more detail as it distinguishes VSG switching from a general VSG de-repression phenotype. VSG-seq is performed in a clonal manner here after PIP5Pase KD and re-expression. To show that proper switching has occurred place in the different clones, instead of a persistent VSG de-repression, the expression level of more VSGs should be shown (e.g. as in panel E) to show that there is really only one VSG detected per clone. For example, it is not clear what the authors 'called' the dominant VSG gene.
 
 We showed in supplementary information Fig S1 B-C examples of reads mapping to the VSGs. Now we included a graph (Fig S1 D) that quantifies reads mapped to the VSG selected as expressed compared to other VSG genes considered not expressed). The data show an average of several clones analyzed. Other VSGs (not selected) are at the noise level (about 4 normalized counts) compared to >250 normalized counts to the selected as expressed VSGs.
 
 As mentioned in the public comments, I don't see how the data from Fig 1E and 1F fit together. Based on Fig 1E VSG2 is the dominant VSG, based on Fig 1F VSG2 is almost never the dominant VSG, but the VSG from BES 12.
 
 In Fig 1E, the VSG2 predominates in cells expressing WT PIP5Pase, however, in cells expressing Mut PIP5Pase, this is not the case anymore. Many other VSGs are detected, and other VSG mRNAs are more abundant than VSG2 (see color intensity in the heat map). The Mut cells may also have remaining VSG2 mRNAs (from before switching) rather than continuous VSG2 expression. This is the reason we performed the clonal analysis shown in Fig 1F, to be certain about the switching. While Fig 1F shows potential switchers in the population, Fig 1E confirms VSG switching in clones.
 
 Many potential switchers were detected in the VSG-seq (Fig 1F, the whole cell population is over 107 parasites), but not all potential switchers were detected in the clonal analysis because we analyzed 212 clones total, a fraction of the over 107 cells analyzed by VSG-seq (Fig 1E). Also, it is possible that not all potential switchers are viable. A preference for switching to specific ESs has been observed in T. brucei (Morrison et al. 2005, Int J Parasitol; Cestari and Stuart, 2015, PNAS), which may explain several clones switching to BES12.
 
 Note that in Fig 1F, tet + cells did not switch VSGs at all; all 118 clones expressed VSG2. We relabeled Fig 1F for clarity and included the VSG names. We added gene IDs in the Figure legends, see line 702 “ BES1_VSG2 (Tb427_000016000), BES12_VSG (Tb427_000008000)…”
 
 Statements in Introduction / Discussion
 
 The statement in lines 82/83 is very strong and gives the impression that the PIP5Pase-Rap1 circuit has been proven to regulate antigenic variation in the host. However, I don't think this is the case. The paper shows that the pathway can indeed turn expression sites on and off, but there is no evidence (yet) that this is what happens in the host and regulates antigenic variation during infection. The same goes for lines 214/215 in the discussion.
 
 We agree with the reviewer, and we edited these statements. The statement lines 82-83: “The data provide a molecular mechanism…” to “The data indicates a molecular mechanism…” For lines 224225: “and provides a mechanism to control…” to “and indicates a mechanism to control…”. We also included in lines 261-262: “It is unknown if a signaling system regulates antigenic variation in vivo.” Also edited lines 262-263: “…the data indicate that trypanosomes may have evolved a sophisticated mechanism to regulate antigenic variation...”.
 
 New vs old data
 
 In general, for Figures 1 - 4, it was a bit difficult to understand which panels showed new findings, and which panels confirmed previous findings (see below for specific examples). In the text and in the figure design, the new results should be clearly highlighted. Authors: All data presented is new, detailed below.
 
 Figure 1: A similar RNA-seq after PIP5Pase deletion was performed in citation 24. Perhaps the focus of this figure should be more on the (clone-specific) VSG-seq experiment after PIP5Pase re-introduction.
 
 This is the first time we show RNA-seq of T. brucei expressing catalytic inactive PIP5Pase, which establishes that the regulation of VSG expression and switching, and repression of subtelomeric regions, is dependent on PIP5Pase enzyme catalysis, i.e., PI(3,4,5)P3 dephosphorylation. Hence, the relevance and difference of the RNA-seq here vs the previous RNA-seq of PIP5Pase knockdown.
 
 Figure 2: A similar ChIP-seq of RAP1 was performed in citation 24, with and without PIP5Pase deletion. Could new findings be highlighted more clearly?
 
 Our and others’ previous work showed ChIP-qPCR, which analyses specific loci. Here we performed ChIP-seq, which shows genome-wide binding sites of RAP1, and new findings are shown here, including binding sites in the BES, MESs, and other genome loci such as centromeres. We also identified DNA sequence bias defining RAP1 binding sites (Fig 2A). We also show by ChIP-seq how RAP1-binding to these loci changes upon expression of catalytic inactive PIP5Pase. To improve clarity in the manuscript, we edited lines 129-130: “We showed that RAP1 binds telomeric or 70 bp repeats (24), but it is unknown if it binds to other ES sequences or genomic loci.”
 
 Figure 4: Binding of Rap1 to PI(3,4,5)P3, but not to other similar molecules, was previously shown in citation 24. Could new findings be highlighted more clearly?
 
 We published in reference 24 (Cestari et al. Mol Cell Biol) that RAP1-HA can bind agarose beadsconjugated synthetic PI(3,4,5)P3. Here, we were able to measure T. brucei endogenous PI(3,4,5)P3 associated with RAP1-HA (Fig 4F). Moreover, we showed that the endogenous RAP1-HA and PI(3,4,5)P3 binding is about 100-fold higher when PIP5Pase is catalytic inactive than WT PIP5Pase. The data establish that in vivo endogenous PI(3,4,5)P3 binds to RAP1-HA and how the binding changes in cells expressing mutant PIP5Pase; this data is new and relevant to our conclusions. To clarify, we edited the manuscript in lines 180-182: “To determine if RAP1 binds to PI(3,4,5)P3 in vivo, we in-situ HA-tagged RAP1 in cells that express the WT or Mut PIP5Pase and analyzed endogenous PI(3,4,5)P3 levels associated with immunoprecipitated RAP1-HA”.
 
 Sequencing. I really appreciate the amount of detail the authors provide in the methods section. The authors do an excellent job of describing how different experiments were performed. However, it would be important that the authors also provide the basic statistics on the sequencing data. How many sequencing reads were generated per run (each replicate of the ChIP-seq and RNA-seq assays)? How long were the reads? How many reads could be aligned?
 
 The sequencing metrics for RNA-seq and ChIP-seq for all biological replicates were included in Table S3 (supplementary information). The details of the analysis and sequencing quality were described in the Methods section “Computational analysis of RNA-seq and ChIP-seq”. To be clearer about the analysis, we also included in Methods, lines 522-524: “Scripts used for ChIP-seq, RNA-seq, and VSG-seq analysis are available at https://github.com/cestari-lab/lab_scripts. A specific pipeline was developed for clonal VSG-seq analysis, available at https://github.com/cestari-lab/VSG-Bar-seq.”.
 
 Minor comments:
 
 Figure 1B: I would recommend highlighting the non-ES VSGs and housekeeping genes with two more colors in the volcano plot, to show that it is mostly the antigen repertoire that is deregulated, and not the Pol ll transcribed housekeeping genes. This is not entirely clear from the panel as it is right now.
 
 The suggestion was incorporated in Fig 1B. We color-coded the figure to include BES VSGs, MES VSGs, ESAGs, subtelomeric genes, core genes (typically Pol II and Pol III transcribed genes), and Unitig genes, those genes not assembled in the 427-2018 reference genome.
 
 Were the reads in Figure 2a filtered in the same way as those in Figure 2C? To support the statements, only unique reads should be used.
 
 Yes, we also added Fig S4 to make more clear the comparison between read mapping to silent vs active ES.
 
 It would be good if the authors could add a supplementary figure showing the RAP1 ChIP-seq (WT and cells lacking a functional PIP5Pase) for all silent expression sites.
 
 We had RAP1 ChIP-seq from cells expressing WT PIP5Pase already. We have it modified to include data from the Mutant PIP5Pase. See Fig S3 and S5.
 
 In Figure 5D, after depletion of PIP5Pase, RAP1 binding appears to decrease across ESAGs, but ESAG expression appears to increase. How can this be explained with the model of RAP1 repressing transcription?
 
 We included in the Results, lines 208-212: “The increased level of VSG and ESAG mRNAs detected in cells expressing Mut PIP5Pase (Fig 5D) may reflect increased Pol I transcription. It is possible that the low levels of RAP1-HA at the 50 bp repeats affect Pol I accessibility to the BES promoter; alternatively, RAP1 association to telomeric or 70 bp repeats may affect chromatin compaction or folding impairing VSG and ESAG genes transcription.”.
 
 Reviewer #3 (Recommendations For The Authors):
 
 Line 114 - typo? Procyclic instead of procyclics:
 
 Fixed, thanks.
 
 Line 233 - the phrasing here is confusing, may want to replace "whose" with "which" (if I am interpreting correctly):
 
 Thanks, no changes were needed. I have had the sentence reviewed by a Ph.D.-level scientific writer.
 
 Methods - there is no description of VSG-seq analysis in the methods. Is it done the same way as the RNA-seq analysis? Is the code for analysis/generating figures available online?
 
 The procedure is similar. We included an explanation in Methods, lines 503-504: “RNA-seq and VSG-seq (including clonal VSG-seq) mapped reads were quantified…”. Also, in lines 522-54: “Scripts used for ChIP-seq, RNA-seq, and VSG-seq analysis are available at https://github.com/cestari-lab/lab_scripts. A specific pipeline was developed for clonal VSG-seq analysis, available at https://github.com/cestarilab/VSG-Bar-seq.”.
 
 Fig 1H - Is this from RNA-seq or VSG-seq analysis of procyclics?
 
 The procyclic forms VSG expression analysis was done by real-time PCR. To clarify it, we included it in the legend “Expression analysis of ES VSG genes after knockdown of PIP5Pase in procyclic forms by real-time PCR”. We also amended the Methods, under the topic RNA-seq and real-time PCR, line 402-407: “For procyclic forms, total RNAs were extracted from 5.0x108 T. brucei CN PIP5Pase growing in Tet + (0.5 µg/mL, no knockdown) or Tet – (knockdown) at 5h, 11h, 24h, 48h, and 72h using TRIzol (Thermo Fisher Scientific) according to manufacturer's instructions. The isolated mRNA samples were used to synthesize cDNA using ProtoScript II Reverse Transcriptase (New England Biolabs) according to the manufacturer's instructions. Real-time PCRs were performed using VSG primers as previously described (23).”
 
 Fig 2 A - Where it says "downstream VSG genes" I assume "downstream of VSG genes" is meant? the regions described in this figure might be more clearly laid out in the text or the legend
 
 Fixed, thanks. We included in the text in Results, line 140: “… and Ts and G/Ts rich sequences downstream of VSG genes”.
 
 Fig 2E - what does "Flanking VSGs" mean in this context?
 
 We added to line 705, figure legends: “Flanking VSGs, DNA sequences upstream or downstream of VSG genes in MESs. “
 
 Fig 2H - Why is the PIP5Pase Mutant excluded from the Chr_1 core visualization?
 
 We did not notice it. We included it now; thanks.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.05.11.540368v3
www.biorxiv.org www.biorxiv.org

New submission 29/08/2023, 09:23:49

1
1. Public_Reviews 29 Aug 2023
  
  in eLife
  
  Author Response
  
  We thank the reviewers for their rigorous and insightful comments, as well as their positive feedback on the manuscript. We agree with reviewer #1 that substantial additional work is needed for a complete mechanistic understanding of how NI circuitry works and we expect that the transgenic tools we generated will be valuable for such experiments. It is noteworthy that specific driver lines do not currently exist for IPN neurons, which limited our ability to perform optogenetic experiments activating the IPN to NI pathway. Reviewer 2 asks for additional clarification and analysis on various experiments, which we intend to address in a revised manuscript. We concur with reviewer #3 that, with the existing data, it is not possible to conclude with certainty that the IPN projections from gsc2 and rln3 NI neurons are solely axonal in nature. Additional experiments with axon- and dendrite- specific markers will be used to resolve this point in future work.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.04.07.487414v3
www.biorxiv.org www.biorxiv.org

New submission 02/05/2023, 11:02:31

1
1. Public_Reviews 29 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  eLife assessment
  
  This important study was designed to examine the bypass of Ras/Erk signaling defects that enable limited regeneration in a mouse model of hepatic regeneration. This hepatocyte proliferation is associated with the expression by groups of cells of mRNA-loaded CD133+ intracellular vesicles that mediate an intercellular signaling pathway that supports proliferation. These are new observations, supported by convincing data, that have broad significance to the fields of regeneration and cancer.
  
  First of all, we greatly appreciate the very positive take of this work by eLife editors and also thank the two reviewers for their constructive comments. We have provided point-by-point responses as follows.
  
  Reviewer #1 (Public Review):
  
  This study was designed to examine the bypass of Ras/Erk signaling defects that enable limited regeneration in a mouse model of hepatic regeneration. The authors show that this hepatocyte proliferation is marked by expression of CD133 by groups of cells. The CD133 appears to be located on intracellular vesicles associated with microtubules. These vesicles are loaded with mRNA. The authors conclude that the CD133 vesicles mediate an intercellular signaling pathway that supports cell proliferation. These are new observations that have broad significance to the fields of regeneration and cancer.
  
  The primary observation is that the limited regeneration observed in livers with Ras/Erk signaling defects is associated with CD133 expression by groups of cells. The functional significance of CD133 was tested using Prom1 KO mice - the data presented are convincing.
  
  The major weakness of the study is that some molecular mechanistic details are unclear - this is, in part, due to the extensive new biology that is described. Nevertheless, the data used to support some key points in this study are unclear:
  
  We fully agree that some details of the molecular mechanisms are yet to be elucidated for the CD133+ vesicles (intercellsomes, as we named). This is the first report of a new direct cell-cell communication mechanism provoked in stress response to proliferative signal deficit.
  
  Remarkably, many questions remain open for the molecular mechanisms for formation and functions of relatively well-characterized structures such as exosomes/EVs, despite a huge body of literature since their discoveries.
  
  a) What is the evidence that the observed CD133 groups of cells are not due to clonal growth. Is this conclusion based on the time course (the groups appear more rapidly than proliferation) or is this based on the GFP clonal analysis?
  
  This is indeed a very critical point for this study. Our initial thought and efforts were indeed on finding evidence that supports clonal expansion of progenitor cells. However, the experiments showed that the CD133+ cells were negative for all other stem/progenitor cell markers and that they are mature hepatocytes. CD133 expression was upregulated dramatically in regenerating livers and disappeared upon completion of liver regeneration. Furthermore, suppression of Ras-Erk signaling by Shp2 and Mek inhibitors robustly induced CD133 expression in a variety of cancer cell lines in culture in vitro.
  
  At 2 days after PHx, we already observed big colonies, which were unlikely derived from a single initiating cell (Figure 1). The GFP clonal analysis unambiguously demonstrated the heterogenous origin of the clustered cells (Figure 3). We detected mixed GFP-positive and -negative cells within each colony, without a single colony consisting entirely of GFP-positive cells. The original colony sizes were estimated to be 10 cells or more (Figures 3G and Figure 3–figure supplement 1B). Thus, both the sizes and compositions in the GFP clonal analyses support the assertion that CD133+ cell clusters originated from multiple mature hepatocytes.
  
  b) What is the evidence that the CD133 vesicles mediate intercellular communication. This is an exciting hypothesis, but what is the evidence that this happens? Is this inferred from IEG mRNA diversity? or some other data. Is there direct evidence of transfer - for example, the does the GFP clonal analysis show transfer of GFP that is not mediated by clonal proliferation? Moreover, since the hepatocytes are isogenic, what distinguishes the donor and recipient cells? Increased clarity concerning what is hypothesis and what is directly supported by data - would improve the presentation of this study.
  
  Per the reviewer’s advice, we have clarified these points in the revised version. Our proposal that CD133 vesicles mediate intercellular communication was supported by these experimental results.
  
  A). Data in Fig. 5 suggest direct trafficking of the vesicles, as CD133 existed on the filaments that bridge the tightly contacting cells. This was confirmed by two different CD133 antibodies in mouse and human. Of note, CD133+ vesicles are negative for CD9, CD63 or CD81, markers for exosomes/EVs. We could only isolate CD133+ vesicles from cell lysates in vitro and mouse tissue lysates, but not from cell supernatants from which exosomes/EVs are isolated.
  
  B). More direct evidence of the transfer was presented in Fig. 6H, showing Myc-tagged CD133 molecules transferred from one cell to another. In response to reviewers’ comments, we now conducted correlative light and electron microscopy to characterize the exchange event around the cell-cell border at EM level (new Figure6-figure supplement 2).
  
  C). Further experimental evidence was provided in the single and double gene KO experiments in Fig. 8E-G, suggesting the functional significance of CD133 in intercellular communication.
  
  D). In addition to the data above, the IEG mRNA diversity analyses based on scRNA-seq support the mRNA exchange model. The isogenic CD133+ SKO hepatocytes were found to lack different IEG transcripts randomly. This is why we propose a mutually sharing model, rather than a donor and recipient model. Importantly, the mRNA diversity (entropy) model also illustrates the association of CD133 and “stemness", as described in the discussion.
  
  In sum, we believe that a most reasonable interpretation of the current data set is a model of direct cell-cell communication via CD133+ vesicles. We take the reviewer’s point and have made changes to the text to better distinguish conclusion and hypothesis, which will be validated in future studies.
  
  Reviewer #2 (Public Review):
  
  The manuscript by Kaneko set out to understand the mechanisms underlying cell proliferation in hepatocytes lacking Shp2 signals. To do this, the authors focused on CD133 as the proliferating clusters of cells in the Shp2 knockout (SKO) livers are CD133 expressing. After excluding the contribution of progenitors that are CD133 to this cell population, the authors focused on the intrinsic regulation of CD133 by Met/Shp2 regulated Ras/Erk pathway and showed upregulation of CD133 to be a compensatory signal to overcome loss of Ras/Erk signal and suggested Wnt10a in the regulation of CD133 signal. The study then focused on the observed filament localization of CD133 in the CD133+ cluster of cells. The study went on to identify the CD133+ vesicles that contain primarily mRNA vs. microRNA like other EVs. Specifically, the authors identified several mRNA species that encode IEGs, indicating a potential role for these CD133+ vesicles in cell proliferation signal transmission to neighboring cells via delivery of the IEG mRNAs as cargos. Finally, they showed that the induction of CD133 (and by derivative, the CD133+ vesicles) are necessary for maintaining cell proliferation in the cell cluster with high proliferation capacities in the SKO livers; and in intestinal crypt organoids treated with Met inhibitors to block Ras/ERk signal.
  
  1) The identification of CD133+ vesicles is largely based on staining and costainings. Though the experiments are very well done with many controls and approaches, the authors may want to perform one or two key experiments with EM to definitively demonstrate the colocalization. For example, the mCherry experiment in Fig6H and the colocalization experiments for CD133 and HuR in Fig 7.
  
  Many thanks for the suggestion. We now completed the two suggested key experiments with new results added to the revised manuscript. For the mCherry experiment, we conducted correlative light and electron microscopy to characterize the exchange event between cells that stably express CD133-GFP fusion protein and mCherry+ cells (new Figure 6-figure supplement 2). The CD133-GFP was clearly found in the mCherry+ cells around the border, demonstrating the intercellular traffic. For the colocalization of CD133 and HuR, we performed double immunogold staining on the isolated vesicles, with the new results presented in the revised Figure7-figure supplement 1D.
  
  2) Since CD133+ marks the 50nM intracellsome defined by the authors, it is unclear what the CD133- vesicles used as controls are. Are they regular EVs that are larger in size? This needs better clarification as they are used as a control for many experiments such as Fig 7A.
  
  Per the advice, we added more explanation to the revised text. We used regular EVs as the control, since they are the well-studied intercellular communication vesicles. Since the EVs are highly heterogenous, we did not choose to select a specific subpopulation of EVs. We used the well-established polymer-based precipitation method to isolate the EV fraction from cell culture supernatant for RNA-seq analysis. We did detect the enrichment of micro-RNAs in the isolated EVs, consistent with reports in the literature. Strikingly, the CD133 vesicles isolated from cell lysates showed a completely distinct RNA profile, relative to the EVs.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.05.16.492226v3
www.biorxiv.org www.biorxiv.org

New submission 23/08/2023, 09:37:27

1
1. Public_Reviews 25 Aug 2023
  
  in eLife
  
  Author Response:
  
  We thank the reviewers for their constructive comments. Below we include a point by point response.
  
  Reviewer #1 (Public Review):
  
  [...] Elaborate on the Methodology: Provide an in-depth explanation of the two active learning batch selection methods, including algorithmic details, implementation considerations, and any specific assumptions made. This will enable readers to better comprehend and evaluate the proposed techniques.
  
  We thank the reviewer for this suggestion. Following this comments we will extend the text in Methods (in Section: Batch selection via determinant maxi- mization and Section: Approximation of the posterior distribution) and in Supporting Methods (Section: Toy example). We will also include the pseudo code for the Batch optimization method.
  
  Clarify Evaluation Metrics: Clearly specify the evaluation metrics employed in the study to measure the performance of the active learning methods. Additionally, conduct statistical tests to establish the significance of the improvements observed over existing batch selection methods.
  
  Following this comment we will add to Table 1 details about the way we computed the cutoff times for the different methods. We will also provide more details on the statistics we performed to determine the significance of these differences.
  
  Enhance Reproducibility: To facilitate the reproducibility of the study, consider sharing the code, data, and resources necessary for readers to replicate the experiments. This will allow researchers in the field to validate and build upon your work more effectively.
  
  This is something we already included with the original submission. The code is publicly available. In fact, we provide a phyton library, ALIEN (Active Learning in data Exploration) which is published on the Sanofi Github (https://github.com/Sanofi-Public/Alien). We also provide details on the public data used and expect to provide the internal data as well. We included a small paragraph on code and data availability.
  
  Reviewer #2 (Public Review):
  
  [...] I would expect to see a comparison regarding other regression metrics and considering the applicability domain of models which are two essential topics for the drug design modelers community.
  
  We want to thank the reviewer for these comments. We will provide a detailed response to their specific comments when we resubmit.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.07.26.550653v3
www.biorxiv.org www.biorxiv.org

New submission 25/08/2023, 09:35:06

1
1. Public_Reviews 25 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  Reviewer #1 (Recommendations For The Authors):
  
  Introduction: "In plants, IP7 and IP8 decrease upon Pi starvation and mutants lacking either of the enzymes necessary for their synthesis induce the phosphate starvation response, leaving it open whether IP7 , IP8 , or both, control the Pi starvation program (10, 13, 14)" In plants, ITPK enzymes catalyze the formation of 5-InsP7 from InP6 https://pubmed.ncbi.nlm.nih.gov/34274522/ and VIH enzymes the formation of 1,5-InsP8 from 5-InsP7 https://pubmed.ncbi.nlm.nih.gov/25901085/ . Loss-of-function of both ITPK1 https://pubmed.ncbi.nlm.nih.gov/34274522/ and of VIH1/2 https://pubmed.ncbi.nlm.nih.gov/31436531/ https://pubmed.ncbi.nlm.nih.gov/31419530/ results in constitutive Pi starvation responses. vih1/vih2 mutants lack 1,5-InsP8 and hyperaccumulate 5-InsP7 https://pubmed.ncbi.nlm.nih.gov/31436531/ and act directly upstream of the transcription factors PHR1/PHL1 https://pubmed.ncbi.nlm.nih.gov/31436531/ https://pubmed.ncbi.nlm.nih.gov/33452263/ https://pubmed.ncbi.nlm.nih.gov/34857773/ , the case for 1,5-InsP8 is settled in plants. I would suggest revising this statement and citing all relevant literature.
  
  We don't see the case for 1,5-IP8 as settled in plants, and none of the papers mentioned above draws this strong conclusion. This may be due to several limitations in the available data. The mentioned studies do not allow to differentiate the effects of 1-IP7 and 1,5-IP8 and, where binding or competition experiments have been performed, e.g. on the transcription factors, the differences in the Kd values for IP7 and IP8 were minor. Furthermore,1,5-IP8 levels and Pi starvation response do not always correlate. IPTK1 mutants, for example, show Pi overaccumulation, and low 5-IP7, but normal 1,5-IP8 (Riemer et al., 2021). Finally, plants are complex organisms with multiple tissue types that serve for accumulating, exporting, transporting or finally consuming Pi. Therefore, correlating inositol pyrophosphate levels from whole-plant extracts with a Pi starvation response is problematic, except if these data could both be obtained from the same cell types or at least tissues.
  
  The comment of the reviewer made us recognize that the complex situation in plants deserves a more detailed coverage and we have therefore adjusted the introduction accordingly.
  
  Results: "We determined the corresponding lysines in Pho81 (Fig. S3), created a point mutation in the genomic PHO81 locus that substitutes one of them, K154, by alanine, and investigated the impact on the PHO pathway."
  
  In my opinion, it would be important to test here in a quantitative in vitro binding assay if (i) the SPX domain of Pho81 can bind PP-InsPs including 1,5-InsP8, (ii) if the dissociation constant is in agreement with the cellular levels of 1,5-InsP8 in yeast (compare Fig. 2) and (iii) if the K154A mutation blocks or reduces the binding of 1,5-InsP8. Without such experimentation, I find the statement "this result underlines the efficiency of the K154A substitution in preventing PP-IP binding to the Pho81 SPX domain." to be overly speculative, as no binding experiment has been conducted.
  
  We agree with the comment of the reviewer concerning the overstatement in the phrase. It has been deleted.
  
  As mentioned already in our previous work (Wild et al., 2016), Pho81SPX counts among the SPX domains that we could not express recombinantly. Likewise, full-length Pho81, which would be the relevant object for correlating in vitro binding studies with the cellular concentrations, has not been accessible. Expression in yeast did not provide sufficient material for ITC or other quantitative techniques. Therefore, we refrained from pursuing binding studies. Nevertheless, given the high conservation of the positively charged patch on SPX domains and the fact that, in every case where it has been tested so far, SPX domains showed inositol polyphosphate binding activity, we find it a conservative assumption that the Pho81SPX binds them as well. This is supported by the effects of the binding site mutant, which mimics the effect of ablating IP8 synthesis.
  
  Results: "Inositol pyrophosphate binding to the SPX domain labilizes the Pho81-Pho80 interaction." Again, in the absence of any protein - protein interaction assay I find this statement not to be supported by the experiments outlined in the manuscript. The best way to address this point would be to perform either co-IP or in vitro pull-down experiments between Pho81-SPX and Pho81-85, in the pre- and absence of 1,5-InsP8 and/or using the Pho81 point-mutants described in the text.
  
  Since Pho81 could not be produced recombinantly, neither by us nor by others who worked on this protein previously, quantitative in vitro binding assays are not accessible for now. A simple IP suffers from the problem that Pho81 interacts with Pho85-Pho80 not only through the SPX domain but also through the minimum domain. The latter interaction may be constitutive. Since the main point of the manuscript is not to dissect the exact mechanisms of Pho85-Pho80 regulations, but only to address the point why the postulated inactivation of this kinase by an 1-IP7/minimum domain complex makes no sense, we prefer not to show a profound (and more complex) analysis of how the different Pho81 domains contribute to binding.
  
  To test the potential of the SPX domain for binding Pho85/Pho80 in vivo, we have created a GFP-fusion of the SPX domain of Pho81. This fusion protein localizes mainly to the cytosol when cells are on high-Pi. Upon Pi starvation, it concentrates in the nucleus. This concentration is not observed in pho80 mutant background (New Fig. S7).
  
  In line with this, I would suggest to move the molecular modelling/docking studies from the discussion into the results section and to use these models to design some interface mutations that could be tested in coIP and/or pull-down assays. Alternatively, the authors may choose to omit the discussion section starting with: "Even though the minimum domain is unlikely to function as a receptor for PP-IPs this does not ... and ending with . In sum, multiple lines of evidence support the view that the SPX domain exerts dominant, 1,5-IP8 mediated control over Pho81 activity in response to Pi availability."
  
  We have now moved the modelling data to the Results section. The structure prediction of the interface is experimentally validated. Data on the effect of interface substitutions are already published, although these substitutions had not been recognized as affecting a common interface at the time. Substituting the interface residues either on the side of Pho80 or of Pho81 constitutively activates Pho85-Pho80 kinase and destabilizes its interaction with Pho81. This was shown by Co-IP experiments from cell extracts by Huang et al. We mention the respective substitutions in the manuscript and cite the paper in which their effect on PHO pathway activation had been described.
  
  Reviewer #2 (Recommendations For The Authors):
  
  Some points need additional attention by the authors:
  
  In general, it would be helpful to introduce abbreviations more thoroughly (certain enzyme names, PA, MD, ...)
  
  We paid more attention to this.
  
  Also in general, the authors may want to think about the nomenclature of inositol pyrophosphates. Given the expansion of PP-IPs that are being detected in different organisms these days it may be a good time to convert to a more precise nomenclature, i.e. 5PP-IP5 instead of 5-IP7; and 1,5(PP)2-IP4, instead of 1,5-IP8. The latter could just be stated once, and then be abbreviated as IP8.
  
  To our understanding the field has not yet come up with a unified nomenclature. Therefore, we prefer to stick with the more practical nomenclature that we have chosen, which also corresponds to what is commonly used in presentations and discussions among colleagues. We have now introduced a sentence making the link to the nomenclature that the reviewer has proposed.
  
  p. 1, Abstract: "negative bioenergetic impacts" - the phrasing seems really vague
  
  Agreed, but we find it difficult to be more explicit and precise in the abstract while remaining concise and not distracting from the main message. This aspect is better explained in the introduction.
  
  p. 3, Significance statement: "... unified model across all eukaryotic kingdoms" While the intended meaning of this wording is better explained in the text later, the phrasing here suggests a more all-encompassing study at hand, instead of a conclusion that fits more closely with established reports from other organisms. Please rephrase.
  
  We have adapted the phrase to avoid this impression.
  
  p. 4: "IPTKs" - are the ITPKs meant here?
  
  Yes, that was a typo.
  
  p. 7, the introduction ends abruptly and could use a concluding sentence.
  
  Done
  
  p.7, "enzymes diphosphorylation either the..."; I understand what the authors are trying to say with diphosphorylating, but the enzymes are phosphorylating a phosphorylated substrate.
  
  Yes. We changed the phrase to "....adding phosphate groups at the 1- or 5-positions....".
  
  p. 7, subtitle "...concentrations and kinetics of..."; kinetics of what? Synthesis/turnover?
  
  We corrected this subtitle
  
  p. 8, with regards to the recovery experiment: Was this recovery determined elsewhere (please cite)? Otherwise it would be beneficial to include an extra figure to illustrate these recoveries in the supplementary information. And do the authors suspect some hydrolysis of IP8 given the lower recovery?
  
  We have now added the experiment testing recovery of IPPs as the new Fig. S1.
  
  p. 9: It is appreciated that the authors point out the concentration of IP6 in S. cerevisiae. I found that concentration rather low, and the authors could highlight this a bit more, given their ability to carry our absolute quantification.
  
  This was a leftover from a previous version of the paper. Since the paper does not treat IP6 or lower inositol polyphosphates, we have deleted this phrase.
  
  p. 9, Fig 2: The exponential decay of 5-IP7 is very nicely shown in Figure 2c. But one of the most important discussion points is IP8 being the key controller of the PHO pathway - it would therefore be beneficial for the argument to also show the same kind of graph for IP8 and if possible, fit a function to the data points to better quantify and compare the decay processes (e.g. via "half-life time" of PP-IPs during starvation, in addition to the suggested "critical concentration" which was only discussed for 5-IP7 thus far).
  
  Kinetic resolution is an issue here. The approach shown in Figs. 2 and 5 is not apt to determine a critical concentration of IP8 because the decline upon transfer to starvation conditions is too fast and difficult to relate to the equally rapid induction of the PHO pathway. We shall address this point in a more appropriate setup in a future study.
  
  p.9, Fig 2a: Where does the 5-IP7 come from in the kcs1Δ strain? In the text the authors state that 5-IP7 in kcs1Δ was not detected, but the figure suggests otherwise. Please explain.
  
  Currently, we do not know where these residual signals stem from. One possibility is that they represent other isomers that exist in minor concentrations and that are not resolved from 5-IP7 in CE. We added a sentence to the figure legend to indicate this.
  
  p. 10: "IP8 was undetectable in kcs1Δ and decreased by 75% in vip1Δ. kcs1Δ mutants also showed a 2 to 3-fold decrease in 1-IP7, suggesting that the synthesisof 1-IP7 depends on 5-IP7. This might be explained by assuming that a significant source of 1-IP7 is synthesis of 1,5-IP8 through successive action of Kcs1 and Vip1, followed by dephosphorylation to 1-IP7." - Please specify this statement. Do the authors mean that 1,5-IP8 is only produced transiently below the detection capabilities of the method but that there still is a (reduced) flux from 5-IP7 to 1,5-IP8 to 1-IP7? Otherwise it would seem paradoxical to have a dependency on a non-existing metabolite in that cell line.
  
  This was not clearly expressed. The revised version now says: " ... a 2 to 3-fold decrease in 1-IP7, suggesting that the synthesis of 1-IP7 depends on 5-IP7. This might be explained by assuming that, in the wildtype, most 1-IP7 stems from the conversion of 5-IP7 to 1,5-IP8, followed by dephosphorylation of 1,5-IP8 to 1-IP7.". We hope that this clarifies the matter.
  
  p. 10: "pulse-labeling approaches are not available for PP-IPs." While this statement is correct, a recent paper co-authored by Qui and Jessen showed nice pulse-labeling data for the lower Ips and could be cited here (PMID: 36589890)
  
  Yes, indeed, we should have been more precise here. What we wanted to express was that rapid pulse-labeling methods for following phosphate group turnover were lacking, with a temporal resolution of minutes rather than hours. Existing pulse labeling approaches, including the study mentioned by the reviewer, do not provide that. We have changed the phrase accordingly.
  
  p. 10: continuation of caption of Fig 2: "were extracted [and] analyzed"
  
  Corrected. Thank you.
  
  p. 12: How is 1-IP7 made in the vip1 kcs1 double mutant?
  
  As explained above, we suspect that these may be side products of IPMKs, which accumulate in the absence of vip1 phosphatase.
  
  p. 13, caption to Figure 3: "XXX cells were analyzed" please replace the place holder XXX.
  
  Done. Thank you.
  
  p. 13, Fig 3B, C, D and p. 50, Fig. S4: On screen the contrast between the different shades of grey of the bars are just visible enough, but not on paper, I suggest using a higher contrast/ different colouring scheme.
  
  We enhanced the contrast.
  
  p. 24, 25, Fig 7.: I could not really appreciate the AlphaFold part, and found it unnecessary. No docking or molecular dynamics simulations were carried out here, and it was not clear to me what information should be gleaned from this part.
  
  Following this comment, we have modified the respective part of the text. This part refers to a publication from the O'Shea lab (Nat. Chem Biol. 4,25) proposing the model that 1-IP7 and the Pho81 minimum domain bind competitively to the active site of Pho85 to inhibit its kinase activity. Modeling of complexes between Pho81, Pho80 and Pho85, which we present in the manuscript, rather suggests binding of the minimum domain to a groove in Pho80. This is important because it provides a viable alternative model for the action of the minimum domain. It suggests the minimum domain as a constitutive linker that attaches Pho80 to Pho85. Importantly, this model accounts perfectly for the results of previous random mutagenesis studies on Pho80 and on the minimum domain, which had independently identified both the Pho80 groove and the minimum domain residues that bind it in the prediction as critical residues for inhibition of Pho85, and for integrity of the Pho85/Pho80/Pho81 complex. We find this alternative explanation for Pho85-Pho80 regulation by Pho81, which we can derive by combining the predictions with already published experimental data, an important element to re-evaluate the relevance of 1-IP7 in PHO pathway regulation and resolve one of the existing discrepancies.
  
  p. 28: No experiments were carried out with plants or mammals. The relevance for plants or mammalian systems therefore seems to be overstated at this point in time.
  
  We are not quite sure how to interpret this remark. We do not claim that our data support a role for IP8 in mammals and plants. But we refer to and cite studies providing the strongest evidence in favor of it in these systems. The relevance of our current study relies in refuting seemingly strong evidence from yeast, which had been diametrically opposed to the data obtained in plants and mammals. The revision of the situation in yeast now paves the way to drawing a coherent concept for fungi, plants and mammals. We feel that this is important and should be underlined.
  
  p. 31: "300 mL of 3% ammonium" - 300 µL?
  
  Yes. Thank you.
  
  p. 45, CE-ESI-MS parameters: "1IP8"
  
  Corrected.
  
  p. 47: Figure S1: Please include more experimental details in the caption and/or methods section. Was a similar analysis software used as e.g. Figure S2 (NIS Elements Software)? Please also include all the analysis software in the Methods section under "fluorescence microscopy". Unless these additional experimental details already clarify the following point: Can the authors briefly comment on why the morphological determination in S1 requires trypan blue staining while in later experiments the yeast cells are readily recognized by the software in "simple" brightfield images?
  
  Trypan blue staining is not strictly required for this. It is just a simple method to fluorescently stain the cell wall. There are many other ways of delineating the cells. It could also have been done in a brightfield image.
  
  We updated the figure legend to better describe how these measurements were done and deposited the script and training file on figshare.
  
  p. 48: "can be downloaded from **" please insert the link once the script is available online.
  
  It has been deposited at Figshare under DOI 10.6084/m9.figshare.c.6700281
  
  Reviewer #3 (Recommendations For The Authors):
  
  1) Italicize the scientific names of the organisms; this was inconsistent throughout the manuscript. Also, gene names should be italicized; this was also inconsistent (e.g., p.12 "... did not induce the PHO84 and PHO5 [sic] promoters...).
  
  Done
  
  2) Summary of the Figure 2A data in the text (p.9) probably has swapped the determined concentrations for 1-IP7 and IP8 (0.3 µM or 0.5 µM) as compared with the data figure.
  
  Yes, indeed. We have corrected this.
  
  3) Figure 2A: which of the mutant PP-IP levels are significantly different from the WT control?
  
  We have now added asterisks to indicate the significance for every mutant.
  
  4) In the discussion on the data (Fig. 2A), I was tripped up by the verb tense in this phrase "5-IP7 has not been detected in the kcs1Δ mutant and 1-IP7 has been strongly reduced..."; I think you want to use the past tense "was" in both cases [as is used in the next sentence]. It made me wonder if there was a difference in the detection of 5-IP7 and IP8 in the kcs1Δ mutant, you could detect 5-IP7 but not IP8; if so, where did the 5-IP7 come from?
  
  We have corrected the tense. Thank you for highlighting this. For the residual inositol pyrophosphate signal in kcs1Δ. We do not know its origin. One possibility, which we now mention in the text, is that it stems from IPMK side activity. It should be underlined that all signals disappear upon PI starvation.
  
  Figure 2C, include the data points that the lines are built from (suggestion).
  
  We refrained from that for the line graphs. For reasons of consistency, we should do this for every line graph. If we did that, Fig. 4B would become quite hard to read.
  
  6) Figure 3B-D, please check that the stipples or hatches are in the figure - the printed copy lacked them although I could see them in the electronic version; this was also true for Figures 5 and 6 (I do not know if it is a printer issue, but other hatches were visible: e.g., not seen in S4 but seen in S5).
  
  They are visible in our copies, also after printing. They may have been lost during file conversion at the journal.
  
  7) The text description of the Pho4-yEGFP, Pho5-yEGFP and Pho84-yEGFP says that the kcs1Δ mutant "showed Pho4-yEGFP constitutively in the nucleus already ... and PHO5 and PHO84 were activated". However, the data is more complex than that: whereas the localization of Pho4-yEGFP is constitutively nuclear, there is a higher basal (repressed) expression of both Pho5 and Pho84 as well as increased expression of both proteins under -Pi conditions. What accounts for the increased expression when Pho4 is already nuclear? This is also seen in the vip1Δ kcs1Δ mutant.
  
  We agree with the reviewer, but we cannot explain this effect with certainty. One possibility could be a wider dysregulation of Pi metabolism in kcs1 mutants. To name a few possibilities: Wildtype cells have polyphosphate reserves that are gradually mobilized during the first hours of P-starvation. kcs1 mutants don't have those and might fall into a "deeper" state of starvation faster. It should be kept in mind that the starvation response is also regulated at the level of chromatin structure, and by antisense transcripts. The influence of kcs1 on these processes is unclear.
  
  8) Figure 9 legend: please add a definition of the MP region (in red) and include it more explicitly in the described model.
  
  We now mention the relevant region also in the legend and have labeled the relevant regions in the images (Huang et al., 2001).
  
  9) Figure S2 legend: information is missing (downloading link).
  
  It has been deposited at Figshare under DOI 10.6084/m9.figshare.c.6700281
  
  10) Figure S4 and S5, missing statistics.
  
  They have been added to the new Fig. S6, which interprets differences between strains and conditions. Fig. S4 (now S3) shows timecourses of IPPs down to zero. Adding statistics for all pairwise differences between the timepoints would be almost an overkill.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.14.528555v3
www.biorxiv.org www.biorxiv.org

New submission 24/08/2023, 09:31:14

1
1. Public_Reviews 24 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  eLife assessment
  
  It is very important to find practical and efficient means in order to increase agricultural productivity. Drawing on data from variable field environments, this study provides a useful theoretical framework to identify new factors that could increase agricultural production. There is solid evidence to support the authors' claims, though following the fate of candidate species after introduction into rice fields would have strengthened the study. Plant biologists and ecologists working in nature and fields will find the work interesting.
  
  Thank you so much for your careful evaluation of our manuscript. We are very pleased to hear that you found our framework useful. We have revised our manuscript according to the "Recommendations for the Authors" to improve our manuscript.
  
  Public Review
  
  Reviewer #1 (Public Review):
  
  This manuscript describes the identification of influential organisms on rice growth and an attempt of validation. The analysis of eDNA on rice pot and mimic field provides rice growth promoting organisms. This approach is novel for plant ecology field. However current results did not fully support whether eDNA analysis-based detection of influencing organism.
  
  Thank you so much for evaluating our manuscript. We have carefully read and responded to your comments. We hope our responses resolve your concerns on our study.
  
  The strength of this manuscript is to attempt application of eDNA analysis-based plant growth differentiation. The weakness is too preliminary data and experimental set-up to make any conclusion. The trials of authors experiments are ideal. However, the process of data analysis did not meet certain levels. For example, eDNA analysis of different time points on rice growth stages resulted in two influential organisms for rice growth. Then they cultivate two species and applied rice seedlings. Without understanding of fitness and robustness, how we can know the effect of the two species on rice growth.
  
  We agree with your comments that we did not have the fitness data of the two species and/or rice seedlings. Thus, it is still difficult to obtain deep understanding of the mechanisms of our findings that the species introduced in the system would influence rice growth. Nonetheless, our study demonstrated the effectiveness of our research framework as we found evidence that the species that were discovered by the eDNA monitoring and time series analysis indeed cause changes in the system. We believe that the first step is to show that the framework is workable and that detailed understanding of the mechanisms or genetic pathway was not a focus of our study. To avoid misunderstanding, we have added several explanations regarding this point in L426–431 and L447. For example, in L426, we have added the following statement: "... the detailed dynamics of the two introduced species was unclear (i.e., the fate of the introduced species). This is particularly important for understanding how the introduced organisms affected rice performance...".
  
  The authors did not check the fate of two species after introducing into rice. If this is true, it is difficult to link between the rice gene expression after treatments and the effectiveness of two species. I think the validation experiment in 2019 needs to be re-conducted.
  
  We did not check the fate of the two species (except measuring the eDNA concentrations of the species), and it is true that we cannot show evidence of "how" these two species influence the rice gene expression. Understanding molecular mechanisms of the phenomenon that we found is important (especially from the viewpoint of molecular biology), but our primary objective was to demonstrate that our "eDNA x time series analysis" framework is feasible for detecting previously overlooked but influential organisms. To this end, we believe that we achieved our objective and repeating the validation experiment should be for a different purpose (i.e., for understanding molecular mechanisms). We have clarified these points in L426–431 and L447 as explained above.
  
  Reviewer #2 (Public Review):
  
  The manuscript "Detecting and validating influential organisms for rice growth: An ecological network approach" explores the influence of biotic and abiotic entities that are often neglected on rice growth. The study has a straightforward experimental design, and well thought hypothesis for explorations. Monitoring data is collected to infer relationships between species and the environment empirically. It is analyzed with an up-to-date statistical method. This allowed the manuscript to hypothesize and test the effects most influential entities in a controlled experiment.
  
  Thank you so much for your careful evaluations. We are pleased to see that you evaluated our manuscript positively. We have further revised our manuscript according to your comments and hope the revision has resolved your concerns.
  
  The manuscript is interesting and sets up a nice framework for future studies. In general, the manuscript can be improved significantly, when this workflow is smoothly connected and communicated how they follow each other more than the sequence and dates provided. It is valuable philosophical thinking, and the research community can benefit from this framework.
  
  Thank you for your suggestions. In order to improve the logic flow and readability of our manuscript, we have revised the descriptions of workflow and clarified how the experimental and statistical steps were connected to each other. To do so, we have added brief explanations about what/how we did at the first sentence of Results subsections (some of these explanations were only in Materials and Methods in the original manuscript). Also, we have moved all of the Supplementary Materials and Methods to the main text. We have thoroughly revised the manuscript, and we hope that all the parts of our manuscript have been connected more smoothly than in the original manuscript.
  
  I understand the length and format of the manuscript make it difficult to add more details, but I am sure it can refer to/clear some concepts/methods that might be new for the audience. How/why variables are selected as important parts of the system, a tiny bit of information about the nonlinear time series analysis in the early manuscript, and the biological reasoning behind these statistically driven decisions are some examples.
  
  We have explained how/why variables are selected (in L125), added more information about the nonlinear time series analysis (in L129 and L175) , and added the biological reasoning behind the statistical decisions (L195).
  
  Reviewer #3 (Public Review):
  
  Most farming is done by subtracting or adding what people want based in nature. However, in nature, crops interact with various objects, and mostly we are unaware of their effects. In order to increase agricultural productivity, finding useful objects is very important. However, in an uncontrolled environment, it coexists with so many biological objects that it is very inefficient to verify them all experimentally. It is therefore necessary to develop an effective screening method to identify external environmental factors that can increase crop productivity. This study identified factors presumed to be important to crop growth based on metabarcoding analysis, field sampling, and non-linear analysis/information theory, and conducted a mesocosm experiment to verify them experimentally. In conclusion, the object proposed by the author did not increase rice yield, but rather rice growth rate.
  
  Thank you so much for your evaluation of our manuscript. We have revised our manuscript based on your comments, and hope it has been improved compared with the original version.
  
  Strength
  
  In actual field data, since many variables are involved in a specific phenomenon, it is necessary to effectively eliminate false positives. Based on the metabarcoding technique, various variables that may affect rice growth were quantitatively measured, although not perfectly, and the causal relationship between these variables and rice growth was analyzed by using information transfer analysis. Using this method, two new players capable of manipulating rice growth were verified, despite their unknown functions until now. I found this process to be very logical, and I think it will be valuable in subsequent ecological studies.
  
  We are very pleased to see that you found our framework is very logical and potentially beneficial for future ecological studies.
  
  Weaknesses
  
  CK treatment's effectiveness remains questionable. Rice's growth was clearly altered by CK treatment. The validation of the CK treatment itself is not clear compared to the GN treatment, and the transcriptome data analysis results do not show that DEG is not present. The possibility of a side effect caused by a variable that the author cannot control remains a possibility in this case. Even though this part is mentioned in Discussion, it is necessary to discuss various possibilities in more detail.
  
  We agree that the effectiveness of the CK treatment was questionable. We have added some more discussion about this point in L376: "The unclear effects of the CK treatment relative to those of the GN treatment could be due to the relatively unstable removal method (i.e., C. kiiensis larvae were manually removed by a hand net) or incomplete removal of the larvae (some larvae might have remained after the removal treatment)."
  
  Reviewer #1 (Recommendations For The Authors):
  
  Comment #1-1 This manuscript describes identification of influential organisms on rice growth and an attempt of validation. The analysis of eDNA on rice pot and mimic field provides rice growth promoting organisms. This approach is novel for plant ecology field. However current results did not fully support whether eDNA analysis-based detection of influencing organism.
  
  Thank you for your careful evaluations of our manuscript. We are pleased to see you found that our approach is novel. We have revised our manuscript in accordance with your comments, and we hope that the revision and responses resolved your concerns.
  
  Comment #1-2 1. Experimental setting: Authors made up small scale pot system in 2017 and then expanded manipulative experiment. I do not understand how two influencing organism sequences were identified from the single treatment depending on different time points. How they can be convince the two organisms affect the rice growth rather than other biological and environmental factors.
  
  In 2017, we performed an intensive monitoring of the experimental rice plots and obtained large time series data (122-day consecutive monitoring x 5 plots = 610 data points). The time series data were analyzed using the information-theoretic causal analysis. The analysis is critically different from correlational analyses and designed to identify causal relationships among variables. Although we understand that field manipulation experiments are a common and straightforward approach to identify causal relationships among organisms, we chose the "fieldmonitoring + time-series-based causal analysis" approach. This is because, as explained in the main text, there are numerous factors that could influence rice performance, and it is practically impossible to perform manipulative experiments for all the potential factors that could influence rice growth. On the other hand, our "field-monitoring + timeseries-based causal analysis" approach has a potential to identify multiple factors under field conditions, even by the single experimental treatment.
  
  Nonetheless, we must admit that our time-series-based approach still has a chance to misidentify causal factors. Our framework relies on statistics, so the chance of false-positive detection of causality cannot be zero. This was exactly the reason why we performed the "validation" experiment in 2019. To complement the statistical results of the 2017 experiments, we performed another experiment in 2019.
  
  Comment #1-3 2. eDNA technology: The eDNA analysis based on four universal primers 16s rRNA, 18s rRNA, ITS, and COI regions must not be enough to identify specific species. The resolution of species classification may not meet to confirm exact species. Thus, the accuracy of two species that they selected for further experiment is difficult to be confirmed. Authors also referred to "putative Globisporangium".
  
  Your point is correct. The DNA barcoding regions we selected are short and it is often difficult to identify species. However, this limitation could not have been overcome even if we had chosen a different genetic marker. The long-read sequencing technology could partially solve the issue, but the number of sequence reads generated by the long-read technique is less than that by the short-read sequencing technology, and comprehensive detection of all species in an ecological community was still challenging. Our approach struck a balance among the identification resolution, comprehensiveness of the analysis, and sequencing costs. In addition, even though we could not identify most ASVs at the species level, some ASVs could be identified at the species level (52 ASVs among the 718 ASVs which had causal influences on rice growth), and we selected the two species (G. nunn and C. kiiensis) from the 52 species.
  
  Further, the taxa assign algorithm we used here (i.e., Claident; Tanabe & Toju 2012 PLoS ONE 10.1371/journal.pone.0076910) adopted conservative criteria for species identification and has a low falsepositive probability.
  
  More importantly, this is also the reason why we performed the "validation" experiment in 2019. The species identified in the 2017 experiment are still "potential" organisms that influence rice growth (i.e., the hypothesis-generating phase), and we tested the hypothesis in 2019.
  
  Nonetheless, we must admit that clear description of potential limitations is important. Thus, we have discussed this in L418: "As for the second issue, short-read sequencing has dominated current eDNA studies, but it is often not sufficient for lower-level taxonomic identification. Using long-read sequencing techniques (e.g., Oxford Nanopore MinION) for eDNA studies is a promising approach to overcome the second issue".
  
  Comment #1-4 3. Biological relevance 1: Authors identify two organisms as influencing organism for rice growth. As conducting the first experiment in 2017, the 2019 experiment was different from natural condition. The two experiments in 2017 and 2019 were conducted under different conditions. How do they compare the experiments? At least, the eDNA analyses in 2017 and 2019 should be very similar. I cannot find such data.
  
  The experimental conditions were different between 2017 and 2019 because they were conducted in different years. Theoretically, it is ideal if the experimental conditions in 2019 are covered by the range of experimental conditions in 2017 (e.g,. rice variety, air temperature, rainfall, and solar radiation). If this condition were satisfied, the attractor (i.e., rice growth trajectory delineated in the state space) in 2019 would be within that in 2017, and our model prediction in 2017 would be used to predict dynamics in 2019 accurately. To fulfill the conditions, we made as much effort as possible: we used the same rice variety and soils in 2019 as those used in 2017, and started our experiment at the same timing in 2019 as that in 2017.
  
  Although natural ecological dynamics cannot be precisely controlled, our monitoring revealed that the ecological dynamics in 2019 was qualitatively similar to that in 2017. To demonstrate that the experimental conditions and eDNA community data were similar between the two experiments, we have presented the climate and eDNA data in an inset figure in Figure 3a, Figure 1–figure supplement 2, Figure 3–figure supplement 2. We must admit that these dynamics are not identical, but we hope that this resolves your concern.
  
  Comment #1-5 4. Lack of detail description: In the Materials and Methods, there are many parts which lack on detail description. For instance, authors must described the two species cultivation, application concentrations, and application methods.
  
  We have moved Supplementary Materials and Methods to the main text and added more detailed descriptions in Materials and Methods. Also, to improve the logical flow and readability of our manuscript, we have added brief explanations about what/how we did at the first sentence of Results subsections (some of these explanations were only in Materials and Methods in the original manuscript). We have added the reference for how to cultivate G. nunn in L608 (Kobayashi et al., 2010; Tojo et al., 1993) (C. kiiensis was not cultivated but removed from the system as in Materials and Methods), and application concentrations. Application methods were described in Materials and Methods, the section Field manipulation experiments in 2019 in L596.
  
  Comment #1-6 5. Validation: Application of one species clearly resulted to promote rice growth. They must include appropriate control treatment. If they pick same genus but different species that identified no specific effect on rice growth through eDNA analysis, no effect on growth can be provided. Generally application of large population of certain non-harmful organism confer plant growth promotion. It is not surprising result. Authors need to prove effectiveness of eDNA analysis. In addition, the field experiments required at least two years of consistent data for publication because environmental factors are so dynamic.
  
  Thank you for pointing this out. We agree with your comment that species that were predicted to have no effect should not promote rice growth in a validation experiment. It was also one of our inititial experimental plans to include such species in our manipulation experiment in 2019, but we could not include them because of the limitation of time, labor, and money. More extensive validation of the statistical results of the 2017 data, including multi-year experiments, would further validate the effectiveness of our approach, which should be done as future studies. To clarify this point, we have added statements in the paragraph starting at L396.
  
  Comment #1-7 In conclusion, I suggest that authors need more large data analysis and validate with more accurate and meaningful protocol.
  
  As we explained in the revised manuscript and the Response to Comments #1-2 to #1-7, our study demonstrated a novel research framework to detect previously overlooked influential organisms under field conditions. We agree that larger data analysis would be ideal to further validate our approach, but whether and how to collect larger data is constrained by time, money, and labor. We believe that our study was designed carefully and could provide meaningful avenues for developing an ecological-network based, novel, and environment-friendly agriculture solutions.
  
  Reviewer #2 (Recommendations For The Authors):
  
  Comment #2-1 Lines 97-110: This is so cool. Modeling with empirical data is very powerful. But a rice field is an open system consisting of metacommunity dynamics. Maybe a tiny bit of biological and biogeochemical background here would be good.
  
  Thank you for your comments. We have added a few examples of how and in which systems these methods were used to evaluate community dynamics and detect biological interactions in L109-L118.
  
  Comment #2-2 Lines 111-126: I like the summary of the study here. I think the influential species concept can be a little more elevated. Paine's famous keystone species work has been cited but a couple more pieces of literature can help to enhance the ecological importance of this work.
  
  We have explained the work by Paine (1966) a bit more and added one more paper that showed the effect of multiple predator species on the system dynamics at L88. We have also added a relevant sentence at L137 to emphasize the ecological/agricultural significance of our work.
  
  Comment #2-3 Experimental design/Figure 1:
  
  Is there any rationale behind choosing red individuals to measure the growth?
  
  Is there any competition between the individuals in the pots?
  
  Figure 1e: It is nice to show the ASVs in time. I wonder how the plot would look like when normalized by biomass/DNA content/coverage/rarefaction because of the seasonality.
  
  As for the first question, we chose the four individuals to minimize the edge effects (i.e., effects of microclimates and neighboring rice would be different between the four rice individuals and those planted in the edge regions). We have mentioned this in the legend of Figure 1.
  
  As for the second question, there might be competition among the individuals in the pot. However, we did not measure the effect of competition (e.g., by comparing the growth with/without other rice individuals).
  
  As for the third question, we published detailed dynamics of ecological community in the Supplementary Figures in Ushio (2022) Proceedings B https://doi.org/10.6084/m9.figshare.c.5842766.v1. In addition, we have uploaded a video showing the temporal dynamics of some top (= most abundant) ASVs in https://doi.org/10.6084/m9.figshare.23514150.v2.
  
  We have mentioned the supporting information in L153.
  
  Comment #2-4 Line 146-147: Is this damage influence the inferences? Maybe it is better to justify.
  
  While we occasionally observed physical damages, it is unlikely that they affected our causal inference because the changes in the rice heights due to the damages were smaller and less frequent than those due to growth. We have noted this at L151.
  
  Comment #2-5 Line 161-162: Maybe refer readers to the methods section where you explain UIC analysis. It'd be easier to interpret the figures.
  
  Mentioned.
  
  Comment #2-6 Line 175-176: I believe very brief information in the intro about the organisms might help explain the hypothesis and interpret the results better.
  
  We have included brief information of the two species at L197.
  
  Comment #2-7 Figure 2: Species interaction strength: Are these proxies to the Jacobians? Is there a threshold for the influence we can consider strong/weak? For example, influential species compared to diagonal elements of the Jacobians (intraspecies interactions) could be shown as a mean vertical line in Figure 2b.
  
  "Influences to rice growth" in Figure 2b is transfer entropy (TE) from a target ASV to rice growth. They are not proxies of the Jacobians, but they might positively correlate with the absolute value of the Jacobians. We have clarified this point in the legend (L953). More direct estimations of the Jacobian can be done using the MDR S-map method (Chang et al. 2021 DOI:10.1111/ele.13897), but we did not perform the MDR S-map in the present manuscript (see Ushio et al. 2023 https://doi.org/10.7554/eLife.85795 for the application of the MDR S-map). As for TE, there is no clear threshold to distinguish strong/weak interactions.
  
  Comment #2-8 Figure 2: Looking at panels c and d, it looks like there is a negative frequency selection between two influential species. Is it a reasonable observation?
  
  This is an interesting point. In this manuscript, we have not carefully examined the interspecific relationship between these two particular species. However, the interspecific interactions were examined in detail and reported in Ushio (2022) Proceedings of the Royal Society B DOI:10.1098/rspb.2021.2690). We re-checked the result in Ushio (2022); although there is a negative correlation between them, we did not find any (statistical) causal relationship between them.
  
  Comment #2-9 Line 209: What is t-SNE analysis? Because of the manuscript's format, maybe methods should be shortly referred to in the relevant section or explained in brackets.
  
  We have spelled out t-SNE.
  
  Comment #2-10 Line 212-214: Maybe briefly explain what the hypotheses are for the alternative analysis, and what is the contribution of the results to the study.
  
  We have added a brief explanation at L241: "Alternative statistical modeling that included the treatments (the control versus GN or CK treatments) and manipulation timing (i.e., before or after the manipulation), which simultaneously took the temporal changes of all the treatments into account, also showed qualitatively similar results (Supplementary file 4), further supporting the results."
  
  Comment #2-11 Figure 3b/c: Maybe species names as panel titles could be helpful. d: Treatment names with initials in the legend could be also helpful to read the plots.
  
  We have added species name as panel titles of Figure 3b,c. Treatment names were included in the legend of Figure 3.
  
  Comment #2-12 Line 233: Maybe mention why the manuscript uses the word "clear".
  
  We have mentioned this in L185.
  
  Comment #2-13 Line 234-236: I think that these alternative tests should be explained somewhere.
  
  We have revised the sentence so that it includes some explanations (L241). Also, we have referred to Materials and Methods.
  
  Comment #2-14 Figure 4: The title says ecological community compositions, and panels show the growth rates and cumulative growth.
  
  Thank you for pointing this out. This was a typo and we have corrected it.
  
  Comment #2-15 Lines 246-269: Can these expression patterns be transient and relevant to the time point that the sample is taken?
  
  Yes, these expression patterns were transient. We collected rice leaf samples for RNA-seq 1 day before the first manipulation and 1, 14, and 38 days after the third manipulation (see Supplementary file 3 for the sampling design). When we merged the pot locations, we observed no difference in the gene expression for samples 1 day before the first manipulation and 14 and 38 days after the third manipulation (except for two genes in samples 38 days after the manipulation), and thus, we consider the DEGs that appeared only in the short period after the manipulation. We have mentioned this in L278 and L383: "We found almost no DEGs for leaf samples taken one day before and 14 and 38 days after the third manipulation (the leaf sampling event 1, 3, and 4), suggesting that the influences of the treatments on the gene expression patterns were transient." (L278) and "These changes were observed relatively quickly and transient." (L383)
  
  Comment #2-16 I wonder if a conceptual framework figure would help to generalize the workflow that can be used for other studies.
  
  Thank you for your suggestion. Although we agree with your comment that such a figure would be helpful to generalize the workflow, we believe that our framework is clear and decided not to include it in the present manuscript. We might consider including such a figure (like Figure 1a in Ushio 2022) if we have an opportunity to write a review paper regarding this topic.
  
  Comment #2-17 Lines 329-335: I feel this information is unclear in the early manuscript. Maybe it's necessary to clearly communicate in the beginning.
  
  We have explained that we could not find any relevant information at least at the time we detected the ASVs in L189.
  
  Comment #2-18 Lines 336-337: Can these species be identified in the previous data set from the ASV sequences?
  
  Yes, these species were identified in the DNA data set obtained in 2017.
  
  Comment #2-19 Lines 387-397: Are there any measurements such as total biomass, and statistical methods to help with the eDNA bias and data compositionality?
  
  We have confirmed that our quantitative eDNA metabarcoding generates comparable results with the fluorescence-based method and quantitative PCR (e.g., see Supplementary Figures in Ushio 2022) (mentioned in L310 in the revised manuscript). However, at least in this study, we could not perform a direct comparison of the eDNA data with species abundance and/or biomass. This is partly because the number of our target species was too large (> 1,000 species). The accurate estimation of species abundance and/or biomass is one of our next goals.
  
  Comment #2-20 Line 472: Maybe mention transfer entropy somewhere in the early manuscript.
  
  We have mentioned this in L175.
  
  Comment #2-21 Lines 494-503: Maybe a summary of this reasoning should be mentioned somewhere in the early manuscript too.
  
  We have described a brief summary of the reasoning in L195.
  
  Comment #2-22 Lines 29-33 If this sentence is simplified it might be easier to follow.
  
  The sentence has been divided into two sentences in L28. Also, each sentence has been simplified.
  
  Comment #2-23 Line 38 Maybe "macrobes" can be explicitly mentioned. Fungi, protozoa, etc.
  
  Mentioned.
  
  Comment #2-24 Line 139: I am not sure if the date should be in the title.
  
  Similar monitoring was done in 2017 and 2019. Thus, we think the date is necessary in the section title.
  
  Comment #2-25 Figure 1: There are 4 red individuals in the design but 5 measurements in the plots.
  
  Heights and SPAD of the four individuals were measured for each plot and the averaged values were used as representative values for each plot. Therefore, 20 measurements (= 4 rice individuals 5 plots) were done every day, but each plot has one rice height for each day. We have clarified this in the legend of Figure 1: "the average values of the four individuals were regarded as representative values for each plot."
  
  Comment #2-26 Figure 1b: Maybe use the same axis length for the temperature as the other plots?
  
  Corrected.
  
  Comment #2-27 Lines 259-261: Are there the names of the genes in databases?
  
  Yes, these are gene names used in the rice databases (e.g., The Rice Annotation Project Database; https://rapdb.dna.affrc.go.jp/inde x.html).
  
  Reviewer #3 (Recommendations For The Authors):
  
  Comment #3-1 Additionally, RGR is not statistically significant, but statistical significance is observed only in cumulative growth because data presentation does not reflect plant characteristics. RGR changes according to the developmental stage of the plant. Therefore, if RGR data are shown separately according to the rice growing season, the cumulative growth pattern and the pattern will appear similar.
  
  RGRs were calculated daily (i.e., cm/day) and they changed depending on the developmental stage of the rice (Figure 1 and Figure 4–figure supplement 1). Therefore, we might find similar RGR patterns if we focus on a specific period of the growing season. However, unfortunately, we performed the intensive (i.e., daily) monitoring in 2019 only during the field manipulation period (middle June to middle July 2019), and we cannot investigate the changes in cumulative growth throughout the growing season (this depends on how many days we add up RGR to calculate the cumulative growth, though). We agree that, if we had investigated the detailed pattern of RGR throughout the growing season in 2019, we could have found similar pattens between RGR and cumulative growth rate at a certain period in the growing season. In Figure 4, the cumulative growths were calculated based on the RGRs before the third manipulation or during 10 days after the third manipulation. We clarified this in the legend of Figure 4.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.19.529115v2
www.biorxiv.org www.biorxiv.org

New submission 07/07/2023, 08:43:27

1
1. Public_Reviews 24 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the previous reviews
  
  Reviewer # 1 (Public Review)
  
  Specific comments
  
  1) For all cell-based assays using shRNA to knock down CRB3, it would be desirable to perform rescue experiments to ensure that the observed phenotype of CRB3 depleted cells is specific and not due to off-target effects of the shRNA.
  
  Thank you for your comments. Based on your suggestions, we performed the rescue experiments to observe any alterations in the primary cilia of CRB3-depleted MCF10A cells with overexpressed CRB3. The revised parts can be found in lines 186-188 and the new Supplementary Figure 3A-C has been added.
  
  2) Figure 3G: it is very difficult to see that the red stained structures are primary cilia.
  
  Yes, the staining structure of primary cilia in mammary ductal lumen are less clear than that of individual cells and in renal tubule in Figure 3G. We used recognized acetylated tubulin and γ-tubulin to stain the primary cilia, which were clearly labeled in individual cells. However, the labeled primary cilia in renal tubule were longer length and demonstrated a more pronounced structure than those in the mammary ductal lumen. In the mammary ductal lumen of the 10 mice we analyzed, the primary cilia showed shorter length and staining structure than the others shown in Figure 3G. This difference may be due to the distinct characteristics of primary cilia in different tissues.
  
  3) Figure 5A: it is unfortunate the authors chose not to show the original dataset (Excel file) used for generating this figure; this makes it difficult to interpret the data. It is general policy of the journal to make source data accessible to the scientific community.
  
  In accordance with the journal policy, we have provided the original dataset (Excel file) for Figure 5A, as detailed in “Figure 5–Source Data 1”.
  
  4) The authors have a tendency to overinterpret their data, and not all claims put forth by the authors are fully supported by the data provided.
  
  We have carefully read through the whole text and have revised the overinterpretation parts. These parts can be found in lines 48-50, lines 93-95, and lines 260-261.
  
  Reviewer # 2 (Public Review)
  
  Thank you for recognizing and supporting our research for this manuscript.
  
  Reviewer # 1 (Recommendations For The Authors)
  
  1) Abstract line 48-51: data overinterpretation. The authors cannot claim this based on the data they are presenting. Please modify the statement/temper the claims.
  
  Thanks for your comments. We have revised this sentence in the abstract, as well as lines 48-50 for details.
  
  2) There are several grammatical errors throughout the manuscript. In particular, the following sentences/statements are either wrong, confusing or non-sensical: lines 55-56; lines 87-90; lines 93-95; lines 385-387; lines 409-410.
  
  Thanks for your positive comments. We have modified lines 55-56 to become new lines 54-55. These sentences in lines 87-90 and lines 93-95 are difficult to understand and logically problematic, so we have carefully revised this paragraph (new lines 85-90). Lines 385-387 have been deleted as they are non-sensical. Lines 409-410 contain misrepresentations. We have revised them in new lines 408-409.
  
  3) Lines 257-259: this is data over-interpretation. It is not correct to state CRB3 is highly dynamic without having done any live cell imaging.
  
  Thank you for your comments. We have revised this sentence, see revised lines 260-261 for details.
  
  4) Figure 8E: if cells do not make cilia when CRB3 is lost (Figure 3), how is it possible to analyze SMO localization to cilia in these cells?
  
  Thank you for your comments. We used immunofluorescence techniques, with acetylated tubulin and SMO co-staining, to analyze the localization of SMO to cilia. The results of immunofluorescent staining of primary cilium and statistical analysis in Figure 3 showed that the proportion of cells with primary cilium was significantly lower in the CRB3 knockdown group, but cells with primary cilium were still present. We used laser confocal microscopy micrographs to identify cells with primary cilium by staining acetylated tubulin, then analyzed the co-localization under the SMO channel, and finally analyzed the proportion of SMO-positive cilia. Several publications (J Cell Biol. 2020;219(6):e201904107; Science. 2008;320(5884):1777-81; Proc Natl Acad Sci U S A. 2012;109(34):13644-9.) have demonstrated that knocking down genes can affect primary cilium formation, and this method has also been used to examine the localization of SMO-related signaling pathway molecules on primary cilium.
  
  5) Lines 366-366: based on the relative low magnification of the images in Figure 8H it is difficult to assess the subcellular localization of GLI1 and whether there is a difference between wild type and the Crb3 mutant cells. For example, it is not clear if GLI1 is localizing to the centrosome-cilium axis. Please modify the text accordingly.
  
  Thank you for your good suggestions. As you mentioned, IHC cannot observe the subcellular localization of GLI1 on the centrosome-cilium axis. However, since GLI1 is a transcriptional effector at the terminal end of the Hh signaling pathway, we may not have made it clear that what we observed in the IHC results was the localization of GLI1 in the nucleus. Therefore, we have revised the description accordingly, as described in line 368 and lines 520-521.
  
  6) Figure 7D, E: the zoomed-in images look pixelated.
  
  Thank you for your positive comments. We have replaced these images in the new Figure 7D and E.
  
  7) Figure 8B: Acetylacte-tub is misspelled.
  
  Thank you for your comments. We have revised and standardized the acetylated tubulin stain to "Ace-tubulin" in all immunofluorescent images throughout the manuscript.
  
  Reviewer # 2 (Recommendations For The Authors)
  
  1) 1) CRB3 is present in mammals as 2 isoforms, A and B, originating from an alternative splicing. In this study, the authors never mention this fact and when using approaches to KO or KD CRB3A/B they are likely to deplete both isoforms which have been shown to have different C-terminal domains and functions (Fan et al., 2007). This is also important for the CRB3 antibodies used in the study since according to the material and methods section they are either against the extracellular domain common to both isoforms or the intracellular domain which is only similar in the domain close to transmembrane between the 2 isoforms. Since the antibodies used in each figure are not detailed it is impossible to know if the authors are detecting CRB3A or B or both. Please provide the information and correct for the actual isoform detected in the data and conclusions.
  
  From the revised version we know now that CRB3B is used for exogenous expression. It has been shown that each isoform has a different role and localization in cells so why focus only on CRB3B for this study?
  
  Thank you for your positive comments. First, previous literature has reported that CRB3b localizes in the primary cilium of MDCK cells. We have corrected the Introduction to specify CRB3b (line 81). Secondly, in the methodology section, we show that the CDs sequence of CRB3b was PCR-amplified from RNA extracted from MCF10A cells. We also designed primers specific to CRB3a but were unable to amplify them, indicating that CRB3b is significantly more expressed in epithelial cells than CRB3a. Finally, according to the company recommended by Genecards website for purchasing CRB3 cloning products, the only CRB3 sequence available in the CRB3 cDNA ORF Clone in Cloning Vector, Human (Cat: HG14324-G) from Sino Biological is CRB3b.
  
  2) 3) The authors use GFP-CRB3A/B, it is not stated which isoform, over-expression to localize CRB3A/B in MCF10A cells (figure 4A). The levels of expression appear to be very high in the GFP panel and it is likely that the secretory pathway of the cells is clogged with GFP-CRB3A/B in transit from the ER to the plasma membrane. Thus, the colocalization with pericentrin might be due to the accumulation of ER and Golgi around the centrosome. This colocalization should be done with the endogenous CRB3A/B and with a better resolution.
  
  The authors do not answer about the potential mislocalization of overexpressed exogenous protein.
  
  We acknowledge the reviewer's perspective. The large amount of exogenous protein overexpression in the cell could potentially obstruct the protein secretion pathway, resulting in the accumulation of the exogenous protein at the ER and Golgi. Such accumulation could create the false impression of co-localization between CRB3b and the centrosome. To provide additional details (lines 215-217 and lines 426-433), we re-expressed the results exogenously and subsequently used staining of endogenous CRB3 and γ-tubulin in Fig. 4C to confirm the co-localization of CRB3 and the centrosome.
  
  3) 4) The staining for CRB3A/B in Figure 4C (red) is striking with a very strong accumulation in an undefined intracellular structure and the authors do not provide any explanation for such a difference with the GFP-CRB3A/B just above.
  
  The authors explain that two different photonic techniques are used (classical versus confocal) but in a cell biology manuscript confocal microscopy is now the standard technique.
  
  Thank you for your comments. We have included a discussion on the partial concordance between CRB3's endogenous staining and exogenous expression results in the "Discussion" section, specifically in lines 420-435.
  
  4) 7) In addition, the authors claim (Line 251/252) that Rab11 is necessary for the transport of CRB3A/B but they should KD Rab11 to show this.
  
  The author's answer is that blocking endocytosis with dynasore is as good as knocking down Rab11 to show its interaction and role in CRB3A/B transport which is not the case.
  
  Thank you for your comments. As requested by the reviewers, we have conducted experiments to knockdown Rab11 and detect CRB3 intracellular trafficking, as shown in the new Supplementary Figure 5B and added lines 258-260. These results provide additional support for our conclusions.
  
  5) 8) The domain of CRB3A/B that is necessary for the interaction with Rab11 is the N-terminal part of the extracellular domain. This domain is thus inside the transport vesicles and not accessible from the cytoplasm. Given that Rab11 is a cytoplasmic protein, how the 2 proteins could interact across the membrane? The authors do not even discuss this essential point for their hypothesis. Comment on the revised version: the authors still do not understand the basic of cell biology since they claim that the extracellular domain of CRB3 can be in contact with Rab11 after endocytosis. Even after endocytosis the extracellular domain of CRB3A/B is inside the lumen of the endosome and not in contact with the cytosol where Rab11 is located. Lines 420-421 of the revised manuscript still claim this interaction between the two proteins without providing the link between the cytosol where Rab11 is and the endosome lumen where the extracellular domain of CRB3A/B is. Please correct.
  
  Thank you for your positive comments. After carefully studying the relevant knowledge, we strongly agree with the reviewer's point of view. We have toned down our claim and removed the description regarding the binding of Rab11 endosomes to specific structural domains of intracellular CRB3 that we were unable to confirm (see lines 443-444 and lines 465-466).
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.15.528649v4
www.biorxiv.org www.biorxiv.org

New submission 07/08/2023, 10:10:16

1
1. Public_Reviews 23 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  In this manuscript, the authors explored the benefits of intermittent fasting on the cardiac physiology through a multi-omics approach and compared different fasting times (IF12; IF 16 and EOD) for a duration of 6 months. Combining the RNA-sequencing, proteomics and phosphor-proteomics analysis, the authors have made an interesting observation that different fasting times would lead to different changes that could be important for the cardiac physiology. Moreover, the changes observed at transcriptional level are different from protein level, suggesting a post-transcriptional regulation mechanism. Using western blot, the authors have confirmed the key signaling pathways, including AMPK, IRS pathway to be significantly altered upon intermittent fasting for 16hrs. Lastly, as a proof of concept for better cardiac function, the animals were challenged with dobutamine and echocardiography was performed to show the mice subjected to intermittent fasting have better cardiac systolic function.
  
  The impact of intermittent fasting on cardiovascular health has been well characterized in several studies. This report appears to be the first one utilizing a multi-omics approach and provided an interesting dataset at transcriptome, proteome and phosphor-proteome levels, and would serve as a valuable data resource for the field. I have the following concerns:
  
  Major concerns:
  
  1) The rationale for choosing the intermittent fasting pattern and timing While the 16:8 intermittent fasting is relatively standard, what is the rationale to test IF 12 hours? As a 4-hour fasting difference might not cause dramatic changes in transcriptome and proteome. Also, what is the rationale to perform 6 months study? The dobutamine stress test is not a terminal procedure, have the authors examined the cardiac function prior to 6 months to see whether there is a difference?
  
  We sincerely thank the reviewer for providing insightful comments and feedback on our study. The aim of our research is to gain a comprehensive understanding of molecular reprogramming in the heart during intermittent fasting using multi-omics techniques. We acknowledge the reviewer's concern regarding the selection of three different time points for intermittent fasting. Our rationale for choosing these time points was to align with the practices commonly used by researchers in the field. By doing so, we intended to explore and compare the effects of different intermittent fasting regimens on the heart. Through our study, we found that a longer fasting period resulted in the most significant changes in the proteome abundance. Though we agree that 4-hour fasting difference may not significantly alter transcriptome and proteome in terms of expressions, remarkable changes of post-translational modifications such as phosphorylation can occur during shorter time periods and this is evident based on the analyses of the modulated phosphoproteins. Hence, we included 12 hours time point also to our analysis. In fact, we would like to emphasize that all three fasting regimens had notable effects on pathways regulating cellular carbohydrates, lipid and protein metabolism, cell-cell interactions, and myocardial cell contractility. Regarding the duration of our study, we opted for a 6-month duration of intermittent fasting to investigate the impact of chronic intermittent fasting on heart transcriptome and proteome changes. While shorter-term (2-3 months) intermittent fasting studies in animals also have shown beneficial effects, we wanted to delve deeper into the molecular alterations induced by long-term intermittent fasting. We acknowledge the reviewer's observation about the dobutamine stress test not being a terminal procedure. In our manuscript, we aimed to present extensive resource data offering molecular insights into intermittent fasting-induced structural and signaling changes in the heart, focusing on various intermittent fasting time intervals. Additionally, we included the effect of cardiac function in response to intermittent fasting, specifically examining the intermittent fasting 16 hours (IF16) group, and highlighted key pathway modulations at this time point as supporting evidence. We appreciate the reviewer’s concern about examining cardiac function prior to 6-month. Although we did not perform this analysis in the current study, we fully agree that such comparison is required for a better understating of the temporal effects of molecular pathways in relation to heart functions during the course of intermittent fasting.
  
  2) Lack of validation study. One interesting observation from this study is the changes of transcriptome does not reflect all the changes at protein level and there is a differential gene expression pattern in IF12, IF16 and EOD. If this is the case, the authors should select a few important targets and provide both mRNA and protein level analysis, as a proof of concept for the bioinformatics analysis accuracy.
  
  We appreciate the reviewer's attention to the comparison of proteome and transcriptome data across different intermittent fasting regimens, as well as their interest in understanding any specific deviations in dietary regimens or sets of proteins. Indeed, it is well-established that post-transcriptional regulation can lead to discrepancies between mRNA and protein levels, primarily due to translational control or protein degradation mechanisms. Posttranscriptional buffering of proteins, particularly enzymes and kinases, is a plausible explanation, given their regulation through post-translational modifications, such as phosphorylations or allosteric regulations. Despite observing a modest correlation between the proteome and transcriptome data, which is generally common, we did identify certain enzymes, such as HMGC2, PDK4ACOT, CLPX, and RNASE4, with a high level of concordance between protein and mRNA abundances. These instances of agreement between the two data types suggest a coordinated regulation of these enzymes at the transcriptional and translational levels during intermittent fasting. To facilitate a clearer understanding of the correlation between proteome and transcriptome data, we have included correlation levels next to the scatter plots in our manuscript. These annotations aim to provide additional insights and aid readers in assessing the relationship between the two datasets.
  
  3) Poor western blot image quality. The quality of the western blot has several issues: a. the change of pAMPK/AMPK appears to be a decrease of total AMPK instead of change at p-AMPK level. Same with GSK3a/b. There appears to be an increase of total GSK3a/b. The AKT should also be blotted and quantified at phosphorylation level. The western blot should be clearly labeled, for the ones with double bands, including GSK3a/b, the author should clearly label which is GSK3a and which is GSK3b. For the IRS with non-specific band, the author should point out IRS-1 band itself.
  
  We appreciate the reviewer's careful evaluation of our study and acknowledge the concerns raised regarding the quality of the western blot images. Despite revising these experiments multiple times, we acknowledge that the immunoblot images may not meet the highest quality standards. We have included the original immunoblots in the supplementary section to ensure transparency and provide additional data for reference.
  
  Reviewer #2 (Public Review):
  
  This study provides an unbiased characterization of the cardiac proteome in the setting of intermittent fasting. The findings constitute a resource of quantitative proteomic data that sheds light on changes in cardiac function due to diet and that may be used in the future by other investigators. There are a number of key missing details that limit interpretation or present opportunities to strengthen the study.
  
  1) For example, the authors find that apolipoproteins are altered with fasting but it is not clear whether this is a contribution of myocardial tissue changes or systemic effects spilling into blood in cardiac tissues.
  
  We appreciate the reviewer's consideration of the potential effect of spilling blood on our study results. While we agree that such an effect is possible, we would like to emphasize that the observed overall changes in the proteome profile, particularly in pathways regulating metabolism and other cardiac remodeling-associated processes, suggest that the alterations we observed are more likely attributed to changes within the myocardial tissues themselves. We would like to highlight that blood microparticles or extracellular proteins were not enriched in our proteome data and hence the impact of blood spilling is not a concern. In fact, the biological processes we observed were majorly associated with ECM receptor interaction, focal adhesion and signaling pathways, which are not typical for secreted or extracellular proteome encompassing blood leakage.
  
  2) Some statements in the text like "Approximately one-third of the differentially expressed proteins in IF groups compared to AL were enzymes with catalytic activity involved in energy homeostasis pathways" do not appear to be supported by data.
  
  The enzymes among all the differentially expressed proteins in the intermittent fasting (IF) groups compared to the ad libitum (AL) control group are indicated in Supplementary Table S2. This constitutes one-third of the total number of differentially expressed proteins and several of these are involved in metabolic and energy homeostasis pathways.
  
  3) It is not clear how the list of Kinases were generated for Figure 1B.
  
  For the kinases indicated in Figure 1B, all the kinases from the proteins that were differentially expressed among the different dietary regimens compared to the control ad libitum (AL) group were first identified (listed in Supplementary Table S2), followed by enrichment analysis ((FDR ≤ 0.05) of the identified kinases across different pathways identified from KEGG pathways derived from DAVID bioinformatics resources.
  
  4) Changes in chromatin or gene expression are not measured so the conclusion that EOD led to 'epigenetic changes' relative to IF16 is not well supported.
  
  We appreciate the reviewer's feedback. Our statement in the manuscript referred specifically to the changes observed in Figure 2, where we presented increased proteomic abundance in pathways related to chromatin remodeling, chromatin organization, gene expression regulation, and histone modification in the EOD (Every Other Day Fasting) group compared to the IF 16 (Intermittent Fasting for 16 hours) group based on functional process and pathway enrichment analysis. Our comprehensive bioinformatics analysis, depicted in Figure 2, provides intriguing insights into these pathways. We acknowledge that further validation and in-depth studies through additional experiments and functional assays are essential to strengthen the conclusion from such observations, which is beyond the scope of the current study. We thank the reviewer for such valuable suggestions that are very useful for our ongoing studies, where we aim to obtain a more robust and thorough understanding of the impact of intermittent fasting on chromatin-related processes.
  
  5) There are also a number of areas where the text is vague. For example, it is not clear what is meant by 'trend shift' when discussing EOD results and Figure 3 generally could use additional information to better understands the figures.
  
  We would like to clarify that the term 'trend shift' refers to the change in the direction of protein and transcript level alterations. Based on the 2-D enrichment analyses that revealed correlated and non-correlated functional processes at the proteome and transcriptome levels, it was evident that during the early intermittent fasting 12 hours (IF12) regimen, the abundance changes of the proteins and transcripts involved in these processes were altered in the same direction (Supplementary Fig. 4b). Nevertheless, with increased fasting hours, mainly in the Every Other Day Fasting (EOD) group, we observed that the levels of proteins and transcripts involved in several of the functional processes appeared to be non-correlated as compared to the IF12 group (Fig. 2d). In Figure 3, we summarize the overall altered protein networks associated with the different intermittent fasting regimens, highlighting densely connected clusters of proteins along with their associated biological processes and pathways. Additionally, we unravel the impact of intermittent fasting on transcriptional rewiring and highlight regimen-specific alterations of specific transcriptional factors, several of which were found to have metabolic implications.
  
  6) An interesting finding is that the IF16 groups showed cardiac hypertrophy (SFig 11b). This is potentially a novel finding and the text should elaborate more on this phenomenon.
  
  We sincerely thank the reviewer for bringing attention to this intriguing aspect of our study. The data you have highlighted warrants further investigation, and we are committed to delving deeper into this area in our future research.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2021.03.04.433999v2
www.biorxiv.org www.biorxiv.org

New submission 23/08/2023, 09:49:31

1
1. Public_Reviews 23 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  Reviewer #1 (Public Review):
  
  The manuscript focused on roles of a key fatty-acid synthesis enzyme, acetyl-coA-carboxylase 1 (ACC1), in the metabolism, gene regulation and homeostasis of invariant natural killer T (NKT_ cells and impact on these T cells' roles during asthma pathogenesis. The authors presented data showing that the acetyl-coA-carboxylase 1 enzyme regulates the expression of PPARg then the function of NKT cells including the secretion of Th2-type cytokines to impact on asthma pathogenesis. The results are clearcut and data were logically presented.
  
  Major concerns:
  
  1) This study heavily relied on the CD4-CreACC1fl/fl mice. While using of a-GalCer stimulation and Ja18KO mice mitigated the concern, it is still a major concern that at least some of the phenotype were due to the effect on conventional CD4 T cells. For example, the deletion of ACC1 gene seems also decreased the numbers of conventional CD4 T cells (Fig. 2D, Fig. S1D). Previously there were reports showing ACC1 gene in conventional CD4 T cells also plays a role in lung inflammation (Nakajima et al., J. Exp. Med. 218, 2021). If the authors believe the phenotype observed was mainly due to iNKT cells, rather than conventional CD4 T cells, a compare/contrast of the two studies should be discussed to explain or reconcile the results.
  
  As the reviewer pointed out, although we have experimentally demonstrated the critical role of ACC1 in iNKT cells in the regulation of allergic asthma, use of Cd4-CreAcc1fl/fl mice inevitably brings the role of conventional CD4+ T-cells in question.
  
  The study conducted by Nakajima et al, which reported that the absence of ACC1 in CD4+ T-cells resulted in reduced numbers and functional impairment of memory CD4+ T-cells, leading to less airway inflammation further suggests possibility of involvement of conventional CD4+ T-cells in regulation of allergic asthma. The direct compare/contrast of two studies seems difficult since Nakajima et al have focused on the role of ACC1 in memory CD4+ T cells while we have focused on iNKT cells.
  
  However, based on our experimental results, we believe that iNKT cells more contribute to the regulation of allergic asthma for the following reasons - (i) while the number of iNKT cells were significantly reduced in Cd4-CreAcc1fl/fl mice, the number of conventional CD4+ T cells were only slightly reduced, (ii) Cd4-CreAcc1fl/fl mice were dramatically decreased in their AHR in α-GalCer induced iNKT cell dependent allergic asthma model, and (iii) Jα18 KO mice that lack iNKT cells almost completely restore their AHR when adoptively transferred with WT iNKT cells but not ACC1-deficient iNKT cells. These results indicate that ACC1-mediated regulation of AHR is significantly dependent on iNKT cells, which might contribute to AHR in the study conducted by Nakajima et al. as well. From these, we believe that while ACC1 is a critical regulator of both conventional CD4+ T cells and iNKT cells in regulation of allergic asthma, iNKT cells may contribute more to regulation of allergic asthma compared to CD4+ T cells. We have summarized the above-mentioned contents in LINES: 421-441 with the reference you have mentioned:
  
  "It should be noted that Cd4-CreAcc1fl/fl mice lack ACC1 expression in both conventional CD4+ T cells and iNKT cells. It should be noted that Cd4-CreAcc1fl/fl mice lack ACC1 expression in both conventional CD4+ T cells and iNKT cells. While the use of iNKT cell- specific Cre system would demonstrate critical role of ACC1 in iNKT cells regarding allergic asthma, there is no iNKT cell-specific Cre system available yet. In addition, the study conducted by Nakajima et al, which reported that the absence of ACC1 in CD4+ T cells resulted in reduced numbers and functional impairment of memory CD4+ T cells, leading to less airway inflammation further suggests possibility of involvement of conventional CD4+ T cells in regulation of allergic asthma. However, based on our experimental results, we believe that iNKT cells more contribute to the regulation of allergic asthma for the following reasons - (i) while the number of iNKT cells were significantly reduced in Cd4-CreAcc1fl/fl mice, the number of conventional CD4+ T cells were only slightly reduced, (ii) Cd4-CreAcc1fl/fl mice were dramatically decreased in their AHR in α-GalCer induced allergic asthma model, and (iii) Jα18 KO mice that lack iNKT cells almost completely restore their AHR when adoptively transferred with WT iNKT cells but not ACC1-deficient iNKT cells. These results indicate that ACC1-mediated regulation of AHR is significantly dependent on iNKT cells, which might contribute to AHR in the study conducted by Nakajima et al. as well. From these, we believe that while ACC1 is a critical regulator of both conventional CD4+ T cells and iNKT cells in regulation of allergic asthma, iNKT cells may contribute more to regulation of allergic asthma compared to CD4+ T cells."
  
  2) The overall significance of the manuscript is related to the potential clinical suppression of ACC1 in human asthma patients. However, the authors only showed the elevated ACC1 genes in these patients, not even in vitro data demonstrating that suppression of ACC1 genes in the iNKT cells from patients could have potential therapeutic effect or suppression of the relevant cytokines.
  
  We would like to appreciate reviewer’s critical comment here. Due to paucity of iNKT cells in human PBMCs, it is extremely difficult to experimentally manipulate expression level of ACC1 in human iNKT cells. Alternatively, to address reviewer’s comment, we compared the cytokine expression of ACC1high iNKT cells from human allergic asthma patients to ACC1low iNKT cells from healthy individuals or non-allergic asthma patients. Our results show that iNKT cells from allergic asthma patients express higher levels of IL4 and IL13 than those from healthy individuals or non-allergic asthma patients, suggesting that the level of ACC1 is most likely involved in functionality of human iNKT cells as well. The results are newly shown in supplementary Fig. 5C with explanation in LINES 376-378 and 382-384:
  
  LINES 376-378: Lastly, the expression levels of IL4 and IL13 were significantly higher in iNKT cells from the allergic asthma patients compared to those from healthy controls and nonallergic asthma patients (Fig. S5C).
  
  LINES 382-384: Thus, iNKT cells from allergic asthma patients express higher ACC1, FASN and PPARG levels and lower levels of a glycolysis which is accompanied with higher levels of IL4 and IL13 than iNKT cells from healthy controls and nonallergic asthma patients.
  
  3) The authors report that a-GalCer administration can induce the AHR, however, in the cited paper (Hachem et al., Eur J. Immunol. 35, 2793, 2005), iNKT cell activation seems to have the opposite effect to inhibit AHR. Did the authors mean to cite different papers?
  
  We apologize for the confusion. We have replaced the inaccurate reference with the reference below in LINES 863-865:
  
  Glycolipid activation of invariant T cell receptor+ iNKT cells is sufficient to induce airway hyperreactivity independent of conventional CD4+ T cells, Proc Natl Acad Sci USA, 103 pp, 2782-2787 (2006),
  
  Reviewer #2 (Public Review):
  
  In this study the authors sought to investigate how the metabolic state of iNKT cells impacts their potential pathological role in allergic asthma. The authors used two mouse models, OVA and HDM-induced asthma, and assessed genes in glycolysis, TCA, B-oxidation and FAS. They found that acetyl-coA-carboxylase 1 (ACC1) was highly expressed by lung iNKT cells and that ACC1 deficient mice failed to develop OVA-induced and HDM-induced asthma. Importantly, when they performed bone marrow chimera studies, when mice that lacked iNKT cells were given ACC1 deficient iNKT cells, the mice did not develop asthma, in contrast to mice given wildtype NKT cells. In addition, these observed effects were specific to NKT cells, not classic CD4 T cells. Mechanistically, iNKT cell that lack AAC1 had decreased expression of fatty acid-binding proteins (FABPs) and peroxisome proliferator-activated receptor (PPAR)γ, but increased glycolytic capacity and increased cell death. Moreover, the authors were able to reverse the phenotype with the addition of a PPARg agonist. When the authors examined iNKT cells in patient samples, they observed higher levels of ACC1 and PPARG levels, compared to healthy donors and non-allergic-asthma patients.
  
  We are very grateful for your kind appreciation of our work.
  
  Reviewer #1 (Recommendations For The Authors):
  
  1) Related to major concern I, an iNKT cell-specific knockout of ACC1 in iNKT cells is highly desirable and should be used to directly address the question.
  
  As the reviewer suggested, iNKT cell-specific deletion of ACC1 will provide invaluable information to our study. Unfortunately, Cre-Loxp system that specifically targets iNKT cells has not be developed. Thus, we opted to use CD4-Cre system, which is the gold standard Cre system for the study of iNKT cells. In addition, to highlight the role of ACC1 in iNKT cells in relation to regulation of allergic asthma, we performed iNKT cell-dependent experiment models and conducted adoptive transfer of iNKT cells into iNKT cell-deficient mice (Jα18 KO). These have been discussed in the section of Discussion in LINES:421-441:
  
  "It should be noted that Cd4-CreAcc1fl/fl mice lack ACC1 expression in both conventional CD4+ T cells and iNKT cells. While the use of iNKT cell- specific Cre system would demonstrate critical role of ACC1 in iNKT cells regarding allergic asthma, there is no iNKT cell-specific Cre system available yet. In addition, the study conducted by Nakajima et al, which reported that the absence of ACC1 in CD4+ T cells resulted in reduced numbers and functional impairment of memory CD4+ T cells, leading to less airway inflammation further suggests possibility of involvement of conventional CD4+ T cells in regulation of allergic asthma. However, based on our experimental results, we believe that iNKT cells more contribute to the regulation of allergic asthma for the following reasons - (i) while the number of iNKT cells were significantly reduced in Cd4-CreAcc1fl/fl mice, the number of conventional CD4+ T cells were only slightly reduced, (ii) Cd4-CreAcc1fl/fl mice were dramatically decreased in their AHR in α-GalCer induced allergic asthma model, and (iii) Jα18 KO mice that lack iNKT cells almost completely restore their AHR when adoptively transferred with WT iNKT cells but not ACC1-deficient iNKT cells. These results indicate that ACC1-mediated regulation of AHR is significantly dependent on iNKT cells, which might contribute to AHR in the study conducted by Nakajima et al. as well. From these, we believe that while ACC1 is a critical regulator of both conventional CD4+ T cells and iNKT cells in regulation of allergic asthma, iNKT cells may contribute more to regulation of allergic asthma compared to CD4+ T cells."
  
  2) For Fig. 5A, RT-PCR verification of PPARg gene expression level change is needed.
  
  As suggested, we have verified the level of Pparg expression of ACC1-deficient iNKT cells through real time PCR and have added the results to Figure 5A.
  
  3) Verifying at least the cytokine secretion can be regulated by manipulating ACC1 expression in human asthma patient samples will make the paper much stronger.
  
  We would like to appreciate reviewer’s critical comment here. Due to paucity of iNKT cells in human PBMCs, it is extremely difficult to experimentally manipulate expression level of ACC1 in human iNKT cells. Alternatively, to address reviewer’s comment, we compared the cytokine expression of ACC1high iNKT cells from human allergic asthma patients to ACC1low iNKT cells from healthy individuals or non-allergic asthma patients. Our results show that iNKT cells from allergic asthma patients express higher levels of IL4 and IL13 than those from healthy individuals or non-allergic asthma patients, suggesting that the level of ACC1 is most likely involved in functionality of human iNKT cells as well. The results are newly shown in supplementary Fig. 5C with explanation in LINES 376-378 and 382-384:
  
  LINES 376-378: Lastly, the expression levels of IL4 and IL13 were significantly higher in iNKT cells from the allergic asthma patients compared to those from healthy controls and nonallergic asthma patients (Fig. S5C).
  
  Minor points:
  
  1) What are the cells being stained in Fig. S2C? Are they iNKT cells? If yes, why there is a tetramer-negative population?
  
  The density plot on the left panel of Fig. S2C represents magnetically enriched thymic iNKT cells. Due to their scarcity, thymic iNKT cells were enriched using CD1d tetramer via magnetic activated cell sorting (MACS)-based enrichment technique. After enrichment, we re-stained enriched cells with CD1d tetramers and gated out CD3 and CD1d tetramer double positive cells via flow cytometry to specifically identify iNKT cells. Due to the imperfect purity of magnetic cell separation technique, a small proportion of CD1d tetramer-negative population is seen in the left panel of Fig. S2C.
  
  A brief mention of this methodology has been added to the “Preparation and activation of murine T and iNKT cells” section under Materials and Methods in LINES 560-566:
  
  "Alternatively, thymic and liver mononuclear cells were labeled with APC-conjugated ɑ-GalCer/CD1d tetramers, bound to anti-APC magnetic beads, and enriched on a MACS separator (Miltenyi Biotec, Auburn, CA, USA; purity 89%). To analyze the development of thymic iNKTs cells, we re-stained enriched cells with CD1d tetramer and gated out CD3 and CD1d tetramer double positive cells via flow cytometry to identify thymic iNKT cells, which were used for further analysis."
  
  2) Where are the adoptive transferred iNKT cells purified/sorted from? Are they from lungs of Acc1fl/fl or CD4-cre/Acc1fl/fl mice, asthma-induced already? As there are very few iNKT cells in healthy and untreated mice. There is little described or explained in Methods and Materials.
  
  The adoptively transferred iNKT cells were purified and pooled from the lungs of at least 10 mice per group. Briefly, mouse lungs were finely chopped into small pieces using razor blades and enzymatically digested using type IV collagenase. iNKT cells from the lungs were sorted via FACS using CD1d tetramers. Approximately, 6.0 × 105 of iNKT cells were obtained from the lungs at least of 10 mice. A brief mention of this methodology was added to the “Adoptive transfer of iNKT cells in allergic asthma models” section in Materials and Methods in LINES 568-574: iNKT cells were obtained from the lungs of at least 10 Acc1fl/fl or Cd4-CreAcc1fl/fl mice. Mouse lungs were finely chopped into small pieces using razor blades and were enzymatically digested using type IV collagenase. iNKT cells from the lungs were sorted via FACS using CD1d tetramers. Approximately, 6.0 × 105 of iNKT cells were obtained from at least 10 mice and were adoptively transferred into individual recipient mouse via the intratracheal route.
  
  3) The use of 2-NBDG was not explained in multiple locations, particularly in Fig.5H. How is its fluorescence used to track iNKT cells? No description in Materials and methods.
  
  2-NBDG, a fluorescence tagged glucose analog is a indicator for measurement of glucose uptake in cells. The fluorescence intensity in 2-NBDG-treated cells represents the degree of glucose uptake in cells, which can be measured using flow cytometry. Thus, in the experiments where we treated 2-NBDG, we described the results as "glucose uptake". A brief explanation of this methodology was added to the main text in LINES 253-254. In addition, we have provided the detailed use of 2-NBDG in ‘Measurement of glucose uptake capacity’ under the section of Materials and methods in LINES 599-607: Measurement of glucose uptake capacity using 2-NBDG assay. After treating 2-NBDG, the fluorescence intensity of cells were measured using flow cytometry and represented the degree of glucose uptake in cells.
  
  4) Fig. 3A legends: it should be "Ja18 KO"?
  
  We would like to appreciate your comment on our mistake here. We have corrected this in the legend of figure 3A.
  
  5) There are two different mechanisms for explaining the less severe asthma/AHR phenotype in ACC1-KO iNKT cells. One is lower number of iNKT cells due to cell death, the other decreased cytokine secretions. It is not clear to the reviewer, what are the relationship between two mechanisms. Are they both contributing to the asthma phenotype or cooperative?
  
  As you mentioned, ACC1-deficient iNKT cells showed increase in intrinsic pathway of apoptosis as well as decrease in their cytokine secretion simultaneously. Thus, we believe that increase in cell death and decrease in cytokine expression of ACC1-deficient iNKT cells cooperatively contributed to the asthma phenotype. The above-mentioned point was discussed in LINES 453-458: Furthermore, the apoptotic tendency of the ACC1-deficient iNKT cells was accompanied by their functional impairment. The ACC1-deficient iNKT cells exhibited impaired viability and functionality. Treatment of glycolysis inhibitor in ACC1-deficient iNKT cells not only restored cellular survival but also their functionalities. From these results, we speculate that ACC1-mediated regulation of both cellular homeostasis and cytokine production cooperatively contributed to the asthma phenotype.
  
  Reviewer #2 (Recommendations For The Authors):
  
  Overall, this is a very strong study with few concerns.
  
  1) Are there tissue specific differences in the iNKT cell populations? The authors examined lung iNKT cells in the Figs 1-3, and used liver NKT cells for the mechanistic studies in Fig 4-5. The studies shown in Fig S2 suggest that ACC1 deficient iNKT cells have developmental defects and impaired homeostatic proliferative capacity. Does ACC1 impact lung and liver iNKT cells similarly and is the lack of allergic asthma in ACC1 deficient iNKT cells due to defective iNKT cell trafficking to the lungs or a failure to survive after transfer (Fig 3)?
  
  In absence of ACC1, the number of iNKT cells from both lungs and livers decreased and showed consistent features (i.e: metabolic parameters), suggesting that there was no tissue specific role of ACC1 in INKT cells.
  
  In the adoptive transfer experiments, we transferred equal number of WT and ACC1-deficient iNKT cells directly into mouse lungs via intratracheal route. Thus, decreased numbers of adoptively transferred ACC1-deficient iNKT cells is more likely from their intrinsically impaired homeostatic proliferative capacity, not due to defective trafficking to the lungs.
  
  2) Similarly, are chemokine receptor expression patterns similar between WT and ACC1 deficient iNKTs (Fig 4)?
  
  We compared chemokine receptor expression of WT and ACC1-deficient iNKT cells using our RNA-seq and verified their expression levels via real time q-PCR. The expression levels of these chemokine receptors were comparable between the two groups of iNKT cells. The results are newly shown in supplementary Fig. 4I with explanation in LINES 351-357:
  
  Meanwhile, chemokine receptor signaling is also implicated in regulating homeostasis of iNKT cell in the periphery. In particular, Meyer et al. suggested that iNKT cells require CCR4 to localize to the airways and to induce AHR. Thus, we examined the expression of several chemokine receptors, including CCR4. We found that WT and ACC1-deficient iNKT cells did not differ in their chemokine receptor expressions, suggesting that the chemokine signaling may not be critical for ACC1-mediated regulation in AHR.
  
  3) The authors data suggest that Tregs are not playing a major role in the regulation of asthma induction in their ACC1 deficient mice, based on FoxP3 expression. Did the authors perform suppressor assays to show that the Tregs function similarly in WT and ACC1 deficient mice?
  
  We would like to appreciate reviewer’s reasonable comment. However, we did not experimentally compare the suppressive capacity of WT and ACC1-deficient Tregs under the asthmatic conditions, due to minimal differences in their Foxp3 expression (Foxp3 expression is a critical determinant of suppressive function of Tregs- (Immunity. 2019 Feb 19;50(2):302-316.; Nat Immunol 2003; 4: 330–336; Cell Mol Immunol. 2015 Sep;12(5):558-65.)). Thus, we speculate that the suppressive capacity between WT and ACC1-deficient Tregs might be similar. Nevertheless, since the suppressive capacity of Tregs can also be regulated by other soluble factors and surface molecules, we cannot completely rule out the possibility that ACC1-deficient Tregs might differ in their suppressive capacity to WT Tregs in asthma. In short, while there are clear limitations to our interpretation here, we believe it is unlikely that Tregs from WT and ACC1 deficient mice show difference in their suppressive capacity during asthma. We have included above-mentioned points in the section of Discussion in LINES 415-419: In this regard, Tregs may also play a major role in asthma. However, the expression level of Foxp3 was comparable between WT and ACC1-deficient Tregs. The level of Foxp3 to some extent, serves as a critical determinant of suppressive function of Tregs. Thus, we speculate that they might not critically contribute to the development of asthma, although we cannot completely rule out the contribution of Tregs to our studies.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.15.528598v3
www.biorxiv.org www.biorxiv.org

New submission 23/08/2023, 09:46:20

1
1. Public_Reviews 23 Aug 2023
  
  in eLife
  
  Author Response
  
  We would like to thank the reviewers for their positive and constructive comments on the manuscript.
  
  We are planning the following revisions to both DGRPool and the corresponding manuscript to address the reviewers’ comments:
  
  1) We agree with reviewer #1 that normalizing the data could potentially improve the GWAS results. Thus, we plan to explore the implementation of this option and assess its impact on the overall results. We will also investigate replacing the ANOVA test with a KRUSKAL test. Instead of upfront data normalization, we will consider using the PLINK –pheno-quantile-normalize option. Both options will be compared on a set of phenotypes where we can analyze the output (i.e., for phenotypes where we expect to find specific variants), to determine whether these strategies enhance the detection power.
  
  2) We also agree with both reviewers that gene expression information is of interest. However, we recognize that incorporating such information would entail substantial work (as elaborated in our response to comments below). We feel that this extensive work is beyond the current scope of this paper, which primarily focuses on phenotypes and genotype-phenotype associations. Nonetheless, we are committed to enhancing user experience by including more gene-level outlinks to Flybase. Additionally, we will link variants and gene results to Flybase's online genome browser, JBrowse. By following the reviewers' suggestions, we aim to guide DGRPool users to potentially informative genes.
  
  3) In agreement with reviewer #2, we acknowledge that additional tools could enhance DGRPool's functionality and facilitate meta-analyses for users. Therefore, we are in the process of developing a gene-centric tool that will allow users to query the database based on gene names. Moreover, we intend to integrate ortholog databases into the GWAS results. This feature will enable users to extend Drosophila gene associations to other species if necessary.
  
  4) Finally, we also concur with both reviewers about making minor edits to the manuscript to address their feedback.
  
  Reviewer #1 (Public Review):
  
  This is a technically sound paper focused on a useful resource around the DRGP phenotypes which the authors have curated, pooled, and provided a user-friendly website. This is aimed to be a crowd-sourced resource for this in the future.
  
  The authors should make sure they coordinate as well as possible with the NC datasets and community and broader fly community. It looks reasonable to me but I am not from that community.
  
  We thank the reviewer for the positive comments. We are relatively well-connected to the D. melanogaster community and aim to leverage this connection to render the resource as valuable as possible. DGRPool in fact already reflects the input of many potential users and was also inspired by key tools on the DGRP2 website. Furthermore, it also rationalizes why we are often bridging our results with other resources, such as linking out to Flybase, which is the main resource for the Drosophila community at large.
  
  I have only one major concern which in a more traditional review setting I would be flagging to the editor to insist the authors did on resubmission. I also have some scene setting and coordination suggestions and some minor textual / analysis considerations.
  
  The major concern is that the authors do not comment on the distribution of the phenotypes; it is assumed it is a continuous metric and well-behaved - broad gaussian. This is likely to be more true of means and medians per line than individual measurements, but not guaranteed, and there could easily be categorical data in the future. The application of ANOVA tests (of the "covariates") is for example fragile for this.
  
  The simplest recommendation is in the interface to ensure there is an inverse normalisation (rank and then project on a gaussian) function, and also to comment on this for the existing phenotypes in the analysis (presumably the authors are happy). An alternative is to offer a kruskal test (almost the same thing) on covariates, but note PLINK will also work most robustly on a normalised dataset.
  
  We thank the reviewer for raising this interesting point. Indeed, we did not comment on the distribution of individual phenotypes due to the underlying variability from one phenotype to another, as suggested by the reviewer. Some distributions appear normal, while others are clearly not normally distributed. This information is 'visible' to users by clicking on any phenotype; DGRPool automatically displays its global distribution if the values are continuous/quantitative. We acknowledge the reviewer's concerns regarding the use of ANOVA tests. However, we consider it acceptable to perform linear regression (including ANOVA tests) on non-normally distributed data, as only the prediction errors need to follow a normal distribution.
  
  Furthermore, the ANOVA test is solely conducted to assess whether any of the potential covariates (such as well-established inversions and symbiont infection status) are associated with the phenotype of interest. PLINK2 automatically corrects for the effects of these covariates during GWAS by considering them as part of the regression model.
  
  Nevertheless, we concur with the reviewer that normalizing the data could potentially enhance GWAS results. Consequently, we commit to exploring the impact of data normalization on the overall outcomes. Additionally, we will consider replacing the ANOVA test with a KRUSKAL test, and using the PLINK –pheno-quantile-normalize option. We intend to compare both approaches using a set of phenotypes where we can compare the output (i.e., where specific variants are expected to be identified). This comparison will help us determine if either method enhances the detection power.
  
  Minor points:
  
  On the introduction, I think the authors would find the extensive set of human GWAS/PheWAS resources useful; widespread examples include the GWAS Catalog, Open Targets PheWAS, MR-base, and the FinnGen portal. The GWAS Catalog also has summary statistics submission guidelines, and I think where possible meta-data harmonisation should be similar (not a big thing). Of course, DRGP has a very different structure (line and individuals) and of course, raw data can be freely shown, so this is not a one-to-one mapping.
  
  Thank you for the suggestion. We will cite these resources in the Introduction and check the GWAS catalog submission guidelines to compare to the ones we are proposing in this paper.
  
  For some authors coming from a human genetics background, they will be interpreting correlations of phenotypes more in the genetic variant space (eg LD score regression), rather than a more straightforward correlation between DRGP lines of different individuals. I would encourage explaining this difference somewhere.
  
  We appreciate this potential issue and we will make this distinction clearer in the manuscript to avoid any confusion.
  
  This leads to an interesting point that the inbred nature of the DRGP allows for both traditional genetic approaches and leveraging the inbred replication; there is something about looking at phenotype correlations through both these lenses, but this is for another paper I suspect that this harmonised pool of data can help.
  
  We agree with the reviewer and hope that more meta-analyses will be made possible by leveraging the harmonized data that are made available through DGRPool.
  
  I was surprised the authors did not crunch the number of transcript/gene expression phenotypes and have them in. Is this because this was better done in other datasets? Or too big and annoying on normalisation? I'd explain the rationale to leave these out.
  
  This is a very good point raised by the reviewer, and this is in fact something that we initially wanted to do. However, to render the analysis fair and robust, it would require processing all datasets in the same way. This implies cataloging all existing datasets and processing them through the same pipeline. Then, it also requires adding a “cell type” or “tissue” layer, because gene expression data from whole flies is obviously not directly comparable to gene expression data from specific tissues or even specific conditions. This would be key information as phenotypes are often tissue-dependent. So, as implied by the reviewer, we deemed this too big of a challenge beyond the scope of the current paper. Nevertheless, we plan to continue investigating this avenue, especially given the strong transcriptomics background of our lab, in a potential follow-up paper.
  
  I think 25% FDR is dangerously close to "random chance of being wrong". I'd just redo this section at a higher FDR, even if it makes the results less 'exciting'. This is not the point of the paper anyway.
  
  We agree with the reviewer that this threshold implies a higher risk of false positive results. However, this is not an uncommonly used threshold (Li et al., PLoS biology, 2008; Bevers et al., Nature Metabolism, 2019; Hwangbo et al, Elife, 2023), and one that seems robust enough in our analysis since similar phenotypes are significant in different studies. Nevertheless, we will revisit these results and explore how a more stringent threshold may impact the results.
  
  I didn't buy the extreme line piece as being informative. Something has to be on the top and bottom of the ranks; the phenotypes are an opportunity for collection and probably have known (as you show) and cryptic correlations. I think you don't need this section at all for the paper and worry it gives an idea of "super normals" or "true wild types" which ... I just don't think is helpful.
  
  This section of the paper was intended to investigate anecdotal evidence suggesting that certain DGRP lines consistently rank at the top or bottom when examining fitness-related traits. If accurate, this observation could imply that inbreeding might have made these lines generally weaker, potentially introducing bias into studies aimed at uncovering the genetic basis of complex traits. However, as per the analyses presented, we did not discover support for this phenomenon. Nevertheless, we consider this message important to convey. In response to the reviewer's feedback, we intend to provide a clearer explanation of the reasoning behind this section of the paper and its main conclusion.
  
  I'd say "well-established inversion genotypes and symbiot levels" rather than generic covariates. Covariates could mean anything. You have specific "covariates" which might actually be the causal thing.
  
  Thank you. We will update the manuscript accordingly.
  
  I wouldn't use the adjective tedious about curation. It's a bit of a value judgement and probably places the role of curation in the wrong way. Time-consuming due to lack of standards and best practice?
  
  Thank you. We will update the manuscript accordingly.
  
  Reviewer #2 (Public Review):
  
  Summary:
  
  In the present study, Gardeux et al provide a web-based tool for curated association mapping results from DRP studies. The tool lets users view association results for phenotypes and compare mean phenotype ~ phenotype correlations between studies. In the manuscript, the authors provide several example utilities associated with this new resource, including pan-study summary statistics for sex, traits, and loci. They highlight cross-trait correlations by comparing studies focused on longevity with phenotypes such as oxphos and activity.
  
  Strengths:
  
  -Considerable efforts were dedicated toward curating the many DRG studies provided.
  
  -Available tools to query large DRP studies are sparse and so new tools present appeal
  
  Weaknesses:
  
  The creation of a tool to query these studies for a more detailed understanding of physiologic outcomes seems underdeveloped. These could be improved by enabling usages such as more comprehensive queries of meta-analyses, molecular information to investigate given genes or pathways, and links to other information such as in mouse rat or human associations.
  
  We appreciate the reviewer's kind comments.
  
  Regarding the tools, we concur with the reviewer that incorporating additional tools could enhance DGRPool and facilitate users in conducting meta-analyses. Therefore, we intend to introduce a gene-centric tool that enables users to query the database based on gene names. Additionally, we will establish links to ortholog databases within the GWAS results, thereby allowing users to extend fly gene associations to other species, if required.
  
  Furthermore, we have plans to link out to a 'genome browser-like' view (Flybase’s JBrowse tool) of the GWAS results centered around the affected variants/genes. We are considering integrating this feature into the new gene-centric tool as well.
  
  Another potential downstream analysis we are considering is gene-set enrichment. This analysis would involve assessing the enrichment of genes in Gene Ontology or other pathway databases directly from the GWAS results page.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.01.543194v1
www.biorxiv.org www.biorxiv.org

New submission 23/08/2023, 09:40:47

1
1. Public_Reviews 23 Aug 2023
  
  in eLife
  
  Author Response
  
  We would like to thank reviewers and editors for their thoughtful and constructive review of our manuscript. Below we have provided responses to specific points in the reviewers’ comments and eLIFE assessment, highlighting areas of the manuscript that will be edited for clarity and where efforts will be made to provide data to address reviewer concerns upon a future resubmission.
  
  eLife assesment:
  
  The authors report that Dbp5 functions in parallel with Los1 in tRNA export, in a manner dependent on Gle1 and requiring the ATPase cycle of Dbp5, but independent of Mex67, Dbp5's partner in mRNA export. The evidence for this conclusion is still incomplete, as is the biochemical evidence that Dbp5 interacts directly with tRNA in vitro with Gle1 and co-factor InsP6 triggering Dbp5 ATPase activity in the Dbp5-tRNA complex. The evidence that Dbp5 interacts with tRNA in cells independently of Los1, Msn5 and Mex67 is, however, solid.
  
  We intend to edit the text to make clear our conclusions and accommodate clarifications on a few details of this assessment.
  
  (1) We would clarify that our data supports a model in which Dbp5 recruitment to tRNA is independent of Mex67 as an adapter in cells; however, this does not mean that Mex67 and Dbp5 do not still co-function in tRNA export. For example, it is possible Dbp5 and Mex67 could still co-function in the same pathway, but instead of Dbp5 working down stream of Mex67, Dbp5 may in fact work upstream as an adapter for Mex67. Edits to the text will be made to ensure this distinction is clear and highlight the possibility for future investigation to elucidate this relationship.
  
  (2) We would like to highlight that based on structural and biochemical data detailing synergistic activation of Dbp5 ATPase cycle by Gle1/InsP6 and single stranded RNA, it is difficult to imagine a scenario where the apparent synergistic activation of Dbp5 ATPase cycle by tRNA and Gle1/InsP6 (Figure 5) is achieved independent of direct RNA binding. For this reason, we still support the claim that the observed synergistic activation, in combination with other in-vivo and in-vitro data provided in the manuscript, support a model where Dbp5 directly binds tRNA. However, we intend to edit the text to highlight this nuance and potential alternative conclusions based on reviewer feedback.
  
  Reviewer #1 (Public Review):
  
  “At least one result suggests that the idea of these pathways in parallel may be too simplistic as deletion of the LOS1 gene, which is not essential decreases the interaction of tRNA export substrate with Dbp5 (Figure 2A). If the two pathways were working in parallel, one might have expected removing one pathway to lead to an increase in the use of the other pathway and hence the interaction with a receptor in that pathway…. The obvious missing experiment here with respect to genetics is the test of whether deletion of the MSN5 gene in the cells, which combines deletion of LOS1 and the dbp5_R423A allele, shown in Figure 1D would be lethal…. The authors provide evidence of a model where the helicase Dbp5 plays a role in tRNA export from the nucleus. Further evidence is required to determine whether Dbp5 could function in the same pathway as the previously defined tRNA export receptors, Los1 and Msn5. There are genetic tests that could be performed to explore this question. Some of the biochemistry presented would show when Los1 is absent that the interaction of Dbp5 with tRNA decreases, which could support a model where Dbp5 plays a role in coordination with Los1”
  
  We agree that this is an important point that should be made clear and discussed in the text. We also agree that further experiments would be needed to be to confirm Dbp5 functions broadly in tRNA export in parallel to both Msn5 and Los1. We will aim to address these points in resubmission and discuss possible alternative conclusions of the presented results.
  
  Reviewer #1 (Public Review):
  
  “While some of the binding assays show rather modest band shifts (Figure 4B for example), the data in Figure 4A showing that there is no binding detected unless a non-hydrolyzable ATP analogue is employed, argues for specificity in nucleic acid binding. The question that does arise is whether the binding is specific for tRNA.”
  
  The specificity of the in-vitro interactions of Dbp5 are an important point of discussion. We will work to expand the topic of specificity of the in-vitro experiments during resubmission.
  
  Reviewer #1 (Public Review):
  
  “With the exception of the binding studies, which also employ a mixture of yeast tRNAs, this study relies primarily on a single tRNA species to come to the conclusions drawn. Many other studies have used multiple tRNAs to explore whether pathways characterized are generalizable to other tRNAs.“
  
  It was previously shown that Dbp5 functions to support the export of multiple tRNA species (https://doi.org/10.7554/eLife.48410). As such, we agree that additional tRNAs should be tested to explore whether phenotypes reported here are also generalizable to other tRNAs. We will add data targeting additional tRNAs during resubmission.
  
  Reviewer #2 (Public Review):
  
  “there are some pieces of data that are misinterpreted. (Figure 1A and B look the same; in Fig 1E, the DAPI staining is abnormal; in Fig 4 the bands can't be seen.)”
  
  Figure 1A and B represent separate experiments, showing that deletion of Los1 does not alter Dbp5 localization and conversely loss of Dbp5 does not alter Los1 localization. As such localization patterns under loss-of-function conditions look the same as wild-type localization for each protein respectively as noted. We believe that we have come to the same conclusion as the reviewer on Figure 1A and B (and this data is not misinterpreted), but also understand this panel will need to be adjusted for clarity and readability. We will make efforts to edit this figure and accompanying text make the data and conclusions clearer, including addressing the EMSAs in figure 4 and associated text for clarity.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.29.547072v4
pmc.ncbi.nlm.nih.gov pmc.ncbi.nlm.nih.gov

Hunger- and thirst-sensing neurons modulate a neuroendocrine network to coordinate sugar and water ingestion

1
1. Public_Reviews 23 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  We greatly appreciate the positive feedback of the reviewers and have modified the manuscript to address their comments, including changes to the text, figures, and methods. We believe that these revisions have strengthened and improved the manuscript. Reviewers’ comments in blue and detailed responses in black are below.
  
  Reviewer #1 Weaknesses:
  
  Is "function" of the ISNs to balance "nutrient need" or osmolarity? Balancing hemolymph osmolarity for physiological homeostasis is conceptually different from balancing thirst and hunger.
  
  We have added the following text to the introduction to address this: “Thus, the ISNs sense both AKH and hemolymph osmolality, arguing that they balance internal osmolality fluctuations and nutrient need (Jourjine, Mullaney et al., 2016).” (ln 80-82).
  
  The final schematic nicely sums up how the different peptidergic pathways might work together, but it is unclear which connections are empirically-validated or speculative. It would be informative to show which parts of the model are speculative versus validated. For example, does FAFB volume synapse = functional connectivity and not just anatomical proximity? A bulk of the current manuscript relies on "synapses of relatively high confidence" (according to Materials and methods: line 522). I recommend distinguishing empirically tested & predicted connections in the final schematic, and maybe reword/clarify throughout the manuscript as "predicted synaptic partners"
  
  We modified the schematic to clarify EM based connections versus functionally validated connections. We also clarified the EM predicted synaptic partners, using “predicted synaptic partners” throughout the manuscript.
  
  Reviewer #2 Areas for further development:
  
  • Does BIT inhibit all of the IPCs or some of them? I think it is critical to indicate the ROIs used for each neuron in the methods. Which part of the neuron is used for imaging experiments? Dendrites, cell bodies, or synaptic terminals?
  
  ROIs used for quantification are described in the figure legends: “ArcLight response of BiT soma…” (Fig 2, Fig S2), “Calcium responses of CCHa2R-RA neurites in SEZ…” (Fig 4), “Calcium response of CCHa2R-RA SEZ neurites…” (Fig S4), “Calcium response of CCAP neurites…” (Fig 5, Fig S5), “Calcium response of all IPC somas…” (Fig S3). We have added ROIs used for quantification to the ‘In vivo calcium imaging’ and the ‘In vivo voltage imaging’ methods sections (ln 493-494).
  
  • The discussion section is not giving big picture explanation of how these neurons work together to regulate sugar and water ingestion. Silencing and activation experiments are good, but without showing the innate activity of these neural groups during ingestion, it is not clear what their functions are in terms of regulating fly behavior.
  
  We agree that how these peptidergic neurons coordinately regulate feeding is unclear. As peptide signals may act at a distance and may cause long-lasting neural activity state changes, studying their integration over space and time is challenging. Acute imaging during feeding would only in part address this challenge, as cumulative changes in nutrient need signals may impart circuit changes that are not apparent by monitoring the acute activity of peptidergic neurons. We modified a paragraph in the discussion to address this (ln 434-443).
  
  “Overall, our work sheds light on neural circuit mechanisms that translate internal nutrient abundance cues into the coordinated regulation of sugar and water ingestion. We show that the hunger and thirst signals detected by the ISNs influence a network of peptidergic neurons that act in concert to prioritize ingestion of specific nutrients based on internal needs. We hypothesize that multiple internal state signals are integrated in higher brain regions such that combinations of peptides and their actions signify specific needs to drive ingestion of appropriate nutrients. As peptide signals may act at a distance and may cause long-lasting neural activity state changes, studying their integration over space and time is a future challenge to further illuminate homeostatic feeding regulation.”
  
  Reviewer #1 (Recommendations For The Authors):
  
  For the final schematic figure, it may be informative to include nanchung and AKHR in the schematic.
  
  We now include this (Fig 6).
  
  For the ingestion duration with optogenetic activation, I don't think the right way to represent the data is by normalizing them to the no LED control. I think it should show raw ingestion time. I understand that the normalized data make the figure "cleaner" (no need to show +/- LED separately) but I think visualization of the raw data is important.
  
  We now include this in a new Supplemental Figure (Fig S6).
  
  Methods for ingestion with optogenetic activation should be detailed in the Methods section.
  
  We expanded upon this in the ‘Temporal consumption assay (TCA)’ methods section. (ln 461-466).
  
  Reviewer #2 (Recommendations For The Authors):
  
  1) I think the authors are not following the recommendations of the Flywire community which recommends that people who contributed to the tracing of neurons are offered authorship in the published papers. I see the authors are thanking other lab members who have done tracing for the neurons described in this study, but I would like them to clarify whether they are following the guidelines provided by Flywire.
  
  We followed the Flywire guidelines and contacted all Flywire users contributing more that 10% to neuron edits for permission to publish with acknowledgements. (see Flywire guidelines https://docs.google.com/document/d/1bUkOB5JnT3u__JDvAoVDHJ3zr5NXQtV_63yx2w6Tcc/edit).
  
  2) The method section for voltage imaging is missing.
  
  We now include a section on voltage imaging (ln 496-498).
  
  3) ROIs for imaging are not indicated in the methods or in the figures. It is hard to judge what is the origin of neural activity plotted in the figures; are they imaging cell bodies, dendrites, or axons?
  
  ROIs used for quantification are described in the figure legends: “ArcLight response of BiT soma…” (Fig 2, Fig S2), “Calcium responses of CCHa2R-RA neurites in SEZ…” (Fig 4), “Calcium response of CCHa2R-RA SEZ neurites…” (Fig S4), “Calcium response of CCAP neurites…” (Fig 5, Fig S5), “Calcium response of all IPC somas…” (Fig S3). We have added ROIs used for quantification to the ‘In vivo calcium imaging’ and the ‘In vivo voltage imaging’ methods sections (ln 493-494).
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

pmc.ncbi.nlm.nih.gov/articles/PMC10104137/
www.biorxiv.org www.biorxiv.org

New submission 22/08/2023, 11:36:11

1
1. Public_Reviews 23 Aug 2023
 
 in eLife
 
 Author Response
 
 The following is the authors’ response to the original reviews.
 
 We would ﬁrst like to thank the reviewers and the editor for their insightful comments and suggestions. We are particularly glad to read that our so<ware package constitutes a set of “well-written analysis routines” which have “the potential to become very valuable and foundational tools for the analysis of neurophysiological data”. We have updated the manuscript to address their remarks where appropriate.
 
 Additionally, we would like to stress that this kind of tools is in continual development. As such, the manuscript oﬀered a snapshot of the package at one point during this process, which in this case was several months ago at initial submission. Since then, several improvements were implemented. The manuscript has been further updated to reﬂect these more recent changes.
 
 From the Reviewing Editor:
 
 The reviewers identiﬁed a number of fundamental weaknesses in the paper.
 
 1) For a paper demonstrating a toolbox, it seems that some example analyses showing the value of the approach (and potentially the advantage in simpliﬁcation, etc over previous or other approaches) are really important to demonstrate.
 
 As noted by the ﬁrst reviewer, the online repository (i.e. GitHub page) conveys a better sense of the toolboxes’ contribution to the ﬁeld than the present manuscript. This is a fair remark but at the same time, it is unclear how to illustrate this in a journal article without dedicating a great deal of page space to presenting raw code, while online tools oﬀer an easier and clearer way to do this. As a work-around, our strategy was to illustrate some examples of data analysis in Figures 4&5 by comparing each illustrated processing step to the corresponding command line used by the Pynapple package. Each step requires a single line of code, meaning that one only needs to write three lines of code to decode a feature from population activity using a Bayesian decoder (Fig. 4a), compute a cross-correlograms of two neurons during speciﬁc stimulus presentation (Fig. 4b) or compute the average ﬁring rate of two neurons around a speciﬁc time of the experimental task (Fig. 4c). We believe that these visual aides make it unnecessary to add code in the main text of this manuscript. However, to aid reader understanding, we now provide clear references to online Jupyter notebooks which show how each ﬁgure was generated in ﬁgure legends as well as in the “Code Availability” section.
 
 https://github.com/pynapple-org/pynapple-paper-2023
 
 Furthermore, we have opted-in for the “Executable Research Articles” feature at eLife, which will make it possible to include live scripts and ﬁgures in the manuscript once it is accepted for publication. We do not know at this stage what it entails exactly, but we hope that Figures 4&5 will become live with this feature. The readers will have the possibility to see and edit the code directly within the online version of the manuscript.
 
 2) The manuscript's claims about not having dependencies seem confusing.
 
 We agree that this claim was somewhat unfounded. There are virtually no Python packages that do not have dependencies. Our intention was to say that the package had no dependencies outside the most common ones, which are Numpy, Scipy, and Pandas. Too many packages in the ﬁeld tend to have long list of dependencies making long-term back-compatibility quite challenging. By keeping depencies minimal, we hope to maximise the package’'s long term back-compatibility. We have rephrased this statement in the manuscript in the following sections:
 
 Figure 1, legend.
 
 “These methods depend only on a few, commonly used, external packages.”
 
 Section Foundational data processing: “they are for the most part built-in and only depend on a few widely-used external packages. This ensures that the package can be used in a near stand-alone fashion, without relying on packages that are at risk of not being maintained or of not being compatible in the near future.”
 
 3) Given its signiﬁcant relevance, it seems important to cite the FMATool and describe connections between it (or analyses based on it) and the presented work.
 
 Indeed, although we had already cited other toolboxes (including a review covering the topic comprehensively), we should have included this one in the original manuscript. Unfortunately, to the best of our knowledge, this toolbox is not citable (there is no companion paper). We have added a reference to it in plain text.
 
 4) Some discussion of integration between Pynapple and the rest of a full experimental data pipeline should be discussed with regard to reproducibility.
 
 This is an interesting point, and the third paragraph of the discussion somewhat broached this issue. Pynapple was not originally designed to pre-process data. However, it can, in theory, load any type of data streams a<er the necessary pre-processing steps. Overall, modularity is a key aspect of the Pynapple framework, and this is also the case for the integration with data pre-processing pipelines, for example spike sorting in electrophysiology and detection of region of interest in calcium imaging. We do not think there should be an integrated solution to the problem but, instead, to make it possible that any piece of code can be used for data irrespective of their origin. This is why we focused on making data loading straightforward and easy to adapt to any particular situation. To expand on this point and make it clear that Pynapple is not meant to pre-process data but can, in theory, load any type of data streams a<er the necessary pre-processing steps, we have added the following sentences to the aforementioned paragraph:
 
 “Data in neuroscience vary widely in their structure, size, and need for pre-processing. Pynapple is built around the idea that raw data have already been pre-processed (for example, spike sorting and detection of ROIs).”
 
 5) Relatedly, a description of how data are stored a<er processing (i.e., how precisely are processed data stored in NWB format).
 
 We agree that this is a critical issue. NWB is not necessarily the best option as it is not possible to overwrite in a NWB ﬁle. This would require the creation of a new NWB ﬁle each time, which is computationally expensive and time consuming. It also further increases the odds of writing error. Theoretically, users who needs to store intermediate results in a ﬂexible way could use any methods they prefer, writing their own data ﬁles and wrappers to reload these data into Pynapple objects. Indeed, it is not easy to properly store data in an object-speciﬁc manner. This is a long-standing issue and one we are currently working to resolve.
 
 To do so, we are developing I/O methods for each Pynapple core objects. We aim to provide an output format that is simple to read and backward compatible in future Pynapple releases. This feature will be available in the coming weeks. To note, while NWB may not be the central data format of Pynapple in future releases, it has become a central node in the neuroscience ecosystem of so<ware. Therefore, we aim to facilitate the interaction of users with reading and writing for this format by developing a set of simple standalone functions.
 
 Reviewer #1 (Public Review):
 
 A typical path from preprocessed data to ﬁndings in systems neuroscience o<en includes a set of analyses that o<en share common components. For example, an investigator might want to generate plots that relate one time series (e.g., a set of spike times) to another (measurements of a behavioral parameter such as pupil diameter or running speed). In most cases, each individual scientist writes their own code to carry out these analyses, and thus the same basic analysis is coded repeatedly. This is problematic for several reasons, including the waste of time, the potential for errors, and the greater diﬃculty inherent in sharing highly customized code.
 
 This paper presents Pynapple, a python package that aims to address those problems.
 
 Strengths:
 
 The authors have identiﬁed a key need in the community - well-written analysis routines that carry out a core set of functions and can import data from multiple formats. In addition, they recognized that there are some common elements of many analyses, particularly those involving timeseries, and their object- oriented architecture takes advantage of those commonalities to simplify the overall analysis process.
 
 The package is separated into a core set of applications and another with more advanced applications, with the goal of both providing a streamlined base for analyses and allowing for implementations/inclusion of more experimental approaches.
 
 Weaknesses:
 
 There are two main weaknesses of the paper in its present form.
 
 First, the claims relating to the value of the library in everyday use are not demonstrated clearly. There are no comparisons of, for example, the number of lines of code required to carry out a speciﬁc analysis with and without Pynapple or Pynacollada. Similarly, the paper does not give the reader a good sense of how analyses are carried out and how the object-oriented architecture provides a simpliﬁed user interaction experience. This contrasts with their GitHub page and associated notebooks which do a better job of showing the package in action.
 
 As noted in the response to the Reviewing Editor and response to the reviewer’s recommendation to the authors below, we have now included links to Jupyter notebooks that highlight how panels of Figures 4 and 5 were generated (https://github.com/pynapple-org/pynapple-paper-2023). However, we believe that including more code in the manuscript than what is currently shown (I.e. abbreviated call to methods on top of panels in Figs 4&5) would decrease the readability of the manuscript.
 
 Second, the paper makes several claims about the values of object-oriented programming and the overall design strategy that are not entirely accurate. For example, object-oriented programming does not inherently reduce coding errors, although it can be part of good so<ware engineering. Similarly, there is a claim that the design strategy "ensures stability" when it would be much more accurate to say that these strategies make it easier to maintain the stability of the code. And the authors state that the package has no dependencies, which is not true in the codebase. These and other claims are made without a clear deﬁnition of the properties that good scientiﬁc analysis so<ware should have (e.g., stability, extensibility, testing infrastructure, etc.).
 
 Following thFMAe reviewer’s comment, we have rephrased and clariﬁed these claims. We provide detailed response to these remarks in the recommendations to authors below.
 
 There is also a minor issue - these packages address an important need for high-level analysis tools but do not provide associated tools for preprocessing (e.g., spike sorting) or for creating reproducible pipelines for these analyses. This is entirely reasonable, in that no one package can be expected to do everything, but a bit deeper account of the process that takes raw data and produces scientiﬁc results would be helpful. In addition, some discussion of how this package could be combined with other tools (e.g., DataJoint, Code Ocean) would help provide context for where Pynapple and Pynacollada could ﬁt into a robust and reliable data analysis ecosystem.
 
 We agree the better explaining how Pynapple is integrated within data preprocessing pipelines is essential. We have clariﬁed this aspect in the manuscript and provide more details below.
 
 Reviewer #1 (Recommendations For The Authors):
 
 Page 1
 
 Title
 
 The authors should note that the application name- "Pynapple" could be confused with something from Apple. Users may search for "Pyapple" as many python applications contain "py" like "Numpy". "Pyapple" indeed is a Python Apple that works with Apple products. They could consider "NeuroFrame", "NeuroSeries" or "NeuroPandas" to help users realize this is not an apple product.
 
 We thank the referee for this interesting comment. However, we are not willing to make such change at this point. The community of users has been growing in the last year and it seems too late to change the name. To note, it is the ﬁrst time such comment is made to us and it does not seem that users and collaborators are confused with any Apple products.
 
 Abstract
 
 The authors mentioned that the Pynapple is "fully open source". It may be better to simply say it is "open source".
 
 We agree, corrected.
 
 Assuming the authors keep the name, it would be helpful if the full meaning of Pynapple - Python Neural Analysis Package was presented as early as possible.
 
 Corrected in the abstract.
 
 Highlight
 
 An application being lightweight and standalone does not imply nor ensure backward compatibility. In general, it would be useful if the authors identiﬁed a set of desirable code characteristics, deﬁned them clearly in the introduction, and then describe their so<ware in terms of those characteristics.
 
 Thank you for your comment. We agree that being lightweight and standalone does not necessarily imply backward compatibility. Our intention was to emphasize that Pynapple is designed to be as simple and ﬂexible as possible, with a focus on providing a consistent interface for users across diﬀerent versions. However, we understand that this may not be enough to ensure long-term stability, which is why we are committed to regular updates and maintenance to ensure that the code remains functional as the underlying code base (Python versions, etc.) changes.
 
 Regarding your suggestion to identify a set of desirable code characteristics, we believe this is an excellent idea. In the introduction, we brieﬂy touch upon some of the core principles that guided our development of Pynapple: a lightweight, stable, and simple package. However, we acknowledge that providing a more detailed discussion of these characteristics and how they relate to the design of our so<ware would be useful for readers. We have added this paragraph in the discussion:
 
 “Pynapple was developed to be lightweight, stable, and simple. As simplicity does not necessarily imply backward compatibility (i.e. long-term stability of the code), Pynapple main objects and their properties will remain the same for the foreseeable future, even if the code in the backend may eventually change (e.g. not relying on Pandas in future version). The small number of external dependencies also decrease the need to adapt the code to new versions of external packages. This approach favors long-term backward compatibility.”
 
 Page 2
 
 The authors wrote -
 
 "Despite this rapid progress, data analysis o<en relies on custom-made, lab-speciﬁc code, which is susceptible to error and can be diﬃcult to compare across research groups."
 
 It would be helpful to add that custom-made, lab-speciﬁc code can lead to a violation of FAIR principles (https://en.wikipedia.org/wiki/FAIR_datadata). More generally, any package can have errors, so it would be helpful to explain any testing regiments or other approach the authors have taken to ensure that their code is error-free.
 
 We understand the importance of the FAIR principles for data sharing. However, Pynapple was not designed to handle data through their pre-processing. The only aspect that is somehow covered by the FAIR principles is the interoperability, but again, it is a requirement for the data to interoperate with diﬀerent storage and analysis pipelines, not of the analysis framework itself. Unlike custom-made code, Pynapple will make interoperability easier, as, in theory, once the required data loaders are available, any analysis could be run on any dataset. We have added the following sentence to the discussion:
 
 “Data in neuroscience vary widely in their structure, size, and need for pre-processing. Pynapple is built around the idea that raw data has already been pre-processed (for example, spike sorting and ROI detection). According to the FAIR principles, pre-processed data should interoperate across diﬀerent analysis pipelines. Pynapple makes this interoperability possible as, once the data are loaded in the Pynapple framework, the same code can be used to analyze diﬀerent datasets”
 
 The authors wrote -
 
 "While several toolboxes are available to perform neuronal data analysis ti‚Äì11,2ti (see ref. 29 for review), most of these programs focus on producing high-level analysis from speciﬁed types of data and do not oﬀer the versatility required for rapidly-changing analytical methods and experimental methods."
 
 Here it would be helpful if the authors could give a more speciﬁc example or explain why this is problematic enough to be a concern. Users may not see a problem with high-level analysis or using speciﬁc data types.
 
 Again, we apologize for not fully elaborating upon our goals here. Our intention was to point out that toolboxes o<en focus on one particular case of high-level analysis. In many cases, such packages lack low level analysis features or the ﬂexibility to derive new analysis pipelines quickly and eﬀortlessly. Users can decide to use low-level packages such as Pandas, but in that case, the learning curve can be steep for users with low, if any, computational background. The simplicity of Pynapple, and the set of examples and notebooks, make it possible for individuals who start coding to be quickly able to analyze their data.
 
 As we do not want to be too speciﬁc at this point of the manuscript (second paragraph of the intro) and as we have clariﬁed many of the aspects of the toolbox in the new revised version, we have only added the following sentence to the paragraph:
 
 “Users can decide to use low-level data manipulation packages such as Pandas, but in that case, the learning curve can be steep for users with low, if any, computational background.”
 
 The authors wrote -
 
 "To meet these needs, a general toolbox for data analysis must be designed with a few principles in mind"
 
 Toolboxes based on many diﬀerent principles can solve problems. It is likely more accurate to say that the authors designed their toolbox with a particular set of principles in mind. A clear description of those principles (as mentioned in the comment above) would help the reader understand why the speciﬁc choices made are beneﬁcial.
 
 We agree that these are not “universal” principles and clearly more the principles we had in mind when we designed the package. We have clariﬁed these principles and made clear that these are personal point of views.
 
 We have rephrased the following paragraph:
 
 “To meet these needs, we designed Pynapple, a general toolbox for data analysis in systems Neuroscience with a few principles in mind.“
 
 The authors wrote -
 
 "The ﬁrst property of such a toolbox is that it should be object-oriented, organizing so<ware around data."
 
 What facts make this true? For example, React is a web development library. A common approach to using this library is to use Hooks (essentially a collection of functions). This is becoming more popular than the previous approach of using Components (a collection of classes). This is an example of how Object-oriented programming is not always the best solution. In some cases, for example, object- oriented coding can cause problems (e.g. it can be hard to ﬁnd the place where a given function is deﬁned and to ﬁgure out which version is being used given complex inheritance structures.)
 
 In general, key selling points of object-oriented programming are extension, inheritance, and encapsulation. If the authors want to retain this text (which would be entirely reasonable), it would be helpful if they explained clearly how an object-oriented approach enables these functions and why they are critical for this application in particular.
 
 The referee makes a particularly important point. We are aware of the limits of OOP, especially when these objects become over-complex, and that the inheritance become unclear.
 
 We have clariﬁed our goal here. We believe that in our case, OOP is powerful and, overall, is less error- prone that a collection of functions. The reasons are the following:
 
 An object-oriented approach facilitates better interactions between objects. By encapsulating data and behavior within objects, object-oriented programming promotes clear and well-deﬁned interfaces between objects. This results in more structured and manageable code, as objects communicate with each other through these well-deﬁned interfaces. Such improved interactions lead to increased code reliability.
 
 Inheritance, a key concept in object-oriented programming, allows for the inheritance of properties. One important example of how inheritance is crucial in the Pynapple framework is the time support of Pynapple objects. It determines the valid epoch on which the object is deﬁned. This property needs to be carried over during diﬀerent manipulations of the object. Without OOP, this property could easily be forgotten, resulting in erroneous conclusions for many types of analysis. The simplest case is the average rate of a TS object: the rate must be computed on the time support ( a property of TS objects), not the beginning to the end of the recording (or of a speciﬁc epoch, independent of the TS). Finally, it is easier to access and manipulate the meta information of a Pynapple object than without using objects.
 
 The authors wrote -
 
 "drastically diminishing the odds of a coding error"
 
 This seems a bit strong here. Perhaps "reducing the odds" would be more accurate.
 
 We agree. Now changed.
 
 Page 3
 
 The authors wrote -
 
 ". Another property of an eﬃcient toolbox is that as much data as possible should be captured by only a small number of objects This ensures that the same code can be used for various datasets and eliminates the need of adapting the structure"
 
 It may be better to write something like - "Objects have a collection of preset variables/values that are well suited for general use and are very ﬂexible." Capturing "as much data as possible" may be confusing, because it's not the amount that this helps with but rather the variety.
 
 We thank the referee for this remark. We have rephrased this sentence as follows:
 
 “Another property of an eﬃcient toolbox is that a small number of objects could virtually represents all possible data streams in neuroscience, instead of objects made for speciﬁc physiological processes (e.g. spike trains).”
 
 The authors wrote -
 
 "The properties listed above ensure the long-term stability of a toolbox, a crucial aspect for maintaining the code repository. Toolboxes built around these principles will be maximally ﬂexible and will have the most general application"
 
 There are two issues with this statement. First, ensuring long-term stability is only possible with a long- term commitment of time and resources to ensure that that code remains functional as the underlying code base (python versions, etc.) changes. If that is something you are commisng to, it would be great to make that clear. If not, these statements need to be less ﬁrm.
 
 Second, it is not clear how these properties were arrived at in the ﬁrst place. There are things like the FAIR Principles which could provide an organizing framework, ideally when combined with good so<ware engineering practices, and if some more systematic discussion of these properties and their justiﬁcation could be added, it would help the ﬁeld think about this issue more clearly.
 
 The referee makes a valid point that ensuring long-term stability requires a long-term commitment of time and resources to maintain the code as the underlying technology evolves. While we cannot make guarantees about the future of Pynapple, we believe that one of the best ways to ensure long-term stability is by fostering a strong community of users and contributors who can provide ongoing support and development. By promoting open-source collaboration and encouraging community involvement, we hope to create a sustainable ecosystem around Pynapple that can adapt to changes in technology and scientiﬁc practices over time. Ultimately, the longevity of any scientiﬁc tool depends on its adoption and use by the research community, and we hope that Pynapple can provide value to neuroscience researchers and continue to evolve and improve as the ﬁeld progresses.
 
 It is noteworthy that the ﬁrst author, and main developer of the package, has now been hired as a data scientist at the Center for Computational Neuroscience, Flatiron Institute, to explicitly continue the development of the tool and build a community of users and contributors.
 
 The authors wrote -
 
 "each with a limited number of methods..."
 
 This may give the impression that the functionality is limited, so rephrasing may be helpful.
 
 Indeed! We have now rephrased this sentence:
 
 “The core of Pynapple is ﬁve versatile timeseries objects, whose methods make it possible to intuitively manipulate and analyze the data.”
 
 The authors wrote that object-oriented coding
 
 "limits the chances of coding error"
 
 This is not always the case, but if it is the case here, it would be helpful if the authors explain exactly how it helps to use object-oriented approaches for this package.
 
 We agree with the referee that it is not always the case. As we explained above, we believe it is less error-prone that a collection of functions. Quite o<en, it also makes it easier to debug. We have changed this sentence with the following one:
 
 “Because objects are designed to be self-contained and interact with each other through well-deﬁned methods, users are less likely to make errors when using them. This is because objects can enforce their own internal consistency, reducing the chances of data inconsistencies or unexpected behavior. Overall, OOP is a powerful tool for managing complexity and reducing errors in scientiﬁc programming.”
 
 Fig 1
 
 In object-oriented programming, a class is a blueprint for the classes that inherit it. Instantiating that class creates an object. An object contains any or all of these - data, methods, and events. The ﬁgure could be improved if it maintained these organizational principles as ﬁgure properties.
 
 We agree with the referee’s remark regarding the logic of objects instantiation but how this could be incorporated in Fig. 1 without making it too complex is unclear. Here, objects are instantiated from the ﬁrst to the second column. We have not provided details about the parent objects, as we believe these details are not important for reader comprehension. In its present form, the objects are inherited from Pandas objects, but it is possible that a future version is based on something else. For the users, this will be transparent as the toolbox is designed in such a way that only the methods that are speciﬁc to Pynapple are needed to do most computation, while only expert programmers may be interested in using Pandas functionalities.
 
 The authors wrote that Pynapple does -
 
 "not depend on any external package"
 
 As mentioned above, this is not true. It depends on Numpy and likely other packages, and this should be explained. It is perfectly reasonable to say that it depends on only a few other packages.
 
 As said above, we have now clariﬁed this claim.
 
 Page 5.
 
 The authors wrote -
 
 "represent arrays of Ts and Tsd"
 
 For a knowledgeable reader's reference, it would be helpful to refer to these either as Numpy arrays (at least at ﬁrst when they are deﬁned) or as lists if they are native python objects.
 
 Indeed, using the word “arrays” here could be confusing because of Numpy arrays. We have changed this term with “groups”.
 
 The authors wrote -
 
 "Pynapple is built with objects from the Pandas library ... Pynapple objects inherit the computational stability and ﬂexibility"
 
 Here a deﬁnition of stability would be useful. Is it the case that by stability you mean "does not change o<en"? Or is some other meaning of stability implied?
 
 Yes, this is exactly what we meant when referring to the stability of Pandas. We have added the following precision:
 
 “As such, Pynapple objects inherit the long-term consistency of the code and the computational ﬂexibility computational stability and ﬂexibility from this widely used package.”
 
 Page 6
 
 Fig 2
 
 In Fig 2 A and B, the illustrations are good. It would also be very helpful to use toy code examples to illustrate how Pynapple will be used to carry out on a sample analysis-problem so that potential users can see what would need to be done.
 
 We appreciate the kind works. Regarding the toy code, this is what we tried to do in Fig. 4. Instead of including the code directly in the paper, which does not seem a modern way of doing this, we now refer to the online notebooks that reproduce all panels of Figure 4.
 
 The authors wrote -
 
 "While these objects and methods are relatively few"
 
 In object-oriented programming, objects contain methods. If a method is not in an object, it is not technically a method but a function. It would be helpful if the authors made sure their terminology is accurate, perhaps by saying something like "While there are relatively few objects, and while each object has relatively few methods ... "
 
 We agree with the referee, we have changed the sentence accordingly.
 
 The authors wrote -
 
 "if not implemented correctly, they can be both computationally intensive and highly susceptible to user error"
 
 Here the authors are using "correctly" to refer to two things - "accuracy" - gesng the right answer, and "eﬃciency" - gesng to that answer with relatively less computation. It would be clearer if they split out those two concepts in the phrasing.
 
 Indeed, we used the term to cover both aspects of the problem, leading to the two possible issues cited in the second part of the sentence. We have changed the sentence following the referee’s advice:
 
 “While there are relatively few objects, and while each object has relatively few methods, they are the foundation of almost any analysis in systems neuroscience. However, if not implemented eﬃciently, they can be computationally intensive and if not implemented accurately, they are highly susceptible to user error.”
 
 In the next sentence the authors wrote -
 
 "Pynapple addresses this concern."
 
 This statement would beneﬁt from just additional text explaining how the concern is addressed.
 
 We thank the referee for the suggestion. We have changed the sentence to this one: “The implementation of core features in Pynapple addresses the concerns of eﬃciency and accuracy”
 
 Page 9
 
 The authors wrote -
 
 This is implemented via a set of specialized object subclasses of the BaseLoader class. To avoid code redundancy, these I/O classes inherit the properties of the BaseLoader class. "
 
 From a programming perspective, the point of a base class is to avoid redundancy, so it might be better to just mention that this avoids the need to redeﬁne I/O operations in each class.
 
 We have rephrased the sentence as follows:
 
 “This is implemented via a set of specialized object subclasses of the BaseLoader class, avoiding the need to redeﬁne I/O operations in each subclass"
 
 The authors wrote -
 
 "classes are unique and independent from each other, ensuring stability"
 
 How do classes being unique and independent ensure stability? Perhaps here again the misunderstanding is due to the lack of a deﬁnition of stability.
 
 We thank the referee for the remark. We ﬁrst changed “stability” for “long-term backward compatibility”. We further added the following sentence to clarify this claim. “For instance, if the spike sorting tool Phy changes its output in the future, this would not aﬀect the “Neurosuite” IO class as they are independent of each other. This allows each tool to be updated or modiﬁed independently, without requiring changes to the other tool or the overall data format.”
 
 The authors wrote -
 
 "Using preexisting code to load data in a speciﬁc manner instead of rewriting already existing functions avoids preprocessing errors"
 
 Here it might be helpful to use the lingo of Object-oriented programming. (e.g. inheritance and polymorphism). Deﬁning these terms for a neuroscience audience would be useful as well.
 
 We do not think it is necessary to use too much technical term in this manuscript. However, this sentence was indeed confusing. We have now simpliﬁed it:
 
 “[…], users can develop their own custom I/O using available template classes. Pynapple already includes several of such templates and we expect this collection to grow in the future.”
 
 Page 10
 
 The authors wrote -
 
 "These analyses are powerful because they are able to describe the relationships between time series objects while requiring the fewest number of parameters to be set by the user."
 
 It is not clear that this makes for a powerful analysis as opposed to an easy-to-use analysis.
 
 We have changed “powerful” with “easy to use".
 
 Page 12
 
 "they are built-in and thus do not have any external dependencies"
 
 If the authors want to retain this, it would be helpful to explain (perhaps in the introduction) why having fewer external dependencies is useful. And is it true that these functions use only base python classes?
 
 We have rephrased this sentence as follows:
 
 “they are for the most part built-in and only depend on a few common external packages, ensuring that they can be used stand-alone without relying on packages that are at risk of not being maintained or of not being compatible in the near future.”
 
 Other comments:
 
 It would be helpful, as mentioned in the public review, to frame this work in the broader context of what is needed to go from data to scientiﬁc results so that people understand what this package does and does not provide.
 
 We have added the following sentence to the discussion to make sure readers understand:
 
 “The path from data collection to reliable results involves a number of critical steps: exploratory data analysis, development of an analysis pipeline that can involve custom-made developed processing steps, and ideally the use of that pipeline and others to replicate the results. Pynapple provides a platform for these steps.”
 
 It would also be helpful to describe the Pynapple so<ware ecosystem as something that readers could contribute to. Note here that GNU may not be a good license. Technically, GNU requires any changes users make to Pynapple for their internal needs to be oﬀered back to the Pynapple team. Some labs may ﬁnd that burdensome or unacceptable. A workaround would be to have GNU and MIT licenses.
 
 The main restriction of the GPL license is that if the code is changed by others and released, a similar license should be used, so that it cannot become proprietary. We therefore stick to this choice of license.
 
 We would be more than happy to receive contributions from the community. To note, several users outside the lab have already contributed. We have added the following sentence in the introduction:
 
 “As all users are also invited to contribute to the Pynapple ecosystem, this framework also provides a foundation upon which novel analyses can be shared and collectively built by the neuroscience community.”
 
 This so<ware shares some similarities with the nelpy package, and some mention of that package would be appropriate.
 
 While we acknowledge the reviewer's observation that Nelpy is a similar package to Pynapple, there are several important diﬀerences between the two.
 
 First, Nelpy includes predeﬁned objects such as SpikeTrain, BinnedSpikeTrain, and AnalogSignal, whereas Pynapple would use only Ts and Tsd for those. This design choice was made to provide greater ﬂexibility and allow users to deﬁne their own data structures as needed.
 
 Second, Nelpy is primarily focused on electrophysiology data, whereas Pynapple is designed to handle a wider range of data types, including calcium imaging and behavioral data. This reﬂects our belief that the NWB format should be able to accommodate diverse experimental paradigms and modalities.
 
 Finally, while Nelpy oﬀers visualization and high-level analysis tools tailored to electrophysiology, Pynapple takes a more general-purpose approach. We believe that users should be free to choose their own visualization and analysis tools based on their speciﬁc needs and preferences.
 
 The package has now been cited.
 
 Reviewer #2 (Public Review):
 
 Pynapple and Pynacollada have the potential to become very valuable and foundational tools for the analysis of neurophysiological data. NWB still has a steep learning curve and Pynapple oﬀers a user- friendly toolset that can also serve as a wrapper for NWB.
 
 The scope of the manuscript is not clear to me, and the authors could help clarify if Pynacollada and other toolsets in the making become a future aspect of this paper (and Pynapple), or are the authors planning on building these as separate publications.
 
 The author writes that Pynapple can be used without the I/O layer, but the author should clarify how or if Pynapple may work outside NWB.
 
 Absolutely. Pynapple can be used for generic data analysis, with no requirement of speciﬁc inputs nor NWB data. For example, the lab is currently using it for a computational project in which the data are loaded from simple ﬁles (and not from full I/O functions as provided in the toolbox) for further analysis and ﬁgure generation.
 
 This was already noted in the manuscript, last paragraph of the section “Importing data from common and custom pipelines”
 
 “Third, users can still use Pynapple without using the I/O layer of Pynapple.”.
 
 We have added the following sentence in the discussion
 
 “To note, Pynapple can be used without the I/O layer and independent of NWB for generic, on-the-ﬂy analysis of data.”
 
 This brings us to an important fundamental question. What are the advantages of the current approach, where data is imported into the Ts objects, compared to doing the data import into NWB ﬁles directly, and then making Pynapple secondary objects loaded from the NWB ﬁle? Does NWB natively have the ability to store the 5 object types or are they initialized on every load call?
 
 NWB and Pynapple are complimentary but not interdependent. NWB is meant to ensure long-term storage of data and as such contains a as much information as possible to describe the experiment. Pynapple does not use NWB to directly store the objects, however it can read from NWB to organize the data in Pynapple objects. Since the original version of this manuscript was submitted, new methods address this. Speciﬁcally, in the current beta version, each object now has a “save” method. Obviously, we are developing functions to load these objects as well. This does not depend on NWB but on npz, a Numpy speciﬁc ﬁle format. However, we believe it is a bit too premature to include these recent developments in the manuscript and prefer not to discuss this for now.
 
 Many of these functions and objects have a long history in MATLAB - which documents their usefulness, and I believe it would be ﬁsng to put further stress on this aspect - what aspects already existed in MATLAB and what is completely novel. A widely used MATLAB toolset, the FMA toolbox (the Freely moving animal toolbox) has not been cited, which I believe is a mistake.
 
 We agree that the FMA toolbox should have been cited. This ha now been corrected.
 
 Pynapple was ﬁrst developed in Matlab (it was then called TSToolbox). The ﬁrst advantage is of course that Python is more accessible than Matlab. It has also been adopted by a large community of developers in data analysis and signal processing, which has become without a doubt much larger than the Matlab community, making it possible to ﬁnd solutions online for virtually any problem one can have. Furthermore, in our experience, trainees are now unwilling to get training in Matlab.
 
 Yet, Python has drawbacks, which we are fully aware of. Matlab can be very computationally eﬃcient, and old code can usually run without any change, even many years later.
 
 A limitation in using NWB ﬁles is its standardization with limited built-in options for derived data and additional metadata. How are derived data stored in the NWB ﬁles?
 
 NWB has predetermined a certain number of data containers, which are most common in systems neuroscience. It is theoretically possible to store any kind of data and associated metadata in NWB but this is diﬃcult for a non-expert user. In addition, NWB does not allow data replacement, making is necessary to rewrite a whole new NWB ﬁle each time derived data are changed and stored. Therefore, we are currently addressing this issue as described above. Derived data and metadata will soon be easy to store and read.
 
 How is Pynapple handling an existing NWB dataset, where spikes, behavioral traces, and other data types have already been imported?
 
 This is an interesting point. In theory, Pynapple should be able to open a NWB ﬁle automatically, without providing much information. In fact, it is challenging to open a NWB ﬁle without knowing what to look for exactly and how the data were preprocessed. This would require adapting a I/O function for a speciﬁc NWB ﬁle. Unfortunately, we do not believe there is a universal solution to this problem. There are solutions being developed by others, for example NWB Widgets (NWB Widgets). We will keep an eye on this and see whether this could be adapted to create a universal NWB loader for Pynapple.
 
 Reviewer #2 (Recommendations For The Authors):
 
 Other tools and solutions are being developed by the NWB community. How will you make sure that these tools can take advantage of Pynapple and vice versa?
 
 We recognize the importance of collaboration within the NWB community and are committed to making sure that our tools can integrate seamlessly with other tools and solutions developed by the community.
 
 Regarding Pynapple speciﬁcally, we are designing it to be modular and ﬂexible, with clear APIs and documentation, so that other tools can easily interface with it. One important thing is that we want to make sure Pynapple is not too dependent of another package or ﬁle format such as NWB. Ideally, Pynapple should be designed so that it is independent of the underlying data storage pipeline.
 
 Most of the tools that have been developed in the NWB community so far were designed for data visualisation and data conversion, something that Pynapple does not currently address. Multiple packages for behavioral analysis and exploration of electro/optophysiological datasets are compatible with the NWB format but do not provide additional solutions per se. They are complementary to Pynapple.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.12.06.519376v2
www.biorxiv.org www.biorxiv.org

New submission 22/08/2023, 10:51:29

1
1. Public_Reviews 22 Aug 2023
 
 in eLife
 
 Author Response
 
 The following is the authors’ response to the original reviews.
 
 We would like to thank you for your thorough review of the manuscript. We have taken all comments into account in the revised version of the manuscript. Please find below our detailed responses to your comments.
 
 eLife assessment
 
 This study reports useful information on the limits of the organotypic culture of neonatal mouse testes, which has been regarded as an experimental strategy that can be extended to humans in the clinical setting for the conservation and subsequent re-use of testicular tissue. The evidence that the culture of testicular fragments of 6.5-day-old mouse testes does not allow optimal differentiation of steroidogenic cells is compelling and would be useful to the scientific community in the field for further optimizations.
 
 Thank you for this assessment. We have carefully considered all comments and made the requested revisions to improve the manuscript.
 
 Reviewer #1 (Public Review):
 
 In this manuscript, the authors aimed to compare, from testis tissues at different ages from mice in vivo and after culture, multiple aspects of Leydig cells. These aspects included mRNA levels, proliferation, apoptosis, steroid levels, protein levels, etc. A lot of work was put into this manuscript in terms of experiments, systems, and approaches. However, as written the manuscript is incredibly difficult to follow. The Introduction and Results sections contain rather loosely organized lists of information that were altogether confusing. At the end of reading these sections, it was unclear what advance was provided by this work. The technical aspects of this work may be of interest to labs working on the specific topics of in vitro spermatogenesis for fertility preservation but fail to appeal to a broader readership. This may be best exemplified by the statements at the end of both the Abstract and Discussion which state that more work needs to be done to improve this system.
 
 As suggested, we have reworked the manuscript to make it clearer, more meaningful and more precise. We believe that this work may be of interest to a broader readership. Indeed, the development of a model of in vitro spermatogenesis could be of interest for labs working on the specific period of puberty initiation, on germ and somatic cell maturation and on steroidogenesis under physiological and pathological conditions, and could also be useful for testing the toxicity of cancer therapies, drugs, chemicals and environmental agents (e.g. endocrine disruptors) on the developing testis.
 
 There is a crucial unmet need to optimize the culture conditions for in vitro spermatogenesis. It is important to identify the deregulated molecular mechanisms leading to a decreased in vitro spermatogenic yield. Such results will be of great help to improve organotypic culture conditions. In the present study, we not only uncovered for the first time a failure in adult Leydig cell development, but also an alteration in the expression of several steroidogenic and steroid-metabolizing genes, which could explain the accumulation of progesterone and estradiol and the deficiency of androstenedione in cultured tissues. This hyperestrogenic and hypoandrogenic environment could explain, at least in part, the low efficiency of in vitro spermatogenesis. Furthermore, we show that the addition of hCG (LH homolog) is not sufficient to facilitate Leydig cell differentiation, restore steroidogenesis and improve sperm yield. These data provide valuable information for improving culture conditions. More fundamentally, this culture system could be a useful tool for identifying factors that are essential for the differentiation and functionality of adult Leydig cells during puberty initiation.
 
 Recommendations For The Authors:
 
 This reviewer appreciates that a lot of work was put into this manuscript in terms of experiments, systems, and approaches. However, the manuscript needs significant revision, and in this reviewer's opinion is not appropriate for a broader readership journal. The results seem rather incremental, and the topic is too specialized in its current format.
 
 The manuscript was significantly revised taking into account the reviewer’s comments. In addition, as mentioned above, the development of a model of in vitro spermatogenesis could have wider applications and be of interest to a broader audience.
 
 Comments for improvement, roughly in order of appearance:
 
 1) Abstract - would recommend condensing to hit the main points of the manuscript.
 
 The abstract has been condensed as suggested.
 
 2) Introduction, overall - this is a rather loosely organized list of information that is not synthesized or communicated in a meaningful way. It contains overstatements and lumps together findings from both mice and primates and thus several statements for the actions of these steroid hormones are inaccurate. The authors rely much too heavily upon reviews and need to replace those with a more scholarly approach of carefully reading and citing primary literature.
 
 The Introduction has been reorganized to make it clearer, more synthetic, more meaningful and more accurate. Only findings from rodents are presented. We carefully read the literature and replaced most of reviews by primary literature.
 
 3) Results - this section was extremely difficult to read and comprehend, as it's essentially a laundry list of measurements of mRNAs, steroids, cholesterols, and proteins that go up or down or don't change at multiple ages, both in vitro and in vivo. The section would be improved greatly by an organization with rationale and concluding statements to prepare the reader for the factoid-style data that are presented.
 
 As suggested, the Results section has been improved by an organization with rationale and concluding statements to make it easier to read and comprehend.
 
 4) 47 - is this approach going to both "preserve and restore"? Sounds more like it will allow for the production of offspring, but the other goals are not going to happen from the approach listed in the latter part of that sentence - so not really "fertility restoration" but more of an insurance program that sperm can be produced for ART
 
 Freezing of prepubertal testicular tissue, which contains spermatogonia, is a fertility preservation option proposed to prepubertal boys with cancer prior to highly gonadotoxic treatments. Several fertility restoration strategies, which aim to allow the production of spermatozoa from cryopreserved spermatogonia, are being developed, including in vitro spermatogenesis. This sentence has been rewritten.
 
 5) 62 - specify whether this "decreased expression" is mRNA or protein, and is this because of a loss of Sertoli cells?
 
 “Decreased expression” was replaced by “decreased mRNA levels”. The results we obtained in the cited study (Rondanino et al., 2017) suggest that the decrease in Rhox5 mRNA levels is not the consequence of a change in the proportion of Sertoli cells but reflects an alteration in Rhox5 gene expression. In Figure 6U of the present study, we show indeed that there is no loss of Sertoli cells in organotypic cultures.
 
 6) 66 - what is "the first wave of mouse in vitro spermatogenesis"? Are these cultures from the first wave of mouse in vivo spermatogenesis, or is there a second wave of in vitro spermatogenesis? Please specify
 
 In the mouse, the first entry into meiosis occurs around 8-10 dpp and the first spermatozoa are produced at around 35 dpp: this is the first wave of spermatogenesis which takes place at the onset of puberty. By culturing 6 dpp-old testes for 30 days, our aim is to reproduce in vitro all the stages of this first wave of spermatogenesis, i.e. entry into meiosis, completion of meiosis and spermiogenesis.
 
 In the cited study (Pence et al., 2019), the authors cultured 5 dpp testes for 35 to 49 days and observed a decline in intratesticular testosterone levels in the cultured tissues, i.e. after the end of the first spermatogenic wave, compared to in vivo controls. Our sentence has been rewritten to make it clearer.
 
 7) 78 - is there a difference in T production by Fetal vs Adult LCs? It is this reviewer's understanding that the levels of T around birth in mice (and then a few months after birth in humans) are quite high, similar to adults. So, what are the authors suggesting here by providing the list of expressed genes in these two LC populations?
 
 As mentioned in the Introduction section, 17β-HSD3 – the enzyme responsible for the conversion of androstenedione to T – is not expressed in fetal Leydig cells but is expressed in adult Leydig cells. Therefore, unlike adult Leydig cells, fetal Leydig cells are not capable of synthesizing T.
 
 In the present study, we investigated steroidogenesis but also wondered which types of Leydig cells could be detected under in vitro conditions. It is therefore important to explain to the reader which steroidogenic proteins are expressed by the different Leydig cell populations.
 
 As described in O’Shaughnessy et al., 2002, levels of intratesticular T decline after birth, being very low between 10 and 20 dpp. Then, T levels increase. At 25 dpp, T levels are close to those observed at 1 dpp. T levels increase more than 16-fold between 25 and 30 dpp and then double between 30 dpp and adulthood. Therefore, intratesticular T levels around birth in mice are not as high as in adults, but are about 36-fold lower after birth than in adulthood. It has been shown that in the fetal testis, the conversion of androstenedione produced by fetal Leydig cells is achieved by the adjacent fetal Sertoli cells that express 17β-HSD3 (O’Shaughnessy et al., 2000; Shima et al., 2013). During postnatal development however, Sertoli cells lose the expression of 17β-HSD3 (O’Shaughnessy et al., 2000).
 
 8) 79 -99 - can the authors revise this long list of information to provide a summary of what they are trying to communicate to the reader? What is the intention of this information?
 
 This paragraph has been modified to make it clearer and more synthetic. As different Leydig cell markers are presented in the Results section, it is important to introduce the reader to the different types of Leydig cells, the proteins expressed by these cells and the factors involved in their proliferation and differentiation.
 
 9) 101-2 - replace "involved in" with a more meaningful word - and it is this reviewer's understanding that T has not been shown convincingly to have much of a role in spermatogonial development, at least in mice - that statement is likely true in primates, but not mice; provide primary literature citations to be more precise, rather than a broad review that covers multiple species
 
 “involved in” was replaced by “is essential for many aspects of spermatogenesis, including”. Moreover, we removed “spermatogonial proliferation and differentiation” and provide primary literature citations to be more precise.
 
 10) 105-7 - similar concern for E as for T, above - KO mouse models for ERalpha and beta did not show defects in spermatogenesis as described - not sure what evidence the authors are specifically referring to here - cite primary literature rather than a review on Vitamin D + estrogen
 
 We agree that the question of whether estrogens play a direct role in spermatogenesis was unanswered by the ER null mice. However, estrogens have been shown to be important for the long-term maintenance of spermatogenesis in the ArKO mouse (Robertson et al., 1999) and for the progression of normal germ cell development in the ENERKI mouse (Sinkevicius et al., 2009). This sentence has been reworded and primary literature is cited to be more precise.
 
 11) 113-4 - there is no convincing evidence this reviewer is aware of that the AR is expressed in male germ cells, and therefore T actions on germ cells are indirect, through Sertoli cells and perhaps PTMs; if there is some, this sentence needs a citation showing that
 
 We agree that there is no evidence that AR is expressed in male germ cells and that T acts indirectly on germ cells. This sentence has been rewritten.
 
 12) 114-6 - this is untrue - nowhere in that paper was testosterone or androgen even mentioned!
 
 This reference has been removed. We apologize for this mistake.
 
 13) 116-7 - again, E actions through the ERs are thought to be indirect in the testis, not acting on germ cells; if this is incorrect, please add supportive citations and explain; replace "involved" with a more meaningful word; Rhox5 has a very minor role in spermatogenesis
 
 In contrast to androgen receptors, which are localized in somatic cells, estrogen receptors have been found in most testicular cells, including germ cells. The studies reporting the expression of estrogen receptors in germ cells are cited in the Introduction section. The word “involved” was replaced by “promotes”.
 
 Rhox5 (also known as Pem) has not a very minor role in spermatogenesis. On the contrary, its expression is crucial for normal spermatogenesis and sperm maturation, as loss of Rhox5 in male mice leads to reduced fertility, increased germ cell apoptosis, decreased sperm count and decreased sperm motility (MacLean et al., 2005).
 
 14) 117 - Ref 29 does not support the statement about Rhox5's role in spermatogenesis
 
 The reference (MacLean et al., 2005), supporting the statement about Rhox5’s role in spermatogenesis, was added in the manuscript.
 
 15) 120 - Does FAAH have a protective role in that it is anti-apoptotic? Or just required for some other Sertoli cell function? Should re-word to be more specific.
 
 FAAH (fatty acid amide hydrolase), whose expression is stimulated by estrogens, has been shown to have a crucial role in promoting survival of Sertoli cells by degrading anandamide (N-arachidonoylethanolamine), an endocannabinoid which has a pro-apoptotic activity (Rossi et al., 2007).
 
 The sentence has been reworded to be more specific.
 
 16) 127 - should complete the Introduction with a sentence summarizing what was done and found, for reader clarity
 
 The Introduction has been completed for reader clarity.
 
 17) 136 - misspelled the procedure
 
 Orchidectomy was replaced by orchiectomy.
 
 18) Mice - why use half-day nomenclature for postpartum mice? This is not standard in the literature.
 
 Half-day nomenclature was used due to the uncertainty of the time of birth, which mostly takes place during the night. Since this is not standard in the literature, half-day nomenclature was removed in the entire manuscript.
 
 19) 172-3 - the half-life of RA is very short (<1 hr), and it is light-sensitive. This addition every 8 days means that retinoids are present for a very minimal window of time - are the authors sure retinoids have no requirement elsewhere during spermatogenesis? And in the literature, the measured pulse of RA in the mouse lasts >40 hours (stages VII-IX)...
 
 RA is mandatory for proper spermatogenesis and is needed many times during spermatogenesis (for review, see Schleif et al., 2022): RA is involved in spermatogonial differentiation, pre-meiotic activation and meiotic completion, establishment of the blood-testis barrier and spermiation. In our study, we did not add RA in the culture medium but retinol, the precursor of RA. Indeed, our previous studies have shown beneficial effects of retinol on in vitro spermatogenesis, including an increased production of spermatids with less nuclear alterations and DNA damage (Arkoun et al., 2015; Dumont et al., 2016).
 
 The reason we added retinol (and not RA, which has a very short half-life) in this study and in our previous studies is that it can be oxidized into RA but also be stored in Sertoli cells in the form of retinyl esters for later use. As retinol is photosensitive, handling and storage were performed in tubes covered with aluminum foil, which protects from direct light exposure.
 
 20) 362 - Start the Results section with a broader statement(s) that prepares the readers rather than jumping into specific experiments; it would be helpful for readers to have concluding sentences included as well for readers to navigate the Results section.
 
 As suggested, the Results section has been improved by an organization with rationale and concluding sentences to facilitate reading.
 
 21) 364 - KI67 is a marker of.
 
 Ki67 is widely used as a cell proliferation marker.
 
 22) 367 - replace "involved".
 
 “involved” was replaced by “necessary for”.
 
 23) What intensity thresholds were used to define a cell as positive or negative for a given marker? And there seemed to be no mention of controls - especially no primary antibody controls. This is a significant oversight if these were not done in parallel with every single immunostaining experiment.
 
 We did not apply intensity thresholds. Cells presenting detectable labeling were defined as positive, while unlabeled cells were defined as negative.
 
 Negative controls, performed by omitting the primary antibodies, were of course done in parallel to each immunostaining and are presented in Figure 1A, Figure 2J and Figure 5C. The mention of negative controls has been added in the Materials and methods section.
 
 24) 388 - INSL3 - is this referring to mRNA or protein? Protein nomenclature is used...
 
 INSL3 is here referring to the protein, whose concentrations were measured by radioimmunoassay.
 
 25) 402 - typo.
 
 “expect” was replaced by “except”.
 
 26) 409 - do mRNA levels really "determine the testicular steroidogenic potential"??
 
 This sentence has been reworded: “determine the testicular steroidogenic potential” was replaced by “highlight a potential deregulation of their expression”.
 
 27) 410 - western should not be capitalized.
 
 Western Blot was replaced by western blot in the entire manuscript.
 
 28) 405-28 - this reviewer is underwhelmed by qRT-PCR results for a handful of markers - what is the purpose? The results do not prove anything about the function of the system.
 
 As the differentiation of Leydig cells is not fully completed in organotypic cultures, we wanted to know which actors of the steroidogenic pathway show deregulated expression in vitro in comparison to physiological conditions, and thus which steps of the steroid hormone biosynthesis pathway may be impaired. We found that the expression of several genes encoding steroidogenic enzymes was decreased in vitro, notably that of Cyp17a1, necessary for the conversion of progesterone to androstenedione. Transcript levels of Hsd17b2, encoding an enzyme that converts estradiol to estrone and testosterone to androstenedione, were also decreased at D30.
 
 Our data therefore show that the expression of several steroidogenic genes and steroid metabolizing genes is deregulated in organotypic cultures but we agree that these results do not prove anything about the function of the system.
 
 We then found an accumulation of estradiol and progesterone, a decrease in androstenedione and unchanged testosterone levels in cultured tissues. The elevation in progesterone and the reduction in androstenedione in in vitro matured tissues could arise from the reduced expression of Cyp17a1. In addition, reduced Hsd17b2 transcript levels may explain why estradiol levels remain elevated in cultures while testosterone levels are similar to controls and androstenedione levels are low.
 
 29) How do the authors interpret data gleaned from tissues containing a variably-sized necrotic core?
 
 In the present study, the central necrotic area was consistent between all samples and variables: it represents on average 16-27% of the explants.
 
 As in our previous publications and recent RNA-seq analyses (Rondanino et al., 2017; Oblette et al., 2019; Dumont et al., 2023), the central necrotic area was removed so that transcript and protein levels in the healthy part of the samples (i.e. where in vitro spermatogenesis occurs) could be measured and compared with in vivo controls. In order to be able to compare the healthy part of the in vitro matured tissues with in vivo controls, transcript levels were normalized to housekeeping genes (Gapdh and Actb) or to the Leydig cell-specific gene Hsd3b1 while protein levels were normalized to ACTB or to 3β-HSD.
 
 30) 520 - after reading to this point, this reviewer was left confused and wondering why any of this is important to the reader unless that reader specifically works on this topic. The way the data were presented makes it nearly impossible for the reader to keep any of the data in their mind as they read. It's a seemingly endless list of ups and downs of many things under many conditions. What is the point of all of this? How will it advance our understanding of spermatogenesis? Or improve in vitro culture? Or help prepubertal cancer patients? Presumably, that will be explained in the Discussion, but at this point, this reviewer honestly has no idea what this all means. Why is this important??
 
 We have modified the Results section by including rationale and concluding statements to make it easier to read and follow for all readers, not necessarily for those working on this topic.
 
 As mentioned above, the identification of the molecular mechanisms that are deregulated in vitro will give us important insights for the optimization of the culture system. The development of an optimized model of in vitro spermatogenesis could lead to several applications, including improving our knowledge of the regulation of spermatogenesis during pubertal development.
 
 In this study, our main findings are that the differentiation of the adult Leydig cell lineage, steroid biosynthesis, metabolism and signaling are altered in organotypic cultures, leading to a hyperestrogenic and hypoandrogenic environment. In addition, we show that the presence of an LH homolog, known to be critical to adult Leydig cell differentiation and to stimulate steroidogenesis, does not rescue the expression of adult Leydig cell markers and of several steroidogenic genes, steroid metabolizing genes and steroid target genes. Other factors required for Leydig cell maturation and functionality will have to be tested in the future on cultured testicular tissues. Improvements to this in vitro maturation procedure in animal models may be useful for future cultures of human testicular biopsies, although we are aware that more work needs to be done before prepubertal cancer patients can benefit from this in vitro maturation approach.
 
 31) 619-20 - this sort of summarizes this reviewer's overall opinion of the manuscript. Not much seems to have been learned here that would justify publication in a broad readership journal like eLife. More work needs to be done to provide that sort of meaningful advance. The current work, with considerable re-writing to improve accuracy and clarity, is much better suited to a specialty journal where others who are working on this specific topic will appreciate its value.
 
 We have carefully considered the reviewer’s comments and modified the manuscript to improve accuracy and clarity. We understand the reviewer’s point of view, but we believe that this work may be of interest not only to labs working on fertility preservation and restoration, but also to those working on puberty initiation, germ and somatic cell maturation, steroidogenesis under physiological and pathological conditions, and on the effect of cancer therapies, drugs, chemicals and environmental agents (e.g. endocrine disruptors) on the developing testis.
 
 As mentioned above, we not only uncovered for the first time a failure in adult Leydig cell development, but also an alteration in the expression of several steroidogenic and steroid-metabolizing genes, which could explain the accumulation of progesterone and estradiol and the deficiency of androstenedione in cultured tissues. This hyperestrogenic and hypoandrogenic environment could explain, at least in part, the low efficiency of in vitro spermatogenesis. Furthermore, we show that the addition of hCG (LH homolog) is not sufficient to facilitate Leydig cell differentiation, restore steroidogenesis and improve sperm yield. These data provide valuable information for improving culture conditions. More fundamentally, this culture system could be a useful tool for identifying factors that are essential for the differentiation and functionality of adult Leydig cells during puberty initiation.
 
 32) Why are the figures repeated at the end of the manuscript?
 
 During the submission process, our bioRxiv preprint (which contains the figures) was merged with the same but higher quality figures.
 
 Reviewer #2 (Public Review):
 
 Preserving and restoring the fertility of prepubertal patients undergoing gonadotoxic treatments involves freezing testicular fragments and waking them up in a culture in the context of medically assisted procreation. This implies that spermatogenesis must be fully reproduced ex vivo. The parameters of this type of culture must be validated using non-human models. In this article, the authors make an extensive study of the quality of the organotypic culture of neonatal mouse testes, paying particular attention to the differentiation and endocrine function of Leydig cells. They show that fetal Leydig cells present at the start of culture fail to complete the differentiation process into adult Leydig cells, which has an impact on the nature of the steroids produced and even on the signaling of these hormones.
 
 The authors make an extensive study of the different populations of Leydig cells which are supposed to succeed each other during the first month of life of the mouse to end up with a population of adult and fully functional cells. The authors combine quantitative in situ studies with more global analyzes (RT-QtPCR Western blot, hormonal assays), which range from gene to hormone. This study is well written and illustrated, the description of the methods is honest, the analyses systematic, and are accompanied by multiple relevant control conditions.
 
 Since the aim of the study was to study Leydig cell differentiation in neonatal mouse testis cultures, the study is well conceived, the results answer the initial question and are not over-interpreted.
 
 My main concern is to understand why the authors have undertaken so much work when they mention RNA extractions and western blot, that the necrotic central part had to be carefully removed. There is no information on how this parameter was considered for immunohistochemistry and steroid measurements. The authors describe the initial material as a quarter testis, but they don't mention the resulting size of the fragment. A brief review of the literature shows that if often the culture medium is crucial for the quality of the culture (and in particular the supplementations as discussed by the authors here), the size of the fragments is also a determining factor, especially for long cultures. The main limitation of the study is therefore that the authors cannot exclude that central necrosis can have harmful effects on the survival and/or the growth and/or the differentiation of the testis in culture. In this sense, the general interpretation that the authors make of their work is correct, the culture conditions are not optimized.
 
 When using the organotypic culture system at a gas-liquid interphase, the central part of the testicular tissue becomes necrotic. As previously reported (Komeya et al., 2016), the central region receives insufficient nutrients and oxygen. In vitro spermatogenesis therefore only occurs in the seminiferous tubules present in the peripheral region. As in our previous publications and recent RNA-seq analyses (Rondanino et al., 2017; Oblette et al., 2019; Dumont et al., 2023), the central necrotic area was removed so that transcript and protein levels in the healthy part of the samples (i.e. where in vitro spermatogenesis occurs) could be measured and compared with in vivo controls. For histological and immunohistochemical analyses, only seminiferous tubules located at the periphery of the cultured fragments (outside of the necrotic region) were analyzed. Steroid measurements were performed on the entire fragments.
 
 The initial material was indeed a quarter testis, which represents approximately 0.75 mm3. No growth of the fragments was observed during the organotypic culture period (Figure 8-figure supplement 1). We agree with the reviewer that the composition of the culture medium is not the only parameter to be considered for the quality of the culture and that the size of the fragments is also a determining factor. We previously determined that 0.75 mm3 was the most appropriate size for mouse in vitro spermatogenesis (Dumont et al., 2016). We do not exclude at all that central necrosis can have harmful effects on the survival and/or the growth and/or the differentiation of the testis in culture. Optimization of the culture medium and culture design (so that the tissue center receives sufficient nutrients and oxygen) will be necessary to increase the yield of in vitro spermatogenesis.
 
 Organotypic culture is currently trying to cross the doors of academic research laboratories to become a clinical tool, but it requires many adjustments and many quality controls. This study shows a perfect example of the pitfall often associated with this approach. The road is still long, but every piece of information is useful.
 
 Reviewer #3 (Public Review):
 
 Moutard, Laura, et al. investigated the gene expression and functional aspects of Leydig cells in a cryopreservation/long-term culture system. The authors found that critical genetic markers for Leydig cells were diminished when compared to the in-vivo testis. The testis also showed less androgen production and androgen responsiveness. Although they did not produce normal testosterone concentrations in basal media conditions, the cultured testis still remained highly responsive to gonadotrophin exposure, exhibiting a large increase in androgen production. Even after the hCG-dependent increase in testosterone, genetic markers of Leydig cells remained low, which means there is still a missing factor in the culture media that facilitates proper Leydig cell differentiation. Optimizing this testis culture protocol to help maintain proper Leydig cell differentiation could be useful for future human testis biopsy cultures, which will help preserve fertility and child cancer patients.
 
 Methods: In line 226, there is mention that the central necrotic area was carefully removed before RNA extraction. This is particularly problematic for the inference of these results, especially for the RT-qPCR data. Was the central necrotic area consistent between all samples and variables (16 and 30FT)? How big was the area? This makes the in-vivo testis not a proper control for all comparisons. Leydig cells are not evenly distributed throughout the testis. A lot of Leydig cells can be found toward the center of the gonad, so the results might be driven by the loss of this region of the testis.
 
 When using the organotypic culture system at a gas-liquid interphase, the central part of the testicular tissue becomes necrotic. As previously reported (Komeya et al., 2016), the central region receives insufficient nutrients and oxygen. In vitro spermatogenesis therefore only occurs in the seminiferous tubules present in the peripheral region. As in our previous publications and recent RNA-seq analyses (Rondanino et al., 2017; Oblette et al., 2019; Dumont et al., 2023), the central necrotic area was removed so that transcript levels in the healthy part of the samples (i.e. where in vitro spermatogenesis occurs) could be measured and compared with in vivo controls. In order to be able to compare the healthy part of the in vitro matured tissues with in vivo controls, transcript levels of the selected genes were normalized to housekeeping genes (Gapdh and Actb) or to the Leydig cell-specific gene Hsd3b1.
 
 The central necrotic area was consistent between all samples and variables: it represents on average 16-27% of the explants.
 
 Moreover, we would like to point out that the gonads were cut into four fragments before in vitro cultures. It is therefore the central part of the cultured explants that was removed and not the central part of the gonads. The central part of the gonads was thus included in our analyses.
 
 What did the morphology of the testis look like after culturing for 16 and 30 days? These images will help confirm that the culturing method is like the Nature paper Sato et al. 2011 and also give a sense of how big the necrotic region was and how it varied with culturing time.
 
 Images showing mouse testicular tissues cultured for 16 and 30 days are presented in Figure 8-figure supplement 1. The cultured tissues resemble those shown by Sato et al., 2011. As mentioned above, the central necrotic area represents on average 16-27% of the explants. No significant difference in the area of the necrotic region was found between the two culture time points.
 
 There are multiple comparisons being made. Bonferroni corrections on p-value should be done.
 
 Bonferroni corrections are used when multiple comparisons are conducted. As mentioned in the Materials and methods section, multiple comparisons were not made in this study. Indeed, the non-parametric Mann-Whitney test was used to compare two conditions: in vitro vs in vivo (D16 FT vs 22 dpp, D16 CSF vs 22 dpp, D30 FT vs 36 dpp, D30 CSF vs 36 dpp, D30 FT + hCG vs 36 dpp, D30 CSF + hCG vs 36 dpp), cultures of fresh vs frozen tissues (6 dpp vs 6 dpp CSF, D16 FT vs D16 CSF, D30 FT vs D30 CSF, D30 FT + hCG vs D30 CSF + hCG) and cultures with vs without hCG (D30 FT + hCG vs D30 FT, D30 CSF + hCG vs D30 CSF). These comparisons were added in the Materials and methods section.
 
 Results: In the discussion, it is mentioned that IGF1 may be a missing factor in the media that could help Leydig cell differentiation. Have the authors tried this experiment? Improving this existing culturing method will be highly valuable.
 
 The decreased Igf1 mRNA levels found in the present study are in line with the RNA-seq data of Yao et al., 2017. As mentioned in the Discussion section, the addition of IGF1 in the culture medium led to a modest increase in the percentages of round and elongated spermatids in cultured mouse testicular fragments (Yao et al., 2017). However, the effect of IGF1 supplementation on Leydig cell differentiation was not investigated. The supplementation of organotypic culture medium with IGF1 is currently being tested in our research team.
 
 Add p-values and SEM for qPCR data. This was done for hormones, should be the same way for other results.
 
 p-values and SEM are shown for both qPCR and hormone data.
 
 Regarding all RT-qPCR data-There is a switch between 3bHSD and Actb/Gapdh as housekeeping genes. There does not seem to be as some have 3bHSD and others do not. Why do Igf1 and Dhh not use 3bHSD for housekeeping? If this is the method to be used, then 3bHSD should be used as housekeeping for the protein data, instead of ACTB. Also, based on Figure 1B and Figure 2A (Hsd3b1) there does not seem to be a strong correlation between Leydig cell # and the gene expression of Hsd3b1. If Hsd3b1 is to be used as a housekeeper and a proxy for Leydig cell number a correlation between these two measurements is necessary. If there is no correlation a housekeeping gene that is stable among all samples should be used. Sorting Leydig cells and then conducting qPCR would be optimal for these experiments.
 
 Hsd3b1 was used as a housekeeping gene only to normalize the mRNA levels of Leydig cell-specific genes. Therefore, Igf1 and Dhh transcript levels were not normalized with Hsd3b1 since Igf1 is expressed by several cell types in the testis (Leydig cells, Sertoli cells, peritubular myoid cells) and Dhh is expressed by Sertoli cells.
 
 Regarding western blots, the expression of AR, CYP19 and FAAH could not be normalized with 3-HSD since AR is expressed by Leydig cells, Sertoli cells and peritubular myoid cells, CYP19 is expressed by Leydig cells and germ cells and FAAH is expressed by Sertoli cells. For CYP17A1 however, 3B-HSD was used as housekeeping instead of ACTB (Figure 2G).
 
 No correlation was found between the number of Leydig cells per cm2 of testicular tissue shown in Figure 1 and Hsd3b1 mRNA levels presented in Figure 2. However, this result was expected since on the one hand the number of Leydig cells per cm2 was determined in the peripheral region of one tissue section whereas on the other hand Hsd3b1 transcript levels were measured in the entire peripheral region of the cultured fragments. The correction factor used for the analysis of genes expressed in Leydig cells present in the healthy part of the cultured tissues was therefore the Leydig cell selective marker Hsd3b1, as previously described (Cacciola et al., 2013).
 
 Figure 2A (CYP17a1): It is surprising that the CYP17a1 gene and protein expression is very different between D30FT and 36.5dpp, however, the immunostaining looks identical between all groups. Why is this? A lower magnification image of the testis might make it easier to see the differences in Cyp17a1 expression. Leydig cells commonly have autofluorescence and need a background quencher (TrueBlack) to visualize the true signal in Leydig cells. This might reveal the true differences in Cyp17a1.
 
 RT-qPCR and western blot analyses show that both Cyp17a1 mRNA levels and CYP17A1 protein levels are decreased in organotypic cultures at D30. However, we agree that such a decrease is not visible in immunostaining. No autofluorescence of Leydig cells could be observed in the negative controls (Figure 2J).
 
 Figure 3D: there are large differences in estradiol concentration in the testis. Could it be that the testis is becoming more female-like? Leydig and Sertoli cells with more granulosa and theca cell features? Were any female markers investigated?
 
 We show in the present study that the expression levels of the Sertoli cell-specific gene Dhh are not reduced in organotypic cultures. We also previously found that the expression levels of the Sertoli cell-specific gene Amh were not reduced in in vitro matured testicular tissues (Rondanino et al., 2017). Moreover, we have recently shown that Sox9, encoding a testis-specific transcription factor, is expressed in organotypic cultures (Dumont et al., 2023). Our recent transcriptomic analysis also revealed that the transcript levels of the pro-male sexual differentiation marker Sry and of the Sertoli cell-specific gene Dmrt1 remained unchanged in organotypic cultures compared to in vivo controls (Dumont et al., 2023). In addition, no increase in the mRNA levels of the female sex-determining genes Foxl2 and Rspo1 was found in vitro (Dumont et al., 2023). However, we cannot rule out that in vitro cultured testes are becoming more female-like as the expression of Hsd17b3, encoding an androgenic enzyme, is reduced (this study) while the expression of the feminizing gene Wnt4 is upregulated (Dumont et al., 2023).
 
 Figure 3D and Figure 5A: It is hard to imagine that intratesticular estradiol is maintained for 16-30 days without sufficient CYP19 activity or substrate (testosterone). 6.5 dpp was the last day with abundant CYP19 expression, so is most of the estrogen synthesized on this first day and it sticks around? Are there differences in estradiol metabolizing enzymes? Is there an alternative mechanism for E production?
 
 In the present study, abundant CYP19 expression was indeed found at 6 dpp. However, the expression of this enzyme was not measured between 6 dpp and D16. Therefore, we cannot be sure that 6 dpp is the last day with abundant CYP19 expression. We assume that the estradiol synthesized before D16 may then accumulate within the cultured tissues. In our study, we quantified the transcript levels of Sult1e1, encoding an estradiol metabolizing enzyme. SULT1E1 is thought to play a physiological role in protecting Leydig cells from estrogen-induced biochemical lesions (Tong et al., 2004). A reduction in Sult1e1 mRNA levels was found at D30 in comparison to in vivo controls, but this may occur earlier during organotypic culture. In addition, decreased transcript levels of Hsd17b2, which encodes an estrogen metabolizing enzyme that converts estradiol to estrone, were found at D30 in this study. We suggest in the Discussion section that elevated estradiol levels in cultured tissues could be a consequence of low Sult1e1 and Hsd17b2 expression. Our recent transcriptomic analyses show that the levels of Cyp1a1, Cyp1b1 and Comt, encoding other estrogen metabolizing enzymes, are unchanged in organotypic cultures (Dumont et al., 2023). To our knowledge, there is no alternative mechanism for estradiol production.
 
 Recommendations For The Authors:
 
 1) The acronyms, PLC, SLC, ILC, ALC, and FLC, become hard to follow. It is recommended to spell out the names.
 
 PLC was replaced by progenitor Leydig cells, SLC by stem Leydig cells, ILC by immature Leydig cells, ALC by adult Leydig cells and FLC by fetal Leydig cells in the entire manuscript.
 
 2) All Figures: Use letters for each bar graph. Difficult to make a connection from text to figure.
 
 A letter was added to each bar graph.
 
 3) Supplemental figure 1: Change "Changement du milieu" to English.
 
 These words were replaced by “Medium change”.
 
 4) Catalog numbers for antibodies are necessary.
 
 The catalog numbers of the antibodies used in this study are presented in Supplementary Table 1.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.11.18.517042v3
www.biorxiv.org www.biorxiv.org

New submission 20/08/2023, 18:08:06

1
1. Public_Reviews 21 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review)
  
  The authors present a scRNAseq study describing the transcriptomes of the tendon enthesis during postnatal development. This is an important topic that has major implication for the care of common clinical problems such as rotator cuff repair. The results are a valuable addition to the literature, providing a descriptive data set reinforcing other, more comprehensive studies. There are weaknesses, however, in the scRNAseq analyses.
  
  1)The authors should provide additional rationale for the PCA analysis shown in Fig 1d. It is uncommon to use PCA for histomorphologic parameters. These results do not convincingly demonstrate that P7 is as a critical developmental timepoint.
  
  2) According to the methods, it appears that the entire humeral head-supraspinatus tendon was used for cell isolation for scRNAseq. This results in the inclusion of cells from a variety of tissues, including bone, growth plate, enthesis and tendon. As such, only a very small percentage of cells in the analysis came from the enthesis. Inclusion of such a wide range of cells makes interpretation of enthesis cells difficult.
  
  3) The differentiationpseudotime analysis described in Fig 3 is difficult to follow. This map includes cell transcriptomes from vastly different tissues. Presumably, embedded in these maps are trajectories for osteoblast differentiation, chondrocyte differentiation, tenocyte differentiation, etc. With so many layers of overlapping information, it is difficult to (algorithmically) deduce a differentiation path of a particular cell type.
  
  4) The authors uses the term function throughout the paper (e.g., functional definition of fibrocartilage subpopulations). However, this is a descriptive scRNAseq study, and function can therefore only theoretically be inferred from the algorithms used to analyze the data. A functional role for any of the identified pathways or processes can only be defined with gain- andor loss-of-function studies.
  
  5) C2 highly expressed biomineralization-related genes (Clec3a, Tnn, Acan). The three example genes are not related to biomineralization.
  
  6) The functional characterization of the three enthesis cell clusters is not convincing. For example, activation of metabolism-related processes can mean a lot of things (including changes in differentiation), yet the authors interpret it very specifically as role in postnatal fibrochondrocyte formation and growth.
  
  7) The pseudotime analysis of the enthesis cell clusters is not convincing. The three clusters are quite close and overlapping on the UMAP. Furthermore, the authors focus on Tnn as a novel and unique gene, yet the expression pattern shown in Fig 5g implies even expression of this gene across all three clusters.
  
  8) The TC1 markers (Ly6a, Dlk3, Clec3b) imply a non-tendon-specific cell population. Perhaps a tendon progenitor pool or an endothelial cell phenotype is more appropriate.
  
  9) Pseudotime analyses assume that your data set includes cells from progenitor through mature cell populations. It is unclear that the timepoints studied here included cells from early progenitor states.
  
  10) The CellChat analysis is difficult to follow, as the authors included 18 cell types. The number of possible interactions among so many cell types is enormous, and deducing valid connections between any two cell types in this case should be justified. Is the algorithm robust to so many possible interactions
  
  Thank you very much for your comments and suggestions. According to your suggestions, we carefully revised the paper. We integrated our dataset with open source GSE182997 datasets and re-performed the downstream analysis. On the other hand, we added immunofluorescence tests to validate the results came from single-cell datasets. And we hope all the mentioned issues in prior version to be well addressed.
  
  Reviewer #2 (Public Review)
  
  To reveals cellular and molecular heterogeneity in enthesis, the authors established a single-cell temporal atlas during development. This study provides a transcriptional resource for further investigation of fibrocartilage development.
  
  Thank you very much for your kind suggestions. According to your suggestions, we integrated our dataset with open source GSE182997 datasets and re-performed the downstream analysis. On the other hand, we added immunofluorescence tests to validate the results came from sinlge-cell datasets. And we hope the mentioned issues in prior version to be well addressed.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.02.526768v1
www.biorxiv.org www.biorxiv.org

New submission 20/08/2023, 17:49:05

1
1. Public_Reviews 21 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #2 (Public Review):
  
  The authors present findings on a designed peptide, PITCR, and its role in inhibiting TCR activation through an extensive series of experiments. These include the measurement of phosphorylation in the TCR zeta chain and a number of associated signaling proteins such as Zap70, LAT, PLCg1, and SLP76. In addition, the authors measure the impact of PITCR on the TCR intracellular calcium response and examine the peptide-induced inhibition of TCR activation by antigen-presenting cells. They also present data indicating that the fluorescently labeled PITCR co-localizes with TCR in Jurkat cells and with ligand-bound TCR in primary murine cells. Overall the experiments provide useful insights into the mechanism of T cell activation and generally support an allosteric model of activation, while not necessarily excluding alternative models.
  
  However, some aspects of the study do need clarification.
  
  1) The authors do not provide a clear structural basis for their peptide design, which makes it difficult to understand the rationale for choosing this particular peptide. The use of a structural model based on the TCR zeta domain, for example, and how it becomes modified to generate PITCR would provide some clarity on what types of putative interactions are being engineered.
  
  We thank the reviewer for giving us a chance to elaborate. We have expanded the results section to provide more information on the peptide design, where we now point out that the acidic residues in the TCR TM allow peptide design. We have also applied the artificial intelligence program AlphaFold-Multimer (AlFoM) to generate a structural model of the docking site of PITCR in the TCR (Figure 9), which informs on new mechanistic insights, as we describe in the updated results section and discuss below.
  
  2) The inhibitory effects of PITCR are not large. Measurement of dose dependence might improve confidence in the results.
  
  As the reviewer points out, we have performed an extensive set of experiments to assess the inhibitory effect of PITCR. We have demonstrated that PITCR inhibits TCR phosphorylation. We have also tested all proximal signaling proteins: Zap70, LAT, SLP76, and PLC gamma. Critically, in all cases a statistically significant inhibition is observed. Furthermore, inhibition was additionally seen when TCR was activated by peptide presentation in antigen-presenting cells. Interaction between PITCR and the receptor is supported by co-localization, co-IP and the new AlphaFold-Multimer prediction. We are therefore confident in the results presented and that the inhibitory effect indeed exists. As we responded to reviewer 1 above, we discuss that inconsistent results were obtained with lower PITCR concentrations, suggesting that the use of a high peptide concentration is required for robust inhibition.
  
  3) Use of control peptides is not uniform. Control peptides similar to PITCR in Figure 1 and Figure 2 studies, for example, could strengthen the authors' arguments.
  
  The original version of the manuscript contained two negative control peptides, the G41P mutant of PITCR, and pHLIP, another pH-responsive peptide which behaves as a conditional transmembrane peptide. However, for feasibility reasons we did not use all the negative controls in all different experiments, as we were satisfied when a negative control peptide acted as such in an experiment. However, because we agree that increased use of negative control peptides will strengthen the manuscript, we have expanded the use of negative control peptides. Specifically, the updated version of the manuscript contains a new section where AlFoM is used to predict the binding pose of PITCR and the structural consequences of interaction (see Figure 9 and the four new supplementary figures). AlFoM showed that PITCR binds with a large interaction interface, and peptide binding causes a large rearrangement of the two zeta chains in TCR. Importantly, neither of the two original negative control peptides (PITCRG41P or pHLIP) impacts the zeta chains. When we used a new negative control, the conditional transmembrane peptide TYPE7 developed by us, AlFoM did not predict it to bind to TCR, as expected, strengthening our argument.
  
  Reviewer #3 (Public Review):
  
  The use of pH-responsive TM-targeting peptides, which the authors previously developed, is a novel aspect of this study. Those peptides can be quite powerful for understanding molecular mechanisms of receptor signaling, such as the allosteric activation model as tested in this study. The manuscript contains several interesting approaches and observations, but there are concerns about the experimental design and interpretation of the results. More importantly, the authors' primary conclusion that the allosteric changes in the TM bundles determine TCR activation is not fully supported by the data presented. For example:
  
  1) The authors provided confocal fluorescence images showing the colocalization of fluorescently labeled peptides and TCR subunits. Based on the data, they concluded that "PITCR is able to bind to TCR". This is misleading, because given the spatial resolution of the imaging technique, "colocalization" does not indicate binding or interaction between molecules. Because the peptide binding to the TM region is the pillar of the primary finding of this study, direct evidence supporting the peptide-TM binding or interaction is essential.
  
  We have to disagree that our statement is misleading: the section of the manuscript that the reviewer referred to, said “suggesting that PITCR is able to bind to TCR before it is activated by OKT3“. Therefore, we were not making a conclusion, just a mere suggestion, that we consider is justified, particularly as it is supported by data presented later. Nevertheless, we certainly agree with the reviewer that co-localization experiments fundamentally cannot indicate binding. We have modified the results (page 11) to follow the suggestion of the reviewer and indicate that co-localization data are not proof of interaction. In addition, we provide new AlphaFold multimer data, which supports that transmembrane binding indeed occurs.
  
  2) In calcium response experiments, the authors compared calcium influx (indicated by Indo-1 ratio) under different cell activation conditions (Figure 2). There are some concerns about how the authors interpreted the data: (1) The calcium plots from OKT3 activation in A-C panels are inconsistent. The plot in (A) showed a calcium peak after activation, which is not present in the plots shown in (B) and (C). There is no explanation or discussion on this inconsistency. (2) What is more concerning is that this prominent calcium peak in (A) was used to draw the conclusion that the designer peptide inhibitor effectively reduces calcium response. However, inconsistent with that conclusion, the calcium plots are indistinguishable for the three conditions: with PITCR (peptide inhibitor), with PITCRG41P (negative control that should not affect TCR activation), or no peptide. All three plots have similar magnetite and fluctuations. This does not support the authors' conclusion that the PITCR (peptide inhibitor) reduces calcium response in T cells.
  
  We thank the reviewer for this comment. We have updated figure 3, which now contains a different replicate of the calcium assay, which we think it is more straightforward to analyze, and more clearly shows the calcium inhibition, as quantified in panel D of the figure.
  
  3) Different types of T cells were used for separate measurements: E6-1 Jurkat T cells were used for calcium influx experiments, J. OT.hCD8+ Jurkat cells were used for CD69 measurements, and primary murine CD4+ T cells were used for colocalization imaging experiments. Rationales for the choices of cells in different measurements are also unclear. This is different from the common practice where different cell types are used in repeated experiments to test the generality of a finding. Here, they were used for different experiments, and findings were lumped together as "T cells", without further evidence/discussion on how translatable the findings from different cell types are.
  
  As the reviewer suggests, we have updated the manuscript to include discussion on the particularities of the use of the different T cells in pages 18 and 19. We envisioned this work as a proof of principle for the design of a peptide that can eventually be modified to be used for pre-clinical applications, and this paper is a first step. With this idea in mind, we wanted to test if this peptide can work in different types of TCR since: (1) TCR populations are diverse; and (2) our design is based on the transmembrane domain of CD3zeta chain, which is largely conserved among species. Using different types of T cells met this goal since they have different types of TCR, but the transmembrane domain of CD3zeta is conserved. In our paper, we used human Jurkat-TCR, OT1-TCR coupled with hCD8, and murine CD4-TCR. In addition, we not only used one activation marker to test the peptide’s inhibitory effect, we used three: phosphorylation, calcium influx, and CD69 activation. For the co-localization experiment, we not only use murine CD4 T cells, but we also tested it in Jurkat T cells with/without OKT3 stimulation as well.
  
  We selected these T cells because they were particularly suited for the breath of different measurements that this manuscript contains, based on published reports. In our opinion this approach broadens the relevance of the work.
  
  4) The authors set out to test the model that TCR activation by pMHC occurs through allosteric changes in the TM region, but in most experiments, they activated Jurkat T cells by anti-CD3 antibody, not by antigen peptides. The anti-CD3 antibody activates TCR signaling through clustering. It is unclear whether TCR activation by anti-CD3 leads to the same allosteric changes in the TM region as activation by pMHC. As such, the main claim of the paper, namely that the designer peptide affects TCR signaling by disrupting the allosteric changes in the TM region, remains insufficiently supported by the data presented.
  
  Figure 8 shows that the levels of co-IP in the presence of detergent are altered by OKT3 activation of TCR. It has recently been established (PMID: 34260912) that this assay allows the investigation of allosteric changes that contribute to activation of TCR. This evidence is supportive of allosterism in TCR activation. Additionally, the TCR proximal signaling is conserved between the Jurkat T cells activated by OKT3 and TCR activated by pMHC. We can reasonably argue that the peptide acts similarly in both conditions, since the peptide also exerts an inhibitory effect in T cells activated by antigen-presenting cells (Figure 4). The newly presented AlFoM model (Figure 9) predicts that PITCR binding displaces a zeta chain in TCR. This new result provides a plausible molecular rationale for the results in Figure 8, where we observe that PITCR changes transmembrane compactness, which has been linked to allosteric activation (Lanz et al., 2021; Prakaash et al., 2021).
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.08.19.503518v2
www.biorxiv.org www.biorxiv.org

New submission 20/08/2023, 17:20:59

1
1. Public_Reviews 21 Aug 2023
 
 in eLife
 
 Author Response
 
 eLife assessment
 
 This useful paper examines changes (or lack thereof) in birds' fear response to humans as a result of COVID-19 lockdowns. The evidence supporting the primary conclusion is currently inadequate, because the model used does not properly account for many potentially confounding factors that could influence the study's outcomes. If the analytic approach were improved, the findings would be of interest to urban ecologists, behavioral biologists and ecologists, and researchers interested in understanding the effects of COVID-19 lockdowns on animals.
 
 Many thanks for these supportive words. We did our best to improve our manuscript according to the reviewers and editor comments. Importantly, we regret being unclear in the Methods, as our models already controlled for most of the confounds (see below) discussed by the reviewers.
 
 For example, given that a single observer collected the data at most sites, site as a random intercept in the models controls also for the observer effects (which is one of the reasons why site is in the model). We added details to Methods (L352-356, see also “Statistical analyses” in the main text).
 
 The first reviewer asked us to use “some measure of urbanity (e.g. Human Footprint Index) that varies across the cities included here”. Our main results are now based on country-specific models and hence, the use of a single value predictor for each city is not appropriate. Please, see also below.
 
 The second reviewer is concerned about multicollinearity in our models because of the 0.95 correlation between Period and Stringency Index. However, these are key predictor variables of interest that have never been used within the same model as predictors. We now clearly explain this in the Methods (L458-538, 548-550) and within legend of Figure S2.
 
 The third reviewer suggested that our models would benefit from controlling for day in the species-specific breeding cycle. Although we don’t have precise city-specific information on the timing of breeding stages in the sampled populations of birds, we partly control for these effects by including a random intercept of day within each year and species. This random factor explained most of the variance (see Table S1-S2) – something that could have been expected. In other words, we do control for what the third reviewer asked for. Similarly, we account for habitat features that may influence escape distance by including site in the models. Site usually refers to a specific park (we assume that within-park heterogeneity is lower than between park variation) and hence partly addresses the reviewer’s concern. Again, we highlight this within the Methods (L466-476).
 
 Reviewer #1 (Public Review):
 
 This paper uses a series of flight initiation "challenges" conducted both prior to and during COVID-19-related restrictions on human movement to estimate the degree to which avian escape responses to humans changed during the "anthropause". This technique is suitable for understanding avian behavioral responses with a high degree of repeatability. The study collects an impressive dataset over multiple years across five cities on two continents. Overall the study finds no effect of lockdown on avian escape distance (the distance at which the "target" individual flees the approaching observer). The study considers the variable of interest as both binary (during lockdown or prior to lockdown) and continuous, using the Oxford Stringency Index (with neither apparently affecting escape distance). Overall this paper presents interesting results which may suggest that behavioral responses to humans are rather inflexible over "short" (~2 year) timespans. The anthropause represents a unique opportunity to disentangle the mechanistic drivers of myriad hypothesized impacts humans have on the behavior, distribution, and abundance of animals. Indeed, this finding would provide important context to the larger body of literature aimed at these ends.
 
 Thank you very much for your positive feedback.
 
 However, the paper could do more to carefully fit this finding into the broader literature and, in so doing, be a bit more careful about the conclusions they are able to draw given the study design and the measures used. Taking some of these points (in no particular order):
 
 Thank you. We did our best in addressing your comments (see below and updated Methods, Results and Discussion sections).
 
 1) Oxford Stringency Index is a useful measure of governmental responses to the pandemic and it's true that in some scenarios (including the (Geng et al. 2021) study cited by this paper) it can correlate with human mobility. However, it is far from a direct measure of human mobility (even in the Geng study, to my reading, the index only explained a minority of the variation). Moreover, particular sub-components of the index are wholly unrelated to human mobility (e.g. would changes to a country's public information campaign lead to concomitant changes in urban human mobility?). Finally, compliance with government restrictions can vary geographically and over time (i.e. we might expect lower compliance in 2021 than in 2020) and the index is calculated at the scale of entire countries and may not be very reflective of local conditions. Overall this paper could do more to address the potential shortcomings of the Oxford Stringency Index as a measure of human mobility including attempting to validate the effect on human mobility using other datasets (e.g. the google dataset and/or those discussed in (Noi et al. 2022). This is of critical importance since the fundamental logic of the experimental design relies on the assumption that stringency ~ mobility.
 
 Thank you for this comment. First, Oxford Stringency Index seemed to us as the best available index for our purposes, i.e to estimate people's mobility during the shutdown because restrictions surely influenced the possibility that people would be outside, and because the index is a country-specific estimate. However, in addition, we now checked all indices mentioned in Noi et al. 2022 and found useful only the Google Mobility Reports, which we now use, because (a) it is publicly available, (b) it is available also for territories outside US, and (c) provides data for each city included in our dataset as well as for urban parks where most of our data were collected. Note that some platforms are no longer providing their mobility data (e.g. Apple).
 
 However, Google Mobility provides day-to-day variation in human mobility, whereas we are interested in overall increase/decrease in human mobility. Nevertheless, we correlated the Google mobility index with the Stringency index and found that human mobility generally decreases with the strength of the anti-pandemic measures adopted in sampled countries (albeit the effect for some countries, e.g. Poland, is small; Fig. 5).
 
 Moreover, we also added analysis using # of humans collected directly in the field during escape trials (e.g. Fig. 6 and S6) and found that the link between # of humans and Stringency index or Google Mobility was weak and noise, 95%CIs widely crossing zero (Fig. 6).
 
 Importantly, if we use Google Mobility and # of humans, respectively, as predictors of escape distance, the results are qualitatively very similar to results based on Oxford Stringency Index (Fig. S6), or Period, with tiny effect sizes for both (95%CIs for Google Mobility -0.3 – 0.06, Table S5, for # of humans -0.12 – 0.02, Table S6) supporting our previous conclusions.
 
 Note that Google Mobility and the number of humans have their limitations (see our comment to the editor and the Methods section in the main manuscript, e.g. L418-433). The lack of Google Mobility data for years before the COVID-19 pandemic does not allow us to fully explore whether overall human activity decreased during COVID-19 or not (our test for period prior and during COVID-19). If the year 2022 reflects a return to “normal” (which is to be disputed due to COVID-19-driven rise in home office use) the 2020 and 2021 had on average lower levels of human activity (Fig. 4). Whether such a difference is biologically meaningful to birds is unclear given the immense day-to-day change in human mobility and presence (Fig 4). Moreover, the number of humans capture within- and between-day variation rather than long-term changes in human presence.
 
 We added details on the new analysis into the method and results sections (e.g. Fig. 4-6; L142-165, 418-438, 495-535) and Supplementary Information (Figs. S5-S9 and associated Tables) and discuss the problematic accordingly. Moreover, to enhance clarity about country specific effect (or their lack), we also add country specific estimates to the Results (Fig. 1 and Fig. S6 and respective Tables). Finally, our statistical design and random structure of the model allowed us to control for spatial and temporal variation in compliance with government restrictions.
 
 2) The interpretation of the primary finding (that behavioral responses to humans are inflexible) could use a bit more contextualization within the literature. Specifically, the study offers three potential explanations for the observed invariance in escape response: 1) these behaviors are consistent within individuals and this study provides evidence that there was no population turnover as a result of lockdowns; 2) escape response is linked to other urban adaptations such that to be an urban-dwelling species dictates escape response; and/or 3) these populations already exhibit maximum habituation and the reduction in human mobility would only have increased that habituation but that trait is already at a boundary condition. Some comments on each of these respectively:
 
 Thank for these comments. We incorporated them in the main text (L293-329). Your point 1) corresponds to our point (i): “Most urban bird species in our sample may be relatively inflexible in their escape responses because the species may be already adapted to human presence” (L293-306); your point 2) to our point (ii): “Urban environment might filter for bold individuals (Carrete and Tella, 2013, 2010; Sprau and Dingemanse, 2017). Thus, the lack of consistent change in escape behaviour of urban birds during the COVID-19 shutdowns may indicate an absence (or low influx) of generally shy, less tolerant individuals and species from rural or less disturbed areas into the cities…” (L307-314); your point 3) to our point (iii): “Urban birds might have been already habituated to or tolerant of variation in human presence, irrespective of the potential changes in human activity patterns” (L315-329). To distinguish between (ii) and (iii) or the two from (i), individually-marked birds and comprehensive genetic analyses are needed, which we now note in the Discussion (L330-348). Importantly, we also discuss that the lack of response might be due to relatively small changes in human activity (L253-292), which we unfortunately could not fully quantify.
 
 a) Even had these populations turned over as a result of a massive rural-to-urban dispersal event, it's not clear that the escape distance in those individuals would be different because this paper does not establish that these hypothetical rural birds have a different behavioral response which would be constant following dispersal. Thus the evidence gathered here is insufficient to tell us about possible relocations of the focal species.
 
 Thank you for this point. We address this point in the Introduction and Discussion (L92-101, 307-314). Rural bird populations/individuals are on average less tolerant of humans than urban birds (e.g. Díaz et al. 2013, PloS One 8:e64634; Tryjanowski et al. 2020, J Tropic Ecol 36:1-5; Mikula et al. 2023, Nat Commun 14:2146) and at the same time, bird individuals seem consistent in their escape responses (Carrete & Tella 2010, Biol Lett 23:167–170; Carrete & Tella 2013, Sci Rep 3:1–7).
 
 Additionally, the paper cites several papers that found no changes in abundance or movements of animals in response to lockdowns but ignore others that do. For example: (Wilmers et al. 2021), (Warrington et al. 2022) (though this may have been published after this was submitted...), and (Schrimpf et al. 2021).
 
 We added the papers (L89-91). Thank you!
 
 There is a missed opportunity to consider the drivers of some of these results - the findings in this paper are interesting in light of studies that did observe changes in space use or abundance - i.e. changes in space use could arise precisely because responses to humans are non-plastic but the distribution and activities of humans changed.
 
 Thank you. Indeed, we now address this in the Discussion (L303-306): “However, some studies reported changes in the space use by wildlife (Schrimpf et al., 2021; Warrington et al., 2022; Wilmers et al., 2021). and these could arise, as our results indicate, from fixed and non-plastic animal responses to humans who changed their activities”.
 
 To wit, the primary finding here would imply that the reaction norm to human presence is apparently fixed over such timescales - however, and critically, the putative reduction in human activity/mobility combined with fixed responses at the individual level might then imply changes in avian abundance/movement/etc.
 
 Unfortunately, we have not measured changes in avian abundance or movements. But, please, note that the change in human mobility in sampled cities might be not as dramatic as initially thought and we consider this scenario to be most plausible in explaining no significant differences in avian escape responses before and during the COVID-19 shutdowns (see Fig. 4). Nevertheless, we add your point into the Discussion: If our findings imply that in birds the reaction norm to human presence is fixed over the studied temporal scale, the putative changes in human presence might then imply changes in avian abundance or movement (L293 and text below it).
 
 b) If this were the case, wouldn't this be then measurable as a function of some measure of urbanity (e.g. Human Footprint Index) that varies across the cities included here? Site accounted for ~15% of the total variation in escape distance but was treated as a random effect - perhaps controlling for the nature of the urban environment using some e.g. remotely sensed variable would provide additional context here.
 
 Urbanity mirrors the long-term level of human presence in cities whereas we were interested mainly in the rather short-term effects of potential changes of human presence on bird behaviour. Thus, we are not sure how adding such variable will help elucidating the current results. Please, also note that we added the country-specific analysis. Site indeed accounted for considerable amount the total variance in escape distance and that is why it was included as random intercept, which controls for non-independents of data points from each city. This could partly help us to control for difference in habitat type (e.g. urbanization level) within cities.
 
 c) Because it's not clear the extent to which the populations tested had turned over between years, the paper could do with a bit more caution in interpreting these results as behavioral. This study spans several years so any response (or non-response) is not necessarily a measure of behavioral change because the sample at each time point could (likely does) represent different individuals. In fact, there may be an opportunity here to leverage the one site where pre-pandemic measures were taken several years prior to the pandemic. How much variance in the change in escape distance is observed when the gap between time points far exceeds the lifetime of the focal taxa versus measures taken close in time?
 
 We believe the initial Fig S4, now Figure 2, addresses this point. The between years temporal variation in FIDs exceeds the variation due to lockdowns. This is true both for measures taken in consecutive years, as well as for measures taken far apart.
 
 d) Finally, I think there are a few other potential explanations not sufficiently accounted for here:
 
 i) These behaviors might indeed be plastic, but not over the timescales observed here.
 
 We agree and have added this point (L301-303). Thank you.
 
 ii) Time of year - this study took place during the breeding season. The focal behavior here varies with the time of year, for example, escape distance for many of these species could be tied up in nest defense behaviors, tradeoffs between self-preservation and e.g. nest provisioning, etc.
 
 Please, note that we controlled for the date in our analyses. Date was used as a proxy for the progress in the breeding season (L463-464 and Fig. 1 caption). Note that we collected data only from foraging or resting individuals, and data were neither collected near the nest sites nor from individuals showing warning behaviours, which we now note (L400-401).
 
 iii) Escape behaviors from humans are adaptively evolved, strongly heritable, and not context dependent - thus we would only expect these behaviors to change on evolutionary timescales.
 
 We discussed this at L307-308 and 381-383. Escape behaviors from humans are highly consistent for individuals, populations, and species (Carrete & Tella 2010, Biol Lett 23:167–170; Díaz et al. 2013, PloS One 8:e64634; Mikula et al. 2023, Nat Commun 14:2146). Whether such behavior is consistent across contexts is less clear (e.g. Diamant et al. 2023, Proc Royal Soc B, in press; but see, e.g. Radkovic et al. 2019, J Ecotourism 18:100-106; Gnanapragasam et al. 2021, Am Nat 198:653-659). Escape distance is often not measured simultaneously, for example, with human presence. In other words, whereas general level of human presence may have no effect on escape distance, the day-to-day or hour-to-hour variations might. We need studies on fine temporal scales (day-to-day or hour-to-hour) using marked individual to elucidate this phenomenon.
 
 iv) See point one above - it's possible that the lockdown didn't modify human activity sufficiently to trigger a behavioral response or that the reaction norm to human behavior is non-linear (e.g. a threshold effect).
 
 We agree, now use also Google Mobility Reports and # of humans data to elucidated this phenomenon and have added such interpretations to L253-292 and, e.g. Fig. 4.
 
 LITERATURE CITED Geng DC, Innes J, Wu W, Wang G. 2021. Impacts of COVID-19 pandemic on urban park visitation: a global analysis. J For Res 32:553-567. doi:10.1007/s11676-020-01249-w
 
 Noi E, Rudolph A, Dodge S. 2022. Assessing COVID-induced changes in spatiotemporal structure of mobility in the United States in 2020: a multi-source analytical framework. Int J Geogr Inf Sci.
 
 Schrimpf MB, Des Brisay PG, Johnston A, Smith AC, Sánchez-Jasso J, Robinson BG, Warrington MH, Mahony NA, Horn AG, Strimas-Mackey M, Fahrig L, Koper N. 2021. Reduced human activity during COVID-19 alters avian land use across North America. Sci Adv 7:eabf5073. doi:10.1126/sciadv.abf5073
 
 Warrington MH, Schrimpf MB, Des Brisay P, Taylor ME, Koper N. 2022. Avian behaviour changes in response to human activity during the COVID-19 lockdown in the United Kingdom. Proc Biol Sci 289:20212740. doi:10.1098/rspb.2021.2740
 
 Wilmers CC, Nisi AC, Ranc N. 2021. COVID-19 suppression of human mobility releases mountain lions from a landscape of fear. Curr Biol 31:3952-3955.e3. doi:10.1016/j.cub.2021.06.050
 
 Reviewer #2 (Public Review):
 
 Mikula et al. have a large experience studying the escape distances of birds as a proxy of behavioral adaptation to urban environments. They profited from the exceptional conditions of social distance and reduced mobility during the covid-19 pandemic to continue sampling urban populations of birds under exceptional circumstances of low human disturbance. Their aim was to compare these new data with data from previous "normal" years and check whether bird behavior shifted or not as a consequence of people's lockdown. Therefore, this study would add to the growing body of literature assessing the effect of the covid-19 shutdown on animals. In this sense, this is not a novel study. However, the authors provide an interesting conclusion: birds have not changed their behavior during the pandemic shutdown. This lack of effects disagrees with most of the previously published studies on the topic. I think that the authors cannot claim that urban birds were unaffected by the covid-19 shutdown. I think that the authors should claim that they did not find evidence of covid-19-shutdown effects. This point of view is based on some concerns about data collection and analyses, as well as on evolutionary and ecological rationale used by the authors both in their hypotheses and results interpretation. I will explain my criticisms point by point:
 
 We are grateful for your positive appraisal of our manuscript, as well as for your helpful critical comments. We toned down the discussion to claim, as suggested by you, that we did not find evidence for effects of covid-19-shutdowns on escape behaviour of birds in urban settings (see Results and Discussion sections). In general, we attempted to provide a more nuanced discussion and reporting of our findings. We also changed the manuscript title to “Urban birds' tolerance towards humans was largely unaffected by the COVID-19 shutdowns” and added validation using Google Mobility Reports (Fig. 5 & S6, Table S3a and S5) and the actual number of humans (Fig. 6 and S6; Table S3b-e and S6). Note however that there is only a single robust study on the topic of shutdown and animal escape distances (Diamant et al. 2023, Proc Royal Soc B, in press), i.e. the topic is largely unexplored (e.g. L99-101), whereas we discuss our finding in light of shutdown influences on other behaviours (L293-329).
 
 1) The authors used ambivalent, sometimes contradictory, reasoning in their predictions and results interpretation. Some examples:
 
 We tried to clarify our reasoning and increased consistency in our claims in the Introduction. Please, note that we simplified the Introduction and now provide one main expectation: FIDs of urban birds should increase with decreased human presence. This pattern is robustly empirically documented, regardless of the mechanism involved (e.g. Díaz et al. 2013, PloS One 8:e64634; Tryjanowski et al. 2020, J Tropic Ecol 36:1-5; Mikula et al. 2023, Nat Commun 14:2146). Please, see our revised Discussion for a more comprehensive discussion of mechanisms which could explain the patterns described in our study.
 
 1.1) The authors claimed that urban birds perceive humans as harmless (L224), but birds actually escape from us, when we approach them... Furthermore, they escape usually 5 to 20 m away. This is more distance that would be necessary just to be not trampled.
 
 We agree and have deleted mentions that humans are perceived as harmless.
 
 1.2) If we are harmless, why birds should spend time monitoring us as a potential threat (L102)? Indeed, I disagree with the second prediction of the authors. I could argue that reduced human activity should increase animal vigilance because real bird predators (e.g. raptors) may increase their occurrence or activity in empty cities. If birds should increase their vigilance because the invisible shield of human fear of their predators is no longer available, then I would expect longer escape distances.
 
 Thank you for this comment. We deleted this prediction and largely rewrote Introduction based on your comments and comments from the other reviewers.
 
 1.3) To justify the same escape behavior shown by birds in pre- and pandemic conditions from an adaptive point of view, the authors argued a lack of plasticity and a strong genetic determination of such behavior. This contravenes the plasticity proposed in the previous point or the expected effect of the stringency index (L112).
 
 We now attempted to write this more clearly while incorporating your suggestions. In the Discussion, we now propose various hypothesis that can, but need not be mutually exclusive. Please, note that we simplified the Introduction and now provide one main hypothesis: FIDs of urban birds should increase with decreased human presence.
 
 In my opinion, some degree of plasticity in the escape behavior would be really favorable for individuals from an adaptive perspective, as they may face quite different fear landscapes during their lives. Looking at the figures, one can see notable differences in the escape distance of the same species between sites in the same city. As I can hardly imagine great genetic differences between birds sampled in a park or a cemetery in Rovaniemi, for instance, I would expect a major role of plasticity to explain the observed variability. Furthermore, if escape behavior would not be plastic, I would not expect date or hour effects. By including them in their models, the authors are accepting implicitly some degree of plasticity.
 
 We regret being unclear. We do accept some degree of plasticity. Yet, our study design prohibits the assessment of the degree of individual plasticity because sampled birds were not individually marked and approached repeatedly. We tried to soften the statements in our Discussion to not fully dismiss a possibility that urban birds have some degree of plasticity in their antipredator behaviour (L293-329). Note however, that while our data collection was not designed to test how hour-to-hour changes in human numbers influence escape distance, the effect of the number of humans (i.e. hour-to-hour variation in human numbers) in our sample was tiny.
 
 The date and hour effect simply control for the particularities of the given day and hour (e.g. warm vs cold times or the time until sunset). In other words, the within species differences (even from the same park) may have little to do with individual plasticity, but instead may reflect between individual differences. We now add this issue to Methods (L471-476): “This approach enabled us to control for spatial and temporal heterogeneity and specificity in escape behaviour of birds (e.g. species-specific responses, changes in escape distances with the progress in the breeding season, spatial and temporal variation in compliance with government restrictions or particularities of the given day and hour)....”
 
 2) Looking at the figures I do not see the immense stochasticity (L156, Fig. S3, S5) claimed by the authors. Instead, I can see that some species showed an obvious behavioral change during the shutdown. For instance, Motacilla alba, Larus ridibundus, or Passer domesticus clearly reduced their escape distances, while others like the Dendrocopos major, Passer montanus, or Turdus merula tended to increase it.
 
 At L138-141 and 327-329 we discussed the within and between genera and cross-country variation and stochasticity in response to the shutdowns (Fig. 2). The reference to species-specific plots was perhaps a little bit misleading. We think that the essential figure, that we now reference at this point, is Figure 2 that shows the temporal trends and/or stochasticity that seem to have little in common with lockdowns. Please, also look at Figure 3 and S3-S4. These show that in all selected genera/species, the trends did not significantly deviate from central regression line which indicates no change in FID before and during the COVID-19 shutdowns.
 
 On the other hand, birds in Poland tended to have larger escape distances during the shutdown for most species, while in Rovaniemi there was an apparent reduction of escape distances in most cases. The multispecies and multisite approach is a strength of this study, but it is an Achilles' heel at the same time. The huge heterogeneity in bird responses among species and sites counterbalanced and as a result, there was an apparent lack of shutdown effects overall. Furthermore, as most data comes from a few (European) species (i.e. Columba, Passer, Parus, Pica, Turdus, Motacilla) I would say that the overall results are heavily influenced (or biased) by them. The authors realize that results are often area- or species-specific (L203), therefore, does a whole approach make sense?
 
 We are grateful for this valuable comment. We believe the general approach makes sense as there is a general expectation about how birds should respond to changes in human presence. That is why we control for non-independence of data points in our sample. Thus, although lots of data come from a few European species, this is corrected for by the model. Note that given the sheer number of sampled species, some site- or species-specific trends may have occurred by chance. Importantly, we believe that Figure 2, with species-site specific temporal trends, reveals that the between year stochasticity in escape distances seems greater that any effects of lockdowns. Nevertheless, we have further dealt with this issue in the revised manuscript by running country-specific models which again clearly showed no significant effect of Period on escape behaviour of birds (including, no effects in Poland and Finland).
 
 3) The previous point is worsened by the heterogeneity of cities and periods sampled. For instance:
 
 3.1) I can hardly imagine any common feature between a small city in northern Finland (Rovaniemi) and a megacity in Australia (Melbourne). Thus, I would not be surprised to find different results between them.
 
 3.2) Prague baseline data was for 2014 and 2018, while for the rest of the study sites were for 2018 and 2019. If study sites used a different starting point, you cannot compare differences at the final point.
 
 We are slightly confused by these comments.
 
 3.1) The cities are expected to be different but (i) the difference may be smaller than imagined (e.g. park structures, managed grass cover, few shrubs and deciduous-dominated tree species) and (ii) we expect the effects of lockdowns to be similar across cities. Whether we have no people in Rovaniemi parks (which despite Rovaniemi’s small size are usually extremely well-visited) or no people in Melbourne parks should not make a difference in principle. Note however, that to avoid overconfident conclusions, we allow for different reaction norms within cities. Please, also note that we are now providing country-specific results which should identify whether shutdowns lead to different reaction in sampled countries. We found no strong effect of shutdowns in any of sampled countries/cities.
 
 3.2) Because of the possible between site differences at the starting point, we use study site as random intercept and control for the between site reaction norms by including the random slope of the period. In other words, such possible differences do not influence outcomes of our models. Regardless, our a priori expectation is that the human activity levels in a given park was similar prior to covid and hence in 2014, 2018, and 2019. Again, we are now providing country-specific results which identify whether shutdowns led to different reactions in sampled countries, which they mostly did not
 
 3.3) Due to the obvious seasonal differences between the northern and southern hemispheres, data collection in Australia began five months later than in the rest of the sites (Aug vs Mar 2020). There, urban birds faced already too many months of reduced human disturbances, while European birds were sampled just at the beginning of the lockdown.
 
 We agree that each city or even park within the city has its specific environmental conditions (here including the time point of lockdown). That is why we control for city and park location in the random structure of the model (see Method section). We now add results per country that shows no clear differences (e.g. Fig. 1).
 
 However, the aim of our study was to test for general, global effects of lockdowns, which are minimal. Note that we now specifically test for country-specific effects in separate models on each country (e.g. Fig. 1, Fig S6) but all country-specific effects are small and still centre around zero.
 
 3.4) Some cities were sampled by a single observer, while others by many of them. Even if all of them are skilled birders, they represent different observers from a statistical point of view and consequently, observer identity was an extra source of noise in your data that you did not account for.
 
 We agree. In Finland and Hungary, data were collected by two closely cooperating observers. In Poland, all data were collected by a single observer. In the Czech Republic and Australia, a single observer (P.M. and M.W., respectively) sampled 46 sites out of 56 and 32 sites out of 37, respectively. Each site was sampled by the same observer both before and during the shutdowns. We now clearly state it in the Methods (L352-356). In other words, our models already largely control for the possible observer confound by having site as a random intercept. Moreover, previous study showed that FID estimates do not vary significantly between trained observers (Guay et al. 2013, Wildlife Research, 40, 289-293).
 
 4) Although I liked the stringency index as a variable, I am not sure if it captured effectively the actual human activity every day. Even if restrictive measures were similar between countries, their actual accomplishment greatly depended on people's commitment and authorities' control and sanctions. I would suggest using a more realistic measure of human activity, such as google mobility reports.
 
 Thank you for this comment. We now validate the use of the stringency index with the Google Mobility Reports, showing that human mobility generally (albeit in some countries relatively weakly) decreases with the strength of governmental antipandemic measures. Please, note that our main research question is related to the general change in human outdoor activity and not to week-to-week, day-to-day or hour-to-hour changes captured by stringency index, Google Mobility or the number of humans during an escape trial data. Nevertheless, using Google Mobility and the number of humans as predictors led to the similar results as for stringency index and Period (Fig. 1 and S6). Please, see extended discussion on this topic in our manuscript (L270-292).
 
 5) The authors used escape trials from birds on the ground and perched birds. I think that they are not comparable, as birds on the ground probably perceive a greater risk than those placed some meters above the ground, i.e. I would expect shorter escape distances for perched birds. As this can be strongly dependent on the species preferences or sampling site (i.e, more or less available perches), I wonder how this mixture of observations from birds on the ground and perched birds could be affecting the results.
 
 We now added information that most birds were sampled when on the ground (79%). Importantly, previous studies have found that perch height has a minimum effect on FIDs (e.g. Bjørvik et al. 2015. J Ornithol 156:239–246; Kalb et al. 2019, Ethology 125:430-438; Ncube & Tarakini 2022, Afr J Ecol 60:533– 543; Sreekar et al. 2015,. Tropic Conserv Sci 8:505-512). We added this information to the Method section (L394-395).
 
 6) The authors did not sample the same location in the same breeding season to avoid repeated sampling of the same individuals (L331). This precaution may help, but it does not guarantee a lack of pseudoreplication. Birds are highly mobile organisms and the same individuals may be found in different places in the same city. This pseudoreplication seems particularly plausible for Rovaniemi, where sampling points must be necessarily close due to the modest size of this city.
 
 We appreciate your concern. We cannot fully exclude the possibility of sampling some individuals twice. However, we sampled during the breeding season within which most birds are territorial, active in the areas around the nests and hence an individual switching parks is unlikely. Also, most sampled birds in our study are passerines which have small territories (typically few hundred square meters). Some larger birds may have larger territories and move larger distance to forage (e.g. kestrels which often forage outside cities) but these birds represent a minority of our records and we have not sampled outside the cities.
 
 7) An intriguing result was that the authors collected data for 135 species during the shutdown, while they collected data only for 68 species before the pandemic. Such a two-fold increase in bird richness would not be expected with a 36% increase in sampling effort during 2020-21. I wonder if this could be reflecting an actual increase in bird richness in urban areas as a positive result of the shutdown and reduced human presence.
 
 There were 141 unique day-years during before COVID and 161 during COVID. So, the sampling effort as calculated by days does not explain the difference in species numbers. Whether the actual effort, which was 381 vs 463 h of sampling, explains the difference is unclear, which we now note in the Methods (L476-483). If not, your proposition is possible, but we would like to avoid any speculations on this topic in the manuscript as it is difficult to infer species diversity from FID sampling.
 
 8) The authors dismissed the multicollinearity problem of explanatory variables unjustifiably (L383). However, looking at fig. S1, I can see strong correlations between some of them. For instance, period and stringency index were virtually identical (r=0.95), while temperature and date were also strongly correlated.
 
 We are confused by this comment and think this reflects a misunderstanding. Period and stringency index are explanatory variables of interest that were never included in the same model and hence their correlation does not contribute to the within a model multicollinearity. To avoid further confusion, we note this within (Fig. S2) legend. However, we must be cautious when interpreting the results from the models on period, Google Mobility, # of humans and stringency index, as the four measure are similar.
 
 We discuss multicollinearity of explanatory variables within the manuscript (L458-538, 548-550) and noted that, with the exception of temperature and day within the breeding season (r = 0.48), the correlations among explanatory variables were minimal. We thus used only temperature as an explanatory variable (i.e. fixed factor; also because temperature reflects both season and variation in temperature across a season) whereas the day was included as a random intercept to control for pseudoreplication within day. Collinearity between all other predictors was low (|r| <0.36).
 
 9) The random structure of the models is a key element of the statistical analyses but those random factors are poorly explained and justified. I needed to look up the supplementary tables to fully understand the complex architecture of the random part of the models. To the best of my knowledge, random variables aim to account for undesirable correlations in the covariance matrix, which is expected in hierarchical designs, such as the present one. However, the theoretical violation of data independence may happen or not. As the random structure is usually of little interest, you should keep it as simple as necessary, otherwise random factors may be catching part of data variability that you would like to explain by fixed variables. I think that this is what is happening (at least, in part) here, as the authors included a too-complex random structure. For instance, if you include the year as a random factor, I think that you are leaving little room for the period effect. The authors simplified the random structure of the models (L387), but they did not explain how. Nevertheless, this model selection was not important at all, as the authors showed the results for several models. I assume, consequently, that the authors are considering all these models equally valid. This approach seems quite contradictory.
 
 The random structure of the model controls for possible pseudoreplication in the data, that is for the cases where we have multiple data points that may not be independent and hence technically represent one. Apart from that, random structure tells us about where the variance in the data lies. This is often of interest and your previous questions about city, site or species specificities can be answered with the random part of the model. To follow up on your example, year is included in the model because data from a single year are not independent (for example because of delayed breeding season in one year vs. in another).
 
 We regret being unclear about the model specification and have attempted to clarify the methods (L466-476). We first specified a model with an ideal random structure that necessarily was complex (perhaps too complex). We then showed that using models with simpler random structures did not influence the outcomes. We now use a simpler model within the main text, but do keep the alternative models to show that the results are not dependent on the random structure of the model (Fig. S1 and Table S2).
 
 Reviewer #3 (Public Review):
 
 This study examined the changes in fear response, as measured by the flight initiation distances (FID), of birds living in urban areas. The authors examined the FIDs of birds during the pandemic (COVID-19 lockdown restrictions) compared to FIDs measured before the pandemic (mostly in 2018 & 2019). The main study justification was that human presence changed drastically during the pandemic lockdowns and the change in human presence might have influenced the fear response of birds as a result of changing the "landscape of fear". Human presence was quantified using a 'stringency' index (government-mandated restrictions). Urban areas were selected from within five different cities, which included four European cities (Czech Republic - Prague, Finland - Rovaniemi, Hungary - Budapest, Poland - Poznan), and one city in the global south (Australia - Melbourne). Using 6369 flight initiation distances across 147 different bird species, the authors found that FIDs were not significantly different before the pandemic versus during the pandemic, nor was the variation in FID explained by the level of 'stringency'.
 
 Major strengths: There are several strengths to this study that allows for understanding the variety of factors that influence a bird's response to fear (measured as flight initiation distances). This study also demonstrates that FIDs are highly variable between species and regions.
 
 Specifically,
 
 1) One of the major strengths of this paper is the focus on birds living in urban areas, a habitat type that is hypothesized to have changed drastically in the 'landscape of fear' experienced by animals during the pandemic lockdown restrictions (due to the presumed decrease in human presence and densities). Maintaining the focus on urban birds allowed for a deeper examination of the effect of human behaviour changes on bird behaviour in urban habitats, which are at the interface of human-wildlife interactions.
 
 2) This study accounted for several variables that are predicted to influence flight initiation distances in birds including species, genus, region (country), variability between years, pandemic year (pre- versus during), the strictness of government-mandated lockdown measures, and ecological factors such as the human observer starting distance, flock size, species-specific body size, ambient air temperature (also a proxy of the timing during the breeding season), time of day, date of data collection (timing within the regional [Europe or Australia] breeding season), and categorization of urban site type (e.g. park, cemetery, city centre).
 
 3) This study examined FIDs in two years previous to the pandemic (mostly 2018 and 2019, one site was 2014) which would account for some of the within- and between-year FID variation exhibited prior to the pandemic.
 
 4) This study uses strong statistical approaches (mixed effect models) which allows for repeat sampling, and a post hoc analysis testing for a phylogenetic signal.
 
 Thank you for your supportive and positive comments.
 
 Major weaknesses: The authors used government 'stringency' as a proxy for human presence and densities, however, this may not have been an accurate measure of actual human presence at the study sites and during measurements of FIDs. Furthermore, although the authors accounted for many factors that are predicted to influence fear response and FIDs in birds, there are several other factors that may have contributed to the high level of variation and patterns in FIDS observed during this study, thus resulting in the authors' conclusion that FIDs did not vary between pre- and during pandemic years.
 
 Thank you for your suggestions. We agree. To capture the general human presence in parks, we now incorporated an analysis using Google Mobility Reports (Fig S6b) that directly measures human mobility in each of sampled cities and specifically in urban parks where most our data were collected, and also address your further concerns that you detail below. Albeit not the main interest of our study, we now also incorporated an analysis using actual # of humans during an escape trial (Fig. S6c).
 
 Moreover, we think that including further possible confounds should not influence our conclusions. In other words, including further confounds will decrease the variance that can be explained by shutdowns and thus such shutdown effects (if any) would be tiny and hence likely not biologically meaningful.
 
 Specifically,
 
 1) The authors used "government stringency" as a measure of change in human activity, which makes the assumption that the higher the level of 'stringency', the fewer humans in urban areas where birds are living. However, the association between "stringency" and actual human presence at the study sites was not measured, nor was 'stringency' compared to other measures of human presence such as human mobility.
 
 Thank you for this essential comment. Initially, we viewed Oxford Stringency Index as the best available index for our purposes. However, we now further acknowledge its limitations (L) and validate the Oxford Stringency Index with the Google Mobility Reports data, showing that both indices are generally negatively (albeit sometimes weakly) correlated across sampled cities (i.e. human mobility decreases with the increasing stringency index). Although other human presence indices were used in the past, e.g. Cuebiq, Descartes Labs and Maryland Uni index, Apple (see Noi et al. 2022, Int J Geograph Info Sci, 36, 585-616), we used only the Google Mobility index because (a) it is publicly available, (b) is available also for territories outside US, and (c) provides data for urban parks within each city included in our dataset. Note however that Google Mobility data are inappropriate to answer our primary question, i.e. whether changes in human presence outdoors due to the COVID-19 shutdowns had any effect on avian tolerance towards humans. First, Google Mobility was available only for 2020-22, i.e. the baseline pre-COVID-19 data for 2018-2019 were unavailable. Thus, there was no way to check whether the human activity levels really changed during the COVID-19 years. Second, Google Mobility data are calculated as a change from 2020 January–February baseline for each day of the week for each city and its location (here we used parks). In other words, the data are not comparable between days and cities, albeit we attempted to correct for this within the random structure of the mixed model. Also, the data may be influenced by extreme events within the 2020 Jan–Feb baseline period (see here). Third, the Google Mobility varies greatly between days and across season (see Fig 4 & S5 or the first figure in these responses), likely more than the possible change due to shutdowns. Nevertheless, we found that results based on Google Mobility are qualitatively very similar to results based on stringency index. Moreover, we showed that the relationships between # of humans and both Google Mobility or Stringency index (Figure 6) are weak and noise with 95%CIs widely overlapping zero (Table S3b-e). Also, similarly to other predictors of human presence, # of humans only poorly predicted changes in avian escape distances. We added details on the new analysis into the Methods and Results and Supplement (L134-165 and associated figures and tables, L415-535).
 
 2) There was considerable variation in FID measurements, which can be seen in the figures, indicating that most of the variation in FID was not accounted for in the authors' models.
 
 We are confused by this statement. The fact that the FIDs varied does not translate directly to that our models did not account for the variation. Nevertheless, we do control for most of the discussed confounds (see further answers below). Importantly, it is unclear how including further possible confounds should influence our conclusions, unless the lockdowns effects are tiny, in which case those might not be biologically meaningful.
 
 Factors that may have contributed to variation in FIDs that were not accounted for in this study are as follows:
 
 a. The authors accounted for the date of data collection using the 'day' since the start of the general region's breeding season (Europe: Day 1 = 1 April; Australia: Day 1 = 15 August). Using 'day' since the breeding season started probably was an attempt to quantify the effect of the breeding stage (e.g. territory establishment, nest young, fledgling) on FIDs. However, breeding stages vary both within- and between species, as well as between sub-regions (e.g. Finland vs. Hungary). As different species respond to predation or human presence differently depending on the stage during their breeding cycle, more specificity in the breeding cycle stage may allow for explaining the observed variation and patterns in FID.
 
 We agree. Although we don’t have a precise city-specific information on the timing of breeding stages in sampled populations of birds, we partly control for these effects by including a random intercept of day within each year and species. This random factor explained relatively high portion of the variance in our data (see Table S1 and S2) - perhaps something you expected.
 
 b. Variation in species-specific FIDs may also vary with habitat features within urban sites, such as the proximity of trees and other protective structures (e.g. perches and cover), the openness of the area, and the level of stressors present (e.g. noise pollution, distance to roads). Perhaps accounting for this habitat heterogeneity would account for the FID variation measured in this study.
 
 We agree. We don’t have such fine-scale data, but we included site identity (typically within a particular park or cemetery) which should account for the habitat heterogeneity among localities. Depending on the model, site explained relatively little variance (1-6%), indicating low heterogeneity between localities in these undescribed characteristics. Also note that park structure may be quite similar both within and between cities, i.e. managed green grass areas, with only a few shrubs and deciduous trees. Therefore, the possible minor habitat heterogeneity should not have any great impacts on our results.
 
 c. The authors accounted for species and genus within their models, however, FIDs may vary with other species-specific (or even specific populations of a species) characteristics such as whether the species/population is neophobic versus neophilic, precocial versus altricial, and the level of behavioural plasticity exhibited. These variables were not accounted for in the analysis.
 
 We agree that FIDs can be correlated with many possible factors. Here, we were interested in general patterns, while controlling for FID differences between species, as well as for possible species-specific reaction norms to lockdowns. Whether neophobic vs neophilic population or precocial versus altricial species react differently to lockdowns might be of interest, but it is beyond the scope of this study. However, that population and population specific reaction norms explain little variation (Table S2a, 0-6% of variation) so such a confound should not substantially influence our conclusion much. We do not have fine-scale data on the level of neophobia, but the effects of lockdowns seem similar for precocial (see Anas, Larus, Cygnus) and altricial (the remaining, mostly passerine) species in our dataset (see Fig. 3 and S3-S4). Please, note that we sampled mainly adults (L386). Moreover, the effects for clades, which may differ in their cognitive skills, are also similar (e.g. Corvids vs. Anas or Cygnus; Fig. 3).
 
 d. Three different methods of measuring the distances between flight and the observer location were used, and FIDs were only measured once per bird, such that there were no measures of repeatability for a test subject. Thus, variation surrounding the measurement of FIDs would have contributed to the variation in FIDs seen during this study.
 
 While all observers were trained, the three methods may add some noise to the FID estimates. However, the FID estimates from a single method may still slightly differ between observers (so do well standardized morphology measurements; Wang, et al. 2019, PLoS Biology, 17, e3000156). Importantly, FID estimates are highly replicable among skilled observers (Guay et al. 2013, Wildlife Research 40:289-293), and we previously validated this approach and showed that distance measured by counting steps did not differ from distance measured by a rangefinder (Mikula 2014, Ardea 102:53-60), which we now explicitly state (L391-394). Importantly, we control for observer bias by specifying locality as a random intercept (see further details in our response to the Editor). Moreover, each site was sampled by the same observer both before and during the shutdowns.
 
 3) The sample design of this study may have influenced the FID variability associated with specific species, and specific populations of species. A different number of species were sampled across the time periods of interest; 68 species were sampled before the pandemic versus 135 species after the pandemic. However, the authors do not appear to have directly compared the FIDs for the same species before the pandemic compared to during the pandemic (e.g. the FIDs of Eurasian blackbirds before the pandemic versus during the pandemic). Furthermore, within the same country-city, it is unclear whether the species observed before the pandemic were observed at the same location (e.g. same habitat type such as the same park) during the pandemic. As a species' FID response may be influenced by population characteristics and features specific to each site (e.g. habitat openness), these factors may have influenced the variability in FID measurements in this study.
 
 We regret being unclear in our methods. Our full model uses all data, but alternative models (see e.g. Fig. S1) used data with ≥5 as well as ≥10 observations before and during lockdowns for a given species. Importantly, Figure 2 and 3 depict data for species sampled at specific sites. We now clarify this within the Methods (L460-483) and the Results (L125-133 and associated figures) and in the figure legends (Fig. S1).
 
 4) The models in this study accounted for many factors predicted to affect FIDs (see the section on major strengths), however, the number of fixed and random factors are large in number compared to the total sample size (N =6369), such that models may have been over-extended.
 
 The number of predictors and random effects is well within the limits for the given sample size (Korner-Nievergelt et al. 2015. Bayesian Data Analysis in Ecology Using Linear Models with R, BUGS, and Stan). Importantly, simpler models give similar results as the more complex ones (Fig. S1) and the visual (model free) representations of our raw and aggregated data confirm our model results. This, we suggest, makes our findings robust and convincing.
 
 Overarching main conclusion
 
 Overall, this study examines factors influencing FIDs in a variety of bird species and concludes that FIDs did not differ during the pandemic lockdowns compared to before the pandemic (2019 and earlier). Furthermore, FIDs were not influenced by the strictness of government-mandated restrictions. Although the authors accounted for many factors influencing the measurement of FIDs in birds, the authors did not achieve their aim of disentangling the effects of pandemic-specific ecological effects from ecological effects unrelated to the pandemic (such as habitat heterogeneity).
 
 We find this statement confusing. We accounted for most relevant confounding factors and found little evidence for the strong effect of pandemic. Moreover, we now added country-specific analyses that confirm the lack of evidence, highlight the Figure 3 that shows no clear shutdown effect and also explore how levels of human presence changed over and within the years. Adding more possible confounds (albeit note that not many are left to add) might only further reduce the variation that could be explained by pandemic and hence such hypothetical effects of pandemic will be if anything small and thus likely not biologically meaningful.
 
 Their findings indicate that FIDs are highly variable both within- and between- species, but do not strongly support the conclusion that FIDs did not change in urban species during the pandemic lockdown. Therefore, this study is of limited impact on our understanding of how a drastic change in human behaviour may impact bird behaviour in urban habitats.
 
 It is unclear why you think our study lacks support for the conclusion that FIDs changed little during pandemic, if all results show no such effects. However, we toned down our Discussion and highlighted also potential issues linked to our approach (e.g. that sampled individuals were not marked and hence we cannot distinguish between various mechanisms that might explain the described pattern (L293-329) or that human presence may not have changed (L253-269). For further details see our previous response.
 
 Overall, the study demonstrates the challenges in using FIDs as a general fear response in birds, even during a pandemic lockdown when fewer humans are presumably present, and this study illustrates the large degree of variation in FIDs in response to a human observer.
 
 We appreciate and agree that our study demonstrates the challenges in quantifying human activity to understand bird escape distance and we added a paragraph on this topic to the discussion (L270-292).
 
 Nevertheless, we hope that our above responses clarify and address most of the issues you had with our manuscript. We tried to show that (a) most of your proposed controls are indeed included in our study design, models, and visualisations, and that (b) multiple evidence (from models and visualisation of raw and aggregated data) support the no overall effect conclusion. We further emphasize the temporal and between- and within-species variability in FIDs in the Results and now specifically indicate that lockdowns did not influenced FIDs above such variability (Fig. 2-3, Fig. S3). In other words, the natural (e.g. temporal) variation in FIDs seems far greater that potential effects of lockdowns (Fig. 2). We believe that even if lockdowns would have tiny effects that could have been detected with more. stringent experimental design (e.g. individually tagged birds) or even more complex models, such effects would be far from being biologically meaningful.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.07.15.500232v1
www.biorxiv.org www.biorxiv.org

New submission 20/08/2023, 17:16:47

1
1. Public_Reviews 21 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  The authors managed to show the broad botanical landscape and not only the main crops. This unique achievement is based on decades of establishing an excellent collection of a full comparative seed collection of the current flora. This allows the identification of species that usually are not identifiable. The authors were able to compare the crops that were grown there and identify the contribution of the Roman period with that of the Arab one. This excellent study is a landmark in how such studies should be done. The list of identified species will be used for many other studies on this subject.
  
  We are very grateful to Reviewer #1 for this generous assessment.
  
  Reviewer #2 (Public Review):
  
  Fuks et al. provide extensive paleobotanical data from several sites in the Negev desert to address hypotheses regarding the relative importance of the Roman Agricultural Diffusion (RAD) and the Islamic Green Revolution (IGR) in the dispersal of crops across Eurasia.
  
  While the overall claims from the authors are convincing, I found the presentation of the data somewhat difficult to follow.
  
  Graphical visualization of the data with respect to the proposed hypotheses would go a long way towards making the argument clearer for a non-specialist audience.
  
  The authors apply appropriate caveats in the discussion about their ability to assess IGR given their timeline only incorporates the first few hundred years and some IGR plants may not leave macrobotanical remains. Yet I think more could be done to explain how the data they do find provides positive evidence for RAD. Many of their findings are inferred to be RAD introductions not because of the timing in their sites, but because of previous evidence of introductions at other sites. It would thus be helpful to be more explicit about what additional evidence these findings provide beyond previously published data of introductions of many of these crops into the Levant.
  
  We thank Reviewer #2 for the positive assessment and helpful comments. We have moved several tables out of the main text to the supplementary tables. We also added a new schematic of the main findings regarding 1st millennium CE introductions to the southern Levant and their significance in the Negev Highlands crop assemblage (Figure 4). We have also added explanatory text to clarify the point about taphonomy vs. period of diffusion.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.12.01.518650v1
www.medrxiv.org www.medrxiv.org

New submission 06/08/2023, 15:14:00

1
1. Public_Reviews 21 Aug 2023
  
  in eLife
  
  Author Response
  
  eLife assessment
  
  This paper is of interest to researchers and policy makers involved in cervical cancer prevention. The paper provides insight into how the Covid19 pandemic accelerated changes in organized cervical cancer screening. The claim that self-sampling led to a major improvement of test coverage seems somewhat exaggerated and alternative hypotheses to those provided by the authors on the population who chose self-sampling are possible. Nonetheless, this is a valuable piece of work given the scope of the intervention(s) and the precedent it sets i.e. a crisis can in fact accelerate positive changes in screening that have been academic possibilities rather than practical realities.
  
  Thank you for this supportive summary. We have included exact data on exactly how much of the population test coverage that was attributable to self-samples. We have furthermore decided to focus on the population test coverage that is caused by organised testing (either taken by a clinician at a time and place that the woman was invited to by the organised program or taken by the woman herself using a sampling kit mailed to her by the organised program). These 2 improved analyses are intended to facilitate interpretation of how much of the improved test coverage that is attributable to the mailing of self-sampling kits.
  
  Reviewer #1 (Public Review):
  
  During the Covid19 pandemic, most cervical cancer screening programs were temporarily put on hold. The authors describe how Swedish health authorities dealt with this situation by implementing primary self-sampling and by launching a campaign with concomitant vaccination and screening. Besides, they show that the coverage of the screening program was one year after the start of the pandemic at pre-pandemic levels.
  
  Strengths of the paper are the clear presentation of the steps taken by the Swedish health authorities and the high quality of the presented screening coverage data which could be obtained directly from the screening registry. However, the paper would benefit from more in-depth analyses because the presented data raise questions. The number of invitations was >30 percent lower in the first year of the pandemic (Figure 1), but the screening coverage was only 4-5 percent lower. In the second year of the pandemic (year 2021), coverage was back at pre-pandemic levels, but the role of primary self-sampling in restoring screening coverage is a bit unclear. It is obvious that primary self-sampling made it possible to invite women again for screening during the pandemic, but there is no data on acceptance of primary self-sampling. Besides, the increase in coverage in year 2021 was only 4% and it is not clear whether such a modest increase could also have been achieved without primary self-sampling. In addition to self-sampling, the authors describe the launch of a concomitant vaccination and screening campaign. This is an interesting initiative but the authors do not show data on the coverage of this campaign in the target age range.
  
  We are now explaining that population test coverage is calculated over a whole screening interval. For example, if the screening interval is 3 years and improved attendance would only fully impact the population test coverage after 3 years. Furthermore, we are now presenting the exact data on how much of the test coverage is indeed attributable to the mailing of self-sampling kits.
  
  Reviewer #2 (Public Review):
  
  The manuscript by Elfstrom et al describes the impact of implementing self-sampling as the primary screening test in Sweden to address decreases in coverage following the COVID pandemic. The authors have a very rich dataset including all records of invitations to screen and screening results in the Stockholm area. A limitation is that there is no individual record linkage to allow investigation of the profile of the individuals who chose to screen using the self-sample.
  
  The conclusions are generally well supported by the authors with the following exceptions:
  
  1) There was not enough evidence presented in the manuscript to conclude that "The most likely explanation for the large increase in population coverage seen is that the sending of self-sampling kits resulted in improved attendance in particular among previously non-attending women."
  
  2) The authors state there is no evidence that delays in screening have impacted cervical cancer rates however they present no data to this effect in the manuscript.
  
  Although all screening and invitation data is indeed collected to the national screening registry, linking this data is not allowed without a permission from the Swedish National Ethical Review Board. We did apply for such a permission, which was granted on 2023-02-01, and a full set of registry linkage analyses to investigate the point raised by the reviewer is now included.
  
  The mention in discussion on stable cervical cancer rates was referring to public data from the national Cancer Registry. The source is now referenced.
  
  Reviewer #3 (Public Review):
  
  The authors report on the nature of interventions that were applied to aid and improve engagement in cervical screening, brought about by the SARS CoV Pandemic in Sweden.
  
  I appreciate that the impact of these interventions, given that they are recent, will take some time to quantify but the description (and reach) of the policy changes that occurred in a short amount of time is of significant interest to the screening community. The piece on HPV Even Faster is particularly novel; I am not aware of another example of where this has been enacted within a routine programme.
  
  Thank you for this supportive statement.
  
  The authors make reference to (15) where the reader can find greater details relating to the population who received the offer of self sampling (and the nature of the device). However I was a little confused (in this stand alone piece) as to who the self sampling group constituted exactly. Did this group not include pregnant women, women invited for first screen or women on non routine recall?
  
  This is correct, self-sampling kits were mailed to all women due for screening in the ages 26-70. Women due for screening aged 23-25 were invited for mid-wife-based sampling. Pregnant women were advised to come in for mid-wife-based screening, to save time. Women under follow-up from previous screens are not due for screening. This is now elaborated more clearly in the paper.
  
  The authors state that "the most likely explanation for the large increase in population coverage seen is that the sending of self-sampling kits resulted in improved attendance in particular among previously non-attending women" - why is this written as speculation at this stage (?) is it not possible to attribute directly the contribution made by self sampling, or is this in hand?
  
  See response to reviewer 2 above: Although all the data is indeed collected, we are not allowed to perform registry linkages without ethical permission. This has now been obtained and the requested analyses made.
  
  While self sampling is certainly an option that can support uptake and enfranchisement in cervical screening - its overall performance is fundamentally contingent on the number of women who then comply with follow up should the HPV test be positive; it is not simply about who returns the sample. It would have been of interest to see the proportion of women who did comply with follow up.
  
  The paper is not about follow-up strategies. Follow-up strategies are different in different settings and reporting is not standardized. They have also changed during the time of the study (e.g. cytology follow-up abandoned). A more detailed analysis of this would require a whole new paper.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2022.07.19.22277806v1
www.medrxiv.org www.medrxiv.org

New submission 13/08/2023, 11:49:35

1
1. Public_Reviews 21 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #2 (Public Review):
  
  1) It could benefit from fleshing out concepts instead of using parentheses, particularly in the abstract.
  
  We agree and have amended the abstract and methods (please refer to responses provided to the editor’s comments 1a-1e)
  
  2) There is space to expand on the results presented in Table 1, including an explanation of Affected cohorts 2008 vs Affected cohorts 2008-2009. It may also be useful to explain this analysis in the methods section.
  
  Please refer to response provided to editor on the same question (comment 5).
  
  3) Given that Australia is a best-case scenario and other countries have not had the same success in HPV vaccination coverage, in the discussion would it be possible to give a comparison of how these three scenarios would look different in a population with school-based vaccination but lower coverage volume, such that readers could understand how much of the success / failures of each of the three catch-up scenarios? It would be particularly helpful for readers who are not familiar with the modelling tool used in this analysis.
  
  We have added some commentary in the discussion in response to the reviewer’s comment. In future, further similar work in countries with lower base coverage would be informative.
  
  “Australia is a relatively high HPV vaccination coverage setting. Outcomes may be less favourable in a lower coverage setting, as there would be less protection from herd effects; however, the impact of disruptions might also be smaller in a setting with lower coverage, since a lower coverage program would be less effective. Nevertheless, the finding that if catch-up is performed expeditiously then it mitigates much of the effect from vaccination delays, is likely to hold in other settings. In a previous study (Simms et al, Lancet Public Health. 2020 Apr;5(4):e223-e234) modelling the health impacts of HPV vaccination hesitancy in Japan from 2013 to 2019 and the potential effects of restoring coverage to 70% with catch-up vaccination in 2020 is informative as it demonstrates that multi-age HPV catch-up vaccination, after catastrophic falls in coverage in Japan, would be effective in mitigating the effects. “
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2023.03.07.23286911v1
www.medrxiv.org www.medrxiv.org

New submission 13/08/2023, 11:45:51

1
1. Public_Reviews 21 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  Ghosh and colleagues report on their multidisciplinary effort to improve cervical cancer screening attendance in the East Boston Neighborhood Health Center (March-August 2021). Specifically, the authors 1) identified using electronic medical records overdue follow-up visits, 2) scheduled screening appointments during regular clinic hours and weekends/evenings, and 3) surveyed patients on their experience. These objectives were clearly defined (although not consistently so throughout the manuscript) and data analyses/presentation were simple and straightforward, appropriate to the study design and methodology used.
  
  Thank you for this comment. We have clarified the objectives in the revised manuscript.
  
  Overall, it is unclear to what extent the overdue appointments were backlogs created by the COVID-19 pandemic or due to pre-pandemic factors that could have been exacerbated by the pandemic. In order to contextualize the current study and its findings, an elaboration is needed on whether the pandemic created the delays in cervical cancer screening or simply compounded the problem. For example, the authors report on page 8, lines 196-197 that in 30% of encounters (not clear how many of the 118 reviewed charts were overdue appointments) the healthcare provider did note the overdue appointments.
  
  We have Figure 2 (now Figure 4) and added Figures 2and 5 to address this comment. In 2019, prior to the COVID-19 pandemic, approximately 70% of patients were up-to-date with cervical cancer screening, corresponding to 8467 patients overdue for screening. In 2020, the up-to-date percentage dropped to 63.5% and the overdue number increased to 8812. Figure 2 is a flowchart of the project which clarifies the “30%” mentioned in the reviewer comment
  
  In addition, a brief description of the cervical cancer screening program in place would be informative.
  
  We have added this in the “setting” section of the methods on page 4-5, lines 107-128)
  
  Table 1 provides an effort versus value summary; however, these constructs are ill-defined, with few inconsistencies with what is reported in the text.
  
  This table is intended to help inform clinics that are considering implementing quality improvement programs about the effort required and value obtained for different aspects of our program. These are based in part on proprietary cost analyses so certain details are not able to be included. We have amended the text/table to eliminate inconsistencies.
  
  Comments specific to Aim 1:
  
  The methodology is missing information on key elements, mainly relating to the decision-making process of establishing and defining the "validated" patient chart list (1375 overdue patients out of 6126 reviewed charts). A chart of the 1375 approached study population is also warranted (459 patients were screened, 622 could not be reached, and 203 cancelled/missed their appointments, what about the remaining 91 patients). A description of the characteristics of the study population and a comparison of the different groups (screened, not reached, cancelled/missed appointment) along these characteristics are missing.
  
  We have added a flowchart with this information to the results section. See Figure 2.
  
  Comments specific to Aim 2:
  
  About 63% of the 459 scheduled screenings were done during the evening/weekend clinics, which represents a substantial gain and clearly indicates a window of opportunity to increase screening rates by pinpointing the importance of offering a convenient time to women attend screening visits. In general, and as expected, offering additional screening clinics was effective in addressing the backlog of patients, although with significant investment and resources as mentioned by the authors. How significant is significant?
  
  We are not able to share these data publicly. We have added the following sentence: “The cost data is proprietary/not shareable but analysis by clinical leadership indicated the program was not cost-effective/sustainable.” Page 22, lines 678-80
  
  Comments specific to Aim 3:
  
  A more structured and detailed presentation/description of the survey instrument, its administration, response rate, and significance of results are warranted in the manuscript, albeit the joint reporting of this in the appended material.
  
  We have added additional detail about the survey method (page 9, lines 225-6, 228-31) and results ( Page 14-5, lines 518-22, 530-3) . We also inserted the survey used in the clinics. (Figure 1).
  
  Reviewer #2 (Public Review):
  
  The purpose of this study is unclear from the introduction. Additionally, the methods are incomplete and did not describe how data was collected and analyzed. The results do not describe the sample. Once these are described more clearly, further comments can be made about what the authors were trying to achieve and the impact of the work on the field.
  
  We have clarified the study purpose in the introduction: “The purpose of the project was to examine the impact of a Quality Improvement intervention on improving cervical cancer screening, as well as to evaluate the effectiveness and sustainability of different methods for addressing overdue screening.” (page 3, lines 87-90)We have also clarified the methods and results to describe data extraction more completely from electronic medical records and statistical analysis using descriptive statistics.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2023.01.20.23284607v1
www.biorxiv.org www.biorxiv.org

Exposure to high-sugar diet induces transgenerational changes in sweet sensitivity and feeding behavior via H3K27me3 reprogramming

1
1. Public_Reviews 21 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #2 (Public Review):
  
  This work attempts to connect the diet of a mother to the physiology and feeding behaviors of multiple generations of her offspring. Using genetic and molecular biology approaches in the fruit fly model, the authors argue that this Lamarckian inheritance is mediated by germline-inherited chromatin and is regulated by the general activity of a histone methylase. However, many of the measured effects are small and variable, the statistical tests to prove their significance are missing or poorly described, and some experiments are inadequately described and lack important controls.
  
  1) The authors claim that the diet of a mother can influence the physiology of her progeny for several generations. However, the observed effects of maternal diet on later generations were small and variable for most assays (see Fig1C, S1.1A, B, D). Additionally, the effect size between F0 HSD to ND was often larger than the effect size between the progeny of F0 parents and ND. To put it another way, if the authors were to compare the F1, F2, etc. to the F0 HSD flies, they would conclude that the majority of the response to diet is not maternally transmitted, and is directly controlled by the diet of the individual being measured.
  
  We agree with the reviewer that the effect size of acute HSD exposure (in HSD-F0 flies) was stronger than that of transgenerational inheritance (in HSD-F1/2/3/4 flies). Similar observations were also made in other studies, see Klosin et al., Science, 2017, Bozler et al., eLife, 2019. We would argue this difference in effect size was as expected and with clear biological relevance.
  
  For all living organisms, acute environmental changes (diet change included) have direct and profound influences on their survival and reproduction, and therefore need robust and immediate responses. In comparison, ancestral environmental changes may only provide some vague and indirect indications of the current living environment of the offspring. Such information may be beneficial for the survival and reproduction of the offspring, but the effect size is expected to be much smaller, or at least smaller than that of acute environmental changes.
  
  Studies on Dutch Famine offers a good example. Human individuals who were prenatally exposed to famine were found to be associated with greater risk in metabolic diseases (Ravelli et al., NEJM, 1976). But nevertheless, direct high-fat diet exposure was still the much stronger risk factor for obesity and metabolic disorders (Bray et al., Am J Clin Nutr, 1998, Jéquier et al., Int J Obes Relat Metab Disord, 2002).
  
  We have added additional discussions in the manuscript for clarification.
  
  Furthermore, since our current study aimed to investigate the mechanism of behavioral transgenerational inheritance, we focused on the comparison between HSD-F1 flies (and their progeny) vs. ND-fed flies. As the ancestors of HSD-F1/2/3/4 flies were exposed to HSD, whereas HSD-F1/2/3/4 flies themselves were never exposed to HSD, any difference we observed between the two groups could be solely attributed to transgenerational inheritance of ancestral HSD exposure. With that saying, to better distinguish the effects of acute HSD exposure vs. transgenerational inheritance upon ancestral HSD exposure, we re-analysed and presented the comparisons among ND, HSD-F0, and HSD-F1 data in the manuscript (Figure 1. B-E, Figure 1-figure supplement 1. A-E, Figure 1-figure supplement 2. A-D, Figure 3. D-E, Figure 3-figure supplement 1. B-D, Figure 3-figure supplement 2 and 3. A-B).
  
  2) The authors chose to study PER, which had the largest average effect sizes between conditions. However, PER was highly variable in the averaged data, with some individuals showing large effects and others having no effects. A better characterization of transgenerational PER may increase the robustness of this assay and confidence in its results. For example, the authors could measure PER in lineages derived from individual flies to determine when transgenerational effects on PER decline or disappear. This form of data collection could help to explain the high variation in the averaged data presented in the paper.
  
  We acknowledged that PER in general was quite a variable behavioural trait (probably as to most if not all behavioural measures). It was not surprising since animal behaviours, as complex traits, could be influenced by numerous intrinsic and extrinsic factors, such as genetic background, developmental environment, diet, population density, environmental conditions, etc. Numerous PER studies have exhibited similar variability (Masek et al., PNAS, 2010, Marella et al., Neuron, 2012, Charlu et al., Nature Communication, 2013, Wang et al., Cell Metabolism, 2016, Wang et al., Cell Reports, 2020).
  
  Nevertheless, in our current study we were able to identify statistically significant behavioural difference between ND-fed flies and HSD-F1/2/3 flies, demonstrating that ancestral HSD exposure imposed transgenerational inheritance on sweet sensitivity. To further increase the robustness of the study as suggested by the reviewer, we have conducted additional repetitions of many PER experiments and further confirmed the phenotype with less variability and more statistical power (Figure 1. G-I, Figure 3. D-E, Figure 3-figure supplement 1. B-D, Figure 3-figure supplement 2 and 3. A-B). The reviewer also suggested the use of isogenic flies, which might help to minimize the variations of genetic background. However, we think that demonstrating the behavioural difference in genetically diverse fly populations is a more credible way to show that such transgenerational inheritance is a reliable and generalizable phenomenon.
  
  3) What do the error bars represent on any figure? There are many examples where the data is highly variable and lies completely outside of the error bars. What is the statistical test for significance that is carried out in each figure? The brief comment about statistics in the methods section is inadequate. The authors should also supply the raw data used to generate the figures so that readers can perform their own statistical tests.
  
  Data in the manuscript were represented as means ± SEM (standard error of the mean) in all of our figures, which is a standard practice in the field (Masek et al., PNAS, 2010, Charlu et al., Nature Comm, 2013, Wang et al., Cell Metabolism, 2016). We have provided detailed explanations of the statistical tests in the manuscript. We have also prepared raw data files as suggested by the reviewer.
  
  The model that global H3K27me3 is regulated by ancestral diet is unconvincing without further experimental validation and explanation. Points 4-10 address specific issues.
  
  4) The authors performed ChIP on cycle 11 embryos. This stage is extremely short (11 min) and contains roughly 10 times less chromatin than embryos only 30 minutes older. These features make it very difficult to collect large numbers of precisely staged embryos without significant contamination. It is also debatable whether early cell cycles (including and preceding cycle 11) are slow enough to deposit and propagate histone marks in the presence of new histone incorporation. See the opposing arguments in Zenk et al 2017 and Li et al 2014. The authors could perform ChIP on older embryos to avoid this controversy.
  
  We thank the reviewer for the clarification. Our embryo collection protocol involved allowing flies to lay eggs freely in a cage for 30 minutes followed by 50 minutes of incubation on a juice plate, and then completing the embryo sorting within 30 minutes. Therefore, to describe it in a more stringent way, our embryos should be in the stage between cycle 10-12. We have corrected this information in the manuscript (Figure 2. A).
  
  Since all the embryos were sorted using the same morphological criteria within the same time frame, their developmental stages should be comparable (i.e. all from cycle 10-12). In several references we consulted, a broader range (cycle 9-13) was used for ChIP-seq sequencing analysis (for example, see Zenk et al., Science, 2017).
  
  Surely any maternally inherited information will also be present in cycle 14 or 15 embryos if it is to influence the development or physiology of the brain. The observed differences in global H3K27me3 levels in F1 vs ND flies could be explained by slightly different aged embryo collections or technical variations in the ChIP protocol. The authors could strengthen their conclusion by performing more ChIP replicates. Alternatively, the authors could use orthogonal approaches like antibody staining or western blots to measure global H3K27me3 levels in precisely staged embryos.
  
  We chose to use cycle 10-12 embryos because we aimed to identify epigenetic modulations directly transmitted through the maternal germline. Embryos in cycle 14-15 might reveal more profound changes, but since embryos in that stage had entered the zygotic phase and started the remodeling of histone modifications, we think it might mask the maternally transmitted changes we sought to identify.
  
  In addition, we conducted two biological replicates for each group for the ChIP-seq analysis, which was a standard in the field (Zenk et al., Nature, 2021, Ing-Simmon et al., Nature Genetics, 2021). In the current study we further verified the genes identified in the ChIP-seq analysis in RNA-seq and qPCR analysis.
  
  We further verified the ChIP-seq results by using western blot, which showed a ~2 folds increase in H3K27me3 modification in HSD-F1 early embryos vs. ND-fed embryos, in line with the ChIP-seq data (Figure 2-figure supplement 1. B). We have also provided immunofluorescence results for embryos at cycle 13 and cycle 14, which clearly showed a significant increase in H3K27me3 modifications in HSD-F1 embryos (Figure 2-figure supplement 1. C).
  
  5) The authors measure PRC2 subunit mRNA levels in adult fly heads to attempt to explain the observed differences in inherited H3K27me3 levels in fly embryos. The authors should examine PRC2 components in germ cells and early embryos to understand how germ cells and early embryos generate H3K27me3 patterns.
  
  We have now shown that Pcl and E(z) mRNA expression in HSD-F0 flies were not significantly changed vs. ND-fed flies (Figure 2-figure supplement 2. D-G). Meanwhile, H3K27me3 demethylase UTX and H3K27ac acetyltransferase Cbp showed significant decrease (Figure 2-figure supplement 2. H). Therefore, HSD exposure imposed complex epigenetic modifications in HSD-F0 flies, which then led to transmission of epigenetic marks to their progeny. Given the main scope of this study was to understand which epigenetic program mediated the behavioral transgenerational inheritance upon ancestral HSD exposure (but not that mediated acute HSD exposure), we focused our effect on H3K27me3 which was significantly changed between HSD-F1 flies vs. ND-fed flies.
  
  6) The RNAi experiment targeting PRC2 components in embryos is uninterpretable without appropriate controls and an explanation of the genotypes used in the experimental paradigm. Are the authors crossing nosNGT mothers to UAS-RNAi fathers and assaying the progeny? What is the genotype of the F1 flies and how does it compare to the genotype of the ND flies? The authors should also note that the Gal4 drivers they use are not necessarily restricted to the ovary, and could directly affect other tissues controlling PER like neurons and muscle. Additionally, the authors should supply the appropriate controls to verify that their experimental paradigm has the intended effect. PRC2 proteins are presumably loaded into embryos and would be immune to zygotic-expressed RNAi. The authors could validate when PRC2 RNAi is effective by staining embryos for H3K27me3.
  
  We have now added schematic diagrams and detailed explanations in our revised manuscript to better explain the RNAi experiments (Figure 3-figure supplement 1. A). As shown in the diagram, we compared each RNAi treatment group to appropriate genetic controls. We have also noted in the manuscript that the GAL4 drivers we used were not restricted to the ovary.
  
  We have now verified the effect of PRC2 knockdown to reduce H3K27me3 in female germline by both western blot and immunofluorescence staining (Figure 3. B-C).
  
  7) Although the authors do not note this, nosNGT>RNAi affects the PER of ND flies (compare Gal4>RNAi to just RNAi or just Gal4 in ND columns in Fig3A-D). This could be due to RNAi expression in neurons or muscles or some other indirect effect. Regardless of the mechanism, this result makes it difficult to interpret how RNAi treatments affect the transgenerational inheritance of PER if there is an equivalently strong nontransgenerational effect.
  
  Although nosNGT>RNAi appeared to slightly affect PER response of ND-fed flies, there was no statistically significant difference (Figure 3-figure supplement 1. B and D, Figure 3-figure supplement 2. A-B). Rather, the effect of E(z) knockdown was evident in HSD-F1 flies (Figure 3-figure supplement 1. B), further confirming the involvement of H3K27me3 in transgenerational inheritance of PER reduction.
  
  8) The matalpha gal4 experiment is inadequately explained in the text or methods. Are the authors expressing RNAi in the ovaries of the F0 flies that are fed an HSD? Does the ovary influence their PER somehow? Similar to point 8, there appears to be a nontransgenerational component to the RNAi phenotype that clouds the interpretation of the transgenerational effect (compare F0 in S3.1A-C).
  
  We have now added a schematic diagram and detailed explanations in our revised manuscript to better explain the RNAi experiments (Figure 3. A). As shown in the diagram, we compared each RNAi treatment group to appropriate genetic controls.
  
  Similar to point 7, although Mat-tub-GAL4>RNAi might seem to affect PER responses of ND-fed flies, there was no statistically significant difference (Figure 3. D-E). Rather, the effect of E(z) knockdown was evident in HSD-F1 flies (Figure 3. D), further confirming the involvement of H3K27me3 in transgenerational inheritance of PER reduction.
  
  9) For the EED inhibitor experiments (both PER and calcium imaging), it is unclear whether the authors fed the mothers or their adult progeny the EED inhibitor. If adult progeny were fed, what tissues were affected? The authors should stain various tissues with an H3K27me3 antibody to verify the effectiveness of their inhibitor. Finally, the effect of the EED inhibitor on calcium imaging was not convincing because the variation was so large.
  
  We have added a new schematic diagram and provided more detailed explanations in the manuscript for pharmacological interventions (Figure 4. A-D). To verify the effect of the drug treatment, we showed that compared to the control group fed with DMSO, flies fed with the inhibitor showed a significant decrease in H3K27me3 levels, demonstrating the effectiveness of the inhibitor (Figure 4-figure supplement 1. A).
  
  We acknowledged the unsatisfactory quality of our calcium imaging experiments in our initial submission. We have now improved our experimental procedures to reach better data quality, while the conclusions remained consistent (Figure 4. E).
  
  10) In all of the PRC2 RNAi and inhibitor experiments, are there any other phenotypes that would suggest that the treatments are working? There are many published PRC2 loss-offunction phenotypes (molecular and developmental) in different tissues. The authors could assure the reader that their treatments are working as expected by doing these controls.
  
  As discussed above, we have now used western blot and immunofluorescence staining to validate the efficiency of PRC2 RNAi in female germline (Figure 3. B-C).
  
  11) The authors propose that a transgenerationally inherited state of the caudal gene is responsible for the transgenerationally inherited PER. However, the experiments investigating the methylation state and expression level of caudal are unconvincing. Cad mRNA abundance varied immensely in the ND RNAseq samples. When the authors compared cad levels across generations, the effect size was small. A single outlier in the ND sample in both the RNAseq and the RTPCR experiments appears to drive up its mean and effect size. The H3K27me3 ChIP on cad is very similar in the F1 and ND samples and the acetylation peak on its promoter appears unchanged. The authors could vastly improve the caudal experiments in this paper by simply using cad antibodies to stain the relevant tissues that contribute to PER. For example, the authors could stain GR5a neurons for cad expression in different generations that inherit (or don't inherit) maternal PER to more accurately determine if cad levels are indeed transgenerationally regulated. The authors could also perform more ChIP experiments at a less variable stage to convincingly correlate epigenetic marks on cad with its expression level.
  
  As discussed above, we conducted two biological replicates for each condition of the ChIP-seq analysis, which was a standard in the field (Zenk et al., Nature, 2021, IngSimmon et al., Nature Genetics, 2021). We have also performed western blot and immunofluorescence for H3K27me3 in ND vs. HSD-F1 embryos to further validate our ChIP-seq data (Figure 2-figure supplement 1. B-C).
  
  As for Cad gene, H3K27m3 signals showed a statistically significant difference between ND-fed and HSD-F1 flies (Figure 5. D). We have also conducted additional qPCR experiments to verify the gene expression changes of the Cad gene (Figure 5. F, right), which was in line with the ChIP-seq data and further supported its validity.
  
  It was worth noting that during the developmental time window of our ChIP-seq analysis, the acetylation signals in the promoter region of cad were very low (Figure 5. D), making it impossible to make a comparison.
  
  Reviewer #3 (Public Review):
  
  Jie Yang et al. investigated the transgenerational behavioral modification of a high-sugar diet (HSD) in Drosophila and revealed the underlying molecular and neural mechanisms. It has been reported that HSD exposure decreases sweet sensitivity in gustatory sensory neurons, resulting in reduced sugar response (Proboscis extension reflex, PER) in flies. The current study reports that this effect can be transmitted across generations through the maternal germline. Furthermore, the authors show that H3K27me3 modification is enhanced in the first-generation progenies of HSD-treated flies (F1), and genetical or pharmacological disruption of PCL-PRC2 complex blocks the behavioral change and restores the sweet sensitivity in the Gr5a+ sweet sensory neurons. The authors further analyze the differentially expressed genes in the F1 flies. Among H3K27me3 hypermethylated regions, they focus on homeobox genes and find a transcription factor Caudal (Cad), which shows decreased expression in the F1 flies. Knocking down Cad in Gr5a+ neurons results in decreased PER response to sucrose.
  
  Transgenerational changes in physiology and metabolism have been broadly studied, while inherited changes at the behavioral level are much less investigated. This work provides convincing evidence for transgenerational modification of feeding behavior and digs out the underlying molecular and neural mechanisms. However, there still are several concerns that need to be clarified.
  
  1) The epigenetic regulator PCR2 has been found to play an essential role in the 7d-HSDinduced modification of the PER response. In this study, it's important to clarify for the transgenerational change, whether epigenetic modification is required in the flies exposed to HSD (F0), the progenies (F1), or both. It would be very helpful for better interpretation if the procedures of HSD treatment in RNAi experiments and the drug treatments were stated in more detail. In addition, the F0 flies should be examined as the control.
  
  In this current study our main scope was to understand the transgenerational influence of HSD exposure on the progeny. To this aim, we chose to study the physiological and behavioral differences between ND-fed flies vs. HSD-F1 flies (and their progeny on ND). HSD-F1 flies (and their progeny) were not exposed to HSD in their whole life cycle and therefore the physiological and behavioral changes we observed vs. ND-fed flies could be solely attributed to epigenetic modifications transmitted via germline cells from HSD-F0 flies. Therefore ND-fed flies were used as the main control.
  
  As for HSD-F0 flies, the acute effects of HSD exposure could be more complex. Epigenetic factor was likely involved, as evident in Figure 3-figure supplement 1. C, Figure 3-figure supplement 3. A-B and Figure 4. C. In addition, HSD exposure might also directly affect gene expression and multiple signaling pathways in HSD-F0 flies (see Chen et al., Science China Life Sciences, 2020). Therefore, we did not aim to investigate how HSD exposure affected HSD-F0 flies in this current study. We have added additional discussions in the manuscript for clarification.
  
  With that saying, we still added more HSD-F0 flies as controls when needed (Figure 2-figure supplement 2. D-G, Figure 3-figure supplement 1. C, Figure 4. C, Figure 5. F， left).
  
  We have also modified the schematic diagrams and added more detailed explanations in the manuscript, in order to provide a clearer illustration of the experimental procedures (Figure 3. A, Figure 3-figure supplement 1. A, Figure 4. A, B and D). Specifically, we employed two different RNAi approaches. Firstly, we used genetic methods to obtain homozygous Mat-tub-gal4>UAS-gene X RNAi fly lines on chromosomes Ⅱ and Ⅲ for germline-specific knockdown (Figure 3, Figure 3-figure supplement 3). Secondly, we used heterozygous nosNGT-gal4>UAS-gene X RNAi flies for embryo-specific knockdown (Figure 3-figure supplement 1 and 2). Our drug experiments involved both treating the flies and measuring their PER (Figure 4. A-C) and treating the parental flies and measuring the PER of their progeny (Figure 4. D).
  
  2) The information on the drug treatment period is also missing for imaging experiments (Fig.4C). Moreover, the response curve is very different from those recorded in the same neurons in previous studies. What’s the reason? Please also provide a representative image showing which part of the Gr5a neurons is recorded.
  
  The experimental procedures of drug treatments were shown in Figure 4. A now. We fed adult flies with specific compounds for five days after eclosion, then measuring the calcium signals of Gr5a+ neurons when flies were fed with sucrose.
  
  As suggested by the reviewer, we have now conducted calcium imaging experiments more carefully and thoroughly. We have now added the new data into the revised manuscript and the conclusions remained consistent (Figure 4. E). We recorded the calcium signal in the axons of Gr5a+ neurons in the SEZ.
  
  3) It's unclear whether the decreased Cad expression upon HSD treatment specifically occurred in Gr5a+ neurons or a lot of cells. If the change in gene expression is significant in the qPCR test, it should occur in a large number of cells, most likely including different types of gustatory sensory neurons. If lower cad expression led to lower neural response and thereby lower behavioral response, how to specifically decrease the PER response to sucrose but not to other tastes? -whether HSD-induced desensitization is specific to sucrose in the offspring?
  
  We agree that Cad expression might decrease in a lot of cells including Gr5a+ neurons in the proboscis. In order to investigate whether taste perception other than sweet sensing was also affected, we conducted PER experiments with fatty acids, which was another type of appetitive taste cues like sugars. Perception of fatty acids is mediated by ionotropic receptors such as ir25a, ir76b, and ir56b (Ahn, et al., eLife, 2017, Brown., et al, eLife, 2021).
  
  Our results indicate that PER of fatty acid in HSD-F0 and HSD-F1 was not significantly reduced compared to the ND-fed controls (Figure 1-figure supplement 2. E-F). This suggests that the impact of Cad on gustatory sensory neurons might be specific to sweet sensitivity of Gr5a+ neurons.
  
  4) In Fig.2D, data are sorted for genomic regions showing an up-regulated modification of H3K27me. It's unclear whether similar sorting was performed in panel C. This needs to be clarified.
  
  The analysis shown in Figure 2C and 2D were linked. As for 2C, we identified genomic loci with enriched H3K27me3, H3K9me3, and H3K27ac peaks, and found that H3K27me3 peaks showed the most robust changes between ND-fed and HSD-F1 flies. Therefore we concentrated on these loci where H3K27me3 modifications were significantly changed between the two groups, and further analyzed their difference. As shown in Figure 2D, within these loci, H3K27ac modifications, which was functionally antagonizing to H3K27me3, were significantly reduced; whereas H3K9me3 signals within these loci remained unchanged. Such results confirmed that ancestral HSD exposure induced robust H3K27me3 modifications in certain genomic loci.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.01.15.524137v1
www.biorxiv.org www.biorxiv.org

New submission 13/08/2023, 11:21:04

1
1. Public_Reviews 21 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  The paper proposes a novel approach, named ModCRE, which utilizes structure-based learning to predict the DNA binding preferences of transcription factors (TFs). The authors integrate both experimental knowledge of the structures of TF-DNA complexes and large amounts of high-throughput TF-DNA interaction data. Additionally, the authors have developed a server that automatically produces these characteristics for other TFs and their complexes with co-factors.
  
  Strengths: The paper's integration of experimental knowledge and highthroughput data to develop statistical knowledge-based potentials to score the binding capability of TFs in cis-regulatory elements is a powerful strategy. The proposed approach can be applied to more than 80% of TF sequences, making it a general method for characterizing binding preferences.
  
  Weaknesses: The paper is difficult to follow, as it contains many technical details and implementation details. The method applied is not always clear, and the paper focuses on implementation rather than the message. The results indicate that the nearest neighbors approach in Figure 4 outperforms the proposed method in many cases, and the proposed method seems to perform better only when similarity with the target is low. The same applies in Fig. 5 when using normalized ranked scores.
  
  It appears that the authors have successfully developed a structure-based learning approach for predicting DNA binding preferences of transcription factors. However, the paper's technical language and implementation focus make it challenging to follow at times.
  
  It seems the authors have successfully achieved most of their aims in improving predictions for TF-DNA interaction, and the results support their conclusions.
  
  This work has the potential to significantly impact the field of TF-DNA binding and gene regulation, particularly for those interested in predicting PWMs for TFs with limited or unreliable experimental data.
  
  General comment: We wish to thank the reviewer for his/her comments helping us to facilitate the reading, clarify the ideas and certainly improve the manuscript. We also thank his/her comments on the strengths. In the current revision we have tried to solve the faults and improve the weaknesses. Certainly, the results section contained many explanations of the method and its implementation rather than its use and application. Referred to figures 4 and 5, the reviewer is right too: Our approach can help to predict the binding motif of a transcription factor on difficult cases, when the PWMs of closest homologs are unknown, but the structure of its complex with DNA can be provided. Otherwise, when information of binding is available for close homologs, traditional state-of-the-art approaches are better than our approach and we recommend them.
  
  Reviewer #2 (Public Review):
  
  This work describes the development of a new structure-based learning approach to predict transcription binding specificity and its application in the modeling of regulatory complexes in cis-regulatory modules. The development of accurate computer tools to model protein-DNA complexes and to predict DNA binding specificity is a very relevant research topic with significant impact in many areas.
  
  This article highlights the importance of transcriptional regulatory elements in gene expression regulation and the challenges in understanding their mechanisms. Traditional definitions of activating regulatory elements, such as promoters and enhancers, are becoming unclear, suggesting an updated model based on DNA accessibility and enhancer/promoter potential. Experimental techniques can assess the sequence preferences of transcription factors (TFs) for binding sites. Recent models propose a cooperative model in which regulatory elements work together to increase the local concentrations of TFs, RNA polymerase II, and other co-factors. Co-operative binding can be mediated through protein-protein or DNA interactions. The authors developed a structurebased learning approach to predict TF binding features and model the regulatory complex(es) in cis-regulatory modules, integrating experimental knowledge of structures of TF-DNA complexes and high-throughput TF-DNA interactions. They developed a server to characterize and model the binding specificity of a TF sequence or its structure, which was applied to the examples of interferon-β enhanceosome and the complex of factors SOX11/SOX2 and OCT4 with the nucleosome. The models highlight the co-operativity of TFs and suggest a potential role for nucleosome opening.
  
  The results presented by the authors have a large variability in performance upon the different TF families tested. Therefore, it would be ideal if the performance/accuracy of the method is tested in some simple predictions and validated with prospective experimental data before applying it to model difficult scenarios such as those described here: SOX11/SOX2/OCT4 and nucleosome or interferon beta and enhanceosome. This will give more support to the models generated and thus the validity of the conclusions and hypothesis derived from them.
  
  General comment: We wish to thank the reviewer for his/her comments, we really appreciate them and the opportunity to have new tests with our approach. Some of his/her comments coincide with those of reviewer 1. When this is the case, we will refer to our previous answers and modifications in the manuscript. In this revision we have included new tests to validate the approach using available and published experiments different than the ones used in the original submission. We hope the new information is sufficient to support our approach.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.04.17.488557v3
www.biorxiv.org www.biorxiv.org

New submission 13/08/2023, 11:12:29

1
1. Public_Reviews 21 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  Davies et al. examined the role of the malaria parasite's FIKK4.1 protein kinase in trafficking and host membrane insertion of key proteins that are exported by the intracellular P. falciparum parasite. FIKK4.1 is one of 18 FIKK serine/threonine kinases exported into the host erythrocyte; these kinases phosphorylate both host proteins and exported parasite proteins. FIKK4.1 has previously been implicated in rigidification of the erythrocyte cytoskeleton. It is also known to affect trafficking and insertion of PfEMP1, the parasite's primary cytoadherence ligand, on the host cell surface. In the present studies, the authors perform sophisticated gene-editing experiments that combine conditional knockout of FIKK4.1 with tagging of two kinase targets with the TurboID proximity biotin-labeling enzyme to explore phosphorylation-dependent changes in target protein localization, structure, or protein-protein interactions. Using conditional knockout of each exported FIKK kinase, they determine that FIKK4.1 is the only kinase that regulates PfEMP1 surface exposure and that it does not appear to modulate surface translocation of RIFINs, a family of parasite antigens involved in immune evasion. The combination of gene-editing, proximity labeling and mass spectrometry, and biochemical studies in the paper is to be lauded. These findings identify key targets of exported kinases and will guide future studies of host cell remodeling.
  
  Key limitations of the study:
  
  1) TurboID tagging of FIKK4.1 followed by proximity labeling and mass spectrometry of biotinylated proteins revealed parasite-stage dependent labeling of 101 parasite proteins and 39 human proteins that come in contact with FIKK4.1. Although TurboID is a more efficient biotin ligase produced through directed evolution, nonspecific biotinylation of proteins that do not form biologically relevant interactions remains an issue. Biotin addition for 4 hours, as used here and in most studies using this ligase, allows for labeling of proteins that undergo random collisions with the TurboID-tagged protein. While there was clear enrichment of exported proteins in the FIKK4.1-tagged parasite at mature schizont stages when FIKK4.1 is in the host cytosol, only 66% of the proteins labeled were exported, consistent with labeling and recovery of irrelevant proteins. As the authors performed appropriate controls and interpreted their findings cautiously, this limitation results primarily from finite efficiency of TurboID, trace levels of endogenous biotin within cells, and other complexities associated with the technology.
  
  We agree with the reviewer that there are limitations to TurboID and the mere presence of a protein in a dataset does not imply functional relevance (which is also true for IP data). However, it is highly complementary to data obtained through other methods (in our case previous cytoadhesion data and phosphoproteome data) and as we show here, can give high resolution information on the local protein environment of a protein. This is illustrated by highly significant protein-specific interaction datasets for PTP4 and KAHRP obtained from biological triplicate experiments. The site-specific protocol we use later in the paper allows us to eliminate unbiotinylated proteins non-specifically binding to beads which is a major advantage, evidenced by the much higher ratio of exported proteins observed in the PTP4 and KAHRP-turboID datasets.
  
  2) The production of dual-edited parasites carrying conditional knockout of FIKK4.1 and TurboID tagging of either KAHRP or PTP4 permitted examination of changes in localization of exported proteins upon their phosphorylation by FIKK4.1. KAHRP and PTP4 are excellent choices for these experiments because they are established targets of the kinase and good candidates for effectors involved in PfEMP1 membrane insertion. Some 30-40 proteins exhibited significant changes in biotinylation by these TurboID-tagged proteins, suggesting altered localization or structure upon loss of FIKK4.1 kinase activity. PfEMP1 trafficking proteins (PTPs), Maurer's cleft proteins, exported heat shock proteins, and components of PSAC, a parasite-associated nutrient uptake channel, all exhibited changes. Although FIKK4.1 is not essential for in vitro parasite propagation, altered localization could result either directly from changes in phosphorylation status of the protein itself or could reflect indirect effects on the cell from loss of FIKK4.1.
  
  The reviewer is correct in that we cannot exclude that it is not only loss of FIKK4.1 mediated phosphorylation sites that leads to the observed changes, but that the loss of the FIKK4.1 kinase domain affects the localisation of other proteins. Conditional inactivation of the FIKK4.1 kinase domain while retaining the overall protein would have been a more elegant approach. However, we do not predict the kinase domain of FIKK4.1 to be a strong structural component given that kinase domains often evolved to have low affinity interactions with their multiple targets and are less likely to act as scaffolding parts. As the reviewer points out, because we observed no growth defect upon deletion of FIKK4.1. Therefore we can be quite certain that the observed changes are not due to indirect effects caused by differences in growth but are a direct effect by the loss of the kinase domain and FIKK4.1’s enzymatic activity.
  
  3) As a consequence of these two limitations, these experiments could not conclusively implicate either KAHRP or a specific PTP in PfEMP1 surface translocation. Whether specific Maurer's cleft proteins or the nutrient channel components contribute to PfEMP1 surface translocation could also not be addressed. The authors' Discussion section is appropriately cautious in interpreting changes in biotinylation upon FIKK4.1 disruption. Although a large amount of data has been generated in this sophisticated study, the precise mechanism of PfEPM1 trafficking and membrane insertion remains elusive.
  
  We agree with the reviewer that we do not definitively explain the mechanism of FIKK4.1 in PfEMP1 surface translocation. But we identify several promising candidates for modulating its effect, some of which (for example PTP4) have previously shown to be relevant for PfEMP1 surface translocation. We also identify unexpected proteins which can now be investigated further. New methods in high resolution Cryo-EM imaging may allow us to image individual protein density in knobs and visualize the observed changes in the future. Further PerTurboID experiments with individual components will likely draw an ever finer picture. Here we focus on emphasising the potential of PerTurboID for identifying connections between proteins, and to observe changes to protein characteristics which would be missed by other techniques.
  
  Reviewer #2 (Public Review):
  
  Davies et al combine TurboID with conditional mutagenesis to reveal how a perturbing event alters the accessibility of a sub-cellular proteome to proximity biotinylation. The approach builds on established techniques for antibody-mediated enrichment of biotinylated peptides (rather than purification of whole biotinylated proteins by avidin) to enable mapping of the specific lysines that are biotinylated by TurboID and how access to these sites changes between conditions. The insights gained have a range of potential implications touching on protein trafficking/localization, complex dynamics and membrane topology. The authors apply this strategy to study trafficking of the key P. falciparum adhesin PfEMP1 to the infected erythrocyte surface. This group has previously shown that the exported parasite kinase FIKK4.1 is important for this process but the specific mechanism is unknown. In the first part of the present study, the authors develop PerTurboID and analyze the altered biotinylation patterns upon FIKK4.1 deletion in parasite lines bearing TurboID tags on PTP4 or KAHRP, two proteins required for this pathway and likely direct substrates of FIKK4.1. Numerous changes in site-specific biotinylation are quantitatively assessed on hundreds of proteins and possible implications for these changes are discussed, including topology of parasite integral membrane proteins exported into the RBC compartment as well as how the conformation of the RhopH complex might be altered upon RBC membrane integration. In a final set of experiments, the authors show that among 18 exported FIKK kinases, FIKK4.1 is uniquely important to PfEMP1 surface display but not to the distinct RIFIN class of parasite proteins that are also trafficked to the RBC surface. On the whole, the data are compelling and provide an important new approach that advances the proximity labeling toolkit.
  
  While the resolution of PerTurboID captures the site-specific changes in biotinylation abundance and position that occur upon loss of FIKK4.1, a limitation of the study is that these observations do not necessarily clarify the model for how FIKK4.1 is controlling the PfEMP1 trafficking pathway. The authors convincingly show that FIKK4.1 uniquely supports PfEMP1 surface presentation and cytoadhesion. However, this is not connected to the PerTurboID data in a way that provides a mechanism for how this is achieved by FIKK4.1 activity and in my opinion doesn't deliver on the title claim to "reveal the impact of kinase deletion on cytoadhesion". Certainly the changes in biotinylation suggest a range of interesting possibilities related to the accessibility and topology of proteins within and beyond the PfEMP1 trafficking pathway; however, it is hard to interpret the relationship of these changes to the process in view. For instance, deletion of FIKK4.1 increases biotinylation of several Maurer's clefts proteins in both the PTP4- and KAHRP-TurboID experiments but why this is or whether it is significant for PfEMP1 transport is unclear.
  
  We agree with the reviewer that we do not definitively confirm the relationship between the changes observed in protein accessibility and the role of FIKK4.1 in PfEMP1 transport. We discuss a number of likely options based on what is known of the candidate genes, but validation would require extensive further work beyond the scope of this paper. We have focussed on demonstrating the value of PerTurboID as a technique for measuring molecular-level changes which would be missed by other methods, providing a list of proteins which are likely involved in modulating FIKK4.1 activity and PfEMP1 trafficking through an interconnected network. We believe the technique will be very useful for understanding gene function in other scenarios. However, we changed the title to be more specific to proteins in the cytoadhesion complex and associated proteins, and not cytoadhesion per se.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.02.526785v1
www.biorxiv.org www.biorxiv.org

New submission 29/09/2022, 12:30:56

1
1. Public_Reviews 21 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  The finding that taste memory formation follows the same or highly similar logic and mechanisms as olfactory memory is very interesting. In particular, the new approach to use an operant learning assay developed by the authors to address this outstanding question in the field is very impressive. The shown data are of high quality and very convincing.
  
  While the current version will be of clear interest to fly people dissecting memory formation, it might be less accessible outside this immediate field. Below I list my suggestions, questions and criticisms.
  
  You have developed an operant assay and stress this in the introduction. This is important because it allows you to gain much better inside into how memory is formed and how it is recalled. Nevertheless, I was somewhat disappointed that you did not exploit that aspect more in your study. First, I suggest showing, at least for the initial figures, the traces (e.g. Fig 1D) not only for the test phase but also for the training phase. As you also mention in your discussion, the extent of memory formation will depend critically on the number of pairings during training. And perhaps not only on their number but also on their evolution/change over time. Second, you only show preference indices. I suggest showing the number of actual interactions with the food source in addition. In my opinion and experience, the preference index can be misleading or at least the interpretation might be questioned if the number of actual choices is very low or very high compared to controls or other groups. Third, regarding the same point, you show traces for test phases, but you do not comment or discuss why they might look the way they look. For instance, it appears that in some cases it takes a while to see an actual difference in the preference index while at other times it seems more instantaneously etc.
  
  We have now added plots showing the preference indices over time during both training and testing for all the experiments in Figures 1 and 2. We also comment in the text on our view of their interpretation. Although we recognize that interesting features of the learning process could be revealed by examining the process over time, we also caution that earlier timepoints are inherently less robust because of smaller sample size to the measurements (flies tend to not take many sips of the food over the first several minutes). Thus, emergence of a preference after a period of time may not reflect an evolution of the preference as much as a firming up of the data as more sips are recorded. As a notable example, our data in Figure 1E,G show close to a zero preference for activation of sweet sensory neurons during the first 10 minutes of training, despite the innately appetitive nature of this manipulation. This is undoubtedly because it takes some time for flies to sample both choices and build up enough interactions to show a clear preference. This is not to say that the curves are never informative, however. For example, it is reassuring to see that activation of PAM neurons does not produce a positive preference at any time during training (Figure 2F).
  
  We have also added the raw sip/interaction numbers for the experiments in Figure 1 in order to provide an example of how these data relate to the preference. Your concern about reliability differing depending on choice number is certainly warranted (as we also discuss above). However, the raw data does not suggest a major difference in the overall number of choices being made between groups.
  
  Along the same lines, I am wondering why you do not observe extinction. Frequently if the CS is re-experienced without the US over several trials, you start to see memory fade. The preference traces as well as the actual interactions might help to explain this.
  
  This is an interesting question, and one that we have certainly wondered about. Our assumption is that the number of exposures to the CS+ during testing is not sufficient to induce extinction. It would be interesting to run a longer testing period to see whether extinction occurs over a longer time course; however, we have not done so at this point.
  
  You use salt as a negative US. I suggest showing at least one experiment with bitter taste (e.g. quinine) to show how general your finding is to negative conditioning. Your optogenetic data suggests it is.
  
  We actually never use natural taste stimuli as the US; we only use salt as the CS+ in our appetitive learning experiments. We have revised the figures and figure legends extensively for clarity and one of the changes is to try to make it clearer what is the CS+ and CS- in each experiment.
  
  You analyze the role of energy state in memory formation. This is very interesting. In light of the importance of feeding state, it would be very helpful to include starvation/metabolic state information not only in the methods but also in the results section (at least briefly).
  
  We have now indicated in all the figure legends and in the text that flies were all food deprived for 24 hours prior to training.
  
  Your data convincingly shows that taste memory is formed in the mushroom body. For instance, you show that inhibition of KCs prevents the change in preference. KC inhibition was done during the entire experiment (training and test). Thus, it's important to show how KC inhibition affects (or does not) training vs. test.
  
  We appreciate the motivation for this suggestion and how extensively this issue has been explored in olfactory classical conditioning. We also agree that it would be interesting to perform this experiment. However, the practical logistics of doing this experiment were not possible with the constraints we were under. We unfortunately don’t currently have the means to operate the STROBE at a temperature high enough to effectively silence neurons using shibire(ts), and silencing with optogenetics is not possible with our current setup either. Thus, we will need to leave this issue unresolved for the time being.
  
  Along the same lines, how do you envision this memory formation to happen at the circuit level? KCs and DANs are likely activated by CS and US. It would be important to at least include a paragraph in the discussion to clarify this.
  
  The bulk of our characterization of this assay (including the demonstration that KCs are required) was done with 75 mM NaCl as the CS+ and optogenetic activation of PAM neurons as the US. Previous studies have shown activation of KCs by tastes (Kirkhart and Scott, 2015), so we believe that KCs are being activated by the CS+ and DANs are being activated by the US (in this case directly through optogenetics). Based on a great deal of beautiful work in olfactory classical conditioning, we believe it is likely that this co-incident activation leads to plasticity as KC-MBON synapses, thereby skewing the behaviour in favor of attraction. We have now tried to clarify this mechanism in the paper.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2021.11.12.468444v3
www.biorxiv.org www.biorxiv.org

New submission 21/08/2023, 09:55:23

1
1. Public_Reviews 21 Aug 2023
  
  in eLife
  
  Author Response
  
  We express our sincere gratitude to the editors and reviewers for their invaluable input. To further improve our manuscript, we have devised a plan to perform additional histological experiments of Bdnf and TrkB expression. Specifically, we will replace the phospho-TrkB antibody with an anti-TrkB antibody to quantify Bdnf/TrkB co-expression. Moreover, we acknowledge the concern raised by the reviewers regarding the clarity of some explanations and the potential influence of alternative mechanisms influencing the defects observed in Bdnf neurons. We aim to provide a clearer explanation and discussion. We also intend to provide a more comprehensive discussion of the limitations of our LM22A-4 drug treatment experiment. By addressing these points, we wish to ensure that our research is informative to the eLife readership.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.07.30.551185v2
www.researchsquare.com www.researchsquare.com

New submission 21/08/2023, 09:37:12

1
1. Public_Reviews 21 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disorder leading to the loss of innervation of skeletal muscles, caused by the dysfunction and eventual death of lower motor neurons. A variety of approaches have been taken to treat this disease. With the exception of three drugs that modestly slow progression, most therapeutics have failed to provide benefit. Replacing lost motor neurons in the spinal cord with healthy cells is plagued by a number of challenges, including the toxic environment, inhibitory cues that prevent axon outgrowth to the periphery, and proper targeting of the axons to the correct muscle groups. These challenges seem to be well beyond our current technological approaches. Avoiding these challenges altogether, Bryson et al. seek to transplant the replacement motor neurons into the peripheral nerves, closer to their targets. The current manuscript addresses some of the challenges that will need to be overcome, such as immune rejection of the allograft and optimizing maturation of the neuromuscular junction.
  
  Bryson et al. begin by examining the survival of mESC-derived motor neurons allografted into SOD1 mice. The motor neurons, made on a 129S1/SvImJ, were transplanted into the tibial nerve of SOD1 mice on a C57BL/6J background. Without immunosuppression, most cells were lost between 14 and 35 days, suggesting an immune response had eliminated them. Tacrolimus prevented cell loss, but it also inhibited innervation of the muscle. It also uncovered the tumorigenic potential of contaminating pluripotent cells. In contrast, immunosuppression using H57-597, an antibody targeting T-cell receptor beta, prevented graft rejection while permitting some innervation of muscle. Pretreatment of the cells with mitomycin-C eliminated pluripotent cells, preventing tumor formation. The authors noted that this combination only innervated ~10% of endplates, likely due to the fact that the implanted motor neurons are not active.
  
  The authors then began the process of optimizing the cells themselves, using measurements taken in late-stage SOD1 mice. Fast-firing and slow-firing populations of neurons were first compared. Using optical stimulation, these two cell types appeared to be similar. The authors opted to use slow-firing neurons in the subsequent experiments. Recognizing that neuromuscular junction (NMJ) innervation and maintenance are dependent on motor neuron activity, implantable optical stimulators were also evaluated. 14 days after transplanting the cells, optical stimulation training was initiated for one hour each day. This training led to a nearly 13-fold increase in force generation, although this still remained well below the force generated by electrical stimulation. The enhanced innervation also prevented the atrophy of muscle fibers caused by denervation.
  
  Overall, the data for the function of the implanted cells are convincing. The dCALMS technique that the authors have developed is quite interesting and will likely be applicable to analyze muscles for other therapeutics. The identification of calcineurin inhibitors as inhibitors of reinnervation will also be important for the development of other cell-based therapeutics for ALS.
  
  This is an excellent summary of the state of the field of ALS therapy development and provides a clear rationale for our novel therapeutic strategy, in the near-complete vacuum of conventional treatment options for patients suffering from this devastating disorder. We are delighted that the Reviewer clearly appreciated the value of our alternative therapeutic strategy and found our supporting data to be convincing, as well as drawing attention to the dCALMS technique, which we agree could be of significant value in the investigation of other therapeutic strategies aimed at restoring muscle innervation. We are extremely grateful for the Reviewer’s diligence in assessing our manuscript.
  
  However, there are some issues that should be addressed. These include some common misconceptions about ALS. While ALS is split into familial and sporadic forms based on the presence or absence of a family history of the disease, mutations in the known ALS-associated genes are found in both forms [1]. The authors also state that exercise programs are likely to accelerate degeneration in ALS. This is incorrect. Moderate exercise is part of the current guidelines for treating ALS, and mouse studies have demonstrated a therapeutic effect of moderate exercise [2]. Regarding the experimental design, there are some important details missing. The animals do not appear to have been operated on at the same age, and the criteria for when to perform the operation were not described. A similar problem exists for when the animals were determined to reach endpoint [3]. The authors also do not seem to address a potential pitfall of this approach: acceleration of the disease process. Indeed, some of the data comparing the ipsilateral side to the contralateral side suggest that the implantation of the cells and/or the light source increase the denervation of the muscle [4]. Finally, there is a fairly large difference between the motor output provided by optical stimulation relative to electrical stimulation. It is currently unclear what level needs to be reached to provide an effective response in the intact animal. Thus, it is difficult to determine if the level of reinnervation that this study has achieved will be sufficient to improve a patient's quality of life [5].
  
  The Reviewer raises some extremely important points and highlights some additional constructive issues where more clarity is required (numbered 1-5 above). We have attempted to address each of these points in order to strengthen the key message of our study and the integrity of our manuscript:
  
  1) The Reviewer is absolutely correct in highlighting that causative mutations in identified genes occur in both sporadic and familial forms of ALS and that this classification simply reflects whether or not there is a known family history of the disease (which can also encompass a spectrum of disorders including frontotemporal degeneration). We will revise our manuscript in order to be more accurate and provide clarity on this important point.
  
  2) Regarding the potential acceleration of muscle denervation, we specifically state that the use of electrical nerve stimulation (ENS) to artificially evoke muscle contraction has been shown to accelerate denervation of the diaphragm muscle in clinical trials aimed at maintaining respiratory function in ALS patients, which significantly shortened life-expectancy. It was not our intention to imply that moderate voluntary exercise, as opposed to artificial “ENS-based” muscle stimulation programmes, could accelerate muscle denervation. Indeed, the negative side-effects of ENS that we highlighted provide a clear rationale for developing a safer alternative to artificially control muscle function once innervation by endogenous motor neurons progressively deteriorates in ALS patients; specifically, our selection of optogenetic nerve stimulation (ONS), which is highly selective to the engrafted light-sensitive motor neurons, recruits motor units in correct physiological order and avoids rapid muscle fatigue potentially overcomes the safety concerns associated with ENS.
  
  Importantly, unchecked disease progression means that complete paralysis of almost all muscles will eventually occur, due to loss of upper or lower motor neurons and accompanying muscle denervation, which would eventually preclude the ability of ALS patients to undertake voluntary exercise programmes, or even activities of daily life. Our approach is aimed at overcoming this inevitable loss of voluntary muscle control and onset of complete paralysis by providing a safe and effective method of artificially maintaining control of targeted muscles that would otherwise become completely paralyzed, as well as preventing their irreversible atrophy.
  
  To avoid the possibility that readers may infer that we are suggesting voluntary exercise programs accelerate degeneration in ALS and to provide additional clarity, we will revise the manuscript to stress that we specifically refer to “ENS-based” exercise programmes in relation to acceleration of muscle denervation.
  
  3) Regarding our experimental design, the congenic B6.SOD1G93A mouse model of ALS is an extremely well-characterized model, with a highly consistent timeframe of disease phenotype manifestation and progression. In order to maximize the translational value of our study, we selected an early post-symptom onset timepoint (95d +/- 4.6 days) that mirrors a time at which human ALS patients would be likely to benefit from the therapeutic strategy: in the vast majority of cases, it is not possible to treat humans until a diagnosis of ALS has been confirmed, which can often take up to 12 months from first presentation. Importantly, ALS patients in the final stages of disease progression would be unlikely to be suitable for this therapy, due to irreversible muscle atrophy, which would preclude the ability of the engrafted motor neurons to form functionally useful connections. Indeed, our strategy is to engraft the replacement motor neurons prior to severe muscle atrophy occurs, so that they are in place to compensate and take over the function of endogenous lower motor neurons as they progressively degenerate and paralysis ensues. In so doing, the replacement motor neurons could prevent the irreversible atrophy of targeted muscles through ONS-based exercise programmes and thereby indefinitely extend the ability of targeted muscles to perform functionally useful movements.
  
  Although the initial graft optimization component of this study, including the tacrolimus trial, was performed across a variety of disease stages (commencing between 57-101 days of age), once we identified the H57-596 monoclonal antibody as an effective means to promote graft survival (without interfering with target muscle innervation), all subsequent grafts were initiated at an early symptom onset timepoint: 95.7 ± 4.6 days for slow-firing motor neuron grafts and 106.8 ± 7.2 days for fast-firing motor neuron grafts. Transgenic SOD1G93A mice were specifically bred for this study and due to complexities of coordinating stem cell differentiation and motor neuron production, optical stimulation device production and access to surgical facilities, with timed matings set up 3-4 months in advance, we feel that this age range was acceptable and doesn’t detract from the findings of our study.
  
  Similarly, we made every effort to ensure that experimental end-point was consistent, at 133 ±8 days for all grafts involving H57-597 administration, which reflects translationally-relevant late-stage disease progression. Since the physiological experiments performed as part of this study are extremely time-consuming, it was necessary to stagger the experimental end-point over several days. Again, we feel that this range is acceptable and still reflects a consistent, translationally-relevant timepoint. Importantly, since the experimental paradigm tested in this study was aimed at individually targeted muscles, which would have been unlikely to have an effect on disease duration or survival, we did not feel that it was ethically justifiable to allow the B6.SOD1G93A mice to approach end-stage disease (which occurs at an average age of 150 days of age in this model).
  
  In the interests of full transparency, the age at which treatment commenced and the experimental end-point for every animal used in this study is reported in Supplementary Tables 2 and 3.
  
  4) The Reviewer raises an extremely pertinent question, regarding whether the engrafted motor neurons themselves, or the implanted stimulation device, may accelerate the progressive loss of innervation of targeted muscles by endogenous motor neurons, in light of our data that shows decreased force evoked by electrical stimulation of ipsilateral (engrafted) versus contralateral (control) muscles. It is worth noting that supramaximal electrical nerve stimulation, used to evoke maximal muscle force, should activate both endogenous and engrafted motor neurons, therefore the combined activation of both populations would be expected to result in a summative (greater) contractile response. The fact that we see the converse is unlikely to be due to an accelerated loss of endogenous motor innervation as a result of the engrafted cells, but is much more likely to be caused by physical nerve damage during the surgical engraftment process: we used a customized Hamilton syringe with a 29G needle to manually inject the cells into the targeted nerve branches, which has an outer diameter of 330μm whilst the diameter of the tibial nerve in an adult mouse is approximately 400μm. This is likely to have led to damage of the endogenous motor (and potentially sensory) axons that may have diminished regenerative capacity due to ongoing disease mechanisms. Fortunately, there is significant scope to refine the engraftment procedure by using smaller gauge needles (potentially made of more flexible materials), bespoke injection systems that can deliver the cells at a controlled rate and micromanipulators that avoid can avoid nerve damage caused by excessive movement of the needle within the nerve. Importantly, the significantly greater scale of human nerves, compared to murine nerves targeted in this study, would also be a significant advantage in terms of physically delivering the cells in ALS patients.
  
  5) The Reviewer’s final comment is entirely justified given that, even in the best cases following optical stimulation training of engrafted SOD1G93A mice, optical stimulation still evoked less contractile force than supramaximal electrical stimulation. The likely reasons for this are complex: there is almost certainly scope to further optimize the optical stimulation training paradigm, which could result in reinforcement of the de novo neuromuscular junctions formed between the engrafted motor neurons and targeted muscle fibres; it is possible that the expression level of the channelrhodopsin-2 protein at the cell surface may require optimization in order to reliably initiate action potentials in the engrafted motor neurons – development of newer channelrhodopsin variants may resolve this potential issue, whilst providing additional advantages (such as enabling transcutaneous stimulation) at the same time. Finally, the maximum contractile response of the triceps surae muscle elicited by optical stimulation that we observed was approximately 13g, which equates to approximately 50% of the body mass of an adult SOD1G93A mouse. Although this is only approximately 10% of the maximal contractile force of a wild-type triceps surae muscle, this would almost certainly provide the ability to perform functionally useful motor tasks if it could be reproduced in ALS patients, particularly if large numbers of targeted muscles could be controlled in a coordinated manner, something that we are actively working on.
  
  Reviewer #2 (Public Review):
  
  The authors provide convincing evidence that optogenetic stimulation of ChR2-expressing motor neurons implanted in muscles effectively restores innervation of severely affected skeletal muscles in the aggressive SOD1 mouse model of ALS, and conclude that this method can be applied to selectively control the function of implicated muscles. This was supported by convincing data presented in the paper.
  
  This is an interesting paper providing new/improved optogenetic methods to restore or improve muscle strength in ALS. In general, it is of high significance in both the techniques and concept, and the paper was well written. The evidence supporting the conclusions is convincing, with rigorous muscle tension physiological analysis, and nerve and muscle histology and image analysis. The work will be of broad interest to medical biologists on muscle disorders.
  
  One weak point is that proper control experiments were not clearly presented - these could be shown in the paper. For example, one control experiment with only YFP but no ChR2 expression with optogenetic stimulation should be performed, following similar procedures and analysis applied to the ChR2-transduced animals.
  
  We are extremely grateful for the Reviewer’s expert appraisal of our manuscript and we are delighted to hear that they found our study to be highly significant, of broad interest and that our supporting evidence for this novel therapeutic approach was convincing and rigorous.
  
  Regarding the inclusion of suggested control experiments, we have extensive negative results data from physiological recordings of muscles in response to optical stimulation in animals where the engrafted motor neurons were rejected (prior to our identification of a 100% effective immunosuppression regimen). This clearly revealed that, in the absence of ChR2-expressing motor neurons, optical stimulation does not elicit any response from the target muscle. However, we do not feel that inclusion of this negative data, which is entirely predictable, would have strengthened the findings of our study. Similarly, if we had engrafted motor neurons that only express YFP, we would have been unable to elicit any muscle contractile activity in response to optical stimulation. As a control, this may have some value in determining the ability of motor neurons derived from other cell lines that do not express ChR2 to survive and innervate target muscles but we don’t feel that the additional work would get us closer to achieving our ultimate goal of using motor neuron replacement in combination with optogenetic stimulation to restore/maintain muscle function in ALS patients. Moreover, the complex and iterative process of developing the cell line used in this study (reported in detail in our previous study) would make it extremely difficult to produce a suitable control stem cell line expressing only YFP. Having said that, we are actively in the process of developing new, more sophisticated human and mouse stem cell lines, using more translationally-relevant gene targeting methods to stably knock-in a variety of updated channelrhodopsin variants that may have superior properties for our approach. This will be reported in follow up study/studies as we feel that it goes well beyond the scope of the current study.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

researchsquare.com/article/rs-1970365/v2
www.biorxiv.org www.biorxiv.org

New submission 21/08/2023, 09:34:12

1
1. Public_Reviews 21 Aug 2023
  
  in eLife
  
  Author Response
  
  We appreciate very much your positive assessment and the comments of the two reviewers, all of which will greatly help us to improve our manuscript. In response, therefore, to these constructive comments we will take pleasure in submitting a revised manuscript during the next step of publication.
  
  We take the opportunity to provide a provisional author response.
  
  As for Reviewer #1.
  
  We thank Reviewer 1 very much for her/his very positive and detailed remarks, all of which will be introduced into the revised version of our manuscript.
  
  We will add the information about the biological control on the development of phosphatic-shelled brachiopod columns in the introduction so that our late narrative can be more understandable. The Cambrian Explosion is the innovation of metazoan body plans and radiation of animals during a relatively short geological time. The expansion of new body plans in different groups of brachiopods in early Cambrian was likely driven by the Cambrian Explosion. The columnar shell structures are not developed in living lingulate brachiopods, and thus it is important to get a better understanding of this extinct shell architecture from the fossil records in order to study the evolutionary trend of shell structures and compositions in brachiopods. Furthermore, the adaptive innovation of biomineralized columns in early brachiopod will be discussed in the revised manuscript.
  
  As for Reviewer #2.
  
  We thank Reviewer 2 very much for her/his very constructive and detailed remarks. All the comments have been thoroughly considered, and most of them will be introduced into the revised version of the manuscript.
  
  We agree that the knowledge is incomplete on the shell structures of early linguliform brachiopods and more research shall be helpful. We also express the idea in the first part of our manuscript that the shell structural complexity and diversity of linguliform brachiopods (especially their fossil representatives) require further studies. As the shell structure and biomineralization process are crucial to unravel the poorly resolved phylogeny and early evolution of Brachiopoda, in this paper, we undertake a primary study of exquisitely well-preserved brachiopods from the Cambrian Series 2. The morphologies, shapes and sizes of cylindrical columns are described in details in this research, and this work will be useful for further comparative studies. We are very sorry to miss the important reference paper on brachiopod shells by Butler et al. (2015), which will be added into the revised manuscript. The structure and language of the manuscript will be revised based on the very helpful suggestions.
  
  Concerning the families Eoobolidae and Lingulellotretidae, we are aware of the current problematic situation of these families, and we will add more discussion about the detailed characters of Eoobolidae in the Systematic Palaeontology part of the manuscript. However, the revision of the families Eoobolidae and Lingulellotretidae falls outside the scope of this paper. We prefer to leave it now as it will be part of an upcoming publication based on more global materials from China, Australia, Sweden and Estonia that we are currently working on.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.01.543202v3
www.biorxiv.org www.biorxiv.org

New submission 21/08/2023, 09:30:06

1
1. Public_Reviews 21 Aug 2023
  
  in eLife
  
  Author Response
  
  We thank the reviewers for thoroughly evaluating our work and for providing constructive and actionable feedback to improve the manuscript. The reviews have left us with a clear direction in which our work can improve, for which we are grateful. We will provide a detailed response to the reviews together with our revised manuscript. At this time, we accept the invitation to provide a provisional reply that addresses the major themes as summarized by the editors.
  
  The goal of our study was to infer an individual’s control strategy from the details of kinematics. We did this using monkey and human data collected under matching experimental conditions. We quantitatively compared these data to simulations that were generated by adapting a reasonable model of sensorimotor function that is standard in the literature. We are pleased that the reviewers and editors felt “that the overall scientific approach is of interest and has scientific merit” and “the approach has promise in aiding future studies that try to link behavior and neurophysiology (allowing homology between humans and primates).”
  
  We agree with the reviewers that additional work is needed to corroborate our main claim that we can unambiguously infer control strategies from behavioral data. This is a known hard problem that we are not the first to address, and we do not claim to have solved it here. We appreciate the suggestions about (1) further testing the classification procedure, (2) considering other metrics that may better distinguish between the control strategies, and (3) investigating the control strategy under perturbation scenarios. We plan to undertake additional simulations, analyses and, in the future, experiments, as suggested by the reviewers to enhance the impact of our work.
  
  In this initial brief response, we wish to focus on one key point noted by the editors, stemming from simulations by one of the reviewers using “a simple fixed controller.” We greatly appreciate that one reviewer went as far as to perform their own simulations. These simulations suggested that subjects do not need to switch between control strategies, but rather could achieve similar behavioral results via “a modest change in gain.” Specifically, the reviewer reports that their simple fixed controller could generate trials that sometimes looked like what we would call position control and sometimes looked like what we would characterize as velocity control. It was noted that “trial-to-trial differences were driven both by motor noise and by the modest variability in gain.”
  
  While we cannot comment with great certainty on the reviewer’s simulation results, since we do not know the specifics, we first wish to note that our controller and experimental subjects demonstrated this same phenomenon, in that there was overlap in the distribution of the metrics for the two strategies (specifically, in Figs. 5, 7 & 8). Hence, in our findings, even under position control some trials looked more like velocity control, and vice versa. We briefly discussed this in the paper, noting that “a large number of trials fall somewhere between the Position and Velocity Control boundaries”, and that “this could be due to a mixed control strategy” or “subjects switch strategies of their own accord”. This point would have been clearer had we included examples of these hand and cursor traces in Fig. 8. We will update Fig. 8 to more clearly illustrate this point and expand our discussion on different possible interpretations.
  
  Second, one may interpret the differences we attributed to changes in “control strategy” as changes simply in the gain of our “fixed” controller. Specifically, similar to the controller implemented by the reviewer, our controller is fixed in terms of the plant, the actuator and the sensory feedback loop; the only change we explored was in the relative weights or gains of position vs. velocity in the Q matrix to generate the motor command. While our intent was primarily to focus on the extremes of position control vs. velocity control, we agree that a mixed strategy of minimizing some combined error in position and velocity is likely. This is something we can readily explore with our controller model.
  
  In summary, we consider it worthwhile to investigate how one can infer the control strategy that a subject is employing to complete the task - either in our CST, or any other task that admits multiple strategies that can lead to success. We regard this as a valuable step towards addressing more realistic behaviors and their neural underpinnings in non-human primate research. The suggestions offered by the reviewers regarding additional analyses, simulations and experiments will provide more definitive answers and clarity for our approach.
  
  We are truly grateful for the time and effort the reviewers put into our manuscript. We are in the process of undertaking revisions to address all of their feedback and look forward to submitting an improved manuscript with a more detailed reply in the coming weeks.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.05.02.539055v1
www.biorxiv.org www.biorxiv.org

New submission 21/08/2023, 09:25:53

1
1. Public_Reviews 21 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  This paper falls in a long tradition of studies on the costs of reproduction in birds and its contribution to understanding individual variation in life histories. Unfortunately, the meta-analyses only confirm what we know already, and the simulations based on the outcome of the meta-analysis have shortcomings that prevent the inferences on optimal clutch size, in contrast to the claims made in the paper.
  
  There was no information that I could find on the effect sizes used in the meta-analyses other than a figure listing the species included. In fact, there is more information on studies that were not included. This made it impossible to evaluate the data-set. This is a serious omission, because it is not uncommon for there to be serious errors in meta-analysis data sets. Moreover, in the long run the main contribution of a meta-analysis is to build a data set that can be included in further studies.
  
  It is disappointing that two referees comment on data availability, as we supplied a link to our full dataset and the code we used in Dryad with our submitted manuscript. We were also asked to supply our data during the review process and we again supplied a link to our dataset and code, along with a folder containing the data and code itself. We received confirmation that the reviewers had been given our data and code. We support open science and it was our intention that our dataset should be fully available to reviewers and readers. Our data and code are at https://doi.org/10.5061/dryad.q83bk3jnk.
  
  The main finding of the meta-analysis of the brood size manipulation studies is that the survival costs of enlarging brood size are modest, as previously reported by Santos & Nakagawa on what I suspect to be mostly the same data set.
  
  We disagree that the main finding of our paper is the small survival cost of manipulated brood size. The major finding of the paper, in our opinion, is that the effect sizes for experimental and observational studies are in opposite directions, therefore providing the first quantitative evidence to support the influential theoretical framework put forward by van Noordwijk and de Jong (1986), that individuals differ in their optimal clutch size and are constrained to reproducing at this level due to a trade-off with survival. We show that while the manipulation experiments have been widely accepted to be informative, they are not in fact an effective test of whether within-species variation in clutch size is the result of a trade-off between reproduction and survival.
  
  The comment that we are reporting the same finding as Santos & Nakagawa (2012) is a misrepresentation of both that study and our own. Santos & Nakagawa found an effect of parental effort on survival only in males who had their clutch size increased – but no effect for males who had their clutch size reduced and no survival effect on females for either increasing or reducing parental effort. However, we found an overall reduction in survival for birds who had brood sizes manipulated to make them larger (for both sexes and mixed sex studies combined). In our supplementary information, we demonstrate the overall survival effect of a change in reproductive effort to be close to zero for males, negative (though non-significant) for females and significantly negative for mixed sexes (which are not included in the Santos & Nakagawa study).
  
  The paper does a very poor job of critically discussing whether we should take this at face value or whether instead there may be short-comings in the general experimental approach. A major reason why survival cost estimates are barely significantly different from zero may well be that parents do not fully adjust their parental effort to the manipulated brood size, either because of time/energy constraints, because it is too costly and therefore not optimal, or because parents do not register increased offspring needs. Whatever the reason, as a consequence, there is usually a strong effect of brood size manipulation on offspring growth and thereby presumably their fitness prospects. In the simulations (Fig.4), the consequences of the survival costs of reproduction for optimal clutch size were investigated without considering brood size manipulation effects on the offspring. Effects on offspring are briefly acknowledged in the discussion, but otherwise ignored. Assuming that the survival costs of reproduction are indeed difficult to discern because the offspring bear the brunt of the increase in brood size, a simulation that ignores the latter effect is unlikely to yield any insight in optimal clutch size. It is not clear therefore what we learn from these calculations.
  
  The reviewer’s comment is somewhat of a paradox. We take the best studied example of the trade-off between reproductive effort and parental survival, a key theme in life-history and the biology of ageing, and subject this to a meta-analysis. The reviewer suggests we should interpret our finding as if there must be something wrong with the method or studies we included, rather than maybe considering the original hypothesis could be false or inflated in importance. The reviewer’s inclination to question the premise of the data in favor of a held hypothesis we consider not necessarily the best scientific approach here. In many places in our manuscript do we question and address issues in the underlying data and interpretation (L101-105, L149-150, 182-185 and L229-233). Moreover, we make it clear that we focus on the trade-off between current reproductive effort and subsequent parental survival and we are aware that other trade-offs could counter-balance or explain our findings, discussed on L189-191 & L246-253. Note that it is also problematic, when you do not find the expected response, to search for an alternative that has not been measured. In the case here, with trade-offs, there are endless possiblilities of where a trade-off might be incurred between traits. We purposfully focus on the one well-studied and theorised trade-off. We clearly acknowledge though that when all possible trade-offs are taken into account a trade-off on the fitness level can occur and cite two famous studies (Daan et al., 1990 and Verhulst & Tinbergen 1991) that have done just that (L250-253).
  
  So whilst, we agree with the reviewer that the offspring may incur costs themselves, rather than costs being incurred by the parents, the aim of our study was to test for a generalised trend across species in the survival costs of reproductive effort. It is unrealistic to suggest that incorporating offspring growth into our simulations would add insight, as a change in offspring number rarely affects all offspring in the nest equally and there can even be quite stark differences; for example this will be most evident in species that produce sacrificial offspring. This effect will be further confounded by catch-up growth, for example, and so it is likely that increased sibling competition from added chicks alters offspring growth trajectories, rather than absolute growth as the reviewer suggests. There are mixed results in the literature on the effect of altering clutch size on offspring survival, with an increased clutch size through manipulation often increasing the number of recruits from a nest.
  
  There are other reasons why brood size manipulations may not reveal the costs of reproduction animals would incur when opting for a larger brood size than they produced spontaneously themselves. Firstly, the manipulations do not affect the effort incurred in laying eggs (which also biases your comparison with natural variation in clutch size). Secondly, the studies by Boonekamp et al on Jackdaws found that while there was no effect of brood size manipulation on parental survival after one year of manipulation, there was a strong effect when the same individuals were manipulated in the same direction in multiple years. This could be taken to mean that costs are not immediate but delayed, explaining why single year manipulations generally show little effect on survival. It would also mean that most estimates of the fitness costs of manipulated brood size are not fit for purpose, because typically restricted to survival over a single year.
  
  First, our results did show a survival cost of reproduction for brood manipulations. We agree that there could be longer-term costs, and so our estimate of the survival cost for manipulated birds is likely to be an underestimate, meaning that our interpretation still holds – the cost to reproduce prevents individuals from laying beyond their optimal level. Note, however, that much theory is build on the immediate costs of reproduction and as such these costs are likely overinterpreted.
  
  We agree with the reviewer that lifetime manipulations could be even more informative than single-year manipulations. Unfortunately, there are currently too few studies available to be able to draw generalisable conclusions across species for lifetime manipulations. This is, however, the reason we used lifetime change in clutch size in our fitness projections, which the reviewer seems to have missed – please see methods line 360-362, where we explicitly state that this is lifetime enlargement. Of course such interpretations do not include an accumulation of costs that is greater than the annual cost, but currently there is no clear evidence that such an assumption is valid. Such a conclusion can also not be drawn from the study on jackdaws by Boonekamp et al (2014) as the treatments were life-long and, therefore, cannot separate annual from accrued (multiplicative) costs that are more than the sum of annual costs incurred.
  
  Details of how the analyses were carried out were opaque in places, but as I understood the analysis of the brood size manipulation studies, manipulation was coded as a covariate, with negative values for brood size reductions and positive values for brood size enlargements (and then variably scaled or not to control brood or clutch size). This approach implicitly assumes that the trade-off between current brood size (manipulation) and parental survival is linear, which contrasts with the general expectation that this trade-off is not linear. This assumption reduces the value of the analysis, and contrasts with the approach of Santos & Nakagawa.
  
  We thank the reviewer for highlighting a lack of clarity in places in our methods. We will add additional detail to this section in our revised manuscript.
  
  For clarity in our response, each effect size was extracted by performing a logistic regression with survival as a binary response variable and clutch size was the absolute value of offspring in the nest (i.e., for a bird who laid a clutch size of 5 but was manipulated to have -1 egg, we used a clutch size value of 4). The clutch size was also standardised and, separately, expressed as a proportion of the species mean.
  
  We disagree that our approach reduces the value of our analysis. First, our approach allows a direct comparison between experimental and observational studies, which is the novelty of our study. Our approach does differ from Santos & Nakagawa but we disagree that it contrasts. Our approach allows us to take into consideration the severity of the change in clutch size, which Santos & Nakagawa do not. Therefore, we do not agree that our approach is worse at accounting for non-linearity of trade-offs than the approach used by Santos & Nakagawa.
  
  Our analysis, alongside a plethora of other ecological studies, does assume that the response to our predictor variable is linear. However, it is common knowledge that there are very few (if any) truly linear relationships. We use linear relationships because they serve a good approximation of the trend and provide a more rigorous test for an underlying relationship than would fitting nonlinear models. For many datasets there is not a range of chicks added for which a non-linear relationship could be estimated. The question also remains of what the shape of this non-linear relationship should be and is hard to determine a priori. We will address non-linear effects in our revised manuscript.
  
  The observational study selection is not complete and apparently no attempt was made to make it complete. This is a missed opportunity - it would be interesting to learn more about interspecific variation in the association between natural variation in clutch size and parental survival.
  
  We clearly state in our manuscript that we deliberately made a tailored selection of studies that matched the manipulation studies (L279-282). We paired species extracted for observational studies with those extracted in experimental studies to facilitate a direct comparison between observational and experimental studies, and to ensure that the respective datasets were comparable. The reviewer’s focus in this review seems to be solely on the experimental dataset. This comment dismisses the observational component of our analysis and thereby fails to acknowledge the question being addressed in this study.
  
  Reviewer #2 (Public Review):
  
  I have read with great interest the manuscript entitled "The optimal clutch size revisited: separating individual quality from the costs of reproduction" by LA Winder and colleagues. The paper consists in a meta-analysis comparing survival rates from studies providing clutch sizes of species that are unmanipulated and from studies where the clutch sizes are manipulated, in order to better understand the effects of differences in individual quality and of the costs of reproduction. I find the idea of the manuscript very interesting. However, I am not sure the methodology used allows to reach the conclusions provided by the authors (mainly that there is no cost of reproduction, and that the entire variation in clutch size among individuals of a population is driven by "individual quality").
  
  We would like to highlight that we do not conclude that there is no cost of reproduction. Please see lines 258–260, where we state that our lack of evidence for trade-offs driving within-species variation in clutch size does not necessarily mean the costs of reproduction are non-existent. We conclude that individuals are constrained to their optima by the survival cost of reproduction. It is also an over-statement of our conclusion to say that we believe that variation in clutch size is only driven by quality. Our results show that unmanipulated birds who have larger clutch sizes also live longer, and we suggest this is evidence that some individuals are “better” than others, but we do not say, nor imply, that no other factors affect variation in clutch size.
  
  I write that I am not sure, because in its current form, the manuscript does not contain a single equation, making it impossible to assess. It would need at least a set of mathematical descriptions for the statistical analysis and for the mechanistic model that the authors infer from it.
  
  We appreciate this comment, but this is the first time we have been asked to put equations in a manuscript rather than explain them in terms that are accessible to a wider audience. Note however that our meta-analysis is standard and based on logistic regression and standard meta-analytic practices. We do not think we need to repeat such equations and we cite the relevant data. For the simulation, we simply simulated the resulting effects and this is not something that we feel is captured more accurately in equations rather than in text and the associated graphs. We of course supplied our code for this along with our manuscript (https://doi.org/10.5061/dryad.q83bk3jnk), though as we mentioned above, we believe this was not shared with the reviewers despite us making this available for the review process. We therefore understand the reviewer feels the simulations were not explained thoroughly. We will revise our text to see if we can add additional explanation where relevant in our revision.
  
  The texts mixes concepts of individual vs population statistics, of within individual vs among-individuals measures, of allocation trade-offs and fitness trade-offs, etc ....which means it would also require a glossary of the definitions the authors use for these various terms, in order to be evaluated.
  
  We would like to thank the reviewer for highlighting this lack of clarity in our text. We will simplify the terminology and define terms in our revised manuscript.
  
  This problem is emphasised by the following sentence to be found in the discussion "The effect of birds having naturally larger clutches was significantly opposite to the result of increasing clutch size through brood manipulation". The "effect" is defined as the survival rate (see Fig 1). While it is relatively easy to intuitively understand what the "effect" is for the unmanipulated studies: the sensitivity of survival to clutch size at the population level, this should be mentioned and detailed in a formula. Moreover, the concept of effect size is not at all obvious for the manipulated ones (effect of the manipulation? or survival rate whatever the manipulation (then how could it measure a trade-off ?)? at the population level? at the individual level ?) despite a whole appendix dedicated to it. This absolutely needs to be described properly in the manuscript.
  
  We would like to thank the reviewer for bringing to our attention the lack of clarity on the details of our methodology. We will make this more clear in our revised manuscript.
  
  For clarity, the effect size for both manipulated and unmanipulated nests was survival, given the brood size raised. We performed a logistic regression with survival as a binary response variable (i.e., number of individuals that survived and number of individuals that died after each breeding season), and clutch size was the absolute value of offspring in the nest (i.e., for a bird who laid a clutch size of 5 but was manipulated to have -1 egg, we used a clutch size value of 4). This allows for direct comparison of the effect size (survival given clutch size raised) between manipulated and unmanipulated birds.
  
  Despite the lack of information about the underlying mechanistic model tested and the statistical model used, my impression is still that the interpretation in the introduction and discussion is not granted by the outputs of the figures and tables. Let's use a model similar to that of (van Noordwijk and de Jong, 1986): imagine that the mechanism at the population level is
  
  a.c_(i,q)+b.s_(i,q)=E_q
  
  Where c_(i,q) are s_(i,q) are respectively the clutch size for individual i which is of quality q, and E_q is the level of "energy" that an individual of quality q has available during the given time-step (and a and b are constants turning the clutch size and survival rate into energy cost of reproduction and energy cost of survival, and there are both quite "high" so that an extra egg (c_(i,q) is increased by 1) at the current time-step, decreases s_(i,q) markedly (E_q is independent of the number of eggs produced), that is, we have strong individual costs of reproduction). Imagine now that the variance of c_(i,q) (when the population is not manipulated) among individuals of the same quality group, is very small (and therefore the variance of s_(i,q) is very small also) and that the expectation of both are proportional to E_q. Then, in the unmanipulated population, the variance in clutch size is mainly due to the variance in quality. And therefore, the larger the clutch size c_(i,q) the higher E_q, and the higher the survival s_(i,q).
  
  In the manipulated populations however, because of the large a and b, an artificial increase in clutch size, for a given E_q, will lead to a lower survival s_(i,q). And the "effect size" at the population level may vary according to a,b and the variances mentioned above. In other words, the costs of reproduction may be strong, but be hidden by the data, when there is variance in quality; however there are actually strong costs of reproduction (so strong actually that they are deterministic and that the probability to survive is a direct function of the number of eggs produced)
  
  We would like to thank the reviewer for these comments. Please note that our simulations only take the experimental effect of brood size on parental survival into account. Our model does not incorporate quality effects. The reviewer is right that the relationship between quality and the effects exposed by manipulating brood size can take many forms and this is a very interesting topic, but not one we aimed to tackle in our manuscript. In terms of quality we make two points: 1) overall quality effects connecting reproduction and parental survival are present 2) these effects are opposite in direction to the effects when reproduction is manipulated and similar in magnitude. We do not go further than that in interpreting our results. The reviewer is right however that we do suggest and repeat suggestions by others that quality can also mask the trade-off in some individuals or circumstances (L63-65, L85-88 & L237-240), but we do not quantify this as this is dependent on the unknown relationships between quality and the response to the manipulation. A focussed set of experiments in that context would be interesting and there is some data that could get at this, i.e. the relationship between produced clutch size and the relative effect of the manipulation. Such information is however not available for all studies and although we explored also analyzing this, currently this is not possible to do with sufficient confidence. We will include this rationale in our revision.
  
  Moreover, it seems to me that the costs of reproduction are a concept closely related to generation time. Looking beyond the individual allocative (and other individual components of the trade-off) cost of reproduction and towards a populational negative relationship between survival and reproduction, we have to consider the intra-population slow fast continuum (some types of individuals survive more and reproduce less (are slower) than other (which are faster)). This continuum is associated with a metric: the generation time. Some individuals will produce more eggs and survive less in a given time-period because this time-period corresponds to a higher ratio of their generation time (Gaillard and Yoccoz, 2003; Gaillard et al., 2005). It seems therefore important to me, to control for generation time and in general to account for the time-step used for each population studied when analysing costs of reproduction. The data used in this manuscript is not just clutch size and survival rates, but clutch size per year (or another time step) and annual (or other) survival rates.
  
  The reviewer is right that this is interesting. There has been unexplained difference in temperate (seasonal) and tropical reproduction strategies. Most of our data come from seasonal breeders however. Although there is some variation in second brooding and such often these species only produce one brood. We do agree that a wider consideration here is relevant, but we are not trying to explain all of life-history in our paper. It is clearly the case that other factors will operate and the opportunity for trade-offs will vary among species according to their respective life histories. However, our study focuses on the two most fundamental components of fitness – longevity and reproduction – to test a major hypothesis in the field, and we uncover new relationships that contrast with previous influential studies, and cast doubt on previous conclusions. We question the assumed trade-off between reproduction and annual survival. We show quality is important and that the effect we find in experimental studies, is so small that it can only explain between-species patterns but is unlikely to be the selective force that constrains reproduction within-species. We do agree that there is a lot more work that can be done in this area. We hope we contribute to this, by questioning this central trade-off. We will try and incorporate some of these suggestions in the revision where possible.
  
  Finally, it is important to relate any study of the costs of reproduction in a context of individual heterogeneity (in quality for instance), to the general problem of the detection of effects of individual differences on survival (see, e.g., Fay et al., 2021). Without an understanding of the very particular statistical behaviour of survival, associated to an event that by definition occurs only once per life history trajectory (by contrast to many other traits, even demographic, where the corresponding event (production of eggs for reproduction, for example) can be measured several times for a given individual during its life history trajectory).
  
  Thank you for raising this point. The reviewer is right that heterogeneity can dampen or augment selection. Note that by estimating the effect of quality here we give an example of how heterogeneity can possibly do exactly this. We thank the reviewer for raising that we should possibly link this to wider effects of heterogeneity and we aim to do so in the revision.
  
  References:
  
  Fay, R. et al. (2021) 'Quantifying fixed individual heterogeneity in demographic parameters: Performance of correlated random effects for Bernoulli variables', Methods in Ecology and Evolution, 2021(August), pp. 1-14. doi: 10.1111/2041-210x.13728.
  
  Gaillard, J.-M. et al. (2005) 'Generation time: a reliable metric to measure life-history variation among mammalian populations.', The American naturalist, 166(1), pp. 119-123; discussion 124-128. doi: 10.1086/430330.
  
  Gaillard, J.-M. and Yoccoz, N. G. (2003) 'Temporal Variation in Survival of Mammals: a Case of Environmental Canalization?', Ecology, 84(12), pp. 3294-3306. doi: 10.1890/02-0409.
  
  van Noordwijk, A. J. and de Jong, G. (1986) 'Acquisition and Allocation of Resources: Their Influence on Variation in Life History Tactics', American Naturalist, p. 137. doi: 10.1086/284547.
  
  Reviewer #3 (Public Review):
  
  The authors present here a comparative meta-analysis analysis designed to detect evidence for a reproduction/ survival trade-off, central to expectations from life history theory. They present variation in clutch size within species as an observation in conflict with expectations of optimisation of clutch size and suggest that this may be accounted for from weak selection on clutch size. The results of their analyses support this explanation - they found little evidence of a reproduction - survival trade-off across birds. They extrapolated from this result to show in a mathematical model that the fitness consequences of enlarged clutch sizes would only be expected to have a significant effect on fitness in extreme cases, outside of normal species' clutch size ranges. Given the centrality of the reproduction-survival trade-off, the authors suggest that this result should encourage us to take a more cautious approach to applying concepts the trade-off in life history theory and optimisation in behavioural ecology more generally. While many of the findings are interesting, I don't think the argument for a major re-think of life history theory and the role of trade-offs in fitness maximisation is justified.
  
  The interest of the paper, for me, comes from highlighting the complexities of the link between clutch size and fitness, and the challenges facing biologists who want to detect evidence for life history trade-offs. Their results highlight apparently contradictory results from observational and experimental studies on the reproduction-survival trade-off and show that species with smaller clutch sizes are under stronger selection to limit clutch size.
  
  Unfortunately, the authors interpret the failure to detect a life history trade-off as evidence that there isn't one. The construction of a mathematical model based on this interpretation serves to give this possible conclusion perhaps more weight than is merited on the basis of the results, of this necessarily quite simple, meta-analysis. There are several potential complicating factors that could explain the lack of detection of a trade-off in these studies, which are mentioned and dismissed as unimportant (lines 248-250) without any helpful, rigorous discussion. I list below just a selection of complexities which perhaps deserve more careful consideration by the authors to help readers understand the implications of their results:
  
  We would like to thank the reviewer for their thoughtful response and summary of the findings we also agree are central to our study. The reviewer also highlights areas where our manuscript could benefit from a deeper discussion and we will add detail to our discussion in our revised manuscript.
  
  We would like to highlight that we do not interpret the failure to detect a trade-off as evidence that there isn’t one. First, and importantly, we do find a trade-off but show this is only incurred when individuals lay beyond their optimal level. Secondly, we also state on lines 258-260 that the lack of evidence to support trade-offs being strong enough to drive variation in clutch size does not necessarily mean there are no costs of reproduction.
  
  The statement that we have constructed a mathematical model based on the interpretation that we have not found a trade-off is, again, factually incorrect. We ran these simulations because the opposite is true – we did find a trade-off. There is a significant effect of clutch size when manipulated on annual parental survival. To appreciate whether this effect alone can explain why reproduction is constrained, we ran the simulations. From these simulations we find that this effect size is too small to explain the constraint so something else must be going on and we do spend a considerable amount of text discussing the possible explanations (L182-194). Note the possibly most parsimonious conclusion here is that costs of reproduction are not there so we also give that explanation some thought (L201-205 and L247-253).
  
  We are disappointed by the suggestion that we have dismissed complicating factors which could prevent detection of a trade-off, as this was not our intention. We were aiming to highlight that what we have demonstrated to be an apparent trade-off can be explained through other mechanisms, and that the trade-off between clutch size and survival is not as strong in driving within-species variation in clutch size as previously assumed. We will add further discussion to our revised manuscript to make this clear and give readers a better understanding of the complexity of factors associated with life-history theory. Although we do feel we have addressed this (L248-255).
  
  • Reproductive output is optimised for lifetime reproductive success and so the consequences of being pushed off the optimum for one breeding attempt are not necessarily detectable in survival but in future reproductive success (and, therefore, lifetime reproductive success).
  
  We agree this is a valid point, which is mentioned in our manuscript in terms of alternative stages where the costs of reproduction might be manifested (L248-250). We would also like to highlight that in our simulations, the change in clutch size (and subsequent survival cost) was assumed for the lifetime of the individual, for this very reason.
  
  • The analyses include some species that hatch broods simultaneously and some that hatch sequentially (although this information is not explicitly provided (see below)). This is potentially relevant because species which have been favoured by selection to set up a size asymmetry among their broods often don't even try to raise their whole broods but only feed the biggest chicks until they are sated; any added chicks face a high probability of starvation. The first point this observation raises is that the expectation of more chicks= more cost, doesn't hold for all species. The second more general point is that the very existence of the sequential hatching strategy to produce size asymmetry in a brood is very difficult to explain if you reject the notion of a trade-off.
  
  We agree with the reviewer that the costs of reproduction can be absorbed by the offspring themselves, and may not be equal across offspring (we also highlight this at L249 in the manuscript). However, we disagree that for some species the addition of more chicks does not equate to an increase in cost, though we do accept this might be less for some species. This is, however, difficult to incorporate into a sensible model as the impacts will vary among species and some species do also exhibit catch-up growth. So without a priori knowledge on this we kept our model simple. To test whether the effect on parental survival (often assumed to be a strong cost) can explain the constraint on reproductive effort, and we conclude it does not.
  
  We would also like to make clear that we are not rejecting the notion of a trade-off. Our study shows evidence that a trade-off between survival and reproductive effort likely does not drive within-species variation in clutch size. We do explicitly say this throughout our manuscript, and also provide suggestions of other areas where a trade-off may exist (L246-250). The point of our study is not whether trade-offs exist or not, it is whether there is a generalisable across-species trend for a trade-off between reproductive effort and survival – the most fundamental trade-off in our field but for which there is a lack of conclusive evidence within species.
  
  • For your standard, pair-breeding passerine, there is an expectation that costs of raising chicks will increase linearly with clutch size. Each chick requires X feeding visits to reach the required fledge weight. But this is not the case for species which lay precocious chicks which are relatively independent and able to feed themselves straight after hatching - so again the relationship of care and survival is unlikely to be detectable by looking at the effect of clutch size but again, it doesn't mean there isn't a trade-off between breeding and survival.
  
  Precocial birds still provide a level of parental care, such as protection from predators. Though we agree that the level of parental care in provisioning food (and in some cases in all parental care given) is lower in precocial than altricial birds, this would only make our reported effect size for manipulated birds to be an underestimate. Again, we would like to draw the reviewer’s attention to the fact we did detect a trade-off in manipulated birds and we do not suggest that trade-offs do not exist. The argument the reviewer suggests here does not hold for unmanipulated birds, as we found that birds that naturally lay larger clutch sizes have higher survival.
  
  • The costs of raising a brood to adulthood for your standard pair-breeding passerine is bound to be extreme, simply by dint of the energy expenditure required. In fact, it was shown that the basal metabolic rate of breeding passerines was at the very edge of what is physiologically possible, the human equivalent being cycling the Tour de France (Nagy et al. 1990). If birds are at the very edge of what is physiologically possible, is it likely that clutch size is under weak selection?
  
  If birds are at the very edge of what is physiologically possible, then indeed it would necessarily follow that if they increase the resource allocated in one area then expenditure in another area must be reduced. In many studies however, the overall brood mass is increased when chicks are added and cared for in an experimental setting, suggesting that birds are not operating at their limit all the time. Our simulations show that if individuals increase their clutch size, the survival cost of reproduction counterbalances the fitness gained by increasing clutch size and so there is no overall fitness gain to producing more offspring. Therefore, selection on clutch size is constrained to the within-species level. We do not say in our manuscript that clutch size is under weak selection – we only ask why variation in clutch size is maintained if selection always favours high-producing birds.
  
  • Variation in clutch size is presented by the authors as inconsistent with the assumption that birds are under selection to lay the Lack clutch. Of course, this is absurd and makes me think that I have misunderstood the authors' intended point here. At any rate, the paper would benefit from more clarity about how variable clutch size has to be before it becomes a problem for optimality in the authors' view (lines 84-85; line 246). See Perrins (1965) for an exquisite example of how beautifully great tits optimise clutch size on average, despite laying between 5-12 eggs.
  
  We woud like to thank the reviewer for highlighting that our manuscript may be misleading in places, however, we are unsure which part of our conclusions the author is referring to here.The question we pose is “why all birds don’t lay at the population optimum?”, and is central to the decades-long field of life-history theory. Why is variation maintained at such a level? As the reviewer outlines it ranges massively with some birds laying half of what other birds lay.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.05.30.493969v1
www.biorxiv.org www.biorxiv.org

New submission 31/07/2023, 09:31:23

1
1. Public_Reviews 18 Aug 2023
 
 in eLife
 
 Author Response:
 
 The following is the authors’ response to the current reviews.
 
 Reviewer #1 (Recommendations For The Authors):
 
 The revision and rebuttal have addressed all concerns raised in the initial review. Upon review of the revised figures, however, it is unclear why Figure 8C shows many significant DEGs in POMC neurons (which according to Figure 8b is the "GABA_24" cluster), whereas Figure 6A shows few to no DEGs in the GABA_24 cluster. Same for Pmch neurons/Glut_25, which seem to be missing from Figure 6A.
 
 Answer: In order to capture changes in these smaller cell population we performed an additional DEG analysis with modified and less strict parameters (compared to the first main analysis). We mention the different parameters in the methods part of the revised manuscript (Differential gene expression analysis and case-control based expression shifts (Cacoa)).
 
 The following is the authors’ response to the original reviews.
 
 Reviewer #1 (Recommendations For The Authors):
 
 Major issues
 
 1) A key conclusion of this study is that neurons show longer lasting infection-related changes in gene expression than do non-neuronal cells, suggesting that neurons are more persistently affected, which could potentially underlie persistent effects of infection on behavior or physiology. However, the authors also report that over twice as many transcripts were captured in neurons than in non-neuronal cells, and that neurons and non-neurons were not equal in number. The number of transcripts and cells per cell type can affect the likelihood of detecting a differentially expressed gene when comparing cell types. Thus, the difference in infection related DEGs between non-neuronal cells and neurons may be due in part to differences in the numbers of transcripts and cells in each group. How would the number of infection related DEG's compare if the same number of transcripts were detected in neurons as in non-neuronal cells? In addition, is there any relationship between the number of infection related DEGs detected and the number of cells in the respective groups?
 
 We performed an additional analysis, down sampling the transcripts per cells to similar numbers (~1600 transcripts/cell), showing a similar pattern as shown in the original calculation of DEGs. High downregulation of genes in GABAergic, Glutamatergic and Nonneuronal cells at 3 and 7 dpi, but long-lasting dis-regulation at 23 dpi only in the neuronal subtypes. The analysis results can be found in Supplementary figure12 and on page 11 in the results section.
 
 2) The rationale for focusing on the LH and DMH is unclear. While these regions do play important roles in control of body weight and wakefulness, the authors do not report whether the cell types relevant to these functions are among those affected by infection. For instance, the authors mention HCRT and MCH neurons in the introduction but do not comment on whether these neurons show any significant changes after H1N1 infection in their analysis. Also, what about the POMC neurons or the Lepr+ DMH neurons? Knowing whether and how these body weight associated cell types are affected could help to connect the phenotypic (e.g., body weight) and molecular changes observed.
 
 We have added an additional analysis of some well know hypothalamic subtypes. What is interesting is that the different neuronal subtypes respond to the infection differently. While most neurons show the strongest response at 3dpi, POMC+ neurons show consistent changes across all three time points. This could point to different neuronal subtypes paying different roles in the sickness response to the influenza infection. The new data has been added to Figure 8 together with new text in the result section and discussion (Page 17 & 20).
 
 3) For discriminating neurons and non-neuronal cells based on their expression of neuronal marker genes, was this performed at the single-cell level or the cluster level? Similarly, was the discrimination of GABAergic and glutamatergic neurons done at the cell or cluster level?
 
 The discrimination of the cell types was done on single cell level. This information has been added to the revised manuscript on page 25.
 
 4) The authors mention that body weight did not change in some of the mice. Was there any difference in infection related DEGs between the mice that lost weight and the mice that didn't? Was there any correlation between the molecular and phenotypic (i.e., body weight) changes observed?
 
 We agree that this could have been an interesting point to investigate, however, we can only say with certainty for 2 animals in the recovery group (23.7 and 23.8) that they didn’t lose weight (Supplement figure 2). In Figure 4A we show that overall the different time points group well together, with exception for animal 23.7 which seems to have a better overlap with 7 dpi, indicating that we possibly captured here a delayed disease response. However, to make any indepth analysis, we have to few animals without weight-loss.
 
 5)The authors noted that the hypothalamic neurons continue to show infection-related changes in gene expression at 23dpi though body weight has returned to normal. In this H1N1 model, are there any persistent behavioral deficits at 23dpi that could be explained by the persistent changes in gene expression in DMH and LH neurons?
 
 We did not test for long-lasting behavioural changes in these animals. Another study by Hosseini et al. (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6596076/) focus on cognitive long term effects of viral infections. Even though they did not include the here used H1N1 model, they included the PR8 strain, but didn’t report any long lasting behavioural or cognitive changes. So far only cognitive deficits during the acute phase of the infection caused by the PR8 H1N1 model have been shown. This would be a very interesting follow up study to perform, but this, we believe, is out of scope for the current manuscript.
 
 6) In Figure 1F, the 3dpi sample appears to differ from the other samples in terms of its neuron/non-neuron composition. The authors point this out but offer no discussion or further analysis. Was this difference driven by one or more cell types? Is this difference likely to be technical (e.g., less white matter in sample = fewer oligodendrocytes), or could this be related to the infection (e.g., glial death or neurogenesis at 3dpi)?
 
 We have added the location of the punching within the hypothalamus for the different groups to the supplements (Supplementary Figure 3). The differences in neuron/non-neuron composition could originate from differences in the punching location, but we do not have data to support this conclusion. The difference could also stem from biological alterations during the infection.
 
 7) Since influenza viruses replicate in the cell nuclei, did the authors capture any H1N1 RNA in their single-nuclei RNA-seq samples?
 
 We mapped the single nuclei data against the viral genes, but could not detect any of the viral genes in the data set. We are still optimizing detecting of low amounts of viral genes in snRNA-seq data and have not included this information in the manuscript. We believe, that the virus did not manage to migrate in the hypothalamus and infiltrate the cells in the here captured area.
 
 Minor Issues
 
 1) Page 1. The abstract ends with the sentence: This is complemented by increased activity of microglia monitoring their surroundings. Presumably, the authors are basing this statement on the functions of genes altered in microglia by infection. However, saying that microglia behavior has changed is a bit of a stretch here, since the results suggest a change in the molecular phenotype of microglia but do not demonstrate a change in their behavior.
 
 We agree that the phrasing of the end of the abstract was not accurate and didn’t reflect the outcome of the analysis. We adjusted the sentence to: “The change of microglia gene activity suggest that this is complemented by a shift in microglia activity to provide increased surveillance of their surroundings.” Which should provide a better idea that the findings we present are a suggestion based on the transcriptomic changes in the cell population. (Page 1)
 
 2) Page 8. The authors refer to Th+, Ddc+ neurons as dopaminergic. However, adrenergic/noradrenergic neurons also express these genes. How do the authors know the neurons are not adrenergic/noradrenergic?
 
 There are to our knowledge no nor-adrenaline/adrenaline producing neurons in the hypothalamus. In contrast dopaminergic neurons have indeed been identified in this area.
 
 3) In the Methods section, Slc17a6 and Slc32a1 are not "pan-neuronal markers" since they are only expressed by subsets of neurons.
 
 We removed the glutamatergic and GABAergic marker genes (Slc17a6 and Slc32a1) from the list of neuronal markers. They are stated further down in the method section as glutamatergic and GABAergic markers. Find the changes on Page 24/25)
 
 4) Was the hashtagging antibody custom or commercial? If commercial, what was the source, catalog #, lot #? If custom, the authors should describe how it was made and validated.
 
 We used commercial antibodies for hash-tagging. We added the missing information to the manuscript and can be found on Page 24 of the revised manuscript.
 
 5) In the data processing section of the Methods, SCTransform is mentioned twice. Was normalization with SCTransform applied twice?
 
 The data was only normalized once using the SCTransfrom method. We adjusted the part of the method section to make it more clear (Page 24).
 
 6) In the section on gene set enrichment analysis, the first sentence includes this text: "(is a reference needed?)." The answer is yes - Alexa A, Rahnenfuhrer J (2022). topGO: Enrichment Analysis for Gene Ontology. R package version 2.50.0.
 
 The missing reference was added (Page 26).
 
 7) Page 4: "leaved" should be corrected to "left"
 
 The wrong wording was corrected.
 
 8) Figure 2D - gene is labeled as Slc31a1 on the figure and Slc32a1 in the figure legend
 
 We provided a new Figure plate with the right marker genes.
 
 9) Official gene IDs should be italicized
 
 We checked the gene IDs again, and italicized wrongly formatted gene IDs.
 
 10) It is not clear whether the authors are planning to share their code. However, their code would be needed to reproduce their results, since the methods section provides a summary of what was done but lacks key details (e.g., parameters and software packages used during data processing and analysis)
 
 Code will be shared on request. We added this also to the revised manuscript (Page 26)
 
 Reviewer #2 (Public Review):
 
 The new work from Lemcke et al suggests that the infection with Influenza A virus causes such flu symptoms as sleepiness and loss of appetite through the direct action on the responsible brain region, the hypothalamus. To test this idea, the authors performed single-nucleus RNA sequencing of the mouse hypothalamus in controlled experimental conditions (0, 3, 7, and 23 days after intranasal infection) and analyzed changes in the gene expression in the specific cell populations. The key results are promising.
 
 However, the analysis (cell type annotation, integration, group comparison) is not optimal and incomplete and, therefore should be significantly improved.
 
 More specifically:
 
 1) The current annotation of cell types (especially neuronal but also applicable to the group of heterogeneous "Unassigned cells") did not make a good link to existing cell heterogeneity in the hypothalamus identified with scRNA seq in about 20 recently published works. All information about different peptidergic groups can not be extracted from the current version (except for a few). There are also some mistakes or wrong interpretations (eg, authors assigned hypothalamic dopamine cells to the glutamatergic group, which is not true). This state is feasible to improve (and should be improved) with already existing data.
 
 We repeated the cell label transfer with the newly published HypoMap and added additional information to the supplements about the cell type assignments. Additionally, we agree that the dopaminergic neurons do not belong to the group of glutamatergic neurons, however assigned them into this group based on the clustering. We changed the phrasing in the results, to make a better differentiation between the two groups (Page 8).
 
 2) I am confused with the results shown in the label transfer (suppl fig 3 and 4; note, they do not have the references in the text) applied to some published datasets (authors used the Seurat functions 'FindTransferAnchors' and 'TransferData'). The final results don't make sense: while the dataset for the arcuate nucleus (Campbel et al) well covered the GABAergic neurons it is not the case for the whole hypothalamus datasets (Chen et al; Zeisel et al). Similarly, for glutamatergic neurons. Additionally, I could not see that the label transfer works well for PMCH cells which should be present in the dataset for the lateral hypothalamus (Mickelsen et al,2019).
 
 We performed the additional label transfer of the hypothalamus data. Here we accepted a prediction score of 0.5 and transferred a cell type label to our annotated cluster IDs, if at least 10% of cells within a cluster were annotated with the 0.5 prediction score. We found that well defined neuron population types like Hcrt+, Pmch+ and Hdc+ neurons as well as Pomc+ neurons were tagged with a high predictions scores ( >= 0.9, Supplement Figures 6 and 7) and non-neuronal cell types (Supplement Figure 8) were well annotated. Additionally we identified an Agrp+ neuron population with the Gaba_1 neurons. This information has been added to the revised manuscript (Pages 6, 8).
 
 3) There are newly developed approaches to check the shifts in the cell compositions and specific differential gene expression in the cell groups (e.g. Cacoa from Kharchenko lab, scCoda from Büttner et al; etc). Therefore, I did not fully understand why here the authors used the pseudo-bulk approaches for the data analysis (having such a valuable dataset with multiple hashed samples for each timepoint). Therefore it would be great to use at least one of those approaches, which were developed specifically for the scRNAseq data analysis. Or, if there are some reasons - the authors should argue why their approach is optimal
 
 We performed an additional analysis comparing case-control studies (Cacoa). We perfomed both modalities, cluster-based and cluster-free expression shifts and cell type compositions We could partly confirm our findings using the pseudo-bulk approach. The clusterspecific density shift (Supplement Figure 15) identified only shifts in non-neuronal cell types between the Control group and 3 dpi. We believe, these composition shifts are caused by the lower number of non-neuronal cells in the 3 dpi time point. Cluster-specific expression shifts show similar results as in the pseudo-bulk approach, with significant expression shift identified at 3 and 7 dpi in neuronal and non-neuronal cell clusters (Supplement Figure 16). However, no significant expression shifts were identified in the recovery group at 23 dpi. Using the cluster-free expression shift approach, however we were able to identify a similar picture as described with the pseudo-bulk approach. In the recovery group at 23 dpi, we found mainly changed gene programs in neuronal cells, and no transcriptional changes in the non-neuronal cells (Supplement Figure 17-20). This new analysis has been added to the revised manuscript (Pages 4-6, 26) including supplementary figure and tables as stated.
 
 4) When the authors describe the DGE changes upon experimental conditions (Figures 5 and 6), my first comment is again relevant: it is difficult to use the current annotation and cell type description as the reference for testing virus effects and shifts in the DGE in distinct neuronal subtypes.
 
 The cell type annotations have been checked and additional label transfer has been performed. All figures in the manuscript has been updated.
 
 I have to note that the experimental design is well done and logical. Therefore I believe that to strengthen the conclusions, the already obtained datasets can be used for improved analysis.
 
 Reviewer #2 (Recommendations For The Authors):
 
 I have some minor concerns:
 
 1) For the quality check it would be good to see how different hashed samples for each timepoint cover the UMAP embeddings.
 
 We added the UMAP embeddings to the supplement. (Supplement Figure 4)
 
 2) In Fig 1e colors are not optimal - it is impossible to assess it.
 
 We separated the UMAPs for the different time points to make it easier to assess. See updated Figure 1E.
 
 3) In the methods authors started "Single-nucleus RNA-sequencing cell population identification" from the description of using a Gaussian mixture model (GMM). However, I could not clearly understand how this model was used and which kind of result it provided.
 
 We used an GMM model with known markers for neurons and in a second step for glutamatergic and GABAergic cells to sub-cluster the cells and then selected based on high and low expression of the marker genes in the cluster into their respective classes. This information has been added to the method section (Page 24/25).
 
 4) Could the authors better clarify why "they calculated normalization factors using the scran function 'computeSumFactors'" when working with pseudobulk analysis?
 
 This size factor normalization was recommended for single cell data by the authors of the DESeq2 packages. http://bioconductor.org/packages/devel/bioc/vignettes/DESeq2/inst/doc/DESeq2.html
 
 5) I didn't find logic in "a cell cluster was only included if it contained more than 2 nuclei in at least 3 individual animals" (page 24). Maybe I misinterpreted it.
 
 The rationale for the selection methods was based on the findings that not all animals in the recovery group had the same effects in weight loss. The acute time points didn’t show enough weight loss to decide if all animals in these groups lost the same amount and were equally sick. Hence, in order to have biological robustness we decided to only analyse clusters where cells from at least 3 animals at a specific time point contributed to a cell type. In order to have enough cells per cell type for the calculation of DEGs, we decided to only include a cell type at a specific time point if it contained at least 3 cells from one individual. This selection method limits the analysis to cell types with at least 9 cells per time point.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.03.06.530999v2
www.biorxiv.org www.biorxiv.org

New submission 18/08/2023, 08:44:53

1
1. Public_Reviews 18 Aug 2023
  
  in eLife
  
  Author Response
  
  On behalf of the authors of the article "Elevated glycolytic metabolism of monocytes limits the generation of HIF-1α-driven migratory dendritic cells in tuberculosis", I would like to provide interim responses noting some relevant points about eLife assessment and public reviews,
  
  eLife assessment
  
  This useful study tests the hypothesis that Mycobacterium tuberculosis infection increases glycolysis in monocytes, which alters their capacity to migrate to lymph nodes as monocyte-derived dendritic cells. The authors conclude that infected monocytes are metabolically pre-conditioned to differentiate, with reduced expression of Hif1a and a glycolytically exhaustive phenotype, resulting in low migratory and immunologic potential. Unfortunately, the evidence for the conclusions is currently incomplete, as the use of dead mycobacteria will affect bioenergetic readouts. The study will be of interest to microbiologists and infectious disease scientists.
  
  We would like to clarify what may be a misunderstanding. Indeed, the study did not deal with “infected monocytes” per se, but rather with the ability of monocytes purified from TB patients vs. healthy control to differentiate into DCs with different migratory capacities upon Mtb infection or stimulation. Since there is no evidence for the presence of Mtb in the patient’s blood, the metabolic effects we observed are likely a consequence of systemic pulmonary disease rather than of direct interaction of monocytes with Mtb. Although irradiated Mtb was used in most experiments, in particular because Seahorse and other technologies cannot be used in our BSL3 laboratory, we provide evidence (Figure 1) that infecting DCs with live Mtb or stimulating DCs with irradiated Mtb generates comparable glycolytic profiles (release of lactate, glucose consumption, HIF1a expression and LDHA expression). To strengthen the relevance of our data, we will characterize the metabolism of DCs infected with live Mtb using SCENITH.
  
  Reviewer #1 (Public Review):
  
  The manuscript by Maio and colleagues looks at the impact of the heightened glycolytic activity induced by Mtb in monocytes, and its impact on Hif1-a dependent migration of DCs.
  
  Data concerning the biological significance of the impact of enhanced glycolysis on DC migration is strong and convincing. While Hif1-a is obviously a key factor, the evidence that it is a linear component in the cascade falls a little short as the main inhibitor used PX-478 does not have a clear, single mode of action. Additional characterization with the alternative inhibitor (Echinomycin) would make the argument more convincing. 
  
  We would like to thank the reviewer for their positive assessment of our manuscript. Although Echinomycin has been used for validating some of the representative experiments performed in our study (see supplementary figure 2E-F), we agree with the reviewer’s suggestion. Therefore, additional experiments using echinomycin will be carried out to confirm our results.
  
  Reviewer #2 (Public Review):
  
  The manuscript by Maio et al attempts to examine the bioenergetic mechanisms involved in the delayed migration of DC's during Mtb infection. The authors performed a series of in vitro infection experiments including bioenergetic experiments using the Agilent Seahorse XF, and glucose uptake and lactate production experiments. This is a well-written manuscript and addresses an important question in the TB field. A major weakness is the use of dead Mtb in virtually all the experiments. Unfortunately, the authors did not attempt to address this critical confounding factor. As a result, data was interpreted, and conclusions were made as if live Mtb was used. Also, previous studies (PMID: 30444490 and PMID: 31914380) have shown that live Mtb suppresses glycolysis, which contradicts findings in this study, perhaps because dead Mtb was used here. For these reasons, obtaining any pertinent conclusions from the study is not possible, which diminishes the significance of the work.
  
  We thank the reviewer for their evaluation of our study. We agree that using live Mtb in all experiments would have been ideal. However, we do not have a Seahorse Analyzer in our BSL3 facility. Thus, we will characterize the metabolism of DCs infected with live Mtb using SCENITH during revision of our manuscript.
  
  With regard to the differences between our results and those of previous studies showing Mtb-induced suppression of glycolysis, they could be explained by the use of different Mtb strains, different multiplicity of infection (MOI), macrophages of different origins, and different measurement timepoints, as discussed in one of these publications (PMID 30444490). For instance, in PMID 30444490, hMDMs infected at an MOI of 1 showed increased extracellular acidification and glycolytic parameters, as opposed to higher MOI or the same MOI but measured in THP1 cells. Importantly, the aforementioned articles studied macrophage and not DC metabolism. These aspects will be discussed in a revised manuscript.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.04.03.535400v4
www.biorxiv.org www.biorxiv.org

New submission 18/08/2023, 08:25:56

1
1. Public_Reviews 18 Aug 2023
  
  in eLife
  
  Author Response
  
  We thank the reviewers for their thoughtful suggestions, which we will address in the revised manuscript.
  
  Briefly, we purposely fixed the Hill coefficients to h=1 on the grounds that one drug molecule binding to the channel is sufficient to block the channel and there is no strong evidence for co-operative binding in the literature. Doing so also helped to constrain the degrees of freedom in the face of noisy observations in the public datasets. As noted by Reviewer 2, the quality of the drug measurements varies widely across laboratories and this is particularly noticeable in estimates of Hill coefficients which are therefore less reliable.
  
  The dose-dependent curves of multi-channel block (Figure 6) are plotted for all four dimensions in the Supplementary Dataset. We omitted GKs and GNaL from Figure 6 in an attempt at brevity since they do not add much to the story.
  
  It is true that pacing frequency was not considered in this study.
  
  The drugs were assessed across a range of doses (1x to 30x) but dosage only had a minimal impact on accuracy (88.1% to 90.8%) as shown in Figure 8A.
  
  Finally, we emphasize that the metric’s novelty lies in deriving a simple linear model from biophysical principles of ion-channel blockade rather than blind statistical model fitting.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.04.19.537441v4
www.biorxiv.org www.biorxiv.org

New submission 18/08/2023, 08:21:47

1
1. Public_Reviews 18 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  Both reviewers strongly suggest that you modify the title of your paper to something that better reflects the data presented.
  
  We have made the title more specific to the findings described in the manuscript and revised the rest of the manuscript in response to the additional reviewer’s comments. We adjusted the abstract accordingly.
  
  Public Reviews:
  
  Reviewer #1 (Public Review):
  
  This manuscript conducts a classic QTL analysis to identify the molecular basis of natural variation in disease resistance. This identifies a pair of glycosyltransferases that contribute to steroidal glycoalkaloid production. Specifically altering the final hexose structure of the compound. This is somewhat similar to the work in tomatine showing that the specific hexose structure mediates the final potential bioactivity. Using the resulting transgenic complementation lines that show that the gene leads to a strong resistance phenotype to one isolate of Alternaria solani and the Colorado potato beetle. This is solid work showing the identification of a new gene and compound influencing plant biotic interactions. While the experiments are solid, the introduction, discussion and associated claims don't accurately reflect my reading of what is known and said in the current literature.
  
  The sentence on line 53-54 is misleading. It provides only three citations on specific links between specialized metabolism and disease resistance. However, there are actually at least 40 on specific links of camalexin and indolic phytoalexins to disease resistance. Similarly there are dozens of uncited papers on benzoxazinoids, indolic glucosinolates, aliphatic glucosinolates and tomatine to both non-host and host based resistance mechanisms. This even goes as far as showing how the pathogens resist an array of these compounds. The choices in the introduction make it appear that little is known about specialized metabolism to disease resistance but I would suggest that this is not an allusion supported by the literature. I would agree that given the breadth of specialized metabolism we have a lot of knowledge about a set of them but that there are hundreds to thousands of untested compounds but to indicate that little is known is unfair to the specialized metabolism community. This is especially true as the introduction and discussion give no image of the large body of literature on specialized metabolism to insect interactions even though this is a major component of this manuscript.
  
  We have rewritten this part of the introduction (lines 50-69). In the original text, we meant to convey our impression that receptor-mediated resistance is studied in a very high degree of detail, and that resistance that is based on secondary metabolites is receiving less recognition in comparison, especially in the plant-microbe interactions field. We agree that our comments might give the (false) impression that there is not much known. There is indeed a lot of data to support the importance of specialised metabolites in resistance, especially against necrotrophic pathogens and insects. The changes that we made should give a better reflection of that knowledge.
  
  I would also agree that specialized metabolism is not a conscious target of breeding programs but the work on benzoxazinoids in maize and glucosinolates in the Brassica's has shown that these compounds have been influenced by breeding programs. Similarly work on de novo domestication of multiple crops is focused on the adjustment of specialized metabolism in these crops.
  
  The reviewer is right to point out that specialized metabolism is influenced by breeding. Specialized metabolites may not only be involved in defence, but they can also affect other properties of the plant such as quality aspects. Potato breeders have made efforts to reduce SGA content in tubers to prevent problems with toxicity and to meet safety regulations. We have adjusted the discussion (lines 255-260).
  
  I would disagree with the hint on line 49-50 and again on lines 236-239 that specialized metabolism may have less pleiotropy. This is not supported by recent work on benzoxazinoids and glucosinolates showing that they have numerous regulatory links to the plant and can be highly pleiotropic. Even the earliest avenicin work in oat showed that the deficient lines had altered root development.
  
  We agree with the reviewer and we have removed the hints that specialized metabolism may have less pleiotropy from the manuscript. We do believe that the broad-spectrum activity of specialized metabolites can be an advantage, but this non-specificity also comes with risks in case of food crops. We note the potential negative effects of SGAs in the discussion (see previous comment and lines 300-303).
  
  My main message from the above three paragraphs is to point out that there are a number of places in the manuscript where the current state of the specialized metabolite literature is not accurately portrayed. To properly place the manuscript in the broader context, I would suggest a more even handed introduction and discussion that takes into account the current state of the specialized metabolism literature.
  
  We rewrote these parts to provide a more balanced view on the role of specialized metabolites in disease resistance.
  
  Is it accurate to say complete resistance to A. solani if only a single isolate of the pathogen is used? Is there evidence that I am unaware of that there are no isolates of this pathogen with saponin resistance? There are pathogens with natural tomatine resistance and this is a common feature of plant pathogens that they have genetic variation in the resistance to specialized metabolism. For example, it should be noted that Botrytis BO5.10 is a tomatine sensitive isolate and the van Kan and Hahn groups have published on isolates that are resistant to saponins. I would suggest caveating across the manuscript that this is a single isolate and that it is possible that there may be isolates with natural resistance to the steroidal glycoalkaloid?
  
  While it is true that we only describe the results of testing a single isolate of A. solani in the submitted manuscript, we previously showed that the S. commersonii resistance is effective against additional Alternaria isolates and species from different locations (1). We included this context to the introduction (lines 71-73) and also added the results of testing a more recent Dutch A. solani isolate (altNL21002, isolated from a potato field in the Netherlands in 2021) and an isolate from the US (ConR1H, isolated from a potato field in Idaho in 2015) to the supplementary material of the revised manuscript (lines 102-104). Of course, this still does not prove that the SGAs protect against all A. solani isolates and we have been more specific in referring to the Alternaria isolate that was tested. Similarly, it is impossible to make a general statement on the lack of detoxification capacity of all isolates of A. solani. It may indeed be possible that there are Alternaria isolates that are tolerant to the tetraose SGAs produced by S. commersonii, especially in natural habitats where Solanum species that produce tetraose SGAs and Alternaria co-occur. We have added this point to the discussion (lines 292-294).
  
  In Figure 4b, is the infection site about 3.5 mm in size such that 3.5 mm means absolutely no infection? If not, that would mean there is some outgrowth by Alternaria and the resistance isn’t complete.
  
  We often observe dead tissue underneath the inoculation droplet on resistant plants, which is measured as a lesion. Such lesions can usually visually be discriminated from the lesions on susceptible genotypes by their colour (dark black for resistant plants versus a more brownish colour of the lesions on susceptible plants), but this information is lost in the quantitative data presented in the figures. Droplets occasionally flow out over the leaf surface, which may explain why larger ‘lesions’ are sometimes observed on resistant plants. In rare cases, there may also be a little bit of outgrowth of Alternaria beyond the inoculation droplet before the infection is stopped on resistant genotypes. Whether the resistance is ‘complete’ in such cases is debatable. We tuned down our statements regarding ‘complete’ resistance throughout the manuscript.
  
  Reviewer #2 (Public Review):
  
  The study focuses on a mechanism of pest/pathogen resistance identified in Solanum commersonii, which appears to offer dominant resistance to Alternaria solani through the activity of specific glycosyltransferases which facilitate the production of tetraose glycoalkaloids in leaf tissue. The authors demonstrated that these glycoalkaloids are suppressive to the growth of multiple pathogenic ascomycetes and furthermore, that transgenic plants expressing these glycosyltransferases in susceptible S. commersonii clones demonstrate improved resistance to a specific strain of A. solani and a genotype of Colorado Potato Beetle. The study design is straightforward, yet thorough, and does a good job demonstrating the importance of these genes in resistance. While the research findings are significant there are statements throughout the manuscript that overstate both the novelty and utility of the findings.
  
  Title: While the protection is impressive, the title suggests that these glycoalkaloids provide protection against all fungi and insects, which is both unlikely and essentially impossible to prove. This should be changed to something more measured. This is especially true given that only a single fungus and insect were tested against transgenic plants, but would be an overstatement even with more robust evaluation.
  
  We appreciate the comment of the reviewer and agree that is unlikely that the S. commersonii SGAs protect against all fungi and insects and that it would be impossible to prove this. We intended to highlight the fact that these compounds provide a qualitative (‘complete’) resistance against the tested isolates/genotypes, and that they are effective across a wide range of organisms (‘fungi and insects’). We have made the title more specific to the findings described in the manuscript.
  
  Throughout the paper: A single isolate of A. solani and a single genotype of CPB were used in this study. While this is in line with the typical limitations of such a study, the authors need to be careful about claiming broad resistance to either of the species. Variability in fungicide tolerance and detoxification activity have been noted in both fungi and CPB, so more specific language should be used throughout (such as L213 and L221).
  
  Similar points were raised by reviewer 1. We have tuned down our statements regarding ‘complete’ resistance and clarified that we tested only a limited set of A. solani isolates and single CPB genotype throughout the manuscript.
  
  Reviewer #2 (Recommendations For The Authors):
  
  L39: Fix grammar.
  
  Done
  
  L42: Race is a terminology not used in all pathosystems (others include pathovar, subspecies, etc.).
  
  We removed the word race and use the general ‘pathogen’.
  
  L53: The role of pterocarpans, flavonoids, indoles, terpenes, and a number of other compound classes have been linked to plant defense across the entire plant kingdom. Highlighting Avenacin is fine, but it shouldn't be ignored that the role of phytoalexins and phytoanticipins in defense against fungi (and the subsequent detoxification of these compounds by fungi) has been well established in a number of pathosystems.
  
  We have removed the specific reference to avenacin (we still refer to it in the discussion, as there are interesting similarities with the saponins from tomato and potato) and tried to highlight the diversity of plant defence compounds across the plant kingdom and the importance of tolerance mechanisms in different pathosystems in the revised manuscript (lines 52-60).
  
  L234-237: This is broadly an overstatement. To my knowledge there is quite a bit of interest in plant defense compounds for breeding (in plants generally) and we know quite a bit about their mode of action (fungal membrane perturbation through binding to ergosterol). There have been active breeding efforts for decades to reduce glycoalkaloid content in potatoes due to the hemolytic activity of these compounds. While this may or may not be the case with these specific SGAs, a more accurate summary of the state of the field is warranted.
  
  We have rewritten the paragraph to give a more balanced view of breeding for SGAs in potato (lines 63-69 on the mode of action of SGAs and lines 255-260 regarding breeding for specific SGA variants in potato).
  
  L279: "...introgression breeding could help to move these compounds from wild relatives to crop species..." Yes, but at what cost? If it results in increase GSAs in tubers, then the plants would be inedible. This could be made more clear and support the following statement that alternative deployment techniques including application as biological protectants.
  
  The reviewer is right to point out the importance of considering negative effects of SGAs in breeding. We paid more attention to this aspect in the discussion and added a sentence to clarify that effects on human health and the environment should be considered before employing these compounds (lines 300-303).
  
  Discussion:
  
  L229-230: the authors state that the tetraose SGA from commersonii can protect against other fungi, but this does not appear to have been tested. Rather, they looked at resistance in the CGN18024_1 and CGN18024_3 lines, which could express other factors unrelated to GSAs to impact resistance or susceptibility. Experiments to support this statement would include screening of the transgenic lines for resistance to other fungi, but this does not appear to have been done.
  
  We believe that the tetraose SGAs have the potential to protect against a range of fungi, but the reviewer correctly points out that these experiments do not provide definitive proof for their role in resistance to other pathogens besides A. solani and CPB. We have adjusted our statement accordingly (lines 247-250 of the discussion, 84-88 of the introduction and the abstract).
  
  Future questions should likely include characterizing the overall SGA content of resistant potatoes, characterizing the saponin content specifically found within tubers, and purifying the compounds to characterize the hemolytic activity of these specific compounds. Even if these aren't your exact plans, they would be necessary steps in any resistance breeding efforts. In particular, it will be important to know if the SGA content is increased in tubers of the tested lines, especially CGN18024_1, CGN18024_3, and the transgenics. Ideally, for breeding purposes there would be a disconnect between SGA production in foliage and tubers. It is unclear whether this is possible in these lines.
  
  These are all good questions, and it would be nice to follow up on them in future research. We explore the different routes towards a safe use of SGAs in resistance breeding in the discussion.
  
  It has been shown that commersonine, one of the tetraose glycoalkaloids is also present in Solanum chacoense. It would be useful to note both this fact and that the Early Blight resistance which has been noted in Solanum chacoense may additionally be from these compounds (examples below).
  
  o https://www.cabi.org/GARA/FullTextPDF/Pre2000/19871336643.pdf
  
  o https://apsjournals.apsnet.org/doi/pdf/10.1094/PHYTO-06-18-0181-R (breeding line 24-24-12 has s. chacoense parentage)
  
  o https://agris.fao.org/agris-search/search.do?recordID=DJ20220231195
  
  This is indeed an interesting observation and it is well possible that SGAs are responsible for the resistance of S. chacoense. There are additional wild Solanum species that produce similar SGAs as found in S. commersonii that could confer resistance to early blight (or CPB) and we added this to the discussion (lines 263-265).
  
  Reference
  
  Wolters PJ, de Vos L, Bijsterbosch G, Woudenberg JH, Visser RG, van der Linden G, et al. A rapid method to screen wild Solanum for resistance to early blight. European Journal of Plant Pathology. 2019;154:109-14.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.09.28.509958v4
www.biorxiv.org www.biorxiv.org

New submission 11/08/2023, 09:18:05

1
1. Public_Reviews 17 Aug 2023
  
  in eLife
  
  Author Response
  
  We thank the reviewers for their work and their thoughtfulness. However, it seems to us that much (but not all) of the critique reflects a misunderstanding of the goals and methods of computational modeling. Details are below. We are grateful for the opportunity to include our views about this in the context of our replies to the Public Critiques of our paper. The comments of the reviewers were very helpful in allowing us to see what might not be clear to our readers.
  
  eLife assessment
  
  This useful modeling study explores how the biophysical properties of interneuron subtypes in the basolateral amygdala enable them to produce nested oscillations whose interactions facilitate functions such as spike-timing-dependent plasticity. The strength of evidence is currently viewed as incomplete because the relevance to plasticity induced by fear conditioning is viewed as insufficiently grounded in existing training protocols and prior experimental results, and alternative explanations are not sufficiently considered. This work will be of interest to investigators studying circuit mechanisms of fear conditioning as well as rhythms in the basolateral amygdala.
  
  Most of our comments below are intended to rebut the sentence: “The strength of evidence is currently viewed as incomplete because the relevance to plasticity induced by fear conditioning is viewed as insufficiently grounded in existing training protocols and prior experimental results, and alternative explanations are not sufficiently considered”. Details are below in the answer to reviewers.
  
  We believe this work will be interesting to investigators interested in dynamics associated with plasticity, which goes beyond fear learning. It will also be of interest because of its emphasis on the interactions of multiple kinds of interneurons that produce dynamics used in plasticity, in the cortex (which has similar interneurons) as well as BLA.
  
  We note that the model has sufficiently detailed physiology to make many predictions that can be tested experimentally. In the revision, we will be more explicit about this.
  
  We thank Reviewer #1 for stressing our work's important contribution to providing concrete hypotheses that can be tested in vivo and highlighting the importance of examining in the future the synergistic role of the interneurons in the BLA in fear learning in the BLA. The weaknesses reported by the Reviewer concern deviations of the model compared to the experimental literature. We describe below why we think those differences are minor in the context of the aims of our model. Specifically,
  
  1) Some connections among neurons in the BLA reported by (Krabbe et al., 2019) have not been taken into account in the model. Some connections between cell types were excluded without adequate justification (e.g. SOM+ to PV+).
  
  In order to constrain our model, we focused on what is reported in (Krabbe et al., 2019) in terms of functional connectivity instead of structural connectivity. Thus, we included only those connections for which there was strong functional connectivity. For example, the SOM+ to PV+ connection is shown to be small (Supp. Fig. 4, panel t). We also omitted PV+ to SOM+, PV+ to VIP+, SOM+ to VIP+, VIP+ to excitatory projection neurons; all of these are shown in (Krabbe et al. 2019, Fig. 3 (panel l), and Supp. Fig. 4 (panels m,t)) to have weak functional connectivity, at least in the context of fear conditioning. See below for comments on modeling strategies. We will explain this better in our revision.
  
  2) The construction of the afferent drive to the network does not reflect the stimulus presentations that are given in fear conditioning tasks. For instance, the authors only used a single training trial, the conditioning stimulus was tonic instead of pulsed, the unconditioned stimulus duration was artificially extended in time, and its delivery overlapped with the neutral stimulus, instead of following its offset. These deviations undercut the applicability of their findings.
  
  Regarding the use of a single long presentation of US rather than multiple presentations (i.e., multiple trials): in early versions of this paper, we did indeed use multiple presentations. We were told by experimental colleagues that the learning could be achieved in a single trial. We note that, if there are multiple presentations in our modeling, nothing changes; once the association between CS and US is learned, the conductance of the synapse is stable. Also, our model does not need a long period of US if there are multiple presentations. This point will be made clearer in our revision.
  
  We agree that, in order to implement the fear conditioning paradigm in our in-silico network, we made several assumptions about the nature of the CS and US inputs affecting the neurons in the BLA and the duration of these inputs. A Poisson spike train to the BLA is a signal that contains no structure that could influence the timing of the BLA output; hence, we used this as our CS input signal. We also note that the CS input can be of many forms in general fear conditioning (e.g., tone, light, odor), and we wished to de-emphasize the specific nature of the CS. The reference mentioned in the Recommendations for authors, (Quirk, Armony, and LeDoux 1997), uses pulses 2 seconds long. At the end of fear conditioning, the response to those pulses is brief. However, in the early stages of conditioning, the response goes on for as long as the figure shows. The authors do show the number of cells responding decreases from early to late training, which perhaps reflects increasing specificity over training. This feature is not currently in our model, but we look forward to thinking about how it might be incorporated. Regarding the CS pulsed protocol used in (Krabbe et al., 2019), it has been shown that intense inputs (6kHz and 12 kHz inputs) can lead to metabotropic effects that last much longer than the actual input (200 ms duration) (Whittington et al., Nature, 1995). Thus, the effective input to the BLA may indeed be more like Poisson.
  
  Our model requires the effect of the CS and US inputs on the BLA neuron activity to overlap in time in order to instantiate fear learning. Despite paradigms involving both overlapping (delay conditioning, where US coterminates with CS (Lindquist et al., 2004), or immediately follows CS (e.g., Krabbe et al., 2019)) and non-overlapping (trace conditioning) CS/US inputs existing in the literature, we hypothesized that concomitant activity in CS- and US-encoding neuron activity should be crucial in both cases. This may be mediated by the memory effect, as suggested in the Discussion of our paper, or by metabotropic effects as suggested above, or by the contribution from other brain regions. We will emphasize in our revision that the overlap in time, however instantiated, is a hypothesis of our model. It is hard to see how plasticity can occur without some memory trace of US. This is a consequence of our larger hypothesis that fear learning uses spike-timing-dependent plasticity; such a hypothesis about plasticity is common in the modeling literature. We will discuss these points in more detail in our revision.
  
  We thank Reviewer #2 for their comments. Below, we reply to each of them:
  
  1) Gamma oscillations are generated locally; thus, it is appropriate to model in any cortical structure. However, the generation of theta rhythms is based on the interplay of many brain areas therefore local circuits may not be sufficient to model these oscillations. Moreover, to generate the classical theta, a laminal structure arrangement is needed (where neurons form layers like in the hippocampus and cortex)(Buzsaki, 2002), which is clearly not present in the BLA. To date, I am not aware of any study which has demonstrated that theta is generated in the BLA. All studies that recorded theta in the BLA performed the recordings referenced to a ground electrode far away from the BLA, an approach that can easily pick up volume conducted theta rhythm generated e.g., in the hippocampus or other layered cortical structure. To clarify whether theta rhythm can be generated locally, one should have conducted recordings referenced to a local channel (see Lalla et al., 2017 eNeuro). In summary, at present, there is no evidence that theta can be generated locally within the BLA. Though, there can be BLA neurons, firing of which shows theta rhythmicity, e.g., driven by hippocampal afferents at theta rhythm, this does not mean that theta rhythm per se can be generated within the BLA as the structure of the BLA does not support generation of rhythmic current dipoles. This questions the rationale of using theta as a proxy for BLA network function which does not necessarily reflect the population activity of local principal neurons in contrast to that seen in the hippocampus.
  
  In both modeling and experiments, a laminar structure does not seem to be needed to produce a theta rhythm. A recent experimental paper, (Antonoudiou et al. 2021), suggests that the BLA can intrinsically generate theta oscillations (3-12 Hz) detectable by LFP recordings under certain conditions, such as reduced inhibitory tone. The authors draw this conclusion by looking at mice ex vivo slices. The currents that generate these rhythms are in the BLA, since the hippocampus was removed to eliminate hippocampal volume conduction and other nearby brain structures did not display any oscillatory activity. Also, in the modeling literature, there are multiple examples of the production of theta rhythms in small networks not involving layers; these papers explain the mechanisms producing theta from non-laminated structures (Dudman et al., 2009, Kispersky et al., 2010, Chartove et al. 2020). We are not aware of any model description of the mechanisms of theta that do require layers.
  
  2) The authors distinguished low and high theta. This may be misleading, as the low theta they refer to is basically a respiratory-driven rhythm typically present during an attentive state (Karalis and Sirota, 2022; Bagur et al., 2021, etc.). Thus, it would be more appropriate to use breathing-driven oscillations instead of low theta. Again, this rhythm is not generated by the BLA circuits, but by volume conducted into this region. Yet, the firing of BLA neurons can still be entrained by this oscillation. I think it is important to emphasize the difference.
  
  Many rhythms of the nervous system can be generated in multiple parts of the brain by multiple mechanisms. We do not dispute that low theta appears in the context of respiration; however, this does not mean that other rhythms with the same frequencies are driven by respiration. Indeed, in the above answer we showed that theta can appear in the BLA without inputs from other regions. In our paper, the low theta is generated in the BLA by VIP+ neurons. Using intrinsic currents known to exist in VIP+ neurons (Porter et al., 1998), modeling has shown that such neurons can intrinsically produce a low theta rhythm. This is also shown in the current paper. This example is part of a substantial literature showing that there are multiple mechanisms for any given frequency band. We will emphasize these points in our revision; we note that, for any individual case, such as this one, the mechanism needs to be tested experimentally.
  
  3) The authors implemented three interneuron types in their model, ignoring a large fraction of GABAergic cells present in the BLA (Vereczki et al., 2021). Recently, the microcircuit organization of the BLA has been more thoroughly uncovered, including connectivity details for PV+ interneurons, firing features of neurochemically identified interneurons (instead of mRNA expression-based identification, Sosulina et al., 2010), synaptic properties between distinct interneuron types as well as principal cells and interneurons using paired recordings. These recent findings would be vital to incorporate into the model instead of using results obtained in the hippocampus and neocortex. I am not sure that a realistic model can be achieved by excluding many interneuron types.
  
  The interneurons and connectivity that we used were inspired by the functional connectivity reported in (Krabbe et al., 2019) (see above answer to Reviewer #1). As reported in (Vereczki et al., 2021), there are multiple categories and subcategories of interneurons; that paper does not report on which ones are essential for fear conditioning. We did use all the highly represented categories of the interneurons, except NPY-containing neurogliaform cells.
  
  The Reviewer says “I am not sure that a realistic model can be achieved by excluding many interneuron types”. We agree with the Reviewer that discarding the introduction of other interneurons subtypes and the description of more specific connectivity (soma-, dendrite-, and axon-targeting connections) may limit the ability of our model to describe all the details in the BLA. However, this work represents a first effort towards a biophysically detailed description of the BLA rhythms and their function. As in any modeling approach, assumptions about what to describe and test are determined by the scientific question; details postulated to be less relevant are omitted to obtain clarity. The interneuron subtypes we modeled, especially VIP+ and PV+, have been reported to have a crucial role in fear conditioning (Krabbe et al., 2019). Other interneurons, e.g. cholecystokinin and SOM+, have been suggested as essential in fear extinction. Thus, in the follow-up of this work to explain fear extinction, we will introduce other cell types and connectivity. In the current work, we have achieved our goals of explaining the origin of the experimentally found rhythms and their roles in the production of plasticity underlying fear learning. Of course, a more detailed model may reveal flaws in this explanation, but this is science that has not been yet done.
  
  4) The authors set the reversal potential of GABA-A receptor-mediated currents to -80 mV. What was the rationale for choosing this value? The reversal potential of IPSCs has been found to be -54 mV in fast-spiking (i.e., parvalbumin) interneurons and around -72 mV in principal cells (Martina et al., 2001, Veres et al., 2017).
  
  A GABA-A reversal potential around -80 mV is common in the modeling literature (Jensen et al., 2005; Traub et al., 2005; Kumar et al., 2011; Chartove et al., 2020). Other computational works of the amygdala, e.g. (Kim et al., 2016), consider GABA-A reversal potential at -75 mV based on the cortex (Durstewitz et al., 2000). The papers cited by the reviewer have a GABA-A reversal potential of -72 mV for synapses onto pyramidal cells; this is sufficiently close to our model that it is not likely to make a difference. For synapses onto PV+ cells, the papers cited by the reviewer suggest that the GABA-A reversal potential is -54 mV; such a reversal potential would lead these synapses to be excitatory instead of inhibitory. However, it is known (Krabbe et al., 2019; Supp. Fig. 4b) that such synapses are in fact inhibitory. Thus, we wonder if the measurements of Martina and Veres were made in a condition very different from that of Krabbe. For all these reasons, we consider a GABA-A reversal potential around -80 mV in amygdala to be a reasonable assumption. We will discuss these points in our revision.
  
  5) Proposing neuropeptide VIP as a key factor for learning is interesting. Though, it is not clear why this peptide is more important in fear learning in comparison to SST and CCK, which are also abundant in the BLA and can effectively regulate the circuit operation in cortical areas.
  
  We do not think that VIP is necessarily more fundamental in fear learning, and certainly not for fear extinction. We will make this clear in the revision.
  
  We thank Reviewer #3 for their comments and for recognizing that we achieved our modeling aims. We reply to the criticisms below.
  
  Weaknesses:
  
  The main weakness of the approach is the lack of experimental data from the BLA to constrain the biophysical models. This forces the authors to use models based on other brain regions and leaves open the question of whether the model really faithfully represents the basolateral amygdala circuitry. Furthermore, the authors chose to use model neurons without a representation of the morphology. However, given that PV+ and SOM+ cells are known to preferentially target different parts of pyramidal cells and given that the model relies on a strong inhibition form SOM to silence pyramidal cells, the question arises whether SOM inhibition at the apical dendrite in a model representing pyramidal cell morphology would still be sufficient to provide enough inhibition to silence pyramidal firing. Lastly, the fear learning relies on the presentation of the unconditioned stimulus over a long period of time (40 seconds). The authors justify this long-lasting input as reflecting not only the stimulus itself but as a memory of the US that is present over this extended time period. However, the experimental evidence for this presented in the paper is only very weak.
  
  Many of these issues were addressed in the previous responses.
  
  1) Our neurons were constrained by electrophysiology properties in response to hyperpolarizing currents in the BLA (Sosulina et al., 2010). We choose the specific currents, known to be present in these neurons, to replicate those responses.
  
  2) Though a much more detailed description of BLA interneurons was given in (Vereczki et al., 2021), it is not clear that this level of detail is relevant to the questions that we were asking, especially since the experiments described were not done in the context of fear learning.
  
  3) It is true that we did not include the morphology, which undoubtedly makes a difference to some aspects of the circuit dynamics. As we described above, modeling requires the omission of many details to bring out the significance of other details.
  
  4) As described above, some form of memory or overlap in the activity of the excitatory projection neurons is necessary for spike-timing-dependent plasticity. In modeling, one must be specific about hypotheses, and describe why they are plausible, if not proved; indeed, modeling can explain known phenomena by showing how they are consequences of some (plausible) hypotheses, which themselves are open to experimental verification.
  
  5) The 40 seconds is not necessary if there are multiple presentations.
  
  Other critiques:
  
  1) It is correct that PV+ and SOM+ preferentially target different parts of excitatory projection neurons and that the model relies on a strong inhibition from SOM+ and PV+ to silence the excitatory projection neurons. This choice of parameters comes from using simplified models: it is standard in modeling to adjust parameters to compensate for simplifications.
  
  2) The SOM+ inhibition of the pyramidal cell firing can be seen as a hypothesis of our model. It is well known that VIP+ cells disinhibit pyramidal cells through inhibition of SOM+ and PV+ cells, which is all we are using in our model; hence this hypothesis is generally believed.
  
  The authors achieved the aim of constructing a biophysically detailed model of the BLA not only capable of fear learning but also showing spectral signatures seen in vivo. The presented results support the conclusions with the exception of a potential alternative circuit mechanism demonstrating fear learning based on a classical Hebbian (i.e. non-depression-dominated) plasticity rule, which would not require the intricate interplay between the inhibitory interneurons. This alternative circuit is mentioned but a more detailed comparison between it and the proposed circuitry is warranted.
  
  We agree with the reviewer that it would be good to have a more detailed comparison with the classical Hebbian rule (non-depression-dominated rule). However, we demonstrated in Supplementary Materials that the non-depression-dominated rule is less robust and only operates within a limited window of PV+ excitation. We will have a more robust discussion of plasticity in the revision.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.04.28.538604v2
www.biorxiv.org www.biorxiv.org

New submission 17/08/2023, 10:52:03

1
1. Public_Reviews 17 Aug 2023
  
  in eLife
  
  Author Response
  
  We would like to thank the reviewers for their careful reading of the manuscript and for the positive feedback and constructive criticism that they have provided. We intend to incorporate this feedback into an improved and updated version of the manuscript. We will address the reviewer comments point by point when we submit an updated version but for now we would like to discuss the major points that we intend to address.
  
  The first concern raised by the reviewers related to the specificity of the BDNF and TrkB staining. We agree that this is an important concern. We tested several antibodies and staining protocols and found that the optimal protocol involved the antibody used in this paper (abcam ab108319), in combination with a heat induced epitope retrieval (HIER) step. Together, this gave robust staining of BDNF in cerebellar tissue and the results of quantification of the staining were in agreement with a BDNF ELISA that we carried out to measure levels of BDNF in the cerebellar vermis of WT and SCA6 mice (Cook et al., 2022). We outline the epitope retrieval method briefly in the methods section of this manuscript but in a revised version we will include further details and data showing the troubleshooting and validation experiments that we have conducted.
  
  Another concern raised by the reviewers is that 7,8-DHF may not be acting as a TrkB agonist. There has been controversy over the mechanism of action of 7,8-DHF and we welcome the opportunity to discuss the issue further in the present manuscript. We have some evidence that 7,8-DHF is acting via TrkB in this case, as we had previously shown that 7,8-DHF administration to SCA6 mice leads to increased cerebellar TrkB levels and phosphorylation of Akt, an activation event known to be downstream of TrkB (Cook et al., 2022). This implicates TrkB in the mechanism of rescue in this case, but we have not demonstrated this directly. We acknowledge that 7,8-DHF could be acting via a different mechanism, such as anti-oxidant or anti-inflammatory effects. This would be interesting and could be followed up on in the future, potentially providing further insights into the pathophysiology of SCA6. We plan to revise the manuscript and provide additional discussion of the potential mechanism of action of 7,8-DHF. Despite this uncertainty, we believe that the finding that 7,8-DHF rescues early endosome abnormalities is a valuable addition to the paper. Whatever the mechanism of 7,8-DHF, this compound holds promise for potential treatment of SCA6.
  
  With further staining experiments and addition of information to the text, we feel confident that we can address the concerns of the reviewers and that an updated version will strengthen our manuscript and thereby provide valuable insight into the pathophysiology and potential treatment of SCA6.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.26.546542v2
www.biorxiv.org www.biorxiv.org

Complexes of vertebrate TMC1/2 and CIB2/3 proteins form hair-cell mechanotransduction cation channels

1
1. Public_Reviews 17 Aug 2023
  
  in eLife
  
  Author Response
  
  We thank reviewers for their evaluation of our work and their thorough critiques, which we will address in an upcoming revised version of the manuscript. We note that work on mouse and fish CIB knockouts in our laboratories started over a decade ago and our discoveries are contemporary to those recently presented by Liang et al., 2021 and Wang et al., 2023, which we acknowledge, cite, and give credit as appropriate. We also note that work on fish knockouts and on fish Cib3 is completely novel.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.05.26.542533v1
www.biorxiv.org www.biorxiv.org

New submission 17/08/2023, 10:31:34

1
1. Public_Reviews 17 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  Public Review:
  
  The authors report the first use of the bacterial Tus-Ter replication block system in human cells. A single plasmid containing two divergently oriented five-fold TerB repeats was integrated on chromosome 12 of MCF7 cells. ChIP and PLA experiments convincingly demonstrate the occupancy of Tus at the Ter sites in cells. Using an elegant Single Molecule Analysis of Replicated DNA (SMARD) assay, convincing data demonstrate the replication block at Ter sites dependent on the presence of the protein. As an orthogonal method to demonstrate fork stalling, ChIP data show the accumulation of the replicative helicase component MCM3 and the repair protein FANCM around the Ter sites. It is unclear whether the Ter sites integrated by a single copy plasmid have any effect on the replication of this region but the data show that the observed effects are dependent on expression of the Tus protein. The SMARD data do not reveal what proportion of forks are arrested at Tus/Ter, or how long the fork delay is imposed. Fork stalling led to a highly localized gammaH2AX response, as monitored by ChIP using primer pairs spread along the integrated plasmid carrying the Ter sites. This response was shown to be dependent on ATR using the ATR inhibitor VE-822. This contrasts with a single Cas9-induced DSB between the two Ter sites, which causes a more spread gammaH2AX response. While this was monitored only at a single distal site, the difference between the DSB and the Tus-induced stall is very significant. Interestingly, despite evidence for ATR activation through the gammaH2AX response, no evidence for phosphorylation of ATR-T1989, CHK1-S345, or RPA2-S33 could be found under fork stalling conditions. The global replication inhibitor hydroxyurea (HU) elicited phosphorylation of ATR-T1989, CHK1-S345, or RPA2-S33. In this context, it would have been of interest to examine if a single DSB in the Ter region leads to phosphorylation of ATR-T1989, CHK1-S345, or RPA2-S33 and cell cycle arrest. It is not shown whether the replication inhibitor HU leads to the same widely spread gamma H2AX response. Overall, this is a well written manuscript, and the data provide convincing evidence that the Tus-Ter system poses a site-specific replication fork block in MCF7 cells leading to a localized ATR-dependent DNA damage checkpoint response that is distinct from the more global response to HU or DSBs.
  
  Author response to public review:
  
  “It is unclear whether the Ter sites integrated by a single copy plasmid have any effect on the replication of this region but the data show that the observed effects are dependent on expression of the Tus protein.”
  
  -The lack of perturbation of the TerB sequence on fork progression has extensively been studied previously in both Willis et al, 2014 and Larsen et. al, 2014. Furthermore, as the detection of the SMARD signal at the TerB sites is dependent on the 7.5kb probe that spans the TerB sites (orange probe, Fig 2B & 2D), it would be impossible to study the effect on replication in this region, with and without the integration of the single copy plasmid.
  
  “The SMARD data do not reveal what proportion of forks are arrested at Tus/Ter, or how long the fork delay is imposed.”
  
  -The percentage of fork stalling at the TerB sites, with and without Tus expression, has been quantified in Figure 2E & 2F. Essentially, 36% forks stall at the TerB block, i.e. 18% of the forks stall in both the 5’ to 3’ (orange) and 3’ to 5’ (blue) direction when the Tus-TerB block is active.
  
  “It is not shown whether the replication inhibitor HU leads to the same widely spread gamma H2AX response.”
  
  -While we have not shown gH2AX accumulation via ChIP after HU treatment, Supplementary Figure 5A & 5B clearly show increased gH2AX foci when the cells are treated with HU, suggesting a global replication stress response that is in stark contrast to the response to Tus-TerB.
  
  Recommendations for the authors:
  
  Lines 78, 95: In the experimental set-up there are two divergent 5-TerB sites in the orientation that is non-permissive for the fork progression notwithstanding the direction. This raises an obvious question: How an intervening (~1kb-long) DNA segment in being replicated? Does it stay under-replicated and then break?
  
  -The reviewers pose an important question about how the intervening sequence flanked by the two TerB sites is replicated, and if this leads to formation of anaphase bridges resulting in breaks. We think this is very plausible and this very question is part of ongoing studies in the lab with the aim to understand how the cell resolves a site-specific block. Unfortunately, this falls outside the scope of the current study.
  
  Also, it is unclear what is meant with non-permissive orientation. This depends on the predominant replication direction. As the construct has Ter repeats in opposite orientation, any direction is non-permissive. These descriptions could be rephrased to avoid confusion
  
  -The text has been edited to clarify this.
  
  Fig 1A: It would be helpful to annotate the map to show the position of each primer relative to the Ter array. Why is there no signal for pp52?
  
  -Figure 1A has the map of the locus with the annotated primer pairs and their relative positions to the TerB array.
  
  -pp52 is positioned beyond the TerB array so binding of the Tus-His protein there is unlikely, confirming the specificity of the Tus binding to only the TerB array and not to the adjacent chromatin.
  
  Figure 1B: Change Tus to Tus-His to make it easier to understand that the anti-His ChIP is targeting Tus. Provide information what normalization method was used in the ChIP experiments.
  
  -Figure 1B has been edited to reflect this change
  
  Line 113: Willis et al. 2014 also worked with chromosomal Ter sites, which should be acknowledged here.
  
  The text has been modified to indicate this. We apologize for the oversight.
  
  Line 126: Define pWB15 and its significance in text.
  
  -The text has been edited to clarify this and mentions pWB15.
  
  Figure 2E, F: Define legend (blue, orange boxes and arrow heads).
  
  -The figure legend corresponding to Figure 2 has a detailed description of the boxes and the arrows.
  
  Figure 3E, 4C: Add map of primers like in Figures 1 and 2.
  
  -The map added to Figures 3 & 4 and text updated.
  
  Figure 4: Showing that the gammaH2AX response is spread like with the single DSB would bolster the conclusion about the difference between a local and global response. Fig 4A, Lane-3: A loading control for the chromatin fraction is missing.
  
  -Measuring gH2AX chromatin spread after global replication stress can be challenging. We have tried to address the question of global and local gH2AX response post replication stress by quantifying gH2AX foci in cells treated with and without hydroxyurea, comparing it with cells that have a functional Tus-TerB block (Supplementary Figure 5A& 5B). A single fork block seems to only elicit a local response while a global replication stress leads to gH2AX accumulation globally in the cell.
  
  -Lamin A/C has been added to Fig 4A as a loading control for the chromatin fraction.
  
  Figure S4: Analyzing ATR, CHK1 and RPA phosphorylation as well as cell cycle profile under single DSB condition may reveal that different localized responses exist. I mention this because it was reported in yeast that a single DSB in G1 cells leads to a similarly localized Mec1 (ATR) -dependent response that does not elicit phosphorylation of Rad53 (CHK1) and other downstream targets, but leads to H2A phosphorylation as well as phosphorylation of RPA and the Rad51 paralog Rad55 (see PMCID: PMC2853130). It might be of interest to the reader to discuss this publication and the commonalities and differences between both localized checkpoint response
  
  -The reviewers raise an interesting question about the phosphorylation of ATR/CHK1/RPA and its effect on cell cycle after a single DSB. The aim of using the Cas9 break site in this study was merely to corroborate previously published observations pertaining to the spread of gH2AX after a DSB and to contrast that with the local response seen with Tus-TerB. Thus, while an intriguing question, we do not think this particular experiment will help in the understanding of the localized checkpoint response after a single replication fork block. However, we have included the observations previous published in the yeast system (PMC2853130) in our discussion as it helps compare and contrast fork blocks and DSBs further. It is of worth though that the yeast studies were looking at the cellular response to a DSB in G1.
  
  Lines 256-260: In the discussion of ATRIP, unpublished data are discussed that show no increase in ssDNA. What is the effect of ATRIP depletion? Maybe delete this mention of unpublished data, if no new data can be provided. The authors are aware that this makes the mechanism of ATR activation at the 5-TerB site elusive.
  
  -This statement has been deleted and the text has been modified.
  
  Another possibility discussed by the authors is fork reversal. Since Tus/Ter complex block the CMG progression, fork reversal would result in a chicken foot structure with the long single-stranded 3'-overhang of an Okazaki fragment site. Such a structure should be protected by BRCA2 or RAD52 proteins from degradation. Any role for these proteins in the checkpoint activation at the TerB site?
  
  -The reviewers suggest an interesting scenario where the Tus-TerB block induced reversed fork structure could be protected by the loading of known DNA repair proteins and this in turn could lead to a signaling mechanism and checkpoint activation. While we have not tested this hypothesis, nor studied the temporal dynamics of the formation if the reversed fork with respect to gH2AX accumulation, we think the localized gH2AX signal observed in the vicinity of the block is what initiates the downstream DDR response, promoting fork stabilization, followed either by fork reversal and restart or fork collapse. If the reversed fork was responsible for the gH2AX signaling, one would envision the spread to be more widespread, perhaps decorating the entire stretch of DNA between the block and the reversed fork. However, further studies are warranted to tease out this mechanism and the spatio-temporal dynamics.
  
  Lines 292-294: The authors state that "unpublished work from our laboratory has demonstrated that replication forks are cleaved at or near the TerB site..." Unless the data are shown, it might be best to eliminate discussion of unpublished work, also because the occurrence of DNA ends at Ter sites was already described in Willis et al. 2017.
  
  -The statement has been deleted and Willis et al. 2017 has been referenced.
  
  Suppl Table 1: It would help to also show representative images of stretched fibers in addition to the summary data shown.
  
  -Since the data is negative, the fiber images do not show any discernible differences and we do not think it adds useful information.
  
  Suppl Fig 4. ChIP for gamma H2AX data. It would be helpful to show the distribution of the gamma H2AX signal along the chromosome for both the DSB response and the Tus/Ter response.
  
  -The gH2AX ChIP signal at PP0-2 and PP10 has been included in Supplementary Fig4D. Though not significant for PP0-2, the data strongly suggests that there is increased spread of gH2AX along the chromosome after a DSB, strongly contrasting with the response after Tus-TerB block. The text has been modified to include both primer pairs.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.03.26.534293v2
www.biorxiv.org www.biorxiv.org

New submission 20/04/2023, 12:24:30

2
1. Public_Reviews 17 Aug 2023
  
  in eLife
  
  Author Response:
  
  The following is the authors' response to the original reviews.
  
  Thank you for sending our manuscript for review and the positive editorial comments. On behalf of all authors, I would like to thank the reviewers for their critical reading of our manuscript and for providing insightful and valuable suggestions. We have revised the discussion section accordingly, including a new supplemental figure to show the results previously stated as “data not shown”. Please see below for detailed explanations.
  
  Reviewer #1 (Public Review):
  
  The manuscript by Zheng et al. examined the disease-causing mechanisms of two missense mutations within the homeodomain (HD) of CRX protein. Both mutations were found in humans and can produce severe dominant retinopathy. The authors investigated the two CRX HD mutants via in vitro DNA-binding assay (Spec-seq), in vivo chromatin-binding assay (ChIP-seq), in vivo expression assay of downstream target genes (RNA-seq), and retinal histological and functional assays. They concluded that p.E80A increased the transactivation activity of CRX and resulted in precocious photoreceptor differentiation, whereas p.K88N significantly changed the binding specificity of CRX and led to defects in photoreceptor differentiation and maintenance. The authors performed a significant amount of analyses. The claims are sufficiently supported by the data. The results not only uncovered the underlying disease-causing mechanisms, but also can significantly improve our understanding of the interaction between HD-TF and DNA during development.
  
  Thank you for summarizing the key findings and strengths of our manuscript.
  
  Minor concerns:
  
  1. The E80A, K88N and R90W (previously reported by the same group) mutations are located very close to each other in the homeodomain (Figure 1A), but had distinct effects on the activity of CRX. Has the structure of the homeodomain (of CRX) been resolved? If so, could the authors discuss this phenomenon (mutations close to each other but have distinct effects) based on the HD-DNA structure?
  
  In paragraphs 2, 4, 5 of the discussion section, we have added explanations on how each mutation could affect CRX HD-DNA interactions differently based on published structural studies. And we further explain how these biochemical changes relate to the molecular perturbations and cellular phenotypes seen in vivo.
  
  In addition, has this phenomenon been observed in other homeodomain TFs?
  
  Disease associated missense mutations at residues HD50 (K88) and HD52 (R90) have also been reported in other HD TFs implicated in CNS development (see discussion paragraph 7). Distinctively, different substitutions at CRX E80 residue have been reported in multiple CoRD cases, suggesting its essential role in HD-DNA-mediated regulation during retinal development. These new points are now included in the discussion section.
  
  2. The authors should briefly summarize the effects/disease-causing-mechanisms of all the reported CRX mutations in the discussion part. The readers can then have a better overview of the topic.
  
  We have added a concise summary of previously proposed CRX mutation classification scheme, all characterized Crx mutant mouse models and their pathogenic mechanisms. Please see paragraph 9 in the discussion section.
  
  3. CRX can also function as a pioneer factor (reported by the same group). Would these HD mutations distinctively affect chromatin accessibility (which then leads to ectopic binding on the genome)?
  
  Prior evidence has demonstrated that regulatory regions for many photoreceptor genes failed to stay accessible upon loss of CRX in the Crx-/- model (PMID: 30068366). It is unclear with the existing data whether CRX could initiate the chromatin remodeling (true pioneering function) of these regions, or it simply maintains the accessibility once these regions became accessible. Future studies comparing epigenomic landscape changes in mutant Crx KI models at various ages can be informative, particularly for the CRX K88N ectopic binding events. Determining how the CRX K88N mutant protein alters chromatin landscape important for photoreceptor fate and/or differentiation during development would shed light on the nature of these ectopic binding events.
  
  4. The discussion part can be shortened and simplified.
  
  We have re-written the discussion section to make it concise and to incorporate discussions on mutant CRX HD structures. Please see the revised manuscript.
  
  Reviewer #2 (Public Review):
  
  Zheng et al., investigated the molecular and functional mechanisms of two homeodomain missense mutations causing human retinal photoreceptor degeneration diseases in photoreceptor development regulated by the CRX transcription factor. They analyzed the E80A mutation associated with dominant cone-rod dystrophy (CRD) and the K88N mutation associated with dominant Leber Congenital Amaurosis (LCA). The authors found that E80A CRX binds to the same target DNA sites as WT CRX, but the binding specificity of K88N CRX is altered from that of WT in an in vitro assay. They generated Crx(E80A) and Crx(K88N) KI mice and performed ChIP assay and observed that K88N CRX binds to novel genomic regions from the WT-binding sites, while E80A binds to the WT sites. In addition, using the KI mice, they found that E80A and K88N differently affect the expression of Crx target genes. This study is well executed with proper and solid methodologies, and the manuscript is clearly written. This study gives us the insights how single missense CRX mutations lead to different types of human retinal photoreceptor degeneration diseases.
  
  We greatly appreciate the reviewer’s summary and positive comments.
  
  While the study has strengths in principle, it has a couple of weaknesses. One is how well E80A KI mice function as a pathological model of dominant CRD, in which cones are mainly first affected, is not clearly shown in this study. More data investigating how cones are affected by performing histological, molecular, and physiological analyses will be helpful and useful. For example, in the Discussion, the authors describe that E80A associates with S-cone opsin promoter results is "data now shown". This data must be presented for the readers. In addition, more molecular insights as to how E80A affects cones will strengthen this study.
  
  The mouse retina is rod dominant and contains only a small number of cones (3% of all photoreceptors) that are born prenatally. This poses technical challenges to appropriately assess cone- specific changes during disease initiation/progression. We are in the process of developing cellular/molecular tools to investigate how cones are being affected in Crx E80A KI model, but this is beyond the scope of the current study.
  
  At the same time, we have added a supplemental panel showing that, based on P0 retinal immunostaining of the early cone marker RXRγ, cones were initially born, and fate specified in CrxE80A retinas (see Figure S7A). Since the E80A protein also hyper-activated S-cone opsin promoter-luciferase (Sop-luc) reporter in HEK293 cells (see Figure S7B), we predict that CRX E80A affects cone photoreceptor differentiation in a similar manner as rod photoreceptors. Furthermore, the cone transcriptional program might be more prone to perturbations by abnormal CRX activities. These possibilities require future investigations. For this manuscript, we have included all these points in the discussion section.
  
  Another point is that it will be very valuable if the authors could show how E80A and K88N differently affect the 3D structure of the CRX homeodomain. Even a simulation model would be valuable.
  
  Please see our answer to Point 1 of Reviewer #1. In short, we have added in the discussion section our explanations on how each mutation could affect CRX HD-DNA interactions differently based on structural studies. We further explain how these biochemical changes relate to the molecular perturbations and cellular phenotypes seen in vivo. Additionally, since TF-DNA interactions are diverse and dynamic across binding sites with different sequence features and genomic environments, future studies that systematically and quantitatively evaluate CRX transcriptional activity at different regulatory sequences would be important.
  
  Recommendations for the authors:
  
  Reviewer #2 (Recommendations For The Authors):
  
  As a minor comment, in page 8, second section, "Previous studies have demonstrated the CRX is activated shortly after cell cycle exit in retinal progenitor cells fated to be photoreceptor.", the authors cited refs 66 and 67, which were in 2105 and 2016. However, this was demonstrated in the paper of J. Neurosci.31(46), 16792-807, 2011, Figure 1. It would be fair for the authors to cite the JN 2011 paper.
  
  Thanks to the reviewer for the suggested reference, we have added it to the revised manuscript.
  
  AuthorResponse
2. Public_Reviews 16 Aug 2023
  
  in eLife
  
  Author Response:
  
  The following is the authors’ response to the previous reviews
  
  Thank you for sending our revised manuscript for review and the positive editorial comments. On behalf of all authors, I would like to, again, thank the reviewers for their critical reading of our revised manuscript and for providing further suggestions. We have revised the introduction and discussion sections to specifically address the comments made by Reviewer #2. Please see below for detailed explanations.
  
  Reviewer #2 (Public Review):
  
  Overall, the authors have significantly improved the manuscript, but there is still an unclarified point. In response to the inquiry in the initial review on how extent E80A KI mice function as a pathological model of dominant CoRD, the authors add data (Figures S7) and described the sixth section in the discussion. However, the authors mentioned that it is technically too challenging because of a small number of cones. The point is not clear to me, but it is possible to analyze cone differentiation and degeneration by immunostaining at multiple stages even though cone number is small. Cone arrestin and S- and M-opsins become positive at early postnatal stages in the mouse retina. Cone arrestin seems earlier than cone opsins. Cones seem born by detecting RXRg at P0, but are cone arrestin and/or cone opsins expressed in early postnatal E80A/+ retina? If positive, how about an apoptosis marker? If negative, it seems to be a cone development phenotype rather than cone degeneration phenotype. If so, authors should modify the expression to say that the E80A retina underlies CoRD-like phenotype. It seems an overstatement.
  
  We greatly appreciate Reviewer 2’s suggestions on further investigating cone photoreceptor phenotypes in the CRX E80A KI mouse model. All the points raised deserve a comprehensive and in-depth study. However, the focus of the current manuscript is to establish a general framework for understanding different missense mutations in homeodomain TFs beyond CRX. We believe that a separate and dedicated study is more appropriate to detail the quantitative molecular and cellular mechanisms of CRX E80A dysfunction in cone and rod photoreceptors, as stated in the last sentence of discussion section paragraph 6: “… quantitative characterization of CRX E80A molecular functions in a cone dominant retina warrants further study to understand its selective effect on the cone differentiation program and help elucidate WT CRX regulatory principles in early photoreceptor development.”.
  
  Clinical diagnosis of cone-rod dystrophy (CoRD) is largely based on functional deficits of cones and rods. 1-month electroretinogram (ERG) (Figures 5K-M) shows no cone-mediated light responses and reduced rod functions in CrxE80A/+ mouse. These ERG deficits in the CRX E80A KI mouse model are in agreement with CoRD characteristics. Thus, it is reasonable to say that CRX E80A KI retina phenotype resembles CoRD phenotype.
  
  Reviewer #2 (Recommendations For The Authors):
  
  As a minor comment, in page 8, second section, "Previous studies have demonstrated the CRX is activated shortly after cell cycle exit in retinal progenitor cells fated to be photoreceptor.", the authors cited refs 66 and 67, which were in 2105 and 2016. However, it was demonstrated in the paper of J. Neurosci.31(46), 16792-807, 2011, Figure 1. The authors need to be scientifically fair to cite the JN 2011 paper.
  
  In response to this comment above, the authors cited the JN 2011 paper in a modified sentence of "Animal studies have demonstrated that Crx is first expressed in post-mitotic photoreceptor precursors and maintained throughout life (Refs.13-15)", moved from the discussion to the introduction. To my knowledge, the JN2011 (new Ref 15) is the first study directly demonstrated that Crx begins to be expressed shortly after cell cycle exit of retinal progenitor cells. Refs. 13 and 14 showed Crx expression in adult stage photoreceptors but did not directly demonstrate the Crx expression in post-mitotic photoreceptor precursors. To be scientifically precise, the references should be cited as "Animal studies have demonstrated that Crx is first expressed in post-mitotic photoreceptor precursors (Ref. 15) and maintained throughout life (Refs.13 and 14)".” Thanks to the reviewer for the precise instruction. We have adjusted the reference order as follows: “Animal studies have demonstrated that Crx is first expressed in post-mitotic photoreceptor precursors13 and maintained throughout life14,15.”, where JN2011 paper is reference 13.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.01.526652v3
www.biorxiv.org www.biorxiv.org

New submission 24/07/2023, 08:07:14

1
1. Public_Reviews 16 Aug 2023
  
  in eLife
  
  Author Response:
  
  We thank the reviewers for their thoughtful reviews. We believe that we can address these comments through revisions within the manuscript (writing/analysis) or as matters of clarification. In this preliminary response, we focus on a few aspects of the reviewer comments.
  
  Experimental design
  
  We will ensure that the rationale for our use of 10-minute analytic periods is clear. These time periods were dictated by the sampling duration required to perform accurate neurochemical analyses (and to reserve half of the sample in the event of a catastrophic failure of batch-processing samples). Since neurochemical release may display multiple temporal components (e.g., ACh) during playback stimulation, and these could differ across neurochemicals of interest, we decided to collect, analyze, and report in two periods. Our results suggest that this was appropriate, comparing values across the two stimulus periods and the pre-stimulus control. We decided not to include analyses of the post-stimulus period because this is subject to wider individual and neuromodulator-specific effects and because it weakens statistical power in addressing the core question—the change in neuromodulator release DURING vocal playback.
  
  We called these periods “Stim 1” and “Stim 2”, but each used the same examplar sequences in the same order.
  
  For behavioral analyses, observation periods were much shorter than 10 mins, but the main purpose of behavioral analyses was to relate to the neurochemical data. As a result, we matched the temporal features of the behavioral and neurochemical analyses. We will ensure that this is clearly described in the revision. We plan a separate report, focused exclusively on a broader set of behavioral responses to playback, that may examine behaviors at a more granular level.
  
  One reviewer expressed concern that we did not utilize a “control” playback stimulus, suggesting white noise as the control. We gave extensive consideration to this in our design. We concluded, based on our previous work, that white noise is not a neutral stimulus and therefore the results would not clarify the responses to the two vocal stimuli. Instead, we opted to use experience as a type of control. This control shows very clearly that temporal patterns and across-group differences in neurochemical response disappear in the absence of experience.
  
  One reviewer comments that our p90-p180 mice are “old”. This is not the case. CBA/CaJ mice display normal hearing for at least 1 year (Ohlemiller, Dahl, and Gagnon, JARO 11: 605-623, 2010) and adult sexual and social behavior throughout our observation period. They are sexually mature adults, appropriate for this study.
  
  Data and statistical analyses
  
  Two reviewers express concerns about our normalization of neurochemical data, suggesting that it diminishes statistical power or is not transparent. We note that normalization is a very common form of data transformation that does not diminish statistical power. It is particularly useful for data forms in which the absolute value of the measurement across experiments may be uninformative. Normalization is routine in microdialysis studies, because data can be affected by probe placement and factors affecting neurochemical processing. Similar to calcium imaging or many electrophysiological recordings, the information is based on a comparison to baseline values. We will consider supplying concentration values in supplemental material.
  
  Two reviewers comment on correlations we presented, with different perspectives. We will review our correlation analyses to determine if these are appropriate and what should be reported.
  
  Although Reviewer 2 raises several valid issues that we will address in our response and revision, we believe that none represent “major flaws” in the study that challenge the validity of our central conclusions. In brief, we will: * provide enhanced description of behaviors * clarify or modify box-plot representations of data * point to our methods that describe corrections for multiple comparisons * clarify sample size concerns * address questions of correlation between neurochemicals and behavior
  
  Factual Corrections
  
  Two reviewer comments and an associated editorial comment suggest that statistical power is lacking. The reviewer comments are incorrect. If the editorial suggestion is based on those comments, we challenge that as well.
  
  Reviewer 1 states that normalization “creates a baseline period with minimal variation…that could inflate statistical power.” We believe that this statement is incorrect. We will justify elsewhere the rationale for using normalized neurochemical data, but the suggestion that this very common transformation alters statistical power is unwarranted.
  
  Reviewer 2 states, in the 4th Recommendation for the Authors, that sample sizes are too small. The reviewer gives examples of sample sizes of 3, but that is incorrect. In revising figures, we will ensure that sample numbers appear clearly, but the reviewer’s claim that we used sample size of 3 is not correct. The minimum sample size is 5.
  
  If these reviewer comments are the bases for the editorial recommendation that the manuscript may require additional power, we believe the recommendation is based on incorrect comments.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.07.02.498564v4
www.biorxiv.org www.biorxiv.org

New submission 15/08/2023, 08:31:26

1
1. Public_Reviews 15 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  Summary:
  
  The study by Fang et al. reports a 3D MERFISH method that enables spatial transcriptomics for tissues up to 200um in thickness. MERFISH, as well as other spatial transcriptomics technologies, have been mainly used for thin (e.g, 10um) tissue slices, which limits the dimension of spatial transcriptomics technique. Therefore, expanding the capacity of MERFISH to thick tissues represents a major technical advance to enable 3D spatial transcriptomics. Here the authors provide detailed technical descriptions of the new method, troubleshooting, optimization, and application examples to demonstrate its technical capacity, accuracy, sensitivity, and utility. The method will likely have a major impact on future spatial transcriptomics studies to benefit diverse biomedical fields.
  
  Strengths:
  
  The study was well-designed, executed, and presented. Extensive protocol optimization and quality assessments were carried out and conclusions are well supported by the data. The methods were sufficiently detailed and the results are solid and compelling.
  
  We thank the reviewer for the positive comments on our manuscript.
  
  Weaknesses:
  
  The biological application examples were limited to cell type/subtype classification in two brain regions. Additional examples of how the data could be used to address important biological questions will enhance the impact of the study.
  
  We appreciate the reviewer's suggestion that demonstrating the applications of our thick-tissue 3D MERFISH method to addressing important biological questions would enhance the impact of our study. In line with this reviewer comment, we had included examples of how our method could be applied to address various biological questions in the summary (last) paragraph of our manuscript. These examples highlight the versatility and utility of our approach in addressing diverse biological questions beyond cell type classification. However, the goal of this work is to develop a new method and establish its validity. While we are interested in applying it to answer important biological questions in the future, we consider these applications beyond the scope of this current work.
  
  Reviewer #2 (Public Review):
  
  Summary:
  
  In their preprint, Fang et al present data on extending a spatial transcriptomics method, MERFISH, to 3D using a spinning disc confocal. MERFISH is a well-established method, first published by Zhuang's lab in 2015 with multiple follow-up papers. In the last few years, MERFISH has been used by multiple groups working on spatial transcriptomics, including approximately 12 million cell maps measured in the mouse brain atlas project. Variants of MERFISH were used to map epigenetic information complementary to gene expression and RNA abundance. However, MERFISH was always limited to thin ~10um sections to this date. The key contribution of this work by Fang et al. was to perform the optimization required to get MERFISH working in thick (100-200um) tissue sections.
  
  Major strengths and weaknesses:
  
  Overall the paper presents a technical milestone, the ability to perform highly multiplexed RNA measurements in 3D using MERFISH protocol. This is not the first spatial transcriptomics done in thick sections. Wang et al. 2018 - StarMAP used thick sections (150 um), and recently, Wang 2021 (EASI-FISH, not cited) performed serial HCR FISH on 300um sections. Data so far suggest that MERFISH has better sensitivity than in situ sequencing approaches (StarMAP) and has built-in multiplexing that EASI-FISH lacks. Therefore, while there is an innovation in the current work, i.e., it is a technically challenging task, the novelty, and overall contribution are modest compared to recently published work.
  
  This summary is elaborated in more details in the following paragraphs, and we will address these detailed comments below.
  
  The authors could improve the writing and the manuscript text that places their work in the right context of other spatial transcriptomics work. Out of the 25 citations, 12 are for previous MERFISH work by Zhuang's lab, and only one manuscript used a spatial transcriptomics approach that is not MERFISH. Furthermore, even this paper (Wang et al, 2018) is only discussed in the context of neuroanatomy findings. The fact that Wang et al. were the first to measure thick sections is not mentioned in the manuscript. The work by Wang et al. 2021 (EASI-FISH) is not cited at all, as well as the many other multiplexed FISH papers published in recent years that are very relevant. For example, a key difference between seqFISH+ and MERFISH was the fact that only seqFISH+ used a confocal microscope, and MERFISH has always been relying on epi. As this is the first MERFISH publication to use confocal, I expect citations to previous work in seqFISH and better discussions about differences.
  
  We thank the reviewer for recognizing our work as a technical milestone. Since this work is aimed to build upon the strengths of MERFISH and address some of its limitations, we primarily cited previous MERFISH papers to make it clear what specific improvements have been achieved in this work. Given the rapid growth of the spatial genomics field, it has become impractical to comprehensively cite all method development or improvement papers in this area. Instead, we cited a 2021 review article in the first sentence of the manuscript and limited all discussions afterwards to MERFISH. In the revised manuscript, we will try to find and include more recent review articles to cover method developments since 2021.
  
  Although we presented our work as an advance in MERFISH specifically, we consider the reviewer’s suggestion of citing the 2018 STARmap paper [Wang et al., Science 361, eaat5961 (2018)] in the introduction part of our manuscript reasonable. This STARmap paper was already cited in the results part of our manuscript, and we will further emphasize this paper in the introduction of our revised manuscript, as this 2018 in situ sequencing paper was the first to demonstrate 3D spatial transcriptomic profiling in thick tissues. In addition, we thank the reviewer for bringing to our attention the EASI-FISH paper [Wang et al, Cell 184, 6361-6377 (2021)], which reported a method for thick-tissue FISH imaging and demonstrated imaging of 24 genes using multiple rounds of multi-color FISH imaging. We also recently became aware of a paper reporting 3D imaging of thick samples using PHYTOMap [Nobori et al, Nature Plants 9, 1026-1033 (2023)]. This paper, published a few days after we submitted our manuscript to eLife, demonstrated imaging of 28 genes in thick plant samples using multiple rounds of multicolor FISH and probe targeting and amplification methods previously developed for in situ sequencing. We will include these three papers in the introduction section of our revised manuscript.
  
  However, we do not consider our use of confocal imaging in this work an advance in MERFISH because confocal, like epi-fluorescence imaging, is a commonly used approach that could be applied to MERFISH of thin tissues directly without any alteration of the protocol. Confocal imaging has been broadly used for both DNA and RNA FISH long before any genome-scale imaging was reported. Confocal and epi-imaging geometries have their distinct advantages, and which of these imaging geometries to use is the researcher’s choice depending on instrument availability and experimental needs. Thus, we do not find it necessary to cite specific papers just for using confocal imaging in spatial transcriptomic profiling, but we will see whether it is reasonable to cite these papers in the revised manuscript. Our real advance related to confocal imaging is the use of machine-learning to increase the imaging speed. Without this improvement, 3D imaging of thick tissue using confocal would take a long time and likely degrade image quality due to photobleaching of out-of-focus fluorophores before they are imaged. We thus cited several papers that used deep learning to improve imaging quality and/or speed. Our unique contribution is the combination of machine learning with confocal imaging for 3D multiplexed FISH imaging of thick tissue samples, which had not been demonstrated previously.
  
  To get MERFISH working in 3D, the authors solved a few technical problems. To address reduced signal-to-noise due to thick samples, Fang et al. used non-linear filtering (i.e., deep learning) to enhance the spots before detection. To improve registrations, the authors identified an issue specific to their Z-Piezo that could be improved and replaced with a better model. Finally, the author used water immersion objectives to mitigate optical aberrations. All these optimization steps are reasonable and make sense. In some cases, I can see the general appeal (another demonstration of deep learning to reduce exposure time). Still, in other cases, the issue is not necessarily general enough (i.e., a different model of Piezo Z stage) to be of interest to a broad readership. There were a few additional optimization steps, i.e., testing four concentrations of readout and encoder probes. So while the preprint describes a technical milestone, achieving this milestone was done with overall modest innovation.
  
  We appreciate the reviewer's recognition of the technical challenges we have overcome in developing this 3D thick-tissue MERFISH method. To achieve high-quality thicktissue MERFISH imaging, we had to overcome multiple different challenges. We agree with the reviewer that the solutions to some of the above challenges are intellectually more impressive than the others that required relatively more mundane efforts. However, all of these are needed to achieve the overall goal, a goal that is considered a milestone by the reviewer. We believe that the impact of a method should be evaluated based on its unique capabilities, potential applications, and its adaptability for broader adoption. In this regard, we anticipate that our reported method will be a valuable and impactful contribution to the field of spatial biology.
  
  Data and code sharing - the only link in the preprint related to data sharing sends readers to a deleted Dropbox folder. Similarly, the GitHub link is a 404 error. Both are unacceptable. The author should do a better job sharing their raw and processed data. Furthermore, the software shared should not be just the MERlin package used to analyze but the specific code used in that package.
  
  We apologize for the invalid Dropbox link. The Dropbox folder got accidentally moved and hence the link provided in the manuscript is no longer linked to the folder. The valid link is now: https://www.dropbox.com/scl/fo/ribx45fnx4zw7kv12sl3w/h?rlkey=fo829wbxmb9mwl6gzivg7vqj3 &dl=0. We will also upload the data to a public data repository when submitting the revised manuscript.
  
  The GitHub link that we provided for the MERlin package is, however, valid and will lead to the correct GitHub site. If, for some reason, clicking the link does not work on your computer, copying the URL address into a web browser should work. Following the suggestion by the reviewer, in addition to the MERlin v2.2.7 package itself, we will also share the specific code to use this package for analyzing the data taken in this work in the revised manuscript.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.07.21.550124v1
www.medrxiv.org www.medrxiv.org

New submission 15/08/2023, 08:24:13

1
1. Public_Reviews 15 Aug 2023
 
 in eLife
 
 Author Response
 
 The following is the authors’ response to the original reviews.
 
 Reviewer #1 (Public Review):
 
 Comment 1. The authors used a meta-mask based on previous LC structural studies to delineate the LC on functional scans within two large public datasets (3T CamCAN and 7T HCP).
 
 The rostral part of the LC was characterized by connections to the posterior and anterior cingulate cortices, medial temporal lobe, hippocampus, amygdala and striatum, while the caudal part projected to the parietal cortex, occipital cortex, precentral and postcentral regions, and thalamus. Older ages were associated with less rostral-like connectivity and increased asymmetry. The gradient explained variance above the effects of age, sex and education on some emotional and cognitive measures. In particular, the old-like functional gradient (loss of rostral-like connectivity and more clustered functional organization) was associated with worse performance on emotional memory and emotion regulation tasks but not to executive functioning or self-rated sleep quality.
 
 Participants with higher anxiety and depression also showed less rostral-like connectivity and more asymmetry. Both the aging and the anxiety/depression asymmetry manifested as less rostral-like connectivity in the left LC than the right LC.
 
 A strength of this study is that it is the first to attempt a voxel-based approach to quantifying functional connectivity in the LC. The results finding differences between rostral and caudal LC connectivity patterns are broadly consistent with prior work indicating differences between rostral/caudal LC and should help advance understanding of the LC's connectivity patterns with cortical regions.
 
 We thank the reviewer for the thorough and positive assessment of our manuscript.
 
 Comment 2. A limitation of the study is the challenge of assessing activity not only from the small LC brainstem nucleus but also within it. Given the current spatial limitations of whole-brain functional imaging, the current findings are bolstered by including the 7T 1.6mm isotropic data. Spatial smoothing was applied with a 3mm FWHM isotropic kernel which may have reduced precision.
 
 The reviewer raises valid points. Spatial resolution is indeed a limiting factor for assessing the LC with functional MRI. The choice of including spatial smoothing in the preprocessing was necessary because connectopic mapping requires a measure of spatial smoothness for the gradient calculation (see Haak et al. 2018 NeuroImage). We included a sentence explaining this in the revised version of the manuscript and added it also as an additional limitation.
 
 Comment 3. Another limitation was that the authors made conclusions about clustered functional organization but it was not clear how clustering was quantified.
 
 We thank the reviewer for the comment. Clusterability was quantified in the following way (based on Ngo et al. 2021 NeuroImage). First, gradient maps were clustered into k=2 clusters using the k-means clustering algorithm and then the Calinski-Harabasz criterion was calculated for each individual gradient map, which was used as a measure of clusterability. Higher criterion values were significantly associated with older age (Spearman’s rho = 0.3129, p<0.0089), indicating that the gradient was more clustered in older individuals. This new analysis is now included in the revised version of the manuscript.
 
 Reviewer #1 (Recommendations For The Authors):
 
 Comment 1. Would it be equally accurate to state that participants with higher anxiety and depression showed more caudal-like connectivity or are the differences clearly localized to the rostral LC?
 
 We thank the reviewer for the question. Since the gradients are by default a dimensionless scale that was further normalized to the range of 0-1, both interpretations are possible. We hypothesized a loss of rostral-like LC connectivity based on previous literature.
 
 Comment 2. These resting-state findings seem to show some interesting parallels to the structural rostral/caudal LC MRI contrast relationships with cortical thickness in Bachman et al. (2021, Neurobiology of Aging), who found that positive associations between LC contrast and structural thickness were found among older adults for rostral but not caudal LC (corresponding with the rostral regions showing the most age-related change). It is also interesting that in Bachman et al., younger adults showed negative correlations between caudal LC contrast and cortical thickness, which may relate to associations with a more caudal-like connectivity pattern (assuming this is a fair way to interpret the current results) in those with high HADS scores (i.e., rostral LC indicators may reflect stress/anxiety).
 
 We thank the reviewer for pointing out these interesting findings. We included the reference in the revised version of the manuscript.
 
 Comment 3. How was "more clustered functional organization" computed? I could not find description of this in the analysis section. If it is something that is evident from the visual depiction of the surface rendering shown in Fig. 2, please explain as it was not clear to me.
 
 We thank the reviewer for the comment. As mentioned in a previous answer to a comment made by the Reviewer, clusterability was quantified in the following way (based on Ngo et al. 2021 NeuroImage). Gradient maps were clustered into k=2 clusters using the k-means clustering algorithm, then the Calinski-Harabasz criterion was calculated for each individual gradient map, which was then used as a measure of clusterability. Higher criterion values were significantly associated with older age (Spearman’s rho = 0.3129, p<0.0089), indicating that the gradient was more clustered in older individuals. This analysis is now included in the revised version of the manuscript.
 
 Comment 4. In the connectopic mapping methods, it is stated that the analysis starts by calculating functional connectivity matrices between all voxel time series in an ROI and time series from a target mask. That statement sounds as though there would be one time series from the overall target mask. It is then stated that the target mask consistent of brain areas from a cortical and subcortical parcellation. But it is not clarified if (as I assume was the case) time series were extracted for each parcel within the mask (and how many parcels there were - 180?).
 
 We thank the reviewer for the helpful comment. Regarding the target mask, average time series were extracted from each parcel in the atlases separately, and then pairwise correlations were calculated with timeseries from all voxels in the ROI (the LC). We used the Glasser-atlas (which contains 360 parcels) as a cortical parcelllation and the Tian-atlas (which contains 50 parcels) as a subcortical parcellation. The corresponding section of the manuscript now includes this clarification.
 
 Comment 5. Then it is stated that "Afterwards, we obtained a similarity matrix from the functional connectivity matrices of LC ROI voxels by calculating the eta-squared measure." It would help here to explain a little more to clarify which things are being compared for similarity. Specifically, for which pairs was the eta-squared measure computed for?
 
 The eta-squared measure was calculated between the functional connectivity profiles (or “fingerprints”) for all pairs of voxels in the LC ROI. More specifically, one such fingerprint contains the Pearson correlation coefficients between a given LC voxel time series and the regional time series from the target mask. The similarity matrix contains the eta-squared similarity of these fingerprints, therefore one index in the similarity matrix contains the similarity between the fingerprints of two specific LC voxels. The corresponding section of the manuscript now includes this clarification.
 
 Comment 6. In Fig. 3, I found the labeling of surface renderings confusing (i.e., did high->low apply to both rows? What about 'emotional memory? do the top and bottom row correspond with the R/L LC?).
 
 We thank the reviewer for the helpful comment and made some changes to Fig. 3 to clarify the labels. The upper row shows the right LC, whereas the bottom row shows the left LC. High->low and low->high applies to both rows. Regarding emotional memory, a worse performance on this task resulted in lower scores. With emotional reactivity, higher scores indicate a worse ability to regulate negative ratings on the task, which results in an inverse relationship of this score with the LC gradient features. We also extended the figure label to include this explanation.
 
 Response to Reviewer #2 (Public Review):
 
 Comment 1. One of the major strengths in the current study is the implementation of the fully data-driven, gradient-based method for mapping connectopies of the LC. This approach is especially suited for brain structures that are difficult to localise because the resulted connectopic mapping is relatively robust to ROI definition (Fig. 7 in Haak et al., 2018). However, as a very inclusive definition of the LC (the "meta atlas") was adopted in the study, to what extent the gradient approach can tolerate changes of accuracy and specificity for LC ROI definition is unknown. Some comparative analyses would be helpful to provide assessments on the specificity and stability of the reported gradient pattern.
 
 We thank the reviewer for the positive assessment of our manuscript. Indeed, an advantage of the connectopic mapping approach is that it is less sensitive to minor ROI inaccuracies, which is convenient for the LC. We repeated the gradient calculation using a larger LC mask from Tona et al. (2017), and included a supplementary figure (Figure S3) that shows how the gradients still retain their rostrocaudal pattern using both LC masks.
 
 Comment 2. Haak et al. showed distinct reproducibility within and between subjects when comparing connectopic mappings between M1 and V1. M1 connectopic mapping showed very high consistency across subjects (ICCs > 0.9) compared with V1. This is very reasonable because the functional organisation within M1 is relatively homogeneous. Regarding the reliability of the LC rostro-caudal gradient, the authors only stated that "individual gradient estimation is often not consistent", but direct measurement on the consistency across subjects for the LC gradient was missing. This is important for future LC fMRI studies as more consistent pattern might warrant the application of an atlas-based method otherwise a more individualized pipeline is needed for investigating functional dissociation in LC subregions.
 
 We thank the reviewer for the question. Indeed, investigating the replicability of gradients at the individual level is important. However, regarding the LC, because of the ROI size and the relative shortness of the scans in the Cam-CAN dataset, we did not calculate individual level gradients and resorted to a group-level approach as we described in the method. Therefore, the assessment of individual reliability was outside of the scope of the current study. We included this as a limitation in the Discussion of the revised manuscript.
 
 Comment 3. It puzzles me that why a dichotomous rostral vs caudal comparison was used to demonstrate the difference in connectivity patterns along the rostro-caudal gradient which might be an oversimplistic approach as described by the authors themselves? In fact, it might be more interesting to include the central "core" LC which is structurally organized in high density (Fernandes et al., 2012) and functionally distinguishable to the peri-LC "shell" region (Totah et al., 2018; Poe et al., 2022).
 
 We thank the reviewer for the comment. Indeed, during the analyses we tried to delineate a central core region within the LC, however, the functional connections in this region varied greatly between individuals and we failed to reliably detect a functionally distinct central core region using FC. One reason for this might unfortunately be the limited spatial resolution of functional MRI. Instead, we hypothesized that the gradient manifests in fMRI connectivity of the LC by a gradual transition of connectivity profiles between the two dominant extremes of the caudal and rostral LC and we aimed to depict these two extremes in Figure 1. Although it is a simpler approach compared to the results of histological studies, we demonstrate in the paper that it still provides valuable information about LC in aging and LC-related behavioral measures.
 
 Comment 4. The composition of rostral vs caudal connectivity pattern changes over ageing, where the loss of rostral-like connectivity was consistent in bilateral LC whereas the gain of caudal-like connectivity in older subjects was only evident in the left LC. Do authors have any explanations on this left-lateralised ageing effect which is interestingly coincided with a lot of observations such as increased left LC contrast ratios was found during ageing (Betts et al., 2017) and in PD patients (Ye et al., 2022), reduced left LC-parahippocampal gyrus connectivity was reported in aMCI patients (Jacobs et al., 2015).
 
 We thank the reviewer for the question. Indeed, we observed lateralized changes in the LC gradients both in connection with aging and cognitive performance. Generally, the LC connects to several highly lateralized cortical networks, e.g. the salience and frontoparietal networks, which might result in an asymmetric plasticity in the LC. Interestingly, neurodegenerative disorders seem to affect the left LC more, e.g. more widespread loss of connectivity between the left LC and resting state networks was found in PD patients, with a correlation between left LC-executive control network connectivity and cognition (Sun et al. 2023). However, the biological basis for this is elusive, as post-mortem studies generally find the bilateral LC symmetric and mostly report pathological changes in the rostral and middle LC (Beardmore et al. 2021). In our case, a possible interpretation is that with the loss of rostral-like connectivity or previously rostral-like areas lose their specific connections and become more similar to the caudal part in terms of connectivity. In our study, since we did not investigate the cerebellum and the spinal cord, the typical caudal connectivity profile is more non-specific, since some of its dominant connections are not assessed. This interpretation is now included in the revised version of the manuscript.
 
 Reviewer #2 (Recommendations For The Authors):
 
 Comment 1. Minor:
 
 the preprocessing pipeline for HCP 7T data was not reported.
 
 We extended the details of the preprocessing pipeline for the HCP 7T dataset.
 
 Comment 2. - a difference map would be useful to demonstrate the similarity of LC connectivity gradient between CamCAN and HCP dataset.
 
 We have now added a difference map between the CamCAN and HCP gradient in the supplementary material (Figure S2).
 
 Comment 3. - labels for left and right LC were missing in Fig 3.
 
 We corrected the labeling in Figure 3.
 
 Comment 4. - in Statistical Analysis, CamCAN participants were divided into two groups with and without depressive and anxiety symptoms. It is unclear whether participants with high HADS scores were presented with both symptoms or just one of them.
 
 Because of the low number of participants with high depression scores on the HADS test, we defined high HADS scores as individuals scoring above normal on either the anxiety part, the depression part, or both.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2023.02.25.23286442v2
www.biorxiv.org www.biorxiv.org

New submission 04/08/2023, 08:44:27

1
1. Public_Reviews 14 Aug 2023
  
  in eLife
  
  Author Respone
  
  Reviewer #1 (Public Review):
  
  While there are many models for sequence retrieval, it has been difficult to find models that vary the speed of sequence retrieval dynamically via simple external inputs. While recent works [1,2] have proposed some mechanisms, the authors here propose a different one based on heterogeneous plasticity rules. Temporally symmetric plasticity kernels (that do not distinguish between the order of pre and post spikes, but only their time difference) are expected to give rise to attractor states, asymmetric ones to sequence transitions. The authors incorporate a rate-based, discrete-time analog of these spike-based plasticity rules to learn the connections between neurons (leading to connections similar to Hopfield networks for attractors and sequences). They use either a parametric combination of symmetric and asymmetric learning rules for connections into each neuron, or separate subpopulations having only symmetric or asymmetric learning rules on incoming connections. They find that the latter is conducive to enabling external inputs to control the speed of sequence retrieval.
  
  Strengths:
  
  The authors have expertly characterised the system dynamics using both simulations and theory. How the speed and quality of retrieval varies across phases space has been well-studied. The authors are also able to vary the external inputs to reproduce a preparatory followed by an execution phase of sequence retrieval as seen experimentally in motor control. They also propose a simple reinforcement learning scheme for learning to map the two external inputs to the desired retrieval speed.
  
  Weaknesses:
  
  1) The authors translate spike-based synaptic plasticity rules to a way to learn/set connections for rate units operating in discrete time, similar to their earlier work in [5]. The bio-plausibility issues of learning in [5] carry over here, for e.g. the authors ignore any input due to the recurrent connectivity during learning and effectively fix the pre and post rates to the desired ones. While the learning itself is not fully bio-plausible, it does lend itself to writing the final connectivity matrix in a manner that is easier to analyze theoretically.
  
  We agree with the reviewer that learning is not `fully bio-plausible’. However, we believe that extending the results to a model in which synaptic plasticity depends on recurrent inputs is beyond the scope of this work. We will address this issue in the Discussion in a revised manuscript.
  
  2) While the authors learn to map the set of two external input strengths to speed of retrieval, they still hand-wire one external input to the subpopulation of neurons with temporally symmetric plasticity and the other external input to the other subpopulation with temporally asymmetric plasticity. The authors suggest that these subpopulations might arise due to differences in the parameters of Ca dynamics as in their earlier work [29]. How these two external inputs would connect to neurons differentially based on the plasticity kernel / Ca dynamics parameters of the recurrent connections is still an open question which the authors have not touched upon.
  
  The issue of how external inputs could self-organize to drive the network to retrieve sequences at appropriate speeds is addressed in the last part of the Results section. We believe this issue is independent from how different forms of synaptic plasticity can be achieved using different parameters that describe how calcium triggers synaptic plasticity. We will discuss these issues more clearly in the revised manuscript.
  
  3) The authors require that temporally symmetric and asymmetric learning rules be present in the recurrent connections between subpopulations of neurons in the same brain region, i.e. some neurons in the same brain region should have temporally symmetric kernels, while others should have temporally asymmetric ones. The evidence for this seems thin. Though, in the discussion, the authors clarify 'While this heterogeneity has been found so far across structures or across different regions in the same structure, this heterogeneity could also be present within local networks, as current experimental methods for probing plasticity only have access to a single delay between pre and post-synaptic spikes in each recorded neuron, and would therefore miss this heterogeneity'.
  
  We agree with the reviewer that this is currently an open question. We will describe this issue in more detail in the Discussion of a revised manuscript.
  
  4) An aspect which the authors have not connected to is one of the author's earlier work: Brunel, N. (2016). Is cortical connectivity optimized for storing information? Nature Neuroscience, 19(5), 749-755. https://doi.org/10.1038/nn.4286 which suggests that the experimentally observed over-representation of symmetric synapses suggests that cortical networks are optimized for attractors rather than sequences.
  
  We thank the reviewer for this suggestion. We will add a paragraph in discussion that discusses work on statistics of synaptic connectivity in optimal networks. We expect that in networks that contain two subpopulations of neurons, the degree of symmetry should be intermediate between a network storing fixed point attractors exclusively, and a network storing sequences exclusively. We will also elaborate on predictions our scenario makes on higher order network motifs.
  
  Despite the above weaknesses, the work is a solid advance in proposing an alternate model for modulating speed of sequence retrieval and extends the use of well-established theoretical tools. This work is expected to spawn further works like extending to a spiking neural network with Dale's law, more realistic learning taking into account recurrent connections during learning, and experimental follow-ups. Thus, I expect this to be an important contribution to the field.
  
  We thank the reviewer for the insightful comments.
  
  Reviewer #2 (Public Review):
  
  Sequences of neural activity underlie most of our behavior. And as experience suggests we are (in most cases) able to flexibly change the speed for our learned behavior which essentially means that brains are able to change the speed at which the sequence is retrieved from the memory. The authors here propose a mechanism by which networks in the brain can learn a sequence of spike patterns and retrieve them at variable speed. At a conceptual level I think the authors have a very nice idea: use of symmetric and asymmetric learning rules to learn the sequences and then use different inputs to neurons with symmetric or asymmetric plasticity to control the retrieval speed. The authors have demonstrated the feasibility of the idea in a rather idealized network model. I think it is important that the idea is demonstrated in more biologically plausible settings (e.g. spiking neurons, a network with exc. and inh. neurons with ongoing activity).
  
  Summary
  
  In this manuscript authors have addressed the problem of learning and retrieval sequential activity in neuronal networks. In particular, they have focussed on the problem of how sequence retrieval speed can be controlled?
  
  They have considered a model with excitatory rate-based neurons. Authors show that when sequences are learned with both temporally symmetric and asymmetric Hebbian plasticity, by modulating the external inputs to the network the sequence retrieval speed can be modulated. With the two types of Hebbian plasticity in the network, sequence learning essentially means that the network has both feedforward and recurrent connections related to the sequence. By giving different amounts of input to the feed-forward and recurrent components of the sequence, authors are able to adjust the speed.
  
  Strengths
  
  Authors solve the problem of sequence retrieval speed control by learning the sequence in both feedforward and recurrent connectivity within a network. It is a very interesting idea for two main reasons: 1. It does not rely on delays or short-term dynamics in neurons/synapses 2. It does not require that the animal is presented with the same sequences multiple times at different speeds. Different inputs to the feedforward and recurrent populations are sufficient to alter the speed. However, the work leaves several issues unaddressed as explained below.
  
  Weaknesses
  
  The main weakness of the paper is that it is mostly driven by a motivation to find a computational solution to the problem of sequence retrieval speed. In most cases they have not provided any arguments about the biological plausibility of the solution they have proposed e.g.:
  
  Is there any experimental evidence that some neurons in the network have symmetric Hebbian plasticity and some temporally asymmetric? In the references authors have cited some references to support this. But usually the switch between temporally symmetric and asymmetric rules is dependent on spike patterns used for pairing (e.g. bursts vs single spikes). In the context of this manuscript, it would mean that in the same pattern, some neurons burst and some don't and this is the same for all the patterns in the sequence. As far as I see here authors have assumed a binary pattern of activity which is the same for all neurons that participate in the pattern.
  
  There is currently only weak evidence for heterogeneity of synaptic plasticity rules within a single network, though there is plenty of evidence for such a heterogeneity across networks or across locations within a particular structure (see references in our Discussion). The reviewer suggests another interesting possibility, that the temporal asymmetry could depend on the firing pattern on the post-synaptic neuron. An example of such a behavior can be found in a paper by Wittenberg and Wang in 2006, where they show that pairing single spikes of pre and post-synaptic neurons lead to LTD at all time differences in a symmetric fashion, while pairing a pre-synaptic spike with a burst of post-synaptic spikes lead to temporally asymmetric plasticity, with a LTP window at short positive time differences. We will mention this possibility in the Discussion, but we believe exploring fully this scenario is beyond the scope of the paper.
  
  How would external inputs know that they are impinging on a symmetric or asymmetric neuron? Authors have proposed a mechanism to learn these inputs. But that makes the sequence learning problem a two stage problem -- first an animal has to learn the sequence and then it has to learn to modulate the speed of retrieval. It should be possible to find experimental evidence to support this?
  
  Our model does not assume that the two processes necessarily occur one after the other. Importantly, once the correct external inputs that can modulate sequence retrieval are learned, sequence retrieval modulation will automatically generalize to arbitrary new sequences that are learned by the network.
  
  Authors have only considered homogeneous DC input for sequence retrieval. This kind of input is highly unnatural. It would be more plausible if the authors considered fluctuating input which is different from each neuron.
  
  In a revised manuscript, we will add an additional panel to Figure 1 to demonstrate that fluctuating inputs do not qualitatively affect our results.
  
  All the work is demonstrated using a firing rate based model of only excitatory neurons. I think it is important that some of the key results are demonstrated in a network of both excitatory and inhibitory spiking neurons. As the authors very well know it is not always trivial to extend rate-based models to spiking neurons.
  
  I think at a conceptual level authors have a very nice idea but it needs to be demonstrated in a more biologically plausible setting (and by that I do not mean biophysical neurons etc.).
  
  We are confident that our results can be reproduced in networks of excitatory and inhibitory spiking networks, since previous studies have shown that such networks can exhibit attractor dynamics (e.g. Amit and Brunel 1997, Brunel and Wang 2001) and sequential activity (e.g. Gillett, Pereira, and Brunel 2020). We plan to include a new section with an associated figure to a revised manuscript demonstrating how the flexible speed control can be achieved in an excitatory-inhibitory (E-I) spiking network containing two excitatory populations with distinct plasticity mechanisms.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.03.22.533836v1
www.biorxiv.org www.biorxiv.org

New submission 14/07/2022, 16:37:36

1
1. Public_Reviews 14 Aug 2023
  
  in eLife
  
  Author Response
  
  We thank the Editor and the Reviewers for the kind words, the helpful suggestions, and the points of critique, which have all helped us substantially strengthen the manuscript. We have made the aesthetic changes requested by Reviewer 2.
  
  Response to Reviewer 2
  
  We thank the Reviewer for their thorough feedback. We provide point by point responses below.
  
  Concern 1
  
  In paragraph 4.2, I found it unclear why the authors find it unsurprising that different experiments would correspond to different betas. I think that this point should be discussed, as beta and N appear in combination in determining the interaction strength. Otherwise, they could try to fit all distributions with the same beta, which would be more natural for me. I guess that the fits would be anyway good to the eye, though quantitatively suboptimal (which could be quantified with the distance introduced).
  
  The reviewer raises valid concerns since as shown in Fig 3, the chosen values for beta, the additional fitting parameter introduced in the agent-based simulation, are: β = 0.18, 0.13, 0.12 and 0.64 respectively for N = 5, 10, 15, 20. We (RS, OM, and OP) find it intriguing that the optimum beta clusters around similar values for N = 5, 10, 15, while the optimum beta for N = 20 is significantly different. We acknowledge that we do not have an explanation why the fitted parameters values are what they are but note that the fitting curve is flat, implying that several beta values could possibly achieve a satisfactory fit. While further agent-based simulations could explore these findings more systematically, we believe that investigating this matter is outside the scope of this paper. Instead, we have acknowledged these points explicitly in the revised discussions.
  
  Portion added to discussions: “As shown in Fig. 3, the chosen values for beta, the additional fitting parameter introduced in the agent-based simulation, are: β = 0.18, 0.13, 0.12 and 0.64 respectively for N = 5, 10, 15, 20. Perhaps it is intriguing that the optimum beta clusters around similar values for N = 5, 10, 15, while the optimum beta for N = 20 is significantly different. While we do not currently have an explanation for why the fitted parameter values are what they are, we note that the fitting curve is flat, implying that several beta values could possibly achieve a satisfactory fit. Further agent-based simulations could explore these findings more systematically, and provide useful insights.”
  
  Concern 2
  
  Citation of previous work on dynamical quorum sensing (lines 51 & 52) I think misses two important points: first these works (and others following them) deal with the appearance of collective oscillations at high density (therefore, the same general problem addressed here); second, Taylor et al. studied also a transition where the oscillators involved did not oscillate at low density, whereas above a density threshold, they display coherent collective oscillations whose period decreases with density - similar to what observed here. I do not think this takes anything away from the originality of this work, which refers to a different system, and models it with different equations, but the parallelism between integrate-and-fire dynamics with quenched noise and excitable dynamics in the presence of noise should in my opinion not be overlooked.
  
  We have explicitly mentioned this in the revised text.
  
  Concern 3
  
  As the authors stress in lines 105 and 132, the analytical model shows that all that really matters in this phenomenon is the fastest frequency of the system. This could be used as an argument to say that the actual frequency distribution of individual fireflies is not all that important, as long as their fastest frequency is comparable. The assumption that they are identical would then sound less radical. Ideally, one could use the numerical simulations to check this, as well as the fact that the phenomenon does not break down when the shortest individual interburst interval Tbmin is narrowly distributed (which could also explain why having a few individuals who can flash at a higher frequency does not affect the outcome).
  
  We thank the reviewer for these observations.
  
  Concern 4
  
  I still feel that the agreement between the model and observations is a bit overstated (line 120). At least, I think the authors may stress that whereas the model predicts that the frequency of the 7-14 minutes oscillations should increase a lot with N, this is not observed in the data. Maybe this mismatch would be reduced if inter-individual variability was added.
  
  Please see the last three paragraphs of the discussion section. In reality, as the swarm size increases, we expect that swarms will no longer be all-to-all connected, and the dynamics of the system will depend upon the speed of propagation of information across the swarm. Precisely how this happens is outside of the scope of the current experimental work and theoretical description presented here.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.03.09.483608v2
www.biorxiv.org www.biorxiv.org

New submission 14/08/2023, 09:18:38

1
1. Public_Reviews 14 Aug 2023
  
  in eLife
  
  Author Response
  
  Thank you for taking the time to manage the reviews of this manuscript. Many helpful suggestions were presented by the two reviewers that will certainly strengthen the revised version of the manuscript. We would like to take the opportunity to provide a provisional response to address concerns and factual errors in the eLife assessment and public reviews. Please see below.
  
  Response to eLife assessment:
  
  The assessment does not appear to reflect reviews entirely accurately. While reviewer 1 was unsatisfied by the “lack of thorough analysis of the experimental outcomes”, the criticism of a lack of sufficient support of our claims was not present in the reviews. Thus, the sentence, “The evidence supporting the claims are interesting although incomplete in some areas”, seems to us excessively negative. Furthermore, while we agree that this work inspires new studies to determine how UBC circuits function in the intact brain and how they promote behaviors, and that “substantial work remains to be conducted” to explore these new avenues, the way the sentence is constructed, and placed directly after “incomplete in some areas”, makes it read as a negative related to the current manuscript, whereas opening doors to new lines of research is certainly positive for the field.
  
  Response to Reviewer 1:
  
  • One of the main criticisms appears to be a lack of quantification of our electrophysiological data and clear explanation of how the model reproduces the behavior of the cells reported here and in previous work. We are thankful for the identification of these omissions. Our extensive work in UBC electrophysiology instructed the development of these models and they reproduce the essential features of ON and OFF UBC spiking responses and mGluR2 and AMPAR conductances accurately, although we agree that we did not present sufficient evidence for this in the manuscript.
  
  • Another major criticism was a lack of consideration of feedback and feedforward inhibition. The goal of Figure 1 was to determine the cell types of labeled UBCs in transgenic mouse lines, which is determined entirely by their synaptic responses to glutamate (Borges-Merjane & Trussell, 2015). Thus, blocking inhibition was essential to produce clear results. Feedback and feedforward inhibition from Golgi cells, which is certainly important in the intact circuit, is not possible to produce in a physiologically realistic way in acute brain slices, because electrical stimulation produces synchronous excitation and inhibition (by directly exciting Golgi cells, rather than their synaptic inputs). The main inhibition that UBCs receive is through mGluR2, which lasts for 100-1000s of milliseconds, and the main excitation that UBCs receive is through mGluR1 and AMPA, which also both last 100-1000s of milliseconds. Thus, these large conductances are unlikely to be significantly shaped by 1-10 ms IPSCs from feedforward and feedback inhibition. For these reasons, it was not our intention to explore GABAergic/glycinergic feedforward and feedback inhibition in the present study.
  
  Factual errors in public reviews:
  
  Reviewer 1, specific point 4:
  
  A) The reviewer accurately points out that the model did not incorporate a change in the amount of glutamate released across release events during trains of presynaptic spikes. We did not find this to be necessary to reproduce the AMPA and mGluR2 currents accurately, because the majority of the response occurs after the last presynaptic stimulus. Short term plasticity during the stimulus train would be expected to change the total amount of glutamate released, but not the time course of the slow current response. We previously showed that the predominant synaptic plasticity that occurs at this synapse during the train is short-term depression that is due in large part to postsynaptic desensitization of AMPA receptors, rather than a change in presynaptic release.
  
  B) The reviewer states that the model does not include desensitization of AMPA receptors. Although there is not a variable that defines desensitization explicitly, the detailed kinetic AMPA receptor model used here accounts for desensitization, which, in fact, mediates slow ON UBC current and is the focus of our previous work. This AMPA receptor model (developed in Balmer et al., 2021 using UBC data from Lu et al., 2017) is a 13-state model, including 4 open states with 1-4 glutamates bound, 4 closed states with 1-4 glutamates bound, 4 desensitized states with 1-4 glutamates bound, and 5 closed states with 0-4 glutamates bound. The transition rates between different states in the model were fit to AMPA receptor currents recorded from dissociated UBCs and they approximate well the ON UBC currents evoked by synaptic stimulation (Balmer et al., 2021).
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.04.11.536335v3
www.biorxiv.org www.biorxiv.org

New submission 14/08/2023, 09:09:18

1
1. Public_Reviews 14 Aug 2023
 
 in eLife
 
 Author Response
 
 Reviewer #1 (Public Review):
 
 The manuscript is very-well written. Although the study is well-conducted the authors should be more convincing on how bacteria residing in tissues do not induce death. The association with IL-10 cytokine production appears weak and more experiments are needed to make it more robust.
 
 Thank you very much for your thoughtful and constructive feedback on our manuscript. We appreciate your positive assessment of the writing quality and the acknowledgment of the wellconducted nature of the study.
 
 In regard to the reviewer's comment that "The association with IL-10 cytokine production appears weak," we would like to provide a comprehensive response based on the findings and insights presented in our study (Fig 5). We would like to emphasize several key points to further elucidate this association:
 
 The established knowledge underscores IL-10's capacity to hinder the activation and proliferation of macrophages, thereby safeguarding against an overly aggressive immune-inflammatory reaction (as referenced). In our earlier investigations, we demonstrated that NAD+ orchestrates a systemic generation of IL-10, which assumes a pivotal function in curtailing proinflammatory responses across various conditions, such as autoimmune diseases (as referenced), alloimmunity (as referenced), and bacterial infections (as referenced). In our latest research, we divulge that the introduction of NAD+ leads to an elevated occurrence of IL-10-producing CD4+ T cells, CD8+ T cells, and macrophages, although not dendritic cells (depicted in Figure 5B and C). Furthermore, our comprehensive analyses have substantiated that NAD+ administration thwarts pyroptosis by specifically targeting the non-canonical inflammasome pathway. Intriguingly, our in vitro outcomes suggest that the neutralization of the autocrine IL-10 signaling pathway through a neutralizing antibody and an IL-10 receptor antagonist partially reverses the NAD+-mediated blockage of pyroptosis. These in vitro results imply that NAD+ induces the production of IL-10 cytokines by macrophages, contributing to the suppression of pyroptosis. To corroborate our in vitro conclusions, we employed IL-10 knockout mice and wild-type mice, both treated with either NAD+ or a placebo solution. The wild-type mice treated with NAD+ displayed a survival rate exceeding 80%, whereas the IL-10 knockout mice exhibited a survival rate of "only" 40%. These in vivo findings align with our in vitro discoveries, underscoring the crucial role of NAD+mediated IL-10 cytokine production in impeding pyroptosis through NAD+ and shielding against septic shock. Drawing from our prior and current investigations, we respectfully disagree with the reviewer's characterization of our work as "weak."
 
 Reviewer #2 (Public Review): Iske et al. provide experimental data that NAD+ lessens disease severity in bacterial sepsis without impacting on the host pathogen load. They show that in macrophages, NAD+ prevents Il1b secretion potentially mediated by Caspase11.
 
 Thank you for taking the time to review our manuscript. We appreciate your insightful comments and valuable feedback regarding our study on the role protective role and underlying mechanisms of NAD+ in septic shock.
 
 While the in vivo and in vitro data is interesting and hints towards a crucial role of NAD+ to promote metabolic adaptation in sepsis, the manuscript has shortcomings and would profit from several changes and additional experiments that support the claims.
 
 We would like to point out that our current study does not underscore a metabolic adaptation in sepsis but more an immune regulation and a specific blockade of the non-canonical inflammasome signaling machinery.
 
 Conceptually, the definition of sepsis is outdated. Sepsis is not SIRS, as in sepsis-2. Sepsis-3 defines sepsis as infection-associated organ dysfunction. This concept needs to be taken into account for the introduction and when describing the potential effects of NAD+ in sepsis. Also, LPS application cannot be considered a sepsis model, since it only recapitulates the consequence of TLR-4 activation. It is a model of endotoxemia. Also, the LPS data does not allow to draw conclusions about bacterial clearance (L135).
 
 Our study uses highly lethal doses of E. Coli or LPS. These doses have been shown to result in multiple organ failure (1, 2). For many decades until now an un-numerable number of studies have used LPS as a model of sepsis (3, 4, 5). We have used LPS animal model based on a study published in 2013 by Kayagaki et al. (1), where the authors reported a novel TLR4-independent mechanism but mediated via activate caspase-11. We used the same animal model to demonstrate the specific role of NAD+ in targeting this TLR4-independent mechanism but mediated via activate caspase-11 and underscore NAD+’s mode of protection.
 
 Moreover, we have not only used LPS but bacterial infection as well using E. Coli. We have also previously published an additional research article demonstrating the protective effect against Listeria Monocytogenes (6). The only model we currently did not use in our current study, is a cecal ligation puncture (CLP) model which is also another common animal model for sepsis.
 
 Our conclusions regarding bacterial clearance are based not only on LPS results but also based on the bacterial load measurement and survival (Figure 1B&C) following E. Coli administration in different tissues (kidney and liver) and not LPS.
 
 The authors state that protective effects by NAD were independent of the host pathogen load. This clearly indicates that NAD confers protection via enhancing a disease tolerance mechanism, potentially via reducing immunopathology. This aspect is not considered by the authors. The authors should incorporate the concept of disease tolerance in their work, cite the relevant literature on the topic and discuss it their findings in light of the published evidence for metabolic alteration sand adaptations in sepsis.
 
 We respectfully disagree with the reviewer’s comment and do not believe that NAD+ enhances disease tolerance. We have supporting data indicating that NAD+ mediates protection via a specific blockade of the non-canonical inflammasome pathway, which prevents an over-zealous immune response that results in organ damage and multiple organ failure (MOF). Moreover, we demonstrate that not only NAD+ mediates protection via a specific blockade of the non-canonical inflammasome pathway but prevents septic shock induced death by an additional immunosuppression mediated by the systemic production of IL-10.
 
 Both Caspase-11 and IL-10 pathways are crucial in NAD+ mediated protection against lethal doses of E. Coli and LPS administration. Figure 5A indicates that caspase-11-/- mice treated with PBS have a modest survival rate (~40% survival) when compared to the group of mice treated with NAD+ (>80% survival). These data indicate that NAD+ promotes survival via a caspase-11independent mechanism. Similarly, wild type mice subjected to NAD+ administration exhibited >80% survival, while NAD+ administration to IL-10-/- mice resulted only in a 40% survival rate. Based on these findings, we believe that NAD+ mediated protection against septic shock via a blockade of caspase-11 blockade and by IL-10 cytokine production that dampened the overzealous immune response rather than a disease tolerance.
 
 For the in vitro data, the manuscript would benefit from additional experiments using in vitro infection models.
 
 In the current study we have used two in vivo models using LPS and E. Coli a gram-negative bacterium. We have also previously reported the protective role of NAD+ in the context of Listeria Monocytogenes (6) a gram-positive bacterium. In the current study, our aim was to demonstrate the inhibitory role of NAD+ on the non-canonical pathway specifically. We believe that additional in vitro experiments for this study are out of scope.
 
 In the merge manuscript, the authors provide two different versions of the figures. In one, bar plots are shown without individual data and in the other with scatter blots. All bar plots need to be provided as scatter plots showing individual values.
 
 As requested by reviewer #2 all bar plots are now provided as scatter plots showing individual values.
 
 The authors should show further serology data for kidney and liver failure etc. as well as further cytokine data such as IL-6 and TNF to better characterize their models.
 
 We did not perform further serology analysis, but we did measure IL-6 and TNFα in mice treated with NAD+ or PBS. Mice treated with NAD+ had a reduced systemic level of both cytokines IL-6 and TNFα. We have now added the figures (Figure 1F). In addition, we performed a long-term survival, and all mice treated with NAD+ recovered fully after 10 days and survived over a year after infection. In addition, the mice that survived following NAD+ treatment died of old age.
 
 Careful revision of the entire manuscript, the figure legends and figures is required. The figure legend should not repeat the methods and materials section. The nomenclature for mouse protein and genes needs to be thoroughly revised.
 
 A Careful revision of the entire manuscript has been performed.
 
 L350. The authors write that they dissect the capacity of NAD+ to dampen auto- and alloimmunity. In this work, no data that supports this statement is shown and experiments with autoantigens or alloantigens are not performed.
 
 We thank the reviewer for this comment. We have now re-phrased our last sentence in the discussion and included references for our previous work. We have now stated:” We have previously reported that NAD+ administration can block auto- (7) and allo-immunity (8) via IL10 cytokine production. Here, we unveiled the capacity of NAD+ to protect against sepsisinduced death via a specific blockade of the non-canonical inflammasome pathway and a robust immunosuppression mediated by IL-10 cytokine production.
 
 L163 The authors describe pyroptosis but in the figure legend call it apoptosis. Specific markers for each cell death should be measured and determined which cell death mechanisms is involved.
 
 We thank the reviewer for this comment. We have focuses on pyoptosis-mediated cell death and not apoptosis. We have now replaced the term “apoptosis” by “pyroptosis-mediated to cell death”.
 
 Animal data comes from an infection model and LPS application. The RNAseq data is obtained from cells primed with Pam3CSK4 and subsequently subjected to LPS. It is unclear how the cell culture model reflects the animal model. As such the link between IFN signaling and the bacterial infection/LPS model are not convincing and need to be further elaborated.
 
 Our findings, depicted in Figure 3, pertain exclusively to in vitro investigations rather than in vivo examinations. Our research has demonstrated the selective inhibition of the non-canonical inflammasome pathway by NAD+, with a primary focus on unraveling the specific signaling pathway influenced by NAD+. Our in vitro outcomes indicate that the introduction of recombinant IFN-β counteracted the inhibitory effect of NAD+ on the non-canonical pathway. However, it's important to note that we have not evaluated the IFN-β pathway within our E. Coli and LPS in vivo models. Our primary intention was to exclusively decipher the roles of IFN-β and NAD+ in the context of inhibiting the non-canonical inflammasome, without extending our investigation to the broader in vivo scenarios.
 
 Figure 5: It is unclear how many independent survival experiments were done, how many mice per group were used and whether the difference between groups was statistical significant. This information should be added.
 
 We have now included the number of experiments, p values and number of animals used in Figure 5.
 
 Further experiments with primary cells from Il10 k.o. and Caspase11 k.o. animals should be provided that support the findings in macrophages.”
 
 We concur with the reviewer's suggestion regarding the need for further experiments involving primary cells from IL-10-/- and Caspase-11-/- mice. However, we are uncertain about the potential contribution of these experiments in generating novel or supplementary findings to the existing study.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2020.03.29.013649v3
www.biorxiv.org www.biorxiv.org

New submission 14/08/2023, 09:03:06

1
1. Public_Reviews 14 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  The goal of this study is to understand the allosteric mechanism of overall activity regulation in an anaerobic ribonucleotide reductase (RNR) that contains an ATP-cone domain. Through cryo-EM structural analysis of various nucleotide-bound states of the RNR, the mechanism of dATP inhibition is found to involve order-disorder transitions in the active site. These effects appear to prevent substrate binding and a radical transfer needed to initiate the reaction.
  
  Strengths of the manuscript include the comprehensive nature of the work - including numerous structures of different forms of the RNR and detailed characterization of enzyme activity to establish the parameters of dATP inhibition. The manuscript could be improved, however, by performing additional experiments to establish that the mechanism of inhibition can be observed in other contexts and it is not an artifact of the structural approach. Additionally, some of the presentations of biochemical data could be improved to comply with standard best practices.
  
  The work is impactful because it reports initial observations about a potentially new mode of allosteric inhibition in this enzyme class. It also sets the stage for future work to understand the molecular basis for this phenomenon in more detail.
  
  We thank the editor and reviewers for their positive evaluation of the potential impact of our work. We completely agree that hypotheses based on structural data require orthogonal experimental verification. However, the number and consistency of the cryo-EM structures speak in favour of the data being representative of conditions in solution. We feel that in particular cryo-EM data should be relatively free of artefacts, e.g. biased or incorrect relative domain orientations or artificially reduced mobility, compared to crystallography, where crystal packing effects can affect these parameters. As we write in response to Reviewer #2, it has been difficult to propose a direct structural mechanism for transmission of the allosteric signal from the a-site in the ATP-cone to the active site and GRD given that the ATP-cones and linker are disordered in the dATP-bound dimers and only partly ordered in the dATP-bound tetramers. Further verification experiments will be performed in future but are outside the scope of the present article.
  
  We will improve the presentation of the biochemical data in a revised version.
  
  General comments:
  
  1) It would be ideal to perform an additional experiment of some type to confirm the order-disorder phenomena observed in the cryo-EM structures to rule out the possibility that it is an artifact of the structure determination approach. Circular dichroism might be a possibility?
  
  Circular dichroism reports only on the approximate relative proportions of helix, sheet and loop structure in a protein; thus we believe that it would not be a sensitive enough tool to distinguish between ordered and disordered states of the GRD. We are considering what alternative methods might be appropriate.
  
  2) Does the disordering phenomenon of one subunit in the ATP-bound structures have any significance - could it be related to half-of-sites activity? Does this RNR exhibit half-of-sites activity?
  
  Half-of-sites activity has not been biochemically proven in any ribonucleotide reductase although it was first suggested in 1987 (PMID: 3298261). However, a strong structural indication was recently published in the form of the holo-complex of the class Ia ribonucleotide reductase from Escherichia coli, which is highly asymmetrical and in which productive contacts forming an intact proton-coupled electron transfer pathway are only formed between one of two pairs of monomers (PMID: 32217749). We have not been able to prove half-of-sites activity for PcNrdD due to low overall radical content, but the structural results are indeed consistent with such an activity.
  
  3) Does the disordering of the GRD with dATP bound have any long-term impact on the stability of the Gly radical? I realize that the authors tested the ability to form the Gly radical in the presence of dATP in Fig. 4 of the manuscript. But it looks like they only analyzed the samples after 20 min of incubation. Were longer time points analyzed?
  
  Radical content was measured after 5 min and 20 min incubation; 5 min incubations (not included in the manuscript) consistently gave higher radical content compared to 20 min incubation. Longer time points were not analysed, as we assumed that the radical content would be even lower after 20 min.
  
  4) Did the authors establish whether the effect of dATP inhibition on substrate binding is reversible? If dATP is removed, can substrates rebind?
  
  This is an interesting question. We measured KDs for dATP in the micromolar range and are hence confident that dATP binding is reversible. Our measurements do not, however, directly prove that inhibition of the enzyme is reversible. Nevertheless, it is worth noting that the protein as purified contained significant amounts of dATP and purification conditions had to be optimised to remove dATP. This is evidence that PcNrdD that has “seen” dATP can subsequently bind substrate in the presence of ATP. We will describe the purification more clearly in a revision.
  
  5) In some figures (Fig. 6e, for example), the cryo-EM density map for the nucleotide component of the model is not continuous over the entire molecule. Can the authors comment on the significance of this phenomenon? Were the ligands validated in any way to ensure that the assignments were made correctly?
  
  Indeed, we sometimes saw discontinuous density for the nucleotides, both in the active site and in the specificity site. However, the break was almost always near the C5’ carbon atom, which is common to all nucleotides. While we cannot readily explain this phenomenon, the nucleotides refined well with full occupancy, giving B-factors similar to those of the surrounding protein atoms. The identity of the nucleotide could always be inferred from a) the size of the base (purine or pyrimidine); b) the known nucleotide combinations added to the protein before grid preparation; c) prior knowledge on the combinations of effector and substrate that have been found valid for all RNRs since the first studies of allosteric specificity regulation.
  
  Reviewer #2 (Public Review):
  
  This manuscript describes the functional and structural characterization of an anaerobic (Class III) ribonucleotide reductase (RNR) with an ATP cone domain from Prevotella copri (PcNrdD). Most significantly, the cryo-EM structural characterization revealed the presence of a flap domain that connects the ATP cone domain and the active site and provides structural insights about how nucleotides and deoxynucleotides bind to this enzyme. The authors also demonstrated the catalytic functions and the oligomeric states. However, many of the biochemical characterizations are incomplete, and it is difficult to make mechanistic conclusions from the reported structures. The reported nucleotide-binding constants may not be accurate because of the design of the assays, which complicates the interpretation of the effects of ATP and dATP on PcNrdD oligomeric states. Importantly, statistical information was missing in most of the biochemical data. Also, while the authors concluded that the dATP binding makes the GRD flexible based on the absence of cryo-EM density for GRD in the dATP-bound PcNrdD, no other supports were provided. There was also a concern about the relevance of the proposed GRD flexibility and the stability of Gly radical. Overall, the manuscript provides structural insights about Class III RNR with ATP cone domain and how it binds ATP and dATP allosteric effectors. However, ambiguity remains about the molecular mechanism by which the dATP binding to the ATP cone domain inhibits the Class III RNR activity.
  
  Strengths:
  
  1) The manuscript reports the first near-atomic resolution of the structures of Class III RNR with ATP domain in complex with ATP and dATP. These structures revealed the NxN flap domain proposed to form an interaction network between the substrate, the linker to the ATP cone domain, the GRD, and loop 2 important for substrate specificity. The structures also provided insights into how ATP and dATP bind to the ATP cone domain of Class III RNR. Also, the structures suggested that the ATP cone domain is directly involved in the tetramer formation by forming an interaction with the core domain in the presence of dATP. These observations serve as an important basis for future study on the mechanism of Allosteric regulation of Class III RNR.
  
  2) The authors used a wide range of methodologies including activity assays, nucleotide binding assays, oligomeric state determination, and cryo-EM structural characterization, which were impressive and necessary to understand the complex allosteric regulation of RNR.
  
  3) The activity assays demonstrated the catalytic function of PcNrdD and its ability to be activated by ATP and low-concentration dATP and inhibited by high-concentration dATP.
  
  4) ITC and MST were used to show the ability of PcNrdD to bind NTP and dATP.
  
  5) GEMMA was used successfully to determine the oligomeric state of PcNrdD, which suggested that PcNrdD exists in dimeric and tetrameric forms, whose ratio is affected by ATP and/or dATP.
  
  Weaknesses:
  
  1) Activity assays.
  
  The activity assays were performed under conditions that may not represent the nucleotide reduction activity. The authors initiated the Gly radical formation and nucleotide reduction simultaneously. The authors also showed that the amount of Gly radical formation was different in the presence of ATP vs dATP. Therefore, it is possible that the observed Vmax is affected by the amount of Gly radical. In fact, some of the data fit poorly into the kinetic model. Also, the number of biological and technical replicates was not described, and no statistical information was provided for the curve fitting.
  
  The highest turnover activity of PcNrdD measured in presence of ATP was 1.3 s-1 (470 nmol/min/mg), a kcat comparable to recently reported values for anaerobic and aerobic RNRs from Neisseria bacilliformis, Leeuwenhoekiella blandensis, Facklamia ignava, Thermus virus P74-23, and Aquifex aeolicus (PMID: 25157154, PMID: 29388911, PMID: 30166338, PMID: 34314684, PMID: 34941255). The general trend illustrated in Figure 1 is that ATP has an activating effect, whereas high concentrations of dATP have an inactivating effect, which cannot be explained by suboptimal assay conditions since our EPR results consistently show that more radical is formed in incubations with dATP compared to incubations with ATP. Curve fitting methods used are listed in Materials and Methods (as specified in the Figure 1 legend), and standard errors for all specified curve fitting results (from triplicate experiments) are shown in Figure 1.
  
  2) Binding assays.
  
  The interpretation of the binding assays is complicated by the fact that dATP binds both a- and s-sites and ATP binds a- and active sites. dATP may also bind the active site as the product. It is unknown if ATP binds s-site in PcNrdD. Despite this complexity, the binding assays were performed under the condition that all the binding sites were available. Therefore, it is not clear which event these assays are reporting.
  
  Both ITC and MST experiments involving ATP and dATP binding to the a-site were performed in the presence of at least 1 mM GTP substrate (5 mM in MST) to fill the active site, and 1 mM dTTP effector to fill the s-site (specified in the legend to Figure 2). These conditions enable binding of ATP or dATP only to the a-site in the ATP-cone.
  
  3) Oligomeric states.
  
  Due to the ambiguity in the kinetic parameters and the binding constants determined above, the effects of ATP and dATP on the oligomeric states are difficult to interpret. The concentrations of ATP used in these experiments (50 and 100 uM) were significantly lower than KL determined by the activity assays (780 uM), while it is close to the Kd values determined by ITC or MST (~25 uM). Since it is unclear what binding events ITC and MST are reporting, the data in Figure 3 does not provide support for the claimed effects of ATP binding. For the effects of dATP, the authors did not observe a significant difference in oligomeric states between 50 or 100 uM dATP alone vs 50 uM dATP and 100 uM CTP. The former condition has dATP ~ 2x higher than the Kd and KL (Figure 1b) and therefore could be considered as "inhibited". On the other hand, NrdD should be fully active under the latter condition. Therefore, these observations show no correlation between the oligomeric state and the catalytic activity.
  
  The results in Figure 3 show that at in presence of 100 µM ATP plus 100 µM CTP the oligomeric equilibrium is 64% dimers plus 36% tetramers, and in presence of 50-100 µM dATP the oligomeric equilibrium is 32% dimers and 68% tetramers. We agree that there is no clear and strong correlation between oligomeric state and inhibition. We will also try to make it clearer in a revised version. Meanwhile, to add some further clarity, SEC experiments at higher nucleotide concentrations will be included in the revision.
  
  4) Effects of dATP binding on GRD structure
  
  One of the key conclusions of this manuscript is that dATP binding induces the dissociation of GRD from the active site. However, the structures did not provide an explanation for how the dATP binding affects the conformation of GRD or whether the dissociation of GRD is a direct consequence of dATP binding or it is due to the absence of nucleotide substrate. Also, Gly radical is unlikely to be stable when it is not protected from the bulk solvent. Therefore, it is unlikely that the GRD dissociates from the active site unless the inhibition by dATP is irreversible. Further evidence is needed to support the proposed mechanism of inhibition by dATP.
  
  We admit that it has been difficult to propose a direct structural mechanism for transmission of the allosteric signal from the a-site in the ATP-cone to the active site and GRD given that the ATP-cones and linker are disordered in the dATP-bound dimers and that the linker can only be partly modelled in the dATP-bound tetramers. Most likely dATP binding causes a change in the dynamics of the linker region and NxN flap that directly affects substrate binding and simultaneously causes disorder of the GRD, given that all are part of a connected system (described as “nexus” in the manuscript). The structures determined in the presence of dATP and CTP show that CTP cannot bind in the absence of an ordered NxN flap.
  
  In any case a major conclusion of the work is that dATP does not inhibit the anaerobic RNR by prevention of glycyl radical formation but by prevention of its subsequent transfer. We agree that further evidence is required to support the proposed mechanism but, given the extent of the data already presented in the manuscript, we feel that such studies should be the subject of a future publication.
  
  5) Functional support for the observed structures.
  
  Evidence for connecting structural observations and mechanistic conclusions is largely missing. For example, the authors proposed that the interactions between the ATP cone domain and the core domain are responsible for tetramer formation. However, no biochemical evidence was provided to support this proposal. Similarly, the functional significance of the interaction through the NxN flap domain was not proved by mutagenesis experiments.
  
  We did actually make mutants to verify the observed interactions in the tetramer, but several of them did not behave well in our hands, e.g. with regard to protein stability. Since we have no evidence that oligomerisation is coupled to inhibition, and since we did not observe any conservation between protein sequences in the interaction area, we chose not to pursue this point further. The main merit of the tetramer structures is that they allowed a high-resolution view of dATP binding to the ATP-cone and a comparison to previously observed ATP-cones. Nevertheless, mutation experiments, also including the NxN flap, could be the subject of future work.
  
  Reviewer #3 (Public Review):
  
  The manuscript by Bimai et al describes a structural and functional characterization of an anaerobic ribonucleotide reductase (RNR) enzyme from the human microbe, P. copri. More specifically, the authors aimed to characterize the mechanism by how (d)ATP modulates nucleotide reduction in this anaerobic RNR, using a combination of enzyme kinetics, binding thermodynamics, and cryo-EM structural determination. One of the principal findings of this paper is the ordering of a NxN 'flap' in the presence of ATP that promotes RNR catalysis and the disordering of both this flap and the glycyl radical domain (GRD) when the inhibitory effector, dATP, binds. The latter is correlated with a loss of substrate binding, which is the likely mechanism for dATP inhibition. It is important to note that the GRD is remote (>30 Ang) from the binding site of the dATP molecule, suggesting long-range communication of the structural (dis)ordering. The authors also present evidence for a shift in oligomerization in the presence of dATP. The work does provide evidence for new insights/views into the subtle differences of nucleotide modulation (allostery) of RNR through long-range interactions.
  
  The strengths of the work are the impressive, in-depth structural analysis of the various regulated forms of PcRNR by (d)ATP using cryo-EM. The authors present seven different models in total, with striking differences in oligomerization and (dis)ordering of select structural features, including the GRD that is integral to catalysis. The authors present several, complementary biochemical experiments (ITC, MST, EPR, kinetics) aimed at resolving the binding and regulatory mechanism of the enzyme by various nucleotides. The authors present a good breadth of the literature in which the focus of allosteric regulation of RNRs has been on the aerobic orthologues.
  
  Given the resolution of some of the structures in the remote regions that appear to be of importance, the rigor of the work could have been improved by complementing this experimental studies with molecular dynamics (MD) simulations to reveal the dynamics of the GRD and loops/flaps at the active site.
  
  We will discuss this option with expert colleagues.
  
  The biochemical data supporting the loss of substrate binding with dATP association is compelling, but the binding studies of the (d)ATP regulatory molecules are not; the authors noted less-than-unity binding stoichiometries for the effectors.
  
  Most of the methods used measure only binding strength, not the number of binding sites (N), whereas ITC also measures number of sites. N is dependent on the integrity of the protein, i.e. the number of protein molecules in a preparation that are involved in binding, and quite often gives lower values than the theoretical number of binding sites.
  
  Also, the work would benefit from additional support for oligomerization changes using an additional biochemical/biophysical approach.
  
  SEC (chromatography), GEMMA (mass spectrometry) and cryo-EM were used to study oligomerization. Since each method has restrictions on nucleotide concentrations as well as protein concentrations that can be used, the results are not directly comparable, but all three methods indicate nucleotide dependent oligomerization changes. The SEC results will be included in a revised version.
  
  Overall, the authors have mostly achieved their overall aims of the manuscript. With focused modifications, including additional control experiments, the manuscript should be a welcomed addition to the RNR field.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.20.545753v2
www.biorxiv.org www.biorxiv.org

New submission 11/08/2023, 09:44:26

1
1. Public_Reviews 11 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  Review 1
  
  Public Review
  
  The authors set out to develop an organoid model of the junction between early telencephalic and ocular tissues to model RGC development and pathfinding in a human model. The authors have succeeded in developing a robust model of optic stalk(OS) and optic disc(OD) tissue with innervating retinal ganglion cells. The OS and OD have a robust pattern with distinct developmental and functional borders that allow for a distinct pathway for pathfinding RGC neurites.
  
  This study falls short on a thorough analysis of their single cell transcriptomics (scRNAseq). From the scRNAseq it is unclear the quality and quantity of the targeted cell types that exist in the model. A comparative analysis of the scRNAseq profiles of their cell-types with existing organoid protocols, to determine a technical improvement, or with fetal tissue, to determine fidelity to target cells, would greatly improve the description of this model and determine its utility. This is especially necessary for the RGCs developed in this protocol as they recommend this as an improved model to study RGCs.
  
  Future work targeting RGC neurite outgrowth mechanisms will be exciting.
  
  We are grateful to Reviewer 1 for these constructive comments. We added plots for quality control in supp. Fig. S5 and quantification of cell clusters in Tab. 1. We compared the transcriptomes between CONCEPT organoids, Gabriel et al.’s brain/optic organoids (Gabriel et al., 2021; PMID: 34407456), and human fetal retinas HGW9 (Lu et al., 2020; PMID: 32386599), which strongly support our findings (Figs. 5, 6; see responses below for details). Besides FGFs/FGFR signaling, scRNA-seq identified additional candidate molecules that may provide axon guidance functions, and these candidate molecules are the focus of our future study.
  
  Recommendations For The Authors
  
  This study falls short on a thorough analysis of their single cell transcriptomics (scRNAseq).
  
  The scRNAseq figure needs to be better presented to allow for an adequate assessment of the model. As written the classification of the different clusters is hard to follow. A representative labeling of the suspected identity of the clusters in an infographic would aid the figure. Since it is hard to follow it is difficult to determine how well clusters correlate with designated cell types. PAX2 expression designating optic stalk seems to correlate well with the group 2 and the designation of the Optic disk, however PAX2 expression for the optic stalk is half in group 4 and half in group 9. what are group 4 and 9? It is also not clear how the thresholding for the given clusters was reached.
  
  To present the scRNA-seq dataset in a clearer way, we added dotted red lines in Fig. 4C to delineate eye (mostly retinal), telencephalic, and mixed cell populations. In Tab. 1, we showed assigned cell types, counts, and percentage for each cluster.
  
  PAX2+ VSX2- optic stalk cells were at edges of clusters 4, 8, 9 that had dorsal telencephalic identities. Clusters 4, 8, 9 were largely segregated along cell cycle phases (Fig. 4A, B, F), and these clusters differentially expressed gene markers SOX3, FGFR2, PRRX1, EDNRB, and FOXG1 (supp. Fig. S7A-S7D; Fig. 4C). In E14.5 mouse embryos, mouse orthologs of SOX3, FGFR2, PRRX1, and EDNRB were specifically expressed in dorsal telencephalon (Fig. S8AS8E); Foxg1 was specifically expressed in both dorsal and ventral telencephalon. Therefore, clusters 4, 8, and 9 have dorsal telencephalic identities, and PAX2+ VSX2- optic stalk cells are at edges of these telencephalic clusters. Lines 259-261; 297-298.
  
  Thresholding of cell clusters were determined by cell clustering parameters, which is described in Materials and Methods: FindVariableFeatures (selection.method = "vst", nfeatures = 2000), ScaleData, RunPCA, ElbowPlot, FindNeighbors (dims = 1:17), FindClusters (resolution = 0.5), and RunUMAP(dims = 1:17). Lines 717-721.
  
  The authors should make an attempt to calculate which different cell types are present and in what proportions. They should also discuss groups that are confounding. Since this is the first description of this technique it is critical to know how much of the model represents mature welldefined cells of interest.
  
  We assigned cell types to clusters and calculated cell counts and proportions of each cluster (Tab.1). The only undetermined cell cluster was cluster 13, which was the smallest one. We described top DEGs of cluster 13 and discussed the cluster. Lines 266-268.
  
  Concerning the focus on RGC isolation. It is interesting that CNTN2 can be used for an effective isolation however, there are many protocols for generating RGCs. Is CNTN2 expression unique to this protocol? If the authors claim that this protocol could be used for studying glaucoma, how does this protocol improve on the quality of RGCs compared to other protocols?
  
  RGC-specific CNTN2 expression was not unique to CONCEPT organoids. We isolated RGCs via CNTN2 from both CONCEPT organoids and 3-D retinal organoids in suspension. Indeed, isolated RGCs shown in the manuscript were from 3-D retinal organoids (see Materials and Methods for details). Importantly, our single cell RNA sequencing analysis demonstrated that CNTN2 was also differentially expressed in early RGCs from human fetal retinas (Fig. 5L, 5M). Therefore, isolation of human early RGCs via CNTN2 should be applicable widely.
  
  In CONCEPT organoids, RGC differentiation and directional axon growth were very efficient. Our study supports a model that FGFs from optic disc cells efficiently induce RGC differentiation and directional axon growth in adjacent retinal progenitor cells, as FGFR inhibitions drastically decreased the number of RGC somas and directional axon growth (Fig. 9). Therefore, CONCEPT organoids are useful in studying axon guidance cues in humans, which knowledge is much needed for axon regrowth from RGCs that are damaged in glaucoma. Notably, juvenile glaucoma gene CYP1B1 was found in assigned optic disc cells in both CONCEPT organoids and human fetal retinas (Fig. 4I, 5D), making CONCEPT organoids a testable model in studying the functions of CYP1B1 in human cells.
  
  A comparative analysis of the scRNAseq profiles of their model with existing organoid protocols, to determine a technical improvement, or with fetal tissue, to determine fidelity to target cells, would greatly improve the description of this model and determine its utility.
  
  In the revised manuscript, we compared the transcriptomes between CONCEPT organoids, Gabriel et al.’s brain/optic organoids (Gabriel et al., 2021; PMID: 34407456), and human fetal retinas HGW9 (Lu et al., 2020; PMID: 32386599). Gabriel et al. (2021) report “axon-like” projections in their “optic vesicle-containing brain organoids”. We found that PAX2+ optic disc, PAX2+ optic stalk, FOXG1+ telencephalic, and VSX2+ neuroretinal cell clusters that were found in CONCEPT organoids did not exist in Gabriel et al.’s organoids (supp. Fig. S12), indicating striking differences between Gabriel et al.’s organoids and our CONCEPT telencephalon-eye organoids.
  
  On the other hand, CONCEPT organoids and human fetal retinas HGW9 had similar expression signatures (Fig. 5). First, we identified a PAX2+ cell cluster in the human retinas HGW9. 64/113 DEGs in the PAX2+ cluster from human fetal retinas HGW9 were also DEGs of cluster 2 (assigned PAX2+ optic disc cells) from CONCEPT organoids. Second, CNTN2 was also differentially expressed in early RGCs of human fetal retinas. Third, when cells in cluster 18 and retinal progenitor clusters from the HGW9 dataset were combined with cells in clusters 2, 4, 5, 7 from the CONCEPT dataset for Seurat anchor-based clustering, cells in cluster 18 from HGW9 (H18) were grouped with cluster 2 from CONCEPT organoids (C2, assigned optic disc; N), and these cells expressed both PAX2 and VSX2 (arrowheads in Fig. 5N-5R). A small portion of H18 cells were grouped with cluster 4 from CONCEPT organoids (C4, assigned optic stalk; N), and these cells expressed PAX2 but not VSX2 (arrows in Fig. 5N-5R). Fourth, CONCEPT organoids and human fetal retinas shared many enriched GO terms in DEGs of assigned optic disc cells (Fig. 6).
  
  Collectively, transcriptomic comparisons support that our CONCEPT organoids are innovative and similar to human fetal retinas. Lines 325-392.
  
  Not clear what reporting on Lens cells in Figure 3 adds to the focus of the manuscript. The figure seems out of place with the flow of the manuscript.
  
  Lens cells were obvious in CONCEPT organoids. The presence of lens cells indicates that cysts have the developmental potential for both neural and non-neural anterior ectodermal cells. For a better flow, we added a transitional sentence at the beginning of the lens section. Lines 207208.
  
  Reviewer #2
  
  Public Review
  
  The study by Liu et al. reports on the establishment and characterization of telencephalon eye structures that spontaneously form from human pluripotent stem cells. The reported structures are generated from embryonic cysts that self-form concentric zones (centroids) of telencephaliclike cells surrounded by ocular cell types. Interestingly, the cells in the outer zone of these concentric structures give rise to retinal ganglion cells (RGCs) based on the expression of several markers, and their neuronal morphology and electrophysiological activity. Single-cell analysis of these brain-eye centroids provides detailed transcriptomic information on the different cell types within them. The single-cell analysis led to the identification of a unique cellsurface marker (CNTN2) for the human ganglion cells. Use of this marker allowed the team to isolate the stem cell-derived RGCs.
  
  Overall, the manuscript describes a method for generating self-forming structures of brain-eye lineages that mimic some of the early patterning events, possibly including the guidance cues that direct axonal growth of the RGCs. There are previous reports on brain-eye organoids with optic nerve-like connectivity; thus, the novel aspect of this study is the self-formation capacity of the centroids, including neurons with some RGC features. Notably, the manuscript further reports on cell-surface markers and an approach to generating and isolating human RGCs.
  
  Recommendations For The Authors
  
  The following significant issues, however, need to be addressed:
  
  The authors show RGC-like cells that grow axons toward the Pax2+ cells, suggesting that this is a model for RGC axon pathfinding. Is there support from transcriptomic data on the expression of guidance molecules? In addition, the authors need to characterize Pax2+ cells further. Do some give rise to astrocyte-like cells?
  
  We assessed the expression of known axon guidance genes in CONCEPT organoids. FGF8 and FGF9 trigger axon outgrowth in motor neuron column explants (Shirasaki et al., 2006). In CONCEPT organoids, FGF8 and FGF9 were differentially expressed in assigned optic disc cells; FGFR inhibition drastically decreased the number of RGC soma and directional axon growth (Fig. 9). In addition, SEMA5a and EFNB1 were expressed in both assigned optic disc and stalk cells, EFNB2 was highly expressed in assigned optic disc cells, and NTN1 was mostly expressed in assigned optic cells (supp. Fig. S12). Lines 307-310.
  
  We compared the transcriptomes between CONCEPT organoids, Gabriel et al.’s brain/optic organoids (Gabriel et al., 2021; PMID: 34407456), and human fetal retinas HGW9 (Lu et al., 2020; PMID: 32386599). Gabriel et al. (2021) report “axon-like” projections in their “optic vesicle-containing brain organoids”. We found that PAX2+ optic disc, PAX2+ optic stalk, FOXG1+ telencephalic, and VSX2+ neuroretinal cell clusters that were found in CONCEPT organoids did not exist in Gabriel et al.’s organoids (supp. Fig. S12), indicating striking differences between Gabriel et al.’s organoids and our CONCEPT telencephalon-eye organoids. Lines 327-345.
  
  To authenticate PAX2+ cells in CONCEPT organoids, we analyzed a single-cell RNA-seq dataset of human fetal retinas HGW9 and identified a similar PAX2+ cell population, cluster 18 (Fig. 5). Expression signatures of PAX2+ cells between CONCEPT organoids and human fetal retinas HGW9 were similar. Notably, cluster 18 differentially expressed PAX2, COL9A3, CYP1B1, SEMA5A, and FGF9 (Fig. 5B-5F), which were top DEGs of cluster 2 in CONCEPT organoids (Fig. 4F, 4G, 4I, 4K; SEMA5A was shown in supp. Fig. S12A). Overall, 64/113 DEGs of cluster 18 in human fetal retinas HGW9 were also DEGs of cluster 2 in CONCEPT organoids. In both HGW9 and CONCEPT organoids, expression of OLIG2, CD44, and GFAP was undetectable (supp. Fig. S14), indicating that astrocytes had not been generated yet at these stages.
  
  When cells in cluster 18 and retinal progenitor clusters from the HGW9 dataset were combined with cells in clusters 2, 4, 5, 7 from the CONCEPT dataset for Seurat anchor-based clustering, cells in cluster 18 from HGW9 (H18) were grouped with cluster 2 from CONCEPT organoids (C2, assigned optic disc; N), and these cells expressed both PAX2 and VSX2 (arrowheads in Fig. 5N-5R). A small portion of H18 cells were grouped with cluster 4 from CONCEPT organoids (C4, assigned optic stalk; N), and these cells expressed PAX2 but not VSX2 (arrows in Fig. 5N5R).
  
  We then compared functional annotations of DEGs (top 200 genes) of cluster 2 in CONCEPT organoids and DEGs (113 genes) of cluster 18 in human fetal retinas HGW9. Top GO terms in GO:MF, GO:CC, and GO:BP are shown (Fig. 6). For DEGs of cluster 2 in CONCEPT organoids, top enriched GO terms in GO:MF, GO:CC, and GO:BP were extracellular matrix structural constituent, collagen-containing extracellular matrix, and system development, respectively. Additional interesting GO:BP terms included axon development, astrocyte development, eye development, response to growth factor, cell adhesion, cell motility, neuron projection development, glial cell differentiation, and signal transduction. For DEGs of cluster 18 in human fetal retinas HGW9, top enriched GO terms in GO:MF, GO:CC, and GO:BP were cell adhesion molecule binding, extracellular space, and developmental process, respectively. Many GO terms were enriched in both samples, further indicating transcriptomic similarities in PAX2+ optic disc cells between CONCEPT organoids and human fetal retinas. Notably, GO terms astrocyte differentiation, neuron projection development, and glial cell differentiation were enriched in the DEGs of assigned optic disc cells for both CONCEPT organoids and human fetal retinas, consistent with expectations.
  
  Transcriptomic comparisons between CONCEPT organoids and human fetal retinas are found in lines 346-392.
  
  The Vsx2+Pax2+ population is not typically detected in vivo in the developing mouse eye. The authors claim that they detected them in vivo, but the data supporting this statement are lacking.
  
  We demonstrate that assigned optic disc cells expressed both VSX2 and PAX2, and this statement is trued for CONCEPT organoids and human fetal retinas HGW9 (Fig. 5N-5R). Please see the underlined sentence in the response to the comment above.
  
  Do the RGCs express subtype-specific markers? Do they detect markers of other retinal neurons typically born early in development-cones, amacrine cells, horizontal cells? The authors need to compare the transcriptome of different clusters to the published datasets from human and mouse retinae.
  
  The stage of CONCEPT organoids for scRNA-seq was at an early stage. In this dataset, subtypes of RGCs were undetectable. Isolated RGCs via CNTN2 were at more advanced stages. Distinct expression of POU4F2, ISL1, RBPMS, and SNCG indicate multiple subtypes of RGCs (Fig. 7L-7P).
  
  We did find other early retinal neurons in the scRNA-seq dataset: photoreceptor cells, amacrine/horizontal cells in CONCEPT organoids (Fig. 4U-4X), and these cells were also in cluster 11 in which RGCs were found.
  
  We performed transcriptomic comparisons between CONCEPT organoids, brain/optic organoids, and human fetal retinas. We found that PAX2+ optic disc, PAX2+ optic stalk, FOXG1+ telencephalic, and VSX2+ neuroretinal cell clusters that were found in CONCEPT organoids did not exist in Gabriel et al.’s organoids, indicating striking differences between Gabriel et al.’s organoids and our CONCEPT telencephalon-eye organoids (supp. Fig. S13). On the other hand, we found that expression signatures of CONCEPT organoids and human fetal retinas are similar (Figs. 5, 6).
  
  Transcriptomic comparisons are found in lines 325-392.
  
  Fig. 3: where are the "lens like" cells located? The structures in panels B and D look very different. Are these lens-cells toward the periphery or scattered throughout?
  
  Lens cells were dispersed in the zone in which neural retinal cells are located, which is shown in a low-magnification image (Fig. 3K). Panel B and D in Figure 3 were at different stages. At early stages, lens clusters were small (Fig. 3B). At later stages, lens clusters became bigger (Fig. 3D).
  
  Fig. 3K and L, TEM images: how do the authors know that these are lens cells?
  
  Western blot of these transparent cell clusters demonstrated that they were lens cells (Fig. 3L).
  
  Fig. 5: The authors claim that a reduced number of Pax2+ cells is associated with entry of the axons. It is not clear if this is just due to physical barriers or to active axon guidance.
  
  We believe that Reviewer 2 referred to the gap region of PAX2 expression in Fig. 7A, 7F. RGC axons grew toward and along adjacent PAX2+ VSX2+ cells. Since PAX2+ VSX2+ cells grossly formed a circular shape, RGC axons followed this circular shape. In a gap region of PAX2 expression, RGC axons exited the circle. The association of RGC axon growth with PAX2+ VSX2+ cells was very robust. Besides PAX2+ cell populations, we did not find any other cell populations that directed RGC axon growth.
  
  Fig. 5K: The authors refer to ALDH1A3 expression in the optic disk, but the presented section does not include the optic disk. In addition, ALDH1A3 is expressed in other regions of the developing retina (Fig. 5K, ref 71).
  
  We are sorry we did not make it clear. We referred to Li et al.’s (2000) paper (Mech Dev 95, 283-289) for Aldh1a3 expression in the optic stalk. Figure 7K was used to shown Aldh1a3 expression in peripheral retinas on sections.
  
  Line 263, Reference 68: The authors claim that col13A1 is specific to the human optic disk. However, col13A1 is expressed in many additional eye lineages (PMID: 10865988).
  
  We are sorry we did not make it clear. We meant that Col13A1 is prominently expressed in the optic disc, which is clearly shown in the referred paper (Figure 3D in the paper PMID: 10865988).
  
  The authors show that inhibiting FgfR results in fewer RGCs and loss of directed axonal growth. The number of cells is drastically reduced; thus, the relevance of the finding directly to axon guidance is not resolved.
  
  FGFR inhibitions drastically the number of RGC somas (Fig. 9F-9K). Additionally, remaining RGCs nearly did not grow directional axons (arrowheads in Fig. 9K), and a few remaining axons wandered around (arrow in Fig. 9K), indicating the role of FGF/FGFR signaling in RGC differentiation and directional axon growth.
  
  Fig. 1H and J: Vsx2 is outside the centroid in panels H and I, but inside the centroid in panels J and K. It is not clear what part of the centroid is shown. This needs to be clarified by adding a scheme.
  
  We are sorry we did not make it clear. We added separate-channel images showing VSX2 and PAX6 expression (supp. Figs. S1, S2) and a new diagram (left panel in Fig. 1B). Overall, FOXG1, VSX2, and PAX6 expression at days 15-17 formed three concentric zones spanning from the center to the periphery. At days 22-26, VSX2 expression expanded peripherally, largely overlapping PAX6 expression (supp. Figs. S1, S2).
  
  Pax6 should be in all cells, also on day 17. Show the separate channels, including DAPI.
  
  We added separate-channel images (supp. Figs. S1, S2). In cysts, PAX6 was expressed in all cells. After cysts attached to the culture surface and grew as colonies, distinct levels of PAX6 expression emerged in concentric zones. At days 17 and 26, PAX6 expression at the central zone (which cells expressed FOXG1) became lower, which is obvious in separate-channel images (supp. Figs. S1, S2). Consistently, PAX6 expression was low in FOXG1+ telencephalic cells in the scRNA-seq (Fig. 4C, 4D).
  
  Lines 27-30: this is a long and complex sentence which needs to be clarified.
  
  We broke it into a few sentences to make it clearer.
  
  Line 43: fix "Retina" to "Retinal"
  
  We fixed it.
  
  Lines 376-377: repeated "mechanisms of".
  
  We fixed it.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.03.22.533827v2
www.biorxiv.org www.biorxiv.org

New submission 11/08/2023, 08:54:59

1
1. Public_Reviews 11 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  We would like to thank the Reviewers for their careful reading and the many thoughtful suggestions to improve our manuscript, as well as both the Editors and Reviewers for the generally positive evaluations and encouraging statements.
  
  Editorial assessment:
  
  This important work presents an interesting perspective for the generation and interpretation of phase precession in the hippocampal formation. Through numerical simula- tions and comparison to experiments, the study provides solid evidence for the role of the DG-CA3 loop in generating theta-time scale correlations and sequences, which would be reinforced through the clarification of the concepts introduced in the study, in particular the notion of intrinsic and extrinsic sequences. This study will be of interest for the hippocampus and neural coding fields.
  
  We appreciate that our work has been considered important. In our revision we made a considerable effort to improve on the presentation of our results and the justification of our model assumptions. Particularly we aimed to clarify the meaning of intrinsic and extrinsic sequences by ad- ditional figure panels as well as fleshing out their definition via spike-timing correlations being independent or dependent on the direction of the running trajectory, respectively. To address all the requests, we added 3 new Fig- ures, multiple new Figure panels and simulated a new model variant.
  
  Reviewer #1 in their public review assessed ”The manuscript has the potential to contribute to the way we interpret hippocampal temporal coding for navigation and memory.”
  
  They criticized
  
  The findings generally relate to network models of phase precession (re- viewed in e.g., Maurer and McNaughton, 2007, Jaramillo and Kempter, 2017). An important drawback of these models with respect to explaining specific experimentally observed features of phase precession, is that they cannot straightforwardly explain phase precession upon first exposure onto a novel track. This is because, specific connectivity in network models may re- quire experience-dependent plasticity, which would not be possible upon first exposure. This is essential, given that the manuscript addresses the possible origin of phase precession in terms of network models and at minimum, this weakness should be discussed.
  
  We agree with Reviewer # 1 (and also with Reviewer # 2, who brought up a similar point) that models based on recurrence struggle to ex- plain how the recurrent connectivity matrix should come about. While we feel that a full model of how the 2-d topology in the recurrent weights can be learned goes far beyond the scope of this paper (and to our knowledge has not been solved so far in any existing model), we added a new model variant (new Figure 6 and Supplementary Figure 1), which explains the ba- sic phenomenology of extrinsic and intrinsic sequences without the need of recurrent connections, only using feed-forward synaptic facilitation. Thus, assuming recurrent connection is not necessary for our main findings. How- ever, we would like to point out that this does not exclude the possibility that recurrent connections, if set up in an appropriate way, also contribute to phase precession and theta sequences.
  
  An important and perhaps essential component of the manuscript, is the distinction between extrinsic and intrinsic models. However, the main con- cepts on which this hinges, namely extrinsic and intrinsic sequences (and the related extrinsicity and intrinsicity) could be better explained and illustrated. Along these lines, the result suggested by the title, namely, hippocampal theta correlations, may be important yet incidental in light of the new concepts (e.g., extrinsicity, intrinsicity) and computational models (e.g., DG-CA3 recurrent loop) that are put forward.
  
  We have added substantial new explanatory material to the figures, captions and text to more didactically introduce the concepts of in- trinsicity and extrinsicity. We have also completely rewritten the abstract and added a subtitle: ”extrinsic and intrinsic sequences”
  
  The study seems to put forward novel computational ideas related to neural coding. However, assessing novelty is challenging as this manuscript builds on previous work from the authors, including published (Leibold, 2020, Yiu et al., 2022) and unpublished (Ahmadi et al., 2022. bioRxiv) work. For example, the interpretation of intrinsic sequences in terms of landmarks had been introduced in Leibold, 2020.
  
  We agree with the reviewer that this paper touches on many related ideas from previous papers (not only of our lab) and is supposed to tie loose ends. Thus, the novel contribution is a biologically plausible mechanistic model of how intrinsic sequences and 2-d place maps interact on the level of interconnected spiking neurons. Such a level of explanation has not yet been available in previous work. We have considerably extended the Discussion section in our revision detailing the bigger picture underlying this theory. Also our addition of the non-recurrent model variant (see above) adds considerable novelty, since it provides an account of phase precession and preplay in novel environments.
  
  The significance of the readout tempotron neuron could be expanded on. In particular, there is room for interpretation of the output signal of that neuron (e.g., what is the significance of other neurons downstream? Why is the rationale for this output to being theta-modulated?)
  
  We have added an additional Figure 8 to better illustrate the inner workings of the tempotron. We also extended the discussion to better explain the potential use of the tempotron output (see above). In short, we consider the tempotron to signal a unique behaviorally important context that is independent of remapping induced by changes of sensory cues, which is a new prediction of the model. Since the context signal is resulting from DG loops it requires a stable code to also exits in the DG. Evidence for such long-term stability in DG has been found in Hainmu¨ller & Bartos (2018).
  
  Reviewer #2 in their public review find ”this research topic to be both important and interesting” and appreciates ”the clarity of the paper.”, com- mending our ”efforts to integrate previous theories into their model and con- duct a systematic comparison”.
  
  We are very happy about these positive remarks and sincerely would like to thank the reviewer!
  
  Reviewer #1 made the following specific recommendations for changes:
  
  The abstract is somewhat difficult to parse. I have identified some words and/or sections that could be improved.
  
  ’ ....inherently 1 dimensional’. This statement seems to be related to an a priori interpretation of the authors. On the other hand, if offline sequences are trivially 1 dimensional because they are sequences (i.e., they constitute a vector), then online sequences would be 1-dimensional as well. What is the key difference between offline and online? Is it the omnidirectional place fields in two dimensions? Perhaps more importantly, how relevant is this fact with respect to the main results of the manuscript, which concern ex- trinsic and intrinsic sequences?
  
  We indeed meant that the sequences are trivially 1-dimensional. The main challenge that we would like to address in this paper is how a 2-d topology of place cells (and direction dependent theta sequences) and a 1-d sequence topology of intrinsic theta correlations and during (p)replay can be reconciled. We hope this has become clearer in the rewritten abstract.
  
  The language in lines 36-38 is overly technical. I suggest modifying the language, the language was less technical and more understandable in the body of the manuscript, which should be also reflected in the Abstract.
  
  We would would like to apologize for making the abstract too technical. Also in response to Reviewer #2, we decided to rewrite the ab- stract entirely.
  
  The authors use a mixture of conductance based models and Izhikevich neurons, presumably for the spiking generating mechanism. The conductance component can be readily interpreted in terms of the underlying biophysics. The Izhikhevich neuron model, however, is phenomenological. I suggest you address i) the rationale for using Izhikevich model, 2) its biophysical inter- pretation, 3) and its combination with conductance-based currents.
  
  The reviewer is correct that spike generation is modelled using Izhikevich’s model whereas synaptic integration is included in a conductance- based manner. As suggested by the reviewer, we have added further expla- nation in the Methods part, explaining that the Izhikevich approach allows to adjust burst firing properties with only few parameters by efficiently em- ulating the bifurcation structure of spike generation in the full biophysical model (1&2) and otherwise has no effect on the integration of conductance- based synaptic currents in a subthreshold regime (3).
  
  Line 126: when you say preferred angle, do you mean preferred (heading) direction? If so, please maintain consistency throughout.
  
  We thank the reviewer for pointing out the inconsistency. We have added the word ”heading” throughout the manuscript whenever ap- propriate. To further improve the consistency, we have clarified the meanings of ”best” (or ”worst”) direction and reserved the use of it solely for cases when trajectory direction is compared with the preferred heading direction, namely, ”best” (”worst”) direction when trajectory is along (opposite) the preferred heading direction.
  
  Line 174: When discussing cross-correlation, sometimes you mean a cross-correlation function between two place fields and sometimes to the his- togram of all such correlations? Please clarify.
  
  We used histograms to empirically estimate the underlying cross-correlation function. For clarity, we have specified that it is a cross- correlation histogram in the revised manuscript whenever we refer to the empirical estimate.
  
  Figure 3:
  
  Understanding the difference between extrinsic and intrinsic sequences is fundamental for the manuscript. I suggest that in the section that refers to Figure 3 (or Figure 3 itself), you kindly provide an example depicting how extrinsic and intrinsic sequences can
  
  1) coexist yet be distinctly identified
  
  2) depend on trajectory
  
  3) depend on DG input
  
  By coexistence, we meant the heterogeneous population of ex- trinsic and intrinsic cell pairs and, hence, the extrinsic and intrinsic theta correlations, as shown in Figure 3J. To improve the clarity, we added the following sentence in the section that refers to Figure 3: ”In our simula- tion, extrinsically and intrinsically driven cell pairs are both present in the population (Figure 3J), indicating a coexistence of extrinsic and intrinsic sequences.”. To illustrate how extrinsic and intrinsic sequences depend on both tra- jectory and DG recurrence, we have also added annotations in Figure 3F to mark the extrinsic and intrinsic part of the sequence.
  
  Moreover, the caption of Figure 3 refers to the directionality of the theta sequences. How does this again relate to the extrinsic/intrinsic distinction?
  
  We hope the highlighting in panel F of Figure 3 has resolved this problem.
  
  Figure 5:
  
  This is a crucial figure that should illustrate the differences between extrinsic and intrinsic sequences, as the figure caption suggests. Surprisingly, it is not at all clear where (i.e., in which panel) and how (i.e., methodologi- cally) should one distinguish one type of sequence from another. I suggest that at least one such panel is dedicated to illustrating the difference and/or detection of these sequences in time and/or from phase precession plots. Moreover, there is significant visual crowding that makes the interpretation challenging (e.g., insert a space between G and E)
  
  We would like to apologize that in the previous version of the manuscript, we seemed to have evoked the impression that the difference between intrinsic and extrinsic sequences should be mainly illustrated in Figure 5. We hope that our revisions of Figures 1 and 3 have made it sufficiently clear to this point. The main purpose of Figure 5 was (and is) to illustrate how intrinsic sequences can lead to out-of-field firing. We have modified the figure caption (and text) accordingly. To address the visual crowding problem in Figure 5, we have inserted a space between panels and also removed repeated labels.
  
  Tempotron neuron and Figure 6:
  
  From the reviewer’s questions on Figure 6, we feel that our presentation caused considerable confusion about the motivation and inter- pretation of the tempotron simulations. We therefore rewrote parts of the associated text and Figure caption. We hope that the revised presentation clarifies the issues. We therefore only briefly respond to the reviewer’s points here, because we think they largely resulted from misunderstandings.
  
  Intuitively, and as the manuscript results suggest, late phases are asso- ciated to extrinsic mechanisms while early phases are associated to intrinsic. Why not construct a simpler classifier readout based on this fact? How does it compare to a tempotron?
  
  Opposite to the reviewer’s comment, extrinsic mechanisms are visible at early phases (late in the field), intrinsic mechanisms at late phases (early in the field). In fact, what the tempotron does is learning to identify the intrinsic (late phase) part and to disregard the extrinsic (early phase) part.
  
  What is the significance of theta-modulated output of the tempotron (readout) neuron?
  
  The theta modulation of the tempotron output is a trivial re- sult of the theta-modulation of the input, i.e., the detection of the intrinsic sequence pattern is done once every cycle.
  
  Suggestion for Figure 6 related to Tempotron readout: Focus on ’with DG loop condition’, as the challenge and most important point here is to identify extrinsic and intrinsic sequences. The No-loop condition could be left as a supplementary figure or side panel.
  
  The no-loop condition is the essential control showing that the tempotron only responds to the previously learned intrinsic pattern and can- not identify spatial location based on the extrinsic pattern.
  
  Further work/predictions.
  
  Lines 196-198. ”Since intrinsic sequences can also propagate outside the trajectory (Figure 5) and activate place cells non-locally, our model predicts direction-dependent expansion of place fields.” If remote activation is ’suffi- ciently’ remote, wouldn’t this predict two separate place fields instead of an expansion?
  
  The reviewer is completely correct. Out of field spiking can be also affecting remote locations, if the intrinsic sequences link to remote place fields. This would lead to double fields, however, the intrinsic part would only be active at late theta phases. For simplicity, we have not added such a case in our paper, but we would like to thank the reviewer for this comment, since it leads to a nice prediction of the model, which can be experimentally tested and therefore was included to the discussion.
  
  Lines 556-558. ”In our model, firing rate is determined by both low-phase spiking from sensory input and high-phase spike arrivals of DG-CA3 loops, both producing opposing effects on the phase distribution.” Is it possible to make a differential prediction based on lesions here, e.g., along the lines of reduced range phase precession, for either high phases or for low phases?
  
  We thank the reviewer for this great suggestion. Lesion of DG in the model does indeed reduce the phase range and mean spike phase. This further corroborates the effect of DG-loop on theta compression and high-phase spiking. We have included a new panel D in Figure 4 and a corresponding mention in the result section.
  
  Line 570. ”We speculate that the functional roles of intrinsic sequences may not be limited to spatial memories.”. Is there any relationship to re- play and/or sleep-dependent memory consolidation? Some speculation in the Discussion section would be welcome and appropriate.
  
  We have added some further speculative ideas to the last section of the Discussion. We propose that replay and preplay reflects the intrinsic sequences that express the current expectation of the animal. We have not yet thought well enough about their relation to memory consolidation to phrase this in the manuscript, but would suggest that they could serve to signal multimodal context information to the neocortex where it can evoke retrieval of unimodal memory traces.
  
  The description of the results, as stated in the public review, can be im- proved. A key component is the definition and identification of extrinsic and intrinsic sequences.
  
  Some comments:
  
  I think that the words ’extrinsic’ and ’intrinsic’ are problematic as both types of sequences/models rely on external (spatial) input, hence both are in some sense ’extrinsic’. On the other hand, both are network mechanisms, thus in some sense ’intrinsic’, where the asymmetry is either programmed directly onto the weights or due to synaptic depression. To add to the con- fusion, ’intrinsic’ mechanisms very often refer to cellular mechanisms in neurophysiology. I kindly ask you to, ideally, reconsider the terminology, or at the very least, be very thorough and precise when describing the mech- anisms. For example, sometimes extrinsic (intrinsic) ’models’ are referred to, sometimes ’sequences’, sometimes ’factors’, sometimes ’pairs’, etc.
  
  We understand and appreciate the reviewers argument, but would like to stick to the terminology, since it was already used in our prior publication. We have made considerable effort to improve the explanation and illustration of extrinsic vs. intrinsic pairs in the main text, Figure 1 and 3 to highlight our definition that is based on pair correlations: Extrin- sic pairs flip the correlation lag with reversal of running direction, intrinsic pairs don’t. This is simply a functional definition and should not be con- fused with potential microscopic mechanisms. One of those (DG-loops) is suggested in our paper.
  
  As discussed in the public review, network mechanisms may require experience-dependent plasticity and hence cannot easily explain phase pre- cession on the first pass. Please discuss why and/or how your model fits with this observation.
  
  We agree that the two models under consideration both require the recurrent network be set up appropriately and there is no theory so far that would explain how. The reason we chose these two models is because they are well known in the community and relatively similar. We reasoned that comparison between an intrinsic model and an extrinsic model would make most sense if the two are a similar as possible. Nevertheless, we ex- tended the manuscript by a new set of simulations in which we do not use re- current CA3 connections and obtain phase precession solely be feed-forward synaptic facilitation (new Figure 6 and supplementary Figure S1). The new simulations show that the basic phenomenology can also be obtained with- out using recurrent CA3 connections, however, as expected when removing one mechanisms of phase precession, the range of phase range is somewhat reduced as compared to the full model.
  
  Along a similar vein, phase precession in Figure 1E only has a range of pi/2, which is about half of the typical range of phase precession for single runs. This should be characterized as a weakness of the intrinsic model.
  
  The precession range in spiking models is highly sensitive to a large number of parameters such that it is hard to make such definite claims (see also above response). In the original Tsodyks et al. 1996 paper the phase range went up to 270 degrees with a slightly different implementation to ours in terms of current vs. conductance-based synapses, an exponen- tial instead of a Gaussian recurrent weight function, and 1-d (original) vs 2-d (ours). We chose conductance-based synapses, and a Gaussian weight profile for better comparison with the Romani and Tsodyks (2015) model. In the original non-spiking implementation by Romani and Tsodyks (2015), the phase range was hardly 70 degrees. Our model implementation of the Romani and Tsodyks (2015) model fits the experimentally reported phase ranges of about 70 to 180 degrees in CA3 (Harris et al., 2001).
  
  Lines 282-284: ”...since phase precession properties change in relation to running directions, nor are they solely intrinsic since reversal of correlation is still observed in most of the sequences (Huxter et al., 2008; Yiu et al., 2022).”. To which extent is this a consequence of the phase precession model (extrinsic vs intrinsic) or the fact that place fields are sometimes directional?
  
  The reversal of sequences with reversed running direction is how we define extrinsic correlation. We hope our changes in relation to Figure 1 has clarified this point.
  
  Figure 2: Is it i) directional input or ii) short-term facilitation that gives rise to lower phase? (or perhaps both?) Please clarify.
  
  It’s both. This is now clarified in the revised version of the Re- sults sections related to Figure 2: higher depolarization always yields earlier phases in spiking models, however, pair correlations are not affected by ei- ther of the two mechanisms.
  
  Line 320. ”...onset of phase precession”. Do you mean in CA3/CA1/DG?
  
  Thank you for pointing this out. We have clarified that this statement refers to CA3.
  
  Line 323. ”....at a different location”. Please add rationale why it has to be at a different location and a reference to the appropriate equation.
  
  The sequence rationale as well as the equation number have been added.
  
  Line 384. ” ... predicting that loss of DG inputs is compensated for by the increase of release probability in the spared afferent synapses from the MEC.”. It wasn’t clear whether this was a ’homeostasis prediction’, or and implementation in the model. Please clarify.
  
  Since the model explained the experimental observations by implementing an increased probability of release, the model predicts that in animals with DG lesion the probability of release should be enhanced. We have modified the wording to avoid confusion.
  
  Line 428 ”...and near future locations) is obvious, the potential role of the lesser expressed intrinsic sequence contributions is not straightforward.”. Similar to my comments above regarding terminology, please clarify what are both contributions and why are intrinsic sequences ’lesser expressed’.
  
  We have rewritten this passage to avoid unclear wording.
  
  Line 474. ”...we showed that the trajectory-independent sequences”. Do you mean ’intrinsic sequences’?
  
  We thank the reviewer for careful reading! We have changed the wording ”intrinsic sequences” in the revision.
  
  Line 482. ”...field pairs being extrinsic”. Please clarify, as the usage of extrinsic now refers to field pairs.
  
  Thank you for pointing this out. We went through the whole manuscript and clarified the terms.
  
  Line 245 (heading). Consider rewriting as ’Dependence of theta se- quences on heading directions’. Extrinsic and Intrinsic models have not yet been introduced.
  
  Since the main purpose of the first Results section is to explain the difference between extrinsic and intrinsic sequences we kept these terms in the heading but modified it to ”Dependence of theta sequences on head- ing directions: Extrinsic and intrinsic sequences”. Additionally, we have put more emphasis on introducing the terms ”extrinsic” and ”intrinsic” in this section.
  
  Figure 1.
  
  I suggest using the same font - C and D, and F and G are too close to each other, consider adding space. For example, the exponent, 10-2 makes reading cumbersome. Line 300. Phase tail means offset phase? Phase tail may be too informal. Line 325: DG loop. Do you mean CA3-DG projection?
  
  We thank the reviewer for the suggestions. In the revised manuscript, we have ensured that the same font is used in all of the fig- ures. To improve the readability of Figure 1, we have added space between panels as suggested, removed repeated axis label and downsized the text ”10-2”. Furthermore, we have rewritten the referenced line without using the word ”tail”, and also, clarified the meaning of DG loop as the short form of CA3-DG projection.
  
  Figure 4 caption: ”DG lesion reduces temporal correlations...”. It is more precise to say that the lesion reduces the slope of the fitted lag vs dis- tance. And how is this related to sequence compression?
  
  In the paragraph referring to Figure 4, we have elaborated on the meaning of theta compression and its relation with the the lag-distance plot. However, we argue that ”reduces the slope of the fitted curve” is not comprehensive enough to express our summarized conclusion in a caption title. We have modified the wording to be ”DG lesion reduces theta compression”.
  
  In addition, we have changed the slope unit to be radians per cm rather than radians per maximum pair distance, in conformity to unit standards.
  
  General comment about terminology with regards to tuning and connec- tivity: it is not formally correct to compare connectivity with trajectories (e.g., lines 388-395, caption of Figure 5A, etc). Perhaps compare tuning to particular directions/preference or receptive field?
  
  We have corrected the wording such that the direction of DG- loop projection is compared to the direction of trajectory.
  
  Line 470. ’...fixed recursive loop.” Sentence is not clear, do you mean recurrent loops?
  
  The reviewer is correct. We corrected the wording
  
  Reviewer #2 had the following recommendations.
  
  M1. The abstract focuses on the differences between online and offline hippocampal replays. However, the replay topic is not touched upon in the rest of the manuscript. I found this very confusing when I first read the pa- per. I suggest the authors reconsider the best way to approach the opening or at least discuss if and how their model would incorporate replay phenomena.
  
  Also in response to reviewer #1 we have rewritten the abstract focusing on the problem of how to generate 2-d topology from 1-d sequences. In addition, also in response to Reviewer#1 we added a paragraph in the discussion detailing a hypothesis on how er think replay and intrinsic se- quences work together.
  
  m2. On lines 89-91, the authors provide the selection of neuronal pa- rameters for excitatory pyramidal cells and inhibitory cells in the Izhikevich model. While the choice of model is reasonable, it would be helpful to clarify the source of these neuronal parameters, especially for readers who are not familiar with the model.
  
  Again, also in response to reviewer # 1, we have added more motivation for the Izhikevich model.
  
  M3. On lines 94-98, the model considers a 2D sheet of CA3 neurons. One of the most significant assumptions is that each 2x2 tile of place cells is considered a unit with four directional angles. What is the basis for this assumption? Is there any experimental result supporting this, or is it a completely artificial design for the model? This is important since the or- ganization of CA3 cells also affects the network architecture discussed later and impacts the realism of the model.
  
  This comment is related to Reviewer #1’s concern on experience- dependent plasticity: How is this connectivity pattern established? We fully agree that this is an open problem for the Tsodyks et al.-type networks. The main reason for choosing them (as argued in our response to reviewer #1) is to have two published models, representing one type of sequence each, that are similar enough for comparison. In addition, we added new simulations (new Figure 6 and Supplementary Figure S1), showing that the basic phe- nomenology can also be obtained in a model without recurrent connections (see also response to Reviewer # 1)
  
  m4. Similarly, on lines 111 and 140, the model uses 500 ms for the timescales of short facilitation and short-term synaptic depression. The choices of these two timescales are vital for producing directionality in extrin- sic and intrinsic sequences, yet their experimental sources are not clarified.
  
  In the Methods section of the revised manuscript, we have in- cluded the sources of previous experimental data and modelling work to support our choice of the time constants.
  
  M5. On line 126, the authors assume that the synaptic strengths be- tween CA3 cells, Wij, are given by the distances between neurons and the similarity between their directional preferences. While this assumption seems reasonable in the sensory cortex, I am unsure if this is also the case in the hippocampus, and the authors should clarify the basis for this assumption.
  
  The distance dependence simply reflects the original Romani and Tsodyks 2015 model (see response to M3) and we share the concern of the reviewers. The increased connectivity for neurons with the same di- rectional preference was necessary to recover the direction dependent phase precession properties (Figure 2) in the realm of the Romani and Tsodyks 2015 model. Please also see our new Figure 6 showing simulations without the recurrent matrix.
  
  More importantly, the existing connections within CA3 and DG cells completely determine the ”intrinsic” sequences. But wouldn’t this be fragile when place cells undergo global remapping, which can take place within only a few seconds? The author should comment on this in the discussion.
  
  We would like to thank the reviewer for bringing up this inter- esting point. In our thinking, the DG-CA3 connectivity is fixed (multiple 1-d trajectories, not necessarily requiring 2-d topology), i.e., the same in- trinsic sequence should show up in multiple environments (and should not remap), although it may just not be active in some environments). This is a prediction of our model and we have added it to the Discussion.
  
  M6. I found the setup of DG place cells unreasonable. DG place cells are found to be granule cells rather than pyramidal cells. Moreover, the model does not consider recurrent connections between DG cells (These setups are closer to CA1 place cells).
  
  We agree with the reviewer, DG granule cells should rather be modelled as high-input resistance EIF neurons. However, the feedback loop via the dentate is not a direct one. It involves hilar mossy cells plus multiple hierarchies of feedback inhibition (this is probably what the reviewer means with recurrent connections between DG neurons, because granule cells are not recurrently connected in the non-pathological state). To our knowledge a biologically realistic model of the hilar-DG network does not exist and it would be far beyond the scope of this paper to develop one. We therefore see our DG feedback model rather as phenomenological. The discussion paragraph on the anatomy of the dentate gyrus touches on these points.
  
  Therefore, a significant concern is: Why should it be the DG feedback projection to CA3 responsible for the ”intrinsic” sequences instead of pro- jections from other brain areas?
  
  The reviewer is generally correct, any brain structure which im- plements fixed sequences via a loop would do. The reason why we suggest the DG to be the best candidate is purely empirical referring to papers with dentate lesions: Sasaki et al. 2018 and Ahmadi et a. 2022. We have added a similar argument to the discussion.
  
  m7. On line 166, the authors claim that there are no connections between inhibitory cells at all. While I understand that this is for simplification of the model, the lack of recurrent inhibition between interneurons may have limited the model’s ability to produce gamma-band dynamics (referring to PING and ING mechanisms), which are robust rhythms produced in CA3. I am very curious if the model can incorporate theta-gamma coupling by in- troducing connections between CA3 inhibitory cells.
  
  We have omitted the gamma oscillation for simplicity, because we do not have a hypothesis for a functional role in the context of dis- tinguishing extrinsic from intrinsic sequences (Occam’s razor) and, as the reviewer correctly anticipates, they unavoidably show up when inhibitory in- terneurons connect to each other (e.g. Thurley et al. 2013). Of course, one could envision situations in which gamma for intrinsic sequences my have different frequency than for extrinsic ones, by differentially manipulating the CA3 and DG basket cell networks, but, as long as there is no experimental data, it would be pure speculation and thus we have not included it in the model.
  
  m8. The authors should clarify the source of parameters in Table 1, especially the synaptic strengths. These values are vital for extrinsic and intrinsic theta sequences.
  
  The weight values have been chosen to allow for large theta phase precession range, coexistence of extrinsic and intrinsic sequences, and stability of the network activity. A similar statement has been added to the manuscript.
  
  M9. I have another concern regarding the measurements of ”extrinsic- ity” and ”intrinsicity” defined on lines 185-196. Are they the best measures? To distinguish the cause of spike correlations, the ”extrinsicity” and ”intrin- sicity” of a pair of spikes should not be high at the same time. However, this is clearly not the case in the model, according to Figs 3 and 5. Moreover, in the data analysis carried out later, spike pairs are considered extrinsic or intrinsic merely by comparing the two measurements. I suggest the authors consider counterfactual methods in causal inference. For example, would a spike pair (cell1, cell2) still exist if we change the sensorimotor inputs or the DG-CA3 projections? If this is difficult to implement, the authors should at least discuss how different choices of measurements would impact the con- clusions of the paper.
  
  The problem the reviewer has identified arises from the funda- mental symmetry of theta phase quantification: if spikes of a pair of place fields have a phase difference of 180◦ one cannot say which cell leads and which cell follows, hence, the phase difference is both intrinsic (because the peak doesn’t flip) and extrinsic (because the peak flips and ends up at the same phase). The fact that in some cases extrinsicity as well as intrinsicity are high simply means that the field pair has a correlation peak lag close to 180◦. Since in the experimental data set in (Yiu et al. 2022) only field pairs were available, we have not been able to use a different quantification then and decided to apply the same quantification in our model for comparison. Moreover, Figure 5F nicely shows that the measures are able to retrieve the ground-truth intrinsic DG-loop structure when considered on the population level.
  
  In our model, though, we can go beyond 2-nd order statistics and derive sequence similarity measures including multiple cells, e.g., Chenani et al. 2019. However, since, we already know the ground truth by construction, we decided to not use these methods. We added a paragraph in the discus- sion elaborating on beyond 2nd order sequence quantification.
  
  m10. The authors begin discussing ”intrinsic sequences” from line 316. However, it is not defined before that (and in the rest of the paper as well), causing confusion when reading the paper. The exact definitions of extrinsic and intrinsic sequences should come earlier.
  
  We hope that our changes to the beginning of the results section (Figure 1), also asked for by Reviewer # 1 could clarify the confusion.
  
  m11. On lines 345-347, the authors claim that ”the intrinsic sequences are played out backward as determined by the direction of fixed recurrence (Figure 3F),” which is vague. If such sequences are present in that panel, it should be more explicitly indicated graphically.
  
  Also in response to Reviewer #1, we have graphically high- lighted the two types of sequences.
  
  M12. On lines 309, 356, 484, 495, 515, and possibly other instances, the authors repeatedly claim that the model simulations are in ”quantitative agreement” with their previous experimental paper. However, no experimen- tal data or comparison with the simulations are presented in this paper. The authors should at least create one figure to demonstrate the degree of consistency between them, instead of merely asking the reader to refer back to their previous paper.
  
  We agree with the reviewer that the experimental data of our previous paper should be presented in the manuscript. However, creating more panels or figures is likely to clutter the already crowded visuals and ob- scure our main message. We therefore decided to give numerical comparisons the previous findings in the main text whenever appropriate, specifically, in the sections referring to Figures 2, 3 and in the Discussion.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.05.527133v3
www.biorxiv.org www.biorxiv.org

New submission 31/05/2023, 10:37:40

1
1. Public_Reviews 10 Aug 2023
  
  in eLife
  
  Author Response
  
  We thank Dr. Carlos Isales and Dr. Jenny Tung as well as the peer Reviewers for their critiques and comments concerning this manuscript and respond here to their key concerns. Some of the Reviewers’ questions raised fascinating points about naked mole-rat biology and social habits, which we are also curious about, but which are too far afield from the central themes of the manuscript to warrant new work or revision. The Reviewers also raised some concerns about our methodological assessments and data interpretation which may warrant further discussion and explanation. We address those comments below. In no case do we feel that the concerns raised undermine our conclusions, so we have not undertaken new analyses nor revised the manuscript.
  
  Median survival and power.
  
  A recurring theme in these reviews is that our conclusion that naked mole-rats do not experience actuarial senescence is spurious, as it is “incomplete for younger animals and inadequate for older animals” due to Kaplan-Meier survival failing to reach median lifespan. We counter that premise, for median survival is an arbitrary threshold with no special bearing on when the Gompertzian hazard increase (onset of actuarial senescence) should become apparent. This point is well illustrated in Figure 5 of our original manuscript (Ruby et al., 2018). For demographic data from lab mice, humans, and horses (panels B, C, and D, respectively), the Gompertzian hazard increase is readily apparent by the time median survival (indicated by vertical dotted lines) is reached.
  
  Another concern raised in the reviews is uncertainty about the true increase in power for these updated data since our 2018 report. The Reviewers correctly point out that the distribution of those data, and not just their scale, are relevant to power. The distribution of all data, old and new, are clearly illustrated as a function of age in Figure 2A. The ~doubling of available observation data is consistent across age groups, with one exception: at ~8,000-10,000 days of age. However, we do not agree that is a shortcoming of the new data’s power for hazard calculation among older animals, given that the animals that formerly occupied that age bin have continued to age, without greater hazard, across the next five years. In other words, the lack of N increase in that particular age bin is balanced by the massive increase in available data at ~10,000-12,000 days of age - an advanced age bin that was previously almost empty.
  
  More surprisingly is the insinuation that for an approximately 40 gram rodent species, median survival on an order of 30+ years, with no sign of an increase in age-related mortality hazard, is considered a reasonable expectation. Both here and in our 2018 manuscript, we have conservatively used Tsex (180 days) as our benchmark for allometric scaling. Alternatively, one could scale this to the predicted lifespan based on average body weight for the species. According to the equation of de Magalhaes et al. (2007), the maximum lifespan of H.glaber is expected to be merely six years. Here, the Reviewers suggest that we are under-powered to make any statements about demographic aging because we have not reached median lifespan - despite the fact that our observations extend out to seven times the expected maximum lifespan. This is the precise nature of our argument that Gompertzian demographic aging is defied: that the onset of actuarial senescence is not apparent even at ages many-fold beyond when one would expect Gompertzian trends to have wiped out the entire population.
  
  Ironically, the Reviewers seem to have focused on the most striking manifestations of Gompertzian defiance - not reaching median lifespan after decades of population observation, or having few death events after tens of thousands of days of individual lifespan observation - as reasons to doubt the conclusions. Even if we quadrupled the number of sample points and included data for another 35 years, if we still did not detect the onset of actuarial senescence, the same critiques would still apply - and would be similarly illogical.
  
  The appropriateness of Kaplan-Meier, with left & right censorship
  
  Objections were raised about the appropriateness of Kaplan-Meier survival analysis for our data. Reviewer #3 asserts that “a Kaplan-Meier estimator can only take right-censored and uncensored records”, which is incorrect. This perhaps reflects a wider misunderstanding of Kaplan-Meier statistics that warrants further explanation.
  
  Reviewer #3 asserts that “left-censoring occurs when your event can be repeated and some events occur before the start of the study”. This is an oversimplified and far too-limited description of when left-censoring should be applied. We will further explain how left-censorship is applied in various analyses of our data, but for further reading on how this practice can produce unbiased estimates, we recommend the Reviewers consult (Cain et al 2011). We will discuss left and right truncation and censorship in terms of the diagram from Figure 2 of that manuscript, which illustrates a study in which the timing of event Y after event X in an individual’s life is being analyzed, given enrollment in a study at age A and exit from the study at age B. We also remind the Reviewers that methods used previously by us are in the papers (Ruby et al, 2018 & 2019) which were referenced and cited in our manuscript and should also be consulted for a full description.
  
  For our study, ages A and B from (Cain et al 2011) are akin to the edges of our hazard estimation windows: appropriate application of censorship and truncation allows us to accurately, unbiasedly estimate hazard within each age bin, allowing fair evaluation of changes (or lack thereof) as a function of age. For full Kaplan-Meier survival, age A is uniformly defined as Tsex (day 187), and B is not globally defined - rather, it is defined for each animal if observation ended due to exit from the collection (i.e., used in research studies (KFR), donated to another researcher, or continuing to be alive at the time of the study). Since none of the Reviewers seemed confused or concerned about our use of right-censorship in these cases, we will focus this discussion on left-censorship.
  
  In our original analysis (Ruby et al., 2018), we did not apply left-censorship because Dr. Buffenstein had maintained the animals since they were born, therefore no events occurred (i.e. observations of an animal being alive or dead on a day) prior to the beginning of the study. In the parlance of (Cain et al, 2011): we knew when the initiating event X had occurred (Tsex), and the animals had been continuously observed thereafter, up until either their death or rightcensorship point. Animals were right-censored if they were removed from the study, e.g. due to sacrifice for research or donation to other researchers. Doing so reduced the population size moving forward (to the right) without modifying the survival value, allowing the impact of individual death events to be appropriately amplified (i.e. Kaplan-Meier analysis).
  
  For left-censored data, the same operation occurs but in reverse order: for example, if an animal is left-censored at 457 days of age, then the population size is increased by one on that day, without modifying the survival value. In Kaplan-Meier survival estimation, for each observation period, the current survival value is multiplied by the fraction of animals surviving at that time interval divided by the number of animals in the population in that interval. Since the animal in question was not observed prior to 457 days of age, it would not be counted in the population size prior to that day: had it died, it would not have been in the study population at all. However, once it has entered the population, each day-of-age on which it is observed to be alive is included in the population size tally, since each day it could also perish and thereby impact the survival curve. If any of the Reviewers received animals from Dr. Buffenstein should they wish to extend this data set in the future using those animals, left-censoring them at their age when they were received (or after some acclimation period) would be the proper method to do so.
  
  As stated above: in our original analysis (Ruby et al., 2018), we did not generally apply leftcensorship because Dr. Buffenstein had maintained the animals since they were born (although beginning the analysis at Tsex qualifies as population-wide left-censorship). In their commentary, Dammann et al. (2019) pointed out that loss of records could modify the hazard distribution through bias towards longer-term survivors: in other words, counting long-lived animals as part of the population in early life is unfair because the death events from the truly larger population at that time had been lost (in that case: perhaps back in the 1980’s). In the parlance of (Cain et al, 2011): loss of records would have been the equivalent of left truncation, which if unchecked could produce bias. For our reply (Ruby et al., 2019), we address this problem by applying a drastic left-censoring of all animal data on a date where we could be highly confident that all records had been securely maintained, thus removing any potential bias introduced by old, lost records - as illustrated by (Cain et al, 2011). That re-analysis does not change our results, negating loss of decades-old records as a confounder of our conclusions. In this new manuscript, we used this technique again, only analyzing data collected since those data reported in our prior publications. Again, our original conclusions were confirmed: quoting Reviewer #3, “the main figures are virtually the same, with some minor changes due to the extended dataset”.
  
  Independence between studies
  
  In this new manuscript, with substantially more data, we applied left-censorship again in order to conduct an analysis of just the newly-provided data. Importantly, no datum - i.e. no day of observation of an animal being either alive or dead - overlapped between that analysis and those from our original reports (Ruby et al., 2018 & 19), and data were collected across nonoverlapping periods of time. Reviewer #2 questions the independence of this analysis from the original, correctly citing that it is still our own collection whose demographic data we are surveying. We reply that it is as independent of a dataset as we could possibly provide: greater independence would require the publication of substantial demographic data from other members of the H.glaber research community, which we would be happy to see. We also want to remind the Reviewers that Sherman and Jarvis (2002) also reported negligible demographic senescence for animals >15 years of age under their care: a fully-independent observation that concurs with our conclusions, albeit with substantially fewer animals and less statistical power.
  
  “Glossing over” reports of aging phenotypes
  
  Reviewer #1 suggests that our review of our own prior publications in this manuscript has “glossed over data that don’t support our main interpretations”, specifically mentioning the papers by Edrey et al., (2011) and Andziak et al., (2006). However, this is not an accurate reflection of the content of those published papers. The reviewer highlights data pertaining to case studies of two animals, aged 29 and 30 years, exhibiting pathologies that are commonly associated with aging in the Edrey et al., (2011) paper that was entitled “Successful aging and sustained good health in the naked mole-rat……”. But, as per the title of that paper, those were atypical cases. Indeed, we reported that the majority of animals maintained good health and activity well into their third decade. The Andziak et al., (2006) paper revealed that young (2y), healthy naked mole-rats have higher levels of oxidative damage to lipids, proteins and DNA than observed in young mice; but the follow up paper Andziak and Buffenstein (2006) reported that unlike that observed in mice, in naked mole-rats the levels of such damage do not further increase with advancing age, supporting the premise of sustained tissue homeostasis. Routine pathological assessments undertaken by our group and from zoological specimens in the 12 years since Edrey et al., (2011) have revealed many more instances of “aging phenotype pathologies” - but again, with similar frequency across all age groups (Delaney et al., 2021). We have not “glossed over data that don’t support our main interpretations”: in fact, the data brought up by the Reviewer support our conclusions. Like natural death, “age-associated disease phenotypes” occur stochastically across all age groups of H.glaber, rather than being exponentially enriched in elderly animals as in other species.
  
  Breeding status
  
  Reviewer #1 also states that “this study fails to fully represent the literature with respect to the divergence in aging rates between breeders and non-breeders” This section of our discussion (lines 326-367) addresses the survival advantage in many cooperative breeding mammals in the wild and in captivity including other mole-rats and meerkats (Sharp and Clutton-Brock, 2010; Dammann et al., 2011, Cram et al., 2018). The lower survival of subordinates in captivity may be due to chronic stress associated with bullying by the dominant animals and their inability to disperse and avoid such unpleasant activities; often being injured and dying after losing fights for a more dominant position in the social hierarchy. Braude et al., (2021) similarly report that compared to subordinates who undertake the more precarious activities of burrow extension, foraging or dispersal, the breeding females remain in their study site for far longer periods.
  
  In captivity, subordinates have two paths to becoming a breeder: If the breeding female dies, some subordinate females within the colony will fight to the death to establish breeding status and inherit the dominant role in the colony. This could imply that they are “higher-quality” individuals as suggested by Reviewer #1 with molecular and physiological mechanisms in place to outlive their “poorer- quality” conspecifics. However, the majority of breeding females in our colony arise through random pairing of a female and a male that has been isolated for a few days from their colony. As such there is no selection for “higher-quality” individuals with concomitant inheritance of better somatic maintenance mechanisms. Rather, breeding status appears to be accompanied by a phenoplastic switch, as suggested by the lower levels of DNA methylation in tissues of breeding females (Horvath et al., 2021) and altered growth patterns when a female changes her status to that of a breeder (O’Riain et al., 2000). This is possibly linked to moving up the dominance hierarchy with concomitant changes in stress, somatotropic, and reproductive hormones as well as augmented tissue repair pathways for the maintenance of homeostasis.
  
  We have not undertaken in depth studies on behavior and social habits and the effect of age, but agree these would be of interest in future studies.
  
  Analysis initiation at 6 months
  
  Mortality rates are highest in the first three months of life, in keeping with increased mortality during the developmental period. While it is true that in captivity most animals continue to grow for the first eighteen months to two years of life and some individuals may continue to gain weight well into their third decade, we and others have shown that animals can successfully breed at 6 months of age, if given the opportunity to do so. Other demographic studies similarly use the age at which animals can reproduce as the starting point for their analyses. Nevertheless, even if we were to use 2 years as the starting point, the same trends will be evident for there was no increase in mortality risk even at ages beyond 30 years.
  
  Colony size effects
  
  It is intriguing that smaller colonies had higher mortality risk than larger colonies. In many cases smaller colonies represent younger colonies with possibly less well established breeders and a higher degree of social instability. In other cases, the breeding female may not be very successful in raising her young, and possibly is not producing “high-quality” offspring. We agree with the Reviewer, behavioral assessments are needed to evaluate if there is more fighting and competition for dominance or if other social dynamics or ‘poorer-quality’ offspring are at play, nevertheless these findings are intriguing and we have speculated as to why this is the case. Further work is needed to definitively tease out why this is indeed the case.
  
  References cited here
  
  Andziak et al., (2006) doi: 10.1111/j.1474-9726.2006.00237
  
  Andziak and Buffenstein (2006) doi: 10.1111/j.1474-9726.2006.00246
  
  Braude et al., (2021) doi: 10.1111/brv.12660
  
  Cain et al (2011) doi: 10.1093/aje/kwq481
  
  Cram et al., (2018) doi: 10.1016/j.cub.2018.07.021
  
  Dammann et al., (2011) doi: 10.1371/journal.pone.0018757
  
  Dammann et al., (2019) doi:10.7554/eLife.45415
  
  Delaney et al., (2021) doi: 10.1007/978-3-030-65943-1_15
  
  De Magalhaes et al., (2007) doi: 10.1093/gerona/62.6.583
  
  Edrey et al., (2011) doi: 10.1093/ilar.52.1.41
  
  Horvath et al., (2022) doi:10.1038/s43587-021-00152-1
  
  O’Riain et al., (2000) doi: 10.1073/pnas.97.24.13194 Ruby et al., (2018) doi: 10.7554/eLife.31157
  
  Ruby et al., (2019) doi: 10.7554/eLife.47047.
  
  Sharp and Clutton-Brock,(2010) doi: 10.1111/j.1365-2656.2009.01616.
  
  Sherman and Jarvis (2002) doi: 10.1017/S0952836902001437
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.03.27.534424v1
www.biorxiv.org www.biorxiv.org

An antagonism between Spinophilin and Syd-1 operates upstream of memory-promoting presynaptic long-term plasticity

1
1. Public_Reviews 10 Aug 2023
 
 in eLife
 
 Author Response
 
 Reviewer #2 (Public Review):
 
 The manuscript by Ramesh et al builds upon prior studies from the Sigrist group to examine synergistic interactions between the Spinophilin (Spn) and Syd-1 synaptic proteins and their role in regulating presynaptic homeostatic plasticity at Drosophila larval NMJs and adult olfactory memory in the Mushroom Body (MB). The authors show synergistic interactions between the two proteins in these processes, where late PHP and long-term memory are abolished in Spn mutants, but restored upon reduction of Syd-1 function in the mutants. The authors go on to show that Spn appears to act in PHP by regulating a late stage in AZ remodeling and longer-term increases in the readily releasable SV pool by controlling actin polymerization/dynamics through the Mical protein. Although key aspects of the overall bigger picture have been published before (Mical’s role in PHP, antagonism between Spn and Syd-1 in AZ development, AZ remodeling in MB-dependent memory), the current paper ties together many of these observations into a bigger picture of how PHP plasticity at the NMJ is established and provides support for a role for PHP-required proteins in promoting long-term memory in the adult MB through effects on AZ structure and AZ protein content/amount. The study also provides new links to the role of Spn in regulating local synaptic actin dynamics and how this alters the readily releasable pool and SV release. Some points of note are provided below.
 
 1) I’m a bit confused about the time course experiments the authors describe that seem to be contradictory in Figures 1 and 2. The authors indicate control animals transiently increase BRP AZ levels during PHP at 10 mins, but by 30 minutes this increase is gone, even though PHP remains. As such, the data in these early figures suggests increases in BRP AZ levels may support an early aspect of the PHP effect (though I note this appears controversial, as other data indicate blocking the rapid AZ remodeling by several manipulations such as Arl8 transport disruption, permits early PHP, but disrupts late PHP). In contrast, the authors show that Spn mutants do not display AZ BRP increase at 10 mins, and still show early PHP, but lack late PHP. I assume the early PHP does not require AZ remodeling or an increase in the RRP at this early time point?
 
 We thank the reviewer for this insightful question, which to a degree is reflected also in reviewer 1´s question concerning the variability of Spn mutants when tested for PHP at 10 min PhTx treatment and thus the temporally and likely functional entanglement of induction and maintenance mechanisms.
 
 Let us start by once again describing our findings: BRP increase is clear at 10 min PhTx treatment but is no longer measurable at 30 min PhTx treatment. Genetic elimination of BRP does not restrict PHP at 10 min PhTx (Bohme et al. 2019). However, BRP mutants are neither able to maintain PHP when PhTx treatment is extended to 30 minutes as described in Turrel et al (Turrel et al. 2022), nor in a chronic PHP paradigm of BRP, GluRIIA double mutant (Bohme et al. 2019). We suggest that the transient increase of BRP, also previously described specifically in the MB γ-neurons (Zhang et al. 2018), triggers other, longer lasting AZ changes. Indeed, we found that the increase of the critical release factor Unc13A is still present at 30 min PhTx treatment and is dependent on the “transient” BRP increase (Fig. S3B) (Turrel et al. 2022). Turrel et al also uncovered a more transient upregulation of BRP when compared to Unc13A in the MB. Here, specifically upon paired olfactory conditioning, 1 h after training, animals displayed BRP and Unc13A level increases. At 3 h post training, however, BRP levels had already plateaued, whereas Unc13A levels had increased further (Figure 1B, (Turrel et al. 2022)).
 
 We have now added to the discussion section: “We suggest that the transient increase of BRP, also previously described specifically in the MB γ-neurons (Zhang et al. 2018), triggers other, longer lasting AZ changes. Indeed, we found that the increase of the critical release factor Unc13A is still present at 30 min PhTx treatment and is dependent on the “transient” BRP increase (Fig. S3B) (Turrel et al. 2022). Turrel et al also uncovered a more transient upregulation of BRP when compared to Unc13A in the MB. Here, specifically upon paired olfactory conditioning, 1 h after training, animals displayed BRP and Unc13A level increases. At 3 h post training, however, BRP levels had already plateaued, whereas Unc13A levels had increased further (Fig. 1B, Turrel et al).” (Line 363)
 
 RRP increase has been shown at 10 min PhTx (Weyhersmuller et al. 2011) treatment and remains high after 30 minutes of PhTx treatment (this study).
 
 2) In relation to point 1 above, the time course seems different in MB neurons, where the AZ remodeling (noted by increases in AZ BRP) seems to take 2-3 hours. Do the authors have any ideas into why the time course of PHP AZ remodeling at larval NMJs can occur in 10 minutes, but MB neuron remodeling seems to take hours?
 
 We thank the reviewer for this question. We specifically probed the time intervals of 10 and 30 min at the NMJ due to established protocols and technical reasons; and 1hr and 3hr in the brain due to our interest in MTM. Zhang et al (Zhang et al. 2018) previously showed that indeed BRP levels in the γ-lobe were significantly increased already after 20 min after conditioning. We in the moment can only suspect that the following differences might be relevant in this point: the differences in the peripheral and central nervous system in terms of glutamatergic motoneuron presynapses (NMJ) versus cholinergic (KC presynapses) might change temporal dynamics of AZ remodeling. Furthermore, the plasticity induction protocol, using PhTx, is potentially a somewhat more “heavy-handed” approach compared to the more subtle conditioning involving the activation of dopaminergic neurons. The more complex circuitry of the central brain might also be involved in maintaining this BRP levels increase over longer timescales than at the NMJ, which may serve some yet unknown physiological purpose in maintaining memories.
 
 We use the NMJ PhTx assay to identify proteins involved in AZ remodeling that could also be involved in memory formation in adult flies. As of now, we have no experimental evidence of whether the AZ remodeling observed in the MB actually leads to synaptic depression or instead is a reaction to the initial short-term synaptic depression occurring. This study and Turrel et al. 2022 (Turrel et al. 2022) provide evidence for an overlap of the executory machinery involved in both mechanisms, NMJ PHP plasticity and MTM formation, as BRP, Spn, Arl8, IMAC and Aplip1 are involved specifically both in mid-term NMJ PHP (at 30 min after PhTx treatment) and in MTM.
 
 3) Could the lack of rapid BRP accumulation during early PHP in Spn mutants be secondary to the larger # of AZs in those mutants and a known rate-limiting amount of BRP available that might not be enough to go to the extra Azs?
 
 This per se might be a relevant concern. Notably, however, acute application of Latrunculin-B in Spn mutants allowed for an increase in BRP (Figure 5g-h). Thus, a limitation in the total pool of available BRP should not be responsible for Spn mutants’ inability to accumulate BRP under PhTx treatment.
 
 4) There isn't any validation of the Spn co-IP results shown in Figure 3 through other assays, and a lot of proteins are being pulled down. I can't see some of these being real (mitochondrial translation proteins? - how could Spn gain access to the inside of the mitochondria since it's a cytosolic protein?). As such, I don't know how to value that huge group of pull-down interactions without further validation, making it difficult to sort out how relevant these really are. The genetic validation of similar phenotypes in the Mical mutant, together with rescues, supports that interaction. Not sure about the rest of that list.
 
 We appreciate the opportunity to discuss our primary data and how we used them to generate testable hypotheses for our study. Firstly, the mitochondrial translation proteins which were identified in our Spn IPs are all nuclear encoded, means they are transcribed in the nucleus and translated in the cytoplasm. Interestingly, recent work indeed suggests that mitochondrial biogenesis in the synapse is supported by local translation (e.g. see (Kuzniewska et al. 2020)). As Spn IPs are also highly significantly enriched for cytosolic translation machinery, it is an appealing idea that Spn might be involved in coupling local translation, mitochondria and memory stabilization. As this clearly goes beyond the scope of this paper, we did not further discuss this point, and are prepared to remove these data if considered misleading.
 
 Concerning unspecific proteins being pulled down in our IPs, we would like to emphasize that these IPs are the result of an established out protocol, which entails laborious synaptosome preparations which our lab worked out previously (Depner et al. 2014). For each condition, 4 biological replicates were performed, and mitochondrial ribosomal proteins were enriched with p<10-30 significance, and never observed in our extensive systematic work on active zone biochemistry for any other active zone protein.
 
 In this study, we used the Spn IPs to identify putative interaction partners, with the intention to validate the physiological relevance of any positive hits through experiments, like we did in the case of Mical. We were also able to identify previously known interaction partners like Syd-1 and Nrx-1 (Muhammad et al. 2015). Obviously, we did not independently validate these findings for the large number of identified proteins, e.g. by using in vitro purified proteins (we do not consider Western probing of IPs to be independent proof of any complementary value to mass-spectrometry based quantification).
 
 We have now added this sentence to our manuscript:
 
 “As a validation of the list of proteins that were returned as interaction partners of Spn in this work, we were able to reconfirm previously known interactions (Muhammad et al. 2015), e.g., Syd-1 (Figure 3b) and Nrx-1 (not shown).” (Line 148)
 
 5) Are the authors worried about the fact that the Actin-GFP line they use to look at synaptic actin dynamics is driven by a GAL4, and the 2nd top hit of their Spn IP pull downs are translation regulators? Could the changes in actin-GFP they see between control and Spn mutants have anything to do with a different translation of the exogenous UAS-actin-GFP? Would have been helpful to do an endogenous stain for actin levels with an anti-actin antibody so no transcription/translation issues of a transgene would be at play. This would be easy to do for the quantification of total actin levels at the synapse.
 
 This is per se a fully justified concern, which is hard to be fully excluded. Indeed, when preparing this manuscript, we attempted to visualize and quantify the endogenous presynaptic actin through immunostaining. However, these attempts were unsuccessful, as the very bright muscle actin staining obscures the relatively low levels of actin present close to the presynaptic AZs, even when using super-resolution light microscopy. Still, we would like to emphasize that Spn and Syd-1 antagonized each others’ function concerning apparent F-actin level (using Gal4 expression of actin-GFP). Given the known connection of Spn operating as a compartment specific F-actin breaker (Chia, Patel, and Shen 2012; Ryan et al. 2005; Nakanishi et al. 1997), we are still rather confident about our finding and its interpretation.
 
 Concerning the FRAP analyses, we are fully confident of our findings, as the intensity of actin-GFP is internally normalized within each NMJ. Therefore, the differences in FRAP experiments should be independent of the starting amounts of actin in control and mutant animals. As we can show that the Spn/Syd-1 antagonism functions on actin dynamics as well (Figure 4j), we are sure concerning the physiological relevance of our observations.
 
 6) Are Mical levels normalized in the Spn, Syd1 double mutants, given PHP is recovered?
 
 We thank the reviewer for the comment and agree that Mical levels should be expected to normalize upon Syd-1 heterozygosity in Spn mutants. We have now immunostained for Mical in wildtype, Spn mutants and Spn mutants with Syd-1 heterozygosity to address this question. We found that Mical levels in Spn mutants were indeed normalized upon Syd-1 heterozygosity (Figure 5 - Figure supplement 1 c-d).
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.01.24.525468v1
www.biorxiv.org www.biorxiv.org

New submission 06/08/2023, 15:06:13

1
1. Public_Reviews 10 Aug 2023
  
  in eLife
  
  Author response
  
  Reviewer #1 (Public Review):
  
  The usual strategy to combat antimicrobial drug resistance is to administer a combination of two drugs with distinct mechanisms. An alternative, however, would be to use two drugs that attack the same target, if resistance to one is incompatible with resistance to the other. The authors previously studied parasites resistant to the dihydroorotate dehydrogenase (DHODH) inhibitor DSM265 through an E182D mutation and found that resistance to another inhibitor, IDI-6273, resulted in a reversion to wild-type. Here, they screened various other inhibitors and found that TCMDC-125334 is more active on DSM265-resistant parasites than the wild-type. In this case, however, it was possible for the parasites to become resistant to both inhibitors, either by increasing the copy number of DSM-265-resistant DHODH genes (with a C276Y mutation) or by the emergence of a different mutation. The selection of wild-type parasites with both compounds resulted in resistance but this took considerably longer than for either compound alone. (The actual frequency of double resistance emergence was not measured.)
  
  Overall the results suggest that for DHODH, when pre-existing resistant parasites are selected with another inhibitor, the results will depend on both the initial mutation and the new inhibitor. The data are solid and convincing and suggest that DHODH has considerable scope for resistance development. The observations do have relevance for other inhibitors and/or enzyme drug targets. However from the data so far, the sweeping statements that the authors make concerning double resistance, in general, are not supported.
  
  The formatting of the Figures requires some improvement and in some cases, more details of the statistical analyses are needed.
  
  We thank Reviewer 1 for their kind and helpful comments. We have answered their specific concerns below. In particular, we have improved the formatting of the figures based on their recommendations. We have also edited the discussion based on reviewer 1’s comments.
  
  Reviewer #2 (Public Review):
  
  This article focuses on drug resistance acquired by Plasmodium falciparum malaria parasites that have been pressured with different inhibitors of the essential enzyme DHODH (dihydroorotate dehydrogenase). The study focuses on collateral sensitivity between DSM265, which has been evaluated in a human clinical trial and found to select for resistance via the point mutation C276Y (C276F and G181S were also implicated; PMID 29909069), and the GSK compound TMCDC-125334, against which a panel of DHODH mutant parasites (including C276Y) were found to have increased sensitivity. The authors herein explore this case of "collateral sensitivity" by examining whether these two inhibitors, when used simultaneously, might preclude the selection of resistant parasites. The answer, in this case, is no; collateral sensitivity did not prevent parasites from acquiring a novel mutation (V532A) that mediated resistance to both. Culture competition assays provide evidence that this mutant retains normal fitness. The authors conclude that for this target the idea of combining these inhibitors is not a viable therapeutic strategy. The authors also illustrate how TMCDC-125334 can select for resistance via a separate mutation (I263S) or amplification of a chromosomal segment containing dhodh. They also present modeling data to examine binding poses and how mutations could impact drug binding, which is allosteric to the enzyme's substrates (orotate and FMN). The data are thorough and provide convincing evidence that in this case collateral sensitization by distinct chemotypes does not translate into a viable strategy to inhibit DHODH in a way that can preclude mutations that confer cross-resistance.
  
  We thank the reviewer for their kind comments and helpful recommendations.
  
  Reviewer #3 (Public Review):
  
  'Collateral sensitivity' occurs when drug-resistance mutations render a drug target more sensitive to inhibition by another drug, which has been previously described in some detail for malaria parasite dihydroorotate dehydrogenase (DHODH - see refs 36, 46, and 47, for example). Although it has been suggested that combinations of such drugs could potentially suppress the emergence of resistance, cross-resistance-associated mutation (or copy-number variation, CNV) could render such combination strategies ineffective. In the current study, the authors assess a new pairing of DHODH-targeting drugs. Cross-resistant parasites with DHODH mutation or CNV arise following either sequential or combined drug selection, suggesting that the drug combination described would likely fail to effectively suppress the emergence of resistance.
  
  The strength of the study is that it describes, for a particular drug combination, different mutations associated either with collateral sensitivity or with cross-resistance, and the authors conclude that "combination treatment with DSM265 and TCMDC-125334 failed to suppress resistance". They go on to say that this "brings into question the usefulness of pursuing further DHODH inhibitors." More specific interpretations and implications of the study are as follows:
  
  a. Other combinations may also fail but there may be combinations that can effectively suppress resistance. A more exhaustive analysis of mutational space will likely be required to determine which combinations if any, would be predicted to succeed in a clinical setting.
  
  b. It was previously reported that "a combination of [DHODH] wild-type and mutant-type selective inhibitors led to resistance far less often than either drug alone. ... Comparative growth assays demonstrated that two mutant parasites grew less robustly than their wild-type parent, and the purified protein of those mutants showed a decrease in catalytic efficiency, thereby suggesting a reason for the diminished growth rate" (Ref 46). Also, "selection with a combination of Genz-669178, a wild-type PfDHODH inhibitor, and IDI-6273, a mutant-selective PfDHODH inhibitor, did not yield resistant parasites" (Ref 36). It is possible that these previously tested combinations would also yield cross-resistant mutants if selected further.
  
  c. Although increased DHODH copy number "confers only moderately reduced susceptibility" to the drug used for selection and although these clones were not assessed here for cross-resistance, it seems likely that CNV may represent a general mechanism that could undermine other collateral resistance strategies.
  
  We thank the reviewer for their kind and helpful comments.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.04.07.486819v1
www.biorxiv.org www.biorxiv.org

New submission 06/08/2023, 15:02:28

1
1. Public_Reviews 10 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  This study applies state-of-the-art single-cell transcriptome analysis to investigate the nature of drug tolerance, a phenomenon distinct from drug resistance, and a problem of considerable importance in the treatment of C. albicans infections. The authors first show that their transcriptomics platform can reveal sub-populations of untreated cells that display distinct transcription profiles related to metabolic and stress responses that are coupled with cell cycle regulation. They note the consistency of these findings with previous work indicating connections between cell cycle phase and expression of genes related to stress responses and metabolism and argue that this validates their experimental approach, which relies on a complex statistical analysis of sparse data from a relatively small number of single cells. They then proceed to analyze drug-treated cells, mostly focusing on fluconazole (FCZ; which targets ERG11, thus disrupting ergosterol biosynthesis and membrane integrity) and examining individual cells at 2-, 3-, and 6-days following treatment. Their primary finding is the identification of two major classes of cells, one of which they call the α response, characterized by high ribosomal protein (RP) gene expression and the absence of either heat shock or hyperosmotic stress gene expression as well as low expression of glycolytic, carbohydrate reserve pathway, and histone genes. The second survival state on day 2 (called the β response) instead displays low RP gene expression and high heat-shock stress response. Interestingly, the proportion of β cells clearly increases on day 3. In addition, responses to caspofungin (CSP) and rapamycin (RAPA) are examined and compared to FCZ or untreated cells. The main conclusion that the authors draw from their data is that the initial α response transitions to the β response, which is similar to a recently characterized ribosome assembly stress response (RASTR) in the budding yeast S. cerevisiae. They argue that the transcriptional state in α cells provokes the transition to the β state.
  
  This manuscript presents an enormous amount of complex data whose significance will be difficult to evaluate for those (e.g., this reviewer) not immersed in the specialized analytical techniques used here. Taken at face value, however, the experimental findings are consistent with the authors' main conclusions. Nevertheless, and consistent with the complexity of the responses observed, there are many findings that remain to be explored in mechanistic detail and for which conclusions are less precise.
  
  We thank Reviewer #1 for their excellent questions. The manuscript does have a large amount of complex data so this version of the manuscript has a tighter focus on the major findings (i.e. 𝛼/Rd versus β/Sd subpopulations in response to FCZ). We have tried to explore these subpopulations in greater depth with supporting data from complementary technologies and additional bioinformatic analyses. We agree that there still remains several observations in the manuscript that are not explored in mechanistic detail. We have tried our best to clearly delineate the evidence that we have for these findings in addition to their potential significance.
  
  Towards the simplification of the manuscript, we have moved the discussion regarding “comets” to Appendix 2 [Changes L837-897] along with the detailed analysis of the response of cells to rapamyacin and caspofungin [Changes L899-963]. We have also removed from the manuscript a paragraph (and associated Figure 2 - figure supplement 5 in the original manuscript) from the Discussion that described our inability to assign DNA level chromosomal aberrations to either the Rd or Sd subpopulations using whole genome sequencing. Figures 5 and 6 of the original manuscript depicted GO analysis that compared changes in the molecular processes between 𝛼/Rd and β/Sd subpopulations at day 3 and 6 respectively. Although interesting, the figures do not advance the main findings of the manuscript and have been removed from this version.
  
  Reviewer #2 (Public Review):
  
  In this manuscript, Dumeaux et al. assess the heterogeneous cellular response of the fungal pathogen Candida albicans to antifungal agents, using single-cell RNA sequencing. The researchers develop and optimized single-cell transcriptomics platform for C. albicans, and exploit this technique to monitor the cellular response to treatment with three distinct antifungal agents. Through this analysis, they identify two distinct subpopulations of cells that undergo differential transcriptomic responses to antifungal treatment: one involving upregulation of translation and respiration, and the other involving stress responses. This work monitors how different and prolonged antifungal exposure alters and shifts fungal cell populations between these responses. This is an innovative study that exploits novel single-cell transcriptomic techniques to address a very interesting question regarding the heterogeneous nature of the fungal response to antifungal drug treatment. This work optimizes a protocol for single-cell RNA sequencing, which is a significant contribution to the fungal research community and will bolster future research efforts in this area. The identification of two distinct subpopulations of fungal cells with differential responses to antifungal treatment is an exciting and novel finding. While there are aspects of this manuscript that are of significant interest, there are also limitations to this work.
  
  The research is framed as a method to study antifungal drug tolerance, but it is not clear how it does so, based on the methods. This work also compares very different populations of cells (rapidly growing untreated cells compared with cells grown in antifungal for several days), making it difficult to assess the role of antifungal treatment specifically in this analysis. This manuscript is also written with a great deal of highly technical language that makes it difficult to dissect the major findings and outcomes from the study.
  
  We sincerely thank the reviewer for these comments and for making the effort to evaluate the manuscript. We have tried to address these criticisms by improving the introduction to better explain fungal drug tolerance [Changes L53-61] and to explain how our experimental design allows us to investigate this phenomenon (for example for UT cells L184-187, L142-149). We have also re-written subsections of the results to more intuitively explain technical concepts (especially surrounding single cell technologies and analyses) [L250-257, L368-373, L699-707]. Some subsections of the results have been moved to the appendices in order to better emphasize the major findings and outcomes (e.g. comets L837-897 and in depth analysis of RAPA and CSP treatment L899-963). We address each of the specific concerns below. We have also removed some complicated analyses that did not directly advance the major findings of the manuscript including the GO analysis in Figures 5 and 6 of the original manuscript.
  
  Before proceeding, we would like to take this opportunity to underscore that these experiments were not primarily designed to investigate the differences between untreated (UT) and treated cells. The major findings (of the 𝛼/Rd and β/Sd subpopulations) are not dependent on the UT profiles. That is, the 𝛼/Rd and β/Sd subpopulations would be evident even if the UT profiles were removed from the manuscript entirely. Rather, the UT profiles/analyses are intended to contribute to the manuscript by helping establish the technical efficacy of the sc-profiling technique. For example, we might expect - a priori - that a large component of cell to cell heterogeneity in isogenic UT cells should correspond to differences in cell cycle, and, indeed, this is what we found.
  
  Indeed, we did embed (via UMAP) and cluster (via Leiden clustering) the UT data alongside data for the drug-treated cells (Figure 3), which reveals that UT cells largely cluster separately from drug-treated cells. The reviewer is absolutely correct to question the sources underlying this separation; in addition to differential cellular responses to the drug itself, some of the separation may be due to differences in the amount of growth media, for example. (The fact that different drugs (FCZ, RAPA and CSP) largely separate from UT cells and from each other may suggest that at least some of this separation could be due to differences in the mode of action of each drug rather than to issues related to, for example, media depletion. However, this difference is not a major finding of the manuscript. Rather, we agree with the reviewer that “The identification of two distinct subpopulations of fungal cells with differential responses to antifungal treatment is an exciting and novel finding”. As such, the major results begin with data in panels 3D and E that reveal the two distinct cell types within the FCZ-treated sample (a distinction that is not dependent on the status of the UT cells).
  
  Reviewer #3 (Public Review):
  
  The authors described their extensive single-cell analysis of Candida undergoing (sub-inhibitory) antibiotic treatment versus no treatment. To do so, the authors used a microfluidics platform they had previously developed, and they optimized, characterized, and validated it for this particular application. Their findings included: (a) the transcription of untreated cells is driven mostly by cell cycle phase, (b) treated cells can be clustered into several major groups and a few outlier groups that the authors termed comets, (c) cells undergoing FCZ treatment can adopt one of two different states (possibly bistability). I found the results interesting and the approach to be sound, and much of the results confirmed my prior expectations. The authors provide a detailed depiction of what is going on in the transcriptome during sub-inhibitory treatment, although this did not always lead to a mechanistic explanation. The clinical relevance was unclear to me beyond a proof of concept application for single-cell transcriptomics. In my opinion, an interesting follow-up would be to follow the transcriptional trajectory of lineages undergoing antimicrobial switching (on and off). The main issues I identified were the author's use of the term tolerance versus resistance, interpretation of "comets", clustering approach, description of fitness, and comparison between time points.
  
  We thank the reviewer for their time and effort with this manuscript. In the revised manuscript, we expanded the introduction to better delineate between resistance and tolerance, moved the “comets” section to the appendices, as it distracted from the major results and we provided more interpretive analysis of the findings. We also better defined the bioinformatic approaches. (Changes e.g. comets L837-897 and in depth analysis of RAPA and CSP treatment L899-963). With respect to comparisons between time points, we now address these concerns throughout the Response to Reviewer document. We have also moved a comparison of UT versus FCZ cells to Appendix 2 L828-836 as it was perhaps misleading readers of our intention. We only performed this comparison as a sort of “sanity” check to see if the single cell (sc)-profiling would detect differences between UT and drug treated cells.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.07.20.500774v1
www.biorxiv.org www.biorxiv.org

The Reissner fiber under tension in vivo shows dynamic interaction with ciliated cells contacting the cerebrospinal fluid

1
1. Public_Reviews 10 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  This manuscript provides novel and intriguing experiments that aim to elucidate the mechanical properties of the Reissner fiber (RF) and to probe its interactions with the motile cilia in the central canal of the spinal cord. Using in vivo imaging in larval zebrafish, the authors show that the RF is under tension and oscillates dorsoventrally. Importantly, ablation of the RF triggered retraction and relaxation of the fiber cut ends. The retraction speed depends on where the fiber was ablated, with fastest retraction in the rostral side, indicating that tension in the RF builds up rostrally. The authors, based on observations from live imaging of intact and ablated RF and central canal, conjecture that numerous ependymal motile monocilia, that are tilted caudally and interact frequently with the RF, contribute to RF heterogenous tension via weak interactions.
  
  The work is important. The experiments are thorough and intricate. The findings are fascinating and open up the prospect for future investigations and models. I'm particularly curious as to what future experiments can be used to test the hypothesis put forward by the authors about the role of ciliafiber interactions in the RF mechanical properties and function.
  
  We thank Reviewer#1 for showing enthusiasm and support.
  
  Reviewer #2 (Public Review):
  
  The present manuscript by the Claire Wyart group analyses the behaviour of Reissner's fibre (RF) when it is cut, as well as the behaviour of cells touching RF when contact is interrupted. They show that RF is under tension that is higher in the rostral than in the caudal spinal cord. One of the proposed mechanisms is a caudally oriented movement of the cilia of ependymal radial glials cells (ERG) that is inherent rather than caused by the contact with RF. Kolmer Agduhr neurons that are also CSF contacting (CSF-cN), alter their activity when contact is lost through laser ablation of RF.
  
  This is an interesting paper - RF has long been proposed to be a source of signalling molecules in the development and physiological function of neural cells in the spinal cord. Cilia are the main centre of signalling activity in ciliated cells (e.g. for sonic hedgehog signalling) and the fact that ERG cilia are in direct contact with RF is intriguing. Presumably, signalling molecules could be directly transferred from RF to ERG at the contact points.
  
  Functionally, CSF-cN are augmenting spinal cord intrinsic sensory feedback on body curvature. This had been shown in vitro/ex vivo, but not clearly evaluated in the living animal. The data shown here demonstrate a possible mechanism for how the feedback can be mediated through contact with RF. This is of fundamental interest to understand the functioning of a locomotor network that is under evolutionary pressure to function early, since fish hatch at 3 days post fertilisation.
  
  We thank Reviewer#2 for the interest in our work.
  
  Interestingly, the authors propose (and discuss against the relevant literature) that the presence of RF in the central canal can influence the flow of the CSF, which should be investigated in further work.
  
  To bring readers back in the context of the existing literature:
  
  When using beads to track particles in the flow in the presence or in the absence of RF, we have not seen major difference in the bidirectional dorsoventral profile of the embryonic CSF flow (Cantaut-Belarif et al CB 2018 ; Sternberg et al., Nature Comm 2019 ; Thouvenin et al., eLife 2020).
  
  However, we cannot exclude that there could be a very local impact of the RF on CSF flow, due to the fact that the flow has to be null on the surface of the fiber (of 200 nm diameter). With our methods for tracking fluorescent particles in single planes at a time (Cantaut-Belarif et al CB 2018 ; Sternberg et al., Nature Comm 2019 ; Thouvenin et al., eLife 2020), we are likely missing the fiber in the plane and the fine analysis of the domain surrounding the fiber is not resolved. However, a null flow at the surface of the RF would impose a sharp gradient around the fiber.
  
  Note that our results estimating the effect of cutting the fiber on the beating frequency of motile cilia were not consistent across fish – half the cilia showing an increase while the rest show a decrease, making it hard to conclude. A finer analysis with higher temporal and spatial resolution in 3D will be necessary to decipher the role of the fiber on the beating of cilia and local CSF flow.
  
  Overall, the results are clearly presented, and methods are thoroughly given, including some indication on the reduction of bias (by blinding movies before analysis). The authors also clearly state the limitations of their work, mostly derived from optical limitation (size of the RF in the larval fish, and speed of the recording in the laser-equipped microscope). This doesn't affect the fundamental statements.
  
  Thank you again for your appreciation of our work.
  
  Reviewer #3 (Public Review):
  
  This manuscript by Bellegarda et al. examined the in vivo dynamic behavior of the Reissner fiber and its interactions with cilia and sensory neurons in the central canal of zebrafish larvae. The authors accomplished this by performing live imaging with a transgenic reporter zebrafish line in which the fiber is GFP-tagged and by finely tracking the movement of the fiber. Interestingly, they discovered that the fiber undergoes a dynamic vibratory-like movement along the dorsoventral axis. The authors then utilized a pulsed laser to precisely cut the fiber, which frequently resulted in a fast retraction behavior and a loss of calcium activity in sensory neurons in the central canal called CSFCNs. Mechanical modeling of the elastic properties of the fiber indicated that the fiber is a soft elastic rod with graded tension along the rostrocaudal axis. Finally, by performing live imaging of motile cilia and the fiber in the central canal, they found that the two interact in close proximity and that cilia motility is affected when the fiber was cut. The authors concluded that the Reissner fiber is a dynamic structure under tension that interacts with sensory neurons and cilia in the central canal.
  
  Strengths:
  
  1) The study utilizes state-of-the-art microscopy techniques and beautiful transgenic zebrafish tools to characterize the in vivo behavior of the Reissner fiber and found that it exhibits surprising dynamic movements along the dorsal-ventral axis. This observation has important implications for the physiology and function of the Reissner fiber.
  
  2) By performing a series of clever laser cutting experiments, the authors reveal that the Reissner fiber is under tension in the central canal of zebrafish. This finding provides direct experimental evidence to support the hypothesis that the Reissner fiber functions in a biomechanical manner during spinal cord development and body axis straightening.
  
  3) By developing a mechanical model of the Reissner fiber and its retraction behavior, the authors estimate the elastic properties of the fiber and found that it is more akin to an elastic polymer rather than a stiff rod. This is a useful finding that illuminates the biophysical properties of the fiber.
  
  4) Through calcium and cilia imaging studies, the authors demonstrate that the Reissner fiber likely interacts with motile cilia and regulates the activity of ciliated sensory neurons (CSF-CNs). The authors propose a model in which fiber-cilia interactions may occur via weak interactions or frictional forces. This model is plausible and opens several new doors for additional investigation.
  
  We thank Reviewer#3 for the support.
  
  Weaknesses:
  
  1) All the live imaging experiments appear to be performed with animals paralyzed via the injection of a chemical agent (bungarotoxin). Does paralysis and/or bungarotoxin negatively impact the behavior of the Reissner fiber? Some data from non-paralyzed animals would ameliorate this concern.
  
  We performed very few experiments on non paralyzed fish as the position of the Reissner fiber were difficult / impossible to analyze in 3D. In a movie added to our revision as Movie 3, it is obvious that skeletal muscle contractions result in very large jumps of the fiber that cannot be corrected for using single plane imaging. Without being able to monitor and correct for muscle contractions, an accurate estimation of the fiber motion in this context would be artefactual.
  
  2) Although the authors convincingly demonstrate that the Reissner fiber is under graded tension, it remains unclear what is the relevance and function of tension on this structure. The photoablation data presented do not delineate between the relevance of the fiber being intact or tension on the fiber as cutting the fiber impacts both. Is fiber tension required for body straightening? At the site of fiber photoablation, does a spinal curvature develop? If cultured, do the ablated animals exhibit a scoliotic phenotype?
  
  We thank Reviewer#3 for asking these important questions. We did ask ourselves the same questions, but had to restrain the ambition of our study as for technical reasons, the ablation experiments performed on an inverted microscope required to mount the fish closed to the bottom coverslip and were extremely difficult to perform while safely removing the animal from the imaging cuvette and not affecting the alignment of its body axis.
  
  3) One of the most potentially impactful conclusions of the paper is that the Reissner fiber interacts with cilia, but the evidence is insufficient to support this. Although some motile cilia are near the fiber (Figure 3A), many cilia are not near the fiber. The provided images and videos do not clearly demonstrate that cilia physically contact or influence the behavior of the Reissner fiber. Further, the data is lacking to conclude that the Reissner fiber directly impacts cilia motility as they did not observe an overall statistically significant difference before and after ablation (Supplemental Figure 1A). Higher magnification, higher resolution, higher acquisition rate and/or colocalization analyses of fiber-cilia interactions could alleviate this concern.
  
  We agree with the reviewer but could not yet perform for technical reasons more spatially- and temporally- resolutive experiments. Further analysis of cilia and RF translational motion is displayed on the Figure 4 - Supplemental Figure 2 and presented in the Results sections.. We observed that for 7 out of 15 dorsal cilia and 4 out of 9 ventral cilia, the preferred position of the cilium was correlated with a position of the fiber – suggesting that they could interact. However, our current dataset in 2 D is too incomplete to draw strong conclusions on the nature of interactions between fiber and cilia. A future study relying on 3D analysis of the fiber and cilia should resolve how collective interactions of cilia may determine the position of the fiber.
  
  4) Similarly, how does the Reissner fiber interact with CSF-CN sensory neurons? The authors suggest that the fiber interacts with CSF-CN sensory neurons by modulating their spontaneous calcium activity via weak interactions or frictional forces from motile ciliated ependymal radial glial cells. While the calcium imaging data of the CSF-CNs is convincing and sound, the exact nature of the fiber-neuron interaction is unclear. Do cilia or apical extensions on CSF-CN sensory neurons sense the fiber or forces through a mechanosensing or chemosensing mechanism?
  
  This question is of great interest to us and will be the topic of a future investigation, as it is very difficult to image CSF-cN motile cilium (see Bohm et al., Nature Comm 2016) and even more with the Reissner fiber.
  
  There is some additional confusion as the authors appear to focus their cilia experiments on ependymal radial glial cells in section 4, rather than CSF-CNs. The addition of an illustrative cartoon would add clarity.
  
  We agree and we added a schematic in the last figure (Figure 4A).
  
  Overall, the conclusions of the study are well supported by the data presented. However, the strength of the conclusions could be enhanced by additional controls, alternative experimental approaches and clarifications.
  
  This manuscript is an important contribution to the fields of spinal cord development and body axis development, which are fundamental questions in neurobiology, developmental biology, and musculoskeletal biology. In recent years, the Reissner fiber and motile cilia function have been linked to cerebrospinal fluid flow signaling and body straightening, but the precise form and function of the fiber remain unclear. This study provides new insight into the dynamic and biophysical properties of the Reissner fiber in vivo in zebrafish and proposes a model in which the fiber interacts with cilia and sensory neurons. This study provides novel insight into the cellular mechanisms that underlie the pathogenesis of disorders such as idiopathic scoliosis.
  
  We thank the Reviewer #3 and added further analysis of cilia and RF motion displayed on the figures below added as well as extended data figures in the main manuscript.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.22.529498v1
www.biorxiv.org www.biorxiv.org

New submission 06/08/2023, 14:06:25

1
1. Public_Reviews 10 Aug 2023
  
  in eLife
  
  Author response
  
  Reviewer #1 (Public Review):
  
  The potential role of the CaMKII holoenzyme in synaptic information processing, storage, and spread has fascinated neuroscientists ever since it has been described that self-phosphorylation of CaMKII at T286 (pT286) can maintain the kinase in an activated state beyond the initial Ca2+ stimulus that induced kinase activation and pT286. The current study by Lučić et al utilizes biochemical and biophysical methods to re-examine two pT286 mechanisms and finds:
  
  (1) that a previously proposed activation-induced subunit exchange within the holoenzyme can not provide pT286 maintenance or propagation; and
  
  (2) that pT286 can occur not only within a holoenzyme but also between two holoenzymes, at least at sufficiently high concentrations.
  
  For the observation regarding the subunit exchange, the authors go above and beyond to demonstrate that a previously proposed activation-induced subunit exchange does not actually occur in their hands and that the previous appearance of such a subunit exchange may instead be due to activation-induced interactions between the kinase domains of separate holoenzymes. This provides important clarification, as the imagination about the possible functions of this subunit exchange has been running wild in the literature.
  
  By contrast, pT286 between holoenzymes at sufficiently high concentrations was largely predicted by the previously reported concentration-dependence of pT286 between monomeric truncated CaMKII (although these previous experiments did not rule out that such pT286 could have been excluded for intact full-length holoenzymes). Notably, the reaction rate reported here for pT286 between two holoenzymes is more than two orders of magnitude slower compared to the previously described rate of the pT286 reaction within a holoenzyme.
  
  The only point on which we disagree (and we think it’s unarguable) is that the current consensus is that inter-holoenzyme phosphorylation simply doesn’t happen (whether or not monomers can phosphorylate each other). The reviewer is of course right that this view seems now less and less likely. We now performed new experiments to investigate this critical point further (see below).
  
  The probable reason for the discrepancy in reported half-time of phosphorylation measured in earlier reports and in our paper is the fact that earlier reports (for example Bradshaw et al., 2002) measured autophosphorylation rate of wild-type CaMKII holoenzymes, at catalytically-competent enzyme concentrations of 0.1-5 µM. We are reporting the phosphorylation rate of 4 µM kinase-dead CaMKII, which is only a substrate, by 10 nM catalytically competent enzyme (CaMKII wild-type). There is up to 500 times less catalytically competent enzyme in our reactions, which is probably the reason why the reaction itself is several orders of magnitude slower.
  
  In summary, this study contains two somewhat disparate parts: (1) one technical tour-de-force to provide evidence that argues against activation-induced subunit exchange, which was a tremendous effort that provides influential novel information, and (2) another set of experiments showing the somewhat predictable potential for pT286 between holoenzymes, but without indication for the functional relevance of this rather slow reaction. Unfortunately, in the current/initial title of the manuscript, the authors chose to emphasize the weaker part of their findings.
  
  We agree with the reviewer that the title should be modified to emphasize both findings of our study. We also hope that our new experiments do bolster our findings with regard to pT286 between holoenzymes, as the reviewer puts it.
  
  The seemingly slow inter-holoenzyme phosphorylation is only slow under conditions in which one of the proteins is kinase-dead. In situation in which all CaMKII holoenzymes are wild-type and therefore capable of performing phosphorylation (both intra- and inter-holoenzyme) the reaction rates for pT286 are expected to be orders of magnitudes faster, than those reported here for the phosphorylation of T286 on kinase-dead protein.
  
  Reviewer #2 (Public Review):
  
  This well-written manuscript provides a technical tour-de-force to provide a novel mechanism for sustaining CaMKII autophosphorylation through an interholoenzyme reaction mechanism the authors term inter-holoenzyme phosphorylation (IHP). The authors use molecular engineering to create designer molecules that permit detailed testing of the proposed interholoenzyme reaction mechanism. By catalytically inactivating one population of enzymes, they show using standard assays that the inactive enzyme can be phosphorylated by active holoenzymes. They go on to show that in cells, the inactive enzyme is phosphorylated only in the presence of co-expressed active CaMKII and that this does not appear to be due to active and inactive subunits mixing within the same holoenzyme. The authors suggest reasons for why previous experiments failed to expose IHP and in some experiments provide evidence that reproduces and then extends earlier studies. Some noted differences from earlier experiments are the reaction temperature, the time course of the reactions, and that significantly higher concentrations of the inactive (substrate) kinase in the present study amplify the IHP. These are plausible reasons for earlier studies not finding significant evidence for IHP and the presented data is well-controlled and of high quality.
  
  The authors then take on the idea of subunit exchange employing multiple strategies. Using genetic expansion, they engineer an unnatural amino acid into the hub domain of the kinase (residue 384). In the presence of the photoactivatable crosslinker BZF and UV illumination, a ladder of subunits was generated indicating intraholoenzyme crosslinks were established. Using this cross-linked enzyme, presumably incapable of subunit exchange, the authors show significant phosphorylation of the kinase-dead mutant. This further supports that IHP is the cause of phosphorylation and not subunit exchange. Extending these experiments, they could not find evidence when CaMKIIF394BZF was mixed with the kinase-dead mutant and exposed to UV light, that there was evidence of the kinasedead subunits exchanged into CaMKIIF394 (active) enzymes.
  
  Just a note, instead of residue 384, this should read 394.
  
  With an entirely different approach, the authors use isotopic labeling of different pools of wt CaMKII (N14 or N15) followed by bifunctional cross-linking and mass spec to assess potential intra- and interholoenzyme contacts. Several interesting findings came of these studies detailed in Figure 4, mapped in detail in Figure 5, and extensively documented in supplementary tables. Critically, numerous crosslinks were found between different domains of the enzyme (catalytic, regulatory, hub) that are themselves a nice database of proximity measurements, but critical to the hypothesis, no heterotypic cross-links were found in the hub domains at any activated state or time point of incubation. This data supports two findings, that catalytic domains come into close proximity between holoenzymes when activated, supporting the potential for IHP, but that no subunit exchange occurs.
  
  The authors then pursue the approach used originally to provide evidence of subunit mixing, single molecule-based fluorescence imaging. Using pools of CaMKII labeled with spectrally separable dyes, the authors reproduce the earlier findings (Stratton et al, 2016) showing that under activating conditions, but not basal conditions, colocalized spots were detected. Numerous controls were done that confirm the need for full activation (Ca2+/CaM + Mg2+/ATP) to visualize co-localized CaMKII holoenzymes. Extending these studies, the authors mix holoenzymes, fully activate them, and after sufficient time for subunit exchange (if it occurs), the reactions were quenched, and then samples were analyzed. The result was that no evidence of dual-colored holoenzymes was present; if subunits had mixed between holoenzymes, dual-colored spots should have been evident after quenching the reactions. This was not the case. Further, experiments repeated with pools of differentially labeled kinase dead enzymes produced no colocalization, as predicted, if activation of the catalytic domains is necessary to establish IHP.
  
  Finally, the authors employ mass photometry to investigate the potential for interholoenzyme interactions. At basal conditions, only a mass peak consistent with CaMKII dodecamers was evident. Upon activation, a small fraction of dimeric complexes was evident (with Ca2+/CaM bound) but the majority of the peak was a dodecamer with 12 associated CaM molecules, and importantly, a significant fraction of a mass population was found consistent with a pair of holoenzymes with associated CaM. As an aside, the holoenzyme population appeared to be modestly destabilized as evidence of a minor fraction of dimers appeared as the authors diluted the enzyme, but the pools of holoenzyme and pairs of holoenzymes (with CaM) remained the dominant species when activated under all three enzyme concentrations assessed. Supporting the importance of activation for interactions between holoenzymes, the catalytically dead kinase even under activating conditions, shows no evidence of dimers of holoenzymes.
  
  Each of the approaches is well-controlled, the data is of uniformly high quality, and the authors' interpretations are generally well-supported.
  
  We are very grateful for these supportive comments.
  
  Reviewer #3 (Public Review):
  
  CaMKII is a multimeric kinase of great biologic interest due to its crucial roles in long-term memory, cardiac pacemaking, and fertilization. CaMKII subunits organize into holoenzymes comprised of 1214 subunits, adopting a donut-like, double-ringed structure. In this manuscript, Lucic et al challenge two models in the CaMKII field, which are somewhat related. The first is a longstanding topic in the field about whether the autophosphorylation of a crucial residue, Thr286, can be phosphorylated between intact holoenzymes (inter-holoenzyme phosphorylation). The second is a more recent biochemical finding, which tested the long-running theory that CaMKII exchanges subunits between holoenzymes to create mixed oligomers. These two models are connected by the idea that subunit exchange could facilitate phosphorylation between subunits of different holoenzymes by allowing subunits to integrate into a different holoenzyme and driving transphosphorylation within the CaMKII ring. Here, the authors attempt to show that one intact holoenzyme phosphorylates another intact holoenzyme at Thr286. The authors also provide evidence suggesting that subunit exchange is not occurring under their conditions, and therefore not driving this phosphorylation event. The authors propose a model where instead of exchanging subunits, two holoenzymes interact via their kinase domains to enable transphosphorylation at Thr286 without integrating into the holoenzyme structure. In order for the authors to successfully convince readers of all three facets of this new model, they need to provide evidence that 1) transphosphorylation at Thr286 happens when subunit exchange is blocked, 2) subunit exchange does not occur under their conditions, and 3) there are interactions between kinases of different holoenzymes that lead to productive autophosphorylation at Thr286.
  
  Strengths:
  
  The authors have designed and performed a battery of cleverly designed and orthogonal experiments to test these models. Using mutagenesis, they mixed a kinase-dead mutant with an active kinase to ask whether transphosphorylation occurs. They observe phosphorylation of the kinase-dead variant in this experiment, which indicates that the active kinase must have phosphorylated it. A few key questions arise here: 1) whether this phosphorylation occurred within a single CaMKII holoenzyme ring (which is the canonical mechanism for Thr286 phosphorylation), 2) whether the phosphorylation occurred between two separate holoenzyme rings, and 3) why was this not observed in previous literature? To address questions 1 and 2, the authors implemented an innovative strategy introducing a geneticallyencoded photocrosslinker in the oligomerization domain, which when crosslinked using UV light, should lock the holoenzyme in place. The rate of phosphorylation was the same when comparing uncrosslinked and crosslinked CaMKII variants, indicating that phosphorylation is occurring between holoenzymes, rather than through a subunit exchange mechanism that would require some type of disassembly and reassembly (presumably blocked by crosslinking). The 3rd question remains as to why this has not been previously observed, as it has not been for lack of effort. The authors mention low temperature and low concentration as culprits, however, Bradshaw et al, JBC v. 277, 2002 carry out a series of careful experiments that indicated that autophosphorylation at T286 is not concentration-dependent (meaning that the majority of phosphorylation occurs via intra-holoenzyme), and this is done over a concentration and temperature range. It is possible that due to the mutants used in the current manuscript, it allows for the different behavior of the kinase-dead domains, which will have an empty nucleotide-binding pocket. Further studies will need to elucidate these details, and importantly, understand what physiological conditions facilitate this mechanism.
  
  We thank the reviewer for their assessment of our work.
  
  The paper cited by the reviewer (Bradshaw et al, JBC v. 277, 2002) is indeed a carefully designed biochemical investigation of CaMKII activity. As the reviewer pointed out, one of the conclusions of the paper is that the autophosphorylation of CaMKII is not concentration dependent, implying that it has to occur exclusively intra-holoenzyme. However, there are some limitations which colour the interpretation of this classic paper. Bradshaw and colleagues used only CaMKII wild-type protein, so the autophosphorylation which is taking place in their reactions is possible both within holoenzymes and between holoenzymes, but this is impossible to distinguish. The authors of the cited paper then used “Autonomous activity assay” (not any measurement of pT286 on CaMKII itself) in which they first stopped the initial autophosphorylation reaction at T286 by adding a quench solution which contained a mixture of EDTA and EGTA, and then measured phosphorylation of the peptide-substrate of CaMKII (autocamtide-2), in the absence of Calmodulin binding (autonomous activity). They also diluted the autophosphorylation reaction to 10 nM CaMKII before adding it to the “Autonomous activity assay”.
  
  As a side point, each reaction was quenched and diluted to the same final CaMKII concentration of 10 nM. They measured the activity of this dilution with phosphorylation of a peptide-substrate (autocamptide-2), in the absence of CaM binding. The authors contend that autonomous activity reported in this way reflects the amount of pT286, which is not impossible, but it is not a direct measure of pT286.
  
  All this adds up to allowing the autophosphorylation of wild-type CaMKII at various concentrations ranging from 0.1 to 4.6 µM in the presence of 10 µM Ca/CaM and 500 µM Mg/ATP. This is a very fast reaction, concentrations of enzyme (CaMKII wild-type), activator (Ca/CaM) and ATP/Mg are all high at the beginning of the autophosphorylation reaction and would expect to allow for maximal autophosphorylation in very short times (seconds). Most importantly, this experiment does not exclude a inter-holoenzyme reaction slower than the intra-holoenzyme one. It certainly could not detect it.
  
  In any case, to relate these concepts to our experiments and current understanding of CaMKII, we performed a new set of experiments modelled on the Bradshaw paper. Critically, we used CaMKII wild-type as the enzyme, and CaMKII kinase-dead, as the substrate. Intraholoenzyme phosphorylation cannot occur in this reaction, which was designed to detect a concentration-dependent phosphorylation reaction. We used a fixed concentration of the substrate kinase (4 µM), and 4 different concentrations of CaMKIIWT ranging from 0.5 -100 nM. In our assay, the level of phosphorylation on substrate CaMKII(CaMKIIKD) was dependent on concentration of enzyme CaMKII (CaMKIIWT) (Figure 1-figure supplement 3), adding more evidence to the hypothesis that CaMKII autophosphorylation can occur inter-holoenzyme.
  
  The possibility that empty nucleotide binding pocket is influencing the phosphorylation status of T286 in the regulatory domain of kinase-dead CaMKII is highly unlikely. One could maybe envision that empty nucleotide binding pocket might expose the regulatory domain in kinase-dead CaMKII for phosphorylation, which would be prevented in CaMKIIWT, but in all available structures of CaMKII (Chao et al, 2011; Myers et al., 2017, Buonarati et al., 2021), the regulatory domain is docked to the kinase domain of CaMKII, although the nucleotide binding pocket is empty (either by mutation of residue K42 and/or simply by not adding the ATP/Mg to reduce chemical dispersity of the sample). The only time the regulatory domain was not docked on the kinase domain is when CaMKII was in complex with Calmodulin (Rellos et al., 2010). Finally, in our crosslinking mass spectrometry experiments, we used both heavy and light forms of CaMKII wild-type, and there we can clearly see interactions between kinase/regulatory domains of two different species of CaMKIIWT, which are dependent on activation.
  
  The most convincing data that subunit exchange does not occur is from the crosslinking mass spectrometry experiment. The authors created mixtures of 'light' and 'heavy' CaMKII holoenzymes, either activated or not and then used a Lys-Lys crosslinker (DSS) to trap the enzyme in its final state. The results of this experiment indicate that subunit exchange is not occurring under their conditions. A caveat here is that there are not many lysines at hub-hub interfaces, which is the crux of this experiment. If there is no subunit exchange under their conditions, how does transphosphorylation occur between holoenzymes? The authors show very nice mass photometry data indicating that there are populations of 24-mers, which corresponds to a double-holoenzyme. Paired with the data from their crosslinking mass spectrometry which shows crosslinks between kinase domains of different holoenzymes, this indicates that perhaps kinases between holoenzymes do interact, and they do so in a competent manner to allow transphosphorylation to occur.
  
  It is true that there are “only” 6 Lysines in the hub domain of CaMKII. However, it is clear from our crosslinking mass spectrometry data that we can detect hub:hub peptides coming from the same holoenzymes (homocrosslinks, either 14N: 14N or 15N: 15N species), but never between holoenzymes (14N with 15N). The fact that peptides can be detected in the homocrosslinks speaks to the validity of using Lysine crosslinkers in this experiment.
  
  Weaknesses:
  
  The authors should be commended for performing three orthogonal experiments to test whether CaMKII holoenzymes exchange subunits to form heterooligomers. However, there are technical issues that dampen the strength of the results shown here. For simplicity, let's consider that CaMKII holoenzymes are comprised of two stacked hexameric rings. It has been proposed that the stable unit of CaMKII assembly and perhaps also disassembly and subunit exchange is a vertical dimer unit (comprised of one subunit from each hexameric ring). In the UV crosslinking data shown in this paper, the authors have a significant number of monomers, some crosslinked dimers (of which there are two populations), and fewer higher-order oligomers. To effectively block subunit exchange, robust crosslinking into hexamers is necessary, which the authors have not done. Incomplete crosslinking results in smaller species that can still exchange (and/or dissociate), confounding the results of this experiment. In addition, Figure 3 shows a trapping experiment, where if the exchange was occurring, there would be an oligomeric band in Lane 8, which is visible and highlighted with a blue arrow by the authors. This result is explained by nonspecific UV effects, however by eye it is not clear if there is an equivalent band in lane 10. The overall issue here is inefficient crosslinking.
  
  We agree with the reviewer that the robustness of the UV-induced crosslinking is not extremely high. However we do observe higher order oligomers on the gel (Figure 2 and Figure 3B, pT286 blot), which states that at least a portion of the holoenzymes is crosslinked. On the other hand, the UVinduced crosslinking is not slowing down the trans-phosphorylation reaction, which would be expected if the subunit exchange would be the prevailing mechanism for spread of kinase activity between holoenzymes.
  
  In figure 3, lanes 8 and 10 show a small portion of dimers (less than 5% by densitometry), and at the absolute limit of detection. This dimer band is most likely due to unspecific UV-induced disulfide bridging (we already lessened it by adding 50 mM TCEP prior to UV treatment (Figure 3-figure supplement 1B and C). Previous reviewers of this manuscript criticized the small dimer band in lane 8, and we wanted to address this transparently in the submission to eLife.
  
  Unfortunately, if we absolutely crank up the contrast to see this band in lane 10, we start to see other features in the noise as well. We have now edited the image in Figure 3B to highlight these minor bands more clearly, but this is also not ideal.
  
  With regard to the trapping experiment, the overall problem is not inefficient crosslinking, because we see that P-T286 signal is quite nicely represented in higher order bands from F394BzF protein, but kinase dead protein (Avi-tagged signal in Figure 3) is almost entirely absent. Any crosslinking of Avitagged protein (possibly corresponding to subunit exchange) is a minor process at the limit of detection on WB.
  
  Unfortunately we did not yet find any better crosslinking sites than the two we report (we have tried about 10). But the results we did obtain encouraged us to employ other techniques to probe subunit exchange (for example, the MS X-linking).
  
  The authors also employ a single-molecule TIRF experiment to further interrogate subunit exchange. Upon inspection of the TIRF images, it is not clear that the authors are achieving single molecule resolution (there are evident overlapping and distorted particles). The analysis employed here is Pearson's correlation coefficient, which is not sufficient for single molecule analysis and would not account for particle overlap, particles that are too bright, and/or particles that are too dim. For example, an alternative explanation for the authors' results is that activation results in aggregation (high correlation), and subsequent EGTA treatment leads to dissociation at these low concentrations (low correlation). However, further experimentation and analysis are necessary.
  
  In the manuscript we present raw images, not processed. As we wrote in the material and methods, we thresholded the images for further processing. All colocalization methods have drawbacks, but we found that our thresholding combined with the Pearson coefficient was highly reproducible. We did also look at Manders coefficients, but these are less straightforward to understand, whilst still giving in our hands the same answer. We agree, there are more experiments that can be done, with particular predictions based on our new mechanism. And we are doing them and will report them when they are ready.
  
  At the risk of repeating ourselves, the reversible loss of overlap of the two labelled populations is the key result and cannot be explained by spurious dim or bright particles, or by a few overlapping profiles.
  
  Taken together, the authors have provided important food for thought regarding inter-holoenzyme phosphorylation and subunit exchange. However, given the shortcomings discussed here, it remains unclear exactly what mechanisms are at play within and between CaMKII holoenzymes once activated.
  
  We thank the reviewer for their critical assessment of our manuscript. We will continue to investigate the relevant points and refine the overall picture of CaMKII, to better clarify the mechanisms.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.08.03.502606v2
www.biorxiv.org www.biorxiv.org

New submission 06/08/2023, 13:20:40

1
1. Public_Reviews 10 Aug 2023
 
 in eLife
 
 Author Response
 
 Reviewer #1 (Public Review):
 
 The Authors of this study have investigated the consequence of knocking out protein 4.1B on hippocampal interneurons. They observed that in 4.1B KO mice, the myelinization of axons of PV and SST interneurons was altered. In addition, the molecular organization of the nodal, heminodal, and juxtaparanodal parts of the interneuron axons was disrupted in 4.1B KO mice. Further, the authors found some changes in spiking features of SST, but not PV interneurons as well as synaptic inhibition recorded in CA1 pyramidal cells. Lastly, 4.1B KO mice showed some impairment in spatial memory.
 
 Strengths
 
 One of the strengths of this MS is the multilevel approach to the question of how myelinization of interneuron axons can contribute to hippocampal functions. Further, the cell biological results support the claim of the reorganization of channel distributions at axonal nodes.
 
 Weaknesses
 
 1) Although the authors acknowledge that SST is expressed in different GABAergic cell types in the hippocampus, they claim that OLM cells, which express SST are subject to changes in 4.1B KO mice. However, this claim is not supported by data. Both OLM cells and GABAergic projection cells expressing SST have many long-running axons in the stratum radiatum, where the investigations have been conducted (e.g. Gulyas et al., 2003; Jinno et al., 2007). Thus, the SST axons can originate from any of these cell types. In addition, both these GABAergic cells have a sag in their voltage responses upon negative current injections (e.g. Zemankovics et al., 2010), making it hard to separate these two SST inhibitory cell types based on the single-cell features. In summary, it would be more appropriate to name the sampled interneurons as SST interneurons. Alternatively, the authors may want to label intracellularly individual interneurons to visualize their dendrites and axons, which would allow them to verify that the de-myelinization occurs along the axons of OLM cells, but not SST GABAergic projection neurons.
 
 We agree and named the sampled interneurons as SST interneurons throughout the text. We acknowledge that SST GABAergic projection cells have long-running axons in the stratum radiatum (Gulyas et al., 2003; Jinno et al., 2007) that may be also dysmyelinated. See Results lanes 200 and 350.
 
 2) Although both the cellular part and the behavioral part are interesting, there is no link between them at present. The changes observed in spatial memory tests may not be caused by the changes in the axonal de-myelinization of hippocampal interneurons. Such a claim can be made only using rescue experiments, since changes in 4.1B KO mice leading to behavioral alterations may occur i) in other cell types and ii) in other regions, which have not been investigated.
 
 Alteration of spatial memory has not been previously reported in the 4.1B KO mice. Our results leave open the possibility that dysmyelination of inhibitory interneurons in the hippocampus may induce impaired cognitive ability (see preprint). We agree that future studies investigating a putative rescue of spatial memory by means of virus-mediated expression of 4.1B in hippocampal Lhx6 interneurons would be very informative.
 
 Reviewer #2 (Public Review):
 
 In this study, Pinatel et al. address the role of interneuron myelination in the hippocampus using a 4.1B protein mouse knockout model. They show that deficiency in 4.1B significantly reduces myelin in CA1 stratum radiatum, specifically myelin along axons of parvalbumin and somatostatin hippocampal interneurons. In addition, there are striking defects in the distribution of ion channels along myelinated axons, with misplacement of Na channel clusters along the nodes of Ranvier and the heminodes, and a pronounced decrease in potassium channels (Kv1) at juxtaparanodes. The axon initial segments of SST are also shorter. Because the majority of myelinated axons in the stratum radiatum of the hippocampus belong to PV and SST interneurons such profound changes in myelination are expected to affect interneuronal function. Interestingly, the authors show that PV basket cells' properties appear largely unaffected, while there are substantial changes in stratum oriens O-LM cells. Inhibitory inputs to pyramidal neurons are also changed. Behaviorally, the 4.1B KO mice exhibit deficits in spatial working memory, supporting the role of interneuronal myelination in hippocampal function. This study provides important insights into the role of myelination for the function of inhibitory interneurons, as well as in the mechanisms of axonal node development and ion channel clustering, and thus will be of interest to a broad audience of circuit and cellular neuroscientists. However, the claims of the specificity of the reported changes in myelination need to be better supported by evidence.
 
 Strengths:
 
 The authors combine a wide array of genetic, immunolabeling, optical, electrophysiological, and behavioral tools to address a still unresolved complex problem of the role of myelination of locally projecting inhibitory interneurons in the hippocampus. They convincingly show that changing myelination and ion channel distribution along nodes and heminodes significantly impairs the function of at least some interneuron types in the hippocampus and that this is accompanied by behavioral deficits in spatial memory.
 
 Regarding the organization of myelinated axons, the lack of 4.1B causes striking changes at the nodes of Ranvier that are convincingly and beautifully presented in the Figures. While the reduction in Kv1 in 4.1B KO mice has been previously reported, the mislocalization of sodium channels at the nodes and heminodes had only been observed in developing but not adult spinal cords. This difference in the dependence of the sodium channel distribution on 4.1B in adult hippocampus vs spinal cord may hold important clues for the varying role of myelin along axons of different neuronal types.
 
 The manuscript is very well written, the discussion is comprehensive, and provides detailed background and analysis of the current findings and their implications.
 
 Weaknesses:
 
 Because of the wide diversity of interneuron types in the hippocampus, and also the presence of myelinated axons from other neuron types as well, including pyramidal neurons, it is very difficult to disentangle the effects of the observed changes in the 4.1 B KO mouse model. While the authors have been careful to explore different possibilities, some of the claims of the specificity of the reported changes in myelination are not completely founded. For example, there is no compelling evidence that the myelination of axons other than the local interneurons is unchanged. The evidence strongly supports the claims of changes in interneuronal myelination, but it leaves open the question of whether 4.1B lack affects the myelination of hippocampal pyramidal neurons or of long-range projections.
 
 This is an important question also raised by Reviewer 1. We have now quantified the density of paranodes in the alveus as shown in Figure 1I. Paranode density was not affected in the alveus nor in the stratum lacunosum-moleculare suggesting that myelinated axons connecting extra-hippocampal areas may be preserved. In particular, this is an indication that the axons of pyramidal neurons that run into the alveus should be properly myelinated.
 
 To be able to better interpret the changes in the 4.1B KO mice, knowledge of the distribution of 4.1B in the hippocampus of control mice will be very helpful. The authors state that 4.1B is observed in PV neurons but not in pyramidal neurons, however, the evidence is not convincing. Thus, the lack of immunolabeling at the pyramidal neuron cell bodies does not indicate that 4.1B is missing at the axonal level. The analysis also leaves out the question of whether 4.1 B is seen in the axons of somatostatin neurons.
 
 We agree and do not exclude that 4.1B may be expressed along the axons of pyramidal neurons. We performed double-staining for SST and 4.1B to show that 4.1B is localized along the internode and enriched at the paranodes of SST axons as observed for PV axons (Figure 4F). The enrichment of 4.1B in GABAergic neurons was previously observed in premyelinated hippocampal cell culture (Bonetto et al. 2019).
 
 Reviewer #3 (Public Review):
 
 Pinatel and colleagues addressed a currently understudied topic in neurobiology, namely, the architecture and function of myelination in subsets of Parvalbumin (PV)- and Somatostatin (SST)-positive GABAergic hippocampal interneurons and its dependence on juxtaparanodal organizer proteins. In order to elucidate the structural and functional implications of interneuron myelination, the authors visualized inhibitory neurons by utilizing a Lhx2-Lhx6 tdTomato reporter line in combination with mutants for crucial membrane and cytoskeletal linker proteins such as Contactin2/TAG-1, Caspr2, and Protein 4.1B. They then applied a comprehensive set of histological, electrophysiological, and behavioral experiments to dissect the role these proteins play in proper myelination and function of PV- and SST-interneurons.
 
 The bulk of the study's data is based on immunofluorescence, which is presented in a number of figures comprised of high-quality images. As much as this is a strength of the study, the underlying image analysis as described in the methods falls short. All structural data rely on the measurements of physical parameters such as length of internodes, the distance between (juxta)paranode and node, the distance between node and myelin sheath, length of the axon initial segment (AIS), etc. In light of this, and considering the small physical dimensions of the nodal region in general, the methods remain unclear about the depth of 3D reconstruction/deconvolution applied to the samples. Measurements presented in the results show significant differences in sub-micrometer dimension, which at least according to the stated methods, are unlikely to be precise given that the confocal imaging parameters do not seem to reach Nyquist conditions. For a study in which a third of all data is aimed at elucidating (sub)micrometer changes, this is crucial and the study would benefit from a more rigorous method description by the authors.
 
 Another methodological weakness is the somewhat small n, and its incoherence across the experiments and therefore, the statistics performed in some of the experiments. Statistics are based on either n for animals, or n for individual data points from several animals. Why is not all data represented as mean/animal? Also, the sampling in general with n = 3 animals is borderline acceptable; in some cases, it seems that only 2 animals were used, and in others, no number is given at all (please refer to author comments for details). This needs to be addressed, either by explaining why so few animals were used, or by adding more data from individual animals. Assigning structures (AIS, nodes) as n results in overstating effects, since especially for AIS, there is significant heterogeneity in the length across neurons from the same type, and this is masked when 100 AIS are considered as individual n instead 100 AIS per animal, and the animal is (correctly) the n.
 
 Since the study seems to switch back and forth between these assignments, it would be helpful to level these data across all experiments unless there are specific reasons not to do so, which then need to be explained. As outlined in the methods, all values are given as means {plus minus} SEM; this needs to be corrected for those cases where the standard deviation is the appropriate choice (e.g. all graphs showing n = individual structure, and not the mean of an animal).
 
 As far as the analysis of geometrical AIS changes is concerned, the method section should be extended to address how, if at all, AIS length and position were analyzed in 3D, also considering the somewhat "spotty" immunosignal outlined in Fig. 8D.
 
 We agree with all these comments. We improved Fig.1 I and J by adding more data (n=4 mice). We would like to point out that the phenotype of the 4.1B KO mice is highly penetrant. The selective loss of myelin in the hippocampus was observed in the two different genetic background (4.1B-/- and 4.1B-/-;Lhx6;tdTomato mice) and at all the ages examined (P25P180).
 
 For the quantitative morphological analysis: We considered “n=number of animals” in Figure 1 to describe the massive and selective alteration of myelin in the hippocampus of 4.1B KO mice. In the following Figures, we considered n=ROIs (Figure 2, Figure 3, Figure 6) for the density of SST and PV interneurons or oligodendroglial cells and n=individual structures (Figure 4, Figure 5, Figure 8) for a more precise sampling of the structure heterogeneity (internode, node, AIS). Means ± SEM are indicated in the text corresponding to plot boxes and distribution plots in the Figures.
 
 Concerning AIS measurements, we considered “n” as individual AIS in a coherent manner with the electrophysiological recordings in which “n” is the individual cells. We hope that we have now better illustrated the AIS of SST cells in the stratum oriens in the new Figure 8 with single channel images. In contrast to the AIS of pyramidal neurons that display sinuous feature, the AIS of SST neurons (and especially O-LM cells which axons run straight across the stratum radiatum) show a rather straight organization.
 
 We improved our measurements of the AIS structural parameters (onset, length) of SST neurons of the stratum oriens using confocal imaging with a 20x objective, 0.54 µm steps, Nyquist conditions. Indeed, these new measurements confirmed that the AIS of SST neurons was significantly shorter in the 4.1B KO mice.
 
 The observed AIS length change is then discussed in the context of a study conducted in a pharmacological model of myelin loss, however, that particular study (Hamada & Kole, 2015) found not only a length change but a position change after cuprizone-induced AIS plasticity. The authors should therefore discuss this finding in a bit more detail than simply stating "Adaptation of the AIS has been reported in the cuprizone chemical model of demyelination" (p. 14, ll. 512).
 
 We added these sentences in the Discussion:
 
 Lane 527: Supporting this notion, previous studies have reported an adaptive response of the AIS of cortical pyramidal neurons in the cuprizone chemical model of demyelination. Specifically, it was observed that the length of the AIS is reduced together with a more proximal site of the onset. These changes reduce the AIS excitability suggesting a compensatory mechanism to ectopic action potentials generated in demyelinated axons (Hamada and Kole, 2015).
 
 Lane 556: Interestingly, in cortical pyramidal neurons, demyelination induced by cuprizone causes the restructuring of AIS and reduces excitability at this site. “Acute demyelination leads to a more proximal onset of AIS without a change in the length of ßIV spectrin expression. However, the AIS of these acutely demyelinated axons display a reduced length of Nav1.6 channel expression and extended Kv7.3 channel expression at the distal site (Hamada and Kole, 2015).”
 
 Similarly to the points made about structural data above, the data from electrophysiological recordings should be presented in such a way that e.g. the number of cells and/or animals is readily accessible from the graph or legend. In its current form, this information - while available - needs to be pieced together from in-text information supplemented by figure legends. Sometimes, the authors do not include the number of animals behind individual cell data (for details please see author comments). Please carefully review all figures and edit accordingly.
 
 The behavioral data presented in the study is interesting, but the conclusions drawn are not supported by the data presented, as many unknown factors remain in place that could contribute to the observed phenotype.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.01.10.523413v2
www.biorxiv.org www.biorxiv.org

New submission 14/12/2022, 10:19:02

1
1. Public_Reviews 10 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #3 (Public Review):
  
  Wernet et al. show that there are intrinsic protein oscillations at the hyphal tips of A. flagrans, a nematode trapping fungus, that become coordinated when two hyphae become close. They create a mathematical model of this synchronization phenomenon, and then go on to show that calcium is critical to the functioning of these oscillations and hyphal fusion. The concept of interhyphal communication through signal synchronization is fascinating, and the visual matching of the output of the model to the data is compelling. However, given that the authors already showed synchronized oscillations in the SofT protein in A. flagrans in Hammadeh et al. 2022 (Figure 4), this diminishes the novelty of the findings in this study. Additionally, as it also has been established that calcium drives other oscillatory communications, the characterization of calcium dependence is not especially novel or bringing new insights into the problem especially since it is unclear if the chelation is having effects due to loss of intracellular supplies and/or because it is the key signal in the dialogue. Right now the mathematical model feels a bit vague with discussion of hypothetical molecules, so the paper would be greatly strengthened if any key regulatory molecules that promote desychronization could be identified or there were some manipulations of the core known proteins that examined consequences of altering the oscillations. As it is, the reader is left intrigued but there are few concrete conceptual advancements.
  
  We thank Reviewer #3 for the thoughtful comments on our manuscript! We would like to emphasize that the main finding of this paper is the discovery of a monologue of individual hyphae before fusion and the transition into a dialogue. This had not been shown in any fungus, and it explains nicely the onset of the communication. During the revision process, we performed co-localization of SofT-GFP and MakB-mCherry in the same hyphae and observed that both proteins were oscillating in the same phase without other hyphae in vicinity, which is the opposite of the so far observed anti-phasic oscillations observed during the cell dialogue. Additionally, we observed that decoupling of the oscillations into the anti-phasic cell dialogue occurred during the transitory phase. We included our results in (L167) and updated the figures to create a new figure 3 and supplementary figure Fig. SS.
  
  We agree that it would be great to isolate the signaling molecule. However, this has been tried by several groups, so far without success. Therefore, we think that this one main finding is exactly the scope of short reports for eLife.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.09.25.509415v1
www.biorxiv.org www.biorxiv.org

New submission 19/02/2023, 12:18:32

1
1. Public_Reviews 10 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  The paper reports important work in which the Fub-1 boundary of the Drosophila bithorax complex is characterized in detail. Fub-1 separates the bxd/pbx regulatory domain, which is active in PS6/A1, from the abx/bx regulatory domain, which is active in PS5/T3. The work presented provides compelling evidence that Fub-1 consists of two key elements: an insulating boundary region called HS1, which is regulated by an adjacent region called HS2. HS2 contains a promoter that is activated in PS6/A1 by enhancers in the bxd/pbx region. Read-through of HS1 by transcripts from the HS2 promoter blocks the insulating activity of HS1, allowing the bxd/pbx regulatory regions to activate Ubx transcription in PS6/A1. It has long been appreciated that boundary elements within the BX-C are regulated in a segment-specific fashion. The work presented in the Ibragimov manuscript provides a very nice example of how this segment-specific regulation can take place. For the most part, the work is very thorough and the conclusions are well-supported. However, there are a few important issues that should be addressed.
  
  First, throughout the manuscript, it is stated that the read-through transcription of HS1 eliminates its blocking activity. Missing, however, is a test of whether the direction of transcription of HS1 is important. That is, no construct is tested in which HS1 is inverted so that RNAs from the HS2 promoter are transcribed from the opposite strand of HS1. If read-through transcription of HS1 is all that is required to abrogate its blocking activity, such a construct should behave identically to constructs in which HS1 is not inverted. However, if the structure of the F1HS2 RNA is critical to preventing the blocking activity of HS1, inversion of HS1 relative to HS2 may render it immune to inactivation by transcripts initiated at HS2.
  
  This is a good point. The sequence/structure of the transcript could be important—e.g., it recruits a factor that disrupts boundary activity.
  
  While we didn’t do such an experiment, this scenario seems unlikely. As noted above we have replaced Fub-1 with two other BX-C boundaries Mcp and Fab-8. Their sequences are different from other and from Fub-1. Both block bxd/pbx from regulating Ubx and give an A1 LOF phenotype. To test the effects of transcription on boundary activity, we placed a P-element promoter upstream of both boundaries (so transcripts from the P-element promoter would read through boundaries towards bxd/pbx. We found that inclusion of the P-element promoter rescued the LOF phenotypes.
  
  Second, the terminology used to designate the constructs tested is very hard to follow and needs simplification. Since the orientation of HS1 in isolation is unimportant, perhaps just HS1 HS2, HS1 Inv(HS2), HS2 HS1, and Inv(HS2) HS1 could be used.
  
  We wanted to keep the terminology consistent in so far as possible with publications on other BX-C boundaries.
  
  Third, in many places in the manuscript genotypes are shown in which the HS1 insulator is placed into F7attP50. For these genotypes, H1 is said to block the interaction between iab-6 and iab-7, but not to support bypass activity. Readers need some help here, as they will not understand why A5 and A6 tergites are black in these genotypes, as this implies that iab-5 is able to act over the HS1 element to activate Abd-B. One explanation may be that iab-5 can promote pigmentation by acting on abd-A.
  
  The likely explanation is that the Fab-6 boundary is able to "bypass" the intervening HS1 insulator and target iab-5 enhancers to Abd-B promoter. There are other Fab-7 replacements in which the iab-5 enhancers are also blocked. The likely explanation is that the Fab-6 boundary is able to "bypass" the intervening HS1 insulator and target iab-5 enhancers to Abd-B promoter. We added an explanation and a review article describing to the text.
  
  Fourth, a more complete description of the HS1248 HS2505R genotype is needed. In this genotype, the H1 insulator is constitutively active, as H2 is inverted. Do animals of this genotype show a bxd phenotype in the larval cuticle? Do adults show a transformation of the halteres like that shown by classical bxd mutations? Answers to these questions would shed light on when H1 is active as an insulator, and whether it is active throughout PS6/A1.
  
  Phenotype of larval cuticle indicates a LOF transformation towards T3. We added a supplementary Figure 6-figure supplement 5 showing this. The haltere shows evidence of an LOF phenotype (Figure 6-figure supplement 6).
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.11.13.516321v1
www.biorxiv.org www.biorxiv.org

New submission 16/08/2022, 11:15:33

1
1. Public_Reviews 10 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer 2 (Public Review):
  
  Weaknesses: The paper is largely written within the 'accidental virulence' framework, ref [2]. This is a valuable framework, but it is worth noting that the ideas overlap with the earlier concept of 'coincidental selection for virulence', first developed by Bruce Levin and C Svanborg-Eden (1990) Parasitology 100, S103-S115 (more recent experimental work in this thread is reviewed by ref [1]).
  
  We wholeheartedly agree. As noted above, the manuscript has been updated to reflect this omission.
  
  Missing this thread leads to a number of statements that are just not supported by the literature. Some examples -
  
  Summary: 'The existence of microbes that are not normally pathogenic, yet are well-suited to host exploitation, is an evolutionary paradox. ' - No, this is potentially explained by coincidental selection for virulence, which has been documented in several studies
  
  Summary: "Our results support the idea that selection in the environment for a trait unrelated to virulence can inadvertently generate opportunistic, "accidental" pathogens." - The trait is not unrelated to virulence, as they are correlated, likely shaped by adhesion behaviors. This correlation is not surprising as 'adhesins' are a classic category of virulence factor, presenting a potential common cause between sticking to a bead and killing an insect more rapidly.
  
  Line 26 "hypothesis has not been directly tested experimentally". - this is probably the main concern, as the paper currently does not address related prior experimental work that has been developed within the co-incidental selection tradition. Please see refs that are cited by paper [1] for a start, and then follow forward for more recent work. One recent study comes to mind as it looks at correlated effects of in vitro bead attachment -- https://www.nature.com/articles/s41396020-0652-0 (virulence was not directly assayed, but of interest also noted a shift in antibiotic resistance following bead-attachment selection without drugs or a host).
  
  We agree with these suggested edits and have updated the manuscript in the appropriate places.
  
  Turning to experimental choices, the use of a 'no bead' experimental control is an important point of comparison, to ensure that the evolutionary effects of interest are particular to the presence of the bead. But if there were no beads, how are you measuring 'cells on bead' (y-axis in Figure 1)? I assume this is an oversight and you're measuring cells per x volume.
  
  This is a good question. To be clear, Figure 1 does not represent measurements taken throughout the course of the experiment; rather, it represents a large phenotyping effort after the experiment ended. Ancestors and evolved populations from multiple timepoints (including the control populations) were started from cryopreserved stocks, then challenged to grow in the presence of a bead. As can be seen in the figure, both ancestors had some plastic adherence ability, which was maintained in the control populations.
  
  Moving into the key virulence assays, I was expecting a similar and simple design: compare the virulence of ancestor versus 'evolved with bead' versus 'evolved without bead'. This would allow answers to the key question of whether bead attachment leads to the evolution of increased virulence, with appropriate controls for adaptation to the general passaging environment. Why not use this simple and standard design?
  
  Instead, we get a more complex design, contrasting isolates that are filtered on the adhesionrelated traits (biofilm, etc), but sampled across timepoints. This does establish that less adhesive and less biofilmy isolates are less virulent so this remains useful information, but the motivation for only using this design is not well spelled out. In principle, you could do this purely on standing variation and not require an experimental evolution step.
  
  We understand and respect this criticism. Please see the response above (in the section responding to the editor’s summary).
  
  As for the question of using standing variation, it is true that a large part of the evolution observed in this experiment is from the sorting of standing genetic variation. We did not anticipate the evolved phenotypes we observed. Perhaps if we had known they were possible, we could have searched for them in the standing genetic variation in F1 offspring/segregants. Related to this idea, we have investigated 350+ segregants from each of these clinical backgrounds (that were generated in order to map the genetic basis of plastic adherence for a manuscript in preparation). The evolved populations occupy different phenotypic space than the mapping population, although there is obviously overlap. Thus, in order to get to the hypermulticellular phenotype observed in the experiment, either multiple rounds of recombination were required to get many high alleles into one background, or new mutations were required.
  
  Concerning the role of plastic, I would encourage caution in the interpretation, given the experimental design. Consider this line from the discussion "In this experiment, favoring the ability to adhere to plastic, a surface that is alarmingly common in industrial, medical, and domestic settings [69], led to a suite of aggregative phenotypes and increased virulence." - by bringing up applied consequences of plastic exposure, this really raises the stakes. At present, the data does not separate the role of bead attachment from the specific role of plastic as a material. What would happen if you repeated with glass beads? I suspect a similar pattern, again driven by adhesin changes. The data at present does not resolve this issue.
  
  We respect this note of caution, which is in opposition to Reviewer #1, who thinks we should add more information about the increase in microplastics. We agree that the results are likely due to selection for adherence, rather than specifically adherence to plastic. That being said, the experiment does show that plastic is a surface on which these yeast can be selected to adhere. And it is also true that this surface is increasingly common. As a compromise, we took out the word alarming and added references to the effect of microplastics on other microbes.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.06.03.494655v1
www.biorxiv.org www.biorxiv.org

A neural network model of hippocampal contributions to category learning

1
1. Public_Reviews 10 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  Sučević and Schapiro investigated a neurobiologically inspired model of human hippocampal structure and computation in category learning. In three separate simulations, the model (CHORSE) is presented with learning tasks defined by various category structures from prior work and evaluated for its ability to learn the category structure, generalize categorization to novel stimuli, and accurately recognize previously encountered stimuli. Although originally conceived of as a computational model of associative memory, C-HORSE is demonstrated to quite naturally account for human-like learning of the three category tasks. Notably, the authors characterize the mechanisms underlying the model's learning by way of additional simulations in which "lesions" to the model's monosynaptic pathway (MSP; direct connections between ERC and CA1) are contrasted with lesions to its trisynaptic pathway (TSP; pathway connecting ERCDG-CA3-CA1). These in silico lesions offer key insight into the computational principles underlying theorized hippocampal functions in category learning: whereas MSP provides incremental learning of shared features diagnostic to category membership that are important for category generalization, TSP learns item-specific information that drives recognition behaviour. The authors propose that C-HORSE's successful account of a broad set of category learning datasets provides clear support for the role of complementary hippocampal functions mediated by MSP and TSP in category learning. This work adds compelling computational evidence to a growing literature linking hippocampus to a broader role in cognition that extends beyond declarative memory.
  
  The model simulations are clear and properly conducted. The three datasets examined offer a relatively broad set of findings from the category learning literature; that the models provide reasonable accounts of human performance in all three speaks to the model's generalizability. Overall, I find this work exciting and an important step in linking longstanding well-established formal learning theories of psychology with neurobiological mechanism. Several weaknesses dampen this excitement, each of which are detailed below:
  
  1) C-HORSE is presented as a new entry into a rich field of formal computational models of category learning. As noted above, the datasets examined span a broad range of learning contexts and structures and the model's ability to account for learning behaviour is compelling. However, no other models are leveraged to perform a direct evaluation. In other words, CHORSE's predictions are compelling, but is it better than other competing models in the literature? To be clear, C-HORSE offers a novel alternative with its fundamental mechanisms originating from anatomical structure and connectivity. As such, a proof-of-concept showing that such a neurobiologically inspired framework can account for category learning behaviour is a worthwhile contribution in its own right and a clear strength of this paper. However, how to consider this model relative to existing theoretical frameworks is not well described in the manuscript.
  
  We very much appreciate this point — see response to Editor summary point #3 above.
  
  2) Relatedly, C-HORSE is evaluated in terms of qualitative fit to behaviour measures from prior studies and in all three simulations restricted to measure of end of learning performance. Again, an appeal to the proof-of-concept nature of the current work may provide an appropriate context for this paper. But, a hallmark of well-established category learning models (e.g., SUSTAIN, DIVA, EBRW, SEA, etc.) is their ability to account for both end of learning generalization (and in some cases, recognition) and behaviour throughout the learning process. C-HORSE does provide predictions of how learning unfolds over time, but how well this compares to human measures is not considered in the current manuscript. Such comparisons would strengthen the support for C-HORSE as a viable model of category learning and help position it in the busy field of related formal models.
  
  We completely agree about the value of this, and we have added empirical timecourse data for comparison with all simulations, as described in response to Editor summary point #7, above.
  
  3) A consistent finding across all three simulations is that the TSP provides item-specific encoding. Evidence for this can be inferred by contrasting categorization and recognition performance across the TSP- and MSP-only model variants. In the discussion, the authors draw a parallel between exemplar theories of category learning and the TSP, which is a compelling theoretical position. However, as noted by the authors, unlike exemplar theories, the TSP-only model was notably impaired at categorization. The author's suggestions for extensions to CHORSE that would enable better TSP-based categorization are interesting. But, I think it would be helpful to understand something about the nature of the representations being formed in the TSP-only model. For example, are they truly item-specific, are the shared category features simply lost to heightened encoding of item-unique features, are category members organized similarly to the intact model just with more variability, and so on. Characterizing the nature of these representations to understand the limitations of the TSP-only model seems important to understanding the representational dynamics of C-HORSE, but are not included in the current manuscript.
  
  The RSA results, now included for Simulations 2 and 3 in addition to Simulation 1, provide the information needed to characterize the nature of the TSP representations. Generally speaking, they are truly item specific, meaning that each item is represented by its own distinct set of units. This is a demonstration of the classic pattern separation function of this pathway, taking similar inputs and projecting them to orthogonal populations of neurons. Simulation 1 is the clearest example of this, where there is virtually no similarity and very low variability in the item similarity structure in DG and CA3. The new Simulation 3 RSA shows us where the limit is to this pattern separation ability of the TSP, with highly typical items being represented by somewhat overlapping populations of neurons in DG and CA3. To the extent that the TSP can succeed in generalization, it seems to involve this pattern separation failure.
  
  We have made these points more explicit in new discussion of the RSA results:
  
  • Simulation 1: “In the initial response, there was no sensitivity at all to category structure in DG and CA3 — items were represented with distinct sets of units. This is a demonstration of the classic pattern separation function of the TSP, applied to this domain of category learning, where it is able to take overlapping inputs and project them to separate populations of units in DG and CA3.” • Simulation 3: “As in the prior simulations, DG and CA3 represented the items more distinctly than CA1, and settled activity after big-loop recurrence increased similarity, especially in CA1. This simulation was unique, however, in that DG and CA3 showed clear similarity structure for the prototype and highly prototypical items. There is a limit to the pattern separation abilities of the TSP, and these highly similar items exceeded that limit. This explains why, at high typicality levels, the TSP could be quite successful on its own in generalization (Figure 5e), and why it struggled with atypical feature recognition for these items (Figure 5f).”
  
  4) In general, a detailed description that links model mechanisms and analyses to the learning constructs of interest for the different simulations is lacking. For example, RSA results for simulation 1 are contrasted for initial and settled representations, but what is meaningful about these two timepoints is not directly stated (moreover, what initial and settled response mean in terms of the current model is not explained). The authors do briefly suggest that differences between initial and settled representations may reflect encoding dynamics before and after bigloop recurrence, but this is not established as a key metric for evaluating the nature of the model representations. In general, more motivation is needed to understand what the chosen analyses reveal about the nature of the model's learning process and representations.
  
  We have added more description of the motivation for our analyses. See response to Editor summary point #6 above.
  
  5) I appreciate the comparison in the discussion to extant models of categorization. Certainly, the exemplar and prototype models are fixtures of the category learning literature and they somewhat align with the type of learning that TSP and MSP, respectively, provide. REMERGE and SUSTAIN are also briefly mentioned, but their discussion is limited which is unfortunate as they are actually more functionally equivalent to C-HORSE. I think, however, that the authors are missing an opportunity to discuss how C-HORSE offers a means for bridging levels of analysis to connect neurobiological mechanisms with these notably successful psychological models of category learning. Rather than framing C-HORSE as a competitor to existing models, it should be viewed as an account existing on a different level of analysis. In this sense, it complements existing approaches and potentially extends a theoretical olive branch between the psychology and neuroscience of category learning.
  
  We love this point about bridging levels of analysis and have added it to our discussion of the model’s relationship to other models, see Editor summary point #3 above.
  
  6) The discussion takes a broad perspective on covering evidence concerning hippocampal contributions to category learning. Although comprehensive, some sections are not well connected back to the main thrust of the paper. For example, a section on neuropsychological accounts of the hippocampus and category learning summarizes central aspects of this literature but is never reflected on through the lens of the current findings. I do think this prior work is relevant, especially since it a central theme of the hippocampus not being necessary for category/concept learning, but its connection back to the current study is not well argued. Similarly, the section on consolidation and sleep is relevant, but in its current form does not seem to fit with the rest of the paper.
  
  We have implemented these suggestions through very significant revisions to the Discussion. We now better connect the sections to the main argument of the paper and made cuts throughout, including removing the section on consolidation and sleep.
  
  Reviewer #2 (Public Review):
  
  The authors present a model of the hippocampal region that incorporates both the (indirect) trisynaptic and (direct) mono-synaptic pathways from entorhinal cortex (EC) to CA1 - the former incorporating projections from EC to dentate gyrus (DG), DG to CA3, and CA3 to CA1, and exhibiting a higher learning rate. They demonstrate that exposing this network to stimuli consistent with standard empirical tests of category learning (e.g. where within-category exemplars share a set of common features) allows the network to reliably assign both novel and previously encountered stimuli to the correct category (e.g. the network can learn to classify stimuli and generalise this knowledge to new examples). They show that the tri-synaptic pathway (TSP) preferentially supports the encoding of individual exemplars (e.g. analogous to episodic memory) while the mono-synaptic pathway (MSP) preferentially supports category learning.
  
  The manuscript is well written, the simulation details appear sound, and the results are clearly and accurately presented. This model builds on a long tradition of computational modelling of hippocampal contributions to human memory function, strongly grounded in anatomical and electrophysiology data from both rodents and humans, and is therefore able to link phenomena at the level of individual cells and circuits to emergent behaviour - a major strength of this, and similar, work. However, I have two major concerns relating to the relationship between these findings and previously published work by the same and other authors.
  
  First, it is not clear to me - from the manuscript - whether these results represent a significant novel advance on previous publications from the same senior author. Figures 1 and 3D are almost identical to figures published in Schapiro et al. (2017) Phil Trans B, and the take-home message (that the MSP might support statistical learning) is the same. In brief, it seems that the authors have subjected an identical network to some new (but related) tasks and reached the same set of conclusions. I see no distinction between learning to extract 'statistical regularities' (in previous work) and learning 'the structure of new categories' (described here). As an aside, demonstrating that an autoencoder network can learn stimulus categories and generalise to new exemplars is also well established.
  
  We appreciate the opportunity to better articulate the novelty and importance of applying the model to the domain of category learning. There are crucial differences between statistical learning and category learning that make these simulations nontrivial (it did not have to be the case that the results would replicate for these category learning paradigms), and, importantly, many of the insights in the current work are category-learning specific (e.g., the effects of atypical features, trade-offs between generalization and recognition of exemplar-specific features). On the other hand, we of course agree that there are principles in common between statistical learning and category learning that are leading to the consistent findings. We added new material to the Introduction to explain the importance of these new simulations in the domain of category learning, and the value we see in demonstrating convergence across domains. See response to Editor point #1 above.
  
  Second, I have some concerns with the relationship between the properties of this hippocampal network model and well described properties of single cells in the rodent and human hippocampus. In particular, the CA1 units in this model (and to some extent, also the CA3 units) come to respond strongly to all exemplars from within each category (e.g. as shown in Figure 3D, bottom right panel). This appears to be at odds with the known properties of place and concept cells from the rodent and human hippocampus, respectively, which show little generalisation across related concepts (i.e. the Jennifer Aniston neuron does not fire in response to other actors from Friends, for example). If the emergent properties of this model are not consistent with existing data, then it is not a valid model.
  
  We appreciate the opportunity to discuss connections to the physiology literature. See response to Editor summary point #2 above.
  
  More generally, the authors are clear that this model is "a microcosm of [the] hippocampusneocortex relationship" and that the properties of the MSP "mirror those of neocortex". Why not assume that category learning is supported by an interaction between hippocampus and neocortex, then, as in the complementary learning systems (CLS) model? Aside from some correlational fMRI data and partial deficits in hippocampal amnesics - either of which could have a myriad of different explanations - what empirical data is better accounted for by this model than CLS? Put differently, what grounds are there for rejecting the CLS model? To some extent, this model appears to account for less empirical data than CLS, with the exception of a few recent neuroimaging studies (which are hard to interpret at the level of single cells)
  
  This is an important point for us to clarify, so we very much appreciate this comment. The crucial issue with CLS that motivated the microcosm theory is that the neocortex in the CLS framework learns far too slowly to support the kind of category learning studied in these paradigms, which unfolds over the course of minutes or hours. The neocortex in CLS was proposed to learn novel structure across days, months, and years.
  
  We have added the following to the Introduction:
  
  • “Despite its analogous properties, the MSP is not redundant with neocortex in this framework: the MSP allows rapid structure learning, on the timescale of minutes to hours, whereas the neocortex learns more slowly, across days, months, and years. The learning rate in the MSP is intermediate between the TSP (which operates as rapidly as one shot) and neocortex. The proposal is thus that the MSP is crucial to the extent that structure must be learned rapidly.”
  
  We also have this description in the Discussion:
  
  • “The MSP in our model has properties similar to the neocortex in that framework, with relatively more overlapping representations and a relatively slower learning rate, allowing it to behave as a miniature semantic memory system. The TSP and MSP in our model are thus a microcosm of the broader Complementary Learning Systems dynamic, with the MSP playing the role of a rapid learner of novel semantics, relative to the slower learning of neocortex.”
  
  Reviewer #3 (Public Review):
  
  The current work aimed to determine how the hippocampus may be able to detect regularities across experiences and how such a mechanism may serve to support category learning and generalization. Rapid learning in the hippocampus is critical for episodic memory and encoding of individual episodes. However, the rapid binding of arbitrary associations and one-shot learning was long thought suboptimal for finding regularities across experiences to support generalization, which were instead ascribed to other, slower-learning memory systems. More recent work has started to highlight hippocampal role in generalization, renewing the question of how generalization can be accomplished alongside memory for episodic details within a single memory structure. The current paper offers a reconciliation, presenting a biologically-inspired model of the hippocampus that is able to learn categories alongside stimulus-specific information comparably to human performance. The results convincingly demonstrate how distinct pathways within the hippocampus may differentially serve these complementary memory functions, enabling the single structure to support both episodic memory and categorization.
  
  Major strengths and contributions
  
  The paper includes simulation of three distinct categorization tasks, with a clear explanation of the unique aspects of each task. The key results are consistent across tasks, lending further support to the main conclusions of the role of distinct hippocampal pathways in learning specific details vs. regularities. Together with prior work on how the same architecture can support statistical learning in other types of tasks, this work provides important evidence of the broad role of the hippocampus in rapid integration of related information to serve many forms of cognition.
  
  Throughout the paper, the authors nicely explain in conceptual terms how the same underlying computations may serve all three categorization tasks as well as statistical learning and episodic inference tasks. Thus, the paper will be of broad interest, beyond researchers focused on modeling and/or categorization.
  
  On a conceptual level, this work provides a fruitful framework for understanding hippocampal functions, representations and computations. It provides a highly plausible mechanistic explanation of how category learning and generalization can be accomplished in the hippocampus and how distinct types of representations may emerge in distinct hippocampal subfields. The framework can be used to derive new testable predictions, some of which the authors themselves introduced. It also provides new insights into how the outputs of different pathways influence each other, providing a more nuanced view of the division of labor and interactions between hippocampal subfields. For example, the big loop recurrence would eventually lead to category influences even on the initially sparse, pattern separated representations in the CA3, which is an idea consistent with empirical observations.
  
  The presented computational model of the hippocampus is currently the most detailed and biologically plausible hippocampal model easily applicable in the area of cognitive neuroscience and behavioral simulations. The commonalities and differences with other related models (conceptual and computational) are well explained. Both the conceptual and technical descriptions of the model are exceptionally clear and detailed. The model is also publicly available for download for any researcher to use with their own task and data. All these aspects make it likely that other researchers may adopt the model in a wider range of tasks, stimulating new discoveries.
  
  The autoencoder nature of the model and the use of categorization tasks meant that some measures of interest, like recognition of exemplar-specific information, could not be evaluated by direct reading of the output layer to compare with some label (like old/new). The authors however came up with clever ways how to evaluate recognition performance in each task that was sensible and highlighted the multiple ways how one may think about information contained in neural representations in each layer. This approach can also be utilized by others for evaluating item-specific and category information in activation patterns, for example in analyses of fMRI.
  
  Finally, I thought the current paper and provided model may also serve as an excellent introduction to computational modeling for those new to this approach. The exceptional clarity of the conceptual and technical description of this model and the clear logic of how one may model a cognitive task and interpret results made this paper fairly accessible. Furthermore, the paper offered new insights and predictions based on analyzing the model's hidden layers, lesion performance, and/or noting some patterns of behavior unique to specific tasks. This was also instructive for highlighting the distinctive contributions that the computational modeling approach can have for furthering our understanding of cognition and the brain.
  
  We are extremely appreciative of the value the Reviewer sees in this work.
  
  Weaknesses
  
  The paper's strengths far outnumbered the weaknesses, that are minor. For one, the selected categorization tasks nicely complemented each other, but only covered stimuli with discretevalue dimensions (features like color, shape, symbol, etc). The degree to which the results generalize (or not) to continuous-value stimuli and different category structures (for instance information-integration or rule-based in COVIS framework) is not clear. How the model could be adjusted for continuous-value stimuli was not specified.
  
  We agree that the simulation of only discrete valued dimensions is a limitation. We chose to do this simply because it is easier to use discrete values in the model as currently implemented, but future work will certainly need to test whether the model can simulate the various paradigms that make use of continuous-valued dimensions. We have added an explicit acknowledgement of this issue in the Methods:
  
  • “The inhibition simulates the action of inhibitory interneurons and is implemented using a set-point inhibitory current with k-winner-take-all dynamics (O’Reilly, Munakata, Frank, Hazy, & Contributors, 2014). All simulations involved tasks with discrete-valued dimensions, as these are more easily amenable to implementation across input/output units whose activity tends to become binarized as a result of these inhibition dynamics. It will be important for future work to extend to implementations of category learning tasks with continuous-valued dimensions.”
  
  There is compelling evidence for the dissociation between different hippocampal pathways and subfields (CA1 vs. CA3) that the model is based on. As the authors noted, there is also compelling evidence for functional dissociations along the long hippocampal axis, with anterior portions more geared towards coarse, generalized representations while posterior towards more detailed, specific representations. The authors nicely pointed out that these proposals of withinhippocampus division of labor are less orthogonal than they may first appear, as there is greater proportion of CA1 in the anterior hippocampus. However, it is premature to imply that this resolves the CA1/CA3 vs. anterior/posterior question; the idea that existing anterior findings may be simply CA1 findings is currently only speculation. Furthermore, first studies indicating that anterior/posterior representational gradients may exist within each subfield are beginning to emerge.
  
  We completely agree that this is speculative at this point, which needed acknowledgment. See response to Editor summary point #2 above.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.01.12.476051v1
www.biorxiv.org www.biorxiv.org

New submission 20/06/2023, 09:42:21

1
1. Public_Reviews 10 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  Reviewer 1
  
  Question 1: While the CTD human brain organoids show a decrease in Cr (in absence of Cr in the culture medium) as compared to control organoids (4 times less), they are not devoid of Cr. Do these organoids express the two enzymes allowing Cr synthesis (AGAT and GAMT), and in which brain cell types? If yes, how to explain the decrease in Cr in the CTD organoids?
  
  There is a lack of functional CRT in the CTD human brain organoids. The basal level of creatine in CTD human brain organoid is significantly lower than in healthy human brain organoids. The intracerebral creatine synthesis is due to different expression of the AGAT and GAMT enzymes and relies on functional CRT for the transport of the GAA intermediate. Literature pointed out that both enzymes are rarely co-expressed (Braissant et al., 2001, PMID: 11165387) meaning that GAA intermediate needs to be transported by CRT to neurons for complete creatine synthesis. Even if we evidenced a slight mRNA expression of AGAT and GAMT enzymes, the creatine synthesis is not effective since the GAA intermediate could not be transported in cell expressing GAMT due to the non-functional creatine transporter in the CTD human brain organoids.
  
  Question 2. The rescue experiment, re-establishing a functional Cr transporter (CRT or SLC6A8) in the CTD human brain organoids, is very interesting, as this may help the design and development of new treatments for CTD. However, authors claim that the functional CRT expressed in the rescued CTD organoids was expressed in each cell. This may be a difficulty in the development of new CTD treatments, as CRT should be expressed in neurons and oligodendrocytes, but not in astrocytes. Authors may want to comment on this point.
  
  As shown in Figure S2C, the whole brain organoid in the rescue experiment shows the expression of the GFP protein, thus also the co-expressed wild-type CRT. In these experiments, we did not make a detailed cellular characterization of the rescued organoids, and this may be the aim of a separate study that will carry out experiments for an exact characterization of the cell-specific CRT expression and function in the rescued brain organoids. Accordingly, we corrected in the revised version of manuscript the statement on page 6 to the following: “SLC6A8 expressing brain organoids showed GFP fluorescence in the whole area of the organoid (Fig S2C).”
  
  Reviewer #1 (Recommendations for The Authors):
  
  Authors may cite the recent review by Fernandes-Pires (2022) exposing the challenges to treat CTD (introduction, lines 57-58 for example).
  
  Reference has been added, lines 57-58 of the revised version
  
  Authors may precise in their introduction (lines 60-61) that, while creatine (Cr) supplementation is not effective to treat CTD male patients, a proportion of female CTD patients is responsive to Cr supplementation (due to the differential inactivation of one of the X chromosome depending on the cells).
  
  Treating CTD appears simple: transport creatine into the brain cells. In individuals with creatine synthesis disorders, increasing brain creatine levels thanks to oral supplementation of creatine monohydrate and/or precursors improves neurodevelopmental outcomes. This task has proven more daunting than expected in CTD since oral creatine supplementation does not increase brain creatine concentrations. Literature and more specially data reported by Van de Kamp “X-linked creatine transporter deficiency: clinical aspects and pathophysiology. J Inhert Metab Dis 37 (5):715-733) describes 3 females CTD patients without improvement of clinical outcomes. Bruun et al., 2018 “Treatment outcome of creatine transporter deficiency: international restrospective cohort study: Metab. Brain Dis: 33:875-884 reports 2/3 CTD females with improvement of clinical outcome. Taken together the sentence has been modified in the revised version of the manuscript as follows: “Several combinations of nutritional supplements or Cr precursors l-arginine and l-glycine, have been studied as therapeutic approaches for CTD, but they have shown limited success (Bruun et al., 2018, Valayannopoulos et al., 2013) (lines 61-63, Page 4)
  
  When comparing their new in vitro CTD model of human brain organoids with existing in vivo rodent models, authors may add the citation of the rat model of Duran-Trio et al (2021 & 2022), in particular for its description of CNS tissue alterations (dendritic spines density for example).
  
  The reference Duran-Trio et al (2021) has been added (page 4, line 70). The reference Duran-Trio et al (2022) has been added (page 11) and the sentence has been modified in the revised version of the manuscript as follows: “Reduced cortical spine density and reductions in protein levels of several synaptic markers have been observed in the brains of Slc6a8-/y mice and rats (Chen et al., 2021; Duran-Trio et al., 2022)”.
  
  Reviewer #2 (Recommendations For The Authors):
  
  There are only minor suggestions for improvement in this manuscript. The authors strongly link creatine uptake, the GSK3β pathway, and intellectual disability. Enhancing this claim with data on phosphorylation differences between organoids derived from healthy individuals and those from CTD patients could solidify this foundation and facilitate a more holistic understanding of the disease. In addition, the in vitro model based on organoids might be closer than other experimental setups; however, proving that those differences are also present in vivo would greatly benefit the story.
  
  As shown in Fig 6A-B, GSK3β is less phosphorylated on Ser9 in CTD brain organoids compared to healthy organoids, indicating that GSK3β is more active in organoids with reduced creatine levels. Studying the level of GSK3β phosphorylation in the mouse brain could be part of next experiments and another story.
  
  There is also some uncertainty around the rescue experiment using the exogenous SLC6A8 gene. Could the difference in creatine uptake between the rescue iPSCs and the healthy control be due to CRT overexpression? Higher levels of the transporter may explain the elevated levels of intracellular creatine. Thus, a comparison using Western blotting experiments could be a valuable addition to evaluating the expression levels of this protein.
  
  For the rescue experiment, we used a vector where SLC6A8 and eGFP were connected by an IRES2 sequence, providing simultaneous, but independent expression of the two proteins. CTD-rescue iPSC clones were selected based on high eGFP fluorescence. These clones probably have several copies of transgene in their genome, which could result in a higher abundance of SLC6A8 compared with healthy iPSCs. The difference in creatine uptake between the CTD-rescue iPSCs and the healthy control is probably due to CRT overexpression. However, there are no satisfactory anti-SLC6A8 antibodies commercially available to quantify CRT by western-blot. We would like to add that, although creatine uptake is higher in CTD-rescue iPSCs than in healthy control, the basal level of creatine (which corresponds to culture conditions for the rest of the experiments) is similar.
  
  Overall, this study provides valuable insights into CTD and potential therapeutic targets. It enriches our understanding of CTD and opens up new avenues for future research in this field.
  
  We thank the reviewer for their kind words and hope this study will be useful for other researchers in the CTD field.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.01.543271v1
www.biorxiv.org www.biorxiv.org

New submission 10/08/2023, 08:57:29

1
1. Public_Reviews 10 Aug 2023
 
 in eLife
 
 Author Response
 
 The following is the authors’ response to the original reviews.
 
 We appreciate the thoughtful feedback provided by the editor and the three reviewers and have addressed their comments, which we believe have results in significant improvements to the manuscript. A point-by-point response to the comments is included below.
 
 Reviewer #1
 
 Line 229: The wording of "highly valuable" seems slightly vague. Consider rephrasing to something more specific, such as: "...using individual animal recorders provide valuable new insight into locomotor behavior when ..."
 
 Thank you for your advice. The sentence was revised as you suggested.
 
 Lines 518-527: Consider adding quantitative details for the four conditions. It is apparent in Figure 4 (dashed lines associated with peaks of the distributions), but in the text it would be helpful to add the speeds and heights chosen to sort the data.
 
 Quantitative descriptions were added in the Materials and Methods section. We also moved detailed information about the curve fitting function from the Result section to Materials and Methods section.
 
 “Threshold values were decided based on the peak in the curve of the fitted probability density distribution (wind speed: 6.0 m/s, wave height 2.8 m). Weibull distribution and log normal distribution were used as the fitting function for wind speed and wave height, respectively (Ferreira and Guedes Soares, 2000; Carta et al., 2009).”
 
 Reviewer #2
 
 Line 51 - Climatic models - climatic model cannot, by definition, provide prediction of specific weather conditions as they focus on large and long-term values and trends. I suggest the authors to review their use of climatic conditions throughout the manuscript, and use instead weather conditions, where appropriate.
 
 Thank you for informing us the usage of terminology. Most of the phrases “climatic models” in the manuscript were replaced by “mathematical weather models”, for example, line 51, 226, 230, 312. We also checked that the phrase “climatic condition” never appears in the manuscript.
 
 Lines 59-61 require editing. It is true that take-off is associated with high rate of energy expenditure, but it is phrased in an unclear way. I suggest writing instead "Therefore, the high energy expenditure associated with take-off is strongly influencing the total energy expenditure of wandering albatross during the foraging trip, unlike the duration or distance of the flight (Shaffer et al., 2001).
 
 Thank you for your advice. As you suggested the phrasing was not proper to describe the previous study. The sentence was revised following your suggestion.
 
 “Therefore, the high energy expenditure associated with take-off strongly influences the total energy expenditure of wandering albatross during the foraging trip, unlike flight duration or distance (Shaffer et al., 2001a)”
 
 Line 213 - I suggest "Among the LMMS, models..."
 
 The sentence was revised as you suggested.
 
 Line 286 - I suggest using the word "difference" or "delta" AIC instead of variation which is confusing.
 
 The sentence was revised as follows.
 
 “For instance, the AIC difference in running speed between the best model and the second-lowest AIC model was only 0.27.”
 
 Line 385 - Please provide actual percentage even if it is < 1%.
 
 We added actual mass percentage of both small and large types of the recorders in line 385 and 386.
 
 “Small Ninja-scans weighed 28 g, which is 0.3 ~ 0.4% of wandering albatross body mass, and are expected to record for 7 h. Large Ninja-scans weighed 91 g, which corresponds to 0.8 ~ 1.3% of wandering albatross body mass, and are expected to record for 65 h.”
 
 Reviewer #3
 
 Thank you for the marked-up manuscript and a lot of comments on it. Most of your grammatical advises and rephrases are reflected in the new version and we double-checked the whole manuscript using English proofreading service. Please refer to the below for the answer to each major comments.
 
 Line 304 – not sure volume is best word choice.
 
 We changed the word “volume” to “amplitude”.
 
 Line 309 – Are you sure that Pennycuick 1982 didn’t document this?
 
 His article mainly focused on the morphology and steady flight mechanism of albatrosses and petrels. There were no descriptions on take-offs of seabirds.
 
 Line 320 – add after Weimerskirch citation, and similar to predicted best glide speeds (Shafer et al. 2001, Funct, Ecol 15)
 
 Thank you for the beneficial information. We added the phrase and the citation in the sentence.
 
 “The mean air speed of wandering albatrosses at the end of the running phase was close to the average flight speed (approximately 15 m/s) (Weimerskirch et al., 2002), and similar to predicted best glide speeds, (Shaffer et al., 2001b) indicating that wandering albatrosses gain sufficient lift at the end of the running phase and efficiently utilize ocean wind.”
 
 Line 684 – citation information incomplete.
 
 Thank you for finding the incomplete citation. Authors of the reference paper were corrected. “Weimerskirch H, Bonadonna F, Bailleul F, Mabille G, Dell’Omo G, Lipp H-P. 2002. GPS tracking of foraging albatrosses. Science 295:1259–1259. doi:10.1126/science.1068034”
 
 Fig.4. – In Part B, reorient the y-axis labels to match the other figures. Change the orientation of y-axis labels like shown in Figure 3.
 
 We rearranged the labels and ticks in Fig.4B to improve the readability and match the graphs with Fig3.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.03.24.534087v2
www.biorxiv.org www.biorxiv.org

New submission 09/08/2023, 08:54:59

1
1. Public_Reviews 09 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  [...] Based on these results, the authors support a model whereby kinetic regimes are encoded in the cis-regulatory sequences of a gene instead of imposed by an evolving trans-regulatory environment.
  
  The question asked in this manuscript is important and the eve locus represents an ideal paradigm to address it in a quantitative manner. Most of the results are correctly interpreted and well-presented. However, the main conclusion pointing towards a potential "unified theory" of burst regulation during Drosophila embryogenesis should be nuanced or cross-validated.
  
  We thank the reviewer for their careful and insightful assessment of our manuscript. The reviewer is right in that our claims should have been more nuanced. Indeed, our proposed unified strategy only concerns even-skipped transcription under the variable conditions that exist in ectopic and endogenous eve expression regions.
  
  Our results and those of others suggest that different developmental genes follow unified—yet different—transcriptional control strategies whereby different combinations of bursting parameters are regulated to modulate gene expression: burst frequency and amplitude for eve (Berrocal et al., 2020), and burst frequency and duration for gap genes (Zoller et al., 2018). In light of the aforementioned works, we can only claim that our results suggest a unified strategy for eve, our case of study, as we observe that eve regulatory strategies are robust to disruption of enhancers and binding sites. In the Discussion section of our revised manuscript, we will emphasize that the bursting control strategy we uncovered for eve does not necessarily apply to other genes, and speculate in more detail that genes that employ the same strategy of transcriptional bursting may be grouped in families that share a common molecular mechanism of transcription.
  
  In addition to the lack of novelty (some results concerning the fact that koff does not change along the A/P axis/the idea of a 'unified regime' were already obtained in Berrocal et al 2020),...
  
  Unfortunately, we believe there is a misunderstanding in terms of what we construe as novelty in our work. In our previous work (Berrocal et al., 2020), we observed that the seven stripes of even-skipped (eve) expression modulate transcriptional bursting through the same strategy—bursting frequency and amplitude are controlled to yield various levels of mRNA synthesis, while burst duration remains constant. We reproduce that result in our paper, and do not claim any novelty. However, what was unclear is whether the observed eve bursting control strategy would only exist in the wild-type stripes, whose expression—we reasoned—is under strong selection due to the dramatic phenotypic consequences of eve transcription, or if eve transcriptional bursting would follow the same strategy under trans-regulatory environments that are not under selection to deliver specific spatiotemporal dynamics of eve expression. Our results—and here lies the novelty of our work—support the second scenario, and point to a model where eve bursting strategies do not result from adaptation of eve activity to specific trans-regulatory environments. Instead, we speculate that a molecular mechanism constrains eve bursting strategy whenever and wherever the gene is active. This is something that we could not have known from our first study in (Berrocal et al., 2020) and constitutes the main novelty of our paper. To put this in other words, the novelty of our work does not rest on the fact that both burst frequency and amplitude are modulated in the endogenous eve pattern, but that this modulation remains quantitatively indistinguishable when we focus on ectopic areas of expression. We will make this point clearer in the Introduction and Discussion section of our revised manuscript.
  
  … note i) the limited manipulation of TF environment;...
  
  We acknowledge that additional genetic manipulations would make it possible to further test the model. However, we hope that the reviewer will agree with us that the manipulations that we did perform are sufficient to provide evidence for common bursting strategies under the diverse trans-regulatory environments present in wild-type and ectopic regions of gene expression. In the Discussion section of our revised manuscript, we will elaborate further on the kind of genetic manipulations (e.g., probing transcriptional strategies that result from swapping promoters in the context of eve-MS2 BAC; or quantifying the impact on eve transcriptional control after performing optogenetic perturbations of transcription factors and/or chromatin remodelers) that could shed further light on the currently undefined molecular mechanism that constrains eve bursting strategies, as a mean to motivate future work.
  
  … ii) the simplicity with which bursting is analyzed (only a two-state model is considered, and not cross-validated with an alternative approach than cpHMM) and…
  
  Based on our previous work (Lammers et al., 2020), and as described in the SI Section of the current manuscript: Inference of Bursting Parameters, we selected a three-state model (OFF, ON1, ON2) under the following rationale: transcription of even-skipped in pre-gastrulating embryos occurs after DNA replication, and promoters on both sister chromatids remain paired. Most of the time these paired loci cannot be resolved independently using conventional microscopy. As a result, when we image an MS2 spot, we are actually measuring the transcriptional dynamics of two promoters. Thus, each MS2-fluorescent spot may result from none (OFF), one (ON1) or two (ON2) sister promoters being in the active state. Following our previous work, we analyzed our data assuming the three-state model (OFF, ON1, ON2), and then, for ease of presentation, aggregated ON1 and ON2 into an effective single ON state. As for the lack of an alternative model, we chose the simplest model compatible with our data and our current understanding of transcription at the eve locus. With this in mind, we do not rule out the possibility that more complex processes—that are not captured by our model—shape MS2 fluorescence signals. For example, promoters may display more than two states of activity. However, as shown in (Lammers et al., 2020 - SI Section: G. cpHMM inference sensitivities), model selection schemes and cross-validation do not give consistent results on which model is more favorable; and for the time being, there is not a readily available alternative to HMM for inference of promoter states from MS2 signal. For example, orthogonal approaches to quantify transcriptional bursting, such as smFISH, are largely blind to temporal dynamics. As a result, we choose to entertain the simplest two-state model for each sister promoter. We appreciate these observations, as they point out the need of devoting a section in the supplemental material of our revised manuscript to clarify the motivations behind model selection.
  
  … iii) the lack of comparisons with published work.
  
  We thank the reviewer for pointing this out. In the current discussion of our manuscript, we compare our findings to recent articles that have addressed the question of the origin of bursting control strategies in Drosophila embryos (Pimmett et al., 2021; Yokoshi et al., 2022; Zoller et al., 2018). Nevertheless, we acknowledge that we failed to include references that are relevant to our study. Thus, our revised Discussion section must include recent results by (Syed et al., 2023), which showed that the disruption of Dorsal binding sites on the snail minimal distal enhancer results in decreased amplitude and duration of transcription bursts in fruit fly embryos. Additionally, we have to incorporate the study by (Hoppe et al., 2020), which reported that the Drosophila bone morphogenetic protein (BMP) gradient modulates the bursting frequency of BMP target genes. References to thorough studies of bursting control in other organisms, like Dictyostelium discoideum (Tunnacliffe et al., 2018), are due as well.
  
  Reviewer #2 (Public Review):
  
  The manuscript by Berrocal et al. asks if shared bursting kinetics, as observed for various developmental genes in animals, hint towards a shared molecular mechanism or result from natural selection favoring such a strategy. Transcription happens in bursts. While transcriptional output can be modulated by altering various properties of bursting, certain strategies are observed more widely. As the authors noted, recent experimental studies have found that even-skipped enhancers control transcriptional output by changing burst frequency and amplitude while burst duration remains largely constant. The authors compared the kinetics of transcriptional bursting between endogenous and ectopic gene expression patterns. It is argued that since enhancers act under different regulatory inputs in ectopically expressed genes, adaptation would lead to diverse bursting strategies as compared to endogenous gene expression patterns. To achieve this goal, the authors generated ectopic even-skipped transcription patterns in fruit fly embryos. The key finding is that bursting strategies are similar in endogenous and ectopic even-skipped expression. According to the authors, the findings favor the presence of a unified molecular mechanism shaping even-skipped bursting strategies. This is an important piece of work. Everything has been carried out in a systematic fashion. However, the key argument of the paper is not entirely convincing.
  
  We thank the reviewer, as these comments will enable us to improve the Discussion section and overall logic of our revised manuscript. We agree that the evidence provided in this work, while systematic and carefully analyzed, cannot conclusively rule out either of the two proposed models, but just provide evidence supporting the hypothesis for a specific molecular mechanism constraining eve bursting strategies. Our experimental evidence points to valuable insights about the mechanism of eve bursting control. For instance, had we observed quantitative differences in bursting strategies between ectopic and endogenous eve domains, we would have rejected the hypothesis that a common molecular mechanism constrains eve transcriptional bursting to the observed bursting control strategy of frequency and amplitude modulation. Thus, we consider that our proposition of a common molecular mechanism underlying unified eve bursting strategies despite changing trans-regulatory environments is more solid. On the other hand, while our model suggests that this undefined bursting control strategy is not subject to selection acting on specific trans-regulatory environments, it is not trivial to completely discard selection for specific bursting control strategies given our current lack of understanding of the molecular mechanisms that shape the aforesaid strategies. Indeed, we cannot rule out the hypothesis that the observed strategies are most optimal for the expression of eve endogenous stripes according to natural selection, and that these control strategies persist in ectopic regions as an evolutionary neutral “passenger phenotype” that does not impact fitness. We recognize the need to acknowledge this last hypothesis in the updated Introduction and Discussion sections of our manuscript. Further studies will be needed to determine the mechanistic and molecular basis of eve bursting strategies.
  
  Reviewer #3 (Public Review):
  
  In this manuscript by Berrocal and coworkers, the authors do a deep dive into the transcriptional regulation of the eve gene in both an endogenous and ectopic background. The idea is that by looking at eve expression under non-native conditions, one might infer how enhancers control transcriptional bursting. The main conclusion is that eve enhancers have not evolved to have specific behaviors in the eve stripes, but rather the same rates in the telegraph model are utilized as control rates even under ectopic or 'de novo' conditions. For example, they achieve ectopic expression (outside of the canonical eve stripes) through a BAC construct where the binding sites for the TF Giant are disrupted along with one of the eve enhancers. Perhaps the most general conclusion is that burst duration is largely constant throughout at ~ 1 - 2 min. This conclusion is consistent with work in human cell lines that enhancers mostly control frequency and that burst duration is largely conserved across genes, pointing to an underlying mechanistic basis that has yet to be determined.
  
  We thank the reviewer for the assessment of our work. Indeed, evidence from different groups (Berrocal et al., 2020; Fukaya et al., 2016; Hoppe et al., 2020; Pimmett et al., 2021; Senecal et al., 2014; Syed et al., 2023; Tunnacliffe et al., 2018; Yokoshi et al., 2022; Zoller et al., 2018) is coming together to uncover commonalities, discrepancies, and rules that constrain transcriptional bursting in Drosophila and other organisms.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.09.527927v1
www.biorxiv.org www.biorxiv.org

New submission 26/07/2023, 08:59:53

1
1. Public_Reviews 09 Aug 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  This article is interested in how butterfly, or more precisely, butterfly wing scale precursor cells, each make precisely patterned ultrastructures made of chitin.
  
  To do this, the authors sought to use the butterfly Parides eurimedes, a papilionid swallowtail, that carries interesting, unusual structures made of 1) vertical ridges, that lack a typical layered stacking arrangement; and 2) deep honeycomb-like pores. These two features make the organism chosen a good point of comparison with previous studies, including classic papers that relied on electronic microscopy (SEM/TEM), and more recent confocal microscopy studies.
  
  The article shows good microscopy data, including detailed, dense developmental series of staining in the Parides eurimedes model. The mix of cell membrane staining, chitin precursor, and F-actin staining is well utilized and appropriately documented with the help of 3D-SIM, a microscopy technique considered to provide super-resolution (here needed to visualize sub-cellular processes).
  
  The key message from this article is that F-actin filaments are later repurposed, in papilionid butterflies, to finish the patterning of the inter-ridge space, elaborating new structures (this was not observed so far in other studies and organisms). The model proposed in Figure 6 summarized these findings well, with F-actin reshaping it itself into a tulip that likely pulls down a chitin disk to form honeycombs. These interpretations of the microscopy data are interesting and novel.
  
  There are two other points of interest, that deserve future investigation:
  
  1) The authors performed immunolocalizations of Arp2 and pharmacological inhibitions of Arp2/3, and found some possible effect on honeycomb lattice development. The inter-ridge region of the butterfly Papilio polytes, which lacks these structures, did not seem to be affected by drug treatments. Effects where time-dependent, which makes sense. These data provide circumstantial evidence that Arp2/3 is involved in the late role of F-actin formation or re-organisation.
  
  2) The authors perform a comparative study in additional papilionids (Fig. 6 in particular). I find these data to be quite limited without a dense sampling, but they are nonetheless interesting and support a second-phase role of F-actin re-organisation.
  
  The article is dense, well produced and succinctly written. I believe this is an interesting and insightful study on a complex process of cell biology, that inspires us to look at basic phenomena in a broader set of organisms.
  
  We thank the reviewer for the positive appraisal.
  
  Reviewer #2 (Public Review):
  
  The manuscript by Seah and Saranathan investigates the cell-based growth mechanism of so called honeycomb-structures in the upper lamina of papilionid wing scales by investigating a number of different species. The authors chose Parides eurimedes as a focus species with the developmental pathway of five other papilionid as a comparative backup. Through state-of-the-art microscopy images of different developmental steps, the author find that the intricate f-actin filaments reorganise, support cuticular discs that template the air holes that form the honeycomb lattice. The manuscript is well written and easy to follow, yet based on a somewhat limited sample size for their focus species, limiting attempts to suppress expression and alter structure shape.
  
  The fact that the authors find a novel reorganisation mechanism is exciting and warrants further research, e.g. into the formation of other microscale features or smaller scale structures (e.g. the mentioned gyroid networks).
  
  We thank the reviewer for the positive appraisal.
  
  The authors place their results in the discussion in the light of current literature (although the references could be expanded further to include the breadth of the field). However, the mechanistic explanation completely ignores the mechanical properties of the membranes as an origin of some of the observed phenomena (see McDougal's work for example) and places the occurence of some features into Turing patterns and Ostwald ripening, which I find somewhat unlikely and I suggest that the authors discover this aspects further in the discussion.
  
  We thank the reviewer for these suggestions. We have added more references from the current literature to more accurately reflecting the breadth of the field. McDougal et al. 2021. discuss the nature of biomechanical forces (differential growth and buckling) on the membrane and deposited cuticle shaping the formation of longitudinal ridges. However, here it is the invagination of the plasma membrane bearing the deposited cuticle that is our main concern. Nevertheless, we agree future studies should indeed consider the mechanical properties of the membranes, in addition to explain some of the observed features. We have clarified this in our discussion.
  
  I have little concerns regarding the experimental approach beyond the somewhat limited sample size. One thing the authors should more clearly mention are the pupation periods for all investigated species as only the periods for two species are named.
  
  Yes, unfortunately, we were only able to obtain pupae with pupation dates for two species. We have clarified this point in the methods.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2020.11.30.404111v3
www.biorxiv.org www.biorxiv.org

New submission 09/08/2023, 09:12:17

1
1. Public_Reviews 09 Aug 2023
  
  in eLife
  
  Author Response
  
  Joint Public Review
  
  “Using computational modeling, this manuscript explores the effect of growth feedback on the performance of gene networks capable of adaptation. The authors selected 425 hypothetical synthetic circuits that were shown to achieve nearly perfect adaptation in two earlier computational studies (see Ma et al. 2009, and Shi et al. 2017). They examined the effects of cell growth feedback by introducing additional terms to the ordinary differential equation-based models, and performed numerical simulations to check the retainment and the loss of the adaptation responses of the circuits in the presence of growth feedback. The authors show that growth feedback can disrupt the gene network adaptation dynamics in different ways, and report some exceptional core motifs which allow for robust performance in the presence of growth feedback. They also used a metric to establish a scaling law between a circuit robustness measure and the strength of growth feedback. These results have important implications in the field of synthetic biology, where unforeseen interactions between designed gene circuits and the host often disrupt the desired behavior. The paper’s conclusions are supported by their simulation results, although these are presented in their summary formats and it would be useful for the community if the detailed results for each topology were available as a supplementary file or through the authors’ GitHub repository.”
  
  We will update our GitHub repository with detailed results for each topology, along with other simulation details and results that might be of interest to the readers.
  
  Strengths: “This work included a detailed investigation of the reasons for adaptation failure upon introducing cell growth to the systems. The comprehensiveness of the analysis makes the work stand out among studies of functional screening of network topologies of gene regulation.” “The authors’ approaches for assessment of robustness, such as the survival ratio Q, can be useful for a wide range of topologies beyond adaptation. The scaling law obtained with those approaches is interesting.”
  
  We are grateful to the referees and editors for their positive assessment of our work.
  
  Weaknesses 1: “The title suggests that the work investigates the ’effects of growth feedback on gene circuits’. However, the performance of ’nearly perfect adaptation’ was chosen for the majority of the work, leaving the question of whether the authors’ conclusion regarding the effects of growth feedback is applicable to other functional networks.”
  
  We will change the title of the paper from “Effects of growth feedback on gene circuits: A dynamical understanding” to “Effects of growth feedback on adaptive gene circuits: A dynamical understanding,” because the focus of our current work was on gene circuits with adaptation. Our work provided a framework that can be readily generalized to investigate the effects of growth feedback in other functional networks such as bistable gene circuits.
  
  Weaknesses 2: “This work relies extensively on an earlier study, evaluating only a selected set of 425 topologies that were shown to give adaptive responses (Shi et al., 2017). This limited selection has two potential issues. First, as the authors mentioned in the introduction, growth feedback can also induce emerging dynamics even without existing function-enabling gene circuits, as an example of the ”effects of growth feedback on gene circuits”. Limiting the investigation to only successful circuits for adaptation makes it unclear whether growth feedback can turn the circuits that failed to produce adaptation by themselves into adaptation-enabling circuits. Secondly, as the Shi et al. (2017) study also used numerical experiments to achieve their conclusions about successful topologies, it is unclear whether the numerical experiments in the present study are compatible with the earlier work regarding the choice of equation forms and ranges of parameter values. The authors also assumed that all readers have sufficient understanding of the 425 topologies and their derivation before reading this paper.”
  
  We will make the following revisions.
  
  We will modify the title of the paper as discussed above. The reviewers/editors are insightful that growth feedback could turn a non-adaptive circuit into an adaptation-enabling one - an interesting possibility worth further study.
  
  We will provide details of all the pertinent numerical simulations, highlighting the differences from those in the previous work (Shi et al., 2017). Briefly, our adaptation criteria are stricter than those utilized in that work. As a result, out of the 425 topologies, random sampling based on our criteria identified adaptation in 414 topologies. For the remaining 11 topologies, either our more strict criteria have eliminated the possibility for the gene circuits to be adaptive, or the adaptive region in the high-dimensional parameter space is too small to be detected by random sampling.
  
  We will describe the 425 topologies utilized in our study and provide more detail in the GitHub repository, including the topological structures and the parameter sets leading to adaptation.
  
  Weaknesses 3: “The authors’ model does not describe the impact of growth via a biological mechanism: they model growth as an additional dilution rate and calculate growth rate based on a phenomenological description with growth rate occurring at a maximum (kg) scaled by the circuit ’burden’ b(t). Therefore, the authors’ model does not capture potential growth rate changes in parameter values (e.g., synthetic protein production falls with increasing growth rate; see Scott & Hwa, 2023).”
  
  We considered dilution due to cell growth as the dominant factor of growth feedback. In fact, we studied the adaptive circuits without growth and their ability to maintain their adaptive behaviors after dilution into a fresh medium, based on a recent work [Zhang, et al., Nature Chemical Biology 16.6 (2020): 695-701]. A higher growth rate can change synthetic protein production. However, the dynamic roles of the dilution and growth-affected production rate should be analogous, given that they both act as inhibitory factors arising from cell growth as mentioned by the reviewers/editors. Taking the growth effect on the production rate into account would require a more comprehensive study. We will elaborate on the limitation of our modeling framework and include the pertinent references (e.g., Scott & Hwa, 2023).
  
  Weaknesses 4: “The authors made several claims about the bifurcations (infinite-period, saddle-node, etc) underlying the abrupt changes leading to failures of adaptations. There is a lack of evidence supporting these claims. Both local and global bifurcations can be demonstrated with semi-analytic approaches such as numerical continuation along with investigations of eigenvalues of the Jacobian matrix. The claims based on ODE solutions alone are not sound.”
  
  We will add this material to our next version of the paper. A further semi-analytic analysis can better justify the numerically discovered bifurcations.
  
  Weaknesses 5: “The impact of biochemical noise is not evaluated in this work; the author’s analysis is only carried out in a deterministic regime.”
  
  Our work focused on uncovering the deterministic dynamical mechanisms underlying growthfeedback induced circuit failures in situations where all protein concentrations are high so that neglecting the effects of biochemical noises can be justified. Incorporating noises into our analysis will significantly complicate the study and likely prevent the dynamical origin of the failures from being unveiled. Nonetheless, the effects of biochemical noises are important and we will provide a discussion in the revised manuscript.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.06.543915v1
www.biorxiv.org www.biorxiv.org

New submission 09/08/2023, 09:05:13

1
1. Public_Reviews 09 Aug 2023
  
  in eLife
  
  Author Response
  
  We would like to express our sincere gratitude for the detailed examination of our manuscript titled "Specific Modulation of CRISPR Transcriptional Activators through RNA-Sensing Guide RNAs in Mammalian Cells and Zebrafish Embryos." We deeply appreciate the time and effort put into the review process, especially considering the unforeseen delays. Your insightful comments and recommendations have provided critical perspectives that we believe will significantly enhance the quality of our work. In this letter, we will address the reviewers' concerns and outline the revisions we will make in response to the feedback.
  
  eLife assessment
  
  The authors aim to develop a CRISPR system that can be activated upon sensing an RNA. As an initial step to this goal, they describe RNA-sensing guide RNAs for controlled activation of CRISPR modification. Many of the data look convincing and while several steps remain to achieve the stated goal in an in vivo setting and for robust activation by endogenous RNAs, the current work will be important for many in the field.
  
  We wish to acknowledge and thank you for the thoughtful eLife assessment, which succinctly summarizes our ambition to create a CRISPR system controlled by RNA sensing. The synopsis provided encapsulates the essence of our research, emphasizing both the progress we have made and the challenges that lie ahead. This assessment fully resonates with our views.
  
  Reviewer #1 (Public Review):
  
  This paper describes RNA-sensing guide RNAs for controlled activation of CRISPR modification. This works by having an extended guide RNA with a sequence that folds back onto the targeting sequence such that the guide RNA cannot hybridise to its genomic target. The CRISPR is "activated" by the introduction of another RNA, referred to as a trigger, that competes with this "back folding" to make the guide RNA available for genome targeting. The authors first confirm the efficacy of the approach using several RNA triggers and a GFP reporter that is activated by dCas9 fused to transcriptional activators. A major potential application of this technique is the activation of CRISPR in response to endogenous biomarkers. As these will typically be longer than the first generation triggers employed by the authors they test some extended triggers, which also work though not always to the same extent. They then introduce MODesign which may enable the design of bespoke or improved triggers. After that, they determine that the mode of activation by the RNA trigger involves cleavage of the RNA complexes. Finally, they test the potential for their system to work in a developmental setting - specifically zebrafish embryos. There is some encouraging evidence, though the effects appear more subtle than those originally obtained in cell culture.
  
  Overall, the potential of a CRISPR system that can be activated upon sensing an RNA is high and there are a myriad of opportunities and applications for it. This paper represents a reasonable starting point having developed such a system in principle.
  
  The weakness of the study is that it does not demonstrate that the system can be used in a completely natural setting. This would require an endogenous transcript as the RNA trigger with a clear readout. Such an experiment would clearly strengthen the paper and provide strong confidence that the method could be employed for one of the major applications discussed by the authors. The zebrafish data relied on exogenous RNA triggers whereas the major applications (as I understood them) would use endogenous triggers.
  
  Related, most endogenous RNAs are longer than the various triggers tested and may require extensive modification of the system to be detected or utilised effectively. While additional data would clearly be beneficial, there should nevertheless be a more detailed discussion of these caveats and/or the strengths and applications of the system as it is presented (i.e. utility with synthetic triggers).
  
  We would like to thank Reviewer #1 for the thoughtful and comprehensive analysis of our work as well as for the constructive feedback provided. We agree with the observation regarding the subtler effects in the zebrafish embryos and the reliance on exogenous RNA triggers. Indeed, the utilization of endogenous transcripts as triggers in a natural setting is a logical next step. We further acknowledge the need to delve deeper into the complexities and challenges of our system, particularly concerning the detection of endogenous RNA, thus offering valuable insights for researchers looking to adapt our system for various applications. In the final version of our paper, we will indeed discuss these challenges in detail and provide a clear path for future users who might be interested in employing this system. Our expanded discussion will encompass the considerations required for high-throughput screens, combining both quantitative and experimental approaches for identifying endogenous RNAs that could act as triggers. We will also elaborate on the potential biotech applications related to the detection of synthetic RNA triggers. This includes its use in Synthetic Biology circuit design and the implementation of logic gates for mammalian cell engineering.
  
  Reviewer #2 (Public Review):
  
  In this work, the authors describe engineering of sgRNAs that render Cas9 DNA binding controllable by a second RNA trigger. The authors introduce several iterations of their engineered sgRNAs, as well as a computational pipeline to identify designs for user-specified RNA triggers which offers a helpful alternative to purely rational design. Also included is an investigation of the fate of the engineered sgRNAs when introduced into cells, and the use of this information to inform installation of modified nucleotides to improve engineered sgRNA stability. Engineered sgRNAs are demonstrated to be activated by trigger RNAs in both cultured mammalian cells and zebrafish.
  
  The conclusions made by the authors in this work are predominantly supported by the data provided. However, some claims are not consistent with the data shown and some of the figures would benefit from revision or further clarification.
  
  Strengths:
  
  The sgRNA engineering in this paper is performed and presented in a systematic and logical fashion. Inclusion of a computational method to predict iSBH-sgRNAs adds to the strength of the engineering.
  
  Investigation into the cellular fate of the engineered sgRNAs and the use of this information to guide inclusion of chemically modified nucleotides is also a strength.
  
  Demonstration of activity in both cultured mammalian cells and in zebrafish embryos increases the impact and utility of the technology reported in this work.
  
  Weaknesses:
  
  While the methods here represent an important step forward in advancing the technology, they still fall short of the dynamic range and selectivity likely required for robust activation by endogenous RNA.
  
  While the iSBH-sgRNAs where the RNA trigger overlaps with the spacer appear to function robustly, the modular iSBH-sgRNAs seem to perform quite a bit less well. The authors state that modular iSBH-sgRNAs show better activity without increasing background when the SAM system is added, but this is not supported by the data shown in Figure 3D, where in 3 out of 4 cases CRISPR activation in the absence of the RNA trigger is substantially increased.
  
  There is very little discussion of how the performance of the technology reported in this work compares to previous iterations of RNA-triggered CRISPR systems, of which there are many examples.
  
  We are very grateful to the Reviewer #3 for the meticulous examination of our work, highlighting both the systematic approach in sgRNA engineering and areas for improvement. The insights offered in this review are extremely useful, and we are committed to addressing these points in the following sections of our response.
  
  Concerning the methods falling short of the dynamic range and selectivity required for robust activation by endogenous RNA, we acknowledge this limitation and recognize the need for improvement in this area. In the final version of our manuscript, we will provide a detailed discussion on how the selection of appropriate triggers might partially improve dynamic ranges and selectivity. This includes an exploration of various strategies and considerations that may enhance the robustness of our system. Regarding the inconsistent performance of the modular iSBH-sgRNAs, as observed in Figure 3D, we recognize this discrepancy and will clarify it in the following iteration. For the concern about the lack of comparison with previous iterations of RNA-triggered CRISPR systems, we recently published a comprehensive literature review on the existing systems for activation of CRISPR in response to RNA detection (doi/full/10.1089/crispr.2022.0052). In the final version of our manuscript, we will include a comparison between our system and existing technologies, thereby addressing this valid observation.
  
  Thank you once again to the entire review team for the meticulous examination and for your thoughtful and constructive feedback. Your insights are instrumental in refining our research, and we look forward to incorporating these changes as we finalize our manuscript.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.05.08.539738v1
www.biorxiv.org www.biorxiv.org

New submission 08/08/2023, 09:06:50

1
1. Public_Reviews 08 Aug 2023
  
  in eLife
  
  Author Response
  
  We are very grateful to the editors and reviewers for their valuable comments of our manuscript. We carefully consider all the comments and will provide a revised manuscript with our point-by-point responses as soon as possible. In the meantime, we will try our best to carry out additional experiments to bolster our conclusions. Here, we would like to respond provisionally to the public reviews.
  
  We appreciate the concerns raised by Reviewer 1 regarding the identification of cell types in our study. Specifically, they noted that the high proportion of NSCs within the astroglial lineage clusters is inconsistent with classic histology studies. We apologize for not clearly specifying in the text and figure legend that the data presented in Figure 2C were obtained from neonatal samples, which may explain the higher presence of NSCs. To rectify this issue, we will revise the text to ensure clarity regarding the age group from which the data in Figure 2C were obtained. Additionally, we commit to providing additional UMAP plots and quantitative analysis separately for different age groups to support our findings. This will allow a more accurate representation of the cell type composition, taking into consideration any potential variations that may occur with age.
  
  We appreciate Reviewer 2's acknowledgment that the finding of our study is interesting and relevant to a broader audience. However, he raised two major concerns that could weaken the conclusions drawn from our study. First, the reviewer noted that the number of sequenced nuclei in our study is lower than the calculated number required for detecting rare cell types. We noticed that according to the computational modeling conducted by Tosoni et al. (Neuron, 2023), at least 21 neuroblast cells (NBs) can be identified out of 30,000 granule cells (GCs) from a total of 180,000 dentate gyrus (DG) cells. In our dataset, we sequenced 24,671 GC nuclei and 92,966 total DG cell nuclei, which also includes neonatal samples. The number of nuclei we sequenced is 4.5 times higher than that of Wang et al. (Cell Research, 2022), who also detected NBs. Therefore, it is reasonable to conclude that we were able to detect NBs. Moreover, the presence of these rare cell types has been demonstrated in our study through immunostaining techniques, which provides further evidence. Secondly, Reviewer 2 raised concerns about the low number of donors included in some of the groups, with only one donor (n=1) being represented in certain cases. We acknowledge these limitations and understand that the inclusion of a larger number of donors would strengthen the statistical power and generalizability of our findings. However, due to the scarcity of stroke or neonatal human samples, it is not feasible to collect a larger sample size within the expected timeframe. Although one sample is not enough to show the precise changes in cells and molecular mechanisms caused by stroke, it can provide a typical example to demonstrate our hypothesis that neural stem cells could be activated under conditions of injury. The latter is what we really want to address in the manuscript. Regrading to the donor’s information, we will provide more details about the donors, including any clinical characteristics available, to enhance the transparency of our study. Importantly, we have implemented strict quality control measures to support the reliability of our sequencing data. These measures include: 1) Immediate collection of tissue samples after postmortem (3-4 hrs) to ensure the quality of isolated nuclei. 2) Only nuclei expressing more than 200 genes but fewer than 5000-8600 genes (depending on the peak of enrichment genes) were considered. On average, each cell detected around 3000 genes. 3) The average proportion of mitochondrial genes in each sample was approximately 1.8%, with no sample exceeding 5%.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.05.15.540723v3
www.medrxiv.org www.medrxiv.org

New submission 07/08/2023, 10:13:36

1
1. Public_Reviews 07 Aug 2023
 
 in eLife
 
 Author Response:
 
 Reviewer #1 (Public Review):
 
 Summary: In this study, the authors generate a Drosophila model to assess disease-linked allelic variants in the UBA5 gene. In humans, variants in UBA5 have been associated with DEE44, characterized by developmental delay, seizures, and encephalopathy. Here, the authors set out to characterize the relationship between 12 disease-linked variants in UBA5 using a variety of assays in their Drosophila Uba5 model. They first show that human UBA5 can substitute all essential functions of the Drosophila Uba5 ortholog, and then assess phenotypes in flies expressing the various disease variants. Using these assays, the authors classify the alleles into mild, intermediate, and severe loss-of-function alleles. Further, the authors establish several important in vitro assays to determine the impacts of the disease alleles on Uba5 stability and function. Together, they find a relatively close correlation between in vivo and in vitro relationships between Uba5 alleles and establish a new Drosophila model to probe the etiology of Uba5-related disorders.
 
 Strengths: Overall, this is a convincing and well-executed study. There is clearly a need to assess disease-associated allelic variants to better understand human disorders, particularly for rare diseases, and this humanized fly model of Uba5 is a powerful system to rapidly evaluate variants and relationships to various phenotypes. The manuscript is well written, and the experiments are appropriately controlled.
 
 Reviewer #2 (Public Review):
 
 Relative simplicity and genetic accessibility of the fly brain make it a premier model system for studying the function of genes linked to various diseases in humans. Here, Pan et al. show that human UBA5, whose mutations cause developmental and epileptic encephalopathy, can functionally replace the fly homolog Uba5. The authors then systematically express in flies the different versions of the gene carrying clinically relevant SNPs and perform extensive phenotypic characterization such as survival rate, developmental timing, lifespan, locomotor and seizure activity, as well as in vitro biochemical characterization (stability, ATP binding, UFM-1 activation) of the corresponding recombinant proteins. The biochemical effects are well predicted by (or at least consistent with) the location of affected amino acids in the previously described Uba5 protein structure. Most strikingly, the severity of biochemical defects appears to closely track the severity of phenotypic defects observed in vivo in flies. While the paper does not provide many novel insights into the function of Uba5, it convincingly establishes the fly nervous system as a powerful model for future mechanistic studies.
 
 One potential limitation is the design of the expression system in this work. Even though the authors state that "human cDNA is expressed under the control of the endogenous Uba5 enhancer and promoter", it is in fact the Gal4 gene that is expressed from the endogenous locus, meaning that the cDNA expression level would inevitably be amplified in comparison. The fact that different effects were observed when some experiments were performed at different temperatures (18 vs. 25) is also consistent with this. While I do not think this caveat weakens the conclusions of this paper, it may impact the interpretation of future experiments that use these tools, and thus should be clearly discussed in the paper. Especially considering the authors argue that most disease variants of UBA5 are partial loss-of-functions, the amplification effect could potentially mask the phenotypes of milder hypomorphic alleles. If the authors could also show that the T2A-Gal4 expression pattern in the brain matches well with that of endogenous RNA or protein (e.g. using HCR-FISH or antibody), it would help to alleviate this concern.
 
 We thank the reviewer for pointing out this limitation.
 
 Regarding the humanization strategy we used in the study, we agree that this is a binary system which may lead to overexpression of the target protein. However, as the
 
 reviewer also points out, this temperature-sensitive system also enables us to flexibly adjust the expression level of the target protein, which is especially useful to study
 
 partial LoF variants such as the UBA5 variants in this study. In our study we have successfully compared the relevant allelic strength of most of the variants, which
 
 supports the use of our system in future studies. However, we do agree that the gene dosage effect could vary widely, so it is difficult to directly predict the effects of one variant in humans based upon results obtained in a model organism.
 
 We agree with the reviewer that a masking effect may exist in our system due to its gene overexpression nature. However, we cannot conclude that this masking effect
 
 really affects the interpretation of Group IA variants in our tests. The three variants are mild LoF, which is also supported by the biochemical assays. Hence, the variants may not cause any phenotype even when they are expressed at a physiological level.
 
 Regarding the temporal and spatial expression pattern of the T2A-GAL4, the Bellen lab has generated T2A-GAL4 lines for more than 3,000 genes. The expression pattern of the vast majority of these GAL4 lines faithfully reflects the expression pattern of the endogenous genes, which has been documented in our previous publications (PMIDs 25824290, 29565247, 31674908, 35723254).
 
 Reviewer #3 (Public Review):
 
 Summary: Variants in the UBA5 gene are associated with rare developmental and epileptic encephalopathy, DEE44. This research developed a system to assess in vivo and in vitro genotype-phenotype relationships between UBA5 allele series by humanized UBA5 fly models and biochemical activity assays. This study provides a basis for evaluating current and future individuals afflicted with this rare disease.
 
 Strengths: The authors developed a method to measure the enzymatic reaction activity of UBA5 mutants over time by applying the UbiReal method, which can monitor each reaction step of ubiquitination in real time using fluorescence polarization. They also classified fruit fly carrying humanized UBA5 variants into groups based on phenotype. They found a correlation between biochemical UBA5 activity and phenotype severity.
 
 Weaknesses: In the case of human DEE44, compound heterozygotes with both loss-of-function and hypomorphic forms (e.g., p.Ala371Thr, p.Asp389Gly, p.Asp389Tyr) may cause disease states. The presented models have failed to evaluate such cases.
 
 We agree with the reviewer that our model did not reflect the situation of the individuals who are compound heterozygous for a Group IA variant (p.Ala371Thr, p.Asp389Gly, or p.Asp389Tyr) and a strong LoF variant. However, we argue that our results do show that the Group IA variants alone do not cause disease. As discussed in the manuscript, individuals homozygous for the p.Ala371Thr variant are healthy and do not present with obvious phenotype. This is consistent with our findings in flies, and shows that the p.Ala371Thr variant is a mild LoF variant.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2023.07.17.23292782v1
www.biorxiv.org www.biorxiv.org

New submission 02/08/2023, 08:27:36

1
1. Public_Reviews 07 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  We greatly appreciate the thoughtful suggestions made by the Reviewers. We have addressed all of their comments below, with our responses bulleted and in italics. We believe these changes have helped clarify the manuscript and strengthen it overall.
  
  Reviewer 1
  
  1) Figures 1B and Supp. Figure 1A: It would be worth mentioning that the wave-form in the 129 strain in response to QLA starts out like AJ and B6, but transitions to looking like the wild-derived strain. So, although not quite as drastic as the NZO and NOD strains, it is not quite like the other classical inbred strains.
  
  • We thank the reviewer for pointing this out. We have added further language to clarify the point:
  
  “Additionally, even with the clear separation between the clusters, inter-strain variation was still observed within the clusters (e.g. more 129 islets had plateau responses to 8G/QLA than the B6 or AJ).”
  
  2) The figures are generally excellent and really help to clarify the work in the paper. For Figure 2A, it would help even further if you could number the six different Ca++ parameters that are measured. They're all there, but it takes a bit of time to find them on the figure and numbering will make it easier on your reader.
  
  • We appreciate this suggestion and have implemented it in our revised Figure 2A. The Ca2+ parameters are now numbered, and the description of this figure has been adjusted accordingly in the results section.
  
  We added the revised text in the results section:
  
  “To elucidate strain differences in Ca2+ dynamics, we focused on six parameters of the Ca2+ waveform (Figure 2A): 1) peak Ca2+ (the top of each oscillation); 2) period (the length of time between two peaks); 3) active duration (the length of time for each Ca2+ oscillation measured at half of the peak height, also known the oxidative “secretory” phase, or “MitoOx” (8); 4) pulse duration (active duration plus extra time for Ca2+ extrusion); 5) silent duration (the electrically-silent “triggering” phase, also known as “MitoCat” (8), which culminates in KATP closure and membrane depolarization); and 6) plateau fraction (the active duration divided by the period, or the fraction of time spent in the active “secretory” phase).”
  
  3) Figure 4A, B: I was expecting to see Ca++ vs insulin parameters in the different strains/sexes. In addition to the heat maps, it would be useful to see the regression plots, showing where each strain and sex falls for the insulin and Ca++ parameters.
  
  • This is an excellent suggestion, and we have added a new Supplemental Figure 5 to provide examples of various strain/sex patterns that drive the correlations used for the heatmap and histogram in Figure 4A and B.
  
  We added text in the results section referring to this point:
  
  “Clustering the Ca2+ responses into distinct groups based on our observations of the waveforms (Figure 1B, Figure 4C-E, and Supplemental Figures 1 and 2) also occurs when correlating individual Ca2+ parameters to ex vivo secretion and clinical data (Supplemental Figure 5). For example, the anticorrelation between the 1st frequency component in 8G and percent insulin secreted in 8.3G/QLA (Supplemental Figure 5A) separates the classic inbred, wild-derived, and diabetes-susceptible strains into distinct groups despite the variability in the trait. Correlation between the silent duration in 8G/QLA to insulin secretion in 8.3G/QLA, likewise groups by strain (Supplemental Figure 5B). Finally, some correlations, such as that between 8G/QLA/GIP silent duration and plasma insulin at sacrifice (Supplemental Figure 5C), can be strongly influenced by outlier strains; e.g., NZO. Collectively, these data demonstrate that genetics has a profound influence on key parameters of islet Ca2+ oscillations.”
  
  4) Please include methods for the insulin measurements collected in Fig. 4.
  
  • Thank you for pointing out this missing information. We have clarified that prior insulin measurements (plasma insulin and ex vivo static insulin secretion that were used in Figure 4 for correlation analysis) were completed in another previously published cohort of mice (reference 17: Mitok KA, Freiberger EC, Schueler KL, Rabaglia ME, Stapleton DS, Kwiecien NW, et al. Islet proteomics reveals genetic variation in dopamine production resulting in altered insulin secretion. The Journal of biological chemistry. 2018;293(16):5860-77).
  
  We added this new text (highlighted) to the results section to help clarify this point:
  
  “Fasting blood glucose and insulin levels were measured in mice at 19 weeks of age, except for the NZO males which were measured at 12 weeks of age. Glucose was analyzed by the glucose oxidase method using a commercially available kit (TR15221, Thermo Fisher Scientific), and insulin was measured by radioimmunoassay (RIA; SRI13K, Millipore). This is the same assay that was used to measure plasma insulin for the previously published cohort used for the correlation analysis in Figure 4 (17).”
  
  5) In the methods, please include details on the four conditions used for Ca++ imaging of the islets, and the timing for each condition.
  
  • We appreciate this guidance in clarifying our manuscript, and we have now included the conditions and timing for each condition in the methods section.
  
  We added the following text to the results section to help clarify this:
  
  “The solutions included 8 mM glucose (8G), 8 mM glucose + 2 mM glutamine, 0.5 mM leucine, and 1.5 mM alanine (8G/QLA), 8G/QLA + 10 nM glucose-dependent insulinotropic polypeptide (8G/QLA/GIP), and 2 mM glucose (2G), each of which were kept in a 37°C water bath.”
  
  Reviewer 2
  
  One major critique is that the authors studied "the human orthologues of the correlated mouse proteins that are proximal to the glycemia-associated SNPs in human GWAS". This implies two assumptions - (1) human and mouse proteins do not differ in terms of islet physiology and calcium signaling; (2) the proteins proximal to the SNPs are the causal factors for functional differences, though the SNPs could affect protein/gene function distant from the SNPs.
  
  • Thank you very much for highlighting this limitation in our study. We think this is very important to address which we have done in our discussion section.
  
  We have added the following text to discuss this important issue:
  
  “Our approach to merge human GWAS with our findings in mouse assumes that the glycemic-related SNPs we nominated alter the abundance or function of the human orthologues. Most SNPs that are strongly associated with phenotypes in human GWAS are noncoding, residing within introns, promoters, 3’UTRs, or intergenic regions (e.g. Figure 6). Therefore, a limitation of our approach is the assumption that SNPs regulate the gene they are proximal to, which is not always accurate (76-78). To infer a more direct link between SNPs and potential target genes, we incorporated human islet chromatin data (37). Physical contact between a region containing SNPs and a distal gene supports a regulatory role, as for ACP1 (Figure 6B). Additionally, SNPs within regions of open chromatin (ATAC-seq) and actively transcribed regions (histone markers) suggest a higher likelihood of regulating transcription factor access. While this approach does not conclusively show a link between the SNPs and expression of the orthologue for our candidate proteins, these chromatin data more strongly suggest that the orthologue expression may be regulated by the candidates’ SNPs.”
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.11.26.517741v3
www.biorxiv.org www.biorxiv.org

New submission 07/08/2023, 10:03:00

1
1. Public_Reviews 07 Aug 2023
 
 in eLife
 
 Author Response
 
 The following is the authors’ response to the original reviews.
 
 In response to the eLife assessment that “the analysis of the data is inadequate”, we strongly disagree and we to point out that in fact we follow the latest IUPHAR community guidelines on bias identification and quantification (Kolb et al, 2022). These protocols are not yet being used in the RTK and FGF fields, and thus the reviewer is not familiar with them, or with the concept of ligand bias. Our responses to the technical comments start at the bottom of page 7 of this document.
 
 We have edited the paper by adding a scaling step-by-step protocol in the Supplementary Data. We have also expanded the Discussion to help readers understand what is measured and how it is very novel. We have also changed the title of the manuscript. The edits in the Manuscript are marked in yellow. Our response to the reviewer is given below.
 
 Question/comment: 1. Previous studies have demonstrated that the variability of signal transduction stimulated by different FGF family members originates from their preferential activation of different members of the FGFR family (Ornitz et al., 1996). For example, it was previously shown that members of the FGF8 subfamily preferentially activate FGFR3c, whereas members of the FGF4 subfamily activate FGFR1c more potently than other FGFs. Moreover, it was shown that FGF18, a member of the FGF8 subfamily, preferentially binds to and activates the FGFR3c isoform. Indeed, this can be seen in the data shown in Figure 3 in this manuscript, where maximum levels of FGFR1 pY653/4 and pFRS2 are reached at different concentrations when stimulated with increasing concentrations of each ligand in HEK293T cells.
 
 The reviewer is correct that there are differences in the signaling of the different FGFRs, however these differences are not relevant for this work. This paper is only about FGFR1c, as this is the only FGF-receptor which is expressed in the mesenchyme of the developing limb bud (early limb bud stage, before the onset of mesenchymal condensations) and encounters different FGF ligands. In the article, we analyze the mechanism by which one FGFR recognizers and responds to three different FGFs.
 
 The reviewer also correctly points out that differences in our work “can be seen in the data shown in Figure 3 in this manuscript, where maximum levels of FGFR1 pY653/4 and pFRS2 are reached at different concentrations when stimulated with increasing concentrations of each ligand in HEK293T cells”. This is correct, but this is a statement about the potencies of the ligands, which is just one of three characteristics we explore here, namely potencies, efficacies, and bias. To determine if ligand bias exists or not, we need to compare two ligands and two responses (such as growth arrest and ECM degradation, or pY653/4 and pFRS2 phosphorylation). Ours is the first report of ligand bias in FGFR1 signaling, and the presence of bias goes far beyond simply differences in potencies (Kolb et al, 2022). Ligand bias in FGFR1 has never been demonstrated before. In part, this is because there have been no cell lines that give us the opportunity to compare two functional responses to FGF stimulus, via just one endogenously expressed FGFR variant. Notice that the paper that the reviewer is citing, (Ornitz et al., 1996), compares only 1 (one) type of response, when induced by different ligands, i.e. proliferation, and thus cannot answer the question if ligand bias exists or not. We have edited the Discussion to emphasize this fact. We have also changed the title.
 
 Two studies meant to characterize FGF binding to the FGFRs (Ornitz et al., 1996; Zhang et al., 2006) have defined the main rules of the FGF-FGFR interaction, such as exclusivity of the FGF3 subfamily (FGF3, FGF7, FGF10) for the ‘b’ variants of the FGFR1 and FGFR2. These studies however do not measure ligand binding. These studies were carried-out in BAF/3 cells, where the transfected FGFRs are treated with exogenous FGFs, to cause cell proliferation. As such, the studies have several limitations. In BAF/3 cells, the cell proliferation is used as a surrogate for FGF binding on FGFR. The FGFRs activate cell proliferation via RAS-ERK MAP kinase pathway. However, many other pathways of downstream signaling are initiated by FGFRs, regulating cell differentiation, migration, metabolism and apoptosis, in biological contexts. Using single cellular response (cell proliferation) as a surrogate for FGF binding to their receptors will favor FGF ligands causing cell proliferation. FGFs which have preference for other responses will incorrectly appear weakly binding and weakly activating in BAF/3 cells. Further, an FGF ligand binding with high affinity to the receptor but inducing a lower proliferative response will be recognized as a less ‘preferential’ for the particular receptor in the BAF/3 assay. Second, the significant diversity of signaling of 18 FGFs through seven FGFR variants in mammalian development suggests that many previously unappreciated nodules of FGF-FGFR signaling exist, including the recently discovered FGF signaling towards primary cilia, or interaction with insulin receptor system (Kunova Bosakova et al., 2019; Neugebauer et al., 2009; Nies et al., 2022). This diversity is not reflected in BAF/3 assay, which respond to FGFs with only one phenotype. This is why we have used the RCS cells in the manuscript. In RCS cells, at least two qualitatively different cell responses can be induced by the FGF signaling, making the cell model ideal for elucidating biased signaling.
 
 The so called ‘binding preferences’ based on the Ornitz articles are not binding measurements and should not be used universally to describe the FGF interactions with FGFRs, because we do not know what the term really means, nor what is it based on; the molecular basis of the FGFR signaling BAF/3 is poorly characterized. In our article, we model the processes occurring in every developing mammalian limb, where three FGF ligands (FGF4, FGF8, FGF9), released by the ectoderm at the surface of the limb bud, signal to the underlying mesenchymal cell expressing just one FGF-receptor, the FGFR1c (Mariani and Martin, 2003; Tabin and Wolpert, 2007). Unlike the BAF/3 cells engineered to ectopically express one FGFR and treated by recombinant FGFs in the lab, all three FGFs are recognized by cells expressing FGFR1c, and each of the three FGFs delivers unique morphogenetic information. The mechanisms underlying differential signaling of multiple FGFs via one FGFR are poorly defined, as the term ‘preferential signaling’ does not provide mechanistic explanation. Our article is a step towards understanding the complex processes of FGF ligand recognition and response. In our article, we evaluate the potency, the efficacy, the FGFinduced FGFR1c oligomerization and downregulation, and conformation of the active FGFR1c dimers in response to FGF4, FGF8 and FGF9. We show that FGF4, FGF8, and FGF9 are biased ligands, and that bias can explain differences in FGF4, FGF8 and FGF9-mediated cellular responses in development.
 
 References
 
 Kolb P, Kenakin T, Alexander SPH, Bermudez M, et al. Community guidelines for GPCR ligand bias: IUPHAR review 32. Br J Pharmacol. 2022;179, 3651-3674.
 
 Kunova Bosakova M, Nita A, Gregor T, Varecha M, et al. Fibroblast growth factor receptor influences primary cilium length through an interaction with intestinal cell kinase. Proc Natl Acad Sci U S A. 2019;116(10):4316-4325.
 
 Mariani FV, Martin GR. Deciphering skeletal patterning: clues from the limb. Nature. 2003;423(6937):319-25.
 
 Nies VJM, Struik D, Liu S, Liu W, et al. Autocrine FGF1 signaling promotes glucose uptake in adipocytes. Proc Natl Acad Sci U S A. 2022;119(40):e2122382119.
 
 Neugebauer JM, Amack JD, Peterson AG, Bisgrove BW, Yost HJ. FGF signalling during embryo development regulates cilia length in diverse epithelia. Nature. 2009;458(7238):651-4.
 
 Ornitz DM, Xu J, Colvin JS, McEwen DG, et al. Receptor specificity of the fibroblast growth factor family. J Biol Chem. 1996;271(25):15292-7.
 
 Tabin C, Wolpert L. Rethinking the proximodistal axis of the vertebrate limb in the molecular era. Genes Dev. 2007;21(12):1433-42.
 
 Zhang X, Ibrahimi OA, Olsen SK, Umemori H, Mohammadi M, Ornitz DM. Receptor specificity of the fibroblast growth factor family. The complete mammalian FGF family. J Biol Chem. 2006;281(23):15694-700.
 
 Question/comment: In order to be sure that the 'biased agonist' described in this manuscript for FGF8 binding is not caused by binding preference towards different FGFR members, the authors should present data comparing cell signaling via FGFR3c stimulated by FGF4, FGF8, and FGF9.
 
 Here, we study signaling by FGFR1, which is the only receptor that is expressed in the mesenchyme of the developing limb bud. FGFR3 is not expressed there, and thus we do not study FGFR3 in this paper. FGFR3 is important regulator of skeletal development, but is not involved in the early stages like FGFR1. When the bones are formed, FGFR3 regulates chondrocyte proliferation and differentiation in the growth plate cartilage (Colvin et al., 1996). In fact, we are currently performing experiments with FGFR3 and multiple FGF ligands, and we see that it also engages in biased signaling. However, these FGFR3 studies have no relevance to the current work and will be published separately.
 
 The so called ‘binding preferences towards different FGFR members’, based on the Ornitz articles (Ornitz et al., 1996; Zhang et al., 2006) provides no mechanistic explanation about differential FGF signaling via the activation of a single FGFR. Our article is a step forward towards the mechanism, by demonstration, for the first time, that ‘ligand bias’ may explain differential signaling by FGF4, FGF8 and FGF9 via FGFR1c.
 
 References
 
 Colvin JS, Bohne BA, Harding GW, McEwen DG, Ornitz DM. Skeletal overgrowth and deafness in mice lacking fibroblast growth factor receptor 3. Nat Genet. 1996;12(4):390-7.
 
 Ornitz DM, Xu J, Colvin JS, McEwen DG, MacArthur CA, Coulier F, Gao G, Goldfarb M. Receptor specificity of the fibroblast growth factor family. J Biol Chem. 1996;271(25):15292-7.
 
 Zhang X, Ibrahimi OA, Olsen SK, Umemori H, Mohammadi M, Ornitz DM. Receptor specificity of the fibroblast growth factor family. The complete mammalian FGF family. J Biol Chem. 2006;281(23):15694-700.
 
 Question/comment: 2. It is well-established that FGFR signaling by canonical FGF family members including FGF4, FGF8, and FGF9 is dependent on interactions of heparin or heparan sulfate proteoglycans (HSPG) to the ligand the receptors. Differential contributions of heparin to cell signaling mediated by FGF4, FGF8, and FGF9 binding and activation of different FGFRs expressed in RCS cells as this cell express endogenous HSPG molecules. This question should be addressed by comparing cell signaling via FGFRs ectopically expressed in BAF/3 cells (which do not possess endogenous FGFRs and HSPG) stimulated by FGF4, FGF8, and FGF9 in the absence or presence of different heparin concentrations. This approach has been applied many times in the past to explore and establish the role of heparin in control of ligand induced FGFR activation.
 
 The work cannot be done with BAF/3 cells, since the topic of the study is ligand bias so we need to compare at least two measurable responses. In RCS cells, the two functional responses are growth arrest and extracellular matrix degradation. In BAF/3 cells, ligand stimulation leads to one single response: proliferation.
 
 The HSPG and other sulphated proteoglycans work as low affinity FGF co-receptors. They stabilize the FGF secondary structure, present the FGFs to the FGFRs, and participate in FGFFGFR interactions (Yayon et al., 1991; Schlessinger et al., 2000; Zakrzewska et al., 2009). In the FGF field, the FGF-FGFR interaction is commonly supported by addition of exogenous heparin, which is highly sulphated glycosaminoglycan capable of full substitution of the cell-bound HSPGs in their function as low affinity FGF co-receptors.
 
 Most cells produce proteoglycans, including BAF/3 cells. The analysis of expression of FGFR overexpressed in BAF/3 cells demonstrated that FGFR1, FGFR2 and FGFR3 migrate as proteins of approximately 130-150 kDa (Ornitz et al., 1996; Fig. 1A), which implies extensive glycosylation in Golgi. For instance, the full-length amino acid sequence for human FGFR3 is 806 residues, which on acrylamide gel migrates as a band of approximately 85 kDa; heavier FGFR3 variants are Golgi-glycosylated proteins. The treatment with de-glycosylation enzymes reduces the molecular weight to the one expected from the amino acid sequence.
 
 To carry-out the BAF/3 experiment with FGF4, FGF8, and FGF9 in the absence or presence of different heparin concentrations, as the referee suggests, makes no sense. In BAF/3 cells, all FGF stimulations were done in the presence of 2 g/ml heparin (Ornitz et al., 1996; Zhang et al., 2006), because without heparin there would be no signaling. Even if the BAF/3 cells produce ample HSPGs, the heparin would still have to be used, because without it many of the FGFs would likely cause no response, regardless of the FGFR variant expressed. We and other have demonstrated, that most of the FGFs require stabilization by heparin to elicit signaling in cells expressing abundant amounts of HSPG (Buchtova et al., 2015; Chen et al., 2012).
 
 Why should we compare the FGF signaling in BAF/3 transfected with FGFR1, with the RCS cells which express endogenous FGFR1? In RCS cells, several cellular phenotypes caused by FGF signaling can be easily detected and quantified, in comparison with BAF/3 cells, which only respond to the FGF signaling by proliferation. No bias in signaling can be established in cells with display only single type of response. The RCS cells used in our paper represent one of the most tractable cellular models of FGFR signaling. There are more than 40 articles exploring the mechanisms of FGF-FGFR signaling in RCS cells, including mechanisms of FGF signal transduction, FGF regulation of cell cycle, cell proliferation, differentiation, premature senescence, loss of extracellular matrix, interaction of FGF signaling with WNT, cytokine and natriuretic peptide signaling, and others (Raucci et al., 2004; Priore et al., 2006; Kamemura et al., 2017; Kolupaeva et al., 2013; Krejci et al., 2005; Krejci et al., 2007; Krejci et al., 2010; Dailey et al., 2003; Rozenblatt-Rosen et al., 2002; Fafilek et al., 2008). In addition, the three treatments to inhibit pathological FGFR signaling which are now in human trials (RBM007, meclozine) or FDAapproved (vosoritide), were initially developed in RCS cells, benefiting from the well characterized molecular mechanisms of FGF signaling (Krejci et al., 2005; Wendt et al., 2015; Kimura et al., 2021; Matsushita et al., 2013). In comparison with RCS cells, very little is known about the mechanisms of the FGF signaling in BAF/3 cells, as the BAF/3 proliferation assay is used mostly to evaluate FGFR agonists and antagonists (Yamada et al., 2020; Kamatkar et al., 2019; Motomura et al., 2008). We have edited this information to the revised Discussion.
 
 References
 
 Buchtova M, Oralova V, Aklian A, Masek J, et al. Fibroblast growth factor and canonical WNT/βcatenin signaling cooperate in suppression of chondrocyte differentiation in experimental models of FGFR signaling in cartilage. Biochim Biophys Acta. 2015 May;1852(5):839-50.
 
 Buchtova M, Chaloupkova R, Zakrzewska M, Vesela I, et al. Instability restricts signaling of multiple fibroblast growth factors. Cell Mol Life Sci. 2015 Jun;72(12):2445-59.
 
 Chen G, Gulbranson DR, Yu P, Hou Z, Thomson JA. Thermal stability of fibroblast growth factor protein is a determinant factor in regulating self-renewal, differentiation, and reprogramming in human pluripotent stem cells. Stem Cells. 2012 Apr;30(4):623-30.
 
 Fafilek B, Balek L, Bosakova MK, Varecha M, et al. The inositol phosphatase SHIP2 enables sustained ERK activation downstream of FGF receptors by recruiting Src kinases. Sci Signal. 2018 Sep 18;11(548):eaap8608.
 
 Kamemura N, Murakami S, Komatsu H, Sawanoi M, et al. Biochem Biophys Res Commun. 2017 Jan 29;483(1):82-87.
 
 Kamatkar N, Levy M, Hébert JM. Development of a Monomeric Inhibitory RNA Aptamer Specific for FGFR3 that Acts as an Activator When Dimerized. Mol Ther Nucleic Acids. 2019 Sep 6;17:530-539.
 
 Kimura T, Bosakova M, Nonaka Y, Hruba E, Yasuda K, et al. An RNA aptamer restores defective bone growth in FGFR3-related skeletal dysplasia in mice. Sci Transl Med. 2021 ;13(592):eaba4226.
 
 Kolupaeva V, Daempfling L, Basilico C. The B55α regulatory subunit of protein phosphatase 2A mediates fibroblast growth factor-induced p107 dephosphorylation and growth arrest in chondrocytes. Mol Cell Biol. 2013 Aug;33(15):2865-78.
 
 Krejci P, Masri B, Salazar L, Farrington-Rock C, et al. Bisindolylmaleimide I suppresses fibroblast growth factor-mediated activation of Erk MAP kinase in chondrocytes by preventing Shp2 association with the Frs2 and Gab1 adaptor proteins. J Biol Chem. 2007;282(5):2929-36.
 
 Krejci P, Masri B, Fontaine V, Mekikian PB, et al. Interaction of fibroblast growth factor and C-natriuretic peptide signaling in regulation of chondrocyte proliferation and extracellular matrix homeostasis. J Cell Sci. 2005 Nov 1;118(Pt 21):5089-100.
 
 Krejci P, Prochazkova J, Smutny J, Chlebova K, et al. FGFR3 signaling induces a reversible senescence phenotype in chondrocytes similar to oncogene-induced premature senescence. Bone. 2010;47(1):102-10.
 
 Matsushita M, Kitoh H, Ohkawara B, Mishima K, et al. Meclozine facilitates proliferation and differentiation of chondrocytes by attenuating abnormally activated FGFR3 signaling in achondroplasia. PLoS One. 2013;8(12):e81569.
 
 Motomura K, Hagiwara A, Komi-Kuramochi A, Hanyu Y, et al. An FGF1:FGF2 chimeric growth factor exhibits universal FGF receptor specificity, enhanced stability and augmented activity useful for epithelial proliferation and radioprotection. Biochim Biophys Acta. 2008 Dec;1780(12):1432-40.
 
 Ornitz DM, Xu J, Colvin JS, McEwen DG, MacArthur CA, Coulier F, Gao G, Goldfarb M. Receptor specificity of the fibroblast growth factor family. J Biol Chem. 1996;271(25):15292-7.
 
 Priore R, Dailey L, Basilico C. Downregulation of Akt activity contributes to the growth arrest induced by FGF in chondrocytes. J Cell Physiol. 2006 Jun;207(3):800-8.
 
 Raucci A, Laplantine E, Mansukhani A, Basilico C. Activation of the ERK1/2 and p38 mitogen-activated protein kinase pathways mediates fibroblast growth factor-induced growth arrest of chondrocytes. J Biol Chem. 2004;279(3):1747-56.
 
 Robinson JW, Egbert JR, Davydova J, Schmidt H, et al. Dephosphorylation is the mechanism of fibroblast growth factor inhibition of guanylyl cyclase-B. Cell Signal. 2017;40:222229.
 
 Rozenblatt-Rosen O, Mosonego-Ornan E, Sadot E, Madar-Shapiro L, et al. Induction of chondrocyte growth arrest by FGF: transcriptional and cytoskeletal alterations. J Cell Sci. 2002 Feb 1;115(Pt 3):553-62.
 
 Schlessinger J, Plotnikov AN, Ibrahimi OA, Eliseenkova AV, et al. Crystal structure of a ternary FGF-FGFR-heparin complex reveals a dual role for heparin in FGFR binding and dimerization. Mol Cell. 2000 Sep;6(3):743-50.
 
 Wendt DJ, Dvorak-Ewell M, Bullens S, Lorget F, et al. Neutral endopeptidase-resistant Ctype natriuretic peptide variant represents a new therapeutic approach for treatment of fibroblast growth factor receptor 3-related dwarfism. J Pharmacol Exp Ther. 2015 Apr;353(1):132-49.
 
 Yamada R, Fukumoto R, Noyama C, Fujisawa A, et al. An epidermis-permeable dipeptide is a potential cosmetic ingredient with partial agonist/antagonist activity toward fibroblast growth factor receptors. J Cosmet Dermatol. 2020 Feb;19(2):477-484.
 
 Yayon A, Klagsbrun M, Esko JD, Leder P, Ornitz DM. Cell surface, heparin-like molecules are required for binding of basic fibroblast growth factor to its high affinity receptor. Cell. 1991 Feb 22;64(4):841-8.
 
 Zakrzewska M, Wiedlocha A, Szlachcic A, Krowarsch D, et al. Increased protein stability of FGF1 can compensate for its reduced affinity for heparin. J Biol Chem. 2009 Sep 11;284(37):25388-403. doi: 10.1074/jbc.M109.001289.
 
 Zhang X, Ibrahimi OA, Olsen SK, Umemori H, Mohammadi M, Ornitz DM. Receptor specificity of the fibroblast growth factor family. The complete mammalian FGF family. J Biol Chem. 2006;281(23):15694-700.
 
 Question/comment: It is impossible to interpret the FGFR binding characteristics and cellular activates of FGF4, FGF8, and FGF9 in the absence of information about the role of heparin in their binding and activation.
 
 We do not measure ligand binding to FGFR1 in this study. We record biological responses when we treat with FGF different ligands, and thus we measure the efficacy and the potency of each ligand to induce a response, and then we compare 2 ligands and 2 responses to determine if bias exists or not. We do not ask questions about the role of heparin, as it is always there no matter if we treat with FGF4, FGF8, or FGF9.
 
 Why it is not possible to interpret our cellular data? In our article, the RCS cells were treated with FGFs in the presence of 1 g/ml heparin, as clearly stated in Methods section. Using heparin at 1 or more μg/ml, to stabilize FGFs and negate the effect of endogenous HSPG, is a standard approach in the FGF field. This includes the two articles, which the whole field have used for more than 20 years as a basic reference for FGF-FGFR interactions (Ornitz et al., 1996; Zhang et al., 2006). In these studies, 2 μg/ml of heparin along with FGFs was used to treat BAF/3 cells; no experiments were conducted without heparin, as is does not make sense. Most likely, without heparin the obtained FGF-FGFR ‘preferences’ would, in fact, be the differences in FGF thermal stability, as we clearly demonstrate in our previous study (Buchtova et al., 2015). The latter article gives a detailed information about the role of heparin in the signaling of multiple FGFs in RCS cells.
 
 References
 
 Buchtova M, Chaloupkova R, Zakrzewska M, Vesela I, Cela P, Barathova J, Gudernova I, Zajickova R, Trantirek L, Martin J, Kostas M, Otlewski J, Damborsky J, Kozubik A, Wiedlocha A, Krejci P. Instability restricts signaling of multiple fibroblast growth factors. Cell Mol Life Sci. 2015 Jun;72(12):2445-59.
 
 Ornitz DM, Xu J, Colvin JS, McEwen DG, MacArthur CA, Coulier F, Gao G, Goldfarb M. Receptor specificity of the fibroblast growth factor family. J Biol Chem. 1996;271(25):15292-7. Zhang X, Ibrahimi OA, Olsen SK, Umemori H, Mohammadi M, Ornitz DM. Receptor specificity of the fibroblast growth factor family. The complete mammalian FGF family. J Biol Chem. 2006;281(23):15694-700.
 
 Technical Comments/Answers
 
 Question/comment: 3. It is not clear how some of the experimental data were analyzed. Blots in Figures 3A and 3B should include controls (total FGFR1 for pY653/4 and total FRS for pFRS2). How are the data shown in Figure 3C normalized? It does look like the level of phosphorylation was all normalized against the strongest signals irrespective of which ligand was used. Each data representing each ligand should be separately normalized.
 
 The reviewer is correct that most often in the RTK literature “each data representing each ligand is separately normalized”. But this approach will eliminate all the information about ligand efficacies and about ligand bias; it will only yield information about the potencies. Here we are not only interested in the potencies, as we are also interested to determine if bias exists or not. As such, we follow scaling protocols that have been established and are currently recommended for ligand bias studies (Kolb et al, 2022).
 
 One way to explain why the scaling that the reviewer is recommending is not correct for this work is to look at equation 2. What the reviewer is suggestion is to set all values of Etop to 1. In this case, the bias coefficient will depend only on the measured potencies, EC50. But this contradicts the very definition of bias, as it is NOT a difference in potencies only. In the literature, differences in potencies are called “quantitative differences”, while ligand bias describes differences which are called “qualitative” or “fundamental” (Kenakin, 2019).
 
 To eliminate confusion, we have added a scaling protocol to the Supplement of the paper.
 
 References
 
 Kolb P, Kenakin T, Alexander SPH, Bermudez M, et al. Community guidelines for GPCR ligand bias: IUPHAR review 32. Br J Pharmacol. 2022;179, 3651-3674.
 
 Kenakin T. Biased Receptor Signaling in Drug Discovery. Pharmacol Rev 2019;71, 267315.
 
 Question/comment: 4. In page 6, authors used the plot shown in Figure 3 for 'FGFR downregulation' to conclude that "the effect of FGF4 on FGFR1 downregulation is smaller when compared to the effects of FGF8 and FGF9. However, it is unclear how the data shown in the plot was normalized - none of the data seem to reach "1.0". Moreover, the plot seems to suggest that FGF4 can strongly downregulate FGFR as it can downregulate FGFR with higher potency.
 
 The Western blots assessing FGFR1 expression are easy to scale, as the value in the absence of ligand is set to 1. The expression decreases as a function of the ligand concentration. We plot FGFR1 downregulation, so we subtract 1 from the scaled FGFR1 band intensities. The total amount of FGFR1 never becomes undetectable (i.e. zero), as the ligand concentration is increased. Thus, a value of 1 in the downregulation curve is never obtained.
 
 We have added a protocol for this scaling in the Supplement.
 
 Question/comment: 5. The structural basis of FGFR1 ligand bias and the different dimeric configurations and interactions between the kinase domain of FGFR1 dimers are not warranted (Figure 6). In the absence of any structural experimental data of different forms of FGFR dimers stimulated by FGF ligands the model presents in the manuscript is speculative and misleading.
 
 This statement about Figure 6 is not fully correct because Figure 6A and B show experimental data. These are FRET experiments which show that the biased ligand, FGF8, induces different FGFR1 transmembrane domain conformation, as compared to FGF4 and FGF9.
 
 The rest of the panels in Figure 6 show modeling using PyRosetta. These are indeed not experimental data, but to the best of our knowledge this is the very first time PyRosetta has been used to predict kinase-kinase interfaces.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.01.06.475273v4
www.biorxiv.org www.biorxiv.org

New submission 07/08/2023, 09:51:13

1
1. Public_Reviews 07 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  Reviewer #1 (Public Review):
  
  The present study examined the physiological mechanisms through which impaired TG storage capacity in adipose tissues affects systemic energy homeostasis in mice. To accomplish this, the authors deleted DGAT1 and DGAT2, crucial enzymes for TG synthesis, in an adipocyte-specific manner. The authors found that ADGAT DKO mice substantially lost the adipose tissues and developed hypothermia when fasted; however, surprisingly, ADGAT KO mice were metabolically healthy on a high-fat diet. The authors found that it was accompanied by elevated energy expenditure, enhanced glucose uptake by the BAT, and enhanced browning of white adipose tissues. This unique animal model provided exciting opportunities to identify new mechanisms to maintain systemic energy homeostasis even in a compromised energy storage capacity. Overall, the data are compelling and well support the conclusion of this paper. The manuscript is clearly written.
  
  We thank the reviewer for the time invested to critically review our paper.
  
  Reviewer #2 (Public Review):
  
  Here, Chitraju et al have studied the phenotype of mice with an adipocyte-specific deletion of the diglycerol acyltransferases DGAT1 and DGAT2, the two enzymes catalyzing the last step in triglyceride biosynthesis. These mice display reduced WAT TG stores but contrary to their expectations, the TG loss in WAT is not complete and the mice are resistant to a high-fat diet intervention and display a metabolically healthier profile compared to control littermates. The mechanisms underlying this are not entirely clear, but the double knockout (DKO) animals have increased EE and a lower RQ suggesting that enhanced FA oxidation and WAT "browning" may be involved. Moreover, both adiponectin and leptin are expressed in WAT and are detectable in circulation. The authors propose that "the capacity to store energy in adipocytes is somehow sensed and triggers thermogenesis in adipose tissue. This phenotype likely requires an intact adipocyte endocrine system...." Overall, I find this to be an interesting notion.
  
  We thank the reviewer for the time invested to critically review our paper.
  
  Reviewer #3 (Public Review):
  
  In this study, the authors sought to test the hypothesis that blocking triglyceride storage in adipose tissue by knockout of DGAT1 and DGAT2 in adipocytes would lead to ectopic lipid deposition, lipodystrophy, and impaired glucose homeostasis. Surprisingly, the authors found the opposite result, with DGAT1/2 DKO in adipocytes leading to increased energy expenditure, minimal ectopic lipid deposition, and improved glucose homeostasis with HFD feeding. These metabolic improvements were largely attributed to increased beiging of the white fat and increased brown adipose tissue activity. This study provides an interesting new paradigm whereby impairing fat storage, the major function of adipose tissue, does not lead to severe metabolic disease, but rather improves it. The authors provide a comprehensive assessment of the metabolism of these DKO mice under chow and HFD conditions, which support their claims. The study lacks in mechanistic insight, which would strengthen the study, but does not detract from the authors' major conclusions.
  
  We thank the reviewer for the time invested to critically review our paper.
  
  The conclusions of this paper are mostly well-supported, but some aspects should be clarified and extended.
  
  1) The authors claim the beiging of WAT of ADGAT DKO mice is partially through the SNS; however, housing these mice at thermoneutrality did not block the beiging, which seems to negate that claim. Is there evidence of increased cAMP/PKA activation in the adipose tissues of ADGAT DKO to support the premise that the beiging is activated by the SNS, even at thermoneutrality? Alternatively, if the authors block beta-adrenergic receptors with antagonists, such as propranolol, does this block the beiging?
  
  We are currently unsure of the mechanism(s) for WAT beiging and whether it requires the SNS. We attempted denervation experiments to ablate SNS input; however, the results were consistent with partial denervation and not clearly interpretable, so we elected not to include them in the manuscript. Unfortunately, we did not measure cAMP/PKA activation or utilize beta blockers in attempt to block SNS activation. Due to a recent laboratory move, there are no study mice available to perform these experiments.
  
  2) It's been shown that autocrine FGF21 signaling is sufficient to promote beiging of iWAT (PMID 34192547). The authors show Fgf21 mRNA is increased in iWAT of chow-fed ADGAT DKO mice. Is Fgf21 also increased in iWAT of HFD-fed mice? This and measurement of local FGF21 secretion by adipocytes would strengthen this study.
  
  We thank the reviewer for this question. Unfortunately, we did not measure Fgf21 mRNA levels in iWAT of HFD-fed mice or FGF21 secretion by adipocytes and mice are not currently available. We agree, however, that FGF21 is a candidate for mediating this phenotype. Testing this idea would likely require crossing the ADGAT DKO mice with FGF21 KO mice. Arguing against FGF21 as contributing systemically, plasma levels were similar in HF-fed ADGAT DKO mice and controls.
  
  3) The primary adipocytes in Figure 5–figure supplement 2A do not appear to have any depletion in TG stores, suggesting this may not be an appropriate model to study the cell autonomous effects of ADGAT DKO on beiging. The authors should use DGAT inhibitors instead to corroborate or investigate this question.
  
  We agree with the reviewer that primary adipocytes from ADGAT DKO mice may not be the best model to study the cell autonomous effects of beiging, particularly since they are accumulating lipids. On the other hand, it’s not likely that DGAT inhibitors would be any better than the genetic deletions of the enzymes. Presumably, the neutral lipids are being synthesized by enzymes other than DGAT1 or DGAT2.
  
  4) Multiple studies have shown the importance of lipolysis for the activation of brown and beige thermogenic programs (PMID 35803907, 34048700) and can be potentiated by HFD feeding (PMID 34048700). In the absence of DGAT activity in ADGAT DKO mice, it seems plausible that free fatty acids could be elevated, especially in the context of HFD. Are free fatty acids elevated in the adipose tissues, which could promote thermogenic gene expression?
  
  We thank the reviewer for pointing this out. Although we cannot exclude this mechanism, arguing against it, we found lower levels of almost all free fatty acid species in iWAT of chow diet fed ADGAT DKO (Figure 5–figure supplement 1, metabolomics). Additionally, plasma FFA were reduced in these mice.
  
  5) The lack of ectopic lipid deposition in the ADGAT DKO mice is striking, especially under HFD conditions. Can the increased energy expenditure fully account for the difference in whole body fat accumulation between Control and DKO mice or have the mice activated other energy disposal mechanisms? Please discuss or include measurement of fat excretion in the feces to strengthen this study.
  
  Although decreased lipid absorption may conceivably contribute to energy loss, we would not expect this to occur in adipocyte-specific knockout mice, and we did not measure the lipid content in the feces. We have added a discussion point to the manuscript.
  
  Reviewer #1 (Recommendations for the Authors):
  
  The authors wish to clarify the following points to strengthen this exciting work further.
  
  1) The authors demonstrated that DKO mice exhibited enhanced browning of WAT even under a thermoneutral condition, and this occurred in a non-cell autonomous fashion. Accordingly, the authors suggested the possibility that SNS activity was enhanced in DKO mice. It would be intriguing to examine the extent to which lipolysis is indeed enhanced in DKO mice. For instance, do DKO mice have higher FFA and glycerol levels than controls in circulation? This could explain a part of the phenotype, as a recent work suggested that WAT lipolysis triggers beige progenitor cell proliferation in WAT.
  
  We thank the reviewer for this question. Although this is an interesting idea, we found that ADGAT DKO mice have lower levels of free fatty acids and glycerol in the circulation, indicating lipolysis not likely the underlying mechanism for increased beiging in ADGAT DKO mice. FFA were also reduced in the iWAT of the ADGAT DKO mice (as shown in supplemental data for Figure 5).
  
  2) The authors suggested the possibility that other candidates in the DGAT2 gene family might compensate for the lack of DGAT1 and DGAT2. It will be insightful if the authors elaborate on this part - e.g., discussing any transcriptional changes of DGAT2 family members in the WAT of DKO mice.
  
  We thank the reviewer for this question. Previous studies showed (Yen et al., 2005), monoacylglycerol acyl transferase (MGAT) enzymes also possess some TG synthesis activity. We found increased mRNA expression of MGAT1 and MGAT2 enzymes in white adipose tissue of ADGAT DKO mice. We now included this data in Figure 1–figure supplement 1G.
  
  3) Minor: Statistics of the AUC for the GTT and ITT (Fig. 3G and 3H).
  
  We thank the reviewer for this point. We have now updated the figures with statistics of the AUC for GTT and ITT.
  
  Reviewer #2 (Recommendations for the Authors):
  
  1) The authors suggest that the DKOs are protected against a high-fat diet due to an intact endocrine function combined with increased FA oxidation and WAT browning. This phenotype is interesting but as the authors write, the underlying mechanisms remain unclear. Furthermore, how important is retained endocrine function, in relation to WAT browning, in explaining the resistance to a high-fat diet in the DKOs? As these mice are born with the double DGAT KO and it is possible that compensatory mechanisms explain some of the observed effects. What happens with the endocrine function/browning effect if DGAT1/2 is inhibited in cells that already contain full TG stores? While I understand that studies in an inducible KO model are outside the scope of this study, data in cells where the effects of DGAT inhibition are studied early and late during differentiation would be interesting and could at least be discussed.
  
  We thank the reviewer for this interesting question. It is possible that ADGAT DKO have adipose tissue-derived factors that act in endocrine or paracrine manner to induce beiging. Unfortunately, we do not have data addressing this point, but added discussion of this point to the revised manuscript.
  
  2) While the possibility to dissociate TG storage from the endocrine function of WAT is attractive, the authors have only studied two adipokines. Do they have data on any other adipokine(-s) supporting the claim that the secretory function is intact?
  
  We thank the reviewer for this question. Regrettably we did not measure additional adipokines, and the mice are no longer available for study due to a recent lab move.
  
  Reviewer #3 (Recommendations for the Authors):
  
  Minor comments/suggestions:
  
  1) The authors show multiple phospholipid species were increased by ADGAT DKO. Cardiolipin has been shown to promote brown fat thermogenesis (PMID 29861389). Were cardiolipin levels changed by ADGAT DKO?
  
  We thank the reviewer for this question. We found cardiolipin levels were increased in iWAT of ADGAT DKO mice. However, we have not measured cardiolipin levels in brown fat.
  
  2) A recent study (PMID 36914626) has shown that inhibition of lipogenesis in adipose tissue impairs autophagy and also causes beiging of white adipose tissue. Is autophagy affected by ADGAT DKO? Are de novo lipogenesis enzymes affected by the DKO?
  
  We thank the reviewer for this interesting suggestion. We did find that mRNA levels of genes involved in de novo lipogenesis (Srebp1c, Acc, Fas) were decreased in iWAT of ADGAT DKO mice, as expected from some of our other studies involving DGAT inactivation. Unfortunately, we did not measured autophagy per se in iWAT of ADGAT DKO mice.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.05.05.490833v3
www.biorxiv.org www.biorxiv.org

New submission 07/08/2023, 09:44:25

1
1. Public_Reviews 07 Aug 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  We thank the editors and reviewers for their thoughtful consideration of our manuscript. Here, we addressed the reviewers’ points.
  
  Reviewer #1 (Public Review):
  
  Tomasi et al. performed a combination of bioinformatic, next-generation tRNA sequencing experiments to predict the set of tRNA modifications and their corresponding genes in the tRNAs of the pathogenic bacteria Mycobacterium tuberculosis. Long known to be important for translation accuracy and efficiency, tRNA modifications are now emerging as having regulatory roles. However, the basic knowledge of the position and nature of the modifications present in a given organism is very sparse beyond a handful of model organisms. Studies that can generate the tRNA modification maps in different organisms along the tree of life are good starting points for further studies. The focus here on a major human pathogen that is studied by a large community raises the general interest of the study. Finally, deletion of the gene mnmA responsible for the insertion of s2U at position 34 revealed defects in in growth in macrophage but in test tubes suggesting regulatory roles that will warrant further studies. The conclusions of the paper are mostly supported by the data but the partial nature of the bioinformatic analysis and absence of Mass-Spectrometry data make it incomplete. The authors do not take advantage of the Mass spec data that is published for Mycobacterium bovis (PMID: 27834374) to discuss what they find.
  
  1) The authors say they took a list of proteins involved in tRNA modifications from Modomics and added manually a few but we do not know the exact set of proteins that were used to search the M. mycobacterium genome.
  
  Thank you for pointing out this issue. We added the complete list of proteins used for the BLAST query as Supplemental Table 1.
  
  2) The absence of mnmGE genes in TB suggested that the xcm5U derivatives are absent. These are present in M. bovis (PMID: 27834374). Are the MnmEG gene found in M. bovis? If yes, then the authors should perform a phylogenetic distribution analysis in the Mycobacterial clade to see when they disappeared. If they are not present in M. bovis then maybe a non-orthologous set of enzymes do the same reaction and then the authors really do not know what modification is present or not at U34 without LC-MS. The exact same argument can be given for the xmo5U derivatives that are also found in M.bovis but not predicted by the authors in M. tuberculosis.
  
  The reviewer raises a valid point. In M. bovis mnm5U and cmo5U derivatives were observed in LC-MS analysis. However, we did not identify candidate genes known to be involved in the biogenesis of mnm5U and cmo5U in the Mycobacteriaceae, including M. bovis and Mtb, suggesting that if these modifications are indeed present, they are not synthesized through canonical biogenesis pathways in this family. There are several examples where the same modification is generated by distinct modification enzymes (Kimura, 2021). These observations raise the interesting possibility that in the Mycobacteriaceae and most species in actinomycetota (except for Bifidobacterium, Corynebacterium and Rhodococcus species), major wobble modifications are generated by biosynthesis pathways that are distinct from those employed by well-characterized organisms. Future studies will examine this hypothesis.
  
  3) Why is the Psi32 predicted by the authors because of the presence of the Rv3300c/Psu9 gene not detected by CMC-treated tRNA seq while the other Psi residues are? Members of this family can modify both rRNA and tRNA. So the presence of the gene does not guarantee the presence of the modification in tRNAs
  
  Thank you very much for the careful read. We did not include RluA in the list of query proteins because it is not classified as a tRNA modification enzyme in Modomics. Additionally, the CMC-coupled tRNA-seq is imperfect for detection of all pseudouridylated positions. Due to this limitation, we only assigned modifications that are both predicted by the presence of putative biosynthetic enzymes and RT-derived signatures. As the reviewer points out, we cannot rule out that this homolog targets only rRNAs. We clarified this possibility in the revised manuscript by adding the following sentence: “Additionally, CMC treatment may not identify  at all positions, thus, the targets of Rv3300 and Rv1540 remain unclear. Since these genes are similar to E. coli rluA, which also targets rRNA, these genes may target rRNAs instead of tRNAs” (lines 298-300)
  
  In the revised manuscript, RluA was added to the BLAST query for creating Fig. 2. Interestingly, Rv3300c is more similar to Pus9 than RluA, while Rv1540 is the Mtb gene most similar to E. coli RluA suggesting that these two genes encode pseudouridylases that target different species of tRNAs/rRNAs.
  
  4) What are tsaBED not essential but tsaC (called sua5 by the authors) essential?
  
  Thank you for pointing out this interesting observation. We are also curious about differences in the essentiality among t6A biogenesis genes. We speculate that TsaC has critical roles in cell viability other than t6A synthesis. TsaC synthesizes threonylcarbamoyl-AMP as an intermediate for t6A biogenesis. Thus, it is possible that this intermediate has a role in other essential cellular activities besides t6A biogenesis. Further study of these factors in Mtb could reveal interesting crosstalk between modification synthesis and other cellular activities.
  
  Reviewer #2 (Public Review):
  
  In this study, Tomasi et al identify a series of tRNA modifying enzymes from Mtb, show their function in the relevant tRNA modifications and by using at least one deleted strain for MnmA, they show the relevance of tRNA modification in intra-host survival and postulate their potential role in pathogenesis.
  
  Conceptually it is a wonderful study, given that tRNA modifications are so fundamental to all life forms, showing their role in Mtb growth in the host is significant. However, the authors have not thoroughly analyzed the phenotype. The growth defect aspect or impact on pathogenesis needs to be adequately addressed.
  
  The authors show that ΔmnmA grows equally well in the in vitro cultures as the WT. However, they show attenuated growth in the macrophages. Is it because Glu1_TTC and Gln1-TTG tRNAs are not the preferred tRNAs for incorporation of Glu and Gln, respectively? And for some reason, they get preferred over the alternate tRNAs during infection? What dictates this selectivity?
  
  Thank you very much for raising this excellent point. As the reviewer suggests, the attenuation of ΔmnmA Mtb growth inside of macrophages could be caused by disparate codon usage between genes required for in vitro growth and intracellular growth. Among multiple codons encoding Glu, Gln, or Lys, s2U modification-dependent codons might be preferentially distributed in genes associated with intracellular growth. For example, Mtb has two tRNA isoacceptors, Glu1_TTC and Glu2_CTC, to decipher two Glu codons, GAA and GAG. According to the wobble pairing rule, GAA is only decoded by Glu1_TTC, whereas GAG is decoded by both Glu1_TTC and Glu2_CTC; i.e., GAG can be deciphered by an s2U-independent tRNA. Thus, genes required for intracellular growth might be enriched with GAA, an s2U-dependent codon. Similar codon usage differences could be present in Gln and Lys codons deciphered by s2U-containing tRNAs. In the revised manuscript, we included a new paragraph in the discussion explaining the possibility that differences in codon usage could contribute to the intracellular fitness defect of the ΔmnmA Mtb mutant (lines 323-332).
  
  As such the growth defect shown in macrophages would be more convincing if the authors also show the phenotype of complementation with WT mnmA.
  
  The reviewer raises a valid point. We note however, that Rv3023c, a putative transposase, is downstream of MnmA and unlike MnmA, Rv3023c appears to be dispensable for in vivo growth, according to the Tn-seq database (reference 44 and 45). Therefore, it is likely that the intracellular growth defect is caused by loss of mnmA.
  
  An important consideration here is the universal nature of these modifications across the life forms. Any strategy to utilize these enzymes as the potential therapeutic candidate would have to factor in this important aspect.
  
  This is a valid point. Targeting a pathogen-specific system enables avoidance of the adverse side effects caused by many therapeutic reagents. There are a couple of Mtb modification enzymes that are specific to bacteria and critical for Mtb fitness (e.g., TilS). These enzymes represent ideal potential therapeutic targets to impede Mtb intracellular growth.
  
  Reviewer #3 (Public Review):
  
  The work presented in the manuscript tries to identify tRNA modifications present in Mycobacterium tuberculosis (Mtb) using reverse transcription-derived error signatures with tRNA-seq. The study identified enzyme homologs and correlates them with presence of respective tRNA modifications in Mtb. The study used several chemical treatments (IAA and alkali treatment) to further enhance the reverse transcription signals and confirms the presence of modifications in the bases. tRNA modifications by two enzymes TruB and MnmA were established by doing tRNA-seq of respective deletion mutants. Ultimately, authors show that MnmA-dependent tRNA modification is important for intracellular growth of Mtb. Overall, this report identifies multiple tRNA modifications and discuss their implication in Mtb infection.
  
  Important points to be considered:
  
  The presence of tRNA-based modifications is well characterised across life forms including genus Mycobacterium (Mycobacterium tuberculosis: Varshney et al, NAR, 2004; Mycobacterium bovis: Chionh et al, Nat Commun, 2016; Mycobacterium abscessus: Thomas et al, NAR, 2020). These modifications are shown to be essential for pathogenesis of multiple organisms. A comparison of tRNA modification and their respective enzymes with host organism as well as other mycobacterium strains is required. This can be discussed in detail to understand the role of common as well as specific tRNA modifications implicated in pathogenesis.
  
  The reviewer raises a fair point. However, with the exception of Chionh et al., the other studies cited here are not genome-wide characterization of tRNA modification. Re-analysis showed that the distribution of the tRNA modifying enzymes are very similar across mycobacterium strains, e.g., Mycobacterium smegmatis, Mycobacterium tuberculosis, and Mycobacterium abscessus, suggesting that modifications related to pathogenesis in Mtb may have different physiological roles in other Mycobacterium species. We included the distribution of tRNA modification enzymes across multiple mycobacterium species in a revised Fig. 1.
  
  Authors state in line 293 "Several strong signatures were detected in Mtb tRNAs but not in E. coli". Authors can elaborate more on the unique features identified and their relevance in Mtb infection in the discussion or result section.
  
  Thank you for the suggestion. However, the identity of these RT signatures and the relevance of these modifications for Mtb pathogenicity remains speculative at this point.
  
  Deletion of MnmA is shown to be essential for E. coli growth under oxidative stress (Zhao et al, NAR, 2021). In similar lines, MnmA deleted Mtb suffers to grow in macrophage. Is oxidative stress in macrophage responsible for slow Mtb growth?
  
  This is an excellent hypothesis which we have added to the revised manuscript (lines 320-322). “In fact, the absence of mnmA is reported to sensitize E. coli to oxidative stress, raising the possibility that s2U modification promotes Mtb growth under oxidative stress elicited by the host.”
  
  Authors state in line 311-312 "Mtb does not contain apparent homologs of the tRNA modifying enzymes that introduce the additional modifications to s2U". This can be characterised further to rule out the possibility of other enzyme specifically employed by Mtb to introduce additional modification.
  
  The reviewer raises a valid point. As discussed above (Reviewer #1, pt 2), Mtb may employ distinct enzymes to generate certain tRNA modifications. Future mass spec-based analyses of Mtb tRNAs will be carried out to identify the precise chemical structure of the sulfurated uridine, and subsequent studies will attempt to determine the enzymes that account for the biogenesis of these modifications.
  
  Kimura, S. (2021). Distinct evolutionary pathways for the synthesis and function of tRNA modifications. Brief Funct Genomics, 20(2), 125-134. doi:10.1093/bfgp/elaa027
  
  Reviewer #1 (Recommendations For The Authors):
  
  Additional data and Analyses
  
  The Modomics database is far from complete so it would be more rigorous to give the full set of genes that was used to do the searches as supplemental data.
  
  Thank you for the suggestion. We added the list of the query genes as Supplemental Table 1. Minor points to be fixed
  
  1) The authors name the psi32 synthase Rv3300c Pus9 when it is a member of the RluA family. It is not clear why the yeast/eukaryotic name was used.
  
  We included enzymes from diverse species in our query, including eukaryotic genes. Indeed, we found that Rv3300c showed the lowest E-value among our query genes, therefore, we name Rv3300c as Pus9.
  
  2) The sua5 gene name was used it should be tsaC to follow the accepted nomenclature.
  
  We renamed Sua5 to TsaC2.
  
  3) The statement lines 203-296 was totally unclear. I did not understand what the authors were trying to say at all.
  
  This paragraph described how sequence context can result in different reverse transcription-derived signatures from dihydrouridine (D). We added a schematics describing this paragraph as Supplemental Fig. 6.
  
  4) In reference, names with special characters should be fixed such as Börk.
  
  We fixed the names with special characters.
  
  Reviewer #2 (Recommendations For The Authors):
  
  The authors state that at least some of tRNA modifying enzymes, while redundant for growth in vitro, may play a role during growth inside the macrophages, mostly due to the diverse stresses they could encounter.
  
  We added a sentence, “In fact, the absence of mnmA is reported to sensitize E. coli to oxidative stress, suggesting that s2U modification is required for Mtb growth under oxidative stress elicited by the host” in the discussion.
  
  • Ideally, authors could have tested the impact of diverse intracellular stresses that Mtb encounters, like redox stress, nitrosative, pH or nutritional stress, to check whether any of these stresses cause in vitro growth defects in ΔmnmA strain.
  
  Thank you for the suggestion. This point will be addressed in future experiments.
  
  This would be a wonderful way to show that under stress, the essentiality of tRNA modification enzymes changes.
  
  Reviewer #3 (Recommendations For The Authors):
  
  • In general, the clarity of the presentation can be improved.
  
  • Authors state that "MiaA, is non-essential in E. coli, but apparently essential in Mtb". While MiaA is shown to be critical for the fitness and virulence of extraintestinal pathogenic E. coli (Fleming et al, NAR, 2022). This can be clarified.
  
  We rephrase as follows: “Unexpectedly, one modifying enzyme, MiaA, is non-essential in E. coli grown in nutrient-rich medium, but apparently …”
  
  • Line numbers 130-132 is a repetition of line numbers 103-105
  
  We repeated these sentences because the same claim was deduced from different experiments, i.e., BLAST search and tRNAseq.
  
  • Line number 228: The presence of U at position 55 in the tRNAs can be included in the text for a better understanding.
  
  We changed the text as following: “… Termination signatures derived from position 55, which is exclusively uridine in all tRNA species, increased in most tRNA species, suggesting …”
  
  • A detailed pictorial depiction on comparing the modifications and enzymes from E. coli and Mtb can be included for easy understanding.
  
  We created an E. coli tRNA modification map in the same format as Figure 2C and added it to the revised manuscript as a new Supplementary Fig. 1.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.20.529267v2
www.biorxiv.org www.biorxiv.org

New submission 07/08/2023, 09:22:16

1
1. Public_Reviews 07 Aug 2023
  
  in eLife
  
  Author Response
  
  We express our gratitude to the editors for acknowledging the significance of our findings and facilitating the review process. We would also like to thank the reviewers for dedicating their time to thoroughly read the manuscript and provide valuable insights.
  
  During the revision process, we will address the raised issues and concerns, confident that our revisions will enhance the clarity and strength of the paper.
  
  In response to the reviewers' feedback, we acknowledge that some of the relevant information was previously presented in our published papers (Meng, Dev Cell. 2017; Xia, Elife. 2021). However, we recognize that in the current version of the manuscript, we may not have expounded on these details as clearly as needed. We will rectify this shortcoming in the revised version to provide a more comprehensive account of our research.
  
  We also explain our perspective on why the discovery of MYRF controlling lin-4 upregulation is crucial in addressing unanswered key questions in developmental biology.
  
  The Loss of Function Characteristics of myrf-1(ju1121 G274R)
  
  We would like to present the evidence supporting the characteristics of myrf-1(ju1121) as a loss-of-function mutation affecting both myrf-1 and myrf-2. In our initial paper (Meng, Dev Cell. 2017), the nature of this mutation was a significant focus of our research.
  
  Our investigation involved analyzing multiple alleles (tm, ok, gk alleles from CGC, and indel alleles made in-house) of myrf-1 and myrf-2, as well as their double mutants. Here is a summary of our current understanding based on these analyses:
  
  myrf-1 single loss-of-function (l.f.) mutants exhibit penetrant arrest at the end of L1 or early L2 stages. However, they only display very mild deficiency in DD synpatic remodeling at 21 hours, primarily caused by a delay.
  
  myrf-2 single l.f. mutants behave similarly to the wild type, exhibiting no significant developmental abnormalities, including synpatic remodeling.
  
  myrf-1 and myrf-2 double l.f. mutants exhibit penetrant arrest during L2, occurring approximately half a stage later than in myrf-1 single mutants.
  
  Remarkably, myrf-1 and myrf-2 double l.f. mutants exhibit severe blockage in synaptic remodeling, indicating that both genes act collaboratively to drive this essential process (Meng, Figure 5).
  
  The myrf-1(ju1121 G274R) mutation exhibits severe synaptic remodeling blockage and arrest during L2, closely resembling myrf-1 myrf-2 double mutants (Meng, Figure 1 and 2).
  
  Therefore, despite myrf-1's more significant role in development based on the arrest phenotype, synaptic remodeling requires the combined function of myrf-1 and myrf-2. This redundancy is further supported by the analysis of the new set of specific myrf-1 mutants (Xia, Figure 6).
  
  Both myrf-1 and myrf-2 are broadly expressed (Meng, Figure 3 and S5), and they undergo developmentally regulated cell-membrane to nucleus translocation (Xia, Figure 4 and Supplement 1). Overexpressing N-MYRF-1 and full-length MYRF-2 in DD neurons leads to precocious synaptic remodeling (Meng, Figure 4 and 5). Interestingly, overexpressing full-length myrf-1 does not have the same effect, indicating potential regulatory differences between these two factors.
  
  The myrf-1(ju1121 G274R) mutation is located in the N-terminal region of the Ig-fold type DNA-binding domain, specifically within the loop between a and b Ig-fold strands. This site is conserved across all metazoan MYRFs (Meng, Figure 1D and 6A). The mutant myrf-1(G274R) loses its DNA binding ability, as demonstrated by a gel mobility shift assay using the counterpart residue mutation in mammalian MYRF (Meng, Figure 6B).
  
  MYRF-1(ju1121 G274R) mutant interfering with normal MYRF’s function has been supported by molecular genetics experiments (Meng, Figure 6C-E) and biochemical analysis. In essence, the MYRF-1(G274R) mutant does not impact MYRF trimerization or MYRF-1-MYRF-2 interaction, but blocks DNA binding. Substantial evidence has confirmed the physical binding of MYRF-1 and MYRF-2 both in vitro and in vivo (Meng, Figure 5G and S6; Xia, Figure 1A). Importantly, MYRF- 1(ju1121 G274R) is still able to bind to MYRF-2, as supported by coIP analysis (Meng, Figure S7), indicating that the G274R mutation does not disrupt the MYRF-1-MYRF-2 interaction. This observation is consistent with the characteristics of the MYRF structure (PMID: 28160598; PMID: 34345217). The critical interface of the MYRF trimer is located in the alpha-helix upstream of the ICE domain, the beta sheets of the ICE, and the beta-helix of the bridge region between ICE and DBD. Therefore, since MYRF-1(ju1121 G274R) is not situated in this critical interface of the MYRF trimer, it is unlikely that the mutation affects MYRF trimerization.
  
  With all available evidence, we propose a reasonable model where myrf-1(ju1121) has two effects: rendering myrf-1 defective in DNA binding and negatively interfering with MYRF-2 by forming a non-functional trimer consisting of monomer MYRF-1(ju1121) and wild-type MYRF-2.
  
  Regarding the potential neomorphic function of myrf-1(ju1121), the myrf-1(ju1121)/+ individuals appear superficially wild type and show no defects in synaptic remodeling. Furthermore, we have generated a myrf-1 minigene array that results in a complete rescue of the developmental phenotype in myrf-1(ju1121) (Meng, Figure 3A-D). Notably, the transgene is expected to be low copy numbered, as it was generated by injecting at a very low concentration of 0.1 ng/μl. The complete rescue of the phenotype strongly suggests that any potential aberrant effects caused by myrf-1(ju1121) mutants are minimal.
  
  In summary, myrf-1(ju1121) behaves similarly to myrf-1 myrf-2 double mutants, and we utilized this allele for the convenience of analysis.
  
  Due to the essential role of MYRF-controlled processes in larval development and the lack of detectable phenotypic effects in myrf-2 single loss-of-function mutants, it is evident that myrf-2 plays a minor role in these developmental events. Considering that development regulation rarely follows a simple linear or accumulative fashion, deciphering the relative contributions of each myrf-1 and myrf-2 in specific developmental events may not be straightforward. Consequently, our primary focus remains on investigating the functions of myrf-1.
  
  Nevertheless, we concur that providing a clear description of the impact of myrf-1 and myrf-2 single mutants on lin-4 expression is crucial. We are actively conducting ongoing analyses, and the new findings will be incorporated in the revised version of our manuscript.
  
  Characterizing myrf-1(syb1313, 1-700) as a Hyperactive Allele of myrf-1
  
  The cleavage and release of N-MYRF are developmentally regulated and occur in late L1. We have substantial evidence supporting the interaction between the non-cytoplasmic region of MYRF and another transmembrane protein, PAN-1, which is crucial for delivering MYRF onto the cell membrane (Xia, Figure 1, 7, 8, 10, 11 and 13). The myrf-1(syb1313, 1-700) mutant lacks the non-cytoplasmic region of MYRF, which is the interaction site for PAN-1. Initial analyses revealed that in the mutants, MYRF-1(syb1313) remains in the cytoplasmic, ER-like structure, resulting in larval arrest during L2 (Xia, Figure 8).
  
  However, a more careful analysis unveiled that a small amount of N-MYRF is processed and enters the nucleus, but this process is not dependent on the normal developmental timing and may take place during early-mid L1. Consequently, this leads to precocious yet discordant DD synaptic remodeling and M-cell lineage division (Xia, Figure 6 and 9). Considering the precocious development, the low quantity of nuclear N-MYRF, and the overall larval arrest phenotype observed in the mutants, we conclude that myrf-1(syb1313) represents an inconsistent, weak hyperactive form of MYRF-1. Moreover, the hyperactive function may be context-dependent, for instance, presence of myrf-1(syb1313) may be sufficient for certain needs in neurons but insufficient for epidermis. Our ongoing research to identify the downstream targets of MYRF also supports this notion.
  
  Given that the myrf-1(syb1313) mutant has been thoroughly characterized and published, it is the most suitable option for use in our current investigations on lin-4 expression.
  
  Furthermore, we employed the MYRF-1(delete 601-650) deletion mutant construct, which is a significantly more effective hyperactive MYRF-1 mutant when overexpressed. This reagent stems from our ongoing study, which is dedicated to identifying the self-inhibitory mechanisms of MYRF cleavage. The extensive volume of data that led to this discovery makes it impractical to include in the current manuscript. However, we are eager to share the substantial effects of MYRF-1(delete 601-650) mutants in activating lin-4 expression, which strengthens the role of MYRF in regulating lin-4. We will take care to revise this section to provide clearer references.
  
  The lin-4p::nls::mScarlet(umn84) knock-in reporter is loss-of-function for lin-4; however, lin-4 mature microRNA does not affect lin-4 expression.
  
  Indeed, the lin-4 knock-in reporter umn84 removes lin-4 coding sequence. As a result, the homozygous reporter strain is also lin-4 null mutants. Since both lin-4 and myrf-1 are located on Chr II and are less than 4 m.u. apart, the constructed strain is myrf-1 lin-4(umn84) / mIn1 (balanced by mIn1). Consequently, the myrf-1 homozygous animal is also lin-4 reporter homozygous.
  
  Regarding the endogenous function of the "auto-regulating element," we are aware of the follow-up paper by Frank Slack's group, in which they concluded that the previously reported sequence is dispensable for lin-4 expression, and the loss of lin-4 does not affect the expression of its primary transcript (PMID: 29324872). To avoid confusion, we will remove or revise the introductory sentences as necessary to accurately reflect this information.
  
  Additionally, besides analyzing the expression of the knock-in reporter of lin-4 (umn84), we also conducted a thorough analysis of mature microRNA expression using targeted qPCR and genomic analysis via microRNA sequencing. Both sets of results indicate severely defective upregulation of lin-4 mature microRNA in myrf-1(ju1121).
  
  No evidence indicates that the 2.4 kb reporter of Plin-4-gfp (maIs134) is an inappropriate reporter for lin-4 transcription.
  
  maIs134 is originated from the Ambros lab, and to date, there is no single evidence demonstrating that maIs134 cannot be regarded as a reliable transcription reporter for lin-4 expression. The Stec et al. (Curr Biol 2021. PMID: 33357451) paper suggests that the PCE or CEA site (at ~ -2.8 kb) outside the 2.4 kb region confers enhancing effects for lin-4 transcription, but no other published paper has studied lin-4 transcription and cited this finding.
  
  While the Stec et al. paper provides elaborate mechanistic descriptions, the basic characterization of the importance of CE-A and blmp-1 to lin-4 expression is lacking. Deletion of CE-A in the lin-4 promoter reporter using an Ex array transgene resulted in highly variable reporter expression (Stec, Figure 4D). Notably, two high expression data points indicated that a transgene reporter without CE-A can be highly expressed, suggesting that CE-A is unnecessary for lin-4 transcription. Only when both CE-A and CE-D (within 2.4 kb) were deleted, the reporter expression was significantly decreased. Moreover, deletion of CE-C (proximal region) alone caused severe loss of reporter activity, supporting that proximal CE-C is the essential element, while CE-A is not.
  
  It is important to note that the effect of CE-A on lin-4 expression has not been analyzed using stable transgenes or genetic deletions in the endogenous lin-4 region. Furthermore, there is no data on how blmp-1 mutants affect the expression of the wild-type lin-4 promoter reporter, CEA deletion reporter, or lin-4 mature microRNA, despite the paper’s main claim that blmp-1 boosts lin-4 expression. While CE-A can confer an enhancing effect in epidermal expression when fused to the gst-5 promoter, there is no data showing that CE-A is sufficient to drive lin-4 transcription by itself.
  
  In summary, there is currently insufficient evidence to establish whether CE-A is necessary or sufficient for regulating lin-4 expression. In fact, the data presented in Stec et al. (Curr Biol 2021) suggest that CE-A is unnecessary for lin-4 expression. As such, I do not see any reason to consider the 2.4 kb reporter in maIs134 as inappropriate for analyzing lin-4 transcription. Furthermore, our presented data using the knock-in reporter of lin-4 (umn84) demonstrated that its regulation by myrf is essentially consistent with the observations drawn from the maIs134 analysis.
  
  The Significance of the Finding: MYRF Regulating lin-4 Upregulation
  
  We are grateful that the editors find our results valuable for those interested in lin-4 expression. However, we acknowledge that the editors may not share the same enthusiasm as we do, seeing this as a landmark discovery in understanding postembryonic development, a fundamental question in the field of developmental biology.
  
  Importance of Understanding lin-4 Upregulation in Development
  
  The foundation of developmental biology has been built on the principles derived from studying embryonic development in model organisms like Drosophila, exemplified by the Nobel laureates Lewis, Nusslein-Volhard, and Wieschaus. These principles explain what occurs during embryonic development, including patern formation, morphogenesis, and differentiation. However, these existing principles do not fully explain the phenomena of postembryonic development, including growth. For instance, during C. elegans development in L1, it remains unclear what controls the initiation of P cell division. If we may exclude dividing cells from the discussion, numerous stage-specific changes occur in non-dividing cells, including neurons. The extensive, systematic expression studies of transcription factors in C. elegans have failed to provide evidence that such developmental progression is driven by sequential activation of transcriptional cascades, as commonly observed during embryonic differentiation. A different approach to ask a similar question is to inquire how developmental timing is controlled, e.g., "why does it take a boy 12 years to reach adolescence?" This perspective highlights the need to identify potential unidentified checkpoints that control postembryonic stages (An example of insightful review: The Systemic Control of Growth. Cold Spring Harb Perspect Biol. 2015. PMID: 26261282)
  
  The upregulation of lin-4 represents a system’s checkpoint during postembryonic development. Deciphering the mechanism controlling lin-4 expression is instrumental in understanding the principles of postembryonic development, even extending to adult development, including life span control.
  
  Importance of the Finding: MYRF's Control of lin-4 Upregulation
  
  To date, no other essential, positive regulator of lin-4 transcription has been identified, although several negative regulators have been reported. A landmark paper by Victor Ambros identified FLYWCH as a repressor of lin-4 expression during embryogenesis (PMID: 18794349). FLYWCH mutants fail to progress to normal hatched larvae, implying that FLYWCH is crucial. The paper indeed suggested that FLYWCH has additional functions beyond suppressing lin-4, although these functions have not been thoroughly characterized. The significance of the FLYWCH finding lies in the elaborate control during the transition from embryo to larval development, where lin- 4 is actively suppressed. This control may ensure the robustness of subsequent lin-4 activation. The process during the embryo-to-larvae transition, as well as the counterpart process in mammalian development perinatally, remains poorly understood.
  
  Another negative regulator of lin-4 is lin-42, as reported in three papers in 2014 (PMID: 25319259; PMID: 24699545; PMID: 25032706). Lin-42 negatively regulates lin-4 expression, despite the main focus of the papers being lin-42's repression of let-7. However, the precise mechanisms by which this repression is achieved are not fully understood.
  
  Amy Pasquinelli's lab conducted a genome-wide screen to identify factors responsible for driving lin-4 upregulation but did not identify a critical factor that promotes lin-4 transcription (PMID: 20937268).
  
  In the recent paper by Stec et al. (Curr Biol 2021. PMID: 33357451), they reported blmp-1's role in enhancing lin-4 expression. However, the significance of blmp-1 in regulating lin-4 remains vaguely described, despite a large amount of data describing elaborate epigenetic controls. The paper did not provide data on how endogenous lin-4 expression is affected in blmp-1 mutants, nor did it demonstrate how full-length reporter expression is affected in blmp-1 mutants. The only relevant data appears to be on the CE-A-gst-5 promoter reporter in blmp-1 mutants. As a result, it remains unclear how blmp-1 affects lin-4 transcription.
  
  In summary, no single factor has been identified, the loss of which leads to significant deficiencies in lin-4 upregulation. MYRF is the first and a critical factor identified in this context. This finding represents a significant advancement in our understanding of lin-4 regulation and its crucial role in development.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.22.546200v3
www.biorxiv.org www.biorxiv.org

New submission 02/08/2023, 08:13:53

1
1. Public_Reviews 07 Aug 2023
 
 in eLife
 
 Author Response:
 
 We thank eLife for carrying out the peer review of our preprint. In this letter, we will provide a response to the eLife assessment, and the editor’s public review, and will also address the major points raised in the peer-review of our study.
 
 First, we wish to inform the readers that including this review, our manuscript has now been reviewed 5 times. These have included three reviews at an earlier journal, a review at eLife under the older model, and the current review at eLife under the new model. In an effort to provide transparency and increase the reader’s confidence in our study, all the prior reviews and our rebuttals to them have been uploaded to Biorxiv and are publicly available for all readers to peruse [1]. These reviews will show that we have responded comprehensively with additional data, and analyses over the last 3 years. Of the current reviewers, Reviewer #1 (who was also Reviewer #1 at the earlier journal) has reviewed our manuscript all 5 times. At the prior journal, an additional Reviewer (#2) carried out 3 cycles of review – and we responded fully and comprehensively to all the issues and comments of that Reviewer. It is our understanding that the prior Reviewer #2 did not respond to the review request from eLife, after which eLife recruited two new Reviewers (current Reviewers #2 and #3), who have now reviewed our work twice – once under the older model and now again under the newer model.
 
 Next, to ease readability, we will respond to the review in three parts. Part A will be dedicated to the editors’ public review. Part B will be dedicated to the response to eLife assessment, and we will respond to the reviewers’ comments in Part C.
 
 Part A: Response to editor’s public review: We thank the editor for his nuanced and fair read of our data and our inferences, and of the multiple back-and-forth cycles of reviews and rebuttals. The editor’s public review highlights key points put forth in our data, and succinctly discusses the evidence provided for our claims. Here, we respond to each of these highlights.
 
 (i) The editor agrees that subject to the broader limits of lineage fate-mapping experiments, which are universal for every prior and current study of vertebrate development, we have provided sufficient evidence for the presence of a population of cells within the myenteric ganglia, which shows mesodermal and not neural crest derivation, and which expresses the pan-neuronal marker Hu among other neuronal and mesenchymal/mesodermal markers.
 
 Given that the current accepted annotation for enteric neurons depends on their expression of pan-neuronal markers (which we show are expressed by MENs), expression of neurotransmitter-encoding genes and proteins (such as CGRP, NOS1, ChAT, etc, which we show are expressed by MENs), and their localization within the enteric plexuses (we show evidence of intra-ganglionic localization of MENs in the myenteric plexus), our data suggests that in describing MENs, ours is the first report describing the presence of a mesoderm-derived neuronal population in a significant neural tissue. By virtue of the continual expansion of the MENs population with maturation and aging, we show evidence that MENs contributes to the post-natal maturation and aging of the enteric nervous system (ENS), and by reducing the proportions of MENs in aging tissue, we can rejuvenate the ENS to normalize gut function in aging mice.
 
 (ii) The editor comments on whether beyond the accepted norm of their intraganglionic localization and expression of pan-neuronal markers, MENs can be described as functional neurons. We agree that in our manuscript, we did not test how MENs function. This is expressly because the current report is the first step in the study of MENs and does not aim to understand how MENs regulate various gut functions. In this response however, we wish to put forth a few arguments that would clarify some of the existing evidence on the functional nature of MENs as well as the current state of knowledge on ENS functions. These would help the readers understand the current evidence on the functional nature of MENs, and in addition, why it would be premature to expect MENs to exhibit canonical neuronal behavior.
 
 a. MENs generate neurotransmitters and neuropeptides: Enteric neurons release various neurotransmitters, and their ability to generate important neurotransmitters such as nitric oxide (NO) and acetylcholine depends on their expression of enzymes Nitric Oxide Synthase 1 (NOS1) and choline acetyltransferase (ChAT). Our work shows that sub-populations of MENs express these important neurotransmitter-generating enzymes (Fig 3). Further, our data also shows that MENs express CGRP, which is an important neuropeptide for regulating various gut functions (Fig 3). These important data show that at the protein level, many MENs have the same cellular machinery as that of NENs that can help carry out regulation of important gut functions.
 
 b. MENs have been shown to be functional in a prior study: Recently, enteric neurons have been shown to carry out significant immunomodulatory functions. These have included the expression of cytokines such as IL-18, which regulates intestinal barrier (as shown by Jarret et al. [2]), and CSF1, which regulates macrophage recruitment [3]. Jarret et al shows that the enteric neuron-derived IL-18 regulates immunity at the mucosal barrier. We show that the IL-18 – expressing enteric neurons are MENS (Fig 4), and thus, the data from Jarret et al [2] provides evidence that MENs are indeed functional in the in vivo environment.
 
 c. We do not quite know how many enteric neurons work at the electrophysiological level: Canonical vertebrate neurons exhibit resting membrane potentials (RMP) in the range of -70 to -80 mV, and during neuronal activation, an increase in membrane potential beyond the threshold of -55 mV activates their action potential [4]. By contrast, past and recent studies have shown that the average RMP of rodent and human enteric neurons is significantly more positive than -70 mV (for human ENS: -48 ± 8 mV, for mouse ENS: -46 ± 6 mV for S neurons, -56 ± 5 mV for AH neurons) [5, 6]. These data suggest that enteric neurons show significant departures from canonical neuronal behaviors and thus, expecting MENs to adhere to canonical neuronal behavior – when most of the ENS does not adhere to expected norms - would be incorrect.
 
 d. A neuron is not defined by its ability to generate an action potential: Neuronal behavior does not require the presence of action potentials, as observed in the neurons in C. elegans [7], much in the same way that the presence of action potentials is not restricted to neurons as it occurs in nonneuronal cells, including in enteroendocrine cells of the mammalian gut [8]. Thus, the presence or absence of action potentials cannot be the basis for adjudicating whether or not a neurotransmitter-expressing cell in a neural tissue is a functional neuron.
 
 (iii) The Editor, after reading the extensive prior and recent correspondence between the authors and the reviewers on whether the cells analyzed in the transcriptomic experiments are the same as those observed in tissues (called tissue MENs by a reviewer), opined that he found “the authors' assertions that they have described a cluster of cells that express both neuronal and mesodermal genes, and that this cluster corresponds to the tissue MENs described in lineage tracing, to be broadly sound”.
 
 We are enthused by the Editor’s opinion, as we had previously argued that our data connecting the transcriptomic data to tissue MENs is robust on the basis of extensive immunohistochemical validations of marker genes found in our single cell transcriptomic analyses. The Editor notes some confusion on why some marker genes not specific to MENs were used for the analyses and further points to the prior rebuttals we have posted on Biorxiv [1], where detailed clarifications on the choice of marker genes have been made. In the interest of readability, we direct the readers to these prior rebuttals at Biorxiv for more details. Succinctly, we initially tested canonical neuronal genes by immunolabeling (such as NOS1, ChAT, CGRP, etc) in NENs and MENs before performing single cell transcriptomic experiments. After performing the transcriptomic experiment, we next chose to validate neuronal and mesenchymal genes that were found expressed in the MENs cluster (such as DCN, SLPI, IL-18, NT-3, etc). Finally, in previous cycles of review, on the reviewer’s insistence, we included data on the expression of a host of neuronal genes and their encoded proteins (including Vsnl1, Pde10a, etc) to provide further evidence of neuronal identity of MENs.
 
 While without a significantly large cluster of NENs, it is impossible to know in our transcriptomic data, whether a gene expressed by MENs would be similarly expressed by NENs, it is important to note that lack of detection of a gene in the single cell experiments cannot be inferred as lack of its expression in those cells, and hence, our inferences on whether any marker gene was exclusively expressed by neurons of a particular lineage were determined by immunohistochemistry. Additionally, we wish to reiterate and inform the readers that our study provides detailed analysis of prior work by May-Zhang et al [9], where they have described a small cluster of Phox2b-expressing cells from the murine myenteric plexus that shows the expression of neuronal and mesenchymal markers. Our analyses shows that the transcriptomic profile of MENs matches the molecular signature of these cells. In the longitudinal muscle – myenteric plexus layer, only glial cells and neurons express Phox2b [10], suggesting that this cluster sequenced by May-Zhang et al are cells of the myenteric plexus. We provide evidence that the majority of the MENs were left unsequenced by MayZhang et al and that this minimized the representation of MENs in their data (Fig 5). These data together provide important confirmation of our argument that the transcriptomic MENs point to no other cell type but the tissue MENs.
 
 (iv) The Editor opines that a weakness in our current data is the significant overrepresentation of MENs in the single cell experiment, while also noting that our “explanation - that some cells are more sensitive to manipulations required to prepare cells for sequencing - is certainly well-represented in the literature and is therefore plausible….But it isn't fully satisfactory”. In our prior arguments (as well as in Part C), we have provided explanations based on prior observations that the issues of disproportionate representation of cell types are a technical limitation of the single cell transcriptomic methodology, which is prevalent in other experimental conditions for ENS (including the gut cell atlas study by Elmentaite et al [11]), and for other cell types in various organs. Due to this limitation, proportions of cells in the single cell space should not be inferred as their proportions in tissues. We also agree with the Editor that owing to the low representation of NENs, our data does not allow for a detailed comparison of the similarities and differences between the neurons of the two lineages, and that “an ideal analysis would have more cells, deeper sequencing, and comprehensive validation of the identity of each cluster of cells.” While in this study our aim was to describe the existence of MENs and not to perform an in-depth characterization of their sub-populations, we agree that this is the logical next step in creating a better understanding of the true diversity of ENS neurons. To that, we are currently evolving the methodologies to allow for a deeper and a more comprehensive analyses and validation of the various MENs populations, and study how they differ from NENs. We aim to publish these data in our next study.
 
 (v) We agree with the Editor’s assessment on our transcriptomic data that “these data and analyses bolster the authors' claims, without conclusively establishing them. That is, these data should neither be dismissed nor, on their own, considered definitive.” We have only used our single cell transcriptomic data to provide additional support for our claims (which are based on extensive lineage fate mapping and immunohistochemical analyses) and are not using these as a stand-alone definitive proof of a mesodermal origin. The data from the transcriptomic experiments were used to learn additional molecular markers, whose expression in MENs in tissue could be tested by immunohistochemistry. With this methodology, we provide data on the coexpression of neuronal and mesenchymal markers by MENs, and test by computational analyses whether similar neuronal population exists in other murine and human transcriptomic datasets.
 
 In addition, we completely agree with the Editor that “at this stage in the history of single-cell analysis, the criteria for using single cell sequencing data to establish cell type and cell origin is are not well established, and that neither the presence nor absence of specific sets of genes in single cells should not, for both technical and biological reasons, be considered dispositive as to identity.” We are very mindful of this limitation of these analyses and hence have continually ensured that our study only uses transcriptomic data of postnatal MENs to define a preliminary molecular signature of MENs, and not to infer developmental origins of MENs.
 
 (vi) We thank the Editor for his summary and for highlighting that despite using multiple lines of evidence to support our hypothesis, the current reviewers are not yet convinced of the mesodermal origin of MENs. Our study utilizes well established tools for lineage fate-mapping (which are the only tools that currently are widely disseminated and accepted in the field of developmental biology) to show that MENs are not derived from the (Wnt1-cre, Pax3-cre -expressing) neural crest and instead are derived from the (Mesp1-cre, Tek-cre -expressing) mesoderm. The reviewers agree that by using multiple lines of evidence, we have established that our results of lineage fate-mapping are real and not due to any artifact. With this rationale, the reviewers would agree that MENs observed in tissue do not show evidence of derivation from neural crest while showing evidence of derivation from the mesoderm. Despite this, we cannot ascertain the scientific rationale for why despite agreeing with our lineage fate-mapping methods and analyses, the reviewers remain unconvinced as to the developmental origins of MENs. We do not know what other experiment would pass the reviewers’ muster to definitively annotate the mesodermal origins of MENs.
 
 We wish to highlight that a recent study in ctenophores, where the investigators show evidence of a syncytial neural net [12], shows that much of the dogmatic view of how neurons are supposed to work is being overturned and newer paradigms that support broader interpretations for the definitions of neurons and how they regulate functions are being established. Our work on the developmental origins of a large population of neurons of the ENS, which is regarded as a primordial and conserved neural tissue, should be viewed in a similar vein.
 
 Part B: Response to eLife assessment: Ours is the first report on the mesodermal derivation of a large population of neurons in a significant nervous system in mammals. We show that this population of neurons, called MENs, is molecularly distinct from the canonical neural crest-derived lineage of neurons, and that the post-natal ENS shows evidence of increasing presence of MENs in the maturing and aging ENS. We show that the two neuronal lineages are sensitive to their own growth factors, which can be used to manipulate their proportions in tissue, and thereby provide a potential rejuvenating therapy for age-associated intestinal dysmotility. We also show that on the basis of MENs’ marker expression, MENs maybe present in the human ENS, and that disproportionate changes in their proportions are associated with chronic gut dysmotility disorders. Our work has profound implications in the multiple fields, including those of enteric and peripheral neurobiology, developmental biology, medicine, and aging. We are thankful that the eLife assessment found that we provide sufficient evidence for this important work.
 
 Part C: Response to Reviewers: Here, we wish to note that all the comments of the reviewers have been sufficiently addressed in prior reviews. All prior reviews, and our extensive rebuttals are available at our preprint for the readers’ perusal [1]. In this response, we wish to succinctly address some comments that have continued to emerge in this round of peer-review.
 
 (i) We wish to highlight that the Reviewers 1 and 2 agree that our lineage-fate mapping experiments are correct and the results are not a result of any artifact. In addition to the additional reviewer in the prior reviews at an earlier journal, whose comments were addressed in full, we have a total of three reviewers who agree that our results on lineage fate-mapping are robust. Reviewer 3 comments on the possibility of ‘cre mosaicism’ or the deleterious issues with long-term expression of cre. Our prior rebuttals have dealt with this comment at length, but succinctly, our results are (a) based on extensive cre and floxed reporter controls for both the lineages, and (b) replicate observations made by other labs – including the Pachnis, the Heuckeroth, and the Southard-Smith labs to provide confidence that these are not due to any artifacts in cre or reporter gene expression. Finally, cre in the two lineage fate mapping systems (Wnt1-cre and Mesp1-cre) is only developmentally expressed and thus, there is no reasonable possibility that our results would be impacted by long-term expression of cre. Thus, our results and inferences on lineage fate mapping, which is central to our annotation of the two distinct developmental lineages, correctly describe the developmental origin of MENs.
 
 (ii) By using extensive immunolabeling for (~21) markers that were learnt from our transcriptomic experiments, we provide evidence of the firm connection between the cluster of cells we annotated as MENs in the single cell transcriptomic experiments and the MENs we observe in tissues. Thus, we have performed more validation for these neurons than any other studies that have traditionally used 2 - 3 markers to validate a cell cluster in the ENS.
 
 In addition, by providing evidence of the expression of pan-neuronal marker Hu and other ENS markers that include NOS1, ChAT, CGRP, etc and ~40 neuronally significant genes, we have established the neuronal nature of MENs. With regards to annotation of MENs as neurons, we expected and understand the confusion in the field with our discovery of mesoderm-derived neurons that coexpress neuronal and mesenchymal markers. We wish to put forth the following arguments for the readers to consider.
 
 a. The annotation of Hu-expressing cells within the myenteric ganglia has been traditionally accepted as an enteric neuron. In those terms, by virtue of their intra-ganglionic presence and expression of Hu (and our data shows that Hu antibodies do not discriminate between the three neuronal isoforms of Hu) and other neuronal markers such as NOS1, ChAT, and CGRP, MENs should be annotated as neurons. We had addressed the semantic nature of this question in our last rebuttal (review #3, reviewer 1), which is available on the preprint [1].
 
 b. As the molecular data on MENs suggests that they have significantly different biology, it would not be unreasonable to expect that their neuronal behavior may be quite different. This is underscored by the fact that we observe many MENs to lack the expression the protein SNAP25, whose presence is thought to be central to canonical neuronal behavior. We also cite evidence that neurons without SNAP-25 expression occur in the CNS neurons as well. In light of these discoveries, gauging the biology and neuronal behavior of MENs is a significant undertaking as it cannot be assumed that the behavior of MENs will be similar to that of NENs.
 
 c. It is not logical to say that “Expressing one of the Hu proteins (Elavl2) probably isn't enough to call these "neurons" especially when neurons usually express Elavl3-4 (HuC/D)” especially when there are currently no antibodies to discriminate between the three neuronal gene products.
 
 d. While at the outset it maybe an easy proposition to suggest that we provide evidence of neuronal activity in MENs by calcium flux or by electrophysiological means, it is important to know that calcium flux exists in all cells of the gut wall, including in smooth muscles, enteric glia, neurons and thus studying calcium flux will not provide definitive proof of neuronal behavior in MENs. Further, we reiterate from Part A of this response letter that “neuronal behavior does not require the presence of action potentials, as observed in the neurons in C. elegans [7], much in the same way that the presence of action potentials is not restricted to neurons as it occurs in non-neuronal cells, including in enteroendocrine cells of the mammalian gut [8]. Thus, the presence or absence of action potentials cannot be the basis for adjudicating whether or not a neurotransmitter-expressing cell in a neural tissue is a functional neuron.”
 
 (iii) Our identification and validation of the molecular identity MENs using single cell transcriptomic experiments helps us establish the congruency of our cell cluster with a similar cluster enteric neurons previously observed by the SouthardSmith lab in their analyses. Thus, similar to our observations on the lineage-fate mapping models, observations on our transcriptomic data are also in-line with the observations made by other labs in the field.
 
 (iv) To address any remaining confusion in the minds of the reviewers and of the readers about the correct methodology for interpreting single cell transcriptomic data and the limitations of this technique, we wish to reiterate that: a. Single cell or nucleus RNA sequencing methods are biased towards sequencing transcripts that are abundant relative to all other transcripts for that individual cell (detection and amplification bias). Thus, while the same transcript may be equally expressed at an absolute level in two different cells, it will be more readily sequenced and detected in the cell where the transcript is relatively more abundant.
 
 b. Correct interpretation of single cell/nucleus transcriptomic data relies on an understanding that not all transcripts of a cell can be sequenced and detected, and thus absence of the expression of transcripts in a cell does not imply absent gene expression. Together this shows the fallacy of an argument often put-forth by the reviewers that a lack of detection of a gene transcript (for e.g. Phox2b) in MENs in a scRNAseq experiment should be inferred as a lack of expression of this transcript, even though we provide evidence of the expression of PHOX2B protein in MENs, and the expression of this transcript in the MENs in the data from the Southard-Smith lab.
 
 c. scRNAseq is not a technique where annotation of a previously unknown cluster should be biased by the detection of expression of one or two genes, and instead establishing identity or conferring novel annotation of that cluster is defined by co-expression of several genes which must be validated in tissue.
 
 d. It is well known that enzyme-based dissociation methods are unequally tolerated by diverse cell types, which is known to cause over- or underrepresentation of several cell types in scRNAseq (Uniken Venema et al.[13], who showed that dissociation method drives detection and abundance of cells sequenced; Wu et al.[14], showed the existence of similar dissociation bias in the kidney; Tiklova et al.[15] showed that specific subpopulations of Dat-expressing neurons in the developing mammalian brain were underrepresented in scRNAseq). The Gut Cell Atlas study (Elmentaite et al.[16]) was not able to detect NENs in the adult intestinal tissue. The lack of detectable canonical enteric neurons (NENs) in the adult tissue in their study should not be viewed as an absence of NENs in those tissues, and with the same logic, a restricted abundance of NENs and a larger abundance of MENs in our dataset cannot and should not be viewed as a reliable indicator of their actual proportions in tissues. The aim of our study is not to provide a comprehensive molecular atlas for all cells that reside in the LM-MP tissue layer, but to use the information in this atlas to identify a cell cluster that best describes MENs, and then use additional tools to validate this information.
 
 e. Without extensive validation by immunohistochemical or other means, detection of transcripts of a particular gene ‘Z’ (which is known to be expressed in cell type ‘X’) in a particular cell cluster ‘A’ of a single cell transcriptomic dataset does not directly imply that cell cluster ‘A’ points to cell type ‘X’. Thus, the detection of transcripts of the gene Wt1 (which is known to be expressed in mesothelial cells) in MENs, in itself does not mean that the MENs cluster comprises of mesothelial cells. It simply suggests that in addition to its expression in mesothelial cells, Wt1 gene is also expressed by MENs – an inference which is supported by data that show the expression of LacZ in myenteric ganglia cells in the WT1-cre transgenic mouse (Wilms et al 2005 [17]).
 
 (v) Our study has performed two scRNAseq studies, first to establish the distinct molecular signature of MENs, and second to provide transcriptomic evidence of MENs-genesis. In the last and current review, Reviewer 2 opines that we should perform an additional single cell RNA sequencing experiment just to show that the MENs cluster is represented in the mesoderm-enriched transcriptomic data. There is no doubt that owing to the expression of various mesodermal-markers that we show are expressed by MENS (both transcriptomically in scRNAseq and at the level of proteins in tissues), the cluster of MENs is mesodermal in origin. Thus, we have already provided evidence and met a higher burden of proof on the mesodermal identity of MENs, and thus, we do not consider the costly scRNAseq experiment proposed by the reviewer a definitive experiment that would justify the time or the cost.
 
 (vi) Our prior rebuttals have provided the reviewers with evidence that shows that our study has used standard bioinformatic pipelines to analyze our data, and our inferences of the transcriptomic data are sound and well validated by additional methods.
 
 (vii) Many comments of the reviewers that required textual edits were already carried out after the prior review at eLife. While a revised version of our manuscript was submitted to eLife for the current review, it is unfortunate that the reviewers have not updated many of their comments. For the sake of brevity, we will not be responding further to the comments that we have already addressed at length in prior rebuttals or in form of textual edits.
 
 References
 
 Kulkarni, S., et al., Age-associated changes in lineage composition of the enteric nervous system regulate gut health and disease. bioRxiv, 2022: p. 2020.08.25.262832.
 
 Jarret, A., et al., Enteric Nervous System-Derived IL-18 Orchestrates Mucosal Barrier Immunity. Cell, 2020. 180(1): p. 50-63 e12.
 
 Muller, P.A., et al., Crosstalk between muscularis macrophages and enteric neurons regulates gastrointestinal motility. Cell, 2014. 158(2): p. 300--13.
 
 Chrysafides, S.M., S.J. Bordes, and S. Sharma, Physiology, Resting Potential, in StatPearls. 2023: Treasure Island (FL) ineligible companies. Disclosure: Stephen Bordes declares no relevant financial relationships with ineligible companies. Disclosure: Sandeep Sharma declares no relevant financial relationships with ineligible companies.
 
 Yew, W.P., et al., Electrophysiological and morphological features of myenteric neurons of human colon revealed by intracellular recording and dye fills. Neurogastroenterol Motil, 2023. 35(4): p. e14538.
 
 Furukawa, K., G.S. Taylor, and R.A. Bywater, An intracellular study of myenteric neurons in the mouse colon. J Neurophysiol, 1986. 55(6): p. 1395-406.
 
 Liu, Q., G. Hollopeter, and E.M. Jorgensen, Graded synaptic transmission at the Caenorhabditis elegans neuromuscular junction. Proc Natl Acad Sci U S A, 2009. 106(26): p. 10823-8.
 
 Gribble, F.M. and F. Reimann, Enteroendocrine Cells: Chemosensors in the Intestinal Epithelium. Annu Rev Physiol, 2016. 78: p. 277-99.
 
 May-Zhang, A.A., et al., Combinatorial Transcriptional Profiling of Mouse and Human Enteric Neurons Identifies Shared and Disparate Subtypes In Situ. Gastroenterology, 2021. 160(3): p. 755-770 e26.
 
 Corpening, J.C., et al., A Histone2BCerulean BAC transgene identifies differential expression of Phox2b in migrating enteric neural crest derivatives and enteric glia. Dev Dyn, 2008. 237(4): p. 1119-32.
 
 Elmentaite, R., et al., Cells of the human intestinal tract mapped across space and time. Nature, 2021. 597(7875): p. 250-255.
 
 Burkhardt, P., et al., Syncytial nerve net in a ctenophore adds insights on the evolution of nervous systems. Science, 2023. 380(6642): p. 293-297.
 
 Uniken Venema, W.T.C., et al., Gut mucosa dissociation protocols influence cell type proportions and single-cell gene expression levels. Sci Rep, 2022. 12(1): p. 9897.
 
 Wu, H., et al., Comparative Analysis and Refinement of Human PSC-Derived Kidney Organoid Differentiation with Single-Cell Transcriptomics. Cell Stem Cell, 2018. 23(6): p. 869-881 e8.
 
 Tiklova, K., et al., Single-cell RNA sequencing reveals midbrain dopamine neuron diversity emerging during mouse brain development. Nat Commun, 2019. 10(1): p. 581.
 
 Elmentaite, R., et al., Single-Cell Sequencing of Developing Human Gut Reveals Transcriptional Links to Childhood Crohn's Disease. Dev Cell, 2020. 55(6): p. 771783 e5.
 
 Wilm, B., et al., The serosal mesothelium is a major source of smooth muscle cells of the gut vasculature. Development, 2005. 132(23): p. 5317-28.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2020.08.25.262832v4
www.biorxiv.org www.biorxiv.org

New submission 30/06/2023, 13:11:59

1
1. Public_Reviews 01 Aug 2023
 
 in eLife
 
 Author Response
 
 Many thanks for the detailed and sometimes sharp, yet appropriate criticism of our study. It was an incentive for us to carry out additional analyses and to devote more effort to an elaboration of concepts. The outcome is that the results have changed slightly and that we now give more space to a discussion of concepts. We first address here the points raised by more than one reviewer before responding to comments contributed by individual reviewers.
 
 The points raised can be divided into three thematic groups, 1) conceptual issues, 2) experimental and analytical questions, and 3) comments challenging the novelty of our results. On the first theme, we think it is essential to make a clear distinction between the conceptual and observational domains. As such, the criteria defining a “mirror neuron” and what is meant by the term "mirror mechanism" belong to the conceptual domain. This understanding of terms requires agreement among scientists, but is not experimentally testable. Unfortunately, there is no agreement on how to define a “mirror neuron” and what is meant by “mirror mechanism”. Thus, for the present work, the only option is to refer to specific definitions or to use our own, definitions which try to capture what others, and here most importantly Rizzolatti and colleagues, probably meant. We have adjusted the introduction in an attempt to convey our understanding and usage of the two terms in a hopefully comprehensible manner. Briefly, we use a definition for "mirror neuron" that we take from the first paragraph of the results section of Gallese et al. (Brain, 1996). We do not consider the "properties of mirror neurons" described in that paper as defining a mirror neuron (MN). Classifying neurons as MNs only on the basis of the presence of a modulation of discharge rate during an executed and an observed action compared with a baseline is a common practice also in other single neuron studies on MNs, consistent with this definition. Regarding "mirror mechanism", we refer to Rizzolatti and Sinigaglia (2016) and make a distinction between a broad and a strict definition. Given our finding that there are almost no F5 MNs whose activity during observation is a motor representation according to our strict definition of a mirror mechanism, and also given the problem that the term “mirror mechanism” itself is not uniformly understood, the question arises whether and how the term "mirror neuron" should be used in the future. The answer to this may vary and belongs to the conceptual domain. We briefly address this question at the end of the discussion of the revised manuscript.
 
 From that understanding of terms, conceptual hypotheses are to be distinguished, which of course must allow experimental predictions, i.e., must be falsifiable. We now distinguish more clearly between a "representation hypothesis" and an "understanding hypothesis". Both hypotheses focus on F5 MNs and are based on the strictly defined mirror mechanism. We test the “representation hypothesis” in our study, and just because it is the basis for the “understanding hypothesis”, falsifying the “representation hypothesis” would allow us to conclude that the “understanding hypothesis” is not valid. In contrast, confirmation of the “representation hypothesis” would not, of course, allow us to conclude that the “understanding hypothesis” holds. That would really be circular reasoning (this conclusion was drawn by some and rightly criticized). However, support for the “representation hypothesis” would be the necessary prerequisite for the “understanding hypothesis” to be true. These two hypotheses take up the original argument that a certain understanding of observed actions could follow from an equality of action-specific F5 MN activity during execution and observation. Because we considered the data on equality of action- specific F5 MN activity to be insufficient, we designed this study. Since our result largely argues against the "representation hypothesis" and thus against the "understanding hypothesis," we now discuss alternative concepts for the function of F5 MNs in more detail. It should be noted here that our fourth concept ("goal-pursuit-by-actor") could well represent the observed action without contradiction to our broad definition of a mirror mechanism, which in principle could also serve a subjective experience (which could be conceived as a kind of understanding). The way we structure the concepts in the discussion of this revised manuscript is, in our opinion, a useful overview of the concepts. The third concept is new in this context. We would like to emphasize that we focus on F5 MNs and intentionally avoid a discussion of mirror neurons beyond F5 in this paper. With the data from this study, we cannot say anything about MNs outside of F5.
 
 Regarding the key question of how the "understanding hypothesis" is testable, or whether it may not be testable at all, we agree, of course, that for the conclusion of whether F5 MNs contribute to perception, only a manipulation of F5 MNs can clarify it. We now say that explicitly in the introduction. We agree with reviewer #2 that "understanding" here is not limited to "action recognition" or "action categorization”, which in principle could be implemented by purely sensory processing. Therefore, we also do not believe that the approach proposed by reviewer #3, which builds on the distinction of actions, would allow for a critical examination of the "understanding hypothesis”. But we disagree that the "understanding hypothesis" is not testable at all. Operationalization is necessary. If we accept that we can measure certain visual or auditory perceptions of an animal by operationalization (e.g., the subjective visual vertical, see for example Khazali et al., PNAS, 2020), then we must also accept that we can, in principle, measure other subjective experiences by operationalization, such as pain or aiming at a goal or even the co- experience of pain. An example of how to approach this is the study by Carrillo et al. (Curr Biol, 2019), which reviewer #2 and colleagues discussed in a recent review article (Bonini et al., TCS, 2022).
 
 With regard to the second theme, experimental and analytical questions, we noticed while reading the comments that in our first version we did not distinguish clearly enough between statements about single neurons and statements about populations of neurons. Therefore, we now clearly separate single neuron analysis and population code analysis in the structure of the article. In view of the fact that statements about mirror neurons in the literature mostly refer to single neurons, we added extensive single neuron analyses, so that only now statistically reliable statements about single neurons are possible. This has led to the realization that the number of neurons with exclusively shared code is so small that these neurons should be considered a rare exception. Given the small number of time periods with shared code, we additionally tested against a hypothesis already rightly proposed as an alternative explanation by G. Csibra in 2005 (Mirror neurons and action observation: Is simulation involved? In: What do mirror neurons mean? Interdisciplines Web Forum 2005). We were able to reject this hypothesis based on two of three methods for testing for a shared code. This is the second piece of evidence besides the clustering of time periods with shared code already described in the first version that time periods with shared code cannot be considered random.
 
 We discuss in more detail the question of whether neurons that exhibit a shared code at least at times support the representation hypothesis. To this end, we additionally examined whether certain action segments are more frequently represented with a shared than with a non-shared code, whether neurons with shared code differ from those with non-shared code in anatomical location, and whether an accuracy can be achieved with a time bin-wise selection of neurons with shared code by population cross-task classifiers as with within-task classifiers in the whole population.
 
 Another issue was how to test for shared code and how to decide if a code has enough sharing. To answer the question, the exact hypothesis we intended to test here is crucial. The representation hypothesis states that the representation of the observed actions in F5 MNs corresponds to the representation as it occurs during the execution of the same actions. Therefore, the relationship between discharge rate and actions that holds during execution should also hold during observation, which is measurable with a classifier trained on execution trials and tested on observation trials. Moreover, the actions should not be more distinguishable during observation with a classifier other than the execution-trained classifier, because if that were so, it would mean that the representation of observed actions is different from that of executed actions. The detection of a cluster of time bins for which both conditions are satisfied confirms that it is possible to discover in this way the shared codes postulated by the representation hypothesis.
 
 With respect to concerns that the monkey may not have used the cue at all when the action was executed, we added a comparison with control trials with a non-informative cue and also compared the duration of the approach phase between the three actions. Regarding oculomotor behavior, we verified that the monkey had actually directed his gaze toward the action during action observation for all three actions.
 
 On the third issue, concerning the novelty of our results, we have now explained in more detail in the introduction why we felt it necessary to conduct a study we considered fundamental. As a result of our study, it can be clearly stated now that representations of observed actions as predicted by the strictly defined mirror mechanism are rare in F5 MNs, but nevertheless cannot be dismissed as random. This dispels the objection rightly raised by Csibra in 2005 and contradicts the currently prevailing view that such a representation can only be found at a population level. Even if these representations are ultimately explained by a concept other than the strictly defined mirror mechanism, their existence must be accounted for by any theory of the function of F5 neurons. Moreover, it is also shown that the observed actions are well discriminated with a non- shared code, at times even optimally. This contradicts the notion – which has been widespread for a long time since the work of Gallese et al. (Brain, 1996) – that mapping to motor representations in terms of broad congruence is simply not perfect. The applied cross-task decoding approach seems promising to test also in the future for a shared action code. Finally, reconsideration of alternative concepts has led us to highlight the possibility of a representation of a goal pursuit by the observer.
 
 Reviewer #1 (Public Review):
 
 The authors set out to investigate the hypothesis that mirror neurons in ventral premotor area F5 code actions in a common motor representation framework. To achieve this, they trained a linear discriminant classifier on the neural discharge of three types of action trials and test whether the thus trained classifier could decode the same categories of actions when observed. They showed that codes were fully matched for a small subset of neurons during the action epoch, while a wider set of "mirror neurons" showed only poorly matched codes for different epochs.
 
 This is one of the descriptions of our results, where we realized that in our first version we did not distinguish clearly enough between statements about single neurons and statements about populations of neurons. This prompted us to perform a detailed single neuron analysis.
 
 The authors controlled for potential visual object confounds by having identical objects be manipulated in three different ways and by having the animal carry out the motor execution in the dark. The main strength of the study lies in the clever decoding approach testing the matched tuning to behavioural categories in a model-free way. The central result is in the identification of the small sub-group of mirror neurons that show true matching during the execution epoch, which can dissociate the three types of action almost perfectly. This aligns well with some previous work while offering a novel avenue to identify and investigate those neurons. The underlying neuronal mechanism and behavioural relevance of these neurons remain an open question. It would have been interesting to understand better whether the specific motor representations at a recording site, for instance identified through microstimulation prior to recording (see Methods), the reaction times on individual trials or the specific gaze targets (object/hand) had a bearing on the decoding performance for a neuron/trial.
 
 We agree that these are interesting questions.
 
 In this study, the focus is on testing for a shared code according to a strictly defined mirror mechanism. We have now compared the anatomical locations of neurons with only time bins in which observed actions were discriminated with a shared code (according to one of the methods) to the locations of neurons with only time bins with non-shared code (see last paragraph in Results). We did not find any relevant difference and this is why one cannot expect topographically specific effects of microstimulation.
 
 We do not expect the reaction time (i.e., the time interval between LED onset and start button release, or the duration of the approach epoch) during execution or observation to have any effect on our results on shared coding as the analysis was based on relative time bins. The observed actions were predominantly distinguished late in the approach epoch, but especially in the manipulation epoch. At this time, reaction time is not expected to have a relevant influence.
 
 The relationship between gaze/eye position and the activity of mirror neurons, during execution or observation, is an interesting topic in itself. However, for testing for a shared code according to a strictly defined mirror mechanism, it is only relevant that the observing monkey actually observes the action. We have ensured this in our experiment by a fixation window and have now also confirmed that the monkey actually looked into the area of the object during all three actions (see Results, lines 209-219 in the manuscript with tracked changes).
 
 Ultimately, the uncovered matched mirror representations should in future experiments be tested with causal interventions and linked trial-by-trial to action selection performance.
 
 The authors put the focus of their discussion on the wider, less well-matched neuronal pool to support an action selection framework, which is of course a valid view and well established in motor representations. From a sensory perspective, sparse coding, as suggested by the small group of "true" mirror neurons identified with the decoding approach, should also be considered as the basis for a possible neuronal mechanism. A particular strength of the paper is that it could give new data and impetus to the important discussion about how motor and sensory coding frameworks come together in cortical processing.
 
 We have expanded the discussion considerably and also address the possibility of sparse coding.
 
 Reviewer #2 (Public Review):
 
 The paper by Pomper and coworkers is an elegant neurophysiological study, generally sound from a methodological point of view, which presents extremely relevant data of considerable interest for a broad audience of neuroscientists. Indeed, they shed new light on the mirror mechanism in the primate brain, trying to approach its study with a novel paradigm that successfully controls for some important factors that are known to impact mirror neuron response, particularly the target object. In this work, a rotating device is used to present the very same object to the monkey or the experimenter, in different trials, and neurons are recorded while the monkey (motor response) or the experimenter (visual response) performed a different action (twist, shift, lift) cued by a colored LED.
 
 The results show that there is a small set of neurons with congruent visual and motor selectivity for the observed actions, in line with classical mirror neuron studies, whereas many more cells showed temporally unstable matched or even completely non-matched tuning for the observed and executed actions. Importantly, the population codes allow to accurately decode both executed and observed actions and, to some extent, even to cross-decode observed actions based on the coding principles of the executed ones.
 
 In my view, however, the original hypothesis that an observer understands the actions of others by the activation of his/her motor representations of the observed actions constitutes circular reasoning that cannot be challenged or falsified, as the author may want to claim. Indeed, 1) there is no causal evidence in the paper favoring or ruling out this hypothesis (and there couldn't be), 2) there is no independent definition (neither in this paper nor in the literature) of what "action understanding" should mean (or how it should be measured). Instead, the findings provide important and compelling evidence to the recently proposed hypothesis that observed actions are remapped onto (rather than matched with) motor substrates, and this recruitment may primarily serve, as coherently hypothesized by the authors, to select behavioral responses to others (at least in monkeys).
 
 1) One of the main problems of this manuscript is, in my view, a theoretical one. The authors follow a misleading, though very influential, proposal, advanced since the discovery of mirror neurons: if there are (mirror) neurons in the brain of a subject with an action tuning that is matched between observation and execution contexts, then the subject "understands" the observed action. This is clearly circular reasoning because the "understanding" hypothesis uniquely derives from the neuron firing features, which are what the hypothesis should explain. In fact, there is no independent, operational definition of the term "understanding". Not surprisingly there is no causal evidence about the role of mirror neurons in the monkey, and the human studies that have claimed to provide causal evidence of "action understanding" ended up using, practically, operational definitions of "recognition", "match-to-sample", "categorization", etc. Thus, "action understanding" is a theoretical flaw, and there is no way "to challenge" a theoretical flaw with any methodologically sound experiment, especially when the flaw consists of circular reasoning. It cannot be falsified, by definition: it must simply be abandoned. On these bases, I strongly encourage the authors to rework the manuscript, from the title to the discussion, by removing any useless attempt to falsify or challenge a circular concept and, instead, constructively shed new light on how mirror neurons may work and which may be their functional role.
 
 Please see the response to all.
 
 2) An important point to be stressed, strictly related to the previous one, concerns the definition of "mirror neuron". I premise that I am perfectly fine with the definition used by the authors, which is in line with the very permissive one adopted in most studies of the last 20 years in this field. However, it does not at all fulfill the very restrictive original criteria of the study in which "action understanding" concept was proposed (see Gallese et al. 1996 Brain): no response to object, no response to pantomimed action or tool actions, activation during execution in the dark and during the observation of another's action.
 
 We do not agree that the enumerated "very restrictive original criteria" emerge from the Gallese et al. (Brain, 1996) study. Except for the first paragraph in the results section, there is no clear statement on how mirror neurons should be defined.
 
 If the idea (which I strongly disagree with) was to simply challenge a (very restrictive) definition of mirroring (a very out-of-date one, indeed, and different from the additional implication of "action understanding"), the original definition of this concept should be at least rigorously applied. In the absence of additional control conditions, only the example neuron in Figure 2A could be considered a mirror neuron according to Gallese et al. 1996.
 
 We have the impression that the question does not distinguish clearly enough between the definition of "mirror neuron" and the definition of "mirror mechanism". In defining "mirror mechanism", we refer to the work of Rizzolatti and Sinigaglia (Nat Rev Neurosci, 2016). We do not think that this definition is out-of-date (see for example the 2018 article by Rizzolatti and Rozzi in Handbook of Clinical Neurology). If the term "mirror mechanism" is to be defined differently, then another term should be used for a new definition or an annotation should be added (such as "version 2"). This would be necessary to avoid unnecessary confusion resulting from unclear terms.
 
 Permissive criteria implies that more "non-mirror" neurons are accepted as "mirror": simply because they are permissively named "mirror", does not imply they are mirroring anything as initially hypothesized
 
 Even for a neuron that would be classified as a "mirror neuron" according to your previously stated "very restrictive original criteria”, it does not follow that it "mirrors” according to a mirror mechanism. And, of course, it is quite possible that more neurons do not "mirror” according to a mirror mechanism if one tests more neurons.
 
 (Example neuron in Fig 2B, for example, could be related to mouth, rather than hand, movements, since it responds strongly and similarly around the reward delivery also during the observation task, when the monkey should be otherwise still).
 
 We agree, it is not excluded that this neuron has a relation to mouth movements. However, since the neuron meets the conditions to be classified as a "mirror neuron", an additional relation to mouth movements would not be relevant. If mouth movements are to be an exclusion criterion, then this would have to be included and justified in the definition of a "mirror neuron".
 
 Clearly, these concerns impact all the action preference analyses. To practically clarify what I mean, it should be sufficient to note that 74% (reported in this study) is the highest percentage ever reported so far in a study of neurons with "mirror" properties in F5 (see Kilner and Lemon 2013, Curr Biol) and it is similar to the 68% recently reported by these same authors (Pomper et al. 2020 J Neurophysiol) with very similar criteria. Clearly, there is a bias in the classification criteria relative to the original studies: again, no surprise if by rendering most of the recorded neurons "mirror by definition" then they don't "mirror" so much. I suggest keeping the authors' definition but removing the pervasive idea to challenge the (misleading) concept of understanding.
 
 We think that it is very important to clearly separate "mirror neuron" from "mirror mechanism". And the question arises whether one should not include a mirroring criterion, which is derived from a definition of a mirror mechanism, in the definition of mirror neurons. We address this briefly in the discussion. Ultimately, the point of our study is to find out how many of the - if you want to put it that way - "permissively defined" mirror neurons actually “mirror”. And the answer depends on how one defines “mirror mechanism”. We provide an answer by resorting to a “strictly defined mirror mechanism”. We have now also given throughout the results section the percentages of neurons with certain properties with respect to all measured F5 neurons. This is a reference that allows comparisons among studies, provided that no neurons were directly discarded during recording, which we avoided in our study.
 
 3) It would be useful to provide more information on the task. Panel B in Figure 1 is the unique information concerning the type of actions performed by the monkey and the experimenter. Although I am quite convinced of the generally low visuomotor congruence, there are no kinematics data nor any other evidence of the statement "the experimental monkey was asked to pay attention to the same actions carried out by a human actor". First, although the objects were the same, the same object cannot be grasped or manipulated in the same way by a human and a macaque, even just because of the considerable difference in the size of their hands; this certainly changes the way in which monkeys' and experimenter's hands interact with the same object, and this is a quantifiable (but not quantified) source of visuomotor difference between observed and executed actions and a potential source of reduced congruency.
 
 We agree, of course, that there are kinematic differences in how a monkey and how a human manipulate the same object. We have not measured the kinematics and thus cannot make a systematic statement about this. We now report in the results section the rather incidental observation that already the reaching trajectories for the three actions differed and show corresponding differences in the timing of the approach epoch. However, for the question of this study, how many neurons are eligible to represent observed actions according to a strictly defined mirror mechanism, the kinematic repertoire of the observed actor is irrelevant. The reference is the F5 mirror neuron activity during the monkey's own action, i.e., how the monkey approaches the object with his hand, how he grasps it, and how he brings it to a certain target position and holds it there. The observed action, according to the strictly defined mirror mechanism, is to be mapped to this reference. Therefore, we did not collect kinematic data. But it is of course a possible explanation for a non-shared code if the strictly defined mirror mechanism does not apply.
 
 Second, there is little information about monkey's oculomotor behavior in the two conditions, which is known to affect mirror neuron activity when exploratory eye movements are allowed (Maranesi et al. 2013 Eur J Neurosci), potentially influencing the present findings: a {plus minus}7 (vertical) and {plus minus}5 (horizontal) window at 49 cm implies that the monkey could explore a space larger than 10 cm horizontally and 14 cm vertically, which is fine, but certainly leaves considerable freedom to perform different exploratory eye movements, potentially different among observed actions and hence capable to account for different "attention" paid by the monkey to different conditions and hence a source of neural variability, in addition to action tuning.
 
 We agree that the topic of the relationship between F5 MNs activity and eye movements is interesting. And we know from the work of Maranesi et al. (2013) that at least larger eye movements during action observation are related to the activity of F5 MNs. In our study, we ensured that the observing monkey was actually observing the action. For this purpose, we used a fixation window. We now additionally verified that the monkey really looked into the area of the object during all three actions (see Results, lines 209-219 in the manuscript with tracked changes). In our study, the fixation window was so small that the monkey could not see the face of the human actor, in contrast to the study of Maranesi et al. (2013). It was mainly the face that attracted the monkey's attention in that study (measured by gaze position). In our study, the risk that the gaze of observing monkey was out of the fixation window was high when he looked at the human actor's hand above the wrist. The execution of the action by the monkey took place in darkness. We did not use a fixation window because the monkey's own execution of the action can be assumed to direct his attention to the action.
 
 We cannot rule out the possibility that smaller eye movements during observation, larger eye movements during execution in darkness, covert shifts of spatial attention, or more generally attentional fluctuations have an influence on F5 MNs that might have counteracted a shared action code in our study. However, if this were the case, then the investigated hypothesis that the activity of F5 MNs during action observation is a motor representation according to the strictly defined mirror mechanism would also have to be rejected.
 
 4) Information about error trials and their relationship with action planning. The monkey cannot really "make errors" because, despite the cue, each object can be handled in a unique way. The monkey may not pay attention to the cue and adjust the movement based on what the object permits once grasped, depending on online object feedback. From the behavioral events and the times reported in Table 1, I initially thought that "shift" action was certainly planned in advance, whereas "lift" and "twist" could in principle be obtained by online adjustments based on object feedback; nonetheless, from the Methods section it appears that these times are not at all informative because they seem to depend on an explicit constraint imposed by the experimenters (in a totally unpredictable way). Indeed, it is stated that "to motivate the monkey even more to use the LED in the execution task, another timeout was active in 30% (rarely up to 100%) of trials for the time period between touch of object to start moving the object: 0.15 (rarely 0.1) for a twist and shift, 0.35 (rarely 0.3s) for a lift". This is totally confusing to me; I don't understand 1) why the monkey needed to be motivated, 2) how can the authors be sure/evaluate that the monkeys were actually "motivated" in this way, and 3) what kind of motor errors the monkey could actually do if any. If there is any doubt that the monkeys did actually select and plan the action in advance based on the cue, there is no way to study whether the activity during action execution truly reflects the planned action goal or a variety of other undetermined factors, that may potentially change during the trials. Please clarify.
 
 It is true that the three actions could in principle be performed without using the LED as an informative cue. While this is unlikely under the assumption that a monkey prefers the easiest and fastest way to get reward, it remains a possibility. For this reason, we introduced time constraints in a part of the trials. The selection of time constraints and the proportion of trials in which they were applied, was a pragmatic compromise between a time limit, at which the LED must be used as an informative cue for action selection in order to comply with the task, and a time span that allows the task to be completed even when overall motivation is low. The latter takes into account the general experimental experience that a monkey's engagement or motivation in such experiments varies across trials, sessions, and days. To evaluate whether the LED color was, indeed, used as a cue for action planning in the execution task, we randomly interleaved trials with a different LED, non-informative regarding the type of object, as a control in 5% of the trials. We compared the behavioral responses in trials with informative cues and those with a non-informative cue. The behavioral analysis established that both monkeys indeed used the informative cues to guide their choices (see Fig. 1D).
 
 Further evidence that the monkey used the cue for action selection and planning is the finding that the type of action was encoded before the release of the start button and then further during the approach phase, i.e., much earlier than somatosensory feedback about the manipulability of the object was available (see Fig. 3A and Fig. 6A).
 
 Regarding the question, which "motor errors" were possible: The answer can be found in the description of the cases in which a trial was aborted (see Material and methods): releasing the start button too early (< 100 ms after turning on the LED), manipulating the object too slowly after touching it (the time constraints mentioned), not holding the object until the reward was given, or not performing the task at all (10 s timeout).
 
 5) Classification analysis. There seems to be no statistical criterion to establish where and when the decoding is significantly higher than chance: the classifier performance should be formally analyzed statistically. I would expect that, in this way, both the exe-obs and the obs-exe decoding may be significant. Together with the considerations of the previous point 2 about the permissive inclusion criteria for mirror neurons, this is a remarkable (even quite unexpected) result, which would prove somehow contrary to what the authors claim in the title of the paper. The fact that in any classification the "within task" performance is significantly better than the "between task" performance does not appear in any way surprising, considering both the inclusive selection criteria for "mirror neurons" and the unavoidably huge different sources of input (e.g. proprioceptive, tactile, top-down, etc. afferences) between execution and observation. So, please add a statistical criterion to establish and show in the figures when and where the classifications are significantly above chance.
 
 We have added - in addition to the statistics already performed in the first version (Fig. 3A in the previous version, now Fig. 6A) - a number of analyses including statistics. This mainly concerns the analyses regarding a shared code at the single neuron level, in which we additionally tested against the null hypothesis proposed by Csibra in 2005 using permutation tests. And we have now also calculated confidence intervals for the population classifications that allow the comparison with chance level. We re-performed the classification analyses using eight-fold cross-validation. We also added a statistical analysis to the finding of clustering of time periods with shared code (Fig. 4). In Figure 5, we additionally compared the frequency of action segments with shared and non-shared codes, which is a descriptive, exploratory analysis. For this reason, it does not make sense to perform inferential statistics. Overall, these analyses represent a significant expansion of the analyses in the first version. We have done this primarily to arrive at statistically sound conclusions at the single neuron level.
 
 Regarding the comparison between within-task classification (o2o) and cross-task classification (e2o), it is important to keep in mind that the goal was to test the hypothesis that the activity of F5 MNs during action observation is a motor representation of the observed action according to the strictly defined mirror mechanism. This hypothesis requires both, 1) an above chance level accuracy of the e2o classifier and 2) no better accuracy of the o2o classifier as compared to the e2o classifier. If the o2o classifier were better, then the actions would not be represented as they are executed. And the reference in this hypothesis is the motor representation, that is, the code at execution. Thus, the direction e2o classification is the crucial one, not the reverse direction (o2e). One explanation for the fact that o2o shows better accuracy in the population may be the different sensory inputs mentioned above. In this case, the tested hypothesis has to be rejected and replaced by another one, which should then have a different name.
 
 Nevertheless, we also show the result of the o2e cross-task classification in Fig. 6 (yellow curve), which was already included in Fig. 3 of the first version. However, we do not address it in more detail in the main text because it is not relevant for the hypothesis to be tested. It is only a reportable additional result.
 
 6) "As the concept of a mirror mechanism posits that the observation performance can be led back to an activation of a motor representation, we restricted this analytical step to a comparison of the exe-obs and the obs-obs discrimination performance". I don't understand the rationale of this choice. The so-called "concept" of mirror mechanism in classical terms posits that mirror neurons have a motor nature and hence their functioning during observation should follow the same principle as during action execution. But this logical consideration has never been demonstrated directly (it is indeed costated by several papers), and when motor neurons are concerned (e.g. pyramidal tract neurons, see Kraskov et al. 2009) their behavior during action observation is by far more complex (e.g. suppression vs facilitation) than that hypothesized for classical "mirror neurons". Furthermore, when across-task decoding for execution and observation code has been used, both in neurophysiological (e.g. Livi et al. 2019, PNAS) and neuroimaging (Fiave et al. 2018 Neuroimage) data, the visual-to-motor direction typical produce better performance than the opposite one. Thus, I don't see any good reason not to show also (if not even just) the obs-exe results. Furthermore, I wonder whether it is considered the possible impact of a rescaling in the single neuron firing rate across contexts, as the observation response is typically less strong than the execution response in basically all brain areas hosting neurons with mirror properties, and this should not impact on the matching if the tuning for the three actions remains the same (e.g. see Lanzilotto et al. 2020 PNAS). The analysis shown in Figures 4 and 5 is, for the rest, elegant and very convincing - somehow surprising to me, as the total number of "congruent" neurons (7.5%) is even greater than in the original study by Gallese et al. (5.4%).
 
 As to the rationale of our approach, please see our response to the previous point.
 
 On the issue of rescaling: the hypothesis tested here requires that the F5 MNs activity on observation is a motor representation of the observed action. Hence, from the activity during observation the action should be just as readable as from the execution-related activity. If we had to use rescaling to find a shared code, then observed actions would not be represented in F5 MNs in the same way as on execution. Additional information on whether the action is being executed or observed would be needed. This would of course be possible in principle, but would contradict the hypothesis. And we then not only have the difficulty of which readout is the physiological one (here we make a parsimonious assumption with a linear readout), but we would have to make an additional assumption about rescaling. For this study, we have now chosen the solution of performing the action preference analysis on a single neuron level in a statistically clean way. This represents a very liberal form of rescaling, as it only tests whether the action with the highest or lowest discharge rate is the same when executed and observed. That is, if the result here is not fundamentally different, which is the case, then it can also be assumed that one does not get qualitatively different results for other forms of rescaling.
 
 7) The discussion may need quite deep revision depending on the authors' responses and changes following the comments; for sure it should consider more extensively the numerous recent papers on mirror neurons that are relevant to frame this work and are not even mentioned.
 
 The discussion has been thoroughly revised considering the comments raised and suggestions of this and the other two reviewers.
 
 Reviewer #3 (Public Review):
 
 Mirror neurons are a big deal in the neuroscience literature and have been for thirty years. I (and many others) remain skeptical of whether they serve the functions often attributed to them - specifically, whether they are motor planning neurons that contribute to understanding the actions of others. Testing their functions, therefore, is of great interest and importance. The present study, however, is not a cogent or convincing test. I do not think this study helps to answer the questions surrounding mirror neurons. It purports to provide a crucial test, that comes out mostly against the mirror neuron hypothesis, but the test has too many weaknesses to be convincing.
 
 Thank you for the clear words. We take from it, first of all, that in the first version of the manuscript we failed to convey the relevance of our study for the discussion of mirror neuron function. The concerns of this reviewer are in line with those of the others and are addressed in our response to all three reviewers.
 
 First, consider that the motor tuning and the visual tuning match "poorly." How poor or good must the match be before the mirror neuron hypothesis is rejected? I do not know, and the study does not help here. Even a "poor" match could contribute significantly to a social perception function.
 
 The specific hypothesis tested here assumes that an action-specific activity of F5 MNs evoked by observed actions corresponds to an action-specific activity of these actions if executed. The approach taken here to compare cross-task classification accuracy (execution-trained, tested in observation) with within-task classification accuracy (observation-trained, tested in observation) tests this hypothesis. The fact that we found a cluster of time periods of single neurons in which both accuracies are almost equal supports this approach and also the hypothesis for these time periods. In principle, of course, the decision for the presence of a difference or equality is always only a statistical statement and contains assumptions. For example, the assumption that a linear readout has physiological relevance enters here. But this problem exists in all studies that ultimately try to understand biological neuronal networks in order to explain perceptions and behavior. However, it is such studies that attempt to elucidate what information is contained in which neurons that set the stage for experiments that, in the optimal case, manipulate certain neurons in a particular way in order to then measure the behavior of an animal that is just right for those neurons.
 
 Second, the results remind me in some ways of other multi-modal responses in the brain. For example, in the visual area MST, neurons are tuned to optic flow fields that imply specific directions of self-motion. Many of the same neurons are tuned to vestibular signals that also imply specific directions of self-motion. But the optic flow tuning and the vestibular tuning are not perfectly matched. There is considerable slop and complexity in how the two tunings compare within individual neurons. That complexity is not evidenced against multi-modal tuning. Instead, it suggests a hidden-layer complexity that is simply not fully understood yet. Just so here, the fact that the apparent motor tuning and apparent visual tuning match "poorly" is not evidence against both a motor planning and a visual encoding function.
 
 We hope that it is now clearer, in contrast to the first version, that we tested a specific hypothesis that is only a prerequisite for the hypothesis of a very specific form of understanding. Referring to the example, the hypothesis analogous to ours would be that the representation of self-motion direction due to optic flow ("observation") corresponds to the representation of self-motion direction due to vestibular stimulation ("execution"). If it were then found that the self-motion direction due to optic flow cannot be predicted from a classifier trained on vestibular stimulation, and that another classifier trained on optic flow performs better, then the hypothesis would have to be rejected. This is then a reason to realize that "everything is a bit more complex" and to search for better explanations.
 
 Third, the animals are massively over-trained in three actions. They perform these actions and see them performed thousands of times toward the same object. Surely, if I were in the place of the monkey, every time I saw the object, I'd mentally imagine all three actions. As I saw a person act on the object, I'd mentally imagine the alternative two actions at the same time. Even if the mirror neuron hypothesis is strictly correct, this experiment might still find a confusion of signals, in which neurons that normally might respond mainly to one action begin to respond in a less predictable way during all three trial types.
 
 In our study, we tested a specific hypothesis related to the time an action is observed. Here, you suggest an alternative hypothesis. The question is whether this alternative hypothesis better explains the result of our study. The alternative hypothesis can be formulated as follows: the F5 MNs activity elicited by an observed action in this experiment corresponds to a mixture of the activities that occur when the other two actions are executed. This hypothesis is to be rejected because it fails to explain why a shared code occurs in single neurons and why cross-task population classifiers show an accuracy above chance level. A modified alternative hypothesis, which states that what is represented in the experiment during observation is a mixture of all three actions, cannot explain why the three actions are very well represented in the population and are optimally represented exactly when the target position of the object is reached.
 
 Fourth, the experiment relies on a colored LED that acts as an instructional cue, telling the monkey which action to perform. What is to stop the neurons from developing a cue-sensitive response, as in classic studies from Steve Wise and others in the premotor cortex? Perhaps the neuronal signal that the experimenters are trying to measure is partly obscured by other, complex responses influenced in some manner by the instructional cue?
 
 In principle, there is the possibility that purely sensory information is also represented in area F5, at least in some neurons or at certain points in time. We take your suggestion and discuss this as one of the alternative concepts (we call it "sensory concept"). However, several findings argue against this concept. For example, neural responses to cues usually represent the subsequent action, but not sensory information of the cue such as the color of the cue. In our study, it is evident from Figure 3A, 6A and 6B that during action execution, actions are discriminated even before the start button is released. Since this discrimination of actions occurs with a time delay after the cue and then increases continuously, this is evidence that the action to be executed is represented, but not the cue itself.
 
 Fifth, finally, and most importantly, the fundamental problem with this study is that it is correlational. Studies that purport to test the function of a set of neurons, and do so by use of correlational measurements, cannot provide strong answers. There are always half a dozen different interpretations and caveats, such as the ones I raised here. Both sides of a debate can always spin the results, and the arguments are never resolved. To test the mirror neuron hypothesis properly would require a causal study. For example, lesion area F5 and test if the monkey is less able to discriminate the actions of others. Or, electrically microstimulate in area F5 and test if the stimulation interferes (either constructively or destructively) with the task of discriminating the actions of others. Only in this way will it be possible to answer the question: do mirror neurons functionally participate in understanding the actions of others? The present study does not answer that question.
 
 We would like to reiterate that studies aimed at elucidating what information is contained in which neurons or areas are necessary to understand neural network processes and are a prerequisite for conducting well-considered experiments that measure behavioral effects through specific manipulation of the neural network. Without the work of Gallese, Rizzolatti and colleagues, the idea of associating F5 neurons with action understanding would not have occurred in the first place. The current tricky question is whether at all, and if so, to what understanding, to what perception, to what behavior that uses information about mental states of another, F5 MNs might be able to contribute. And for this, it helps to have a clearer idea of what information is contained in F5 MNs during action observation.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.02.01.478703v1
Jul 2023
www.biorxiv.org www.biorxiv.org

New submission 31/07/2023, 09:07:20

1
1. Public_Reviews 31 Jul 2023
  
  in eLife
  
  Author Response:
  
  Reviewer #1 (Public Review):
  
  This is a short but important study. Basically, the authors show that α-synuclein overexpression's negative impact on synaptic vesicle recycling is mediated by its interaction with E-domain containing synapsins. This finding is highly relevant for synuclein function as well as for the pathophysiology of synucleinopathies. While the data is clear, functional analysis is somewhat incomplete.
  
  We will perform all additional functional analyses asked by the reviewer (listed in “recommendations for the authors”) and report that in the revised version. These include dissociation of exo/endocytosis in the context of synapsin-E domain, and further quantification of the rise and fall of pHluorin curves.
  
  Reviewer #2 (Public Review):
  
  In this manuscript the authors established synapsin's E-domain as an essential functional binding partner that allows α-syn functionality. They show very elegantly that only synapsin isoforms that have an Edomain bind α-syn and allow the inhibition mediated by α-syn. Deletion of the C-terminus (α-syn 96-110) eliminated this interaction. Hence, synapsin E-domain binds to α-syn enabling the inhibitory effect of αsyn on synaptic transmission.
  
  The paper will be improved significantly if additional experiments are added to expand and provide a more mechanistic understanding of the effect of α-syn and the intricate interplay between synapsin, αsyn, and the SV. For an enthusiastic reader, the manuscript as it looks now with only 3 figures, ends prematurely. Some of the experiments above or others could complement, expand and strengthen the current manuscript, moving it from a short communication describing the phenomenon to a coherent textbook topic. Nevertheless, this work provides new and exciting evidence for the regulation of neurotransmitter release and its regulation by synapsin and α-syn.
  
  We will address all the technical and conceptual points raised by the reviewer, and do all the necessary experiments (listed in “recommendations for the authors”) and report that in the revised version). These include quantification of the expression levels of various proteins, evaluation of the dispersion of synapsin and α-syn under the stimulation conditions used in our studies, and consideration of other proposed roles of α-syn.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.24.546170v2
www.biorxiv.org www.biorxiv.org

The PMA Phorbol Ester Tumor Promoter Increases Canonical Wnt Signaling Via Macropinocytosis

1
1. Public_Reviews 31 Jul 2023
  
  in eLife
  
  Author Response:
  
  We thank the reviewers for their very thoughtful suggestions. We will submit a revised manuscript addressing these comments and including a point-by-point response to reviewers. We will provide evidence that Wnt3a treatment increases macropinocytosis and that PMA increases this cellular response in cultured cells, but only in the presence of Wnt3a. This will be done using the current gold standard for macropinocytosis assays, the uptake of high molecular weight Dextran sensitive to the Na/H+ exchanger inhibitor EIPA. A time-lapse video of rapid macropinocytosis cup induction by PMA in colorectal cancer cells will also be provided. Other new experiments will show that levels of the upstream macropinocytosis regulator Rac1 are increased by β-catenin DNA, constitutively active Lrp6, or LiCl. The criticism that by taking a broad approach our study lacks mechanistic analysis depth is a valid one. The reason we used a multiplicity of approaches – Xenopus embryo assays, cancer calls in culture, colon cancer tissue arrays and mouse xenografts – was to validate, in as many different ways possible, a central finding: that the classical phorbol ester tumor promoters can act by potentiating Wnt/β-catenin signaling through membrane trafficking.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.02.543509v2
www.biorxiv.org www.biorxiv.org

New submission 28/07/2023, 09:07:26

1
1. Public_Reviews 28 Jul 2023
  
  in eLife
  
  Author Response:
  
  We are very grateful to the Editors and the three Reviewers for their valuable reviews of our submission. We will take into account all the comments and provide a revised manuscript with our point-by-point responses as soon as possible. In the meantime, we would like to respond provisionally to the reservation expressed in the eLife editorial assessment and by Reviewer #3 about the validity of our models to study of the neurobehavioral consequences of purine deficiency and the pathogenesis of Lesch-Nyhan disease (LND) in Drosophila.
  
  Two enzymes are responsible for purine recycling in mammals: APRT and HGPRT. Only HGPRT deficiency causes neurobehavioral disturbances and LND in humans, while APRT deficiency leads to metabolic deficits without neurological or behavioral symptoms. In contrast, as we have been able to confirm, Drosophila expresses a single purine recycling enzyme, Aprt, and no HGPRT or HGPRT-like activity. Here we propose different ways to model LND in Drosophila, based either on Aprt deficiency or the expression of mutant HGPRT.
  
  Although it may be difficult to accept that the inactivation of a different gene in a distant organism could be a good model for LND, we have found that, in contrast to humans, Aprt deficiency has both metabolic and neurobehavioral consequences in Drosophila. This suggested that Aprt, being the unique fly purine recycling enzyme, might share the enzymatic function of human APRT and the neurodevelopmental function of human HGPRT, because its inactivation should recapitulate all pathological consequences of a lack of purine recycling in this organism, and in particular in the brain.
  
  The statement by Reviewer #3 that “it is unknown whether Aprt is also a structural homologue [of HGPRT]” is not accurate. APRT and HGPRT are known to be functionally and structurally related. Both human APRT and HGPRT belong to the type I PRTases family identified by a conserved phosphoribosyl pyrophosphate (PRPP) binding motif, which is used as a substrate to transfer phosphoribosyl to purines. This binding motif is only found in PRTases from the nucleotide synthesis and salvage pathways (see: Sinha and Smith (2001) Curr Opin Struct Biol 11(6):733-9. PMID: 11751055). The purine substrates adenine, hypoxanthine and guanine share the same chemical skeleton and APRT can bind hypoxanthine, indicating that APRT and HGPRT also share similarities in their substrate binding sites (Ozeir et al. (2019) J Biol Chem. 294(32):11980-11991. PMID: 31160323). Moreover, Drosophila Aprt and Human APRT are closely related as the amino acid sequences of APRTs have been highly conserved throughout evolution (shown in Fig. S3B of our paper). We apologize for not providing this information in our original submission. This point will be made clearer in the revised article.
  
  Here we report a set of evidence that Drosophila can be used as a model to study LND. A strong argument, as we believe, is that the same drugs have been found effective in rescuing the seizure-like phenotype in Aprt-deficient flies (Figure 7 in our manuscript) and the viability of fibroblasts and neural stem cells derived from iPSCs of LND patients, in which de novo purine synthesis was prevented (as discussed on page 37). This is a good sign that Drosophila could be used to identify new genetic targets and pharmacological compounds capable to rescue HGPRT mutations in humans.
  
  Finally, we would like to emphasize that Reviewer #1 and Reviewer #2 expressed confidence in the potential usefulness of our work to better understand and treat LND in their public reviews. Reviewer #1 indeed stated that: “The findings provide a new example of how manipulating specific genes in the fruit fly allows the study of fundamental molecular processes that are linked to a human disease”, and Reviewer #2 further wrote: "Altogether, these are very important and fundamental findings that convincingly demonstrate the establishment of a Drosophila model for the scientific community to investigate LND, to carry out drug testing screens and find cures”, and added: “To conclude, this is a fundamental piece of work that opens the opportunity for the broader scientific community to use Drosophila to investigate LND”.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.23.546306v2
www.biorxiv.org www.biorxiv.org

New submission 28/07/2023, 09:03:28

1
1. Public_Reviews 28 Jul 2023
  
  in eLife
  
  Author Response:
  
  We thank the reviewers for the constructive feedback and detailed reviews. To avoid any misunderstandings, we would like to add the following clarification. The comments from Reviewer 3 seem to indicate that in our simulator, synthspot, we mix cells from different data sets and even different species to create synthetic spots. The comment is the following:
  
  The choice to blend mouse and human scRNA-seq datasets in the simulation setup for generating synthetic spots is not ideal due to its departure from a realistic biological scenario.
  
  We would like to point out that the synthetic spots we create for the silver standard data sets are always sampled from the same scRNAseq or snRNAseq data set to keep the simulations as biologically plausible as possible.
  
  For each of the 6 public data sets, we create 9 different synthetic data sets, resulting in a total of 54 synthetic data sets. Each of these 9 data sets correspond to a different abundance pattern with spots representing combinations of cells sampled from this same public data set. Hence, these synthetic data sets always reflect cell types that actually co-occur in the tissue sections used to generate the underlying public scRNAseq or snRNAseq data set.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.03.22.533802v3
www.medrxiv.org www.medrxiv.org

New submission 28/07/2023, 08:55:57

1
1. Public_Reviews 28 Jul 2023
  
  in eLife
  
  Author Response:
  
  Reviewer #1 (Public Review):
  
  In this paper, Hui and colleagues investigate how the predictive accuracy of a polygenic score (PGS) for body mass index (BMI) changes when individuals are stratified by 62 different covariates. After showing that the PGS has different predictive power across strata for 18 out of 62 covariates, they turn to understanding why these differences and seeing if predictive performance could be improved. First, they investigated which types of covariates result in the largest differences in PGS predictive power, finding that covariates with larger "main effects" on the trait and covariates with larger interaction effects (interacting with the PGS to affect the trait) tend to better stratify individuals by PGS performance. The authors then see if including interactions between the PGS and covariates improves predictive accuracy, finding that linear models only result in modest increases in performance but nonlinear models result in more substantial performance gains.
  
  Overall, the results are interesting and well-supported. The results will be broadly interesting to people using and developing PGS methods. Below I list some strengths and minor weaknesses.
  
  Strengths:
  
  A major impediment to the clinical use of PGS is the interaction between the PGS and various other routinely measured covariates, and this work provides a very interesting empirical study along these lines. The problem is interesting, and the work presented here is a convincing empirical study of the problem.
  
  The result that PGS accuracy differs across covariates, but in a way that is not well-captured by linear models with interactions is important for PGS method development.
  
  Thank you for all of the positive comments.
  
  Weakness:
  
  While arguably outside the scope of this paper, one shortcoming is the lack of a conceptual model explaining the results. It is interesting and empirically useful that PGS prediction accuracy differs across many covariates, but some of the results are hard to reconcile simultaneously. For example, it is interesting that triglyceride levels are associated with PGS performance across cohorts, but it seems like the effect on performance is discordant across datasets (Figure 2). Similarly, many of these effects have discordant (linear) interactions across cohorts (Figure 3). Overall it is surprising that the same covariates would be important but for presumably different reasons in different cohorts. Similarly, it would be good to discuss how the present results relate to the conceptual models in Mostafavi et al. (eLife 2020) and Zhu et al. (Cell Genomics 2023).
  
  Thank you for the comments. We agree that more generalizable explanations would be useful, which may be worth exploring in future work. Specifically, if there is heteroskedasticity in the relationship between PGS and BMI (e.g., phenotypic variance increases for higher values of BMI while PGS variance does not, or at least by a different amount), then that may partially explain the performance differences when stratifying by covariates that have main effects on BMI – somewhat similarly to what is presented in Figure 2 of Mostafavi et al. Such results may imply that similar performance differences could occur when stratifying by the phenotype itself, although this still may not explain differences in PGS effects, and differences in performance when using nonlinear methods (such as in this work and in Figure 4 of Zhu et al.). While we observe discordant effects for certain covariates across datasets, the findings from the correlation analyses use all cohorts and ancestries, and we expect that these difference in effects across datasets may be due to differences in their relationship with BMI across datasets (triglyceride levels may be especially noisy due to their sensitivity to fasting which may have been controlled for differently across datasets).
  
  Reviewer #2 (Public Review):
  
  This work follows in the footsteps of earlier work showing that BMI prediction accuracy can vary dramatically by context, even within a relatively ancestrally homogenous sample. This is an important observation that is worth the extension to different context variables and samples.
  
  Much of the follow-up analyses are commendably trying to take us a step further-towards explaining the underlying observed trends of variable prediction accuracy for BMI. Some of these analyses, however, are somewhat confounded and hard to interpret.
  
  For example, many of the covariates which the authors use to stratify the sample by may drive range restriction effects. Further, the covariates considered could be causally affected by genotype and causally affect BMI, with reverse causality effects; other covariates may be partially causally affected by both genotype and BMI, resulting in collider bias. Finally, population structure differences between quintiles of a covariate may drive variable levels of stratification. These can bias estimation and confounds interpretations, at least one of which intuitively seems like a concern for each of the context variables (e.g., the covariates SES, LDL, diet, age, smoking, and alcohol drinking).
  
  The increased prediction accuracy observed with some of the age-dependent prediction models is notable. Despite the clear utility of this investigation, I am not aware of much existing work that shows such improvements for context-aware prediction models (compared to additive/main effect models). I would be curious to see if the predictive utility extends to held-out data from a data set distinct from the UKB, where the model was trained, or whether it replicates when predicting variation within families. Such analyses could strengthen the evidence for these models capturing direct causal effects, rather than other reasons for the associations existing in the UKB sample.
  
  Thank you for the comments. We agree there are certain biases that may be introduced in these analyses. For population structure between quintiles, the analyses are already stratified by ancestry and have the top 5 genetic principal components included, which may help with this issue. In the interaction models we included separate terms for the PGS of the covariate as well which was meant to better capture the environmental component of the covariates, which may partially ameliorate issues of collider bias as SNPs that are causally affecting both BMI and the covariate would be partially adjusted for. While range restriction effects could introduce bias, in the correlation analyses the relationship between main effects and interaction effects (which were estimated without range restriction) have strong and reproducible correlations with PGS R2 differences across datasets.
  
  We agree the increased prediction performance using PGS created directly from GxAge GWAS effects is notable, as it is essentially “free” performance increase that doesn’t require any new data, and it likely generalizable to additional covariates. It would be useful to validate its performance in other datasets, especially ones that are outside of the 40-69 age of UKBB.
  
  Reviewer #3 (Public Review):
  
  Polygenic scores (PGS), constructed based on genetic effect sizes estimated in genome-wide association studies (GWAS) and used to predict phenotypes in humans have attracted considerable recent interest in human and evolutionary genetics, and in the social sciences. Recent work, however, has shown that PGSs have limited portability across ancestry groups, and that even within an ancestry group, their predictive accuracy varies markedly depending on characteristics such as the socio-economic status, age, and sex of the individuals in the samples used to construct them and to which they are applied. This study takes further steps in investigating and addressing the later problem, focusing on body mass index, a phenotype of substantial biomedical interest. Specifically, it quantifies the effects of a large number of co-variates and of interactions between these covariates and the PGS on prediction accuracy; it also examines the utility of including such covariates and interaction in the construction of predictors using both standard methods and artificial neural networks. This study would be of interest to investigators that develop and apply PGSs.
  
  I should add that I have not worked on PGSs and am not a statistician, and apologize in advance if this has led to some misunderstandings.
  
  Strengths:
  
  The paper presents a much more comprehensive assessment of the effects of covariates than previous studies. It finds many covariates to have a substantial effect, which further highlights the importance of this problem to the development and application of PGSs for BMI and more generally.
  
  The findings re the relationships between the effects of covariates and interactions between covariates and PGSs are, to the best of my knowledge, novel and interesting.
  
  The development of predictors that account for multiple covariates and their interaction with the PGS are, to the best of my knowledge, novel and may prove useful in future efforts to produce reliable PGSs.
  
  The improvement offered by the predictors that account for PGS and covariates using neural networks highlights the importance of non-linear interactions that are not addressed by standard methods, which is both interesting and likely to be of future utility.
  
  Thank for the positive feedback.
  
  Weaknesses:
  
  The paper would benefit substantially from extensive editing. It also uses terminology that is specific to recent literature on PGSs, thus limiting accessibility to a broader readership.
  
  The potential meaning of most of the results is not explored. Some examples are provided below: • The paper emphasizes that 18/62 covariates examined show significant effects, but this result clearly depends on the covariates included. It would be helpful to provide more detail on how these covariates were chosen. Moreover, many of these covariates are likely to be correlated, making this result more difficult to interpret. Could these questions at least be partially addressed using the predictors constructed using all covariates and their interactions jointly (i.e., with LASSO)? In that regard, it would be helpful to know how many of the covariates and interactions were used in this predictor (I apologize if I missed that). • While the relationship between covariate effects and covariate-PGS interaction effects is intriguing, it is difficult to interpret without articulating what one would expect, i.e., what would be an appropriate null. • The finding that using artificial neural networks substantially improves prediction over more standard methods is especially intriguing, and highlights the potential importance of non-linear relationships between PGSs and covariates. These relationships remain hidden in a black box, however. Even fairly straightforward analyses, based on using different combinations of the PGS and/or covariates may shed some light on these relationships. For example, analyzing which covariates have a substantial effect on the prediction or varying one covariate at a time for different values of the PGS, etc.
  
  The relationship to previous work should be discussed in greater detail.
  
  Thank you for the comments. Regarding running LASSO with all covariates along with each of their interactions with PGS in one model, upon reading those sections of the text again it is a little unclear we agree; but we actually did something very similar already (related sections have been edited for clarity in our revised manuscript) with these results being presented later on in the neural network section (second paragraph, S Table 7 – those results specifically aren’t in Figure 5). We just looked at changes in prediction performance, and did not try to interpret the model coefficients. We agree that many of the covariates are probably correlated, but based on the correlation results (Figure 4) it doesn’t seem like any covariate is especially important separately from its effect on BMI itself i.e., whatever covariates were chosen by LASSO may still not be especially important. This explanation is related to the interpretation of the neural network results, where neural networks improved performance even over linear models with just age and sex and their interactions with PGS as additional covariates, which may suggest that increased performance is coming from nonlinearities apart from multiplicative interaction effects with the PGS. So observing the coefficients from LASSO but still with a linear model may still not substantially aid in explaining the relationships that increase prediction performance using neural networks (additionally, this analysis may be difficult to replicate since many of the covariates are not present in multiple datasets). But this replication would be nice to see in future studies if such datasets exist. In terms of the null relationship between covariate main and interaction effects, if they are from the same model they will inherently be correlated, but the main effects from Figure 4 are from a main effects model only. Regarding the other points, the text will be edited for clarity and elaboration on specific topics.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2023.05.10.23289777v1
www.biorxiv.org www.biorxiv.org

New submission 26/07/2023, 08:48:27

1
1. Public_Reviews 26 Jul 2023
  
  in eLife
  
  Author Response:
  
  We would like to express our gratitude for the valuable feedback provided by the editors and reviewers. In response to the reviewer's comments, we have outlined a plan to carry out additional experiments to bolster our paper's strength. Our primary objectives are to explore the developmental roles of both Porl1F and Regnase-1 and to provide further clarification regarding the involvement of Regnase-1 in memory consolidation. We will utilize the newly available Polr1F RNAi transgenes and confirm the efficacy of both the previous and new RNAi lines through quantitative polymerase chain reaction (qPCR) analysis. Additionally, we aim to investigate the impact of Regnase-1 overexpression on sleep and memory consolidation. We will also clarify some points that may not have been clear to reviewers.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.05.31.543136v1
www.biorxiv.org www.biorxiv.org

New submission 30/10/2022, 18:44:15

1
1. Public_Reviews 25 Jul 2023
 
 in eLife
 
 Author Response
 
 Reviewer #1 (Public Review):
 
 This paper has significant strengths in taking a rich, quantitative, neurally-grounded approach to the development of human walking. It provides a rich empirical dataset of EMG and kinematic data at this challenging age, as well as sophisticated analyses of these data in terms of motor primitives, which are a concept that has recently been usefully applied to understanding human walking and its development.
 
 STRENGTHS
 
 It builds on emerging literature in this field and adds data at the key age of infancy-toddlerhood.
 
 It takes a longitudinal approach, sampling children at the ages of newborn, 3 months, and newly walking. This is still reasonably rare in developmental research and allows for a powerful, robust interpretation of data: the authors should be commended for taking this approach.
 
 WEAKNESSES
 
 Some aspects of the work could have been more clearly introduced. This includes neural aspects: the location of the CNS control centres at the spinal level, and which higher centres control them (e.g. brainstem); the justification for understanding primitives as modular (no cross-talk or feedback). It also includes developmental aspects: introducing the stepping reflex, and behavioural aspects of infant motor variability (e.g. Adolph, Hoch & Cole, TICS, 2018).
 
 The patterns relate to walking in a stereotypical manner, yet children's walking is full of skips, jumps, and climbs - both in relation to external obstacles and on even ground. Indeed, it is a challenge to get children to 'walk normally' in a lab. Thus, variability is in fact greater than is discussed here and this should be acknowledged.
 
 Thanks for the remarks. We reviewed the introduction and clarified these points. Mainly, we realized that we were not clear enough about the type of variability that we focused on, and added a paragraph at the top of the introduction to clearly define the different types of variability that exist during development and to specify that we only focus on the ability to produce a given coordination mode (like for example alternated leg movements) with various muscle activities (line 34-44). We also added some specification about the neural structures that are known to be involved in modularity in animals (line 53-58).
 
 The analyses are based on a limited sample of the data. (1) I am not clear on what basis the coders selected cycles, and why 5 cycles were selected. (2) It is not clear why certain movement parameters (cycle duration and flexion/extension proportions) and not others (e.g. step length, double support time) were selected. In particular, it is not clear why the authors focus on temporal, rather than spatial, variability. (3) Some data are based on stepping, and some on kicking. Because it's not clear that these are really equivalent, and because there are small samples of each (n<10), it's not clear that there is enough data to allow us to come to strong conclusions. The sample size should be justified - on the basis of power analyses and/or previous work in this area (e.g. Dominici, Science, n=40). From the results, where p values often hover around p=0.06, the paper seems underpowered to detect a decrease in variability with age for stepping kinematics and primitives.
 
 We initially limited to 5 the number of cycles to analyze in each individual and age in order to make the indexes of variability comparable across individuals and ages. However, as detailed in the general response above, in the new version of the manuscript we preprocessed (i.e. filtered and normalized) a different amount of data in each individual and age (i.e. between 5 and 22 cycles depending on what was available) and we reproduced every analysis of the paper for 5 randomly chosen combinations of 5 cycles when strictly more than 5 cycles were available (i.e. we used a bootstrapping approach, limiting the number of combinations to 5 because of the processing time of the algorithm). Therefore, every result presented in the paper correspond to an average value computed across these 5 random combinations (except when the number of available cycles was strictly equal to 5), which allowed to include a different number of steps in the analysis while keeping the indexes comparable across individuals and ages. This raised the number of cycles included in the analysis from 200 to 586.
 
 We do not present data on step length and double support time because we wanted to apply our analyses on the two behaviors (i.e. stepping and kicking) and there are no step length or support phase in kicking. Moreover, we do not have access to these data. In fact, the available space on the skin on newborns was limited after having disposed EMG sensors, and we could not dispose enough 3D markers to analyze step length. Furthermore, we had to record toddler’s walking in a room that was not equipped with motion capture, therefore we did not have access to any marker’s position at walking onset. As such, we report kinematic parameters that were available for each behavior, which are stride duration, variability of stride duration and percentage of extension phase. This was clarified in the manuscript line 581-583.
 
 As detailed in our general response, we had chosen a very conservative approach which reduced the amount of data that were presented in the original manuscript, however we systematically reviewed our data and we now present our analyses on 18 infants, of which 11 stepped at birth, 15 stepped at 3 months old, 15 kicked at birth, 15 kicked at 3 months old, and 15 were recorded at walking onset. Each infant was followed longitudinally and we only present data if they are available on at least two time points. The results were reinforced with this new number of included individuals, and the p values are stronger (see table S1 were all the p-values are reported, line 979).
 
 There are some points of interpretation that could have been clearer, for example highlighting how one might distinguish between variability as incidental (motor noise) or purposeful (for exploration); and how studying the time around walking onset can contribute to the broader literature on this topic.
 
 The main result of this study is that the structure of EMG variability evolves during the first year of life, but the origin of this variability (incidental or purposeful) remains unknown. Be it purposeful or incidental, variability might arise at any level of the nervous system (Dhawale et al., 2017), and here we propose that it arises at the level of primitives’ activation during early development. As this is coherent with the fact that different pharmacological or electrical stimuli applied to the spinal cord of neonatal rodents can generate variability (Kiehn and Kjærulff, 1996; Klein et al., 2010), we can hypothesize that such variability could be purposely generated at a supra-spinal level during early development. However, even if it is generated at this level, variability could result from an instability of the command rather than from purposeful explorations. Interestingly, the distinction itself might be challenging, because both types of variability (incidental or purposeful) might contribute to exploration: theoretically, variability might be useful for exploration and learning even if it has not been purposely generated by the individual (Dhawale et al., 2017). As such we used animal literature to make hypothesis about the origin of this variability but we are not aware or any protocol that could have helped to discriminate among the two. This was specified line 388-389: “As similar neurophysiological investigations cannot be conducted in human infants, discriminating among purposeful and incidental variability remains challenging,”.
 
 The time around walking onset was chosen to match previous literature on the topic (mainly, Dominici et al., (2011), but it also matches the period that is more and more recommended as a period when to intervene in early therapy. This was discussed line 469-471: “Overall, when compared with adult values (Figure 3, Figure 5, Table S3), our results suggest an immaturity of the modular system before and around walking onset, which confirms that infancy should be an ideal period of plasticity to benefit from in therapy (Ulrich et al., 2010; Morgan et al., 2021).”. As the age of walking onset is highly variable across infants (Martorell et al., 2006), we also chose to focus on walking onset rather than age to standardize recruitment along experience rather than age, as EMG variability is known to rapidly evolve with experience after a few months of walking experience (Chang et al., 2006). In the new version of the manuscript, we highlighted this variability by allocating legends according to the age of walking onset (Figure 2, Figure 3 and Figure 5, see Figure 3E detailing this legend).
 
 Reviewer #3 (Public Review):
 
 Hinnekens et al. examined the development of humans' leg movements as they learn to step, kick, and independently walk during infancy. An established theory argues that motor movements can be composed of a finite set of building blocks ("motor primitives"), just like any word can be composed of a finite set of letters. In their paper, Hinnekens et al. follow up this theory by longitudinally recording muscle activations of infants using EMG (at three time points: a few days after birth, at 3 months, and shortly after they learned to walk independently). The authors examined two modules that underlie the infants' stepping and two modules that underlie toddler walking, all based on previous literature. The authors also examined different modules that underlie infants' upright stepping and supine kicking. The authors used supervised machine learning (an advanced version of factor analysis) to identify the modules and to track their change at the different developmental time points. The authors found that trial-to-trial variability in the structure of primitives reduces from newborns to toddlers, even though the number of primitives increased. The authors relate these findings to motor exploration by arguing that newborns generate high variability with a low number of primitives.
 
 The paper has one clear strength - its longitudinal recordings. Unlike most papers in this area of research, the authors follow the same individuals from birth until they learn to walk and the comparison between the use of primitives is done on the same infants. This is certainly novel.
 
 That said, the contribution of the paper to the literature is unclear and it suffers from some critical weaknesses that challenge the current conclusions in the paper, based on the existing data.
 
 1) Although the data is based on longitudinal recordings, and this is certainly desirable, the paper is based only on 10 infants. Moreover, only seven infants contributed supine data at the first time points and only six infants contributed upright data at the different time points. The paper would benefit from a more reliable dataset that includes more infants and time points to compare. To conclude the authors' conclusions, much richer data is required.
 
 As detailed in our general response, we had chosen a very conservative approach which reduced the amount of data that were presented in the original manuscript, however we systematically reviewed our data and we now present our analyses on 18 infants, of which 11 stepped at birth, 15 stepped at 3 months old, 15 kicked at birth, 15 kicked at 3 months old, and 15 were recorded at walking onset. Each infant was followed longitudinally and we only present data if they are available on at least two time points. The results were confirmed by those analyses that yielded stronger p-values (see table S1, line 979).
 
 2) Relatedly, although the strength of longitudinal data is compared between individuals and has significant insights into individual differences in development, this was not clearly (sometimes not at all) discussed in the paper. The work would benefit from more focus on individual differences and a clear explanation of its contribution to the field from that aspect. The key arguments in the paper focus on the ratio between the number of primitives and the variability in each time point, but none of this from the lens of individual differences. This is challenging to do because there are not many individuals who contribute to the dataset but otherwise, it is not clear what the paper contributes to previous work and more critically.
 
 Thanks for the suggestion. To follow this remark, we modified each figure of the paper so the 18 individuals would each have their own color and could be followed across figures. Also, as the age of walking onset was different across infants, we allocated colors to each infant based on when they started to walk (Figure 2, Figure 3, Figure 5). Moreover, increasing our cohort highlighted some interindividual differences in the development of kicking only between birth and three months old (Figure 3, Figure 5, Table S1). This was discussed in a new paragraph of the discussion (line 469-487).
 
 3) The motivation for the paper is unclear. Why did the authors do what they did? Why is this important to do it the way they did? In the current manuscript, it is not clear why they used this design to get those conclusions.
 
 The main rational of the paper was to explore a paradox of the literature on early locomotor development: on one hand, newborn infants produce a highly variable muscular activity (Teulier et al., 2012), but on the other hand authors report that they produce rhythmic movement with a small number of invariant modules (Dominici et al., 2011; Sylos-labini et al., 2020). As the latest studies were based on averaged or single-step data, our main goal was to assess both EMG variability and features of modularity in the same cohort, in an attempt to refute or explain this paradox. We reviewed and clarified the introduction on this matter by clarifying the place of our study among the broader literature on variability in development (line 34-44) and by deepening explanations about the abovementioned paradox in relation to previous studies on infants’ modularity (line 72-96).
 
 4) The data selection process is also not clear. At each time point and from each infant, the authors examined 5 cycles from the same leg. The definition of a cycle was hip-flexion onset to another hipflexion onset on one side of hip extension. It is not clear what variability (measured by % of the cycle in flexion and extension) means in this case because infants hold their legs in one position for a long time. What are those 5 cycles? Why five? A lot of information is missing there about the arbitrary selection of analytic parameters. In addition, the authors argue they performed the same analyses with different parameters and that they got similar results. However, those results are not given in detail and it is hard to compare them with the authors' report.
 
 We entirely reviewed our data and less selection were applied in the current manuscript. Here is the complete data selection process:
 
 Among the 18 infants that we followed from birth on, 15 were followed until walking onset (among the other three, one had moved and the other two could not be seen around walking onset because of the covid pandemic). Around birth and three months old, in each infant we tried to elicit stepping (by holding the infant in an upright position with his feet above a surface) and kicking (by placing the infant in a supine position). Therefore, we systematically analyzed each video from every infant and every age to spot and count every alternated leg movement within the two behaviors. After this step, we checked the quality of EMG data for the 10 muscles that were recorded. As our analysis has to be based on the same number of muscles for each individual, if the quality of the signal was too low for even one muscle during a leg cycle, the cycle had to be removed from the analysis. After this check, if less than 5 alternated leg cycles were available, the whole trial was removed from the analysis. The rational is that the hypotheses that we tested were mainly about intra-individual variability and therefore analyses had to be based on a minimal number of cycles. In newborns this was particularly challenging because we were limited in recording time (1 to 2 minutes), moreover we did not always have qualitative EMG data because we always reduced the amount of adhesive surface on the skin for ethical reasons. Therefore for several babies we could not observe enough cycles to include them in the analysis, however in the current version of the manuscript we present data on 11 babies for newborn stepping, 15 babies for 3 mo stepping, 15 babies for newborn kicking, 15 babies for 3 mo kicking, and 15 babies for toddlers walking. The trials that were not included are grey in Table 1 of this document. For every other trial, the exact number of remaining cycles are reported in the same table.
 
 In the previous version of the manuscript, as we wanted the indexes to be comparable across individuals and ages, we had systematically analyzed 5 cycles that were randomly chosen among the available one. However this created data loss. As detailed in the general response above, to be less selective and to include every available cycle, we now rely on a new approach: if more than 5 cycles were available, we computed every variable of the study 5 times (for 5 random combinations of 5 cycles that were randomly chosen among every available cycle). The variables were averaged afterwards. Thanks to this new approach, 586 cycles are now included in the analysis, which confirmed the robustness of our findings.
 
 Infants can indeed hold their legs in one position for a long time but all of our results were obtained after having normalized each phase of flexion or extension by a given number of time points (see Figure 6, Temporal normalization). Our results were also verified with a different temporal normalization, directly normalizing the cycle instead of the phases. We choose not to report more results in the main text for the overall readability of the paper but here are the same table of p-value as in the appendix of the paper with a normalization based on cycles instead of phases.
 
 5) The recording times are not common across individuals. One newborn was recorded after 1 day and the other after 21 days. Not sure this is comparable, especially if the main contribution of the paper is the longitudinal data. Moreover, the second recording was conducted between 74 days to 122 days. This range is too broad. Same for the third time point - one walk onset is not reported, some infants were recorded at <380 days and some >500 days. This difference challenges the reliability of the data.
 
 Given the high inter-individual variability that relates to the age of walking onset (Martorell et al., 2006) it is often a challenge in developmental sciences to choose between standardizing recruitment according to the age or according to the experience. In the present study, we choose to recruit toddlers of similar experience (i.e. around walking onset) rather than on similar age because motor variability is known to depend a lot on experience, in particular regarding EMG data during the first months of walking (Chang et al., 2006). However, we agree with the reviewer that the age of walking onset is an important source of inter-individual variability and therefore we modified each figure of the paper so the 18 individuals would each have their own color which was ordered and allocated according to the age of walking onset (see Figure 3E detailing this legend).
 
 Regarding the other recording points, and the experience after walking onset, the recording time can indeed vary across individual despite our efforts during data collection to prevent this phenomenon. Main reasons were benign diseases of infants or work constraints for parents that induced postponements of the appointments. However, we report the precise age of each infant for each recording as well as individual data underlying each global figure (see source data of Figure 2, Figure 3 and Figure 5). Based on these data we checked that the individual that were recorded later than the others (for example, subject 1 and subject 14 who were recorded at 21 days for the 1st time point) did not demonstrate aberrant values.
 
 6) Conceptually, I'm not sure I understand why the authors selected leg alternation (and not other types of movements) as their modules. I was not convinced that leg alternations reflect their real-life locomotor experience (e.g., short bouts in all directions), and therefore the variability measured in this work does not reflect the variability of infants' natural locomotor behaviour.
 
 We fully agree that leg alternation do not reflect the whole variability that underlies real-life locomotor experience of these infants, however we did not intend to focus here on all the variability that exists during development but more specifically on the variability that allows to produce a given type of behavior with different inputs. This variability is interesting to study because infants tend to use steadier and steadier patterns of coordination to produce a given movement (Teulier et al., 2012), suggesting that they explore among different possible muscular associations before choosing some. As we wanted to study this phenomenon, it appeared methodologically pertinent to fix other sources of variability (i.e. to study different behaviors separately and to study only one coordination mode), as is commonly done in other EMG-based studies of the field (Dominici et al., 2011; Sylos-labini et al., 2020; Teulier et al., 2012). This choice allowed to remain comparable between infants and toddlers. Indeed, while infants produce numerous coordinative patterns while stepping or kicking, such as parallel cycles or singles cycles for example, toddlers only produce alternated flexion and extension cycle of the lower-limb when walking. Therefore, by selecting alternating cycles of flexion and extension only in infants, we ensure that the differences of variability that we observe between ages is not due to the ability of producing various movement, but really due to the ability of producing a given movement with various muscle outputs. Accordingly, and following our results, it allows to conclude that the structure of variability evolve between birth and independent walking to command a given movement. To explain this notion that relates to the redundancy of motor control, we added a new paragraph at the top of the introduction to better explain the place of our studies among broader literature on infant variability (line 34-44). We also clearly wrote in the discussion that our conclusions did not apply to every developmental source of variability (line 393-395): “As we observed such structure within alternated leg movements, other studies are needed to explore the extent of these results to other early behaviors or coordination modes”.
 
 7) There is not enough rationale for why the specific measurements (IEV, VAF, IRV, etc.) were used and why those are the appropriate ones for the address the questions in the paper. What is the justification for using those measurements?
 
 As our main goal is to characterize how EMG variability can be generated in a modular system, we defined those metrics as directly representative of the different features that we wanted to study: variability of the EMG output, dimensionality of the underlying modular organization, variability of module activations and selectivity of the command (be it through module activations or within module themselves). While VAF is commonly used in muscle synergies studies, this study is the first to explore how cycle-to-cycle variability could be generated in a modular system, and therefore these indexes were defined for its proper needs. As such to clarify their role to a broad audience we included a new table at the beginning of the Results section (see Table 1 of the ms, line 176).
 
 8) Some of the conclusions, especially those that relate to motor exploration, are not based on sufficient data. Motor exploration was not explicitly measured in this study, and how motor exploration is reflected by the current data and analyses is not clear.
 
 We fully agree with the reviewer: while we observed that the structure of EMG variability evolves during the first year of life, the origin of this variability (incidental or purposeful) remains unknown. This was specified line 388-389 “As similar neurophysiological investigations cannot be conducted in human infants, discriminating among purposeful and incidental variability remains challenging,”.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.05.05.490063v1
www.biorxiv.org www.biorxiv.org

New submission 25/07/2023, 10:46:12

1
1. Public_Reviews 25 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  Summary
  
  While DNA sequence divergence, differential expression, and differential methylation analysis have been conducted between humans and the great apes to study changes that "make us human", the role of lncRNAs and their impact on the human genome and biology has not been fully explored. In this study, the authors computationally predict HSlncRNAs as well as their DNA Binding sites using a method they have developed previously and then examine these predicted regions with different types of enrichment analyses. Broadly, the analysis is straightforward and after identifying these regions/HSlncRNAs the authors examined their effects using different external datasets.
  
  Strengths/weaknesses
  
  By and large, the analysis performed is dependent on their ability to identify HSlncRNAs and their DBS. I think that they have done a good job of showing the performance metrics of their methods in previous publications. Thereafter, they perform a series of enrichment-type analyses that have been used in the field for quite a while now to look at tissue-specific enrichment, or region-specific enrichment, or functional enrichment, and I think these have been carried out well. The authors achieved the aims of their work. I think one of the biggest contributions that this paper brings to the field is their annotation of these HSlncRNAs. Thus a major revisionary effort could be spent on applying their method to the latest genomes that have been released so that the community could get a clean annotation of newly identified HSlncRNAs (see comment 2).
  
  Comments
  
  1) Though some of their results about certain HSlncRNAs having DBSs in all genes is rather surprising/suspicious, I think that broadly their process to identify and validate DBSs is robust, they have multiple lines of checks to identify such regions, including functional validation. These predictions are bound to have some level of false positive/negative rate and it might be nice to restate those here and on what experiment/validation data these were conducted. However, the rest of their analysis comprises different types of enrichment analysis which shouldn't be affected by outlier HSlncRNAs if indeed their FPR/FNR are low.
  
  2) There are now several new genomes available as part of the Zoonomia consortium and 240 Primate consortium papers released. These papers have re-examined some annotations such as Human Accelerated Regions (HARs) and found with a larger dataset as well as better reference genomes, that a large fraction of HARs were actually incorrectly annotated - that is that they were also seen in other lineages outside of just the great apes. If these papers have not already examined HSlncRNAs, the authors should try and re-run the computational predictions with this updated set and then identify HSlncRNAs there. This might help to clarify their signal and remove lncRNAs that might be present in other primates but are somehow missing in the great apes. This might also help to mitigate some results that they see in section 3 of their paper in comparing DBS distances between archaics and humans.
  
  3) The differences between the archaic hominins in their DBS distances to modern humans are a bit concerning. At some level, we expect these to be roughly similar when examining African modern humans and perhaps the Denisovan being larger when examining Europeans and Asians, but they seem to have distances that aren't expected given the demography. In addition, from their text for section 3, they begin by stating that they are computing two types of distances but then I lost track of which distance they were discussing in paragraph 3 of section 3. Explicitly stating which of the two distances in the text would be helpful for the reader.
  
  (1) According to Figure 1A (according actually to Meyer et al., 2012, Prufer et al., 2017, and Prüfer et al., 2013), the phylogenetic distance from modern humans to Denisovan is shorter than the distance to Altai Neanderthal. However, also according to these studies, the branch of Denisovan is more remote to modern humans than Altai Neanderthal. Thus, it is not unreasonable to find that 2514 and 1256 DBSs have distances > 0.034 in genes in Denisovans and Altai Neanderthals, respectively. Probably, both the phylogenetic distances and DBS distances depend considerably on the sampled genomes of Altai and Denisovan who lived on the earth for quite long. When new samples are obtained, these distances may be somewhat changed.
  
  (2) Regarding “they are computing two types of distances but then I lost track of which distance they were discussing in paragraph 3 of section 3”, the second type of distances were discussed in section 3, and the distances computed in the first way were not further analyzed because “This defect may be caused by that the human ancestor was built using six primates without archaic humans”.
  
  4) Isn't the correct control to examine whether eQTLs are more enriched in HSlncRNA DBSs a set of transcription factor binding sites? I don't think using just promoter regions is a reasonable control here. This does not take away from the broader point however that eQTLs are found in DBSs and I think they can perform this alternate test.
  
  Indeed, the TFs-TFBSs and lncRNAs-DBSs relationships are comparable, and which one contains more QTLs is an interesting question. In this sense, it is reasonable to use TFBSs as the control. However, for three reasons, we did not perform the comparison and use TFBSs as the control. First, most TFBSs are predicted by varied methods, making us concern the reliability of comparing two sets of predictions. Second, most QTLs in DBSs are mQTLs but most QTLs in TFBSs are eQTLs. Third, probably a greater portion of TFBSs than DBSs are not in promoters, and the time consumption of LongTarget made us unable to predict DBSs truly genome-wide. Nevertheless, this is an interesting question deserving further exploring.
  
  5) In the discussion, they highlight the evolution of sugar intake, which I'm not sure is appropriate. This comes not from GO enrichment but rather from a few genes that are found at the tail of their distribution. While these signals may be real, the evolution of traits is often highly polygenic and they don't see this signal in their functional enrichment. I suggest removing that line. Moreover, HSlncRNAs are ones that are unique across a much longer time frame than the transition to agriculture which is when sugar intake rose greatly. Thus, it's unlikely to see enrichment for something that arose in the past 6000-7000 years would in the annotation that is designed to detect human-chimp or human-neanderthal level divergence.
  
  Multiple sugar metabolism-related pathways, including “glucose homeostasis” and “glucose metabolic process”, are found to be enriched only in Altai Neanderthal but not in chimpanzees (Figure 2). Indeed, HS lncRNAs are across a much longer time frame than the transition to agriculture. However, given that apes and monkeys know picking the ripe, sugar-rich fruits at the right time and place, we conjecture that archaic humans as hunter-gatherer could effectively explore natural sugars.
  
  Reviewer #2 (Public Review):
  
  Lin et al attempt to examine the role of lncRNAs in human evolution in this manuscript. They apply a suite of population genetics and functional genomics analyses that leverage existing data sets and public tools, some of which were previously built by the authors, who clearly have experience with lncRNA binding prediction. However, I worry that there is a lack of suitable methods and/or relevant controls at many points and that the interpretation is too quick to infer selection. While I don't doubt that lnc RNAs contribute to the evolution of modern humans, and certainly agree that this is a question worth asking, I think this paper would benefit from a more rigorous approach to tackling it.
  
  At this point, my suggestions are mostly focused on tightening and strengthening the methods; it is hard for me to predict the consequence of these changes on the results or their interpretation, but as a general rule I also encourage the authors to not over-interpret their conclusions in terms of what phenotype was selected for when as they do at certain points (eg glucose metabolism).
  
  I note some specific points that I think would benefit from more rigorous approaches, and suggest possible ways forward for these.
  
  1) Much of this work is focused on comparing DNA binding domains in human-unique long-noncoding RNAs and DNA binding sites across the promoters of genes in the human genome, and I think the authors can afford to be a bit more methodical/selective in their processing and filtering steps here. The article begins by searching for orthologues of human lncRNAs to arrive at a set of 66 human-specific lncRNAs, which are then characterised further through the rest of the manuscript. Line 99 describes a binding affinity metric used to separate strong DBS from weak DBS; the methods (line 432) describe this as being the product of the DBS or lncRNA length times the average Identity of the underlying TTSs. This multiplication, in fact, undoes the standardising value of averaging and introduces a clear relationship between the length of a region being tested and its overall score, which in turn is likely to bias all downstream inference, since a long lncRNA with poor average affinity can end up with a higher score than a short one with higher average affinity, and it's not quite clear to me what the biological interpretation of that should be. Why was this metric defined in this way?
  
  Length is an important metric of DBS, but it has a defect – a triplex of 100 bp may have 50% or 70% of nucleotides bound; in the two situations, the binding affinity of DBD and DBS is very different.
  
  2) There is also a strong assumption that identified sites will always be bound (line 100), which I disagree is well-supported by additional evidence (lines 109-125). The authors show that predicted NEAT1 and MALAT1 DBS overlap experimentally validated sites for NEAT1, MALAT1, and MEG3, but this is not done systematically, or genome-wide, so it's hard to know if the examples shown are representative, or a best-case scenario.
  
  More details are described in the citation Wen et al. 2022. We will put the sites into Supplementary Tables in the revised version.
  
  It's also not quite clear how overlapping promoters or TSS are treated - are these collapsed into a single instance when calculating genome-wide significance? If, eg, a gene has five isoforms, and these differ in the 3' UTR but their promoter region contains a DBS, is this counted five times, or one? Since the interaction between the lncRNA and the DBS happens at the DNA level, it seems like not correcting for this uneven distribution of transcripts is likely to skew results, especially when testing against genome-wide distributions, eg in the results presented in sections 5 and 6. I do not think that comparing genes and transcripts putatively bound by the 40 HS lncRNAs to a random draw of 10,000 lncRNA/gene pairs drawn from the remaining ~13500 lncRNAs that are not HS is a fair comparison. Rather, it would be better to do many draws of 40 non-HS lncRNAs and determine an empirical null distribution that way, if possible actively controlling for the overall number of transcripts (also see the following point).
  
  (1) If, say, three transcripts of a gene share the same promoter region (i.e., they have the same TSS) but differ only in 3’UTR, the promoter region was used to predict DBSs just for once. Otherwise, if the three transcripts have different TSS, the three promoter regions were used to predict DBSs.
  
  (2) A gene may have many DBSs if it has many transcripts, or few ones if it has just a few transcripts. We did not correct for this uneven distribution of transcripts, because our GTEx analysis was on the transcript level; it is well recognized that transcripts of the same gene can be expressed in different tissues.
  
  (3) We randomly sampled a pair of non-HS lncRNA and a transcript for 10000 times (i.e., 10000 pairs). It is a point that multiple draws of 40 non-HS lncRNAs should be made to make the statistics more robust.
  
  3) Thresholds for statistical testing are not consistent, or always well justified. For instance, in line 142 GO testing is performed on the top 2000 genes (according to different rankings), but there's no description of the background regions used as controls anywhere, or of why 2000 genes were chosen as a good number to test? Why not 1000, or 500? Are the results overall robust to these (and other) thresholds? Then line 190 the threshold for downstream testing is now the top 20% of genes, etc. I am not opposed to different thresholds in principle, but they should be justified.
  
  The over-representation analysis using g:Profiler was performed taking the whole genome as the background. Analyzing more DBSs (especially weak DBSs) would generate more results, but the results could be less reliable. Thus, there is a trade-off between analyzing fewer DBSs with relatively high reliability and analyzing more DBSs with relatively low reliability. Inevitably, the handling of this trade-off is somewhat subjective, and to carefully compare the two classes of DBSs per can be an independent question. Although weak DBSs were not systematically analyzed, the results from the strong DBSs undoubtedly suggest that HS lncRNAs have contributed greatly to human evolution.
  
  Likewise, comparing Tajima's D values near promoters to genome-wide values is unfair, because promoters are known to be under strong evolutionary constraints relative to background regions; as such it is not surprising that the results of this comparison are significant. A fairer comparison would attempt to better match controls (eg to promoters without HS lncRNA DBS, which I realise may be nearly impossible), or generate empirical p-values via permutation or simulation.
  
  We examined Tajima’s D in DBSs (Supplementary Figure 9) and in HS lncRNA genes (Supplementary Figure 18). In both cases, we compared the Tajima’s D values with the genome-wide background.
  
  4) There are huge differences in the comparisons between the Vindija and Altai Neanderthal genomes that to me suggest some sort of technical bias or the such is at play here. e.g. line 190 reports 1256 genes to have a high distance between the Altai Neanderthal and modern humans, but only 134 Vindija genes reach the same cutoff of 0.034. The temporal separation between the two specimens does not seem sufficient to explain this difference, nor the difference between the Altai Denisovan and Neanderthal results (2514 genes for Denisovan), which makes me wonder if it is a technical artefact relating to the quality of the genome builds? It would be worth checking.
  
  We used the same workflow (and the same cutoff 0.034) to analyze Vindija and Altai Neanderthal and Denisovan. If a smaller cutoff was used, one would see more Vindija genes. The question again is that there is a trade-off. Analyzing epigenome and epigenetic regulation in archaic genomes is an interesting direction, and much more studies are needed before more reasonably setting related parameters and cutoffs.
  
  5) Inferring evolution: There are some points of the manuscript where the authors are quick to infer positive selection. I would caution that GTEx contains a lot of different brain tissues, thus finding a brain eQTL is a lot easier than finding a liver eQTL, just because there are more opportunities for it. Likewise, claims in the text and in Tables 1 and 2 about the evolutionary pressures underlying specific genes should be more carefully stated. The same is true when the authors observe high Fst between groups (line 515), which is only one possible cause of high Fst - population differentiation and drift are just as capable of giving rise to it, especially at small sample sizes.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.05.31.543169v2
www.biorxiv.org www.biorxiv.org

New submission 25/07/2023, 09:45:58

1
1. Public_Reviews 25 Jul 2023
  
  in eLife
  
  Author Response
  
  The primary concern of Reviewer 1 is that Ne might affect gBGC and hence GC, and this might act as a confounding effect. The reviewer suggests that we should investigate how gBGC (with GC presumably as its proxy) might affect CAIS, and to what extent any relationship here could explain the relationship between CAIS and body mass. We believe that we have already dealt with this both in Supplementary Figure S5A (where we regret having inserted the wrong figure panel, a mistake we will correct), and its PIC-corrected counterpart in S5B. These two panels show (or will show) that CAIS is not correlated with GC. Note that we expect our genomic-GC-based codon usage expectations to reflect unchecked gBGC in an average genomic region, independently of whether that species has high or low Ne. Our working model is that mutation biases, including but not limited to the strength of gBGC, vary among species, and that they rather than selection determine each species’ genome-wide %GC. By correcting for genome-wide %GC, our CAIS thus corrects for mutation bias, in order to isolate the effects of selection.
  
  Reviewer 1 also suggests that we examine the relationship between gene expression and GC corrected RSCU, as we would expect codon adaptation to be stronger in more highly expressed genes, as was previously shown in the non-GC corrected CAI metric (Sharp et al 1987). Correlations with gene expression are outside the scope of the current work, which is focused on producing a single value of codon adaptation per species. It is indeed possible that our general approach could be useful in future work investigating differences among genes.
  
  One key difference between our work and that of Galtier et al. 2018 is that our approach does not rely on identifying specific codon preferences per species. Our approach thus remains appropriate even for scenarios e.g. where different cell types, different environmental conditions, and/or different genes have different codon preferences (Gingold et al. 2014 https://doi.org/10.1016/j.cell.2014.08.011). At a high level, our results are in broad agreement with those of Galtier et al., 2018, who found that gBGC affected all animal species, regardless of Ne, and who like us, found that the degree of selection on codon usage depended on Ne. Through use of a more sensitive methodology, we believe we have expanded our ability to detect codon adaptation into animals of somewhat higher Ne than in previous work.
  
  We thank Reviewer 2 for explicitly laying out the math that was implicit in our Figures 1 and 2. In our revisions, we will more clearly acknowledge that the per-site codon adaptation bias depicted in Figure 1 has limited sensitivity to s*Ne. We believe our approach worked despite this because the phenomenon is driven by what is shown in Figure 2. I.e., where Ne makes a difference is by determining the proteome-wide fraction of codons subject to significant codon adaptation, rather than by determining the strength of codon adaptation at any particular site or gene.
  
  Simulated datasets would be great, but we think it a nice addition rather than must-have, in particular because we are skeptical about whether our understanding of all relevant processes is good enough such that simulations would add much to our more heuristic argument along the lines of Figure 2. E.g. we believe the complications documented by Gingold et al. 2014 cited above are pertinent, but incorporating them into simulations would require a complex set of assumptions.
  
  In response to the final comment of reviewer 2, the reason that we hard-coded genome-wide %GC values is that we took them from the previous study of James et al. (2023) https://doi.org/10.1093/molbev/msad073. As summarized in the manuscript, genome-wide %GC was a byproduct of a scan conducted in that work, of all six reading frames across genic and intergenic sequences available from NCBI with access dates between May and July 2019. The code used in the current work to calculate the intergenic %GC, as well as that used to calculate amino acid frequencies, is located at https://github.com/MaselLab/Codon-Adaptation-Index-of-Species. We agree that more user-friendly tools would be useful, but producing robust tools falls outside the scope of the current manuscript.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.03.02.530449v1
www.biorxiv.org www.biorxiv.org

New submission 25/07/2023, 09:42:15

1
1. Public_Reviews 25 Jul 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  eLife assessment
  
  This important study addresses both the native role of the Plasmodium falciparum protein PfFKBP35 and whether this protein is the target of FK506, an immunosuppressant with antiplasmodial activity. The genetic evidence for the essentiality of FKBP35 in parasite growth is compelling. However, the conclusion that the role of FKBP35 is to secure ribosome homeostasis and the claim that FK506 exerts its antimalarial activity independently of FKBP35 rely on incomplete evidence.
  
  We thank the Reviewers and Editors for their careful evaluation of our manuscript and the constructive criticism. We realized that some of our conclusions may be regarded/misunderstood as overstatements. This was by no means our intention and we apologize for the unnecessary inconvenience. The phenotype of FKBP35 knock-out parasites clearly centers on failing ribosomes and protein synthesis, which in our opinion, provides an important leap towards understanding the role of this drug target in P. falciparum biology. It is however correct that, at this point, we can only make evidence-based hypotheses about direct interaction partners and we will emphasize this more clearly in a revised version of the manuscript. In order to prevent misinterpretation of our work, and as detailed in the point-by-point responses to the reviewer comments, we propose changing the manuscript title to “Genetic validation of PfFKBP35 as an antimalarial drug target”. To address the criticism regarding the effects of FK506, we will perform specific additional experiments. We are convinced that this new data set will resolve any remaining ambiguities and allows for a conclusive assessment of FK506 drug activity in P. falciparum.
  
  Reviewer #1 (Public Review):
  
  In this study, the authors investigate the biological function of the FK506-binding protein FKBP35 in the malaria-causing parasite Plasmodium falciparum. Like its homologs in other organisms, PfFKBP35 harbors peptidyl-prolyl isomerase (PPIase) and chaperoning activities, and has been considered a promising drug target due to its high affinity to the macrolide compound FK506. However, PfFKBP35 has not been validated as a drug target using reverse genetics, and the link between PfFKBP35-interacting drugs and their antimalarial activity remains elusive. The manuscript is structured in two parts addressing the biological function of PfFKBP35 and the antimalarial activity of FK506, respectively.
  
  The first part combines conditional genome editing, proteomics and transcriptomics analysis to investigate the effects of FKBP35 depletion in P. falciparum. The work is very well performed and clearly described. The data provide definitive evidence that FKBP35 is essential for P. falciparum blood stage growth. Conditional knockout of PfFKBP35 leads to a delayed death phenotype, associated with defects in ribosome maturation as detected by quantitative proteomics and stalling of protein synthesis in the parasite. The authors propose that FKBP35 regulates ribosome homeostasis but an alternative explanation could be that changes in the ribosome proteome are downstream consequences of the abrogation of FKBP35 essential activities as chaperone and/or PPIase. It is unclear whether FKBP35 has a specific function in P. falciparum as compared to other organisms. The knockdown of PfFKBP35 has no phenotypic consequence, showing that very low amounts of FKBP35 are sufficient for parasite survival and growth. In the absence of quantification of the protein during the course of the experiments, it remains unclear whether the delayed death phenotype in the knockout is due to the delayed depletion of the protein or to a delayed consequence of early protein depletion. This limitation also impacts the interpretation of the drug assays.
  
  We thank the Reviewer for the compliments regarding our experimental setup and the clarity of our manuscript. We agree that the link between FKBP35 knock-out and ribosome homeostasis is indirect and we now emphasize this more clearly in the revised manuscript. To prevent a general misinterpretation of our manuscript, we will adapt the title accordingly.
  
  We would still like to reiterate that the phenotype of FKBP35 knock-out parasites is best described by their defects in maintaining functional ribosomes. It is for several reasons that we believe the links between FKBP35 and ribosome function are purely evidence driven: First, pre-ribosomal and nucleolar factors are the first proteins (in generation 1 schizonts) to be affected upon knock-out of fkbp35 (Figure 2A, Table S1). We realized that Figure 2A falls short in showing this observation, which is why will update the figure accordingly. Second, the dysregulation of ribosomal factors and the general stall in protein synthesis is dominating the phenotype of FKBP35 knock-out parasites in generation 2. We thus believe it is appropriate to say that knock-out cells are most likely killed in response to defective ribosome maintenance – which is a consequence of reduced FKBP35 levels. We are aware that our experiments (and possibly any other reverse genetics approach) cannot rule out that FKBP35 affects ribosomal factors indirectly. Clearly, more work is required to disentangle this question in more detail in the future.
  
  We agree with the Reviewer that it is not possible to tell if the delayed death-like phenotype is due to a “delayed protein depletion”. We would however like to note that the DiCre/loxP approach allows for an immediate knock-out at the genome level and is thus as precise as possible. Further, in addition to the substantial depletion of FKBP35 in knock-out cells during the phenotypically silent generation, knocking out of fkbp35 at earlier time points (TPs 24-30 and 34-40 hpi in the preceding generation) resulted in the very same phenotype cycle (Figure 1). Here, parasite death was delayed substantially longer, i.e. more than one complete cycle. Together with the dysregulation of early ribosome maturation in generation 1, these findings point towards a delayed death phenotype. It is of course still possible to explain the delayed death-like phenotype by remnant activity of proteins synthetized prior to the genomic knock-out. We address this possibility and describe the two scenarios mentioned by the Reviewer in lines 141-144. Disentangling the two possibilities in future experiments will be difficult, not only with regards to FKBP35, but regarding “delayed death” phenotypes in general.
  
  In the second part, the authors investigate the activity of FK506 on P. falciparum, and conclude that FK506 exerts its antimalarial effects independently of FKBP35. This conclusion is based on the observation that FK506 has the same activity on FKBP35 wild type and knock-out parasites, suggesting that FK506 activity is independent of FKBP35 levels, and on the fact that FK506 kills the parasite rapidly whereas inducible gene knockout results in delayed death phenotype. However, there are alternative explanations for these observations. As mentioned above, the delayed death phenotype could be due to delayed depletion of the protein upon induction of gene knockout. FK506 could have a similar activity on WT and mutant parasites when added before sufficient depletion of FKBP35 protein. In some experiments, the authors exposed KO parasites to FK506 later, presumably when the KO is effective, and obtained similar results. However, in these conditions, the death induced by the knockout could be a confounding factor when measuring the effects of the drug. Furthermore, the authors show that FK506 binds to FKBP35, and propose that the FK506-FKBP35 complex interferes with ribosome maturation, which would point towards a role of FKBP35 in FK506 action. In summary, the study does not provide sufficient evidence to rule out that FK506 exerts its effects via FKBP35.
  
  Noteworthy, we were also very much surprised by data indicating that the antimalarial activity of FK506 is independent of FKBP35. It is for this reason that we conducted a comprehensive set of experiments to disprove our initial observations, but couldn`t find any evidence for an FKBP35-dependent mode of action of FK506:
  
  We were not able to see altered FK506 sensitivity in (i) inducible knock-down parasites, (ii) inducible overexpression parasites and (iii) inducible knock-out parasites. Parasites with altered FKBP35 levels (as assessed by Western blot and quantitative proteomics at 36-42 hpi, respectively) were equally sensitive to FK506. Importantly, at no sub-lethal FK506 concentration did lower FKBP35 levels lead to an altered response of FKBP35KO compared to the wild-type control population. Furthermore, (iv) induction of the knock-out in the cycle preceding FK506 exposure also had no effect on parasite sensitivity. As mentioned by the Reviewer, we also exposed the parasites to FK506 at 30-36 hpi and (v) did not see any effect, even though we measured a 19-fold difference in FKBP35 protein levels between the parasite populations at 36-42 hpi. At this point, parasite death induced by the knock-out cannot be a confounding factor (as it was mentioned by the Reviewer), because the FKBP35 knock-out has no effect on parasite survival in generation 1 in the absence of FK506 (Figure 1F). This demonstrates that the observed effect is only due to drug-mediated killing and not due to the FKBP35 knock-out.
  
  To account for a scenario in which the drop in FKBP35 levels only occurs after 36 hpi, we will perform an additional set of experiments, in which we induce the knock-out at 0-6 hpi and treat the parasites at 36-42 hpi (i.e. the time point at which the 19-fold difference in protein levels was measured by quantitative proteomics). This setup will allow determining whether or not the parasite killing activity of FK506 depends on FKBP35 levels.
  
  So far, our experiments cannot support any scenario in which FK506 kills P. falciparum parasites via inhibiting the essential role of FKBP35 and we would therefore want to insist that this statement is based on highly solid evidence. In this context, it is important to note that our conclusion includes two scenarios: “This indicates that either the binding of FK506 does not interfere with the essential role of PfFKBP35, or that PfFKBP35 is inhibited only at high FK506 concentrations that also inhibit other essential factors.” While this phrase is already present in our initial submission, we will emphasize this point more clearly in the revised manuscript. We are convinced that this information is of high importance for ongoing and future drug development.
  
  Reviewer #2 (Public Review):
  
  The manuscript by Thomen et al. FKBP secures ribosome homeostasis in Plasmodium falciparum and focuses on the importance of PfKBP35 protein, its interaction with the FK506 compound, and the role of PfKBP35 in ribosome biogenesis. The authors showed the interaction of the PfKBP54 with FK506, but the part of the FK506 and PfKBP54 in ribosome biogenesis based on the data is unclear.
  
  The introduction is plotted with two parallel stories about PfKBP35 and FK506, with ribosome biogenesis as the central question at the end. In its current form, the manuscript suffers from two stories that are not entirely interconnected, unfinished, and somewhat confusing. Both stories need additional experiments to make the manuscript(s) more complete. The results from PfFBP35 need more evidence for the proposed ribosome biogenesis pathway control. On the other hand, the results from the drug FK506 point to different targets with lower EC50, and other follow-up experiments are needed to substantiate the authors' claims.
  
  The strengths of the manuscript are the figures and experimental design. The combination of omics methods is informative and gives an opportunity for follow-up experiments.
  
  We thank the Reviewer for the evaluation of the manuscript. We apologize for the fact that the Reviewer found the manuscript to be inaccessible. We will use the comments as an incentive to restructure the manuscript and do our best to clarify the presentation, interpretation and conclusion of the presented data in the revised version. We believe that the FKBP35 data are strongly interlinked with the findings on FK506. We will emphasize these links more clearly and are convinced that the complementary nature of the datasets are a particular strength of the presented work.
  
  Reviewer #3 (Public Review):
  
  The study by Thommen et al. sought to identify the native role of the Plasmodium falciparum FKBP35 protein, which has been identified as a potential drug target due to the antiplasmodial activity of the immunosuppressant FK506. This compound has multiple binding proteins in many organisms; however, only one FKBP exists in P. falciparum (FKBP35). Using genetically-modified parasites and mass spectrometry-based cellular thermal shift assays (CETSA), the authors suggest that this protein is in involved in ribosome homeostasis and that the antiplasmodial activity of FK506 is separate from its activity on the FKBP35 protein. The authors first created a conditional knockdown using the destruction domain/shield system, which demonstrated no change in asexual blood stage parasites. A conditional knockout was then generated using the DiCre system. FKBP35KO parasites survived the first generation but died in the second generation. The authors called this "a delayed death phenotype", although it was not secondary to drug treatment, so this may be a misnomer. This slow death was unrelated to apicoplast dysfunction, as demonstrated by lack of alterations in sensitivity to apicoplast inhibitors. Quantitative proteomics on the FKBP35KO vs FKBP35WT parasites demonstrated enrichment of proteins involved in pre-ribosome development and the nucleolus. Interestingly, the KO parasites were not more susceptible to cycloheximide, a translation inhibitor, in the first generation (G1), suggesting that mature ribosomes still exist at this point. The SunSET technique, which incorporates puromycin into nascent peptide chains, also showed that in G1 the FKBP35KO parasites were still able to synthesize proteins. But in the second generation (G2), there was a significant decrease in protein synthesis. Transcriptomics were also performed at multiple time points. The effects of knockout of FKBP35 were transcriptionally silent in G1, and the parasites then slowed their cell cycles as compared to the FKBP35WT parasites.
  
  The authors next sought to evaluate whether killing by FK506 was dependent upon the inhibition of PfKBP35. Interestingly, both FKBP35KO and FKBP35WT parasites were equally susceptible to FK506. This suggested that the antiplasmodial activity of FK506 was related to activity targeting essential functions in the parasite separate from binding to FKBP35. To identify these potential targets, the authors used MS-CETSA on lysates to test for thermal stabilization of proteins after exposure to drug, which suggests drug-protein interactions. As expected, FK506 bound FKBP35 at low nM concentrations. However, given that the parasite IC50 of this compound is in the uM range, the authors searched for proteins stabilized at these concentrations as putative secondary targets. Using live cell MS-CETSA, FK506 bound FKBP35 at low nM concentrations; however, in these experiments over 50 ribosomal proteins were stabilized by the drug at higher concentrations. Of note, there was also an increase in soluble ribosomal factors in the absence of denaturing conditions. The authors suggested that the drug itself led to these smaller factors disengaging from a larger ribosomal complex, leading to an increase in soluble factors. Ultimately, the authors conclude that the native function of FKBP35 is involved in ribosome homeostasis and that the antiplasmodial activity of FK506 is not related to the binding of FKBP35, but instead results from inhibition of essential functions of secondary targets.
  
  Strengths:
  
  This study has many strengths. It addresses an important gap in parasite biology and drug development, by addressing the native role of the potential antiplasmodial drug target FKBP35 and whether the compound FK506 works through inhibition of that putative target. The knockout data provide compelling evidence that the KBP35 protein is essential for asexual parasite growth after one growth cycle. Analysis of the FKBP35KO line also provides evidence that the effects of FK506 are likely not solely due to inhibition of that protein, but instead must have secondary targets whose function is essential. These data are important in the field of drug development as they may guide development away from structure-based FK506 analogs that bind more specifically to the FKBP35 protein.
  
  Weaknesses:
  
  There are also a few notable weaknesses in the evidence that call into question the conclusion in the article title that FKBP35 is definitely involved in ribosomal homeostasis. While the proteomics supports alterations in ribosome biogenesis factors, it is unclear whether this is a direct role of the loss of the FKBP35 protein or is more related to non-specific downstream effects of knocking down the protein. The CETSA data clearly demonstrate that FK506 binds PfKB35 at low nM concentrations, which is different than the IC50 noted in the parasite; however, the evidence that the proteins stabilized by uM concentrations of drug are actual targets is not completely convincing. Especially, given the high uM amounts of drug required to stabilize these proteins. This section of the manuscript would benefit from validation of a least one or two of the putative candidates noted in the text. In the live cell CETSA, it is noted that >50 ribosomal components are stabilized in drug treated but not lysate controls. Similarly, the authors suggest that the -soluble fraction of ribosomal components increases in drug-exposed parasites even at 37{degree sign}C and suggests that this is likely from smaller ribosomal proteins disengaging from larger ribosomal complexes. While the evidence is convincing that this protein may play a role in ribosome homeostasis in some capacity, it is not sure that the title of the paper "FKBP secures ribosome homeostasis" holds true given the lack of mechanistic data. A minor weakness, but one that should nonetheless be addressed, is the use of the term "delayed death phenotype" with regards to the knockout parasite killing. This term is most frequently used in a very specific setting of apicoplast drugs that inhibit apicoplast ribosomes, so the term is misleading. It is also possible that the parasites are able to go through a normal cycle because of the kinetics of the knockout and that the time needed for protein clearance in the parasite to a level that is lethal.
  
  Overall, the authors set out to identify the native role of FKB35 in the P. falciparum parasites and to identify whether this is, in fact, the target of FK506. The data clearly demonstrate that FKBP35 is essential for parasite growth and provide evidence that alterations in its levels have proteomic but not transcriptional changes. However, the conclusion that FKBP35 actually stabilizes ribosomal complexes remains intermediate. The data are also very compelling that FK506 has secondary targets in the parasite aside from FKBP35; however, the high uM concentrations of the drug needed to attain results and the lack of biological validation of the CETSA hits makes it difficult to know whether any of these are actually the target of the compound or instead are nonspecific downstream consequences of treatment.
  
  We appreciate the detailed and valuable suggestions to improve the manuscript. We agree that CETSA could only identify potential targets of FK506 in the micromolar range, while FK506 showed a high affinity for FKBP35, consistent with earlier reports (2). We would however like to point out that FK506 kills P. falciparum at exactly these relatively high concentrations and not at those presumed from the high affinity interactions between FK506 and FKBP35. The relatively high FK506 concentration required to stabilize potential off target proteins is therefore not a concerning observation, but rather corroborates our conclusion that FK506 fails to inhibit the essential function of FKBP35 at concentrations that leave off targets unaffected. As mentioned in response to Reviewer 1, we will describe and discuss these data more clearly in the revised manuscript. We thank the Reviewer for pointing out the potential issues regarding the use of the term “delayed death phenotype”. We now refer to the FKBP35 phenotype as “delayed death-like” in the revised manuscript.
  
  We believe that follow-up work on specific FK506 CETSA hits is out of scope of the current and already quite complex manuscript.
  
  As mentioned in the response to Reviewer 1, we realize that the short title of the manuscript can be regarded as an overstatement. Again, this was clearly not our intention and we apologize that the Reviewers had to indicate this issue. While we believe that the message of the title holds true (see response to Reviewer 1), we recognize the misconception that might arise from it, which is why we propose the new title: “Genetic validation of PfFKBP35 as an antimalarial drug target”.
  
  Reviewer #1 (Recommendations For The Authors):
  
  1) Documentation of FKBP35 protein levels over time in knockout, knockdown and overexpressing parasites is missing here. Since the knockdown of PfFKBP35 has no phenotypic consequence, very low amounts of FKBP35 are probably sufficient for parasite survival and growth. In the absence of quantification of the protein during the course of the experiments, it remains unclear whether the delayed death phenotype in the knockout is due to the delayed depletion of the protein or to a delayed consequence of early protein depletion. This limitation also impacts the interpretation of the drug assays. In particular, the delayed death phenotype could simply reflect delayed protein depletion, contrasting with the immediate inhibition of FKBP35 by FK506. The quantification by mass spectrometry does indicate substantial depletion but provides no information on the kinetics and levels. What is 19 fold compared to the knockdown condition? Also, expression of FKBP35 in overexpressing parasites should be compared side by side with the iKD (in the presence of Shield).
  
  We agree with the Reviewer that low FKBP35 levels are likely sufficient for parasite survival. This is addressed in the manuscript (lines 141-143). Assessing protein levels in the transgenic parasites side by side in time course experiments would be interesting. However, our conclusions are independent of the outcome of such experiments because the relative difference in FKBP35 levels resulting from conditional expression systems did not change the parasites` susceptibility to FK506. We believe that comparing isogenic populations is much more informative than comparing independent cell lines with each other.
  
  2) The authors claim that FK506 fails at inhibiting the essential function of PfFKBP35 (line 103), however this is not directly supported by data. FK506 kills the parasite and so inhibits essential functions. The data indicate that FK506 antimalarial activity does not seem to be influenced by FKBP35 levels, which would support the authors claim. However, as mentioned above, it is important to better define experimentally FKBP35 expression levels. Also, in experiments where FK506 is added late after rapamycin treatment, the authors need to clarify how they could distinguish drug killing and death due to the knockout.
  
  In the experiment described by the Reviewer, the FKBP35 knock-out was induced in young ring stages (0-6 hpi) and FK506 was added at 30-36 hpi, we measured the parasite survival from G1 to G2 (see figure 5A). In the absence of FK506, the FKBP35 knock-out has no effect on parasite survival (Figure 1), demonstrating that the observed effect is only due to drug killing and not due to the KO.
  
  To address the concerns regarding delayed depletion of FKBP35, we have performed an additional set of experiments. This data corroborates that the effect of FK506 is independent of FKBP35 levels. We discuss this topic in more detail in the Public Review. In brief, the additional experiment included exposing knock-out parasites (KO induced 0-6 hpi) with FK506 at 36-42 hpi, i.e. at a time point when FKBP35 protein levels are reduced by more than 90% (19-fold difference compared to the control parasites based on quadruplicate quantitative mass spectrometry data). However, despite the clear difference, the IC50 of FK506 remained the same as determined before (see new figure 4F).
  
  3) Since FK506 is known to inhibit FKBP35 PPIase activity, it could be relevant to compare the effects of FK506 versus KO on ribosomes and translation. This could provide further evidence supporting a FKBP35-independent activity of FK506.
  
  We agree with the Reviewer that this would be very informative. However, it would be difficult to disentangle indirect downstream processes on translation caused by both the FK506 drug treatment and the FKBP35 knock-out in a cellular assay. Establishing a biochemical in vitro assay to study the role of PPIase activity in translation is out of scope of this manuscript.
  
  Minor points
  
  -The title is rather vague, which reflects the fact that the function of PfFKBP35 is not precisely defined in the study.
  
  We thank the Reviewer for this assessment, which is in agreement with Reviewer 3. Based on these concerns, and in order to prevent misinterpretation of our manuscript, we propose changing the title to “Genetic validation of PfFKBP35 as an antimalarial drug target” (see public response above).
  
  -The transcriptomics data (Fig 3) provide little information on the function of FKBP35 and could be included as supplemental material. On the contrary, data in FigS5 convey important information and should be moved to the main figures.
  
  We believe that the transcriptomics data are important to characterize the effect of limiting FKBP35 levels in G2, as they show that, unlike certain homologs of other organisms (3), FKBP35 has no role in transcriptional control and its knock-out does not have any downstream consequences on the transcriptional level (except for the death-related stall in cell cycle progression). We would therefore would like to keep this dataset represented in the main figures. The updated Figure 4F now includes more information about the effect of adding FK506 at different time points, which was only addressed in Figure S5 in the previous version of the manuscript. We believe that the key message of Figure S5 is now covered in Figure 4.
  
  -Line 30: "action" rather than "role"
  
  We corrected this.
  
  Reviewer #2 (Recommendations For The Authors):
  
  I have no comments on data, code, or other issues.
  
  General comments:
  
  The introduction is plotted with two parallel stories about PfKBP35 and FK506, with ribosome biogenesis as the central question at the end. In its current form, the manuscript suffers from two stories that are not entirely interconnected, unfinished, and somewhat confusing. I recommend focusing only on one story - either characterizing PfBP35 and its role in Plasmodium falciparum biology - future investigation of PfBP35 control of cellular processes or focusing on the actual targets of the FK506 drug (identified in figure 4). Both stories need additional experiments to make the manuscript(s) more complete and ready for publication. The results from PfFBP35 need more evidence for the proposed ribosome biogenesis pathway control. On the other hand, the results from the drug FK506 point to different targets with lower EC50, and other follow-up experiments are needed to substantiate the authors' claims.
  
  The strengths of the manuscript are the figures and experimental design. The combination of omics methods is informative and gives an opportunity for follow-up experiments.
  
  Detailed points and suggestions for authors:
  
  Line 99
  
  There is no such thing as "protein translation"; it is mRNA translation or protein synthesis, which needs to be updated throughout the manuscript.
  
  We thank Reviewer 2 for pointing out this error that we now corrected.
  
  Line 174
  
  The statement needs a reference(s).
  
  We added an appropriate review reference.
  
  Lines 229-235
  
  While transcriptomics and proteomics data can argue that FKBP35 maybe acts at the post-transcriptional level, its function, as well as presented data, could point to post-translational mechanisms as well, cell cycle checkpoint misregulation, and multiple other pathways that control cell size, cell proliferation, translation, and ribosome biogenesis. More solid and direct evidence on ribosome biogenesis (rRNA processing, polysome profiles, or similar experiments) would be needed to show the function of FKBP35 in this cellular process.
  
  We have changed the term “post-transcriptional processes” to “transcription-independent processes”. As detailed in the Public Review, we agree with the Reviewer and lowered our statements regarding the function of FKBP35 throughout the manuscript.
  
  Lines 237-313
  
  The authors showed again the interaction of PfFKBP35 with the FK506 drug, but the phenotype differs from the protein deletion. Moreover, EC50s for multiple other proteins (i.e., PF3D7_1138700 or PF3D7_1325900, among others) are lower than for PfFKBP35 but are never further tested. This would be necessary to characterize FK506 drug targets, and it would be a different study.
  
  We believe that characterizing putative targets of FK506 is out of the scope of this already complex study and should be addressed, as suggested by the Reviewer, in a future and independent efforts.
  
  Lines 293 - 301
  
  The point of lower EC50 for PfFKBP35 and FK506 in in vitro cell lysate experiment compared to in vivo IC50 data is not surprising, given that drug delivery is not an issue in a lysate experiment. It is unclear why the authors pick some proteins and not others for further characterization of FK506 binding. There is no explanation for this selection. They did not follow up on the best targets of FK406 drugs from Fig 4 (above comment).
  
  As mentioned above, validation of FK506 targets is out of scope of this study.
  
  Lines 313 -352
  
  An alternative scenario for the FK506 drug data in CETSA experiments is that they bind directly to ribosomes interacting with rRNA, as many macrolides do. One should note that these are not ribosomal factors (line 334) but ribosomal proteins mentioned in Fig.4 F, mainly associated with large ribosomal subunit.
  
  We agree with Reviewer 2 that FK506 could bind indirectly to ribosomal proteins. This scenario is already described in the initial version of the manuscript (see lines 285-287: “Of note, these ribosomal proteins were stabilized at virtually identical FK506 concentrations (Figs. 4D, and S7), indicating that the drug – directly or indirectly – interacts with ribosomal complexes.”).
  
  We thank the Reviewer for pointing out that we are indeed talking about “ribosomal proteins” rather than “ribosomal factors”. We now corrected this.
  
  Reviewer #3 (Recommendations For The Authors):
  
  Please see Public review for suggestions about experimental validation of the link to ribosome homeostasis.
  
  We would like to thank Reviewer 3 for the detailed suggestions.
  
  References
  
  1) Kennedy K, Cobbold SA, Hanssen E, Birnbaum J, Spillman NJ, McHugh E, et al. Delayed death in the malaria parasite Plasmodium falciparum is caused by disruption of prenylation-dependent intracellular trafficking. PLoS Biol. 2019;17(7):e3000376.
  
  2) Kotaka M, Ye H, Alag R, Hu G, Bozdech Z, Preiser PR, et al. Crystal structure of the FK506 binding domain of Plasmodium falciparum FKBP35 in complex with FK506. Biochemistry. 2008;47(22):5951-61.
  
  3) Kasahara K, Nakayama R, Shiwa Y, Kanesaki Y, Ishige T, Yoshikawa H, et al. Fpr1, a primary target of rapamycin, functions as a transcription factor for ribosomal protein genes cooperatively with Hmo1 in Saccharomyces cerevisiae. PLoS Genet. 2020;16(6):e1008865.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.12.09.519720v4
www.biorxiv.org www.biorxiv.org

New submission 25/07/2023, 09:24:01

1
1. Public_Reviews 25 Jul 2023
  
  in eLife
  
  Author Response
  
  We thank the reviewers for their comments, and their evident close reading of the manuscript. Generally, we agree with the reviewers on the strengths and weaknesses of our manuscript. We plan to submit a revised version which has a more extensive discussion of alternative explanations for initial high ribosome density as seen by ribosome profiling, and which more specifically points out the limitations of our work.
  
  As a preface to specific responses to the reviewers, we will say that we could divide observations of slow initial translation into two categories, which we will call “encoded slow codons”, and “increased ribosome density”. With respect to the first category, Tuller et al. documented initial “encoded slow codons”, that is, there is a statistical excess of rare, slowly-translated codons at the 5’ ends of genes. Although the size of this effect is small, statistical significance is extremely high, and the existence of this enrichment is not in any doubt. At first sight, this appears to be a strong indication of a preference for slow initial translation. In our opinion, our main contribution is to show that there is an alternative explanation for this initial enrichment of rare, slow codons—that they are a spandrel, a consequence of sequence plasticity at the 5’ (and 3’) ends of genes. The reviewers seem to generally agree with this, and we are not aware that any other work has provided an explanation for the 5’ enrichment of rare codons.
  
  The second category of observations pertaining to slow initial translation is “increased ribosome density”. Early ribosome profiling studies used cycloheximide, and these showed a much greater density of ribosomes near the 5’ end of genes than elsewhere. This high initial ribosome density helped motivate the paper of Tuller et al., though their finding of “encoded slow codons” could explain only a very small part of the increased ribosome density. More modern ribosome profiling studies do not use cycloheximide as the first step in arresting translation, and in these studies, the density of ribosomes near the 5’ end of genes is greatly reduced. And yet, there remains, even in the absence of cycloheximide at the first step, a significantly increased density of ribosomes near the 5’ end (e.g., Weinberg et al., 2016). (However, at least some of these studies do use cycloheximide at later steps in the protocol, and the possibility of a cycloheximide artefact is difficult to exclude.) It appears to us that some of the reviewer’s main concerns are that we do not explain the increased 5’ ribosome density seen by ribosome profiling. We agree; but we feel it is not the main point of our manuscript. In revision, we will more extensively discuss other work on increased ribosome density, and more explicitly point out the limitations of our manuscript in this regard. We also note, though, that increased ribosome density is not a direct measure of translation speed—it can have other causes.
  
  Specific Responses.
  
  Reviewer 1 was concerned that we did not more fully discuss other work on possible reasons for slow initial translation. We will discuss such work more extensively in our revision. However, as far as we know, none of this work proposes a reason for the 5’ enrichment of rare, slow codons.
  
  Reviewer 1 was also concerned about confounding effects in our reporter gene analysis of the effects of different codons on efficiency of translation. We have two comments. First, it is important to remember that although we changed codons in our reporters, we did not change any amino acids. We changed codons only to synonymous codons. Thus at least one of the reviewer’s possible confounding effects—interactions of the nascent peptide chain with the exit channel of the ribosome—does not apply. However, of course, the mRNA nucleotide sequence is altered, and this would cause a change in mRNA structure or abundance, which could matter. We agree this is a limitation to our approach. However, to fully address it, we feel it would be necessary to examine a really large number of quite different sequences, which is beyond the scope of this work.
  
  Reviewer 2 was concerned that the conservation scores for the 5’ 40 amino acids, and the 3’ 40 amino acids were similar, but slow translation was only statistically significant for the 5’ 40 amino acids. As we say in the manuscript, we are also puzzled by this. We note that 3’ translation is statistically slow, if one looks over the last 100 amino acids. Our best effort at an explanation is a sort of reverse-Tuller explanation: that in the last 40 amino acids, the new slow codons created by genome plasticity are fairly quickly removed by purifying selection, but that in the first 40 amino acids, for genes that need to be expressed at low levels, purifying selection against slow codons is reduced, because poor translation is actually advantageous for these genes. To expand on this a bit, we feel that the 5000 or so proteins of the proteome have to be expressed in the correct stoichiometric ratios, and that poor translation can be a useful tool to help achieve this. In this explanation, slow translation at the 5’ end is bad for translation (in agreement with our reporter experiments), but good for the organism, whereas in Tuller, slow translation at the 5’ end is good for translation.
  
  Reviewer 2 wondered whether the N-terminal fusion peptide affects GFP fluorescence in our reporter. This specific reporter, with this N-terminus, has been characterized by Dean and Grayhack (2012), and by Gamble et al. (2016), and the idea that a super-folder GFP reporter is not greatly affected by N-terminal fusions is based on the work of Pedelacq (2006). None of these papers show whether this N-terminal fusion might have some effect, but together, they provide good reason to think that any effect would be small. We will add these citations to the revision.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.06.27.497802v3
www.biorxiv.org www.biorxiv.org

New submission 24/07/2023, 11:47:24

1
1. Public_Reviews 24 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #2 (Public Review):
  
  During meiosis, mitotic cohesin complexes are replaced by meiosis-specific cohesins to enable a stepwise loss of sister chromatid cohesion. The identity of the cohesin complex is defined by its kleisin subunit. In the early meiotic prophase, the mitotic kleisin Scc1 is replaced by a meiotic counterpart Rec8. C. elegans expresses two additional meiotic kleisins, COH-3 and COH-4; however, how meiotic cohesin complexes differ in their loading and function has been unclear. In this paper, Castellano-Pozo and colleagues unveil their differential dynamics and functions using elegant approaches that include auxin-mediated depletion and TEV-mediated removal of meiotic kleisins. The association of COH-3/4 with chromosomes is dynamic and is under the control of two cohesin regulators, WAPL-1 and SCC-2, while REC-8 remains more stably associated. The authors established that COH-3/4 is involved in maintaining the structural integrity of chromosome axes, whereas the REC-8 cohesin is solely responsible for sister chromatid cohesion throughout meiosis. They further demonstrated the role of REC-8 in the repair of meiotic DSBs.
  
  Overall, this solid work unequivocally establishes the distinct regulation and requirements for REC-8 and COH-3/4 cohesin complexes during C. elegans meiosis.
  
  We thank the reviewer for their overall support.
  
  However, as the authors acknowledged, the role of REC-8 cohesins in sister chromatid cohesion has been shown previously using genetic mutants (Crawley et al., 2016 eLife). While the authors highlighted the advantages of removing cohesin subunits in establishing their distinct requirements, many of the results were recapitulated from their previous work (e.g. rec-8; spo-11 and coh-3/4; spo-11). It might be helpful for the readers to compare the results between the two studies and point out uniquely illuminating results.
  
  Although we and others have previously suggested that REC-8 cohesin provides SCC in worms based on observations made in different meiotic mutants, a convincing demonstration of this possibility was lacking and an alternative model proposing that COH-3/4 cohesin do provide SCC had been proposed (Severson et al 2014). Using TEV-tagged versions of REC-8 and COH-3/4 we unequivocally establish that SCC is uniquely provided by REC-8 complexes in metaphase I oocytes. We have introduced modifications in the text and figures (including a model shown in Figure 5) to highlight the main results of our study.
  
  The role of REC-8 in DNA repair has also been shown in different contexts. Chromosomes fragmentation and DNA bridges are observed in rec-8; syp-1 or rec-8; syp-2 (RNAi) animals (Colaiacovo et al., 2003 Dev Cell; Crawley et al., 2016 eLife), suggesting a role of REC-8 in inter-sister repair. Persistent RAD-51 foci are also observed on asynapsed chromosomes in rec-8 mutants, suggesting a role for REC-8 in DNA repair (Cahoon et al., 2019 Genetics). The authors must cite these papers and discuss the results in the context of prior work.
  
  We agree with the reviewer that the studies mentioned above are consistent with the possibility that REC-8 complexes contribute to inter-sister repair. We now include citations of the manuscripts mentioned by the reviewer. The experiments presented in Figures 4A-B are different from those in the studies mentioned by the reviewer in that by introducing exogenous DSBs by IR (including in a spo-11 mutant background (Figure 4B)) we can more directly address the contribution of REC-8 and COH-3/4 complexes in pachytene nuclei under a situation in which similar numbers of DSBs are introduced. These experiments show that low abundance REC-8 complexes play a much more prominent role in DSB repair than highly-abundant COH-3/4 complexes and suggest that this activity is coupled to REC-8’s role in SCC.
  
  Reviewer #3 (Public Review):
  
  The study, performed in the animal model C. elegans, aims at characterizing functional differences in the meiosis-specific kleisins, REC-8 and COH-3/4.
  
  The authors conclude that in worms the identity of the kleisin subunit of the cohesin complex determines whether cohesin promotes cohesion, or controls higher-order chromosome structure. COH-3/4 is highly abundant and dynamic and responds to SCC-2 and WAPL-1. In contrast, REC-8 complexes associate stably and in low abundance and are resistant to SCC-2 and WAPL-1 perturbations.
  
  Main points:
  
  This study is a continuation and partially a repeat of a study Castellano-Pozo & Martinez-Perez published in Nat. Comm. 2020, in which they depleted COH-3/4 and REC-8 by injecting TEV and cleaved artificially engineered TEV sites in these kleisins.The results were slightly different though, as the authors concluded: "Disassembly of axial elements requires simultaneous removal of REC-8 and COH-3/4."
  
  The current study uses a degron instead of TEV and SIM to revisit the same result. This time, degradation of COH-3/4 alone, but not of Rec8 alone completely eliminates axial elements. It seems that, if the conclusion is now correct, the previous headline must be incorrect, showing that more care has to be taken in the conclusions.
  
  The reviewer is referring to data shown in Figure 1C saying that we used a degron system to degrade COH-3/4 and REC-8 from pachytene nuclei. This is incorrect, images in this figure correspond to rec-8 and coh-3 coh-4 double mutants (as indicated in main text and figure legend) and therefore to germlines lacking REC-8 or COH-3/4 from the onset of meiosis. In contrast, in the Castellano-Pozo et al 2020 study REC-8 or COH-3/4 were removed from pachytene chromosomes using the TEV approach following normal chromosome morphogenesis at meiosis onset to specifically address how kleisin removal in nuclei at the pachytene stage impacted on meiotic progression. In addition to this, Figure 1C does not show that lack of COH-3/4 “completely eliminates axial elements”, as stated by the reviewer, but rather that “SMC-1::GFP signals appeared as discontinuous weak signals in pachytene nuclei” (see description of this result in lines 103-104 of first version). This finding is consistent with the Castellano-Pozo et al 2020 study where we reported that staining of HORMADs (used to visualise axial elements) became weaker and more discontinuous following removal of COH-3/4 than REC-8 from pachytene axial elements (this observation is also mentioned in lines 96-97 of the first version of our manuscript).
  
  One new experiment in this study is the degradation of scc-2::AID::GFP. The authors treat the germline with auxin for 14 hours. How long scc-2::AID actually needs for degradation and thus, how long cells actually remain without SCC-2, is unknown. What is definitely needed is a serious analysis of the speed of degradation of Scc2 in the various stages.
  
  It is currently not possible to estimate, as the authors do, how long cells have been without SCC-2. This estimation assumes an immediate depletion of SCC-2.
  
  If this were indeed the case, then depletion intervals should be much shorter, because the important primary phenotypes occur immediately after depletion, not 14 hours later.
  
  We now analyse REC-8::HA and COH-3/4 staining after auxin treatment for 8 and 14 hours, showing that 8 hours results in weaker effect on COH-3/4 depletion in pachytene nuclei and a smaller section of the germline lacking REC-8::HA staining in early prophase. We also include cartoons in Figure 2B to explain how nuclei progress through pachytene (35 hours in total).
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.10.12.511771v1
www.medrxiv.org www.medrxiv.org

New submission 24/07/2023, 11:41:30

1
1. Public_Reviews 24 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #2 (Public Review):
  
  This manuscript tackles the important and vexing problem of mapping alleles for TB. It is a really important problem, and this paper presents the largest genetic data set. It does so by amalgamating data from multiple cohorts. The manuscript rightly points out that many studies have not produced reproducible results, and most alleles are population specific, and rarely seen in multiple studies.
  
  1) Authors find a strong HLA associated SNP. They do conduct HLA imputation, but there is little effective fine-mapping. Authors should report which classical alleles are consistent with this allelic association (e.g. which classical alleles are in phase with it). Authors comment on DQA1-0301, but it isn't clear in the main text how significant it is. I think the authors should dig a little deeper. Imputing amino acids and assessing association might be useful. Finding classical alleles that explain the SNP associations and are seen across populations might be useful. If the authors think that the SNP might be a regulatory allele, the authors should make a case for that based on genomic annotations, eQTL analyses etc.
  
  We thank the reviewer for pointing out the issues with the HLA section. We also received feedback from another reviewer about the HLA section. Based on this we have completely reworked the HLA section with more rigorous analysis to make the results easier to interpret and detect potential underlying HLA alleles that could explain the significant SNP detected in the MR-MEGA meta-analysis. This includes our findings with summary statistics for the DQA1*02:01 allele with those available from studies that were not included in our genome-wide meta-analysis. The HLA section has been updated on page 7-9, as shown below and a figure has been added to the main manuscript (Figure 3B) and the supplementary data (Figure S2):
  
  Notwithstanding inconsistency across populations the strongest signal in the combined global analyses is at DQA102:01, revealing a protective effect (OR 0.88, 95% CI 0.82-93, p-value = 1.3e-5, Figure 3B). The signal remains apparent in the six populations with the lead SNP at MAF >2.5% and individual level data available (p-value = 0.0003). However, conditioning on the significant SNP (rs28383206) in this subset, we find the signal at DQA102:01 all but disappears (Figure S2) suggesting the classical allele is tagging the rs28383206 association (p-value = 0.44). This observation is consistent with previous observations of HLA analysis in Icelandic (DQA102:01: OR 0.82, p-value = 7.39e-4) and Han Chinese populations (DQA102:01: OR 0.82, p-value = 7.39e-4), but showed opposite direction of effect in another Chinese population (DQA1*02:01: OR 1.28, p-value = 0.0193, Figure 3B)19,21,23.
  
  The discussion was also updated (page: 14-15) to incorporate and discuss the updated results as shown below:
  
  Based on the significant association, rs28383206, in the HLA region identified in this multi-ancestry (Figure 3A), HLA specific imputation and association testing was done to fine map the region and identify potential HLA epitopes driving this association. HLA DQA102:01 had the strongest signal in the meta-analysis across the 8 included studies (Figure 3B), but this signal disappeared when conditioning on the significant SNP (rs28383206). HLA DQA102:01 has previously been identified in an Icelandic and two Chinese population, but the direction of effect was not consistent19,21,23. Despite these inconsistencies the association between Mtb and HLA class II should be explored in more detail in future studies. A study investigating outcomes of Mtb exposure in individuals of African Ancestry identified protective effects of HLA class II alleles for individuals resistant to TB, highlighting the importance of HLA class II and susceptibility to TB62. HLA class II is a key determinant of the immune response in TB and Mtb has mechanisms to directly interfere with MHC class 2 antigen presentation63. This is supported by studies in mice, where mice in which the MHC class ll genes were deleted died quickly when exposed to Mtb and died faster than mice in which MHC class I genes were deleted63.
  
  2) The authors comment on ancestry. Are ancestry components disease associated in any cohort? It might be interesting to demonstrate this.
  
  We thank the reviewers for this recommendation. While ancestry components have been shown to be disease associated in the admixed (RSA) populations in previous studies, we have considered the fact that effects of genetic ancestry can be severely confounded by socioeconomic factors. Factors such as housing, employment, poverty and access to healthcare have significant impact on TB incidence rates, especially in African populations. We cannot account for these socioeconomic differences in our analysis, but we have updated the manuscript (page: 15) to highlight this issue and the potential impact of socioeconomic factors on our results.
  
  This is supported by the fact that previous TB genetic association studies have identified significant effects of ancestry on TB susceptibility11,26. However, the effects of genetic ancestry can be confounded by other factors not accounted for in this analysis, such as differences in socioeconomic factors (including differences in housing, employment, poverty, and access to healthcare) between the included study populations59–61. For the ancestry-specific analysis, fewer studies result in there being less input heterogeneity to account for, but the reduced sample size was not sufficient to detect any ancestry-specific genome-wide associations. This is particularly evident for the African ancestry-specific meta-analysis where the large degree of heterogeneity, which could be a result of the high genetic diversity within Africa, in combination with differences in socioeconomic factors compared to other populations included in this study, resulted in no observable suggestive association peaks59,60.
  
  Reviewer #3 (Public Review):
  
  This paper was a significant and commendable effort, given all the challenges in TB genetics research. It was generally well written and analyses well done. Analytical methods were appropriate. The inclusion of polygenic heritability estimates is also nice to have within this large work. There is also a wealth of supplemental data provided, which will be useful to the field.
  
  However, there are a number of important weaknesses that need to be addressed. These are listed here, and recommended revisions are addressed in the recommendations section:
  
  1) As the authors point out, one of the challenges in this work is the varying phenotype definitions (diagnosis of TB cases, definition of controls) across all the included genetic studies. Table S1 is critical for this, however it is missing information, and some of the information is unclear. More importantly, the authors state multiple times that there is no evidence of heterogeneity due to these variable phenotype definitions, and that genetic ancestry contributes more to differences in effect sizes between GWAS than study design. However, these two things are confounded - different study designs / phenotype definitions were used in studies of different ancestry.
  
  We thank the reviewer for pointing this out and we have updated Table S1 to define the phenotype definitions and how cases and controls were identified. All datasets should now have clear definitions. As for the impact of different phenotype definitions on the heterogeneity we do agree that these are confounding factors and we do not claim that there is no evidence of phenotype definitions influencing heterogeneity, but rather we claim that the genetic ancestry of the included populations has a larger impact on heterogeneity than other factors investigated in this study. We updated the manuscript to clarify this in the discussion (page: 15) as shown below:
  
  The p-values of residual heterogeneity in genetic effects between the studies in the multi-ancestry meta-analysis show no significant inflation between the studies suggesting that differences in study characteristics (phenotype definition, infection pressure, Mtb strain) are not the main contributor to the lack of significant associations, but they certainly have an impact and are compounded with ancestry-correlated heterogeneity and other factors. However, the ancestry-correlated heterogeneity p-values are generally lower than the residual heterogeneity, suggesting that genetic ancestry has a stronger impact on the differences in effects sizes between the studies. This is supported by the fact that previous TB genetic association studies have identified significant effects of ancestry on TB susceptibility11,26. However, the effects of genetic ancestry can be confounded by other factors not accounted for in this analysis, such as differences in socioeconomic factors (including differences in housing, employment, poverty, and access to healthcare), phenotype definitions and differences in infection pressure between the included study populations 60–62
  
  And we also updated the polygenic heritability in the results section (page: 4) as shown below:
  
  Furthermore, variations in phenotype definition can have an impact on heritability estimates (Table S1).
  
  2) The polygenic heritability analysis table is not explained very well.
  
  We thank the reviewer for pointing out this issue. The polygenic heritability table (Table S2) has been updated and some columns were removed (as they contained results from a discarded analysis). We have added footnotes to the table to define the variables and make the table more understandable and we have also updated the results section to clarify the analysis (page: 19) as shown below:
  
  The genetic relationship matrix was calculated for each autosomal chromosome (un-imputed data) which were pruned for SNPs in linkage disequilibrium (LD) using a 50 SNP window, sliding by 10 SNPs at a time and removing all variants with LD greater than 0.5.
  
  And page 19:
  
  Heritability estimations were transformed onto the liability scale using the GCTA software to account for the difference in the proportion of cases in the data compared to the population prevalence74.
  
  3) The supplemental data file is not very helpful without some sort of guide. It isn't clear whether the wealth of candidate genes that have been studied in TB were examined in these data. That would be a great benefit of this work.
  
  We thank the reviewer for pointing this out and we agree that the supplemental data excel sheet was difficult to understand. We have included a readme file (also on sheet 1 of the excel sheet) to explain which information is in the sheets of the excel document. This includes a list of candidate SNPs and genes that we investigated along with the meta-analysis results of these candidate SNPs and genes. We also updated the “Prior associations” section of the manuscript in which we cover the results of candidate SNPs and genes (page: 13-14).
  
  4) There needs to be clarity on how unpublished works were sought. In non-genetic meta-analyses, there is usually some detail about a process of contacting authors, etc. There needs to be some assurance that every attempt was made to collect all the relevant data. It is also not clear why family-based analyses could not be included considering that summary statistics were the basis of analysis.
  
  I updated the manuscript to address this (page: 18):
  
  This analysis includes 12 of the 17 published (and un-published, Table 1 and S1) GWAS studies of TB (with HIV negative cohorts) prior to 202210–17,26. For unpublished works we contacted researchers that were funded for genetic TB research and acquired data sharing agreements to obtain summary statistics (or raw data) along with any meta-data that was available. It excludes data from Iceland and Vietnam 18,31, as they declined to share data. It excludes data from China, Korea, Peru and Japan6,20,21,23,31, as data sharing agreements could not be finalized in time for this analysis. The Indonesian and Moroccan data were to sparsely genotyped and not suitable for reliable imputation and the Moroccan data was also family-based and thus also not suitable for this meta-analysis, as this would introduce confounding effects from the inclusion of related individuals24,25.
  
  5) It is rather surprising that only one locus meets genome-wide significance. The authors do explain this well in terms of the ancestry-specific effects driving these results, but it is also surprising that no candidate genes (that had not been discovered in GWAS studies, but were rather studied separately) did not rise to some higher significance threshold.
  
  We agree that it is surprising that we did not detect more significant associations and failed to replicate any candidate SNPs or genes at a genome wide significance level. We aim to have future iterations of this analysis with more data to increase power to detect more variants of interest, but this is beyond the scope of this manuscript.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2022.08.26.22279009v1
www.biorxiv.org www.biorxiv.org

New submission 24/07/2023, 11:37:40

1
1. Public_Reviews 24 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  “Liu et al present a very interesting manuscript investigating whether there are distinct mechanisms of learning in children with ASD. What they found was that children with ASD showed comparable learning to typically developing children, but that there was a difference in learning strategy, with less plasticity and more stable learning representations in children with ASD. In other words, children with ASD showed similar learning performance to typically developing children but were more likely to use different learning rules to get there. Interestingly greater fMRI-measured brain plasticity was associated with learning gains in typically developing children, whereas more stable (less plasticity) neural patterns were associated with learning gains in autistic children. This was mediated by insistence on sameness (from the RRIB) in the ASD group. This is a good paper, well reasoned and with strong methods.”
  
  We appreciate the positive comments from the reviewer.
  
  1.1) “The biggest issue is related to subject numbers...With n=35 it is only possible to make a generalized statement about autism.”
  
  Thank you for this comment. Although the sample size in the current study was modest, we would like to note that acquiring high-quality behavioral and brain imaging data at multiple time points a is a challenge in children with ASD. The current training study with unique longitudinal behavioral and brain imaging data provides an unprecedented opportunity to investigate the potentially atypical training-induced learning and brain plasticity in children with ASD relative to TD peers. To our knowledge, the present longitudinal sample is largest of its kind in studies of neurocognitive function in children with ASD. We have acknowledged these points in the revised Discussion section (Page 15), including the following statement:
  
  “First, larger sample sizes are required to further characterize heterogeneous patterns of atypical learning and whether the findings can be generalized to a broader ASD population.” (Page 15)
  
  1.2) “[Another] issue is related to [heterogeneity of autism-related findings]. For example, take the following statement from the results: "while most TD children used the memory-based strategy most frequently following training, nearly half of the children with ASD used rule-based strategies most frequently for trained problems." Is this the heterogeneity of autism at play, or the noisiness of the task and measures?
  
  We hypothesize that group differences in changes in strategy use following training are due to atypical learning style or high level of inter-individual differences, i.e., greater heterogeneity, in autism, rather than noisiness of the measures. This hypothesis is based on the fact that we used the same tasks before and after training and a standardized training protocol across the two groups, which (i) allowed us to systemically examine atypical learning of these tasks in children with ASD compared to TD children and (ii) provided ecologically valid measures. This design minimized potential differences in measurement error between the two groups. We have clarified these points in the revised Introduction section (Page 4), including the following statement: “Crucially, we employed identical tasks before and after training and a standardized training protocol across the two groups. This approach enabled systemic analysis of learning in children with ASD relative to TD children.” (Page 4)
  
  1.3) “Conceptually, is it realistic to expect a unitary learning strategy in all of autism?
  
  We agree with the sentiment expressed by the reviewer, and indeed this notion led to the hypothesis that our study was to test. We hypothesized that children with ASD would not show a unitary learning strategy at this stage of development examined. Our results reveal that a disproportionate number of children with ASD use a rule-based strategy, reflecting atypical learning styles.
  
  1.4) “Lastly, the task itself can only be solved in a subset of autistic children and therefore presents a limited view of the condition.”
  
  We thank the reviewer for this important point and agree that additional studies tailored to more severely affected children with ASD are required for a more comprehensive characterization of learning in children with autism.
  
  Reviewer #2 (Public Review):
  
  “Overall, the authors sought to determine whether children with autism spectrum disorder (ASD) or typical development (TD) would both benefit from a 5-day intervention designed to improve numerical problem-solving. They were particularly interested in how learning across training would be associated with pre-post intervention changes in brain activity, measured with functional magnetic resonance imaging (fMRI). They also examined whether brain-behavior associations driven by learning might be moderated by a classic cognitive inflexibility symptom in ASD ("insistence on sameness"). The study is reasonably well-powered, uses a 5-day evidence-based intervention, and uses a multivariate correlation-based metric for examining neuroplastic changes that may be less susceptible to random variation over time than conventional mass univariate fMRI analyses. The study did have some weaknesses that draw into question the specific claims made based on the present set of analyses, as well as limit the generalizability of the findings to the significant proportion of individuals with ASD that are outside of the normative range of general cognitive functioning. The study also found minimal evidence for transfer between trained and untrained mathematical problems, limiting enthusiasm for the intervention itself. The majority of the authors' claims were rooted in the data and the team was generally able to accomplish their aims. I am sensitive to the fact that one of the main limitations I noted would have significant ethical implications-i.e. NOT offering potentially beneficial numerical training to children randomized to a sham or control group. I think the authors' work will represent a welcome addition to a growing corpus of studies showing similar neuropsychological test performance across several cognitive domains (e.g. learning, memory, proactive cognitive control, etc.) in ASD and TD. However, these relatively preserved cognitive functions still appear to be implemented by unique neural systems and demonstrate unique correlations to clinical symptoms in youth with ASD relative to TD, which may have implications for both educational and clinical contexts.
  
  We thank the reviewer for the positive feedback and helpful suggestions.
  
  Reviewer #3 (Public Review):
  
  “Liu and colleagues examined learning and brain plasticity in neurotypical children and children with autism. The main findings include autistic children relying more on rule-based versus memory-based learning strategies, altered associations between learning gains and brain plasticity in children with autism, and insistence on sameness as a moderator between brain plasticity and learning in autism. Although the sample size is limited in this study, the findings provide a significant contribution to the field. The major strengths of this paper include an extensive pre and post training protocol, a detailed methods section, rationale behind the study, investigation of a potential moderator of learning gains and neural plasticity, and investigation of "neural plasticity" in association to learning in autism. Weaknesses of the study include a small sample size, and some missing information/analyses from the study. The authors laid out four clear aims of the study. They investigated these aims and the analytic approaches were appropriate. The paper included significant findings toward better understanding the mechanisms underlying differences in learning strategies and behavior in children diagnosed with autism spectrum disorder. This holds significant value in educational and classroom settings. Further, the investigation of a potential moderator of learning gains and neural plasticity provides a potential mechanism to improve the relationship. Overall, this is a significant contribution to the field. The autism literature is limited in understanding differences in learning styles and the underlying neural mechanisms of these differences.”
  
  We thank the reviewer for the positive comments and detailed suggestions.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.01.25.525594v1
www.medrxiv.org www.medrxiv.org

New submission 24/07/2023, 11:33:41

1
1. Public_Reviews 24 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  As part of a special issue on COVID-19 and cancer, Fuzzell and colleagues report findings from their mixed method study on the impact of the pandemic on cervical cancer screening and colposcopies, consisting of a national (United States) survey (March-August 2021) of 1251 clinicians (675 perform colposcopy) and qualitative interviews (June-December 2021) with 55 of these clinicians. The study looked specifically at perceived pandemic-related practice changes and disruptions over one year into the pandemic after the lockdowns had been lifted.
  
  The overall focus is on three pandemic-related questions (impact on cervical cancer screening practice, colposcopy practice, ability to provide LEEP) that were asked as part of a larger survey related to cervical cancer screening and management of abnormal results, details of which are however not fully described in terms of the survey's general aim and items, but seem to have been designed within the context of adherence to guidelines (following Cabana's Guideline Based Practice Improvement Framework).
  
  The authors thank the reviewer for their thoughtful feedback. The surveys topics assessed are now described more fully in the Method, and measures are available upon request. The survey covered several areas related to cervical cancer screening practices and management of abnormal screening results, including presentation of vignettes focused on screening intervals, management or treatment, and screening exit or continuation in relation to 2019 ASCCP risk-based management guidelines adoption, as well as a sub-set of items for clinicians who perform colposcopy. There were also items related to HPV self-sampling, as well as the impact of the COVID-19 pandemic on screening and follow-up (which is the focus of the present manuscript).
  
  Reviewer #2 (Public Review):
  
  Lindsay Fuzzell and her team of researchers have performed an extremely well-executed survey study, which captures a wide spectrum of providers who perform cervical cancer screening in the US. The researchers have captured a vast amount of demographic data in this study in attempting to determine whether cervical cancer screening continued to be reduced in the year immediately after the lockdown period caused by the COVID-19 pandemic.
  
  The authors have uncovered some important and revealing concerns regarding the current state of cancer screening during the public health crisis caused by the COVID-19 pandemic. The most notable implication from their survey was a statistically higher reported reduction in cervical cancer screening in Internal medicine and family medicine providers as well as for community health and safety net clinics. These findings are important as they represent a large portion of primary care and a vulnerable patient population that has been shown to have worse cancer-related outcomes.
  
  This study is more sobering information about the magnitude of ramifications of the COVID-19 pandemic on the US public health system. Decreases in cancer screening may have lasting implications for cancer-related mortality for many years to come. The implications of not going back to pre-pandemic cancer screening rates are daunting, to say the least.
  
  The scope of this survey, the amount of data attained, and the sound methodology of the data acquisition and statistical analysis are the strengths of this study. Weaknesses are inherent to the study relying on survey answers rather than data from cervical cancer screening registries. Reporting biases are complex in surveys and answers given may not reflect the true rates of screening. The authors have also reported a disproportionate and statistically significant reduction in cervical cancer screening for Black and Asian providers. I would conclude more cautiously here with confidence intervals crossing one in both for this statistical analysis.
  
  Overall, this is a survey study with a great magnitude, which has important implications for cancer screening and public health in the US.
  
  The authors thank the reviewer for their kind assessment. The discussion now includes an acknowledgement of the weaknesses inherent with using self-report surveys, namely that self-report surveys have inherent biases and may not be actual representations of screening and colposcopy practices that could be ascertained via medical record or claims databases. Additionally, regarding confidence intervals that cross one, given the few studies that have explored factors associated with clinician perspectives on screening and colposcopy changes due to the pandemic, we desired a more broad-based approach to identifying factors associated with our outcomes of interest, thus electing to utilize p of .10 as significance level. This strikes a balance between the commonly accepted method of using the AIC (Akaike's Information Criterion, which implicitly assumes a significance level of 0.157), and the often-used significance level of 0.05. We now describe the choice of 0.10 in the text. However, we acknowledge that by using 0.10 as a significance level, some 95% confidence intervals for factors we consider significant cross one. We have tempered language in the discussion for findings with p-values between 0.05 and 0.10. Additionally, in examination of the confidence intervals for findings related to race that the reviewer mentions, we identified an error in the labelling of Table 3. Marginally significant findings for Asian clinicians actually apply to mixed race/other clinicians. We have corrected this error in Table 3 and throughout the manuscript. We thank the reviewer for bringing the confidence intervals that cross one to our attention as this triggered an examination of our findings.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2023.01.11.23284437v1
www.biorxiv.org www.biorxiv.org

New submission 23/03/2023, 10:21:16

1
1. Public_Reviews 24 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #2 (Public Review):
  
  This paper addresses the topic of how T cells migrate in different tissues. The authors provide experimental evidence that T cell migration in the lung is more confined than in lymph nodes and gut villi. While prior studies have started to define the way T cells migrate during normal and pathological conditions, there is still a lot to learn about the factors that control this process. Thus, the topic is significant and timely. The authors use previously acquired data with two-photon microscopy from murine tissues. They compare multiple motility parameters of T cells in lymph nodes, gut villi, and inflamed lungs. Experiments demonstrate that T cells in the lung have a particular mode of migration characterized by low speeds, back-and-forth motions, and confinement.
  
  Strengths:
  
  Overall, this is a very well-performed study. The data presented is of excellent quality and, for the most part, supports the authors' conclusions. The imaging techniques used to track T cells in various organs and the mouse models implemented are very relevant and robust. The functional analysis of the different migration features of T cells is compelling and should be of use to the community. The conclusion that T cells use different migration modes depending on the organ appears novel. This is considered of major significance.
  
  We appreciate these comments by the reviewer that the study is relevant, robust, and timely.
  
  Weaknesses:
  
  The main weakness of the manuscript is that the study remains descriptive and comparative. It is important to analyze and describe different migration modes depending on the organ. Still, it would have been desirable for the authors to provide information on the reason for such differences. One of the striking observations is the back-and-forth motion of T cells in the lung. Searching for mechanisms underlying this unique mode of displacement would strengthen the quality of the study.
  
  We agree that the next step is to determine the underlying cells, signals, and structures that determine motility differences between tissues. However, we believe that a detailed study is beyond the scope of this manuscript, which is the first to directly compare the types of motility that should be studied in individual tissues that distinguish T cell motility in individual tissues such as villi and lung.
  
  Reviewer #3 (Public Review):
  
  The ability of T cells to move through a variety of complex and disparate tissue environments is fundamental to their success in surveying and responding to infectious challenges. A better understanding of the molecular cues that regulate T cell motility in tissues is needed in order to inform therapeutic targeting of T cell migration. Contributions that are intrinsic and extrinsic to the T cells themselves have been shown to shape the pattern of T cell movement. This study uses advanced quantitative image analysis tools to dissect differences in T cell motility in different tissue locations, to better define how the tissue environment shapes the pattern of motility and scope of tissue explored. The combination of different quantitative measures of motion enables the extensive characterization of CD8 T cell motility in the lymph node, lung, and villi of the small intestine. However, there are too many variables with respect to the CD8 T cell populations used for analysis to be able to gain new insight into the impact of the tissue microenvironment itself.
  
  The use of these advanced quantitative imaging analysis tools has the potential to significantly expand our analysis capabilities of T cell movement within and across tissues. The strength of the paper is the comprehensive analysis of multiple motility parameters designed with T cell function in mind. Specifically, with respect to the need for T cells to search a tissue area to identify antigen-bearing cells for T cell activation and identify cellular targets for the delivery of anti-microbial effector functions. The inclusion of an analysis of the "patrolled volume per time" is seen as a particularly useful advance to compare T cell behaviors across tissues.
  
  However, with the current data sets, it is difficult to draw definitive conclusions on the impact of the tissue environment on how T cell move, given the considerable variability in the CD8 T cells themselves. Extended experimentation would be needed to fully support their key claims. In particular:
  
  1) The authors have separated out naïve and activated CD8 T cells for their analysis, but this is a marked over-simplification. There are too many variables within these groups to be able to distinguish between differences in the T cell populations versus differences in the tissue environment. Variables include:
  
  a) T cells pre-activated in vitro before in vivo transfer (LPS-lung) versus transfer of naïve T cells for activation in vivo (Flu-lung, LCMV-villi)
  
  b) Polyclonal CD8 T cells (naïve, LPS-lung, Flu-lung) versus monoclonal (P14) CD8 T cells (LCMV-villi)
  
  c) Presence of cognate-antigen (Flu-lung, LCMV-villi) versus absence of antigen (LPS-lung)
  
  d) Cell numbers, 104 polyclonal naïve for Flu-lung versus 5 x 104 monoclonal (P14 T cells) for LCMV-villi)
  
  e) Intravital imaging (LCMV-villi) versus tissue explants (Flu-lung)
  
  The reviewer is absolutely correct that many factors differ, and we have added details about these potential differences. However, we can conclude that there are similarities in motility despite tissue and T cell activation differences, particularly between naive T cells in LN and d8 activated CD8 T cells in the gut villi. We report that the most significant differences between T cell motility parameters are in activated CD8 T cells in the lung compared to those in other tissues, regardless of antigen specificity. These lead us to suggest that the specific motility differences we see in T cells in the lung are likely to be the result of a combination of factors which we hypothesize are likely to be due to molecular changes in both the T cells (chemokine receptors) and the tissue (cell types, chemokines, and structural components). Future work will include defining specific differences that lead to changes in motility.
  
  The authors do present data that suggest similarities of motility patterns within the same tissue occur despite variabilities in the CD8 T cell source, for example, the MSD is not significantly different in the two lung groups despite differences in the way the CD8 T cells were activated. However, these similarities are lost when other parameters are analyzed suggesting additional variability independent of the tissue itself.
  
  In addition to the MSD (Fig 3), we also include parameters commonly analyzed including cell- based speed (Fig 2A). Regardless of the type of T cell, the median cell-based speeds range from 4.3 um/min to 6.5 um/min. Meandering ratio is also commonly used to analyze motility dynamics and naive T cells (0.70) and activated T cells in villi (0.63) also show similar meandering ratios (Fig 5).
  
  2) Controlled experiments are needed, where the input CD8 T cell population is kept constant and the target tissue differs, to substantiate any of the current conclusions. This could be done by using a single source and/or specificity of CD8 T cells (e.g., P14 or OT-I TCR transgenics, or polyclonal in vitro activated CD8 T cells) transferred into mice where the tissue providing the antigen or inflammation source is varied (lung with pOVA-flu versus small intestine with pOVA-LCMV for example).
  
  Alternatively, activated polyclonal CD8 T cells could be analyzed in the LPS-lung draining LN as well as in the LPS-lung to make a direct comparison between the tissues (LN versus lung) using CD8 T cells of the same activation status.
  
  The experimental systems cannot be directly compared except in some circumstances. For example, we included LPS-induced lung injury because we wanted to directly compare non-antigen specific with antigen specific activated T cells in the lung. We have compared motility of OTI Tg T cells responses in the lung with non-OTI Tg T cells and found similar motility and effector characteristics [15]. We have not repeated the additional controls requested here as OVA is a model antigen and commonly used as a tag to simply track CD8 T cell effector responses. There is vast literature showing similar responses between OVA-specific versus antigen specific CD8 T cell responses in multiple tissues, with OTI Tg T cells analyzed as “normal CD8 T cells”. Thus, while it is possible that imaging OTIs in multiple tissues could confirm that the type of T cells is “more similar” in each tissue, we do not believe adding this analysis would add to the overall conclusions of the manuscript as there is no data to suggest that OTIs would behave differently in different tissues. Adding in vitro activated CD8 T cells imaged in activated lymph nodes would add more variables (activated lymph node versus naive lymph node) which we do not believe would shed new light on our primary finding which is that the lung appears to induce specific types of T cell behavior compared to the naive lymph node and the gut.
  
  3) Differences in the micro-anatomical regions of the tissues studied may also contribute to tissue differences in movement patterns between the lung and the small intestine. The region of the small intestine imaged was specifically focused on the villi, close to the gut epithelium. Details of the location within the lung where images were taken are missing, therefore the motility differences between the lung and small intestine could reflect differences in the micro-anatomical position of the CD8 T cells within the tissue (proximal to epithelium versus parenchymal), rather than differences between the tissues themselves.
  
  The reviewer is absolutely correct and we have added greater discussion of this in both the Introduction and Discussion.
  
  Overall, the authors have developed a quantitative multi-parameter approach to the study of T-cell motility in different tissues. Application of these analytical tools to the study of T-cell behavior in different tissue locations has the potential to reveal tissue and/or T-cell-specific patterns of movement that may help to identify molecular requirements for context-specific dynamic T-cell behavior. Their quantitative approach reveals small but statistically significant differences in particular motility parameters, the functional significance of which will require further study. The careful design of experiments to reduce as many variables as possible will be needed to increase the impact of the work and ensure new insights into this important aspect of T-cell function.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.11.17.516891v2
www.biorxiv.org www.biorxiv.org

New submission 27/01/2023, 10:04:07

1
1. Public_Reviews 24 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  The manuscript by Curtis et al. reports the interaction between CaMKII and alpha-actinin-2. The authors found that the interaction was elevated after NMDA receptor activation in dendritic spines. In addition, this study reveals NMDA receptor binding to CaMKII facilitates alpha-actinin-2 access to the CaMKII regulatory segment, indicating that the NMDA receptor is involved in this interaction. The authors identified the EF1-4 motifs mediated this interaction, and overexpression of this motif inhibited structural LTP. Moreover, biochemical measurements of affinities from various combination of protein fragments including autoinhibited CaMKII 1-315, regulatory segments of CaMKII, and the EFhand motif reveals that autoinhibited CaMKII has limited access to alpha-actinin-2. The authors also solved the structure of the interaction, supporting their finding in neurons at the molecular level. The authors claim that the interaction between CaMKII and alpha-actinin-2 is essential for structural LTP through cooperative action by the NMDA receptor and actin cytoskeleton.
  
  Overall, the experiments are well-designed and the results are largely convincing and well-interpreted. But some aspects of the experiments need to be clarified.
  
  1) Time resolution of the interaction analysis appears to be poor, as calcium elevation in a dendritic spine would be at milli-second order. What is the time window to interact alpha-actinin-2 with CaMKII during NMDA receptor activation or LTP?
  
  We have performed additional time-course experiments to determine how quickly interactions between alpha-actinin-2 and CaMKII are elevated following NMDAR activation. The results of these experiments are shown in Figure 2A and Figure 2-Figure Supplement 1. We found that the change in association was established rapidly after NMDAR activation (t50% = 22±1 s, Figure 2A), which is consistent with proposed time-courses for CaMKII interactions following the induction of LTP (see Yasuda, Hayashi & Hell, Nat Rev Neuroscience, 2022, PMID 36056211). We have included additional text in the results (lines 138-147), methods (lines 609-611 & 650-652), and discussion (lines 426-427) sections explaining these experiments, and figure legends are provided for the new figures on lines 10061009 and lines 1096-1101.
  
  2) The authors analyzed the binding of CaMKII and alpha-actinin-2 with partial fragments. It remains to be unknown whether CaMKII can form a protein complex with GluN2B and alpha-actinin-2 in a single CaMKII protomer.
  
  The reviewer is referring to experiments shown in figure 5, in which we found that a fragment of GluN2B (1260-1492) increases pull-down of full-length CaMKIIa with a fusion of GST to the EF3-4 region of a-actinin-2. This region of GluN2B contains a CaMKII phosphorylation sequence (positions 1290-1309) that occupies the substrate binding groove of the kinase domain (Stratton et al., Cell Reports, 2023, PMID 35830796). Therefore, the most logical explanation for the results of the pulldown experiment is that GluN2B increases a-actinin-2 access to the regulatory segment by binding to the substrate binding groove of the same CaMKII protomer. Nevertheless, we discuss the difficulty of conceptualising and investigating interactions between oligomeric proteins within the PSD on lines 451461.
  
  3) Besides synaptic localization, the effect of the interaction on the enzymatic activity of CaMKII is not known.
  
  The Colbran laboratory has previously examined the effect of a-actinin-2 on CaMKII activity. Jalan-Sakrikar and colleagues (JBC, 2012, PMID 22427672) showed that a fragment of aactinin-2 corresponding to EF hands 3 and 4 is able to weakly activate CaMKII (~ 10 % compared to Ca2+/CaM) towards peptide substrates autocamtide-2 and GluN2B but not syntide-2 (see Figure 1B&C of this paper). An earlier study by Robison and colleagues (JBC, 2005, PMID 16172120) found that aactinin-2 antagonises Ca2+/CaM-dependent activation of unphosphorylated CaMKII towards autocamtide2, but does not affect the activity of pT286 auto-activated CaMKII (see Figure 4A of this paper). This work is referred to on lines 63-65 of the introduction.
  
  4) Although the authors quantify the effect of the EF-hand disruptor by measuring numbers of the dendritic spine by its shape, the specificity of the EF-hand disruptor needs to be clarified.
  
  There are two known interaction partners for the EF hand region of a-actinin-2: CaMKII and Titin (Young et al., EMBO J, 1998, PMID 9501083; Atkinson et al., Nat Struct Biol, 2001, PMID 11573089). Titin is an extremely long sarcomeric protein that is expressed in striated muscle cells but not neurons. Therefore, the effects of the disruptor are highly likely to reflect disruption of interactions to CaMKII. We also performed control experiments with EF34 L854R that does not bind CaMKII effectively (Figure 3-figure supplement 1C). We have added a sentence to clarify the specificity of the EF-hand disruptor on lines 182-184, as follows: ” Furthermore, the only known interaction partner for the EF14 region of a-actinin-2 besides CaMKII is the muscle-specific protein titin (Young et al., 1998), so any effects of EF14 in neurons are likely to reflect destabilisation of native interactions between CaMKII and a-actinin-2”.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.12.04.519035v1
www.biorxiv.org www.biorxiv.org

New submission 24/07/2023, 09:05:50

1
1. Public_Reviews 24 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  This study uses electrophysiological techniques in vitro to address the role of the Na+ leak channel NALCN in various physiological functions in cartwheel interneurons of the dorsal cochlear nucleus. Comparing wild type and glycinergic neuron-specific knockout mice for NALCN, the authors show that these channels 1) are required for spontaneous firing, 2) are modulated by noradrenaline (NA, via alpha2 receptors) and GABA (through GABAB receptors), 3) how the modulation by NA enhances IPSCs in these neurons.
  
  This work builds on previous results from the Trussell's lab in terms of the physiology of cartwheel cells, and from other labs in terms of the role of NALCN channels, that have been characterized in more and more brain areas somewhat recently; for this reason, this study could be of interest for researchers that work in other preparations as well. The general conclusions are strongly supported by results that are clearly and elegantly presented.
  
  I have a few comments that, in my opinion, might help clarify some aspects of the manuscript.
  
  1) It is mentioned throughout the manuscript, including the abstract, that the results suggest a closed apposition of NALCN channels and alpha2 and GABAB receptors. From what I understand, this conclusion comes from the fact that GABAB receptors activate GIRK channels through a membrane-delimited mechanism. Is it possible that these receptors converge on other effectors, for example adenylate cyclase (see https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6374141/).
  
  It will be of interest to test the roles of adenylyl cyclase modulation in the control of NALCN, as a complement to the studies we have presented here.
  
  2) In Figure 2G, the neurons from NALCN KO mice appear to reach a significantly higher frequency than those from WT (figure 2E, 110 vs. 70 spikes/s). Was this higher frequency a feature of all experiments? The results mention a rundown of peak firing rate due to whole-cell dialysis, but, from what I understand, the control conditions should be similar for all experiments.
  
  The peak firing rates in control solutions for WT and KO CWC are not statistically different.
  
  3) Also in Figure 2, the firing patterns for neurons from WT and NALCN KO mice appear to be quite different, with spikes appearing to be generated during the hyperpolarization of the bursts in the second half of the current step for WT neurons but always during the depolarization in KO neurons. Was this always the case? If so, could NALCN channels be involved in this type of firing? Along these lines, it would be interesting to show an example of a firing pattern of neurons from WT mice in the presence of NA, which inhibits NALCN channels.
  
  The specific pattern of spikes in CWC is quite variable from trial-to-trial or cell-to-cell, as it is dependent on multiple CaV and calcium dependent K channels subtypes, and is not dependent on the genotypes used here. The primary effects observed in the KO are in background firing and sensitivity to NA, both reflected alterations in rheobase. The firing pattern example requested was shown in the raster plot of fig 2B2.
  
  4) It might be interesting to discuss how the hyperpolarization induced by the activation of GIRK channels and inhibition of NALCN channels could have different consequences due to their opposite effect on the input resistance.
  
  We considered this as a point of discussion, but decided that making sense of it would depend on assumptions about the location of the channels (dendritic vs somatic, distance to AIS) that we do not have data for. For example, a dendritic increase in resistance through NALCN block, leading to a hyperpolarization of the soma, might have actions similar to a somatic hyperpolarizing conductance increase by GIRK, as far as the voltage at the AIS is concerned.
  
  Reviewer #3 (Public Review):
  
  The study by Ngodup and colleagues describes the contribution of sodium leak NALCN conductance on the effects of noradrenaline on cartwheel interneurons of the DCN. The manuscript is very well-written and the experiments are well-controlled. The scope of the study is of high biological relevance and recapitulates a primary finding of the Khaliq lab (Philippart et al., eLife, 2018) in ventral midbrain dopamine neurons, that Gi/o-coupled receptors inhibit NALCN current to reduce neuronal excitability. Together these studies provide unequivocable evidence for NALCN as a downstream target of these receptors. There are no major concerns. I have only minor suggestions:
  
  Minor
  
  1) As introduced in the introduction, NALCN is inhibited by extracellular calcium which has led to some discourse of the relevance of NALCN when recorded in 0.1 mM calcium. A strength of this study is the effect of NA on NALCN is recorded in physiological levels of calcium (1.2 mM). I suggest including the concentration of extracellular calcium in the aCSF in the Results section instead of relying on the reader to look to the Methods.
  
  Will do.
  
  2) It would be interesting to include the basal membrane properties of the KO compared to wildtype, including membrane resistance and resting membrane potential. From the example recording in Figure 2, one might think that the KOs have lower membrane resistance, so it is interesting that the 2 mV hyperpolarization produced similar effects on rheobase. In addition, from the example in Figure 2G, it appears that NA has an effect on firing frequency with large current injection in the KO. Is this true in grouped data and if so, is there any speculation into how this occurs?
  
  Will do.
  
  3) Please expand on the rationale for why GABAB and alpha2 must be physically close to NALCN. To my knowledge, the mechanism by which these receptors inhibit NALCN is not known. Must it be membrane-delimited?
  
  Given the known membrane delimited modulation of GIRK by GABAB, and that alpha2 and GABAB receptors appear to share the same population of NALCN channels, and that alpha2 receptors do not appear to target GIRK channels, we felt the simplest explanation would be coupling through G-proteins, with spatial segregation of different receptor/channel pools providing the means for separating GIRK and NALCN effects. However, the involvement of an additional second messenger is testable.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.23.546323v2
www.biorxiv.org www.biorxiv.org

A Synergistic Workspace for Human Consciousness Revealed by Integrated Information Decomposition

1
1. Public_Reviews 24 Jul 2023
  
  in eLife
  
  Author Response
  
  We wish to thank the Reviewers for the appreciation they have expressed for our work, and the constructive feedback that they offered. We agree that clarifying the interpretation of synergy and information decomposition in the context of macroscale BOLD signals and loss of consciousness will be a valuable addition to the manuscript, and so will be improving the quality of our figures, and we will endeavour to do so. Briefly, at this stage we just wish to clarify that it is not our intention to claim that Phi-R and synergy, as measured at the level of regional BOLD signals, represent a direct cause of consciousness, or are identical to it. Rather, our work is intended to use these measures similarly to the use of sample entropy and LZC for BOLD signals: as theoretically grounded macroscale indicators, whose empirical relationship to consciousness may reveal the relevant underlying phenomena. We will ensure that our updated manuscript reflects this additional nuance.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2020.11.25.398081v3
www.biorxiv.org www.biorxiv.org

New submission 24/07/2023, 08:42:16

1
1. Public_Reviews 24 Jul 2023
  
  in eLife
  
  Author Response
  
  We thank the reviewers for their very thorough and detailed comments as well as the overall positive reception of the work. Additionally, the reviewers provided excellent detailed suggestions for future work.
  
  Specific response to Reviewer 1:
  
  “Indeed, the major disappointment of this work is the clinical relevance that was highlighted in the Introduction but was never really studied in the end. iPSC from patients could be added to the study.”
  
  We completely agree that it would be very exciting to use patient-derived iPSC in the platform that we describe in this manuscript. We recognize that extensive work to characterize and validate BMECS differentiated from patient-derived iPSCs would be required, including validating BBB-like properties, before retinol transport data could be collected and interpreted. This work is beyond the scope of the current manuscript. We hope that in the future the in vitro model we describe in this manuscript will be used for exactly this type of clinically relevant application.
  
  Specific response to Reviewer 2:
  
  “1) The authors assume that there is a significant fraction of free ROL, 20% for ROH/RBP and 7% for RBP/TTR complexes (summarized in Table 1). This implies that at the physiological concentration of ROH/RBP in the plasma of 2 uM, free ROL represents 0.4 uM. However, the concentration of free ROL is limited by its poor solubility in the aqueous phase, which is around 0.06 uM (Szuts EZ, 1991, Arch Biochem Biophys). Moreover, taking into account the large concentration of other potential nonspecific carriers for lipids, it is safe to assume that there is virtually no free ROH in the plasma. There is also an important physiological reason for the limited amount of free ROL. Its rapid and nonspecific partition into cells (also observed in this study) would work against the highly specific RBP/STRA6-dependent ROH uptake pathway, undermining its physiological function.”
  
  The reviewer raises an important point that we considered carefully during the design of the research. As the reviewer says, Szuts (1991) reported retinol (ROH) solubility of ~0.06 µM (range of 0.03 – 0.11 µM). Szuts defined ROH solubility as ‘the amount of dissolved solute in equilibrium with its solid state…includ[ing] all its dissolved forms (monomers, multimers, and micelles)’. We are using a definition of ‘free’ ROH as ‘ROH not bound to protein’; in our work ‘free’ ROH could include retinol multimers and micelles, which likely do exist under our experimental conditions. (We did not see any evidence of solid ROH.) That said, we calculate that the concentration of free ROH (ROH not bound to protein) is ~0.14 µM when both RBP and TTR are present. In more complex biological mixtures containing other ROH carriers, the concentration of unbound ROH is expected to be lower, in agreement with the reviewer.
  
  One key point is that the free ROH concentration depends on the experimental setup, and must be correctly accounted for. For example, in some of the literature investigating STRA6-mediated uptake and signaling in vitro, purified ROH-RBP is used as the retinol source and samples do not include TTR. In such a case, the unbound ROH concentration in an equilibrated sample is anticipated to be significantly higher than the physiological concentration. Our investigation demonstrates that unbound ROH can accumulate intracellularly; thus, failure to include TTR and/or to account for the action of unbound ROH could lead to errors in mechanistic interpretation of experimental studies on retinol transport into cells or across barriers such as the BBB.
  
  2) “However, a question remains: would the outcome of the experiment be different if the basolateral chamber contained an ROH acceptor (retinol-binding proteins) rather than Hank's balanced salt solution, to which the partition of ROL is limited by its water solubility?”
  
  We agree with the reviewer that it would be very interesting to determine whether retinol permeability changes in the presence of RBP and/or TTR on the basolateral side. This is a logical next step and can readily be performed in the Transwell setup. We chose not to do this for this project because we wanted to compare our setup with other in vitro models (e.g., with porcine BMECs) where no retinol-binding proteins were present basolaterally.
  
  3) “The authors claim that transthyretin (TTR) increases BMECs permeability when compared to ROH/RBP. However, the mechanistic explanation for this phenomenon remains unclear. Do the authors imply the presence of a putative TTR receptor whose signaling could affect the efflux of ROL at the basolateral side of BMECs? TTR is an ubiquitous plasma protein. The concentration of TTR is tightly regulated and maintained between 300 - 330 mg/L. Therefore, it is questionable how TTR can serve as a signaling molecule modulating retinoid homeostasis in the brain.”
  
  We disagree with the reviewer about the TTR concentration. Per Johnson et al (Clin Chem Lab Med 2007, 45:419-426), TTR concentration varies with age, gender, inflammation and nutritional status, with typical concentrations for adults ranging from 150-450 mg/L. We were surprised at our observations that TTR enhanced ROH permeability across BMECs and that LRAT expression increased in the presence of TTR. We do not currently have a mechanistic interpretation and agree with the reviewer that further exploration of these tantalizing observations is warranted.
  
  “Additional technical issues that could affect the experimental outcomes: The formation of the ROH/RBP-TTR complex should be confirmed and purified using gel filtration to separate free TTR and ROH/RBP. Only fractions containing the complex should be used in the experiments. Assuming that the complex is formed with 100% efficiency is overly optimistic.”
  
  We respectfully disagree with the reviewer regarding using gel filtration to isolate TTR/ROH/RBP complexes. Any such isolated complexes will fairly rapidly re-equilibrate so that some protein and some ROH is unbound. It is important to note that we do not assume that the complex is formed with 100% efficiency. In fact, on the contrary, we explicitly take into account the distribution of materials (free TTR, free RBP, free ROH, RBP-ROH, TTR-RBP-ROH) in any sample; values are reported in the manuscript. This issue is also relevant to the first point raised by the reviewer. We routinely validated binding of ROH to RBP by FRET and ROH-RBP to TTR by fluorescence anisotropy.
  
  “Reloading RBP with isotopically labeled ROH requires an additional purification step. Stripping ROL from the ROH/RBP complex with organic solvent (diethyl ether) is appropriate but relatively harsh, causing partial unfolding of a fraction of RBP. Therefore, assuming that 100% of stripped RBP remains functional and can be reloaded with ROH is inaccurate. Reloading apo-RBP with a stoichiometric amount of ROH without an additional purification step (e.g., ion exchanger) leads to an excess of free ROL and/or its nonspecific association with nonfunctional RBP fractions. Measuring absorbance at 330 nm is not sufficient proof of binding since free ROH also absorbs at the same wavelength.”
  
  We produced RBP by refolding of guanidine-denatured RBP in an excess of ROH to ensure near 100% ROH loading. High quality refolded RBP can qualitatively be determined by examination of the A330/280 absorbance ratio, which should be ~1.0. We then extract ROH to completion by diethyl ether to produce pure apo-RBP (ROH-free). We utilized this diethyl-ether stripped apo-RBP stock for all future characterizations, including binding to ROH and TTR. We found our stripped apo-RBP was a suitable replacement for serum sources in every biophysical assay performed. Reloaded ROH-RBP elutes as a single peak on ion exchange chromatography, indicating the vast majority of stripped RBP is available for ROH binding. We provide detailed information about RBP characterization in Est and Murphy, Prot. Exp. Purif. (2020), to which the interested reader is referred.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.04.11.536348v1
www.biorxiv.org www.biorxiv.org

New submission 21/07/2023, 09:17:24

1
1. Public_Reviews 21 Jul 2023
  
  in eLife
  
  Author Response
  
  We thank the reviewers for their careful reading of our manuscript and for their constructive and positive comments. We will revise the manuscript to address their key points. Here, we address the reviewer’s scepticism of sleep-learning being mediated by the episodic memory system. We agree that the reported unconscious learning of novel verbal associations during sleep may not match textbook definitions of episodic memory. However, the traditional definitions of episodic memory have long been criticized (e.g, Henke, 2010; Hannula, Minor, Slabbekoorn, 2023; Shohamy & Turk-Browne, 2013; Dew & Cabeza, 2011; Reder et al, 2009). We stand by our claim that sleep-learning was of episodic nature. Here, we provide arguments for this claim:
  
  In the introduction and the discussion, we are reporting that we use a computational definition of episodic memory (Cohen & Eichenbaum, 1993; Henke, 2010; O’Reilly et al., 2014; O’Reilly & Rudy, 2000), and not the traditional definition of episodic memory that ties episodic memory to wakefulness and conscious awareness (Gabrieli, 1998; Moscovitch, 2008; Schacter, 1998; Squire & Dede, 2015; Tulving, 2002). Consciousness and wakefulness are no properties of episodic memory according to the computational definition of episodic memory. Instead, the core computational features of episodic memory according to the computational definition are 1) rapid learning, 2) association formation, and 3) a compositional and flexible representation of the associations in long-term memory. We designed the retrieval task in the current study to assess only the retention of sleep-formed flexibly and compositionally stored word-word associations. Reviewer 3 suggests that sound-sound associations may have been formed during sleep and may have been reactivated at test resulting in the translation of the sound pattern of the translation word to the meaning of the translation word and further to the correct superordinate semantic category of the translation word. Although these processing steps during sleep and during the wake retrieval are possible, the rapid sound-sound associative encoding, long-term storage, and the flexible sound retrieval would still require hippocampal processing and hence computations in the episodic memory system. The interpretation in terms of associative auditory learning with a double semantic translation at wake testing is laborious and inefficient and hence a less parsimonious interpretation of sleep-learning than conceptual associative encoding during sleep. Our view resonates the findings by Andrillon et al. (2017) that mere auditory perceptual learning during slow-wave sleep was not possible at all or led to suppressive memory traces that could not be retrieved following awakening.
  
  Importantly, Züst et al. (Current Biology, 2019) had also presented pseudowords and translation words for paired-associative word encoding during slow-wave sleep. Retrieval testing was performed in the waking state following sleep by use of a cued-recall task, as in the current study. During retrieval testing, Züst et al. recorded brain blood oxygenation using functional magnetic resonance imaging. Importantly, the hippocampus was activated during correctly, but not during incorrectly retrieved memories that had been formed during sleep. Crucially, activation resulting from this contrast within the posterior and anterior hippocampus and within lexical-semantic storage sites in the left temporal pole correlated between participants with retrieval performance (Züst et al., 2019). These correlation results demonstrate that those participants, who learned the vocabulary best during slow-wave sleep activated the hippocampus and lexical-semantic storage sites the most during wake retrieval testing. Because the learning and retrieval tasks in the current study were similar to Züst et al. (2019), the hippocampus was likely mediating the retrieval of the sleep-formed associations in the current study. We have also measured the brain oxygenation using functional magnetic resonance imaging in five persons while they learned pairs of pseudowords and translation words during slow-wave sleep and found the hippocampus activated (besides language areas) in all persons (unpublished).
  
  For these reasons, we believe that vocabulary presentations during sleep had triggered a hippocampus-mediated rapid conceptual-associative encoding process that provided for flexible representations of combinations of pseudowords and translation words in episodic memory.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.10.24.513503v3
www.biorxiv.org www.biorxiv.org

New submission 20/07/2023, 09:43:40

1
1. Public_Reviews 20 Jul 2023
  
  in eLife
  
  Author Response
  
  We thank the reviewers for their insightful reviews of our work, including both its strengths and limitations. Below we present minor corrections to the preprint and responses to the main points brought up by each reviewer.
  
  Erratum:
  
  Line 330 refers to Fig. 7F (instead of 7D).
  
  Line 331 refers to Fig. 7G (instead of 7E).
  
  Reviewer #1 (Public Review):
  
  The experimental design presented cannot clearly show that the effect of passive exposure was due to the specific exposure to task-relevant stimuli since there is no control group exposed to irrelevant stimuli.
  
  We acknowledge the possibility that exposure to task-irrelevant stimuli could result in improvements in learning. Testing this possibility would be a worthwhile goal of future experiments, but it is outside the scope of our current study. We have been careful in our paper to only draw conclusions about the effects of exposure to task-relevant stimuli compared to no exposure. We will also add a discussion of this point and references to the literature pointed out by the reviewer to the final version of our manuscript.
  
  The conclusion that "passive exposure influences responses to sounds not used during training" (line 147) does not seem fully supported by the authors' analysis. The authors show that there is an increase in accuracy for intermediate sweep speeds despite the fact that this is the first time the animals encounter them in the active session. However, it seems impossible to exclude that this effect is not simply due to the increased accuracy of the extreme sounds that the animals had been trained on.
  
  The conclusion that the reviewer quotes from our paper is drawn from Figure 3, in which we show that mice exhibit an improvement on non-extreme stimuli after training on extreme stimuli. Panel 3D illustrates that the observed improvements are not just changes in psychometric performance driven by the extreme sounds. In the context of this result, the conclusion relates to generalization in performance on task-relevant stimuli that are closely related to the training stimuli. In our view, it was not entirely obvious a priori that this result would have to occur, since it is possible that performance could improve at the extremes without improving at the intermediate stimuli.
  
  In the modelling section, the authors adjusted the hyper-parameters to maximize the difference between pure active and passive/active learning. This makes a comparison of learning rates between models somewhat confusing.
  
  We apologize for the confusion. None of our conclusions are based on comparisons of learning speed between models, but perhaps this was not pointed out sufficiently clearly. The relevant comparisons between conditions for each specific model are made using the same hyperparameters. We will clarify this in the updated version of our manuscript.
  
  The description of the sound does not state whether when reducing the slope of the sweeps the center or the onset frequency of the sounds is preserved.
  
  Frequency modulated sounds of different FM slopes were generated such that the center frequency was always the same. This will be clarified in the updated version of our manuscript.
  
  Reviewer #2 (Public Review):
  
  One limitation here is that the presented analysis is somewhat simplistic, does not include any detailed psychometric analysis (bias, lapse rates etc), and primarily focuses on learning speed.
  
  In our analyses of trials that included extreme and intermediate stimuli, we investigated some metrics of the type that the reviewer suggests here. However, since such additional psychometric analyses generally led to null results and would in any case be somewhat tangential to our main results, which are about learning speed and responses to sounds not included during training, we did not include these in our manuscript. A limitation of our study is that the available data does not allow for an analysis of psychometrics during the initial learning stages, since only the extreme stimuli were presented during the task.
  
  Reviewer #3 (Public Review):
  
  The first [major weakness] is that even Model 5 differs from their data. For example, the A+P (passive interleaved condition) learning curve in Figure 7 seems to be non-monotonic, and has some sort of complex eigenvalue in its decay to the steady state performance as trials increase. This wasn't present in their experimental data (Figure 2D), and implies a subtle but important difference. There also appear to be differences in how quickly the initial learning (during early trials) occurs for the A+P and A:P conditions. While both A+P and A:P conditions learn faster than A only in M5, A+P and A:P seem to learn in different ways, which isn't supported in their data.
  
  The reviewer is correct that there are subtle differences between the two learning curves produced by Model 5. Due to noise in the experimental data, however, it is possible that such subtle distinctions also appear in the learning curves of the mice. Further, the slight overshoot of the learning curve that the reviewer mentions is not constrained by the experimental data due to the fact that different mice reach asymptotic performance at different times, and many of them have not even reached asymptotic performance by the end of the training period.
  
  However, even if there are minor discrepancies between the learning curves produced by the final version of the model and by the mice, we do not see this as being especially surprising or problematic. As in any model, there are a large number of potentially important features that are not included in any of our models–for example, realistic spectrotemporal neural responses, nonlinearity in neural activations, heterogeneity across mice, and many others. The aim of our modeling was to choose a space of possible models (which is inevitably restricted) and show which model version within that space best captures our experimental observations. Expanding the space of possible models that we considered to capture further nuances in the data will be a task for future work.
  
  The second major weakness is that the authors also don't generate any predictions with M5. Can they test this model of learning somehow in follow-up behavioural experiments in mice? ... Without follow-up experiments to test their mechanism of why passive exposure helps in a schedule-independent way, the impact of this paper will be limited.
  
  Although testing behavioral predictions from our models was beyond the scope of the current study, we do generate specific predictions with M5 (specifically, about neural representations). Our model produces predictions about neural representations and the ways in which they evolve through learning, and we hope to test these predictions in future work.
  
  I believe the authors need to place this work in the context of a large amount of existing literature on passive (unsupervised) and active (supervised) learning interactions. This field is broad both experimentally and computationally. For example, there is an entire sub-field of machine learning, called semi-supervised learning that is not mentioned at all in this work.
  
  We thank the reviewer for pointing this out. The updated version of our manuscript will include a discussion on how our results fit in with this literature.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.04.04.535463v2
www.biorxiv.org www.biorxiv.org

New submission 09/05/2023, 15:27:34

1
1. Public_Reviews 20 Jul 2023
 
 in eLife
 
 Author Response
 
 The following is the authors’ response to the original reviews.
 
 Reviewer #1 (Public Review):
 
 This study presents an important finding on human m6A methyltransferase complex (including METTL3, METTL14 and WTAP). The evidence supporting the claims of the authors is convincing, although the model and assays need to be further modified. The work will be of interest to biologists working on RNA epigenetics and cancer biology.
 
 In mammals, a large methyltransferase complex (including METTL3, METTL14 and WTAP) deposits m6A across the transcriptome, and METTL3 serves as its catalytic core component. In this manuscript, the authors identified two cleaved forms of METTL3 and described the function of METTL3a (residues 239-580) in breast tumorigenesis. METTL3a mediates the assembly of METTL3-METTL14-WTAP complex, the global m6A deposition and breast cancer progression. Furthermore, the METTL3a-mTOR axis was uncovered to mediate the METTL3 cleavage, providing potential therapeutic target for breast cancer. This study is properly performed and the findings are very interesting; however, some problems with the model and assays need to be modified. It is widely known that METTL3 and METTL14 form a stable heterodimer with the stoichiometric ratio of 1:1 (Wang X et al. Nature 534, 575-578 (2016), Su S et al. Cell Res 32(11), 982994 (2022), Yan X et al. Cell Res 32(12), 1124-1127 (2022)), the numbers of METTL3 and METTL14 in the model of Fig 7P are not equivalent and need to be modified.
 
 We thank for reviewer’s good suggestion. We have modified the model in Fig. 7P.
 
 Reviewer #2 (Public Review):
 
 In this study, Yan et al. report that a cleaved form of METTL3 (termed METTL3a) plays an essential role in regulating the assembly of the METTL3-METTL14-WTAP complex. Depletion of METTL3a leads to reduced m6A level on TMEM127, an mTOR repressor, and subsequently decreased breast cancer cell proliferation. Mechanistically, METTL3a is generated via 26S proteasome in an mTOR-dependent manner.
 
 The manuscript follows a smooth, logical flow from one result to the next, and most of the results are clearly presented. Specifically, the molecular interaction assays are welldesigned. If true, this model represents a significant addition to the current understanding of m6A-methyltransferase complex formation.
 
 A few minor issues detailed below should be addressed to make the paper even more robust. The specific comments are contained below.
 
 1) The existence of METTL3a and METTL3b. In this study, the author found the cleaved form of METTL3 in breast cancer patient tissues and breast cancer cell lines. Is it a specific event that only occurs in breast cancer? The author may examine the METTL3a in other cell lines if it is a common rule.
 
 We thank reviewer for point this out. We discovered the cleaved form of METTL3 in breast cancer, and we further examined this cleaved METTL3 in other cell lines such as lung cancer cell lines, renal cancer cell lines, HCT116 and MEF (new Supplementary Figures 1A-1C), these data suggest that it is a common rule. Therefore, we speculate that METTL3a may be ubiquitiously expressed. We have added this part in the revised manuscript, please see Line 118-120.
 
 2) Generation of METTL3a and METTL3b.
 
 1) Figure 1 shows that METTL3a and METTL3b were generated from the C-terminal of full-length METTL3. Because the sequence of METTL3a is involved in the sequences of METTL3b, can METTL3b be further cleaved to produce METTL3a?
 
 Although the sequence of METTL3a is involved in the sequences of METTL3b, overexpression of METTL3b in T47D, MDA-MB-231 and 293T cells did not show METTL3a expression in these cells (please see Figures 3A, 3C, 3G), suggesting that METTL3b can not be further cleaved to produce METTL3a, and the METTL3 cleavage may require its N-terminal region. We have added this in the discussion, please see Line 358 to 360.
 
 2) Based on current data, the generation of METTL3a and METTL3b are separated. Are there any factors that affect the cleavage ratio between METTL3a and METTL3b?
 
 We thank for reviewer’s excellent question. In this study, we show that both METTL3a and METTLb are produced through proteasomal cleavage, and both of them are positively regulated by the mTOR pathway. On the other hand, we indeed observed the differential cleavage ratios between METTL3a and METTL3b across different cell lines. For example, METTL3a/METTLb ratio was greater than 1 in MDA-MB-231 cells (see Figure 7C), less than 1 in T47D and 293T cell lines (see Figure 7A and 7B), and equal to 1 in MEF cells (see Figure 7O). Based on these results, we speculate that there may be some factors that control the cleavage ratio between METTL3a and METTL3b, which warrants further investigation. We have added this in the discussion, please see Line 374 to 379.
 
 3) In Figure 2G, the author shows the result that incubation of the Δ198+Δ238 METTL3 protein with T47D cell lysates cannot produce the METTL3a and METTL3b variants. The author may also show the results that Δ198 METTL3 protein or Δ238 METTL3 protein incubates with T47D cell lysates, respectively.
 
 Following the reviewer’s suggestion, we had performed in vitro cleavage assays by incubation of METTL3-Δ238 or METTL3-Δ198 with T47D cell lysates, and had incorporated this result in the revised manuscript. Please see our new Supplementary Figure 3A.
 
 4) As well as many results published in previous studies, the in vitro methylation assay shows that WT METTL3 is capable of methylating RNA probe (figure 2H). The main point of this study is that METTL3a is required for the METTL3-METTL14 assembly. However, the absence of METTL3a in the in vitro system did not inhibit METTL3METTL14 methylation activity. Moreover, the presence of METTL3a even resulted in a weak m6A level.
 
 The main point of this study is that METTL3a is required for the METTL3WTAP interaction, but dispensable for the METTL3-METTL14 assembly (see Figure 4A-4B). In this in vitro methylation assays, METTL3 and METTL14 is capable of methylating RNA probe in the absent of WTAP. In this condition, we found that METTL3 WT as well as its different variants (METTL3-Δ238, METTL3-Δ198, METTL3b and METTL3a) except the catalytically dead mutant METTL3 APPA showed methylation activity in vitro.
 
 5) In Figure 4A, the author suggests that WTAP cannot be immunoprecipitated with METTL3a and 3b because WTAP interacted with the N-terminal of METTL3. If this assay is performed in WT cells, the endogenous full-length METTL3 may help to form the complex. In this case, WTAP is supposed to be co-immunoprecipitated.
 
 We thank reviewer for point this out. METTL3 interacts with WTAP through its N-terminal (1-33aa) (1). Consistently, we find that the two cleaved forms METTL3a and METTL3b which lack the N-terminal region are not able to bind with WTAP. In Figure 4A, we overexpressed METTL3 WT as well as its different variants METTL3-Δ238, METTL3-Δ198, METTL3b and METTL3a respectively in WT cells, and compared the binding ability with WTAP or METTL14 across these overexpressed METTL3 variants. We acknowledge that the exogenous METTL3a and METTL3b interact with endogenous full-length METTL3, and the endogenous full-length METTL3 may help them to form the complex with WTAP. But in fact, the exogenous expression levels of METTL3a and METTL3b are much higher than that of endogenous full-length METTL3 (see Figure 3A and 3C). In this case, METTL3a or METTL3b predominantly interacts with itself, METTL3, METTL14 or other potential interacting proteins through its C-terminal region, this may greatly dilute the condition for the interaction between WTAP and endogenous full-length METTL3. Moreover, in Figure 4A, the comparison is among overexpressed METTL3 variants, the week indirect interactions through much lower expression levels of endogenous protein are probably not comparable to those direct interactions between overexpressed METTL3 variants and WTAP.
 
 Reference:
 
 1) Schöller, E., Weichmann, F., Treiber, T., Ringle, S., Treiber, N., Flatley, A., Feederle, R., Bruckmann, A., and Meister, G. (2018). Interactions, localization, and phosphorylation of the m6A generating METTL3–METTL14–WTAP complex. Rna 24, 499-512
 
 Reviewer #1 (Recommendations For The Authors):
 
 Major points:
 
 1) It is widely known that METTL3 and METTL14 form a stable heterodimer with the stoichiometric ratio of 1:1 (Wang X et al. Nature 534, 575-578 (2016), Su S et al. Cell Res 32(11), 982-994 (2022), Yan X et al. Cell Res 32(12), 1124-1127 (2022)), the numbers of METTL3 and METTL14 in the model of Fig 7P are not equivalent and need to be modified.
 
 We thank for reviewer’s good suggestion. We have modified the model in Fig. 7P.
 
 2) The in vitro methylation activity was detected by the m6A antibody, which has limited linear range. The MTase-Glo{trade mark, serif} Methyltransferase Assay is a SAMdependent enzyme assay with wide applications (Please refer to the references below).
 
 Could this assay be performed by authors?
 
 Wilkinson AW et al. Nature 565(7739), 372-376 (2019).
 
 Yu D et al. Nucleic Acids Res 49(20),11629-11642 (2021).
 
 Yan X et al. Cell Res 32(12), 1124-1127 (2022).
 
 Chen J et al. Nat Commun 13(1), 3257 (2022).
 
 Thanks for reviewer’s good suggestion. We had performed the in vitro methylation assay by using MTase-Glo kit, and the data is consistent with the dot blot results. Please see the new Figure 2H-J.
 
 3) When expressed alone in mammalian cell lines, METTL14 is unstable and is easily contaminated with endogenous METTL3 during purification (Yang W et al. Nat Cell Biol 16(2), p.191-8 (2014), Fig 1e). In Fig 2I, Co-expressing METTL3 and METTL14 maybe a good choice.
 
 We thank for reviewer’s good suggestion. In fact, we co-expressed METTL3 and METTL14 in this in vitro methylation assay in Fig 2I (new Figure 2J in the revised version), METTL3-Flag or its mutant with Flag tag and METTL14-Flag were co-transfected into 293T cells, and co-purified by using Flag M2 magnetic beads from the cell lysates. We have added these details in the indicated method section, please see Line 574-585.
 
 Other minor points:
 
 1) In Fig 5D, the protein domain information of METTL3 and relevant references need to be added (Su S et al. Cell Res 32(11), 982-994 (2022), Fig 6g; Yan X et al. Cell Res 32(12), 1124-1127 (2022), Fig 1a).
 
 We have added these references in the revised manuscript.
 
 2) In Fig 5, would METTL3b contribute to the METTL3-METTL3 interaction?
 
 Our data showed that METTL3a but not METTL3b is responsible for the METTL3-WTAP interaction, breast cancer cell proliferation and the m6A modification. Then, we investigated the mechanism of how METTL3a regulates the METTL3-WTAP interaction, and found that METTL3a is essential for METTL3-METTL3 interaction, which is a prerequisite step for WTAP recruitment in MTC complex. In this case, we speculate that METTL3b is not required for the METTL3-METTL3 interaction. Indeed, through Co-IP assays,we found that METTL3b has no effect on the METTL3-METTL3 interaction (new supplementary Figure 4D), which is consistent with our above data showing that METTL3b is dispensable for the METTL3-WTAP interaction. We have added this comment in Page 6, Line 226 to 228.
 
 3) In Fig 3F, the color in the legend and figure is inconsistent.
 
 We have corrected the inconsistent color in the revised manuscript.
 
 Reviewer #2 (Recommendations For The Authors):
 
 1) In Figure 5D, the construction details of METTL3-HA and Flag should have been included in the method section. Are these tag sequences in the N-terminal of METTL3 protein?
 
 These tags are all in the C-terminal of METTL3. We have added the construction details of these plasmids in the method section. Please see Line 434.
 
 2) In Figure 7A, the labels of the inhibitors are overlapped with the figures.
 
 We have corrected the labels of the inhibitors in Figure 7A in the revised manuscript.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.17.528944v3
www.biorxiv.org www.biorxiv.org

New submission 20/07/2023, 09:01:39

1
1. Public_Reviews 20 Jul 2023
  
  in eLife
  
  Author Response
  
  We thank the reviewers and editors for their thoughtful evaluation of our preprint. We felt that the reviews were fair and that addressing them will improve the rigor and clarity of our presentation. We are working to address all of the comments, with intent to submit a revised manuscript in the near future.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.05.03.539131v2
www.medrxiv.org www.medrxiv.org

New submission 19/07/2023, 11:56:09

1
1. Public_Reviews 19 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  This cross-sectional study examined the results of a survey about cancer treatment disruption during June-August 2020 in 82 counties located in Missouri and Illinois in the U.S. The main outcome was disruption in cancer care. Authors reported that higher education, being a female, experiencing more discrimination in healthcare settings, and having scheduled a telehealth appointment were associated with higher odds of care disruption. Lack of a research focus, lack of following any conceptual framework, the cross-sectional nature of the study, and the small sample size were the noted shortcomings of the manuscript.
  
  We thank Reviewer 1 for their comments. We agree that it is important to understand COVID-related care disruptions using causal methods. However, this manuscript aimed to examine the local impact of COVID care disruptions. We focused on the Siteman Cancer Center’s (SCC) catchment area because the co-author team includes the SCC’s Associate Director of Community Outreach and Engagement (COE) program, the SCC Associate Director for Diversity, Equity, and Inclusion, multiple members of the SCC COE leadership team. Thus, we are uniquely positioned to mobilize and identify outreach opportunities and/or programs that address any gaps we discover. Moreover, this focus on our catchment area and the motivation for this survey aligns with the National Cancer Institute’s priorities of population health assessments to characterize cancer-relevant knowledge, attitudes, beliefs, and behaviors across cancer center catchment areas. While this is a crosssectional study, this snapshot of care disruption will be helpful in planning local outreach strategies. Lastly, our catchment area is challenged with multiple cancer disparities patterned by social identities. Therefore, our analysis was guided by the theory that social identities related to race, ethnicity, class, and gender shape access to healthcare and disease processes and are the fundamental drivers of health. Thus, we included variables that impact health and are patterned by these social factors.
  
  Reviewer #2 (Public Review):
  
  Dr. Kia Davis and colleagues present a thoughtful analysis of disruptions to cancer care during COVID-19 in the article, "Understanding disruptions in cancer care to reduce increased cancer burden: a cross-sectional study." The article is based on an online survey of 680 residents in the Siteman Cancer Center catchment area in Summer 2020. The authors aim to characterize demographic differences in cancer care disruptions. Information about the causes and distribution of care disruption can help reduce the impacts of COVID-19 and guide the recovery of programs and services. The article provides a clear and detailed assessment of factors associated with care disruption and return to care during the first six months of the pandemic.
  
  A strength of the study is the focus on the catchment area of the cancer center during a period of dramatic change. The results would provide timely and actionable data to address emerging barriers to care and associated social or contextual factors. This information helps the Community Outreach and Engagement efforts to be responsive to community priorities despite rapidly evolving circumstances.
  
  The analysis would benefit from greater detail in three areas. First, it would be helpful to have more information about how the outcome measures were originally developed or tested. Second, for the regression analysis, it would be helpful to show the demographic characteristics of the two strata to better understand the sample composition. Third, the authors should demonstrate that the data do not violate the assumptions for conducting logistic regression to improve confidence in the findings.
  
  COVID-19 affected all aspects of the cancer continuum. The study reports factors associated with postponing or canceling cancer-related appointments during the pandemic. It will be of great interest to researchers and practitioners in cancer prevention and control.
  
  We thank Reviewer 2 for their thoughtful critique of our work. Their suggestions have strengthened our manuscript. Since our article was submitted, the questionnaire where we derived our outcome measure has been published. The questions were drawn from validated measures assessing the impact of pandemics such as H1N1, and major life disruptions such as natural disasters. This language was updated in the manuscript as were the references. Moreover, we added a supplemental Table 2 to show the demographic characteristics by race strata. Finally, we tested and can confirm that the analysis does not validate the assumptions of logistic regression. We believe that our results will aid in the understanding of how COVID impacted cancer care in our catchment area so that we can better mobilize resources. While we understand this is a cross-sectional study with the potential for unmeasured confounding, we believe this snapshot of cancer care during the pandemic will also be of interest to researchers, clinicians, and other practitioners in cancer prevention and control in locations like ours.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2022.12.26.22283886v1
www.biorxiv.org www.biorxiv.org

New submission 19/07/2023, 10:05:22

1
1. Public_Reviews 19 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #3 (Public Review):
  
  In this manuscript, Castano et al generate and test a small molecule inhibitor of CDKL5, an Xlinked kinase whose loss-of-function is the cause of a severe neurodevelopmental disorder. Since the current knowledge of CDKL5 functions mainly rely on genetic models it is still unclear which effects are caused directly by CDKL5 loss and which can be ascribed to indirect effects. A specific inhibitor would therefore be an important tool for the field.
  
  Castano and colleagues therefore tested a panel of twenty kinase inhibitors for their capacity to block phosphorylation of a EB2, a bona fide CDKL5 substrate, in rat neurons. Among the three that could inhibit EB2 phosphorylation at low concentrations, one was found to inhibit CDKL5 while not affecting GSK3 kinases, which share significant homology to CDKL5. Considering that genetic studies have previously linked CDKL5 to excitatory synaptic transmission, acute hippocampal slices were exploited to test the consequences of CDKL5 inhibition. While CDKL5 loss in the past was found to affect both AMPA- and NMDA-Rs, the small molecule-based inhibition affected only AMPA-R responses at the post-synaptic level. Since pharmacokinetic analyses showed that the inhibitor has a low capacity for brain penetration the molecule remains limited for testing the acute inhibition of CDKL5 in vitro and ex vivo. Such a tool represents an important aspect in the CDKL5 field and the findings suggesting a direct role of CDKL5 in regulating AMPA-R functions are interesting. However, the manuscript could be improved to render it more readable.
  
  Thank you for this positive feedback and we hope that our adjustments improve the readability.
  
  The description of the binding and orthogonal assays, which are the basis for the selection of the small molecule inhibitor, is not straightforward to understand for non-expert readers and could be improved.
  
  We have added additional text to the Methods and Results to better explain the assays.
  
  While the in vitro and ex vivo assays are well presented, it is not clear why the myelin basic protein is used as a substrate for CDKL5 in the in vitro kinase assays. Does this protein contain a CDKL5 consensus site?
  
  To execute the in vitro kinase assays, myelin basic protein (Active Motif, 31314) was employed as a substrate for recombinant CDKL5. Myelin basic protein is used as a substrate for multiple kinases, both serine/threonine and tyrosine kinases, to enable in vitro kinase assays due to the presence of multiple sites for phosphorylation. As such, we and others have used this protein as a kinase substrate for evaluating kinase activity[2, 4]. MBP does not contain a CDKL5 consensus site of RPXS/T*, and as such could be considered a less than ideal substrate to study CDKL5 activity, however for in vitro kinase assays MBP is still suitable as it can be phosphorylated by CDKL5. In addition, CDKL5 is known to phosphorylate substrates that do not contain a consensus motif[3].
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.04.24.538049v1
www.biorxiv.org www.biorxiv.org

New submission 19/07/2023, 09:40:16

1
1. Public_Reviews 19 Jul 2023
 
 in eLife
 
 Author Response
 
 Reviewer #1 (Public Review):
 
 This study demonstrates that a hybrid measurement method increases 3 fold the resolution of mouse USV localization. This increased resolution enables to revise previous occurrence frequency measures for female vocalizations and establishes the existence of vocal dominance in triadic interactions. The method is well described and its efficiency is carefully quantified. A limitation of the study is the absence of ground truth data, which may have been generated eventually with miniaturized loudspeakers in mouse puppets. However, a careful error estimation partially compensates for the absence of these likely challenging calibrations. In addition, the conclusions take into account this uncertainty. The gain in accuracy with respect to previous methods is clear and the impact of localisation accuracy on biological conclusions about vocalisation behavior is clearly exemplified. This study demonstrates the impact of the new method for understanding vocal interactions in the mouse model, which should be of tremendous interest for the growing community studying social interactions in mice.
 
 We have performed the requested, additional ground estimate using a movable miniature speaker, for more details see point 2 of Reviewer 2, and the new supplementary figure.
 
 Reviewer #2 (Public Review):
 
 Past systems for identifying and tracking rodent vocalizations have relied on triangulating positions using only a few high-quality ultrasonic microphones. There are also large arrays of less sensitive microphones, called acoustic cameras that don't capture the detail of the sounds, but do more accurately locate the sound in 3D space. Therefore the key innovation here is that the authors combine these two technologies by primarily using the acoustic camera to accurately find the emitter of each vocalization, and matching it to the highresolution audio and video recordings. They show that this strategy (HyVL) is more accurate than other methods for identifying vocalizing mice and also has greater spatial precision. They go on to use this setup to make some novel and interesting observations. The technology and the study are timely, important, and have the potential to be very useful. As machine learning approaches to behavior become more widespread in use, it is easy to imagine this being incorporated and lowering entry costs for more investigators to begin looking at rodent vocalizations. I have a few comments.
 
 1) What is the relationship of the current manuscript to this: https://www.biorxiv.org/content/10.1101/2021.10.22.464496v1 which has a number of very similar figures and presents a SLIM-only method that reportedly has lower precision than the current HyVL approach. Is this superseded by the submitted paper?
 
 The referred manuscript (now published in Scientific Reports) is indeed related to the current work: The currently presented system is based on the integration between SLIM (based on 4 high quality microphones) and Beamforming (based on the 64-channel microphone array). The accuracy of SLIM is generally lower than that of HyVL, but it makes essential contributions to the overall accuracy of HyVL through the integration of the complementary strengths of the two methods/microphone arrays (see Fig. 3A, L-shape of errors). To our knowledge, SLIM was the previously most accurate technique (based on 4 microphones, see comparison in the Discussion), but HyVL exceeds this by a substantial margin. Some figures appear similar mostly due to related code in the underlying analysis pipeline and visualization scripts (e.g. the half-disc densities). However, the set of dyadic and triadic recordings was collected specifically for the present study, and all top-level analyses were performed separately. The single mouse (C57Bl/6 WT) ground truth dataset is shared between the two studies, where in the SLIM paper only the USM4/SLIM part was evaluated (leading to a correspondingly lower, single animal accuracy).
 
 We felt that the level of detail above would probably impede the reading of the manuscript, and we have therefore added a subset of the above clarifications to the methods and the first time the other study is mentioned.
 
 2) Can the authors provide any data showing the accuracy of their system in localizing sounds emitted from speakers as a function of position and amplitude? I am imagining that it would be relatively easy to place multiple speakers around the arena as ground truth emitting devices to quantify the capabilities of the system.
 
 Ground truth data is critical for any meaningful comparison. First, we would like to highlight that we already provided ground truth data in the previous version of the manuscript: In Fig. 3C. we analyzed vocalization data from trials with (1) just a single mouse as well as (2) vocalization at times when all mice were far apart in relation to the accuracy of HyVL (>100 mm, i.e. >25x the accuracy of HyVL) where the chances of erroneous assignment are negligible. We think that these tests are the most relevant, as they are conducted with the relevant sounds, at their actual intensity, spectral profile and emitter acoustics.
 
 In addition, we have now conducted a series of tests with sounds produced by a miniature speaker placed in 25 different locations to demonstrate the lower-bound of accuracy achievable with the system. The tests indicate an accuracy of MAE < 1mm under these ideal conditions, i.e. without the absorption of the mouse bodies, varying direction of emission of the mouse snout, varying intensity, varying spectral content, duration, etc. Exploring the dependence on all these parameters is in itself interesting, but requires a detailed study in itself. The detailed experimental conditions and results are now provided in Supplementary Fig. 4, including a quantification of the dependence on amplitude.
 
 3) How is the system's performance affected by overlapping vocalizations? It might be useful to compare the accuracy of caller identification for periods where only one animal is calling at a time vs. periods where multiple animals are simultaneously calling.
 
 This is an excellent question. Our current code for detecting vocalizations cannot automatically determine if one or multiple vocalizations are concurrently present. We have therefore manually checked all vocalizations for overlapping instances, including those in triadic recordings with two males, where this would be expected to occur most frequently.
 
 We considered vocalizations to be overlapping if the overlapping constituent timefrequency traces did not form a harmonic stack. Overall, overlaps were surprisingly rare. We did find a couple of cases (<0.1%) where our detection algorithm produced a longer vocalization interval that contained multiple, differently shaped vocalization traces that, when re-analyzed in shortened time-frequency bins with beamforming, belonged to two different males. Note here that beamforming is separately performed from the onset to the end of each vocalization, so the cumulative heatmap can change depending on these onset and end times, which are normally determined by our detection algorithm.
 
 However, although the identity of the assigned vocalizer could shift in these very rare cases depending on which time bin was re-analyzed, the system’s localization performance remained in principle unaffected: as mentioned above, shorter time bins on non-overlapping parts correctly show the origin of the vocalizations in this case, and therefore a solution to this issue could be a USV detection algorithm that is able to detect the overlap based on the spectral shapes and parses them apart. During the beamforming each vocalization can then be separately localized, by restricting the beamforming to the corresponding time and frequency range. Further, the analysis could be refined so that multiple salient peaks can be detected in the soundfield estimate. This would, however, substantially change the analysis approach, i.e. rather than a single estimate per USV, a sequence of soundfield estimates should be computed and later fused again. Since such a procedure uses less data per single estimate, it also increases the possibility of false positives, which in the current situation with very few overlaps in time, would likely reduce the overall accuracy of the system, we decided to not modify the algorithm in this direction, but we agree that ideally a joint approach - combining separation on the spectrogram and soundfield level - should be pursued. For the present data, if a time window was analyzed such that the intensity map of the sound field contains multiple hotspots of an approximately equal magnitude, the USV would likely remain unassigned, because the within soundfield uncertainty would be higher than for a single peak, and this would reduce the MPI. However, given the rarity of these cases in our dataset, we do not think that their exclusion would change the results appreciably. This information was added as a paragraph to the Discussion.
 
 It is worth noting that HyVL is very robust: There were a number of cases (<5%) where environmental dampening in combination with harmonic stacking produced interesting timefrequency traces in some of the USM4 microphones, but our system did not have any issue spatially localizing this - what seems like a - smeared vocalization trace. We provide a few examples of this kind in a short video (see Rebuttal Video 2 and the legend at the bottom of this document), where the overlap is also reflected in the intensity map of the sound field, overlaid onto the platform.
 
 4) Can the authors comment on how sound shadows cast by animals standing between the caller and a USM4 affect either the accuracy of identification or the fidelity of the vocal recording?
 
 An important point to raise. Sound scattering and dampening caused by the conspecifics of the vocalizing animal can impede the accuracy of any sound localization system, but can unfortunately not be avoided in a social setting. To address this issue, we raised all USM4 microphones by ~12 cm above the interaction platform to minimize the instances of sound blocked by the mice. Further, the Cam64 device should largely be unaffected by sound shadows as it is centrally located above the platform. We have added a modified version of the above comment to the discussion under the heading "Current limitations and future improvements of the presented system".
 
 5) I'm a bit confused about how the algorithm uses the information from the video camera. Reading through the methods, it seems like they primarily calculate competing location estimates by the two types of microphone data and then make sure that a mouse is in close proximity to one location, discarding the call if there isn't. Why did the authors choose this procedure rather than use the tracked position of the snouts as constrained candidate locations and use the microphone data to arbitrate between them? Do they think that their tracking data are not reliable or accurate enough?
 
 Thanks for this important suggestion, which we have actually grappled with a lot during the analysis. First of all, the visual tracking data, in particular the manual data, is in our opinion (based on human visual identification) near perfect (within the limits of the video resolution, pixel resolution = 0.8 mm), i.e. on the order of 1-2 mm, and is therefore not the source of any unattributable vocalizations. If we understand the reviewer correctly, then we indeed perform the attribution as he indicates based on the tracked snouts of all mice, specifically by measuring the MPI's of both acoustic location estimates for all mice and then choosing the most reliable one. Specifically, the attributions can be grouped into 3 cases: (i) Estimated origin close to one snout, and snouts rather far apart, (ii) Estimated origin close to one snout and snouts close, and (iii) estimated origin not close to either snout. (i) is easy to address, (ii) is appropriately handled by the mouse probability index, but (iii) is tricky. Since the vocalization has to come from one of the mice, this already indicates that the localization is not working well in this case. Therefore we found it prudent (similar to Neunuebel et al. 2015) to not assign in these cases. Interestingly the MPI is not useful in these cases, as due to the exponential dependence of the normal density on distance, for example a case with a distance of 50 mm to one snout and 60 mm to another snout could lead to an MPI close to 1, which is likely not trustable. We have described this in the Methods as follows:
 
 "This distance threshold mainly serves to compensate for a deficiency of the 𝑀𝑃𝐼: if all mice are far from the estimate, all 𝑃𝑘 are extremely small, however, the 𝑀𝑃𝐼𝑘 will often exceed 0.95." Due to the inherent limit for localizing very quiet, short USVs by any system, we think this kind of selection (introduced originally by Neunuebel et al 2015) is a valuable and necessary step in the processing to avoid confusions (which are of course already substantially reduced through HyVL here).
 
 6) I guess the authors have code that we can run, but I couldn't access it. The manuscript describes the algorithms and equations that are used to calculate the location, but this doesn't really give me a feel for how it works. If you want to have the broadest impact possible, I think you would do well to make the code user-friendly (maybe it is, I don't know). In pursuit of that goal, I would suggest that the authors devote some of the paper to a guided example of how to use it.
 
 While the code was made available to the reviewers via the link at the beginning of the manuscript (p2, before abstract), we completely agree that this method of distribution is not very accessible. We have therefore created a publicly available GitHub repository (https://github.com/benglitz/HyVL) which hosts the code and details its use on the basis of a sample data set (which is available to the reviewers in the repository link, and later to the public under https://doi.org/10.34973/7kgc-ta72). While we do provide a sample video and analysis workflow there, our data analysis pipeline is quite integrated and other labs will likely use different pipelines. We have therefore tried to make the core functions independent of our pipeline and thus easy to integrate by others into their analysis pipelines.
 
 Reviewer #3 (Public Review):
 
 The present manuscript describes a new method to identify the emitter of ultrasonic vocalisations during social interactions between 2 or 3 mice. The method combines two technologies (an "acoustic camera" and a set of four microphones) and succeeds in increasing the spatial precision and the attribution of USV emission to one of the mice. The manuscript describes the characteristics and advantages of each method and the advantages of using both to optimize the identification of USV emitter. The authors used the method to confirm that females are also vocalising during male-female interactions and that females emit USV mostly during nose-nose contact while this was not the case for males. Interestingly, the authors identified that the vocal behaviour of two competing males was strongly asymmetric when facing a female. This was not the case for two females facing one male.
 
 The method is really promising since the identification of the emitter of USVs during mouse social interactions is a necessary step to speed up our understanding of this communication modality. The increase in spatial precision and in the proportion of attributed vocalisations is non-negligible and will be of great utility in the future.
 
 We would like to thank the reviewer for this positive perspective on the future utility of our system.
 
 Generally, the statistical analyses should be adjusted. Indeed, the statistical analyses do not consider the fact that the same individuals were recorded several times (if we understood well the methods). Each point was considered independent (in non-parametric Wilcoxon tests), while this is not the case given the repetitions with the same individuals (the number of repeated encounters per individual should be given in the methods section, by the way). We strongly recommend revising the statistical analyses of the results in Figures 4 and 5. In addition, it could be interesting to check whether the vocal behaviour is stable within each individual (i.e., a male that is vocalising frequently in one situation vocalises always frequently in other situations).
 
 We generally agree with this suggestion: In order to properly conduct the analysis for individuals as you suggest, a balanced dataset should be used. We had initially collected such a balanced dataset, which was previously not detailed in the manuscript, as the focus was on USV localization/attribution and hence only the recordings containing USVs were analyzed (detailed now in the beginning of Results and Methods). However, overall, the probability of a recording containing vocalizations at all is low: in our balanced set only 23/112 recordings contained vocalizations. We therefore had collected additional recordings with the best vocalizers which created the previously analyzed set of 83 recordings containing USVs recorded with all microphones. This dataset is therefore dominated by recordings from mice that are active vocalizers. While this does not raise any issue for the estimation of the accuracy of the method (Figure 3) or the female vocalizations (Figure 4, because recordings were always randomized across female mice), it precludes an encompassing analysis of individual differences in Figure 5, i.e. the dyadic-triadic comparison. In the new Figure 5, we address the reviewer's question for the dyadic recordings, finding that the current set of recordings does not provide sufficient evidence that individual male mice had significantly different vocalization rates. We would, however, like to point out that this is likely a consequence of the n=4 recordings that are compared here. For the female mice, we also did not find differences in vocalization rates, which is based on n=14 recordings and thus a more reliable result (p=0.16, 1-way ANOVA with factor individual).
 
 For the triadic recordings, however, due to a limitation in the experiment execution, we unfortunately do not have the complete information available on an experiment level for the triadic recordings, i.e. the video stream was accidentally started after all mice were placed in the platform, and since the same sex animals are visually not separable (while the female mice are separable from the males, based on a slightly shaved region on their head), we cannot completely assess this question in triadic recordings based on the available data. When including the triadic recordings in addition and assuming a single vocalizer (combining all male USVs, see below for why the males could not be assigned in the triadic condition) the male individual comparison can be approximately performed with n=8 recordings, and then the dependence on individual becomes borderline significant (p=0.028, 2-way ANOVA with factors individual and condition).
 
 For the comparison of vocalization rates in the previous Figure 5 that the reviewer was referring to, we cannot perform a rigorous analysis on the individual level, due to the lack of balance. While we thus agree that differences between individual mice can contribute to the differences observed, we do not think that this would change the conclusion that one of the mice dominates the vocal emissions. If the reviewers agree, we would thus leave Figures 6 (old Fig. 5) and new Figure 7 (behavioral confirmation of dominant/subordinate division) as part of the manuscript, with a clear cautioning about the possible contribution of individual differences to the observed differences. If the reviewers find it inappropriate to leave the results based on the unbalanced dataset in, all results after figure 5 could also be excluded (although we would find this unfortunate, given the additional time and effort we have invested in these).
 
 It is not easy to understand the rationale behind testing animals in pairs and in triads from the beginning of the manuscript. The authors should better introduce this aspect in the manuscript, especially given the fact that biological results deal with this aspect in Figure 5. The authors might strengthen the parts of the biological results extracted from their new method.
 
 Thank you for pointing out the need for clarification regarding the rationale behind testing animals in pairs and in triads. It is because courtship interactions are particularly vocal and social, that they are of interest to many fields, e.g. neurodevelopmental disorders.3,4 Due to the natural competitiveness between mice during courtship interactions, high accuracy is particularly beneficial in this regard because it allows disentangling USVs at close distances. We adapted the introduction to better reflect this reasoning and included an extra paragraph in the introduction and also where the biological results from old Fig. 5 / new Fig. 6 are summarized.
 
 More specifically, the fact that one male takes over the vocal behaviour within a triad is of high interest. Nevertheless, some behavioural data would be needed to strengthen these findings.
 
 We agree that this is an interesting finding and also agree that some additional behavioral analysis is useful to complement it. In order to arrive at this analysis, we performed all-frame, 3-animal tracking on the 14 triadic recordings with two males. This required switching to skeleton tracking with SLEAP5 in addition to manual post-processing to ensure that no identity switches occur. In each recording the dominant male was then defined as the one that emitted more vocalizations, and then the vocalization-independent spatial interaction histogram was computed, similar to the ones in Fig.4, but now separating between the dominant and the subordinate males (see new Figure 7). The results are consistent with the most typical location of vocalization of the male, in proximity to the female abdomen: The dominant male's spatial interaction histogram (Fig. 7A) was more clearly peaked in the location of the female abdomen very close to the male's snout, in comparison with the subordinate male's histogram (Fig. 7B), which shows up very clearly in the difference between the normalized histograms (Fig. 7C). Significance analysis was performed using 100x bootstrapping on the relative spatial positions to estimate p=0.99 confidence bounds around the histograms of the dominant and subordinate respectively. Significance at a level of p<0.01 highlights multiple relative spatial positions (Fig. 7D), including the one proximal to the snout which has the largest absolute difference (Fig. 7C). Note, that these analyses were conducted on the basis of the non-balanced dataset which contained enough vocalizations to assess the dominant male based on the vocalization rates and thus individual traits of certain animals remain as a possible confound.
 
 A small proportion of USVs was not assigned. The authors did not discuss the potential reason for this failure (Were the USVs too soft? Did they include specific acoustic characteristics that render them difficult to localise?). These points could be of interest when testing other mouse strains or other species.
 
 Good point, we agree that it is interesting to know the reasons for failure. As so often, there is not a single property that makes localization hard, but multiple factors contribute. In the SLIM paper, we already identified duration and intensity as important contributors (Fig. 3E/F), and in the speaker test (see new Supplementary Fig. 4) we again demonstrated the influence of intensity. In addition, frequency bandwidth and acoustic occlusion are two other main contributors that each influence the availability of the information/signal-to-noise ratio at the microphones:
 
 Frequency bandwidth: In signals that are very narrowband, there are more opportunities for phase ambiguity, in particular for very high-frequency signals. These are avoided/reduced for more wideband signals.
 
 Acoustic occlusion: As ultrasonic sounds can be quite directional, if an animal is vocalizing away from a microphone, which in addition would put its body in the way of the sounds to the microphone, then this can reduce the intensity at the microphone to a level where the information is insufficient to utilize information from this microphone. This mostly influences the 4 microphones surrounding the platform, while the Cam64 overhead will likely not be affected by acoustic occlusion in the plain.
 
 We have added a brief version of this explanation to the discussion under the heading: "Current limitations and future improvements of the presented system"
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.01.18.524540v1
www.medrxiv.org www.medrxiv.org

New submission 16/07/2023, 14:12:27

1
1. Public_Reviews 19 Jul 2023
 
 in eLife
 
 Author Response
 
 Reviewer #2 (Public Review):
 
 This manuscript reports on an important study that aims to identify symptom trajectories for the early detection of pancreatic cancer. The study's findings are based on the analysis of two complementary data sources: structured data obtained from the Danish National Patient Registry and unstructured information extracted from the free-text sections of patient notes. The researchers successfully identified various symptoms and disease trajectories that are strongly associated with pancreatic cancer, with compelling evidence from both data sources. Additionally, the study provides a detailed comparison and contrast of the results obtained from each data source, adding valuable insights into the strengths and limitations of each method.
 
 Strengths:
 
 The work is well motivated by the urgent need for early detection of pancreatic cancer, which is often difficult due to the lack of effective (computational) methods. The manuscript is generally well-written and includes relevant studies, providing a comprehensive overview of the current state of the field.
 
 One of the unique contributions of this work is its use of both structured registry data and unstructured clinical notes to leverage complementary information. This approach enables a more nuanced and comprehensive understanding of the disease symptom trajectories, which is critical for improving early disease diagnosis and prognosis.
 
 The methodology employed in this study is sound and robust, and the authors have candidly discussed its limitations. The results are significant and highlight previously unknown insights into symptom disease trajectories, which have important implications for the management of pancreatic cancer.
 
 Overall, this is a well-designed and executed study that makes an important contribution to the field of cancer/informatics research, and it should be of great interest to both researchers and clinicians.
 
 Weaknesses:
 
 To complement the results in Figure 1, I'd also suggest that the authors compile a list of the most common (known) symptoms of pancreatic cancer as a reference. In other words, not only can you compare results found from the two sources but also compare them with existing knowledge. This is something you discussed partly in lines 245 but including this early as part of the results in Figure 1 would be more informative.
 
 We agree that this would be informative to include into the Venn diagram. Hence, we have created a list of the most established and well-known symptoms of pancreatic cancer (Supplementary table S1) and converted these to the comparable ICD-10 level that we also use for the text mining and registry counts in Fig. 1. We have included the Venn diagram as Supplementary Figure S1.
 
 In terms of the text mining evaluation results, providing information on recall errors would be beneficial to better understand the performance of the method. Additionally, line 144 mentions 53 words, but it is still not clear to me what these words refer to. Could you please clarify this point or provide more context?
 
 We have added sensitivity/recall measures on the text mining procedure and furthermore added two references in the Discussion of the Tagcorpus program which was used for text mining the clinical notes. These references also mention similar sensitivities for the studies. The 53 words are false positives and we have clarified why these have been captured as false positives by the Tagcorpus (negations).
 
 The disparities between Figure 2A and 2B are noteworthy, from very different initial symptoms to the proportion of short median survival dates (<=90 days), with much more pronounced differences than those observed in Figure 1 comparing two data sources. The highlighted trajectories are almost completely different. Should this be expected? I was hoping to see at least some overlap between the two results.
 
 After updating the case population (via the cancer registry) and showing only symptoms trajectories in this revised version, we can clearly see that the trajectories are more similar. This gives an indication that the methods pick up on similar pancreatic-cancer symptoms, but there are also differences that show how each data type can complement the other, such as the text-mined trajectories being able to pick up longer symptom trajectories prior to the cancer.
 
 All trajectories shown in Figure 2 include three symptoms. Is this by design? Could there be meaningful trajectories with different numbers of symptoms (e.g. 4 or more)?
 
 We agree and have added the significant length 4 trajectories (for the registry data) as supplementary figure S2. No trajectories with length 5 or higher were found in the registry-based analysis. No length 4 (or higher) trajectories were found for the text-mined patients (presumably due to the data set size).
 
 Considering those patients with both clinical notes and registry data, it may be beneficial to merge their symptoms to generate more informative trajectories.
 
 This could be interesting but is out of scope for this paper. Here we would like to stress the proof-of-concept that the two data types can complement each other. The next steps would be to generate these multimodal trajectories to for example test if they are predictive of pancreatic cancer. Nonetheless, we acknowledge the significance of this perspective and have incorporated it into the Discussion section of the manuscript.
 
 Given that results from two sources are being compared in Figures 1 and 2, have you considered calculating the top 20 most significant symptoms from the registry data as well?
 
 We have done this and added them to Supplementary figure S3.
 
 While there is a discussion related to cardiovascular diseases, I noticed no mention of cataracts or gonarthrosis, which were found to be prevalent among patients with short survival in Figure 2.
 
 Since we now only include symptoms trajectories in the Results, we have chosen to not include these results in the Discussion for the final version of the manuscript. However, the diagnosis-wide trajectories are included in the Supplementary figure S2. Cataract and gonarthrosis have still been found significant in the results even though they are not shown in the Supplementary figure due to its visualization threshold of min. 400 patients per trajectory.
 
 Ultimately, the goal of this research is to improve the early detection and prognosis of pancreatic cancer, thus it is important to discuss how the findings of this work could be applied in practice towards this goal (e.g. used by disease prediction algorithms?)
 
 We agree that this is very important and have added a small section on this in the Discussion. We have also cited a recent publication using deep learning algorithms to predict pancreatic cancer based solely on registry data (Placido et al. 2023).
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2023.02.13.23285861v1
www.medrxiv.org www.medrxiv.org

New submission 16/07/2023, 14:08:50

1
1. Public_Reviews 19 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  In general, in the discussion, I miss two of the main points that led to suspend screening programs in most countries during the pandemic:
  
  1) protecting women from the risk of infection linked to attending a clinic during pandemic when health facilities were mostly attended by symptomatic people seeking care for Covid-19;
  
  We agree. We have added this to the background and Discussion section (page 3, lines 76-78 & page 9, lines 296-299).
  
  2) the of health professionals because they were mostly involved in covid related activities: lack of radiologists (addressed to the emergency department to assure diagnoses of pneumonia), lack of anesthesiologists (due to the expansion of intensive care), thus risking not having timely surgical treatment; lack of screening organization personal for invitations and phone calls (working on contact tracing).
  
  We agree. We have added this to the background and Discussion section (page 3, lines 76-78 & page 9, lines 296-299).
  
  Lacking the rationale for suspending screening, it is not clear to the reader how the Danish program afforded these issues and was able to maintain open the program.
  
  We have elaborated on this in the Discussion section (page 296-299), arguing that Denmark may have partly decreased the issue of staff shortage due to e.g., a lower burden of COVID-19, use of laymen and medical student for testing and vaccinations and a high vaccine coverage.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2022.09.26.22280381v1
www.biorxiv.org www.biorxiv.org

New submission 16/07/2023, 11:52:30

1
1. Public_Reviews 19 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  Hoang, Tsutsumi and colleagues use 2-photon calcium imaging to study the activity of Purkinje cells during a Go/No-go task and related this activity to their location in Aldolase-C bands. Tensor component analysis revealed that a substantial part of the calcium responses can be linked to four functional components. The manuscript addresses an important question with an elegant technical approach and careful analysis. There are a few points that I think could be addressed to further improve the quality of the manuscript.
  
  1) The authors should be careful not to overstate the goal and results. For instance, in the abstract it is stated that dynamical functional organization is necessary for dimension reduction. However, the statement that the 4 TCs together account for about half of the variance (line 220) indicates that dimensionality may not be reduced that much. I would suggest revising the first and last sentence of the abstract accordingly.
  
  Dynamic functional organization of TC1 and TC2 by synchronization is the major finding of this study and we believe that it is one of the most efficient mechanisms of dimension reduction, given the unique anatomy of the cerebellum. In the revised manuscript, we added a supplemental result showing that the dimensionality of TC1 and TC2 neurons decreased and increased, respectively, in accordance with bi-directional changes in their synchronization (Figure 3 – figure supplement 1DE). Dimension reduction was further confirmed by conventional PCA (Figure 6 – figure supplement 1). However, we agree that the statement that the cerebellum reduces dimensions by self-organization of components is speculative, and we revised the abstract accordingly.
  
  At the end of the introduction, the authors refer to "the first evidence supporting the two major theories of cerebellar function" but which two theories is referred to and how this manuscript support them is not very obvious. Similarly, they state that "This study unveiled the secret of cerebellar functional architecture", which I would consider to be an unnecessary overstatement of the impact of the work described.
  
  In the revised Introduction, we explicitly stated that TC1 and TC2 are related to timing control and cognitive error learning, respectively, with some indirect causal evidence. We also revised the last paragraph of the Introduction to emphasize that this study provides the first evidence to support the view that distinct cerebellar components may serve divergent cerebellar functions in a single task. The statement "This study unveiled the secret of cerebellar functional architecture" was removed.
  
  In the title, the authors use the word modular. In the consensus paper on cerebellar modules (Apps et al., 2018) an attempt is made to unify the terms used to describe cerebellar anatomical structures. Here "module" is used for the longitudinal zone of interconnected PCs, CN neurons and olivary neurons. As the authors only studied PC activity (and indirectly the IO), I would suggest using band, stripe or subpopulation instead.
  
  Because we used TCA to identify functional components underlying the Go/No-go data, we changed the word “module” to “component” in the title.
  
  Finally, the term "CF firing" or "CF activity" is used when referring to the recorded signals. However, the authors measure postsynaptic calcium responses that are indeed likely driven by CF inputs, but could also be influenced by PF inputs. At the very least, because Purkinje cells and not climbing fibers are being imaged, "complex spike" should be used instead. It would be more accurate still to use the more general "calcium response" and make less of an assumption about the origin of the calcium response.
  
  In this study, CF-dependent dendritic Ca2+ signals in adjacent AldC compartments were recorded by the two-photon imaging. The HA_time algorithm (Hoang et al. 2020) was then applied to extract spike timings from the recorded signals. In the revised manuscript, we used the terms “calcium responses” and “complex spikes” when referring to the recorded Ca2+ signals and the estimated spikes, respectively.
  
  2) For some figure panels and statements in the manuscript error bars or confidence intervals and statistics are missing. This is the case for, for example, the changes in fraction correct, lick latency, fraction incorrect, etc. (Fig 1B, 2E-F, TC levels in 3, 4D-E and 5A-C). Including these is particularly relevant in Fig 4E as this is a key result, mentioned also in the abstract. Please indicate clearly if these plots are cumulative for all mice or per mouse and averaged. I advise the authors to statistically support the claim that the changes are significant and in opposite direction as this element of the study is referred to in the abstract and discussion (summary).
  
  We added the error bars / confidence intervals to the related figures. Most importantly, we added histograms of synchrony strength for TC1/TC2 neurons (Figure 4E) and conducted statistical tests to strengthen the claim of bi-directional changes in synchronization of TC1/TC2.
  
  3) Data presentation sometimes does not do the work justice. For example, the data in Figure 6 are very interesting, but hard to read because of the design of the figure. It is clear how the components are mostly confined to Aldolase-C domains, but within the domains the distribution is not clear. I would advise to also more clearly indicate what the locations of the colors within the bands refers to. The spatial distribution of the selected top 300 cells for each TC could be added.
  
  We added pie-chart plots for the fraction of TC1-4 neurons in each Ald-C zone and learning stage. We also indicated in the figure legend that the location of a single-color bar referred to the geographic distance of the corresponding neuron relative to Ald-C boundaries. We included spatial distribution of the selected neurons in Figure 4 – figure supplement 1D.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.12.05.518634v1
www.biorxiv.org www.biorxiv.org

New submission 16/07/2023, 11:47:57

1
1. Public_Reviews 19 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  The authors investigate the mechanistic underpinning of paradoxical activation (PA) of RAF by small molecule kinase inhibitors using mathematical modeling. The main novelty of the study is the consideration of RAF conformational autoinhibition by its N-terminal regulatory domains as a new determinant of PA. This mechanism has not been explicitly considered in previous theoretical studies, which are based on two other mechanisms: drug-induced RAF oligomerization into active dimers (dimer potentiation DP) and negative cooperativity (NC) of inhibitor binding by a second monomer in the inhibitor-induced RAF kinase dimerization. An important discovery of this study is that conformational autoinhibition is a critical determinant of PA and that in some cases, it can contribute to PA in the absence of DP and NC. Another novelty is the consideration of RAF interaction with 14-3-3 proteins, as a determinant of PA. The 14-3-3 dimeric scaffolds play an important role in the regulation of both autoinhibited and active states of RAF and thus understanding how their interaction with RAF influences PA by RAF inhibitors is important. Using mathematical modeling the authors show that 14-3-3 binding does indeed enhance PA in response to a spectrum of RAF inhibitors.
  
  We thank Reviewer #1 for reviewing our manuscript, and we agree with this summary.
  
  Strengths
  
  The overall strength of this study is that it increases the mechanistic understanding of how PA of RAF originates in response to its inhibitors. Consideration of the effect that the inhibitors play in breaking the autoinhibited conformation has been overlooked by previous mathematical analyses of PA, and this study bridges this gap. By doing so, the authors discover that breaking that autoinhibited state is in fact the biggest contribution to PAB by RAF inhibitors. In my opinion, this is the most impactful finding of this study, which additionally speaks to how important are the autoinhibitory mechanisms for constraining basal RAF signaling in cells. The presented analysis also shows that consideration of conformational autoinhibition can explain PA by all different types of RAF inhibitors (1, 1.5, and 2), which until now has been difficult to reconcile.
  
  Another important contribution of this study is the investigation of how the 14-3-3 scaffold proteins can further contribute to PA. This is exciting, especially in light of recent elegant structural studies that unveiled complex regulation of RAF by 14-3-3 (which are both important for RAF inhibition and stabilization of the active dimers). The authors dissect these opposing roles of 14-3-3 in their model and show the autoinhibitory interaction with 14-3-3, but not the activating one, significantly increases the PA response. Their findings that an increase in the 143-3 levels amplifies PA is very interesting and somewhat provocative as it is unclear how much 14-3-3 levels in cells can oscillate. To this end, the authors show that elevated 14-3-3 levels are observed with increased time of RAF inhibitor treatment, which might point to a new mechanism of resistance to RAF inhibitors.
  
  We thank reviewer #1 for the enthusiastic review and for highlighting the value of bringing conformational autoinhibition into the study and understanding of paradoxical activation. We also appreciate the positive consideration of the 14-3-3 section of the manuscript and the helpful suggestions later in the review. In this revision, we have taken the offered option of removing all of the 14-3-3 theoretical and experimental work. We plan to expand the 14-3-3 work in our ongoing work, in accordance with the thoughtful input from reviewers #1, #2, and #3 on this topic. Thank you.
  
  Weaknesses
  
  The main weakness of the study is the limited experimental analysis conducted to test the predictions that arise from the mathematical models. While some of these predictions might be challenging to test, the one which is tested is not tested rigorously. The experiments focus on 14-3-3-based regulation and are conducted in cells by observing the effect of 14-3-3 overexpression on the inhibition of RAF signaling by its different kinase inhibitors. While the authors acknowledge that too, 14-3-3 overexpression will have a multifaceted effect on signaling as these scaffold proteins participate in the regulation of almost all signaling events. Thus, the proposed experiments are not sufficient to conclude that the observed effects are in fact a result of 14-3-3/RAF interaction.
  
  The authors consider conformational autoinhibition and 14-3-3 stabilization of autoinhibited RAF as two different mechanisms. While it is not a weakness, I am curious how accurate is the consideration of the autoinhibited state of RAF in the absence of 14-3-3. Is it known how the proportion of RAF in cells in its inactive state exists while not bound to 14-3-3?
  
  We thank Reviewer #1 for this input on how we can significantly improve the 14-3-3 section of the manuscript. We have removed the 14-3-3 sections due to the consensus input of all three reviewers and the presented option of focusing on the theoretical results of how conformational autoinhibition influences PA. We do plan to continue this research program on beyond this manuscript, and we therefore very much appreciate these insights into which aspects should be supported with additional experiments and the challenges that follow from the pleiotropic activities of 14-3-3 proteins. The suggestion of quantifying the ratio of autoinhibited to non-autoinhibited forms of RAF when 14-3-3 proteins are present and absent is an experiment we plan to pursue in our future work. It will require us to learn new methods and/or to form new collaborations, and we therefore appreciate the consensus opinion that this would be outside of our current expertise and outside of the scope of the focused manuscript on modeling the impact of conformational autoinhibition on PA.
  
  Reviewer #2 (Public Review):
  
  In this study, the authors set out to investigate factors that have been neglected in existing mathematical models for the paradoxical activation (PA) of RAF by pharmacological inhibitors. The PA phenomenon is well known and is thought to be an important factor in limiting the effectiveness of RAF inhibitors. The authors primarily use mathematical models, first to examine the importance of conformational autoinhibition of RAF monomers, and later to investigate the potential role played by binding of 14-3-3 proteins to either autoinhibited monomers or active dimers. The authors develop several model variants containing different candidate mechanisms and generate analytical solutions that demonstrate under which parameter conditions PA may occur within these models. The use of analytical solutions is a strong point of the paper, as it allows evaluation of the models independently of specific parameter values. This analysis suggests that conformational autoinhibition is a very strong contributor to paradoxical activation, as models that include this mechanism show substantially larger concentration ranges under which RAF is activated by inhibitors. Fitting the parameters of the model to a published dataset on multiple inhibitors further suggests that conformational activation is important, as models containing this mechanism can fit the dataset with significantly lower error. Another interesting observation is that the different types of RAF inhibitors (1, 1.5, 2) fit the data with parameter values that are reasonably similar within each type. A moderate weakness in this analysis is that all of these observations provide indirect evidence for the importance of conformational autoinhibition. A direct test of whether PA is reduced when conformational autoinhibition is removed would be more compelling, but such a test could be difficult to set up experimentally.
  
  We thank Reviewer #2 for reviewing our manuscript, and we agree with this summary. We agree that an experimental test where conformational autoinhibition is removed from the system would a very compelling experiment, but that it would be difficult to set up experimentally. We appreciate the option to focus on the theoretical advance in our revision, and we will be working toward such an experiment.
  
  The authors then perform an analysis of how 14-3-3 binding to either autoinhibited monomers or active dimers might enhance PA. A new model is constructed that contains these binding events in the context of conformational activation, but without negative cooperativity or dimer potentiation included, for the sake of limiting complexity. These models implicate monomer binding, but not dimer binding as a contributor to PA. They follow up on this model result by overexpressing 14-3-3 proteins in two RAS-mutant cell lines, which leads to both higher baseline ERK phosphorylation and to a wider range of inhibitor-induced PA, as predicted by the model. A cell-based RAF dimerization assay also shows higher dimerization effects when 14-3-3 plasmids are transfected as well. This experimental evidence provides strong support for the model, although one drawback, which is noted by the authors in the discussion, is that 14-3-3 overexpression could potentially exert effects on RAF activity through pleiotropic effects other than the binding actions included in the model.
  
  We thank Reviewer #2 for the input on the 14-3-3 section of the manuscript. Although it has been removed from the revision, all of the comments from the review will be helpful for our ongoing work.
  
  Overall, this study makes a strong contribution to understanding the paradoxical effects of RAF inhibitors on the RAS/ERK signaling pathway, which remains a significant problem in the use of targeted inhibitors for cancer. Demonstrating that both conformational activation and 14-3-3 binding strongly contribute to the PA effect is an important step forward, as it establishes that these mechanisms should not be overlooked when designing strategies to use Raf inhibitors.
  
  We appreciate the thoughtful review and helpful comments to improve the manuscript.
  
  Reviewer #3 (Public Review):
  
  The authors describe a mathematical and computational modeling study of RAF paradoxical activation (PA), a phenomenon in which RAF inhibitors exhibit a bell-shaped dose-response curve of Erk phosphorylation - activating signaling through wild-type RAF at low drug concentrations before inhibiting it at higher concentrations. They explore three distinct mechanisms that may contribute to PA - conformational autoinhibition, negative cooperativity, and drug-induced dimerization - and conclude that all three are required to best fit published data that show the PA phenomenon. They explore the effect of 14-3-3 binding to RAF both computationally and experimentally and reach the conclusion that 14-3-3 can potentiate the PA phenomenon via stabilization of the autoinhibited conformation.
  
  We thank Reviewer #3 for reviewing our manuscript, and for the helpful comments in the review.
  
  Strengths:
  
  One key finding will be quite valuable to the field - that paradoxical activation can arise in the absence of negative cooperativity and without any effect of the inhibitor on the propensity of RAF to dimerize, provided that there exists a "conformationally autoinhibited" state that cannot dimerize and cannot bind inhibitor. This finding is important because negative cooperativity and dimer-induction have been a major focus - arguably the main focus - of prior studies of the phenomenon and also a source of considerable confusion. Inhibitors with very different chemical structures and binding properties - type 1.5 inhibitors that are dimer-breakers (and may or may not exhibit negative cooperativity) and type I and II inhibitors that can promote dimers (and almost certainly do not exhibit negative cooperativity) can nevertheless both exhibit PA. Thus the authors' modeling provides a unifying explanation - it is not dimerinduction or negative cooperativity that is at the root of PA, rather it is that there exists an autoinhibited state that can neither bind inhibitor nor dimerize. The authors further show that negative cooperativity and dimer-induction can act in concert with "conformational autoinhibition" to modify the PA response in a drug-specific manner.
  
  We thank Reviewer #3 for highlighting these strengths and their value to the field. In the focused paper, we have updated our discussion of the fits and of the model to highlight these points better.
  
  Weaknesses:
  
  Unfortunately, the authors don't really explain in a straightforward manner what is going on with the conformational autoinhibition model (Figure 2A). One has to read carefully and all the way to section 3 of appendix 1 to piece it together. In short, what the math shows is that at least for certain ranges of parameter values, the presence of an inhibitor can increase the concentration of dimers, even when it does not change the equilibrium constant for dimer formation, and some of those dimers will have an active, drug-free protomer. This is because the inhibitor effectively traps open monomers, which can then capture drug-free open monomers to form active dimers (active in one subunit, inactive and drug-bound in the other). As inhibitor concentration increases, the pool of autoinhibited RAF is diminished, and eventually, it is shifted completely to fully inhibited dimers. But at low concentrations of inhibitor, there is a net increase in dimerized (active) but drug-free protomers (see figure on page 27 of the appendix). Voila, paradoxical activation, with no need to invoke negative cooperativity.
  
  We apologize for the confusion, and agree that the description/walk through in the appendix should be featured more prominently in the manuscript. To this end, we have added a section to the main manuscript (titled “Paradoxical activation reflects a shifting balance of signaling complexes”) that includes the content that was previously in the appendix, and we have added a supplementary figure (Figure 2 – figure supplement 2) which includes the figures from the appendix. Thank you for your thorough review and working through the appendix, and we appreciate this suggestion.
  
  Considering the potential for confusion around what is meant by "drug-induced dimerization" as an effect distinct from the effect of the drug in promoting RAF dimerization in their conformational autoinhibition model, it would have been helpful for the authors to explicitly address the distinction (drug-induced dimerization alters the equilibrium constant for dimerization; this is not a feature of the conformational autoinhibition model).
  
  Thank you for this suggestion. We have clarified our text by rewriting it to read: … some RAF inhibitors have been shown to result in an increased level of RAF dimerization (Hatzivassiliou et al., 2010; Jin et al, 2017; Karoulia et al., 2016; Lavoie et al, 2013). This druginduced dimer potentiation is commonly thought of as manifesting in a higher affinity between RAF protomers when one (or both) are bound to a RAF inhibitor (Kholodenko, 2015).
  
  Also, I am confused by Figure 3C. The figure shows, and the authors state in the text, that for type II inhibitors an f > ~1 indicates a propensity to break dimers. But type 1.5 inhibitors should break dimers, and Type I and II inhibitors should promote dimers (at least some Type I and II drugs have been shown to promote kinase dimers). Seems that the predictions of the model are inconsistent with experimental data, at least for some inhibitors.
  
  We agree that discussing the fits, relating them to experimental data and current thinking in the field, is important. We have therefore significantly extended our discussion of the fits in Figure 3C in the Discussion of the text. The new text reads:
  
  It has previously been difficult to reconcile PA for Type I.5 inhibitors, which are sometimes thought of as dimer breakers because they position the alpha-C helix in the “out” position (in contrast to Type I and Type II inhibitors). Studies with recombinant protein and analytic ultracentrifugation clearly found type I.5 inhibitors to predominantly be in the monomeric form (Lavoie et al., 2013). Within-cell assays have similarly found type I.5 inhibitors to promote dimerization less than other Type I and Type II RAF inhibitors (Hatzivassiliou et al., 2010; Peng et al., 2015; Thevakumaran et al, 2015), however, RAF inhibitors still appeared to promote some dimerization in those in-cell assays. 14-3-3 binding proteins, which can help stabilize RAF dimers, may help explain this discrepancy (Kondo et al., 2019; Liau et al, 2020; Park et al., 2019). For example, by promoting the non-autoinhibited form, a type I.5 inhibitorbound RAF monomer is more dimerization capable than an autoinhibited (and non-inhibitor bound) RAF monomer, and even if the affinity is reduced compared to a non-autoinhibited and non-inhibitor bound RAF monomer, 14-3-3 proteins may be able to bind and overcome the effect. As our model does not explicitly include 14-3-3 proteins, this effect may contribute to our parameter estimation process finding an elevated binding affinity for type I.5 bound RAF monomers.
  
  Although negative cooperativity has been difficult to precisely measure experimentally, it has widely been assumed to be present to help explain the paradoxical activation caused by Type I.5 inhibitors that do not promote dimerization as strongly as other RAF inhibitors. Our best fit parameters did tend to have g values that were larger than 1, indicating that the model fit best when there was some negative cooperativity. This could suggest that negative cooperativity is more abundant than widely believed. Alternatively, the model without negative cooperativity was able to fit the data nearly as well as the full model that included negative cooperativity (i.e., Figure 3D). This may suggest that other processes not included in the model may be modulating paradoxical activation and the g parameter, as the only other term the model, is contributing to the models ability to account for these otherwise not included effects.
  
  We found parameter sets that reproduced available, published, data in order to test our model and investigate the potential for it to help illuminate aspects of PA. The best fit parameter sets further support a role for conformational autoinhibition and its modulation by RAF inhibitors in PA. However, it is also important not to read too deeply into the fits. For example, the data for the type II inhibitors AZ-628, LY3009120, and TAK-632 had small total fold-change PA magnitudes, and our fits for them have even less PA. We anticipate that the model-fitting approach would converge to increasingly accurate estimates for the parameters as the set of data being fit to expands. Additionally, quantitative experimental measurements of the parameters being fit should also cascade to impact other parameters and result in better estimates (Gutenkunst et al, 2007).
  
  A large part of the paper deals with the effect of 14-3-3 binding. In my view, this part of the manuscript is not particularly helpful. There is no evidence (that I am aware of) that 14-3-3 concentrations vary significantly, or that their variation affects RAF activity/signaling. Considering their abundance relative to RAF, and relatively high affinity for RAF, it is likely that both autoinhibited and active RAF are saturated with 14-3-3. (RAF that is not 14-3-3-bound is likely mostly bound to chaperones and not active). That said, the authors' conclusion (based on modeling) that 14-3-3 can increase the extent of paradoxical activation by stabilizing the autoinhibited state seems sensible, but hard to reconcile with their experimental result where they find increased basal signaling with 14-3-3 over-expression. It is also difficult to understand how increased 14-3-3 binding to RAF could lead to active RAF dimers that are not inhibited at 10-100 uM concentrations of potent RAF dimer inhibitors like LY3009120 (Fig. 5C). It seems more likely that 14-3-3 overexpression is promoting Erk phosphorylation in a manner that is (at least partially) Raf-independent. To their credit, the authors acknowledge this concern.
  
  We thank Reviewer #3 for the helpful critique of the section on 14-3-3. Although we have cut this section as part of the consensus review and suggestions for how to proceed with the revision, these points are very helpful for us as we consider how to interpret the modeling and experimental results of this section, how it fits into what is known, and what we should investigate next. Thank you.
  
  Finally, one comment regarding the presentation. The authors discuss conformational inhibition and 14-3-3 binding as if they are promoting and/or inducing paradoxical activation. This is pervasive in the paper, including in the title, and is distracting and potentially will mislead some readers. Obviously, it is RAF inhibitor that induces or promotes paradoxical activation. Conformational autoinhibition - mediated by 14-3-3 - is a feature of the system that makes paradoxical activation possible.
  
  We completely agree. We have rephrased to avoid this interpretation and we apologize for not recognizing it previously. Thank you for catching this and noting it for us to fix. As examples of the revisions to address this point, the last sentence of our abstract now reads:
  
  Overall, this work establishes conformational autoinhibition as a robust mechanism for RAFinhibitor driven PA based solely on equilibrium dynamics of canonical interactions that comprise RAF signaling and inhibition.
  
  And as another example, the third to last sentence in our Introduction now reads:
  
  Our modeling reveals that, under certain conditions, RAF autoinhibitory conformational changes and their modulation by RAF inhibitor binding can be sufficient to drive PA.
  
  Lastly, we have a last paragraph in the discussion that summarizes and hypothesizes to generalization:
  
  \Our analysis was motivated by RAF inhibitors and PA in RAS mutant cells treated with a RAF inhibitor. Our model, however, is generalizable to other systems that share the modeled features. We anticipate that PA will be observed for other proteins (a) that have a dynamic-equilibrium of conformations, (b) where not all conformations can dimerize, and (c) where drug binding the protein stabilizes one or more of the conformations that can dimerize. As dimerization and conformational autoinhibition are both common features for kinase regulation (Huse & Kuriyan, 2002; Lavoie et al, 2014), it seems reasonably to hypothesize PA will be observed for more kinases through modulation of the conformation and dimerization dynamic-equilibrium. Thank you for suggesting these changes.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/849489v2
www.biorxiv.org www.biorxiv.org

New submission 16/07/2023, 11:43:35

1
1. Public_Reviews 19 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  This manuscript reports a study to investigate the reporting practices in three top cardiovascular research journals for articles published in 2019. The study was preregistered, which makes the intent and methodology transparent, and the authors also make their materials, data, and code open. While the preregistration and sample strategy is a strength, it suffers from a higher than expected number of non-empirical articles decreasing the sample size and thus inference that can be drawn. The author's focus was mainly on transparency of reporting and not on the actual reproducibility or replicability of the articles; however, the accessibility of data, code, materials, and methods is a prerequisite. While the authors were still able to draw inferences to their main objectives, they could not perform some of their proposed analyses because of a small sample size (due partly to the less than half empirical articles in their sample as well as the low number of papers with accessible information to code). One of the descriptive analyses they performed, the country level scores (Figure 6), in particular suffers from the small sample size and while the authors state indicates this in their manuscript I do not think it would be reasonable to include as it has the potential to be misinterpreted since so many are based on an n=1. Overall, I found the authors presentation and discussion clear and concise; however, a lack of a more in-depth discussion is an area to improve the current manuscript. The manuscript outlines opportunities for researchers, journals, funders, and institutions to improve the way cardiovascular research is reported to enable discovery, reuse, and reproducibility.
  
  We appreciate the reviewer’s recognition of our pre-registration, methodology, and resource sharing and also their feedback regarding the small sample size of empirical research articles and need for a more in-depth discussion of the impacts of our study. We have now increased the number of empirical studies to a total of 393 out of 639 articles screened. We also agree that our study focuses more on transparency than reproducibility and replicability, and we have changed our title to reflect this. While the sample size of empirical papers has increased, a comparison of accessibility scores across countries continued to suffer from small sample size and we have removed it based on the recommendation of the reviewers. We have updated the Materials and Methods section to reflect our updated analyses, as well as included additional paragraphs on Limitations and Future Work in our Discussion to acknowledge future improvements that could be made to the accessibility score used in our study.
  
  Reviewer #2 (Public Review):
  
  This is a descriptive paper in the field of metascience, which documents levels of accessibility and reproducible research practices in the field of cardiovascular science. As such, it does not make a theoretical contribution, but it argues, first, that there is a problem for this field, and second, it provides a baseline against which the impact of future initiatives to improve reproducibility can be assessed. The study was pre-registered and the methods and data are clearly documented. This kind of study is extremely labour-intensive and represents a great deal of work.
  
  I have a major concern about the analysis. It is stated that to be fully reproducible, publications must include sufficient resources (materials, methods, data and analysis scripts). But how about cases where materials are not required to reproduce the work? In line 128-129 it is noted that the materials criterion was omitted for meta-analyses, but what about other types of study where materials may be either described adequately in the text, readily available (eg published questionnaires), or impossible to share (e.g. experimental animals).
  
  To see how valid these concerns might be, I looked at the first 4 papers in the deposited 'EmpricalResearchOnly.csv' file. Two had been coded as 'No Materials availability statement' and for two the value was blank.
  
  Study 1 used registry data and was coded as missing a Materials statement. The only materials that I could think might be useful to have might be 'standardized case report forms' that were referred to. But the authors did note that the Registry methods were fully documented elsewhere (I am not sure if that is the case).
  
  Study 2 was a short surgical case report - for this one the Materials field was left blank by the coder.
  
  Study 3 was a meta-analysis; the Materials field was blank by the coder
  
  Study 4 was again coded as lacking a Material statement. It presented a model predicting outcome for cardiac arrhythmias. The definitions of the predictor variables were provided in supplementary materials. I am not clear what other materials might be needed.
  
  These four cases suggest to me that it is rather misleading to treat lack of a Materials statement as contributing to an index of irreproducibility. Certainly, there are many studies where this is the case, but it will vary from study to study depending on the nature of the research. Indeed, this may also be true for other components of the irreproducibility index: for instance, in a case study, there may be no analysis script because no statistical analysis was done. And in some papers, the raw data may all be present in the text already - that may be less common, but it is likely to be so for case studies, for instance.
  
  A related point concerns the criteria for selecting papers for screening: it was surprising that the requirement for studies to have empirical data was not imposed at the outset: it should be possible to screen these out early on by specifying 'publication type'; instead, they were included and that means that the numbers used for the actual analysis are well below 400. The large number of non-empirical papers is not of particular relevance for the research questions considered here. In the Discussion, the authors expressed surprise at the large number of non-empirical papers they found; I felt it would have been reasonable for them to depart from their pre registered plan on discovering this, and to review further papers to bring the number up to 400, restricting consideration to empirical papers only - also excluding case reports, which pose their own problems in this kind of analysis.
  
  A more minor point is that some of the analyses could be dropped. The analysis of authorship by country had too few cases for many countries to allow for sensible analysis.
  
  Overall, my concern is that the analysis presented here may create a backlash against metascientific analyses like this because it appears unfair on authors to use a metric based on criteria that may not apply to their study. I am strongly in favour of open, reproducible science, and agree it is important to document the state of the science for different disciplines. But what this study demonstrates to me is that if you are going to evaluate papers as to whether they include things like materials/data/ availability statements, then you need to have a N/A option. Unfortunately, I suspect it may not be possible to rely on authors' self-evaluation of N/A and that means that metascientists doing an evaluation would need to read enough of the paper to judge whether such a statement should apply.
  
  We thank the reviewer for the time taken to review our paper, the appreciation of the work we conducted, and for the suggestions for improving our research methods. To address the initial concern about our analytical approach, the definition for fully reproducible publications that we used was only applicable to research that utilized empirical research methods. We recognize that publications such as editorials and reviews are not inherently reproducible experimental studies; thus, such papers were not provided with an accessibility score, were only screened for the components such as funding and conflict of interest information, and were only compared amongst each other. Additionally, articles such as meta-analyses and systematic reviews that do not include materials had adjusted accessibility scores. We expanded our Methods and Discussion section to further explain our screening process and our assumption that all empirical research articles contain methods, data, and analysis scripts and to acknowledge the limitations of our approach. We also agree that screening more empirical research articles is more in line with the intent of our pre-registration and we expanded the number of empirical research articles screened to 393. We also agree with the reviewer that the analysis by country should be excluded because of the small sample size for most countries, and we have adjusted the manuscript accordingly.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.07.06.498942v1
www.medrxiv.org www.medrxiv.org

New submission 19/07/2023, 09:06:56

1
1. Public_Reviews 19 Jul 2023
  
  in eLife
  
  Author Response
  
  We thank the reviewers for their insightful comments, which raise several important points regarding our study. As the reviewers have recognised, we introduced a number of simplifications in order to perform this complex optimisation problem, such as by restricting the analysis to a single intervention (insecticide-treated nets) and modelling countries at a national level. Despite their clear relevance to the study, computationally it would not have been feasible to run the multitude of scenarios suggested by reviewer 1, which we recognise as a limitation. As such we agree with the assessment that this study primarily represents a thought experiment to assess whether current policies are aligned with an optimal allocation strategy or whether there might be a need to consider alternative strategies. The findings are relevant primarily to global funders and should not be used to inform individual country allocation decisions. This perspective also underlies our decision to start the analysis from a baseline of year 2000 as opposed to modelling the current 2023 malaria situation: the largest international donor (the Global Fund) also uses baseline malaria levels in the period 2000-2004 as the basis of their allocation calculations (The Global Fund, Description of the 2020-2022 Allocation Methodology, December 2019). A simplified version of this method is represented by our “proportional allocation” strategy. We will further address these points in a revised manuscript and detailed responses to the reviewer comments.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2023.04.16.23288647v2
www.biorxiv.org www.biorxiv.org

New submission 16/07/2023, 08:57:50

1
1. Public_Reviews 17 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #2 (Public Review):
  
  Machold and colleagues develop and describe an intersectional genetic mouse (Id2Cre:Dlx5/6FlpE) that allows for the targeting of a cortical interneuron subpopulation predominantly consisting of the neurogliaform cell subtype (NGFCs). The strategy is a modification of that previously published by the authors (Id2cre:Nkx2-1Flpo; Valero et al., 2021) in which a subset of deep layer 6 NGFCs with distinct embryonic origins were targeted. Conversely, using the NDNF transgenic mouse lines previous studies, including thosefrom the Rudy laboratory, have clearly shown the prevalence of NGFCs in the outermost cortical Layer 1 region. Thus, the Id2Cre:Dlx5/6FlpE mouse poses an advantage over these previous approaches permitting the targeting of NGFCs in Layers 2-5. NGFCs in these regions have been hitherto difficult to study in an expedited manner.
  
  The manuscript is of the resource/toolbox type and the authors are thorough in their description of the distribution and molecular characteristics of the ID2 neurons labelled by this intersectional approach. Furthermore, the authors perform a series of in vivo experiments. These entail the identification of NGFCs, the assessment of their influence on other neuronal populations, and the ability to delineate their activity during various network and behavioral states. Indeed, the authors reveal an activity pattern that is unique to NGFCs across epochs of specific network states. Therefore, this clearly demonstrates the applicability of the ID2Cre:Dlx5/6Flpe mouse to study the role of L2-5 NGFCs in a whole brain setting and these in vivo experiments constitute a major strength of the current study.
  
  However, as with many transgenic mice, they are not always perfect, and the authors are very transparent regarding the additional, albeit a relatively smaller number of reported non-NGFCs particularly those of the CCK IN subtype. Indeed, clear morpho- functional divergence is revealed by the authors between these ID2 IN subpopulations. Furthermore, it is possible that this variability may differ across varying cortical regions. Thus, careful consideration of this caveat is necessary when using this mouse for future in vitro and in vivo studies. Related to this matter is a concern regarding the framing of the manuscript. The authors term the ID2 mixed population as the "4th group" since they do not express PV, SST, and VIP. One could argue this is a matter of semantics but to combine IN types that display distinct morphological and physiological properties into a single "group" based on one molecular feature is not consistent with that proposed by the widely accepted Petilla terminology (Ascoli et al., 2008).
  
  We agree that the definition of “group” here for INs delineated by the molecular markers PV, SST, VIP and Id2 is oversimplified, but in practice, the use of the corresponding genetic tools (e.g., Pvalb-Cre, Sst-Cre etc.) has resulted in widespread adoption of this marker-based organization of IN diversity. For example, PV+ INs targeted with PV-Cre encompass both basket cells and chandelier cells that while sharing some electrophysiological properties (e.g., fast-spiking behavior) are completely distinct morphologically, and innervate different subcellular compartments (soma vs. axon initial segment). The same is true for SST INs, in that there appear to be at least three main subtypes – Martinotti, non-Martinotti, and long range projecting – each with distinct axonal projections and electrophysiology. Thus, while the molecular targeting approaches developed to date have greatly facilitated functional studies of IN subtypes, they have prioritized marker expression over the other aspects of IN diversity outlined in the Petilla framework.
  
  Of interest to many who investigate cortical INs is the ability to genetically target specific subtypes during development. To this end, a potential and welcome addition to the manuscript would be an analysis (perhaps restricted to distribution/molecular characterization) highlighting whether the Id2cre:Dlx5/6Flpe strategy allows genetic access to layer 2-5 NGFCs during postnatal development following maternal tamoxifen administration.
  
  We agree that a method to target NGFC at early postnatal ages would be useful; however, the expression of Id2 is dynamic during development, and is robust in ventricular zone progenitors at embryonic stages (Neuman et al., 1993 Dev. Biol. PMID 8224536) so maternal tamoxifen administration is likely to result in nonspecific labeling. Furthermore, we found that multiple doses of tamoxifen were necessary to achieve decent labeling of the Id2 IN population in adult animals, a protocol that would be difficult to perform in pregnant dams or early postnatal animals due to pup lethality.
  
  Regardless, the experiments in the current study are, in general, well performed and clearly presented with the authors' conclusions supported by the results. Thus, it is clear that further refinements to genetic strategies are obviously required to exclusively target NGFCs throughout the cortical depth. Nevertheless, in the interim, the approach described in this current manuscript will be of use to the neuroscience community and help to further unravel the physiological role of this relatively understudied neuronal subtype.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.12.01.518752v2
www.biorxiv.org www.biorxiv.org

New submission 16/07/2023, 08:48:56

1
1. Public_Reviews 17 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #3 (Public Review):
  
  Because of the position of pigeon embryos in eggs, light exposure will only stimulate the right eye, leading to lateralisation of brain responses and behaviour. Lorenzi and colleagues injected manganese chloride into pigeon eggs, to assess neuronal activation in the embryonic brain. While the eggs were placed in the light or dark, manganese ions accumulated in neurons that were activated (in cell bodies and axons), which was then visualized with MRI of the embryos before hatching. The authors report lateralisation of neuronal activity in three brain regions, which could potentially be important for our understanding of experience-dependent development of lateralised neural activation.
  
  The tectofugal pathway in pigeons projects from the retina to the optical tectum, then to the nucleus rotundus in the thalamus, and then to the entopallium. The thalamofugal pathway projects from the retina to the GLd in the thalamus, and then to the wulst in the hyperpallium. The two pathways involve different thalamic nuclei (e.g., Deng 2006). In the methods and throughout the manuscript it should be specified which thalamic region is used as ROI.
  
  Here we refer to the Gld in the thalamofugal visual pathway, we did not estimate activity in the n. rotundus. We have now clarified this point in the revised MS (ll. 54, 80, 86).
  
  This manuscript only describes neural activity, but the MEMRI technique should also be used to assess the effect of experimental manipulations on axonal connectivity. It is important to learn about the asymmetry of contralateral projections in the light vs dark groups for answering the research question.
  
  Here we used systemic administration of Mn through the CAM. The Blood Brain Barrier at this embryonic stage is not completely developed and its permeability to ions and small molecules is way higher in embryo than in later stages of development (Engelhardt, B. (2003). Development of the blood-brain barrier. Cell and tissue research, 314(1), 119-129.). Other studies involving direct, local injection in selected brain regions are more apt to investigate connectivity, but this is not the protocol used here. We appreciate the reviewer’s suggestion, and this will be the object of future experiments. However, we would like to disseminate the current protocol and the results it led to at an early stage to enable and encourage its use by other researchers in the field.
  
  There is an overinterpretation of post-hoc statistics that are reported without correction for multiple testing. The wulst light group lateralization is probably not actually different from zero (uncorrected p=0.04).
  
  We considered the reviewer's observation regarding the need for improvements in the statistical methods. In response, we have made amendments to the relevant section of the manuscript, explicitly stating that significant findings were obtained using a two-way ANOVA. For comparisons between conditions within specific brain regions, we conducted two-sample t-tests, and the results were corrected for Type I errors using the false discovery rate (FDR) method. Post-hoc one-sample t-tests were employed to assess lateralization across brain regions and conditions, and the corresponding p-values were reported without correction for multiple comparisons (as explicitly reported in the text, to avoid any confusion).
  
  The first line in the discussion states that there is thalamofugal lateralization, but no lateralization in the tectofugal pathway. To my understanding, previous literature reported it the other way around: in altricial pigeons, light exposure in the egg mainly affected the tectofugal pathway (Deng & Rogers 2002), while the thalamofugal pathway in pigeons was not lateralized (Strockens et al., 2013). The manuscript should compare the current findings with the literature and discuss differences.
  
  We are aware of the substantial differences in brain lateralization of the two visual pathways between pigeons and chicks after embryonic light exposure. However, in the present work we employed chick embryos (Gallus gallus domesticus), and the space limitations of a Brief Communication do not allow for an in-depth discussion of these differences between avian species.
  
  Moreover, the tectum is the only region shown here from the tectofugal pathway. However, lateralization of contralateral connections is expected from tectum to the nucleus rotundus in the thalamus, and thus lateralization of activation may only arise in downstream brain regions from the optical tectum. Therefore, the conclusion that there is no lateralization in the tectofugal pathway is not supported by the data.
  
  In conclusion, I think it is interesting and worthwhile that the authors assessed neural activity in response to visual stimulation in the embryo prior to hatching, but multiple methodological weaknesses and unclarities should be addressed.
  
  The ROI that we here named Thalamus does not include the nucleus rotundus, but is referring to the nucleus geniculatus lateralis (Gld). We have now clarified this point in the revised MS (ll. 54, 80, 86), and we now refer only to the tectum, without generalizing to the entire tectofugal pathway, which will be the subject of future investigations.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.02.526801v1
www.biorxiv.org www.biorxiv.org

New submission 16/07/2023, 08:40:02

1
1. Public_Reviews 17 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #3 (Public Review):
  
  This manuscript proposes to tackle a very interesting and methodologically challenging topic: the mechanistic underpinnings of neural specialization in the infant brain. The authors presented 4- to 7-month-old infants with social and non-social stimuli while their neural, hemodynamic, and metabolic activity was monitored, and they report a complex pattern of relationships between neural and metabolic or hemodynamic responses during social processing on the one hand, and during non-social processing on the other hand.
  
  The approach described in this manuscript is very interesting and the combined use of EEG and bNIRS data appears very promising. However, there is some confusion between the initial aims of the study, and the analyses performed, which jeopardizes the clarity and the impact of this manuscript. Besides, the predictions of the authors are often underspecified which complexifies the interpretation of the results.
  
  Based on its abstract, the goal of this work is to "combine simultaneous measures of coordinated neural activity metabolic rate and oxygenated blood supply to measure emerging specialization in the infant brain". The introduction nicely elaborates on the "interactive specialization theory" and the potential role of the interplay between brain energy consumption and neural activity in the emergence of functionally specialized brain regions during development. The authors present a novel multimodal approach, with potentially important implications for the study of brain specialization as a function of experience or maturation. Yet the experimental procedure presented in this manuscript only assesses specialized brain activity in response to social processing in 4- to 7-month-old infants, using multimodal neuroimaging.
  
  Indeed, the authors presented 4- to 7-month-old infants with social and non-social stimuli while their neural, hemodynamic, and metabolic activity was monitored. The authors report significant differences between the two conditions in terms of neural activity in the delta, alpha, beta, and gamma bands; as well as in the pattern of hemodynamic to metabolic coupling. Using a GLM approach, the authors report on fNIRS channels and EEG sensors showing significant relationships between the evoked neural activity in the beta and gamma frequency bands, and each of the bNIRS signals (HbO, HbR & CCO), in the social and in the non-social conditions. The authors identify a particular fNIRS channel overlaying posterior STS, showing a positive relationship between Pz EEG beta activity and HbO, as well as CCO, together with a negative relationship between that same neural activity and HbR, in the social condition. This pattern of activity was not observed in the non-social condition.
  
  Overall, these results indicate differential neural responses to social and non-social stimuli, coupled metabolic and hemodynamic activity in response to social as well as nonsocial stimuli.
  
  These results additionally indicate coordinated metabolic, hemodynamic, and neural responses in brain regions selective for social processing, but it does not allow us to conclude that this coordinated activity is actually related to the functional specialization process (e.g. last sentence of the abstract).
  
  We would like to thank the reviewer for their detailed comments. Based on their suggestions, we have made several changes to the manuscript. This study was the first to combine EEG and broadband NIRS and therefore served as a proof of principle study. At the onset of this work, there were many elements to develop such as the technical aspect of simultaneous bNIRS – EEG measurements as well as the methodology to combine the signals from both techniques with such different time resolutions. Therefore, we focused on one age group of infants rather than performing a study involving multiple age groups. The 4-to-7-month-old age group has been studied extensively using fNIRS, particularly to look at social brain development using similar stimuli as those used in the present study. Previous studies have demonstrated that social selectivity can be detected at 4 – 8 months of age (Grossmann et al., 2010; Lloyd-Fox et al., 2012, 2013, 2017). As this was a proof of principle study, we wanted to ensure that we were able to replicate results from previous studies with this new methodology. We therefore used one age group of 4-to-7-months. This has also been added to the introduction of the manuscript to provide clearer reasoning for using this age group.
  
  The reviewer is correct that the current study does not provide direct evidence of developmental change in functional specialisation or the hypothesised interactive process through which functional specialisation may occur. Rather, we are measuring the status of functional specialisation (the idea that different areas in the brain are specialised for different functions) at the age we study, by testing whether the signals we observe are selective to social but not non-social stimuli. We have reframed the abstract and introduction of the manuscript to ensure this is clear, and we additionally now focus more on the methodology developed to answer such questions. Future studies can leverage our methodology to study different age groups to establish how the relationships between neural and vascular/metabolic signals changes over developmental time, which may provide greater insight into the specialisation process.
  
  Grossmann, T., Oberecker, R., Koch, S. P., & Friederici, A. D. (2010). The Developmental Origins of Voice Processing in the Human Brain. Neuron, 65(6), 852–858. https://doi.org/https://doi.org/10.1016/j.neuron.2010.03.001
  
  Lloyd-Fox, S., Begus, K., Halliday, D., Pirazzoli, L., Blasi, A., Papademetriou, M., Darboe, M. K., Prentice, A. M., Johnson, M. H., Moore, S. E., & Elwell, C. E. (2017). Cortical specialisation to social stimuli from the first days to the second year of life: A rural Gambian cohort. Developmental Cognitive Neuroscience, 25, 92–104. https://doi.org/10.1016/j.dcn.2016.11.005
  
  Lloyd-Fox, S., Blasi, A., Elwell, C. E., Charman, T., Murphy, D., & Johnson, M. H. (2013). Reduced neural sensitivity to social stimuli in infants at risk for autism. Proceedings of the Royal Society B: Biological Sciences, 280(1758), 20123026. https://doi.org/10.1098/rspb.2012.3026
  
  Lloyd-Fox, S., Blasi, A., Mercure, E., Elwell, C. E., & Johnson, M. H. (2012). The emergence of cerebral specialization for the human voice over the first months of life. Social Neuroscience, 7(3), 317–330. https://doi.org/10.1080/17470919.2011.614696
  
  Another weakness of this manuscript relates to the unclear or underspecified motivations behind some of the performed analyses. For example, the authors contrast brain responses to social vs. baseline, non-social vs. baseline, and social vs. non-social. For clarity in the manuscript, the authors should specify the motivation behind each of these contrasts and their predictions.
  
  We thank the reviewer for their suggestion. We have added the predictions for each of the analyses in the introduction section, lines 436 – 527. We have removed the “social minus non-social” comparison for the EEG topographical maps from Figure 2 as there was no value added by including this comparison.
  
  Another example is in the analysis of the hemodynamic and metabolic coupling analysis, here the authors analyze only the social vs. baseline and non-social vs. baseline contrast, and they do not analyze the social vs non-social contrast. It would be useful for the reader to understand why only these two contrasts are performed and not the social vs. non-social, and what are the predictions of the authors.
  
  We have now added this into the manuscript and the results can be seen in Figure 3c. We have clarified our predictions both at the end of the introduction (lines 436 - 527) and at the beginning of the discussion (lines 685 – 755).
  
  The following has been added to the introduction:
  
  For EEG, we expected an increase in neural activity in response to the social condition and a decrease in neural activity in response to the non-social condition. Based on previous work, this was expected to be strongest in the theta frequency band [3]. Moreover, for the combined bNIRS-EEG analyses, we hypothesised differentiated haemodynamic/metabolic coupling with neural activity for the social and non-social stimulus conditions. We performed two types of statistical tests: a) individual comparisons of the social and non-social conditions and b) comparison of the social condition versus the non-social condition. The individual condition tests were performed to show the scale and spatial location/sensitivity of the coupling between haemodynamics/metabolism and neural activity for each condition. Meanwhile, the social versus non-social comparison was performed to show where there was a significant difference in the coupling between the two conditions. With comparison (a) we aimed to identify regions involved in the processing of social and non-social stimuli by identifying the regions where the coupling was significant. With comparison (b) we aimed to identify regions where coupling was significantly different between conditions. We predicted that for the individual comparison of the social condition, we would observe positive associations between bNIRS and EEG measures, i.e. coordinated increases in haemodynamics/metabolism and neural oscillatory activity in the beta and gamma frequency bands (based on previous combined EEG – fMRI studies [16], [18]–[21], [23], [30]) which would be localised to core social brain regions. We hypothesised that for the non-social condition, over the same brain regions, positive associations would be observed between bNIRS and EEG measures, but they would be coordinated decreases in haemodynamics/metabolism and oscillatory activity. We also expected coordinated increases in haemodynamics/metabolism and oscillatory activity localised to the parietal brain region. These predictions are based on our previous work [29] where we demonstrated that stronger coupling between haemodynamics and metabolism was observed in the temporo-parietal regions for the social condition and in parietal region for the non-social condition which is known to play an important role in object processing [31], [32]. For the social versus the non-social contrast, we predicted that haemodynamic activity and metabolism would be coupled with neuronal oscillatory activity more strongly for the social stimuli in comparison to the non-social stimuli, with significant differences being observed in the temporo-parietal regions.
  
  The following has been added to the discussion:
  
  As a proof of principle, we examined the relationship between these measures to identify regional selectivity to social versus non-social stimuli. To first demonstrate the scale and spatial sensitivity of the coupling between haemodynamic/metabolic activity and neuronal oscillatory activity, comparisons were performed individually for the social and non-social conditions. For this, we predicted coordinated increases in haemodynamics/metabolism and neural activity in the beta and gamma frequency band. We predicted that for the social condition this would be localised to the core social brain regions (temporo-parietal region) while for the non-social condition, we expected the coupling to be localised to parietal regions, known to be involved in object processing [31], [32]. We additionally expected coordinated decreases in haemodynamic/metabolic activity and neural activity over the temporo-parietal region for the non-social condition, in accordance with our previous work [29]. Next, to demonstrate differential coupling for social and non-social stimuli, we performed a comparison of the social condition versus the non-social condition. For this, we hypothesised that in the beta and gamma frequency bands, there would be stronger coupling between haemodynamics/metabolism and neural activity for the social condition over the temporo-parietal region.
  
  Finally, the core result of this work derives from the final GLM analysis which relates EEG activity to hemodynamic or metabolic responses. This analysis implies the inspection of interactions between 3 neuroimaging modalities, with 4 EEG measures, 2 hemodynamic measures, and 1 metabolic measure, which represents a very rich and relatively complex analytic approach. Unfortunately, the predictions are not clearly specified, which makes results interpretation difficult.
  
  We appreciate that the methods are complex, and the hypotheses should be stated more clearly. The hypotheses have now been explicitly stated both at the end of the introduction (lines 436 - 527) and at the beginning of the discussion (lines 685 – 755).
  
  Based on the results (L160-162) and discussion (L233-235) sections, it appears that the authors aim at identifying brain regions showing a precise pattern of activity, with a positive relationship between EEG activity and HbO/CCO responses together with a concurrent negative relationship between EEG and HbR responses in response to social events, but not in response to non-social events. Importantly, the social vs. non-social contrast seems crucial to assess the selectivity of the response. Yet, the authors analyze the 3 chromophores separately, and they do not contrast the two conditions (figure 3). As a result, the authors are limited to reporting a descriptive pattern of relationships between EEG and HbO/HbR/CCO activations for the social condition. And another one for the non-social condition. Overall, the authors conclude that channel 14, overlaying the right TPJ, shows the expected pattern of activity, specifically in response to social stimuli. Yet, this statement is only supported by visual inspection/comparison of the results between the social vs baseline and non-social vs baseline conditions. The authors do not assess analytically the differential patterns of activations between the two conditions. Instead, a GLM including all 3 chromophores and contrasting the two experimental conditions would allow us to directly test the predicted pattern of activity, and the selectivity of the activity for social stimuli.
  
  As per the reviewer’s comment, we have now included the comparison of the social and non-social conditions, shown in Figure 3c. The results from this comparison showed that haemodynamics and metabolic activity at channels 11 and 14 (located spatially close to one another) had a significantly greater association to EEG electrode “Pz” for the social condition, in comparison to the non-social condition for the beta and gamma bands. These results support/indicate the selectivity of the response to the social condition, analytically.
  
  We have kept the results showing the individual comparison of the social and non-social conditions. The individual condition tests were performed to show the scale and spatial location/sensitivity of the coupling between haemodynamics/metabolism and neural activity for each condition. Meanwhile, the social versus non-social comparison was performed to show where there was a significant difference in the coupling between the two conditions. With comparison (a) we aimed to identify regions involved in the processing of social and non-social stimuli by identifying the regions where the coupling was significant. With comparison (b) we aimed to identify regions where coupling was significantly different between conditions. The following has been added on line 533 – 541 to explain the reasoning behind the comparisons performed.
  
  We performed two types of statistical tests: a) individual comparisons of the social and non-social conditions and b) comparison of the social condition versus the non-social condition. The individual condition tests were performed to show the scale and spatial location/sensitivity of the coupling between haemodynamics/metabolism and neural activity for each condition. Meanwhile, the social versus non-social comparison was performed to show where there was a significant difference in the coupling between the two conditions. With comparison (a) we aimed to identify regions involved in the processing of social and non-social stimuli by identifying the regions where the coupling was significant. With comparison (b) we aimed to identify regions where coupling was significantly different between conditions.
  
  As our interest was in looking at the selectivity of the response and not comparing the chromophores, we did not perform a comparison between chromophores.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.11.08.514512v1
www.medrxiv.org www.medrxiv.org

New submission 16/07/2023, 08:36:16

1
1. Public_Reviews 17 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer 2 (Public Review):
  
  1) My major criticism of the study is that the authors argue for CD8+ Trm activity as a key mechanism for OLP pathogenesis but have presented mostly descriptive datasets. The data strongly argue for CD8+ Trm cells as a defining feature of erosive OLP, but there is no data to support their involvement in disease pathogenesis. The authors note the lack of a mouse model for OLP which represents a significant technical barrier to interrogating the role of CD8+ Trm cells in OLP pathogenesis.
  
  Thank you for bringing this to our attention, and please accept our apologies for any confusion caused by our previous article. The pathogenesis of OLP is responsible for the immune disease caused by multiple factors, but there is no corresponding animal model at present, which has obvious limitations on the research. Therefore, we focus on the research on the reasons for the change of the clinical state of the disease. Our study found that CD8+ TRM cells play an important role in the changes observed in the local presentation of OLP, specifically erosions. However, it is important to note that they are not the primary driver of the disease. In addition, we use cohort studies combined with transcriptome data to increase the strength of evidence for causal effects. We have revised and emphasized this point in the updated text.
  
  The modified description in introduction is as follows:
  
  Notably, EOLP has a significantly higher risk of malignant transformation than non-erosive oral lichen planus (NEOLP) (Danielsson et al., 2013). To reduce the psychological and economic burden of OLP patients, improve their quality of life, and decrease the risk of cancer, it is crucial to maintain the disease in a relatively stable non-erosive stage for as long as possible. However, clinical experience suggests that OLP often exhibits a prolonged and recurrent disease course, with alternating periods of non-erosive and erosive lesions. Despite this, the underlying causes and mechanisms of lesion type switching remain unclear (Husein-ElAhmed and Steinhoff, 2022). (Page 4, lines 13-21)
  
  2) Another criticism is the lack of strong findings in the analysis of CD8+ Trm cells isolated from non-erosive and erosive OLP tissues. The authors note increases in CD8+ Trm cell recovery, however, they only observe minor changes in CD8+ Trm activity upon restimulation. Analyzing the activation status or proliferative capacity of CD8+ Trm cells from non-erosive and erosive OLP could be informative and more robust measures of functional changes.
  
  We appreciate your suggestion to test the activation status and proliferation of sorted CD8+ Trm cells to further investigate the differences between the two groups. However, due to the limited amount of tissue available for our study, it was so hard to obtain sufficient numbers of CD8+ Trm cells for these experiments. Additionally, there is a lack of established methods for in vitro culture of CD8+ Trm cells, which further limited our options for functional studies.
  
  To investigate the function of CD8+ Trm cells in the two tissue groups, we instead measured inflammatory factors in the supernatant of CD8+ Trm cells after in vitro stimulation. This allowed us to indirectly assess the activity of CD8+ Trm cells in non-erosive and erosive OLP. We used ELISA assay to measure the levels of several inflammatory cytokines, which are known to be produced by activated T cells, including CD8+ Trm cells.
  
  We acknowledge that this method has limitations and is an indirect measure of CD8+ Trm cell function. However, we believe that our approach provides useful information on the potential role of CD8+ Trm cells in oral lichen planus and represents a valuable contribution to the field.
  
  3) A minor criticism is the formatting of the data presented in Figure 4. The authors should clearly label each marker used in the flow cytometry experiments as well as clearly labeling y-axes for graphs 4H and 4I.
  
  Thank you for your valuable comments, I have modified the flow cytometry diagram accordingly and labeled each step of the gating strategy, also modified the other two diagrams. And 4H and 4I figure numbers changed to 4G and 4H.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2022.10.18.22281149v1
www.biorxiv.org www.biorxiv.org

New submission 21/12/2022, 11:45:11

1
1. Public_Reviews 17 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  This paper investigates whether bistable rhodopsins can be used to manipulate GPCR signalling in zebrafish. As a first step, the authors compared the performance of bistable rhodopsins fused with a flag tag or with a fluorescent protein tag (TagCFP). Constructs were compared by expressing in HEK cells followed by calcium imaging with aequorin or cAMP monitoring with GloSensor. This showed that the protein with a smaller flag tag performed better. Then, a series of transgenic zebrafish lines were made, in which tagged rhodopsins were expressed in reticulospinal neurons or cardiomyocytes.
  
  The data indicate that bistable rhodopsin can be used to manipulate Gq and Gi/o signalling in zebrafish. The Gq-coupled SpiRh1 was effective in manipulating reticulospinal neurons, as indicated by analysis of tail movements and calcium imaging of the neurons. Gi/o signalling could be manipulated by Opn3 from mosquitoes, TMT from pufferfish, and parapinopsin from lamprey, as shown by their effects on the heartbeat. Lamprey parapinopsin has the interesting property that it can be turned on and off by different wavelengths of light, and this was used to stop and restart the heart. Finally, the authors show that the cardiac effects are mediated by an inward-rectifier K+ channel, through the use of pharmacological inhibitors.
  
  A strength of this paper is the testing of a range of bistable rhodopsins, with a total of 10 proteins tested. This provides a good resource for future experiments. A weakness is the failure to show that some experiments involved repeated sampling of the same animal. Figure 3 gives the impression that there are 48 independent datapoints. However, there are 8 animals, with 6 datapoints coming from each. Similarly, Figure 4 shows the data from 6 trials of 4 animals, not 24 independent animals. Repeated sampling should be reflected in the data presentation, and in the statistical analysis. Was there an effect of trial number, which is suggested in Figure 6?
  
  In response to the reviewer’s comments, we modified the graph to show the average data for individual animals in Figure 3A-E, Figure 3-supplement 2, Figure 4D-F, H, and Figure 4-supplement 2B. We also showed the effect of trial number (difference between trials 1 and 6) in Figure 3-supplement 1 and Figure 4-supplement 1. In addition, we also showed all data as source data. We believe that more accurate statistical analyses were conducted using data from each individual animal.
  
  Delta F/F refers to relative change, which should be (F-F0)/F0. This should be zero when t = 0. The values in Figure 3E, and 3F are ~ 1 when t = 0, however. Are these figures showing F/F0?
  
  The reviewer is correct. It is indeed F-F0/F0 (ΔF/F0). In Figure 3F (3E in the original manuscript), t=0 was the time when 470-495 nm light (for both stimulation of SpiRh1 and detection of GCaMP6s fluorescence) started to be applied. In the experiment in Figure 3G (3F in the original manuscript), 405 nm light was applied to activate SpiRh1[S186F] for 2 s and then 470-495 nm light was applied to detect GCaMP6s fluorescence. In other words, t=0 is the time when 405 nm light started to be applied.
  
  The authors' conclusions that the bistable rhodopsins are useful tools in the zebrafish system appear largely justified. This is consistent with findings from other organisms, including mouse (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8097317/, https://www.sciencedirect.com/science/article/pii/S0896627321001616). The tools here are likely to find broad use by scientists who use the zebrafish as the experimental system for a variety of different areas.
  
  For the studies on LamPP and MosOpn3, we cited the references mentioned by the reviewer. We believe that our study substantiates that LampPP and MosOpn3, as well as other bistable rhodopsins, are valuable tools for zebrafish research, as pointed out by the reviewer.
  
  Reviewer #2 (Public Review):
  
  The presented study aims at deciphering the physiological function of GPCR signaling in excitable cells. To this end, the authors developed transgenic zebrafish models expressing a selection of Gq- and Gi/o-coupled bistable rhodopsins in either reticulospinal neurons or cardiomyocytes and elucidated behavioral responses (tail movements) or physiological responses (heartbeat) as well as intracellular Ca2+ dynamics following optical stimulation of rhodopsins.
  
  One of the major strengths of the presented study is the functional comparison of five Gq- and five Gi/o-coupled rhodopsins in two major classes of excitable cells, however; the selection of rhodopsins tested remains elusive. More importantly, it is not obvious why some of the effects of rhodopsin activation were assessed in both neurons and cardiomyocytes, while others were only tested in one of the two systems without further explanation. The main chosen experimental readouts (swimming/tail bending or cardiac contractions) have limited informative value regarding GPCR signaling, as they will only report the peak of the iceberg, namely whether movements are elicited or heartbeats inhibited. No analysis on subtle changes in heart rate and contraction force was included, but such modulation of cardiac activity (e.g. positive or negative chronotropic, inotropic, dromotropic, bathmotropic, and/or lusitropic responses) would represent better the physiological modulation of the heart via GPCR and down-stream signaling events. In line, the presented data only represents behavior at one light intensity tested, whereas a light titration of observed effects could provide more meaningful insight into both rhodopsin responses and signaling mechanisms. Also, the potential promiscuity of G protein activation of selected receptors has not been addressed, neither experimentally nor in the discussion part. As a result of the above-mentioned limitations, it is difficult to follow the logic of the study and especially to interconnect the data obtained in reticulospinal neurons (where activation of jumping spider rhodopsin elicited tail bending) to myocyte data (where three Gi-coupled rhodopsins suppressed cardiac activity). Moreover, as such, the study does not provide explanations on why a certain tool might evoke an effect in one system or the other, or not, which could be the main deliverable of such a comparative analysis.
  
  We are grateful for helpful and insightful comments from the reviewer. We believe that the presentation of experimental findings in the original manuscript may have led to a misunderstanding. We examined the effects of Gq and Gi/o-coupled bistable rhodopsins on both reticulospinal V2a neurons and cardiomyocytes. We observed noticeable effects of Gq rhodopsins on reticulospinal V2a neurons, but no significant effects on cardiomyocytes. Similarly, we found effects of Gi/o-coupled rhodopsins on cardiomyocytes, but no significant effects on reticulospinal V2a neurons. These discrepancies could be attributed to differences in the target cells and experimental conditions, suggesting the need for further optimization. We described the data on page 13, lines 16-22 and page 16, lines 9-10 in the Result section and Table 1, and discussed the relationship between the activity of bistable rhodopsins and their effects on target cells on page 21, lines 6-15 and page 24, line 19-page 25, line 2 in the Discussion section of the revised manuscript.
  
  In order to clarify the function of Gi/o-coupled rhodopsins on the heart in more detail, we conducted experiments in which we activated cardiomyocytes expressing bistable rhodopsins at various light intensities to observe the effects on heartbeats. We analyzed cardiac arrest rate, latency to cardiac arrest, and time to resumption of heartbeat. The results of these experiments are shown in Figure 4 and Figure 4-supplement 2, 3 in the revised manuscript. We described the data on page 15, line 16-page 16, line 1 in the revised manuscript, as follows.
  
  To analyze the photosensitivity of Gi/o-coupled rhodopsins, we applied light of various intensities for 1 s and examine their effect on HBs (Figure 4-supplement 2). Cardiac arrest was induced and sustained for over 20 s after stimulation of MosOpn3 with 0.05 mW/mm2 light for 1 s. Photoactivation of PufTMT and LamPP at lower light intensities (0.2 or 0.05 mW/mm2) resulted in cardiac arrest, but faster HB recovery than stimulation with 0.5 mW/mm2 light (Figure 4-supplement 2). The data indicate that the ability of MosOpn3 to suppress HBs is more photosensitive than PufTMT and LamPP in the zebrafish heart. We further examined atrial-ventricular (AV) conductivity by measuring the time difference between atrial and ventricular contractions before and after light stimulation when HBs had slightly recovered. There was no significant difference in AV conductivity before and after light stimulation (Figure 4-supplement 3).
  
  We performed experiments to the best of our ability with current technology regarding cardiac function. However, we hope that the reviewer is willing to acknowledge that there are certain limitations in conducting a detailed analysis of the zebrafish larval heart, since many experimental techniques, such as electrophysiological analysis, have not yet been fully or effectively established for this animal model.
  
  While the presented data is interesting, the graphical presentation and description of the data are insufficient. Most importantly, the current version of the text does not include a quantitative description of effects and statistical analyses (which are found in the figures and legends!). The lack of quantitative description also extends to both the introduction and discussion, which remain general without a specific dissection of observed effects.
  
  We have described quantitative data in the Result section.
  
  One major concern is the selective citation of own work. While single statements in both the introduction and discussion are supported by up to ten own papers, recent studies using rhodopsins for dissecting GPCR signaling in neurons are not sufficiently discussed and new data is not compared to published results by other teams. Moreover, relevant papers on cardiomyocytes (e.g. PMID: 35579776, 35365606, 34987414, 30894542) are not cited at all, despite the use of similar rhodopsins and/or optogenetic activation of the same signaling pathways. Taking into account these published studies may help to better understand the observed responses.
  
  We apologize for not citing important relevant papers in the original manuscript. We have now cited all four papers (Dai et la., 2022; Wagdi et al., 2022; Cokic et al., 2021; Makowka et al., 2019) mentioned by the reviewer, as well as a new paper describing the use of MosOpn3 and LamPP in C. elegans neurons (Koyanagi et al., 2022) in the Introduction section. We also discussed the differences between our findings and previously published data in the Discussion section.
  
  Additional comment: Data were obtained from larvae zebrafish. It would be useful to include a discussion on how GPCR signaling might be different in adult fish compared to larvae, and how to test whether the observed effects are more generally applicable.
  
  We discussed the differences between the hearts of zebrafish larvae and adults, and the differences in GPCR signaling, on page 27, lines 10-16, as follows. In this study, we used zebrafish larvae to study the role of GPCR signaling in cardiac function, and there are differences in heart structure and function between larvae and adult zebrafish. As a zebrafish grows, blood pressure increases and the heart becomes more complex with the development of valves and ventricular trabeculae. Therefore, GPCR signaling, which regulates heart structure and function, may differ between juvenile and adult fish. Optogenetic manipulation of the heart’s function in adult zebrafish using bistable opsins should clarify this issue.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.10.25.513732v1
www.biorxiv.org www.biorxiv.org

New submission 21/12/2022, 11:48:30

1
1. Public_Reviews 17 Jul 2023
 
 in eLife
 
 Author Response
 
 Reviewer #1 (Public Review):
 
 This paper aims to test whether a series of light activated ion channels (GtCCRT4, KnChR) and enzymes that regulate second messengers (BeGC1, bPac, OaPac) can be used to manipulate cells in the zebrafish.
 
 Among the strengths of the paper are the use of several independent methods to test whether the tools are functional - e.g. electrophysiology of mammalian cells for GtCCR4, calcium and cAMP imaging in zebrafish cells in vivo, behaviour tests (tail movement) and monitoring of heart beat. Multiple transgenic lines were established, to select for lines with optimal expression levels. Experiments are carried out in two cell types - reticulospinal neurons in the hindbrain and cardiomyocytes.
 
 The authors have largely achieved their aim of determining whether the rhodopsins can be used in zebrafish. They demonstrate that the cation channel KnChR is particularly sensitive in triggering depolarization of the reticulospinal neurons, as indicated by tail movement. They show that the photoactivatable adenylyl cyclase bPAC and cation channels have an effect on heartbeat. Two other photoactivatable enzymes OaPAC and BeGC1 have no effect on heartbeat, although it is not evident whether this is due to lack of effect on cAMP and cGMP levels.
 
 The abstract sets out to investigate the role of second messengers, emphasizing the need for specificity. However, KnChR is not specific for Na+. As noted by Tashiro et al, the channel can also conduct H+, Ca2+ and Mg2+. The knowledge gap that is being addressed by the manuscript thus needs to be reframed. The concluding statement of the abstract, that the tools tested here can be used to investigate second messengers, is not accurate given the broad conductance of KnChR.
 
 We agree with the reviewer. We changed the title to “Optogenetic manipulation of neuronal and cardiomyocyte functions in zebrafish using microbial rhodopsins and adenylyl cyclases” and revised the abstract and introduction, accordingly. The last sentence of the abstract was modified to “These data suggest that these optogenetic tools can be used to reveal the function and regulation of zebrafish neurons and cardiomyocytes.”
 
 The tools described here have been tested previously in other species, either in cultured mammalian cells (GtCCR4, KnChR, OaPAC) or in vivo (bPAC and BeGC1). The current work thus does not introduce novel tools, but provides evidence that some of these tools can be used in zebrafish. Overall, the lines characterized here will be of use to scientists using zebrafish as the experimental system in a variety of areas.
 
 We appreciated the positive comments from the reviewer. It was worthwhile generating and analyzing so many transgenic zebrafish.
 
 Reviewer #2 (Public Review):
 
 Optogenetic proteins are important tools for circuit neuroscience. The authors characterize five proteins, GtCCR4, KnCHR2, BeGC1, bPAC, and OaPAC with respect to their ability to suppress normal cell excitability and compare the results to those for the more established GtACR1 and CrChR2[T159]. The study makes use of expression in the zebrafish heart and hindbrain, as well as in a cell line. Electrophysiology in the cell line demonstrates that GtCCR photo-activation induces similar currents as CrChR2 activation and shows less signs of desensitization. Using a transgenic vsx2:Gal4 zebrafish line, immunohistochemistry shows that the tools are expressed. When activated, they triggered the expected behavioral responses (swimming) at short latency (<4s). This was true even for the three tools that are guanylyl or adenylyl cyclases (BeGC1, bPAC, OaPAC) and thus affect cell excitability only indirectly. At the tested light intensity, the Klebsormidium nitens channelrhodopsin (KnChR) had the shortest latency (<0.5 s) and highest (100%) probabilities of inducing locomotion. When expressing the tools in the zebrafish heart, brief illumination (100 ms) induces brief (100 ms - 1500 ms) suppression of the heartbeat. Notably, also tools that evoke depolarization induce heartbeat suppression. Heartbeat movies and calcium imaging demonstrate that this is caused by prolonged cardiomyocyte contraction. The optogenetic guanylyl and adenylyl cyclases were not effective in perturbing zebrafish heartbeat (except for bPAC over longer time scales).
 
 Given the large number of optogenetic proteins available to date and the challenge of employing them in well-controlled neuroscience experiments, this study presents an important contribution for neuroscientists performing optogenetic research in animal models. Two light-gated cation channels, GtCCR4 and KnChR, are tested for the first time in vivo. The evidence supporting the claims regarding heartbeat and induced swimming behavior is solid. Since GtCCR4 is more Na+-selective than other channelrhodopsins, it should allow better control of experimental variables and is a valuable addition to the optogenetic tool box. The created transgenic zebrafish lines will be useful for the zebrafish neuroscience community.
 
 The expression in zebrafish was compared using immunohistochemical staining (of a single Gal4 driver line). From this experiment alone, it is difficult to judge the expression level, the in vivo visibility of the fluorescence under the microscope, and the proportion of target cells that do express the optogenetic gene of interest.
 
 The evidence for optogenetically induced alteration of swimming behavior is compelling. However, the associated neuronal responses and their dependence on different light intensity levels remain uncharacterized. Therefore, if anyone plans to use these tools to investigate a neural circuit in the future, the needed light levels and the specificity of the manipulation would still need to be determined.
 
 We stimulated neuronal ND7/23 cells, reticulospinal V2a neurons or cardiomyocytes expressing microbial optogenetic tools at various light intensities and examined their effects on neuronal activities and behaviors (tail movements and cardiac arrest). These data are shown in revised Figure 1, Figure 1-supplement 1, Figure 3, Figure 3-supplements 2, 3, Figure 5, and Figure 5-supplements 1, 2. We described the data on page 12, line-page 13, line 1 and page 14, lines 10-13 in the revised manuscript.
 
 For the optogenetic guanylyl and adenylyl cyclases, which clearly were able to alter behavioral responses, the signaling and circuit mechanisms that lead to neuronal depolarization remain unknown, but possible activation pathways are discussed.
 
 Reviewer #3 (Public Review):
 
 In this study, the authors set out to test several new optogenetic tools in zebrafish. They motivate the study by citing differences in ion selectivity of channelrhodopsins and the potential utility of photoactivatable anenylyl and guanylyl cyclases to control cell functions. Although the study provides some useful new information about the utility of these tools in zebrafish, the characterization is limited and there are serious caveats around interpretation of behavioral responses.
 
 The latency of behavioral responses is often extremely long and there is a lack of control data from opsin negative animals, raising serious doubts as to whether these responses are optogenetically mediated.
 
 In other words, many of these responses may not result from optogenetic activation of V2a cells, but instead arise from indirect effects such as visual stimulation of the animal. Previous zebrafish studies have shown swimming responses in opsin-negative control animals at latencies above ~100 ms and used a 50 ms cut-off for optogenetically evoked swims. One can see evidence suggestive of this issue in the authors' data: latency data for GtCCR4 appears bimodal with a cluster of short latency swims and a second spread at latencies >2s; this could be a mix of fast optogenetic and slow artifactual responses. As the authors have already tested opsin negative control animals, they should examine the latency distribution of these responses. The long latency is even more striking in the case of BeGC1, pPAC and OaPAC where in all cases mean latency exceeds 2 seconds. No short latency responses are apparent and the delay is too long to be solely a result of second messenger action (e.g. activation of cyclic nucleotide gated ion channels). In any case, no explanation is provided.
 
 We understand the reviewer’s concern that the responses were too slow. However, the neurons responded after accumulation of cAMP or cGMP, which bind and activate CNG in the neurons. Similar delayed responses were observed when G protein-coupled bistable rhodpsins were activated in reticulospinal V2a neurons (please see the accompanying manuscript).
 
 We compared the latency of zebrafish larvae expressing each tool with those not expressing the tool. The data are shown in Figure 3, Figure 3-supplement 1, Figure 5, Figure 6, Figure 7, and Figure 7-supplement 1. Statistically, we considered responses within 8 s after the start of light stimulation as positive, and significant differences in responses were observed depending on the presence or absence of tool expression, suggesting that tail movements were induced by tool activation. In the absence of tool expression, spontaneous movements were occasionally observed, but they did not often occur within 8 s. We have described the data on page 15, line 20-page 16, line 4 in the revised manuscript.
 
 Although this study is motivated by the need to precisely control the flux of specific ions and modulate specific second messenger pathways, there is almost no characterisation of these processes in zebrafish cells. As such, the degree to which these tools are useful to "precisely control second messengers in vivo" is unclear and the lack of mechanistic data also leaves open questions about unexpected aspects of behavioral results (e.g. the long latency of presumed cyclic-nucleotide induced behavior, above).
 
 We believe that the description "controlling second messengers" was misleading. Since Reviewer #3 has taken issue with this aspect, we note that this paper does not provide a detailed analysis of second(ary) messengers. We have restructured the entire manuscript to focus on optogenetic regulation of zebrafish neurons and cardiomyocytes rather than on "control messenger regulation".
 
 Finally, there is little comparison with other commonly used optogenetic actuators. CrChR2[T159C] is used as the only control but more recent tools (e.g. CoChR, Chrmine, ChroME) are not considered. Thus, beyond showing that the new tools have behavioral effects in zebrafish, the usefulness of this report for researchers wanting to compare and select between tools is limited.
 
 We examined the activity of CoChR and ChrimsonR in neuronal ND7/23 cells. In addition, we generated transgenic zebrafish expressing CoChR or ChrimsonR, and examined their activity in V2a neurons and cardiomyocytes. We thereby compared the activity of GtACR4, KnChR, and CrChR2[T159C] with that of CoChR and ChrimsonR. The data are shown in Figure 1, Figure 1-supplement 1, Figure 2, Figure 3, Figure 3-supplement 3, and Figure 5-supplements 1, 2. We described the data for CoChR and ChrimsonR in the relevant part of the Result section (pages 8-14) and discussed a comparison on page 18, lines 2-16 in the revised manuscript.
 
 We found that KnChR was a more potent optogenetic tool than GtCCR4, CrChR2, and ChrimsonR in zebrafish reticulospinal V2a neurons. Optogenetic activity of KnChR was comparable to that of CoChR in both reticulospinal V2a neurons and cardiomyocytes (Figures 1, 3, 5). Truncation of KnChR prolonged the channel open lifetime by more than 10-fold (Tashiro et al. , 2021) (Figure 1). KnChR conducts various monovalent and bivalent cations, including H+, Na+, and Ca2+, while KnChR has a higher permeability to Na+ and Ca2+ and a higher permeability ratio of Ca2+ to Na+ than CrChR2 (Tashiro et al. , 2021). These properties may contribute to the high photo-inducible activity of KnChR. Activation of KnChR may induce influx of more cations with a longer channel open time than CrChR2 and ChrimsonR, leading to stronger cell depolarization. Optogenetic activity of KnChR was comparable to that of GtCCR4 in cultured cells, but higher than GtCCR4 in zebrafish reticulospinal V2a neurons and cardiomyocytes. While the exact reason is unclear, it is possible that the expression of functional KnChR protein may be high in zebrafish cells.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.10.25.513731v1
www.biorxiv.org www.biorxiv.org

New submission 30/12/2022, 17:34:29

1
1. Public_Reviews 17 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #2 (Public Review):
  
  Dipeptide repeat (DPR) proteins produced from both sense GGGGCC (poly-GA, poly-GP and poly-GR) and antisense CCCCGG (poly-PR, poly-PG, poly-PA) repeat RNAs are found C9ORF72-linked ALS/FTD and contribute to neurodegeneration. The translation of the repeat RNA can initiate without the AUG start codon, a process known as repeat associated non-AUG (RAN) translation. In this manuscript, the authors used luciferase reporter construct to show that the translation of PR and PG from the CCCCGG repeats initiated from in-frame AUG in the C9 sequences before the repeats. After mutating candidate AUG codons, the translation can initiate from other AUG, so there is redundancy. But if mutating all the in-frame AUG codons, the luciferase was dramatically reduced, supporting the translation initiated at the AUG start codon. The translation initiation factor eIF2D has been shown to be important for CUG start codon-dependent poly-GA translation from GGGGCC repeats. Here it is shown that eIF2D is not required for poly-PG and poly-PR translation from CCCCGG repeats using both reporter and patient iPS-neurons. The data using luciferase reporter to study antisense repeat translation is solid, the translation initiates from AUG start codon as there are AUG in frame with PG and PR in the constructs containing the antisense sequences.
  
  We thank the reviewer for the constructive feedback.
  
  On the other hand, as the reporter construct includes the sequences containing the AUG codon, it is not surprising that AUG was used. This is canonical translation.
  
  We completely agree. In the revised Introduction, we now point out that, before our study, it was not clear which mode of translation (RAN vs AUG canonical) is employed for DPR synthesis.
  
  Also, in the revised Discussion (lines 251-257)), we mention the following: “Hence, our findings together with these previous studies suggest that DPR synthesis may involve at least three different modes of translation: (a) near-cognate start codon (e.g., CUG, AGG) dependent-translation for poly-GA and poly-GR from sense GGGGCC transcripts, (b) canonical AUG-dependent translation for poly-PR and poly-PG synthesis from antisense CCCCGG transcripts, and (c) DPR synthesis may also occur through RAN translation mechanisms that solely utilize the repeat. It is conceivable that all three modes of translation may occur simultaneously in disease, and that the use of non-canonical and canonical initiation codons may be the primary contributors of DPR production ”.
  
  The 1,000bp intronic sequence included in our antisense 35xCCCCGG constructs (Figure 1A) is the authentic human intronic sequence. We agree that it does contain multiple putative initiation codons, and this was our motivation for conducting systematic mutagenesis of all these codons. To narrow down the list of putative initiation codons, we used our recently developed machine-learning algorithm for initiation codon prediction (PMID: 35648796). We found a CUG and an AUG in poly-PR frame; a CUG and three AUGs in the poly-PG frame), all of which had a good Kozak sequence (as mentioned in Results). Systematic mutagenesis of these codons (single and multiple codon mutations were generated) revealed that an AUG at -273bp is necessary for poly-PR synthesis (Figure 2). Of note, poly-PR is one of the most toxic DPRs, for which an initiation codon had not been previously identified in the literature.
  
  Additionally, the AUG-initiated translation of antisense repeats has been reported previously. Therefore, the novelty is limited.
  
  We agree that an AUG initiation codon was previously described for poly-PG (Boivin et al., EMBO J, 2020, PMID: 31930538). However, our findings significantly extend this observation because redundancy at the level of AUG initiation codon usage was not reported in that study.
  
  We believe our study significantly contributes to the field of C9ORF72 ALS/FTD in the following way:
  
  (i) We identified for the first time an AUG (at -273nt) necessary for synthesis of poly-PR, one of the most toxic DPRs.
  
  (ii) We propose the concept of initiation codon redundancy for poly-PG, which may apply to other DPRs in C9ORF72 ALS/FTD, as well as in other neurological disorders caused by nucleotide repeat expansion mutations.
  
  (iii) Our findings merged with those of previous studies suggest that DPR synthesis may involve at least three different modes of translation: (a) near-cognate start codon (e.g., CUG, AGG) dependent-translation for poly-GA and poly-GR from sense GGGGCC transcripts, (b) canonical AUG-dependent translation for poly-PR and poly-PG synthesis from antisense CCCCGG transcripts, and (c) DPR synthesis may also occur through RAN translation mechanisms that solely utilize the repeat. It is conceivable that all three modes of translation may occur simultaneously in disease, and the use of non-canonical and canonical initiation codons may be the primary contributor of DPR production”.
  
  (iv) We found that the non-canonical translation initiation factor eIF2D is mainly responsible for poly-GA (sense DPR) production without affecting anti-sense DPRs. Hence, we propose a model where DPR translation occurs in a “piecemeal manner”, i.e., a distinct machinery of translation initiation factors may be needed for the synthesis of each DPR.
  
  In the revised manuscript, we now better highlight these key contributions.
  
  How the antisense DPRs are translated endogenously, AUG-canonical translation or RAN translation, depends on whether the AUG is included in the antisense RNA in patients and where the transcription of the antisense starts, upstream or downstream of the AUG start codons. However, this is not considered in the manuscript.
  
  Thank you for this important point. Zu et al., (PNAS, 2013) observed antisense DPR aggregation in brain samples of C9ORF72 ALS/FTD patients. In the same study, the authors conducted 5’ Rapid Amplification of cDNA Ends (RACE). Although this analysis did not identify the exact transcription start site for the antisense CCCCGG RNA, it did show that the region that includes the AUG codons, which we found to be important for poly-PR or poly-PG, is included in the antisense RNA from human C9ORF72 ALS/FTD samples. In page E4969, Zu et al write: “RACE analysis of FCX samples showed intron 1b antisense transcripts begin at varying sites 251–455 bp upstream of the G2C4 repeat”. The same study also detected antisense RNA foci in brain samples of C9ORF72 ALS/FTD patients.
  
  The exact transcription start site for the antisense (and sense) transcript remains unknown. In the near future, we plan RACE experiments to identify it and share these finding with the community in a separate manuscript.
  
  We have modified the Results (lines 133-136) to: “These results strongly suggest that AUG at -273 bp is the start codon for translation of poly-PR, one of the most toxic DPRs in C9ORF72 ALS/FTD. This AUG is predicted to be included in the endogenous antisense CCCCGG transcript based on 5’ Rapid Amplification of cDNA Ends (RACE) analysis on brain samples of C9ORF72 ALS/FTD patients14.”
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.08.06.503063v1
www.biorxiv.org www.biorxiv.org

New submission 17/07/2023, 09:20:00

1
1. Public_Reviews 17 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  1) While the current dataset aims to demonstrate a "correlation" between grid cell encoding and task performance, the other variables that could confound this correlation should be carefully examined.
  
  (1) The exact breakdown of the fraction of beaconed/non-beaconed/probe trials is never shown. if the session makeup has a significant effect on the coding scheme or other results, this variable should be accounted for.
  
  (2) The manuscript did not provide information about whether individual mice experienced sessions with different combinations of the three trial types, and whether they show different preferences in position or distance encoding even in comparable sessions. This leads to the question of whether different behaviour and activity encoding were dominated by experimental or natural differences between individual mice. Presenting the data per mouse will be helpful.
  
  (3) Related to the above point, in Figure 5, the mice appeared to behave worse in probe trials than non-beaconed trials. If the mouse did not know if a trial is a probe or a non-beacon trial, they should behave equivalently until the reward location and thus should stop an equal amount. If this difference is because multiple probe trials are placed consecutively, did the mouse learn that it will not get a reward and then stop trying to get rewards? Did this affect switching between position and distance coding?
  
  (4) It is not shown how the behaviours (e.g., running speed away from the reward zone, licking for reward) in beaconed/non-beaconed/probe trials were different and whether the difference in behaviours led to the different encoding schemes.
  
  We appreciate these suggestions and will add all of the requested analyses in a revised manuscript. We note here that while the proportion of trial types differed between sessions, in all sessions trial types were varied in a repeating sequence, so blocks of behaviour where grid firing is anchored (or not anchored) to the track coordinates can not be explained as a consequence of a particular trial type. We will make this clearer in a revised manuscript.
  
  2) Regarding the behaviour and activity encoding on a trial-by-trial basis, did the behavioural change occur first, or did the encoding switch occur first, or did they happen within the same trial? This analysis will potentially determine whether the encoding is causal for the behaviour, or the other way around.
  
  We agree this is an important point and the corresponding analyses will be reported in a revised manuscript.
  
  3) The author determined that the grid cell coding schemes were limited to distance encoding and position encoding. However, there could be other schemes, such as switching between different position encodings (with clear spatial fields but at different locations), as indicated by Low et. al., 2021, and switching between different distant encodings (with different distance periods). If these other schemes indeed existed in the data, they might contribute to the variation of the behaviours.
  
  We did not observe switching between coding schemes of the same type within our dataset and so did not document this. We agree it is important to do so and will provide additional analyses in the revised manuscript
  
  4) The percentage of neurons categorised in each coding scheme was similar between non-grid and grid cells. This implies that non-grid cells might switch coding schemes in sync with grid cells, which would mean the whole MEC network was switching between distance and position coding. This raises the question of whether the grid cell coding scheme was important per se, or just the MEC network coding scheme.
  
  We appreciate the suggestion and very much agree that looking at cells outside of just grid cells is important in determining which cells are functionally relevant in spatial behaviours. We will provide additional analyses in a revised manuscript.
  
  5) In Figure 2 there are several cell examples that are categorised as distance or position coding but have a high fraction of the other coding scheme on a per-trial basis. Given this variation, the full session data in F should be interpreted carefully, since this included all cells and not just "stable" coding cells. It will be cleaner to show the activity comparison only between the stable cells.
  
  We agree that showing stable examples before introducing examples that switch on a per-trial basis will be helpful. We will amend this in a revised manuscript.
  
  6) The manuscript is not well written. Throughout the manuscript, there are many unexplained concepts (especially in the introduction) and methods, mis-referenced figures, and unclear labels.
  
  We appreciate the feedback and will work to address the concerns in a revised manuscript.
  
  Reviewer #2 (Public Review):
  
  This study is very timely as there is a pressing need to identify/delimitate the contribution of grid cells to spatial behaviors. More studies in which grid cell activity can be associated with navigational abilities are needed. The link proposed by Clark and Nolan between "virtual position" coding by grid cells and navigational performance is a significant step toward better understanding how grid cell activity might support behavior. It should be noted that the study by Clark and Nolan is correlative. Therefore, the effect of selective manipulations of grid cell activity on the virtual task will be needed to evaluate whether the activity of grid cells is causally linked to the behavioral performance on this task. In a previous study by the same research group, it was shown that inactivating the synaptic output of stellate cells of the medial entorhinal cortex affected mice's performance of the same virtual task (Tennant et al., 2018). Although this manipulation likely affects non-grid cells, it is still one of the most selective manipulations of grid cells that are currently available.
  
  We appreciate this additional context provided here. In our view, it is critical to narrow down the space of possible behaviours that grid cells might contribute to. As the reviewer notes, our previous work provided evidence that speaks to this question by targeting genetic manipulations (Tennat et al., 2018), but while this approach was specific to stellate cells it does not discriminate grid from non-grid cells and so does not tell us specifically about roles for grid cells. As far as we are aware there is currently no manipulation that will do this. In the experiments here, we take a complementary approach, leveraging the variability inherent in behaviour and the fact that in our location memory task animals will perform many trials in a session. By showing that spatially anchored grid firing does not predict behavioural success on cued trials, but does predict success on trials that are solved by path integration, we substantially narrow the space of behaviours that grid cells could contribute to. Importantly, stellate cells appear necessary for both cued and uncued behaviour in the task (Tennant et al., 2018), suggesting that their roles are more general than the grid cell population, which is likely to be only a subset of stellate cells. We will more carefully address this point in a revised manuscript.
  
  When interpreting the "position" and "distance" firing mode of grid cells, it is important to appreciate that the "position" code likely involves estimating distance. The visual cues on the virtual track appear to provide mainly optic flow to the animal. Thus, the animal has to estimate its position on the virtual track by estimating the distance run from the beginning of the track (or any other point in the virtual world).
  
  We agree this terminology has the potential for causing confusion. A simpler descriptive definition would be track-anchored and track-independent rather than position and distance coding. We will consider this and other alternatives for a revised manuscript.
  
  Reviewer #3 (Public Review):
  
  This study addresses the major question of 'whether and when grid cells contribute to behaviour'. There is no doubt that this is a very important question. My major concern is that I'm not convinced that this study gives a significant contribution to this question, although this study is well-performed and potentially interesting. This is mainly due to the fact that the relation between grid cell properties and behaviour is exclusively correlative and entirely based on single cell activity, although the introduction mentions quite often the grid cell network properties and dynamics. In general, this study gives the impression that grid cells exclusively support the cognitive processes involved in this task. This problem is in part related to the text. However, it would be interesting to look at the population level (even beyond grid cells) to test whether at the network level, the link between behavioural performance and neural activity is more straightforward compared to the single-cell level.
  
  We appreciate the feedback and suggestions. As we note in our response to Reviewer #2, there is currently no method for selective manipulation of grid cells, while testing correlation is a critical step on the path to establishing causation. Our study contributes by reducing the space of possible functions of grid cells to exclude behaviours in which local cues are available, while providing evidence for a clear relationship between anchoring of grid cells and successful outcomes when path integration is used for localisation. We’re unclear here about what the reviewer means by ‘more straightforward’ as the relationships we establish do not appear overly complicated, and as strong relationships between activity of single grid cells and populations of grid cells are already well established (Gardner et al., 2021; Waaga et al., 2021; Yoon et al., 2013).
  
  The authors used a statistical method based on the computation of the frequency spectrum of the spatial periodicity of the neural firing to classify grid cells as 'position-coding' (with fields anchored to the virtual track) and 'distance-coding' (with fields repeating at regular intervals across trials). This is an interesting approach that has nonetheless the default to be based exclusively on autocorrelograms. It would be interesting to compare with a different method based on the similarities between raw maps.
  
  We’re not sure we understand the point here. The manuscript provides analyses comparing rate maps for activity periods in which grid cells are / are not anchored to the task environment (e.g. Figure 2A-C, Figure 3B-E); when grid cells are anchored the rate maps are clearly spatial, when they are not anchored we show that spatial information (in the track reference frame) is very substantially reduced.
  
  Beyond this minor point, cell categorization is performed using all trial types. Each trial type (i.e. beacon or non-beacon) is supposed to force mice to use different strategies and should induce different spatial representations within the entorhinal-hippocampal circuit (and not only in the grid cell system). In that context, since all trials are mixed, it is difficult to extrapolate general information.
  
  Again, we’re not sure we understand the point. We appreciate this likely reflects a lack of clarity on our part in the writing of the manuscript. As noted in our response to Reviewer #1, we will include additional details about the organisation of trials and relationships between trials, behavioural outcomes and neural codes observed. We should note here that mice are not ‘forced’ to adopt any particular strategy. Rather, on uncued trials a path integration strategy is the most efficient way to solve the task. Mice could instead use a less efficient strategy of stopping at short intervals and still obtain rewards, although the behavioural evidence suggests they do not choose to do this after learning the task.
  
  On page 5 the authors state that 'Since only position representations should reliably predict the reward location, ..., we reasoned that the presence of positional coding could be used to assess whether grid firing contributes to the ongoing behaviour'. I do not agree with this statement. First of all, position coding should be more informative only in a cue-guided trial. Second, distance coding could be as informative as position coding since at the network level may provide information relevant to the task (such as distance from the reward).
  
  Again, this point perhaps reflects a lack of clarity on our part in writing the manuscript. When grid cells are anchored to the track reference frame (position encoding in the manuscript), then the location of the rate peaks in grid firing is reliable from trial to trial. This is the case whether or not the trial is cued. When grid cells are independent of the track reference frame (distance encoding in the manuscript, but we now appreciate this is a poor choice of words), then the location of the firing rate peaks vary from trial to trial; thus position can not be read out directly from trial to trial. In principle, when grid cells are not anchored to the track the mouse could read out track position by storing the grid network configuration at the start of each trial and then subtracting this from readouts of distance as mice move along the track. If mice do use this computation we would expect them to do so equally well on cued and uncued trials, whereas our results clearly show a dissociation between trial types in the relationship between grid firing and behavioural outcome. We will highlight this possibility in a revised manuscript.
  
  Third, position-coding is interpreted as more relevant because it predominates in correct trials. However, this does not imply that this coding scheme is indeed used to perform correct trials.
  
  As we note above, our analyses reduce the space of behaviours to which grid cells might contribute, by providing evidence that anchoring of grid firing is associated with successful outcomes specifically when mice adopt a path integration strategy. We agree that alternative models remain plausible, for example perhaps the behaviourally relevant computations are implemented elsewhere in the brain with grid anchoring to the track as an indirect consequence. Nevertheless, the space of alternative models is substantially reduced given our experiments and analyses, while our approach complements tests of grid-behaviour functions that rely on manipulations which leave open alternative explanations based on off target effects. We expect that inclusion in a revised manuscript of the further analyses suggested above should provide further tests of the grid-behaviour relationship.
  
  It could be more informative to push forward the correlative analysis by looking at whether behavioural performance can be predicted by the coding scheme on a trial-by-trial basis.
  
  Figure 5E shows the recommended analysis.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.05.12.540491v2
www.biorxiv.org www.biorxiv.org

New submission 17/07/2023, 15:10:02

1
1. Public_Reviews 17 Jul 2023
  
  in eLife
  
  Author Response
  
  eLife assessment
  
  This useful study emphasizes some previously ignored aspects of synaptic communication between Purkinje neurons and their targets in the cerebellar nuclei. Reviewers felt that some aspects of the evidence were solid but that others were incomplete.
  
  We think this is an extensive and complete study. The major issue that the reviewers raised is about the usage of high chloride internals in our recordings. We feel that this single issue does not really match the statement “others were incomplete”, which suggests that this study is incomplete in some way. Please note that in our complete revision we will respond to the issue of chloride by pointing out: (1) the advantages of using high chloride internals to determine the distribution of input sizes, (2) the challenges of estimating the relationship between input sizes for different chloride internals, (3) the previous studies that have established the relationship between input sizes and chloride levels at other synapses, and (4) additional simulations will be provided indicating that subtle changes in the input sizes would have minor quantitative effects on the influences of individual inputs, but would not affect the main conclusions of the paper.
  
  Reviewer #1 (Public Review):
  
  This manuscript explores physiological properties of Purkinje-to-nuclear synapses. The report provides largely incremental advances over what has already been discovered about this synaptic relationship. The main findings, as articulated by the authors, are that Purkinje-to-nuclear synaptic strength is variable, with a few very strong inputs to the cerebellar nuclei. They show that single inputs effectively inhibit nuclear firing and that the diversity of synaptic strength influences nuclear neuron responsivity to inputs by enhancing synaptic variance. In addition, while not necessarily surprising, it's nice to see that stronger inputs would have a stronger influence on a postsynaptic cell, both in terms of rates and temporal coding transfer. Overall, as it stands, the manuscript is not very scholarly, overstates the novelty of findings, and frames a straw-man. That said, buried in here are some potentially interesting observations.
  
  This review provides us with an opportunity to more clearly summarize what is new in our findings. Our study builds upon Person and Raman (2012) and other studies, and makes a number of important advances. (1) We provide a much more extensive characterization of input sizes (n=157) than previous studies, and show that the distribution of input sizes is skewed, with the largest inputs almost 100 times larger than the smallest inputs. This distribution is clearly different from that of Person and Raman (2012), where the estimation of unitary PC input sizes was based on small sample sizes from a broad range of age (n=30, P13-29 animals). The high Cl- concentration internal we used in our recordings provides us with superior stability and sensitivity in detecting such variability in input size. (2) We show for the first time that the distribution of input sizes becomes more skewed in juvenile animals than in young animals, suggesting that PC-CbN synapses are modified by plasticity mechanisms during development. (3) Our dynamic clamp approach is based on the skewed distribution of input sizes we observed, and the Purkinje cell firing patterns we recorded in vivo, whereas Person and Raman (2012) primarily focused their dynamic clamp studies on 40 uniform sized inputs (even though they recognized that there are also somewhat larger inputs), with their firing interspike intervals drawn from Gaussian distributions (which lack refractory periods and do not represent realistic PCs firing patterns). We also complement our dynamic clamp studies with simulations using an integrate-and-fire model that does a good job of replicating our dynamic clamp studies. This allowed us to more thoroughly explore the effects of different size input that would not be practical with dynamic clamp studies. (4) We show that individual PC inputs powerfully regulate the rate and timing of CbN neuron firing, without requiring a high degree of PC synchrony. (5) We further show that timing control by PCs leads to strong inhibition of CbN firing and, surprisingly, a brief elevation prior to the inhibition. This result from the refractory period of PCs, which generate a disinhibition period prior to the inhibition, and is shaped by the firing statistics of PC inputs. If such an elevation prior to inhibition was observed in vivo, it could be misinterpreted as excitation of CbN neurons by other inputs (e.g., mossy fiber collaterals) preceding the PC inputs. (6) We show that the total inhibitory conductance and the coefficient of variation (CV) of this conductance are both important factors in controlling the firing rate of CbN neurons. Having variable input sizes or synchronized inputs all lead to higher CV of the inhibitory conductance and therefore higher firing rates. (7) We show that all different-sized PC inputs transmit a robust rate code that simply depend on their sizes. (8) Our study helps to resolve a long-standing controversy in the field. Some thought that PC synchrony is an effective way of controlling CbN neuron firing, while others doubted the physiological relevance of PC synchrony. Here we show that a single large input is functionally equivalent to many small, perfectly synchronized inputs, which can influence the rate and timing of CbN firing as previously proposed (Person and Raman, 2012a), but without requiring a high degree of PC synchrony. We also suggest that a high degree of synchrony is not a prerequisite for an appreciable influence, because synchronizing a few large inputs can have large effects on CbN neuron firing. We strived to be fair and thorough, and we think that the study is scholarly. Prior to the initial submission, we sought advice from experts in the field, Indira Raman and Nicolas Brunel, and their input was very helpful in this regard. We will revise the manuscript to more clearly articulate what has been done previously, and what aspects of our study are new.
  
  Reviewer #2 (Public Review):
  
  In this manuscript, the authors address how cerebellar Purkinje cells (PC) control the firing of nuclear cells (CbN), the output stage of the cerebellar. They used patch-clamp recordings in acute cerebellar slices, and combined dynamic clamp with simulations of nuclear cell firing rate.
  
  This article addresses one of the most fundamental unresolved question of the cerebellar physiology: how inhibitory PCs control the output stage of the cerebellum?
  
  They first described a developmental evolution of the that PC-CbN synapses. Inhibitory synaptic weights become highly variable after three weeks of age, with a group of very large PC inputs. They used dynamic clamp to examine the influence of these variable inputs on CbN firing rate. They demonstrate that while all input size affect CbN discharge, larger ones can stop them for a few milliseconds. Using a distribution of variable input size, they showed that increasing the variability of PC inputs favor CbN discharge, while increasing the magnitude of a constant inhibitory conductance decrease their firing rate. By varying the frequency of PC inputs, they suggest that CbNs faithfully transmit rate code, but larger inputs are more effective to decrease their firing rate. Finally, addressing how synchrony of variable PC inputs influence CbN discharge, dynamic clamp studies and simulations showed that input synchronization enhance firing, but driven by the total charge of the inhibitory input.
  
  The keystone observations that PC inputs are highly variable is very interesting and convincing and open new questions about PC-CbN plasticity. More importantly the combination of dynamic clamp and simulations is a real strength of the study, allowing the authors to test many combinations of inputs in real cells and extrapolating their hypotheses in silico. Weaknesses result from the assumptions made on the construction of the distribution of inputs and the many different conditions explored. The organization of the article could be difficult to read for a non-specialist of cerebellar physiology.
  
  We thank the reviewer for their kind comments. We will revise the manuscript to clarify the assumptions made to construct the distribution of input sizes. We will do our best to revise the manuscript to make it easier for a non-specialist to read.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.05.25.542308v1
www.biorxiv.org www.biorxiv.org

New submission 17/07/2023, 09:16:47

1
1. Public_Reviews 17 Jul 2023
  
  in eLife
  
  Author Response
  
  We thank the editors and the reviewers for their comments. In response, we plan to revise the manuscript in order to provide the details requested and include additional bioinformatic analysis of the data, along the lines suggested by the reviewers. We will also take into account individual variations among the subjects investigated in this study, and discuss the extent to which factors other than age might contribute to the results. And we will expand the discussion to consider how our results may apply to other cells/tissues and how they relate to other findings in the field.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.04.19.537477v1
www.biorxiv.org www.biorxiv.org

New submission 14/07/2023, 09:28:16

1
1. Public_Reviews 14 Jul 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the current reviews.
  
  We will make some minor changes to address the issues in the revised manuscript during preparation of the Version of Record.
  
  1) Acknowledge the previous discovery that COUPTFII expression is confined to the ventral hippocampus in early human fetal forebrain (doi: 10.1093/cercor/bhx185).
  
  We agree. We will incorporate the previous discovery that COUPTFII expression is confined to the ventral hippocampus in early human fetal forebrain (doi: 10.1093/cercor/bhx185) in the discussion section of "COUP-TFII governs the distinct characteristics of the ventral hippocampus".
  
  2) Give some consideration to this observation from my original review "Abnormalities in the trisynaptic circuit. No studies of actual synapses, either physiological or morphological, were carried out. I wonder to what extent these immunohistochemical studies just further reflect the abnormalities in hippocampal morphology presented earlier in the manuscript without specifically telling us about synaptic circuits? Although the immunohistochemical preparations are beautiful, they are inadequate on their own in telling us much about what sort of synaptic circuitry exists in the transgenic animals".
  
  Our data in Figure 4 show clearly that at the neural circuit level, compared with the corresponding control, the trisynaptic circuit is abnormal in all three models; therefore, in the discussion section of "COUP-TF genes are imperative for the formation of the trisynaptic circuit", we will add the following sentence, "We would like to investigate what sort of synaptic circuitry is compromised either physiologically or morphologically in the trisynaptic circuit of individual animal model in detail in the future studies.
  
  In addition, we will correct a reference related to the COUP-TFII gene and congenital heart defects.
  
  The reference of "High, F. A., Bhayani, P., Wilson, J. M., Bult, C. J., Donahoe, P. K., & Longoni, M. (2016). De novo frameshift mutation in COUP-TFII (NR2F2) in human congenital diaphragmatic hernia. Am J Med Genet A, 170(9), 2457-2461. doi:10.1002/ajmg.a.37830" was replaced with "Al Turki, S., Manickaraj, A. K., Mercer, C. L., Gerety, S. S., Hitz, M. P., Lindsay, S., . . . Hurles, M. E. (2014). Rare variants in NR2F2 cause congenital heart defects in humans. Am J Hum Genet, 94(4), 574-585. doi:10.1016/j.ajhg.2014.03.007".
  
  —————
  
  The following is the authors’ response to the original reviews.
  
  Reviewer #1(Recommendations For The Authors):
  
  1) Better presentation of the western blot results
  
  We agree with the reviewer. Based on the suggestion, new information about the western blot results has been added in the revised Figure 1Ap. We added a dash to each western blot image to indicate the target band of COUP-TFI (46 KDa), COUP-TFII (45 KDa), and GAPDH (37 KDa), respectively. There were two bands in the blot of COUP-TFII, with the upper band corresponding to mouse IgG at 50 KDa, and the bottom band corresponding to COUP-TFII protein at 45 KDa. Therefore, only the lower bands of COUP-TFII are used for the quantitative analysis. The expression of COUP-TFII in the ventral hippocampus is clearly higher than that in the dorsal hippocampus.
  
  2) Full presentation of the Immunohistochemistry and qPCR results for at E11.5 and E14.5 in double knockdown mice.
  
  Thanks for the suggestion. Based on the suggestion, we added immunofluorescent data in the double knockout mice at E11.5 in the Figure 5Ba-h. Meanwhile, given that it takes time to prepare animal samples at E14.5 for RT-qPCR assays, we performed immunofluorescent assays at both E13.5 and E14.5 to make sure that the changes of Lhx5 and Lhx2 expression in the hippocampal regions between the control and mutant mice were consistent. As shown in the new Figure 5B, consistent with the downregulated expression of Lhx5 transcripts in the double mutant, the expression of the Lhx5 protein was reduced in the CH in the double mutants at E11.5; moreover, the numbers of Lhx5-positive Cajal-Retzius cells decreased in the double mutant embryos at E11.5, E13.5 and E14.5 (Figure 5Ba-d, a’-d’, a’’-d’’, i-l, i’-l’, q-t, q’-t’). Consistent with RT-qPCR data, the expression of Lhx2 was comparable between the control and double-mutant mice at E11.5 (Figure 5Be-h, e’-h’). Interestingly, the expression of the Lhx2 protein was increased in the hippocampal primordium in the COUP-TF double-mutant mice at E13.5 and E14.5 (Figure 5Bm-p, m’-p’, u-x, u’-x’). Please find the altered descriptions in the Page 15, lines 347-351, 353-358 and Page 21, lines 500-503 in the revised manuscript.
  
  3) Minor corrections. Lines 159-162, prospected not quite the right word. I would suggest "an ectopic CA-like region was observed medially in the temporal hippocampus in the COUP1TFII mutant, where the prospective posterior part of the medial amygdaloid nucleus was situated, (MeP), indicated by the star (Figure 1Ba-f). The presence of the ectopic CA-like region in the ventral but not dorsal hippocampus of the mutant was further confirmed by the presence of the prospective MeP and amygdalohippocampal area (AHi) in sagittal sections, as indicated by the star. See also line 251. Line437/438 I would suggest "... most important breakthroughs in understanding the role of the hippocampus in memory."
  
  Thanks for the suggestion. We made the changes based on the suggestion. Please find the amendments in Page 8, lines 178-181; Page 12, lines 270, 276; Page 14, line 318; Page 19, lines 451; Page 20, lines 461-462 in the revised manuscript.
  
  Reviewer #2 (Recommendations For The Authors):
  
  1) It is also important to point out that the immunofluorescence data in Figure 5B is contrary to what is known for Lhx5 (it's not expressed in the neocortical and hippocampal vz) and Lhx2 (it's not expressed in the choroid plexus). Authors should explain how their conclusions could align more clearly, and consider the possibility that their results are due to a possible artifact of image setting issues or worse, antibody specificity issues.
  
  Very good point. Based on the comments and suggestions, we first tested another Lhx5 antibody, R&D, Cat # AF6290, in the immunofluorescence assays. Indeed, there was something wrong with the previous Lhx5 antibody, Millipore, Cat # AB5762. With the new Lhx5 antibody, consistent with the reported in situ data, the expression of Lhx5 was detected specifically in the CH at E11.5, and in the Cajal-Retzius cells in the marginal zone of the telencephalon. The same Lhx2 antibody, Santa Cruz, Cat # sc-19344, which has been used successfully in one of our previous studies (Tang et al., Development, 2012) (PMID: 22492355), was used in the present study. We believe that the observations at the MP and DP of the samples are really associated with the expression of Lhx2 protein. We performed new immunofluorescence assays with the new Lhx5 antibody and confirmed with the Lhx2 antibody. As shown in new Figure 5B, consistent with the downregulated expression of Lhx5 transcripts in the double mutant, the expression of the Lhx5 protein was reduced in the CH in the double mutants at E11.5; moreover, the numbers of Lhx5-positive Cajal-Retzius cells decreased in the double mutant embryos at E11.5, E13.5 and E14.5 (Figure 5Ba-d, a’-d’, a’’-d’’, i-l, i’-l’, q-t, q’-t’). Consistent with RT-qPCR data, the expression of Lhx2 was comparable between the control and double-mutant mice at E11.5 (Figure 5Be-h, e’-h’). Interestingly, the expression of the Lhx2 protein was increased in the hippocampal primordium in the COUP-TF double-mutant mice at E13.5 and E14.5 (Figure 5Bm-p, m’-p’, u-x, u’-x’). Please find the changed descriptions in Page 15, lines 347-351, 353-358 and Page 21, lines 500-503 in the revised manuscript.
  
  The reference:
  
  Tang, K., Rubenstein, J. L., Tsai, S. Y., & Tsai, M. J. (2012). COUP-TFII controls amygdala patterning by regulating neuropilin expression. Development, 139(9), 1630-1639. doi:10.1242/dev.075564
  
  2) The expression domain of RxCre remains poorly explained, and the early expression of COUPTFI and II (E10.5-E12.5) could be considered major weaknesses of the paper.
  
  Thanks for the suggestion. The generation of RXCre was reported by Swindell et al., Genesis, 2006 (PMID: 16850473). Given that the activation of the LacZ expression serves as an indicator for the deletion of the COUP-TFII gene (Tang et al., Development, 2012) (PMID: 22492355), we performed the immunofluorescent data with antibodies against COUP-TFII and LacZ on the sagittal sections of RXCre/+; COUP-TFIIF/+ heterozygous mutant and RXCre/+; COUP-TFIIF/F homozygous mice at E11.5. As shown in the new Figure 1—figure supplement 1Da-f, COUP-TFII was readily detected at the hippocampal primordium of the heterozygous mutant embryo at E11.5 (Figure 1—figure supplement 1Da, c, g); in contrast, the expression of COUP-TFII significantly decreased in the homozygous mutant (Figure 1—figure supplement 1Dd, f, j). In addition, compared with the heterozygous mutant embryo, the LacZ signals increased distinctly in the hippocampal primordium of the homozygous mutant embryo at E11.5 (Figure 1—figure supplement 1Db-c, e-f, h, k), suggesting that RXCre recombinase can efficiently excise the COUP-TFII gene in the hippocampal primordium as early as E11.5. Please find the corresponding changes in Page 7, lines 149-159 and Page 8, lines 160-164 in the revised manuscript.
  
  Meanwhile, we also added the early expression of COUP-TFI and -TFII at E10.5 and E11.5 in new Figure 1—figure supplement 1Aa-d. At embryonic days 10.5 (E10.5), COUP-TFI was detected in the dorsal pallium (DP) laterally and COUP-TFII was expressed in the MP and CH medially (Figure 1—figure supplement 1Aa, b). At E11.5, the expression of COUP-TFII remained in the hippocampal primordium, including MP and CH (Figure 1—figure supplement 1Ac, d). Please find the corresponding changes in Page 6, lines 129-132 and Page 9, lines 202-203 in the revised manuscript.
  
  The references:
  
  Swindell, E. C., Bailey, T. J., Loosli, F., Liu, C., Amaya-Manzanares, F., Mahon, K. A., . . . Jamrich, M. (2006). Rx-Cre, a tool for inactivation of gene expression in the developing retina. Genesis, 44(8), 361-363. doi:10.1002/dvg.20225
  
  Tang, K., Rubenstein, J. L., Tsai, S. Y., & Tsai, M. J. (2012). COUP-TFII controls amygdala patterning by regulating neuropilin expression. Development, 139(9), 1630-1639. doi:10.1242/dev.075564
  
  Reviewer #3 (Recommendations For The Authors):
  
  1) Regarding the RxCre line, I was also confused about its spatiotemporal expression, as this line is not a commonly used Cre line and no detailed description is provided in the manuscript. Searching this line shows a previous paper by the authors (PMID: 22492355) in which they tested the RxCre recombinase activity. At E12.5, RxCre induced high LacZ expression in the ventral telencephalon but much less in the dorsal telencephalon. But they did not check later stage. Therefore, it's hard to explain the defective dorsal hippocampus in RxCre, CFI CKO. They should check later stage.
  
  The generation of RXCre was reported by Swindell et al., Genesis, 2006 (PMID: 16850473), which reveals high Cre recombinase activity of RXCre in the eye and ventral telencephalon. Given that the activation of the LacZ expression serves as an indicator for the deletion of COUP-TFII gene, Tang et al., Development, 2012 (PMID: 22492355), we performed the immunofluorescent data with antibodies against COUP-TFII and LacZ on the sagittal sections of RXCre/+; COUP-TFIIF/+ heterozygous mutant and RXCre/+; COUP-TFIIF/F homozygous mice at E11.5. As shown in new Figure 1—figure supplement 1D, compared with the heterozygous mutant embryo, the expression of COUP-TFII was significantly decreased in the homozygous mutant; in addition, the LacZ signals evidently increased in the hippocampal primordium of the homozygous mutant embryo at E11.5, suggesting that RXCre recombinase can efficiently excise the target gene in the hippocampal primordium as early as E11.5. The expression of COUP-TFI is barely detectable in the early developing hippocampal primordium including MP at E10.5, E11.5 and E12.5. The expression of COUP-TFI is high in the MP of the control (Figure 1Cj, l); in contrast, the COUP-TFI expression is barely detectable in the MP of the homozygous double mutant at E14.5, indicating that RXCre can efficiently delete the COUP-TFI gene in the hippocampal primordium at E14.5. The loss of the COUP-TFI gene in the MP as early as E14.5 by RXCre initiates the defective dorsal hippocampus in RXCre/+; COUP-TFIF/F knockout mice.
  
  2) Authors should check and review extensively for improvements to the use of English.
  
  We carefully checked and made changes throughout the manuscript accordingly. For example, “imperative” was used 6 times in the previous manuscript, lines 20, 255, 486, 499, 522, 553; “imperative” was used only once in Page 22, line 522 in the revised manuscript.
  
  3) Please correct the manuscript; 1-month-old mice are not adult mice.
  
  Thanks for the suggestion. Based on the suggestion, we have corrected related words and sentences in the manuscript. Please find the amendments in the revised manuscript (Page 7, line 146; Page 9, lines 203-204; Page 10, line 213; Page 13, lines 299-300; Page 17, line 406; Page 20, line 476).
  
  4) Additional ref should be added at line 93 on page 5.
  
  Based on the suggestion, we added some new references (Bertacchi et al., EMBO J, 2020) (PMID: 32572460); (Del Pino et al., Cereb Cortex, 2020) (PMID: 32484994); (J. Feng et al., Sci Adv, 2021) (PMID: 34215582) at line 96 on page 5.
  
  The references:
  
  Bertacchi, M., Romano, A. L., Loubat, A., Tran Mau-Them, F., Willems, M., Faivre, L., . . . Studer, M. (2020). NR2F1 regulates regional progenitor dynamics in the mouse neocortex and cortical gyrification in BBSOAS patients. Embo j, 39(13), e104163. doi:10.15252/embj.2019104163
  
  Del Pino, I., Tocco, C., Magrinelli, E., Marcantoni, A., Ferraguto, C., Tomagra, G., . . . Studer, M. (2020). COUP-TFI/Nr2f1 Orchestrates Intrinsic Neuronal Activity during Development of the Somatosensory Cortex. Cereb Cortex, 30(11), 5667-5685. doi:10.1093/cercor/bhaa137
  
  Feng, J., Hsu, W. H., Patterson, D., Tseng, C. S., Hsing, H. W., Zhuang, Z. H., . . . Chou, S. J. (2021). COUP-TFI specifies the medial entorhinal cortex identity and induces differential cell adhesion to determine the integrity of its boundary with neocortex. Sci Adv, 7(27). doi:10.1126/sciadv.abf6808
  
  5) I am confused why the authors analyzed 1-month-old mice in some instances but 3-month-old mice in others.
  
  The RXCre/+; COUP-TFIF/F; COUP-TFIIF/F double mutant mice barely survived beyond postnatal 3 weeks. To make our findings consistent and comparable, we mainly prepared figures with observations on about 1-month-old mice in the RXCre related single or/and double gene mutant mouse models. In the study of the Emx1Cre related COUP-TFI mouse model, due to behavioral tests such as the Morris water maze test, experiments were performed with the adult experimental animal about postnatal 3 months. In order to be consistent with the stage of the mice for the behavioral tests, we only displayed morphological data with observations on the control and Emx1Cre/+; COUP-TFIF/F mutant mice at about postnatal 3-month.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.17.528915v3
www.biorxiv.org www.biorxiv.org

New submission 14/07/2023, 09:25:17

1
1. Public_Reviews 14 Jul 2023
 
 in eLife
 
 Author Response
 
 The following is the authors’ response to the original reviews.
 
 We thank the reviewers for their comments. We have now addressed all the comments in a revised version of the manuscript, which we believe has strengthened our paper.
 
 1) Introduction LINE 60: the authors cite Funato et al 2016 as the paper first describing a role for SIk3 in sleep regulation. In fact, the role for this kinase was first identified nearly a decade earlier in C. elegans (Van der Linden et al, Genetics 2008 PMID 18832350).
 
 Thank you for pointing us to this reference. Van der Linden et al. demonstrated that the C. elegans homolog of Sik3 (KIN-29) regulates satiety quiescence, in which worms stop moving following feeding on high quality food. However, as pointed out in Trojanowski and Raizen “Call it Worm Sleep” (2016), not all of the behavioral criteria for sleep has been applied to C. elegans satiety quiescence, and we cannot find any references that unequivocally demonstrate satiety quiescence is a sleep state. As McClanahan et al., (2020) show, quiescent states following mild sensory arousal do not fulfill the sleep criteria of changes in arousal threshold and homeostatic regulation, so not all quiescent states in C. elegans are sleep. Then again Grubbs et al, 2020 does demonstrate that KIN29 regulates both developmentally timed and stress induced sleep states in worms, suggesting that the observations in Van der Linden were ahead of its time and these behavioral states are possibly inter-related. We believe, though, that our line “the roles of… SIK3 kinase in modulating sleep homeostasis in mice (Funato et al. 2016) were identified in genetic screens” remains accurate.
 
 2) Introduction LINE 71: remove the word "known" from "...while some known human sleep/wake regulators, such as the...")
 
 Good idea. Done.
 
 3) I was confused regarding Supplemental data 1 describing the genes they targeted with their forward genetic screen. Am I understanding correctly from the "Summary stats" tab that 702 fish lines with virus insertions were screened behaviorally? In Figure S1, it looks like about 60 are shown in the histograms but in the text (in the Discussion) they say 25 were screened. Were all the genes listed under the Excel tabs (GPCRs, channels, etc) tested? Or was just a subset tested? Where are the sleep data for these lines? Negative results may be relevant to their manuscript since they listed (tested??) a number of ion channel genes under tab "channels" which appear to NOT have a sleep phenotype.
 
 We apologize for the confusion on these points. As highlighted in the legend to Supplementary Figure S1, we had planned a screening strategy with the following pipeline: Candidate mammalian gene → Zebrafish ortholog → ID viral insertion from “Zenemark” library → grow viral insertion lines from frozen sperm→ phenotype F3 heterozygous and homozygous mutant generation. Unfortunately, the company, Znomics, which held the Zenemark library, could not reliably reconstitute the correct live fish from the sperm library, and of the 702 lines we planned to screen, we could only screen 26 (25 was a typo) lines. We treated heterozygous and homozygous animals for each line independently, for a total of 52 screened lines in the histograms.
 
 To make this clearer, we have edited the main text as follows (lines 104-105): “For screening, we identified zebrafish sperm samples from the Zenemark collection (Varshney et al., 2013) that harboured viral insertions in genes of interest and used these samples for in vitro fertilization and the establishment of F2 families, which we were able to obtain for 26 lines.” And lines 111-112: “While most screened heterozygous and homozygous lines had minimal effects on sleep-wake behavioural parameters (Figure S1B-S1C),”
 
 We believe it is important to include the full set of Supplementary Data 1, even though the vast majority of these candidate lines were not tested.
 
 4) Results LINE 117: remove the word "prominent", which is subjective, from the sentence "...showed a prominent decrease in sleep during the..."
 
 Good point. Done.
 
 5) LINES 185-186: did you see any circadian variation in your dmist:GFP protein abundance or localization? Protein trafficking has been described as a mechanism of circadian regulation of excitability.
 
 For practical reasons, we imaged the membrane localization of Dmist:GFP in plasmidinjected embryos at 90% epiboly, which is about 9 hours after fertilization and when the cells remain large and in a relatively flat epithelium. Thus, we could not follow circadian fluctuations in abundance or localization. For circadian studies, we believe the best method will be to raise an antibody that recognizes Dmist.
 
 6) LINE 203: does the GFP-tagged Dmist rescue the loss-of-function phenotype? This is relevant to Figure 2E. it is also relevant to the issue of structure-function. If it rescues, then the C-terminus may not be essential to protein function.
 
 As noted, for practical reasons, we observed Dmist-GFP only transiently at early stages of development, expressed using a strong, ubiquitous promoter. A rescue experiment is a good idea for future experiments, where we carefully control the expression of Dmist in neurons.
 
 7) LINE 220: explain what you mean by "...consistent with nonsense-mediated decay." and/or give a reference.
 
 In zebrafish and other species including humans, mutant transcripts that have premature stop codons often undergo “nonsense mediated decay”, whereby the expression levels are largely reduced (Wittkopp et al., 2009). In the zebrafish community, this is often used as secondary evidence of a loss of function mutation, as relatively few antibodies are available to directly observe zebrafish proteins. We have added a reference that describes this phenomenon (Wittkopp et al., 2009).
 
 8) LINE 225: define "LME model"
 
 Now reads: “Linear mixed effects (LME).”
 
 9) LINES 227-229: could the vir/vir phenotype be explained by specific effects on protein structure? could vir/vir be a gain-of-function allele?
 
 We can’t rule this out formally, and vir/+ animals do show some sleep phenotypes, albeit weaker than those of vir/vir animals (Figure 1G). However, it is not uncommon for heterozygous mutants to show significant phenotypes that are weaker than those of their homozygous mutant siblings, and the strong suppression of dmist expression by the viral insertion (which is located in the dmist intron) is more consistent with a hypomorphic loss-of-function phenotype for the vir allele.
 
 10) LINES 229-230: I don't quite follow the argument for pursuing further studies only of i8/i8. i8/i8 seems to also be a hypomorphic allele based on your qPCR data.
 
 First, the dmist viral line was generated by an insertional mutagenesis method followed by sequencing, and each line has multiple other inserts in a background that does not match the background of the other animals reported in this paper. Second, the dmist vir allele is an insertion in the intron, leading to reduced, but not complete loss of expression. In contrast, the i8 allele was generated on the same background strain as our other existing and newly reported lines. Moreover, our i8 line is likely a loss-of-function allele and not a hypomorph. Yes, dmist expression is reduced in the i8 allele; however, this is likely due to nonsense mediated decay of dmist mRNA. The mutation introduces a frameshift in the dmist coding sequence, and as a result the amino acid sequence of the protein is altered after the N-terminal signal sequence.
 
 11) LINES 241-243: grammar.
 
 Fixed
 
 12) LINE 245: define "JackHMMR iterative search"
 
 We’ve added the phrase: “and seeding a hidden Markov model iterative search (JackHMMR)”
 
 13) LINE 246 is missing the word "we" prior to "...found distant homology between..."
 
 Added
 
 14) LINE 301: show data demonstrating deviation from Mendelian ratios. Also, comment on meaning of such data (embryonic lethality??).
 
 We have added this data in the line (301):
 
 “atp1a3b mutant larvae were not obtained at Mendelian ratios (55 wild type [52.5 expected], 142 [105] atp1a3b+/-, 13 [52.5] atp1a3b-/-; p<0.0001, Chi-squared) suggesting some impact on early stages of development leading to lethality.”
 
 15) Discussion LINES 362-372: This paragraph seems to be of only tangential relevance to the paper. Consider removing.
 
 Our screening strategy was a large-scale reverse genetic screen, but the number of lines was limited by the technical issues described above. We think it is important to mention that the strategy, if employed today, could benefit from newer technologies.
 
 16) Discussion. Another model is that Dmist and NaK pump have a developmental effect. Arguing against this developmental model is the Oubain expt.
 
 This is an important point. We’ve added the line (454:457): “We also cannot exclude a role for Dmist and the Na+/K+ pump in developmental events that impact sleep, although our observation that ouabain treatment, which inhibits the pump acutely after early development is complete, also impacts sleep, argues against a developmental role.”
 
 17) FIGURE 1G: Are these significance cut offs corrected for multiple comparisons?
 
 Yes, all the data is corrected for multiple comparisons.
 
 18) performing neuronal activity measures, either via neural activity imaging or phospho-ERK labeling in different mutants at day or night conditions, to determine whether baseline neuronal activity brain-wide or in specific brain regions are altered.
 
 These are excellent experiments that we plan to perform in the future.
 
 19) Please check all Figure numbers for accuracy.
 
 We have double checked these.
 
 20) The authors emphasize the role of increased cellular sodium, but equally plausibly, the phenotypes could be due to decreased cellular potassium. The potassium channel shaker has been previously identified as a critical sleep regulator in Drosophila.
 
 We completely agree. We would like to highlight that we did devote an entire paragraph to the possibility of changes in extracellular potassium in the discussion: “A third possibility is that Dmist and the Na+,K+-ATPase regulate sleep not by modulation of neuronal activity per se but rather via modulation of extracellular ion concentrations. Recent work has demonstrated that interstitial ions fluctuate across the sleep/wake cycle in mice. For example, extracellular K+ is high during wakefulness, and cerebrospinal fluid containing the ion concentrations found during wakefulness directly applied to the brain can locally shift neuronal activity into wake-like states (Ding et al., 2016). Given that the Na+,K+-ATPase actively exchanges Na+ ions for K+ , the high intracellular Na+ levels we observe in atp1a3a and dmist mutants is likely accompanied by high extracellular K+. Although we can only speculate at this time, a model in which extracellular ions that accumulate during wakefulness and then directly signal onto sleep-regulatory neurons could provide a direct link between Na+,K+ ATPase activity, neuronal firing, and sleep homeostasis. Such a model could also explain why disruption of fxyd1 in non-neuronal cells also leads to a reduction in night-time sleep.”
 
 We also agree that Shaker may be an important component of this sleep regulatory mechanism. Indeed, we previously showed that another potassium channel in zebrafish regulates sleep (Rihel et al., 2010).
 
 We have emphasized sodium homeostasis in our title and paper only because we were able to directly observe intracellular sodium levels, so we are confident that these have been altered in our mutants. We can only presume that potassium levels have also been altered, but we could not directly observe this.
 
 21) The similar phenotype between dmist and Fxyd1 in sleep reduction yet very different expression patterns, with dmist being mostly neuronal while fxyd1 being mostly non-neuronal, raise many possible questions: 1) are the sleep phenotypes due to neuronal Na/K imbalance? Or 2) Are the sleep phenotypes due to extracellular Na/K imbalance? Or 3) both? Some feasible experiments may help achieve a better mechanistic understanding of the observed sleep defects.
 
 Yes, we think these are excellent studies for future work. As noted in the previous point (20), we did discuss the possibility that changes to extracellular potassium might be a parsimonious explanation for the similar phenotypes of fxyd1 and dmist mutants.
 
 Future experiment suggestions (not required)
 
 1) Perform a double mutant analysis of fxyd1 and atp1a3a, to determine whether an epistatic relationship similar to that of dmist and atp1a3a is observed in the case of fxyd1 and atp1a3a.
 
 This is a great experiment that we will do in the future. Unfortunately, the fxyd1 mutant had been sperm frozen during the COVID-19 pandemic, so we cannot do this experiment at this time.
 
 2) Given the differences in the sleep phenotypes between vir/vir and i8/i8 mutants, would be informative to see the phenotype of the vir/i8 trans-heterozygote.
 
 This is also a good experiment to perform in the future. Since obtaining the cleaner i8 allele, the dmistvir/vir lines were sperm frozen.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2020.11.18.388736v4
www.biorxiv.org www.biorxiv.org

New submission 11/07/2023, 12:45:51

1
1. Public_Reviews 11 Jul 2023
  
  in eLife
  
  Author Response:
  
  We would like to thank the eLife reviewers for the considerable time and effort they have invested to review these manuscripts. We have also benefited from a previous round of review of the manuscript describing the proposed burial features, which underwent two rounds of revisions in a high-impact journal over a period of approximately 8 months during 2022 and early 2023. Both sets of reviews have reflected mixed responses to the evidence we have presented, with one reviewer recommending acceptance with minor editorial revisions, two recommending acceptance with minor revisions and the fourth recommending rejection based upon similar arguments to those reflected by some of the reviewers in this current round of reviews in eLife. Ultimately the managing editor of this first journal took the decision that the review process could not be completed in a timely manner and rejected the manuscript although the submission here reflected our consideration of these reviewers suggestions.
  
  We have chosen in this initial response to the eLife reviews to include some references to the previous anonymous reviews in order to illustrate differences of opinion and differences in revision suggestions within the review process. Our goal is to offer maximal insight into our decision-making process and to acknowledge the considerable time and effort put into the assessment of these manuscripts by reviewers (for eLife and in the case of the earlier review process). We hope that this approach will assist the readers, and reviewers, of our manuscripts in understanding why we are proceeding with certain decisions during the revision process.
  
  This is a new process for us and the reviewers, and one way in which it significantly differs from more traditional review is that both the reviews and our reply will be public well in advance of our revisions to the manuscript. Indeed, considering the scope of the reviews, some of those revisions may take considerable time, although many can be accomplished fairly easily. Thus, we are not in a position to say that we have solved every issue raised by the reviewers. Instead, we will examine what appear to be the key critical issues raised regarding the data and the analyses and how we propose to address these as we revise the papers. We will also address several philosophical and ethical issues raised by the reviews and our proposal for dealing with these. More specific editorial and citational recommendations will be dealt with on a case-by-case basis, and we do not address these point-by-point in this reply. Please note, this response to the reviewers is not the revision of the manuscript and is only the initial opinion of the corresponding authors with some guidance from the larger group of authors of all three papers. Our final submitted revision will reflect the input of all authors included on those submissions.
  
  We took the decision to submit three separate papers consciously. The two different categories of evidence, burials and engravings, involve different kinds of analysis and different (although overlapping) teams of researchers, and we recognized that each deserved their own presentation and assessment. Meanwhile, together they inform the context of H. naledi in a way that requires some synthetic discussion, in which both kinds of evidence are relevant, leading to a third paper. But the mutual relevance of these different kinds of evidence and their review by a common set of reviewers naturally raises cross-cutting issues, and the reviewers have cross-referenced the three articles. This has sometimes led to suggestions about one manuscript based on the contents of another. Considering the situation, we accepted the recommendation that it would be clearer to consider all three articles in a single reply. Thus, while each of the three papers will proceed separately during the revision process, it will be necessary to highlight across all three papers occasionally in our responses.
  
  Scientific Issues:
  
  In reading the reviews, we feel there are 9 critical points/assertions raised by one or more of the reviewers that present a problem for, or challenge to, our hypothesis that the observed evidence (bone accumulations and engravings) described in the Dinaledi subsystem are of intentional naledigenic origin. These are:
  
  The evidence presented does not demonstrate a clear interruption of the floor sediments, thus failing to demonstrate excavated holes.
  
  The sediments infilling the holes where the skeletal remains are found have not been demonstrated to originate from the disruption of the floor sediments and thus could be part of a natural geological process (e.g. water movement, slumping) or carnivore accumulations.
  
  Previous geological interpretations by our research group have given alternative geological explanations for formation of the bony accumulations that contradict the present evidence presented here and result in alternative origins hypotheses.
  
  Burial cannot be effectively assessed without complete excavation of the features and site.
  
  The skeletal remains as presented do not conform clearly to typical body arrangement/positions associated with human (Homo sapiens) burials.
  
  There is no evidence of grave goods or lithic scatters that are typically associated with human burials.
  
  Humans may have been involved with the creation of either the Homo naledi bone accumulations, the engravings, or both.
  
  Without a date of the engravings, the null hypothesis should be the engravings were created by Homo sapiens.
  
  The null hypothesis for explanation of the skeletal remains in this situation should be “natural accumulation”.
  
  Our analysis of the Dinaledi Feature 1 leads us to accept that the laminated orange-red mudstone (LORM) sedimentary layer is interrupted, indicating a non-natural intervention, and that the hole created by the interruption was then filled by both a fleshed body (and perhaps parts of other bodies) which were then covered by sediment that originated from the hole that was dug. We recognize that the four eLife reviewers are not convinced that our presentation is sufficient to establish this. Interestingly, this was not the universal opinion of earlier reviewers of the initial manuscript several of whom felt we had adequately supported this hypothesis. The lack of clarity in this current version of the burial manuscript is our responsibility. In the upcoming revision of this paper to be submitted, we will take the reviewers’ critiques to heart and add additional figures that illustrate better the disruption of the LORM and clarify the sedimentological data showing the material covering the skeletal remains in the hole are the disrupted sediments excavated from the same hole. We are proposing to isolate this most critical evidence for burial into a separate section in the revised submission based on the reviewers’ comments. The fact that the LORM layer is disrupted, a fleshed body was placed in the hole created by this disruption, and the body (and perhaps parts of other bodies) was/were then covered by the same sediments from the hole is the central feature of our hypothesis that the bone accumulations observed reflect a burial and not a natural process.
  
  The possibility of fluvial transport or involvement in the subsystem is a topic that we have addressed extensively in past work, and it is clear from these reviews that we must enhance our current manuscript to discuss this issue at greater length. Our previous work (Dirks et al. 2015; Dirks et al. 2017) emphasized that fluvial transport of whole bodies into the subsystem was precluded by several lines of sedimentological evidence. We excavated a rich accumulation of skeletal remains, including articulated limbs and other elements in subvertical orientations inconsistent with slow sedimentary infill, which were difficult to explain without positing either a large and dense pile of bodies and/or sediment movement. We encountered fractured chunks of laminated orange-red mudstone (LORM) in random orientations within our excavation area, within and among skeletal remains, which directly refuted that the remains were inundated with water at the time of burial, and this limited the possibility of fluvial transport. Water flow sufficient to displace bodies or complete skeletal evidence would also transport large and course sediment, which is absent from the subsystem, and would sort the commingled skeletal material that we found by size, which we do not observe. But our excavation only covered less than a square meter at very limited depth, and this was the limit to our knowledge of subsurface sediment. We thus were left with uncertainty that led us to suggest the possibility of sediment slumping or movement into subsurface drains, although these were not observed near our excavation. Our current work expands our knowledge of the subsurface and presents an alternative explanation for the disposition of skeletal remains from our earlier excavation. But we acknowledge that this new explanation is vulnerable to our own previous published proposals, and we must do a better job of explaining how the new information addresses our previous suggestions. By not clearly creating a section where we explained how these previous hypotheses were now nullified by new evidence, we clearly confused the reviewers with our own previous work. We will revise the manuscript by enhancing the review of the significant geological evidence demonstrating that there is no significant fluvial action in the system and making it clear how the burial hypothesis provides a clearer explanation for the situation of skeletal remains from our previous excavation work.
  
  One of the central issues raised by reviewers has been a perceived need to excavate these features completely, totally exhuming all skeletal remains from them. Reviewers have written that it is necessary to identify every skeletal element that is present and account for any missing elements. On this point, we have both ethical and scientific differences from these reviewers. We express our ethical concerns first. Many of the best-preserved possible burials ever discovered by archaeologists were subjected to total excavation and exhumation. Cases like La Chapelle-aux-Saints, La Ferrassie, and Skhūl were fully excavated at a time when data recording and excavation methods did not include the range of spatial and geomorphological approaches that later became routine. The judgment of early investigators that these situations were intentional burials was challenged by later workers, and the kind of information that might enable better tests had been irrevocably lost (Gargett 1999; Dibble et al. 2015; Rendu et al. 2014).
  
  Later, improved excavation standards have not sufficed to remove uncertainty or debate about possible burials. For example, it was long presumed that well-preserved remains of young children were by themselves diagnostic of intentional burial, such as those from Dederiyeh, Border Cave, or Roc de Marsal. Such cases were also fully excavated, with adequate documentation of the positioning of skeletal remains and their surrounding stratigraphic situation, but such cases were later challenged on several bases and the complete exhumation of material has confused or precluded testing of new hypotheses (e.g. Gargett 1999). The case of Roc de Marsal is one in which data from the initial excavation combined with data from the initial excavation combined with re-excavation and geoarchaeological analysis led to a naturalistic interpretation of the skeletal material (Sandgathe et al. 2011; Goldberg et al. 2017). But even in this case, the researchers erred in their interpretation of the skeleton’s situation due to a lack of identification of parts of the infant’s skeleton (Gómez-Olivencia and García-Martinez 2019). That is to say, it is not only the burial hypothesis but other hypotheses that suffer from complete excavation. Researchers concerned with preserving all possible information have sometimes taken extraordinary measures to remove and study possible burials at high-resolution in the laboratory. Such was the case of the Shanidar IV burial removed from the site and transported in plaster jacket by Solecki, which led to the disruption and loss of internal stratigraphic information (Pomeroy et al. 2020). Arguably, the current state of the art is full excavation with partial preparation, such as that undertaken at Panga ya Saidi (Martinón-Torres et al. 2021). But again, any future attempt to reinterpret or test the hypothesis of burial must rely on the adequacy of documentation as the original context has been removed.
  
  In our decision to leave material in place as much as possible, we are expanding upon standard practice to leave witness sections and unexcavated areas for future research. The situation is novel, representing possible burials by a nonhuman species, and that makes it doubly important in our opinion to be conservative in not fully exhuming the skeletal material from its context. We anticipate that many other researchers, including future investigators, will suggest additional methods to further test the hypothesis of burial, something that would be impossible if we had excavated the features in their entirety prior to publishing a description of our work. We believe strongly that our ethical responsibility is to publish the work and the most likely interpretation while leaving as much evidence in place as possible to enable further testing and replication. We welcome the suggestions of additional methods/analyses to test the H. naledi burial hypothesis.
  
  This being said, we also observe that total exhumation would not resolve the concerns raised by the reviewers. The recommendation of total exhumation is in pursuit of a full account of all skeletal material present and its preservation and spatial situation, in order to demonstrate that they conform to body positions comparable to human burials. As has been highlighted in forensic casework, the excavation of an inhumation feature does not necessarily provide an accurate spatial or anatomical manifest of the stratigraphical relationships between the body, encapsulating matrix, and any cut present due to preservational, taphonomic and operational factors (Dirkmaat and Cabo, 2016; Hunter, 2014). In particular, in cases where skeletal elements are highly fragmented, friable, or degraded (such as through bioerosion) then complete excavation—even under controlled laboratory conditions—may destroy bone and severely limit skeletal identification (Henderson, 1997; Hochrein, 2002; Owsley and Compton, 1997), particularly in elements where the ratio of trabecular to cortical bone is high (Darwent and Lyman, 2002; Lyman, 1994). As such, non-invasive methods of 3D and 4D modelling (preservation in situ) are often considered preferable to complete necropsy or excavation (preservation by record) where appropriate (Bolliger and Thali, 2009; Dell’Unto and Landeschi, 2022; Randolph-Quinney et al., 2018; Silver, 2016).
  
  The test of burial is not primarily positional, but taphonomic and geological. The position and number of bones can elaborate on process-driven questions of decay and destruction in the burial environment, or post-mortem modification, but are not singularly indicative of whether the remains were intentionally buried – the post-mortem narrative of all the processes affecting the cadaveric island is required (Knüsel and Robb, 2016). In previous cases, researchers have disputed or accepted the hypothesis of intentional hominin burial based upon assumptions about how modern humans or Neandertals would have positioned bodies, with the idea that some positions reflect ritual intent while others do not. But applying such assumptions is unjustifiable, particularly for a species like H. naledi, whose culture may have differed fundamentally from our own. Our work acknowledges that the present evidence does not enable a full reconstruction of the burial positions, but it does show that fleshed remains were encased in sediment prior to decomposition of soft tissue, and that subsequent spatial changes can be most parsimoniously explained by natural decomposition within sedimentary matrix contained within a burial feature (after Green, 2022; Mickleburgh and Wescott, 2018; Mickleburgh et al., 2022). If the argument is that extraordinary claims require extraordinary evidence, we feel that the evidence documents excavation and interment (and will do so more clearly in the revision) and the fact of the remains do not match a “typical” human burial in body positioning is not in itself evidence that these are not H. naledi burials.
  
  We feel that the reviewers (in keeping with many palaeoanthropologists) have a clear idea of what they “think” a burial should look like in an idealised sense, but this platonic ideal of burial form is not matched by the extensive literature in archaeothanatology, funerary archaeology and forensic science which indicates enormous variability in the activity, morphology and post-mortem system experienced by the human body in cases of interment and body disposal (e.g. Aspöck, 2008; Boulestin and Duday, 2005 and 2006; Connelly et al., 2005; Channing and Randolph-Quinney, 2006; Cherryson, 2008; Donnelly et al., 1995; Finley, 2000; Hunter, 2014; Parker Pearson, 1999; Randolph-Quinney, 2013). Decades of experience in the identification, recovery and interpretation of clandestine, deviant, and non-formal burials indicates the platonic ideal is rare, and in many contexts, the exception (Cherryson, 2008; Parker Pearson, 1999). This variability is particularly relevant to morphological traits in burial context, such as the informal nature of the grave cut in plan and section, shallow burial depth, and initial disposition of body (placement) during the early post-mortem period. These might run counter to the expectations of reviewers or others referencing the fossil hominin record, but are well accepted within the communities of researchers investigating Holocene archaeological sites and forensic contexts.
  
  It is encouraging to see reviewers beginning to incorporate the extensive (often experimentally derived) literature from archaeothanatology and forensic taphonomy in their deliberations, and we will be taking these comments on board going forward. In particular, we acknowledge reviewers’ comments and the need to construct a more detailed post-mortem narrative, accounting for joint disarticulation (labile versus persistent joints etc), displacement, and final disposition of elements within the burial space. As such we will incorporate the hierarchy of decomposition (rank order disarticulation), associations between regions of anatomical association, areas of disassociation, and the voids produced during decomposition (after Mickleburgh and Wescott, 2018; Mickleburgh et al., 2022) into our narrative. In doing so we acknowledge the tensions between the inductive archaeolothanatological narrative-driven approach (e.g. Duday, 2005 & 2009) versus robust decomposition data derived from human forensic taphonomic experimentation recently articulated by Schotsmans and colleagues (2022) - noting that we will highlight comparative data based on forensic experimental casework and actualistic modelling over inductive intuitive approaches which come with significant evidential shortcomings (Bristow et al. 2011).
  
  Finally, from a taphonomic perspective it is worth pointing out to reviewers that we have already addressed the issue of lack of taphonomic evidence for carnivore involvement in the formation of the Dinaledi assemblage (Dirks, et al., 2016). Absence of any carnivore-induced bone surface modifications, patterns of skeletal part representation, and a total absence of any carnivore remains found within the Dinaledi chamber (following Kuhn and colleagues, 2010) lead us to reject carnivores as possible vectors of body accumulation within the Dinaledi Chamber and Hill Antechamber.
  
  Reviewers suggest that without a date derived from geochronological methods, the engravings cannot be associated with H. naledi, and that it is possible (or probable) that the engravings were done in the recent past by H. sapiens. This suggestion neglects the context of the site. We have previously documented the structure and extremely limited accessibility of the Dinaledi subsystem. This subsystem was not recorded on maps of the documented Rising Star Cave system prior to our work and its discovery by our teams. Furthermore, there is no evidence of prehistoric human activity in the areas of the cave related to possible subterranean entrances There is no evidence that humans in the past typically ventured into such extreme spaces like those of Rising Star. It is clear from the presence of the remains of many individuals that H. naledi ventured into these spaces again and again. It is likely that H. naledi moved through these spaces more easily than humans do based on their physique. We show that the engravings overlay each other suggesting multiple engraving events. These engravings took time and effort and the only evidence for use of the Dinaledi subsystem by any hominin is by H. naledi. The context leads to the null hypothesis that H. naledi made the marks. In our revision, we will elaborate on this argument to clarify the evidence for our stance on this hypothesis. Several reviewers took issue with the title of the engraving paper as we did not insert a qualifier in front of the suggested date range for the engravings. We deliberately left out qualifying language so that the title took the form of a testable hypothesis rather than a weak assertation. Should future work find the engravings were not produced within this time range, then we will restate this hypothesis.
  
  Finally, with regards to the engravings we have chosen to report them because they exist. Not reporting the presence of engraved marks on the walls of a cave above hypothesized burials would be tantamount to leaving relevant evidence out of the description of an archeological context. We recognize and state in our manuscript that these markings require substantial further study, including attempts at geochronological dating. But the current evidence is clearly relevant to the archaeological context of the subsystem. We take a similar stance with reporting the presence of the tool shaped artefact near the hand of the H. naledi skeleton in the Hill Antechamber. It is evident that this object requires further study, as we stated in our manuscript, but again omitting it from our study would be leaving out relevant evidence.
  
  Some have suggested that the null hypothesis should be that all of these observed circumstances are of natural origin. Our team took this approach in our early investigation of the Dinaledi subsystem (Dirks et al. 2015). We adopted the null hypothesis that the geological processes involved in the accumulation of H. naledi skeletal remains were “natural” (e.g., non-naledigenic involvement), and we were able to reject many alternative explanations for the assemblage, including carnivore accumulation, “death trap” accumulation, and fluvial transport of bodies or bones (Dirks et al. 2015). This led us to the hypothesis that H. naledi were involved in bringing the bodies into the spaces where they were found. But we did not hypothesize their involvement in the formation of the deposit itself beyond bringing the bodies to the location.
  
  This approach seems conservative. It followed the traditional view that small-brained hominins do not engage in cultural practices. But we recognize in hindsight that this null hypothesis approach did harm to our analyses. It impeded us from recognizing within our initial excavations of the puzzle box area and other excavations between 2014 – 2017 that we might be encountering remains that were intrusive in the sedimentary floor of the chamber. If we had approached the accumulation of a large number of hominins from the perspective of the null hypothesis being that the situation was likely cultural, we perhaps would have collected evidence in a slightly different manner. We certainly note that if the Dinaledi system had been full of the remains of modern humans, there would have been little doubt that the null hypothesis would have been that this was a cultural space and not a “natural space”. We therefore respectfully disagree with the reviewers who continue to support the idea that we should approach hominin excavations with the null hypothesis that they will be natural (specifically non-cultural) in origins. If excavations continue with this mindset we believe that potential cultural evidence is almost certain to be lost.
  
  There has been a gradient across paleoanthropological excavations, archaeological work, and forensic investigation, with increasing precision of context. The reality is that the recording precision and frame of approach is typically different in most paleontological excavations than in those related to contemporary human remains. If anything comes from the present discussion of whether the Dinaledi system is a burial site for H. naledi or not, we hope that by taking seriously the possibility of deep cultural dynamics of hominins, we will encourage other teams to meet the highest standards of excavation in order to preserve potential cultural evidence. Given H. naledi’s cranial capacity we suggest that even very early hominin skeletal assemblages should be re-examined, if there is sufficient evidence or records available. These would include examples such as the A.L. 333 Au. afarensis site (the so called First Family site in Hadar Ethiopia), the Dikika infant skeleton, WT 15000 (Turkana Boy) and even A.L. 288 (Lucy) as such unusual taphonomic situations where skeletons are preserved cannot be simply explained away as “natural” in origin, based solely on the cranial capacity and assumed lack of cognitive and cultural complexity of the hominins as emphasized by us in Fuentes et al. (2023). We are not the first to observe that some very early hominin situations may represent early mortuary activity (Pettitt 2013), but we would advocate a step further. We suggest it may be damaging to take “natural accumulation” as the standard null hypothesis for hominin paleoanthropology, and that it is more conservative in practice to engage remains with the null hypothesis of possible cultural formation.
  
  We are deeply grateful for the time and effort all of the 8 reviewers (across three reviews) have taken with this work. We also acknowledge the anonymous reviewers from previous submissions who’s opinions and comments will have made the final iterations of these manuscripts better for their efforts. As this process is rather public and includes commentary outside of the eLife forum, we ask that the efforts of all 37 authors and 8 reviewers involved be respected and that the discourse remain professional in all venues as we study this fascinating and quite complex occurrence. We appreciate also the efforts of members of the public who have engaged with this relatively new process where preprints are posted prior to the reviews allowing comments and interactions from colleagues and the public who are normally not part of the internal peer review process. We believe these interactions will make for better final papers. We feel we have met the standards of demonstrating burials in H. naledi and that the engraving are most likely associated with H. naledi. However, given the reviews we see many areas where our clarity and context, and analyses, were less strong than they can be. With the clarifications and additions taken on board through these review processes the final papers will be stronger and clearer. We, recognize that this is an ongoing process of scientific investigation and further work will allow continued, and possibly better, evaluation of these hypothesis and others.
  
  Lee R Berger, Agustín Fuentes, John Hawks, Tebogo Makhubela
  
  Works cited:
  
  Aspöck, E. (2008). What Actually is a ‘Deviant Burial’?: Comparing German-Language and Anglophone Research on ‘Deviant Burials.’ In E. M. Murphy (Ed.). Deviant Burial in the Archaeological Record. Oxford: Oxbow Books. pp 17–34.
  
  Bolliger, S.A. & Thali, M.J. (2009). Thanatology. In S.A. Bolliger and M.J. Thali (eds) Virtopsy Approach: 3D Optical and Radiological Scanning and Reconstruction in Forensic Medicine. Boca Raton: CRC Press. pp 187-218.
  
  Boulestin, B. & Duday, H. (2005). Ethnologie et archéologie de la mort: de l’illusion des références à l’emploi d’un vocabulaire. In: C. Mordant and G. Depierre (eds) Les Pratiques Funéraires à l’Âge du Bronze en France. Actes de la table ronde de Sens-en-Bourgogne. Paris: Éditions du Comité des Travaux Historiques et Scientifiques. pp. 17–30.
  
  Boulestin, B. & Duday, H. (2006). Ethnology and archaeology of death: from the illusion of references to the use of a terminology. Archaeologia Polona 44: 149–169.
  
  Bristow, J., Simms, Z. & Randolph-Quinney, P.S. Taphonomy. In S. Black and E. Ferguson (eds.) Forensic Anthropology 2000-2010. Boca Raton, FL: CRC Press. pp 279-318.
  
  Channing, J. & Randolph-Quinney, P.S. (2006). Death, decay and reconstruction: the archaeology of Ballykilmore Cemetery, County Westmeath. In J. O’Sullivan and M. Stanley (eds.) Settlement, Industry and Ritual: Archaeology. National Roads Authority Monograph Series No. 3. Dublin: NRA/Four Courts Press. pp 113-126.
  
  Cherryson, A. K. (2008). Normal, Deviant and Atypical: Burial Variation in Late Saxon Wessex, c. AD 700–1100. In E. M. Murphy (Ed.). Deviant Burial in the Archaeological Record. Oxford: Oxbow Books. pp 115–130.
  
  Connolly, M., F. Coyne & L. G. Lynch (2005). Underworld : Death and Burial in Cloghermore Cave, Co. Kerry. Bray, Co. Wicklow: Wordwell.
  
  Darwent, C. M. & R. L. Lyman (2002). Detecting the postburial fragmentation of carpals, tarsals and phalanges. In M. H. Sorg and W. D. Haglund (eds). Advances in Forensic Taphonomy: Method, Theory and Archeological Perspectives. Boca Raton, FL, CRC Press. pp 355-378.
  
  d’Errico, F., & Backwell, L. (2016). Earliest evidence of personal ornaments associated with burial: The Conus shells from Border Cave. Journal of Human Evolution, 93, 91–108.
  
  De Villiers. H. (1973). Human skeletal remains from Border Cave, Ingwavuma District, KwaZulu, South Africa. Annals of the Transvaal Museum, 28(13), 229–246.
  
  Dell’Unto, N. and Landeschi, G. (2022). Archaeological 3D GIS. London: Routledge.
  
  Dibble, H. L., Aldeias, V., Goldberg, P., McPherron, S. P., Sandgathe, D., & Steele, T. E. (2015). A critical look at evidence from La Chapelle-aux-Saints supporting an intentional Neandertal burial. Journal of Archaeological Science, 53, 649–657.
  
  Dirkmaat, D. C., & Cabo, L. L. (2016). Forensic archaeology and forensic taphonomy: basic considerations on how to properly process and interpret the outdoor forensic scene_. Academic Forensic Pathology_ 6, 439–454.
  
  Dirks, P. H., Berger, L. R., Roberts, E. M., Kramers, J. D., Hawks, J., Randolph-Quinney, P. S., Elliott, M., Musiba, C. M., Churchill, S. E., de Ruiter, D. J., Schmid, P., Backwell, L. R., Belyanin, G. A., Boshoff, P., Hunter, K. L., Feuerriegel, E. M., Gurtov, A., Harrison, J. du G., Hunter, R., … Tucker, S. (2015). Geological and taphonomic context for the new hominin species Homo naledi from the Dinaledi Chamber, South Africa. ELife, 4, e09561.
  
  Dirks, P.H.G.M., Berger, L.R., Hawks, J., Randolph-Quinney, P.S., Backwell, L.R., and Roberts, E.M. (2016). Comment on “Deliberate body disposal by hominins in the Dinaledi Chamber, Cradle of Humankind, South Africa?” [J. Hum. Evol. 96 (2016) 145-148]. Journal of Human Evolution 96: 149-153.
  
  Dirks, P. H., Roberts, E. M., Hilbert-Wolf, H., Kramers, J. D., Hawks, J., Dosseto, A., Duval, M., Elliott, M., Evans, M., Grün, R., Hellstrom, J., Herries, A. I., Joannes-Boyau, R., Makhubela, T. V., Placzek, C. J., Robbins, J., Spandler, C., Wiersma, J., Woodhead, J., & Berger, L. R. (2017). The age of Homo naledi and associated sediments in the Rising Star Cave, South Africa. ELife, 6, e24231.
  
  Donnelly, S., C. Donnelly & E. Murphy (1999). The forgotten dead: The cíllíní and disused burial grounds of Ballintoy, County Antrim. Ulster Journal of Archaeology 58, 109-113.
  
  Duday, H. (2005). L’archéothanatologie ou l’archéologie de la mort. In: O. Dutour, J.-J. Hublin and B. Vandermeersch (eds) Objets et Méthodes en Paléoanthropologie. Paris: Comité des Travaux Historiques et Scientifiques. pp. 153–215.
  
  Duday, H. (2009). Archaeology of the Dead: Lectures in Archaeothanatology. Oxford: Oxbow Books.
  
  Finley, N. (2000). Outside of life: Traditions of infant burial in Ireland from cillin to cist. World Archaeology 31, 407-422.
  
  Gargett, R. H. (1999). Middle Palaeolithic burial is not a dead issue: The view from Qafzeh, Saint-Césaire, Kebara, Amud, and Dederiyeh. Journal of Human Evolution, 37(1), 27–90.
  
  Goldberg, P., Aldeias, V., Dibble, H., McPherron, S., Sandgathe, D., & Turq, A. (2017). Testing the Roc de Marsal Neandertal “Burial” with Geoarchaeology. Archaeological and Anthropological Sciences, 9(6), 1005–1015.
  
  Gómez-Olivencia, A., & García-Martínez, D. (2019). New postcranial remains from the Roc de Marsal Neandertal child. PALEO. Revue d’archéologie Préhistorique, 30–1, 30–1.
  
  Green, E.C. (2022). An archaeothanatological approach to the identification of late Anglo-Saxon burials in wooden containers. In C.J. Knüsel and E.M.J. Schotsmans (eds.) The Routledge Handbook of Archaeothanatology. London: Routledge. pp 436-455.
  
  Henderson, J. (1987). Factors determining the state of preservation of human remains. In A. Boddington, A. Garland and R. Janaway (eds). Death, Decay and Reconstruction: Approaches to Archaeology and Forensic Science. Manchester: Manchester University Press. pp 43-54.
  
  Hunter, J. R. (2014). Human remains recovery: archaeological and forensic perspectives. In C. Smith (ed). Encyclopedia of Global Archaeology. New York: Springer New York. pp 3549-3556.
  
  Hochrein, M. (2002). An Autopsy of the Grave: Recognizing, Collecting and Preserving Forensic Geotaphonomic Evidence. In M. H. Sorg and W. D. Haglund (eds). Advances in Forensic Taphonomy: Method, Theory and Archeological Perspectives. Boca Raton, FL, CRC Press: 45-70.
  
  Knüsel, C.K. & Robb, J. (2016). Funerary taphonomy: An overview of goals and methods. Journal of Archaeological Science: Reports 10, 655-673.
  
  Kuhn, B.F., Berger, L.R. & Skinner, J.D. (2010). Examining criteria for identifying and differentiating fossil faunal assemblages accumulated by hyenas and hominins using extant hyenid accumulations. International Journal of Osteoarchaeology 20, 15-35.
  
  Lyman, R. (1994). Vertebrate Taphonomy. Cambridge, Cambridge University Press.
  
  Martinón-Torres, M., d’Errico, F., Santos, E., Álvaro Gallo, A., Amano, N., Archer, W., Armitage, S. J., Arsuaga, J. L., Bermúdez de Castro, J. M., Blinkhorn, J., Crowther, A., Douka, K., Dubernet, S., Faulkner, P., Fernández-Colón, P., Kourampas, N., González García, J., Larreina, D., Le Bourdonnec, F.-X., … Petraglia, M. D. (2021). Earliest known human burial in Africa. Nature, 593(7857), 7857.
  
  Mickleburgh, H.L & Wescott, D.J. (2018). Controlled experimental observations on joint disarticulation and bone displacement of a human body in an open pit: implications for funerary archaeology. Journal of Archaeological Science: Reports 20: 158-167.
  
  Mickleburgh, H.L., Wescott, D.J., Gluschitz, S. & Klinkenberg, V.M. (2022). Exploring the use of actualistic forensic taphonomy in the study of (forensic) archaeological human burials: An actualistic experimental research programme at the Forensic Anthropology Center at Texas State University (FACTS), San Marcos, Texas. In C.J. Knüsel and E.M.J. Schotsmans (eds.) The Routledge Handbook of Archaeothanatology. London: Routledge. pp 542-562.
  
  Owsley, D. & B. Compton (1997). Preservation in late 19th Century iron coffin burials. In W. Haglund and M. Sorg (eds). Forensic Taphonomy: The Postmortem Fate of Human Remains. Boca Raton, FL, CRC Press: 511-526.
  
  Parker Pearson, M. (1999). The Archaeology of Death and Burial. College Station: Texas A&M University Press.
  
  Pettitt, P. (2013). The Palaeolithic Origins of Human Burial. Routledge.
  
  Pomeroy, E., Bennett, P., Hunt, C. O., Reynolds, T., Farr, L., Frouin, M., Holman, J., Lane, R., French, C., & Barker, G. (2020). New Neanderthal remains associated with the ‘flower burial’ at Shanidar Cave. Antiquity, 94(373), 11–26.
  
  Randolph-Quinney, P.S. (2013). From the cradle to the grave: the bioarchaeology of Clonfad 3 and Ballykilmore 6. In N. Brady, P. Stevens and J. Channing (eds.). Settlement and Community in the Fir Tulach Kingdom. Dublin: National Roads Authority Press. pp A2.1-48.
  
  Randolph-Quinney, P.S., Haines, S. and Kruger, A. (2018). The use of three-dimensional scanning and surface capture methods in recording forensic taphonomic traces: issues of technology, visualisation, and validation. In: W.J. M. Groen and P. M. Barone (eds). Multidisciplinary Approaches to Forensic Archaeology. Berlin: Springer International Publishing, pp. 115-130.
  
  Rendu, W., Beauval, C., Crevecoeur, I., Bayle, P., Balzeau, A., Bismuth, T., Bourguignon, L., Delfour, G., Faivre, J.-P., Lacrampe-Cuyaubère, F., Tavormina, C., Todisco, D., Turq, A., & Maureille, B. (2014). Evidence supporting an intentional Neandertal burial at La Chapelle-aux-Saints. Proceedings of the National Academy of Sciences, 111(1), 81–86.
  
  Sandgathe, D. M., Dibble, H. L., Goldberg, P., & McPherron, S. P. (2011). The Roc de Marsal Neandertal child: A reassessment of its status as a deliberate burial. Journal of Human Evolution, 61(3), 243–253.
  
  Silver, M. (2016). Conservation Techniques in Cultural Heritage. In E. Stylianidis and F. Remondino (eds) 3D Recording, Documentation and Management of Cultural Heritage. Dunbeath: Whittles Publishing. pp 15-106.
  
  Schotsmans, E.M.J., Georges-Zimmermann, P., Ueland, M. and Dent, B.B. (2022). From flesh to bone: Building bridges between taphonomy, archaeothanatology and forensic science for a better understanding of mortuary practices. In C.J. Knüsel and E.M.J. Schotsmans (eds.) The Routledge Handbook of Archaeothanatology. London: Routledge. pp 501-541.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.01.543135v1
www.biorxiv.org www.biorxiv.org

New submission 11/07/2023, 12:29:07

1
1. Public_Reviews 11 Jul 2023
  
  in eLife
  
  Author Response:
  
  We would like to thank the eLife reviewers for the considerable time and effort they have invested to review these manuscripts. We have also benefited from a previous round of review of the manuscript describing the proposed burial features, which underwent two rounds of revisions in a high-impact journal over a period of approximately 8 months during 2022 and early 2023. Both sets of reviews have reflected mixed responses to the evidence we have presented, with one reviewer recommending acceptance with minor editorial revisions, two recommending acceptance with minor revisions and the fourth recommending rejection based upon similar arguments to those reflected by some of the reviewers in this current round of reviews in eLife. Ultimately the managing editor of this first journal took the decision that the review process could not be completed in a timely manner and rejected the manuscript although the submission here reflected our consideration of these reviewers suggestions.
  
  We have chosen in this initial response to the eLife reviews to include some references to the previous anonymous reviews in order to illustrate differences of opinion and differences in revision suggestions within the review process. Our goal is to offer maximal insight into our decision-making process and to acknowledge the considerable time and effort put into the assessment of these manuscripts by reviewers (for eLife and in the case of the earlier review process). We hope that this approach will assist the readers, and reviewers, of our manuscripts in understanding why we are proceeding with certain decisions during the revision process.
  
  This is a new process for us and the reviewers, and one way in which it significantly differs from more traditional review is that both the reviews and our reply will be public well in advance of our revisions to the manuscript. Indeed, considering the scope of the reviews, some of those revisions may take considerable time, although many can be accomplished fairly easily. Thus, we are not in a position to say that we have solved every issue raised by the reviewers. Instead, we will examine what appear to be the key critical issues raised regarding the data and the analyses and how we propose to address these as we revise the papers. We will also address several philosophical and ethical issues raised by the reviews and our proposal for dealing with these. More specific editorial and citational recommendations will be dealt with on a case-by-case basis, and we do not address these point-by-point in this reply. Please note, this response to the reviewers is not the revision of the manuscript and is only the initial opinion of the corresponding authors with some guidance from the larger group of authors of all three papers. Our final submitted revision will reflect the input of all authors included on those submissions.
  
  We took the decision to submit three separate papers consciously. The two different categories of evidence, burials and engravings, involve different kinds of analysis and different (although overlapping) teams of researchers, and we recognized that each deserved their own presentation and assessment. Meanwhile, together they inform the context of H. naledi in a way that requires some synthetic discussion, in which both kinds of evidence are relevant, leading to a third paper. But the mutual relevance of these different kinds of evidence and their review by a common set of reviewers naturally raises cross-cutting issues, and the reviewers have cross-referenced the three articles. This has sometimes led to suggestions about one manuscript based on the contents of another. Considering the situation, we accepted the recommendation that it would be clearer to consider all three articles in a single reply. Thus, while each of the three papers will proceed separately during the revision process, it will be necessary to highlight across all three papers occasionally in our responses.
  
  Scientific Issues:
  
  In reading the reviews, we feel there are 9 critical points/assertions raised by one or more of the reviewers that present a problem for, or challenge to, our hypothesis that the observed evidence (bone accumulations and engravings) described in the Dinaledi subsystem are of intentional naledigenic origin. These are:
  
  The evidence presented does not demonstrate a clear interruption of the floor sediments, thus failing to demonstrate excavated holes.
  
  The sediments infilling the holes where the skeletal remains are found have not been demonstrated to originate from the disruption of the floor sediments and thus could be part of a natural geological process (e.g. water movement, slumping) or carnivore accumulations.
  
  Previous geological interpretations by our research group have given alternative geological explanations for formation of the bony accumulations that contradict the present evidence presented here and result in alternative origins hypotheses.
  
  Burial cannot be effectively assessed without complete excavation of the features and site.
  
  The skeletal remains as presented do not conform clearly to typical body arrangement/positions associated with human (Homo sapiens) burials.
  
  There is no evidence of grave goods or lithic scatters that are typically associated with human burials.
  
  Humans may have been involved with the creation of either the Homo naledi bone accumulations, the engravings, or both.
  
  Without a date of the engravings, the null hypothesis should be the engravings were created by Homo sapiens.
  
  The null hypothesis for explanation of the skeletal remains in this situation should be “natural accumulation”.
  
  Our analysis of the Dinaledi Feature 1 leads us to accept that the laminated orange-red mudstone (LORM) sedimentary layer is interrupted, indicating a non-natural intervention, and that the hole created by the interruption was then filled by both a fleshed body (and perhaps parts of other bodies) which were then covered by sediment that originated from the hole that was dug. We recognize that the four eLife reviewers are not convinced that our presentation is sufficient to establish this. Interestingly, this was not the universal opinion of earlier reviewers of the initial manuscript several of whom felt we had adequately supported this hypothesis. The lack of clarity in this current version of the burial manuscript is our responsibility. In the upcoming revision of this paper to be submitted, we will take the reviewers’ critiques to heart and add additional figures that illustrate better the disruption of the LORM and clarify the sedimentological data showing the material covering the skeletal remains in the hole are the disrupted sediments excavated from the same hole. We are proposing to isolate this most critical evidence for burial into a separate section in the revised submission based on the reviewers’ comments. The fact that the LORM layer is disrupted, a fleshed body was placed in the hole created by this disruption, and the body (and perhaps parts of other bodies) was/were then covered by the same sediments from the hole is the central feature of our hypothesis that the bone accumulations observed reflect a burial and not a natural process.
  
  The possibility of fluvial transport or involvement in the subsystem is a topic that we have addressed extensively in past work, and it is clear from these reviews that we must enhance our current manuscript to discuss this issue at greater length. Our previous work (Dirks et al. 2015; Dirks et al. 2017) emphasized that fluvial transport of whole bodies into the subsystem was precluded by several lines of sedimentological evidence. We excavated a rich accumulation of skeletal remains, including articulated limbs and other elements in subvertical orientations inconsistent with slow sedimentary infill, which were difficult to explain without positing either a large and dense pile of bodies and/or sediment movement. We encountered fractured chunks of laminated orange-red mudstone (LORM) in random orientations within our excavation area, within and among skeletal remains, which directly refuted that the remains were inundated with water at the time of burial, and this limited the possibility of fluvial transport. Water flow sufficient to displace bodies or complete skeletal evidence would also transport large and course sediment, which is absent from the subsystem, and would sort the commingled skeletal material that we found by size, which we do not observe. But our excavation only covered less than a square meter at very limited depth, and this was the limit to our knowledge of subsurface sediment. We thus were left with uncertainty that led us to suggest the possibility of sediment slumping or movement into subsurface drains, although these were not observed near our excavation. Our current work expands our knowledge of the subsurface and presents an alternative explanation for the disposition of skeletal remains from our earlier excavation. But we acknowledge that this new explanation is vulnerable to our own previous published proposals, and we must do a better job of explaining how the new information addresses our previous suggestions. By not clearly creating a section where we explained how these previous hypotheses were now nullified by new evidence, we clearly confused the reviewers with our own previous work. We will revise the manuscript by enhancing the review of the significant geological evidence demonstrating that there is no significant fluvial action in the system and making it clear how the burial hypothesis provides a clearer explanation for the situation of skeletal remains from our previous excavation work.
  
  One of the central issues raised by reviewers has been a perceived need to excavate these features completely, totally exhuming all skeletal remains from them. Reviewers have written that it is necessary to identify every skeletal element that is present and account for any missing elements. On this point, we have both ethical and scientific differences from these reviewers. We express our ethical concerns first. Many of the best-preserved possible burials ever discovered by archaeologists were subjected to total excavation and exhumation. Cases like La Chapelle-aux-Saints, La Ferrassie, and Skhūl were fully excavated at a time when data recording and excavation methods did not include the range of spatial and geomorphological approaches that later became routine. The judgment of early investigators that these situations were intentional burials was challenged by later workers, and the kind of information that might enable better tests had been irrevocably lost (Gargett 1999; Dibble et al. 2015; Rendu et al. 2014).
  
  Later, improved excavation standards have not sufficed to remove uncertainty or debate about possible burials. For example, it was long presumed that well-preserved remains of young children were by themselves diagnostic of intentional burial, such as those from Dederiyeh, Border Cave, or Roc de Marsal. Such cases were also fully excavated, with adequate documentation of the positioning of skeletal remains and their surrounding stratigraphic situation, but such cases were later challenged on several bases and the complete exhumation of material has confused or precluded testing of new hypotheses (e.g. Gargett 1999). The case of Roc de Marsal is one in which data from the initial excavation combined with data from the initial excavation combined with re-excavation and geoarchaeological analysis led to a naturalistic interpretation of the skeletal material (Sandgathe et al. 2011; Goldberg et al. 2017). But even in this case, the researchers erred in their interpretation of the skeleton’s situation due to a lack of identification of parts of the infant’s skeleton (Gómez-Olivencia and García-Martinez 2019). That is to say, it is not only the burial hypothesis but other hypotheses that suffer from complete excavation. Researchers concerned with preserving all possible information have sometimes taken extraordinary measures to remove and study possible burials at high-resolution in the laboratory. Such was the case of the Shanidar IV burial removed from the site and transported in plaster jacket by Solecki, which led to the disruption and loss of internal stratigraphic information (Pomeroy et al. 2020). Arguably, the current state of the art is full excavation with partial preparation, such as that undertaken at Panga ya Saidi (Martinón-Torres et al. 2021). But again, any future attempt to reinterpret or test the hypothesis of burial must rely on the adequacy of documentation as the original context has been removed.
  
  In our decision to leave material in place as much as possible, we are expanding upon standard practice to leave witness sections and unexcavated areas for future research. The situation is novel, representing possible burials by a nonhuman species, and that makes it doubly important in our opinion to be conservative in not fully exhuming the skeletal material from its context. We anticipate that many other researchers, including future investigators, will suggest additional methods to further test the hypothesis of burial, something that would be impossible if we had excavated the features in their entirety prior to publishing a description of our work. We believe strongly that our ethical responsibility is to publish the work and the most likely interpretation while leaving as much evidence in place as possible to enable further testing and replication. We welcome the suggestions of additional methods/analyses to test the H. naledi burial hypothesis.
  
  This being said, we also observe that total exhumation would not resolve the concerns raised by the reviewers. The recommendation of total exhumation is in pursuit of a full account of all skeletal material present and its preservation and spatial situation, in order to demonstrate that they conform to body positions comparable to human burials. As has been highlighted in forensic casework, the excavation of an inhumation feature does not necessarily provide an accurate spatial or anatomical manifest of the stratigraphical relationships between the body, encapsulating matrix, and any cut present due to preservational, taphonomic and operational factors (Dirkmaat and Cabo, 2016; Hunter, 2014). In particular, in cases where skeletal elements are highly fragmented, friable, or degraded (such as through bioerosion) then complete excavation—even under controlled laboratory conditions—may destroy bone and severely limit skeletal identification (Henderson, 1997; Hochrein, 2002; Owsley and Compton, 1997), particularly in elements where the ratio of trabecular to cortical bone is high (Darwent and Lyman, 2002; Lyman, 1994). As such, non-invasive methods of 3D and 4D modelling (preservation in situ) are often considered preferable to complete necropsy or excavation (preservation by record) where appropriate (Bolliger and Thali, 2009; Dell’Unto and Landeschi, 2022; Randolph-Quinney et al., 2018; Silver, 2016).
  
  The test of burial is not primarily positional, but taphonomic and geological. The position and number of bones can elaborate on process-driven questions of decay and destruction in the burial environment, or post-mortem modification, but are not singularly indicative of whether the remains were intentionally buried – the post-mortem narrative of all the processes affecting the cadaveric island is required (Knüsel and Robb, 2016). In previous cases, researchers have disputed or accepted the hypothesis of intentional hominin burial based upon assumptions about how modern humans or Neandertals would have positioned bodies, with the idea that some positions reflect ritual intent while others do not. But applying such assumptions is unjustifiable, particularly for a species like H. naledi, whose culture may have differed fundamentally from our own. Our work acknowledges that the present evidence does not enable a full reconstruction of the burial positions, but it does show that fleshed remains were encased in sediment prior to decomposition of soft tissue, and that subsequent spatial changes can be most parsimoniously explained by natural decomposition within sedimentary matrix contained within a burial feature (after Green, 2022; Mickleburgh and Wescott, 2018; Mickleburgh et al., 2022). If the argument is that extraordinary claims require extraordinary evidence, we feel that the evidence documents excavation and interment (and will do so more clearly in the revision) and the fact of the remains do not match a “typical” human burial in body positioning is not in itself evidence that these are not H. naledi burials.
  
  We feel that the reviewers (in keeping with many palaeoanthropologists) have a clear idea of what they “think” a burial should look like in an idealised sense, but this platonic ideal of burial form is not matched by the extensive literature in archaeothanatology, funerary archaeology and forensic science which indicates enormous variability in the activity, morphology and post-mortem system experienced by the human body in cases of interment and body disposal (e.g. Aspöck, 2008; Boulestin and Duday, 2005 and 2006; Connelly et al., 2005; Channing and Randolph-Quinney, 2006; Cherryson, 2008; Donnelly et al., 1995; Finley, 2000; Hunter, 2014; Parker Pearson, 1999; Randolph-Quinney, 2013). Decades of experience in the identification, recovery and interpretation of clandestine, deviant, and non-formal burials indicates the platonic ideal is rare, and in many contexts, the exception (Cherryson, 2008; Parker Pearson, 1999). This variability is particularly relevant to morphological traits in burial context, such as the informal nature of the grave cut in plan and section, shallow burial depth, and initial disposition of body (placement) during the early post-mortem period. These might run counter to the expectations of reviewers or others referencing the fossil hominin record, but are well accepted within the communities of researchers investigating Holocene archaeological sites and forensic contexts.
  
  It is encouraging to see reviewers beginning to incorporate the extensive (often experimentally derived) literature from archaeothanatology and forensic taphonomy in their deliberations, and we will be taking these comments on board going forward. In particular, we acknowledge reviewers’ comments and the need to construct a more detailed post-mortem narrative, accounting for joint disarticulation (labile versus persistent joints etc), displacement, and final disposition of elements within the burial space. As such we will incorporate the hierarchy of decomposition (rank order disarticulation), associations between regions of anatomical association, areas of disassociation, and the voids produced during decomposition (after Mickleburgh and Wescott, 2018; Mickleburgh et al., 2022) into our narrative. In doing so we acknowledge the tensions between the inductive archaeolothanatological narrative-driven approach (e.g. Duday, 2005 & 2009) versus robust decomposition data derived from human forensic taphonomic experimentation recently articulated by Schotsmans and colleagues (2022) - noting that we will highlight comparative data based on forensic experimental casework and actualistic modelling over inductive intuitive approaches which come with significant evidential shortcomings (Bristow et al. 2011).
  
  Finally, from a taphonomic perspective it is worth pointing out to reviewers that we have already addressed the issue of lack of taphonomic evidence for carnivore involvement in the formation of the Dinaledi assemblage (Dirks, et al., 2016). Absence of any carnivore-induced bone surface modifications, patterns of skeletal part representation, and a total absence of any carnivore remains found within the Dinaledi chamber (following Kuhn and colleagues, 2010) lead us to reject carnivores as possible vectors of body accumulation within the Dinaledi Chamber and Hill Antechamber.
  
  Reviewers suggest that without a date derived from geochronological methods, the engravings cannot be associated with H. naledi, and that it is possible (or probable) that the engravings were done in the recent past by H. sapiens. This suggestion neglects the context of the site. We have previously documented the structure and extremely limited accessibility of the Dinaledi subsystem. This subsystem was not recorded on maps of the documented Rising Star Cave system prior to our work and its discovery by our teams. Furthermore, there is no evidence of prehistoric human activity in the areas of the cave related to possible subterranean entrances There is no evidence that humans in the past typically ventured into such extreme spaces like those of Rising Star. It is clear from the presence of the remains of many individuals that H. naledi ventured into these spaces again and again. It is likely that H. naledi moved through these spaces more easily than humans do based on their physique. We show that the engravings overlay each other suggesting multiple engraving events. These engravings took time and effort and the only evidence for use of the Dinaledi subsystem by any hominin is by H. naledi. The context leads to the null hypothesis that H. naledi made the marks. In our revision, we will elaborate on this argument to clarify the evidence for our stance on this hypothesis. Several reviewers took issue with the title of the engraving paper as we did not insert a qualifier in front of the suggested date range for the engravings. We deliberately left out qualifying language so that the title took the form of a testable hypothesis rather than a weak assertation. Should future work find the engravings were not produced within this time range, then we will restate this hypothesis.
  
  Finally, with regards to the engravings we have chosen to report them because they exist. Not reporting the presence of engraved marks on the walls of a cave above hypothesized burials would be tantamount to leaving relevant evidence out of the description of an archeological context. We recognize and state in our manuscript that these markings require substantial further study, including attempts at geochronological dating. But the current evidence is clearly relevant to the archaeological context of the subsystem. We take a similar stance with reporting the presence of the tool shaped artefact near the hand of the H. naledi skeleton in the Hill Antechamber. It is evident that this object requires further study, as we stated in our manuscript, but again omitting it from our study would be leaving out relevant evidence.
  
  Some have suggested that the null hypothesis should be that all of these observed circumstances are of natural origin. Our team took this approach in our early investigation of the Dinaledi subsystem (Dirks et al. 2015). We adopted the null hypothesis that the geological processes involved in the accumulation of H. naledi skeletal remains were “natural” (e.g., non-naledigenic involvement), and we were able to reject many alternative explanations for the assemblage, including carnivore accumulation, “death trap” accumulation, and fluvial transport of bodies or bones (Dirks et al. 2015). This led us to the hypothesis that H. naledi were involved in bringing the bodies into the spaces where they were found. But we did not hypothesize their involvement in the formation of the deposit itself beyond bringing the bodies to the location.
  
  This approach seems conservative. It followed the traditional view that small-brained hominins do not engage in cultural practices. But we recognize in hindsight that this null hypothesis approach did harm to our analyses. It impeded us from recognizing within our initial excavations of the puzzle box area and other excavations between 2014 – 2017 that we might be encountering remains that were intrusive in the sedimentary floor of the chamber. If we had approached the accumulation of a large number of hominins from the perspective of the null hypothesis being that the situation was likely cultural, we perhaps would have collected evidence in a slightly different manner. We certainly note that if the Dinaledi system had been full of the remains of modern humans, there would have been little doubt that the null hypothesis would have been that this was a cultural space and not a “natural space”. We therefore respectfully disagree with the reviewers who continue to support the idea that we should approach hominin excavations with the null hypothesis that they will be natural (specifically non-cultural) in origins. If excavations continue with this mindset we believe that potential cultural evidence is almost certain to be lost.
  
  There has been a gradient across paleoanthropological excavations, archaeological work, and forensic investigation, with increasing precision of context. The reality is that the recording precision and frame of approach is typically different in most paleontological excavations than in those related to contemporary human remains. If anything comes from the present discussion of whether the Dinaledi system is a burial site for H. naledi or not, we hope that by taking seriously the possibility of deep cultural dynamics of hominins, we will encourage other teams to meet the highest standards of excavation in order to preserve potential cultural evidence. Given H. naledi’s cranial capacity we suggest that even very early hominin skeletal assemblages should be re-examined, if there is sufficient evidence or records available. These would include examples such as the A.L. 333 Au. afarensis site (the so called First Family site in Hadar Ethiopia), the Dikika infant skeleton, WT 15000 (Turkana Boy) and even A.L. 288 (Lucy) as such unusual taphonomic situations where skeletons are preserved cannot be simply explained away as “natural” in origin, based solely on the cranial capacity and assumed lack of cognitive and cultural complexity of the hominins as emphasized by us in Fuentes et al. (2023). We are not the first to observe that some very early hominin situations may represent early mortuary activity (Pettitt 2013), but we would advocate a step further. We suggest it may be damaging to take “natural accumulation” as the standard null hypothesis for hominin paleoanthropology, and that it is more conservative in practice to engage remains with the null hypothesis of possible cultural formation.
  
  We are deeply grateful for the time and effort all of the 8 reviewers (across three reviews) have taken with this work. We also acknowledge the anonymous reviewers from previous submissions who’s opinions and comments will have made the final iterations of these manuscripts better for their efforts. As this process is rather public and includes commentary outside of the eLife forum, we ask that the efforts of all 37 authors and 8 reviewers involved be respected and that the discourse remain professional in all venues as we study this fascinating and quite complex occurrence. We appreciate also the efforts of members of the public who have engaged with this relatively new process where preprints are posted prior to the reviews allowing comments and interactions from colleagues and the public who are normally not part of the internal peer review process. We believe these interactions will make for better final papers. We feel we have met the standards of demonstrating burials in H. naledi and that the engraving are most likely associated with H. naledi. However, given the reviews we see many areas where our clarity and context, and analyses, were less strong than they can be. With the clarifications and additions taken on board through these review processes the final papers will be stronger and clearer. We, recognize that this is an ongoing process of scientific investigation and further work will allow continued, and possibly better, evaluation of these hypothesis and others.
  
  Lee R Berger, Agustín Fuentes, John Hawks, Tebogo Makhubela
  
  Works cited:
  
  Aspöck, E. (2008). What Actually is a ‘Deviant Burial’?: Comparing German-Language and Anglophone Research on ‘Deviant Burials.’ In E. M. Murphy (Ed.). Deviant Burial in the Archaeological Record. Oxford: Oxbow Books. pp 17–34.
  
  Bolliger, S.A. & Thali, M.J. (2009). Thanatology. In S.A. Bolliger and M.J. Thali (eds) Virtopsy Approach: 3D Optical and Radiological Scanning and Reconstruction in Forensic Medicine. Boca Raton: CRC Press. pp 187-218.
  
  Boulestin, B. & Duday, H. (2005). Ethnologie et archéologie de la mort: de l’illusion des références à l’emploi d’un vocabulaire. In: C. Mordant and G. Depierre (eds) Les Pratiques Funéraires à l’Âge du Bronze en France. Actes de la table ronde de Sens-en-Bourgogne. Paris: Éditions du Comité des Travaux Historiques et Scientifiques. pp. 17–30.
  
  Boulestin, B. & Duday, H. (2006). Ethnology and archaeology of death: from the illusion of references to the use of a terminology. Archaeologia Polona 44: 149–169.
  
  Bristow, J., Simms, Z. & Randolph-Quinney, P.S. Taphonomy. In S. Black and E. Ferguson (eds.) Forensic Anthropology 2000-2010. Boca Raton, FL: CRC Press. pp 279-318.
  
  Channing, J. & Randolph-Quinney, P.S. (2006). Death, decay and reconstruction: the archaeology of Ballykilmore Cemetery, County Westmeath. In J. O’Sullivan and M. Stanley (eds.) Settlement, Industry and Ritual: Archaeology. National Roads Authority Monograph Series No. 3. Dublin: NRA/Four Courts Press. pp 113-126.
  
  Cherryson, A. K. (2008). Normal, Deviant and Atypical: Burial Variation in Late Saxon Wessex, c. AD 700–1100. In E. M. Murphy (Ed.). Deviant Burial in the Archaeological Record. Oxford: Oxbow Books. pp 115–130.
  
  Connolly, M., F. Coyne & L. G. Lynch (2005). Underworld : Death and Burial in Cloghermore Cave, Co. Kerry. Bray, Co. Wicklow: Wordwell.
  
  Darwent, C. M. & R. L. Lyman (2002). Detecting the postburial fragmentation of carpals, tarsals and phalanges. In M. H. Sorg and W. D. Haglund (eds). Advances in Forensic Taphonomy: Method, Theory and Archeological Perspectives. Boca Raton, FL, CRC Press. pp 355-378.
  
  d’Errico, F., & Backwell, L. (2016). Earliest evidence of personal ornaments associated with burial: The Conus shells from Border Cave. Journal of Human Evolution, 93, 91–108.
  
  De Villiers. H. (1973). Human skeletal remains from Border Cave, Ingwavuma District, KwaZulu, South Africa. Annals of the Transvaal Museum, 28(13), 229–246.
  
  Dell’Unto, N. and Landeschi, G. (2022). Archaeological 3D GIS. London: Routledge.
  
  Dibble, H. L., Aldeias, V., Goldberg, P., McPherron, S. P., Sandgathe, D., & Steele, T. E. (2015). A critical look at evidence from La Chapelle-aux-Saints supporting an intentional Neandertal burial. Journal of Archaeological Science, 53, 649–657.
  
  Dirkmaat, D. C., & Cabo, L. L. (2016). Forensic archaeology and forensic taphonomy: basic considerations on how to properly process and interpret the outdoor forensic scene_. Academic Forensic Pathology_ 6, 439–454.
  
  Dirks, P. H., Berger, L. R., Roberts, E. M., Kramers, J. D., Hawks, J., Randolph-Quinney, P. S., Elliott, M., Musiba, C. M., Churchill, S. E., de Ruiter, D. J., Schmid, P., Backwell, L. R., Belyanin, G. A., Boshoff, P., Hunter, K. L., Feuerriegel, E. M., Gurtov, A., Harrison, J. du G., Hunter, R., … Tucker, S. (2015). Geological and taphonomic context for the new hominin species Homo naledi from the Dinaledi Chamber, South Africa. ELife, 4, e09561.
  
  Dirks, P.H.G.M., Berger, L.R., Hawks, J., Randolph-Quinney, P.S., Backwell, L.R., and Roberts, E.M. (2016). Comment on “Deliberate body disposal by hominins in the Dinaledi Chamber, Cradle of Humankind, South Africa?” [J. Hum. Evol. 96 (2016) 145-148]. Journal of Human Evolution 96: 149-153.
  
  Dirks, P. H., Roberts, E. M., Hilbert-Wolf, H., Kramers, J. D., Hawks, J., Dosseto, A., Duval, M., Elliott, M., Evans, M., Grün, R., Hellstrom, J., Herries, A. I., Joannes-Boyau, R., Makhubela, T. V., Placzek, C. J., Robbins, J., Spandler, C., Wiersma, J., Woodhead, J., & Berger, L. R. (2017). The age of Homo naledi and associated sediments in the Rising Star Cave, South Africa. ELife, 6, e24231.
  
  Donnelly, S., C. Donnelly & E. Murphy (1999). The forgotten dead: The cíllíní and disused burial grounds of Ballintoy, County Antrim. Ulster Journal of Archaeology 58, 109-113.
  
  Duday, H. (2005). L’archéothanatologie ou l’archéologie de la mort. In: O. Dutour, J.-J. Hublin and B. Vandermeersch (eds) Objets et Méthodes en Paléoanthropologie. Paris: Comité des Travaux Historiques et Scientifiques. pp. 153–215.
  
  Duday, H. (2009). Archaeology of the Dead: Lectures in Archaeothanatology. Oxford: Oxbow Books.
  
  Finley, N. (2000). Outside of life: Traditions of infant burial in Ireland from cillin to cist. World Archaeology 31, 407-422.
  
  Gargett, R. H. (1999). Middle Palaeolithic burial is not a dead issue: The view from Qafzeh, Saint-Césaire, Kebara, Amud, and Dederiyeh. Journal of Human Evolution, 37(1), 27–90.
  
  Goldberg, P., Aldeias, V., Dibble, H., McPherron, S., Sandgathe, D., & Turq, A. (2017). Testing the Roc de Marsal Neandertal “Burial” with Geoarchaeology. Archaeological and Anthropological Sciences, 9(6), 1005–1015.
  
  Gómez-Olivencia, A., & García-Martínez, D. (2019). New postcranial remains from the Roc de Marsal Neandertal child. PALEO. Revue d’archéologie Préhistorique, 30–1, 30–1.
  
  Green, E.C. (2022). An archaeothanatological approach to the identification of late Anglo-Saxon burials in wooden containers. In C.J. Knüsel and E.M.J. Schotsmans (eds.) The Routledge Handbook of Archaeothanatology. London: Routledge. pp 436-455.
  
  Henderson, J. (1987). Factors determining the state of preservation of human remains. In A. Boddington, A. Garland and R. Janaway (eds). Death, Decay and Reconstruction: Approaches to Archaeology and Forensic Science. Manchester: Manchester University Press. pp 43-54.
  
  Hunter, J. R. (2014). Human remains recovery: archaeological and forensic perspectives. In C. Smith (ed). Encyclopedia of Global Archaeology. New York: Springer New York. pp 3549-3556.
  
  Hochrein, M. (2002). An Autopsy of the Grave: Recognizing, Collecting and Preserving Forensic Geotaphonomic Evidence. In M. H. Sorg and W. D. Haglund (eds). Advances in Forensic Taphonomy: Method, Theory and Archeological Perspectives. Boca Raton, FL, CRC Press: 45-70.
  
  Knüsel, C.K. & Robb, J. (2016). Funerary taphonomy: An overview of goals and methods. Journal of Archaeological Science: Reports 10, 655-673.
  
  Kuhn, B.F., Berger, L.R. & Skinner, J.D. (2010). Examining criteria for identifying and differentiating fossil faunal assemblages accumulated by hyenas and hominins using extant hyenid accumulations. International Journal of Osteoarchaeology 20, 15-35.
  
  Lyman, R. (1994). Vertebrate Taphonomy. Cambridge, Cambridge University Press.
  
  Martinón-Torres, M., d’Errico, F., Santos, E., Álvaro Gallo, A., Amano, N., Archer, W., Armitage, S. J., Arsuaga, J. L., Bermúdez de Castro, J. M., Blinkhorn, J., Crowther, A., Douka, K., Dubernet, S., Faulkner, P., Fernández-Colón, P., Kourampas, N., González García, J., Larreina, D., Le Bourdonnec, F.-X., … Petraglia, M. D. (2021). Earliest known human burial in Africa. Nature, 593(7857), 7857.
  
  Mickleburgh, H.L & Wescott, D.J. (2018). Controlled experimental observations on joint disarticulation and bone displacement of a human body in an open pit: implications for funerary archaeology. Journal of Archaeological Science: Reports 20: 158-167.
  
  Mickleburgh, H.L., Wescott, D.J., Gluschitz, S. & Klinkenberg, V.M. (2022). Exploring the use of actualistic forensic taphonomy in the study of (forensic) archaeological human burials: An actualistic experimental research programme at the Forensic Anthropology Center at Texas State University (FACTS), San Marcos, Texas. In C.J. Knüsel and E.M.J. Schotsmans (eds.) The Routledge Handbook of Archaeothanatology. London: Routledge. pp 542-562.
  
  Owsley, D. & B. Compton (1997). Preservation in late 19th Century iron coffin burials. In W. Haglund and M. Sorg (eds). Forensic Taphonomy: The Postmortem Fate of Human Remains. Boca Raton, FL, CRC Press: 511-526.
  
  Parker Pearson, M. (1999). The Archaeology of Death and Burial. College Station: Texas A&M University Press.
  
  Pettitt, P. (2013). The Palaeolithic Origins of Human Burial. Routledge.
  
  Pomeroy, E., Bennett, P., Hunt, C. O., Reynolds, T., Farr, L., Frouin, M., Holman, J., Lane, R., French, C., & Barker, G. (2020). New Neanderthal remains associated with the ‘flower burial’ at Shanidar Cave. Antiquity, 94(373), 11–26.
  
  Randolph-Quinney, P.S. (2013). From the cradle to the grave: the bioarchaeology of Clonfad 3 and Ballykilmore 6. In N. Brady, P. Stevens and J. Channing (eds.). Settlement and Community in the Fir Tulach Kingdom. Dublin: National Roads Authority Press. pp A2.1-48.
  
  Randolph-Quinney, P.S., Haines, S. and Kruger, A. (2018). The use of three-dimensional scanning and surface capture methods in recording forensic taphonomic traces: issues of technology, visualisation, and validation. In: W.J. M. Groen and P. M. Barone (eds). Multidisciplinary Approaches to Forensic Archaeology. Berlin: Springer International Publishing, pp. 115-130.
  
  Rendu, W., Beauval, C., Crevecoeur, I., Bayle, P., Balzeau, A., Bismuth, T., Bourguignon, L., Delfour, G., Faivre, J.-P., Lacrampe-Cuyaubère, F., Tavormina, C., Todisco, D., Turq, A., & Maureille, B. (2014). Evidence supporting an intentional Neandertal burial at La Chapelle-aux-Saints. Proceedings of the National Academy of Sciences, 111(1), 81–86.
  
  Sandgathe, D. M., Dibble, H. L., Goldberg, P., & McPherron, S. P. (2011). The Roc de Marsal Neandertal child: A reassessment of its status as a deliberate burial. Journal of Human Evolution, 61(3), 243–253.
  
  Silver, M. (2016). Conservation Techniques in Cultural Heritage. In E. Stylianidis and F. Remondino (eds) 3D Recording, Documentation and Management of Cultural Heritage. Dunbeath: Whittles Publishing. pp 15-106.
  
  Schotsmans, E.M.J., Georges-Zimmermann, P., Ueland, M. and Dent, B.B. (2022). From flesh to bone: Building bridges between taphonomy, archaeothanatology and forensic science for a better understanding of mortuary practices. In C.J. Knüsel and E.M.J. Schotsmans (eds.) The Routledge Handbook of Archaeothanatology. London: Routledge. pp 501-541.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.01.543133v2
www.biorxiv.org www.biorxiv.org

New submission 11/07/2023, 11:01:44

1
1. Public_Reviews 11 Jul 2023
  
  in eLife
  
  Author Response:
  
  We would like to thank the eLife reviewers for the considerable time and effort they have invested to review these manuscripts. We have also benefited from a previous round of review of the manuscript describing the proposed burial features, which underwent two rounds of revisions in a high-impact journal over a period of approximately 8 months during 2022 and early 2023. Both sets of reviews have reflected mixed responses to the evidence we have presented, with one reviewer recommending acceptance with minor editorial revisions, two recommending acceptance with minor revisions and the fourth recommending rejection based upon similar arguments to those reflected by some of the reviewers in this current round of reviews in eLife. Ultimately the managing editor of this first journal took the decision that the review process could not be completed in a timely manner and rejected the manuscript although the submission here reflected our consideration of these reviewers suggestions.
  
  We have chosen in this initial response to the eLife reviews to include some references to the previous anonymous reviews in order to illustrate differences of opinion and differences in revision suggestions within the review process. Our goal is to offer maximal insight into our decision-making process and to acknowledge the considerable time and effort put into the assessment of these manuscripts by reviewers (for eLife and in the case of the earlier review process). We hope that this approach will assist the readers, and reviewers, of our manuscripts in understanding why we are proceeding with certain decisions during the revision process.
  
  This is a new process for us and the reviewers, and one way in which it significantly differs from more traditional review is that both the reviews and our reply will be public well in advance of our revisions to the manuscript. Indeed, considering the scope of the reviews, some of those revisions may take considerable time, although many can be accomplished fairly easily. Thus, we are not in a position to say that we have solved every issue raised by the reviewers. Instead, we will examine what appear to be the key critical issues raised regarding the data and the analyses and how we propose to address these as we revise the papers. We will also address several philosophical and ethical issues raised by the reviews and our proposal for dealing with these. More specific editorial and citational recommendations will be dealt with on a case-by-case basis, and we do not address these point-by-point in this reply. Please note, this response to the reviewers is not the revision of the manuscript and is only the initial opinion of the corresponding authors with some guidance from the larger group of authors of all three papers. Our final submitted revision will reflect the input of all authors included on those submissions.
  
  We took the decision to submit three separate papers consciously. The two different categories of evidence, burials and engravings, involve different kinds of analysis and different (although overlapping) teams of researchers, and we recognized that each deserved their own presentation and assessment. Meanwhile, together they inform the context of H. naledi in a way that requires some synthetic discussion, in which both kinds of evidence are relevant, leading to a third paper. But the mutual relevance of these different kinds of evidence and their review by a common set of reviewers naturally raises cross-cutting issues, and the reviewers have cross-referenced the three articles. This has sometimes led to suggestions about one manuscript based on the contents of another. Considering the situation, we accepted the recommendation that it would be clearer to consider all three articles in a single reply. Thus, while each of the three papers will proceed separately during the revision process, it will be necessary to highlight across all three papers occasionally in our responses.
  
  Scientific Issues:
  
  In reading the reviews, we feel there are 9 critical points/assertions raised by one or more of the reviewers that present a problem for, or challenge to, our hypothesis that the observed evidence (bone accumulations and engravings) described in the Dinaledi subsystem are of intentional naledigenic origin. These are:
  
  The evidence presented does not demonstrate a clear interruption of the floor sediments, thus failing to demonstrate excavated holes.
  
  The sediments infilling the holes where the skeletal remains are found have not been demonstrated to originate from the disruption of the floor sediments and thus could be part of a natural geological process (e.g. water movement, slumping) or carnivore accumulations.
  
  Previous geological interpretations by our research group have given alternative geological explanations for formation of the bony accumulations that contradict the present evidence presented here and result in alternative origins hypotheses.
  
  Burial cannot be effectively assessed without complete excavation of the features and site.
  
  The skeletal remains as presented do not conform clearly to typical body arrangement/positions associated with human (Homo sapiens) burials.
  
  There is no evidence of grave goods or lithic scatters that are typically associated with human burials.
  
  Humans may have been involved with the creation of either the Homo naledi bone accumulations, the engravings, or both.
  
  Without a date of the engravings, the null hypothesis should be the engravings were created by Homo sapiens.
  
  The null hypothesis for explanation of the skeletal remains in this situation should be “natural accumulation”.
  
  Our analysis of the Dinaledi Feature 1 leads us to accept that the laminated orange-red mudstone (LORM) sedimentary layer is interrupted, indicating a non-natural intervention, and that the hole created by the interruption was then filled by both a fleshed body (and perhaps parts of other bodies) which were then covered by sediment that originated from the hole that was dug. We recognize that the four eLife reviewers are not convinced that our presentation is sufficient to establish this. Interestingly, this was not the universal opinion of earlier reviewers of the initial manuscript several of whom felt we had adequately supported this hypothesis. The lack of clarity in this current version of the burial manuscript is our responsibility. In the upcoming revision of this paper to be submitted, we will take the reviewers’ critiques to heart and add additional figures that illustrate better the disruption of the LORM and clarify the sedimentological data showing the material covering the skeletal remains in the hole are the disrupted sediments excavated from the same hole. We are proposing to isolate this most critical evidence for burial into a separate section in the revised submission based on the reviewers’ comments. The fact that the LORM layer is disrupted, a fleshed body was placed in the hole created by this disruption, and the body (and perhaps parts of other bodies) was/were then covered by the same sediments from the hole is the central feature of our hypothesis that the bone accumulations observed reflect a burial and not a natural process.
  
  The possibility of fluvial transport or involvement in the subsystem is a topic that we have addressed extensively in past work, and it is clear from these reviews that we must enhance our current manuscript to discuss this issue at greater length. Our previous work (Dirks et al. 2015; Dirks et al. 2017) emphasized that fluvial transport of whole bodies into the subsystem was precluded by several lines of sedimentological evidence. We excavated a rich accumulation of skeletal remains, including articulated limbs and other elements in subvertical orientations inconsistent with slow sedimentary infill, which were difficult to explain without positing either a large and dense pile of bodies and/or sediment movement. We encountered fractured chunks of laminated orange-red mudstone (LORM) in random orientations within our excavation area, within and among skeletal remains, which directly refuted that the remains were inundated with water at the time of burial, and this limited the possibility of fluvial transport. Water flow sufficient to displace bodies or complete skeletal evidence would also transport large and course sediment, which is absent from the subsystem, and would sort the commingled skeletal material that we found by size, which we do not observe. But our excavation only covered less than a square meter at very limited depth, and this was the limit to our knowledge of subsurface sediment. We thus were left with uncertainty that led us to suggest the possibility of sediment slumping or movement into subsurface drains, although these were not observed near our excavation. Our current work expands our knowledge of the subsurface and presents an alternative explanation for the disposition of skeletal remains from our earlier excavation. But we acknowledge that this new explanation is vulnerable to our own previous published proposals, and we must do a better job of explaining how the new information addresses our previous suggestions. By not clearly creating a section where we explained how these previous hypotheses were now nullified by new evidence, we clearly confused the reviewers with our own previous work. We will revise the manuscript by enhancing the review of the significant geological evidence demonstrating that there is no significant fluvial action in the system and making it clear how the burial hypothesis provides a clearer explanation for the situation of skeletal remains from our previous excavation work.
  
  One of the central issues raised by reviewers has been a perceived need to excavate these features completely, totally exhuming all skeletal remains from them. Reviewers have written that it is necessary to identify every skeletal element that is present and account for any missing elements. On this point, we have both ethical and scientific differences from these reviewers. We express our ethical concerns first. Many of the best-preserved possible burials ever discovered by archaeologists were subjected to total excavation and exhumation. Cases like La Chapelle-aux-Saints, La Ferrassie, and Skhūl were fully excavated at a time when data recording and excavation methods did not include the range of spatial and geomorphological approaches that later became routine. The judgment of early investigators that these situations were intentional burials was challenged by later workers, and the kind of information that might enable better tests had been irrevocably lost (Gargett 1999; Dibble et al. 2015; Rendu et al. 2014).
  
  Later, improved excavation standards have not sufficed to remove uncertainty or debate about possible burials. For example, it was long presumed that well-preserved remains of young children were by themselves diagnostic of intentional burial, such as those from Dederiyeh, Border Cave, or Roc de Marsal. Such cases were also fully excavated, with adequate documentation of the positioning of skeletal remains and their surrounding stratigraphic situation, but such cases were later challenged on several bases and the complete exhumation of material has confused or precluded testing of new hypotheses (e.g. Gargett 1999). The case of Roc de Marsal is one in which data from the initial excavation combined with data from the initial excavation combined with re-excavation and geoarchaeological analysis led to a naturalistic interpretation of the skeletal material (Sandgathe et al. 2011; Goldberg et al. 2017). But even in this case, the researchers erred in their interpretation of the skeleton’s situation due to a lack of identification of parts of the infant’s skeleton (Gómez-Olivencia and García-Martinez 2019). That is to say, it is not only the burial hypothesis but other hypotheses that suffer from complete excavation. Researchers concerned with preserving all possible information have sometimes taken extraordinary measures to remove and study possible burials at high-resolution in the laboratory. Such was the case of the Shanidar IV burial removed from the site and transported in plaster jacket by Solecki, which led to the disruption and loss of internal stratigraphic information (Pomeroy et al. 2020). Arguably, the current state of the art is full excavation with partial preparation, such as that undertaken at Panga ya Saidi (Martinón-Torres et al. 2021). But again, any future attempt to reinterpret or test the hypothesis of burial must rely on the adequacy of documentation as the original context has been removed.
  
  In our decision to leave material in place as much as possible, we are expanding upon standard practice to leave witness sections and unexcavated areas for future research. The situation is novel, representing possible burials by a nonhuman species, and that makes it doubly important in our opinion to be conservative in not fully exhuming the skeletal material from its context. We anticipate that many other researchers, including future investigators, will suggest additional methods to further test the hypothesis of burial, something that would be impossible if we had excavated the features in their entirety prior to publishing a description of our work. We believe strongly that our ethical responsibility is to publish the work and the most likely interpretation while leaving as much evidence in place as possible to enable further testing and replication. We welcome the suggestions of additional methods/analyses to test the H. naledi burial hypothesis.
  
  This being said, we also observe that total exhumation would not resolve the concerns raised by the reviewers. The recommendation of total exhumation is in pursuit of a full account of all skeletal material present and its preservation and spatial situation, in order to demonstrate that they conform to body positions comparable to human burials. As has been highlighted in forensic casework, the excavation of an inhumation feature does not necessarily provide an accurate spatial or anatomical manifest of the stratigraphical relationships between the body, encapsulating matrix, and any cut present due to preservational, taphonomic and operational factors (Dirkmaat and Cabo, 2016; Hunter, 2014). In particular, in cases where skeletal elements are highly fragmented, friable, or degraded (such as through bioerosion) then complete excavation—even under controlled laboratory conditions—may destroy bone and severely limit skeletal identification (Henderson, 1997; Hochrein, 2002; Owsley and Compton, 1997), particularly in elements where the ratio of trabecular to cortical bone is high (Darwent and Lyman, 2002; Lyman, 1994). As such, non-invasive methods of 3D and 4D modelling (preservation in situ) are often considered preferable to complete necropsy or excavation (preservation by record) where appropriate (Bolliger and Thali, 2009; Dell’Unto and Landeschi, 2022; Randolph-Quinney et al., 2018; Silver, 2016).
  
  The test of burial is not primarily positional, but taphonomic and geological. The position and number of bones can elaborate on process-driven questions of decay and destruction in the burial environment, or post-mortem modification, but are not singularly indicative of whether the remains were intentionally buried – the post-mortem narrative of all the processes affecting the cadaveric island is required (Knüsel and Robb, 2016). In previous cases, researchers have disputed or accepted the hypothesis of intentional hominin burial based upon assumptions about how modern humans or Neandertals would have positioned bodies, with the idea that some positions reflect ritual intent while others do not. But applying such assumptions is unjustifiable, particularly for a species like H. naledi, whose culture may have differed fundamentally from our own. Our work acknowledges that the present evidence does not enable a full reconstruction of the burial positions, but it does show that fleshed remains were encased in sediment prior to decomposition of soft tissue, and that subsequent spatial changes can be most parsimoniously explained by natural decomposition within sedimentary matrix contained within a burial feature (after Green, 2022; Mickleburgh and Wescott, 2018; Mickleburgh et al., 2022). If the argument is that extraordinary claims require extraordinary evidence, we feel that the evidence documents excavation and interment (and will do so more clearly in the revision) and the fact of the remains do not match a “typical” human burial in body positioning is not in itself evidence that these are not H. naledi burials.
  
  We feel that the reviewers (in keeping with many palaeoanthropologists) have a clear idea of what they “think” a burial should look like in an idealised sense, but this platonic ideal of burial form is not matched by the extensive literature in archaeothanatology, funerary archaeology and forensic science which indicates enormous variability in the activity, morphology and post-mortem system experienced by the human body in cases of interment and body disposal (e.g. Aspöck, 2008; Boulestin and Duday, 2005 and 2006; Connelly et al., 2005; Channing and Randolph-Quinney, 2006; Cherryson, 2008; Donnelly et al., 1995; Finley, 2000; Hunter, 2014; Parker Pearson, 1999; Randolph-Quinney, 2013). Decades of experience in the identification, recovery and interpretation of clandestine, deviant, and non-formal burials indicates the platonic ideal is rare, and in many contexts, the exception (Cherryson, 2008; Parker Pearson, 1999). This variability is particularly relevant to morphological traits in burial context, such as the informal nature of the grave cut in plan and section, shallow burial depth, and initial disposition of body (placement) during the early post-mortem period. These might run counter to the expectations of reviewers or others referencing the fossil hominin record, but are well accepted within the communities of researchers investigating Holocene archaeological sites and forensic contexts.
  
  It is encouraging to see reviewers beginning to incorporate the extensive (often experimentally derived) literature from archaeothanatology and forensic taphonomy in their deliberations, and we will be taking these comments on board going forward. In particular, we acknowledge reviewers’ comments and the need to construct a more detailed post-mortem narrative, accounting for joint disarticulation (labile versus persistent joints etc), displacement, and final disposition of elements within the burial space. As such we will incorporate the hierarchy of decomposition (rank order disarticulation), associations between regions of anatomical association, areas of disassociation, and the voids produced during decomposition (after Mickleburgh and Wescott, 2018; Mickleburgh et al., 2022) into our narrative. In doing so we acknowledge the tensions between the inductive archaeolothanatological narrative-driven approach (e.g. Duday, 2005 & 2009) versus robust decomposition data derived from human forensic taphonomic experimentation recently articulated by Schotsmans and colleagues (2022) - noting that we will highlight comparative data based on forensic experimental casework and actualistic modelling over inductive intuitive approaches which come with significant evidential shortcomings (Bristow et al. 2011).
  
  Finally, from a taphonomic perspective it is worth pointing out to reviewers that we have already addressed the issue of lack of taphonomic evidence for carnivore involvement in the formation of the Dinaledi assemblage (Dirks, et al., 2016). Absence of any carnivore-induced bone surface modifications, patterns of skeletal part representation, and a total absence of any carnivore remains found within the Dinaledi chamber (following Kuhn and colleagues, 2010) lead us to reject carnivores as possible vectors of body accumulation within the Dinaledi Chamber and Hill Antechamber.
  
  Reviewers suggest that without a date derived from geochronological methods, the engravings cannot be associated with H. naledi, and that it is possible (or probable) that the engravings were done in the recent past by H. sapiens. This suggestion neglects the context of the site. We have previously documented the structure and extremely limited accessibility of the Dinaledi subsystem. This subsystem was not recorded on maps of the documented Rising Star Cave system prior to our work and its discovery by our teams. Furthermore, there is no evidence of prehistoric human activity in the areas of the cave related to possible subterranean entrances There is no evidence that humans in the past typically ventured into such extreme spaces like those of Rising Star. It is clear from the presence of the remains of many individuals that H. naledi ventured into these spaces again and again. It is likely that H. naledi moved through these spaces more easily than humans do based on their physique. We show that the engravings overlay each other suggesting multiple engraving events. These engravings took time and effort and the only evidence for use of the Dinaledi subsystem by any hominin is by H. naledi. The context leads to the null hypothesis that H. naledi made the marks. In our revision, we will elaborate on this argument to clarify the evidence for our stance on this hypothesis. Several reviewers took issue with the title of the engraving paper as we did not insert a qualifier in front of the suggested date range for the engravings. We deliberately left out qualifying language so that the title took the form of a testable hypothesis rather than a weak assertation. Should future work find the engravings were not produced within this time range, then we will restate this hypothesis.
  
  Finally, with regards to the engravings we have chosen to report them because they exist. Not reporting the presence of engraved marks on the walls of a cave above hypothesized burials would be tantamount to leaving relevant evidence out of the description of an archeological context. We recognize and state in our manuscript that these markings require substantial further study, including attempts at geochronological dating. But the current evidence is clearly relevant to the archaeological context of the subsystem. We take a similar stance with reporting the presence of the tool shaped artefact near the hand of the H. naledi skeleton in the Hill Antechamber. It is evident that this object requires further study, as we stated in our manuscript, but again omitting it from our study would be leaving out relevant evidence.
  
  Some have suggested that the null hypothesis should be that all of these observed circumstances are of natural origin. Our team took this approach in our early investigation of the Dinaledi subsystem (Dirks et al. 2015). We adopted the null hypothesis that the geological processes involved in the accumulation of H. naledi skeletal remains were “natural” (e.g., non-naledigenic involvement), and we were able to reject many alternative explanations for the assemblage, including carnivore accumulation, “death trap” accumulation, and fluvial transport of bodies or bones (Dirks et al. 2015). This led us to the hypothesis that H. naledi were involved in bringing the bodies into the spaces where they were found. But we did not hypothesize their involvement in the formation of the deposit itself beyond bringing the bodies to the location.
  
  This approach seems conservative. It followed the traditional view that small-brained hominins do not engage in cultural practices. But we recognize in hindsight that this null hypothesis approach did harm to our analyses. It impeded us from recognizing within our initial excavations of the puzzle box area and other excavations between 2014 – 2017 that we might be encountering remains that were intrusive in the sedimentary floor of the chamber. If we had approached the accumulation of a large number of hominins from the perspective of the null hypothesis being that the situation was likely cultural, we perhaps would have collected evidence in a slightly different manner. We certainly note that if the Dinaledi system had been full of the remains of modern humans, there would have been little doubt that the null hypothesis would have been that this was a cultural space and not a “natural space”. We therefore respectfully disagree with the reviewers who continue to support the idea that we should approach hominin excavations with the null hypothesis that they will be natural (specifically non-cultural) in origins. If excavations continue with this mindset we believe that potential cultural evidence is almost certain to be lost.
  
  There has been a gradient across paleoanthropological excavations, archaeological work, and forensic investigation, with increasing precision of context. The reality is that the recording precision and frame of approach is typically different in most paleontological excavations than in those related to contemporary human remains. If anything comes from the present discussion of whether the Dinaledi system is a burial site for H. naledi or not, we hope that by taking seriously the possibility of deep cultural dynamics of hominins, we will encourage other teams to meet the highest standards of excavation in order to preserve potential cultural evidence. Given H. naledi’s cranial capacity we suggest that even very early hominin skeletal assemblages should be re-examined, if there is sufficient evidence or records available. These would include examples such as the A.L. 333 Au. afarensis site (the so called First Family site in Hadar Ethiopia), the Dikika infant skeleton, WT 15000 (Turkana Boy) and even A.L. 288 (Lucy) as such unusual taphonomic situations where skeletons are preserved cannot be simply explained away as “natural” in origin, based solely on the cranial capacity and assumed lack of cognitive and cultural complexity of the hominins as emphasized by us in Fuentes et al. (2023). We are not the first to observe that some very early hominin situations may represent early mortuary activity (Pettitt 2013), but we would advocate a step further. We suggest it may be damaging to take “natural accumulation” as the standard null hypothesis for hominin paleoanthropology, and that it is more conservative in practice to engage remains with the null hypothesis of possible cultural formation.
  
  We are deeply grateful for the time and effort all of the 8 reviewers (across three reviews) have taken with this work. We also acknowledge the anonymous reviewers from previous submissions who’s opinions and comments will have made the final iterations of these manuscripts better for their efforts. As this process is rather public and includes commentary outside of the eLife forum, we ask that the efforts of all 37 authors and 8 reviewers involved be respected and that the discourse remain professional in all venues as we study this fascinating and quite complex occurrence. We appreciate also the efforts of members of the public who have engaged with this relatively new process where preprints are posted prior to the reviews allowing comments and interactions from colleagues and the public who are normally not part of the internal peer review process. We believe these interactions will make for better final papers. We feel we have met the standards of demonstrating burials in H. naledi and that the engraving are most likely associated with H. naledi. However, given the reviews we see many areas where our clarity and context, and analyses, were less strong than they can be. With the clarifications and additions taken on board through these review processes the final papers will be stronger and clearer. We, recognize that this is an ongoing process of scientific investigation and further work will allow continued, and possibly better, evaluation of these hypothesis and others.
  
  Lee R Berger, Agustín Fuentes, John Hawks, Tebogo Makhubela
  
  Works cited:
  
  Aspöck, E. (2008). What Actually is a ‘Deviant Burial’?: Comparing German-Language and Anglophone Research on ‘Deviant Burials.’ In E. M. Murphy (Ed.). Deviant Burial in the Archaeological Record. Oxford: Oxbow Books. pp 17–34.
  
  Bolliger, S.A. & Thali, M.J. (2009). Thanatology. In S.A. Bolliger and M.J. Thali (eds) Virtopsy Approach: 3D Optical and Radiological Scanning and Reconstruction in Forensic Medicine. Boca Raton: CRC Press. pp 187-218.
  
  Boulestin, B. & Duday, H. (2005). Ethnologie et archéologie de la mort: de l’illusion des références à l’emploi d’un vocabulaire. In: C. Mordant and G. Depierre (eds) Les Pratiques Funéraires à l’Âge du Bronze en France. Actes de la table ronde de Sens-en-Bourgogne. Paris: Éditions du Comité des Travaux Historiques et Scientifiques. pp. 17–30.
  
  Boulestin, B. & Duday, H. (2006). Ethnology and archaeology of death: from the illusion of references to the use of a terminology. Archaeologia Polona 44: 149–169.
  
  Bristow, J., Simms, Z. & Randolph-Quinney, P.S. Taphonomy. In S. Black and E. Ferguson (eds.) Forensic Anthropology 2000-2010. Boca Raton, FL: CRC Press. pp 279-318.
  
  Channing, J. & Randolph-Quinney, P.S. (2006). Death, decay and reconstruction: the archaeology of Ballykilmore Cemetery, County Westmeath. In J. O’Sullivan and M. Stanley (eds.) Settlement, Industry and Ritual: Archaeology. National Roads Authority Monograph Series No. 3. Dublin: NRA/Four Courts Press. pp 113-126.
  
  Cherryson, A. K. (2008). Normal, Deviant and Atypical: Burial Variation in Late Saxon Wessex, c. AD 700–1100. In E. M. Murphy (Ed.). Deviant Burial in the Archaeological Record. Oxford: Oxbow Books. pp 115–130.
  
  Connolly, M., F. Coyne & L. G. Lynch (2005). Underworld : Death and Burial in Cloghermore Cave, Co. Kerry. Bray, Co. Wicklow: Wordwell.
  
  Darwent, C. M. & R. L. Lyman (2002). Detecting the postburial fragmentation of carpals, tarsals and phalanges. In M. H. Sorg and W. D. Haglund (eds). Advances in Forensic Taphonomy: Method, Theory and Archeological Perspectives. Boca Raton, FL, CRC Press. pp 355-378.
  
  d’Errico, F., & Backwell, L. (2016). Earliest evidence of personal ornaments associated with burial: The Conus shells from Border Cave. Journal of Human Evolution, 93, 91–108.
  
  De Villiers. H. (1973). Human skeletal remains from Border Cave, Ingwavuma District, KwaZulu, South Africa. Annals of the Transvaal Museum, 28(13), 229–246.
  
  Dell’Unto, N. and Landeschi, G. (2022). Archaeological 3D GIS. London: Routledge.
  
  Dibble, H. L., Aldeias, V., Goldberg, P., McPherron, S. P., Sandgathe, D., & Steele, T. E. (2015). A critical look at evidence from La Chapelle-aux-Saints supporting an intentional Neandertal burial. Journal of Archaeological Science, 53, 649–657.
  
  Dirkmaat, D. C., & Cabo, L. L. (2016). Forensic archaeology and forensic taphonomy: basic considerations on how to properly process and interpret the outdoor forensic scene_. Academic Forensic Pathology_ 6, 439–454.
  
  Dirks, P. H., Berger, L. R., Roberts, E. M., Kramers, J. D., Hawks, J., Randolph-Quinney, P. S., Elliott, M., Musiba, C. M., Churchill, S. E., de Ruiter, D. J., Schmid, P., Backwell, L. R., Belyanin, G. A., Boshoff, P., Hunter, K. L., Feuerriegel, E. M., Gurtov, A., Harrison, J. du G., Hunter, R., … Tucker, S. (2015). Geological and taphonomic context for the new hominin species Homo naledi from the Dinaledi Chamber, South Africa. ELife, 4, e09561.
  
  Dirks, P.H.G.M., Berger, L.R., Hawks, J., Randolph-Quinney, P.S., Backwell, L.R., and Roberts, E.M. (2016). Comment on “Deliberate body disposal by hominins in the Dinaledi Chamber, Cradle of Humankind, South Africa?” [J. Hum. Evol. 96 (2016) 145-148]. Journal of Human Evolution 96: 149-153.
  
  Dirks, P. H., Roberts, E. M., Hilbert-Wolf, H., Kramers, J. D., Hawks, J., Dosseto, A., Duval, M., Elliott, M., Evans, M., Grün, R., Hellstrom, J., Herries, A. I., Joannes-Boyau, R., Makhubela, T. V., Placzek, C. J., Robbins, J., Spandler, C., Wiersma, J., Woodhead, J., & Berger, L. R. (2017). The age of Homo naledi and associated sediments in the Rising Star Cave, South Africa. ELife, 6, e24231.
  
  Donnelly, S., C. Donnelly & E. Murphy (1999). The forgotten dead: The cíllíní and disused burial grounds of Ballintoy, County Antrim. Ulster Journal of Archaeology 58, 109-113.
  
  Duday, H. (2005). L’archéothanatologie ou l’archéologie de la mort. In: O. Dutour, J.-J. Hublin and B. Vandermeersch (eds) Objets et Méthodes en Paléoanthropologie. Paris: Comité des Travaux Historiques et Scientifiques. pp. 153–215.
  
  Duday, H. (2009). Archaeology of the Dead: Lectures in Archaeothanatology. Oxford: Oxbow Books.
  
  Finley, N. (2000). Outside of life: Traditions of infant burial in Ireland from cillin to cist. World Archaeology 31, 407-422.
  
  Gargett, R. H. (1999). Middle Palaeolithic burial is not a dead issue: The view from Qafzeh, Saint-Césaire, Kebara, Amud, and Dederiyeh. Journal of Human Evolution, 37(1), 27–90.
  
  Goldberg, P., Aldeias, V., Dibble, H., McPherron, S., Sandgathe, D., & Turq, A. (2017). Testing the Roc de Marsal Neandertal “Burial” with Geoarchaeology. Archaeological and Anthropological Sciences, 9(6), 1005–1015.
  
  Gómez-Olivencia, A., & García-Martínez, D. (2019). New postcranial remains from the Roc de Marsal Neandertal child. PALEO. Revue d’archéologie Préhistorique, 30–1, 30–1.
  
  Green, E.C. (2022). An archaeothanatological approach to the identification of late Anglo-Saxon burials in wooden containers. In C.J. Knüsel and E.M.J. Schotsmans (eds.) The Routledge Handbook of Archaeothanatology. London: Routledge. pp 436-455.
  
  Henderson, J. (1987). Factors determining the state of preservation of human remains. In A. Boddington, A. Garland and R. Janaway (eds). Death, Decay and Reconstruction: Approaches to Archaeology and Forensic Science. Manchester: Manchester University Press. pp 43-54.
  
  Hunter, J. R. (2014). Human remains recovery: archaeological and forensic perspectives. In C. Smith (ed). Encyclopedia of Global Archaeology. New York: Springer New York. pp 3549-3556.
  
  Hochrein, M. (2002). An Autopsy of the Grave: Recognizing, Collecting and Preserving Forensic Geotaphonomic Evidence. In M. H. Sorg and W. D. Haglund (eds). Advances in Forensic Taphonomy: Method, Theory and Archeological Perspectives. Boca Raton, FL, CRC Press: 45-70.
  
  Knüsel, C.K. & Robb, J. (2016). Funerary taphonomy: An overview of goals and methods. Journal of Archaeological Science: Reports 10, 655-673.
  
  Kuhn, B.F., Berger, L.R. & Skinner, J.D. (2010). Examining criteria for identifying and differentiating fossil faunal assemblages accumulated by hyenas and hominins using extant hyenid accumulations. International Journal of Osteoarchaeology 20, 15-35.
  
  Lyman, R. (1994). Vertebrate Taphonomy. Cambridge, Cambridge University Press.
  
  Martinón-Torres, M., d’Errico, F., Santos, E., Álvaro Gallo, A., Amano, N., Archer, W., Armitage, S. J., Arsuaga, J. L., Bermúdez de Castro, J. M., Blinkhorn, J., Crowther, A., Douka, K., Dubernet, S., Faulkner, P., Fernández-Colón, P., Kourampas, N., González García, J., Larreina, D., Le Bourdonnec, F.-X., … Petraglia, M. D. (2021). Earliest known human burial in Africa. Nature, 593(7857), 7857.
  
  Mickleburgh, H.L & Wescott, D.J. (2018). Controlled experimental observations on joint disarticulation and bone displacement of a human body in an open pit: implications for funerary archaeology. Journal of Archaeological Science: Reports 20: 158-167.
  
  Mickleburgh, H.L., Wescott, D.J., Gluschitz, S. & Klinkenberg, V.M. (2022). Exploring the use of actualistic forensic taphonomy in the study of (forensic) archaeological human burials: An actualistic experimental research programme at the Forensic Anthropology Center at Texas State University (FACTS), San Marcos, Texas. In C.J. Knüsel and E.M.J. Schotsmans (eds.) The Routledge Handbook of Archaeothanatology. London: Routledge. pp 542-562.
  
  Owsley, D. & B. Compton (1997). Preservation in late 19th Century iron coffin burials. In W. Haglund and M. Sorg (eds). Forensic Taphonomy: The Postmortem Fate of Human Remains. Boca Raton, FL, CRC Press: 511-526.
  
  Parker Pearson, M. (1999). The Archaeology of Death and Burial. College Station: Texas A&M University Press.
  
  Pettitt, P. (2013). The Palaeolithic Origins of Human Burial. Routledge.
  
  Pomeroy, E., Bennett, P., Hunt, C. O., Reynolds, T., Farr, L., Frouin, M., Holman, J., Lane, R., French, C., & Barker, G. (2020). New Neanderthal remains associated with the ‘flower burial’ at Shanidar Cave. Antiquity, 94(373), 11–26.
  
  Randolph-Quinney, P.S. (2013). From the cradle to the grave: the bioarchaeology of Clonfad 3 and Ballykilmore 6. In N. Brady, P. Stevens and J. Channing (eds.). Settlement and Community in the Fir Tulach Kingdom. Dublin: National Roads Authority Press. pp A2.1-48.
  
  Randolph-Quinney, P.S., Haines, S. and Kruger, A. (2018). The use of three-dimensional scanning and surface capture methods in recording forensic taphonomic traces: issues of technology, visualisation, and validation. In: W.J. M. Groen and P. M. Barone (eds). Multidisciplinary Approaches to Forensic Archaeology. Berlin: Springer International Publishing, pp. 115-130.
  
  Rendu, W., Beauval, C., Crevecoeur, I., Bayle, P., Balzeau, A., Bismuth, T., Bourguignon, L., Delfour, G., Faivre, J.-P., Lacrampe-Cuyaubère, F., Tavormina, C., Todisco, D., Turq, A., & Maureille, B. (2014). Evidence supporting an intentional Neandertal burial at La Chapelle-aux-Saints. Proceedings of the National Academy of Sciences, 111(1), 81–86.
  
  Sandgathe, D. M., Dibble, H. L., Goldberg, P., & McPherron, S. P. (2011). The Roc de Marsal Neandertal child: A reassessment of its status as a deliberate burial. Journal of Human Evolution, 61(3), 243–253.
  
  Silver, M. (2016). Conservation Techniques in Cultural Heritage. In E. Stylianidis and F. Remondino (eds) 3D Recording, Documentation and Management of Cultural Heritage. Dunbeath: Whittles Publishing. pp 15-106.
  
  Schotsmans, E.M.J., Georges-Zimmermann, P., Ueland, M. and Dent, B.B. (2022). From flesh to bone: Building bridges between taphonomy, archaeothanatology and forensic science for a better understanding of mortuary practices. In C.J. Knüsel and E.M.J. Schotsmans (eds.) The Routledge Handbook of Archaeothanatology. London: Routledge. pp 501-541.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.01.543127v2
www.biorxiv.org www.biorxiv.org

New submission 10/07/2023, 10:20:14

1
1. Public_Reviews 10 Jul 2023
  
  in eLife
  
  Author Response:
  
  We thank eLife and the reviewer for the nice summary of our manuscript. We largely agree with the summary and review, and just add a few small points.
  
  First, the review asks about the reproducibility of our findings, and suggests that they are only from a single experiment. In fact, our manuscript reports data from two independent single-cell experiments: one performed at low multiplicity of infection (MOI), and another at higher MOI. The broad trends, including the lack of strong correlations between viral mRNA transcription and progeny production, are consistent across both experiments.
  
  Second, the reviewer asks about what happens when two different virions bearing the same viral barcode infect two different cells, given that we estimate 4-8% of barcodes to be shared between multiple infecting virions. When two cells are infected by different virions with the same barcode, this breaks the one-to-one link between transcription in that cell and progeny in the supernatant, since it is not possible to determine which cell contributed the progeny with that barcode. This means that between 4-8% of the points on our correlation plots could be affected by this factor, meaning that a few outliers should be expected. Another scenario, where a single cell is infected by two barcodes, is not problematic for our method because we can simply sum the progeny output for both barcodes from that cell.
  
  Finally, the reviewer notes that some cells appear to produce progeny virions despite failing to express one or more viral genes. Such cells can be explained in one of two ways. First, as noted immediately above, we expect a small fraction (4-8%) of the points to be erroneous due to a lack of a guaranteed one-to-one link between cell and progeny for non-unique barcodes. Second, in some cases the missing viral gene could be a technical artifact caused by a stochastic failure to capture modestly expressed transcripts from the gene; this phenomenon, known as gene dropout, occurs at a fairly high rate in single-cell experiments (see Qiu Nature Communications 2020 for a detailed discussion). Genes that are expressed at lower levels, like the Influenza virus polymerase genes, are more likely to be missed during single-cell RNA sequencing. The absent viral genes in each infected cell can be explored in detail using the interactive plots at https://jbloomlab.github.io/barcoded_flu_pdmH1N1/
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.08.30.505828v2
www.biorxiv.org www.biorxiv.org

New submission 04/07/2023, 10:06:08

1
1. Public_Reviews 10 Jul 2023
  
  in eLife
  
  Author Response:
  
  The following is the authors’ response to the original reviews.
  
  Major Revisions:
  
  1) Although we appreciate this work was carried out independently, it would improve this paper if this structure presented here was compared to the recently published structure of Cx43 (Nat Commun 14, 931 (2023)) with the conclusions including added in the discussion.
  
  We encourage the readers to read both our study on Cx43 and the one mentioned by the reviewer. However, we believe the optimal format for such a comparison is going to be a more comprehensive review article, which is outside the scope of our study.
  
  2) Please elaborate on the lipid-binding pockets observed for lipid 1, lipid 2, and the N-lipid/PGL. For example, what are the residues involved in these lipid-protein interactions? Are these residues conserved in other connexin isoforms? Do these lipid-binding pockets match with previous structures, including the recent Cx43 structure? Please clarify what lipid sites are ambiguous due to insufficient resolution.
  
  Within the scope of our study, we have shown that some of the disease-linked residues are located in close proximity to the lipid sites (Fig. 4b). This suggests a possible role of the lipid sites in diseases associated with Cx43 mutations (and possibly with the mutations in other connexins, as the structures of other connexin channels also feature bound lipids inside the pore region). We feel that a more in-depth comparison will require a careful study, beyond the analysis that we have performed here, and for this reason we would like to reserve such a detailed comparison for our future work (possibly a comprehensive review article on connexin structure and function).
  
  3) The NT domain and TM2 segments are referred to as the gate region. If there is no strong evidence to support this claim then please use "putative" gate region.
  
  We have updated the text accordingly, referring to this region as a putative gate region where appropriate.
  
  4) It is mentioned that there is a reorientation of extracellular loops 1 and 2 after Gap junction formation. Based on their structures, I wonder how this rearrangement alters the channel conduction pathway. For example, Do the electrostatic surface and hydrophobic properties change? Please consider adding further details as this information could be useful to understand why some properties of hemichannels differ from intercellular GJ channels.
  
  We have updated the Fig. 5 with an illustration of the Cx43 HC surface coloured according to electrostatic potential (to match the same representation of the Cx43 GJC). It is obvious that the rearrangement of the extracellular loops 1 and 2 do not dramatically alter the electrostatic properties of the HC relative to the GJC. A more obvious difference is in the local environment of the ECLs: it is radically different in a “free” HC (exposed to the solvent or to the extracellular space of a cell), compared to the ECL environment in a connexon within a GJC (which is sealed by a docked connexon from the opposite membrane).
  
  5) Related to the previous point, the pore profile shown in Figure 5C shows that there is a constriction site in the extracellular part with the same diameter as the observed constriction caused by the NT domain. This constriction point seems to be associated with the high energies calculated for Cl-. Please clarify if this constriction is produced by the formation of the GJC or is also present in HC?
  
  This is the same constriction zone, and the Cl- barriers are further down the channel axis where the electrostatic potential of the protein is negative. We have included a similar calculation for the HC simulation in Fig. 5 (revised Fig. 5f).
  
  6) Related to the MD simulations shown in Figure 5d: if the voltage is applied across the whole GJC, the free energy under voltage should not be symmetric. Please clarify.
  
  The symmetry observed in the free energies is due to the fact that the ions enter and exit from the same hemichannel. Only at very high voltages we observe some rare full GJC permeation events, slightly unbalancing the free energy at 500 mV.
  
  7) The scheme in Figure 6 many needs further editing. The authors propose a putative closed state in which lipids are bound next to the NT, but we suggest it should be made clearer in the figure that this is a putative model, since there is no functional evidence supporting the role of these lipids in the gating/permeation properties of Cx43. Also, please clarify what is meant by a "semi-permeable gate" - a channel that only permeates ions but not molecules?
  
  We have updated the legend of the figure 6, to clearly reflect that this is a putative model. The “semi-permeable” state of the channel is something that was suggested previously by the authors of the Cx31.3 study, and we refer to that structure in the figure.
  
  Minor comments:
  
  1) In the result section there are some statements that currently lack solid experimental support. Please consider editing or moving this text to the discussion section only. A good example of this is the Diseaselinked mutation section, specifically lines 199-206. In another example: in lines, 237-238 authors state that NT can move laterally and vertically, but this idea still requires experimental validation.
  
  We feel that the original formulations of these portions of the text are appropriate. Disrupting them would interrupt the flow of the manuscript, and we prefer to stay with the original text in this case.
  
  2) Line 283. "With these structures in mind, we can now establish the existence of several structurally defined gating substates of the connexin channels". Please, tone down this statement. Replace "establish" with "propose" or another more appropriate word.
  
  We have updated the text as suggested ("propose” instead of “establish”)
  
  3) Line 313-314. " The presence of such molecules could have important implications for HC or GJC assembly, substrate permeation, and molecular gating". Currently, this entire statement does not have any support. Is there any paper that authors can discuss to suggest with some basis that lipids might have a role in assembly, permeation or gating?
  
  We feel that this statement is sufficiently careful, conveying a thought that the presence of such molecules could have important implications for various HC- or GJC-related processes. It is not a particularly strong claim and seems to be appropriate in this context.
  
  4) It seems that the structure shown in panels A and C in Figure 2 are shown in opposite directions, which makes the figure confusing. If needed, please rotate the structure in panel A to show the cytosolic part of the protein as panel C. Also, in the same figure, panels G and F are wrongly labeled. Please correct.
  
  For Fig. 2a, the angle is very different from anything else we show in the figure, so we would rather keep this as it is now. We have corrected the labelling for Fig. 2g-h.
  
  5) Check spelling mistakes in the legend of Extended data Fig.2, Extended data Fig.9, and line 243.
  
  We are grateful to the reviewers for pointing out the typos, which have now been corrected.
  
  6) The colors for G-L isoforms are not specified in Extended Data Fig.10. Please correct this.
  
  We updated the figure, removing the PGL label (the correct label is “lipid-N”).
  
  7) It is not clear what is the difference between PGL and the N-lipid density. Does PGL refers to the lipid-like density observed in nanodiscs, as indicated in Extended Fig. 4 and 10?. Please clarify this issue in the manuscript.
  
  The labeling has been corrected in like with the revised version of the manuscript (this density element is now referred to as the “lipid-N”).
  
  8) Page 7 line 234-235 "The pore opening has a solvent-accessible radius of ~6Å (Figure 5c) very close to the effective hydrated radius of K+ (~6.6 Å) and Cl- (~7.2 Å). This makes it the most narrow pore opening...", it should be diameter, not radius.
  
  We have added a calculation for the HC (new Fig. 5f) and corrected the text as follows (line 234):
  
  “The pore opening observed in our cryo-EM structures has a solvent-accessible radius of ~3 Å (Figure 2b). This makes it the most narrow pore opening observed for a connexin channel to date (a comparison of the pore openings in the cryo-EM structures of connexin channels is shown in Extended Data Fig. 12). However, the average solvent-accessible radius of the pore during molecular dynamics was ~6 Å (Figure 5c); note that the effective hydrated radius of K+ and Cl- is ~3.3 Å and ~3.6 Å, respectively.”
  
  And line 277:
  
  “The average pore radius during the simulations was consistent with that observed in the cryo-EM structure (Fig. 5f).”
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.03.26.485947v3
www.biorxiv.org www.biorxiv.org

New submission 09/07/2023, 09:29:19

1
1. Public_Reviews 10 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  In this study, Shin and colleagues investigate the role of the posttranslational modification of the DNA methyltransferase by covalent linkage of the N-Acetylglucosamine (O-GlcNAc).
  
  The authors present compelling evidence showing that a prolonged high fat/sucrose diet causes global protein O-GlcNAcylation in the liver and DNMT1 is among the proteins that increase their O-GlcNAc level. This result is significant because of the paucity of in vivo data addressing the interplay between metabolism and protein O-GlcNAcylation. The paper also shows that DNMT1's O-GlcNAcylation level correlated to the extracellular glucose levels in other cell types.
  
  Using mass spectrometry, the authors identify S878 as the main site for O-GlcNAcylation. It is noteworthy that the mapping was performed with hyper-O-GlcNAcylated cells and may be different in a physiological situation. To investigate how O-GlcNAcylation of S878 of DNMT1 impacts its activity and ultimately DNA methylation patterns, Shin and colleagues mostly use a cellular model of hyper O-GlcNAcylation induced by the combination of high glucose and a chemical inhibitor of OGA (the only enzyme responsible for O-GlcNAc removal). The data shows that increased O-GlcNAcylation resulting from the combination of high glucose and OGA inhibition causes a reduction of DNMT1 activity and local loss of DNA methylation specifically at partially methylated domains.
  
  This study brings completely new knowledge on the regulatory function of glycosylation of DNMT1 and its impact on its methyl-transferase activity and downstream genomic methylation. Furthermore, the manuscript introduces new data on the interplay between cellular metabolism and O-GlcNAcylation on DNMT1 and other proteins. The experiments are well-controlled, and their interpretation is sound. This study should be of special interest to the fields of fundamental and environmental epigenetics, as well as metabolism.
  
  The main limitation of the study is the convolution of the functional experiments where the perturbation is a combination of high glucose and chemical inhibition of OGA. The relative contribution of the two variables is partially addressed in Figure 3-figure supplement 1B which shows that high glucose increases DNMT1 activity (Hep3B cells) while Figure 3D shows that high glucose when combined with OGA inhibitor decreases DNMT1 activity (Hep3B cells). As discussed, the data suggest that high-glucose and OGA inhibition may have an antagonistic effect on DNMT1 activity. An experiment of treatment of the cells with the OGA inhibitor in physiological glucose conditions would address this gap of knowledge.
  
  We thank the reviewer for the suggestion. The physiological glucose levels are between 5 to 7 mM, and 25mM is in hyperglycemic range, which corresponds to severe diabetes. The new Figure 1A shows TMG treatment with physiological glucose conditions. We have included new WB data of 5mM glucose, 5mM glucose + TMG, 25mM glucose, and 25mM glucose + TMG (Figure 1A).
  
  To understand the impact of the environment (in this study: extracellular glucose level) on the epigenome, one should keep in mind the variation of cytosine methylation patterns between individuals and over time. A recent large-scale profiling of DNA methylation of 137 individuals shows a near absence of individual variation between replicates of the same cell type, suggesting that genomic methylation patterns are largely insensitive to the environment (https://doi.org/10.1038/s41586-022-05580-6).
  
  Comparative methylomes of healthy and diabetic individuals are needed to examine the medical significance of the findings presented here. It is possible that the modulation of DNMT1 activity by O-GlcNAc modification is relevant for a specific cell type or developmental stage that remains to be discovered.
  
  We thank the reviewer for the suggestion. While the present study is focused on the functional impact of glucose concentrations on O-GlcNAcylation of DNMT1, the extension of this work to diabetic individuals is a goal for a follow up project.
  
  Reviewer #2 (Public Review):
  
  I've read the manuscript by Shin et al with great interest. The authors describe the identification of O-GlcNAcylation of DNMT1 and the impact this modification has on the maintenance activity of DNMT1 genome-wide and that modification of S878 leads to enzyme inhibition. The manuscript is written in a clear and understandable way making it easy for the reader to understand the logic as well as the steps of the experimental approach.
  
  The authors identify O-GlcNAcylation of DNMT1 in a number of different cell lines by combining inhibition studies and WB and further on they identify the modification sites with LC/MS, predictions, and mutational studies. I really like the experimental approach, which while being straightforward (albeit technically challenging), is powerful and well-controlled in this case to unequivocally prove the modification of DNMT1 and identify the site. However, mutation of the two identified modification sites does not remove all the O-GlcNAcylation signal associated with DNMT1, thus possibly not all the possible sites were identified. While this is not a criticism of this manuscript, it would be interesting to know what other sites are modified and the enzymatic/biological effects associated.
  
  We completely agree with the reviewer. As the O-GlcNAc band was also detected in double mutated DNMT1 (Figure 2D), it is expected that undetected O-GlcNAcylated sites will exist. This is a limitation of current MS analysis and is known to be difficult to detect in the case of modified sites located at both 5’- and 3’- ends of the protein or around the site cut by endoprotease such as trypsin. In follow up work we plan to detect more diverse O-GlcNAc modified sites using more types of endoproteases and observe changes in the phenotype of various cells accordingly.
  
  Also, the authors isolate the modified DNMT1 from cells using immunoprecipitation, which is indeed useful to study the changes in catalytic activity but does not provide any information if the cellular localisation of modified DNMT1 changes.
  
  We apologize for this oversight. We have added a DNMT1 localization assay via immunofluorescence (IF) in the revised manuscript (Figure 3—figure supplement 3). We found no difference in DNMT1 localization between wild type and S878A mutants.
  
  Subsequently, the authors checked the impact of high glucose diet on the genome-wide DNA methylation patterns. The observed effects (Fig 4A) are very strong, almost as strong as observed with Aza treatment and therefore I wonder if LINE/IAP or other elements are getting activated (as observed with genome-wide demethylation with Aza).
  
  We thank the reviewer for the suggestion. Changes in methylation of LINE-1 by hyperglycemia condition are displayed in Figure 4—figure supplement 4. In the case of LINE-1, DNA methylation is lost globally in hyperglycemia conditions. While beyond the scope of this study, a more thorough examination of the impact of the observed loss of methylation under high glucose conditions is of interest.
  
  Do the authors see any changes in cell phenotype, slower/faster proliferation, or increased apoptosis due to the activation of mobile elements (not only ROS)?
  
  This is also a very interesting idea. We plan on further investigating this as part of a follow up study.
  
  Another point is that the S878A mutant seems not to be able to fully maintain the DNA methylation (Fig 4A). Does O-GlcNAcylation recruit any additional interactors? Given that the authors immunoprecipitated DNMT1 and use it for activity assay, it is possible, that the modification attracts an additional protein factor that could in turn inhibit DNMT1 activity (as observed). Therefore, the observed kinetic effect could be indirect, while still interesting and important, the mechanism of inhibition would be different.
  
  We thank the reviewer for the great suggestions. According to Figure 4A, in the case of mutated DNMT1, a slight methylation loss appears to occur in both conditions. There could be for a number of reasons. It may be due to interacting proteins or it may be caused by some damage of DNMT1 itself. A further investigation of this is planned as a follow up project.
  
  DNA methylation clock can be used to estimate the biological age of a tissue/cells. While not directly in the line of the manuscript, I was wondering if the DNA methylation changes in the high glucose diet would affect the methylation sites used for the DNAme clock. Meaning, would the cells/tissue epigenetically age faster when in high glucose media, and if the Ala mutant could provide resistance to that?
  
  We thank the reviewer for the interesting suggestion. We believe this is beyond the scope of this manuscript, but we'll consider this with interest in the future.
  
  In discussion, the authors write that this is the first investigation of O-GlcNAcylation in relation to DNA methylation, while this is true for DNMTs, TET enzymes, that oxidise 5mC and trigger active DNA demethylation have been shown before to also be modified.
  
  We have toned down the language throughout the revised manuscript. This is the first investigation into the maintenance of DNA methylation. Although there is a great deal of evidence regarding the important regulatory role of O-GlcNAcylation in gene regulation, a direct link with maintenance of DNA methylation has not previously been established.
  
  A nice and rigorous study, with important observations and connections to biological effects. It would be nice to prove that the effects are direct and not associated with other factors that could be recruited by the modification and impact the activity of DNMT1. I find it a bit surprising that phosphorylation of the target serine does not impact DNMT1 activity as well.
  
  We thank the reviewer for the positive comments and agree that there are many interesting avenues to follow up on this.
  
  Reviewer #3 (Public Review):
  
  The authors investigate the potential effect of OGlcNacylation on the activity of the DNA methyltransferase DNMT1.
  
  Some results that are convincingly obtained include:
  
  There is more overall OGlcNacylation when Glucose concentration in the culture medium or the feed is high;
  
  DNMT1 is OGlcNacylated, and more so in high glucose or on rich chow;
  
  The position S878 can be OGlcNacylated;
  
  The activity of transfected DNMT1 is decreased in high glucose conditions. This effect is lessened when S878 is mutated to A or D.
  
  Some results that are suggested but not fully backed by experimental data include:
  
  This process happens to the endogenous protein under physiologically relevant conditions;
  
  We agree that we could not completely rule out endogenous DNMT1 in our experiments. We have adjusted the language in the revised manuscript to acknowledge this. However, we confirmed the change in activity of recombinant DNMT1 (Figure 3D), and also demonstrated the change in activity under physiological conditions (normal physiological glucose level vs hyperglycemic range) in Figure 3—figure supplement 1B. This is a result that directly shows that the activity of DNMT1 changes under physiological conditions. In addition, DNA hypomethylation due to high glucose has been previously reported, already (Kandilya et al., 2020; Lan et al., 2016). Our results suggest a possible mechanism for this.
  
  Kandilya, D., Shyamasundar, S., Singh, D.K., Banik, A., Hande, M.P., Stunkel, W., Chong, Y.S., and Dheen, S.T. (2020). High glucose alters the DNA methylation pattern of neurodevelopment associated genes in human neural progenitor cells in vitro. Sci Rep 10, 15676.
  
  Lan, C.C., Huang, S.M., Wu, C.S., Wu, C.H., and Chen, G.S. (2016). High-glucose environment increased thrombospondin-1 expression in keratinocytes via DNA hypomethylation. Transl Res 169, 91-101 e101-103.
  
  This process is responsible for changes in DNA methylation, leading to changes in gene expression, leading to increased ROS and increased apoptosis.
  
  We confirmed that ROS levels increased under high glucose conditions through DCFH-DA fluorescence experiments (Figure 5A). In addition, γH2A.X fluorescence experiments showed that DNA damage was increased under high glucose conditions (Fig. 5B). On the other hand, in the case of the S878A mutant, DNA damage was reduced under hyperglycemic conditions compared to wild type DNMT1 despite an increase in ROS levels (Fig. 5B). Moreover, we verified that the DNA damage did not come from oxidative stress through 8-OHdG analysis (Figure 5—figure supplement 4). Therefore, DNA oxidative stress is suppressed by DNMT1 due to the increase of ROS under high glucose conditions. However, the reduction of DNA methylation by O-GlcNAcylation of DNMT1 induces apoptosis due to oxidative stress.
  
  Studying the connection between cellular metabolism and epigenetic phenomena is interesting. However, I feel that the article falls short of its aims because of the limits of the experimental system, some missing controls, and some data overinterpretation.
  
  We hope the reviewer finds our revised manuscript more suitable.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.05.11.491514v6
www.medrxiv.org www.medrxiv.org

New submission 09/07/2023, 09:24:37

1
1. Public_Reviews 10 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  Overall, this manuscript exposes key gaps in patient care resulting from the pandemic, as well as the challenges and unmet needs felt by healthcare workers in cervical cancer screening. The authors’ findings on the struggles while regaining screening volume across the nation in a sustainable way, demonstrate that pre-existing weaknesses in the cancer control system were exacerbated by the pandemic and are integral to amend. The authors were able to identify these gaps in care and work environments through their synthesis of qualitative interviews. I applaud the use of such mixed methods, which emphasizes the complementary need for both quantitative and qualitative data. What could be better strengthened in the manuscript is the authors’ justification for statistical analyses within the context of the research question, and reporting of survey administration and management.
  
  The authors thank the reviewer for a thorough assessment of the manuscript. We have addressed the reviewer’s concerns regarding justification of statistical analyses in the Data Analysis, Quantitative survey data section, and reporting of survey administration and management in the Results, Quantitative survey data section.
  
  Reviewer #2 (Public Review):
  
  Fuzzell et al. conducted a mixed-method study looking into the possible impact of COVID-19 on clinician perceptions of cervical cancer screening. The authors examined how the pandemic-related staffing changes might have affected the screening and abnormal results follow-up during the period October 2021 through July 2022.
  
  They found that 80% of the clinicians experienced decreased screening during the start of the pandemic and that ≈67% reported a return to pre-pandemic levels. The general barriers for not returning to pre-pandemic levels were staffing shortages and problems with structural systems for tracking overdue patients and those in need of follow-up after abnormal screening tests.
  
  Strengths:
  
  There is a high focus on the consequences and the need for action to prevent the ongoing impact of COVID-19 on cervical cancer screening. Some of the actions mentioned by the authors could be the use of HPV self-sampling kits, and it is interesting to be provided knowledge on the clinicians' views on HPV self-sampling. Both are of high interest to the general population in the US. Throughout the discussion, the authors and their claims are supported by other studies.
  
  Weaknesses:
  
  The lack of a National representative sample, where 63% of the responding clinicians were practicing in the Northeast, affects the possibility of generalization of the results found in the study. The overrepresentation of white females is not addressed in the discussion. This composition could have affected the results, especially when the authors report a need to look at higher salaries and better childcare to maintain adequate staffing.
  
  The conclusions are mostly supported by the data, however, some aspects of the data analysis need to be clarified.
  
  We thank the reviewer for their constructive feedback. Despite our best efforts, we were unable to recruit a sample more representative of all US regions. We note this limitation in the discussion: “Notwithstanding efforts to achieve a regionally diverse sample, 63% of responding clinicians were practicing in the Northeast at the time of their participation. Given that COVID-19 policies varied widely by state, this regional imbalance may limit the generalizability of our results. Despite the oversample of clinicians in the Northeast, region was not a significant predictor of either outcome.” Also, we acknowledge the high enrollment of White women in our provider sample and now address this point in the discussion: “Similarly, our sample was 85% female and 70% White. Although ideally we would have included a sample that was more diverse with respect to race and gender, these characteristics are not disparate from the majority of clinicians who perform cervical cancer screening (e.g., race: Women’s Health NPs [77% White], active Ob/Gyns [67% White], all active physicians [64% White]; gender: all NPs [92% female], Ob/Gyns [64% female], all active physicians [37% female]).” Data describing these characteristics are reported in the Association of American Medical Colleges (AAMC) 2022 Physician Specialty Data Report and Executive Summary, the 2018 NPWH Women’s Health Nurse Practitioner Workforce Demographics and Compensation Survey: Highlights Report, and a published paper describing the characteristics of nurse practitioners in the US, which are cited in text.
  
  Reviewer #3 (Public Review):
  
  This US study presents findings from an online survey and in-person interviews of healthcare providers regarding themes associated with cervical screening in federally qualified health centres (FQHCs). The study provides insights during the post-acute phase of the pandemic into a range of areas, including perceived changes in the provision of cervical cancer screening services and the impact of the pandemic, staffing and systems barriers to cervical cancer screening, strategies for tracking missed screens and catch-ups, follow-up of abnormal screening results, as well as attitudes towards HPV self-sampling. Results indicate persisting pandemic-related impacts on patient engagement and staffing, as well as system barriers to effective screening, catch-up of missed screens and follow-ups. Taken together, these issues may lead to increases in cervical cancer in the long-term in populations serviced by these centres, if measures are not taken to adequately support them. Participants were recruited from various regions in the US, however, the study was not conducted using a nationally-representative sample. Although highlighted issues are informative, findings cannot be generalised and larger studies are warranted in the future to monitor cervical screening provision and outcomes in FQHCs.
  
  We thank the reviewer for their thorough assessment of the manuscript. In the discussion, we have made sure to note the non-nationally representative sample and need for continued monitoring of cervical cancer screening and related outcomes in underserved settings and communities.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2023.01.27.23285111v1
www.medrxiv.org www.medrxiv.org

New submission 09/07/2023, 08:25:34

1
1. Public_Reviews 10 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #2 (Public review):
  
  1) The systematic review includes data from some studies where PCOS is self-reported. While self-reported PCOS information has been found to be largely sensitive and specific, it would be of interest to know if prevalence ratios of mental health-related were impacted by self-reporting.
  
  Thank you for your insightful comment regarding the potential impact of self-reporting on the prevalence ratios of mental health-related outcomes in women with PCOS. We agree that this is an important factor to consider.
  
  In response, we have revisited all the studies included in our review. We have updated Supplemental Tables 2-4 to provide greater transparency and understanding. These revised tables now include a new column specifying the mental health assessment method used in each study. This update should allow for a more nuanced interpretation of the results, taking into account the potential impact of self-reporting.
  
  Furthermore, we conducted a sensitivity analysis by rerunning the meta-analysis to discern the potential influence of self-reported PCOS on our results, excluding the studies that relied solely on self-reported PCOS diagnosis. After we excluded studies where PCOS was self-reported, the point estimate for anxiety was similar whereas point estimates for depression and eating disorder were slightly higher but none of the estimates were different beyond chance compared to the original analysis. We believe these steps significantly strengthen the clarity and robustness of our findings (Line 314; Supplemental Tables 7 and 8).
  
  2) Likewise, the screening vs self-reported nature of the mental health disorders is not clear from the information included in the characteristics table.
  
  We have modified our Supplemental Tables 2-5 to include a column detailing the method of ‘Mental Health Assessment’. We should note that the majority of the studies directly assessed mental health using a variety of validated questionnaires. We have also included in the Discussion a section emphasizing that some of the studies included in the review relied on self-reported PCOS diagnosis and its potential impact. We also highlighted that while self-reported information is generally reliable, it is subject to potential bias that could impact the prevalence ratios of mental health-related conditions (Line 460).
  
  3) Calculated prevalence ratios were compared with prevalence values for the general population to determine the excess prevalence. However, the source of these general population statistics (i.e., whether these figures come from the control data in the included studies or other sources) is not clear.
  
  Thank you for raising this important point. We have now clarified in our Methods section that the general population statistics used for determining excess prevalence were derived from the control data in the included studies. We hope this provides the necessary transparency for our approach in calculating and interpreting the prevalence ratios (Line 210).
  
  4) The estimated costs for anxiety-, depression- and eating disorder-related care are accessed in published papers and used to calculate the excess costs. Conclusions would be strengthened by a defence of these figures, particularly for anxiety where the source paper is from 1999.
  
  Thank you for your insightful comment. We agree that providing a justification for our choice of cost estimates, especially for the anxiety care cost from a 1999 study, would strengthen our conclusions. The 1999 source was selected because it is a seminal study that offers a comprehensive breakdown of anxiety-related care costs. Despite its age, this paper is often cited in contemporary research due to its rigorous methodology and the granularity of its cost analysis. Adjusted for inflation, its findings still provide an insightful comparison point for current data. To ensure that these figures accurately represent present-day costs, we have adjusted them for inflation using the medical care inflation calculator. Our choice of these specific studies was based on their rigorous methodology, the detailed breakdown of costs, and their relevance to our targeted age groups. The aforementioned adjustments and justifications ensure that these figures aptly represent the present-day costs of treating these conditions.
  
  Similarly, the 2021 papers on depression and eating disorders present comprehensive and up-to-date analyses of the economic burdens associated with these conditions. These papers were selected for their rigorous methodologies, comprehensive cost breakdowns, and alignment with our age-specific focus. The Greenberg et al. (2021) paper, for example, is an authoritative source that provides detailed analysis on the economic burden of adults with major depressive disorder. Likewise, the paper by Streatfeild et al. (2021) offers a meticulous investigation into the socio-economic cost of eating disorders in the U.S., making it an apt choice for our study. We recognize the necessity of providing a robust justification for our choice of these particular papers, and we have endeavored to do so in our Methods section, thus reinforcing the transparency of our approach. We have clarified this in our Methods section to make our approach more transparent to readers (Line 225).
  
  5) An inflation tool is used to adjust the figure, but this does not take into account changes in treatment or practice since this estimate was made. The accuracy of these estimated figures is central to the final conclusions.
  
  Thank you for your valuable comment. We do note that the inflation figures used are a healthcare-specific inflation factor, as healthcare inflation differs from general consumer inflation. However, we agree that the inflation-adjusted figures do not necessarily account for changes in treatment practices since the original estimate was made, assuming these changes would alter the cost of care. We have added a discussion of this limitation in our manuscript and proposed future studies to validate these estimates using more recent data (Line 473).
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2023.01.05.23284220v1
www.biorxiv.org www.biorxiv.org

New submission 09/07/2023, 08:21:04

1
1. Public_Reviews 10 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  GSK3 is a multi-tasking kinase that recognises primed (i.e. phosphorylated) substrates. One of the mechanisms by which the activity of GSK3 can be regulated is through N-terminal (pSer9) phosphorylation. In this case, the phosphorylated N-terminus turns into a pseudo-substrate that occupies the substrate binding pocket and thus inhibits the activity of GSK3 towards its real substrates.
  
  One outstanding question is how this autoinhibitory mechanism can affect some, but not all signaling pathways that GSK3 is involved in. One example is WNT/CTNNB1 signaling. Here, GSK3 plays a central role in the turnover of CTNNB1 in the absence of WNT, but this pool of GSK3 is not affected by pSer9 phosphorylation.
  
  Gavagan et al. address this question using an in vitro approach with purified proteins. They identify a role for AXIN1 in protecting the "WNT signaling pool" of GSK3 from the auto- inhibition that occurs upon pSer9 phosphorylation.
  
  Specifically, they show that i) GSK3-pSer9 is less capable of binding and phosphorylating primed CTNNB1 - thus suggesting that GSK3-pSer9 does not contribute to WNT signaling, ii) in the presence of AXIN1, GSK3-pSer9 becomes more capable of binding and phosphorylating CTNNB1 - suggesting that Axin can promote binding of GSK3 and CTNNB1 even when the primed binding pocket on GSK3 is blocked initially, iii) AXIN1 specifically prevents the PKA mediated phosphorylation of GSK3B on pSer9 - while leaving the phosphorylation of other PKA substrates unaffected.
  
  Strengths:
  
  The authors use an in vitro system in which they can reconstitute different interactions and reactions using purified proteins, thus allowing them to zoom in on specific biochemical events in isolation.
  
  The authors measure the phosphorylation of primed substrates (pSer45-CTNNB1 or WNT- independent substrates) and quantify specific kinetic parameters (kcat, KM, and kcat/KM) - of wildtype non-phosphorylated GSK3B, pSer9GSK3B, or the non-phosphorylatable S9A-GSK3B, either in the presence or absence of AXIN1 (or an AXIN1 fragment).
  
  The experiments appear to be well-controlled and the results appear to be interpreted correctly.
  
  Weaknesses:
  
  Key experiments (e.g. Figures 2 and 3) are described as being performed as n=3 technical replicates rather than independent/biological replicates.
  
  We suggest that the replicates described in our work can properly be described as biological replicates, and we have updated the manuscript accordingly. We apologize for the confusion and elaborate on our reasoning below.
  
  Each replicate reported for our in vitro kinetic assays is an independent reaction prepared in a separate reaction vessel, and replicates were analyzed on separate gels. Thus, each reaction is a distinct biological sample and should have been described as a biological replicate. A technical replicate would have been repeat measurements of the same timepoint from a single reaction.
  
  Our original description as technical replicates was based on the notion that each replicate came from the same protein purification (biological sample). However, an analogy to cell culture experiments can illustrate why our initial reasoning was incorrect. In a cell culture experiment, cells from the same initial source are typically split into independent wells for biological replicates. Similarly, our proteins come from the same initial source but are split into independent reaction vessels for biological replicates.
  
  The critical point is that, regardless of the precise terminology, our replicates capture the variability between independent experiments.
  
  The validation in a biologically relevant setting (i.e. a cellular context) is limited to Figure 4C, which shows that over-expression of AXIN1 reduces the total levels of pSer9-GSK3.
  
  The biochemical experiments presented in our work address a critical gap in the signaling field and, together with the in vivo validation in Figure 4C, establish a model that was previously speculative. We suggest that further in vivo experiments are beyond the scope of the current manuscript.
  
  The authors convincingly show that AXIN1 can play a role in shielding GSK3 from auto- inhibition. As it stands, the impact of this work on the field of WNT/CTNNB1 signaling is likely to remain limited. This is mainly due to the reason that the mechanism by which AXIN1 shields the WNT/CTNNB1 signaling pool of GSK3 from pSer9 inhibition remains unresolved. Based on the fact that a mini AXIN1 (i.e. an AXIN1 fragment) behaves the same as WT AXIN1, the authors conclude that AXIN1 likely causes allosteric changes on GSK3 but is less likely to block PKA from binding. They cannot conclusively show this, however, as they do not have evidence in favour of one or the other explanation.
  
  We thank the reviewer for this important comment which details the central concern raised in the review process. To address this point, we have collected additional biochemical data that conclusively shows that the Axin shielding effect is allosteric and not a steric block. We demonstrated that a minimal, 27 amino acid Axin peptide produces the same GSK3β shielding behavior as full length Axin and miniAxin. The minimal Axin peptide does not sterically occlude the GSK3β phosphorylation site. This data is included in a revised Fig 4A and described on lines 115-120 of the revised manuscript.
  
  However, this study does offer more insight into the compartmentalisation of GSK3 and the quantitative parameters may be used in computational models describing the different cellular activities of GSK3.
  
  This work also has conceptual significance: Scaffold proteins are known to promote signal transduction by bringing proteins together (often: kinases and substrates). Here, Gavagan et al. show that AXIN1 also plays a second role, namely in protecting one of its binding kinases (GSK3) from inhibitory signals. This could potentially hold for other scaffolding proteins as well.
  
  Reviewer #2 (Public Review):
  
  Gavagan et al. investigated the role of the scaffolding protein, Axin, in the cross-pathway inhibition of GSK3b. The authors utilize reconstituted Axin, b-catenin, GSK3b, and protein kinase A to test 2 models. In the first model, the formation of the complex consisting of Axin, b-catenin, and GSK3b overcomes inhibitory phosphorylation of serine 9 of GSK3b. In the second model, the binding of Axin to GSK3b inhibits serine 9 phosphorylation through allosteric effects. Previous literature has established that the phosphorylation of serine 9 of GSK3b inhibits its kinase activity. To provide a quantitative measure of inhibition, the authors determine the binding affinity and catalytic efficiency of GSK3b in comparison to GSK3b phosphoS9 towards b-catenin. Interestingly, the data demonstrate a 200-fold decrease in Kcat/Km and 7 fold increase in Km. It is unclear why serine 9 mutation to alanine increases the rate of B-catenin phosphorylation more than the GSK unphosphorylated protein in figure S10.
  
  We thank the reviewer for catching this inconsistency. In the Michaelis-Menten plots presented in the main text (Figure 2 & Figure 3D), rates for unphosphorylated GSK3β and GSK3β_S9A are indistinguishable. These plots were used to determine the kinetic parameters reported in Table S1 (now Supplementary file 1a). The purpose of Figure S10 (now Figure 2-figure supplement 8) was to confirm that these reactions were first order (linear) in enzyme concentration, but the reviewer is correct to flag the inconsistency in absolute rates. In Figure S10A (now Figure 2-figure supplement 8A), the rates for unphosphorylated GSK3β were ~2-3-fold lower than expected.
  
  We have reanalyzed the original frozen reaction timepoints on new western blots. The results were identical for unphosphorylated GSK3β and GSK3β_S9A, resolving the apparent discrepancy. Upon review of the original western blot images, we noted that they were relatively noisy, potentially indicating incomplete blot transfer or an antibody going bad. Because we were able to reanalyze the original samples and obtained internally consistent results, we suggest that the updated data should replace the original data. The updated data are included in a revised Figure S10A (now Figure 2-figure supplement 8A).
  
  Next, the authors tested if the addition of Axin could overcome this inhibition. Although the addition of Axin decreases the Km, thereby producing a 20-fold increase in catalytic efficiency, the addition of Axin does not rescue the catalytic turnover of the phosphorylated GSK3b. Hence, the authors propose that Axin does not rescue the kinase activity of GSK3b from the inhibitory effects of serine 9 phosphorylation.
  
  Next, the authors test if Axin protects GSK3b from phosphorylation by the upstream kinase PKA. Excitingly, the data show a decrease in binding affinity and catalytic efficiency of PKA with GSK3b phosphoS9 in comparison to GSK3b. The binding of Axin inhibits GSK3b serine 9 phosphorylation by PKA but does not inhibit the phosphorylation of other PKA substrates such as Creb. The authors demonstrate that a fragment of Axin, residues 384-518, behaves similarly to the full-length Axin to shield GSK3b from phosphorylation. However, it is unclear how this fragment may bind in the destruction complex and if Axin has allosteric effects on GSK3b.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.12.05.519208v1
www.biorxiv.org www.biorxiv.org

New submission 09/07/2023, 08:10:40

1
1. Public_Reviews 10 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  Various parts of the premotor cortex have been implicated in choices underlying decisionmaking tasks. Further, norepinephrine has been implicated in modulating behavior during various decision-making tasks. Less work has been done on how noradrenergic modulation would affect M2 activity to alter decision-making, nor is it clear whether noradrenergic modulation effects on activity would differ between the male and female sexes.
  
  This manuscript addresses some of these questions.
  
  In particular, clear sex differences in task engagement are seen.
  
  May also show some interesting differences and distributions of β2 adrenergic receptors in M2 between males and females.
  
  We thank the reviewer for their summary of our findings and thoughtful critique of our manuscript. In our revised manuscript we have taken measures to address the reviewer’s comments in line (blue edits in text and revised figures) with direct responses outlined below. We believe these revisions improve the scientific rigor of our findings and provide relevant context for our studies. We hope that they have sufficiently addressed the reviewer’s concerns.
  
  Less clear is the specificity of systemic antagonism of β adrenergic receptors on the changes in M2 activity reported. As propranolol was given systemically, changes in M2 firing rates could also be due to broader circuit (indirect) activity changes. As it was not given locally, nor were local receptor populations manipulated, one is unable to make the conclusion that changes in neural activity are due to the direct effects of adrenergic receptors within M2 populations.
  
  We agree that propranolol driven changes in anterior M2 activity may arise via multiple mechanisms, including direct action on the adrenoreceptors within M2, and indirect action via other regions that project to M2. Although locally activating inhibitory interneurons within M2 is sufficient to disrupt cueguided action plans and behavior in a 2AFC task (Inagaki et al., 2018), our noradrenergic manipulation was not restricted to M2. We have clarified our conclusions and provided additional discussion to highlight that propranolol actions were multifaceted and that direct actions in M2 are likely working in concert with propranolol mediated actions in other regions.
  
  Also not clear, is the contribution of M2 to this task, and whether the changes in M2 activity patterns observed are directly responsible for the behavioral disruptions measured.
  
  We have revised our introduction and discussion to more clearly outline the critical role of cue-guided action plans in M2 for successful behavior in 2AFC tasks. Suppression of cue-guided activity in M2 results in behavioral performance at near chance levels, similar to what we saw in females after propranolol (Guo et al., 2017; Inagaki et al., 2018; Li et al., 2016). Furthermore, targeted photostimulation of action plan encoding neurons in M2 is sufficient to drive behavioral responses (Daie et al., 2021). In our investigations it is plausible to expect propranolol related disruptions in other cognitive, sensory or motor regions. Based on the strong foundational evidence for M2 activity in 2AFC, the propranolol driven changes in anterior M2 in females, whether direct or indirectly mediated, are likely sufficient to drive behavioral disruptions in accuracy and/or trial completion.
  
  Reviewer #2 (Public Review):
  
  This paper by Rodbarg et al describes an interesting study on the role of beta noradrenergic receptors in action-related activity in the premotor cortex of behaving rats. This work is precious because even if the action of neuromodulatory systems in the cortex is thought to be critical for cognition, there is very little data to actually substantiate the theories. The study is well conducted and the paper is well written. I think, however, that the paper could benefit from several modifications since I can see 3 major issues:
  
  We thank the reviewer for their generous comments on the potential impact of our manuscript as well as their suggestions to improve this work. Below we outline responses to specific comments raised by the reviewer in addition to adresing them in the revised manuscript. We hope these responses sufficiently address the reviewer’s concerns.
  
  Both from a theoretical and from a practical point of view, the emphasis on 'cue-related' activity and the potential influence of NA on sensory processing is problematic. First, recent studies in rodents and primates have clearly demonstrated that LC activation is more closely related to actions than to stimulus processing (see Poe et al, 2020 for review).
  
  Indeed during optimal performance the peaks of LC activity are larger when PETH are aligned to action initiation rather than the cue itself (Clayton et al., 2004). This alignment resolves variability in decision processing times and omitted cues. Although LC responses align with action they are evoked by, and occur after, cue presentation with LC responses to visual cues occurring ~ 60ms after presentation (Aston-Jones & Bloom, 1981). The same behavioral action without preceding task relevant cues does not evoke an LC response (Rajkowski et al., 2004)
  
  In our current study cues initiate activity in anterior M2, this is our primary interest and where our electrodes are placed. The window between cue delivery and action completion hones in on our goal of investigating the role for β noradrenergic signaling in target cortical processing, rather than LC explicitly. In both NHP and rodents NE signaling (and evoked LC) promotes sustained cortical representations between cue onset and actions across cortical regions (dlPFC, S1) (Ramos & Arnsten, 2007; Vazey et al., 2018; Wang et al., 2007). In the current study we aligned neural data to either cue presentation (Figure 3) or action (lever press; Figure 4). Both presentations support a critical role for β adrenoreceptor signaling in suppressing irrelevant information, resolving and maintaining action plans. A unique feature of aligning the data to cue onset is that it allows us to see how the neural activity changes not only on completed trials (that end with a lever press) but also on omitted trials (which strongly increase after propranolol). We propose the reason we are seeing large increases in omitted trials is because β adrenoreceptor blockade either directly or indirectly prevents anterior M2 from resolving an action plan.
  
  Second, the analysis of neural activity around cue onset should be examined with spikes aligned on the action, since M2 is a motor region and raster plots suggest that activity is strongly related to action (I'll be more specific below).
  
  We agree that M2 shows important action plan activity which we highlight throughout the manuscript. In cued tasks, M2 neurons have been shown to represent action plans starting at cue onset that continues up to behavioral execution. Neural data was examined and results presented aligned to cue onset (illustrated in Figure 3) and aligned to action - lever press (illustrated in Figure 4). The impact of propranolol in diminishing action plan selection was similar in both action, and cue-aligned analyses.
  
  The distinction between neural activity and behavior or cognition is not always clear. I understand that spike count can be related to motor preparation or decision, but it should not be taken for granted that neuronal activity is action planning. The analysis should be clarified and the relation between neural activity, behavior, and potential hidden cognitive operations should be explicated more clearly.
  
  We have worked to clarify in our revised introduction, results and discussion the specifics of the known roles of neural activity in M2 in both action planning and decision making. We further expand that the neuronal activity in our study may reflect potential changes in cognitive processing and thus alter resultant behavioral outcomes.
  
  The sex difference is interesting, but at the moment it seems anecdotal. From a theoretical point of view, is there any ecological/ biological reason for a sex dependency of noradrenergic modulation of the cortex? Is there any background literature on sex differences in motor functions in rats, or in terms of NA action? If not, why does it matter (how does it change the way we should interpret the data?) From a practical point of view, is there a functional sex difference in absence of treatment, or is it that the drug has a distinct effect on males vs females? This has very distinct consequences, I think.
  
  We did not find overt differences in behavior in the absence of treatment. Only when noradrenergic function was challenged using propranolol did we identify functional sex differences. We agree that this has very distinct consequences – specifically it supports sex differences that can be revealed by perturbations of normal function. These functional sex differences may be a result of differences in the anatomy of central noradrenergic systems, a hypothesis further supported by our mRNA expression findings and existing literature on LC anatomy across species (Bangasser et al., 2011, 2016; Luque et al., 1992; Mulvey et al., 2018; Ohm et al., 1997; Pinos et al., 2001). Collectively these results have potential ramifications for understanding sex differences in disease prevalence and targeted treatments.
  
  Background literature supports some innate sex differences in motor function and executive function in rodents and humans. Of particular relevance to our investigation is an established difference in behavioral strategy with females being more risk averse than males (Grissom & Reyes, 2019). Ethologically risk adverse strategies may support parental care roles, and increased inhibitory mechanisms may be selected for in females. Although this strategy was not directly tested in our study, the large increase in omissions after propranolol seen in females is in line with avoiding risk (incorrect choices) during uncertainty (disrupted neural signaling). As with other executive functions, the utilization of norepinephrine within the cortex along with other neuromodulators, and local microcircuit interactions would all contribute to promoting risk averse behavior.
  
  These issues could be clarified both in the introduction and in the discussion, but the authors might have a different view on what is theoretically relevant here. In the result section, however, I think that both the lack of specificity in the description of behavior and cognitive operation and the confusion between 'sensory' and 'motor' functions make it very difficult to figure out what is going on in these experiments, both at a behavioral and at a neurophysiological level. First, the description of the behavior in the task is clearly not sufficient, which makes the interpretation of the measures very difficult.
  
  We have made an effort to better specify the task and relevant behavioral operations in both the methods and results and have included a clearer task schematic (Figure 1A). We agree that the confusion between ‘sensory’ and ‘motor’ functions may make it more difficult to understand the findings in this study. Anterior M2 plays a unique role in representing motor/action plans that can be informed by sensory information. This integrative function creates difficulty in parsing the neural activity of anterior M2 as strictly motor, sensory or cognitive. In attempts to improve clarity we have expanded and highlighted relevant information on the known roles of M2 in the introduction and discussion.
  
  One possible interpretation of the effects of the drug is a decrease in motivation, for instance, due to a decrease in reward sensitivity or an increase in sensitivity to effort. But there are others. More importantly, none of these measures can be used to tease apart action preparation from action execution, even though the study is supposed to be about the former.
  
  Neural activity during action planning, prior to action execution is known to be an essential function of M2 (Barthas & Kwan, 2017; Gremel & Costa, 2013; Guo et al., 2017; Inagaki et al., 2018, 2022; Li et al., 2016; Siniscalchi et al., 2016; Sul et al., 2011; Wei et al., 2019) for optimal performance in 2AFC tasks. In all, we found that the representation/separation of opposing action plans (a well validated function of M2) prior to responses (lever press) is degraded after propranolol, especially in females. We have provided additional emphasis on these foundational studies throughout our revised manuscript.
  
  To minimize impact of motivational factors, effort and reward size remain consistent within our task, and all trials require a random initiation hold prior to cue delivery. As described in our general response to the editor above (Figure 1, above), we investigated whether motivational changes may be reflected in our M2 recordings. PETHs from the first and last 10 trials within saline sessions did not identify potential motivation related differences in anterior M2 activity. Similarly, across propranolol sessions the neural activity was consistent between early and late trials. We used early and late trials as there was a mild decrease in trial rate during saline sessions in both males and females, potentially indicative of motivation/reward sensitivity changes during these sessions. M2 neural responses consistently separate action plans (after saline) or failed to separate action plans (propranolol sessions).
  
  Also, but this is less critical: In Figures 2C and D, it looks like there is a bimodal distribution for the effect of propranolol in females. Is there something similar in the neuronal effects of the drug? And in the distribution of receptors? Can it be accounted for by hormonal cycles/ anything else?
  
  Although there is some clustering in behavioral outcomes all data passed normality assumption as appropriate. Propranolol treatments were not synchronized to hormonal cycles, and the data likely include animals at various hormonal stages. Similar clustering was not apparent in neuronal effects of propranolol, although propranolol increased variability in many measures.
  
  In a pilot experiment we did not see any difference in baseline performance on our 2AFC task across the hormonal cycle (diestrous, proestrous, estrous or metestrous) of females in any measure including accuracy (F(3,33)=0.59, p=0.63, one-way ANOVA) and omissions (F(3,33)=0.51, p=0.68).
  
  The description of neural activity is also very superficial. In general, it is not clear how spike count measures have been extracted. For example, legend and figure C are not clear, is the (long) period of cue presentation included in the 'decision time'?? "Cues were presented at a variable interval 200-700ms after initiation and until animals left the well, 'Well Exit'. The time from cue onset to well exit was identified as the decision time (yellow)." Yet on the figure only the period after cue presentation is in yellow. This is critical because, given the duration of the cue, the animals are probably capable of deciding (to exit the well) before the cue turns off. Indeed, as shown in fig 2D, the animals can decide within about 500 ms. So to what extent is the 'cue response' actually a 'decision response'?
  
  We have clarified the task and spike count measurements in methods and added a revised task schematic. It is correct that the cues are available throughout the decision time (for up to 5 seconds or until well exit), and an action plan is generated before well exit/cues turn off as reflected by the separation of neural action plans (Fig 3, saline). Anterior M2 neurons maintain action plan representation from cue onset until the lever press under normal conditions (Fig 4, saline). These action plans encapsulate “cue responses” and “decision responses”. We have aligned neural data to discrete timestamps at either end of the window in which M2 processing is known to be critical, specifically between cues and actions (lever press) and focus on neural activity relative to those points. We refer to this activity throughout the manuscript as an ‘action plan’ as action planning functions of M2 activity have been well established in prior studies.
  
  When looking at figure 3A, there is clearly a pattern on the raster, a line going from top left to bottom right. If the trials are sorted chronologically, something is happening over time. If, as I suspect, trials are sorted by ascending response time, this raster is showing that what authors are calling a 'response to cues' is actually a response around action. Basically, if propranolol slows down reaction time, the spikes will be delayed from cue onset only because they remain locked to the action. Then the whole analysis and interpretation need to be reconsidered. But it might be for the best: as I mentioned earlier, recent work on LC activity has clearly emphasized its influence on motor rather than sensory processing (Poe et al, 2020).
  
  Figure 3A is a single neuron example, and data analyses focus on population-wide activity. Neural data is presented both aligned to cues, for all trials in which a cue was received, and aligned to lever press (action), for all trials on which a lever press occurred. In both cases, aligned to cue or aligned to action, the impact of propranolol is the same. β adrenoreceptor blockade reduces the separation of action plans in M2, severely so in females. However, a major finding is that females receive a cue but omit a large number of trials after propranolol, for this outcome the action does not occur. We propose this is due to the lack of action plan separation in anterior M2 (either directly or indirectly). When no behavioral response occurs, these trials cannot be aligned to action, yet we are still interested in the neural activity during the critical window between cue delivery and actions. We are not assigning this neural activity to sensory processing but using this discrete sensory event within our trials (cue) to align the data as there is substantial evidence that action plans in M2 arise after cue presentation in tasks such as ours where performance is guided by external cues.
  
  Fig 2D-F: it is hard to believe that the increase in firing rate induced by propranolol in females is not significant. Presumably, because the range of the median firing rate is so high in the first place, distribution (2E) really indicates an increase in firing. Maybe some other test? e.g paired t.test, or standardized values (z.score) to get rid of variability in firing across neurons?
  
  We agree that the session wide firing rate appears rightward shifted in females after propranolol. As our recordings were taken on different days, several days apart we cannot assume they are the same neurons for paired analyses. In our revised manuscript we evaluated these distributions using a MannWhitney test to increase power and decrease the impact of variability within the population. Previously we had used a Kolmogorov-Smirnov test. Using our new analysis, we can confirm that the propranolol significantly increases session wide firing rates in anterior M2 of females (p=0.027) but not males. This finding increases evidence for direct actions of propranolol within M2 and supports our hypothesis that propranolol leads to local disinhibition by reducing β noradrenergic signaling in interneurons and that without this noradrenergic tone anterior M2 is less efficient at suppressing irrelevant action plans.
  
  Along those lines, would it be worth looking for effects on specific populations (interneurons) which are sometimes characterized by thinner spikes and higher mean firing rates? Given the distribution of beta receptors RNA on interneurons, one would actually expect an effect of propranolol on the firing rate irrespective of task events. Or what is it that prevents the influence of propranolol on interneurons from changing the firing rate? In any case, one of the strengths of this study is the localization of beta receptors on specific neuronal populations in the cortex, so I think that the authors should really try to build on it and find something related to the neurophysiological effects. Otherwise, one cannot exclude the possibility that the behavioral effects are not related to the influence of the drug on these receptors in that region.
  
  Data were collected using stainless steel electrode arrays and our sample population of task related neurons is likely biased to pyramidal neurons, with a small number of fast spiking interneurons. We used validated spike waveform parameters of interneurons in premotor cortex (peak-to-trough ratio and duration; Giordano et al., 2023) in an attempt to isolate putative interneurons and found only a very small number of these cells in our recordings (n=5-7 per group). This population is too small to make any inferences about specific impacts. We have focused on the collective population activity of M2 as this is most strongly related to optimal action planning.
  
  You are correct that from the given findings we cannot conclusively show that the results found here are a result of propranolol acting solely within anterior M2. We have made sure to clarify throughout our revised manuscript that the behavioral and physiological changes we identified are a result of collective direct and indirect actions of propranolol.
  
  The conclusion that neuronal discrimination decreases because the proportion of neurons showing no effect increases is confusing (negative results, basically). It would be clearer if they were reporting the number of neurons that do show an effect, and presumably that this number shows a significant decrease.
  
  The reviewer is correct that the number of neurons that do show an effect (task related activity) does significantly decrease with propranolol (from n=70 to 27 in females and n=71 to 48 in males). These n are now given adjacent to the proportions rather than at the end of the paragraph. Proportions were used for statistical analysis due to an overall decrease in the total number of units after propranolol. All PETH presented are from neurons that show some task related activity, these PETH confirm that neural activity no longer effectively discriminates/separates action plans in M2.
  
  Figs 3F-I: a good proportion of neurons (at least 20%) show a significant encoding before cue onset. How is it possible? This raises the issue of noise level/ null hypothesis for this kind of repeated analysis. How did the author correct for multiple comparison issues?
  
  In response to reviews, we have altered the manner in which we identify the significantly modulated neurons to increase rigor and no longer include these figures or analyses. The proportion of neurons showing action plan encoding prior to cue onset was likely an artifact of how the data was analyzed and an insufficient correction for multiple comparisons, allowing inclusion of internally generated action plans in some neurons.
  
  The description of the action-related activity is globally confusing. Again, how can the authors discriminate between activity related to planning vs action itself? What is significant and what is not, in males vs females? What is being measured here? For example, a very unclear statement on line 238: "Propranolol primarily disrupted active inhibition of irrelevant action selection in M2 activity, reducing the ability to maintain action plan representation in M2, delaying lever press responses (Figure 4L, 4M)." What is 'active inhibition? What is an irrelevant action plan? What is selection? All of that should be defined using objective behavioral criteria and tested formally.
  
  We have changed our wording to clarify what we are describing and why we have chosen the words we have, and to ensure consistency and objectivity throughout the manuscript. Much of the wording we have used – for example action planning or action plan selection, are the words used in the literature to describe M2 neural activity. We call the activity in M2 action planning (either externally/cue guided or internally guided) because that is what has been previously demonstrated. In our task design and analysis we are tracking cue guided actions, as opposed to internally guided.
  
  We also separate the electrophysiology data as preferred and nonpreferred because the literature has shown individual M2 neurons show specific directional tuning as noted in our results, using the term ‘preferred’ encapsulates that tuning regardless of left/right direction. An example M2 neuron that increases activity for left cues and responses (preferred direction), will show active inhibition (low/negative z scores) on trials with right cues and responses (nonpreferred), other neurons would show the inverse relationship with direction.
  
  A primary impact of propranolol was the loss of negative z-scores for nonpreferred trials ie neurons with a left preference that are usually inhibited on right trials were still firing and vice-versa. After propranolol neurons continue to fire for an irrelevant action plan (for the opposite direction), and the resulting population activity is not significantly different for opposing cues/responses. Behavioral responses normally occur after opposing action plans have significantly separated in M2, collapsing action plans by preventing relevant signaling (Guo et al., 2017; Inagaki et al., 2018; Li et al., 2016) or facilitating irrelevant signaling as we see here with propranolol leads impairments in 2AFC performance.
  
  Also, the description of the classifier analysis should be more thorough. Referencing the toolbox is not sufficient to understand what has been done.
  
  We have added additional explanation in both the methods and description of the results to clarify the functions of the neural decoding box and how we are using it to evaluate information encoding within M2. We have provided detail on how the algorithm was trained, how shuffled data was generated and how we determined significance of decoding accuracy.
  
  Measuring Beta adrenoceptors is a great idea, and the results are interesting, especially the difference between neuron types. But again, how does that fit with neurophysiological results? Note, that since this is RNA measures, it should not be phrased as 'receptors' but 'receptors RNA' throughout. One possible interpretation of these anatomical results that cannot be reconciled with physiology is that protein expression at the membrane shows a distinct pattern.
  
  We have changed the references to β receptor expression to β receptor mRNA expression throughout the manuscript. Although mRNA provides a valuable proxy for adrenoreceptor production, as noted by the reviewer protein expression at the membrane may differ. Reliable antibodies that allow quantitative analysis of membrane bound adrenoreceoptors in situ with co-labeling of specific cell types are limited. The goal of assessing mRNA expression within M2 was to determine if the functional sex differences we identified in M2 neurophysiology when manipulating β adrenoreceptor function could be mediated by basal differences in adrenoreceptors. The causal impact of differential mRNA expression in anterior M2 was not directly tested but our findings provide preliminary evidence that adrenoreceptor regulation may differ across sexes. Our results provide a plausible avenue for differential sensitivity to β adrenoreceptor manipulation across sexes, that may also be found in other brain regions.
  
  In conclusion, I think that this is a very interesting study and that the results are potentially relevant for a wide audience. But the paper would clearly benefit from revisions. If the authors could clearly identify a significant relationship between the action of NA on beta receptors on specific cortical neurons, at a physiological and behavioral level, that would be a seminal study. At the moment, the evidence is not convincing enough but the data suggest that it is the case.
  
  We thank the reviewer for the kind remarks. We have undertaken a number of new analyses, refined existing analysis and clarified our claims in the manuscript to improve rigor. Collectively our data reflect that the behavioral and neural deficits after systemic propranolol are likely due to both direct and indirect actions on M2. We believe this work is compelling and that it will inform future work investigating potential sex differences in central noradrenergic anatomy and functional sex differences after perturbations of noradrenergic signaling.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.12.06.519304v1
www.biorxiv.org www.biorxiv.org

New submission 10/07/2023, 10:40:12

1
1. Public_Reviews 10 Jul 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  Reviewer #1 (Public Review):
  
  (1) What's the rationale of trypsinizing the tissue prior to mitochondrial isolation? This is not standard for subsequent proteomics analysis. This step will inevitably cause protein loss, especially for the post mitochondrial fractions (PMF). Treating samples with 0.01ug/uL trypsin for 37oC 30 min is sufficient to partially digest a substantial portion of the proteome. If samples from different subjects were not of the same weight, then this partial digestion step may introduce artificial variability as variable proportions of proteins from different subjects would be lost during this step. In addition, the mitochondrial protein enrichment in the mito fraction, despite statistically significant, does not look striking (Figure 1E, ~30% mitochondrial proteins in the mito fraction). As a comparison, Williams et al., MCP 2018 seem to have obtained high mitochondrial protein content in the mito fraction without trpsinizing the frozen quadriceps using a similar SWATH-MS-based approach.
  
  Trypsinisation of the tissue prior to mitochondrial isolation is based on previous work and a Nature Protocol (1, 2) which isolated mitochondria for skeletal muscle. The rationale is that it aids in mechanical homogenisation from highly fibrous tissues such as quadriceps muscle by digesting extracellular matrix proteins. The trypsin/protein ratio used to aid in this process is at least 400 times lower than the amount of trypsin used for formal proteomic tryptic digestion. Three pieces of evidence suggest this step has negligible effect on downstream proteomic analysis. First, because the trypsinisation buffer is detergent free, trypsin will only affect extracellular or exposed membrane proteins. Filtering our PMF dataset for proteins with ‘extracellular matrix’ gene ontology identifies at least 90 unique extracellular matrix proteins indicating good retention of proteins susceptible to partial digestion. Second, the trypsin dose used is 50 times lower than the concentration used for passaging cultured cells, which retain viability after trypsinisation. Third, and contrary to the point raised by the reviewer, we observe less missingness in PMF samples compared to mitochondrial samples. We thank the reviewer for bringing the Williams et al. 2018 MCP paper to our attention. We note that mitochondrial enrichment between the two papers is comparable (~2- fold). To improve clarity line 408 now reads: “Whole quadriceps muscle samples were prepared as previously described with modification (99, 100). First, tissue was snap frozen with liquid nitrogen…” and line 95 reads: “Mitochondrial proteins were defined based on their presence in MitoCarta 3.0 (24) and consistent with previous work (25) were approximately two-fold enriched in the mitochondrial fraction relative to the PMF (Fig 1E).”
  
  (2) The authors mentioned that the proteomics data were Log2 transformed and median- normalized. Would it be possible to provide a bit more details on this? Were the subjects randomized?
  
  Samples were randomised prior to sample processing and mass spectrometry analysis. Because of possible variation in total protein content, it is critical to normalise protein intensities between samples. Median normalisation adjusts the samples so that they have the same median, thereby accounting for technical variation. Log2 normalisation helps to achieve normal distributions, critical for many downstream statistical tests. Line 471 now reads: “…to achieve normal distributions and account for technical variation in total protein.”
  
  (3) In Figure 1D, what were the numbers of mice the authors used for the CV comparisons in each group? Were they of similar age and sex? Were the differences in CV values statistically significant?
  
  The mitochondrial and PMF proteomes originated from the same quadriceps sample from the same mouse, and thus the age and sex are the same across both proteomes. After quality control, we had mitochondrial proteomes for 194 mice and PMF proteomes for 215 mice. The overall CV in the mitochondrial fraction was significantly greater than in the PMF, however whether the source of this variation is biological, or the result of mitochondrial isolation is unclear and as such we have avoided making a statement within the body of the manuscript. We have now more clearly described the nature of the samples in the revised manuscript and added sample sizes to figure 1F.
  
  (4) The authors stated in lines 155-157 that proteins negatively associated with the Matsuda index were further filtered by presence of their cis-pQTLs. Perhaps more explanations would be needed to justify this filtering criterion? Having a cis-pQTL would mean the protein abundance variation is explained by the variation in its coding gene, this however conceptually would not be relevant to its association with the Matsuda index. With the data that the authors have in hand, would it not be natural to align the Matsuda index QTL with the pQTLs (cis and trans if available), and/or to perform mediation analysis to examine causal relationships with statistical significance?
  
  The rationale for filtering by cis-pQTL was not to study the genetics of either Matsuda or associated proteins but rather to identify proteins that were more likely to be causally associated with Matsuda Index as opposed to adaptively associated. To clarify this line 165 now reads: “Filtering based on cis-pQTL presence was based on the rationale that if genetic variation can explain protein abundance differences between mice, then we can be confident that phenotype (Matsuda Index) is not driving the observed differences and therefore the protein-phenotype associations are likely causal. Importantly, this assumption can only be made for cis-acting pQTLs.” Previous work by Matthew et al. (see https://qtlviewer.jax.org/) has demonstrated that cis-pQTL have markedly higher LOD scores than trans-pQTLs, and our own unpublished work suggests that trans-pQTLs do not reproduce well between datasets. The reviewer rightfully suggests aligning protein QTL with those for Matsuda. This is our long-term goal but to identify genome wide significant peaks associated with altered Matsuda will require many more mice than studied here.
  
  (5) It seems a bit odd that the first half of the paper focused extensively on the authors' discoveries in the mitochondrial proteome, and how proteins involved in mitochondrial processes (such as complex I) were associated with Matsuda Index, but the final fingerprint list of insulin resistance, which contained 76 proteins, only had 7 mitochondrial proteins. Was this because many mitochondrial proteins were filtered out due to no cis-pQTL presenting?
  
  There are three reasons our fingerprint is lacking mitochondrial proteins: 1) there are more non-mitochondrial than mitochondrial proteins in the muscle proteome; 2) we focussed on negatively associated proteins, and as demonstrated in figure 2c, the mitochondrial proteome is enriched for positively associated proteins; 3) as implied by the reviewer, we filtered for pQTL presence, further reducing the number of mitochondrial proteins in our fingerprint. To improve clarity, line 170 now reads: “Low mitochondrial representation in the fingerprint is the result of selecting negatively associating proteins, and as seen (Figure 2C) previously, the mitochondrial proteome is enriched for positive contributors to insulin resistance.”
  
  (6) The authors found that thiostrepton-induced insulin resistance reversal effects were not through insulin signalling. It activated glycolysis but the mechanism of action was not clear. What are the proteins in the fingerprint list that led to identification of thiostrepton on CMAP?
  
  Is thiostrepton able to bind or change the expression of these proteins? Since thiostrepton was identified by searching the insulin resistance fingerprint protein list against CMAP, it would be rational to think that it exerts the biological effects by directly or indirectly acting on these protein targets.
  
  This is indeed the implication of our data. Because of the timescales involved it is unlikely that thiostrepton is changing fingerprint protein levels but could be binding to and inhibiting them. Searching the CMAP thiostrepton signature reveals ARHGDIB and NAGK as the fingerprint proteins with the most positive and negative fold-changes respectively perhaps suggesting they play a role in thiostrepton’s mechanism of action. Experiments are underway to test this hypothesis however these are beyond the scope of the current paper.
  
  Reviewer #2 (Public Review):
  
  Line 105: The observation that variance in respiratory proteins is stable while lipid pathways is variable is quite interesting. Is this due to lower overall levels of lipid metabolism enzymes (ex. do these differ substantially from similar pathways ranked from high-low abundance?).
  
  The relationship between coefficient of variation (CV) and relative abundance of proteins is important to consider. To address this, we have now also performed GSEA on proteins ranked from high to low relative abundance. These comparisons have been added to supplementary figure 1 and line 110 now reads: “As a control experiment, we also performed enrichment analysis on proteins ranked by LFQ relative abundance. High CV pathways (enriched for high CV proteins) tended to be lower in relative abundance (enriched for low relative abundance proteins) (Supplementary Fig 1a, b). However, many high variability pathways, lipid metabolism for example, were not enriched in either direction based on relative abundance suggesting differences in relative abundance do not fully explain pathway variability differences.”
  
  Line 154: the 664 associations are impressive and potentially informative. It would be valuable to know which of these co-map to the same locus - either to distinguish linkage in a 2mb window or identify any cis-proteins which directly exert effects in trans-
  
  To assess this, we have analysed pQTL position relative to gene position to generate a ‘hotspot’ plot. We have also generated a histogram of this pQTL density (in a 2 Mbp window) and added these figures to figure 3. We did not detect any obvious pQTL hotspots, and the distribution of pQTLs across the genome appears fairly uniform. Line 159 now reads: “These were distributed across the genome and were predominately cis acting (Figure 3A)...”
  
  Line 194: Cross-platform validation of the CMAP fingerprint results is an admirable set of validations. It might be good to know general parameters like how many compounds were shared/unique for each platform. Also the concordance between ranking scores for significant and shared compounds.
  
  The Connectivity Map (CMap) query included 5163 compounds, the Prestwick library included 1120, and the overlap was 420. We have added these comparisons to supplementary figure 2. Supplementary figure 2 now also contains a comparison of CMap scores between overlapping compounds (found in CMap and the Prestwick library) against all significant compounds identified by CMap (supplementary figure 2b). Interestingly, compounds present in both platforms scored higher on average, suggesting the Prestwick library captures a significant proportion of highly scoring CMap candidates. Line 206 now reads: “In total, 420 compounds were found across both platforms, and these consensus compounds captured a significant proportion of highly scoring CMap compounds (Supplementary Figure 2A, B).”
  
  Line 319: Another consideration in the molecular fingerprint is how unique these are for muscle. While studies evaluating gene expression have shown that many cis-eQTLs are shared across tissues, to my knowledge, this hasn't been performed systematically for pQTLs. Therefore, consider adding a point to the discussion pointing out that some of the proteins might be conserved pQTLs whereas others which would be more relevant here present unique druggable targets in muscle.
  
  To examine tissue specificity, we determined whether our skeletal muscle fingerprint proteins were detected and contained a pQTL in two metabolically important tissues, liver and adipose. Despite detecting almost all the fingerprint proteins in both adipose and liver tissue, they were depleted for pQTL compared to skeletal muscle. These data have now been added to figure 3c. Line 172 now reads: “To assess the tissue specificity of our fingerprint we searched for the same proteins in metabolically important adipose and liver tissues. Despite detecting 94% and 82% of muscle fingerprint proteins across each tissue respectively, both adipose and liver were depleted for pQTL presence (Figure 3C) suggesting that regulation of our fingerprint protein abundance is specific to skeletal muscle.”
  
  Line 332: These are fascinating observations. 1, that in general insulin signaling and ampk were not themselves shown as top-ranked enrichments with matsuda and that this was sufficient to alter glucose metabolism without changes in these pathways. While further characterization of this signaling mechanism is beyond the scope of this study, it would be good to speculate as to additional signaling pathways that are relevant beyond ROS (ex. CNYP2 and others)
  
  We have now added further discussion to the manuscript to address this point., Line 347 now reads: “Aside from glycolysis, other pathways may be involved in enhancing insulin sensitivity. For example, the negatively associated protein ARHGDIA (Figure 2F) is a potent negative regulator of insulin sensitivity, and our fingerprint of insulin resistance contained its homologue ARHGDIB. Both ARHGDIA and ARHGDIB have been reported to inhibit the insulin action regulator RAC1 thus lowering GLUT4 translocation and glucose uptake. Further investigations may uncover a role for thiostrepton in modulating the RAC1 signalling pathway via ARHGDIB.”
  
  Line: 314: Remove the statement: "While this approach is less powerful than QTL co- localisation for identifying causal drivers,", as I don't believe that this has been demonstrated. Clearly, the authors provide a sufficient framework to pinpoint causality and produce an actionable set of proteins.
  
  We have edited line 314, which now reads: “Moreover, our approach has the major advantage that it requires far fewer mice to obtain meaningful outcomes (222 mice in this study) compared to that required for genetic mapping of complex traits like Matsuda Index.”
  
  Line 346: I would highlight one more appeal of the approach adopted by the authors. Given that these compound libraries were prioritized from patterns of diverse genetics, these observations are inherently more-likely to operate robustly across target backgrounds.
  
  This point is further supported by our thiostrepton results in both C57BL6/j and BXH9 mice. Line 317 now reads: “Furthermore, because we have used genetically diverse datasets (DOz mice and multiple cell lines in Connectivity Map) our findings are likely robust across diverse target backgrounds.”
  
  Line 434: I might have missed but can't seem to find where the muscle data are available to researchers. Given the importance and novelty of these studies, it will be important to provide some way to access the proteomic data.
  
  These data are now available via the ProteomeXchange Consortium. Line 465 now reads: “The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE (104) partner repository with the dataset identifier PXD042277.”
  
  Frezza C, Cipolat S, Scorrano L. Organelle isolation: functional mitochondria from mouse liver, muscle and cultured filroblasts. Nat Protoc. 2007;2(2):287-95.
  
  Acin-Perez R, Benador IY, Petcherski A, Veliova M, Benavides GA, Lagarrigue S, et al. A novel approach to measure mitochondrial respiration in frozen biological samples. The EMBO Journal. 2020;39(13):e104073.
  
  Chick JM, Munger SC, Simecek P, Huttlin EL, Choi K, Gatti DM, et al. Defining the consequences of genetic variation on a proteome-wide scale. Nature. 2016;534(7608):500- 5.
  
  Gatti DM, Svenson KL, Shabalin A, Wu L-Y, Valdar W, Simecek P, et al. Quantitative Trait Locus Mapping Methods for Diversity Outbred Mice. G3 Genes|Genomes|Genetics. 2014;4(9):1623-33.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.03.01.530673v2
www.ncbi.nlm.nih.gov www.ncbi.nlm.nih.gov

New submission 10/07/2023, 10:32:59

1
1. Public_Reviews 10 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  In this study, the authors set out to investigate spatial RNA processing events, specifically alternative splicing and 3' UTR usage, in mouse brain and kidney tissues using ReadZS and SpliZ methodologies on spatial transcriptomics data. The research contributes to understanding tissue-specific gene expression regulation from a spatial perspective. The study introduces a novel approach for analyzing spatial transcriptomics data, allowing for the identification of RNA processing and regulation patterns directly from 10X Visium data. The authors present convincing evidence supporting the identification of novel RNA processing patterns using their methodology, which holds significant implications for researchers in the field of spatial transcriptomics and the study of alternative splicing and 3' UTR usage.
  
  Thank you for this thorough overview of our work.
  
  The conclusions of the study are mostly well-supported by the data; however, certain aspects could be improved to strengthen the findings.
  
  1) The conclusions of this study would be strengthened by conducting a more extensive tissue sample analysis and including biological replicates. Additionally, appropriate batch effect corrections should be applied when dealing with biological replicates.
  
  We agree that including biological replicates would strengthen our findings. We will include biological replicates of the mouse brain tissues in the revision.
  
  2) The 3' UTR usage and alternative splicing should be compared among clearly labeled clusters for a more comprehensive analysis.
  
  We understand that it can be difficult to see how the SpliZ quantiles map spatially onto the tissue images. For the splicing of Gng13, Myl6, and Rps24, we will include box plots broken down by spatial quadrant in the revision. However, this does result in an oversimplification of the spatial patterns found in the tissue slices, which make the plots less informative than the quantile plots to our view.
  
  3) The authors should clarify their rationale for choosing ReadZS and SpliZ approaches and provide comparisons with other methods to demonstrate the advantages and potential limitations of their chosen methodologies.
  
  Thank you for pointing out the lack of sufficient discussion of ReadZS and SpliZ in the manuscript. The ReadZS and SpliZ were chosen for this analysis because both of these methods provide an individual score for each cell-gene pair, which is easily adapted to providing a score for each spot-gene pair. Due to the sparsity and 3’ bias of Visium data, approaches designed to analyze RNA processing in full-length sequencing analysis are not applicable. The SpliZ and ReadZS are two of the limited number of tools available that are designed for the analysis of RNA processing in droplet-based data. Other available tools tend to rely on aggregating data across multiple cells using a method called pseudo-bulking (Li et al., 2021; Patrick et al., 2020). It is not clear how this could be used for spatial transcriptomics data without potentially obscuring subtle spatial patterns in the data. Others are based on PSI measurements, which are vulnerable to artifacts due to sparsity (Buen Abad Najar et al., 2020; Olivieri et al., 2022; Wen et al., 2022). The tradeoff between pseudo-bulking and a single score per spot-gene pair means that the ReadZS and SpliZ do not have the power to detect changes for genes with very low read counts. We will add text in the revision to clarify this point.
  
  Reviewer #2 (Public Review):
  
  The authors applied existing ReadZS and the SpliZ methods, previously developed to analyze RNA process in scRNA-seq data, to Visium data to study spatial splicing and RNA processing events in tissues by Moran's I. The authors showed several example genes in mouse brain and kidney, whose processing are spatially regulated, such as Rps24, Myl6, Gng13.
  
  Thank you for this thorough overview of our work.
  
  The paper touches on an important question in RNA biology about how RNA processing is regulated spatially. Both experimental and computational challenges remain to address it. Despite some potentially interesting findings, most claims remain to be validated by orthogonal methods such as RNA FISH and simulations.
  
  We appreciate that the reviewer finds the question important, and that the findings are potentially interesting. In the revision we will include biological replicates for our findings in the mouse brain. Unfortunately, experimental validation is outside of our budget for this project. It is unclear what further simulations could validate the biological discoveries in this manuscript: permutations were used to calculate the p value of each discovery, and the false positive and negative rates of the SpliZ have been assessed through simulation (Olivieri et al., 2022).
  
  In addition, the percentage of spatial processing events (splicing in 0.8-2.2% of detected genes, i.e. 8-17 genes and RNA processing in 1.1-5.5% of detected genomic windows, i.e. 57-161 windows) discovered is low. Does it suggest that most of RNA processing events were not spatially regulated across the tissue? Or does it question the assumption of treating spatial transcriptomics data similar to scRNA-seq data?
  
  We agree that the question of the prevalence of spatial RNA processing regulation is critical. Rather than the two options proposed here, we believe that the sparsity of the data limits our ability to call more of these events. In the revision, we will provide a supplemental figure showing the relationship between read depth and p value for each gene to quantify how the fraction of observed regulation changes with sequencing depth. It is worth noting that as these technologies improve, we expect the sequencing depth of spatial technologies to increase which would likely result in more discoveries.
  
  The unique features for ST data, such as mixture of neighboring cells, different capture biases and much smaller number of spots (pseudo cells here), may have significant effects on the power of scRNA-seq based methods, but it is not discussed in the manuscript. The lack of careful evaluation and low discovery rates could limit application of the approach to other tissues and subcellular data.
  
  We appreciate the concern that technical differences between scRNA-seq data and spatial transcriptomics data could affect our results. We agree that this point could be addressed more thoroughly in the text. None of the specificities of spatial transcriptomics data invalidate the assumptions of the SpliZ or ReadZS. The method we use to identify genes with significant spatial regulation of RNA processing was specifically created to be used for Visium data. It takes into account mixture of RNAs in neighboring cells by randomly sampling scores of neighboring cells, rather than randomization of the location of the spots themselves, which does indeed result in a high false positive rate (see “Permutations for Moran’s I” in the Methods). We do note that there is a limit to the power of this kind of analysis based on the number of spots and the read depth, which we will quantify in a plot in the revision.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

ncbi.nlm.nih.gov/pmc/articles/PMC10054993/
www.biorxiv.org www.biorxiv.org

New submission 10/07/2023, 11:46:27

1
1. Public_Reviews 10 Jul 2023
 
 in eLife
 
 Author Response:
 
 We thank Reviewer #1 for their positive assessment of our work.
 
 Reviewer #2 (Public Review):
 
 […] Although these results confirm what we already know about processes involved in the meninges in MS and its models and gradients of pathology in sub-pial regions, this is the first to use spatial transcriptomics to demonstrate such gradients at a molecular level in an animal model that demonstrates lymphoid like tissue development in the meninges and associated grey matter pathology. The mouse EAE model being used here does reproduce many, although not all, of the pathological features of MS and the ability to look at longer time points has been exploited well. However, this particular spatial transcriptomics technique cannot resolve at a cellular level and therefore there is a lot of overlap between gene expression signatures in the meninges and the underlying grey matter parenchyma.
 
 We appreciate the reviewer’s concise summary and comments on our manuscript. We agree that the Visium spatial sequencing technology we applied is limited in its resolution and cannot precisely distinguish individual cells or anatomic regions. For that reason, there is undoubtedly some overlap between gene expression signatures in the meninges and underlying parenchyma, particularly in spots on the borders of the meningeal inflammation clusters. However, we believe that the majority of meningeal inflammation (“cluster 11”) spots are indeed in the meninges and represent the spatial transcriptome of that niche. To support this, in the revised manuscript we will provide H&E images with the UMAP clusters overlayed to demonstrate the anatomic borders that correlate with the clusters.
 
 The short nature of this report means that the results are presented and discussed in a vague way, without enough molecular detail to reveal much information about molecular pathogenetic mechanisms.
 
 We thank the reviewer for this comment. The goal of this work is to transcriptomically characterize the spatial relationship between areas of meningeal inflammation and the underlying parenchyma. While we agree that mechanistic studies are needed to further evaluate the role of presented signaling pathways, those experiments are beyond the scope of this brief report.
 
 The trajectory analysis is a good way to explore gradients within the tissues and the authors are to be applauded for using this approach. However, the trajectory analysis does not tell us much if you only choose 2 genes that you think might be involved in the pathogenetic processes going on in the grey matter. It might be more useful to choose some genes involved in pathogenetic processes that we already know are involved in the tissue damage in the underlying grey matter in MS, for which there is already a lot of literature, or genes that respond to molecules we know are increased in MS CSF, although the animal models may be very different. Why were C3 and B2m chosen here?
 
 We appreciate the reviewer’s points here. C3 and B2m were chosen as examples of genes that have differential fit to the gradient descending pattern to assist the reader in interpreting subsequent gene set trajectory analysis. However, we agree that there are many other genes of interest and will expand the number of genes displayed in our revised manuscript.
 
 Strengths: - The mouse model does exhibit many of the features of the compartmentalized immune response seen in MS, including the presence of meningeal immune cell infiltrates in the central sulcus and over the surface of the cortex, with the presence of FDC's HEVs PNAd+ vessels and CXCL13 expression, indicating the formation of lymphoid like cell aggregates. In addition, disruption of the glia limitans is seen, as in MS. Increased microglial reactivity is also present at the pial surface. - Spatial transcriptomics is the best approach to studying gradients in gene expression in both white matter and grey matter and their relationship between compartments. - It would be useful to have more discussion of how the upregulated pathways in the two .compartments fit with what we know about the cellular changes occurring in both, for which presumably there is prior information from the group's previous publications.
 
 Limitations: - EAE in the mouse is not MS and may be far removed when one considers molecular mechanisms, especially as MS is not a simple anti-myelin protein autoimmune condition. Therefore, this study could be following gene trajectories that do not exist in MS. This needs a significant amount of discussion in the manuscript if the authors suggest that it is mimicking MS. - The model does not have the cortical subpial demyelination typical of MS and it is unknown whether neuronal loss occurs in this model, which is the main feature of cytokine-mediated neurodegeneration in MS. If it does not then a whole set of genes will be missing that are involved in the neuronal response to inflammatory stimuli that may be cytotoxic. - Visium technology does not get down to single cell level and does not appear to allow resolution of the border between the meninges and the underlying grey matter. - Neuronal loss in the MS cortex is independent of demyelination and therefore not related to remyelination failure. There does not appear to be any cortical grey matter demyelination in these animals, so it is difficult to relate any of the gene changes seen here to demyelination. - No mention of how the ascending and descending patterns of gene expression may be due to the gradient of microglial activation that underlies meningeal inflammation, which is a big omission.
 
 We thank the reviewer for their insightful comments on the strengths and limitations of our study. Regarding the SJL EAE model we use in this paper, it certainly is not a perfect model of meningeal inflammation in MS, indeed we believe that no such animal model exists, but it does recapitulate several key features of human disease as described by the reviewer. Spatial transcriptomics of cortical grey matter lesions and overlying meninges of samples derived from patients with MS would be ideal, though access to this tissue is highly limited. In the revised manuscript we will include more detailed discussion of the limitations in applying these findings to MS. However, in addition to potential implications for MS research, our data contribute more generally to understanding of meningeal inflammation and penetrance of inflammation into brain tissue.
 
 We acknowledge that sub-pial neuronal loss has not been assessed in SJL EAE, and if present it would increase the relevance of this model to neurodegeneration. We are currently working to assess this.
 
 We agree with the reviewer that Visium technology is limited in its ability to discriminate individual cells, as discussed above (2.2).
 
 We agree that gene expression by activated microglia is likely a major driver of the transcriptomic changes observed in the parenchyma, and thank the reviewer for highlighting this. We will add discussion of this to our revised manuscript, and intend to generate additional data regarding the contribution of subpial microglial activation to the measured transcriptomic changes.
 
 Finally, we thank Reviewer #3 for their assessment of our work.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.02.543421v1
www.biorxiv.org www.biorxiv.org

New submission 10/07/2023, 10:28:37

1
1. Public_Reviews 10 Jul 2023
  
  in eLife
  
  Author Response
  
  eLife assessment:
  
  Trypanosoma brucei evades mammalian humoral immunity through the expression of different variant surface glycoprotein genes. In this fundamental paper, the authors extend previous observations that TbRAP1 both interacts with PIP5pase and binds PI(3,4,5)P3, indicating a role for PI(3,4,5)P3 binding and suggesting that antigen switching is signal dependent. While much of the evidence is compelling, one reviewer suggested that the work would benefit from further controls.
  
  We appreciate the evaluation of the work and agree that the findings substantially advance our understanding of antigenic variation. A detailed response to the public review is included below, which addresses and clarifies the issues raised by the reviewers, including those concerning controls. We also want to highlight the comment by Reviewer #3 “The methods used in the study are rigorous and well-controlled…. their results support the conclusions made in the manuscript.”. We hope this and our comments will help address the issue of controls in this eLife statement.
  
  Reviewer #1 (Public Review):
  
  Trypanosoma brucei undergoes antigenic variation to evade the mammalian host’s immune response. To achieve this, T. brucei regularly expresses different VSGs as its major surface antigen. VSG expression sites are exclusively subtelomeric, and VSG transcription by RNA polymerase I is strictly monoallelic. It has been shown that T. brucei RAP1, a telomeric protein, and the phosphoinositol pathway are essential for VSG monoallelic expression. In previous studies, Cestari et al. (ref. 24) have shown that PIP5pase interacts with RAP1 and that RAP1 binds PI(3,4,5)P3. RNAseq and ChIPseq analyses have been performed previously in PIP5pase conditional knockout cells, too (ref. 24). In the current study, Touray et al. did similar analyses except that catalytic dead PIP5pase mutant was used and the DNA and PI(3,4,5)P3 binding activities of RAP1 fragments were examined. Specifically, the authors examined the transcriptome profile and did RAP1 ChIPseq in PIP5pase catalytic dead mutant. The authors also expressed several C-terminal His6-tagged RAP1 recombinant proteins (full-length, aa1-300, aa301-560, and aa 561-855). These fragments’ DNA binding activities were examined by EMSA analysis and their phosphoinositides binding activities were examined by affinity pulldown of biotin-conjugated phosphoinositides. As a result, the authors confirmed that VSG silencing (both BES-linked and MES-linked VSGs) depends on PIP5pase catalytic activity, but the overall knowledge improvement is incremental. The most convincing data come from the phosphoinositide binding assay as it clearly shows that N-terminus of RAP1 binds PI(3,4,5)P3 but not PI(4,5)P2, although this is only assayed in vitro, while the in vivo binding of full-length RAP1 to PI(3,4,5)P3 has been previously published by Cestari et al (ref. 24) already. Considering that many phosphoinositides exert their regulatory role by modulating the subcellular localization of their bound proteins, it is reasonable to hypothesize that binding to PI(3,4,5)P3 can remove RAP1 from the chromatin. However, no convincing data have been shown to support the author’s hypothesis that this regulation is through an “allosteric switch”. Therefore, the title should be revised.
  
  We appreciate the reviewer’s detailed evaluation of our work. There are a few general comments that we would like to clarify. We will break them into three points. All data included here are new and were not previously published.
  
  i) “RNAseq and ChIPseq analyses have been performed previously …(ref. 24).” Reference 24 is Cestari et al. 2019, Mol Cell Biol. We, or others, have not published ChIP-seq of RAP1 in T. brucei. Previous work showed ChIP-qPCR, which analyses specific loci. The ChIP-seq shows genome-wide binding sites of RAP1, and new findings are shown here, including binding sites in the BES, MESs, and other genome loci such as centromeres. We also identified DNA sequence bias defining RAP1 binding sites (Fig 2A). We also show by ChIP-seq how RAP1-binding to these loci changes upon expression of catalytic inactive PIP5Pase. As for the RNA-seq, this is also the first time we show RNA-seq of T. brucei expressing catalytic inactive PIP5Pase, which establishes that the regulation of VSG silencing and switching is dependent on PIP5Pase enzyme catalysis, i.e., PI(3,4,5)P3 dephosphorylation. To improve clarity in the manuscript, we edited page 4, line 122, as follows: “We showed that RAP1 binds telomeric or 70 bp repeats (24), but it is unknown if it binds to other ES sequences or genomic loci.”
  
  ii) “The in vivo binding of full-length RAP1 to PI(3,4,5)P3 has been previously published by Cestari et al. (ref. 24) already.”. We published in reference 24 that RAP1-HA can bind agarose beads-conjugated synthetic PI(3,4,5)P3. Here, we were able to measure T. brucei endogenous PI(3,4,5)P3 associated with RAP1-HA (Fig 4F). Moreover, we showed that the endogenous RAP1-HA and PI(3,4,5)P3 binding is about 100-fold higher when PIP5Pase is catalytic inactive than WT PIP5Pase. The data establish that in vivo endogenous PI(3,4,5)P3 binds to RAP1-HA and how the binding changes in cells expressing mutant PIP5Pase; this data is new and relevant to our conclusions.
  
  iii) “no convincing data have been shown to support the author’s hypothesis that this regulation is through an “allosteric switch””. We show here in vitro and in vivo data supporting the conclusion. We show that PI(3,4,5)P3 binds to the N-terminus of rRAP1-His with a calculated Kd of about 20 µM (Fig 4B-E, Table 1). In contrast, we show by EMSA and binding kinetics by microscale thermophoresis that rRAP1-His binds to 70 bp and telomeric repeats via protein regions encompassing the Myb (central) or Myb-L domains (C-terminal) but not the N-terminus containing the VHP domain (Fig 3C-G, and Fig S5). Using microscale thermophoresis, we also show that rRAP1-His binds to 70 bp and telomeric repeats with Kd of 10 and 24 nM, respectively (Fig 3 and Table 1). Notably, we show that 30 µM of PI(3,4,5)P3, but not PI(4,5,)P2 – used as a control – disrupts rRAP1-His binding to 70 bp and telomeric repeats, changing Kds to about 188 and 155 nM, respectively (Fig 5A-C). We also show that PI(3,4,5)P3 does not disrupt the binding of rRAP1-His fragments (Myb or MybL) without the N-terminus domain (Fig S5), implying binding of PI(3,4,5)P3 to RAP1 N-terminus is required for displacement of RAP1 DNA binding domains (Myb and MybL) from telomeric and 70 bp repeats, and that PI(3,4,5)P3 is not competing for Myb or Myb-L binding to DNA. Moreover, we show that RAP1-HA binding to 70 bp and telomeric repeats in vivo is displaced in T. brucei cells expressing catalytic inactive PIP5Pase (Fig 5D-G), which we show results in RAP1-HA binding about 100-fold more endogenous PI(3,4,5)P3 than in T. brucei expressing WT PIP5Pase (Fig 4F). The in vivo data agrees with the in vitro data. The data show a typical allosteric regulator system, in which binding of a ligand to one site of the protein, here PI(3,4,5)P3 binding to RAP1 N-terminus, affects other domains (RAP1 Myb and Myb-L domains) binding to DNA. To improve the clarity of the title, we will change it in the revised version to imply a direct role of PI(3,4,5)P3 regulation of RAP1 in the process. This will provide more specific information to the readers and addresses the concern of the reviewer related to the “allosteric switch”. The new title will be: PI(3,4,5)P3 allosteric regulation of RAP1 controls antigenic switching in trypanosomes
  
  There are serious concerns about many conclusions made by Touray et al., according to their experimental approaches:
  
  1) The authors have been studying RAP1’s chromatin association pattern by ChIPseq in cells expressing a C-terminal HA tagged RAP1. According to data from tryptag.org, RAP1 with an N-terminal or a C-terminal tag does not seem to have identical subcellular localization patterns, suggesting that adding tags at different positions of RAP1 may affect its function. It is therefore essential to validate that the C-terminally HA-tagged RAP1 still has its essential functions. However, this data is not available in the current study. RAP1 is essential. If RAP1-HA still retains its essential functions, cells carrying one RAP1-HA allele and one deleted allele are expected to grow the same as WT cells. In addition, these cells should have the WT VSG expression pattern, and RAP1-HA should still interact with TRF. Without these validations, it is impossible to judge whether the ChIPseq data obtained on RAP1-HA reflect the true chromatin association profile of RAP1.
  
  Tryptag data show both N- and C-terminus RAP1 with nuclear localization in procyclic forms, although there are differences in signal intensities in the images (http://tryptag.org/?id=Tb927.11.370). It is important to note that Tryptag data is from procyclic forms, and DNA constructs are not validated for their integration in the correct locus. As for the RAP1-HA localization in bloodstream forms, we demonstrated that C-terminally HA-tagged RAP1 co-localizes with telomeres by a combination of immunofluorescence and fluorescence in situ hybridization (Cestari and Stuart, 2015, PNAS), and RAP1-HA co-immunoprecipitate telomeric and 70 bp repeats (Cestari et al. 2019 Mol Cell Biol). We also showed by immunoprecipitation and mass spectrometry that HA-tagged RAP1 interacts with nuclear and telomeric proteins, including PIP5Pase (Cestari et al. 2019). Others have also tagged T. brucei RAP1 in bloodstream forms with HA without disrupting its nuclear localization (Yang et al. 2009, Cell; Afrin et al. 2020, Science Advances). As for the experiment suggested by the reviewer, there is no guarantee that cells lacking one allele of RAP1 will behave as wildtype, i.e., normal growth and repression of VSGs genes. Also, less than 90% of T. brucei TRF was reported to interact with RAP1 (Yang et al. 2009, Cell), which might be indirect via their binding to telomeric DNA repeats rather than direct protein-protein interactions.
  
  2) Touray et al. expressed and purified His6-tagged recombinant RAP1 fragments from E. coli and used these recombinant proteins for EMSA analysis: The His6 tag has been used for purifying various recombinant proteins. It is most likely that the His6 tag itself does not convey any DNA binding activities. However, using His6-tagged RAP1 fragments for EMSA analysis has a serious concern. It has been shown that His6-tagged human RAP1 protein can bind dsDNA, but hRAP1 without the His6 tag does not. It is possible that RAP1 proteins in combination with the His6 tag can exhibit certain unnatural DNA binding activities. To be rigorous, the authors need to remove the His6 tag from their recombinant proteins before the in vitro DNA binding analyses are performed. This is a standard procedure for many in vitro assays using recombinant proteins.
  
  We show in Fig 3C-G that His-tagged full-length rRAP1 does not bind to scrambled telomeric dsDNA sequences, which indicates that His-tagged rRAP1 does not bind unspecifically to DNA. Moreover, in Fig 3G, we show that His-tagged rRAP11-300 also does not bind to 70 bp or telomeric repeats. In contrast, full-length His-tagged rRAP1, rRAP1301-560, or rRAP1561-855 bind to 70 bp or telomeric repeats (Fig 3C-G). Since all proteins were His-tagged, the His tag cannot be responsible for the DNA binding.
  
  As for the statement that human rRAP1-His has unspecific DNA binding properties, we could not find a reference to this statement; we cannot compare it without knowing the details of the experiment. Biochemical assays can result in unspecific binding depending on binding/buffer conditions. Also, humans and T. brucei RAP1 share only 15% of amino acid identity; unspecific binding to DNA could be specific to human RAP1.
  
  3) It is unclear why Nanopore sequencing was used for RNAseq and ChIPseq experiments. The greatest benefit of Nanopore sequencing is that it can sequence long reads, which usually helps with mapping, particularly at genome loci with repetitive sequences. This seems beneficial for RAP1 ChIPseq analysis as RAP1 is expected to bind telomere repeats. However, for ChIPseq, the chromatin needs to be fragmented. Larger DNA fragments from ChIPseq experiments will decrease the accuracy of the final calculated binding sites. Therefore, ChIPseq experiments are not supposed to have long reads to start with, so Nanopore sequencing does not seem to bring any advantage. In addition, compared to Illumina sequencing, Nanopore sequencing usually yields smaller numbers of reads, and the sequencing accuracy rate is lower. The Nanopore sequencing accuracy may be a serious concern in the current study. All telomeres have the perfect TTAGGG repeats, all VSG genes have a very similar 3’ UTR, and all 70 bp repeats have very similar sequences. In fact, the active and silent ESs have 90% sequence identity. Are sequence reads accurately mapped to different ESs? How is the sequencing and mapping quality controlled? Furthermore, it is unclear whether the read depth for RNAseq is deep enough.
  
  The mean sequence length for the ChIP-seq was about 500 bp (see Table S3), which helps to align reads to ESs and distinguish the different ESs, and it is a reasonable size range to define RAP1 binding sites. Although sequencing depths are usually higher in Illumina than in nanopore (all depending on the amount of sequencing), most Illumina short reads map to multiple genomic sequences, making it difficult to distinguish ESs. This is particularly important for RAP1 because it binds to repeats such as 70 bp and telomeric repeats. Mapping short reads to those regions would be virtually impossible; hence, our choice of nanopore sequencing. For RNA-seq, the ~500 bp read length help sequence alignment to the subtelomeric regions containing many VSG genes. The nanopore reads obtained here had an average sequencing score 12 (i.e., base call accuracy of 94%). Filtering reads with MAPQ ≥ 20 (99% probability of correct alignment) helped us to distinguish RAP1 binding to specific ESs, including silent vs active ES (ChIP-seq) or VSG sequences (RNA-seq). The details of the analysis and sequencing metrics (i.e., sequencing depth and read length) were described in the Methods section “Computational analysis of RNA-seq and ChIP-seq” and Table S3, respectively.
  
  4) Many statements in the discussion section are speculations without any solid evidence. For example, lines 218 - 219 “likely due to RAP1 conformational changes”, no data have been shown to support this at all. In lines 224-226, the authors acknowledged that more experiments are necessary to validate their observations, so it is important for the authors to first validate their findings before they draw any solid conclusions. Importantly, RAP1 has been shown to help compact telomeric and subtelomeric chromatin a long time ago by Pandya et al. (2013. NAR 41:7673), who actually examined the chromatin structure by MNase digestion and FAIRE. The authors should acknowledge previous findings. In addition, the authors need to revise the discussion to clearly indicate what they “speculate” rather than make statements as if it is a solid conclusion.
  
  The statement “likely due to RAP1 conformational changes” in lines 218-219 (page 6) is part of the Discussion. We did not make a strong statement but discussed a possibility. We believe that it is beneficial to the reader to have the data discussed, and we do not feel this point is overly speculative.
  
  For lines 224-226 (page 6), the statement refers to the finding of RAP1 binding to centromeric regions by ChIP-seq, which is a new finding but not the focus of this work. Hence, future studies are necessary for this finding, and we believe it is appropriate in the Discussion to be upfront and highlight this point to the readers. However, for the RAP1 binding to telomeric ES sites, e.g., 70 bp repeats and telomeric repeats (the focus of this work), we validated the binding by EMSA and by performing binding kinetics using microscale thermophoresis.
  
  We did not include Pandya et al. 2013 NAR because the authors demonstrated RAP1 compaction of chromatin to occur in procyclic forms only. Pandya et al. stated in their abstract: “no significant chromatin structure changes were detected on depletion of TbRAP1 in BF cells”. Hence, the suggested reference is not relevant to the context of our conclusions in bloodstream forms. Nevertheless, we have reviewed the Discussion to avoid broad speculations in the revised version of the manuscript.
  
  There are also minor concerns:
  
  1) In the PIP5Pase conditional knockout system, the WT or mutant PIP5Pase with a V5 tag is constitutively expressed from the tubulin array. What’s the relative expression level of this allele and the endogenous PIP5Pase? Without a clear knowledge of the mutant expression level, it is hard to conclude whether the mutant has any dominant negative effects or whether the mutant phenotype is simply due to a lower than WT PIP5pase expression level.
  
  The relative mRNA levels of the exclusive expression of PIP5Pase Mut compared to the WT is available in the Data S1, RNA-seq. The Mut allele’s relative expression level is 0.85-fold to the WT allele (both from tubulin loci). We also showed by Western blot the WT and Mut PIP5Pase protein expression (Cestari et al. 2019, Mol Cell Biol). Concerning PIP5Pase endogenous alleles, we compared RNA-seq reads counts per million from the conditional null PIP5Pase cells exclusively expressing WT or the Mut PIP5Pase alleles (Data S1, this work) to our previous RNA-seq of single-marker 427 strain (Cestari et al. 2019, Mol Cell Biol). We used the single-maker 427 because the conditional null cells were generated in this strain background. The PIP5Pase WT and Mut mRNAs expressed from tubulin loci are 1.6 and 1.3-fold the endogenous PIP5Pase levels in single-marker 427, respectively. We include a statement in the Methods, page 7, lines 265-268: “The WT or Mut PIP5Pase mRNAs exclusively expressed from tubulin loci are 1.6 and 1.3-fold the WT PIP5Pase mRNA levels expressed from endogenous alleles in the single marker 427 strain. The fold-changes were calculated from RNA-seq reads counts per million from this work (WT and Mut PIP5Pase, Data S1) and our previous RNA-seq from single marker 427 strain (24).”
  
  2) In EMSA analysis, what are the concentrations of the protein and the probe used in each reaction? The amount of protein used in the binding assay appears to be very high, and this can contribute to the observation that many complexes are stuck in the well. Better quality EMSA data need to be shown to support the authors’ claims.
  
  All concentrations were provided in the Methods section. See page 9 Electrophoretic mobility shift assays: “100 nM of annealed DNA were mixed with 1 μg of recombinant protein…”. For microscale thermophoresis, also see page 9, Microscale thermophoresis binding kinetics: “1 μM rRAP1 was diluted in 16 two-fold serial dilutions in 250 mM HEPES pH 7.4, 25 mM MgCl2, 500 mM NaCl, and 0.25% (v/v) N P-40 and incubated with 20 nM telomeric or 70 bp repeats…”. Note that two different biochemical approaches, EMSA and microscale thermophoresis, were used to assess rRAP1-His binding to DNA. Both show similar results (Fig 3 and 5, and Fig S5; microscale thermophoresis shows the binding kinetics, data available in Table 1). The EMSA images clearly show the binding of RAP1 to 70 bp or telomeric repeats but not to scramble telomeric repeat DNA.
  
  Reviewer #2 (Public Review):
  
  This manuscript by Touray, et al. provides a significant new twist to our understanding of how antigenic variation may be regulated in T. brucei. Key aspects of antigenic variation are the mutually exclusive expression of a single antigen per cell and the periodic switching from expression of one antigen isoform to another. In this manuscript, the authors show, as they have previously shown, that depletion of the nuclear phosphatidylinositol 5-phosphatase (PIP5Pase) results in a loss of mutually exclusive VSG expression. Furthermore, using ChIP-seq, the authors show that the repressor/activator protein 1 (RAP1) binds to regions upstream and downstream of VSG genes located in transcriptionally repressed expression sites and that this binding is lost in the absence of a functional PIP5Pase. Importantly, the authors decided to further investigate this link between PIP5Pase and RAP1, a protein that has previously been implicated in antigenic variation in T. brucei, and found that inactivation of PIP5Pase results in the accumulation of PI(3,4,5)P3 bound to the RAP1 N-terminus and that this binding impairs the ability of RAP1 to bind DNA. Based on these observations, the authors suggest that the levels of PI(3,4,5)P3 may determine the cellular function of RAP1, either by binding upstream of VSG genes and repressing their function, or by not binding DNA and allowing the simultaneous expression of multiple VSG genes in a single parasite.
  
  While I find most of the data presented in this manuscript compelling, there are aspects of Figure 1 that are not clear to me. Based on Figure 1F, the authors claim that transient inactivation of PIP5Pase results in a switch from the expression of one VSG isoform to another. However, I am not exactly sure what the authors are showing in this panel, nor do the data in Figure 1F seem to be consistent with those shown in Figure 1C. Based on Figure 1F, a transient inactivation of PIP5Pase appears to result in an almost exclusive switch to a VSG located in BES12. However, based on Figure 1E, the VSG transcripts most commonly found after a transient inactivation of PIP5Pase are those from the previously active VSG (BES1) and VSGs located on chr 1 and 6 (I believe). The small font and the low resolution make it impossible to infer the location of the expressed VSG genes, nor to confirm that ALL VSG genes located in expression sites are activated, as the authors claim. Also, I was not able to access the raw ChIP-seq and RNA-seq reads. Thus, could not evaluate the quality of the sequencing data.
  
  We appreciate the reviewer’s comments and evaluation of our work. Fig 1E shows VSG-seq of a population after transient (24h) exclusive expression of the PIP5Pase mutant, followed by re-expression of the WT PIP5Pase allele for 60 hours (multiple VSGs are detected). As a control, it also shows VSG-seq in cells continuously expressing WT PIP5Pase (mostly VSG2, BES1 is detected). Fig 1F and Fig S1 show the sequencing of VSGs expressed by clones isolated (5-6 days of growth) after a temporary knockdown (24h) of PIP5Pase (tet -), followed by its re-expression. For comparison, no knockdown (tet +) was included. Fig 1F shows potential switchers in the population, the Fig 1E confirms VSG switching in clones.
  
  To clarify the difference between Fig 1E and 1F, we edited the manuscript on page 3, lines 103-110: “To verify PIP5Pase role in VSG switching, we knocked down PIP5Pase for 24h (Tet -), then restored its expression (Tet +) and isolated clones by limiting dilution and growth for 5-6 days. Analysis of isolated clones after temporary PIP5Pase knockdown (Tet -/+) confirmed VSG switching in 93 out of 94 (99%) of the analyzed clones (Fig 1F, Fig S1). The cells switched to express VSGs from silent ESs or subtelomeric regions, indicating switching by transcription or recombination mechanisms. Moreover, no switching was detected in 118 isolated clones from cells continuously expressing WT PIP5Pase (Tet +, Fig 1F).”. We also edited Fig 1F to indicate temporary knockdown (Tet -/+) vs no knockdown (Tet -). The modifications will be available in the resubmitted version of the manuscript.
  
  We agree that the heat map is difficult to read due to the amount of information. We will include in the revised version of the manuscript a table with the data in the supplementary information; the reader will be able to evaluate the data in detail.
  
  A preference for switching to specific ESs has been observed in T. brucei (Morrison et al. 2005, Int J Parasitol; Cestari and Stuart, 2015, PNAS), which may explain several clones switching to BES12. Many potential switchers were detected in the VSG-seq (Fig 1F, the whole cell population is over 107 parasites), but not all potential switchers were detected in the clonal analysis because we analyzed 212 clones total, a fraction of the over 107 cells analyzed by VSG-seq (Fig 1E). Also, it is possible that not all potential switchers are viable. However, the point of the clonal analysis is to validate the VSG switching after genetic perturbation of PIP5Pase.
  
  Fig 1C shows examples of ES derepression by RNA-seq after 24h exclusive expression of the mutant compared to WT PIP5Pase. The RNA-seq shows that all ESs are derepressed (Fig 1B). This can be visualized in the volcano plot (Fig 1B, BES and MES VSGs are labelled) and on the spreadsheet Data S1. Although all ESs are derepressed after PIP5Pase mutant expression, not all ESs are selected during switching, as observed in Fig 1E-F. This agrees with our previous observations in switching assays with proteins that control VSG switching (Cestari and Stuart, 2015, PNAS).
  
  As for metrics of sequencing and raw sequencing data. See Methods section, page 13, lines 483-485: “Sequencing information is available in Table S3 and fastq data is available in the Sequence Read Archive (SRA) with the BioProject identification PRJNA934938.” Table S3 has a summary of sequencing data. Metrics information such as sequencing quality and analysis can be found in the Methods section “Computational analysis of RNA-seq and ChIP-seq”. The latter includes information about nanopore reads, i.e., mean Q-score of 12.
  
  Reviewer #3 (Public Review):
  
  In this manuscript, Touray et al investigate the mechanisms by which PIP5Pase and RAP1 control VSG expression in T. brucei and demonstrate an important role for this enzyme in a signalling pathway that likely plays a role in antigenic variation in T. brucei.
  
  The methods used in the study are rigorous and well-controlled. The authors convincingly demonstrate that RAP1 binds to PI(3,4,5)P3 through its N-terminus and that this binding regulates RAP1 binding to VSG expression sites, which in turn regulates VSG silencing. Overall their results support the conclusions made in the manuscript.
  
  There are a few small caveats that are worth noting. First, the analysis of VSG derepression and switching in Figure 1 relies on a genome that does not contain minichromosomal (MC) VSG sequences. This means that MC VSGs could theoretically be misassigned as coming from another genomic location in the absence of an MC reference. As the origin of the VSGs in these clones isn’t a major point in the paper, I do not think this is a major concern, but I would not over-interpret the particular details of switching outcomes in these experiments.
  
  The authors state that “our data imply that antigenic variation is not exclusively stochastic.” I am not sure this is true. While I also favor the idea that switching is not exclusively stochastic, evidence for a signaling pathway does not necessarily imply that antigenic variation is not stochastic. This pathway could be important solely for lifecycle-related control of VSG expression, rather than antigenic variation during infection. Nevertheless, these data are critical for establishing a potential pathway that could control antigenic variation and thus represent a fundamental discovery.
  
  Another aspect of this work that is perhaps important, but not discussed much by the authors, is the fact that signalling is extremely poorly understood in T. brucei. In Figure 1B, the RNA-seq data show many genes upregulated after expression of the Mut PIP5Pase (not just VSGs). The authors rightly avoid claiming that this pathway is exclusive to VSGs, but I wonder if these data could provide insight into the other biological processes that might be controlled by this signaling pathway in T. brucei.
  
  Overall, this is an excellent study that represents an important step forward in understanding how antigenic variation is controlled in T. brucei. The possibility that this process could be controlled via a signalling pathway has been speculated for a long time, and this study provides the first mechanistic evidence for that possibility.
  
  We thank the reviewer for the evaluation of our work. We agree that it is difficult to ensure the origin of all VSG genes not having minichromosome sequences; hence we did not emphasize this point in the manuscript. We used the 427-2018 reference genome assembled by PacBio and Hi-C (Muller et al. 2018, Nature), which we believe is the best assembly for the 427 strain, especially related to the VSG genes.
  
  We also agree that having signaling controlling switching in vitro does not mean the switching necessarily occurs by signaling in vivo. Nevertheless, stochastic switching is an accepted model; but it has not been proved, whereas we provide molecular evidence that signaling can cause switching. To express this reviewer’s suggestion, we edited the Discussion, page 7, line 250: from “our data imply that antigenic variation is not exclusively stochastic” to “our data suggest that antigenic variation is not exclusively stochastic”.
  
  Most of the RNA-seq data were VSGs genes/pseudogenes. Other genes upregulated included retrotransposons and DNA/RNA processing enzymes such as endonucleases and polymerases. We included in the Results, page 3, line 100: “Other genes upregulated include primarily retrotransposons, endonucleases, and polymerase proteins.”.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.05.11.540368v3
www.biorxiv.org www.biorxiv.org

New submission 02/07/2023, 10:48:57

1
1. Public_Reviews 07 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #2 (Public Review):
  
  Associative learning assigns valence to sensory cues paired with reward or punishment. Brain regions such as the amygdala in mammals and the mushroom body in insects have been identified as primary sites where valence assignment takes place. However, little is known about the neural mechanisms that translate valence-specific activity in these brain regions into appropriate behavioral actions. This study identifies a small set of upwind neurons (UpWiNs) in the Drosophila brain that receive direct inputs from two mushroom body output neurons (MBONs) representing opposite valences. Through a series of behavioral, imaging, and electrophysiological experiments, the authors show that UpWiNs are differentially regulated by the two MBONs, i.e., inhibited by the glutamatergic MBON-α1(encoding negative valence) while activated by the cholinergic MBON-α3 (encoding positive valence). They also show that UpWiNs control the wind-directed behavior of flies. Activation of UpWiNs is sufficient to drive flies to orient and move upwind, and inhibition of UpWiNs reduces flies' upwind movement toward the source of reward-predicting odors (CS+). These results, together with existing knowledge about the function of the mushroom body in memory processing, suggest an appealing model in which reward learning decreases and increases the responses of MBON-α1 and MBON-α3 to the CS+ odor, respectively, and these changes cause UpWiNs to respond more strongly to the CS+ odor and drive upwind locomotion. Interestingly, in the final part of the results, the authors reveal a wind-independent function of UpWiNs: increasing the probability that flies will revisit the site where UpWiNs were activated. Thus, UpWiNs guide learned reward-seeking behavior with and without airflow. Although the mushroom body has been extensively studied for its role in learning and memory, the downstream neural circuits that read the information from the mushroom body to guide memory-driven behaviors remain poorly characterized. This study provides an important piece of the puzzle for this knowledge gap.
  
  Strength
  
  1) Memory studies have predominantly relied on binary choice (go or no-go) assays as measures of memory performance. While these assays are convenient and efficient, they fall short of providing a comprehensive understanding of underlying behavioral structures. In an effort to overcome this limitation, the current study used video recording and tracking software to delve deeper into memory-guided behavior. This innovative approach allowed the authors to uncover novel neurons and examine their contribution to behavior with a level of detail not possible with binary choice assays.
  
  2) This study used electron microscopy-based Drosophila hemibrain connectome data to reveal the synaptic connection between UpWiNs and MBON-α1 and MBON-α3. Using this method, the study shows that a single UpWiN receives direct input from both MBON-α1 and MBON- α3, which is confirmed by a functional imaging experiment. The connectome dataset also reveals several neurons downstream of UpWiNs, opening avenues for further research into the neural mechanisms linking memory and behavior.
  
  Weakness
  
  1) The authors repeatedly state in the manuscript that MBON-α1 and MBON-α3 convey appetitive or aversive memories, respectively. This assertion may not be entirely accurate. Evidence from sugar reward conditioning experiments suggests that MBON-α3 is potentiated and required for sugar reward memory retrieval. Therefore, the compartmentalization for appetitive and aversive memories appears not as obvious at the level of MBONs.
  
  What we intended was that activation of DANs in these compartments can induce aversive and appetitive memories, respectively, when paired with odors, and that these are the sole output pathway from these compartments to read out the memories in these compartments. As we previously proposed (Aso et al., 2014a eLife), these MBONs can integrate inputs from MBONs of other compartments and their activity can reflect appetitive memory stored as synaptic plasticity in other compartments. Since DANs in the α3 compartment respond to heat, bitter and electric shock but not sugar, the observation that MBON-α3 acquires an enhanced CS+ odor response after appetitive conditioning is presumably due to these intercompartmental connections rather than plasticity of KC-MBON synapses in the α3 compartment. In any case, the fact that excitatory activity of MBON-α1 and MBON-α3 conveys opposite valence of memory still holds true since appetitive conditioning induces depression and potentiation of odor responses, respectively.
  
  To clarify this point, we now cited related literature in the following sentence in the final paragraph of Introduction: “UpWiNs receive inputs from several types of lateral horn neurons and integrate inhibitory and excitatory inputs from MBON-α1 and MBON-α3, which are the output neurons of MB compartments that store long-lasting appetitive or aversive memories, respectively (Aso and Rubin, 2016; Ichinose et al., 2015; Jacob and Waddell, 2022a; Pai et al., 2013; Yamagata et al., 2015).”
  
  2) This study did not conclusively establish the importance of the MBON-α1/α3 to UpWiN pathways in memory-driven behavior. In the experiments shown in Figure 5, flies were trained to associate the activation of reward-related DANs with a specific odor (CS+). After conditioning, UpWiNs were observed to show enhanced responses to the CS+ odor. However, the results should be interpreted with caution because the driver line used to activate DANs (R58E02-LexAp65) labels not only DANs projecting to the MBON-α1 compartment, but all DANs in the protocerebral anterior medial (PAM) cluster. Thus, it remains unclear to what extent the observed enhanced responses are influenced by changes in inhibitory inputs from MBON-α1. While UpWiNs have been shown to play a critical role in the expression of sugar reward memory (Figure 7), it should be noted that UpWiNs receive inputs from multiple upstream neurons, making it difficult to accurately assess the contribution of MBON-α1/α3 to UpWiN pathways in UpWiN recruitment. Further research is needed to fully address this issue.
  
  We totally agree with this point and added a sentence to explain an alternative mechanism. “This enhancement of CS+ response can be most easily explained as an outcome of disinhibition from MBON-α1 whose output had been decreased by memory formation; MBON-α1 is inhibitory to UpWiNs (Figure 4B) and MBON-α1 response to the CS+ is reduced following the same training protocol (Yamada et al. 2023). In addition to such a mechanism, plasticity in the β1 compartment may contribute to the enhanced CS+ response in UpWiNs because the driver R58E02 contains DANs in the β1 and glutamatergic MBON from the β1 directly synapse on the dendrites of MBON-α1 and MBON-α3. “
  
  3) UpWind neurons (UpWiNs) were so named because their activation promotes upwind locomotion. However, when activated in the absence of airflow, flies show increased locomotor speed and an increased probability of revisiting the same location (Figure 7 and Figure 7-figure supplement 1). The revisiting behavior can be observed during the activation of UpWiNs, which is distinct from the local search behavior that typically begins after a reward stimulus is turned off (e.g., Gr64f-GAL4 results in Figure 7-figure supplement 1).
  
  Return probability was calculated within a 15-s time window. High return probability during LED ON period (10-20s) in Figure 7-figure supplement 1 does not necessarily mean that flies returned during LED ON period. If a fly is at the position A when t=10s, to be counted as “returned”, it needs to move more than 10mm away from A and move back to the position less than 3mm distance from A by t=25s. In the case of sugar sensory neuron activation with Gr64f-GAL4, the peak of return probability is shifted toward a later time point because flies stop and extend proboscis during activation period.
  
  Because revisiting a location can also be a consequence of repeated turns, it seems more accurate to describe UpWiNs as controlling the speed and likelihood of turns and promoting upwind movement by integrating with neurons that sense the direction of airflow.
  
  The return probability plotted in Figure 7E is probability of return to the position at the end of LED period within 15s post LED period when angular speed of SS33917>CsChrimson and SS33918>CsChrimson flies are identical to empty-split-GAL4>CsChrimson control flies (Figure 7-figure supplement 1). Thus, revisiting behavior cannot be explained by a simple increase in turing probability.
  
  Although functions of UpWiNs are not limited to promotion of wind-directed walking, we still think that the “UpWind Neurons” is a practical name for broad readers and oral communications at the current stage of investigations, because EM neuron IDs and names (SMP348, SMP353, SMP354, SLP399 and SLP400) are too lengthy and do not contain any functional information. We initially defined a set of 11 neurons labeled by SS33197 split-GAL4 as “UpWind Neurons (UpWiNs)” based on initial optogenetic screening (Figure 2A). We found other driver lines for mushroom body interneuron cell types that can promote release of dopamine and more robust returning phenotype (e.g. SS49755), but SS33917 remained to be the champion driver line for upwind locomotion phenotype.
  
  Reviewer #3 (Public Review):
  
  Aso et al. provide insight into how learned valences are transformed into concrete memory-driven actions, using a diverse set of proven techniques.
  
  Here the authors use a four-armed arena to evaluate flies' preference for a reward-predicting odor and measure upwind locomotion. This behavioral paradigm was combined with the photoactivation of different memory-eliciting neurons, revealing that appetitive memories stored in different compartments of the mushroom bodies (center of olfactory memory) induce different levels of upwind locomotion. The authors then proceed to a non-exhaustive optogenetic screen of the neurons located downstream of the output neurons of the mushroom bodies (MBONs) and identify a group of 8-11 Cholinergic neurons promoting significant changes in upwind locomotion, the UpWins. By combining confocal immunolabelling of these neurons with electron microscope images, they manage to establish the UpWins' connectome within themselves and with the MBONs. Then, using two in vivo cell recording techniques, electrophysiology, and calcium imaging, they define that UpWins integrate both inhibitory and excitatory synaptic inputs from the MBONs encoding appetitive and aversive memory, respectively. In addition, they show that the UpWins' response to a reward-predicting odor is increased after appetitive training. On a behavioral level, the authors establish that the UpWins respond to wind direction only and are not involved in lower-level motor parameters, such as turning direction and acceleration. Finally, they demonstrate that the UpWins' activity is necessary for long-term appetitive memory retrieval, and even suggest a broader role for the UpWins in olfactory navigation, as their photoactivation increases the probability of revisiting behavior. In the end, the authors state that they provide new insights into how memory is translated into concrete behavior, which is fully supported by their data. Altogether, the authors present a pretty complete study that provides very interesting and reliable data, and that opens a new field of investigation into memory-driven behaviors.
  
  Strengths of the study:
  
  To support their conclusions, the authors provide detailed data from different levels of analysis (behavioral, cellular, and molecular), using multiple sophisticated techniques.
  
  The measurement of multiple parameters in the behavioral analysis supports the strong changes in upwind locomotion. In addition, taken individually these parameters provide precise insights into how upwind locomotion changes, and allow the authors to more precisely define the role of the UpWins.
  
  The authors use split-Gal4 drivers instead of Gal4, allowing them to better refine neuron labelling.
  
  The authors discussed and investigated all possible biases, making their data very reliable. For example, they demonstrated that the phenotypes observed in the behavioral assay were wind-directed behaviors and could not be explained by bias avoidance of the arena's center area.
  
  Limitations of the study:
  
  In the absence of more precise drivers, the UpWins' labelling lacks precision. For example, there is no way to know exactly which UpWin is responding in the electrophysiological experiment presented in Figure 4.
  
  We have ongoing efforts to generate split-GAL4 and split-LexA driver lines for specific subsets of UpWiN neurons, but the data using those lines are not ready for this manuscript. However, we would like to point out that historically, identification of a group of neurons with striking phenotype has been foundational to promote follow-up studies. A good example is P1 neurons for courtship behavior.
  
  The screening of neurons located downstream of the MBONs is not exhaustive, meaning that other groups of neurons might be involved in memory-driven upwind locomotion. Although, it does not diminish the authors' conclusions.
  
  The UpWiNs is certainly not the only one cell type for mediating memory-driven upwind locomotion, since our and other groups’ studies (e.g. Matheson et al., 2022; PMCID: PMC9360402) identified a collection of cell types that can promote upwind locomotion upon optogenetic activation.
  
  In 2021, we released images and driver lines of a larger collection of split-GAL4 driver lines at https://splitgal4.janelia.org. We are preparing a manuscript to provide anatomical descriptions of these lines. This collection of new drivers will help elucidate more comprehensive views of circuits for memory-driven actions.
  
  All data were obtained with walking flies. So far, there have been no experiments on flying flies.
  
  This is an intriguing question and we mentioned in Discussion that “Our study was limited to walking behaviors, and the role of UpWiNs in flight behaviors remains to be investigated.”
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.12.21.521497v2
www.medrxiv.org www.medrxiv.org

New submission 02/07/2023, 10:40:46

1
1. Public_Reviews 07 Jul 2023
  
  in eLife
  
  Author Response
  
  Review #1 Public Review:
  
  This is an interesting study which attempts to assess the effect of the pandemic on diagnoses of pancreatic cancer. The authors have used a large national database to evaluate this, however, it should be noted that this database only captures 40% of the population in England. The authors have looked at specific parameters including Body Mass Index (BMI) as well as markers of diabetes and liver function. Only BMI had a difference in the frequency of measurements during the pandemic, presumably due to reduced face-to-face visits to allow weight and height to be captured.
  
  Interestingly the authors noticed a reduction in surgery for pancreatic cancer by 25%, yet reported that there were no differences in the frequency of death within 6 months following the diagnosis of pancreatic cancer. The reduction in surgery is likely related at least in part to the loss of operating lists due to pandemic restrictions, however, this paper is not equipped to address another important possibility behind this, which is that pancreatic cancers were presenting too late for surgical intervention. It is not sufficient to comment that pancreatic cancer treatment was not affected by the pandemic based on the data presented on deaths within 6 months of the diagnosis of pancreatic cancer alone, as the median survival of patients diagnosed with pancreatic cancer within the pandemic has not been captured and compared to that of patients diagnosed in the preceding 5 years.
  
  Therefore while the study can conclude no difference in pancreatic cancer diagnoses before and during the pandemic, more work needs to be done to truly assess if the pandemic had any effect on the outcomes from pancreatic cancer for patients diagnosed within this timeframe.
  
  Thank you for taking time to undertake the review and for all the constructive comments. This study was designed to assess the effect of the pandemic on pancreatic cancer services in England. We focused on the quantity of healthcare.
  
  We acknowledge and understand the comments by the reviewer with regards to the limitations of this study in relation to the effect of the COVID-19 pandemic on diagnosis and survival. We did not assess the effect of the pandemic on the staging information and survival length.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2022.12.02.22283026v1
www.biorxiv.org www.biorxiv.org

New submission 01/07/2023, 16:52:53

1
1. Public_Reviews 07 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  This research aimed to discern the pattern of methylation changes that occur during aging, distinguishing between a unified specific mechanism and stochastic changes. To date, no unified hypothesis exists to guide our understanding of the changes in chromatin geography observed during the aging of cells. This work analysed six different types of purified blood-borne white blood cells allowing comparison across different immune cell subsets to determine if similar patterns occurred in all cell populations. Intriguingly, each subset exhibited its own distinct differential methylation rather than a single program. However, a core set of gene changes close to age-associated CpGs was identified suggesting that a central program existed, but that individual cell type function and metabolism shaped the overall chromatin landscape for the population. These findings establish a new framework for considering the aging process and open new questions about how the individual clocks of different populations might be regulated. While circulating cells are readily accessible for evaluation in humans, the majority of immune cells that regulate immune homeostasis are found within the tissues of the body. Whether these cells exhibit a similar profile to circulating cells or are rather shaped by their tissue or organ-specific ecosystem remains to be determined. In this setting, these tissue-resident cells are exposed to very different oxygen tensions and metabolic substrates. Furthermore, genes identified have been associated with aging, they concurrently appear to be associated with inflammation, thus it is not clear whether aging and low-grade inflammation are inherently linked, or whether these two pathways can be segregated. Thus a number of questions remain warranting further investigation.
  
  The reviewer makes a very good point regarding different tissue resident cells being exposed to different oxygen and metabolic stress. In the reviewed manuscript we have Arid3a coming up as one of the transcription factors with motifs in and around probes hypermethylated with age in monocytes. Arid3a is known to target inflammatory genes but future research is warranted to implicate the link between aging and low-grade inflammation. To address the comment about connection between aging and low-grade inflammation, in the revised manuscript, we have incorporated new analysis by looking into SomaScan array derived protein levels of seven cytokines from the same cohort of donors. We tested the hypothesis that part of the age-associated changes in DNA methylation are connected with the well-known age-related proinflammatory state. We have now added the details in the Results and Methods sections. Briefly, we run two regression models (CpGi~age+sex and CpGi~age+sex+analytej, where i is each CpG probe from EPIC array and j is each of the seven cytokines). We find that change in DNA methylation levels in nearly 70009000 CpG sites in CD4 cells and 124 CpG sites in B cells that were originally age-associated, also are associated with increasing levels of TNFRSF1A, TNFRSF1B and TNF-alpha levels thereby indicating a link between DNA methylation change and aging as well as inflammatory cytokines levels.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.01.23.525162v1
www.biorxiv.org www.biorxiv.org

Melanocortin 1 receptor regulates cholesterol and bile acid metabolism in the liver

1
1. Public_Reviews 07 Jul 2023
 
 in eLife
 
 Author Response
 
 Reviewer #1 (Public Review):
 
 The work described herein would have an impact on the field in multiple ways. Firstly, it demonstrates a novel metabolic role for MSH in the regulation of hepatic cholesterol metabolism. This may prove to be a viable therapeutic strategy for the treatment of dyslipidemia. Furthermore, the authors demonstrate an alternative signaling cascade elicited by MSH independent of cAMP, but rather relying on AMPK. This novel interaction between AMPK and MC1R could have more widespread implications beyond the control of hepatic cholesterol metabolism.
 
 For the most part, the conclusions offered by the authors are supported by the data that is presented. There are, however, a number of concerns in the current version of this manuscript detailed below.
 
 We thank the reviewer for the encouraging and insightful comments, and we are pleased to read that the manuscript has raised considerable interest.
 
 1) The authors demonstrate the expression of MC1R in hepatocytes through IHC staining and western blot analysis. Furthermore, the authors show an alteration in systemic bile acid homeostasis in MC1R KO mice. However, no mention of MC1R expression or function in cholangiocytes is discussed. This is important to assess both experimentally and within the discussion given the profound role of the biliary epithelium in modulating bile acid homeostasis. Furthermore, in figure 1 the authors validate the MC1R knockdown only through mRNA expression. Given panels A and C of figure 1 shows there is clearly a functional antibody for MC1R, validation of protein knockdown is needed.
 
 The reviewer raises an important point, which we addressed by performing immunofluorescence staining using an antibody against the cholangiocyte marker cytokeratin 19 (CK-19). These colocalization studies demonstrate the presence of MC1-R in CK19-positive cholangiocytes (Figure 1-figure supplement 1). Furthermore, we have now added a discussion on the possible role of MC1-R in modulating bile acid homestasis in cholangiocytes (page 12, lines 456-462). We also quantified MC1-R protein expression by Western blotting in the liver of LMc1r-/- mice. MC1-R protein level was significantly reduced in L-Mc1r-/- mice compared to L-Mc1+/- mice (Figure 2-figure supplement 2).
 
 2) Figure 2 demonstrates a steatotic effect of MC1R knockdown in hepatocytes. The authors attempt to provide mechanistic insight into this phenomenon through assessing the mRNA expression of genes involved in cholesterol and fatty acid synthesis. The data provided is modest at the gene level and no protein validation was provided to demonstrate functional alterations of these proteins in MC1R KO mice. Key proteins proposed such as SREBP2 and HMGCR need to be validated via a western blot of IHC analysis.
 
 As requested by the reviewer, we quantified the expression of key proteins in the liver of L-Mc1r-/- mice by Western blotting. We observed that the protein levels of HMGCR and DHCR7 as well as the ratio between the mature and precursor forms of SREBP2 were reduced in L-Mc1r-/- mice (Figure 2F-H, page 6/lines 182-191 & page 10-11/lines 390-401). This is likely a result of the feedback regulation, whereby cholesterol accumulation suppresses the cleavage of SREBP2 and leads to a consequent downregulation of the key cholesterol synthesis enzymes such as HMGCR and DHCR7 (Brown S & Goldstein JL, Cell. 1997 May 2;89(3):331-40).
 
 We discussed in the original submission (page 11) as follows: ‘In the presence of excess cellular cholesterol, transcriptional induction and posttranslational activation of SREBP-2 should be attenuated, which in turn downregulates Hmgcr and Dhcr7 and reduces cholesterol synthesis as a counterregulatory mechanism. Therefore, given the increase in hepatic cholesterol content, it was unexpected that Srebp2 expression was upregulated in the liver of L-Mc1r-/- mice’. The finding of reduced SREBP2/HMGCR protein expression is thus more logical, but admittedly, it is discordant with increased Srebp2/Hmgcr mRNA expression (as reported in the original submission), which might be a compensatory response to suppressed SREBP2 cleavage. Taking into account that activation of MC1-R did not affect the protein expression of HMGCR or DHCR7 in HepG2 cells, it is plausible that hepatic cholesterol accumulation in L-Mc1r-/- mice is driven by a defect in bile acid metabolism, rather than by a direct effect of MC1-R signaling on cholesterol synthesis. To avoid unnecessary confusion, we decided to omit the qPCR data and related text parts from the manuscript and report the protein expression data instead.
 
 4) The authors suggest the involvement of AMPK in mediating the cholesterol-lowering effects of MSH. However, MSH is still able to lower free cholesterol levels even in the presence of an AMPK inhibitor. This suggests that MSH does not in fact rely on the activation of AMPK to elicit these cholesterol-lowering effects. The authors' conclusions are stronger than the actual data support. Furthermore, the authors claim LD211 phenocopies the effects of MSH in the presence of an AMPK inhibitor. However, the authors only measured the phosphorylation of Akt as their outcome. This begs the question, does LD211 still lower total cholesterol in the presence of AMPK inhibitors? This experiment is essential to conclude whether or not LD211 phenocopies the effects of MSH.
 
 The reviewer may have missed that we postulate in the manuscript that ‘MC1-R activation engages multiple signaling mechanisms to regulate cholesterol metabolism in HepG2 cells’ (manuscript page 8, lines 310-311 & page 13, lines 498508), since low concentration of a-MSH was still able to lower free cholesterol level in the presence of the AMPK inhibitor dorsomorphin. We have been careful not to claim that the effects of a-MSH are solely dependent on AMPK phosphorylation. Likewise, we have not claimed in the original submission that LD211 phenocopies the effects of MSH in the presence of an AMPK inhibitor. However, as suggested by the reviewer, we performed new experiments to investigate the effects of LD211 on cellular cholesterol levels in the absence and presence of dorsomorphin. We found that AMPK inhibition with dorsomorphin completely abolished the cholesterollowering effect of LD211 (Figure 7-figure supplement 2), which might indicate that this synthetic agonist has a stronger signaling bias toward the AMPK pathway compared to α-MSH.
 
 5) The authors initiate the project by showing high-fat diet disrupts the expression of MC1R. However, all of the subsequent experiments in hepatic MC1R KO mice are performed under normal chow. This begs the question of what is the phenotype of the hepatic MC1R KO mice fed a high-fat diet. Does KO of MC1R in the liver exacerbate HFD-induced obesity, glucose intolerance, and dyslipidemia? Inversely, can WT mice challenged with an HFD be rescued metabolically by treatment with either MSH or LD211? Providing data along these lines of investigation will provide physiological/clinical relevance to their findings.
 
 As suggested by the reviewer, we phenotyped the hepatic MC1R KO (LMc1r-/-) mice after feeding them a cholesterol- and fat-rich Western diet for 12 weeks (RD Western Diet, D12079B, Research Diets Inc, NJ, USA). This was exactly the same dietary regimen (product and duration) that was used to study the changes in hepatic MC1-R expression in wild-type C57Bl mice (Figure 1B&C). We observed that 12-week Western diet feeding induced a significant gain in body weight and total fat mass as well as an increase in plasma and hepatic cholesterol and TG levels (Figure 2-figure supplement 2). L-Mc1r-/- mice did not show a difference in body weight gain, but the weight gain was attributable to enhanced gain in fat mass and a blunted increase in lean mass compared to control Mc1rfl/fl mice (Figure 2-figure supplement 2A, D & E). Furthermore, liver weight and plasma cholesterol and TG concentrations were unchanged in HFD-fed L-Mc1r-/- mice (Figure 2-figure supplement 2B, C, F & G). Importantly, recapitulating the phenotype observed in chow-fed mice, hepatic cholesterol and TG content was significantly increased in LMc1r-/- mice after a HFD challenge (Figure 2-figure supplement 2H & I). Taken together, it appears that the phenotype of HFD-fed L-Mc1r-/- mice was slightly diluted compared to the phenotype observed in chow-fed L-Mc1r-/- mice. This phenotypic difference might relate to the finding that Western diet feeding reduced the hepatic expression of MC1-R, thus limiting the incremental effect of genetically induced MC1-R deficiency on hypercholesterolemia and hepatic lipid accumulation.
 
 We have previously studied the effects of pharmacological MC1-R activation in Western diet-fed mice and observed that chronic treatment with a selective MC1-R agonist reduced plasma cholesterol level and upregulated hepatic Ldlr expression without affecting body weight gain (Rinne P et al, Circulation. 2017 Jul 4;136(1):8397.). These findings are also discussed on manuscript page 12, lines 475-478. Although the selective MC1-R agonist was different in that particular study, it is expected that LD211 would also elicit a similar cholesterol-lowering effect in Western diet-fed mice. Chronic treatment with a-MSH, on the other hand, would likely produce wide-ranging metabolic effects. In addition to MC1-R activation in hepatocytes and its consequent effect on liver cholesterol metabolism, a-MSH would affect feeding, energy expenditure and cholesterol metabolism via MC4-R activation in the central nervous system as well as fatty acid and glucose metabolism via MC5-R activation in the skeletal muscle. Therefore, the phenotype associated with a-MSH treatment would be complex and mediated by multiple mechanisms and MC-R subtypes, thus making it difficult to interpret the exact contribution of hepatic MC1-R signaling to the observed phenotype.
 
 Reviewer #2 (Public Review):
 
 Keshav Thapa et al. investigated the role of melanocortin 1 receptor (MC1-R) in cholesterol and bile acid metabolism in the liver. First, they observed that MC1-R is present in the mouse liver and that its expression is reduced in response to a cholesterolrich diet. To determine the role of MC1-R in the liver, they generated hepatocyte-specific MC1-R KO mice (L-Mc1r-/-). These animals exhibited a significant increase in liver weight, lipid accumulation, triglycerides and cholesterol levels, and fibrosis in comparison with control mice. By performing liquid chromatography-mass spectrometry, the authors also found that L-Mc1r-/- mice also have fewer bile acids in the plasma and faeces, but not in the liver. In accordance with these findings, mRNA/protein expression of different genes involved in these processes were altered in L-Mc1r-/- animals.
 
 Secondly, in an attempt to evaluate the underlying mechanisms, they measured the expression of MC1-R in HepG2 cells under different treatments (i.e., palmitic acid, LDL, and atorvastatin). Moreover, they stimulated these cells with the endogenous MC1-R agonist - MSH, where they show that this molecule decreases the free cholesterol content, whereas increasing LDL and HDL uptake, as well as recapitulates some previously observed phenotypes in the proportions of bile acids. These effects were also encountered when using a selective agonist for MC1-R (i.e., LD211), further supporting the specific role of MC1-R. Finally, some experiments indicated that -MSH evokes not one single, but multiple intracellular signalling cascades for which MC1-R activation effects might take place.
 
 Overall, this work provides novel and interesting findings on the role of MC1-R in cholesterol and bile acid metabolism in the liver, which undoubtedly will have some crucial implications for future research. Nevertheless, some experimental details should be better explained for the correct interpretation of the data. Besides, discrepant results exist regarding the molecular mechanisms behind MC1-R action that requires additional experimentation to support the conclusions drawn.
 
 We thank the reviewer for the encouraging and insightful comments, and we are pleased to read that the manuscript has raised considerable interest.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.11.08.515543v1
www.biorxiv.org www.biorxiv.org

New submission 01/07/2023, 16:38:58

1
1. Public_Reviews 07 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  The authors aim to understand the role of clonal heterogeneity of tumors in immunogenicity of clonally expressed antigens. This is a significant problem with many basic as well as translational implications.
  
  The strength of the manuscript lies in the novel demonstration that a poorly immunogenic tumor antigen, when paired with a stronger tumor antigen, begins to elicit significant immune response. The weakness lies in the fact that the actual mechanism of the key demonstration is never shown. There is a lot of speculation and tangential experimentation, but little actual evidence of a mechanism.
  
  By making the key observation (mentioned in the strength section in the previous paragraph), the authors did achieve their objective albeit very partially. Their observation is based on excellent experimental tools and design. This study will stimulate further experiments in this important field.
  
  Their key observation is somewhat reminiscent of the practice of conjugating small "non-immunogenic" antigens (such as some carbohydrates) to large protein carriers (such as serum albumin) in order to elicit strong antibody response to the weaker antigen. It is interesting to contemplate if the underlying mechanisms have any commonality.
  
  We thank the reviewer for their consideration of our work and their constructive feedback. We concur that our study has limitations and further work will be necessary to fully deconstruct the mechanism leading to the observed phenotype. We have revised the text to better reflect the aim and scope of our study. However, the goal of our work was to establish a trackable model that would allow us to model different, albeit limited, degrees of antigen expression patterns reflecting what is observed in patients with different levels of ITH. Our key observation reproduces what is observed clinically, adding strength to the model. Next, we wanted to study what was different about the induced immune responses to develop strategies to better treat tumors with heterogeneous NeoAg expression patterns that currently do not respond to checkpoint blockade therapy. Studying KP-HetHigh and KP-HetLow tumors revealed that tumor debris-carrying cDC1 draining from KP-HetLow tumors phagocytosed both NeoAgs. This population of cDC1, carrying both NeoAgs, had a more stimulatory phenotype compared to cDC1 without tumor debris or cDC1 that had engulfed only one NeoAg. We were able to develop a targeted therapy including CD40 agonism based on our key observations: KP-HetLow had a more robust response towards the weaker NeoAg which was associated with more stimulatory cDC1 presenting both NeoAgs compared to KP-HetHigh tumors. The stronger immune response increased responsiveness to CBT.
  
  The reviewer makes an interesting point about conjugate vaccines, which canonically elicit greater responses because they engage multiple immune cells, namely T cells with B cells, resulting in stronger antibody responses. The prevalence of tumor debris-carrying cDC1 with both neoantigens in KP-HetLow does make us consider that this population of cDC1 may be engaging multiple immune populations, i.e., different neoantigen-specific T cells. We suggest this as a possible mechanism for greater Aatf responses, but further work is necessary to determine if the same cDC1 can directly interact with both neoantigen-specific T cells.
  
  Reviewer #2 (Public Review):
  
  There are data to suggest that intratumour mutational heterogeneity (ITH; the proportion of all mutations that are found only within cancer subclones) is associated with worse therapeutic outcomes. Specifically, patients with more mutations (and thus neoantigens) mostly expressed by subclones (high ITH) have poorer responses to checkpoint immunotherapy. The authors set out to explore the mechanisms underlying this by studying 2 dimensions of neoantigen biology: firstly, distribution (clonal vs subclonal) and secondly, immunogenicity (weak vs strong binding to MHC class I). Using a panel of lung cancer cell lines modified to express individual or dual neoantigens in order to model clonal and subclonal expression, elegant studies show that clonal co-expression with a "strong" neoantigen can boost the immunogenicity of a "weak" neoantigen and result in tumour control. Mechanistically, this is related to engulfment of both neoantigens by cross presenting type 1 conventional dendritic cells and the associated enhanced activation state of this cell type. This is an interesting and potentially important finding that may be related to mechanisms of epitope spreading as immune responses diverge from targeting more to less immunogenic epitopes. Overall, the study is thought-provoking, informative in relation to how neoantigen immunogenicity is shaped and may have practical relevance.
  
  We greatly appreciate the constructive comments from the reviewer and their insightful comments and questions on our work. We have edited the text in response to their feedback. We believe these changes have made the writing clearer and more effectively communicates the scope of our study and our results to the reader.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.12.16.520773v1
www.biorxiv.org www.biorxiv.org

New submission 07/07/2023, 08:22:50

1
1. Public_Reviews 07 Jul 2023
  
  in eLife
  
  Author Response:
  
  We would like to thank the Editors and Reviewers for their positive evaluations, constructive comments, and for the opportunity to revise our manuscript. We feel that the comments and suggestions will further improve our manuscript.
  
  In the updated manuscript we aim to incorporate all suggested changes and considerations provided by the Reviewers. In particular, we will provide further information on the quality-control ratings per subfield, as suggested by Reviewer 1. Moreover, we will evaluate whether the training-related changes were specific to CA1-3, rather than just showing significant alterations in CA1-3 and not in the other subfields. Last, as suggested by Reviewer 2, we will additionally test for multivariate associations between hippocampal subfield structure and function, to further evaluate the specificity of hippocampal subfield change as a function of training and cortisol.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.03.03.531039v2
www.biorxiv.org www.biorxiv.org

New submission 07/07/2023, 09:24:03

1
1. Public_Reviews 07 Jul 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  Reviewer #1 (Recommendations For The Authors):
  
  This study is well presented and contains all the necessary experiments to support their claims. They made the interesting finding of an additional factor Dyn2. However, it is unclear whether it is present in the human complex. Hence, it would be interesting to see whether Dyn2 co-purifies when expressed with the other complex components in insect cells. Also, purification of a tagged complex from yeast would have indicated whether Dyn2 is part of the complex and whether other factors, like RBM15 or Hakai, present in humans are also present in yeast.
  
  We agree that Dyn2 subunit is an exciting new finding that is worth further investigation. The IP-MS experiments suggest that Dyn2 is subunit of the complex and that the Dyn2 interaction is mediated via Slz1. We also noticed a reduction in m6A levels (50%) in the dyn2 deletion mutant. What the function of Dyn2 is and whether it is conserved remains to be determined.
  
  Our IP-MS experiments with Mum2 identified the complex as described in the manuscript, however we did not find evidence of orthologs of RBM15 and Hakai. More follow up work is needed using in vivo and in vitro assays are needed to determine how m6A by the yeast MTC is regulated.
  
  P3 top: Although m6A is the most abundant internal methylation variant, it is far below the methylation levels of cap-adjacent nucleotides in mammalian mRNAs (PMID: 35970556 ).
  
  We have added the word “internal” to the first sentence of the introduction.
  
  A list of author contributions is missing.
  
  We have added this in the revised version.
  
  Reviewer #2 (Recommendations For The Authors):
  
  Most of the conclusions of this paper are well supported by data, and the text is clearly written and easy to read. Here are my suggestions and comments:
  
  1) In Fig.2, why not use LC-MS to measure m6A levels in Ygl036w, Dyn2, Pab1, Npl3 mutants, as in Fig.1?
  
  For measuring m6A levels, we use combination of LC-MS and m6A ELISA and m6A-seq2 throughout the manuscript. We used ELISA in the Fig2 because we had established this assay in the lab (Ensinck et al, RNA Journal, 2023). M6A-ELISA technique was more accessible and easier to execute compared to LC-MS. Additionally our collaborator for the LC-MS moved his lab to another country, which made it impractical to continue the use of LC-MS.
  
  2) The protein purification experiment described in Fig. 4D is informative. Can they include Dyn2 in the expression system as well?
  
  Thank you for the suggestion. Dyn2 was not the focus of the manuscript as Dyn2 has, at best, only a minor role in m6A deposition in vivo. We are also currently aiming to dissect how Dyn2 regulates m6A and the yeast MTC in follow up work. Hence we decided not to add more experiments on Dyn2 to the current manuscript.
  
  3) Among the MTC components identified in this study, Dyn2 is a new and interesting subunit. It was shown that in C. elegans Dlc1 is involved in stabilizing the m6A writer Mett10. I wonder if yeast has a homolog of C. elegans Mett10?
  
  As far as we know, there is no ortholog identified of Mett10 (METTL16 in mammals) in budding yeast.
  
  4) The authors have emphasized "the m6A dependent and independent functions"; however, this is only based on previous observations. Is it possible that the less severe phenotype associated with ime4 catalytic mutant is due to residual catalytic activity? I think the data presented in Fig. 5 tell us that Ime4 and other MTC subunits have no additional moonlighting function. It is not entirely clear to me what "the m6A-independent function" is.
  
  The observation that the yeast MTC complex has m6A dependent and independent function is based on the previous observations and the current work. In Agarwala et al 2012 PLOS Genetics, it was shown that mum2 and ime4 deletion mutants have more severe phenotype than slz1 deletion mutant or the catalytically inactive mutant of Ime4. We confirmed these observations in the revised manuscript (see Figure S5A and S5B). In this work, we showed that kar4 and vir1 deletion mutants have comparable delay in the onset of meiosis as mum2 and ime4 deletion mutants. Also, the MTC remains intact with absence of Slz1, but falls apart in ime4D, mum2D, vir1D or showed strongly reduced RNA binding (kar4 deletion mutant). Based on this we conclude that an m6A independent function of the MTC exists.
  
  We have included data demonstrating that the catalytically inactive mutant has no residual m6A and a milder meiotic phenotype compared to the ime4 deletion mutant (see Figure S5A and S5B).
  
  5) In Mum2-TEV-ProA IP (1B) and Kar4-TEV-ProA IP (S1A), Slz1 was not significantly enriched; however, in the repeated Mum2-TEV-ProA IP with/without RNAse (S1B, 4C), Slz1 was strongly enriched. Why are the Slz1 results so variable?
  
  This is an astute observation, for which we do not have a definitive answer. One possibility is that Slz1 is the only subunit that is induced during meiosis. It is possible that induction of Slz1 varied between the different IP-MS experiments, hence leading to variability in its association with the MTC complex.
  
  6) The last paragraph on page 11, "Collectively...", and the first paragraph on page 12, "Collectively...", seem redundant.
  
  We have removed the duplicated paragraph in the revised manuscript.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.10.528004v2
www.biorxiv.org www.biorxiv.org

New submission 07/07/2023, 08:47:49

1
1. Public_Reviews 07 Jul 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  Reviewer #1 (Public Review):
  
  Membrane receptor guanylyl cyclases are important for many physiological processes but their structures in full-length and their mechanism are poorly understood. Caveney et al. determined the cryo-EM structure of a highly engineered GC-C in a complex with endogenous HSP90 and CDC37. The structural work is solid and the structural information will be useful for the membrane receptor guanylyl cyclases field and the HSP90 field. However, a detailed characterization of the protein sample is lacking. Moreover, the physiological significance of this structure is not fully exploited by supporting experiments and the mechanistic insight is currently limited.
  
  We thank Reviewer #1 for constructive reviews and agree that this work forms the basis for future exploration by the guanylyl cyclase and HSP90 fields.
  
  1) The characterization of the protein sample is lacking. SDS-PAGE would be useful to identify potential proteolysis, leading to the dissociation of GC dimer. Further size-exclusion chromatography would be helpful to estimate the molecular weight of the complex and to determine if only GC-C monomer is purified.
  
  We have included a representative SDS-PAGE gel in our revised version of the manuscript (Figure 1—figure supplement 1). While we agree that SEC could be beneficial to further explore the stoichiometry of the imaged sample, we see no significant degradation of the guanylyl cyclase via SDS-PAGE, and therefore believe that the zippered construct would remain dimeric. Relatively poor yields of these samples precluded further exploration in this regard.
  
  2) The orientation distribution of the particles is not homogenous in Fig. S1D. It would be helpful to present the 3DFSC curve to evaluate the effect of preferred orientation on the reconstruction.
  
  While the orientational distribution is not perfectly uniform, the provided angles allowed for sufficient reconstruction of maps with no notable anisotropy. We have included 3DFSC curves in our revised version of Figure 1—figure supplement 1.
  
  3) Description of protein expression details is lacking. Did the author use transient transfection, stable cell line or virus-mediated transduction?
  
  We have clarified that these cells were expressed using transiently transfected ExpiCHO cells.
  
  4) HSP90 binds ATP and is often co-purified with endogenous ATP/ADP. Is there ATP or ADP present in the sample/cryo-EM maps? Is the conformation of NBD similar to ATP-bound HSP90? The author needs to include the description/figures about the nucleotide state of HSP90.
  
  There is clear density for present nucleotide in our reconstruction. Given the mechanistic role for ATP turnover in the release of HSP90 client (Young, Hartl, 2000 – PMID 11060043) and the resolved density, we believe the identity for this nucleotide is ATP. We have added comment to this regard in the revised manuscript: “…the C2 pseudosymmetric, ATP bound, closed state Hsp90 dimer.”
  
  5) The catalytic domains of GC have to be dimerized to perform cyclase function. The presence of only one GC-PK monomer in the cryo-EM structure indicates the structure does not represent an active state of GC. These results suggest the GC expressed in this way is not functional. The authors need to explain why most of the GC protein is trapped in this inactive form.
  
  Indeed, we do believe that this regulatory state is non-functional, as observed for active kinases. We have clarified this in the revised manuscript: “In addition, this disruption of the native state of GC-C, as observed in our structure, would likely leave GC domains out of each other’s proximity, precluding their catalytic activity while Hsp90 is bound.”
  
  6) The GC-C construct used here is a highly engineered "artificial" construct, which has not been fully characterized in this work. Does this construct have similar activity as the activated wt GC-C? Does the protein (this engineered construct) expressed in CHO cells show activity?
  
  While our original goal in developing this construct was to create an imageable construct that was locked in the active state, our current interpretation of the data is that the leucine-zipper induced, putative active geometry leads to the majority of this construct falling into the regulatory state with HSP90 binding. We make no claim to have resolved an active conformation in this work, yet believe that this state is of note due to the previously unresolved nature of these regulatory complexes for guanylyl cyclase receptors.
  
  7) Are the residues on the interface between GC and HSP conserved in other members of membrane receptor guanylyl cyclases? Would mutations on this interface affect the activity of GC?
  
  Given the role this structure plays in our understanding that HSP90 client recruitment is largely not driven by specific residue interactions and the ~30% identity of GC-C to NPR-A and NPR-B, we do not believe that mutations that do not significantly change the stability or fold of the PK domain would significantly modify recruitment to HSP.
  
  8) The authors propose that targeting HSP90 would tune the activity of GC. Is there any experimental data supporting this idea?
  
  Based on the work of Kumar et al., 2001 (PMID 11152473), we do believe that there is a functional link between HSP90 recruitment and GC activity. We hope that this work will spark further exploration of these concepts.
  
  9) The model in Fig. S3 is largely speculative due to the lack of supporting functional data. In addition, it would be better to change the title to "structure of the protein kinase domain of guanylyl cyclase receptor in complex with HSP90 and cdc37" because the mechanistic insight is limited.
  
  We agree that our supplemental figure is more speculative. We have referenced this in the discussion section of the manuscript and put this figure in the supplement to ensure that this is understood to be more speculative in nature.
  
  Reviewer #2 (Public Review):
  
  Caveney et al have overexpressed an engineered construct of the human membrane receptor guanyl cyclase GC-C in hamster cells and co-purified it with the endogenous HSP90 and CDC37. They have then determined the structure of the resultant complex by single particle cryoEM reconstruction at sufficient resolution to dock existing structures of HSP90 and CDC37, plus an AlphaFold model of the pseudo-kinase domain of the guanylyl cyclase. The novelty of the work stems from the observation that the pseudo-kinase domain of GC-C associates with CDC37 and HSP90 similarly to how the bona fide protein kinases CDK4, CRAF and BRAF have been previously shown to interact.
  
  The experimentation is limited to the cryoEM analysis, and is lacking additional studies that would give deeper insight into the oligomeric nature - if any - of the GC-C when bound to HSP90-CDC37 as compared to the free protein. This is relevant, as the dimerization domain downstream of the pseudokinase, is evident in the maps - albeit not well resolved - and it is not clear whether it is still able to mediate dimerization with a second free or HSP90-CDC37bound GC-C. It would also be good to see some experimentation that asks whether association with HSP90-CDC37 inhibits the guanyl cyclase activity. It is clear from previous work that HSP90-CDC37 silence the kinase activity of their bound client kinases, but in this case the catalytic guanyl cyclase is not directly associated with the chaperone complex and may still be able to function.
  
  Given the geometry of the interaction, the dimerization domain of the GC would likely be monomerized, albeit with global dimerization remaining – contributed by the ECD, or in our case the liganded-ECD mimicking leucine zipper. Experimentally, it has been shown in live cells (Kumar et al., 2001, PMID 11152473) that the HSP90 association is required for maximal GC-A function. This is likely due to some sort of resetting nature to the associating to allow further activity, as opposed to activity during the association – given the latter is unlikely based on our structure, where the two GC domains would not be able to form the active dimerized state. Further dissection of this, while outside the scope of the current work, is of great interest.
  
  Although the sequence alignment presented in SuppFig 2 shows that GC-C conserves the classic DFG motif that plays a critical role in the regulation of most kinases, the numbering of the sequence is absent, making it very difficult to relate this to the structural detail shown in Fig 2B. This needs to be clarified, as the interaction of CDC37-Trp31 with the DFG motifs and downstream activation loops in CRAF and BRAF have been proposed as important features of the selectivity of these kinases for the HSP90-CDC37 system, and it would be good to be able to see clearly how much of this is also conserved in the GC-C pseudokinase domain interaction. For example, is the much shorter activation segment (DFG -> APE) ordered in the complex or disordered?
  
  We have clarified Figure 2—figure supplement 1 with additional numbering. While we agree that the DFG motif may play a role in recognition, only the first residue of this motif is interacting with CDC37 in our structure, so it may be likely that the role of this motif is more structural in maintaining a CDC37 complementary fold, as opposed to direct residue interactions. Additionally, many kinases which are not regulated by CDC37/HSP90 contain this motif. The shorter DFGAPE of GC-C is traceable with the exception of N613, S614, I615, though the density in this region reflects this loop not being well stabilized.
  
  It was not easy to follow what was in the sample used for cryoEM. The cloning of the guanylyl cyclase (GC) component is described in the methods and they have shown some illustrations in fig 1 but a proper numbered figure of the domain organisation clearly showing domain boundaries and linker segments is really needed for a reader not familiar with the structure of GCs, especially since they have replaced the ECD with a leucine zipper in their construct. It is important to show a domain figure of what this construct looks like as well, as from the illustrations in fig 1 for examples its hard to see what's PK, DD, GC domains. It would also be helpful to see in the supplementary a gel of complex they put on the grids, to make it clearer what exactly the sample is and to reassure that the GC-C domains that are not resolved in the cryoEM are nonetheless present in the sample.
  
  We have added in a gel figure to the supplement and clarified the content of the imaged construct in the methods section: “This construct contains all domains of the native GC-C, with the exception of the ECD.”
  
  Overall there is only minimal proposal of mechanism or biological function based on the structure. The speculation in the Discussion of two fates - PP5 dephosphorylation or E3 ligase recruitment, is not supported by any experimentation, which is reasonable for speculation, but is also not underpinned by reference to any previously published work suggesting that these additional processes may be important. In the absence of any work by the authors can they put these speculations more in context with previously published work that supports the importance of these processes specifically for GC regulation?
  
  We have ensured that these potential pathways only appear in the discussion section. It has been observed, for instance by Oberoi et al., 2022 that phosphatases can act on all components of a HSP90–CDC37–client system. Given there are well characterized phosphorylation sites for membrane GC receptors, we believe this is worth discussing in this manuscript, to stimulate further exploration of these mechanisms in the field. In addition, it has been reported that many E3 ligases are recruited to HSP90 complexes and can degrade rather non-specifically. It has been shown that one can generate PROTAC-like molecules to target non-specific clients to HSP90–E3 ligase machinery for degradation (Li et al., 2023). Given this proximity induced nature to E3 degradation of HSP90 clients, it would be highly likely that, at least in some cases, mGCs would be degraded by this mechanism as well.
  
  Reviewer #3 (Public Review):
  
  A detailed understanding of how membrane receptor guanylyl cyclases (mGC) are regulated has been hampered by the absence of structural information on the cytoplasmic regions of these signaling proteins. The study by Caveney et al. reports the 3.9Å cryo-EM structure of the human mGC cyclase, GC-C, bound to the Hsp90-Cdc37 chaperone complex. This structure represents a first view of the intracellular functional domains of any mGC and answers without doubt that Hsp90-Cdc37 recognizes mGCs via their pseudokinase (PK) domain. This is the primary breakthrough of this study. Additionally, the new structural data reveals that the manner in which Hsp90-Cdc37 recognizes the GC-C PK domain C-lobe is akin to how kinase domains of soluble kinases docks to the chaperone complex. This is the second major finding of this study, which provides a concrete framework to understand, more broadly, how Hsp90-Cdc37 recruits a large number of other diverse client proteins containing kinase or pseudokinase domains. Finally, the Hsp90-Cdc37-GC-C structure offer clues as to how GC-C may be regulated by phosphorylation and/or ubiquitinylation by serving as a platform for recruitment of PP5 and/or E3 ligases.
  
  Comments:
  
  1) The authors used an interesting approach to obtain the GC-C-Hsp90-Cdc37 complex. Flagtagged human GC-C was overexpressed in CHO cells with the expectation of co-purifying endogenous hamster homologs of Hsp90 and Cdc37. There are several points worth noting:
  
  a) It is not clear from the data presented (Figure 1C, Suppl Fig 1A) or the Methods the percentage of particles in the cryo-EM specimen that represent the GC-C-Hsp90-Cdc37 complex. Presumably, some fraction of GC-C isolated will not be associated with Hsp90Cdc37. If a very large portion of GC-C is associated with Hsp90-Cdc37, it would be good to explain why this is to be expected. Are 2D/3D classes corresponding to the activated GC-C dimer found? If not, why?
  
  While we see some traces of GC-C not bound by Hsp90, there is, in the least, a significant alignment bias for the Hsp90 bound complex. We believe that the engineered construct, which we designed to be locked in a putative active conformation, is going through catalytic cycles to some point where the regulatory mechanism is kicking in. It may be that for proper resetting of the receptor, the receptor needs to cycle back through an unliganded, inactive conformation, which our leucine zipper construct is unable to allow, thus locking our GC in the regulatory complex, though this is speculation.
  
  b) Figure 1A suggests that GC-C is phosphorylated before recruitment of Hsp90-Cdc37. What is the phosphorylation status of the GC-C specimen that was imaged by cryo-EM?
  
  We had placed the P in grey in this figure to represent the potential for the active state to be phosphorylated. For GC-C in particular, the phosphorylation state does not affect activity as much as GC-A and GC-B for example. We have removed this P from the figure for clarity.
  
  c) The resolution of the cryo-EM map (3.9 Å) is too low for unambiguous identification of proteins. Please provide more precise justification for the claim that the densities observed do in fact correspond to hamster Hsp90 and Cdc37.
  
  While we agree that the resolution is limiting for protein identification, the fact that we are using a very stringent FLAG purification allows confidence in the ID for our target, GC-C. For Hsp90 and Cdc37, we are confident that they are endogenous hamster Hsp90 and Cdc37, given the large structural similarity observed in comparison to prior Hsp90/Cdc37/client complex structures, and the ID/register well confirmed by the placement of bulky residues.
  
  d) The authors state that human GC-C pulls down hamster Hsp90-cdc37 but soluble kinases cannot, despite the high sequence identity between human and hamster Hsp90-cdc37. Is this because GC-C recognition is more promiscuous? Can this difference be understood in light of the new structural information presented?
  
  “This native pulldown strategy contrasts with the structures of Hsp90–Cdc37 in complex with soluble kinases (García-Alonso et al., 2022; Oberoi et al., 2022; Verba et al., 2016), for which Hsp90 and Cdc37 had to be overexpressed to obtain complex suitable for imaging.”
  
  It is our understanding, from reading the papers cited above, that Hsp90/Cdc37 needed to be overexpressed to obtain these samples for imaging. We use a different strategy because our sample does not require overexpression of Hsp90 and Cdc37. This may be because of something specific to hamster cells, which were (presumably) not tested in the above studies, or it could be something specific to do with GC-C.
  
  2) A large portion of the enforced GC-C dimer was not visible in the cryo-EM maps. It is not easy to learn from Figure 1 exactly which parts of the GC-C construct was sufficiently ordered and observed structurally. Please improve Figure 1.
  
  We have adjusted Figure 1 to better depict what is observed in the cryoEM density.
  
  3) On page 4, the authors claim that they are able to orient the GC-C-Hsp90-Cdc37 complex "as it would sit on a membrane" and referred to Figure 1B. It is not clear what is implied here. Does Hsp90-Cdc37 binding constrain the complex to face the inner leaflet of the membrane in a specific orientation as shown in Figure 1B? If true, this could potentially have important functional implications. Please illustrate how this was deduced based on the information available.
  
  Given the observed density for the PK domain, which is membrane proximal, we can safely assume that the TM would be located immediately above this region. Given the size of Hsp90 and assuming the soluble Hsp90 must sit below the membrane, we can determine, with some accuracy the relative orientation of this complex next to the membrane. This orientation is depicted in Figure 1B.
  
  4) Also on page 4, it is stated that it is sterically unlikely an additional Hsp90-Cdc37 complex is associated with the other copy of GC-C in the leucine zippered dimer. It is not obvious to the reader how this may be the case. An additional figure could help make this more clear. Additional biochemical evidence will also help. The absence of GC-C-Hsp90-Cdc37 dimers in cryo-EM micrographs can also support the argument.
  
  We have clarified this: “is sterically unlikely that an additional regulatory complex is forming on the second GC-C in a concurrent fashion, given the large size of the first Hsp90–Cdc37 and the requisite proximity of the second GC-C.”
  
  5) Some comments on Figure 2:
  
  a) NTD and CTD are mislabeled in Figure 2A.
  
  Thank you for catching this, we have fixed this.
  
  b) The authors should show cryo-EM density to support their modeling of GC-C in Figures 2B and C.
  
  We have provided maps and models to the reviewer and will release these maps and models upon publication so that all relevant densities can be interpreted to their fullest extent by readers. In addition, we have added representative density panels to Figure 1-figure supplement 2.
  
  6) The authors claim that Hsp90-Cdc37 clients are more similar structurally near the cdc37 interface. Please illustrate this with additional figures. Suppl. Figure 2 is inadequate for this purpose.
  
  We have added a structural overlay to Figure 2—figure supplement 1A to illustrate this.
  
  The authors can also consider adding a more detailed discussion comparing the interactions between the pseudokinase/kinase C-lobe and Cdc37 in known structures. Is shape/charge complementarity a universal feature of cdc37-dependent kinase/pseudokinase recruitment? It would be interesting to also consider if it would be possible to predict which of the ~60 human pseudokinases are possible Hsp90-Cdc37 clients. New structural findings from this study and publicly available AI-predicted protein structures could help.
  
  While the use of AI to predict pseudokinase interactions would indeed be interesting, we believe this is outside the scope of this work. Given methodology is in place for determination of kinase clients for Hsp90 (Taipale et al., 2012), this could be an additional route to obtain this information in future work.
  
  Reviewer #2 (Recommendations For The Authors):
  
  In Figure 1B the authors show a large unaccounted-for region of density which they speculate may be due to the dimerization domain. That this is lost in the sharpened maps suggests that it is more mobile than the core which probably dominates the automatic mask generation used by cryoSPARC. It would be very interesting to try and resolve this region further by using focussed classification and refinement - probably in RELION. This would add further novelty, as so far in the three HSP90-CDC37 kinase complexes previously described, little is seen outside the C-terminal lobe of the kinase (or in this case pseudokinase) lobe.
  
  Given the structurally uncharacterized nature of the DD and GC domains for mGCs, using computational means to further our understanding of these regions was attempted. Across several software packages, these attempts were unsuccessful. We will be uploading these micrographs to EMPIAR shortly after publication, which will allow for other groups to re-process this data as they see fit and as new software techniques emerge in this rapidly developing field. We believe that the partially unfolded nature of the PK domain is providing too much of a hinge point prior to the DD for the software to be able to resolve this currently.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.14.528495v3
www.biorxiv.org www.biorxiv.org

New submission 07/07/2023, 08:43:27

1
1. Public_Reviews 07 Jul 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  Reviewer # 1
  
  Specific comments
  
  1) Figure 1: it is unclear how many mice were used for the described phenotypic analyses (panels D and E). Please clarify.
  
  We acknowledge that we made a mistake in failing to clearly describe the phenotypic analyses. In Figure 1D and E, we performed statistical analysis on the number of TEBs in whole mammary mounts. One mouse stained a mammary whole mount with Carmine-alum staining. Thus, “n” represents the 10 mice we analyzed. We have modified the legend of Figure 1 to " D, E. Quantification of the average number of TEBs and bifurcated TEBs in littermate Crb3fl/fl (n=10) and Crb3fl/fl;MMTV-Cre (n=10) mice at 8 weeks old" in lines 909-911.
  
  2) Figure 2: in panels B and C it is unclear how the data was quantified; the legend states "n=10", does this mean the experiment in B was done 10 times? And that 10 acini per condition were measured in panel C? In panel D a difference in 0.3% between NC and shCRB3 seems miniscule; do the authors mean 30% instead? And how many acini were counted per condition per (how many) experiments? Same applies to panels G and H, it is unclear how many cells were analyzed per (how many) experiments.
  
  Thanks for your suggestions. We failed to describe the details of the statistical analysis well in the experimental method. To provide a brief overview of our statistical analysis method, we took 3-4 random bright-field micrographs of each well in the chamber slide system and repeated the experiment three times. We then counted the number of acini in all micrographs (Figure 2B) and examined the diameter of all acini in each photograph, averaging the values as data (Figure 2C). We also determined the percentage of aberrant acini in each photograph, which was used as an analysis value (Figure 2D). We carefully confirmed that the vertical axis of Figure 3D was indeed mislabeled and should mean 30%, and revised the original figure. For IF analysis of the mitotic spindle orientation during lumen formation, we examined the division angle of one cell in one acinus that was mitotically dividing, 3-4 acini were randomly examined in each well in the chamber slide system, and this experiment was repeated three times (Figure 2G and H). Therefore, we have provided a detailed description of these issues in the Figure 2 legend. The revised parts are found in lines 922-924, lines 926-927, lines 929-930, and line 932.
  
  3) Figure 2: it would be desirable if authors were able to quantify the data in panels E and I.
  
  Thank you for your comments. According to your suggestions, we performed the quantitative analysis of Figure 2E and I, which is now presented in the new Figure 2D and H.
  
  4) For all cell-based assays using shRNA to knock down CRB3 (Fig. 2A-H; Fig. 3A-F; Fig. 4C-E; Fig. 5G-J; Fig. 6C; Fig. 7C, D; Fig. 8E-G), it would be desirable to perform rescue experiments to ensure that the observed phenotype of CRB3 depleted cells is specific and not due to off-target effects of the shRNA.
  
  Yes, rescue experiments involving overexpression of CRB3 in CRB3 depleted cells can accurately account for the specific phenotype as well as eliminate the off-target effects of shRNA. However, our group has long focused on the role of the cell polarity protein CRB3 in contact inhibition and tumorigenesis. Our previous studies have ruled out the off-target effects of shRNA and reported that CRB3 regulates contact inhibition and tumorigenesis through Hippo or Wnt signaling pathways (Cell Death Dis 2017;8(1):e2546, Oncogenesis 2017;6(4):e322, J Cell Mol Med 2018;22(7):3423-33). Therefore, we will pay close attention to rescue experiments to ensure experimental integrity and phenotypic specificity in our subsequent studies.
  
  5) Figure 3: how many cells were counted/measured per condition (in how many experiments) in panels B, D, H, F, G and H? In panels C and D, what is the CRB3 protein level in these cells? This is of relevance as protein overexpression per se could impinge on ciliation frequency. This question could be addressed by performing a western blot analysis with CRB3 antibody.
  
  We did not clearly describe the measurement and statistical analysis methods in the previous manuscript. Similarly, we took 3-4 random IF and SEM micrographs of each sample in one experiment, and this experiment was repeated three times. Subsequently, the number of ciliated cells and total cells were counted, and the proportion of ciliated cells was calculated (Figure 3B, D and F). In these figures, the cilium length of representative ciliated cells was measured in each photograph. In the knockout mouse model, we needed to find the intact mammary ductal lumen and renal tubule in IF staining of mouse mammary and renal tissue sections, with 5-6 random fields micrographs taken per slice, and the proportion of ciliated cell was measured by counting and taking the average. A total of ten mice were repeated in these experiments (Figure 3G and H). Therefore, the legend of Figure 3G and H has been partially modified and a detailed description has been added to the Figure 3 legend. The revised parts are in lines 945-946, lines 950-951, line 953.
  
  Thank you for your suggestions that we perform a western blot analysis with CRB3 antibody in Figure 3C and D. And we have added the western blotting with CRB3 analysis in the new Supplementary Figure 3A.
  
  6) Figure 3G: it is very difficult to see that the red stained structures are primary cilia.
  
  Yes, the staining structure of primary cilia in mammary ductal lumen are less clear than that of individual cells and in renal tubule in Figure 3G. We used recognized acetylated tubulin and γ-tubulin to stain the primary cilia, which were clearly labeled in individual cells. However, the labeled primary cilia in renal tubule were longer length and demonstrated a more pronounced structure than those in the mammary ductal lumen. In the mammary ductal lumen of the 10 mice we analyzed, the primary cilia showed shorter length and staining structure than the others shown in Figure 3G. This difference may be due to the distinct characteristics of primary cilia in different tissues.
  
  7) Figure 4B: how many cells were analyzed in how many experiments?
  
  Our statistical methods for analyzing cellular experiments using IF were essentially the same. We randomly selected 3-4 IF micrographs of each sample in one experiment, and this experiment was repeated three times. Subsequently, the number of colocalization cells and total cells were counted, and the proportion of cells with pericentrin and CRB3 colocalization was calculated (Figure 4B). The detailed description has been added to the Figure 4 legend. The revised part is in lines 962-963.
  
  8) Lines 217-219: since the cells were not stained with a cilia marker, only a centrosome marker, the claim that CRB3 localizes to the base of cilia is unsubstantiated.
  
  Thank you for your comments. The base of cilia is the basal body, which develops from the mother centriole of the centrosome (Cancer Res. 2006;66(13): 6463-7). Firstly, we found colocalization of CRB3 and pericentrin, a centrosome marker, in MCF10A cells (Figure 4A and B). Secondly, we verified the colocalization of CRB3 with γ-tubulin, a marker of basal body in primary cilia, in confluent quiescence cells (Figure 4C and D). In addition, we found that CRB3 was localized at the base of primary cilia labeled with acetylated tubulin (Figure 4E and F). Due to the species of commercialized CRB3 antibody, we were able to indirectly claim that CRB3 localizes to the base of cilia through these experiments.
  
  9) Figure 3 and Figure 4: is it problematic to use gamma tubulin as centrosome marker if CRB3 depletion causes reduced centrosomal recruitment of gamma tubulin ring complex components? Also, in Figure S3A no gamma tubulin staining can be seen in the lower panel, why?
  
  Thank you for your positive comments. As is well known, γ-tubulin is a marker of the centrosome, and we found that CRB3 depletion causes reduced centrosomal recruitment of gamma tubulin ring complex components. However, Our Figure 3 was illustrated the effect of CRB3 on ciliary assembly, and Figure 4 was analyzed the localization of CRB3 in primary cilia. In some reports on ciliary assembly, the fluorescent double staining of acetylated tubulin and γ-tubulin have been used to label primary cilia, and the effect of target genes on ciliary number and assembly were analyzed by these markers (Nature. 2013;502(7470): 254-7, Cell. 2007;130(4): 678-90 and so on). Although CRB3 affects the recruitment of gamma tubulin ring complex components, it does not affect the analysis of ciliary number and localization in Figures 3 and 4.
  
  In Figure S3A, green staining labeled with γ-tubulin could be clearly found in the lower left panel. The representative area from the left amplification may have been poorly selected, resulting in no γ-tubulin staining on the right side. We have updated the lower right panel in the new Supplementary Figure 3B.
  
  10) Figure S4A: the grouping of indicated proteins is factually wrong. For example, FBF1, SCLT1 and ODF2 are not IFT-B components, and several of the proteins indicated as localizing to the basal body also localize to (unciliated) centrioles. In contrast, CP110 is usually only found on unciliated centrioles and not mature basal bodies. Authors should consult the relevant literature and correct the figure accordingly. Alternatively, this misleading text/grouping could be removed from the figure. Furthermore, in the legend to Figure S4 there is no information provided about this quantitative analysis (how many independent experiments, which cells were analyzed etc.).
  
  Thank you for your helpful suggestions. We have taken your advice and removed this misleading information from the manuscript, Supplementary Figure 4A and its corresponding legend. In the legend to Supplementary Figure 4A, we have added the detailed information for this quantitative analysis in the legend. The revised legend is shown in lines 1098-1100.
  
  11) Figure S4B: how do authors know which of the bands correspond to CRB3 fusion protein?
  
  Based on the construction strategy of the CRB3-GFP fusion protein (Figure 6D) and its base sequence, we were able to calculate its molecular weight. Then the molecular weight of CRB3-GFP fusion protein was verified by western blotting (Figure 6F and 7A). Meanwhile, exogenous overexpression allowed for the production of the CRB3-GFP fusion protein in large quantities. Due to these features, we could know that the band indicated by the black arrow is most likely CRB3-GFP fusion proteins. In order to check the molecular weight, we have labeled the key molecular weight markers in the new Supplementary Figure 4B.
  
  12) Lines 251-253: this seems like data overinterpretation.
  
  Thank you for your comments. We have revised this sentence in lines 252-254.
  
  13) Lines 260-261: the data showing perturbed gamma tubulin localization is not convincing as data was not quantified.
  
  According to your suggestions, we performed the quantitative analysis of Figure 4C, which is now presented in the new Figure 4E.
  
  14) Figure 5H and Figure 6C: to show that the GCP6 IP actually worked, these blots should be probed also for GCP6.
  
  Thank you for your good suggestions. We have added these blots probed for GCP6 in new Figure 5H and 6C.
  
  15) Figure 5I: how many cells were analyzed in how many experiments?
  
  Our statistical methods for analyzing cellular experiments using IF were essentially the same. We took 3-4 random IF micrographs of each sample in one experiment, and this experiment was repeated three times. The detailed description has been added to the Figure 5 legend. The revised part is in lines 992-994.
  
  16) Figure S5: it looks like GPC6 and Rab11 are localizing all over the cell, are the antibodies used for the IFMs specific for these proteins?
  
  After checking the specificity of these antibodies used for the IFMs, we have decided to delete the corresponding results in the Supplementary Figure 5 and their description in the original manuscript.
  
  17) Lines 43, 89, and 314-315: the claim that CRB3 directly binds Rab11 is not supported by the data. The data provided only shows that these proteins interact indirectly. To show direct interaction, yeast-2-hybrid analysis or pull-down assays with purified proteins would be required.
  
  Thank you for your positive comments. Since we were unable to complete the relevant experiments to demonstrate direct interaction of two proteins, we have revised our conclusions. Replace " CRB3 directly binds Rab11" with " CRB3 binds Rab11" in the manuscript.
  
  18) Figure 6G and lines 314-315: this result is surprising as it indicates GTP- and GDP-locked versions of Rab11 have the same inhibitory effect on CRB3 binding? Please comment, and also indicate how data in Figure 6G was quantified (and how many independent experiments were used for the quantification).
  
  We were also puzzled by the results shown in Figure 6G. Based on the western blotting bands, we suspected that there may have been some issues with the experiment. Specifically, we believed that the inefficient transfection of Flag-Rab11aWT, Flag-Rab11a[Q70L], Flag-Rab11a[S20V], and Flag-Rab11a[S25N] plasmids, as well as the insufficient amount of GFP antibody used in the co-IP experiment, led to the corresponding bands being too weak and masking the true differences.
  
  To address this, we optimized the experimental conditions, strictly increased the experimental control, and repeated the experiment in triplicate. The new results are shown in the revised Figure 6G. The statistics from the three independent experiments revealed that CRB3b had a stronger interaction with Rab11a[Q70L] and Rab11a[S20V], while showing a weaker interaction with Rab11a[S25N], compared to Rab11aWT. As this result, we revised the original manuscript in lines 308-310 and added a detailed description to the Figure 6 legend in lines 1012-1013.
  
  19) Figure 8G: data needs to be quantified.
  
  Thank you for your comments. We replaced the unattractive bands in the western blotting of Figure 8G with better quality ones. The statistical analysis of the Figure 8G data is shown in Supplementary Figure 6.
  
  Further minor comments
  
  1) Abstract should indicate that this study describes conditional knockout of Crb3 in mouse mammary gland epithelial cells.
  
  This is good writing advice. We have added the relevant description in lines 40-42.
  
  2) Line 87: specify which gland (mammary?).
  
  We have modified to " mammary gland" in line 87.
  
  3) Line 140: sentence states that knockout of Crb3 is essential for branching morphogenesis in mammary gland development, I do not think this is correct.
  
  We have removed the inappropriate finding.
  
  4) Line 152: "formed more number" should be "formed more" or "formed higher number of".
  
  We modified "formed more number" to "formed more" in line 154.
  
  5) Lines 157-163: text and logic are difficult to follow for a non-expert.
  
  We have modified the logic of this paragraph, as detailed in lines 158-165.
  
  6) Figure 4A, C: figure resolution could be improved. It is difficult to see what the authors claim these figures are showing.
  
  The clarity of the original images in Figure 4A and C is acceptable, while the images on the right are electronically enlarged. Although there is a decrease in pixels, it can still display our findings.
  
  7) Figure 7D, E: images look pixelated.
  
  The clarity of the original images in Figure 7D and E is acceptable using a laser confocal microscope, while the images on the right are electronically enlarged.
  
  8) Line 222: unclear what authors mean by "detected a series".
  
  We modified "detected a series" to "some important" in line 226.
  
  9) Lines 221-225: which cells were used for the analysis in Fig. S4?
  
  We used MCF10A cells for the analysis in Supplementary Figure 4, and modified its legend in line 1098.
  
  10) Line 245: what is "cytomembrane"?
  
  We modified "cytomembrane" to "cell membrane" in lines 246-247.
  
  11) Lines 246-250: wording is unclear/difficult to understand.
  
  We have modified this paragraph, as detailed in lines 248-251.
  
  12) Line 273: should "regimented" be "sedimented"?
  
  We modified "regimented" to "sedimented" in line 274.
  
  13) Line 287-288: sentence does not make sense.
  
  We have removed this sentence.
  
  14) Figure 5A: it would be desirable to show the original dataset (Excel file) used for generating this figure.
  
  To maintain data integrity, we should provide the original dataset (Excel file). However, there are some unpublished data in this file that we must withhold for the time being. If needed, the corresponding author can be requested to provide the file.
  
  15) Lines 298-299: wording is unclear.
  
  We have modified this sentence, as detailed in lines 296-298.
  
  16) Lines 285-287: replace "instead of" with "but not".
  
  We modified "instead of" to "but not" in line 286.
  
  17) For all IFMs showing merged images of the green and red channel, please also show the red and green channel separately.
  
  Most of our fluorescence images are presented separately for each channel in this manuscript, with only a few merged images due to space limitations. This type of presentation is commonly used in published papers.
  
  18) Lines 326 and 327: replace "bonded" with "bound".
  
  We have modified in lines 322-323.
  
  19) Lines 327-328 and 361-364: wording is unclear/grammatically incorrect.
  
  We have modified these paragraphs, as detailed in line 323 and lines 357-360.
  
  20) Line 342: what is meant by "the combination of"?
  
  We modified "the combination of" to "the binding of" in line 338.
  
  21) Line 365: localization of what?
  
  This means "subcellular localization" in lines 360-361.
  
  Reviewer # 2
  
  Major points
  
  1) CRB3 is present in mammals as 2 isoforms, A and B, originating from alternative splicing. In this study, the authors never mention this fact and when using approaches to KO or KD CRB3A/B they are likely to deplete both isoforms which have been shown to have different C-terminal domains and functions (Fan et al., 2007). This is also important for the CRB3 antibodies used in the study since according to the material and methods section they are either against the extracellular domain common to both isoforms or the intracellular domain which is only similar in the domain close to transmembrane between the 2 isoforms. Since the antibodies used in each figure are not detailed it is impossible to know if the authors are detecting CRB3A or B or both. Please provide the information and correct for the actual isoform detected in the data and conclusions.
  
  Thanks for your positive comments. In mammals, CRB3 has two isoforms, CRB3a and CRB3b, distinguished by alternative splicing within the fourth exon of the CRB3 gene, which in turn produces a protein with 23 amino acid differences at the C terminus. Both CRB3a and CRB3b have mostly identical amino acid sequences, and have indistinguishable molecular weight sizes. As a result, the knockout mouse construction strategy and the design principles of RNAi sequences target both CRB3a and CRB3b. This is described in lines 100-104 and lines 149-150. Additionally, commercially available antibodies detect both CRB3a and CRB3b, as mentioned in line 123 and lines 636-637 in revised manuscript.
  
  However, it should be noted that our CRB3 overexpression, as shown in the CRB3 structural domain in Figure 6D, refers specifically to the sequence of CRB3b. As a result, we have updated the original manuscript as well as the legends of Figures 3C, 3E, 4A, 5A, 5B, 6D-G, 7A, 7B and Supplementary Figure 2F-H, 3A, 4B, 6B to reflect this change. All instances of overexpressed CRB3 have been changed to CRB3b.
  
  2) CRB3A and B have been localized in the cilium itself (Fan et al., 2004; 2007) but in the study CRB3A/B does not enter the cilium but is localized in the basal body (figure 4). How the authors reconcile these different localizations?
  
  Indeed, we found that CRB3 is mainly localized at the basal body of the primary cilium, which differs from previous reports in the literature (Curr Biol. 2004;14(16):1451-61 and J Cell Biol. 2007;178(3):387-98). However, upon closer examination of one of these reports (Curr Biol. 2004;14(16):1451-61), it appears that CRB3 was actually scattered on the primary cilia, with a strong focus at the basal body. Additionally, in rat kidney collecting ducts, the localization of CRB3 on primary cilia was significantly reduced, with obvious localization at the basal body. Another study (J Cell Biol. 2007;178(3):387-98) also reported the co-localization of CRB3b and γ-tubulin in MDCK cells, which is consistent with our conclusion. We further verified the co-localization of CRB3 with the centrosome by overexpressing CRB3b in mammary epithelial cells, indicating that CRB3 mainly localizes to the basal body of the primary cilium. This information is discussed in the Discussion section of the manuscript (lines 400-410).
  
  3) The authors use GFP-CRB3A/B, it is not stated which isoform, over-expression to localize CRB3A/B in MCF10A cells (figure 4A). The levels of expression appear to be very high in the GFP panel and it is likely that the secretory pathway of the cells is clogged with GFP-CRB3A/B in transit from the ER to the plasma membrane. Thus, the colocalization with pericentrin might be due to the accumulation of ER and Golgi around the centrosome. This colocalization should be done with the endogenous CRB3A/B and with a better resolution.
  
  Thank you for your comments. We were also interested in the co-localization of endogenous CRB3 and centrosome proteins. However, the only commercial CRB3 antibody available is the rabbit species, and the pericentrin antibody (Abcam, ab4448) that is very useful is also the rabbit species. We had difficulty finding commercial centrosome-associated antibodies for other species. Therefore, we examined the co-localization of endogenous CRB3 with γ-tubulin in Figure 4C and combined the results with those of exogenous CRB3 to illustrate the co-localization of CRB3 with centrosomes.
  
  4) The staining for CRB3A/B in figure 4C (red) is striking with a very strong accumulation in an undefined intracellular structure and the authors do not provide any explanation for such a difference with the GFP-CRB3A/B just above.
  
  Thank you for your good suggestions. The immunofluorescence images of GFP-CRB3 in Figure 4a were obtained using a fluorescence microscope, while the images of endogenous CRB3 were obtained using a laser confocal microscope. The fluorescence microscope excites a fluorescent dye to emit a signal, which is amplified into a visible light signal and presents a full fluorescent signal. In Figure 4a, we can clearly see the full distribution of exogenous CRB3 in MCF10A cells, including its tight junctional localization consistent with previous reports in the literature and its co-localization with centrosomal proteins. On the other hand, laser confocal microscopy uses a laser as the light source to excite the fluorescence within the sample point by point. It employs a precision pinhole filtering technique with strong laminar imaging capabilities. In the specific analysis of endogenous CRB3 co-localization studies with centrosomes and primary cilium, signals at tight junctions must be excluded. Therefore, Figure 4c represents the fluorescence signal at the level of intracellular CRB3 co-localization with γ-tubulin. The two methods use different detection means and techniques, and are not directly comparable.
  
  5) The staining in figure 4E is also different from those shown in figure 4F in which the CRB3A/B staining is right at the base of the axoneme while it is not the case in figure 4E where we can see a red dot close to but not right at the base of the axoneme.
  
  Thank you for your comments. The new Figure 4F displays the localization relationship between CRB3 and primary cilium, analyzed using laser confocal microscopy. With the unique single-level detection function of this microscope, the problem of level selection may cause the red dots to appear close to, rather than right at the basal body of the primary cilium. However, the new Figure 4G, based on the use of 3D reconstruction scanning technique, clearly demonstrates the localization of CRB3 at the basal body of the primary cilium under the same cells and conditions.
  
  6) The authors claim that CRB3A/B interacts directly with Rab11 but they only show co-immunoprecipitation experiments from cell lysates which do not support direct interactions. The only way to show a direct interaction is to produce both proteins in vitro. Thus, the term direct interaction should be removed.
  
  Thank you for your positive comments. Since we were unable to complete the relevant experiments to demonstrate direct interaction of two proteins, we have revised our conclusions. Replace " CRB3 directly binds Rab11" with " CRB3 binds Rab11" in the manuscript.
  
  7) In addition, the authors claim (Line 251/252) that Rab11 is necessary for the transport of CRB3A/B but they should KD Rab11 to show this.
  
  Thank you for your good suggestions. It is essential to observe CRB3 trafficking after knockdown Rab11. However, in Figure 5C, we used the endocytosis inhibitor dynasore, which also inhibits Rab11-positive endosomes. This result shows that dynasore can significantly inhibit CRB3 trafficking in MCF10A cells. We believe that this experiment partially demonstrates that inhibiting Rab11 function can affect CRB3 trafficking.
  
  8) The domain of CRB3A/B that is necessary for the interaction with Rab11 is the N-terminal part of the extracellular domain. This domain is thus inside the transport vesicles and not accessible from the cytoplasm. Given that Rab11 is a cytoplasmic protein, how the 2 proteins could interact across the membrane? The authors do not even discuss this essential point for their hypothesis.
  
  Thank you for your positive comments. As shown in the schematic model in Figure 9, we believe that when cells form tight junctions, CRB3 is primarily located on the cell membrane. Subsequently, endosomes are involved in the intracellular degradation process of CRB3 on the cell membrane. Intracellular CRB3 can bind to Rab11 through the extracellular domain, which in turn participates in primary cilia assembly. We have made detailed modifications to lines 418-421.
  
  9) Figures are not numbered.
  
  Thank you for your comments. We have updated the numbers in the original manuscript as well as the legends of Figures 1D, 1E, 2B, 2D, 2F, 2G, 3B, 3D, 3F-H, 4B, 4E, 5I, 6, 8G and Supplementary Figure 1E, 2, 3C, 4A, 5B, 6.
  
  Minor points
  
  1) The authors cite several studies showing that a down regulation of CRB3A/B in human cells promotes cancer but other studies show the contrary: Lin et al., 2015 for example. Please discuss these discrepancies.
  
  Thanks for your good suggestion. We have included additional studies with contrasting results in the discussion section, specifically in lines 378-380.
  
  2) Line 98: "exhibit smaller" smaller than what?
  
  We modified "exhibit smaller" to "exhibit smaller size" in line 97.
  
  3) Line 152: "form more number, ..." ???
  
  We modified "formed more number" to "formed more" in line 154.
  
  4) Line 180: "Compared with the control, the number of cells with primary cilium was significantly increased ». To me it is the contrary! This part is not clear at all. Please rewrite.
  
  We have revised the sentence in lines 183-185.
  
  5) Authors should check and review extensively for improvements to the use of English.
  
  Thanks for your good writing advice. We have carefully reviewed and revised the entire manuscript to improve its readability.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.15.528649v4
www.biorxiv.org www.biorxiv.org

New submission 06/07/2023, 08:31:20

1
1. Public_Reviews 06 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  The objective of this investigation was to determine whether experimental pain could induce alterations in cortical inhibitory/facilitatory activity observed in TMS-evoked potentials (TEPs). Previous TMS investigations of pain perception had focused on motor evoked potentials (MEPs), which reflect a combination of cortical, spinal, and peripheral activity, as well as restricting the focus to M1. The main strength of this investigation is the combined use of TMS and EEG in the context of experimental pain. More specifically, Experiment 1 investigated whether acute pain altered cortical excitability, reflected in the modulation of TEPs. The main outcome of this study is that relative to non-painful warm stimuli, painful thermal stimuli led to an increase on the amplitude of the TEP N45, with a larger increase associated with higher pain ratings. Because it has been argued that a significant portion of TEPs could reflect auditory potentials elicited by the sound (click) of the TMS, Experiment 2 constituted a control study that aimed to disentangle the cortical response related to TMS and auditory activity. Finally, Experiment 3 aimed to disentangle the cortical response to TMS and reafferent feedback from muscular activity elicited by suprathreshold TMS applied over M1. The fact that the authors accompanied their main experiment with two control experiments strengthens the conclusion that the N45 TEP peak could be implicated in the perception of painful stimuli.
  
  Perhaps, the addition of a highly salient but non-painful stimulus (i.e. from another modality) would have further ruled out that the effects on the N45 are not predominantly related to intensity/saliency of the stimulus rather than to pain per se.
  
  We thank the reviewer for their comment on the possibility of whether stimulus salience influences the N45 as opposed to pain per se. However, we note that in Experiment 1, despite the same level of stimulus salience/intensity for all participants (46 degrees), individual differences in pain ratings were associated with the change in the N45 amplitude, suggesting that the results cannot be explained by stimulus intensity/salience.
  
  Reviewer #2 (Public Review):
  
  The authors have used transcranial magnetic stimulation (TMS) and motor evoked potentials (MEPs) and TMS-electroencephalography (EEG) evoked potentials (TEPs) to determine how experimental heat pain could induce alterations in these metrics.In Experiment 1 (n = 29), multiple sustained thermal stimuli were administered over the forearm, with the first, second, and third block of stimuli consisting of warm but non-painful (pre-pain block), painful heat (pain block) and warm but non-painful (post-pain block) temperatures respectively. Painful stimuli led to an increase in the amplitude of the fronto-central N45, with a larger increase associated with higher pain ratings. Experiments 2 and 3 studied the correlation between the increase in the N45 in pain and the effects of a sham stimulation protocol/higher stimulation intensity. They found that the centro-frontal N45 TEP was decreased in acute pain.
  
  The study comes from a very strong group in the pain fields with long experience in psychophysics, experimental pain, neuromodulation, and EEG in pain. They are among the first to report on changes in cortical excitability as measured by TMS-EEG over M1.
  
  While their results are in line with reductions seen in motor-evoked responses during pain and effort was made to address possible confounding factors (study 2 and 3), there are some points that need attention. In my view the most important are:
  
  1) The method used to calculate the rest motor threshold, which is likely to have overestimated its true value : calculating highly abnormal RMT may lead to suprathreshold stimulations in all instances (Experiment 3) and may lead to somatosensory "contamination" due to re-afferent loops in both "supra" and "infra" (aka. less supra) conditions.
  
  The method used to assess motor threshold was the TMS motor threshold Assessment Tool (Awiszus et al., 2003). This was developed as a quicker alternative for calculating motor threshold compared to the traditional Rossini-Rothwell method which involves determining the lowest intensity that evokes 5/10 MEPs of at least 50 microvolts. The method has been shown to achieve the same accuracy of determining motor threshold as the traditional Rossini-Rothwell method, but with fewer pulses (Qi et al., 2011; Silbert et al., 2013). Therefore, the high RMTs in our study cannot be explained by the threshold assessment method. Instead, they are likely explained by aspects of the experimental setup that increased the distance between the TMS coil and the scalp, including the layer of foam placed over the coil, the EEG cap and the fact that the electrodes we used had a relatively thick profile.
  
  Awiszus, F. (2003). TMS and threshold hunting. In Supplements to Clinical neurophysiology (Vol. 56, pp. 13-23). Elsevier.
  
  Qi, F., Wu, A. D., & Schweighofer, N. (2011). Fast estimation of transcranial magnetic stimulation motor threshold. Brain stimulation, 4(1), 50-57.
  
  Silbert, B. I., Patterson, H. I., Pevcic, D. D., Windnagel, K. A., & Thickbroom, G. W. (2013). A comparison of relative-frequency and threshold-hunting methods to determine stimulus intensity in transcranial magnetic stimulation. Clinical Neurophysiology, 124(4), 708-712.
  
  2) The low number of pulses used for TEPs (close to ⅓ of the usual and recommended)
  
  We agree that increasing the number of pulses can increase the signal to noise ratio. During piloting, participants were unable to tolerate the painful stimulus for long periods of time and we were required to minimize the number of pulses per condition.
  
  We note that there is no set advised number of trials in TMS-EEG research. According to the recommendations paper, the number of trials should be based on the outcome measure e.g., TEP peaks vs. frequency domain measures vs. other measures and based on previous studies investigating test-retest reliability (Hernandez-Pavon et al., 2023). The choice of 66 pulses per condition was based on the study by Kerwin et al., (2018) showing that optimal concordance between TEP peaks can be found with 60-100 TMS pulses delivered in the same run (as in the present study). The concordance was particularly higher for the N40 peak at prefrontal electrodes, which was the key peak and electrode cluster in our study.
  
  Further supporting the reliability of the TEP data in our experiment, we note that the scalp topographies of the TEPs for active TMS at various timepoints (Figures 5, 7 and 9) were similar across all three experiments, especially at 45 ms post-TMS (frontal negative activity, parietal-occipital positive activity).
  
  In addition to this, the interclass correlation coefficient (Two-way fixed, single measure) for the N45 to active suprathreshold TMS across timepoints for each experiment was 0.90 for Experiment 1 (across pre-pain, pain, post-pain time points), 0.74 for Experiment 2 (across pre-pain and pain conditions), and 0.95 for Experiment 3 (across pre-pain conditions). This suggests that even with the fluctuations in the N45 induced by pain, the N45 for each participant was stable across time, further supporting the reliability of our data. These ICCs will be reported in the next revision.
  
  Hernandez-Pavon, J. C., Veniero, D., Bergmann, T. O., Belardinelli, P., Bortoletto, M., Casarotto, S., ... & Ilmoniemi, R. J. (2023). TMS combined with EEG: Recommendations and open issues for data collection and analysis. Brain Stimulatio, 16(3), 567-593
  
  Kerwin, L. J., Keller, C. J., Wu, W., Narayan, M., & Etkin, A. (2018). Test-retest reliability of transcranial magnetic stimulation EEG evoked potentials. Brain stimulation, 11(3), 536-544.
  
  Lack of measures to mask auditory noise.
  
  In TMS-EEG research, various masking methods have been proposed to suppress the somatosensory and auditory artefacts resulting from TMS pulses, such as white noise played through headphones to mask the click sound (Ilmoniemi and Kičić, 2010), and a thin layer of foam placed between the TMS coil and EEG cap to minimize the scalp sensation (Massimini et al., 2005). However, recent studies have shown that even when these methods are used, sensory contamination of TEPs is still present, as shown by studies that show commonalities in the signal between active and sensory sham conditions that mimic the auditory/somatosensory aspects of real TMS (Biabani et al., 2019; Conde et al., 2019; Rocchi et al., 2021). This has led many authors (Biabani et al., 2019; Conde et al., 2019) to recommend the use of sham conditions to control for sensory contamination. To separate the direct cortical response to TMS from sensory evoked activity, Experiment 2 (n = 10) included a sham TMS condition that mimicked the auditory/somatosensory aspects of active TMS to determine whether any alterations in the TEP peaks in response to pain were due to changes in sensory evoked activity associated with TMS, as opposed to changes in cortical excitability. Therefore, the lack of auditory masking does not impact the main conclusions of the paper.
  
  Ilmoniemi, R. J., & Kičić, D. (2010). Methodology for combined TMS and EEG. Brain topography, 22, 233-248.
  
  Massimini, M., Ferrarelli, F., Huber, R., Esser, S. K., Singh, H., & Tononi, G. (2005). Breakdown of cortical effective connectivity during sleep. Science, 309(5744), 2228-2232.
  
  Biabani, M., Fornito, A., Mutanen, T. P., Morrow, J., & Rogasch, N. C. (2019). Characterizing and minimizing the contribution of sensory inputs to TMS-evoked potentials. Brain stimulation, 12(6), 1537-1552.
  
  Conde, V., Tomasevic, L., Akopian, I., Stanek, K., Saturnino, G. B., Thielscher, A., ... & Siebner, H. R. (2019). The non-transcranial TMS-evoked potential is an inherent source of ambiguity in TMS-EEG studies. Neuroimage, 185, 300-312.
  
  Rocchi, L., Di Santo, A., Brown, K., Ibáñez, J., Casula, E., Rawji, V., ... & Rothwell, J. (2021). Disentangling EEG responses to TMS due to cortical and peripheral activations. Brain stimulation, 14(1), 4-18.
  
  3) A supra-stimulus heat stimulus not based on individual HPT, that oscillates during the experiment and that lead to large variations in pain intensity across participants is unfortunate.
  
  The choice of whether to calibrate or fix stimulus intensity is a contentious question in experimental pain research. A recent discussion by Adamczyk et al., (2022) explores the pros and cons of each approach and recommends situations where one method may be preferred over the other. That paper suggests that the choice of the methodology is related to the research question – when the main outcome of the research is objective (neurophysiological measures) and researchers are interested in the variability in pain ratings, the fixed approach is preferrable. Given we explored the relationship between MEP/N45 modulation by pain and pain intensity, this question is better explored by using the same stimulus intensity for all participants, as opposed to calibrating the intensity to achieve a similar of pain across participants.
  
  Adamczyk, W. M., Szikszay, T. M., Nahman-Averbuch, H., Skalski, J., Nastaj, J., Gouverneur, P., & Luedtke, K. (2022). To calibrate or not to calibrate? A methodological dilemma in experimental pain research. The Journal of Pain, 23(11), 1823-1832.
  
  So is the lack of report on measures taken to correct for a fortuitous significance (multiple comparison correction) in such a huge number of serial paired tests.
  
  Note that we used a Bayesian approach for all analyses as opposed to traditional frequentist approach. In contrast to the frequentist approach, the Bayesian approach does not require corrections for multiple comparisons (Gelman et al., 2000) given that they provide a ratio representing the strength of evidence for the null vs. alternative hypotheses as opposed to accepting or rejecting the null hypothesis based on p-values. As such, throughout the paper, we frame our interpretations and conclusions based on the strength of evidence (e.g. anecdotal/weak, moderate, strong, very strong) as opposed to referring to the significance of the effects.
  
  Gelman A, Tuerlinckx F. (2000). Type S error rates for classical and Bayesian single and multiple comparison procedures. Computational statistics, 15(3):373-90.
  
  Reviewer #3 (Public Review):
  
  The present study aims to investigate whether pain influences cortical excitability. To this end, heat pain stimuli are applied to healthy human participants. Simultaneously, TMS pulses are applied to M1 and TMS-evoked potentials (TEPs) and pain ratings are assessed after each TMS pulse. TEPs are used as measures of cortical excitability. The results show that TEP amplitudes at 45 msec (N45) after TMS pulses are higher during painful stimulation than during non-painful warm stimulation. Control experiments indicate that auditory, somatosensory, or proprioceptive effects cannot explain this effect. Considering that the N45 might reflect GABAergic activity, the results suggest that pain changes GABAergic activity. The authors conclude that TEP indices of GABAergic transmission might be useful as biomarkers of pain sensitivity.
  
  Pain-induced cortical excitability changes is an interesting, timely, and potentially clinically relevant topic. The paradigm and the analysis are sound, the results are mostly convincing, and the interpretation is adequate. The following clarifications and revisions might help to improve the manuscript further.
  
  1) Non-painful control condition. In this condition, stimuli are applied at warmth detection threshold. At this intensity, by definition, some stimuli are not perceived as different from the baseline. Thus, this condition might not be perfectly suited to control for the effects of painful vs. non-painful stimulation. This potential confound should be critically discussed.
  
  In Experiment 3, we also collected warmth ratings to confirm whether the pre-pain stimuli were perceived as different from baseline. We did not include this data initially in the first submission, but will do so in the supplemental material in our next revision. This data showed warmth ratings were close to 2/10 on average. This confirms that the non-painful control condition produced some level of non-painful sensation.
  
  2) MEP differences between conditions. The results do not show differences in MEP amplitudes between conditions (BF 1.015). The analysis nevertheless relates MEP differences between conditions to pain ratings. It would be more appropriate to state that in this study, pain did not affect MEP and to remove the correlation analysis and its interpretation from the manuscript.
  
  The interindividual relationship between changes in MEP amplitude and individual pain rating is statistically independent from the overall group level effect of pain on MEP amplitude. Therefore, conclusions for the individual and group level effects can be made independently.
  
  It is also important to note that in the pain literature, there is now increasing emphasis placed on investigating the individual level relationship between changes in cortical excitability and pain as opposed to the group level effect (Seminowicz et al., 2019; Summers et al., 2019). As such, it is important to make these results readily available for the scientific community.
  
  Summers, S. J., Chipchase, L. S., Hirata, R., Graven-Nielsen, T., Cavaleri, R., & Schabrun, S. M. (2019). Motor adaptation varies between individuals in the transition to sustained pain. Pain, 160(9), 2115-2125.
  
  Seminowicz, D. A., Thapa, T., & Schabrun, S. M. (2019). Corticomotor depression is associated with higher pain severity in the transition to sustained pain: a longitudinal exploratory study of individual differences. The Journal of Pain, 20(12), 1498-1506.
  
  3) Confounds by pain ratings. The ISI between TMS pulses is 4 sec and includes verbal pain ratings. Considering this relatively short ISI, would it be possible that verbal pain ratings confound the TEP? Moreover, could the pain ratings confound TEP differences between conditions, e.g., by providing earlier ratings when the stimulus is painful? This should be carefully considered, and the authors might perform control analyses.
  
  It is unlikely that the verbal ratings contaminated the TEP response as the subsequent TMS pulse was not delivered until the verbal rating was complete and given that each participant was cued by the experimenter to provide the pain rating after each pulse (rather than the participant giving the rating at any time). As such, it would not be possible for participants to provide earlier ratings to more painful stimuli. We will make this part of the protocol clearer in the next revision of the manuscript.
  
  4) Confounds by time effects. Non-painful and painful conditions were performed in a fixed order. Potential confounds by time effects should be carefully considered.
  
  Previous research suggests that pain alters neural excitability even after pain has subsided. In a recent meta-analysis (Chowdhury et al., 2022) we found effect sizes of 0.55-0.9 for MEP reductions 0-30 minutes after pain had resolved. As such, we avoided intermixing pain and warm blocks given subsequent warm blocks would not serve as a valid baseline, as each subsequent warm block would have residual effects from the previous pain blocks.
  
  At the same time, given there was no conclusive evidence for a difference in N45 amplitude between pre-pain and post-pain conditions of Experiment 1 (Supplementary Figure 1), it is unlikely that the effect of pain was an artefact of time i.e., the explanation that successive thermal stimuli applied to the skin results an increase in the N45, regardless of whether they are painful or not. We will make this point in our next revision.
  
  Chowdhury, N. S., Chang, W. J., Millard, S. K., Skippen, P., Bilska, K., Seminowicz, D. A., & Schabrun, S. M. (2022). The Effect of Acute and Sustained Pain on Corticomotor Excitability: A Systematic Review and Meta-Analysis of Group and Individual Level Data. The Journal of Pain, 23(10), 1680-1696.
  
  5) Data availability. The authors should state how they make the data openly available.
  
  We will upload the MEP, TEP and pain data on the Open science framework at the time of the next revision.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.04.20.537735v2
www.biorxiv.org www.biorxiv.org

New submission 05/07/2023, 08:55:42

1
1. Public_Reviews 05 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  Sun and colleagues investigated the cross-reactive antibodies between E. coli and the host in severe alcoholic hepatitis (SAH). The study found that IgA and IgG were deposited in the liver of SAH patients. Complements C3d and C4d were also deposited in the SAH patient's liver. Moreover, they found that the Ig accumulated in the SAH liver, but not in the SAH serum, induced hepatocyte killing, suggesting that liver Ig is important. Then, they found that these Ig can recognize both human and E. coli antigens. Very interestingly, SAH-derived Ig shows cross-reactivity to both human and E. coli antigens, suggesting E. coli-primed Ig in SAH may damage hepatocytes through host antigen recognition. These Ig are not observed in alcoholic cirrhosis patients. The liver RNA-seq data suggested that Ig was also produced in the liver, not only gut-derived Ig. This is a very interesting study showing the novel mechanism of SAH mediated by the Ig with the cross-reactivity with bacteria and host antigens, which is not observed in AC patients. Overall, the study design is reasonable and the data are consistent to support their central hypothesis. There are a few comments.
  
  We thank the Reviewer for his/her positive comments on our manuscript!
  
  Specific comments:
  
  1) Figures 1 and 2 show Ig deposition in the liver (it seems on hepatocytes). Not only Ig reaction to the specific antigen but also non-specific Fc receptor-mediated binding to hepatocytes could have contributed.
  
  2) Similarly, in Figure 2G Ig-mediated hepatocyte killing, Fc receptor-mediated hepatocyte killing may be involved.
  
  Anti-IgG antibody (ab200699) recognizes a protein of 75 kDa, identified as gamma heavy chain of human immunoglobulins. It is possible that non-specific Fc receptor-mediated binding to hepatocytes in the SAH liver can also be recognized by this anti-IgG antibody staining.
  
  However, no IgG or IgA deposition in the healthy donor livers was identified by anti-IgG or IgA staining. These results suggest that there was no antigen specific or Fc receptor-mediated binding to healthy hepatocytes.
  
  In the ADCC assay, hepatocytes isolated from healthy donor livers were used as the target cells. Immune cell (NK) mediated ADCC is mainly triggered by IgG (binding to antigens of hepatocytes) through the interaction between IgG Fc fragment and Fc-receptors (FcγRs) of NK cells. If IgG deposition in the SAH liver were mainly due to non-specific Fc receptor-mediated binding to hepatocytes, we would expect IgG binding to FcγRs of hepatocytes and no activation of NK cells. Ig-mediated hepatocyte killing (Figure 2G) indicates the Ig (from SAH liver) reaction to the specific antigens.
  
  3) The study examined the possibility of liver resident B cell and plasma cell-mediated Ig. As the authors mentioned in the discussion, B cells may be translocated from the intestine to the liver. Or the resident B cells (not from the gut) are also involved.
  
  We agree with the Reviewer at this point. The resident B cells may be also involved in the Ig production.
  
  Reviewer #2 (Public Review):
  
  In this paper, Ahmadi et al demonstrated that antibodies produced locally in the liver by infiltrating B cells can enhance liver damage caused by fat accumulation. The main finding is that human samples extracted from severe alcoholic hepatitis showed antibody accumulation that may be related to an enhanced immune response to self-antigens, which could ultimately fuel liver damage - which was already present due to alcohol consumption. Their data are corroborated by arrays and gene ontology assays, and I strongly believe that these data could add to the future options we have to treat patients.
  
  We thank the Reviewer for his/her positive comments on our manuscript!
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.23.529702v1
www.biorxiv.org www.biorxiv.org

New submission 03/07/2023, 08:01:40

1
1. Public_Reviews 05 Jul 2023
  
  in eLife
  
  Author Response:
  
  We thank the reviewers and eLife editorial team for their valuable assessment. While additional experiments could further strengthen the theoretical framework proposed in this study, we believe that we have successfully established the delayed nuclear export of hemagglutinin and neuraminidase mRNAs by quantifying the FISH observation with the mathematical model. We agree that this finding raises a further important question to be addressed regarding the molecular mechanism underlying the prolonged nuclear retention of these segments. Our ongoing investigation is focusing on identifying potential cis-elements that contribute to the delay of these segments.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.04.07.536075v1
www.biorxiv.org www.biorxiv.org

New submission 04/07/2023, 10:30:09

1
1. Public_Reviews 04 Jul 2023
  
  in eLife
  
  Author Response:
  
  We are grateful to the three referees for their overall positive evaluation of our work and valuable constructive suggestions. We will address their public reviews with utmost care, as well as their private recommendations.
  
  To Reviewer #1: thanks for the positive comments and for the appreciation of our « impressive approaches »
  
  We will add a more comprehensive section of neuronal migration analysis in the Material and Methods section. Sorry for that regrettable lack of precision.
  
  We will address the comments about the sinuosity index definition and interpretations.
  
  We will enhance the clarity of our writing and delve deeper into the discussion. As mentioned to Reviewer #3, the brevity of the text was influenced by the Short report format.
  
  To Reviewer #2 : thanks for the overall positive appreciation.
  
  We will also consider the recommendations for authors with care.
  
  To Reviewer #3 : thanks a lot for the feedback.
  
  We will further develop the introduction and discussion sections as suggested. Regrettably and as mentioned to Reviewer #1, we had to significantly condense them due to the space constraints imposed by the Short Report format.
  
  We will attempt to overexpress Map1B in order to assess the potential phenotypic similarity to the Fmr1 null condition, as suggested. However, it is important to acknowledge that this experiment may not yield a definitive answer due to potential differences in the level of Map1B expression driven by a CMV promoter compared to its endogenous expression in Fmr1 null neurons, as well as variations in the subcellular distribution of the overexpressed Map1B.
  
  Regarding the anatomical consequences of aberrant migration, we acknowledge that neither our present work nor our previous study by Scotto-Lomassesse et al. provide evidence in this regard, as pointed out by the reviewer. Indeed, the delayed neurons do reach the olfactory bulb based on our findings. However, other studies have demonstrated that a delay in migration can have important functional consequences (eg Bocchi et al, 2017 doi: 10.1038/s41467-017-01046-w). Accordingly, we will revise and moderate our conclusions on this specific point.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.03.06.530447v4
www.biorxiv.org www.biorxiv.org

New submission 30/06/2023, 13:06:04

1
1. Public_Reviews 04 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  Cedillo et al. address the critically important question of how biguanides exert their positive effects on longevity using the powerful C. elegans model. Biguanides metformin and phenformin have been widely prescribed in the clinic to address metabolic challenges of diabetes; more recently the value of metformin in addressing specific cancers has emerged, and testing for impact on healthy human aging is getting underway. The need to understand the mechanism of biguanide action and the metabolic consequences of biguanide administration is clear.
  
  The authors report that three genes that suppress longevity associated with metformin or phenformin treatment affect a common pathway for ether lipid biosynthesis; this ether lipid biosynthesis pathway is required for mitochondrial lifespan extension, eat-2 mediated dietary restriction longevity, and TOR inhibition-associated longevity, but not insulin pathway mediated longevity. Authors document with lipid profiling how ether lipids and some other lipids are impacted by phenformin vs. genetic disruption of ether lipid biosynthesis, define the tissue primarily responsible for the ether lipid biosynthesis, show that over-expression of enzyme fard-1 is sufficient to confer most of the phenformin effect, and implicate conserved stress transcription factor SKN-1 as a downstream outcome of the ether lipid change.
  
  Strengths include the exploitation of the nematode model to address requirements not readily discerned in other models, the rigor of genetic documentation, the inclusion of metabolic profiling, the testing of multiple potential pathways that have been in the general discourse regarding metformin action, and the elaboration of a reasonably supported model that ether lipid biosynthesis is required for phenformin to activate longevity-promoting metabolic defenses downstream of conserved stress-responsive transcription factor SKN-1/NRF2. The novelty includes that ether lipids are directly linked to lifespan, ether lipid biosynthesis is needed for specific longevity pathways, and that ether lipids might play a role in a shift to pro-longevity metabolism.
  
  There are some points that require clarification and could benefit from additional study, some wording and presentation issues, and a few missing points of potential discussion.
  
  Overall, the data reported in this paper contribute a highly valuable advance in the biguanide field and adds stimulating hypotheses to the scientific community for moving forward in this biomedically important area.
  
  We thank Reviewer #1 for their positive feedback regarding our work, and for their insightful suggestions to improve the rigor and impact of this manuscript.
  
  Reviewer #2 (Public Review):
  
  This manuscript pulls together a series of integrated genetic and metabolomic data sets to examine the molecular basis for biguanide action in C. elegans. Biguanides such as Metformin are important anti-diabetic drugs as well as being explored as a therapeutic mechanism for increasing human longevity. Understanding the molecular basis of biguanide action is of general interest to those in the ageing and age-related health fields as well as to those studying metabolism and obesity. The work here has been carried out in C. elegans but the work can be picked up by those working in mammalian systems. More could be done to highlight the conserved aspects of the mechanisms involved to assist with this translatability.
  
  The methodology used is in general standard in the field and experiments are reported in detail. The successful use of metabolomics in C. elegans and its associated protocols is helpful as more labs expand to do this type of work.
  
  Strengths: In general all the experiments presented are logical and well executed with the conclusions supported by the data. I am convinced that: 1) Metformin and Phenformin extend C. elegans lifespan (although that has previously been shown), 2) biguanides induce changes in ether lipids, 3) genes required for ether lipid biogenesis are required for the lifespan incurred with biguanide treatment and, in the case of fard-1 oe, can also promote longevity when levels are increased, 4) ether lipid biogenesis is also needed for other specific key longevity processes to extend lifespan, and 5) that some key ageing regulators (skn-1, aak-2 and daf-16) are required for fard-1 oe to extend lifespan.
  
  Weaknesses: I was less convinced by the fat accumulation data and felt that the link between skn-1 gain of function and ether lipid genes was not clear and that the results were more correlative than mechanistic. If age-associated somatic depletion of fat is important for the lifespans seen here then this is interesting and important and identifying an epistatic, genetic link between the implicated genes and fat levels is desirable. Additionally, biguanides are reported to have major effects on the metabolism and growth of bacteria. As C. elegans grows on and eats E. coli, it is important that the biguanides in question do not alter the worm's food source. If bacterial growth is restricted or metabolically altered this would have a major impact on fat metabolism and the other outputs examined here (see Cabreiro et al 2013). Therefore the impact of these biguanide treatments on the C. elegans foods used here should be clearly addressed. Additionally, biguanide treatment is subject to dose dependence. Different concentrations of biguanide are used for different types of experiments to make correlative points e.g. growth inhibition at 160mM metformin, and metformin uptake measured in C. elegans treated with 50mM. It is not clear why, or whether this could impact the results. Can the authors be sure that these different doses do not alter metformin action and/or uptake either by the worms or the way the bacteria metabolise it? I appreciate that it is interesting and important to understand what biguanides are doing in the organism irrespective of whether this is a direct or indirect effect but knowing how the effects are achieved could be important for treatment strategies moving forwards.
  
  We thank Reviewer #2 for their favorable comments on our manuscript and for their helpful feedback regarding the weaknesses in our initial manuscript submission. We address the major comments below:
  
  Regarding the genetic link between SKN-1 and ether lipid biosynthetic machinery in regulation of fat accumulation, we have performed Asdf analysis in skn-1(zu135) total loss-of-function animals, rigorously indicating that biguanides require SKN-1 to drive somatic lipid depletion (Figure 6D-E). We additionally show that biguanides activate the innate immune response sensor dod-24, previously shown by us to be activated by a transcriptionally redirected SKN-1 metabolic stress response program2, in a manner that requires both SKN-1 and all ether lipid biosynthetic machinery (Figure 6F and Figure 6 – figure supplement 1C). Combined with our previous result showing that fard-1 (oe3) requires SKN-1 to extend lifespan (Figure 5D), and our observation that SKN-1 gain-of-function animals do not mimic the ether lipid pattern seen in FARD-1 overexpressing animals (Reviewer Response 1), our results rigorously corroborate that biguanides activate SKN-1 downstream of ether lipid machinery to exert a metabolic stress defense response. This activation results in alterations of somatic lipid homeostasis, innate immune response, and pro-longevity outcomes.
  
  Regarding possible indirect effects of biguanides on bacterial growth and metabolism to modulate ether lipid biosynthetic activity, we performed FAME GC/MS of Adult Day 1 nematodes treated with or without phenformin and grown on live or dead, metabolically inactive OP50-1 E. coli food sources using a rigorously established 1% PFA treatment protocol (Figure 6 – figure supplement 2)3. We additionally performed lifespan analyses in the same experimental design, with the inclusion of lifespan extending doses of metformin (Figure 6 – figure supplement 3). Both experiments show, with biological replication, that biguanide-mediated induction of ether lipid synthesis, biguanide-mediated lifespan extension, and the dependency of ether lipid machinery on biguanide-mediated lifespan extension all operate through direct interactions in the worm, as opposed to indirect effects on bacterial growth and metabolism.
  
  Regarding the use of different doses of biguanides: this point was also raised by Reviewer 1 and is responded to above in Author Response 4. Briefly, the goal of the 160 mM dosage of metformin used in our prior genetic screens10 and subsequently highlighted in Figure 1 – figure supplement 1A is to enhance the sensitivity and specificity of our discovery approach to identify effectors of the biological action of biguanides. The 160 mM dose causes potent growth inhibition in C. elegans. Our prior published work indicates that use of this dose to identify growth inhibitory effectors of biguanides can also identify longevity effectors of metformin 10. Thus, we used a similar strategy here to identify fard-1 and acl-7, which were initially identified as gene knockdowns that block the growth inhibitory effects of 160 mM metformin. The justification for the different biguanide concentrations used in this work is now included in the text for clarity (lines 135 to 153).
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2021.09.02.457410v3
www.biorxiv.org www.biorxiv.org

New submission 30/06/2023, 13:01:44

1
1. Public_Reviews 04 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #2 (Public Review):
  
  "... the fact that MGN-BLA circuit disruptions were done during the conditioning phase of associative threat learning, and not during the recall phase only, complicates the side-by-side comparison: it could be argued that in this case what is disturbed is the processing of the unconditioned innately aversive stimulus in the task, the foot shock, instead of the learnt threat of the sound".
  
  In our previous email to the editors, we mentioned work by Barsy et al., showing that indeed the inhibition of this input during the recall phase reduces freezing response (Please see Fig. 8 in Barsy et al). In the new revision, we refer to this experiment.
  
  Specific comments (weaknesses):
  
  e) There are not enough analysis and method descriptions to demonstrate the specificity of the targeting approach
  
  We have included these data as supplementary figures (S2A and B, S5B, S7, S9A and S10K) and added a more detailed methodology in the method section.
  
  f) …the authors administer blockers of beta-adrenergic receptors systemically. This reveals differences between MGN-BLA projecting neurons, BLA neurons, and innate and learnt threat, but the mechanistic implications are not clear and should be discussed.
  
  In the revised manuscript, we extensively discuss these points: (This indicates that the looming stimulus conveyed through the thalamic input…may contribute to the variability in the effect of the drug in freezing response); (...in mice injected with propranolol, the defensive responses…The differences in species or strains used, or experimental parameters may contribute to the variability in the effect of the drug in freezing response.)
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.01.10.523445v1
www.biorxiv.org www.biorxiv.org

New submission 06/03/2023, 12:44:02

1
1. Public_Reviews 04 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #2 (Public Review):
  
  Mahbub et al further elucidate the structural and functional consequences of the ARL15-CNNM2 interaction for divalent cation transport. They show that ARL15 has low GTP binding affinity and could not detect GTPase activity, questioning whether ARL15 functions as a GTPase. Although the interaction of ARL15 and CNNMs has been demonstrated by multiple groups before, this study addresses some of the key questions that are central within the TRPM-CNNM-PRL-ARL15 field. Particularly, the authors have identified residues in both ARL15 and CNNM proteins which are required for their binding to one another. In addition, they have also illustrated how PRL proteins compete with ARL15 for their binding to CNNMs. Lastly, the functional consequences of ARL15 binding to CNNMs are shown by TRPM7-mediated Zn2+ transport assays.
  
  We thank the reviewer for the many positive comments.
  
  However, the current dataset also comes with limitations. Previous studies demonstrated that PRLs interact with the CBS domains of CNNMs and lock them in their so-called "flat" confirmation. It remains unclear how ARL15 affects the structure of the CBS domains, especially in the presence of ATP. The subcellular localisation of these interactions has not been examined. Moreover, the consequences of ARL15 on TRPM7 activity are not completely elucidated. It remains unclear whether this functional effect is CNNM-dependent. Moreover, how the zinc uptakes translate to other divalent ion transport, such as magnesium, has not been examined. These questions should be answered to confirm the model as presented in Figure 7.
  
  We agree that CBS-pair domain dimerization is important. Structural studies of a prokaryotic CNNM homolog from our group showed large conformational changes in an ATP-binding mutant (Chen et al., Nat Comm, 2021).
  
  While most crystal structure of PRL-CNNM complexes do indeed show the flat conformation, it is unclear if that is a consequence of crystal packing or PRL binding. We do not see an effect of ATP on PRL binding affinity. The CBS-pair domain dimerization interface appears to be very adaptable; our recent structure of PRL-CNNM proteins from flies shows a completely different dimerization interface (Fakih et al, JBC, 2023).
  
  In contrast, the ARL15-CNNM interaction is affected by ATP. As suggested by the reviewer, we have examined ARL15 binding to a CNNM2 mutant (T568I) that is unable to bind ATP. These results confirm the roughly two-fold improvement in affinity is due to ATP binding to the CNNM2 CBS-pair domain and resulting conformational changes.
  
  As requested by all the reviewers, we have added experiments to Figure 7 that investigate the effect of ARL15 on Mg2+ transport.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.01.19.524765v1
www.biorxiv.org www.biorxiv.org

New submission 11/01/2023, 14:01:09

1
1. Public_Reviews 04 Jul 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  It has recently been shown that the HIV-1 protease can cleave and activate the inflammasome-forming sensor CARD8 upon treatment of infected cells with non-nucleoside reverse-transcriptase inhibitors (Wang et al., Science 2021). Here, Kulsuptrakul and colleagues show that the high susceptibility to proteolytic activation by the HIV-1 protease is a specific feature of human CARD8. They show that changes in human-specific F-F motif render the CARD8 protein of non-human primates largely resistant to cleavage. Interestingly, the protease of SIVcpz the direct precursor of pandemic HIV-1 strains are also capable of cleaving human but not chimpanzee CARD8. Thus, the authors propose that a human-specific CARD8 motif may contribute to the increased levels of inflammation and disease progression in HIV-infected humans compared to non-human primates that are naturally infected with SIV.
  
  Strengths of the study are that the authors convincingly show that a single human-specific amino acid change in CARD8 determines its susceptibility to cleavage by the HIV-1 protease and that the results shown are well controlled and presented. It is also interesting that SIVcpz can cleave human CARD8 and activate an inflammatory response. The major weakness is that it remains unclear whether HIV-1 of SIVcpz may induce CARD8-dependent inflammatory responses in primary CD4+ T cells or macrophages. The most relevant setting in the study was the infection of THP-1 cells with the T cell line-adapted X4-tropic HIV-1 LAI molecular clone. However, the effects on cell death were modest (Figure 3A) and on IL-1ß secretion was not dose-dependent (Figure 3B). Altogether, stronger effects were observed with VSV-G-pseudotyped HIV-1 and only those were used in subsequent experiments involving human CARD8 cleavage mutants (Figure 4). Additional evidence that primary HIV-1 molecular clones and/or SIVcpz may indeed induce CARD8-dependent inflammatory responses in primary viral target cells would greatly increase the significance of the study. In the absence of such data, conclusions about the potential role of CARD8 sensing of the viral protease for the pathogenesis of AIDS should be cautioned throughout.
  
  We have now added an experiment using the HIV-1 strain BG505, which uses a distinct co-receptor and is from a different clade than LAI. The results show that BG505 infection also induces CARD8-depdenent inflammasome activation (Figure 3E).
  
  We have also more specifically measured caspase-1 activation using a FLICA assay (which specifically measures active CASP1) in WT, CARD8 KO and CASP1 KO THP-1 cells (Figure 3D, right panel). In experiments with both VSV-g pseudotyped and infectious virus, we observed increased FLICA signal in WT but not CASP1 KO THP-1 cells. Moreover, the FLICA signal and other readouts of inflammasome activation in CARD8 KO THP-1 cells was indistinguishable from the CASP1 KO THP-1 cells (Figure 3D). Thus, our results are consistent with HIV-1 infection inducing CASP1-dependent pyroptosis downstream of CARD8.
  
  While we agree with the reviewers that primary cell data would be informative, we believe that this is not the main point of our paper. Moreover, others have already shown CARD8-dependent cell death after infection of primary T cells with HIV-1 (Wang et al., 2021, Science; Clark et al. 2022, Nature Chem Biol; Balibar et al. 2023, Science Trans Med; Wang & Shan, 2023, BioRxiv). We therefore have not extensively pursued primary cell experiments in this manuscript and instead have elected to use a more easily manipulatable cell line to focus on the evolutionary and mechanistic basis of CARD8 activation by simian lentiviruses.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.10.04.510817v1
www.biorxiv.org www.biorxiv.org

New submission 03/07/2023, 13:08:10

1
1. Public_Reviews 03 Jul 2023
  
  in eLife
  
  Author Response:
  
  Reviewer #1 (Public Review):
  
  […] Collective variable choice:
  
  The explanation for the choice of CVs on page 5 is not sufficient to understand the process and its likely success. How were the most important and unimportant CVs identified exactly? Table 2 on page 19 shows only gate distances, cavity-filter distances and a single variable related to filter structure itself (77 CA - 77 CA) representing a pinch. Is that pinching really the only slow variable associated with inactivation changes in the filter? Why are there no variables, say for carbonyl flipping, E71 or D80 movements or even for ion and water occupancy (although water may be sampled with control of other interactions, such as involving L81)?
  
  CVs for steering simulations were selected based on structural comparisons between the X-ray structures as well as the information about the inactivation available in the literature. These steering CVs were later used as CVs for the string method with the exception of those found to be irrelevant in preliminary string simulations (see methods for details). For example we discarded CVs that would just oscillate freely and thus represent fast equilibrating CVs. We will add additional explanations to the methods section of the manuscript in revisions.
  
  Carbonyl flipping, E71 and D80 movement and SF occupancy were observed in the initial steering simulation to correlate with the 77 CA - 77 CA opening and the opening of the L81-W67 contact. They were not biased but followed the expected path as a consequence of the motion of the imposed selectivity filter constriction. Therefore, they did need not be explicitly biased. The same can be said with respect to water occupancy behind the selectivity filter, which correlates with the opening of the L81-W67 contact.
  
  I understand that the X-ray structure is the one source of information used to define an inactivated structure and is one with just a pinch and no complete carbonyl flipping away from the pore, as has been identified in past studies and discussed as being involved by the authors on page 14. Key changes like carbonyl flipping surely are part of the story and may be slow variables. At the very least, if not part of the CV space, could be analysed.
  
  Indeed, the reviewer is correct in stating that there are molecular motions of interest aside from the ones included in the CV space. Figure 3 and associated supplementary figures indeed extensively investigate the probability distributions of many of those as the system progresses along the inactivation pathway. These results are presented in the section titled “Free energy landscapes offer insights into atomistic-resolution mechanistic details”. Carbonyl flipping seemed to be highly correlated with the 77CA- 77CA distance and this analysis was therefore not presented.
  
  On page 10 the authors discuss possible differences in Amber and Charmm involving the extent to which the 4 subunits change in respect to the L81-W67 water pathway and W67-D80 hydrogen bond, arguing the different results for force field could be to do with different numbers of subunits doing different things. If I understand, the chosen CVs are all tetramer-based distances (including across subunits) and not subunit-based CVs, so that random and incomplete changes may occur to subunits for a given point in CV space.
  
  In fact, some of the CVs represent intrasubunit distances, for example L81-W67 while others represent distance across subunits. This distinction never represented a criterion to select CVs.
  
  There is thus potential for the string to converge on a local minimum pathway with partial changes to its interactions within and between subunits, and may not be a unique global solution. Can the authors please explain whether or not this is possible and what analysis has been done to check it?
  
  This indeed represent a well-recognized shortcoming of all string-based enhanced sampling methods. The string-of-swarms method used herein indeed assumes that there is a dominant minimum free energy path and requires a reasonable starting path. One major advantage of this methodological choice, however, is that the path can be described in high dimension, thus avoiding stark dimensionality reduction as is the case in many collective-variable based methods such as metadynamics.
  
  We do note that though the initial path was the same for the two force fields, the final pathway is different, which tends to indicate that the results do not only depend on the initial path but also on the force field guiding the dynamics of the process.
  
  X-ray endpoints and initial pathway:
  
  The string was created from a pulling/steered MD between existing X-ray structures for the closed (5VKH), partially open (3FB5), fully open (5VK6) and finally inactivated (5VKE) states. The authors write on page 12 that "The block of conduction during inactivation appears to result from pinching at the selectivity filter...", but given the end point was forced to be the X-ray structure with pinching, wasn't this outcome predetermined? This raises a significant point of how much has choice of endpoints predetermined the final states of the string? i.e. How much is an end state actually allowed to draft away from the initial Xray structure. Was a bead placed at the very endpoint and allowed to update via swarms, or was it fixed and all beads just interpolate between those fixed end states? The reason this is important is that it is plausible the inactivated crystal structure with pinching but not other changes (such as complete V76 carbonyl flipping or outer filter splaying), may not be the actual free energy minimum structure for that state and that force field.
  
  The reviewer is right to point out that this observation is most likely a consequence of the choice of the end points of the initial string. The string method assumes that the end points of the string are fairly representative of the initial and final states of the processed studied. In this case, for ease of use, the endpoints of the simulation were fixed. When endpoints are left free to relax, they drift towards the closest minima and make comparisons between force fields, between simulation conditions, etc more difficult.
  
  We do agree that the selection of initial and final states as well as the starting string are important modeling choices. For this reason, we were very mindful and made these choices based on the existing published evidence (available at the time).
  
  We will make these details explicit in a revised version of the manuscript.
  
  Another obvious concern is the possible reliance on the initial pulling procedure used before string optimisation began. Fig.2 Supp 1 shows generally that the Amber path stayed pretty close to the initial steered MD path, whereas Charmm drifted downward away from that path. One could justifiably ask, if a very different initial path was chosen, might different local minimum pathways result, including Amber sampling a path like Charmm? How does one test whether or not the final path has not been trapped in some local trough of free energy? e.g. Imagine starting the Amber string using an initial path like the more diagonal Charmm-like path, or even a more extreme unphysiological one, such as a steered trajectory that initially inactivates before opening the gate. Would the final results be the same? I appreciate the simulations are very expensive and such trials may not be possible, but what evidence is there that the final path has not been trapped away from the global minimum?
  
  As stated above, the reviewer is right to point out the weakness of the method of converging to the closest local minimum free energy path. It is unfortunately computationally infeasible to test many possible paths. For this reason, we chose to initiate our calculations with a pathways based on experimental data; in this case based on available X-ray structures. In addition, it is necessary to contrast the results of the simulation with available experimental evidence: the string method with swarms of trajectories, when aptly used, has a history of bringing useful insights to several biological systems (Lev et al. 2017b; Suh et al. 2019, Fleetwood et al 2021, 2019; McComas et al. 2022).
  
  As already noted, the fact that the two force field yield very different energy landscapes is evident since they would otherwise converge to the same final pathway given the same initial pathway guess.
  
  One test offered by the authors is a set of unbiased MD simulations launched from points on the string. The authors ran 200ns simulations and write on page 5 that "These simulations have the expected stability based on their starting values. This is a good quality test to check the correct estimation of the general features of the free energy surface". While this sounds reasonable, 200ns MD may only be sufficient to begin to explore locally within the solved free energy trough, much like the swarms in the iterations were able to do. My own examination of Fig2 Supp 5 is that some of these simulations linger around the expected states and some drift away within the general trough of sampling, which is a good sign. What those 200ns simulations may not be able to do is escape that trough and see evidence of other possible solutions, beyond what was sampled with the string that was tied to Xray endpoints and trapped in the solution pathway that was already formed after 100-300 iterations. Overall, the string involved 800 iterations of 10ps swarms (80ns around each bead; albeit 32 trajectories in parallel), allowing good local sampling around the beads in the free energy trough, but in terms of ability to diffuse away from that point, only being comparable in contiguous trajectory time to the unbiased MD tests. It therefore would have been interesting to see if longer simulations remain in this trough; though I understand the challenges in running so much MD. Such simulations may, however, lead to exploration beyond what was seen in the string solutions.
  
  We agree with the authors that longer simulations would be very interesting to understand the behavior of the string-of-swarms method and how it behaves for this intricate FES. Note however, that 80 ns divided over 32 trajectories yields an overall trajectory length that is ~two orders of magnitude below a single 200 ns-long simulation. We thus still stand by our statement that the fact that these simulations behave as expected from the free energy landscapes is a good quality check of the CVs and of the resulting free energy landscapes.
  
  Force field effects and origin:
  
  Regarding the effect of the chosen force field, the authors state that "Given that our simulations were conducted under activating conditions, we had expected the open states to be more populated than the closed ones. Simulations carried out at higher pH may be able to resolve this inconsistency". Also running at high pH would be a nice thing to do to prove the method is in fact sensitive to conditions to see a shift in the distribution of states.
  
  Indeed this is the logical next step for future work.
  
  But the question is why were open states not more occupied under low pH and 50mM K+? From my analysis of the figures, the results show that the Charmm force field tends to allow for opening of the channel somewhat (at least with similar free energy for partially and fully open to closed) whereas Amber tends to close the channel more (with more uphill energy as the channel opens than Charmm; Fig 2). i.e. at low pH and 50 K+, isn't the Amber model incorrectly reporting fairly strong bias against opening? Moreover, regarding the free energy of the inactivated state itself, why should we not expect equilibrated channels under activating conditions to eventually fall into an inactivated state, in which case we should expect low free energy of that state (as found with Charmm and not Amber in Fig2), but with a slow rate. While much discussion in the manuscript appears to discuss limitations in Charmm (although on page 12 discussion leans either way), these factors may seem to favour Charmm over Amber.
  
  We would like to thank the reviewer for raising these points. We can only speculate about what might be the reasons for these discrepancies, and we have tried to be as honest as possible in our manuscript and avoid overinterpretation of our results. It is interesting that Reviewer 2 gathered from our data that the AMBER results were more consistent with expectations while this reviewer thought the opposite. This does reinforce our decision to avoid taking sides and present both options. Our personal opinion is currently that both force fields are imperfect at describing all the aspects of the activation-inactivation gates coupling. We will include more discussion in the revisions of the manuscript.
  
  On page 12 the authors explain the possible causes for force field dependence, although this seems limited to ion interactions, glutamate charges and dihedrals. But it would be nice to get a bit more insight into what terms may have influenced the pathway, in particular involving interactions between TM2 and the base of the selectivity filter and hydration behind the filter. Regarding ion interactions, is there a good reason to believe ions are key to the difference seen? i.e. How were ions involved differently in the state transitions involving Amber and Charmm? The authors have noted a role for ion-carbonyl interactions.
  
  We agree that this would be interesting, but judged that this would be better done in a separate study. We do note that the K-carbonyl interactions have been reported as candidates for these discrepancies, as mentioned and cited in the manuscript. Very recent simulations using ab initio MD support that the overstimation of the K-carbonyl interaction is the reason for the low conductance of potassium channels in classical MD, refer to Hui et al. Biophysical Journal, vol. 122, issue 3, p. 520a. We will add this reference in revisions.
  
  It is important that the authors explain which is the two competing models has been used and why. i.e. Off-the-shelf Charmm36 force field includes strong K+-backbone carbonyl interaction, previously seen to promote high ion occupancy, similar to Amber, whereas Lennard-Jones parameters modified to match N-methyl-acetamide and water partitioning (such as early Berneche, Noskov and Roux work) reduce ion occupancy and increase water content inside the filter.
  
  We have used “off-the-shelf” or conventional CHARMM36 as described in the literature cited.
  
  Reviewer #2 (Public Review):
  
  […] The study is impressive and interesting. However, I have a number of concerns that the authors may wish to address in a revised version of the manuscript.
  
  First, concerning a set of unbiased simulations spawned at different regions of the investigated free energy landscapes, the authors write: "These simulations have the expected stability based on their starting values".
  
  Fig 2.c shows a rather smooth downhill slope in the free energy curve towards the closed state for AMBER , so wouldn't the expected behavior in that case be that all unbiased trajectories end up in the closed state, or at least travel a substantial amount in that direction? However, that is not observed. This should be further investigated.
  
  It is true that this would be the effect we should observe after a significant simulation time. Resorting to 200ns-long simulations, our goal was to test whether the local free energy basins identified by the string-of-swarms method were indeed metastable. If that were the case, we would expect the trajectories to remain within the basins on medium timescales due to the kinetic barriers that would need to be overcome to transfer to other basins. Of course, if simulations were long enough, all basins would eventually be explored by the trajectory with a probability related to the relative free energy of the basins.
  
  Second, "This suggests that stabilization of the partially open state by the removal of bound lipids can explain the increase in open probability" is an odd statement, as "stabilization of the partially open state" means almost the same as "increase in open probability".
  
  It is true that one appears to necessarily imply the other. An increase in open probability could potentially come from two effects: a stabilization of the open state or a destabilization of the closed one. In a two-state system, the two cases are indistinguishable since only relative difference in free energies matter. However, this is a three state system, if one takes as a reference the energy of the inactivated state, there is an effective difference in the physics of the system if a stabilization of the open state or a destabilization of the closed state occurs.
  
  The statement "both force fields yield inactivation barriers that are orders of magnitude lower than what is expected from electrophysiology experiments" seems inaccurate. Perhaps the authors mean "inactivation rates that are orders of magnitude lower" rather than barriers?
  
  Yes, this was a mistake on our part. We will amend the manuscript.
  
  In addition, the assertion "The CHARMM force field, on the other hand, results in landscapes in agreement with the fact that one of the dominant states in activating conditions is the partially open state, as revealed by a combination of ssNMR+MD." seems to hold for the AMBER force field without PG lipids rather than for CHARMM?
  
  AMBER simulations with or without bound PG lipids have a fully open state basin within the minimum free energy path (Fig 4a, 4b) which is not the case for CHARMM (Fig 2b). In that sense, the CHRAMM force field seems to be in better agreement with the ssNMR data. The ssNMR+MD study however suggests that the PO open state basin should be the lowest in free energy. In both cases, however, the C basin is lower in free energy than the PO. We can only speculate about why that may be.
  
  Together with the higher barrier towards the inactivated state as well as covering most known x-ray structures along the inactivation pathway, this would seem to point all in the direction that the studied AMBER force field provides a more faithful picture of the inactivation pathway than CHARMM. I, therefore, find the somewhat inconclusive summary as presented in Fig. 5 a bit uninformative, as it suggests that both mechanisms might be equally likely.
  
  Although the X-ray structures do suggest an AMBER-like path, structural information in isolation is not sufficient to fully understand a phenomenon of dynamical nature. The X-ray structures of metastable structures particularly of open states require the use of engineered mutations and other techniques to trap these states. We of course do not question that a lot of very valuable information can be derived from them, but they should be considered in the context of other computational and experimental techniques. We believe we are very explicit in the text in discussing the weakness and strengths of either possibilities. In fact, we find it interesting that Reviewer 1 gathered from our data that the CHARMM results were more consistent with expectations. This does reinforce our decision to avoid taking sides and present both options. Our personal opinion is currently that both force fields are imperfect at describing all the aspects of the activation-inactivation gates coupling.
  
  Overall, the study would benefit from a follow-up step to become more conclusive. This could be either in the form of the suggested L81 mutation or changing the simulation conditions to inactivating conditions such as low salt, in which case the inactivated state would be expected to become a minimum, which would provide an additional reference point for validation. Either of these would narrow down the spectrum of possible mechanisms.
  
  We absolutely agree with this reviewer. These are great suggestions for further investigations that will definitely be considered in future studies.
  
  Reviewer #3 (Public Review):
  
  […] The analysis is careful and is state-of-the-art. The results reveal remarkable differences between the CHARMM and AMBER force fields.
  
  Unfortunately, the "elephant in the room" with regards to K+ channel inactivation is the significance of the dilated structures more recently obtained by Xray and EM. While it is worthwhile doing our best to really understand the constriction mechanism of KcsA, and the present manuscript does an excellent job at that, the ground has shifted and understanding finer points about KcsA constriction has become, unfortunately, not the most prominent issue in the field at the present time.
  
  Let's discuss the current situation about the inactivation of K+ channels. The situation is fairly unsettled. The KcsA channel was the first for which some atomic structure and mechanism, centered on a constriction of the selectivity filter, were proposed. The constricted conformation really does not conduct because the filter is too narrow. More recently a few structures (Xray and EM) for channel mutants known to have more propensity to inactivate have revealed a different conformation of the filter which appears to be dilated toward the extracellular side. This is a conformation that had never been seen previously. Different "camps" co-exist in the K+ channel community about inactivation. Those who were very skeptical about the constricted conformation claim that the new dilated structures is the final truth. While the dilated structures are certainly part of the body of information that we have now, but their significance remains somewhat unclear if anything because of the fact that they are not perfectly occluded and they allow ion conduction! While it is worthwhile doing our best to really understand the constriction mechanism of KcsA, and the present manuscript does an excellent job at that, the ground has shifted and understanding finer points about KcsA constriction has become, unfortunately, not the most prominent issue in the field at the present time.
  
  We appreciate the reviewer’s comments and we are also grateful for the contextualization of the current state of the literature with respect to KcsA inactivation.
  
  Although we acknowledge the importance of these new findings and look forward to a lively debate in the literature regarding the importance of this alternative mechanism, this information was not available at the time when this study was started. In any case, for an initial study with a novel technology and with methodological choices such as the force field choice, studying the more established path seems still a valid choice. Of course, the techniques used to study this method can be used to study new hypotheses and contrast them with our current work. This will be an important line of work going forward. We will add further literature discussion to the manuscript and better outline how we decided on the scope of this study.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.04.05.535698v1
www.biorxiv.org www.biorxiv.org

Inhibition of microtubule detyrosination by parthenolide facilitates functional CNS axon regeneration

1
1. Public_Reviews 03 Jul 2023
  
  in eLife
  
  Author Response:
  
  Weaknesses:
  
  1) In vivo studies are limited to select outcomes of recovery and do not validate or address mechanism of action in vivo.
  
  2) Known activities of DMAPT beyond microtubule detyrosination, such as oxidative stress, mitochondrial function and NFkB inhibition, are not considered in experimental examinations or in the interpretation of findings.
  
  Response: Our research indicates that parthenolide exhibits a regenerative effect within a nanomolar range and with a bell-shaped concentration-response curve in culture. Moreover, we demonstrate a close correlation between the inhibition of detyrosinated microtubules and regeneration and consider the effects of hIL-6 or PTEN-KO on detyrosination in mouse and human RGCs. Therefore, we offer a coherent and satisfactory mechanistic explanation for the effects of parthenolide. We, therefore, feel the request to experimentally explore additional, somewhat speculative possibilities is not reasonable or helpful, and this issue should not be considered as a weakness.
  
  Moreover, to the best of our knowledge, no evidence suggests profound antioxidative effects of DMAPT or parthenolide within these low-concentration ranges and that these would affect axon regeneration. Antioxidative effects may also not explain the observed bell-shaped curve. Furthermore, we have already considered the effect of NFkappaB in our previous work (Gobrecht et al., 2016) and shown that NFkappaB remains unaffected by low concentrations of parthenolide. Hence, conducting additional experiments addressing oxidative stress or other speculative causes will not strengthen our findings and do not justify the additional sacrifice of animal lives. Nevertheless, we will consider discussing these points in a revised version.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.04.05.535722v3
www.biorxiv.org www.biorxiv.org

New submission 03/07/2023, 08:08:16

1
1. Public_Reviews 03 Jul 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  Reviewer #1 (Public Review):
  
  Jamge et al. sought to identify the relationships between histone variants and histone modifications in Arabidopsis by systematic genomic profiling of 13 histone variants and 12 histone modifications to define a set of "chromatin states". They find that H2A variants are key factors defining the major chromatin types (euchromatin, facultative heterochromatin, and constitutive heterochromatin) and that loss of the DDM1 chromatin remodeler leads to loss of typical constitutive heterochromatin and replacement of this state with features common to genes in euchromatin and facultative heterochromatin. This study deepens our understanding of how histone variants shape the Arabidopsis epigenome and provides a wealth of data for other researchers to explore.
  
  Strengths:
  
  1) The manuscript provides convincing evidence supporting the claims that: A) Arabidopsis nucleosomes are homotypic for H2A variants and heterotypic for H3 variants, B) that H3 variants are not associated with specific H2A variants, and C) H2A variants are strongly associated with specific histone post-translational modifications (PTMs) while H3 variants show no such strong associations with specific PTMs. These are important findings that contrast with previous observations in animal systems and suggest differences in plant and animal chromatin dynamics.
  
  2) The authors also performed comprehensive epigenomic profiling of all H2A, H2B, and H3 variants and 12 histone PTMs to produce a Hidden Markov Model-based chromatin state map. These studies revealed that histone H2A variants are as important as histone PTMs in defining the various chromatin states, which is unexpected and of high significance.
  
  3) The authors show that in ddm1 mutants, normally heterochromatic transposable element (TE) genes lose H2A.W and gain H2A.Z, along with the facultative heterochromatin and euchromatin signatures associated with H2A.Z at silent and expressed genes, respectively.
  
  Weaknesses:
  
  1) Following up on the finding that H2A.Z replaces H2A.W at TE genes in ddm1 mutants, the authors provide in vitro evidence that DDM1 binds to H2A.Z-H2B dimers. These results are taken together to conclude that DDM1 normally removes H2A.Z-H2B dimers from nucleosomes at TE genes and replaces them with H2A.W-H2B dimers. However, the evidence for this model is circumstantial and such a model raises a variety of other questions that are not addressed by the authors.
  
  The Reviewer raises a series of interesting questions. We proposed that DDM1 exchanges H2A.Z to H2A.W because it is the simplest model and also because LSH - the mammalian ortholog of DDM1 exchanges H2A to macroH2A. However we do stress in the revised manuscript that this is a model and other possible models that could involve chaperones and additional remodelers are possible. Addressing why the loss of DDM1 results in a net exchange of H2A.W to H2A.Z is not the purpose of this study. Here we use the perturbation caused by ddm1 as a means to address the importance of the dynamics exchange of H2A variants in setting up the chromatin states. We do observe that perturbing this dynamic exchange causes an important perturbation of chromatin states. This further supports our main conclusion: H2A variants dynamics are one important factor that organizes chromatin states.
  
  For example: if DDM1 does remove H2A.Z from TE genes, how does H2A.Z normally come to occupy these sites, given that they are highly DNA methylated and that H2A.Z is known to anticorrelate with DNA methylation in plants and animals?
  
  The anticorrelation between H2A.Z and DNA methylation is observed at steady state. The exchange of H2A.Z to H2A.W that results from the action of DDM1 would indeed remove unwanted H2A.Z from regions occupied by DNA methylation as suggested by the Reviewer.
  
  Given that H2A.Z does not accumulate in TEs in h2a.w mutants, how would H2A.X and H2A instead become enriched at these sites if DDM1 cannot bind these forms of H2A?
  
  This is a valid question: We envisage that H2A.X and H2A are deposited by remodelers and chaperones other than DDM1 in the h2a.w mutant.
  
  Given that there are no apparent regions with common sequence between H2A.Z and H2A.W variants that are not also shared with other H2A classes, how would DDM1 selectively bind to H2A.W-H2B and H2A.Z-H2B dimers to the exclusion of H2A(.X)-H2B dimers?
  
  It was shown by the Muegge Lab both in vitro and in vivo that LSH - the mammalian ortholog of DDM1 binds to macroH2A and H2A, and these two H2A variants do not share similar specific region. Yet it remains to determine which region of H2A.Z and H2A.W binds to DDM1, which does not fit in the scope of this study.
  
  Reviewer #2 (Public Review):
  
  Jamge et al. set out to delineate the relationship between histone variants, histone modifications and chromatin states in Arabidopsis seedlings and leaves. A strength of the study is its use of multiple types of data: the authors present mass-spec, immunoblotting and ChIPseq from histone variants and histone modifications. They confirm the association between certain marks and variants, in particular for H2A, and nicely describe the loss of constitutive heterochromatin in the ddm1 mutant.
  
  The support for some of the conclusions is weak. The title of the discussion, "histone variants drive the overall organization of chromatin states" implies a causation which wasn't investigated, and overstates the finding that some broad chromatin states can be further subdivided when one considers histone variants (adding variables to the model).
  
  We have removed subtitles in the discussion and have taken care to avoid over simplified statements.
  
  Adding variables to a ChromHMM model naturally increases the complexity of the models that can be built, however it is difficult to objectively define which level of complexity is optimal. The differences between states may be subtle to the point that they may be considered redundant. The authors claim that the sub-states they define are biologically important, but provide little evidence to support this claim. It is not obvious whether the 26 states model is much more useful than a 9-states model. Removing variables naturally affects the definition of states that depend on these variables, but it is also hard to define the biological significance of that change. This sensitivity analysis is thus not very developed.
  
  We agree that adding more input tracks/ data will increase the complexity.
  
  But we would like to mention the differences of this study and the 9-state model,
  
  1) We have included the histone variants which have been previously missed in chromatin state definition.
  
  2) The previous 9-state model used data from different tissue types. In this study all the data generated and analyzed is from seedlings.
  
  3) Increasing the number of states allowed us to resolve heterochromatin states compared to 9-state model which was previously missed. (BioRXiv)
  
  4) The biological relevance of the 26 states model is analyzed and described in depth (States BioRxiv paper).
  
  In addition we have now updated the Figure 2F to include a more direct comparison of marks used in both models. And we have expanded the description in the methods section and our reasoning behind using 26 state model to be analyzed in depth.
  
  There are issues with the logical sequence of arguments in Fig1 and Fig3. Fig1A shows that nucleosomes often contain both H3.1 and H3.3. Therefore pulling-down H3.1-containing nucleosomes also pulls down H3.3 and whether specific H2A variants associated with H3.1 cannot be answered in this way (Fig1B).
  
  We thank the Reviewer for point this out. If 60% of nucleosomes are homotypic and if they would associate with a specific H2A variant this would be clearly visible on WB as a much stronger band. Also, the MS data presented in Figure1 figure supplement 1D clearly show that all H2A variants associate with both H3.1 and H3.3. We have included in the revised version more detailed explanation to clarify this point.
  
  The same issue likely carries to the investigation of the association with H3 modifications if Fig1C and 1D, since the H3.1-HA pull-down also pulls down endogenous H3.1 (so presumably the rest of the nucleosome, with H3.3, as well).
  
  We disagree on this point. The H3 band corresponding to the transgene copy is either H3.1 or H3.3, so all signals on upper band (T) in Figure 1C are associated with either H3.1 (H3.1 IP) or H3.3 (H3.3 IP), thus unambiguously showing that all modifications we analyzed are present on both H3.1 and H3.3. Furthermore, data shown in Figure 1D and E, where we analyzed modifications on K27 and K36 which are in the H3 region that can be distinguished between H3.1 and H3.3 by MS clearly demonstrate that these modifications are present on both H3.1 and H3.3. In order to make this clearer, we also extended the description of this part in the Results section to emphasize this.
  
  In Fig3, the conclusion that it is the loss of H2A.Z -> H2A.W exchange in the ddm1 mutant that causes loss of constitutive heterochromatin is rushed. The fact that the h2a.w mutant does not recapitulate the loss of constitutive heterochromatin seen in ddm1 argues against this interpretation.
  
  We agree that at first the minimal impact of the loss of H2A.W alone is surprising. However, we point to the preprint https://www.biorxiv.org/content/10.1101/2022.05.31.493688v1. There it is shown that the joint loss of H2A.W and H3K9 methylation (also observed in ddm1) affects silencing of a large range of transposons that also lose silencing in ddm1.
  
  It's also difficult to conclude about the importance of dynamic exchanges when the ddm1 mutation has been present for generations and the chromatin landscape has fully readapted. Further work is needed to support the authors' hypothesis.
  
  We apologize that the Reviewer could not find the information regarding the origin of ddm1 mutant material. We did not use a mutant where ddm1 mutations was kept for generations. We were in fact very careful on this point and used leaves from ddm1 first homozygous plants segregated from heterozygous ddm1 kept heterozygous.
  
  The study also relies on a large number of custom (polyclonal) antibodies with no public validation data. Lack of specificity, a common issue with antibodies, would muddle the interpretation of the data.
  
  We added information about validation of custom made antibodies into Methods: ”Specificities of custom made polyclonal antibodies against Arabidopsis H2A.Z.9, H2A.X, H2A.W.6, H2A.13, H2A.W.7, H2Bs, and linker histone H1 were validated in previous publications (Yelagandula et al., 2014; Lorkovic et al., 2017; Jiang et al., 2020; Osakabe et al., 2021).“ For H2A.2 and H2A.Z.11 antibodies we provide validation data as Figure 2 figure supplement 1.
  
  Overall, this study nicely illustrates that, in Arabidopsis, histone variants (and H2A variants in particular) display specificity in modifications and genomic locations, and correlate with some chromatin sub-states. This encourages future work in epigenomics to consider histone variants with as much attention as histone modifications.
  
  Reviewer #3 (Public Review):
  
  How chromatin state is defined is an important question in the epigenetics field. Here, Jamge et al. proposed that the dynamics of histone variant exchange control the organization of histone modifications into chromatin states. They found 1) there is a tight association between H2A variants and histone modifications; 2) H2A variants are major factors that differentiate euchromatin, facultative heterochromatin, and constitutive heterochromatin; 3) the mutation in DDM1, a remodeler of H2A variants, causes the mis-assembly of chromatin states in TE region. The topic of this paper is of general interest and results are novel.
  
  Overall, the paper is well-written and results are clearly presented. The biochemical analysis part is solid.
  
  Reviewer #4 (Public Review):
  
  This work aims at analyzing the impact of histone variants and histone modifications on chromatin states of the Arabidopsis genome. Authors claim that histone variants are as significant as histone modifications in determining chromatin states. They also study the effect of mutations in the DDM1 gene on the exchange of H2A.Z to H2A.W, which convert the silent state of transposons into a chromatin state normally found on protein coding genes.
  
  This is an interesting and well done study on the organization of the Arabidopsis genome in different chromatin states, adding to the previous reports on this issue.
  
  Reviewer #1 (Recommendations For The Authors):
  
  1) The rationale for switching from using 10-day old seedlings for chromatin profiling to using mature leaves in Figure 3 and beyond is not explained and introduces additional complexity into the analyses. The reasoning should be clearly explained in the text, and there are several additional suggestions or questions related to this that should be addressed:
  
  This was done for practical reasons. We had already obtained some profiles of marks in ddm1 mutants and extended the dataset using the same stage of development because this tied this study with our previous study. Using different stages of development provides an additional benefit. The same chromatin states are observed in 10 day old seedlings and leaves of older plants. Constitutive heterochromatin is occupied by the same chromatin states and logically euchromatin is positioned on different genes as expected by the distinct pattern of gene expression at the two stages of development.
  
  A) In the 16-state model (Figure 3A), euchromatin states were not well defined compared to the 26-state model. Why did the authors not profile these marks also, and could this explain why ddm1 mutants did not show a significant effect on euchromatin states in this model?
  
  We apologize for the lack of detailed explanation: In our previous study we used leaves of five weeks ld plants to show the impact of ddm1 on the profiles of H2A.W.6, H2A.X, H1, H3K9me2, H3K36me3 and H3K27me3 in leaves (Jamge, Osakabe et al., 2021). This study showed that DDM1 causes the deposition of H2A.W.6 to heterochromatin and we thus used leaves to extend this investigation to the two other marks of heterochromatin (constitutive or facultative) H3K9me1, H2A.W.7 and H2A.Z.9 and H2A.Z.11.
  
  B) The authors state that the tissue types do not impact the definition of chromatin states. However, there is a clear difference in the portion of the genome occupied by each chromatin state between leaf and seedling (states 1, 5, 8, 13, and 14; Figure S3A).
  
  We had missed a comment on supFig3B and have now provided more explanation: “Although the composition of the chromatin states did not vary significantly between seedlings and leaves, each state occupied a similar proportion of the genome in seedling or leaves to the exception of state 5 present primarily in leaves and state 13 only present in seedlings (Figure 3 figure supplement 3A, right column with green bars) and the euchromatin states occupied different genes (Figure 3 figure supplement 3B) as expected by the dissimilar transcriptomes of these two developmental stages.”
  
  2) The naming of supplemental figures throughout the text is confusing as the legends refer to them as "Figure SX" but they are called out in the text as "Figure X figure supplement XA-B". The eLifeconvention is "Figure X figure supplement XA-B".
  
  This was changed.
  
  3) In Figure 4, Panel D is mislabeled as C in the figure, and C is lacking a label.
  
  4) Please remove the word "the" from the title.
  
  This was done
  
  Reviewer #2 (Recommendations For The Authors):
  
  Fig1D legend should also mention K37.
  
  This was corrected.
  
  Fig2F legend should say "no H3 modifications" rather than "no histone modifications" This was corrected.
  
  Fig4 labels C/D do not correspond to the legend. D is missing and C should go to the ddm1 stacked barplot.
  
  This was corrected.
  
  H3 variants analysis: Taking the relative abundance of H3.1 and H3.3 (and transgenes) into account would be useful to interpret the results of the nucleosome composition results. If they are at equivalent amounts, the null hypothesis of independent association would give 50% heterotypic nucleosomes and 50% homotypic.
  
  This is a valid comment. In an ideal system the last statement would be correct, but this does not take into account chromatin dynamics associated with replication, transcription, etc. Also, total amounts of H3.1 and H3.3 in tissue we used for the experiment is not known. It could possibly be inferred from RNAseq data, but if this would reflect real amounts of the protein is highly questionable. In Arabidopsis there are 5 H3.1 genes and 3 H3.3 genes. Nevertheless, we recalculated data for H3.1 and H3.3 and this has been updated in the main text (~60% of H3.1 and ~42% of H3.3 immunoprecipitated nucleosomes contained both H3 variants). Thus, from the available data these numbers are the best we can get.
  
  p. 5 bottom paragraph. Repetition.
  
  This was corrected
  
  p12. The reference to LSH is dropped in without making clear how it is relevant. Expand on mechanism to suggest similar DDM1 mechanism?
  
  This section was expanded to provide more background in the interpretation of the results.
  
  p13. inversion between H2A.W and H2A.Z in "the loss of DDM1 prevents the replacement of H2A.W by H2A.Z".
  
  This was corrected
  
  p13. make it clear that the last sentence of the results is a working model, not a fully backed up conclusion.
  
  Alternative models are mentioned in this section and in the discussion in the revised version.
  
  p14 middle paragraph. Not clear what "in silico simulation" refers to. Simply chromatin-state classification with ChromHMM?
  
  This refers to the Jacard index calculation in Fig. 2F that models the impact of the loss of H2A variants (or other elements of chromatin) on the definition of chromatin states by ChromHMM. This is now clarified.
  
  p14 bottom paragraph: the H2A.Z tail repression of ubiquitin ligase but its being the favoured substrate for H2AK121Ub is apparently contradictory. Can this be explained?
  
  This refers to H2B Ubiquitination and is now clarified
  
  p15. Correlation between variants and modifications/chromatin states does not necessarily mean causation.
  
  We agree and have improved the revised version in this respect.
  
  p15 "forward feedback loop" is ambiguous (is it a feed-forward loop? A feedback loop?), just use "positive feedback loop".
  
  This was corrected.
  
  p23 top "$(Ingouff et al)" doesn't seem properly formatted.
  
  This reference did not belong there and has been removed.
  
  Data availability: GSE226469 is not public. The manuscript also mentions availability of source data for all the main figures, but I could not find it. It would be great to make the code publicly available too.
  
  All the data and code will be public upon posting the revised version of the manuscript.
  
  Reviewer #3 (Recommendations For The Authors):
  
  My major concern is authors only used DDM1 as an example to show that the exchange of the histone variant contributes to definition and distribution of chromatin state on transposons (i.e., constitutive heterochromatin regions associated with H2A.W). Readers may wonder whether similar mechanisms also work at the euchromatin region. This point should be clearly discussed and mentioned in the Results (for example, cite recent work on INO80).
  
  We discuss the impact of other remodelers in the Discussion in the revised version. We hope that the reviewer will understand that doing a study on the impact of other remodelers on chromatin states which would require dozens of new ChIP profiles and is clearly beyond the scope of revising a manuscript.
  
  Minor:
  
  1) Fig. 2A and 2B, what does color mean? I guess the color code is referred to chromatin states (Fig. 2F).
  
  We have clarified on Figure 2A the attribution of a specific color to each chromatin state. This same color is used also in other panels of Figures 2 and S2.
  
  2) Supplemental Figures: All the figure panels should be on the same page.
  
  We rearranged supplemental figures so that each figure fits on one page. In places where this was not possible, we created additional supplemental figures.
  
  3) "We observed that increasing state numbers from 26 to 27 gave rise to biologically redundant states.": Where are the data? Fig S2A? This figure is hard to understand.
  
  In the updated manuscript, we have described the legend and the methods for FigS2A in more detail.
  
  Reviewer #4 (Recommendations For The Authors):
  
  A general concern refers to the text that frequently falls into excessive oversimplifications and/or overstatements, with the danger of being misleading for the reader. This needs to be thoroughly revised.
  
  We added more careful statements and proposed alternative models when it was possible.
  
  Specific comments.
  
  1) Fig 1A. Authors found the ~40% of nucleosomes contained both H3.1 and H3.3. This is a significant finding that deserves a more detailed comment.
  
  We now provide a more detailed description of IP and MS data presented in Figure 1. This should also help to avoid oversimplifications and/or overstatements as criticized in a general comment.
  
  2) Fig 1C. "H3. And H3.3 bore the same sets and comparable levels of methylation and acetylation...". Too general statement, please specify. Is this also the case for H3K9me2? Others?
  
  We did describe this part into more detail to emphasize more precisely what Figure 1 shows. We also included data on K9me into Figure 1 figure supplement 1H.
  
  3) Fig 1D. Could you confirm the high level of H3K27me1 on H3.3?
  
  H3K27me1 data are shown both by WB (Figure 1C) and Mass spectrometry (Figure 1D and E). We also provide a possible explanation for high levels of this mark on H3.3 by taking into account the fact that H3K27me1 is also produced by demethylation of H3K27me3 by JMJ demethylases.
  
  4) All WB in Fig 1. They need to be quantified and normalized (plus statistical analysis) in order to provide strong support to the conclusions.
  
  The conclusion of all WB are supported by quantified Mass spectrometry data and many WB were even repeatedly shown in Figure 1F (for example IPs for H2A variants and a large set of H3 marks used for WBs) with the same results. Also, association of H3K4me3 and H3K36me3 with H2A variants was analyzed in both ways (Figure 1F); IPs of variants and WBs of variants and marks and IPs of marks and WBs of marks and variants. For most of the data we do not have more than two repeats, so statistical analysis may not be possible.
  
  Nevertheless, we are convinced that our major conclusions from data presented in Figure 1 and Supporting figure 1 (these are: that H3 variants form both homotypic and heterotypic nucleosomes, that H3 marks do not preferentially associate with H3 variants but some of them do so with H2A variants and that H3 modifications show very complex pattern of associations with each other) are fully valid as they were drawn from two orthogonal approaches and further supported by the chromatin states identified.
  
  5) Fig. 2A. Authors focus on "the most parsimonious model" based on 26 chromatin states. This needs to be justified in a more explicit manner. It is surprising that this number emerges for an analysis of 27 independent variants and marks. What are the differences in the conclusions when other number of states are used? See also below (reduced number of number derived from the "concatenated model").
  
  Why 26 states were chosen is now explained in great details in the method section. Since to the exception of H2A variants that are invariably homotypic, nucleosomes can be heterotypic for all other histone variants and histone modifications, the random combination of the 27 marks in one nucleosome representing one states is 4 H2A (without the subtypes) x 4H3 x 2H1 x 2(power16) (for each mark) which is well above the circa 26 states observed. This shows that our probabilistic model reduces the potential complexity of a theorical random association in a remarkable manner.
  
  6) As a summary, it would be very helpful to generated a table (or similar) where is proposed chromatin state is ascribed to functional genomic elements.
  
  This aspect of the work is presented in a preprint where the biological association with the chromatin is described in details. See Jamge et al 2002, https://www.biorxiv.org/content/10.1101/2022.06.02.494419v1
  
  7) Fig 2F (and S2B). A comprehensive comparison a various approaches should include others and estimate the Jaccard similarity index: (1) the same of marks and variants used in the Sequeira-Mendes et al paper, and (2) the subset of marks and variants added in this study. In this way, a direct evaluation of the contributions could be more properly made.
  
  We thank the reviewer for this suggestion and have now included a new column with the combination of marks and variants as used in Sequeira-Mendes et al., 2014 (see Figure 2F). These data clearly demonstrate that adding histone variants significantly contribute to the definition of chromatin states.
  
  8) Fig. 3. Explain in more detail the concatenated model used here. Does the reduction in the number of chromatin states mean that the other do not add new information?
  
  ChromHMM concatenated model allows to identify common definition of chromatin state in multiple tissue types. Here multiple cell types are concatenated leading to a shared definition of chromatin states, but specific to each cell type.
  
  In our paper we used the concatenated model to identify common chromatin states in two different genotypes (WT and ddm1). The data for WT and ddm1 was obtained from leaves. As we had a limited number of ChIP-seq profiles in the leaves dataset The complexity of the concatenated model was also reduced compared to the extensive 26 chromatin state model. We chose to analyze 16-states in the concatenated model because this was the minimal number of states that gave rise to a similar complexity of heterochromatic states.
  
  9) The ddm1 mutant. The text in page 14 is a bit confusing. It seems that H2A.Z is deposited on TEs and the exchanged by the H2A.W.
  
  We have provided additional alternative models that could explain our observations.
  
  10) Page 15: link between H2A.Z and H3K27me3. Gomez-Zambrano et al (2018, cited in the text, found that only a relatively small subset of (putative) targets are common to H2A.Z and H3K27me3. How do authors reconcile this with their statement supporting a link between both of them?
  
  We refer to Gomez-Zambranao et al to illustrate the link between H2A.Z and H2AK121ub so we do not understand this comment. The strong link between H2A.Z and H3K27me3 is shown without ambiguity by our work and also Carter et al., 2018.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.03.08.531698v2
Jun 2023
www.biorxiv.org www.biorxiv.org

New submission 29/06/2023, 08:26:58

1
1. Public_Reviews 30 Jun 2023
  
  in eLife
  
  Author Response:
  
  Reviewer #1 (Public Review):
  
  The study investigates the nature of "trailblazer" cells in distinct tumor models, including luminal B (MMTV/PyMT) and triple negative (TNBC) tumors (C3-TAg). The authors note that the trail-blazer phenotypes in the TNBC model are more complex relative to the Luminal B model and represent distinct EMT programs associated with the expression of distinct EMT-TFs (Zeb1, Zeb2 and Fra-1). They demonstrated that of numerous EMT-TFs, Zeb1 and Fra-1 were required for increased cancer cell migration and invasion. They reveal that TGF-beta and EGF-mediated signaling are required for the diverse EMT states that are required for trailblazer cell activity and increased cell migration/invasion. TGF-beta signaling engaged Zeb 1 and Zeb2 while EGF sig-naling activated Fra-1. Indeed, inhibitors of either TGF-beta or EGF signaling could impair cell migration/invasion. While both pathways contributed to trailblazer phenotypes, EGF signaling was shown to interfere with certain TGF-beta induced transcriptional response, including the ex-pression of genes encoding extracellular matrix proteins.
  
  One concern was the heavy reliance of the C3-TAg as the sole TNBC model in which the dis-tinct trailblazer phenotypes were described. The data in Fig. 3 of the submission reveals that the phenotypes observed in the C3-TAg model could be recapitulated in a TNBC patient-derived xenograft model (PDX). Using this PDX, the authors were able to show vimentin expression in lung metastatic TNBC cells that were intravascular, those that had extravasated and clusters of cancer cells fully within the lung parenchyma. This was an important addition to the manuscript. The additional experiments to investigate the role of Zeb1 and Zeb1 more fully, beyond the focus on Fra-1 in the initial submission was an additional strength of the new submission. Additional clarifications to the discussion also clarified the concepts articulated in the study. The study em-ploys multiple breast cancer models, utilizes numerous in vitro and in vivo assessments of the trailblazer phenotypes, and the experimental design is rigorous and the interpretation of the data is sound. The manuscript will be of general interest to the research community.
  
  Thank you for the supportive comments. We are glad that the revisions addressed your prior concerns.
  
  Reviewer #2 (Public Review):
  
  This represents an important study that demonstrates a high degree of heterogeneity within trailblazer cells in clusters that participate in collective migration. Solid methods highlight this het-erogeneity and show that in TNBC cancers, trailblazer cells are defined by vimentin (and not Keratin 14) and are dependent on both TGFbeta and EGFR signaling. Additional, single cell stud-ies would further support this work.
  
  Thank you for the suggestion. Our current data establishes that trailblazer cells are heterogene-ous using FACS, immunostaining and functional studies of fresh tumor organoids and estab-lished tumor organoid lines. In addition, our RNA-seq experiments provided deep insight into the nature of gene expression changes that corresponded with the evolution of new trailblazer states. This discovery of trailblazer cell heterogeneity was one of multiple key new discoveries in this manuscript, along with revealing a Krt14-independent invasion mechanism, the regulation of trailblazer cells by Tgfβ and Egfr signaling and a new compromise mode of signal integration. We agree that our results support further investigation of the nature and function of basal-like breast cancer heterogeneity during the progression to metastasis. However, a comprehensive implementation of scRNA-seq is mostly likely required to further unravel new aspects of hetero-geneity that substantially advance upon the conclusions supported by our current data. Such an undertaking is beyond the scope of this investigation.
  
  We agree that scRNA-seq would be confirmatory of trailblazer cell heterogeneity that has been demonstrated with multiple approaches rather than a new discovery of heterogeneity.
  
  Strengths:
  
  The paper highlights that collective migration, and the nature of trailblazer cells can be highly heterogeneous. This is important as it suggests that the ability to move between states may su-persede a singular phenotype.
  
  The paper uses animal models and organoids and in several areas attempts to correlate find-ings to human tissues.
  
  The experiments are logically described.
  
  Reviewer #3 (Public Review):
  
  Cancer is a disease of many faces and in particular, the ability of cancers cells to change their phenotypes and cell behaviors - cancer cell plasticity - is a major contributor to cancer lethality and therapeutic challenge of treating this disease. In this study, Nasir, Pearson et al., investigate tumor cell plasticity through the lens of invasive heterogeneity, and in particular in models of tri-ple-negative breast cancer (TNBC), a subtype of breast cancer with particularly poor clinical prognosis and more limited treatment modalities. Using organoid models in a variety of matrix systems, microscopy, and signaling pathway inhibitors, they find that invading TNBC breast tu-mors, primarily in the C31-Tag genetically engineered mouse model of TNBC, are composed of heterogeneous invasive/"trailblazer" type tumor cells that in many cases express vimentin, a classical intermediate filament marker of epithelial-mesenchymal transition, and reduced keratin-14, another filament marker of basal epithelial cells associated with collective invasion in differ-ent breast cancer models. Supportive genetic and pharmacologic evidence is provided that gen-eration of these cells is TGF-beta signaling pathway driven, likely in vivo from the surrounding tumor microenvironment, in accord with published studies in this space. Another important as-pect of this study is the good transcriptional evidence for multiple migratory states showing dif-fering degrees of partial overlap with canonical EMT programs, dependent on TGF-beta, and suggestive but at present incomplete understanding of a parallel program involving Egfr/Fra-1 mediated effects on invasion. When taken in context with other recent studies (Grasset et al. Science Translational Medicine 2022), these data are broadly supportive of concept of targeting vimentin-dependent invasion programs in TNBC tumors.
  
  The core conclusions of this paper are generally supported by the data, but there are some conceptual and technical considerations that should be taken into account when interpreting this study. Specific comments:
  
  1) The contribution of the different vimentin-positive trailblazer cells to distant metastasis was not directly confirmed in vivo in this study. Given the limited proliferative potential of many fully EMT'd cells and in light of recent studies indicating that invasion can be uncoupled from meta-static potential, it seems important to directly test whether the different C31-tag isolates, varying in invasive potential in this study, produce metastases and if so do metastases abundance corre-late with the invasive potential in 3D culture. The collection of lungs at 34 days post injection de-scribed in methods is too short to evaluate metastatic frequency.
  
  We agree that it is important to determine the contribution of trailblazer cells towards metastatic dissemination. In this manuscript, we show that Vimentin expressing cells in a triple negative breast cancer (TNBC) PDX model disseminate to the lungs (Figure 3F). We have also shown that Vimentin expressing SUM159 breast cancer (BC) trailblazer cells spontaneously metasta-size to the lungs in previous publications (Fig. 2–figure supplement 1C) and (Westcott et al, J Clin Invest, 2015, 10.1172/JCI77767 and Maine et al, Oncotarget, 2016, 10.18632/oncotarget.7408). Notably, the depletion of genes specifically expressed in trailblazer cells reduced spontaneous metastasis without significantly impinging on primary tumor growth (Westcott et al, J Clin Invest, 2015, 10.1172/JCI77767 and Maine et al, Oncotarget, 2016, 10.18632/oncotarget.7408). Our new results in Figure 5D show that Tgfβ activates genes that define the trailblazer state in the metastatic SUM159 trailblazer cell model. Thus, features of the Tgfβ regulated trailblazer program in the C3-TAg cells is active in the SUM159 trailblazer model of spontaneous metastasis. In addition, commonly employed BC cell line metastasis models, such as MDAMB231 derivatives are highly mesenchymal (Fig. 2–figure supplement 1C) and (Kang et al, Cell, 2003, 10.1016/S1535-6108(03)00132-6 and Minn et al, Nature, 2005, 10.1038/nature03799, as examples).
  
  It is not technically feasible to establish a correlation between the relative invasion of The C3-TAg GEMM primary tumors and spontaneous metastasis. C3-TAg GEMM primary tumors de-velop rapidly and the mice must be euthanized prior to the detection of metastasis. This limitation of the model is mentioned in the Results section “Trailblazer cells are specified by Vimentin ex-pression in basal-like breast cancer patient tumors”. The aggressive primary tumor growth and limited spontaneous metastasis of the the C3-TAg model has also been previously reported by others (Green et al, Oncogene, 2000, 10.1038/sj.onc.1203280). Surgical resection of the original primary tumor is not feasible option to allow metastases to form since additional tumors develop in multiple mammary glands.
  
  In response to reviewer requests, we initiated the growth of orthotopic primary tumors from con-trol or Tgfβ treated 1339-org cells to address the relationship between induction of the trailblazer state and primary tumor cell dissemination. We had to euthanize the mice at day 34 (d34) be-cause tumors within both cohorts had reached the maximum permitted diameter of 2 cm. This will be indicated in the Methods section with revised text. We detected CTCs from the mice bearing control and Tgfβ treated 1339-org cell tumors. However, no micrometastases were de-tected, which is indicated in the text describing Figure 4–figure supplement 3A-B. Thus, per-forming surgical resection in new experiments would not be expected to allow the later detection of metastasis, as there did not appear to be DTCs in the lungs that could initiate colonization. In addition, we would have to resect the tumors prior to d34 to successfully and humanely remove the primary tumors, further reducing the odds of metastases developing. We will continue our work to identify an experimental balance that permits sufficient primary tumor growth to initiate spontaneous metastasis. However, the time scale of resolving this technical challenge is uncer-tain and we believe that our published analysis of trailblazer cell metastasis and new findings here showing the dissemination of Vimentin expressing cells in a PDX model addresses the question of whether Vimentin expressing trailblazer cells metastasize.
  
  We agree that certain cell states induced by EMT programs can limit the proliferative potential of tumor cells. As described in the Introduction, we previously found that the induction of a trailblaz-er state in a subset of breast cancer cell line models triggers a collateral cost in fitness that limits the ability of trailblazer cells to initiate tumor growth (Westcott et al, Cancer Res, 2020, 10.1158/0008-5472.CAN-20-0014). The traits that distinguish trailblazer cells which are capable of tumor initiation and metastasis versus trailblazer cells with reduced fitness have begun to be delineated. Our prior report suggested that cells that were dependent on p63 for growth lost their proliferative capacity when converting to a trailblazer state (Westcott et al, Cancer Res, 2020, 10.1158/0008-5472.CAN-20-0014). C3-TAg cells are not dependent on p63 for growth, which is indicated by the vast majority of the tumor cells lacking p63 expression in primary tumors and primary tumor organoids (Westcott et al, Cancer Res, 2020, 10.1158/0008-5472.CAN-20-0014), similar to the metastatic SUM159 breast cancer cell line model. We were also able to derive clonal trailblazer cell lines that lacked detectable p63 expression from a C3-TAg tumor (Figure 2—figure supplement 1B) and grow organoids even when the limited extent of p63 expression was further reduced by Tgfβ (Figure 5C). Additionally, the persistent Tgfβ treated 1339-org cells, which were enriched for trailblazer cells and had reduced p63 expression, were capable of initiating primary tumor growth (Figure 4F). Together, these results indicate that C3-TAg trail-blazer cells are capable of initiating metastatic colonization. However, given the heterogeneity in trailblazer states that we discovered, it is possible that a subset of trailblazer cell states have re-duced proliferative capacity. Our analysis approach in this manuscript would not necessarily de-tect these low fitness trailblazer cells if they were a relatively small fraction of the total trailblazer population. We will clarify this point in the Discussion section in the revised manuscript. Our re-sults have begun to reveal mechanisms for the transcriptional regulation of trailblazer cell heter-ogeneity. We plan to continue delineating the regulatory programs conferring specific transcrip-tion state, defining approaches for the prospective isolation of distinct trailblazer subpopulations and determining trailblazer subpopulation specific biomarkers to understand the specific contri-bution of distinct trailblazer subpopulations towards metastasis. Given the scope of this analysis, it is not feasible to incorporate these future studies into this manuscript.
  
  2) The invasion of cancer cells is dependent on 3D matrix composition. In other studies, collec-tive cancer invasion is performed in exclusively collagen type 1 gels or in other instances entirely in 3D reconstituted basement membrane gel, e.g. lung cancer invasion studies. In this study, the authors use a mixture composed of both matrices. Given the invasion suppressive effects of matrigel, particularly for epithelial type cells, further studies would be important to determine whether the invasion phenotypes seen in this study are generalizable across matrix environ-ments.
  
  The invasion of C3-TAg and PyMT organoids embedded in a 100% pure reconstituted base-ment is shown in Fig. 1–figure supplement 1G. We will emphasize that trailblazer invasion was evaluated in multiple ECM compositions with revised text and figure graphic. We also provide images for the reviewer showing that C3-TAg organoids collectively invade in a pure Collagen I ECM. Importantly, these findings are consistent with our results showing that Vimentin express-ing cells are associated with basal-like mammary tumor cell invasion in the complex ECM of C3-TAg GEMM primary tumors (Figure 2G) and patient primary tumors (Figure 3D). Moreover, Vimentin expressing cells disseminated to the lungs in the TNBC PDX that we evaluated (Figure 3F).
  
  The ECM composition selected for experiments is dictated by the experimental question(s) being addressed. It is unlikely that mammary tumor cells would only ever collectively invade through an ECM that is either pure Collagen I or pure reconstituted basement membrane (BM). Indeed, it has been proposed that mixtures of Collagen I and BM proteins best reconstitute the complexity of primary tumor ECM (Hooper et al, Methods Enzymol, 2006, 10.1016/S0076-6879(06)06049-6). In line this observation, mixtures of Collagen I and BM proteins have been routinely used for the past 20 years to define mechanisms of 3D invasion; Xiang and Muthuswamy, Methods En-zymol, 2006, 10.1016/S0076-6879(06)06054-X; Calvo et al, Nat Cell Biol, 2013 10.1038/ncb2756; and Kato et al, eLife, 2023, 10.7554/eLife.76520, as examples).
  
  Consistent with the known complexity of the ECM in the tumor microenvironment (TME), we detect Collagen I and Collagen IV (a key component of experimental BM) in the TME of primary breast cancer tumor models (Westcott et al, J Clin Invest, 2015, 10.1172/JCI77767). Important-ly, we have found that a mixture of collagen I and experimentally derived BM proteins reliably reveals breast cancer trailblazer cell invasion mechanisms that promote the malignant progres-sion and metastasis of primary tumors and whose expression correlates with poor patient out-come (Westcott et al, J Clin Invest, 2015, 10.1172/JCI77767 and Westcott et al, Cancer Res, 2020, 10.1158/0008-5472.CAN-20-0014, as examples). Notably, the relative differences in trail-blazer and opportunist cell invasive phenotypes are not dictated by the ECM composition used in our 3D assays. We have previously tested the invasion of trailblazer and opportunist subpopula-tions in different ECM compositions using both spheroid vertical invasion assays (Westcott et al, J Clin Invest, 2015, 10.1172/JCI77767). Increasing collagen I concentration enhanced the rela-tive rate of trailblazer cell invasion, with trailblazer cells always showing a significantly enhanced invasion relative to opportunist cells.
  
  The relationship between trailblazer and opportunist cells that we have detected in primary tu-mors is recapitulated when using mixtures of Collagen I and BM proteins in our past publications and in this manuscript. The clonal opportunist cell lines derived from a C3-TAg tumor expressed high levels of the transcription factor p63 (Figure 2–figure supplement 1A-B). We previously showed that p63 restricts induction of a trailblazer state in human breast cancer trailblazer cell lines (Westcott et al, Cancer Res, 2020, 10.1158/0008-5472.CAN-20-0014). Notably, we showed that p63 expressing C3-TAg cells were not able to initiate collective invasion in the same ECM composition used in our current manuscript. Moreover, p63 cells in primary C3-TAg tumors were noninvasive opportunist cells that were limited to trailing p63-low trailblazer cells when collective-ly invading in primary tumors and in organoids (Westcott et al, Cancer Res, 2020). We now show that p63 expressing opportunist cell lines are limited to invading behind primary C3-TAg trailblazer cells and trailblazer cell lines in our 3D invasion assays (Figure 1B and Figure 1–figure supplement 1D-E). Together, these results indicate that the ECM employed in our 3D assays reveals the mechanistic underpinnings of both trailblazer and opportunist cell invasion in primary tumors.
  
  With respect to lung cancer invasion, leader cells that we would classify as trailblazer cells have been isolated from 2 non-small cell lung cancer cell line spheroid models grown in pure reconsti-tuted BM extract (Konen et al, Nat Comm, 2017, 10.1038/ncomms15078). However, it unclear whether these cell line derived NSCLC trailblazer cells are more intrinsically invasive than non-trailblazer siblings in primary NCSCLC tumors or if the traits associated cell line NSCLC trail-blazer cells are required for metastasis. These tests have never been reported to the best of our knowledge. Similarly, it is not clear whether these NSCLC cell line derived trailblazer cells reflect features of primary NSLC primary tumor cells, as we are unaware of any such comparisons be-ing reported. Thus, there is no reason to consider pure reconstituted BM to be an equivalent or preferred experimental option to define trailblazer cell features. Nevertheless, as we mentioned before, our discovery approach identifies trailblazer cells that are intrinsically more invasive than opportunist siblings across multiple ECM conditions, including pure reconstituted BM and, im-portantly, in primary tumors.
  
  3) TGF-beta is well known to induce EMT. Although this study identifies potential transcriptional mediators of the invasion/trailblazer program, is this program reversible?
  
  We have previously shown the breast cancer trailblazer cells can convert to an opportunist state, demonstrating that trailblazer states are reversible (Westcott et al, J Clin Invest, 2015, 10.1172/JCI77767). In this manuscript. we show that C3-TAg organoid lines derived in the Tgfbr1 inhibitor A83-01 have few if any cells with a trailblazer phenotype relative to C3-TAg pri-mary tumors, suggesting a reversion of the trailblazer state (Fig. 4C and Figure 4–figure sup-plement 2A-C). However, our results do not entirely rule out the possibility that only non-trailblazer cells grew to establish the organoid lines. Indeed, the problem of tracing phenotypic conversions when evaluating heterogeneous populations is a systemic challenge that extends beyond our analysis of trailblazer cells. Clearly defining the conversion rates for trailblazer cells will require multiple genetic markers to distinguish the different trailblazer states we have now identified, in addition to phenotypic and molecular analysis over multiple days, or possibly weeks. Thus, further definition of the rate of reversion of different trailblazer cells is worthy line of future investigation rather than a feasible objective of this study.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2020.11.14.383232v5
www.biorxiv.org www.biorxiv.org

New submission 29/06/2023, 12:32:52

1
1. Public_Reviews 30 Jun 2023
  
  in eLife
  
  Author Response:
  
  We thank the reviewers for their careful and overall positive assessment of our work.
  
  Reviewer #1 (Public Review):
  
  This paper describes the discovery, functional analysis and structure of TcaP, a protein encoded by the Vibrio phage satellite PLE that forms a size-determining scaffold around PLE procapsids made from helper phage ICP1 structural proteins. The system displays a fascinating similarity to the P2/P4 system, which had previously been unique in its use of a size-determining external scaffolding protein, Sid. The work is interesting, comprehensive and of high quality. The presentation could be improved as listed in the suggestions below.
  
  An interesting observation is that PLE appears to be dependent on small capsids for efficient transduction. This is not completely surprising if the element uses a cos site type mechanism for packaging, since this requires an integer number of genomes to be packaged when the capsid is full, and this might be more difficult to accomplish when the helper capsid is much larger than the satellite, as is the case with ICP1. The authors mention in a few places that this is the first known satellite to have this requirement. However, this is not quite correct: a similar defect was seen in phi12/SaPIbov5, where the large phi12 capsid was not quite the right size for either two or three copies of the wildtype ("unevolved") SaPIbov5 (Carpena et al. 2016).
  
  We thank the reviewer for bringing up this point. First, we agree that for cos type packaging systems, this would not be surprising. However, ICP1 is a pac type phage and we have evidence that PLE is also a pac rather than a cos type packaging satellite; therefore, PLE is the first headful satellite to show such a defect. For cos packaging elements, both SaPIbov5 and P4, non-integer genome lengths have been shown to pack less efficiently into capsids as pointed out above and shown in Carpena et al 2016 and Shore 1978. However, in both of these cases, the genomes were manipulated to change their size, suggesting that naturally occurring cos satellites maintain their genome sizes to be proportional to their capsid sizes or in integer proportion to their helper capsids. We will include a short summary of these previous findings in the main text to provide context for the rare decreases in transduction efficiency reported in the cos satellites.
  
  The authors present several micrographs showing capsids formed in the presence or absence of wildtype or mutant TcaP and CP (Fig. 1, Fig 2., Fig 3). However, each micrograph shows only a handful of particles of the "correct" size, in addition to a few shells that are aberrant or of a different size. I miss a more statistically rigorous enumeration of shells of different size (PLE or ICP1 sized, or different), empty vs. full, aberrant shells etc. This could be presented as a size distribution graph, a histogram or in table form.
  
  We thank the reviewer for this recommendation and agree that it would add to the manuscript. We will quantify these particles and present the data in the main text.
  
  In the abstract, the term "divergent satellite P4" is vague and unclear. Divergent from what? Probably they mean distinct from or unrelated to PLE. Please clarify.
  
  Yes, we did mean unrelated to PLE, and we will clarify in the text.
  
  How do they know that gp123 is a decoration protein? Was this previously determined, does it have (sequence) similarity to other known decoration proteins, or is it simply the most likely designation based on its position in the genome?
  
  Gp123 was annotated based on its position. While there is sequence similarity to other annotated Vibrio phages’ decoration proteins, we will clarify in the text that Gp123 is a putative decoration protein.
  
  Although the reconstruction and modeling statistics are good, it is difficult to assess the quality of the map and the model from the presented figures. Details of the density and FSC curves (half-map and model-to-map) should be shown. It is also difficult to see the TcaP structure and how it compares to Sid from the figures presented.
  
  We will address this concern in the revised manuscript.
  
  Introduction, Paragraph 3: "...which is the number of coat proteins divided by 60" is not strictly speaking the definition of T number. The T number corresponds to the number of subtriangles that one triangular face of the icosahedron is divided into. It corresponds to the number of coat proteins divided by 60 in the canonical case, but in tailed phages, 5 copies are removed to make way for the portal protein. (Other viruses could be described as having architecture corresponding to a specific T number, but with divergent numbers of subunits, e.g. adenoviruses or polyomaviruses.)
  
  We agree that our simplified explanation of the T number is not entirely accurate and will modify the sentence appropriately.
  
  Reviewer #2 (Public Review):
  
  Phage satellites are fascinating elements that have evolved to hijack phages for induction, packaging, and transfer, promoting their widespread dissemination in nature. It is remarkable how different satellites use conserved strategies of parasitism, utilising unrelated proteins that perform similar roles in their cognate elements. In the current manuscript, Dr. Seed and coworkers elucidated the mechanism used by one family of satellites, the PLEs, to produce small capsids, a process that inhibits phage reproduction while increasing PLE transmission. The work is presented beautifully, and the results are astonishing. The authors identified the gene responsible for generating the small capsids, characterised its role in the PLE transfer and phage inhibition, and determined the structure of the PLE-sized small capsids. It is a truly impressive piece of work.
  
  We thank the reviewer for their positive evaluation of our work.
  
  Reviewer #3 (Public Review):
  
  The manuscript by Boyd and co-authors "A Vibrio cholerae viral satellite maximizes its spread and inhibits phage by remodelling hijacked phage coat proteins into small capsids" reports important results related to self-defending mechanisms that bacteria are used against phages that infect them. It has been shown previously that bacteria produce phage-inducible chromosomal island-like elements (PLE) that encode proteins that are integrated into bacterial genome. These proteins are used by bacteria to amend the phage capsids and to create phage-like particles (satellites) that move between cells and transfer the genetic material of PLE to another bacteria. That study highlights the interactions between a PLE-encoded protein, TcaP, and capsid proteins of the phage ICP1.
  
  The manuscript is well written, provides a lot of new information and the results are supported by biochemical analysis.
  
  We thank the reviewer for their supportive evaluation of our work.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.03.01.530633v1
www.biorxiv.org www.biorxiv.org

New submission 29/06/2023, 10:52:31

1
1. Public_Reviews 30 Jun 2023
  
  in eLife
  
  Author Response:
  
  We would like to thank the reviewers for their time in evaluating our manuscript. The reviewers provided constructive comments and suggested changes to improve our manuscript. The main comment was about the framing. We agree with the reviewers and will rewrite the manuscript to focus more on migration patterns than conservation. We will add and expand the paper's theoretical framework and include the studies and descriptions of migration patterns of individual species suggested by the reviewers. At the same time, some of the reviewers' comments (especially on the terms and suggestions for changing the title of the paper) are mutually exclusive. We will pay particular attention to this issue and improve the paper's theoretical basis.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.03.13.532370v2
www.biorxiv.org www.biorxiv.org

New submission 30/06/2023, 08:47:48

1
1. Public_Reviews 30 Jun 2023
  
  in eLife
  
  Author Response
  
  We are grateful for the constructive feedback and the possibility of further improving our manuscript in terms of quality and clarity. Below, we have prepared a brief answer to the points raised in the reviewers’ feedback. We plan to address all these issues fully in the revised version of the manuscript.
  
  We agree that some of our claims were overly enthusiastic. We will rewrite parts of the manuscript to tame our statements. Additionally, we are thankful for the comments on the use of language, which we will certainly apply while editing the manuscript. Below, we focus on the main comments.
  
  Both reviewers: We appreciate advice on possible confounding factors. We should note here that there is substantial evidence on the effects of alpha rhythm amplitude on the excitability of a neuronal network and, as a consequence, on the amplitude of evoked responses (Baumgarten et al., 2016 Cerebral Cortex; Iemi et al., 2017 eLife; Stephani et al., 2021 eLife). This effect is due to changing the gain for evoked responses, and it is quite different compared to the baseline-shift mechanism (BSM). In BSM, the changes in the amplitude of evoked responses occur due to the generation of an additional evoked response component, which we tried to reveal in our current work. Still, we agree with suggestions to test additional factors, such as earlier evoked responses, baseline window, and head size, and we will test those.
  
  Reviewer #2 Comment 2: Certainly, for low-density recordings, some method of data transformation is required. Here we would like to show our reasoning for why we did not use current-source density (CSD) but rather utilised other approaches. First, the CSD transform performs well for spatially localised activities since it is a spatial high-pass filter. In our case, P300 and alpha amplitude dynamics are fairly widespread with low spatial frequency, and we believe we would not benefit from applying CSD. Second, CSD has been shown to be more sensitive to surface sources in the crowns of gyri. For activity in the P300 window, we have no reason to believe that this is the case. Third, as we completely agree that low density montage is a limitation, we used source reconstruction with eLoreta (Fig. 5) to refine the spatial localisation of potential sources of P300 and alpha amplitude change.
  
  Reviewer #1 Comment 4: Our study is indeed based on a sample of older participants. However, in our previous work (Studenova et al., 2022), we compared young and elderly participants using resting-state data. There, we measured the baseline-shift index (BSI). We found that BSIs for elderly participants were lower in comparison to those for young participants. Therefore, despite these limitations, in the current study, we were still able to detect a correspondence between BSIs and evoked responses in elderly participants. Therefore, we believe that for a sample of young participants, the results should not be different.
  
  Reviewer #2 Comment 4: We agree that mediation analysis will provide additional insights, and we will add it to the revised version of the manuscript.
  
  Overall, we found the reviewer's comments very helpful. We will update the manuscript accordingly.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.20.529191v3
www.biorxiv.org www.biorxiv.org

New submission 28/06/2023, 14:43:02

1
1. Public_Reviews 30 Jun 2023
  
  in eLife
  
  Author Response:
  
  We would like to thank the reviewers for their comments on the manuscript. The primary concern that they raised is that the imaging data are largely qualitative. This is a fair assessment, and we agree that a careful quantitative characterization of TF clustering with and without IDRs using high resolution imaging would provide valuable insight that would extend our findings. Our goal for this study was to conduct a high level survey of IDR localization, for which we believe a qualitative overview was sufficient. We hope that this work can serve as a useful foundation for future studies of the complex roles that IDRs play in TF function.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.03.27.534457v2
www.biorxiv.org www.biorxiv.org

New submission 29/06/2023, 08:31:12

1
1. Public_Reviews 30 Jun 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  1) Only one PITAR siRNA was tested in majority of the experiments, which compromises the validity of the results. Some results are inconsistent. For example, Fig 2G indicates that PITAR siRNA caused G1 arrest. However, PITAR overexpression in the same cell line did not show any effect on cell cycle progression in Fig 5I.
  
  We thank the reviewer for this comment. Indeed, we have used two siRNAs in experiments related to Fig. 2C, 2D, and 2E. Keeping the reviewer’s comment, we plan to reproduce the results of Fig. 2F, 2G, 2H, 2I, 5A, 5B, 5E, and supplementary Fig. 5A using additional siRNA targeting PITAR.
  
  The reason for the fact that “PITAR silencing showed a robust G1 arrest, but PITAR overexpression failed to show any effect on the cell cycle profile” is as follows: since glioma cells overexpress PITAR (which keeps the p53 suppressed), silencing PITAR (which will elevate p53 levels) in glioma cells will show a robust phenotype in cell cycle profile (in the form of increase G1 arrest). In contrast, the overexpression of PITAR in glioma cells (which already has high levels of PITAR and hence drastically reduced p53 levels) is unlikely to show any significant change in the cell cycle profile. But, a phenotype for PITAR overexpression on cell cycle profile can be shown in DNA-damaged (which induces p53 levels) glioma cells. Indeed, we have done this experiment in Fig. 5L, which shows G2/M arrest (42.34%) induced by DNA damage is reduced significantly (19%) in PITAR overexpressed condition (34.42%). However, keeping reviewers' comments in the right spirit, we plan to repeat this experiment with appropriate modifications to arrive at a more robust phenotype for PITAR overexpression.
  
  2) The conclusion that PITAR inactivates p53 through regulating TRIM28, which is highlighted in the title of the manuscript, is not supported by convincing results. Although the authors showed that a PITAR siRNA increased while PITAR overexpression decreased p53 level, the siRNA only marginally increased the stability of p53 (Fig 5E). The p53 ubiquitination level was barely affected by PITAR overexpression in Fig 5F. To convincingly demonstrate that PITAR regulates p53 through TRIM28, the authors need to show that this regulation is impaired/compromised in TRIM28-knockout conditions. The authors only showed that TRIM28 overexpression suppressed PITAR siRNAinduced increase of p53, which is not sufficient. Note that only one cell line was investigated in Fig 5.
  
  To address this issue, we will overexpress PITAR in TRIM28 silenced cells to show the requirement of TRIM28 for PITAR to inhibit p53. In addition, we also plan to carry out PITAR silencing and overexpression experiments in another glioma cell line as recommended by the reviewer.
  
  3) Another major weakness of this manuscript is that the authors did not provide any evidence indicating that the glioblastoma-promoting activities of PITAR were mediated by its regulation of p53 or TRIM28 (Fig 6 and Fig 7). Thus, the regulation of glioblastoma growth and the regulation of TRIM28/p53 appear to be disconnected.
  
  We would like to respectfully disagree with the reviewer on this particular point. We have indeed provided the following evidence in the current version of the manuscript glioblastomapromoting activities of PITAR were mediated by its regulation of p53 or TRIM28.
  
  A) In Fig. 6, we demonstrate that PITAR silencing-induced reduction in the neurosphere growth is accompanied by a reduction in TRIM28 RNA and an increase in the CDKN1A RNA without a change in p53 RNA levels. We also demonstrate that PITAR overexpression-induced neurosphere growth is accompanied by an increase in the TRIM28 RNA, and a decrease in CDKN1A RNA without a change in p53 RNA levels.
  
  B) To add strength to the above results, we plan to do western blot experiments under similar conditions to demonstrate the appropriate changes in TRIM28, p53, and CDKN1A levels. Also, we will do a TRIM28 rescue experiment in RG5 neurosphere cells.
  
  C) In supplementary Fig. 6 (related to Fig. 6), we show that PITAR silencing failed to decrease neurosphere growth in mutant p53 containing GSC line (MGG8).
  
  D) In supplementary Fig. 7 (related to Fig. 6), we show that PITAR silencing failed to inhibit colony growth of p53-silenced U87 glioma cells (U87/shp53#1). We also show that while PITAR silencing decreased TRIM28 RNA levels in U87/shNT and U87/shp53#1 glioma cells, it failed to increase CDKN1A and MDM2 (p53 targets) at the RNA level.
  
  E) In Fig. 7, we show that the TRIM28 protein level is drastically reduced in small tumors formed by U87/siPITAR cells.
  
  F) In supplementary Fig. 8 (related to Fig. 7), we show that glioma tumor formed by U87/PITAR OE express high levels of TRIM28 protein but reduced levels of p21 protein.
  
  G) We also plan to do additional experiments, as described below, to demonstrate that glioblastoma-promoting activities of PITAR are indeed mediated by its regulation of p53 or TRIM28. We will demonstrate the inability of PITAR overexpression to induce the growth of glioma-tumor initiated by TRIM28 silenced U87 cells.
  
  4) It is not clear what kind of message the authors tried to deliver in Fig 7F/G. Based on the authors' hypothesis, DNA-damaging agents like TMZ would induce PITAR to inactivate p53, which would compromise TMZ's anti-cancer activity. However, the data show that TMZ was very effective in the inhibition of U87 growth. The authors may need to test whether PITAR downregulation, which would increase p53 activity, have any effects on TMZ-insensitive tumors. Such results are more therapeutically relevant.
  
  Reviewer #1 rightly pointed out that TMZ induces PITAR expression, which should compromise TMZ's anti-cancer activity. In addition, overexpression of PITAR also promotes glioma-tumor growth. Figure 7F&G demonstrates the following two facts:1. PITAR overexpression increases the glioma-tumor growth (Figure 7G, compare red line with the blue line), 2. PITAR overexpressing glioma-tumor are resistant to TMZ chemotherapy (Figure 7G, compare the pink line with the green line).
  
  In addition, in Figure 2I, we indeed show that PITAR-silenced cells are more sensitive to TMZ and Adriamycin chemotherapy.
  
  However, considering reviewers’ comments, we plan to repeat Figure 7A, combining TMZ chemotherapy and PITAR silencing to demonstrate that TMZ chemotherapy-induced PITAR indeed promotes chemo-resistance.
  
  5) Lastly, the model presented in Fig 7H is confusing. It is not clear what the exact role of PITAR in the DNA damage response based on this model. If DNA damage would induce PITAR expression, this would lead to inactivation of p53 as revealed by this manuscript. However, DNA damage is known to activate p53. Do the authors want to imply that PITAR induction by DNA damage would help to bring down the p53 level at the end of DNA damage response? The presented data do not support this role unfortunately.
  
  We appreciate reviewer #1 comments. Based on our model in 7H, we believe DNA damageinduced PITAR attenuates DNA damage response by increasing TRIM28 protein levels. TRIM28 ubiquitinates p53 in an MDM2-dependent manner ( Wang et al., 2005). Based on this, we hypothesised that PITAR-induced TRIM28 also contributes to MDM2 mediated ending of DNA damage response.
  
  Considering the reviewers' comments, we plan to do the following experiment.
  
  The kinetics of p53, TRIM28, p21, MDM2 protein levels, and PITAR RNA levels after DNA damage will be monitored in PITAR-silenced conditions. It is known that reduction in the DNA damage-induced p53 levels coincides with high levels of MDM2 accumulation. We believe that in PITAR-silenced cells, p53 levels will remain high for a longer time compared to control cells because of the lack of PITAR-induced TRIM28-mediated degradation of p53.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.04.11.536370v1
psyarxiv.com psyarxiv.com

New submission 30/06/2023, 09:25:18

1
1. Public_Reviews 30 Jun 2023
 
 in eLife
 
 Author Response:
 
 Reviewer #1 (Public Review):
 
 […] The major strength of the study is the elegant and well-powered data set. Longitudinal data on this scale is very difficult to collect, especially with patient cohorts, so this approach represents an exciting breakthrough. Analysis is straightforward and clearly presented. However, no multiple comparison correction is applied despite many different tests. While in general I am not convinced of the argument in the citation provided to justify this, I think in this case the key results are not borderline (p<0.001) and many of the key effects are replications, so there are not so many novel/exploratory hypothesis and in my opinion the results are convincing and robust as they are. The supplemental material is a comprehensive description of the data set, which is a useful resource.
 
 The authors achieved their aims, and the results clearly support the conclusion that the AD and mean confidence in a perceptual task covary longitudinally. I think this study provides an important impact to the project of computational psychiatry.Sspecifically, it shows that the relationship between transdiagnostic symptom dimensions and behaviour is meaningful within as well as across individuals.
 
 Response: We thank the reviewer for their appraisal of our paper and positive feedback on the main manuscript and supplementary information. We agree with the reviewer that the lack of multiple comparison corrections can also justified by key findings being replications and not borderline significance. We have added this additional justification to the manuscript (Methods, Statistical Analyses, page 15, line 568: “Adjustments for multiple comparisons were not conducted for analyses of replicated effects”)
 
 Reviewer #2 (Public Review):
 
 […] The major strength and contribution of this study is the use of a longitudinal intervention design, allowing the investigation of how the well-established link between underconfidence and anxious-depressive symptoms changes after treatment. Furthermore, the large sample size of the iCBT group is commendable. The authors employed well-established measures of metacognition and clinical symptoms, used appropriate analyses, and thoroughly examined the specificity of the observed effects.
 
 However, due to the small effect sizes, the antidepressant and control groups were underpowered, reducing comparability between interventions and the generalizability of the results. The lack of interaction effect with treatment makes it harder to interpret the observed differences in confidence, and practice effects could conceivably account for part of the difference. Finally, it was not completely clear to me why, in the exploratory analyses, the authors looked at the interaction of time and symptom change (and group), since time is already included in the symptom change index.
 
 Response: We thank the reviewer for their succinct summary of the main results and strengths of our study. We apologise for the confusion in how we described that analysis. We examine state-dependence., i.e. the relationship between symptom change and metacognition change, in two ways in the paper – perhaps somewhat redundantly. (1) By correlating change indices for both measures (e.g. as plotted in Figure 3D) and (2) by doing a very similar regression-based repeated-measures analysis, i.e. mean confidence ~ time*anxious-depression score change. Where mean confidence is entered with two datapoints – one for pre- and one for post-treatment (i.e. within-person) and anxious-depression change is a single value per person (between-person change score). This allowed us to test if those with the biggest change in depression had a larger effect of time on confidence. This has been added to the paper for clarification (Methods, Statistical Analysis, page 14, line 553-559: “To determine the association between change in confidence and change in anxious-depression, we used (1) Pearson correlation analysis to correlate change indices for both measures and, (2) regression-based repeated-measures analysis: mean confidence ~ time*anxious-depression score change, where mean confidence is entered with two datapoints (one for pre- and one for post-treatment i.e., within-person) and anxious-depression change is a single value per person (between-person change score)”).
 
 The analyses have also been reported as regression in the Results for consistency (Treatment Findings: iCBT, page 5, line 197-204: ‘To test if changes in confidence from baseline to follow-up scaled with changes in anxious-depression, we ran a repeated measure regression analyses with per-person changes in anxious-depression as an additional independent variable. We found this was the case, evidenced by a significant interaction effect of time and change in anxious-depression on confidence (b=-0.12, SE=0.04, p=0.002)… This was similarly evident in a simple correlation between change in confidence and change in anxious-depression (r(647)=-0.12, p=0.002)”).
 
 This longitudinal study informs the field of metacognition in mental health about the changeability of biases in confidence. It advances our understanding of the link between anxiety-depression and underconfidence consistently found in cross-sectional studies. The small effects, however, call the clinical relevance of the findings into question. I would have found it useful to read more in the discussion about the implications of the findings (e.g., why is it important to know that the confidence bias is state-dependent; given the effect size of the association between changes in confidence and symptoms, is the state-trait dichotomy the right framework for interpreting these results; suggestions for follow-up studies to better understand the association).
 
 Response: Thank you for this comment. We have elaborated on the implications of our findings in the Discussion, including the relevance of the state-trait dichotomy to future research and how more intensive, repeated testing may inform our understanding of the state-like nature of metacognition (Discussion, Limitations and Future Directions, page 10, line 378-380: “More intensive, repeating testing in future studies may also reveal the temporal window at which metacognition has the propensity to change, which could be more momentary in nature.”).
 
 Reviewer #3 (Public Review):
 
 […] I think these findings are exciting because they directly relate to one of the big assumptions when relating cognition to mental health - are we measuring something that changes with treatment (is malleable), so might be mechanistically relevant, or even useful as a biomarker?
 
 This work is also useful in that it replicates a finding of heightened confidence in those with compulsivity, and lowered confidence in those with elevated anxious-depression.
 
 One caveat to the interest of this work is that it doesn't allow any causal conclusions to be drawn, and only measures two timepoints, so it's hard to tell if changes in confidence might drive treatment effects (but this would be another study). The authors do mention this in the limitations section of the paper.
 
 Another caveat is the small sample in the antidepressant group.
 
 Some thoughts I had whilst reading this paper: to what extent should we be confident that the changes are not purely due to practice? I appreciate there is a relationship between improvement in symptoms and confidence in the iCBT group, but this doesn't completely rule out a practice effect (for instance, you can imagine a scenario in which those whose symptoms have improved are more likely to benefit from previously having practiced the task).
 
 Response: We thank the reviewer for commenting on the implications of our findings and we agree with the caveats listed. We thank the reviewer for raising this point about practice effects. A key thing to note is that this task does not have a learning element with respect to the core perceptual judgement (i.e., accuracy), which is the target of the confidence judgment itself. While there is a possibility of increased familiarity with the task instructions and procedures with repeated testing, the task is designed to adjust the difficulty to account of any improvements, so accuracy is stable. We see that we may not have made this clear in some of our language around accuracy vs. perceptual difficulty and have edited the Results to make this distinction clearer (Treatment Findings: iCBT, pages 4-5, lines 184-189: “Although overall accuracy remained stable due to the staircasing procedure, participants’ ability to detect differences between the visual stimuli improved. This was reflected as the overall increase in task difficulty to maintain the accuracy rates from baseline (dot difference: M=41.82, SD=11.61) to follow-up (dot difference: M=39.80, SD=12.62), (b=-2.02, SE=0.44, p<0.001, r2\=0.01)”.)
 
 However, it is true that there can be a ‘practice’ effect in the sense that one may feel more confident (despite the same accuracy level) due to familiarity with a task. One reason we do not subscribe to the proposed explanation for the link between anxious-depression change and confidence change is that the other major aspect of behaviour that improved with practice did so in a manner unrelated to clinical change. As noted above in the quoted text, participants’ discrimination improved from baseline to follow-up, reflected in the need for higher difficulty level to maintain accuracy around 70%. Crucially, this was not associated with symptom change. This speaks against a general mechanism where symptom improvement leads to increased practice effects in general. Only changes in confidence specifically are associated with improved symptoms. We have provided more detail on this in the Discussion (page 9, lines 324-326: “This association with clinical improvements was specific to metacognitive changes, and not changes in task performance, suggesting that changes in confidence do not merely reflect greater task familiarity at follow-up.”).
 
 Relatedly, to what extent is there a role for general task engagement in these findings? The paper might be strengthened by some kind of control analysis, perhaps using (as a proxy for engagement) the data collected about those who missed catch questions in the questionnaires.
 
 Response: Thank you for your comment. We included the details of data quality checks in the Supplement. Given the small number of participants that failed more than one attention checks (1% of the iCBT arm) and that all those participants passed the task exclusion criteria, we made the decision to retain these individuals for analyses. We have since examined if excluding these small number of individuals impacts our findings. Excluding those that failed more than one catch item did not affect the significance of results, which has now been added to the Supplementary Information (Data Quality Checks: Task and Clinical Scales, page 5, lines 181-185: “Additionally, excluding those that failed more than one catch item in the iCBT arm did not affect the significance of results, including the change in confidence (b=0.16, SE=0.02, p<0.001), change in anxious-depression (b=-0.32, SE=0.03, p<0.001), and the association between change in confidence and change in anxious-depression (r(638)=-0.10, p=0.011)”).
 
 I was also unclear what the findings about task difficulty might mean. Are confidence changes purely secondary to improvements in task performance generally - so confidence might not actually be 'interesting' as a construct in itself? The authors could have commented more on this issue in the discussion.
 
 Response: Thank you for this comment and sorry it was not clear in the original paper. As we discussed in a prior reply, accuracy – i.e. proportion of correct selections (the target of confidence judgements) are different from the difficulty of the dot discrimination task that each person receives on a given trial. We had provided more details on task difficulty in the Supplement. Accuracy was tightly controlled in this task using a ‘two-down one-up’ staircase procedure, in which equally sized changes in dot difference occurred after each incorrect response and after two consecutive correct responses. The task is more difficult when the dot difference between stimuli is lower, and less difficult when the dot difference between stimuli is greater. Therefore, task difficulty refers to the average dot difference between stimuli across trials. Crucially, task accuracy did not change from baseline to follow-up, only task difficulty. Moreover, changes in task difficulty were not associated with changes in anxious-depression, while changes in confidence were, indicating confidence is the clinically relevance construct for change in symptoms.
 
 We appreciate that this may not have been clear from the description in the main manuscript, and have added more detail on task difficulty to the Methods (Metacognition Task, page 14, lines 540-542: “Task difficulty was measured as the mean dot difference across trials, where more difficult trials had a lower dot difference between stimuli.”) and Results (Treatment Findings: iCBT, pages 4-5, lines 184-186: “Although overall accuracy remained stable due to the staircasing procedure, participants’ ability to detect differences between the visual stimuli improved.”). We have also elaborated more on how improvements in symptoms are associated with change in confidence, not task performance in the Discussion (page 9, lines 324-326: “This association with clinical improvements was specific to metacognitive changes, and not changes in task performance, suggesting that changes in confidence do not merely reflect greater task familiarity at follow-up”).
 
 To make code more reproducible, the authors could have produced an R notebook that could be opened in the browser without someone downloading the data, so they could get a sense of the analyses without fully reproducing them.
 
 Response: Thank you for your comment. We appreciate that an R notebook would be even better than how we currently share the data and code. While we will consider using Notebooks in future, we checked and converting our existing R script library into R Notebooks would require a considerable amount of reconfiguration that we cannot devote the time to right now. We hope that nonetheless the commitment to open science is clear in the extensive code base, commenting and data access we are making available to readers.
 
 Rather than reporting full study details in another publication I would have found it useful if all relevant information was included in a supplement (though it seems much of it is). This avoids situations where the other publication is inaccessible (due to different access regimes) and minimises barriers for people to fully understand the reported data.
 
 Response: We agree this is good practice – the Precision in Psychiatry study is very large, with many irrelevant components with respect to the present study (Lee et al., BMC Psychiatry, 2023). For this reason, we tried to provide all that was necessary and only refer to the Precision in Psychiatry study methods for fine-grained detail. Upon review, the only thing we think we omitted that is relevant is information on ethical approval in the manuscript, which we have now added (Methods, Participants, page 11, lines 412-417: “Further details of the PIP study procedures that are not specific to this study can be found in a prior publication (21). Ethical approval for the PIP study was obtained from the Research Ethics Committee of School of Psychology, Trinity College Dublin and the Northwest-Greater Manchester West Research Ethics Committee of the National Health Service, Health Research Authority and Health and Care Research Wales”). If any further information is lacking, we are happy to include it here also.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

psyarxiv.com/uk7hr
www.biorxiv.org www.biorxiv.org

New submission 26/05/2023, 09:44:58

1
1. Public_Reviews 29 Jun 2023
 
 in eLife
 
 Author Response
 
 The following is the authors’ response to the original reviews.
 
 Reviewer #1 (Public Review):
 
 She et al studied the evolution of gene expression reaction norms when individuals colonise a new environment that exposes them to physiologically challenging conditions. Their objective was to test the "plasticity first" hypothesis, which suggest that traits that are already plastic (their value changes when facing a new environment compared to the original environment) facilitates the colonisation of novel environments, which, if true, would be predicted to result in the evolution of gene expression values that are similar in the population that colonised the new environment and evolved under these particular selection pressures. To test this prediction, they studied gene expression in cardiac and muscle tissues in individuals originating from three conditions: lowland individuals in their natural environment (ancestral state), lowland individuals exposed to hypoxia (the plastic response state), and a highland population facing hypoxia for several generations (the coloniser state). They classified gene expression patterns as maladaptive or adaptive in lowland individuals responding to short term hypoxia by classifying gene expression patterns using genes that differed between the ancestral state (lowland) and colonised state (highland). Genes expressed in the same direction in lowland individuals facing hypoxia (the plastic state) as what is found in the colonised state are defined as adaptative, while genes with the opposite expression pattern were labelled as maladaptive, using the assumption that the colonised state must represent the result of natural selection. Furthermore, genes could be classified as representing reversion plasticity when the expression pattern differed between the plasticity and colonised states and as reinforcement when they were in the same direction (for example more expressed in the plastic state and the colonised state than in the ancestral state). They found that more genes had a plastic expression pattern that was labelled as maladaptive than adaptive. Therefore, some of the genes have an expression pattern in accordance with what would be predicted based on the plasticity-first hypothesis, while others do not.
 
 Thank you for a precise summary of our work. We appreciate the very encouraging comments recognizing the value of our work. We have addressed concerns from the reviewer in greater detail below.
 
 Q1. As pointed out by the authors themselves, the fact that temperature was not included as a variable, which would make the experimental design much more complex, misses the opportunity to more accurately reflect the environmental conditions that the colonizer individuals face at high altitude. Also pointed out by the authors, the acclimation experiment in hypoxia lasted 4 weeks. It is possible that longer term effects would be identifiable in gene expression in the lowland individuals facing hypoxia on a longer time scale. Furthermore, a sample size of 3 or 4 individuals per group depending on the tissue for wild individuals may miss some of the natural variation present in these populations. Stating that they have a n=7 for the plastic stage and n= 14 for the ancestral and colonized stages refers to the total number of tissue samples and not the number of individuals, according to supplementary table 1.
 
 We shared the same concerns as the reviewer. This is partly because it is quite challenging to bring wild birds into captivity to conduct the hypoxia acclimation experiments. We had to work hard to perform acclimation experiments by taking lowland sparrows in a hypoxic condition for a month. We indeed have recognized the similar set of limitations as the review pointed out and have discussed the limitations in the study, i.e., considering hypoxic condition alone, short time acclimation period, etc. Regarding sample sizes, we have collected cardiac muscle from nine individuals (three individuals for each stage) and flight muscle from 12 individuals (four individuals for each stage). We have clarified this in Supplementary Table 1.
 
 Q2. Finally, I could not find a statement indicating that the lowland individuals placed in hypoxia (plastic stage) were from the same population as the lowland individuals for which transcriptomic data was already available, used as the "ancestral state" group (which themselves seem to come from 3 populations Qinghuangdao, Beijing, and Tianjin, according to supplementary table 2) nor if they were sampled in the same time of year (pre reproduction, during breeding, after, or if they were juveniles, proportion of males or females, etc). These two aspects could affect both gene expression (through neutral or adaptive genetic variation among lowland populations that can affect gene expression, or environmental effects other than hypoxia that differ in these populations' environments or because of their sexes or age). This could potentially also affect the FST analysis done by the authors, which they use to claim that strong selective pressure acted on the expression level of some of the genes in the colonised group.
 
 The reviewer asked how individual tree sparrows used in the transcriptomic analyses were collected. The individuals used for the hypoxia acclimation experiment and represented the ancestral lowland population were collected from the same locality (Beijing) and at the same season (i.e., pre-breeding) of the year. They are all adults and weight approximately 18g. We have clarified this in the Supplementary Table S1 and Methods. We did not distinguish males from females (both sexes look similar) under the assumption that both sexes respond similarly to hypoxia acclimation in their cardiac and flight muscle gene expression.
 
 The Supplementary Table 2 lists the individuals that were used for sequence analyses. These individuals were only used for sequence comparisons but not for the transcriptomic analyses. The population genetic structure analyzed in a previously published study showed that there is no clear genetic divergence within the lowland population (i.e., individuals collected from Beijing, Tianjing and Qinhuangdao) or the highland population (i.e., Gangcha and Qinghai Lake). In addition, there was no clear genetic divergence between the highland and lowland populations (Qu et al. 2020).
 
 Q4. Impact of the work
 
 There has been work showing that populations adapted to high altitude environments show changes in their hypoxia response that differs from the short-term acclimation response of lowland population of the same species. For example, in humans, see Erzurum et al. 2007 and Peng et al. 2017, where they show that the hypoxia response cascade, which starts with the gene HIF (Hypoxia-Inducible Factor) and includes the EPO gene, which codes for erythropoietin, which in turns activates the production of red blood cell, is LESS activated in high altitude individuals compared to the activation level in lowland individuals (which gives it its name). The present work adds to this body of knowledge showing that the short-term response to hypoxia and the long term one can affect different pathways and that acclimation/plasticity does not always predict what physiological traits will evolve in populations that colonize these environments over many generations and additional selection pressure (UV exposure, temperature, nutrient availability). Altogether, this work provides new information on the evolution of reaction norms of genes associated with the physiological response to one of the main environmental variables that affects almost all animals, oxygen availability. It also provides an interesting model system to study this type of question further in a natural population of homeotherms.
 
 Erzurum, S. C., S. Ghosh, A. J. Janocha, W. Xu, S. Bauer, N. S. Bryan, J. Tejero et al. "Higher blood flow and circulating NO products offset high-altitude hypoxia among Tibetans." Proceedings of the National Academy of Sciences 104, no. 45 (2007): 17593-17598.
 
 Peng, Y., C. Cui, Y. He, Ouzhuluobu, H. Zhang, D. Yang, Q. Zhang, Bianbazhuoma, L. Yang, Y. He, et al. 2017. Down-regulation of EPAS1 transcription and genetic adaptation of Tibetans to high-altitude hypoxia. Molecular biology and evolution 34:818-830.
 
 Thank you for highlighting the potential novelty of our work in light of the big field. We found it very interesting to discuss our results (from a bird species) together with similar findings from humans. In the revised version of manuscript, we have discussed short-term acclimation response and long-term adaptive evolution to a high-elevation environment, as well as how our work provides understanding of the relative roles of short-term plasticity and long-term adaptation. We appreciate the two important work pointed out by the reviewer and we have also cited them in the revised version of manuscript.
 
 Reviewer #2 (Public Review):
 
 This is a well-written paper using gene expression in tree sparrow as model traits to distinguish between genetic effects that either reinforce or reverse initial plastic response to environmental changes. Tree sparrow tissues (cardiac and flight muscle) collected in lowland populations subject to hypoxia treatment were profiled for gene expression and compared with previously collected data in 1) highland birds; 2) lowland birds under normal condition to test for differences in directions of changes between initial plastic response and subsequent colonized response. The question is an important and interesting one but I have several major concerns on experimental design and interpretations.
 
 Thank you for a precise summary of our work and constructive comments to improve this study. We have addressed your concerns in greater detail below.
 
 Q1. The datasets consist of two sources of data. The hypoxia treated birds collected from the current study and highland and lowland birds in their respective native environment from a previous study. This creates a complete confounding between the hypoxia treatment and experimental batches that it is impossible to draw any conclusions. The sample size is relatively small. Basically correlation among tens of thousands of genes was computed based on merely 12 or 9 samples.
 
 We appreciate the critical comments from the reviewer. The reviewer raised the concerns about the batch effect from birds collected from the previous study and this study. There is an important detail we didn’t describe in the previous version. All tissues from hypoxia acclimated birds and highland and lowland birds have been collected at the same time (i.e., Qu et al. 2020). RNA library construction and sequencing of these samples were also conducted at the same time, although only the transcriptomic data of lowland and highland tree sparrows were included in Qu et al. (2020). The data from acclimated birds have not been published before.
 
 In the revised version of manuscript, we also compared log-transformed transcript per million (TPM) across all genes and determined the most conserved genes (i.e., coefficient of variance ≤ 0.3 and average TPM ≥ 1 for each sample) for the flight and cardiac muscles, respectively (Hao et al. 2023). We compared the median expression levels of these conserved genes and found no difference among the lowland, hypoxia-exposed lowland, and highland tree sparrows (Wilcoxon signed-rank test, P<0.05). As these results suggested little batch effect on the transcriptomic data, we used TPM values to calculate gene expression level and intensity. This methodological detail has been further clarified in the Methods and we also provided a new supplementary Figure (Figure S5) to show the comparative results.
 
 The reviewer also raised the issue of sample size. We certainly would have liked to have more individuals in the study, but this was not possible due to the logistical problem of keeping wild bird in a common garden experiment for a long time. We have acknowledged this in the manuscript. In order to mitigate this we have tested the hypothesis of plasticity following by genetic change using two different tissues (cardiac and flight muscles) and two different datasets (co-expressed gene-set and muscle-associated gene-set). As all these analyses show similar results, they indicate that the main conclusion drawn from this study is robust.
 
 Q2. Genes are classified into two classes (reversion and reinforcement) based on arbitrarily chosen thresholds. More "reversion" genes are found and this was taken as evidence reversal is more prominent. However, a trivial explanation is that genes must be expressed within a certain range and those plastic changes simply have more space to reverse direction rather than having any biological reason to do so.
 
 Thank you for the critical comments. There are two questions raised we should like to address them separately. The first concern centered on the issue of arbitrarily chosen thresholds. In our manuscript, we used a range of thresholds, i.e., 50%, 100%, 150% and 200% of change in the gene expression levels of the ancestral lowland tree sparrow to detect genes with reinforcement and reversion plasticity. By this design we wanted to explore the magnitudes of gene expression plasticity (i.e., Ho & Zhang 2018), and whether strength of selection (i.e., genetic variation) changes with the magnitude of gene expression plasticity (i.e., Campbell-Staton et al. 2021).
 
 As the reviewer pointed out, we have now realized that this threshold selection is arbitrarily. We have thus implemented two other categorization schemes to test the robustness of the observation of unequal proportions of genes with reinforcement and reversion plasticity. Specifically, we used a parametric bootstrap procedure as described in Ho & Zhang (2019), which aimed to identify genes resulting from genuine differences rather than random sampling errors. Bootstrap results suggested that genes exhibiting reversing plasticity significantly outnumber those exhibiting reversing plasticity, suggesting that our inference of an excess of genes with reversion plasticity is robust to random sampling errors. We have added these analyses to the revised version of manuscript, and provided results in the Figure 2d and Figure 3d.
 
 In addition, we adapted a bin scheme (i.e., 20%, 40% and 60% bin settings along the spectrum of the reinforcement/reversion plasticity). These analyses based on different categorization schemes revealed similar results, and suggested that our inference of an excess of genes with reversion plasticity is robust. We have provided these results in the Supplementary Figure S2 and S4.
 
 The second issue that the reviewer raised is that the plastic changes simply have more space to reverse direction rather than having any biological reason to do so. While a causal reason why there are more genes with expression levels being reversed than those with expression levels being reinforced at the late stages is still contentious, increasingly many studies show that genes expression plasticity at the early stage may be functionally maladapted to novel environment that the species have recently colonized (i.e., lizard, Campbell-Staton et al. 2021; Escherichia coli, yeast, guppies, chickens and babblers, Ho and Zhang 2018; Ho et al. 2020; Kuo et al. 2023). Our comparisons based on the two genesets that are associated with muscle phenotypes corroborated with these previous studies and showed that initial gene expression plasticity may be nonadaptive to the novel environments (i.e., Ghalambor et al. 2015; Ho & Zhang 2018; Ho et al. 2020; Kuo et al. 2023; Campbell-Staton et al. 2021).
 
 Q3. The correlation between plastic change and evolved divergence is an artifact due to the definitions of adaptive versus maladaptive changes. For example, the definition of adaptive changes requires that plastic change and evolved divergence are in the same direction (Figure 3a), so the positive correlation was a result of this selection (Figure 3d).
 
 The reviewer raised an issue that the correlation between plastic change and evolved divergence is an artifact because of the definition of adaptive versus maladaptive changes, for example, Figure 3d. We agree with the reviewer that the correlation analysis is circular because the definition of adaptive and maladaptive plasticity depends on the direction of plastic change matched or opposed that of the colonized tree sparrows. We have thus removed previous Figure 3d-e and related texts from the revised version of manuscript. Meanwhile, we have changed Figure 3a to further clarify the schematic framework.
 
 Reviewer #1 (Recommendations For The Authors):
 
 Q1. Here are private recommendations that I think could help improve the manuscript. West-Eberhard was a pioneer back in 2003 in explicating the hypothesis of "plasticity first". I think it is important to cite their main work in the first paragraph of introduction and to use the term "plasticity-first", which is widely known among evolutionary biologists studying phenotypic plasticity, instead of "plasticity followed by genetic change", since the three papers cited in paragraph 1 call it « plasticity first ».
 
 West-Eberhard, M.J. (2003) Developmental Plasticity and Evolution, Oxford University Press.
 
 Thank you for suggesting West-Eberhard (2003) and we have cited this important work. We have also changed “plasticity followed by genetic change” to “plasticity first”.
 
 Q2. Introduction. Line 5, Change for « On the one hand, if plasticity changes ... »
 
 We have modified as suggested.
 
 Q3. Line 52, Change for « ...same direction as adaptive evolution does ...»
 
 We have modified as suggested.
 
 Q4. Line 66,When presenting papers that address the plasticity and evolution of gene expression in response to environmental variables, paper by Morris et al is another example that could be useful to include (but this is only a suggestion in case the authors missed it).
 
 Thank you for suggesting this nice work. We have cited Morris et al. (2014).
 
 Q5. Line 94, Change for "We acclimated"
 
 We have modified as suggested.
 
 Q6. In Figure 3, the figure in panel A and B is labelled "normaxia", but I think that "normoxia" is usually the term used.
 
 Thank you for spot the typo. We have modified Figure 3a and we no longer used the term “normaxia”.
 
 Material and methods
 
 It would be important to merge supplementary table 1 and 2 and only present the individuals that were used with their respective cardiac and muscle libraries (if they come from the same individual?). Also, the origin of the individuals used in the hypoxia experiment should be explained at the beginning of the methods section and explicated in the supplementary table. Information on sex or stage of development (juvenile? Adult? Male? female?) and time of year (in breeding stage? Pre-migration (if any), etc) would allow the reader to see that individuals from lowland differed only in their exposure to hypoxia or not, or if other variables may affect gene expression patterns. Similarly, if all individuals form the highland are males and the lowland hypoxia exposed individuals are females (or juveniles versus breeders, or different time of year, etc) this should be stated in the methods. Gene expression is labile so the reader should know if other variables influence the results presented or not.
 
 Thank you for suggestion. We have added detailed information (i.e., age, collecting time and season) to the supplementary Table 1. We have also added this information to the Methods. Because the birds used in transcriptomic analysis (Supplementary Table 1) were different individuals from those used in the sequence analyses (Supplementary Table 2), these two tables cannot be merged.
 
 References:
 
 Campbell-Staton SC, Velotta JP, Winchell KM. 2021. Selection on adaptive and maladaptive genes expression plasticity during thermal adaptation to urban heat islands. Nat. Commun. 12: 6195.
 
 Ghalambor CK, Hoke KL, Ruell EW, Fischer EK, Reznick DN, Hughes KA. 2015. Non-adaptive plasticity potentiates rapid adaptive evolution of gene expression in nature. Nature 525:372–375.
 
 Hao et al. 2023. Divergent contributions of coding and noncoding sequences to initial high-altitude adaptation in passerine birds endemic to the Qinghai–Tibet Plateau. Mol. Ecol. Doi: 10.1111/mec.16942.
 
 Ho WC, Zhang J. 2018. Evolutionary adaptations to new environments generally reverse plastic phenotypic changes. Nat. Commun. 9: 350.
 
 Ho WC, Zhang J. 2019. Genetic gene expression changes during environmental adaptations tend to reverse plastic changes even after correction for statistical nonindependence. Mol. Biol. Evol. 36: 604–612.
 
 Ho WC, Li D, Zhu Q, Zhang J. 2020. Phenotypic plasticity as a long-term memory easing readaptations to ancestral environments. Sci. Adv. 6: eaba3388.
 
 Kuo KC, Yao CT, Liao BY, Weng MP, Dong F, Hsu YC, Hung CM. 2023. Weak gene-gene interaction facilitates the evolution of gene expression plasticity. BMC Biol. 21: 57.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.20.529215v1
www.biorxiv.org www.biorxiv.org

New submission 29/06/2023, 12:37:34

1
1. Public_Reviews 29 Jun 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  Reviewer #1 (Recommendations For The Authors):
  
  I would recommend the authors check the results section, it seems to me that the first two paragraphs are not results, but methods.
  
  We would like to express our appreciation to both reviewers for bringing this to our attention. Indeed, we discussed this in detail, but decided that because the methods come after the results section. We believe that providing the basic methodological approach to readers before the results is essential for better comprehension. Once again, we sincerely thank the reviewers for their valuable feedback, however, we would prefer to leave this part as it is.
  
  In Figure 3B, why there is not male and female shown in different lines, as in the rest of figures? I recommend following the same pattern everywhere.
  
  Has been changed accordingly, and the respective sex-specific lines were also added to Figure 4.
  
  I recommend checking carefully all the articles included in Table 2. Maybe some of the included information here is not precise.
  
  We thank the reviewer for highlighting this. We carefully checked the articles again, and made some small adjustments.
  
  In Material and methods: just note that when ages are estimated, usually there is a variable accounting for the amount of estimated years, that should be included in the model, and see that it has no effect on the dependent variable. I recommend including this variable.
  
  We sincerely appreciate the helpful comment from the reviewer, which we have carefully considered and implemented in our manuscript. However, we would like to highlight that addressing age estimation error is complex, as it involves measurement error. Thus, simply adding it as an independent variable may not fully capture its potential impact, as the effect may be positive or negative depending on the individual. Hence, the potential effect would be better accounted for by the implementation of individual random intercepts and smooths to adjust the confidence intervals, which is part of our model structures. Furthermore, we would like to emphasize that we have also conducted analyses on a reduced dataset that only included zoo-born individuals with precisely known birthdates, and the results remained consistent. So instead of changing our analyses, we now emphasize how our approach also addresses this aspect.
  
  Creatinine: Is there any other reference, more recent and in English, to complement the original one cited?
  
  We have now supplemented the original citation with an additional English citation: Anestis et al. 2009.
  
  Reviewer #2 (Recommendations For The Authors):
  
  Minor corrections
  
  Please, in Study population, the citation of table 2 is in fact Table 3. For table 3 (in Methodology), please provide the units Body weight having a mean of 32.4, has it a median of 9 ?
  
  Please, provide results separately for males and females
  
  We changed the table as requested, though the table only reports sample sizes and thus only numbers without units. The values for body weight are accurate.
  
  In Results
  
  The two first paragraphs have to be included in methods and structured with those already present.
  
  We would like to express our appreciation to both reviewers for bringing this to our attention. Indeed, we discussed this in detail, but decided that because the methods come after the results section, we believe that providing the basic methodological approach to readers before the results is essential for better comprehension. Once again, we sincerely thank the reviewers for their valuable feedback, however, we would prefer to leave this part as it is.
  
  In Table 1, indicate what 'Est' means.
  
  Has been changed accordingly
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.01.26.525764v3
www.biorxiv.org www.biorxiv.org

New submission 28/06/2023, 14:48:18

1
1. Public_Reviews 28 Jun 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  The cerebral cortex, or surface of the brain, is where humans do most of their conscious thinking. In humans, the grooves (sulci) and bumps (convolutions) have a particular pattern in a region of the frontal lobe called Broca's area, which is important for language. Specialists study features imprinted on the internal surfaces of braincases in early hominins by casting their interiors, which produces so-called endocasts. A major question about hominin brain evolution concerns when, where, and in which fossils a humanlike Broca's area first emerged, the answer to which may have implications for the emergence of language. The researchers used advanced imaging technology to study the endocast of a hominin (KNM-ER 3732) that lived about 1.9 million years ago (Ma) in Kenya to test a recently published hypothesis that Broca's remained primitive (apelike) prior to around 1.5 Ma. The results are consistent with the hypothesis and raise new questions about whether endocasts can be used to identify the genus and/or species of fossils.
  
  We would like to thank Rev. 1 for their comments on our paper.
  
  Reviewer #2 (Public Review):
  
  The authors tried to support the hypothesis that early Homo still had a primitive condition of Broca's cap (the region in fossil endocasts corresponding to Broca's area in the brain), being more similar to the condition in chimpanzees than in humans. The evidence from the described individual points to this direction but there are some flaws in the argumentation.
  
  We are grateful to Rev. 2 for their comments, although we partially agree with some of them.
  
  First, we would like to rectify the statement of Rev. 2 that we “tried to support the hypothesis that early Homo still had a primitive condition of Broca's cap”, indeed, our aim was to test this hypothesis and not to try to validate it.
  
  First, only one human and one chimpanzee were used for comparison, although we know that patterns of brain convolutions (and in addition how they leave imprints in the endocranial bones) are very variable.
  
  We understand the point raised by Rev. 2 about the variation of brain convolutions in humans and chimpanzees. We used atlases published by Connolly (1950), Falk et al. (2018) and de Jager et al. (2019, 2022) to analyse the endocast of KNM-ER 3732 and compare it to the extant human and chimpanzee cerebral conditions. However, in Figure 2, for the sake of clarity only two Homo and Pan specimens were used to illustrate the comparison (as it has been done in other published papers, e.g., Carlson et al., 2011; Science, Gunz et al., 2020 Sci Adv). In the revised version, we modified the manuscript to explain further our approach (line 156) “We used brain and endocast atlases published in Connolly (1950), Falk et al. (2018) and de Jager et al. (2019, 2022; see also www.endomap.org) for comparing the pattern identified in KNM-ER 3732 to those described in extant humans and chimpanzees. To the best of our knowledge, these atlases are the most extensive atlases of extant human and chimpanzee brains/endocasts available to date and are widely used in the literature to explore variability in sulcal patterns. In Figure 2, the extant human and chimpanzee conditions are illustrated by one extant human (adult female) and one extant chimpanzee (adult female) specimens from the Pretoria Bone Collection at the University of Pretoria (South Africa) and in the Royal Museum for Central Africa in Tervuren (Belgium), respectively (Beaudet et al., 2018).”.
  
  Second, the evidence from this fossil specimen adds to the evidence of previously describe individuals but still not yet fully prove the hypothesis.
  
  We tempered our discussion by concluding that (line 116) “Overall, the present study not only demonstrates that Ponce de León et al.’s (2021) hypothesis of a primitive brain of early Homo cannot be rejected, but also adds information […]”.
  
  Third, there is a vicious circle in using primitive and derived features to define a fossil species and then using (the same or different) features to argue that one feature is primitive or derived in a given species. In this case, we expect members of early Homo to be derived compared to their predecessors of the genus Australopithecus and that's why it seems intriguing and/or surprising to argue that early Homo has primitive features. However, we should expect that there is some kind of continuum or mosaic in a time in which a genus "evolves into" another genus. This discussion requires far more discussions about the concepts we use, maybe less discussion about what is different between the two groups but more discussion about the evolutionary processes behind them.
  
  We fully agree with Rev. 2 on this aspect. We believe that identifying these differences/similarities between fossil and extant hominids constitute the first step of a better understanding of the evolutionary mechanisms. Our work suggests indeed a certain continuity between genera and raises questions on the genus concept and how to interpret the specimens currently attributed to early Homo. In the revised version of the manuscript we included a reference to this possible scenario (line 134): “[…] or to the absence of a definite threshold between the two genera based on the morphoarchitecture of their endocasts (Wood and Collard, 1999).”.
  
  Fourth, the data of convolutional imprints presented are rather subjective when identifying which impressions represent which brain convolutions. Not seeing an impression does not necessarily mean that the corresponding brain feature did not exist. Interestingly, the manuscript does not mention and discuss at all the frontoorbital sulcus. This is a sulcus that usually runs from the orbital surface of the frontal lobe up to divide the inferior frontal gyrus in chimpanzees, a condition totally different than in humans who do not have a frontoorbital sulcus. Could such a sulcus be identified, this would provide a far more convincing argument for a primitive condition in this specimen. In Australopithecus sediba, e.g., the condition in this region seems to be a mosaic in which some aspects of the morphology seem to be more modern while one of the sulcual impressions can well be interpreted as a short frontoorbital sulcus. For this specimen, by the way, I would come back to my third point above: some experts in the field might argue that this specimen could belong to Homo rather than Australopithecus...
  
  We agree that the presence of a fronto-orbital sulcus would be more conclusive. However, this sulcus has not been identified in KNM-ER3732 and the region in which we would expect to find it is not preserved. As demonstrated by Ponce de León et al. (2021), because of the topographic relationships between sulci (and cranial structures), it is possible to interpret imprints on endocasts and the evolutionary polarity of some traits even in the absence of landmarks such as the fronto-orbital sulcus. In Australopithecus sediba the main derived feature of the endocast corresponds to the ventrolateral bulge in the left inferior frontal gyrus, and not to the sulcal pattern itself (Carlson et al., 2011 Science). However, the discussion around the taxonomic status of this taxon confirms the urgent need for reconsidering specimens from that time period and clarifying the mosaic-like or concerted evolution of the derived Homo-like traits within our lineage. Regarding the subjective nature of this approach, we invite readers to examine the specimen on MorphoSource (https://www.morphosource.org/concern/media/000497752?locale=en) and to request access to the National Museums of Kenya to the physical or virtual specimen to falsify our hypothesis.
  
  According to my arguments above, I think that this manuscript might revive interesting discussions about this topic but it is not likely to settle them because the data presented are not strong enough to fully support the hypothesis.
  
  We would be more than happy to consider new/other specimens with similar chronological and geographical contexts and investigate further this hypothesis in the future.
  
  Reviewer #3 (Public Review):
  
  The authors provide a detailed analysis of the sulcal and sutural imprints preserved on the natural endocast and associated cranial vault fragments of the KNM-ER3732 early Homo specimen. The analyses indicate a primitive ape-like organization of this specimen's frontal cortex. Given the geological age of around 1.9 million years, this is the earliest well-documented evidence of a primitive brain organization in African Homo.
  
  In the discussion, the authors re-assess one of the central questions regarding the evolution of early Homo: was there species diversity, and if yes, how can we ascertain it? The specimen KNM-ER1470 has assumed a central role in this debate because it purportedly shows a more advanced organization of the frontal cortex compared to other largely coeval specimens (Falk, 1983). However, as outlined in Ponce de León et al. 2021 (Supplementary Materials), the imprints on the ER1470 endocranium are unlikely to represent sulcal structures and are more likely to reflect taphonomic fracturing and distortion. Dean Falk, the author of the 1983 study, basically shares this view (personal communication). Overall, I agree with the authors that the hypothesis to be tested is the following: did early Homo populations with primitive versus derived frontal lobe organizations coexist in Africa, and did they represent distinct species?
  
  I greatly appreciate that the authors make available the 3D surface data of this interesting endocast.
  
  We are grateful to Rev. 3 for their comments and for contextualizing our finding. We would also like to point out that, although the 3D surface can be viewed on MorphoSource, permission from the National Museums of Kenya has to be requested for studying the specimen and getting access to the physical specimen and/or the 3D model.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.05.543693v2
www.biorxiv.org www.biorxiv.org

Molecular portraits of colorectal cancer morphological regions

1
1. Public_Reviews 28 Jun 2023
 
 in eLife
 
 Author Response:
 
 We are grateful to the reviewers for their insightful comments, suggestions, and criticism. In the updated version of the manuscript, all these will be properly reflected. Here we briefly address the main points raised:
 
 Reviewer #1:
 
 1.1. Patient selection and tumor area selection are crucial for this study but not very carefully defined. Why are some core and others not? Figure referral is an issue here (sup figure 6 where all core and non-core samples are supposed to be according to the legend of Fig 4 is likely sup fig 7 but this is then a complete copy paste of Figure 4). In the methods it is stated that the core samples are based on limited contamination of additional morphotypes (<20%) but Fig 4 suggests that all tumours listed have multiple morphotypes.
 
 The tissue samples were obtained from a hospital cohort of patients with stage II-IV colorectal cancer (at diagnostic time), with no particular selection criteria imposed, as this was an exploratory study.
 
 Tumor regions were marked for macro-dissection by an experienced pathologist following the standard practice for whole-tumor transcriptomics studies. The subregions (morphological regions) were marked by the same experienced pathologist for macro-dissection (in an adjacent section) and reassessed later with respect to their “morphological purity”. It is impossible to macro-dissect regions containing a single morphological pattern. Hence, those regions which contained significant amount (>=20%) of other morphologies were considered “non-core”, while the rest were called “core” regions. This distinction applies to morphological regions solely and not to whole-tumor samples.
 
 Indeed, the reference in caption to Figure 4, should refer to Supp. Fig. 7 (which needs to be updated).
 
 1.2. CMS subtype should be performed with single sample predictor rather than CMScaller.
 
 We agree that a single-sample predictor for CMS is needed, however CMScaller is the de facto classifier for CMS (>130 citations) so we used it to illustrate the practical implications.
 
 1.3. A couple of surprising observations need specification. MUC2 is a strong CMS3 reporter gene yet Mucinous tumours appear to end up in CMS4 rather than 3. Can the authors show that indeed stroma cells are very evident in these samples?
 
 We do not have a direct estimation of the amount of stromal cells, but the high scores of the various fibroblast-related signatures in mucinous regions (Fig2 B, D) indicate that, indeed, there is an enrichment in stroma. In the follow-up study we plan to perform specific staining as well as spatial transcriptomics of these regions to further investigate our findings.
 
 1.4. The SE PP and CT are assigned to CMS2, but in Figure 4 this appears a lot more variable than the authors would make the reader believe. The full data are not completely clear (see point 1).
 
 In the paper, we transparently state that PP, SE, and CT were assigned to CMS2 in 62.5%, 41.7% and 41.9% of cases, respectively. These proportions referred to all samples for which CMSCaller made a prediction. In Fig.4, we also show the proportion of cases in which CMSCaller did not predict any subtype.
 
 1.5. The tumor response rates are rather weird as this is likely dependent on the complete tumour and not so much the subareas. It is not very well described what we see in this analysis.
 
 We did not compute any response rates but simple prognostic scores as (weighted, if weights were provided) means of genes in the specific signatures (see Methods). The question addressed was whether these scores were comparable between whole tumor and corresponding tumor regions (within same tumor). Given the observed (relative) variability, the more important follow-up question - which we cannot answer with our limited survival data – is whether a higher score in a region in comparison with whole-tumor is indeed indicative of a higher risk of relapse.
 
 1.6. Serrated adenomas have previously been aligned with CMS4. Is this different from serrated areas in cancers?
 
 We do not have data from adenomas to compare with the serrated carcinoma regions. But a comparison of (regions of) both traditional serrated and sessile serrated adenomas to serrated carcinoma would be interesting.
 
 1.7. The fact that iCMS2 and iCMS3 align rather well with the current analysis of the distinct regions suggests that the analysis that was reported last year is the proper way to view tumor intrinsic signatures. The authors now propose a rather similar outcome to this issue which does take away a lot of the novelty of the findings of this study.
 
 Our goal was not to propose another stratification paradigm for colorectal cancer, but rather to study the associations between morphology and transcriptome and its implications in practice. As such, our analyses are not limited to molecular subtypes and the respective observations are but a small part of our findings. Indeed, the intrinsic subtypes (iCMS 2/3) are stable and robust, as they are based on the genes expressed in epithelial cells, and they may well prove to be of clinical importance too. However, they do not cover all aspects (e.g. fibroblasts subtypes) and, as stated in Joanito et al. Nat Gen 54, pages 963–975 (2022), “iCMS, MSI status and CMS jointly inform the molecular classification of CRC”. Last, in our opinion, the molecular classification of CRC, while a useful point of view in tumour classification, is not covering all the necessary perspectives on tumour heterogeneity.
 
 Reviewer #2:
 
 2.1. Overall, the manuscript provides an interesting histological/morphological framework through which we can consider heterogeneity in colorectal carcinoma and an approach by which we might improve the performance of gene expression-based classifiers in predicting clinical behaviour and/or responses to therapy. Exploration of CRC morphotypes and their differences was quite interesting. However, more work is needed to support the claims made by the authors. While I appreciate that the authors themselves identify limitations of their study within the manuscript, I believe awareness of these limitations is not reflected in some of the claims made in the abstract and at points in the main text when discussing the use of expression-based classifiers.
 
 We will improve the manuscript to stress the exploratory nature of our analyses and their limitations.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.01.24.525310v3
www.biorxiv.org www.biorxiv.org

New submission 26/06/2023, 14:29:07

1
1. Public_Reviews 27 Jun 2023
 
 in eLife
 
 Author Response
 
 The following is the authors’ response to the original reviews.
 
 This important work reports the identification of a list of proteins that may participate in the clearance of paternal mitochondria during fertilization, which is known as essential for normal fertilization and embryonic and fetal development. While the main method used is state of the art and the supporting data are solid, the vigor of the biochemical assays and function validation is inadequate. This work will be of interest to developmental and reproductive biologists working on fertilization. Key revisions (for the authors) include 1) Use a mitochondria-enriched fraction instead of whole sperm for the assays, and add more control samples to monitor what got lost during sperm and oocyte treatments before the coincubation step. 2) Functional validation of the key proteins identified.
 
 We thank Editors of eLife, as well as Special Issue Guest-Editors and Reviewers for a favorable assessment and helpful recommendations for key revisions. Provisional revisions included in our revised article are detailed below. We agree with Editors’ comment about the use of mitochondrion enriched fractions and additional functional validation of key proteins. In fact, we are developing experimental protocols for oocyte extract coincubation with isolated sperm heads and tails, and eventually with purified mitochondrial sheaths, to separate the ooplasmic sperm nucleus remodeling factors from the mitophagic ones. Such experiments, as well as functional validations using porcine zygotes are contingent upon anticipated post-pandemic rebound in the availability of porcine oocytes, obtained from ovaries harvested on slaughterhouse floors, requiring currently unavailable workforce which has hampered our access to this necessary resource.
 
 Reviewer #1 (Peer Review):
 
 Could the authors make clear how much the presented pictures reflect the described localisation? There is no information on the number of spermatozoa and embryos observed nor the fraction of these embryos showing the presented pattern of localisation. This must be included.
 
 Two hundred spermatozoa were counted per replicate of the cell-free system co-incubation and 20 zygotes per replicate, with 3 replicates of immunolabelling for each phase/picture which were examined to establish the typical localization patterns that were observed. The displayed patterns were observed in 65 to 88% of examined spermatozoa/zygotes; varying dependent on protein, replicate, and phase of immunolabelling. In all cases, the signal displayed is the typical pattern that was displayed in most cells. This information has been added to the Materials and Methods section for clarification.
 
 It is not clear if the authors also examined the localization of other proteins and obtained a different pattern than anticipated from the proteomic approach or if they only tested these 6 proteins and got a 100% of correlation.
 
 These are the 6 proteins which were selected based on extensive literature review into known functions of all identified proteins, as well as extensive research into available and reliable antibodies to detect such proteins within our porcine systems. Even so, no particular localization patterns were anticipated; instead, we presented the patterns actually observed and even some patterns which defied our expectations (i.e., the localization of BAG5 in the sperm acrosome).
 
 The authors use "MS" in the text to indicate "mitochondrial Sheath" and "Mass spectrometry". this is confusing.
 
 The authors agree and the usage of MS as an acronym for either has been removed entirely to avoid confusion.
 
 In the introduction the author refers to Ankel-Simons and Cummins, 1996 as a reference for the number of sperm mitochondria in mammalian species, this is incorrect since the quoted paper is about the number of mtDNA molecules and mentioned an earlier publication.
 
 This has been revised and the appropriate citation has been used.
 
 Reviewer #2 (Peer Review):
 
 Major:
 
 1) It has been proved from the earlier studies from this group that the porcine cell-free system is useful to observe spermatozoa interacting with ooplasmic proteins in a single trial and could recapitulate fertilization sperm mitophagy events that take place in a zygote without affecting later cell-division process. However, the post-fertilization sperm mitophagy process is a complex time-associated event that many processes that occur sequentially and interactively, which means ooplasmic proteins might be involved in this process but may not directly interact with sperm or may associate with sperm-ooplasmic protein complex at different time points. It is certainly a great advance already in knowledge to identify "the candidate players" from the list of 185 proteins; however, with the time-resolution (4 and 24hr) in the current study and without functional validation experiments at this stage, it is still difficult to postulate the importance of these identified proteins. The functional validation experimental designs, in my opinion, is critically important for better interpretation of the data.
 
 The authors agree with this reviewer’s sentiments and do plan to conduct further functional analysis. This project was able to generate a list of candidate, sperm-mitophagy promoting proteins and we were further able to show that many of these proteins were detectable both via mass spectrometry and via immunocytochemistry in spermatozoa exposed to our cell-free system. Furthermore, similar localization patterns were found in spermatozoa that were detected within newly fertilized zygotes. These results boost our confidence in our cell-free system and show that our list of candidate proteins is truly a useful list for future localization and functional analyses. We are certainly aware that we have not captured every protein that may play a role in post-fertilization sperm mitophagy and that the proteins captured are just candidates until proven otherwise. Likewise, we have almost certainly captured multiple proteins that are currently candidates that will likely not be shown to play a role in postfertilization sperm mitophagy, while it is plausible that at least some of these candidate proteins do play a role in mitophagy and some of them likely participate (perhaps have yet to be described roles) in other fertilization events, in which we would be extremely interested in as well.
 
 2) As shown in Figure 1, whole sperm was used in the co-incubation and the later MS analysis; thus, proteins identified in the current study might be relevant in fertilization processes other than postfertilization sperm mitophagy, as proteins identified in the current study may be associated with other parts of the sperm (e.g. sticky sperm head, e.g. PSMG2 associated with sperm midpieces, tail at 4hr coincubation, but then only associate with sperm head at 24hr co-incubation) rather than sperm midpiece, despite the fact that authors applied immunohistochemistry to show the localization of this protein, but the evidence is indirect, so how authors functionally differentiate these 6 identified proteins from sperm mitophagy process with other processes and to confirm (or to associate) the relevance of these proteins with sperm mitophagy process?
 
 The authors agree that the 6 proteins which were further studied by using immunocytochemistry may be playing roles in other processes such as pronuclear formation. We discussed some potential roles including and beyond post-fertilization mitophagy, in the Supplemental Discussion. After reviewer comments, we moved the Supplemental Discussion back in the main Discussion section. Thus, this section now considers additional putative pathways in which the said 6 proteins cold participate, though we concede that thorough functional studies must still be performed.
 
 3) Class 3 proteins were present in both the gametes or only the primed control spermatozoa, but are decreased in the spermatozoa after co-incubation, which authors interpreted as sperm-borne mitophagy determinants and/or sperm-borne proteolytic substrates of the oocyte autophagic system, this data categorization may need to be revised as sperm-borne proteolytic substrates of the oocyte autophagic system only, not for sperm borne mitophagy determinants. The argument for this disagreement is due to the fact that if the protein is a sperm-borne mitophagy determinant, after coincubation, to execute the mitophagy process, this protein should still be associated with the sperm at least at the early stage (of 4hr) (constant under MS detection when comparing control with 4hr treated) rather than being released from the sperm. Or alternatively, they could result in class 3 proteins (but not all those 6 were in class 3). Nevertheless, if these proteins serve as substrates, they can be used (consumed) and show decreased under MS detection.
 
 This argument for redefining the Class 3 proteins more accurately is understood and we agree. The definition is revised in the paper.
 
 4) Of particular interest among the 6 proteins that were further investigated. Unlike other proteins, MVP was highly significant (p<0.001) after 4hr incubation, but the significance became less after 24hr (p=0.19). Interpretation of this dynamic change in the relevance of the mitophagy process would facilitate the readers to understand the relevance and the role of MVP.
 
 The differences in significance are likely influenced by the abundance of MVP detectable by mass spectrometry. As the time of cell-free system incubation increases, the variability between replicates also seemed to increase, likely due to the sustained proteolytic activity taking place in our system. This work was based on three replicates of mass spectrometry for each time point; additional replicates likely would have reduced the p-value for the 24hr cell-free data set, for MVP and potentially other proteins also. At both time points, MVP was only detectable in spermatozoa after they had been exposed to the cell-free system treatment which is the criteria that truly interested us more than the actual differences in content between the timepoints and is why it was added to our list of candidate proteins.
 
 5) In figure 3, the association of ooplasmic MVP to sperm midpiece is not convincing enough as sperm midpiece and tail often show some levels of non-specific signals under fluorescent microscopy. And the dynamic association of ooplasmic MVP to sperm midpiece in Fig. 3F-G is difficult to reach a conclusion solely based on data presented in the manuscript. Additional negative control of sperm MVP staining from the primed and treated sperm would be helpful. Additionally, a quantitative comparison (15 vs 25hr) of sperm-associated MVP signals from the fertilized embryo or a stack image from different angles would clarify the doubts raised here.
 
 For all images and all replicates, serum controls were also generated. These controls were then viewed under fluorescent microscope, and light intensities and exposures thresholds for each fluorescent light channel were set based on the background intensity that came from these nonimmune serum-treated control samples. We set our light intensity/acquisition time below a threshold where the non-specific signal began to appear. All the presented patterns are based on setting this peak intensity threshold and as such the signal we see should be the true signal. Furthermore, 200 spermatozoa were counted per treatment per replicate of the cell-free system co-incubation and 20 zygotes per replicate, with 3 replicates of immunolabelling for each protein and data point, which was used to represent the typical localization patterns that were observed. The displayed patterns were observed between in 65- 88% of examined spermatozoa/zygotes. Invariably, the signal displayed in the manuscript is the typical pattern that was seen in a majority of cells. This information has now been added to the Materials & Methods section for clarification.
 
 6) Same concerns for the other 5 proteins (PSMG2, PSMA3, FUNDC2, SAMM50, BAG5) as indicated above.
 
 See response to Question 5.
 
 7) The patterns of these 6 proteins under the immunofluorescent study are confusing as the pattern varies after co-incubation (treated), and mostly, the signal of these proteins observed from the fertilized embryos is not really associated with sperm midpieces. Therefore, the evidence of these proteins involving in post-fertilization sperm mitophagy is, at this moment, weak based on the data presented. But the relevance of these proteins in events post-fertilization or early embryo development is certainly (evidence did not strong enough to support "sperm mitophagy," in my opinion).
 
 The authors agree that some of these proteins seem to be playing roles beyond postfertilization sperm mitophagy and that there is a need for true functional studies before the authors can state with certainty that these proteins play a role in any of the discussed fertilization events. We state this in the discussion: “Considering the dynamic proteomic remodeling of both the oocyte and spermatozoa which takes place during early fertilization, these 185 proteins which have been identified likely play roles in processes beyond sperm mitophagy.” It should be noted that the authors went into greater detail about potential alternative protein functions based on the present data and literature review in the Supplemental Discussion. Based on this comment and other reviewer comments we have now included the Supplemental Discussion as part of the main Discussion section, and this will hopefully help clarify some of the authors’ thoughts about the 6 candidate proteins which were further analyzed during this study.
 
 Minor:
 
 1) To my understanding, statistical significance (relevance) is normally set at a p-value of either <0.1 or 0.05. The reason for loosening the p-value of 0.2 in the current study needs to be justified as this was not a common statistical criterium, and the interpretation of those candidates from this loosened criterium should also be careful.
 
 The loosening of statistical relevance in this study to 0.2, only applied to our Class 1 proteins. This is because for a protein to fall into the Class 1 proteins it was a protein that was only present in samples after they were exposed to the cell-free system. In the case of these Class 1 proteins, this happened for all 3 replicates at each stated timepoint. We found this pattern of detection to be important whether the p-value fell under 0.1 or 0.2. As such, we loosened our statistical threshold for our Class 1 proteins. Any proteins added to our candidate list will be subject to further investigation before definitive conclusions can be drawn, and as such we think that capturing more proteins was more important for the goals of this study than limiting the number of proteins captured, especially for those Class 1 proteins. An explanation of this has been added to the Materials & Methods section Mass Spectrometry Data Statistical Analysis.
 
 2) First cell cleavage of porcine embryo normally occurs within 48hr post-insemination or activation; therefore, the 4 and the 24hr time points used in the current study require justification included in the discussion or methods and material section.
 
 First cleavage of porcine embryos normally occurs around 24 - 28 hours post-insemination. Thus, for both the cell-free system and the embryo studies we were capturing an advanced 1 cell stage zygote/zygote like system with our 24 hour and 25-hour time points.
 
 3) In figure 2, colors used in different time points and in two different classes represent (sometimes) different protein categories, would be easier for the readers for quick comparisons if the same color could be used to represent the same protein category throughout the graph. (E.g, proteins for early zygote development are shown in red in "A", but blue in "B")
 
 This has been corrected and the color scheme for Figure 2 has been revised for easier comparisons.
 
 Reviewer #3 (Peer Review):
 
 I am not used to seeing a supplementary discussion in a manuscript. I also believe it should be incorporated into normal discussion.
 
 The Supplemental Discussion has been incorporated into the main Discussion now.
 
 It would be very helpful to make an additional figure in which the proposed interactome of identified factors with the sperm mitochondria before and after incubation are drawn schematically and also which factors are not IDed in both cases (when comparing to somatic mito- or autophagy). This eases to get through the discussion and will beautifully summarize and illustrate the importance and progress that the authors have made with this assay.
 
 We made a diagram that depicts the changes in protein localization patterns overtime within our cell-free system. This diagram has been added to the manuscript as Figure 9.
 
 Reviewer #1 (Public Review):
 
 In this manuscript, the authors used an unbiased method to identify proteins from porcine oocyte extracts associated with permeabilised boar spermatozoa in vitro. The identification of the proteins is done by mass spectrometry. A previous publication of this lab validated the cell-free extract purification methods as recapitulating early events after sperm entry in the oocyte. This novel method with mammalian gametes has the advantage that it can be done with many spermatozoa at the time and allows the identification of proteins associated with many permeabilised boar spermatozoa at the time. This allowed the authors to establish a list of proteins either enriched or depleted after incubation with the oocytes extract or even only associated with spermatozoa after incubation for 4h or 24h. The total number of proteins identified in their test is around 2 hundred and with very few present in the sample only when spermatozoa were incubated with the extracts. The list of proteins identified using this approach and these criteria provide a list of proteins likely associated with spermatozoa remnants after their entry and either removed or recruited for the transformation of spermatozoa-derived structures. Using WB and histochemistry labelling of spermatozoa and early embryos using specific antibodies the authors confirmed the association/dissociation of 6 proteins suspected to be involved in autophagy.
 
 While this unique approach provides a list of potential proteins involved in sperm mitochondria clearance it's (only) a starting point for many future studies and does not provide the demonstration that any of these proteins has indeed a role in the processes leading to sperm mitochondria clearance since the protein identified may also be involved in other processes going-on in the oocyte at this time of early development.
 
 We thank reviewer 1 for positive comments. We added a sentence in Discussion addressing the obvious shortcoming of present study, as further functional validations of candidate mitophagy factors are planned.
 
 Concerning the localisation of the 6 proteins further analysed, the authors must add how much the presented picture represents the observed patterns. They must include the details on the fraction of spermatozoa and embryos displaying the presented pattern.
 
 We now specify that the patterns depicted in manuscript are typical and representative of data from at least three replicates of immunolabeling in spermatozoa and zygotes. For each of these replicates, 200 spermatozoa were examined per replicate of the cell-free system co-incubation or 20 zygotes per replicate. The displayed patterns were observed between 65-88% in examined spermatozoa/zygotes. Invariably, the signal displayed in manuscript is the typical pattern that was seen in a majority of cells. This information has now been added to the Materials & Methods section for clarification.
 
 Reviewer #2 (Public Review):
 
 Mitochondria are essential cellular organelles that generate ATPs as the energy source for maintaining regular cellular functions. However, the degradation of sperm-borne mitochondria after fertilization is a conserved event known as mitophagy to ensure the exclusively maternal inheritance of the mitochondrial DNA genome. Defects on post-fertilization sperm mitophagy will lead to fatal consequences in patients. Therefore, understanding the cellular and molecular regulation of the postfertilization sperm mitophagy process is critically important. In this study, Zuidema et. al applied mass spectrometry in conjunction with a porcine cell-free system to identify potential autophagic cofactors involved in post-fertilization sperm mitophagy. They identified a list of 185 proteins that might be candidates for mitophagy determinants (or their co-factors). Despite the fact that 6 (out of 185) proteins were further studied, based on their known functions, using a porcine cell-free system in conjunction with immunocytochemistry and Western blotting, to characterize the localization and modification changes these proteins, no further functional validation experiments were performed. Nevertheless, the data presented in the current study is of great interest and could be important for future studies in this field.
 
 We thank reviewer 2 for positive comments. As we explain in our response to Editors and Reviewer 1, further validation studies will be resumed once the availability of slaughterhouse ovaries for such studies improves. Examples of such functional validation of pro-mitophagic proteins SQSTM1 and VCP are included in our previous studies (DOI: 10.1073/pnas.1605844113 and DOI: 10.3390/cells10092450) that led to the development of cell-free system reported here, and are cited in present study.
 
 Reviewer #3 (Public Review):
 
 In this manuscript, a cytosolic extract of porcine oocytes is prepared. To this end, the authors have aspirated follicles from ovaries obtained from by first maturing oocytes to meiose 2 metaphase stage (one polar body) from the slaughterhouse. Cumulus cells (hyaluronidase treatment) and the zona pellucida (pronase treatment) were removed and the resulting naked mature oocytes (1000 per portion) were extracted in a buffer containing divalent cation chelator, beta-mercaptoethanol, protease inhibitors, and a creatine kinase phosphocreatine cocktail for energy regeneration which was subsequently triple frozen/thawed in liquid nitrogen and crushed by 16 kG centrifugation. The supernatant (1.5 mL) was harvested and 10 microliters of it (used for interaction with 10,000 permeabilized boar sperm per 10 microliter extract (which thus represents the cytosol fraction of 6.67 oocytes). The sperm were in this assay treated with DTT and lysoPC to prime the sperm's mitochondrial sheath. After incubation and washing these preps were used for Western blot (see point 2) for Fluorescence microscopy and for proteomic identification of proteins.
 
 Points for consideration:
 
 1) The treatment of sperm cells with DTT and lysoPC will permeabilize sperm cells but will also cause the liberation of soluble proteins as well as proteins that may interact with sperm structures via oxidized cysteine groups (disulfide bridges between proteins that will be reduced by DTT).
 
 This is certainly a possibility, the lysoPC and DTT permeabilization steps were designed to mimic natural processing (plasma membrane removal and sperm protein disulfide bond reduction), which the spermatozoa would undergo during fertilization. However, we do realize that this is a chemically induced processing and thus is not a perfect recapitulation of fertilization processes. However, in this study and in previous studies with this system, we were able to show alignment between proteomic interactions taking place in the cell-free system and within the zygotes.
 
 2) Figure 3: Did the authors really make Western blots with the amount of sperm cells and oocyte extracts as the description in the figures is not clear? This point relates to point 1. The proteins should also be detected in the following preparations (1) for the oocyte extract only (done) (2) for unextracted nude oocytes to see what is lost by the extraction procedure in proteins that may be relevant (not done) (3) for the permeabilized (LPC and DTT treated and washed) sperm only (not done) (4) For sperm that were intact (done) (5) After the assay was 10,000 permeabilized sperm and the equivalent of 6.67 oocyte extracts were incubated and were washed 3 times (or higher amounts after this incubation; not done). Note that the amount of sperm from one assay (10,000) likely will give insufficient protein for proper Western blotting and or Coomassie staining. In the materials and methods, I cannot find how after incubation material was subjected to western blotting the permeabilized sperm. I only see how 50 oocyte extracts and 100 million sperm were processed separately for Western blot.
 
 The authors did make Western blots with the number of spermatozoa and oocytes stated in the materials and methods, a total protein equivalent of 10 to 20 million spermatozoa (equivalent to ~20-40 µg of total protein load) and 100 MII oocytes (equivalent to ~20 µg of total protein load). These numbers have been corrected in the Materials & Methods. Also, we did find in the Materials & Methods section that the Co-Incubation of Permeabilized Mammalian Spermatozoa with Porcine Oocyte Extracts section refers to using cell-free exposed spermatozoa for electrophoresis; however, for none of the presented Western blot work was this true. Rather, all of the presented Western blots as per their descriptions are utilizing ejaculated or capacitated sperm or oocytes. This line has been removed from the Materials & Methods to reduce confusion.
 
 Regarding preparation (2), we have previously assessed the difference between oocyte extract and intact oocytes in this manner internally and we are certainly losing proteins due to the oocyte extraction process. We make caveats in this vein throughout the article such as: “Furthermore, this cell-free system while useful does not perfectly capture all the events which take place during in vivo fertilization. The cell-free system is intended to mimic early fertilization events but is presumably not the exact same as in vitro fertilization.”
 
 3) Figures 4, 5, 6, 7, and 8 see point 2. I do miss beyond these conditions also condition 1 despite the fact that the imaged ooplasm does show positive staining.
 
 For all the presented Western blots, the tissue type is stated in the image description and the protocol which was used to prepare these samples is stated in the Materials & Methods.
 
 4) These points 1-3 are all required for understanding what is lost in the sperm and oocyte treatments prior to the incubation step as well as the putative origin of proteins that were shown to interact with the mitochondrial sheath of the oocyte extract incubated permeabilized sperm cells after triple washing. Is the origin from sperm only (Figs 5-8) or also from the oocyte? Is the sperm treatment prior to incubation losing factors of interest (denaturation by DTT or dissolving of interacting proteins preincubation Figs 3-8)?
 
 The authors understand that there are proteins and interactions lost on both sides of the cellfree system equation and we have added a sentence to the Discussion to caveat this limitation in the system.
 
 5) Mass spectrometry of the permeabilized sperm incubated with oocyte extracts and subsequent washing has been chosen to identify proteins involved in the autophagy (or cofactors thereof). The interaction of a number of such factors with the mitochondrial sheath of sperm has been shown in some cases from sperm and others for an oocyte origin. Therefore, it is surprising that the authors have not sub-fractionated the sperm after this incubation to work with a mitochondrial-enriched subfraction. I am very positive about the porcine cell-free assay approach and the results presented here. However, I feel that the shortcomings of the assay are not well discussed (see points 1-5) and some of these points could easily be experimentally implemented in a revised version of this manuscript while others should at least be discussed.
 
 We agree that the use of a mitochondrial-enriched subfraction for further analysis would be interesting and useful. We are actively developing experimental protocols for oocyte extract coincubation with isolated sperm heads and tails, and eventually with purified mitochondrial sheaths. However, such experiments are contingent upon our access to porcine oocytes, which has continued to be a struggle since the COVID-19 pandemic compromised our ability to attain oocytes in large, cheap, and reliable quantities. This was a continuous problem with preparing materials for this very paper and has continued to be an issue for our laboratory as well as many others at our university and across the country. We continue to maximize oocytes every time we can get access to them, but the unfortunate reality is that this access has become sparce and unreliable over the past three years.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.01.23.525177v2
www.biorxiv.org www.biorxiv.org

Neuropeptide Y-expressing dorsal horn inhibitory interneurons gate spinal pain and itch signalling

1
1. Public_Reviews 26 Jun 2023
  
  in eLife
  
  Author Response
  
  The following is the authors’ response to the original reviews.
  
  Reviewer 1 (Recommendations For The Authors):
  
  1) The strikingly different conclusion from the previous Bourane study seems to stem from the experimental approaches. Rather than using genetic crosses that target all neurons from the hindbrain and spinal cord that express Npy at any point in development, Boyle et al target their manipulations specifically to the lumbar region of the superficial dorsal horn in adult mice using direct viral injections. Thus, Boyle is almost certainly manipulating much fewer neurons that the original study. How then is their behavioral effects so much greater? At the minimum, the authors need to discuss this discrepancy head on. Better would be a direct molecular/anatomical comparison of the neurons targeted by each approach. This could be done using Nyp-Cre mice crossed to a Rosa-LSL-reporter strain and quantifying the overlap with the same markers used here. Perhaps, the intersectional approach with Lbx1 resulted in labeling of a different population of neurons than the adult AAV injections? Although likely outside the scope, given this work directly questions the main conclusion of the Bourane paper, it will be important to see a replication of the original finding of selectivity to mechanical itch.
  
  We agree that our approach should be manipulating a smaller population of neurons, and that it is therefore suprising that we see greater behavioural effects. Please see our response to "Weakness 1" of Reviewer 2 for consideration of this point. We have already provided a direct molecular comparison as requested by the reviewer, and this appears in Figure 1 supplement 1. Here we used tissue from NPY::Cre that had been crossed with Ai9 mice (i.e. a Rosa-LSL-reporter) and had received intraspinal injections of AAV.flex.GFP. We then characterised the neurochemistry of tdTomato+ cells that were GFP+ or GFP-negative.
  
  2) The authors state that, "91.6% ± 0.3% of cells classed as Cre-positive cells were also Npy-positive, and these accounted for 62.1% ± 0.6% of Npy-positive cells" If I am reading this correctly, does that mean that 40% of the Npy+ cells are Cre negative? If so, how is this possible?
  
  This interpretation is correct. For quantification of RNAscope data we used a cut-off level of 4 transcripts, and cells with fewer than 4 transcripts were classed as negative. It is likely that some of the NPY cells classified as negative for Cre would have had some Cre mRNA (sufficient to cause recombination), but at a level below this threshold. It is also possible that some NPY+ cells would fail to express Cre, since this is a BAC transgenic mouse, rather than a knock-in.
  
  3) Similarly, the authors state that "great majority of FP-expressing neurons in laminae I-III were immunoreactive (IR) for NPY (78.5% ± 3.6%), and these accounted for 74.6% ± 109 1.9% of the NPY-IR neurons in this area". So does this mean 20% of the recombination is non-specific/in other cell types that could be involved in pain/itch sensation?
  
  Our finding that 91.6% of cells with Cre mRNA were also positive for Npy mRNA (see above) indicates that Cre expression was largely restricted to NPY cells. The failure to detect NPY peptide in some of these cells probably results from the relatively low level of peptide seen in the cell bodies of peptidergic neurons, which results from the rapid transport of peptides into their axons.
  
  4) Comparing Fig 3B and Fig4B it seems the control baseline von Frey responses are different. In fact, baseline response in Fig4b is quite like the CNO effect in Fig 3B. Unless I'm misunderstanding something, this seems quite odd?
  
  We agree that there is a difference between the baseline responses. We are not aware of any particular reason for this, and we think that it reflects a degree of variability that is seen with the von Frey test. Interestingly, the baseline values for the SNI cohort (Fig 4E) lies between the values in Fig 3B and Fig 4B.
  
  5) In Fig 4E, the behavior of the CNO treated mice is quite variable. Can the authors comment as to how this might be happening? Does the effect correlate with viral transduction?
  
  We did not see any obvious correlation between the extent of viral transduction and the behaviour of individual mice.
  
  6) Fig6, the PDyn-Cre experiment, is a bit of a non sequitur?
  
  Please see our response to "Weakness 2" of Reviewer 2 for consideration of this point.
  
  7) The conclusion is unusually long. I recommend trimming it to make it more concise.
  
  We presume that this refers to the Discussion. However, this was ~1550 words, and we do not feel that that is unusually long.
  
  Reviewer 2 (Public Review):
  
  Weaknesses
  
  1) There is inadequate discussion about previous studies of NPY interneurons. Specifically, the authors should address why a more restricted subset of these neurons (this study) have broader effects than seen previously.
  
  We have expanded the discussion on the discrepancies between our findings and those reported previously. We state at the outset that we are targeting a more restricted population (lines 509-10), and we now go into more detail concerning both similarities and differences between our findings and the reasons that we think may underlie any discrepancies (various changes between lines 522-575).
  
  2) I cannot see the reason for including results from manipulation of Dyn+ interneurons in this paper. First, the title does not reflect roles of spinal Dyn+ population. In addition, without further experiments characterizing relationships between NPY and Dyn interneurons in modulating itch and/or nociception, Dyn datasets seem to deviate from the main theme.
  
  We had previously shown that activating Dyn-INs suppressed pruritogen-evoked itch (Huang et al 2018), but it was important to test whether silencing these cells would have the opposite effect. Our finding of overlap in function (i.e. both NPY-INs and Dyn-INs suppress itch, and that both innervate GRPR cells) provides strong evidence against the idea that neurochemically-defined interneuron populations have highly specific functions, and we now state this in the Discussion. The anatomical experiments (which follow on from the functional studies) provide important new information concerning synaptic circuitry of the dorsal horn, by showing that NPY-INs preferentially innervate GRPR cells, and provide around twice as many synapses on these cells, compared to the Dyn-INs. Interestingly, this correlates with the relatively large optogenetically-evoked IPSCs that we saw when NPY-INs were activated, compared to those reported by Liu et al (2019) when galanin-expressing (which largely correspond to Dyn-INs) were activated. By including these findings in the paper, we are able to make comparisons between these two populations.
  
  3) While the authors provided convincing evidence that GRPR+ neurons serve as a downstream effector of NPY+ neuron evoked itch, the relationship between GRPR and NPY neurons in modulating pain is not examined. Therefore, Fig. 7B is pure speculation and should be removed.
  
  We feel that our recent findings that GRPR neurons correspond to vertical cells, that they respond to noxious stimuli, and that activating them results in pain-related behaviours, makes it reasonable to speculate that the NPY/GRPR circuit may also be involved in the anti-nociceptive action of NPY cells. The legend for Fig 7B already refers to this as a "potential circuit", and we have toned down the corresponding part of the discussion to say that our findings "raise the possibility" that this is the case (lines 605-7). We feel that this part of the figure is important, as otherwise our summary diagram ignores some of the main findings of the paper, and we hope that this is now acceptable.
  
  Recommendations For The Authors
  
  1) Fig. 1G: the "misexpression" of tdTomato neurons was much more prominent in deep dorsal horn laminae but not in the superficial ones. Was this representative? Can the authors perform a laminae specific characterization?
  
  We did test for this possibility in 2 NPY::Cre;Ai9 mice that had received intraspinal injections of AAV.flex.GFP, and found that there was a modest difference - 62% of tdTomato+ cells in laminae I-II, but only 39% of those in lamina III, were GFP+. This suggests that "misexpression" may have differed slightly between these regions. However, since the difference was quite modest, and we were only able to analyse tissue from two mice in this way, we did not include these findings in the paper.
  
  2) I have a lot of problems interpreting the c-Fos data in Fig. 2 E and F. For the mCherry- population, how was the quantification performed? From the image, it does not look like 2030% of cells express c-Fos; at a minimum a clear stain of neurons would be needed. Similarly, the identification of NPY cells is not particularly convincing (e.g., middle arrowhead lower 2 panels in C).
  
  We have provided further details on how the analysis was performed (changes made to lines 1016-29). NeuN staining was used to reveal all neurons, and a modified optical disector method was performed from somatotopically appropriate regions of the dorsal horn. As noted by the Reviewer, NeuN staining was required to allow identification of mCherrynegative cells. However, we have not included the NeuN immunoreactivity in the image, as this would add considerably to the complexity. These images are from single optical sections, and therefore the overall numbers of cells are low (in comparison to what would be seen in a projected image). The intensity of mCherry staining varied between cells. However, for all mCherry-positive cells (including the example referred to by the Reviewer), there was clear staining in the membrane, which could be followed in serial sections.
  
  3) Please add individual data points for all quantifications.
  
  These have been added.
  
  Reviewer 3 Recommendations For The Authors:
  
  1) It is somewhat surprising that there is no effect on CPP after activating spinal NPY neurons in neuropathic mice, given the almost complete rescue of hypersensitivity to baseline values in the nociceptive tests. Based on the methods, it appears that conditioning was carried out already 5 min after CNO injection. Yet, suppression of c-fos activity in excitatory spinal dh neurons was observed 30min after CNO injection. Also, it is not clear to me when CNO was injected prior to the nociceptive or CQ testing?
  
  Have the authors considered that conditioning from 5-35 min after CNO injection might be too short after CNO injection to achieve a profound analgetic effect?
  
  In a previous study (Polgár et al 2023), we had observed the timecourse of CNO-evoked itch and pain behaviours in mice in which GRPR cells expressed hM3Dq. We found that these started within 5 minutes of i.p. CNO injection (e.g. Fig S2 in that paper). In addition, the timecourse of action of gabapentin and CNO (both given i.p.) are likely to be similar, and there was a preference for the chamber paired with gabapentin. We are therefore confident that the conditioning period with CNO was adequate. We now explain this in the Methods section (lines 846-52). The timing of CNO injections for the nociceptive and CQ tests is now described (lines 749-55).
  
  2) The authors claim that tonic pain was not affected based on the conditioned place preference test. Efficacy in withdrawal response tests and in the CPP differ by more than duration of the stimulus. I'd suggest using more cautious wording here.
  
  We agree that caution is needed in interpreting the results of the CPP experiments. We have therefore replaced "does" with "may" in the Results section (line 336) and "did" with "may" in the Discussion (line 620).
  
  3) On page 9 the authors state "...suggesting that they suppress the transmission of pain- and itch-related information in the dorsal horn." However, pain is not affected in the loss of function experiments suggesting some qualitative differences in the role of the NPY neurons in itch and pain. This should also be reflected more clearly in this statement and in the discussion e.g. "suppress itch" and "can suppress pain".
  
  We accept the point made by the Reviewer. We have slightly altered the wording in lines 249-51 and 610 to reflect this.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.10.528013v2
www.biorxiv.org www.biorxiv.org

New submission 17/06/2023, 18:53:19

1
1. Public_Reviews 22 Jun 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  In this study, the authors study the effect of dynactin disruption on kinetochore fiber (k-fiber) length in spindles of dividing cultured mammalian cells. Dynactin disruption is known to interfere with dynein function and hence spindle pole formation. The main findings are that poles are not required for correct average k-fiber length and that severed k-fibers can regrow to their correct length both in the presence and absence of poles by modulating their dynamic properties at both k-fiber ends. In the presence of poles, regrowth is faster and the variation between k-fiber lengths is smaller. This is a very interesting study with high-quality quantitative imaging data that provides important new insight into potential mechanisms of spindle scaling, extending in an original manner previous work on this topic in cultured cells and in Xenopus egg extract. The Discussion is interesting to read as several possible mechanisms for k-fiber length control are discussed. The technical quality of the study is very high, the experiments are very original, and most conclusions are well supported by the data. Especially, the experiments observing the regrowth of k-fibers after severing and the study of the dynamic properties of these k-fibers provide very novel insight. Addressing the following concerns could potentially improve the manuscript:
  
  We thank the reviewer for their fair, rigorous, and conceptually engaging remarks.
  
  (1) The phenotype generated here by disrupting dynactin via overexpressing p50 appears to be different from that caused by knocking down NuMA or dynein - as previously reported by the Dumont lab (Hueschen et al., 2019). In this study here, unfocused spindles are observed whereas earlier turbulent spindles were observed. This raises the question of whether dynein activity that contributes to pole focusing is really completely inhibited here. These discrepancies in phenotypes seem to deserve an explanation. Is k-fiber length in cultured mammalian cells only maintained in the case of this specific type of inhibition?
  
  We thank the reviewer for the important point about the different phenotypes observed in different dynein inhibition conditions and we refer them to our response to Essential Revision #1. In summary, we believe that different dynein inhibition phenotypes are similar. Unfocused spindles appear turbulent on longer timescales and appear to reach a steady-state on shorter timescales. The amount of pole-unfocusing also seems to correspond to the severity of dynein inhibition (Figure 1—figure supplement 1). We have chosen to study inhibited spindles that were steady-state and unfocused. We have added this discussion in line 129 as well as better characterized our system of dynein inhibition by adding two new figures (Figure 1—figure supplement 1, Figure 1—figure supplement 3).
  
  Furthermore, we address the question of whether dynein might still be responsible for length regulation despite poles being unfocused in line 433 of the Discussion: “recent work has revealed that mammalian spindles can achieve similar architecture whether or not dynein (or its recruiter NuMA) is knocked out (Neahring et al., 2021). This suggests that the severe defects in spindle coordination (Figure 1, Figure 5) and maintenance (Figure 2) observed in p50-unfocused spindles are more likely due to the loss of spindle poles than due to the loss of dynein activity per se.”
  
  We have additionally overexpressed p50 in human RPE1 cells and observed qualitatively similarly unfocused yet generally bi-oriented spindles as in rat kangaroo PtK2 cells, showing that the formation of unfocused spindles in PtK2 is not an artifact unique to that cell line (see newly added Figure 1—figure supplement 3). However, these unfocused RPE1 spindles did not have clear, resolvable k-fibers as in PtK2, so length was not quantified. The only method we are aware of that robustly unfocuses poles in PtK2 spindles is p50 overexpression.
  
  (2) p50 addition and also p150-cc1 addition was often used in Xenopus egg extract in order to inhibit dynein function. Considerably larger concentrations of p50 than p150-cc1 needed to be used. Can the authors estimate the level of overexpression of p50 in the cells they study? It seems that could be possible given that a mCherry fusion protein can be overexpressed. Was it necessary to select cells with a particular level of mCherry-p50 overexpression to observe the reported phenotypes?
  
  We thank the reviewers for the suggestion to quantify p50 expression and have added Figure 1—figure supplement 1. Due to gradual red laser power loss over months, data from a single day were plotted for proper comparison, but trends were always consistent within any given day. As discussed above, we observed that higher levels of mean p50 intensity corresponded to unfocused spindles. We have clarified that we chose to study these highly overexpressing unfocused spindles in the text and methods, and we speculate that level of p50 overexpression correlates with amount of dynein inhibition and subsequent pole-unfocusing. This is also consistent with the higher concentrations of p50 needed to inhibit dynein in Xenopus.
  
  (3) Some comparison to previous experiments using p50 and p150-cc1 addition to Xenopus egg extract spindles could put this study better into the context of the available literature. It seems from previous publications that the p50 addition produced short, unfocused, barrel-shaped spindles, indicating that spindle length is maintained without poles, whereas the p150-cc1 addition produced elongating spindles (e.g. Gaetz & Kapoor, 2004).
  
  We appreciate the reviewer’s discussion of dynein inhibition in the Xenopus context.
  
  While Xenopus has been used to study spindle size regulation, it has not been as useful to study k-fiber length regulation, which we focus on. Xenopus spindles have a different architecture, with k-fibers that are not discrete and continuous like in mammalian spindles. Indeed, while p50 and p150-CC1 overexpression alter spindle length in Xenopus, they do not have the same effect in mammalian spindles. Additionally, p150-CC1 does not robustly unfocus poles in mammalian spindles as it does in Xenopus; instead, it leads to an inconsistent variety of spindle disorganization phenotypes with frequently focused poles in PtK2 (data not shown). We speculate this variety of spindle phenotypes arise from a different mechanism of dynein inhibition that does not fully target pole-focusing.
  
  However, we agree that referencing prior Xenopus work establishes important context and precedent. In line 95 of the Introduction, we state “…inhibiting dynein unfocuses poles but spindles still form albeit with altered lengths in Drosophila (Goshima et al., 2005) and Xenopus (Gaetz and Kapoor, 2004; Heald et al., 1996; Merdes et al., 1996), and without a clear effect on mammalian spindle length (Guild et al., 2017; Howell et al., 2001),” addressing the different effects of dynein inhibition in Xenopus compared to mammalian spindles. We have also added direct mentions of p50 in Xenopus in line 129 (see Essential Revision #1 response).
  
  Finally, we have added a figure showing overexpression of p50 in a human RPE1 cells to show reproducibility of pole unfocusing across other mammalian cell types (see newly added Figure 1—figure supplement 3).
  
  (4) In this context, it seems that some more explanation is required for the observations presented in Fig. 1D and 1E. It appears that spindle length and k-fiber length have been measured quite differently. Not much information is provided for how spindle length was defined and measured (please expand this part of the Methods). Could the two different methods of measurement be the reason for the mean k-fiber length remaining unaltered in dynactin-disrupted spindles, whereas the spindle length increases in these cells? If not, do non-k-fiber microtubules contribute to unfocused spindles being longer or are chromosomes not aligned in the metaphase plate causing the increase in spindle length by misalignment of k-fiber sister pairs?
  
  We thank the reviewers for pointing out the lack of clarity in Figures 1D and 1E. We have expanded and clarified the Methods section describing how spindle axes were measured and how k-fiber lengths were measured, as well as included examples and cartoons to illustrate them (see newly added Figure1—figure supplement 4).
  
  To clarify, we did not intend to directly measure spindle length, but we did approximate the size of each spindle’s “footprint” in Figure 1D as well as measure individual k-fiber length in Figure 1E. It is now clarified in the Methods line 898 as “Spindle minor and major axes lengths were determined by cropping, rotating, then thresholding spindle images with the Otsu filter using SciKit. Ellipses were fitted to thresholded spindles to approximate the length of their major and minor axes using SciKit’s region properties measurement (Figure1—figure supplement 4A). In control spindles, the major axis corresponded to spindle length along the pole-to-pole axis, and the minor axis corresponded to spindle width along the metaphase plate axis. However, unfocused spindles were disorganized along both axes to the extent where the minor axis did not always correspond to the metaphase plate axis. Thus, Figure 1D reports ”spindle minor axis length” and “spindle major axis length” rather than “spindle width” and “spindle length”. Furthermore, it is worth noting that in unfocused spindles, spindle length is decoupled from k-fiber length because of k-fiber disorganization along both axes. Thus, spindle length was not measured in unfocused spindles...”
  
  We additionally removed the potentially confusing terminology of “wider” and “longer” in the Results section to make clear that we are approximating spindle size, not spindle length and width, and we now state in line 168,“ k-fibers were more spread out in the cell, with spindles covering a larger area compared to control along both its major and minor axes (Figure 1D).”
  
  We believe our clarification and expansion of the Methods section, as well as inclusion of a new supplementary figure and cartoon address the reviewer’s points, and we thank them for pointing out the lack of clarity.
  
  (5) It seems that in the Discussion it is implied that k-fibers can respond to severing in both focused and unfocused spindles by modulating their dynamics at both ends of the k-fibers, but in the Results section the wording is more cautious because of the difference in 'flux' in severed and unsevered unfocused spindles is not significant (Fig. 4D, blue data). It appears indeed that there is also a difference in flux between severed and unsevered unfocused spindles, but the number of data points is too small. Depending on how difficult these experiments are, it could be worth increasing the size of the data set to come to a clear conclusion, given that the data shown in Figs. 3 and 4 are quite remarkable and form the core of the study.
  
  We appreciate the reviewer’s close reading and pertinent suggestions.
  
  As detailed in our response to Essential Revision #3, we did not increase the sample size for unfocused spindles since it would not be reasonably feasible to show significant differences in flux. However, we performed more ablations and photomarking in control spindles as detailed in our response to this reviewer’s point 6 below, a different but related point.
  
  (6) Can the authors exclude that the stopping of 'flux' at minus ends after severing is due to some sort of permanent damage induced by ablation? In other words, do severed spindles begin to flux again once they have regrown to their original length?
  
  We thank the reviewer for their important points.
  
  We have addressed this question in the newly added Figure 4—figure supplement 1 as described in our response to Essential Revision #3 to show that flux resumes after length recovery. In summary, we observed no adverse effects of ablation on k-fiber minus-ends. Severed k-fibers have restored lengths, and minus-end dynamics several minutes after ablation.
  
  (7) To this reader, the conceptualization of distinguishing between 'global' and 'local' effects/behavior was a little confusing, both in the title and also later in the text. The concept of 'local' regulation of k-fiber length appears to contradict the observation that k-fiber length can be regained after severing by changes in the dynamics at both ends (so at two very different locations) which is a rather remarkable finding. Maybe distinguishing between 'individual' and 'collective' k-fiber behavior could be clearer.
  
  We appreciate the reviewer’s consideration of terminology. We have addressed this by clearly defining our use of ‘local’ to refer to individual k-fibers as a unit where appropriate in the text (lines 271, 449). We chose these terms since they can help describe individual versus collective properties, while simultaneously emphasizing the aspects of global architecture and spatial organization in the spindle.
  
  (8) Can the authors exclude that some of the differences between unfocused and focused spindles could be due to altered dynein activity at kinetochores? Or due to the dynein-dependent accumulation of certain spindle proteins along microtubules towards the minus ends of k-fibers or other spindle microtubules, instead of being due to only the presence versus absence of poles? Could this be tested by ablating both poles? If this is too challenging, a discussion of these possibilities could be justified.
  
  We appreciate the reviewer’s consideration of kinetochore activity as well as other methods of removing poles. However, p50 overexpression is currently the only method to robustly unfocus spindles in PtK2 cells – ablating poles or removing pole-associated structures such as centrosomes does not abolish pole-focusing in this system (Khodjakov et al., 2000). Furthermore, we now discuss the possibility that altered dynein activity (such as activity at kinetochores) may give rise to the phenotypes we describe in our work in line 433: “…recent work has revealed that mammalian spindles can achieve similar architecture whether or not dynein (or its recruiter NuMA) is knocked out (Neahring et al., 2021). This suggests that the severe defects in spindle coordination (Figure 1, Figure 5) and maintenance (Figure 2) observed in p50-unfocused spindles are more likely due to the loss of spindle poles than due to the loss of dynein activity per se. Though we cannot exclude it, this also suggests that the findings we make in unfocused spindles are not due changes in activity of the dynein population at kinetochores.”
  
  Reviewer #2 (Public Review):
  
  The mitotic spindle of eukaryotic cells is a microtubule-based assembly responsible for chromosome segregation during cell division. For a given cell type, the steady-state size and shape of this structure are remarkably consistent. How this morphologic consistency is achieved, particularly when one considers the complex interplay between dynamic microtubules, spatial and temporal regulation of microtubule nucleation, and the activities of several microtubule-based motor proteins, remains a fundamental unanswered question in cell biology. In this work by Richter et al., the authors use biochemical and biophysical perturbations to explore the feedback between mitotic spindle shape and the dynamics of one of its main structural elements, kinetochore fibers (k-fibers) - bundles of microtubules that extend from kinetochores to spindle poles. Overexpression of the p50 dynactin subunit in mammalian tissue culture cells (Ptk2) was used to inhibit the microtubule motor cytoplasmic dynein resulting in misshapen spindles with unfocused poles. Measurements of k-fiber lengths in control and unfocused conditions showed that although mean k-fiber length was not statistically different, the variation of length was significantly higher in unfocused spindles, suggesting that k-fiber length is set locally, occurring in the absence of focused poles. With a clever combination of live-cell imaging with photoablation and/or photobleaching of fluorescently-labeled k-fibers, the authors went on to explore the mechanistic bases of this length regulation. K-fiber regrowth following ablation occurred in both conditions, albeit more slowly in unfocused spindles. Paired ablation and localized photobleaching on the same k-fiber revealed that microtubule dynamics, specifically those at the plus-end, can be tuned at the level of individual k-fiber. Lastly, the authors show that chromosome segregation is severely impaired when cells with unfocused spindles are forced to enter mitosis. The work's biggest strength is the application of an innovative experimental approach to address thoughtful and well-articulated hypotheses and predictions. Conclusions stemming from the experiments are generally well-supported, though the experiments addressing the "tuning" of k-fiber dynamics could be bolstered by additional data points and perhaps better presented. The manuscript would also benefit from the inclusion of some investigation of spatial differences in the observed effects as well as the molecular and biophysical basis of the observed feedback between k-fiber length and focused poles.
  
  We appreciate the reviewer providing pertinent, rigorous, and intellectually astute suggestions.
  
  Comments/Concerns/Questions:
  
  1) In the discussion, the authors acknowledge that the changes in spindle morphology resulting from p50 overexpression are likely also causing changes in the well-characterized RanGTP/SAF gradients that radiate from chromosome surfaces. Why did the authors did not include an analysis of k-fiber length as a function of positioning within the spindle? The inclusion of this data would not require more experimentation and could be added as a plot showing K-fiber length versus distance from the geometric center of the spindle (defined by the intersection of the major and minor axes perhaps?).
  
  We thank the reviewer for this pertinent suggestion and refer them to our response to Essential Revision #2. Briefly, we have added the recommended analyses to Figure 1—figure supplement 6 by correlating k-fiber length to position along the spindle’s longitudinal and latitudinal axes.
  
  2) The authors also acknowledge the established relationship between MT length and MT end dynamics, yet in their ablation studies, the average initial k-fiber length at ablation in control spindles was higher than that for k-fibers in unfocused spindles. It seems that this difference makes the interpretation of the data, particularly the conclusion that fiber growth rates differ due to the absence of focused poles, a bit tenuous. To address this, the authors should consider including plots of grow-back rates versus k-fiber length (again, this should not require additional experiments, just more analysis).
  
  We thank the reviewer for their critical thinking about experiments. We would like to clarify to the reviewer that initial k-fiber lengths within unfocused spindles preceding ablation were not actually longer on average compared to the average length of control k-fibers from Figure 1E (Figure 2—figure supplement 1). We apologize that this unexpected artifact was not clear in the text and have now reworded line 232 to be more straightforward: “Mean k-fiber lengths in unfocused spindles before ablation appeared to be shorter (Figure 2D); however, this was due to not capturing the full length of k-fibers in a single z-plane while imaging ablated k-fibers. Indeed, length analysis of full z-stacks from unfocused spindles before ablation yielded an indistinguishable mean k-fiber length compared to control k-fibers in Figure 1E (Figure 2—figure supplement 1). Thus, ablated k-fibers were compared to their unablated neighbors as internal controls.”
  
  We believe that this language clearly calls out the perceived inconsistency, and that our use of internal controls overcomes this confounding factor to make meaningful conclusions. We address the relationship of k-fiber length and growth rate in our response to Essential Revision #2. We are not including it in the manuscript based on our inability to make any meaningful conclusion to either support or exclude the possibility of length-dependent growth rates.
  
  3) As presented, the data shown in Figure 4 is confusing and does not seem very compelling. The relationship between the kymographs and time series is unclear as is the relationship between the dashed lines in the kymographs and the triangles and the plots in the 4B time series and 4C, respectively. Furthermore, it's not always clear what the triangles are pointing to (e.g. in the unfocused condition time series). The authors might want to consider reworking this figure and providing more measurements of flux following ablation in both the control and unfocused conditions. Lastly, the authors should clarify what negative displacement means.
  
  We apologize for the unclear figure annotations and thank reviewers for their suggestions. As discussed in our response to Essential Revision #3, we believe we have improved the clarity and presentation of figures and kymographs. More measurements of flux after ablation in unfocused spindles was not feasible as discussed; however, we have performed these measurements in control spindles and added Figure 4—figure supplement 1 to strengthen conclusions about turning flux off/on after ablation.
  
  We have additionally clarified axis titles by replacing “negative displacement” with the more intuitive descriptor “photomark position relative to minus-end” and clearly defining it in the figure legends in line 565 as follows: “Figure 3 […] (D) Minus-end dynamics, where photomark position over time describes how the mark approaches the k-fiber’s minus-end over time in control and unfocused k-fibers.”
  
  We thank reviewers for their suggestions to improve clarity and bolster our conclusions.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.11.26.517738v1
www.medrxiv.org www.medrxiv.org

New submission 17/06/2023, 18:49:55

1
1. Public_Reviews 22 Jun 2023
  
  in eLife
  
  Author Response
  
  We thank the Editor for his assessment. We agree that the data we present in this manuscript can be a starting point for more in-depth analysis. We are currently developing a mathematical model of HIV transmission dynamics; we plan to use the data that we present in this paper as parameter values.
  
  Reviewer #1 (Public Review):
  
  One aim of this paper was to study historical migration from Botswana during the time of the development of the HIV epidemic. The second aim was to test whether the migration networks impacted the development of the epidemic. The first aim was achieved: this paper used historical census data in a clear way, to describe the qualities of characteristics of migration in the country at four points in time, from 1981 to 2011. Very detailed data are presented in clear ways, using network chord diagrams, sharing age- and sex-specific migration rates, and urban-rural classifications. However, data was not presented to achieve the second aim. The authors reviewed some important literature about migration and HIV. They suggested that the migration patterns, such as from specific mining towns and mostly between districts, could have been important in supporting the generalized spread of HIV. But without evidence linking HIV prevalence over time in the linked districts in Botswana, this aim was not supported.
  
  We have now made it clear that we are not testing whether the migration networks impacted the development of Botswana’s HIV epidemic: this is what the Reviewer describes as the second aim of our paper. We have only one aim: to test the hypothesis that, during the development of Botswana’s HIV epidemic, the population was extremely mobile and highly connected through migratory flows and counter-flows. This is based on the fact that these conditions are necessary for the development of a generalized HIV epidemic. However – previous to our analysis – these conditions have not been shown to occur during the development of a generalized HIV epidemic. Given that our results support our mobility hypothesis (i.e., that the population was very mobile and essentially all the districts were connected throughout the country), in the discussion (lines 338-362) we describe how the migration networks that we have identified may have impacted the development of the generalized hyperendemic HIV epidemic in Botswana. We have also clarified that our study has only one hypothesis that we are testing by referring to this single hypothesis as the mobility hypothesis (Abstract: lines 25-29).
  
  One other limitation of the paper was that very little context, outside of migration rates, was provided. Is there any additional information about economic growth, or political event for example, that could clarify or add context to these migration flows? As it stands now, these analyses are quite basic and don't take into account underlying demographic, economic, or political trends.
  
  In response to this concern we have expanded the text in the introduction to provide more context regarding political, demographic and economic factors (Introduction: lines 66-75). We have also expanded our discussion of the implications of our results (and of additional results that we have included: lines 263-283) for understanding the role of internal migration on urbanization in Botswana (Discussion: lines 379-420); urbanization occurred simultaneously to the development of Botswana’s generalized hyperendemic HIV epidemic.
  
  The data presented in this paper has potential impact. As the paper stands now, it could be quite useful for future work when linked to additional data sources on HIV prevalence over time (or other questions that could have been influenced by migration patterns).
  
  We thank this Reviewer for their helpful comments.
  
  Reviewer #2 (Public Review):
  
  To provide context into the HIV epidemic in Botswana over the latter half of the 20th century and the beginning of the 21st, the authors have analyzed micro census data to examine patterns of migration. They use this dataset to show how patterns between urban and rural areas have changed over several decades, and the demographic characteristics of migrants. The dataset used for this study is a very reliable source, and the insights in terms of migration patterns are interesting. The primary weakness of the analyses regards the link to HIV transmission: micro-census data only examine mobility that leads to individuals changing residence for longer periods of time, without accounting for shorter-term trips that may also lead to HIV transmission, such as seasonal migration or short trips. This is likely less of an issue with HIV than other diseases, however, due to its transmission often involving new sexual partners, which will generally be less likely to occur during short trips. Broadly, however, this is an interesting report on the migration patterns during a critical period for HIV transmission nationwide.
  
  We thank the Reviewer for their comments.
  
  In our current manuscript, we have discussed the potential impact of mobility on Botswana’s HIV epidemic, and focused on migration, i.e., one directional movement in terms of a permanent re-location of residency. This type of migration, by changing an individual’s sexual network and social environment, has been shown to increase the risk of acquiring HIV for both women and men. Short-term mobility (e.g., short-term circular migration, where the trip can range in duration from overnight to an entire season) can also affect HIV transmission dynamics. Circular migrants have been shown to both have an increased risk of acquiring HIV, and of transmitting HIV. The greater the number of trips and/or the duration of the trip, the greater the risk. We note that both migration and short-term mobility are important, and their relative importance to each other is likely to evolve over time as a generalized HIV epidemic diffuses through the population. Their relative importance is also likely to vary amongst countries in sub-Saharan Africa.
  
  We have added all of the previous paragraph, with citations, to the text (Discussion: lines 364-377).
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2023.02.01.23285339v1
www.biorxiv.org www.biorxiv.org

New submission 17/06/2023, 18:21:03

1
1. Public_Reviews 22 Jun 2023
 
 in eLife
 
 Author Response
 
 Reviewer #2 (Public Review):
 
 Root growth is driven by cell elongation, and its local control allows roots to navigate the complex soil environment. Cell growth is driven by the relaxation of the cell wall, a process requiring a drop in pH. Auxin is a key regulator of root development that inhibits root growth. Auxin effects on proton dynamics are complex, it can promote both acidification and alkalinization of the extracellular space through different signaling modules, some only recently uncovered. Serre et al. report on using a new dye to monitor extracellular pH in the region surrounding the Arabidopsis thaliana root. Their manuscript aims to clarify the relationships between pH around the root, proton flux, auxin, cell elongation, and root growth with this tool. They show a typical zonation of pH values along the root: a more acidic domain corresponding to the transit-amplifying compartment, followed by a more alkaline one at the transition and early elongation zones and a more acidic one in the late elongation/root hair zone. This zonation is in agreement with previous reports obtained by other methods. A particularly puzzling aspect is the origin of the more alkaline domain. Serre et al. present evidence supporting the involvement of the AUX1-AFB1-CNGC14 module for the emergence of this more alkaline domain and how it can contribute to the ability of the root to navigate its environment.
 
 Serre et al. show that the more alkaline domain in the transition zone is not directly determined by the activity or localization of the AHA proton pumps but rather by the auxin influx carrier AUX1. They show that the components of the rapid auxin response pathway, in particular, the auxin co-receptor AFB1 and the calcium channel CNGC14, contribute to the emergence of this more alkaline domain. Finally, they show that mutants in these two genes, impaired in the rapid auxin response pathway, show less efficient navigation of the root tip.
 
 The manuscript is clear and well-written. The logic is sound, and the conclusions are supported by the data.
 
 The new dye appears as a promising tool for monitoring the pH in the rhizosphere with advantages over the previous ones. Yet, as pointed out by the authors in the discussion, it reports on pH at the organ scale in the region around the root, not in the apoplast or the cell wall, which can eventually complexify the elaboration of a mechanistic model joining auxin, proton efflux, cell wall properties, cell elongation, and root growth. Although several of the findings confirm previous reports, the manuscript brings novelty by demonstrating the involvement of the rapid auxin response. I am overall supportive of the manuscript. Yet, several points should be addressed:
 
 The presentation of the more acidic and alkaline domains could be easier to visualize.
 
 The authors refer to acidic and alkaline domains but do not report on absolute pH values; they monitor the emission ratio of the dye. They justify why to use relative pH value in the discussion and refer there to internal controls that are not clearly defined. In my opinion, the wording should be more consistent across the text and figures and refer to more acidic and more alkaline domains rather than acidic (pH<7) and alkaline (pH>7) domains.
 
 The data related to the unaltered distribution of AHA using antibody staining should be backed up.
 
 The way the pH profile and the statistical analyses should be improved.
 
 The authors should test the effect of extracellular auxin perception (tmk, abp) mutants on pH zonation.
 
 Conclusion could be strengthened by moving several pieces of data currently in supplemental material to the main text.
 
 We agree with the comment to the definition of ‘acidic’ and ‘alkaline’ domains; we altered the text and explained that we observe ‘relatively alkaline’ and ‘relatively acidic’ domains in comparison to the medium pH in the first part of results.
 
 We defined the ‘internal controls’ in the text – by this we mean mock treated or wild type plants imaged together with the treated or mutant plants.
 
 To address the role of the apoplastic auxin pathway in the root surface pH, we analyzed the tmk1, tmk4 and abp1 mutants. Surprisingly, all three mutants appear undistinguishable from the controls, showing the crucial importance of the cytoplasmic AFB1 auxin perception pathway. We have included the data as Fig.S4-1.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.11.23.517700v1
www.biorxiv.org www.biorxiv.org

New submission 17/06/2023, 18:16:27

1
1. Public_Reviews 22 Jun 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  This paper studies color vision in anemonefish. The central conclusion of the paper is that anemonefish use signals from their UV cones to discriminate colors that would not otherwise be distinguishable; this differs from other fish in which UV cones extend the range of wavelengths of sensitivity but do not add a dimension to color vision. The work fits into a rich history of studies investigating how color vision fits into an animal's ecological niche. My primary concerns regard the microspectrophotometry data from single cones and some aspects of the presentation of the behavioral data.
  
  Microspectrophotometry
  
  The spectral properties of the cone types are a key issue for interpreting the results. These were measured using MSP, and fits are shown in Figure 2. The raw data shown in Fig. S1 appears more complicated than indicated in the main text. The templates miss the measurements across broad wavelength bands in each cone type. Particularly concerning is the high UV absorbance across cone types and the long-wavelength absorbance in the UV cone. It is not clear how this picture supports the relatively simple description of cone types and spectral sensitivities given in the main text and which forms the basis of the modeling.
  
  Microspectrophotometry is an inherently noise-prone measurement technique, particularly for very small photoreceptor outer segments such as that of single cones, which are also difficult to detect as intact, isolated (nonoverlapping) cells. As such, the absorbance curve fitting and derived lambda max (λmax) values should be treated as estimates. The accuracy of these estimates is adequate for this type of study, and visual modelling results have been shown to be robust against small errors (±10 nm λmax) in photoreceptor sensitivity for multiple species [see Lind, O. & Kelber, A. (2009). Vis Res. 49(15), 1939-1947; and Bitton, PP. et al. (2017). PLOS ONE, 12: e0169810]. We consider it highly unlikely that small shifts in cone λmax from measurement error would make a meaningful difference to the colour discrimination thresholds.
  
  It should be noted that the raw data shown in the original Supplementary Figure 1, included all scans overlain with an average absorbance curve for presentation purposes; however, the actual lambda max values for different cone types were measured and then averaged among individual scans fitted with photopigment absorbance curve templates. For clarity and transparency, we have now provided three multipaned plots (see Figure 1 – figure supplements 1-3) showing the individual pre- and post-bleach scans of absorbance spectra, fitted absorbance curve templates, and R2 values from the best visual pigment template fit.
  
  It is worth noting that most of the cone absorbance spectra found in our study closely resemble those in λmax and quality to those measured in another anemonefish species (Amphiprion akindynos) [see Supplementary Figure 1 in Stieb S. et al. (2019). Sci Rep. 9, 16459]. These cone λmax values can also be reconciled with previous estimates on opsin λmax based on amino acid sequences and cone opsin expression in the A. ocellaris retina characterised in Mitchell LJ et al. (2021). GBE, 13: evab184.
  
  Evidence that the unusual long-wavelength absorbance detected in a couple of the single cone (pre-bleach) measurements were not of visual pigment in origin comes from post-bleach scans, which showed their persistence (i.e., did not show a photobleaching response) and were likely instead contaminants (e.g., blood, RPE pigment). UV absorbance in some of the double cone measurements (above that expected of the prebleached beta peak from chromophore spectral absorption) can be attributed to either noise from scans as is quite typical of MSP and/or partial (accidental) bleaching from stray light sources. Although utmost care was taken to minimise contamination and unintended bleaching sometimes it is unavoidable.
  
  We refer the Reviewer to multiple published studies for further examples of typical MSP measurements that share similar levels of noise to ours e.g., see Figure 1 in Knott B. et al. (2013). JEB, 216:4454-4461; Figure 3 in Schott, RK et al. (2015). PNAS, 113(2): 356-361; Figure 2 in Dalton BE et al. (2014). Proc R Soc B. 281; Figure 5 in Tosetto, JE et al. (2021). Brain Behav Evol. 96: 103-123.
  
  Presentation
  
  The results are not presented in a straightforward way - at least for this reviewer. What is missing for me is a clear link between the psychometric curves in Figure 3A and the discrimination thresholds indicated in Figure 3B and Figure 4. Figure 3A is only discussed in the text on line 289 - after Figure 4 has been introduced and discussed. It would have been very helpful for me if the psychometric curves were first introduced and described, then the relation to Figure 3B was clearly indicated (perhaps with a single psychometric curve as an example). Similarly for Figure 4 the relationship between specific psychometric curves and the threshold plotted would be quite helpful. Currently it takes a careful reading to understand why being below the dashed line in Figure 4 is important.
  
  We have made the following changes, including the introduction of the psychometric curves earlier in the results (lines 236-249) and moved the psychometric function comparison before the mention of Figure 4. Additionally, to make the association between the plotted colour loci and psychometric curves clearer, we have added a smaller psychometric curve plot adjacent to the colour space (in Figure 3B) using red as an example which has an averaged psychometric curve overlying the individual fish curves. The figure caption (lines 250-274) explains that the plotted colour loci and given thresholds are mean values calculated from the individual fish behavioural data.
  
  We have also added a brief reminder that the theoretical limit of colour discrimination is predicted by the RNL model as 1∆S, where in our task fish should be just able to distinguish targets from grey distractors (see lines 222-224). To clarify, the plotted values in Figure 4B are both the individual fish thresholds (points) and average threshold (black bar) per colour set. The individual threshold values are taken at a correct choice probability of 50% from fitted psychometric curves of fish behavioural performance (shown in Figure 3A).
  
  RNL model
  
  The data is fit and interpreted in the context of the receptor noise limited model. The paragraph in the discussion about complementary color pairs suggests that this model is incorrect (text around line 332). Consideration of how the results depend on the RNL model is important, especially given the interpretation here.
  
  The inability of the RNL model to account for the observed asymmetry between color discrimination thresholds implies that they cannot be solely attributed to photoreceptor noise. We can therefore infer from the asymmetry that thresholds are set by a higher-level process, whether that involves post-receptor processes within the inner retina or in the brain remains to be investigated. As explained in lines 396-397 one possibility is that activation of the UV receptor suppresses noise in the visual pathway or enhances the saliency of colors for anemonefish. The high sensitivity to violet-green, which was found in all six of the fish tested, is consistent with the heightened saliency of this color (lines 397-399).
  
  Figure 3B
  
  This is the key figure in the paper. But several issues make seeing the data in this figure difficult. First, the important part of the figure is buried near the origin and hard to see. Can you show a surface that connects the thresholds in the different chromatic directions, or otherwise highlight the regions of discriminable and not discriminable colors?
  
  See previous comment. In short, we have taken the advice of the Reviewer and added highlighted areas around the regions of discriminable colors in Figure 3B to help visually separate them from the non-discriminable regions of colors (from grey). Additionally, we have added an inset showing an enlarged image of the area surrounding the centre of colour space.
  
  Reviewer #2 (Public Review):
  
  Mitchell and colleagues examined the contribution of a UV-sensitive cone photoreceptor to chromatic detection in Amphiprion ocellaris, a type of anemonefish. First, they used biophysical measurements to characterize the response properties of the retinal receptors, which come in four spectrally-distinct subtypes: UV, M1, M2, and L. They then used these spectral sensitivities to construct a 4-dimensional (tetrahedral) color space in which stimuli with known spectral power distributions can be represented according to the responses they elicit in the four cone types. A novel five-LED display was used to test the fish's ability to detect "chromatic" modulations in this color space against a background of random-intensity, "achromatic" distractors that produce roughly equal relative responses in the four cone types. A subset of stimuli, defined by their high positive UV contrast, were more readily detected than other colors that contained less UV information. A well-established model was used to link calculated receptor responses to behavioral thresholds. This framework also enabled statistical comparisons between models with varying number of cone types contributing to discrimination performance, allowing inferences to be drawn about the dimensionality of color vision in anemonefish.
  
  The authors make a compelling case for how UV light in the anemonefish habitat is likely an important ecological source of information for guiding their behavior. The authors are to be commended for developing an elegant behavioral paradigm to assess visual performance and for incorporating a novel display device especially suited to addressing hypotheses about the role of UV light in color perception. While the data are suggestive of behavioral tetrachromacy in anemonefish, there are some aspects of the study that warrant additional consideration:
  
  1) One challenge faced by many biological imaging systems is longitudinal chromatic aberration (LCA) - that is, the focal power of the system depends on wavelength. In general, focal power increases with decreasing wavelength, such that shorter wavelengths tend to focus in front of longer wavelengths. In the human eye, at least, this focal power changes nonlinearly with wavelength, with the steepest changes occurring in the shorter part of the visible spectrum (Atchison & Smith, 2005). In the fish eye, where the visible spectrum extends to even shorter wavelengths, it seems plausible that a considerable amount of LCA may exist, which could in turn cause UV-enriched stimuli to be more salient (relative to the distractor pixels) due to differences in perceived focus rather than due solely to differences in their respective spectral compositions. Such a mechanism has been proposed by Stubbs & Stubbs (2016) as a means for supporting "color vision" in monochromatic cephalopods (but see Gagnon et al. 2016). It would be worth discussing what is known about the dispersive properties of the crystalline lens in A. ocellaris (or similar species), and whether optical factors could produce sufficient cues in the retinal image that might explain aspects of the behavioral data presented in the current study.
  
  This is an interesting point, and we appreciate the reviewer’s thoughtful comment regarding this topic especially as LCA increases exponentially in the UV. Although we certainly cannot disprove such a mechanism in the present study, we are highly sceptical that LCA could be used by reef fish and is involved in the heightened saliency of UV stimuli. Previous work has found that LCA is mostly corrected for in the teleost retina of both marine and freshwater species by graded, multifocal lenses that focus different wavelengths at the same depth as their maximally sensitive cone photoreceptors [e.g., for evidence in African cichlids see Kröger, R. H. H. et al. (1999). J Comp Physiol. A, 184, 361-369; Malkki, P. E. & Kröger, R. H. H. (2005). J Opt. A, 7, 691-700; and for various reef fishes see Karpestam, B. et al. (2007). J Exp Biol., 210, 16: 2923-2931]. In essence, LCA is corrected in the eyes of many teleosts by accurately tuning longitudinal spherical aberration through having a graded density lens. We draw particular attention to the latter reference which comparatively examined the optical properties of reef fish lenses, including diurnal, planktivorous damselfishes (from the same family as anemonefishes, Pomacentridae). They found that not only were the lenses of these species highly UV-transmissive (as we show in anemonefish), but all were multifocal and capable of focusing both visible (non-UV) and UV wavelengths. Considering the coastal cephalopod species examined thus far, all of them contain only one type of visual pigment which is packed in their long photoreceptor (150-450µm long outer segment) across an entire retina (Chung and Marshall 2016, Proceeding B). Theoretically, given these long photoreceptors, the LCA and the resulting differentials of focal length onto different patches of photoreceptors or different depth of the outer segment might provide cues for colour discrimination even though no behavioural evidence exists to prove this hypothesis yet. Unlike the cephalopod case, the four specific spectral cones arranged in a mosaic pattern along with their very short outer segments (5-10µm) in the anemonefish retina likely makes the LCA less effective in this retinal design.
  
  We have added a short paragraph (Lines 400-412) discussing the possibility of an optical mechanism contributing to heightened UV saliency with a particular focus on LCA and our thoughts on why we consider it an unlikely mechanism in anemonefish.
  
  2) The authors provide a quantitative description of anemonefish visual performance within the context of a well-developed receptor-based framework. However, it was less clear to me what inferences (if any) can be drawn from these data about the post-receptoral mechanisms that support tetrachromatic color vision in these organisms. Would specific cone-opponent processes account for instances where behavioral data diverged from predictions generated with the "receptor noise limited" model described in the text? The general reader may benefit from more discussion centered on what is known (or unknown) about the organization of cone-opponent processing in anemonefish and related species.
  
  In short, we do not know the specific opponent interactions of anemonefish cones. The RNL model assumes all possible opponent interactions in its calculations. From our results, very little can be said about the post-receptor mechanisms involved in their putative tetrachromatic vision. We would like to avoid overreaching beyond what our data can show. A future directions section has now been added to the discussion (lines 467-497), which briefly mentions the known UV opponency in larval zebrafish and that future investigation in anemonefish should attempt to disentangle the specific opponent (chromatic) and non-opponent (achromatic) circuits in the anemonefish retina.
  
  Reviewer #3 (Public Review):
  
  The comments below focus mainly on ways that the data and analysis as currently present do not to this reviewer compel the conclusions the authors wish to draw. It is possible that further analysis and/or clarification in the presentation would more persuasively bolster the authors' position. It also seems possible that a presentation with more limited conclusions but clarity on exactly what has been demonstrated and where additional future work is needed would make a strong contribution to the literature.
  
  Fig 3A. It might be worth emphasizing a bit more explicitly that the x-axis (delta S) is the result of a model fit to the data being shown, since this then means that if RNL model fit the data perfectly, all of the thresholds would fall at deltaS = 1. They don't, so I would like to see some evaluation from the authors' experience with this model as to whether they think the deviations (looks like the delta S range is ~0.4 to ~1.6 in Figure 4B) represent important deviations of the data from the model, the non-significant ANOVA notwithstanding. For example, Figure 4B suggests that the sign of the fit deviations is driven by the sign of the UV contrast and that this is systematic, something that would not be picked up by the ANOVA. Quite a bit is made of the deviations below, but that the model doesn't fully account for the data should be brought out here I think. As the authors note elsewhere, deviations of the data from the RNL model indicate that factors other than receptor noise are at play, and reminding the reader of this here at the first point it becomes clear would be helpful.
  
  We have now stated more explicitly in the figure caption for Figure 3A, that the delta S values presented were calculated by fitting fish behavioral data to the RNL model. To test the overall effect that the sign of the UV contrast had on the discrimination threshold, we have now included ‘contrast’ (positive or negative) as another fixed effect in the linear mixed effects model. We have now included details of this test in the results which shows the systematic effect (lines 338-340). Additionally, as suggested we now briefly introduce in the results the idea that factors other than receptor noise are causing the observed deviations in data from the RNL model.
  
  Line 217 ff, Figure 4, Supplemental Figure 4). If I'm understanding what the ANOVA is telling us, it is that the deviations of the data across color directions and fish (I think these are the two factors based on line 649) is that the predictions deviate significantly from the data, relative to the inter-fish variability), for the trichromatic models but not the tetrachromatic model. If that's not correct, please interpret this comment to mean that more explanation of the logic of the test would be helpful.
  
  The interpretation of the ANOVA by the Reviewer is mostly correct. We had the variables color set and Fish ID, with threshold delta S as the dependent variable. This showed that deviations from the predicted threshold were significant relative to the inter-fish variability for the trichromatic models. Missing details describing the ANOVA have now been added to the methods (lines 789-798).
  
  Assuming that the above is right about the nature of the test, then I don't think the fact that the tetrachromatic model has an additional parameter (noise level for the added receptor type) is being taken into account in the model comparison. That is, the trichromatic models are all subsets of the tetrachromatic model, and must necessarily fit the data worse. What we want to know is whether the tetrachromatic model is fitting better because its extra parameter is allowing it to account for measurement noise (overfitting), or whether it is really doing a better job accounting for systematic features of the data. This comparison requires some method of taking the different number of parameters into account, and I don't think the ANOVA is doing that work. If the models being compared were nested linear models, than an F-ratio test could be deployed, but even this doesn't seem like what is being done. And the RNL model is not linear in its parameters, so I don't think that would be the right model comparison test in any case.
  
  Typical model comparison approaches would include a likelihood ratio test, AIC/BIC sorts of comparisons, or a cross-validation approach.
  
  If the authors feel their current method does persuasively handle the model comparison, how it does so needs to be brought out more carefully in the manuscript, since one of the central conclusions of the work hinges at least in part on the appropriateness of such a statistical comparison.
  
  Our visual model comparisons were aimed at assessing whether a trichromatic or tetrachromatic model best fit the colour discrimination data. The trichromatic and tetrachromatic models assume two and three opponency pathways, respectively. If the fish were not tetrachromatic, and instead trichromatic, then we would expect that the RNL model should better fit the data with two opponency mechanisms (rather than three). Our reason for making this assessment, is because of the possibility that not all the cones could be contributing to colour vision and could be used exclusively for achromatic tasks (e.g., luminance vision or motion detection). However, according to our finding that the data best fit the tetrachromatic model (i.e., how the behavioural discrimination thresholds more closely fitted the theoretical prediction of 1∆S), it is likely that anemonefish used all four cones for colour vision.
  
  We have also now repeated our analysis using unweighed delta S values which are calculated using general n-dimensional models of colour vision (using the PAVO2 package). These models essentially follow the same initial steps followed by the RNL model (and many others) but omit the receptor noise correction stage. After comparing (using ANOVA, see lines 303-311) the predicted thresholds with the data in this non-RNL space, it was found that again the tetrachromatic model predictions did not deviate significantly from the data relative to individual fish performance; however, we also found that the trichromatic model without M2 cone input no longer differed from the predicted values. In this case, it seems that the extra noise parameter did contribute to the difference in fit. Whether this is a biologically meaningful comparison (as all photoreceptors contain noise) is an open question. We have added a short statement explicitly framing our interpretation of anemonefish having a 3-D colour space to being in accordance with the closeness of RNL model predictions (lines 370-371, 506-508).
  
  Also on the general point on conclusions drawn from the model fits, it seems important to note that rejecting a trichromatic version of the RNL model is not the same as rejecting all trichromatic models. For example, a trichromatic model that postulates limiting noise added after a set of opponent transformations will make predictions that are not nested within those of RNL trichromatic models. This point seems particularly important given the systematic failures of even the tetrachromatic version of the RNL model.
  
  This is a good point. We have limited our conclusions to specifically address trichromatic models generated within the framework of the RNL model by adding in the conclusion section that fish psychophysical thresholds were best explained by the RNL model when all four cone types contributed to colour vision (see lines 370-371, 506-508). In this same sentence, we have also added in parentheses that “suggesting (but not proving) tetrachromacy” (line 508). We have also edited the abstract to state that our results were “…best described by a tetrachromatic model using all four cone types…”, rather than stating we have shown tetrachromacy (lines 36-37).
  
  More generally, attempts to decide whether some human observers exhibit tetrachromacy have taught us how hard this is to do. Two issues, beyond the above, are the following. 1) If the properties of a trichromatic visual system vary across the retina, then by imaging stimuli on different parts of the visual field an observer can in principle make tetrachromatic discriminations even though visual system is locally trichromatic at each retinal location. 2) When trying to show that there is no direction in a tetrachromatic receptor space to which the observer is blind, a lot of color directions need to be sampled. Here, 9 directions are studied. Is that enough? How would we know? The following paper may be of interest in this regard: Horiguchi, Hiroshi, Jonathan Winawer, Robert F. Dougherty, and Brian A. Wandell. "Human trichromacy revisited." Proceedings of the National Academy of Sciences 110, no. 3 (2013): E260-E269. Although I'm not suggesting that the authors conduct additional experiments to try to address these points, I do think they need to be discussed. We agree with the reviewer, that colour discriminability achieved by tetrachromatic vision could in theory be achieved by the combined effect of localised, distinct forms of trichromacy. Evidence in other fishes suggests that such multiple forms of trichromacy across the retina likely exist in many species. However, the behavioural effects of this retinal setup remain to be studied likely due to its extremely difficult nature. We have added a new section titled “future directions” (Lines 474-489), in which we discuss the possibility that distinct forms of trichromacy in the anemonefish retina could in theory achieve colour discrimination on par with tetrachromatic vision. We also give suggestions on how this could be investigated.
  
  Although we tried to include as many colour directions as practically possible in our experiment, we have certainly not provided an exhaustive range that completely encompasses anemonefish colour space. Whether 9 colour directions are adequate to assess the dimensionality of their color vision is difficult to say. As addressed in the previous comment, we now acknowledge this limitation by refining our conclusion, saying that our results do not prove tetrachromacy.
  
  Line 277 ff. After reading through the paper several times, I remain unsure about what the authors regard as their compelling evidence that the UV cone has a higher sensitivity or makes an omnibus higher contribution to sensitivity than other cones (as stated in various forms in the title, Lines 37-41, 56-57, 125, 313, 352 and perhaps elsewhere).
  
  At first, I thought they key point was that the receptor noise inferred via the RNL model as slightly lower (0.11) for the UV cone than for the double cones (0.14). And this is the argument made explicitly at line 326 of the discussion. But if this is the argument, what needs to be shown is that the data reject a tetrachromatic version of the RNL model where the noise value of all the cones is locked to be the same (or something similar), with the analysis taking into account the fewer parametric degrees of freedom where the noise parameters are so constrained. That is, a careful model comparison analysis would be needed. Such an analysis is not presented that I see, and I need more convincing that the difference between 0.11 and 0.14 is a real effect driven by the data. Also, I am not sanguine that the parameters of a model that in some systematic ways fails to fit the data should be taken as characterizing properties of the receptors themselves (as sometimes seems to be stated as the conclusion we should draw).
  
  We have performed various modelling scenarios where receptor noise was adjusted for each channel; however, the UV channel was consistently found to be more sensitive than the other channels. In (the original) Supplementary Figure 6 (now Figure 4 – figure supplements 1 and 2), we show predicted dS values calculated using receptor noise levels in the exact manner that the Reviewer suggests by ranging from 0.05 to 0.15, and most importantly, included scenarios where receptor noise was held equal across cone types and others where it was varied between single cones and double cones. None of the models adjusted the data so that sensitivity was equal across all four channels, which means that by an unknown mechanism, the UV channel is more sensitive, but this is unrelated to noise levels. Our best-fit receptor noise values of 0.11 (for single cones) and 0.14 (for double cones) are estimate values and should be treated as such till actual receptor noise measurements are made.
  
  Then, I thought maybe the argument is not that the noise levels differ, but rather that the failures of the model are in the direction of thresholds being under predicted for discriminations that involve UV cone signals. That's what seems to be being argued here at lines 277 ff, and then again at lines 328 ff of the discussion. But then the argument as I read it more detail in both places switches from being about the UV cones per se to being about postive versus negative UV contrast. That's fine, but it's distinct from an argument that favors omnibus enhanced UV sensitivity, since both the UV increments and decrements are conveyed by the UV cone; it's an argument for differential sensitivity for increments versus decrements in UV mediated discriminations. The authors get to this on lines 334 of the discussion, but if the point is an increment/decrement asymmetry the title and many of the terser earlier assertions should be reworked to be consistent with what is shown.
  
  To clarify our argument, we found that the colour discrimination thresholds were systematically lower than predicted by the RNL model for colours which elicited higher UV cone stimulation relative to other cone types. These colours we refer to as UV positive based on the sign direction of their contrast against grey distractors produced by higher UV/V LED channel (i.e., in a positive direction). Whereas colours with UV negative chromatic contrast had lower UV cone stimulation relative to the other cone types. Therefore, our interpretation of the importance of UV cone signals for colour discrimination are congruent with the results. In the discussion, we suggest a possibility that activation of the UV receptor suppresses noise downstream in the visual pathway or enhances the saliency of colours (see lines 397-398). This activation of the UV receptor would, of course, be at its highest for colours with positive UV chromatic contrast.
  
  Note that we have added to the discussion the possibility that colour preferences or a difference in attentiveness might have contributed to differences in discrimination thresholds (see discussion lines 412-413, 427-428, 433-435, 456-466, and 469-473). However, we consider it a less likely explanation due to a couple of reasons, including 1) a lack of difference in responsiveness across colour sets in their timing to peck the target, and 2) any non-learnt bias would have likely been overridden or at least weakened by training prior to the experiment where colours were rewarded equally (see lines 462-466).
  
  We have edited the results (lines 334-352) to make our point clearer and by changing the subtitle to be more explicit: “Lower discrimination thresholds induced by positive UV contrast”. The subsection begins by explaining the different types of UV chromatic contrast by elevation angle and, finally, how this division among colour sets was a major determinant of colour discrimination thresholds.
  
  Perhaps the argument with respect to model deviations and UV contrast independent of sign could be elaborated to show more systematically that the way the covariation with the contrasts of the other cone stimulations in the stimulus set goes, the data do favor deviations from the RNL in the direction of enhanced sensitivity to UV cone signals, but if this is the intent I think the authors need to think more about how to present the data in a manner that makes it more compelling than currently, and walk the reader carefully through the argument.
  
  We have added to the results the linear mixed-effects model output with ‘contrast’ (positive/negative) added as a fixed effect. This analysis shows that the sign direction of UV contrast was a strong predictor of threshold (see address to previous comments and lines 399-401, 790-799).
  
  On this point, if the authors decide to stick with the enhanced UV sensitivity argument in the revision, a bit more care about what is meant by "the UV cone has a comparatively high sensitivity (line 313 and throughout)" needs more unpacking. If it is that these cones have lower inferred noise (in the context of a model that doesn't account for at least some aspects of the data), is this because of properties of the UV cones, or the way that post-receptoral processing handles the signals from these cones mimicking a cone effect in the model. And if it is thought that it is because of properties of the cones, some discussion of what those properties might be would be helpful. As I understand the RNL model, relative numbers of cones of each type are taken into account, so it isn't that. But could it be something as simple as higher photopigment density or larger entrance aperture (thus more quantum catches and higher SNR)?
  
  It is unknown what aspect of the cone morphology or physiology sets the activation or inactivation threshold. Electrophysiological data collected from the UV cones of other fish species e.g., in goldfish and zebrafish [see Hawryshyn & Beauchamp (1985). 25, Vis Res.; and Yoshimatsu et al. (2020). 107, Neuron.] show that they have exceptionally high sensitivity. What has not been shown is that having a UV cone can improve colour discrimination.
  
  Previous quantitative cone opsin gene expression analysis showed that the single cone opsins (SWS1 and SWS2B) are expressed at lower levels than all double cone opsin genes. This difference in expression combined with the smaller size of single cone outer segments than the double cones make it unlikely that a larger photoreceptor size, higher volume or packing density of visual pigment is responsible. Contrary to our findings, these aspects of the different cone types (if they had an effect) would instead predict that double cones have a higher SNR, and non-UV colours would be more discriminable. We have now added these details to the discussion (see lines 391-397).
  
  Line 288 ff. The fact that the slopes of the psychometric functions differed across color directions is, I think, a failure of the RNL model to describe this aspect of the data, and tells us that a simple summary of what happens for thresholds at delta S = 1 does not generalize across color directions for other performance levels. Since one of the directions where the slope is shallower is the UV direction, this fact would seem to place serious limits on the claim that discrimination in the UV direction is enhanced relative to other directions, but it goes by here without comment along those lines. Some comment here, both about implications for fit of RNL model and about implications for generalizations about efficacy of UV receptor mediated discrimination and UV increment/decrement asymmetries, seems important.
  
  The variation in the psychometric functions is difficult to interpret and cannot be explained by the RNL model. What the RNL model predicts is delta S based on low level factors (namely receptor noise). In the discussion, we completely agree with the notion that the asymmetry in thresholds from predicted values, and the variation in psychometric slopes cannot be explained by the RNL model, e.g., this is heavily implied by “colour discrimination thresholds cannot be directly attributed to noise in the early stages of the visual pathway…” (lines 388-390). To clarify the inability of the RNL model to account for this aspect of the data, we have included a statement (see line 390).
  
  It is a good point that this could be an indication of heterogeneity in colour space. Heterogeneity in discrimination thresholds across animal colour space (both surrounding the threshold area and for more saturated regions) has been explored in detail using trichromatic triggerfish by Green N. F. et al. (2022). JEB, 7(225):jeb243533. We have added this idea to the discussion (see lines 490-498). For UV, it seems that two of the five fish (#34 and 20) had noticeably shallower curves than the others tested for UV (fish #19, 33, 36). Both also varied more in their ability to distinguish targets, as shown by their wider confidence intervals. One of these two fish (#34) was retested for UV at the end of the experiment, and in the secondary assessment had a steeper psychometric curve more in line with the other fish in the experiment (see Figure 3 – figure supplement 1 and added lines 247-250). Based on this discrepancy in performance between assessments, it is also possible that individual learning effects had a role in impacting the shape of the psychometric curve. Note, this had minimal effect on colour discrimination thresholds and any differences were in the direction of change observed across colour sets in the experiment (i.e., lower dS for UV positive directions).
  
  Line 357 ff. Up until this point, all of the discussion of differences in threshold across stimulus sets has been in terms of sensitivity. Here the authors (correctly) raise the possibility that a difference in "preference" across stimulus sets could drive the difference in thresholds as measured. Although the discussion is interesting and germaine, it does to some extent further undercut the security of conclusions about differential sensitivity across color directions relative to the RNL model predictions, and that should be brought out for the reader here. The authors might also discuss about how a future experiment might differentiate between a preference explanation and a sensitivity explanation of threshold differences.
  
  We have now added a paragraph (see lines 469-473) discussing that future work should test for color preferences and suggest how this could be done using a similar foraging task. We also include our thoughts immediately prior on why it is unlikely that a colour preference was a major contribution towards the results. In short, we consider it unlikely as fish showed no evidence of reduced latency for pecking at targets across the colour sets and because the training regime prior to the experiment equally rewarded fish for all colours and would likely have overridden a strong preference (at least in this specific foraging context).
  
  RNL model. The paper cites a lot of earlier work that used the RNL model, but I think many readers will not be familiar with it. A bit more descriptive prose would be helpful, and particularly noting that in the full dimensional receptor space, if the limiting noise at the photoreceptors is Gaussian, then the isothreshold contour will be a hyper-ellipsoid with its axes aligned with the receptor directions.
  
  There is now added explanation of the RNL model (see lines 141-151), particularly on its assumptions that it only receives chromatic input and that discrimination is limited by noise arising in the photoreceptors and not by any specific opponent mechanisms. We also added the mention of the expected hyper-ellipsoid shape of isothreshold contours if receptor noise is Gaussian. Note, while we appreciate the importance of the reader to understand the basic functionality of the model, we wanted to avoid overloading the introduction with details on the RNL model which is not the focus of the paper. The RNL model is well-established in the field of visual ecology and animal vision research for well over a decade and has been thoroughly dissected by previous methodological reviews. We refer to one of these more recent reviews by Olsson et al. (2018) Behav Ecol. 29(2):273-282, and direct the reader to the methods section for further details on the RNL model.
  
  Use of cone isolating stimuli? For showing that all four cone classes contribute to what the authors call color discrimination, a more direct approach would seem to be to use stimuli that target stimulation of only one class of cone at a time. This might require a modified design in which the distractors and target were shown against a uniform background and approximately matched in their estimated effect on a putative achromatic mechanism. Did the authors consider this approach, and more generally could they discuss what they see as its advantages and disadvantages for future work.
  
  The Reviewer is correct in that a targeted approach of isolated cone stimulation would be the optimal approach to demonstrating tetrachromatic colour vision. However, the extreme spectral overlap in the absorption curves of anemonefish cones, particularly in the mid-wavelength region makes this problematic in using the current LED display. We added to the discussion ways that this could be studied in the future (see lines 474-489). This might be possible (but still challenging) using a monochromator, but such technology severely limits the diversity of stimuli which can be created and usually restricts experiments to a simple paired choice design (or grey card experiment). The traditional paired choice experiment requires animals to be trained to distinguish a specific colour, while the Ishihara-like task trains animals to distinguish targets using an odd-one-out approach. This latter approach is highly efficient, as it does not require retraining when testing a new colour (i.e., fish learnt the task not a specific colour). Here, we wanted to assess colour discrimination in multiple directions to compare performance, and the flexible LED display combined with a generalisable task was important.
  
  The above assumes that anemonefish do not use multiple trichromatic systems. In which case, the use of standard experimental stimuli (e.g., a monochromator, an LED display) would be unsuitable as they illuminate the whole retina. To definitively test the range of opponent interactions, it would be necessary to make electrophysiological measurements targeting the transmitting neurons using a retinal multielectrode array (MEA) approach or by in-vivo calcium imaging (lines 484-486).
  
  We understand that our results are not a direct test of the dimensionality of anemonefish colour vision and should not be interpreted as such, as we do not have direct evidence of tetrachromacy. To recognize this limitation of our data, we have drawn back some of our conclusive statements that claimed to have demonstrated tetrachromacy.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.12.01.518784v1
www.biorxiv.org www.biorxiv.org

New submission 17/06/2023, 17:49:48

1
1. Public_Reviews 22 Jun 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  Precise regulation of gamete fusion ensures that offspring will have the same ploidy as the parents. However, breaking this regulation can be useful for plant breeding. Haploid induction followed by chemical-induced genome doubling can be used to fix desirable genotypes, while triparental hybrids where two sperm cells with two different genotypes fertilize an egg cell can be advantageous for bypassing hybridization barriers to create interspecies hybrids with increased fitness. This manuscript follows up on a previous study from the same research group that used a clever high throughput polyspermy detection assay (HIPOD) to show that wild-type Arabidopsis naturally forms triparental hybrids at very low frequencies (less than 0.05% of progeny) and that these triparental hybrids can bypass dosage barriers in the endosperm (Nakel, et al., 2017). Mao and co-authors hypothesized that mutants that conferred polytubey, the attraction of multiple pollen tubes by mutant female gametophytes, would also increase the rate of triparental hybrids. They used a double mutant in the endopeptidase genes ECS1 and ECS2 which had previously been reported to induce supernumerary pollen tube attraction to test this hypothesis with their two-component HIPOD system in which one pollen donor constitutively expresses the mGAL4-VP16 transcription factor while the second pollen donor carries an herbicide resistance gene regulated by the GAL4-responsive UAS promoter. Triparental hybrids are detected as herbicide-resistant progeny from wild-type Arabidopsis flowers that have been pollinated by the two paternal genotypes. The authors convincingly show that the ecs1 ecs2-1 double mutant more than doubled the frequency of triparental, triploid hybrids in HIPOD crosses. They next tested the hypothesis that this increase in triparental hybrids was due to a gametophytic effect by using an ecs1-/- ecs2-1/ECS2 maternal parent in the HIPOD assay and testing whether the ecs2-1 mutant allele was preferentially inherited in triparental hybrids. The mutant allele was inherited at a much higher rate than expected, confirming their hypothesis.
  
  The triparental hybrid results with the ecs1 ecs2 mutant were not that surprising since the presence of extra sperm cells gives more opportunities for triparental hybrids to form, especially if gamete fusion is misregulated. However, an unexpected result came when the authors used aniline blue staining to analyze the ecs1 ecs2 polytubey phenotype. They confirmed that the double mutant had increased levels of polytubey compared to wild-type ovules, but they also noticed that 13% of seeds were not developing normally. This phenotype was confirmed with a second ecs2 allele and was complemented with both ECS1 and ECS2 transgenes under their native promoters. Microscopic analysis revealed normal gametophyte morphology before fertilization, but 8% of pollinated ovules failed to develop an embryo and 7% failed to develop endosperm, suggesting single fertilization events. In a logical set of experiments, they followed up on this result by crossing ecs1 ecs2 with pollen carrying a fluorescent reporter that would be expressed in developing embryos and endosperm. In this experiment, they were again surprised. Some of the wild-type-looking seeds lacked a paternal contribution (i.e. no fluorescent signal from the paternal reporter construct) in the embryo. This prompted them to look more closely at the progeny, upon which they detected small plants that were haploid. They confirmed the haploid nature by chromosome spreads. Finally, they used interaccession crosses between ecs1 ecs2 (Col-0) and Landsberg to verify that haploid progeny only carried maternal alleles of markers on all five chromosomes, indicating that the ecs1 ecs2 genotype can induce maternal haploids.
  
  This interesting study highlights the importance of following up on unexpected results. The conclusions are well-supported by the data and quite exciting. Paternal haploid inducers have been discovered in several species, but this is one of only two examples of maternal haploid induction. While the percentage of maternal haploids is very low, this phenomenon could be useful for plant breeding.
  
  Weaknesses
  
  The data in the manuscript is intriguing, but the question of how the same mutant combination promotes the formation of both triploid and haploid progeny remains unanswered and is not thoroughly discussed, nor is any model suggested for how the ECS1/2 peptidases could play a role in regulating gamete fusion and/or repressing parthenogenesis. A second unanswered question is whether the maternal haploids are a result of failed plasmogamy or karyogamy between the egg and sperm leading to parthenogenesis or a result of paternal genome elimination after plasmogamy. In figure 3B, the authors attempted to test whether plasmogamy occurs between the male and female gametes in ecs1 ecs2 ovules by crosses with pollen that expresses a mitochondrial marker under control of the pRPS5a promoter which is active in sperm cells as well as embryos and endosperm of fertilized ovules. This experiment allowed them to detect sperm cells that had not fused with the egg and central cell at 2 days after pollination. They also counted the percentage of seeds that expressed the mitochondrial marker in both embryo and endosperm at 2 DAP and found that ecs1 ecs2 mutants had a 20% reduction of visible mitochondria in embryo sacs compared to wildtype. They conclude that the result indicates a potential plasmogamy defect. However, the dependability of this marker is questionable since only ~55% of wild-type seeds had detectable signal in the embryo and endosperm. The authors imply that this experiment could be used to test plasmogamy, but it is not clear how any conclusions related to the abnormal seed phenotype could be drawn from examining the rate of signal in both the embryo and endosperm. Since the mitochondrial marker was not expressed from a sperm-specific promoter, the fluorescent signal at 2DAP is likely due to new gene expression from pRPS5a in the fertilized embryo and endosperm, not an indication of the presence of sperm-derived mitochondria. Perhaps an earlier timepoint could be used as well as a spermspecific promoter instead of pRPS5a to answer the question of whether plasmogamy is happening in the ecs1 ecs2 ovules.
  
  Thanks for the suggestion. We here provide two additional new data sets to provide evidence that ecs1 ecs2 mutant plants indeed exhibit single fertilization that lead to fertilization recovery.
  
  We determined the fertilization failure by checking the decondensation HTR10-RFP labelled sperm nuclei 8-10 HAP (Figure 3B) and the frequency of heterofertilization through dual pollination experiment (Figure 3C-E) (see above).
  
  Reviewer #2 (Public Review):
  
  The manuscript reports the triploid and haploid productions using an ecs1ecs2 mutant as the maternal donor, in addition to the evaluation of the sexual process observed in the mutant. The indicated data show exquisite quality. To improve the content, I recommend carefully reconsidering the descriptions because some of the insights would cause a stir in the controversy regarding ECS1&2 functions in plant reproduction.
  
  Strengths
  
  Triploid production by a combination of ecs1ecs2 mutant and HIPOD system has potential as a future plant breeding tool. Moreover, it's intriguing that both triploid and haploid productions were achieved using the same mutant as a maternal donor. I think authors can claim the value of their results more by adding descriptions about the usefulness of the aneuploid plants in plant breeding history.
  
  The evidence of the persistent synergid nucleus (Figure 3A) is critical insight reported by this study. As Maruyama et al. (2013) reported by live cell imaging, synergid-endosperm fusion had occurred at the two endosperm nuclei stage. It would be valuable to claim the observed fact by citing Maruyama's previous observation.
  
  Weakness
  
  As the authors suggested, the higher triploid frequency observed in ecs1ecs2 than WT was likely caused by the increased polyspermy. However, it also could be that reduction of normal seed number in ecs1ecs2 (whichever is due to failure of fertilization or embryo development arrest) accounts for the increased frequency of the triploid compared to WT.
  
  The results in Figure 3C-E suggested the single fertilization for both egg and central cells at similar frequencies. This is an exciting result, but it is still possible that the fertilized egg or central cell degenerated after fertilization resulting in the disappearance of paternally inherited fluorescence. Evaluation of fertilization patterns at 7-10HAP in ecs1ecs2 mutant may provide more confident insight, although unfused sperm cell was evaluated at 1DAP (Figure 3-figure supplement 1B). The fertilization states can be distinguished depending on the HTR10RFP sperm nuclei morphology and their positions, as reported by Takahashi et al (2018).
  
  Thank you for your suggestion. We added the requested experiment see Figure 3B in the revised manuscript. In addition, we conducted a dual pollination experiment, that provides evidence for the activation of the fertilization recovery machinery (Figure 3C-E) (see above).
  
  Several recent studies have reported exciting insights on ECS1&2 functions; however, various results from different laboratories have raised controversy. Though, the commonly found feature is the repression of polytubey. For readers, it would be helpful to organize the explanation about which insights are concordant or different.
  
  Thank you for your suggestion. We now indicate using terms like in line with or in contrast to, where our data confirms /or contradicts with previous data.
  
  In addition, a drawing that explains the time course in the process from pollination to seed development (up to 6DAP) based on WT would help to understand which point is evaluated in each data.
  
  Thank you for your suggestion. We added a model figure (Figure 4E) at the end of the manuscript that brings the concepts together and facilitates the understandings.
  
  Reviewer #3 (Public Review):
  
  In this manuscript, Mao et al. reported that the two proteases ECS1 and ECS2 participate in both polyspermy block and gamete fusion in Arabidopsis thaliana. The authors could observe polytubey phenotype which has been reported previously and obtain both triparental plants and haploids in ecs1 ecs2 mutants. Therefore, they proposed that the triparental plants resulted from the polytubey block defect, whereas the haploids were caused by the gamete fusion defect. Together with two other previous reports, I think it is very interesting to see these two proteases participating in so many different but connected processes. Although they did not provide the molecular mechanism of how ECS participated in polyspermy block and gamete fusion, their findings provide more options for and thus promote plant breeding. The work may have a wide application in the future and will be of broad interest to cell biologists working on gamete fusion and plant breeders.
  
  We thank the reviewer for their positive comments.
  
  Although most of the conclusions in this paper are well supported by the data, it could be improved with a minor revision including providing clearer data analysis and descriptions, images with higher resolution, and more discussions.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.01.20.476184v1
www.biorxiv.org www.biorxiv.org

New submission 17/06/2023, 17:20:32

1
1. Public_Reviews 22 Jun 2023
  
  in eLife
  
  Author Response
  
  Reviewer #2 (Public Review):
  
  In the discussion, the authors suggest that the binding of CHAPS could be an inspiration to develop compounds, targeting, for instance, mammalian receptors, that would bind to both the orthosteric site and a potential groove underneath loop C (where the sterol moiety of CHAPS binds in Alpo4). A figure (SI4) shows a few homologues in surface representation, giving an idea of whether this groove is generally present in the family.
  
  Seeing this figure, I wondered if it would be relevant to compare several conformations of one or a few chosen homologues. Given that gating always impacts the quaternary assembly, is this groove more pronounced in say the inhibited state of a given homologue than in its agonist-bound state?
  
  The width of the groove in 7 does change as the channel transition from apo to open state. This is now demonstrated with an additional Figure 3 – figure supplement 1b and the discussion was adjusted accordingly p 18, line 379:
  
  “The sterol group connected by a linker binds in between subunits and induces conformational changes which also change the width of the groove in Alpo4 (Figure 3f, g), therefore it likely plays an active role in the observed quaternary twist. The changes in the groove shape are not specific to Alpo4 but are also observed for example in nicotinic 7 receptor (Figure 3 – supplement 1b) suggesting that the groove can be targeted for allosteric modulation of the channel. ”
  
  A related thought was that some of the protein binders affecting pLGIC function (toxins, VHH) contact two subunits and wrap around/below loop C. Do these have binding sites that overlap with the groove?
  
  We inspected the structures of pLGICs homologs with bound -bungarotoxin (6UWZ, 4HQP, 7Z14, and 7KOO) and 2 with bound VHHs (6SSI and 6HJY). The toxins were bound in similar conformations but not the VHHs. The examples of the complexes are now shown as Supplementary Figure 13a (see above). In the case of ELIC, the nanobody Nb72 was bound on top of the sterol-binding cavity, but it did not interact with the interior of the cavity. This is now explained on p 17 from line 374:
  
  “When binding sites of larger know binders, including VHH47,48 and -bungarotoxin10,49 were examined (Figure 3 – supplement 1a) a nanobody bound to ELIC in the site covering the sterol-binding groove was identified, however, its interactions with ELIC did not overlap significantly with the interior of the sterol-binding groove. This suggests that the latter is a novel target location for binders.”
  
  Very interestingly, the binding of CHAPS stabilizes a conformation that differs from the apo one. It includes a twist of the ECDs but does not lead to a significant opening of the M2 bundle. The authors note that the direction of the twist is reversed to that often associated with the binding of ligands in homologues. This reversion is quite a feature, which deserves to be shown in a supplementary movie (e.g overlay of the Alpo apo>CHAPs transition with the nico>apo transition of a7).
  
  We have re-examined the rotation and compared it to the conformational changes in nACh 7 and 5-HT3 receptors. Upon closer examination, it became clear that relative rotation of the ECD and the TMD provides a very simplistic view of the quaternary conformational changes which are more complex 3D quaternary changes than a simple relative domain rotation. Careful alignment of the structures to the extracellular side of the trans-membrane pore showed that in both channels resting-> open state transition is associated with clockwise rotation, but resting-> desensitized state transition in 5-HT3 involves a counterclockwise rotation. Thus, 1) the direction of rotation is not a ‘universal’ feature of pLGICs and 2) the clockwise rotation is the direction of channel activation for α7 nACh receptor and 5-HT3 and shares similarities with rearrangements observed in Alpo4. However, the relative movement of the ECDs is different between Alpo upon CHAPS binding and α7 nACh and 5-HT3 receptor upon activation. To demonstrate this, we added Video 2 which shows quaternary changes for all 3 channels and the text has been modified as follows on page 11 line 208:
  
  “Quaternary changes in Alpo4 induced upon CHAPS binding and those associated with the activation of related α7 nACh and 5-HT3 receptors induced rotation of ECD relative to TMD in the same direction, however, the shifts of principal relative to complementary subunits were different (Video 2). In Alpo4, the complementary subunit slides upward whereas in the two other channels it consistently shifts towards the principal subunit and tilts relative to the TMD. The tilt is less pronounced in Alpo4 which is probably why it does not lead to the pore dilation.”
  
  We are grateful to the reviewer for drawing our attention to this point, which permitted us to correct initially inaccurate statements.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.01.06.523009v1
www.biorxiv.org www.biorxiv.org

New submission 17/06/2023, 17:12:30

1
1. Public_Reviews 22 Jun 2023
  
  in eLife
  
  Author Response
  
  Reviewer #2 (Public Review):
  
  Here, a simple model of cerebellar computation is used to study the dependence of task performance on input type: it is demonstrated that task performance and optimal representations are highly dependent on task and stimulus type. This challenges many standard models which use simple random stimuli and concludes that the granular layer is required to provide a sparse representation. This is a useful contribution to our understanding of cerebellar circuits, though, in common with many models of this type, the neural dynamics and circuit architecture are not very specific to the cerebellum, the model includes the feedforward structure and the high dimension of the granule layer, but little else. This paper has the virtue of including tasks that are more realistic, but by the paper’s own admission, the same model can be applied to the electrosensory lateral line lobe and it could, though it is not mentioned in the paper, be applied to the dentate gyrus and large pyramidal cells of CA3. The discussion does not include specific elements related to, for example, the dynamics of the Purkinje cells or the role of Golgi cells, and, in a way, the demonstration that the model can encompass different tasks and stimuli types is an indication of how abstract the model is. Nonetheless, it is useful and interesting to see a generalization of what has become a standard paradigm for discussing cerebellar function.
  
  We appreciate the Reviewer’s positive comments. Regarding the simplifications of our model, we agree that we have taken a modeling approach that abstracts away certain details to permit comparisons across systems. We now include an in-depth discussion of our simplifying assumptions (Assumptions & Extensions section in the Discussion) and have further noted the possibility that other biophysical mechanisms we have not accounted for may also underlie differences across systems.
  
  Our results predict that qualitative differences in the coding levels of cerebellum-like systems, across brain regions or across species, reflect an optimization to distinct tasks (Figure 7). However, it is also possible that differences in coding level arise from other physiological differences between systems.
  
  Reviewer #3 (Public Review):
  
  1) The paper by Xie et al is a modelling study of the mossy fiber-to-granule cell-to-Purkinje cell network, reporting that the optimal type of representations in the cerebellar granule cell layer depends on the type task. The paper stresses that the findings indicate a higher overall bias towards dense representations than stated in the literature, but it appears the authors have missed parts of the literature that already reported on this. While the modelling and analysis appear mathematically solid, the model is lacking many known constraints of the cerebellar circuitry, which makes the applicability of the findings to the biological counterpart somewhat limited.
  
  We thank the Reviewer for suggesting additional references to include in our manuscript, and for encouraging us to extend our model toward greater biological plausibility and more critically discuss simplifying assumptions we have made. We respond to both the comment about previous literature and about applicability to cerebellar circuitry in detail below.
  
  2) I have some concerns with the novelty of the main conclusion, here from the abstract: ’Here, we generalize theories of cerebellar learning to determine the optimal granule cell representation for tasks beyond random stimulus discrimination, including continuous input-output transformations as required for smooth motor control. We show that for such tasks, the optimal granule cell representation is substantially denser than predicted by classic theories.’ Stated like this, this has in principle already been shown, i.e. for example: Spanne and Jo¨rntell (2013) Processing of multi-dimensional sensorimotor information in the spinal and cerebellar neuronal circuitry: a new hypothesis. PLoS Comput Biol. 9(3):e1002979. Indeed, even the 2 DoF arm movement control that is used in the present paper as an application, was used in this previous paper, with similar conclusions with respect to the advantage of continuous input-output transformations and dense coding. Thus, already from the beginning of this paper, the novelty aspect of this paper is questionable. Even the conclusion in the last paragraph of the Introduction: ‘We show that, when learning input-output mappings for motor control tasks, the optimal granule cell representation is much denser than predicted by previous analyses.’ was in principle already shown by this previous paper.
  
  We thank the Reviewer for drawing our attention to Spanne and Jo¨rntell (2013). Our study shares certain similarities with this work, including the consideration of tasks with smooth input-output mappings, such as learning the dynamics of a two-joint arm. However, our study differs substantially, most notably the fact that we focus our study on parametrically varying the degree of sparsity in the granule cell layer to determine the circumstances under which dense versus sparse coding is optimal. To the best of our ability, we can find no result in Spanne and J¨orntell (2013) that indicates the performance of a network as a function of average coding level. Instead, Spanne and Jo¨rntell (2013) propose that inhibition from Golgi cells produces heterogeneity in coding level which can improve performance, which is an interesting but complementary finding to ours. We therefore do not believe that the quantitative computations of optimal coding level that we present are redundant with the results of this previous study. We also note that a key contribution of our study is mathemetical analysis of the inductive bias of networks with different coding levels which supports our conclusions.
  
  We have included a discussion of Spanne and Jo¨rntell (2013) and (2015) in the revised version of our manuscript:
  
  "Other studies have considered tasks with smooth input-output mappings and low-dimensional inputs, finding that heterogeneous Golgi cell inhibition can improve performance by diversifying individual granule cell thresholds (Spanne and J¨orntell, 2013). Extending our model to include heterogeneous thresholds is an interesting direction for future work. Another proposal states that dense coding may improve generalization (Spanne and Jo¨rntell, 2015). Our theory reveals that whether or not dense coding is beneficial depends on the task."
  
  3) However, the present paper does add several more specific investigations/characterizations that were not previously explored. Many of the main figures report interesting new model results. However, the model is implemented in a highly generic fashion. Consequently, the model relates better to general neural network theory than to specific interpretations of the function of the cerebellar neuronal circuitry. One good example is the findings reported in Figure 2. These represent an interesting extension to the main conclusion, but they are also partly based on arbitrariness as the type of mossy fiber input described in the random categorization task has not been observed in the mammalian cerebellum under behavior in vivo, whereas in contrast, the type of input for the motor control task does resemble mossy fiber input recorded under behavior (van Kan et al 1993).
  
  We agree that the tasks we consider in Figure 2 are simplified compared to those that we consider elsewhere in the paper. The choice of random mossy fiber input was made to provide a comparison to previous modeling studies that also use random input as a benchmark (Marr 1969, Albus 1971, Brunel 2004, Babadi and Sompolinsky 2014, Billings 2014, LitwinKumar et al., 2017). This baseline permits us to specifically evaluate the effects of lowdimensional inputs (Figure 2) and richer input-output mappings (Figure 2, Figure 7). We agree with the Reviewer that the random and uncorrelated mossy fiber activity that has been extensively used in previous studies is almost certainly an unrealistic idealization of in vivo neural activity—this is a motivating factor for our study, which relaxes this assumption and examines the consequences. To provide additional context, we have updated the following paragraph in the main text Results section:
  
  "A typical assumption in computational theories of the cerebellar cortex is that inputs are randomly distributed in a high-dimensional space (Marr, 1969; Albus, 1971; Brunel et al., 2004; Babadi and Sompolinsky, 2014; Billings et al., 2014; Litwin-Kumar et al., 2017). While this may be a reasonable simplification in some cases, many tasks, including cerebellumdependent tasks, are likely best-described as being encoded by a low-dimensional set of variables. For example, the cerebellum is often hypothesized to learn a forward model for motor control (Wolpert et al., 1998), which uses sensory input and motor efference to predict an effector’s future state. Mossy fiber activity recorded in monkeys correlates with position and velocity during natural movement (van Kan et al., 1993). Sources of motor efference copies include motor cortex, whose population activity lies on a lowdimensional manifold (Wagner et al., 2019; Huang et al., 2013; Churchland et al., 2010; Yu et al., 2009). We begin by modeling the low dimensionality of inputs and later consider more specific tasks."
  
  4) The overall conclusion states: ‘Our results....suggest that optimal cerebellar representations are task-dependent.’ This is not a particularly strong or specific conclusion. One could interpret this statement as simply saying: ‘if I construct an arbitrary neural network, with arbitrary intrinsic properties in neurons and synapses, I can get outputs that depend on the intensity of the input that I provide to that network.’ Further, the last sentence of the Introduction states: ‘More broadly, we show that the sparsity of a neural code has a task-dependent influence on learning...’ This is very general and unspecific, and would likely not come as a surprise to anyone interested in the analysis of neural networks. It doesn’t pinpoint any specific biological problem but just says that if I change the density of the input to a [generic] network, then the learning will be impacted in one way or another.
  
  We agree with the Reviewer that our conclusions are quite general, and we have removed the final sentence as we agree it was unspecific. However, we disagree with the Reviewer’s paraphrasing of our results.
  
  First, we do not select arbitrary intrinsic properties of neurons and synapses. Rather, we construct a simplified model with a key quantity, the neuronal threshold, that we vary parametrically in order to assess the effect of the resulting changes in the representation on performance. Second, we do not vary the intensity/density of inputs provided to the network – this is fixed throughout our study for all key comparisons we perform. Instead, we vary the density (coding level) of the expansion layer representation and quantify its effect on inductive bias and generalization. Finally, our study’s key contribution is an explanation of the heterogeneity in average coding level observed across behaviors and cerebellum-like systems. We go beyond the empirical statement that there is a dependence of performance on the parameter that we vary by developing an analytical theory. Our theory describes the performance of the class of networks that we study and the properties of learning tasks that determine the optimal expansion layer representation.
  
  To clarify our main contributions, we have updated the final paragraph of the Introduction. We have also removed the sentence that the Reviewer objects to, as it was less specific than the other points we make here.
  
  "We propose that these differences can be explained by the capacity of representations with different levels of sparsity to support learning of different tasks. We show that the optimal level of sparsity depends on the structure of the input-output relationship of a task. When learning input-output mappings for motor control tasks, the optimal granule cell representation is much denser than predicted by previous analyses. To explain this result, we develop an analytic theory that predicts the performance of cerebellum-like circuits for arbitrary learning tasks. The theory describes how properties of cerebellar architecture and activity control these networks’ inductive bias: the tendency of a network toward learning particular types of input-output mappings (Sollich, 1998; Jacot et al., 2018; Bordelon et al., 2020; Canatar et al., 2021; Simon et al., 2021). The theory shows that inductive bias, rather than the dimension of the representation alone, is necessary to explain learning performance across tasks. It also suggests that cerebellar regions specialized for different functions may adjust the sparsity of their granule cell representations depending on the task."
  
  5) The interpretation of the distribution of the mossy fiber inputs to the granule cells, which would have a crucial impact on the results of a study like this, is likely incorrect. First, unlike the papers that the authors cite, there are many studies indicating that there is a topographic organization in the mossy fiber termination, such that mossy fibers from the same inputs, representing similar types of information, are regionally co-localized in the granule cell layer. Hence, there is no support for the model assumption that there is a predominantly random termination of mossy fibers of different origins. This risks invalidating the comparisons that the authors are making, i.e. such as in Figure 3. This is a list of example papers, there are more: van Kan, Gibson and Houk (1993) Movement-related inputs to intermediate cerebellum of the monkey. Journal of Neurophysiology. Garwicz et al (1998) Cutaneous receptive fields and topography of mossy fibres and climbing fibres projecting to cat cerebellar C3 zone. The Journal of Physiology. Brown and Bower (2001) Congruence of mossy fiber and climbing fiber tactile projections in the lateral hemispheres of the rat cerebellum. The Journal of Comparative Neurology. Na, Sugihara, Shinoda (2019) The entire trajectories of single pontocerebellar axons and their lobular and longitudinal terminal distribution patterns in multiple aldolase C-positive compartments of the rat cerebellar cortex. The Journal of Comparative Neurology.
  
  6) The nature of the mossy fiber-granule cell recording is also reviewed here: Gilbert and Miall (2022) How and Why the Cerebellum Recodes Input Signals: An Alternative to Machine Learning. The Neuroscientist. Further, considering the re-coding idea, the following paper shows that detailed information, as it is provided by mossy fibers, is transmitted through the granule cells without any evidence of re-coding: Jo¨rntell and Ekerot (2006) Journal of Neuroscience; and this paper shows that these granule inputs are powerfully transmitted to the molecular layer even in a decerebrated animal (i.e. where only the ascending sensory pathways remains) Jo¨rntell and Ekerot 2002, Neuron.
  
  We agree that there is strong evidence for a topographic organization in mossy fiber to granule cell connectivity at the microzonal level. We thank the Reviewer for pointing us to specific examples. We acknowledge that our simplified model does not capture the structure of connectivity observed in these studies.
  
  However, the focus of our model is on cerebellar neurons presynaptic to a single Purkinje cell. Random or disordered distribution of inputs at this local scale is compatible with topographic organization at the microzonal scale. Furthermore, while there is evidence of structured connections at the local scale, models with random connectivity are able to reproduce the dimensionality of granule cell activity within a small margin of error (Nguyen et al., 2022). Finally, our finding that dense codes are optimal for learning slowly varying tasks is consistent with evidence for the lack of re-coding – for such tasks, re-coding may absent because it is not required.
  
  We have dedicated a section on this issue in the Assumptions and Extensions portion of our Discussion:
  
  "Another key assumption concerning the granule cells is that they sample mossy fiber inputs randomly, as is typically assumed in Marr-Albus models (Marr, 1969; Albus, 1971; LitwinKumar et al., 2017; Cayco-Gajic et al., 2017). Other studies instead argue that granule cells sample from mossy fibers with highly similar receptive fields (Garwicz et al., 1998; Brown and Bower, 2001; J¨orntell and Ekerot, 2006) defined by the tuning of mossy fiber and climbing fiber inputs to cerebellar microzones (Apps et al., 2018). This has led to an alternative hypothesis that granule cells serve to relay similarly tuned mossy fiber inputs and enhance their signal-to-noise ratio (Jo¨rntell and Ekerot, 2006; Gilbert and Chris Miall, 2022) rather than to re-encode inputs. Another hypothesis is that granule cells enable Purkinje cells to learn piece-wise linear approximations of nonlinear functions (Spanne and J¨orntell, 2013). However, several recent studies support the existence of heterogeneous connectivity and selectivity of granule cells to multiple distinct inputs at the local scale (Huang et al., 2013; Ishikawa et al., 2015). Furthermore, the deviation of the predicted dimension in models constrained by electron-microscopy data as compared to randomly wired models is modest (Nguyen et al., 2022). Thus, topographically organized connectivity at the macroscopic scale may coexist with disordered connectivity at the local scale, allowing granule cells presynaptic to an individual Purkinje cell to sample heterogeneous combinations of the subset of sensorimotor signals relevant to the tasks that Purkinje cell participates in. Finally, we note that the optimality of dense codes for learning slowly varying tasks in our theory suggests that observations of a lack of mixing (J¨orntell and Ekerot, 2002) for such tasks are compatible with Marr-Albus models, as in this case nonlinear mixing is not required."
  
  7) I could not find any description of the neuron model used in this paper, so I assume that the neurons are just modelled as linear summators with a threshold (in fact, Figure 5 mentions inhibition, but this appears to be just one big lump inhibition, which basically is an incorrect implementation). In reality, granule cells of course do have specific properties that can impact the input-output transformation, PARTICULARLY with respect to the comparison of sparse versus dense coding, because the low-pass filtering of input that occurs in granule cells (and other neurons) as well as their spike firing stochasticity (Saarinen et al (2008). Stochastic differential equation model for cerebellar granule cell excitability. PLoS Comput. Biol. 4:e1000004) will profoundly complicate these comparisons and make them less straight forward than what is portrayed in this paper. There are also several other factors that would be present in the biological setting but are lacking here, which makes it doubtful how much information in relation to the biological performance that this modelling study provides: What are the types of activity patterns of the inputs? What are the learning rules? What is the topography? What is the impact of Purkinje cell outputs downstream, as the Purkinje cell output does not have any direct action, it acts on the deep cerebellar nuclear neurons, which in turn act on a complex sensorimotor circuitry to exert their effect, hence predictive coding could only become interpretable after the PC output has been added to the activity in those circuits. Where is the differentiated Golgi cell inhibition?
  
  Thank you for these critiques. We have made numerous edits to improve the presentation of the details of our model in the main text of the manuscript. Indeed, granule cells in the main text are modeled as linear sums of mossy fiber inputs with a threshold-linear activation function. A more detailed description of the model for granule cells can now be found in Equation 1 in the Results section:
  
  "The activity of neurons in the expansion layer is given by: h = φ(Jeffx − θ), (1) where φ is a rectified linear activation function φ(u) = max(u,0) applied element-wise. Our results also hold for other threshold-polynomial activation functions. The scalar threshold θ is shared across neurons and controls the coding level, which we denote by f, defined as the average fraction of neurons in the expansion layer that are active."
  
  Most of our analyses use the firing rate model we describe above, but several Supplemental Figures show extensions to this model. As we mention in the Discussion, our results do not depend on the specific choice of nonlinearity (Figure 2-figure supplement 2). We have also considered the possibility that the stochastic nature of granule cell spikes could impact our measures of coding level. In Figure 7-figure supplement 1 we test the robustness of our main conclusion using a spiking model where we model granule cell spikes with Poisson statistics. When measuring coding level in a population of spiking neurons, a key question is at what time window the Purkinje cell integrates spikes. For several choices of integration time windows, we show that dense coding remains optimal for learning smooth tasks. However, we agree with the Reviewer that there are other biological details our model does not address. For example, our spiking model does not capture some of the properties the Saarinen et al. (2008) model captures, including random sub-threshold oscillations and clusters of spikes. Modeling biophysical phenomena at this scale is beyond the scope of our study. We have added this reference to the relevant section of the Discussion:
  
  "We also note that coding level is most easily defined when neurons are modeled as rate, rather than spiking units. To investigate the consistency of our results under a spiking code, we implemented a model in which granule cell spiking exhibits Poisson variability and quantify coding level as the fraction of neurons that have nonzero spike counts (Figure 7-figure supplement 1; Figure 7C). In general, increased spike count leads to improved performance as noise associated with spiking variability is reduced. Granule cells have been shown to exhibit reliable burst responses to mossy fiber stimulation (Chadderton et al., 2004), motivating models using deterministic responses or sub-Poisson spiking variability. However, further work is needed to quantitatively compare variability in model and experiment and to account for more complex biophysical properties of granule cells (Saarinen et al., 2008)."
  
  A second concern the Reviewer raises is our implementation of Golgi cell inhibition as a homogeneous rather than heterogeneous input onto granule cells. In simplified models, adding heterogeneous inhibition does not dramatically change the qualitative properties of the expansion layer representation, in particular the dimensionality of the representation (Billings et al., 2014, Cayco-Gajic et al., 2017, Litwin-Kumar et al., 2017). We have added a section about inhibition to our Discussion:
  
  "We also have not explicitly modeled inhibitory input provided by Golgi cells, instead assuming such input can be modeled as a change in effective threshold, as in previous studies (Billings et al., 2014; Cayco-Gajic et al., 2017; Litwin-Kumar et al., 2017). This is appropriate when considering the dimension of the granule cell representation (Litwin-Kumar et al., 2017), but more work is needed to extend our model to the case of heterogeneous inhibition."
  
  Regarding the mossy fiber inputs, as we state in response to paragraph 3, we agree with the Reviewer that the random and uncorrelated mossy fiber activity that has been used in previous studies is an unrealistic idealization of in vivo neural activity. One of the motivations for our model was to relax this assumption and examine the consequences: we introduce correlations in the mossy fiber activity by projecting low-dimensional patterns into the mossy fiber layer (Figure 1B):
  
  "A typical assumption in computational theories of the cerebellar cortex is that inputs are randomly distributed in a high-dimensional space (Marr, 1969; Albus, 1971; Brunel et al., 2004; Babadi and Sompolinsky, 2014; Billings et al., 2014; Litwin-Kumar et al., 2017). While this may be a reasonable simplification in some cases, many tasks, including cerebellumdependent tasks, are likely best-described as being encoded by a low-dimensional set of variables. For example, the cerebellum is often hypothesized to learn a forward model for motor control (Wolpert et al., 1998), which uses sensory input and motor efference to predict an effector’s future state. Mossy fiber activity recorded in monkeys correlates with position and velocity during natural movement (van Kan et al., 1993). Sources of motor efference copies include motor cortex, whose population activity lies on a low-dimensional manifold (Wagner et al., 2019; Huang et al., 2013; Churchland et al., 2010; Yu et al., 2009). We begin by modeling the low dimensionality of inputs and later consider more specific tasks.
  
  We therefore assume that the inputs to our model lie on a D-dimensional subspace embedded in the N-dimensional input space, where D is typically much smaller than N (Figure 1B). We refer to this subspace as the “task subspace” (Figure 1C)."
  
  The Reviewer also mentions the learning rule at granule cell to Purkinje cell synapses. We agree that considering online, climbing-fiber-dependent learning is an important generalization. We therefore added a new supplemental figure investigating whether we would still see a difference in optimal coding levels across tasks if online learning were used instead of the least squares solution (Figure 7-figure supplement 2). Indeed, we observed a similar task dependence as we saw in Figure 2F. We have added a new paragraph in the Discussion under Assumptions and Extensions describing our rationale and approach in detail:
  
  "For the Purkinje cells, our model assumes that their responses to granule cell input can be modeled as an optimal linear readout. Our model therefore provides an upper bound to linear readout performance, a standard benchmark for the quality of a neural representation that does not require assumptions on the nature of climbing fiber-mediated plasticity, which is still debated. Electrophysiological studies have argued in favor of a linear approximation (Brunel et al., 2004). To improve the biological applicability of our model, we implemented an online climbing fiber-mediated learning rule and found that optimal coding levels are still task-dependent (Figure 7-figure supplement 2). We also note that although we model several timing-dependent tasks (Figure 7), our learning rule does not exploit temporal information, and we assume that temporal dynamics of granule cell responses are largely inherited from mossy fibers. Integrating temporal information into our model is an interesting direction for future investigation."
  
  Finally, regarding the function of the Purkinje cell, our model defines a learning task as a mapping from inputs to target activity in the Purkinje cell and is thus agnostic to the cell’s downstream effects. We clarify this point when introducing the definition of a learning task:
  
  "In our model, a learning task is defined by a mapping from task variables x to an output f(x), representing a target change in activity of a readout neuron, for example a Purkinje cell. The limited scope of this definition implies our results should not strongly depend on the influence of the readout neuron on downstream circuits."
  
  8) The problem of these, in my impression, generic, arbitrary settings of the neurons and the network in the model becomes obvious here: ‘In contrast to the dense activity in cerebellar granule cells, odor responses in Kenyon cells, the analogs of granule cells in the Drosophila mushroom body, are sparse...’ How can this system be interpreted as an analogy to granule cells in the mammalian cerebellum when the model does not address the specifics lined up above? I.e. the ‘inductive bias’ that the authors speak of, defined as ‘the tendency of a network toward learning particular types of input-output mappings’, would be highly dependent on the specifics of the network model.
  
  We agree with the Reviewer that our model makes several simplifying assumptions for mathematical tractability. However, we note that our study is not the first to draw analogies between cerebellum-like systems, including the mushroom body (Bell et al., 2008; Farris, 2011). All the systems we study feature a sparsely connected, expanded granule-like layer that sends parallel fiber axons onto densely connected downstream neurons known to exhibit powerful synaptic plasticity, thus motivating the key architectural assumptions of our model. We have constrained anatomical parameters of the model using data as available (Table 1). However, we agree with the Reviewer that when making comparisons across species there is always a possibility that differences are due to physiological mechanisms we have not fully understood or captured with a model. As such, we can only present a hypothesis for these differences. We have modified our Discussion section on this topic to clearly state this.
  
  "Our results predict that qualitative differences in the coding levels of cerebellum-like systems, across brain regions or across species, reflect an optimization to distinct tasks (Figure 7). However, it is also possible that differences in coding level arise from other physiological differences between systems."
  
  9) More detailed comments: Abstract: ‘In these models [Marr-Albus], granule cells form a sparse, combinatorial encoding of diverse sensorimotor inputs. Such sparse representations are optimal for learning to discriminate random stimuli.’ Yes, I would agree with the first part, but I contest the second part of this statement. I think what is true for sparse coding is that the learning of random stimuli will be faster, as in a perceptron, but not necessarily better. As the sparsification essentially removes information, it could be argued that the quality of the learning is poorer. So from that perspective, it is not optimal. The authors need to specify from what perspective they consider sparse representations optimal for learning.
  
  This is an important point that we would like to clarify. It is not the case that sparse coding simply speeds up learning. In our study and many related works (Barak et al. 2013; Babadi and Sompolinsky 2014; Litwin-Kumar et al. 2017), learning performance is measured based on the generalization ability of the network – the ability to predict correct labels for previously unseen inputs. As our study and previous studies show, sparse codes are optimal in the sense that they minimize generalization error, independent of any effect on learning speed. To communicate this more effectively, we have added the following sentence to the first paragraph of the Introduction:
  
  "Sparsity affects both learning speed (Cayco-Gajic et al., 2017), and generalization, the ability to predict correct labels for previously unseen inputs (Barak et al., 2013; Babadi and Sompolinsky, 2014; Litwin-Kumar et al., 2017)."
  
  10) Introduction: ‘Indeed, several recent studies have reported dense activity in cerebellar granule cells in response to sensory stimulation or during motor control tasks (Knogler et al., 2017; Wagner et al., 2017; Giovannucci et al., 2017; Badura and De Zeeuw, 2017; Wagner et al., 2019), at odds with classic theories (Marr, 1969; Albus, 1971).’ In fact, this was precisely the issue that was addressed already by Jo¨rntell and Ekerot (2006) Journal of Neuroscience. The conclusion was that these actual recordings of granule cells in vivo provided essentially no support for the assumptions in the Marr-Albus theories.
  
  In our reading, the main finding of J¨orntell and Ekerot (2006) is that individual granule cells are activated by mossy fibers with overlapping receptive fields driven by a single type of somatosensory input. However, there is also evidence of nonlinear mixed selectivity in granule cells in support of the re-coding hypothesis (Huang et al., 2013; Ishikawa et al., 2015). Jo¨rntell and Ekerot (2006) also suggest that the granule cell layer shares similar topographic organization as mossy fibers, organized into microzones. The existence of topographic organization does not invalidate Marr-Albus theories. As we have suggested earlier, a local combinatorial expansion can coexist with a global topographic organization.
  
  We have described these considerations in the Assumptions and Extensions portion of the Discussion:
  
  "Another key assumption concerning the granule cells is that they sample mossy fiber inputs randomly, as is typically assumed in Marr-Albus models (Marr, 1969; Albus, 1971; LitwinKumar et al., 2017; Cayco-Gajic et al., 2017). Other studies instead argue that granule cells sample from mossy fibers with highly similar receptive fields (Garwicz et al., 1998; Brown and Bower, 2001; J¨orntell and Ekerot, 2006) defined by the tuning of mossy fiber and climbing fiber inputs to cerebellar microzones (Apps et al., 2018). This has led to an alternative hypothesis that granule cells serve to relay similarly tuned mossy fiber inputs and enhance their signal-to-noise ratio (Jo¨rntell and Ekerot, 2006; Gilbert and Chris Miall, 2022) rather than to re-encode inputs. Another hypothesis is that granule cells enable Purkinje cells to learn piece-wise linear approximations of nonlinear functions (Spanne and J¨orntell, 2013). However, several recent studies support the existence of heterogeneous connectivity and selectivity of granule cells to multiple distinct inputs at the local scale (Huang et al., 2013; Ishikawa et al., 2015). Furthermore, the deviation of the predicted dimension in models constrained by electron-microscopy data as compared to randomly wired models is modest (Nguyen et al., 2022). Thus, topographically organized connectivity at the macroscopic scale may coexist with disordered connectivity at the local scale, allowing granule cells presynaptic to an individual Purkinje cell to sample heterogeneous combinations of the subset of sensorimotor signals relevant to the tasks that Purkinje cell participates in. Finally, we note that the optimality of dense codes for learning slowly varying tasks in our theory suggests that observations of a lack of mixing (J¨orntell and Ekerot, 2002) for such tasks are compatible with Marr-Albus models, as in this case nonlinear mixing is not required."
  
  We have also included the Jo¨rntell and Ekerot (2006) study as a citation in the Introduction:
  
  "Indeed, several recent studies have reported dense activity in cerebellar granule cells in response to sensory stimulation or during motor control tasks (Jo¨rntell and Ekerot, 2006; Knogler et al., 2017; Wagner et al., 2017; Giovannucci et al., 2017; Badura and De Zeeuw, 2017; Wagner et al., 2019), at odds with classic theories (Marr, 1969; Albus, 1971)."
  
  11) Results: 1st para: There is no information about how the granule cells are modelled.
  
  We agree that this should information should have been more readily available. We now more completely describe the model in the main text. Our model for granule cells can be found in Equation 1 in the Results section and also the Methods (Network Model):
  
  "The activity of neurons in the expansion layer is given by: h = φ(Jeffx − θ), (2)
  
  where φ is a rectified linear activation function φ(u) = max(u,0) applied element-wise. Our results also hold for other threshold-polynomial activation functions. The scalar threshold θ is shared across neurons and controls the coding level, which we denote by f, defined as the average fraction of neurons in the expansion layer that are active."
  
  12) 2nd para: ‘A typical assumption in computational theories of the cerebellar cortex is that inputs are randomly distributed in a high-dimensional space.’ Yes, I agree, and this is in fact in conflict with the known topographical organization in the cerebellar cortex (see broader comment above). Mossy fiber inputs coding for closely related inputs are co-localized in the cerebellar cortex. I think for this model to be of interest from the point of view of the mammalian cerebellar cortex, it would need to pay more attention to this organizational feature.
  
  As we discuss in our response to paragraphs 5 and 6, we see the random distribution assumption at the local scale (inputs presynaptic to a single Purkinje cell) as being compatible with topographic organization occurring at the microzone scale. Furthermore, as discussed earlier, we specifically model low-dimensional input as opposed to the random and high-dimensional inputs typically studied in prior models.
  
  "A typical assumption in computational theories of the cerebellar cortex is that inputs are randomly distributed in a high-dimensional space (Marr, 1969; Albus, 1971; Brunel et al., 2004; Babadi and Sompolinsky, 2014; Billings et al., 2014; Litwin-Kumar et al., 2017). While this may be a reasonable simplification in some cases, many tasks, including cerebellumdependent tasks, are likely best-described as being encoded by a low-dimensional set of variables. For example, the cerebellum is often hypothesized to learn a forward model for motor control (Wolpert et al., 1998), which uses sensory input and motor efference to predict an effector’s future state. Mossy fiber activity recorded in monkeys correlates with position and velocity during natural movement (van Kan et al., 1993). Sources of motor efference copies include motor cortex, whose population activity lies on a low-dimensional manifold (Wagner et al., 2019; Huang et al., 2013; Churchland et al., 2010; Yu et al., 2009). We begin by modeling the low dimensionality of inputs and later consider more specific tasks. We therefore assume that the inputs to our model lie on a D-dimensional subspace embedded in the N-dimensional input space, where D is typically much smaller than N (Figure 1B). We refer to this subspace as the “task subspace” (Figure 1C)."
  
  References
  
  Albus, J.S. (1971). A theory of cerebellar function. Mathematical Biosciences 10, 25–61.
  
  Apps, R., et al. (2018). Cerebellar Modules and Their Role as Operational Cerebellar Processing Units. Cerebellum 17, 654–682.
  
  Babadi, B. and Sompolinsky, H. (2014). Sparseness and expansion in sensory representations. Neuron 83, 1213–1226.
  
  Badura, A. and De Zeeuw, C.I. (2017). Cerebellar granule cells: dense, rich and evolving representations. Current Biology 27, R415–R418.
  
  Barak, O., Rigotti, M., and Fusi, S. (2013). The sparseness of mixed selectivity neurons controls the generalization–discrimination trade-off. Journal of Neuroscience 33, 3844– 3856.
  
  Bell, C.C., Han, V., and Sawtell, N.B. (2008). Cerebellum-like structures and their implications for cerebellar function. Annual Review of Neuroscience 31, 1–24.
  
  Billings, G., Piasini, E., Lo˝rincz, A., Nusser, Z., and Silver, R.A. (2014). Network structure within the cerebellar input layer enables lossless sparse encoding. Neuron 83, 960–974.
  
  Bordelon, B., Canatar, A., and Pehlevan, C. (2020). Spectrum dependent learning curves in kernel regression and wide neural networks. International Conference on Machine Learning 1024–1034.
  
  Brown, I.E. and Bower, J.M. (2001). Congruence of mossy fiber and climbing fiber tactile projections in the lateral hemispheres of the rat cerebellum. Journal of Comparative Neurology 429, 59–70.
  
  Brunel, N., Hakim, V., Isope, P., Nadal, J.P., and Barbour, B. (2004). Optimal information storage and the distribution of synaptic weights: perceptron versus Purkinje cell. Neuron 43, 745–757.
  
  Canatar, A., Bordelon, B., and Pehlevan, C. (2021). Spectral bias and task-model alignment explain generalization in kernel regression and infinitely wide neural networks. Nature Communications 12, 1–12.
  
  Cayco-Gajic, N.A., Clopath, C., and Silver, R.A. (2017). Sparse synaptic connectivity is required for decorrelation and pattern separation in feedforward networks. Nature Communications 8, 1–11.
  
  Chadderton, P., Margrie, T.W., and Ha¨usser, M. (2004). Integration of quanta in cerebellar granule cells during sensory processing. Nature 428, 856–860.
  
  Churchland, M.M., et al. (2010). Stimulus onset quenches neural variability: a widespread cortical phenomenon. Nature Neuroscience 13, 369–378.
  
  Farris, S.M. (2011). Are mushroom bodies cerebellum-like structures? Arthropod structure & development 40, 368–379.
  
  Garwicz, M., Jorntell, H., and Ekerot, C.F. (1998). Cutaneous receptive fields and topography of mossy fibres and climbing fibres projecting to cat cerebellar C3 zone. The Journal of Physiology 512 ( Pt 1), 277–293.
  
  Gilbert, M. and Chris Miall, R. (2022). How and Why the Cerebellum Recodes Input Signals: An Alternative to Machine Learning. The Neuroscientist 28, 206–221.
  
  Giovannucci, A., et al. (2017). Cerebellar granule cells acquire a widespread predictive feedback signal during motor learning. Nature Neuroscience 20, 727–734.
  
  Huang, C.C., et al. (2013). Convergence of pontine and proprioceptive streams onto multimodal cerebellar granule cells. eLife 2, e00400.
  
  Ishikawa, T., Shimuta, M., and Ha¨usser, M. (2015). Multimodal sensory integration in single cerebellar granule cells in vivo. eLife 4, e12916.
  
  Jacot, A., Gabriel, F., and Hongler, C. (2018). Neural tangent kernel: Convergence and generalization in neural networks. Advances in Neural Information Processing Systems 31.
  
  Jo¨rntell, H. and Ekerot, C.F. (2002). Reciprocal Bidirectional Plasticity of Parallel Fiber Receptive Fields in Cerebellar Purkinje Cells and Their Afferent Interneurons. Neuron 34, 797–806.
  
  Jorntell, H. and Ekerot, C.F. (2006). Properties of Somatosensory Synaptic Integration in Cerebellar Granule Cells In Vivo. Journal of Neuroscience 26, 11786–11797.
  
  Knogler, L.D., Markov, D.A., Dragomir, E.I., Stih, V., and Portugues, R. (2017). Senso-ˇ rimotor representations in cerebellar granule cells in larval zebrafish are dense, spatially organized, and non-temporally patterned. Current Biology 27, 1288–1302.
  
  Litwin-Kumar, A., Harris, K.D., Axel, R., Sompolinsky, H., and Abbott, L.F. (2017). Optimal degrees of synaptic connectivity. Neuron 93, 1153–1164. Marr, D. (1969). A theory of cerebellar cortex. Journal of Physiology 202, 437–470.
  
  Nguyen, T.M., et al. (2022). Structured cerebellar connectivity supports resilient pattern separation. Nature 1–7.
  
  Saarinen, A., Linne, M.L., and Yli-Harja, O. (2008). Stochastic Differential Equation Model for Cerebellar Granule Cell Excitability. PLOS Computational Biology 4, e1000004.
  
  Simon, J.B., Dickens, M., and DeWeese, M.R. (2021). A theory of the inductive bias and generalization of kernel regression and wide neural networks. arXiv: 2110.03922.
  
  Sollich, P. (1998). Learning curves for Gaussian processes. Advances in Neural Information Processing Systems 11.
  
  Spanne, A. and Jo¨rntell, H. (2013). Processing of Multi-dimensional Sensorimotor Information in the Spinal and Cerebellar Neuronal Circuitry: A New Hypothesis. PLOS Computational Biology 9, e1002979.
  
  Spanne, A. and Jo¨rntell, H. (2015). Questioning the role of sparse coding in the brain. Trends in Neurosciences 38, 417–427.
  
  van Kan, P.L., Gibson, A.R., and Houk, J.C. (1993). Movement-related inputs to intermediate cerebellum of the monkey. Journal of Neurophysiology 69, 74–94.
  
  Wagner, M.J., Kim, T.H., Savall, J., Schnitzer, M.J., and Luo, L. (2017). Cerebellar granule cells encode the expectation of reward. Nature 544, 96–100.
  
  Wagner, M.J., et al. (2019). Shared cortex-cerebellum dynamics in the execution and learning of a motor task. Cell 177, 669–682.e24.
  
  Wolpert, D.M., Miall, R.C., and Kawato, M. (1998). Internal models in the cerebellum. Trends in Cognitive Sciences 2, 338–347.
  
  Yu, B.M., et al. (2009). Gaussian-process factor analysis for low-dimensional single-trial analysis of neural population activity. Journal of Neurophysiology 102, 614–635.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.08.15.504040v1
www.biorxiv.org www.biorxiv.org

New submission 17/06/2023, 17:08:06

1
1. Public_Reviews 22 Jun 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  This study combines in vitro somatic and dendritic recordings and computational modeling to study how cholinergic agonists modulate the response of CA1 pyramidal neurons to triangular current injections. The authors have previously used a similar approach (Upchurch, 2022, JNeuroscience) to show that CA1 neurons exhibit asymmetric AP firing (more firing on the upward ramp) in response to such current injections and that this effect is due to Na channel inactivation. The present work builds on these results by showing that cholinergic modulation changes this response, i.e., there is more firing on the downward part of the ramp. This change appears to require an intracellular Ca2+ concentration increase (mediated via IP3 and voltage-gated Ca2+ channels), which activates TRPM4 channels. In this scheme, cholinergic activity increases IP3, and the depolarizing current injection opens voltage-gated Ca2+ channels. This study will be of some interest to cellular neurophysiology experts working on the hippocampus.
  
  1) This study claims that the triangular current injections recapitulate hippocampal place cell activity. However, it has been shown recently that the asymmetric firing of CA1 place cells is due to synaptic weight changes resulting from synaptic plasticity (e.g., Bittner et al., 2017). This suggests that the asymmetric firing of place cells is primarily the result of asymmetric synaptic input. Therefore, the authors should test whether carbachol similarly affects a synaptically driven membrane potential ramp. If this is not the case, the strong claim that this work has implications for place cell firing is not justified, in my opinion.
  
  We have added the results showing the effects of cholinergic modulation on a synaptically-driven membrane potential ramp, obtained by electrically stimulating the Schaffer collaterals with a stimulation frequency that was adjusted according to a linear, symmetric ramp (see also Hsu et al, Neuron 99,147-162, 2018). These results have been added to the manuscript in the Results section for new Figure 2 (lines 169-197) and in the Methods section (lines 716-726).
  
  2) Along the same lines, it has been shown before that the precision of spike timing depends on the stimulation pattern in vitro (Mainen and Sejnowski, 1995). Constant stimuli led to imprecise AP firing trains, whereas current injections that included fluctuations resembling synaptic input generated spike trains that were more reliable and reproducible in terms of timing. This study concluded that a low intrinsic noise level in spike generation was essential in generating informative spike sequences. Following this pivotal work, the authors could add noise to their current stimulus and observe the effect on the AP firing patterns. If this is not possible, the authors should at least report the sweep-to-sweep variability for the data shown, e.g., in panels 1A2, 1B2, 1D2, and 1E2.
  
  We thank the reviewer for this suggestion to acknowledge the variability in the data across trials and we have added the Mainen and Sejnowski, 1995 citation to the manuscript (see Results lines 128-134). We addressed sweep-to-sweep variability among the various trials.
  
  3) In most of the data presented in this manuscript, Carbachol appears to induce a 3 mV hyperpolarization and increase input resistance. As a result, the amount of current injected during Carbachol is drastically lower than during the controls. This should be emphasized more, and the input resistance should be quantified for each experimental condition. It should also be discussed whether this change in input resistance can account for the changes in the firing pattern observed. Finally, it should be clearly stated how the amount of the current injected was chosen for each cell, and data from a range of injected current ramps should be shown for each cell.
  
  We thank the reviewers for this comment, which made us realize that our initial presentation was not clear, in particular with regard to the traces that were chosen as examples in the initial submission of the paper. We now clarify on page 5 (lines 113-125) of the manuscript as follows:
  
  “In some trials, under control conditions, we applied a baseline depolarization prior to the ramp, in order to capture the variability observed in vivo (Harvey et al Nature 461:941–946, 2009; Epsztein et al. Neuron 70:109–120, 2011). Application of the cholinergic agonist carbachol (CCh, 2 µM) caused a depolarization of 2-6 mV. We compensated for this depolarization by injecting tonic hyperpolarizing current to reestablish the original membrane potential (see also Losonczy, et al., Nature 452, 436-442, 2008), as indicated by an offset from the 0 pA current level in the traces of the injected current ramps. The amplitude of background fluctuations in the resting membrane potential increased from a few tenths of a mV in control to 2-4 mV in CCh. Moreover, the threshold for action potential generation became more hyperpolarized. For all these reasons, we were not able to consistently vary the membrane potential using baseline depolarizations in the presence of CCh, because baseline depolarization alone frequently evoked spiking.”
  
  For this reason, many of the carbachol example traces in the initial submission had more hyperpolarized Vm than their control counterparts. Acetylcholine also caused a depolarization in a dose-dependent manner, that was compensated for in the same way. In this new version of the manuscript, we systematically report the effects of cholinergic agonists on membrane potential and neuronal excitability. Further, we show example traces with resting membrane potentials within 1 mV for each pharmacological comparison, therefore removing this variable and hopefully making results clearer. We also now state how the amount of injected current was chosen for each condition, and that the amount of injected current was generally lower in the presence of cholinergic agonists. Both the tonic hyperpolarizing current and the amplitude of the injected ramp for each example can now be appreciated in each figure.
  
  Finally, the reviewers’ comment also made us realize that, in principle, the center of mass of firing could be systematically skewed by the initial membrane potential, the amplitude of the current ramp injection and/or the input resistance. For this reason, we added a supplementary figure (1-2) where the adaptation index was plotted as a function of each these variables. In all cases, it is apparent that the main factor determining whether the center of mass of firing is shifted earlier or later in the ramp is the presence or absence of carbachol rather than initial membrane potential, current injection amplitude, or input resistance.
  
  4) It remains unclear how the current result that TRPM4 channels can mediate the firing pattern change relates to the previous finding that the current injection evoked CA1 neuronal firing pattern is due to long-term Na channel inactivation.
  
  We thank the reviewers for this suggestion, which helps to clarify our initial results. New Figure 8 addresses the connection between long-term inactivation of Na+ channels and the activation of TRPM4 channels, as characterized by the model (see Results lines 375-391). Furthermore, the model was instrumental in assessing how the Ca2+ and voltage-dependence of TRPM4 channels synergize to contribute to the shift in the center of mass of firing (Figure 9). Figure 9 illustrates the positive feedback loop between Ca2+ entry and the additional depolarization produced by Ca2+ activation of TRPM4 channels that can potentially accelerate firing (see Results lines 392-427).
  
  5) Figure 8: Panel C is supposed to confirm the prediction from the model that the carbachol-mediated change of firing activity is related to intracellular Ca2+ domains. However, the example cell shown is depolarized to -52 mV, and there is no hyperpolarization following Carbachol. Is this an effect of the high concentration of BAPTA? Again, what was the current injected under this experimental condition?
  
  Again, we thank the reviewer for pointing out the lack of clarity in the presentation of our results. We have now rewritten the results section for former Figure 8 (now Figure 10) to more clearly present these findings. The reviewer is correct that with the combination of 30 mM BAPTA + 10 nM free Ca2+ added to the intracellular solution (panel C of current Figure 10) the addition of carbachol did not change the membrane potential, as there were no changes in the holding current. Also, the amplitude of the ramp is comparable in control conditions and in the presence of carbachol under these conditions.
  
  We have now added all these details in the Results section for figure 10C.
  
  Reviewer #2 (Public Review):
  
  The manuscript focuses on the cholinergic modulation of TRPM4 channels in the CA1 pyramidal neurons. The authors presented solid convincing evidence that TRPM4 but not TRPC channels are the Ca2+-activated nonselective cation channel in CA1 pyramidal neurons being modulated by activation of muscarinic receptors. Using bi-directional ramp protocol, the authors revealed that ACh modulation could lead to forward shifts in place field center of mass, whereas decreased ACh modulation could contribute to backward shifts. This represents a significant molecular/cellular finding that links neuromodulation of intrinsic properties to place field shifts, a phenomenon seen in vivo. The authors used a computational approach to model this CA1 neuron spiking to further reveal the mechanism.
  
  To further improve the manuscript, I have the following suggestions/questions:
  
  1) The triangular ramp stimulation (introduced by the same group; Upchurch et al., 2022) makes it possible to emulate the hill-shaped depolarization during place field firing. However, one concern is the time scale/duration of the ramp (2 sec) compared to the physiological pattern (100ms~200ms in the in vivo recording in freely moving rat, Epsztein et al., 2011). Using a longer ramp to generate more spikes for calculating the adaptation index is understandable. However, considering the Ca entry/accumulation during prolonged depolarization, repeating one set of experiments with a shorter ramp is crucial to verify the major findings.
  
  When determining the duration of the current injections for our ramps, we relied on the data recorded in vivo in freely moving rats (Epsztein et al. Neuron 70:109–120, 2011) or in head-fixed mice running on spherical a treadmill immersed in virtual reality (Harvey et al Nature 461:941–946, 2009). In those papers, the voltage deflections are shown as a function of time, and gray bars or boxes represent the time the animals spend traversing the place field. We interpret those figures as showing that the hill-shaped depolarizations have variable durations, on the order of 1-20 s; we therefore think that our experiments with 2 and 10 second-long ramps cover a fair range of these durations. The place fields in Epsztein et al., 2011 were 4 cm long, and the authors give an example in Figure 3, in which the 2 meter track is traversed 1.5 times in 3 minutes. At that rate, the rat spent on average 2.4 seconds in each place field. We interpret the numerous shorter epochs of firing on the order of 100-200 ms shown Figure 2 in Epsztein et al. as the result of ongoing theta modulation within one overall depolarization during a single place field traversal. The following quote from that paper supports our interpretation “Some (Figure 2E, trace 1), but not all (trace 2), passes revealed spiking associated with a series of large (to ~-25 mV), long-lasting (~100 ms) depolarizations (Kandel and Spencer, 1961; Wong and Prince, 1978; Traub and Llinás, 1979; Takahashi and Magee, 2009) occurring rhythmically at ~4–5 Hz (theta frequency).” We thank the reviewer for pointing out these traces; our results are more directly applicable to the traces without theta modulation. Adding theta modulation is beyond the scope of this study but will be considered in future studies. Our average results in Figure 1 show that carbachol similarly affects 2 s and 10 s ramps, therefore we decided to present only the data on 2 second ramps for all the subsequent figures (see Results lines 156-157).
  
  2) Strictly speaking, the term "Ca2+-induced Ca2+ release (CICR)" is only used in ER Ca2+ release via ryanodine receptors (RyR) rather than IP3Rs. The author should be careful since it is used in the abstract (Line 36). In addition, pharmacology inhibition experiments should be incorporated to further dissect the role of RyR-induced CICR.
  
  We thank the reviewer for pointing out the possible confusion regarding the use of the term Ca2+-induced Ca2+ release (CICR) and we removed it from the text. Further, for this resubmission, we have pharmacologically dissected the role of IP3 vs ryanodine receptors in the cholinergic shift in the center of mass of firing due to the activation of TRPM4 channels, as suggested by the reviewer (see new Figure 6). To our surprise, neither the IP3R antagonist, Xestospongin C (1-2 µM), nor the RyR antagonist ryanodine (40 µM) were effective in preventing the cholinergic shift of the center of mass of firing when added to the intracellular solution (see Results lines 310-340).
  
  3) Applying strong buffering BAPTA not only removed the IP3R-TRPM nanodomain but also hindered Ca entry via VGCC. To validate the role of ER Ca2+ release in regulating TRPM, depletion of ER Ca2+ pool with SERCA inhibitor (e.g. thapsigargin) would be a more direct way to test the model (also make sure to add TRPC inhibitor to avoid the store-operated Ca2+ entry).
  
  We agree with the reviewer that 30 mM BAPTA also disrupts intracellular Ca2+ elevation via voltage-dependent Ca2+ channels on the neuronal membrane. Given that our experiments excluded a role of Ca2+ release from the intracellular stores (see below), our new model includes a nanodomain where, during cholinergic activation, the Ca2+ entry through VGCC is amplified to reach micromolar concentrations, through a currently unknown mechanism. As pointed out by the reviewer, the experimental results with 30 mM BAPTA support the existence of a nanodomain for the activation of TRPM4 channels, regardless of the nature of the calcium source.
  
  We have also addressed the role of ER Ca2+ release in our experiments.
  
  4) How does the TRPM current overcome the long-term inactivation of Nav? A channel state model should be added to the manuscript to make it easier to understand.
  
  Figure 11C now shows the Markov model of the NaV channel and new Figure 8 is devoted to explaining the mechanism by which current through the TRPM4 channels overcomes the long-term inactivation of the NaV channel.
  
  Reviewer #3 (Public Review):
  
  Combining slice physiology and simulation, Combe and colleagues discovered that TRPM4 channels activated by Ca2+ in nanodomains mediate ICAN currents in CA1 pyramidal neurons that drive the cholinergic modulation of firing rate. The finding is novel and interesting.
  
  Strengths:
  
  1) Identification of TRPM4 channels as the carrier of ICAN currents with independent pharmacological inhibitors and other supporting evidence.
  
  2) Physiological and simulational verification of physically closely located Ca2+ source and TRPM4 channels required for ICAN activation.
  
  Weaknesses:
  
  1) The conclusion of the cholinergic role in down-ramp or backward firing shifts is not convincing.
  
  We agree with the reviewer that our interpretation is somewhat speculative, and we have now included disclaimers throughout the manuscript as well as placed most of these interpretations in a portion of the discussion titled “Ideas and speculations: Implications of our results for place fields in intact rodents”. In addition, we added the word “potential” in the title.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.10.24.513511v2
www.biorxiv.org www.biorxiv.org

New submission 04/12/2022, 11:23:18

1
1. Public_Reviews 22 Jun 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  The manuscript by Masschelin et al. describes how Vitamin B2 deficiency affects body composition, energy expenditure, and glucose metabolism. B2 deficient mice have lower O2 consumption, and locomotor activity, with no difference in food intake. These mice also have lower liver FAD levels, which is expected given that B2 is a necessary cofactor for this coenzyme. Additionally, these mice have lower blood glucose levels following pyruvate injection, implying a lower capacity for gluconeogenesis. Using PPAR KO mice, they show that this effect on pyruvate tolerance is due to PPARα activation, though there is still a minor difference between wildtype and KO mice. Importantly, they show that fenofibrate PPAR agonism can improve glucose output following pyruvate injection in the absence of B2. The authors also perform robust metabolomics in each experimental condition and phenotype of the mouse well.
  
  Thank you for the positive input.
  
  1) The authors have yet to explore other explanations of differences in glucose metabolism under B2D +/Fenofibrate. The canonical targets of PPARα are involved in fatty acid oxidation, ketogenesis, and VLDL/HDL metabolism, in addition to gluconeogenesis (Bougarne et al. 2018). Gluconeogenesis is more of a fasting response due to CREB, FOXO1/PGC1a activation rather than PPAR. In response to B2D, the PPARα KO mice have increased plasma TGs, which may suggest a difference in VLDL TG secretion (Suppl. S3). Perhaps lipid metabolism is more directly affected, and changes in glucose metabolism are secondary to that of triglyceride metabolism. Regarding ketogenesis, the fenofibrate+ B2D fed mice have decreased plasma betahydroxybutyrate, suggesting decreased ketogenesis, which is a more canonical PPARα pathway (Suppl. S3). Testing each of these processes would help control that this mechanism is specific to gluconeogenesis and not secondary to something else.
  
  We value this reviewer’s comment. To address this point, we considered other mechanisms in our revised Discussion. In future studies, we plan to further explore these metabolic effects and to use ATAC-Seq to understand the transcription factors responsive to B2D. We anticipate these studies will take additional years to complete. Nonetheless, the present studies set the foundation for future work to investigate how FAD influences transcriptional regulation of metabolism.
  
  2) Is the effect on ISR dependent on PPARα? Is the mechanism of Fenofibrate on the liver, or on another cell type? In Figure 1, the authors state that Riboflavin deficiency alters body composition and energy expenditure, and then focuses on the liver. However, FAD levels are also increased in the heart and kidneys in addition to the liver. These tissues also respond to PPARα agonism, in addition to the muscle which plays a role in regulating glucose metabolism (B2D mice also have a higher lean mass (Fig 1e)). Additionally, the authors haven't shown specifically if the effects of Fenofibrate on electron transport and the ISR are dependent on the presence of PPARα (Figure 5, 6).
  
  We agree that knowing whether the effects of Fenofibrate on the ISR require liver PPARA is a critical issue, which will require dedicated studies for a thorough and meaningful conclusion. In new experiments, we knocked down Ppara in the liver using AAV8-Cre administration to Pparaflox/flox mice. Our data show liver-specific Ppara knockdown recapitulates whole-body B2D effects on pyruvate tolerance and hepatic steatosis (Figure 3I). These results agree with findings in whole-body Ppara knockout mice (Supplemental Figure 4), reinforcing the idea that the direct impact of B2D mainly occurs via PPARA activity in the liver. We acknowledge in the discussion ATF4 and ISR activation may contribute to PPARA-independent responses to B2D (Biochem J 443:165–71, 2012; Gut 65:1202-1214, 2016).
  
  An assessment of genetic requirements will require a large, rigorous set of experiments to identify the ratelimiting responses for fenofibrate activities during B2D, which we plan to do in the future. For this report, we decided to focus exclusively on tissue-specific knockout of Ppara. We will establish evidence for ISR responses to B2D in a separate study based on the feedback received here.
  
  Reviewer #2 (Public Review):
  
  The objective of this work by Masschelin et al. is to investigate the physiological relevance of flavin adenine dinucleotide (FAD). In particular, FAD supports the activity of flavoproteins involved in the production of cellular energy. Mutations in genes encoding flavoproteins often are associated with inborn errors of metabolism (IEMs), thus the clinical interest in investigating in more depth the physiological role of FAD. In this study, the authors first subjected male mice to a vitamin B12 deficient diet (B2D), demonstrating that loss of B12 replicates the phenotypes often observed with IEMs, including loss of body weight, hypoglycemia, and fatty liver. Using a combination of metabolomic phenotyping, transcriptomic analyses, and pharmacology (treatment with Fenofibrate, a PPARa agonist), the authors then reach the general conclusion that activation of the nuclear receptor PPARa can rescue the B2D phenotypes, thus revealing that PPARa directly controls the metabolic responses to FAD availability. Although the phenotypic analysis of the mice subjected to B2D increases our knowledge of the physiological impact of depleting the FAD pools on global energy metabolism, not all conclusions and statements made by the authors are totally supported by the data. In particular, the study is overall too descriptive and lacks mechanistic insights. While PPARa is likely an important player in the metabolic response to FAD availability, the molecular details on how FAD controls the activity of PPARa either directly or indirectly are entirely missing. Therefore, the authors are encouraged to directly assess whether B2D directly influences PPARa activity on the genes identified in the study, perform rescue experiments in the liver of PPARa KO mice and explore the possibility that other factors (including nuclear receptors) also participate in the response to B2 deficiency and diminished FAD pools.
  
  We appreciate the input from Reviewer 2. The direct and indirect effects of B2D on PPARA activity are likely not trivial. However, we performed experiments to determine how FAD depletion affects PPARA transcriptional activity using the riboflavin analog and competitive inhibitor lumiflavin (Figure 3L). We found lumiflavin reduced PPRE-luciferase activity in the presence of PPARA agonist. Although the assay is a synthetic reporter expressed in vitro, the experiment provides evidence of how B2D influences PPARA transcriptional activity. And, yes, we agree that our manuscript does not completely reconcile the factor(s) explaining the effects of B2D on gene expression, and expanded the discussion to comment on this point. In future studies, we intend to identify which transcription factor(s) regulate the liver responses to B2D, and further elucidation of the molecular mechanisms will be a central objective of future work.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.10.16.512439v1
www.biorxiv.org www.biorxiv.org

New submission 30/01/2023, 16:02:44

1
1. Public_Reviews 22 Jun 2023
 
 in eLife
 
 Author Response
 
 Reviewer #1 (Public Review):
 
 In this manuscript, Scagliotti and colleagues investigate the role of Dlk1 in regulating pituitary size in multiple mouse models with different Dlk1 gene dosages in order to understand the mechanisms of organ size control. They find that overexpression of Dlk1 leads to pituitary overgrowth and loss of Dlk1 causes undergrowth. Authors find two compartments of Dlk1 expression in the pituitary, in the marginal zone stem cell compartment and the parenchymal differentiated cell compartment, and by combing genetic mouse models show that a specific interaction of Dlk1 expression in both regions is necessary to affect pituitary organ size. They present to suggest that Dlk1 may repress Wnt signaling during development to control a shift from progenitor proliferation to differentiation. The data are meticulous, high quality, and clear.
 
 I have some questions about the interpretation of their data regarding the mechanism of Dlk1 regulation of pituitary organ size, as I believe there could be potential alternative explanations for their observations:
 
 I was wondering about the cause of the enlargement of the pituitary gland in Fig 1E, and whether it is caused by an increased number of cells (hyperplasia), an increased cell size (hypertrophy), or both. Line 104 states it is hyperplasia, and that cell size was not affected in WT-TG ('not shown', line 121). However, line 444 says the TG is hypertrophic. It would be good if the authors could elaborate on this and show or state how cell size was determined. Figs 5/6 show that WT-Tg proliferation is generally similar to WT, which suggests the increased size is not hyperplasia. It would be good to know whether this is correct. Some previous studies have shown that in pregnancy, lactotroph hypertrophy can be responsible for pituitary enlargement without hyperplasia (Castrique 2010, Hodson 2012).
 
 We have now clarified this point throughout the manuscript. We had previously counted cells per field in the analysis shown in Figure 1D as a proxy for cell number (these did not significantly differ by genotype). We have now performed a more robust examination. Cell number was determined using a well-established stereological technique: For each animal the maximal cross-sectional area (CSA) was determined from the volumetric analysis. At this level 3 independent sections were used to measure anterior pituitary CSA and count haematoxilin-stained nuclei, giving a mean cells/CSA measurement per individual. This number was multiplied by the AP volume to give an estimate of cell number.
 
 This analysis was performed on mice from the new cohort of animals containing litter matched adults of all 4 genotypes, and shown in Figure 4E. WT-TG animals had a significant increase in cell number compared to WT littermates (p = 0.0443), therefore pituitary expansion occurs by hyperplasia.
 
 Related to the organ size question above, I had a question about the cell number and proportions in Fig 1D/E/F, which shows the maintenance of endocrine cell proportions and an increase in the volume of ~30% in WT-Tg. For the cell proportions to be maintained, I thought the increase in volume per cell type (Fig 1G) would therefore have to also increase proportionally in every cell type, while 1G appears to show an increase in GH (sig) and PRL/TSH cells (ns). It would be good if the authors could discuss this briefly.
 
 We agree and indeed we see this trend across all cell types. When the data in Figure 1G is compared by 2-Way ANOVA we see a significant effect by cell type (p< 0.0001) and by genotype (p = 0.0009). However, for other hormone producing cells the effect size is does not overcome the variation in a smaller cell population so the difference between genotypes does not pass multiple significance testing with the relatively small sample size used. We have modified the legend to Figure 1G to make the ANOVA result clearer.
 
 This study is impactful and will be of interest to several research communities, including those interested in pituitary development and function, organ size control, and gene imprinting mechanisms.
 
 Reviewer #2 (Public Review):
 
 Scagliotti et al address how organ size is regulated by imprinted genes. Using a series of mouse models to modulate the dosage of the paternally expressed gene, Dlk1, the authors demonstrate that DLK1 is important for the maintenance of the stem cell compartment leading to the growth of the pituitary gland and the expansion of growth hormone-producing cells. The authors show that overexpression of Dlk1 leads to pituitary hyperplasia while deletion of the paternal allele leads to reduced pituitary size. Reduced pituitary size is accompanied by reduced cell proliferation in the cleft at e13.5 and an increase in the number of POU1F1+ cells, suggesting that loss of Dlk1 alters the balance between the number of cells remaining in the replicating stem cell pool and those differentiating into the POU1F1 lineage. An elegant caveat of this paper is the rescue of Dlk1 expression in the population of cells expressing Pou1f1 but not in SOX2+ stem cells. Expression of Dlk1 only in POU1F1+ cells is not sufficient to rescue pituitary size. The authors suggest that this is because DLK1 must be present in stem cells which then activate paracrine WNT signaling to promote cell proliferation in POU1F1+ cells.
 
 Strengths:
 
 This is an important study that provides a mechanistic understanding of how the imprinted gene, Dlk1, regulates organ size. The study employs an elegant experimental design to address the dosage requirement for Dlk1 in regulating pituitary gland size. Rescuing Dlk1 in the POU1F1+ cells, but not the marginal zone SOX2+ cells provides intriguing results about a possible role for DLK1 in paracrine signaling between these different pituitary cell types. The study uses publicly available scRNAseq and ChIPseq data to further support their findings and identify Dlk1 as a likely target of POU1F1.
 
 Weaknesses:
 
 The study only analyzes females for the adult time point. For embryonic and postnatal time points sexes are pooled. Gender differences in pituitary gene expression embryonically or postnatally could potentially affect experimental outcomes.
 
 We have now added adult data for both sexes.
 
 The authors employ a mouse model that rescues Dlk1 expression starting at e15.5 in POU1F1+ parenchymal cells but not in marginal zone stem cells. Rescuing Dlk1 expression in a specific population of cells is one of the strengths of this study. Based on this information and the fact that overexpression of Dlk1 leads to increased pituitary size, the authors suggest that DLK1+ marginal zone stem cells and DLK+ parenchymal cells may interact to promote postnatal proliferation. However, the ability to more carefully parse out the complex spatial and temporal contributions of DLK1 to pituitary size would be enhanced by the addition of a mouse model that rescues Dlk1 expression only in SOX2+ cells and a model that rescues expression in both stem cells and POU1F1+ cells.
 
 We agree that the addition of a model where Dlk1 is only expressed in SOX2+ cells would add significant mechanistic insight. To our knowledge an inducible gain-of-function Dlk1 model does not yet exist. Moreover, use of a SOX2-Cre driver would also increase Dlk1 expression in the hypothalamus as well as Rathke’s pouch, further complicating the analysis.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.11.01.514745v1
www.biorxiv.org www.biorxiv.org

Rapid, automated and experimenter-free assessment of cognitive flexibility reveals learning impairments following recovery from activity-based anorexia in female rats

1
1. Public_Reviews 22 Jun 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  In this manuscript, Huang et al., assess cognitive flexibility in rats trained on an animal model of anorexia nervosa known as activity-based anorexia (ABA). For the first time, they do this in a way that is fully automated and free from experimenter interference, as apparently experimenter interference can affect both the development of ABA as well as the effect on behaviour. They show that animals that are more cognitively flexible (i.e. animals that had received reversal training) were better able to resist weight loss upon exposure to ABA, whereas animals exposed to ABA first show poorer cognitive flexibility (reversal performance).
  
  Strengths:
  
  The development of a fully-automated, experimenter-free behavioural assessment paradigm that is capable of identifying individual rats and therefore tracking their performance.
  
  The bidirectional nature of the study - i.e. the fact that animals were tested for cognitive flexibility both before and after exposure to ABA, so that direction of causality could be established.
  
  The analyses are rigorous and the sample sizes sufficient.
  
  The use of touchscreens increases the translational potential of the findings.
  
  Weaknesses
  
  Some descriptions of methods and results are confusing or insufficiently detailed.
  
  We have been through all methods and results to include additional details as requested by this reviewer below.
  
  It seems to me that performance on the pairwise discrimination task cannot be directly (statistically) compared to performance on reversal (as in Figure 4E), as these are tapping into fundamentally different cognitive processes (discrimination versus reversal learning). I think comparing groups on each assessment is valid, however.
  
  We agree that discrimination and reversal are different cognitive processes, and statistical comparisons between these two components of the task were only made when examining the speed of learning in the validation of the novel testing system. Moreover, our inclusion of the pink and purple bars on graphs such as Figure 4C & 4E represent “main effects of ABA exposure”, regardless of learning phase (PD or reversal) rather than, as you describe, comparing PD to R1. Perhaps this comparison wasn’t clear, so we have amended the text to say ‘main effect of ABA exposure p=.0017’ rather than just “exposure”.
  
  Not necessarily a 'weakness' but I would have loved to see some assessment of the alterations in neural mechanisms underlying these effects, and/or some different behavioural assessments in addition to those used here. In particular, the authors mention in the discussion that this manipulation can affect cholinergic functioning in the dorsal striatum We (Bradfield et al., Neuron, 2013) and a number of others have now demonstrated that cholinergic dysfunction in the dorsomedial striatum impairs a different kind of reversal learning that based on alterations in outcome identity and thus relies on a different cognitive process (i.e. 'state' rather than 'reward' prediction error). It would be interesting perhaps in the future to see if the ABA manipulation also alters performance on this alternative 'cognitive flexibility' task.
  
  This is an excellent suggestion and we have already begun exploring this in other ongoing work in the laboratory. Due to ‘compulsive’ wheel running being a hallmark of ABA, we are interested in determining if this also translates to a goal-directed action impairment using the well-established outcome-specific devaluation task. Perhaps with ABA it may be more relevant to investigate outcome-reversals rather than stimulus-reversals, and if this is the case, it would further support the use of the ABA model for investigating cognitive dysfunction relevant to AN. We have included an additional section in the discussion text relating to our hypotheses regarding outcome-specific reversal learning in the ABA model.
  
  Nevertheless, I certainly think the manuscript provides a solid appraisal of cognitive flexibility using more traditional tasks, and that the authors have achieved their aims. I think the work here will be of importance, certainly to other researchers using the ABA model, but perhaps also of translational importance in the future, as the causal relationship between ABA and cognitive inflexibility is near impossible to establish using human studies, but here evidence points strongly towards this being the case.
  
  Reviewer #2 (Public Review):
  
  Huang and colleagues present data from experiments assessing the role of cognitive inflexibility in the vulnerability to weight loss in the activity-based anorexia paradigm in rats. The experiments employ a novel in-home cage touchscreen system. The home cage touch screen system allows reduced testing time and increased throughput compared with the more widely used systems resulting in the ability to assess ABA following testing cognitive flexibility in relatively young female rats. The data demonstrate that, contrary to expectations, cognitive inflexibility does not predispose to greater ABA weight loss, but instead, rats that performed better in the reversal learning task lost more weight in the ABA paradigm. Prior ABA exposure resulted in poorer learning of the task and reversal. An additional experiment demonstrated that rats that had been trained in reversal learning resisted weight loss in the ABA paradigm. The findings are important and are clearly presented. They have implications for anorexia nervosa both in terms of potentially identifying those at risk also in understanding the high rates of relapse.
  
  Thanks for a great summary of the manuscript.
  
  Reviewer #3 (Public Review):
  
  Activity-based anorexia (ABA), which combines access to a running wheel and restricted access to food, is a most common paradigm used to study anorexic behavior in rodents. And yet, the field has been plagued by persistent questions about its validity as a model of anorexia nervosa (AN) in humans. This group's previous studies supported the idea that the ABA paradigm captures cognitive inflexibility seen in AN. Here they describe a fully automated touchscreen cognitive testing system for rats that makes it possible to ask whether cognitive inflexibility predisposes individuals to severe weight loss in the ABA paradigm. They observed that cognitive inflexibility was predictive of resistance to weight loss in the ABA, the opposite of what was predicted. They also reported reciprocal effects of ABA and cognitive testing on subsequent performance in the other paradigm. Prior exposure to the ABA decreased subsequent cognitive performance, while prior exposure to the cognitive task promoted resistance to the ABA. Based on these findings, the authors argue that the ABA model can be used to identify novel therapeutic targets for AN.
  
  The strength of this manuscript is primarily as a methods paper describing a novel automated cognitive behavioral testing system that obviates the need for experimentalist handling and single housing, which can interfere with behavioral testing, and accelerate learning on the task. Together, these features make it feasible to perform longitudinal studies to ask whether cognitive performance is predictive of behavior in a second paradigm during adolescence, a peak period of vulnerability for many psychiatric disorders. The authors also used machine learning tools to identify specific behaviors during the cognitive task that predicted later susceptibility to the ABA paradigm. While the benefits of this system are clear, the rigor and reproducibility of experiments using this paradigm would be enhanced if the authors provided clear guidelines about which parameters and analyses are most useful. In their absence, the large amount of data generated can promote p-hacking.
  
  The authors use their automated behavioral testing paradigm to ask whether cognitive inflexibility is a cause or consequence of susceptibility to ABA, an issue that cannot be addressed in AN. They provide compelling evidence that there are reciprocal effects of the two behavioral paradigms, but do not perform the controls needed to evaluate the significance of these observations. For example, the learning task involves sucrose consumption and food restriction, conditions that can independently affect susceptibility to the ABA. Similarly, the ABA paradigm involves exercise and restricted access to food, which can both affect learning.
  
  In the Discussion, the authors hypothesize that the ABA paradigm produces cognitive inflexibility and argue that uncovering the underlying mechanism can be used to identify new therapeutic targets for AN. The rationale for their claim of translational relevance is undermined by the fact that the biggest effect of the ABA paradigm is seen in the pair discrimination task, and not reversal learning. This pattern does not fit clinical observations in AN.
  
  In summary, the significance of this manuscript lies in the development of a new system to test cognitive function in rats that can be combined with other paradigms to explore questions of causality. While the authors clearly demonstrate that cognitive flexibility does not promote susceptibility to ABA, the experiments presented do not provide a compelling case that their model captures important features of the pathophysiology of AN.
  
  We thank the reviewer for this detailed review and note that we have now both explicitly defined the most useful parameters for analyses from the novel touchscreen system as well as removed some comparisons that could be considered superfluous. We argue that the additional information provided by the machine learning analyses are, at this stage, exploratory, and rather than reveal independent descriptions of behavioural change in ABA exposed versus naïve rats this information will aid in the generation of hypotheses to be tested in future studies. Therefore, the figures pertaining to these analyses have now been provided as supplements to Figures 3 & 4 (Figure 3-figure supplement 3; Figure 4-figure supplements 3&4). We have also clarified our intention to explore possible behavioural differences using this technique in the methods and discussion.
  
  We have also completed the essential control experiment, defined in the “essential revisions” section of this review, whereby we show only moderate impairments in reversal learning following a matched period of food restriction without rapid weight loss, suggesting that the substantial impairment seen following ABA exposure was not due to food restriction alone (see updated Figure 4 and supplements).
  
  However, we do not agree with this reviewer “that the biggest effect of the ABA paradigm is seen in the pair discrimination task” and point to the outcomes of both reciprocal experiments.
  
  In the first experiment, rats that went onto be susceptible or resistant to ABA did not differ on pairwise discrimination learning but specifically on performance at the reversal of reward contingencies (Figure 3B & E). Although this result was not in the hypothesised direction, this suggests that reversal learning specifically and not pairwise discrimination can differentiate those rats that go on to be susceptible to weight loss. We have included additional discussion in the text related to this finding (see line 490-497).
  
  In the second experiment, it is clear by the number of ABA exposed rats that were unable to learn the reversal component even after being able to learn pairwise discrimination, that flexible learning is more impaired by ABA. While it is true that ABA exposed rats that were successful in learning the reversal task were slower to learn the pairwise discrimination component than naïve rats (Figure 4E), this was not related to their ability to learn the reversal task overall – with equivalent learning rates in pairwise discrimination to ABA exposed rats that failed to learn the reversal component (Figure 4G-I). The absence of significant differences between ABA exposed and naïve animals in Figure 4F relates to the fact that the large proportion of ABA exposed animals never reached performance criterion in the reversal phase of the task and therefore data from these animals could not be included in the figure. This is where the trials completed within each session becomes important for interpretation (i.e. Figure 4-figure supplement 1M-O), whereby ABA exposure caused impaired responding specifically within the reversal phase of the task. The results text has been updated to better reflect this critical point.
  
  Overall, this suggests that the impairment in cognitive flexibility caused by ABA exposure was related both to an associative learning impairment (slower to learn PD than naïve animals) and an impairment in the integration of new and existing learning (failure to learn R1 in a large proportion of animals).
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.11.15.516539v1
www.biorxiv.org www.biorxiv.org

New submission 16/03/2023, 16:16:58

1
1. Public_Reviews 22 Jun 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  Weaknesses
  
  1) I was curious as to how novel this setup is. Although I do not do head-fixed research myself, I thought there were already some open-source, relatively cheap systems available. I'm not sure how the current setup differs from those already available. Personally, even if this system involves only the wheel turning, as this is a truly operant response, that is novel enough for my liking.
  
  The novelty of the system stems from the synergistic combination of functionality, the low-cost open source nature of the design, and the breadth of behavioral procedures the system is able to support. The use of a wheel as an operant response was adapted from the International Brain Laboratory rig which has been used extensively for visual discrimination tasks. We adapted this wheel design to make the response closer to lever pressing through the use of the wheel brake, which ensures that subjects have to rotate the wheel in discrete rotational bouts rather than continuously spinning the wheel and potentially disengaging and allowing the wheel to rotate independently. There are no examples of systems capable of delivering 5+ solutions within a behavioral session or conducting valence testing with a modification of real-time place preference without the cost and complexity associated with virtual reality. We believe that the combination of factors, the flexibility and scalability of the system makes OHRBETS a novel and useful system for diverse motivation and consumption behaviors in head-fixed mice.
  
  2) It would be useful to have a bit more detail in the manuscript (not just on the GitHub link - in supplemental material perhaps?) on how to build such a system, just to get a sense of how difficult building such a system might be and how many components it has.
  
  With this submission we have included detailed assembly instructions as a supplement to the main manuscript and added reference to the file within the methods section. We have also added details, including time estimates, to the methods section.
  
  3) I wasn't sure how to feel about the comparisons across experimental set-ups in Figures 2 and 3. Usually, these sorts of comparisons are not considered statistically valid due to the many variables that differ between set-ups. However, I do see that the intent here is a bit different - i.e. is to show that despite all these alterations in variables the behavioural outputs are still highly correlated. However, without commenting on this intent, I did find these comparisons a little jarring to read.
  
  Thank you for highlighting this. We have added in a justification for why we measured the consistency in behavior measured with each head-fixed system.
  
  4) The only dataset I was not wholly convinced by was that in Figure 3 (real-time place preference and aversion). I think the authors have done the best job that they can of replicating such a procedure in a head-fixed mouse, but the head-fixed version is going to necessarily differ from the freely moving version in a fundamental way when the contextual cues and spatial navigation form part of the RTPT task. Giving a discrete cue, such as a tone, just is not a sufficient substitute for contextual cues, and I think the two types of task would engage fundamentally different brain cells and circuits (e.g. only the free-moving version is likely to engage place cells in the hippocampus).
  
  To avoid confusion regarding the place component of the real-time place preference assay name, we have renamed the head-fixed assay for assessing valence to Wheel-Time Preference (WTP). We have also added a full paragraph to the discussion where we outline the differences in the task requirements and relevant neuronal circuits between the freely-moving RTPP and head-fixed WTP. We understand that the head-fixed task is not a perfect analog of the RTPP task, however based on the similarity in the resulting time spent in the stimulation chamber/zone we believe that the WTP is able to replicate the valence assessment that many in the field uses RTPP to measure. We believe that the WTP with OHRBETS opens up new possibilities for assessing preference in head-fixed mice and this justifies keeping the figure within the main manuscript.
  
  To thoroughly address the potential confound of spatial information during the multi-spout experiment, we have added an additional supplemental figure (Figure 4- figure supplement 5) that depicts the proportion of trials with licking and added a paragraph to the discussion centered on the potential confound associated with learning the solution identity.
  
  5) Personally, I found having the statistics in a separate file confusing.
  
  Thank you for raising this concern. With our initial submission, we were concerned that including all of the statistics within the main text would make the paper difficult to read due to the extensive amount of statistics. With this submission, in addition to the statistics table, we have included statistics within the figure legends and main text where applicable.
  
  6) Line 589-594. Suggesting the medial/lateral shell recording results mean that the medial shell 'tracks value, and the range of values during the multi-spout consumption of gradients of NaCl is greater than the range of values during multi-spout consumption of gradients of sucrose" seems to engage in circular logic to me. That is, the authors should use behavioural data to infer what the animal is experiencing and whether it is a change in value, and/or a greater change in value during NaCl vs. sucrose consumption, and only then should they make an inference about what the larger medial shell response means.
  
  Thank you for identifying this potential site of confusion. To address this concern we have modified the language to better communicate our interpretation of the data.
  
  “If we assume that the range of values is greater during multi-spout consumption of gradients of NaCl compared to gradients of sucrose, as indicated by a greater range in licking behavior (Figure 8- Figure Supplement 4), then the greater range of dopamine release in the NacShM could imply that dopamine release in this structure tracks value.”
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.01.13.523828v1
www.biorxiv.org www.biorxiv.org

New submission 15/12/2022, 17:03:27

1
1. Public_Reviews 22 Jun 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  Wang, Y. et al. investigated the role of TPL2 signaling in acute and chronic neuroinflammatory conditions using small molecule inhibitors and a TPL2 kinase-dead mutant mouse line. They find that TPL2 is upregulated by various brain-resident cells, including microglia, astrocytes, and endothelial cells, during neurodegenerative disease progression and following peripheral LPS injection. They show that upon pharmacological and genetic inhibition during acute LPS stimulation, pro-inflammatory cytokine concentration, microgliosis, and neuronal loss can be reversed. In chronic neuroinflammation, as seen in a tauopathy mouse model, the loss of TPL2 rescues reactive gliosis, immune cell infiltration, neurodegeneration, and cognitive health. Interestingly, TPL2 loss of function was not significantly beneficial in models of nerve injury and stroke. By analyzing their multiple sequencing datasets and those of other research teams, the authors find that TPL2 aids to upregulate transcripts for the DAM signature, immediate early genes, and astrocyte reactivity. These data build together to further emphasize the intricacy and importance of the immune component in neurodegeneration and other neuroinflammatory conditions.
  
  The conclusions of this paper are mostly well supported by their data, but further confirmation of sequencing results and microglia intrinsic mechanisms need to be expanded.
  
  1) In the discussion section, it will be important to highlight that TPL2 could also be directly contributing to tauopathy disease progression through its actions in brain-resident endothelial cells. They spend a lot of time characterizing the effects of TPL2 on in vitro microglial responses and do not adequately discuss the potential that their disease phenotypes in the tauopathy model have more to do with TPL2's ability to regulate BBB permeability or facets of endothelial biology. It will be important to highlight that there are various discrete cellular mechanisms (e.g. functions for TPL2 in microglia, endothelial cells, astrocytes, peripheral immune cells, etc.) that could be underlying the disease readouts seen in their global TPL2 kinase-dead mice. They should discuss this in the context of previous literature demonstrating roles for TPL2 in other non-microglial cell types (e.g. Nanou et al PMID: 34038728).
  
  Thank you for this comment. We agree that while TPL2 is most highly expressed in microglia in the brain, TPL2 expression in endothelial cells and other cell types could potentially contribute to the disease. We have added discussion of this to the manuscript including discussion of the Nanou et al paper which raises the possibility that the TPL2-dependent infiltration of peripheral immune cells in TauP301S mice could be due to regulation of the BBB by TPL2 activity in endothelial cells. We also discuss potential roles for TPL2 in the various other cell types. In addition, we have now added characterization of cell-autonomous TPL2-dependent phenotypes in cultured astrocytes and have provided additional analysis of TPL2-dependent changes in endothelial cells in the scRNAseq experiment in TauP301S mice.
  
  2) Hippocampal single-cell RNA sequencing led the authors to report that TLP2KD in the PS19 model of tauopathy reduced the number of T-cell and dendritic cell (DC) infiltrates into the brain. The authors should corroborate this finding with immunohistochemistry or flow cytometry to confirm the presence of changing CD4+, CD8+, and DC populations. Most notably, it is critical for them to enumerate the cell numbers in an effort to validate that there are indeed empirical, and not just proportional, reductions in these cell populations.
  
  Thank you for the suggestion. We have performed immunohistochemistry to examine T cells in fixed brain tissue sections. We have included the data for T cell staining in Figure 5-figure supplement 2. We focused the IHC analysis on staining for CD8+ T cells based on the substantially greater abundance of CD8+ T cells compared to CD4+ T cells or DC in the single cell data (Figure 5C, Figure 5-figure supplement 5) and the availability of an antibody that worked well in our hands. These results corroborate the single cell data by empirically showing significantly increased numbers of T cells in TauP301S mice and significantly reduced numbers in the TauP301S x TPL2KD mice (Figure 5-figure supplement 2).
  
  3) The authors concluded from Figure 3 that TPL2 plays a key role in in vivo microglia and astrocyte activation. Adding in an in vitro study, like those done in Figures 1, 2, and S4, that looks at a cell-autonomous role for TPL2 in astrocyte reactivity would strengthen this claim and rule out a microglial-independent pathway of TPL2 inflammation.
  
  Thank you for the suggestion. To investigate the potential cell-autonomous role of TPL2 in astrocytes, we cultured primary mouse astrocyte and stimulated astrocytes with either LPS or cytokines, in the absence or presence of TPL2 inhibitor and measured stimulation induced changes in cytokine release and gene expression. Data are included in Figure 3-figure supplement 1 and the results are discussed in the manuscript. In contrast to the broader TPL2-dependence of cytokine release by cultured microglia only a much more restricted set of cytokines exhibited TPL2-dependence in cultured astrocytes. Furthermore, RT-qPCR analysis of TPL2-dependent activated astrocyte genes identified in the LPS in vivo study found much less TPL2-dependent activation in cultured astrocytes. We discuss that these results suggest that the TPL2-dependent astrocyte activation observed in vivo was probably largely contributed to indirectly by the function of TPL2 in microglia, but there was also potentially some contribution of cell-autonomous function of TPL2 in astrocytes.
  
  4) Although the TPL2KD mouse line is a valuable tool to impair TPL2's function while retaining its expression, the researchers failed to comment on the potential effects a global mutation in TPL2 could have in their model systems. Peripheral immunological challenges, like their IP injections of LPS, could behave differently and affect the nervous system in a microglia-independent pathway if monocyte/macrophage signaling is also impaired.
  
  We agree that during peripheral immunological challenges TPL2 could affect the nervous system in a microglia-independent manner. We have added this point to the discussion.
  
  5) Oligodendrocytes and OPCs have comparable numbers of DEGs to astrocytes (Figure S11a). What is changing within their transcriptional profile?
  
  In this manuscript we focused on TPL2-dependent DEGs in the Tauopathy model, which were all in microglia. We agree the TPL2-independent changes in the TauP301S mice in other cell types are also interesting. This data set has been uploaded to public data repository (GSE180041) and analysis of the changes in oligodendrocytes has been performed from this data set, as well as other disease models, in a recent publication: “Disease-associated oligodendrocyte responses across neurodegenerative diseases” (PMID: 36001972).
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.10.13.512193v1
www.medrxiv.org www.medrxiv.org

New submission 22/06/2023, 10:02:40

1
1. Public_Reviews 22 Jun 2023
 
 in eLife
 
 Author Response
 
 Reviewer #1 (Public Review):
 
 Strengths
 
 This paper is well situated theoretically within the habit learning/OCD literature. Daily training in a motor-learning task, delivered via smartphone, was innovative, ecologically valid and more likely to assay habitual behaviors specifically. Daily training is also more similar to studies with non-humans, making a better link with that literature. The use of a sequential-learning task (cf. tasks that require a single response) is also more ecologically valid. The in-laboratory tests (after the 1 month of training) allowed the researchers to test if the OCD group preferred familiar, but more difficult, sequences over newer, simpler sequences.
 
 The authors achieved their aims in that two groups of participants (patients with OCD and controls) engaged with the task over the course of 30 days. The repeated nature of the task meant that 'overtraining' was almost certainly established, and automaticity was demonstrated. This allowed the authors to test their hypotheses about habit learning. The results are supportive of the authors' conclusions.
 
 We truly appreciate the positive assessment of referee 1, particularly the consideration that our study is theoretically strong and that ‘the results are supportive of the authors' conclusions’. This is an important external endorsement of our conclusions, contrasting somewhat with the views of referee 2.
 
 Weaknesses
 
 The sample size was relatively small. Some potentially interesting individual differences within the OCD group could have been examined more thoroughly with a bigger sample (e.g., preference for familiar sequences). A larger sample may have allowed the statistical testing of any effects due to medication status.
 
 The authors were not able to test one criterion of habits, namely resistance to devaluation, due to the nature of the task
 
 We agree with the reviewer that the proof of principle established in our study opens new avenues for research into the psychological and behavioral determinants of the heterogeneity of this clinical population. However, considering the study timeline and the pandemic constraints, a bigger sample was not possible. Our sample can indeed be considered small if one compares it with current online studies, which do not require in-person/laboratory testing, thus being much easier to recruit and conduct. However, given the nature of our protocol (with 2 demanding test phases, 1-month engagement per participant and the inclusion of OCD patients without comorbidities only) and the fact that this study also involved laboratory testing, we consider our sample size reasonable and comparable to other laboratory studies (typically comprising on average between 30-50 participants in each group).
 
 This article is likely to be impactful -- the delivery of a task across 30 days to a patient group is innovative and represents a new approach for the study of habit learning that is superior to an inlaboratory approach.
 
 An interesting aspect of this manuscript is that it prompts a comparison with previous studies of goal-directed/habitual responding in OCD that used devaluation protocols, and which may have had their effects due to deficits in goal-directed behavior and not enhanced habit learning per se.
 
 Thank you for acknowledging the impact of our study, in particular the unique ability of our task to interrogate the habit system.
 
 Reviewer #2 (Public Review):
 
 In this study, the researchers employed a recently developed smartphone application to provide 30 days of training on action sequences to both OCD patients and healthy volunteers. The study tested learning and automaticity-related measures and investigated the effects of several factors on these measures. Upon training completion, the researchers conducted two preference tests comparing a learned and unlearned action sequences under different conditions. While the study provides some interesting findings, I have a few substantial concerns:
 
 1) Throughout the entire paper, the authors' interpretations and claims revolve around the domain of habits and goal-directed behavior, despite the methods and evidence clearly focusing on motor sequence learning/procedural learning/skill learning. There is no evidence to support this framing and interpretation and thus I find them overreaching and hyperbolic, and I think they should be avoided. Although skills and habits share many characteristics, they are meaningfully distinguishable and should not be conflated or mixed up. Furthermore, if anything, the evidence in this study suggests that participants attained procedural learning, but these actions did not become habitual, as they remained deliberate actions that were not chosen to be performed when they were not in line with participants' current goals.
 
 We acknowledge that the research on habit learning is a topic of current controversy, especially when it comes to how to induce and measure habits in humans. Therefore, within this context referee’s 2 criticism could be expected. Across disQnct fields of research, different methodologies have been used to measure habits, which represent relaQvely stereotyped and autonomous behavioral sequences enacted in response to a specific sQmulus without consideraQon, at the Qme of iniQaQon of the sequence, of the value of the outcome or any representaQon of the relaQonship that exists between the response and the outcome. Hence these are sQmulus-bound responses which may or may not require the implementaQon of a skill during subsequent performance. Behavioral neuroscienQsts define habits similarly, as sQmulus-response associaQons which are independent of reward or outcome, and use devaluaQon or conQngency degradaQon strategies to probe habits (Dickinson and Weiskrantz, 1985; Tricomi et al., 2009). Others conceptualize habits as a form of procedural memory, along with skills, and use motor sequence learning paradigms to invesQgate and dissect different components of habit learning such as acQon selecQon, execuQon and consolidaQon (Abrahamse et al., 2013; Doyon et al., 2003; Squire et al., 1993). It is also generally agreed that the autonomous nature of habits and the fluid proficiency of skills are both usually achieved with many hours of training or pracQce, respecQvely (Haith and Krakauer, 2018).
 
 We consider that Balleine and Dezfouli (2019) made an excellent attempt to bring all these different criteria within a single framework, which we have followed. We also consider that our discussion in fact followed a rather cautious approach to interpretation solely in terms of goaldirected versus habitual control.
 
 Referee 2 does not actually specify criteria by which they define habits and skills, except for asserting that skilled behavior is goal-directed, without mentioning what the actual goal of the implantation of such skill is in the present study: the fulfillment of a habit? We assume that their definition of habit hinges on the effects of devaluation, as a single criterion of habit, but which according to Balleine and Dezfouli (2019) is only 1 of their 4 listed criteria. We carefully addressed this specific criterion in our manuscript: “We were not, however, able to test the fourth criterion, of resistance to devaluation. Therefore, we are unable to firmly conclude that the action sequences are habits rather than, for example, goal-directed skills. Regardless of whether the trained action sequences can be defined as habits or goal-directed motor skills, it has to be considered…”. Therefore, we took due care in our conclusions concerning habits and thus found the referee’s comment misleading and unfair.
 
 We note that our trained motor sequences did in fact fulfil the other 3 criteria listed by Balleine and Dezfouli (2019), unlike many studies employing only devaluation (e.g. Tricomi et al 2009; Gillan et al 2011). Moreover, we cited a recent study using very similar methodology where the devaluation test was applied and shown to support the habit hypothesis (Gera et al., 2022).
 
 Whether the initiation of the trained motor sequences in experiment 3 (arbitration) are underpinned by an action-outcome association (or not) has no bearing on whether those sequences were under stimulus-response control after training (experiment 1). Transitions between habitual and goal-directed control over behavior are quite well established in the experimental literature, especially when choice opportunities become available (Bouton et al (2021), Frölich et al (2023), or a new goal-directed schemata is recruited to fulfill a habit (Fouyssac et al, 2022). This switching between habits and goal-directed responding may reflect the coordination of these systems in producing effective behavior in the real world.
 
 Fouyssac M, Peña-Oliver Y, Puaud M, Lim NTY, Giuliano C, Everitt BJ, Belin D. (2021).Negative Urgency Exacerbates Relapse to Cocaine Seeking After Abstinence. Biological Psychiatry. doi: 10.1016/j.biopsych.2021.10.009
 
 Frölich S, Esmeyer M, Endrass T, Smolka MN and Kiebel SJ (2023) Interaction between habits as action sequences and goal-directed behavior under time pressure. Front. Neurosci. 16:996957. doi: 10.3389/fnins.2022.996957
 
 Bouton ME. 2021. Context, attention, and the switch between habit and goal-direction in behavior. Learn Behav 49:349– 362. doi:10.3758/s13420-021-00488-z
 
 2) Some methodological aspects need more detail and clarification.
 
 3) There are concerns regarding some of the analyses, which require addressing.
 
 We thank referee 2 for their detailed review of the methods and analyses of our study and for the helpful feedback, which clearly helps improve our manuscript. We will clarify the methodological aspects in detail and conduct the suggested analysis. Please see below our answers to the specific points raised.
 
 Introduction:
 
 4) It is stated that "extensive training of sequential actions would more rapidly engage the 'habit system' as compared to single-action instrumental learning". In an attempt to describe the rationale for this statement the authors describe the concept of action chunking, its benefits and relevance to habits but there is no explanation for why sequential actions would engage the habit system more rapidly than a single-action. Clarifying this would be helpful.
 
 We agree that there is no evidence that action sequences become habitual more readily than single actions, although action sequences clearly allow ‘chunking’ and thus likely engage neural networks including the putamen which are implicated in habit learning as well as skill. In our revised manuscript we will instead state: “we have recently postulated that extensive training of sequential actions could be a means for rapidly engaging the ‘habit system’ (Robbins et al., 2019)]”
 
 5) In the Hypothesis section the authors state: “we expected that OCD patients... show enhanced habit attainment through a greater preference for performing familiar app sequences when given the choice to select any other, easier sequence”. I find it particularly difficult to interpret preference for familiar sequences as enhanced habit attainment.
 
 We agree that choice of the familiar response sequence should not be a necessary criterion for habitual control although choice for a familiar sequence is, in fact, not inconsistent with this hypothesis. In a recent study, Zmigrod et al (2022) found that 'aversion to novelty' was a relevant factor in the subjective measurement of habitual tendencies. It should also be noted that this preference was present in patients with OCD. If one assumes instead, like the referee, that the familiar sequence is goal-directed, then it contravenes the well-known 'egodystonia' of OCD which suggests that such tendencies are not goal-directed.
 
 To clarify our hypothesis, we will amend the sentence to the following: “Finally, we expected that OCD patients would generally report greater habits, as well as attribute higher intrinsic value to the familiar app sequences manifested by a greater preference for performing them when given the choice to select any other, easier sequence”.
 
 A few notes on the task description and other task components:
 
 6) It would be useful to give more details on the task. This includes more details on the time/condition of the gradual removal of visual and auditory stimuli and also on the within practice dynamic structure (i.e., different levels appear in the video).
 
 These details will be included in the revised manuscript. Thank you for pointing out the need for further clarification of the task design.
 
 7) Some more information on engagement-related exclusion criteria would be useful (what happened if participants did not use the app for more than one day, how many times were allowed to skip a day etc.).
 
 This additional information will be added to the revised manuscript. If participants omitted to train for more than 2 days, the researcher would send a reminder to the participant to request to catch up. If the participant would not react accordingly and a third day would be skipped, then the researcher would call to understand the reasons for the lack of engagement and gauge motivation. The participant would be excluded if more than 5 sequential days of training were missed. Only 2 participants were excluded given their lack of engagement.
 
 8) According to the (very useful) video demonstrating the task and the paper describing the task in detail (Banca et al., 2020), the task seems to include other relevant components that were not mentioned in this paper. I refer to the daily speed test, the daily random switch test, and daily ratings of each sequence's enjoyment and confidence of knowledge.
 
 If these components were not included in this procedure, then the deviations from the procedure described in the video and Banca al. (2020) should be explicitly mentioned. If these components were included, at least some of them may be relevant, at least in part, to automaticity, habitual action control, formulation of participants' enjoyment from the app etc. I think these components should be mentioned and analyzed (or at least provide an explanation for why it has been decided not to analyze them).
 
 This is also true for the reward removal (extinction) from the 21st day onwards which is potentially of particular relevance for the research questions.
 
 The task procedure was indeed the same as detailed in Banca et al., 2020. We did not include these extra components in this current manuscript for reasons of succinctness and because the manuscript was already rather longer than a common research article, given that we present three different, though highly inter-dependent, experiments in order to answer key interrelated questions in an optimal manner. However, since referee 2 considers this additional analysis to be important, we will be happy to include it in the supplementary material of the revised manuscript.
 
 Training engagement analysis:
 
 9)I find referring to the number of trials including successful and unsuccessful trials as representing participants "commitment to training" (e.g. in Figure legend 2b) potentially inadequate. Given that participants need at least 20 successful trials to complete each practice, more errors would lead to more trials. Therefore, I think this measure may mostly represent weaker performance (of the OCD patients as shown in Figure 2b). Therefore, I find the number of performed practice runs, as used in Figure 2a (which should be perfectly aligned with the number of successful trials), a "clean" and proper measure of engagement/commitment to training.
 
 We acknowledge referee’s concern on this matter and agree to replace the y-axis variable of Figure 2b to the number of performed practices (thus aligning with Figure 2a). This amendment will remove any potential effect of weaker performance on the engagement measurement and will provide clearer results.
 
 10) Also, to provide stronger support for the claim about different diurnal training patterns (as presented in Figure 2c and the text) between patients and healthy individuals, it would be beneficial to conduct a statistical test comparing the two distributions. If the results of this test are not significant, I suggest emphasizing that this is a descriptive finding.
 
 We will conduct the statistical test and report accordingly.
 
 Learning results:
 
 11) When describing the Learning results (p10) I think it would be useful to provide the descriptive stats for the MT0 parameter (as done above for the other two parameters).
 
 Thank you for pointing this out. The descriptive stats for MT0 will be added to the revised version of the manuscript.
 
 12) Sensitivity of sequence duration and IKI consistency (C) to reward:
 
 I think it is important to add details on how incorrect trials were handled when calculating ∆MT (or C) and ∆R, specifically in cases where the trial preceding a successful trial was unsuccessful. If incorrect trials were simply ignored, this may not adequately represent trial-by-trial changes, particularly when testing the effect of a trial's outcome on performance change in the next trial.
 
 This is an important question. Our analysis protocol was designed to ensure that incorrect trials do not contaminate or confound the results. To estimate the trial-to-trial difference in ∆MT (or C) and ∆R, we exclusively included pairs of contiguous trials where participants achieved correct performance and received feedback scores for both trials. For example, if a participant made a performance error on trial 23, we did not include ∆R or ∆MT estimates for the pairs of trials 23-22 and 24-23. Instead of excluding incorrect trials from our analyses, we retained them in our time series but assigned them a NaN (not a number) value in Matlab. As a result, ∆R and ∆MT was not defined for those two pairs of trials. Similarly for C. This approach ensured that our analyses are not confounded by incremental or decremental feedback scores between noncontiguous trials. In the past, when assessing the timing of correct actions during skilled sequence performance, we also considered events that were preceded and followed by correct actions. This excluded effects such as post-error slowing from contaminating our results (Herrojo Ruiz et al., 2009, 2019). Therefore, we do not believe that any further reanalysis is required.
 
 Ruiz MH, Jabusch HC, Altenmüller E. Detecting wrong notes in advance: neuronal correlates of error monitoring in pianists. Cerebral cortex. 2009 Nov 1;19(11):2625-39.
 
 Bury G, García-Huéscar M, Bhattacharya J, Ruiz MH. Cardiac afferent activity modulates early neural signature of error detection during skilled performance. NeuroImage. 2019 Oct 1;199:704-17.
 
 13) I have a serious concern with respect to how the sensitivity of sequence duration to reward is framed and analyzed. Since reward is proportional to performance, a reduction in reward essentially indicates a trial with poor performance, and thus even regression to the mean (along with a floor effect in performance [asymptote]) could explain the observed effects. It is possible that even occasional poor performance could lead to a participant demonstrating this effect, potentially regardless of the reward. Accordingly, the reduced improvement in performance following a reward decrease as a function of training length described in Figure 5b legend may reflect training-induced increased performance that leaves less room for improvement after poor trials, which are no longer as poor as before. To address this concern, controlling for performance (e.g., by taking into consideration the baseline MT for the previous trial) may be helpful. If the authors can conduct such an analysis and still show the observed effect, it would establish the validity of their findings."
 
 Thank you for raising this point. Figure 5b illustrates two distinct effects of reward changes on behavioral adaptation, which are expected based on previous research.
 
 I. Practice effects: Firstly, we observe that as participants progress across bins of practice, the degree of improvement in behavior (reflected by faster movement time, MT) following a decrease in reward (∆R−) diminishes, consistent with our expectations based on previous work. Conversely, we found that ∆MT does not change across bins of practices following an increase in reward (∆R+). We appreciate the reviewer's suggestion regarding controlling for the reference movement time (MT) in the previous trial when examining the practice effect in the p(∆T|∆R−) and p(∆T|∆R+) distributions. In the revised manuscript, we will conduct the proposed control analysis to better understand whether the sensitivity of MT to score decrements changes across practice when normalising MT to the reference level on each trial. But see below for a preliminary control analysis.
 
 II. Asymmetry of the effect of ∆R− and ∆R+ on performance: Figure 5b also depicts the distinct impact of score increments and decrements on behavioural changes. When aggregating data across practice bins, we consistently observed that the centre of the p(∆T|∆R−) distribution was smaller (more negative) than that of p(∆T|∆R+). This suggests that participants exhibited a greater acceleration following a drop in scores compared to a relative score increase, and this effect persisted throughout the practice sessions. Importantly, this enhanced sensitivity to losses or negative feedback (or relative drops in scores) aligns with previous research findings (Galea et al., 2015; Pekny et al., 2014; van Mastrigt et al., 2020).
 
 We have conducted a preliminary control analysis to exclude the potential impact that reference movement time (MT) values could have on our analysis. We have assessed the asymmetry between behavioural responses to ∆R− and ∆R+ using the following analysis: We estimated the proportion of trials in which participants exhibited speed-up (∆T < 0) or slow-down (∆T > 0) behaviour following ∆R− and ∆R+ across different practice bins (bins 1 to 4). By discretising the series of behavioural changes (∆T) into binary values (+1 for slowing down, -1 for speeding up), we can assess the type of changes (speed-up, slow-down) without the absolute ∆T or T values contributing to our results. We obtained several key findings:
 
 • Consistent with expectations (sanity check), participants exhibited more instances of speeding up than slowing down across all reward conditions.
 
 • Participants demonstrated a higher frequency of speeding up following ∆R− compared to ∆R+, and this asymmetry persisted throughout the practice sessions (greater proportion of -1 events than +1 events). 53% events were speed-up events in the in the p(∆T|∆R+) distribution for the first bin of practices, and 55% for the last bin. Regarding p(∆T|∆R-), there were 63% speed-up events throughout each bin of practices, with this proportion exhibiting no change over time.
 
 • Accordingly, the asymmetry of reward changes on behavioural adaptations, as revealed by this analysis, remained consistent across the practice bins.
 
 Thus, these preliminary findings provide an initial response to referee 2 and offer valuable insights into the asymmetrical effects of positive/negative reward changes on behavioural adaptations. We plan to include these results in the revised manuscript, as well as the full control analysis suggested by the referee. We will further expand upon their interpretation and implications.
 
 14) Another way to support the claim of reward change directionality effects on performance (rather than performance on performance), at least to some extent, would be to analyze the data from the last 10 days of the training, during which no rewards were given (pretending for analysis purposes that the reward was calculated and presented to participants). If the effect persists, it is less unlikely that the effect in question can be attributed to the reward dynamics.
 
 The reviewer’s concern is addressed in the previous quesQon. Also, this analysis would not be possible because our Gaussian fit analyses use the Qme series of conQnuous reward scores, in which ∆R− or ∆R+ are embedded. These events cannot be analyzed once reward feedback is removed because we do not have behavioral events following ∆R− or ∆R+ anymore.
 
 15) This concern is also relevant and should be considered with respect to the sensitivity of IKI consistency (C) to reward. While the relationship between previous reward/performance and future performance in terms of C is of a different structure, the similar potential confounding effects could still be present.
 
 We will conduct this analysis for the revised manuscript, similarly to the control analysis suggested by referee 2 on MT. Our preliminary control analysis, as explained above, suggests that the fundamental asymmetry in the effect of ∆R+ and ∆R+ on behavioral changes persists when excluding the impact of reference performance values in our Gaussian fit analysis.
 
 16) Another related question (which is also of general interest) is whether the preferred app sequence (as indicated by the participants for Phase B) was consistently the one that yielded more reward? Was the continuous sequence the preferred one? This might tell something about the effectiveness of the reward in the task.
 
 We have now conducted this analysis. There is in fact no evidence to conclude that the continuously rewarded sequence was the preferred one. The result shows that 54.5% of HV and 29% of the OCD sample considered the continuous sequence to be their preferred one. Of note, this preference may not necessarily be linked to the trial-by-trial reward sensitive analysis. The latter assesses how learning may be affected by reward. The overall preference may be influenced by many other factors, such as, for example, the aesthetic appeal of particular combinations of finger movements.
 
 Regarding both experiments 2 and 3:
 
 17) The change in context in experiment 2 and 3 is substantial and include many different components. These changes should be mentioned in more detail in the Results section before describing the results of experiments 2 and 3.
 
 Following referee’s advice, we will move these details (currently written in the Methods section) to the Results section, when we introduce Phase B and before describing the results of experiments 2 and 3.
 
 Experiment 2:
 
 18) In Experiment 2, the authors sometimes refer to the "explicit preference task" as testing for habitual and goal-seeking sequences. However, I do not think there is any justification for interpreting it as such. The other framings used by the authors - testing whether trained action sequences gain intrinsic/rewarding properties or value, and preference for familiar versus novel action sequences - are more suitable and justified. In support of the point I raised here, assigning intrinsic rewarding properties to the learned sequences and thereby preferring these sequences can be conceptually aligned with goal-directed behavior just as much as it could be with habit.
 
 We clearly defined the theoretical framing of experiment 2 as a test of whether trained action sequences gain intrinsic value and we are pleased to hear that the referee agrees with this framing. If the referee is referring to the paragraph below (in the Discussion), we actually do acknowledge within this paragraph that a preference for the trained sequences can either be conceptually aligned with a habit OR a goal-directed behavior.
 
 “On the other hand, we are describing here two potential sources of evidence in favor of enhanced habit formation in OCD. First, OCD patients show a bias towards the previously trained, apparently disadvantageous, action sequences. In terms of the discussion above, this could possibly be reinterpreted as a narrowing of goals in OCD (Robbins et al., 2019) underlying compulsive behavior, in favor of its intrinsic outcomes”
 
 This narrowing of goals model of OCD refers to a hypothetically transiQonal stage of compulsion development driven by behavior having an abnormally strong, goal-directed nature, typically linked to specific values and concerns.
 
 If the referee is referring to the penulQmate sentence of hypothesis secQon, this has been amended in response to Q5. We cannot find any other possible instances in this manuscript stating that experiment 2 is a test of habitual or goal-directed behavior.
 
 Experiment 3:
 
 19) Similar to Experiment 2, I find the framing of arbitration between goal-directed/habitual behavior in Experiment 3 inadequate and unjustified. The results of the experiment suggest that participants were primarily goal-directed and there is no evidence to support the idea that this reevaluation led participants to switch from habitual to goal-directed behavior.
 
 Also, given the explicit choice of the sequence to perform participants had to make prior to performing it, it is reasonable to assume that this experiment mainly tested bias towards familiar sequence/stimulus and/or towards intrinsic reward associated with the sequence in value-based decision making.
 
 This comment is aligned with (and follows) the referee’s criticism of experiment 1 not achieving automatic and habitual actions. We have addressed this matter above, in response 1 to Referee 2.
 
 Mobile-app performance effect on symptomatology: exploratory analyses:
 
 20) Maybe it would be worth testing if the patients with improved symptomatology (that contribute some of their symptom improvement to the app) also chose to play more during the training stage.
 
 We have conducted analysis to address this relevant question. There is no correlation between the YBOCS score change and the number of total practices, meaning that the patients who improved symptomatology post training did not necessarily chose to play the app more during the training stage (rs = 0.25, p = 0.15). Additionally, we have statistically compared the improvers (patients with reduced YBOCS scores post-training) and the non-improvers (patients with unchanged or increased YBOCS scores post-training) in their number of app completed practices during the training phase and no differences were observed (U = 169, p = 0.19).
 
 Discussion:
 
 21) Based on my earlier comments highlighting the inadequacy and mis-framing of the work in terms of habit and goal-directed behavior, I suggest that the discussion section be substantially revised to reflect these concerns.
 
 We do not agree that the work is either "inadequate or mis-framed" and will not therefore be substantially revising the Discussion. We will however clarify further the interpretation we have made and make explicit the alternative viewpoint of the referee. For example, we will retitle experiment 3 as “Re-evaluation of the learned action sequence: possible test of goal/habit arbitration” to acknowledge the referee’s viewpoint as well as our own interpretation.
 
 22) In the sentence "Nevertheless, OCD patients disadvantageously preferred the previously trained/familiar action sequence under certain conditions" the term "disadvantageously" is not necessarily accurate. While there was potentially more effort required, considering the possible presence of intrinsic reward and chunking, this preference may not necessarily be disadvantageous. Therefore, a more cautious and accurate phrasing that better reflects the associated results would be useful.
 
 We recognize that the term "disadvantageously" may be semantically ambiguous for some readers and therefore we will remove it.
 
 Materials and Methods:
 
 23) The authors mention: "The novel sequence (in condition 3) was a 6-move sequence of similar complexity and difficulty as the app sequences, but only learned on the day, before starting this task (therefore, not overtrained)." - for the sake of completeness, more details on the pre-training done on that day would be useful.
 
 Details of the learning procedure of the novel sequence (in condition 3, experiment 3) will be provided in the methods of the revised version of the manuscript.
 
 Minor comments:
 
 24) In the section discussing the sensitivity of sequence duration to reward, the authors state that they only analyzed continuous reward trials because "a larger number of trials in each subsample were available to fit the Gaussian distributions, due to feedback being provided on all trials." However, feedback was also provided on all trials in the variable reward condition, even though the reward was not necessarily aligned with participants' performance. Therefore, it may be beneficial to rephrase this statement for clarity.
 
 We will follow this referee’s advice and will rephrase the sentence for clarity.
 
 25) With regard to experiment 2 (Preference for familiar versus novel action sequences) in the following statement "A positive correlation between COHS and the app sequence choice (Pearson r = 0.36, p = 0.005) further showed that those participants with greater habitual tendencies had a greater propensity to prefer the trained app sequence under this condition." I find the use of the word "further" here potentially misleading.
 
 The word "further" will be removed.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2023.02.23.23286338v3
www.biorxiv.org www.biorxiv.org

New submission 21/06/2023, 09:56:31

1
1. Public_Reviews 21 Jun 2023
  
  in eLife
  
  Author Response
  
  eLife assessment
  
  This study assesses homeostatic plasticity mechanisms driven by inhibitory GABAergic synapses in cultured cortical neurons. The authors report that up- or down-regulation of GABAergic synaptic strength, rather than excitatory glutamatergic synaptic strength, is critical for homeostatic regulation of neuronal firing rates. The reviewers noted that the findings are potentially important, but they also raised questions. In particular, the evidence supporting the findings is currently incomplete and demonstration of independent regulation of mEPSCs and mIPSCs is a necessary experiment to support the major claims of the study.
  
  We appreciate the detailed, thoughtful assessment of our paper by the reviewers and editors and will submit a revised version in the future that addresses the reviewers’ comments as detailed below in response to each concern. We will include a more open discussion of alternative possibilities. Further, we will repeat the optogenetic experiments assessing AMPAergic scaling in our mouse cortical cultures in order to demonstrate independent regulation of mEPSCs and mIPSCs as suggested.
  
  Reviewer #1 (Public Review):
  
  In the manuscript titled "GABAergic synaptic scaling is triggered by changes in spiking activity rather than transmitter receptor activation," the authors present an investigation of the role of GABAergic synaptic scaling in the maintenance of spike rates in networks of cultured neurons. Their main findings suggest that GABAergic scaling exhibits features consistent with a key homeostatic mechanism that contributes to the stability of neuronal firing rates. Their data demonstrate that GABAergic scaling is multiplicative and emerges when postsynaptic spike rates are altered. Finally, their data suggest that, in contrast to their prior data on glutamatergic scaling, GABAergic scaling is driven by spike rates. The authors set the paper up as an argument that GABAergic scaling, rather than glutamatergic scaling, serves as the critical homeostatic mechanism for spike rate regulation.
  
  While the paper is ambitious in its rhetorical scope and certainly presents intriguing findings, there are several serious concerns that need to be addressed to substantiate the interpretations of the data. For example, the CTZ data do not support the interpretations and conclusions drawn by the authors. Summarily, the authors argue that GABAergic scaling is measuring spiking (at the time scale of the homeostatic response, which they suggest is a key feature of a homeostat) yet their data in figure 5B show more convincingly that CTZ does not influence spiking levels - only one out of four time points is marginally significant (also, I suspect that the bootstrapping method mentioned in line 454-459 was conducted as a pairwise comparison of distributions. There is no mention of multiple comparisons corrections, and I have to assume that the significance at 3h would disappear with correction).
  
  We certainly understand the criticism here (similar to reviewer 2’s third point). In our resubmission we will do a better job discussing these complications, which we now summarize. First, we are presenting our entire dataset to be as transparent as possible. Unlike most synaptic scaling studies (including our own) that apply drugs to alter activity and assess mPSC amplitude at the final time point, here we are actually showing CTZ’s effect on spiking activity within the culture over time. This is critical because it has informed us of the drug’s true effect on spiking, the variability that is associated with these perturbations, and the ability and timing of the cultured network to homeostatically recover initial levels. This was important because it revealed that the drugs do not always influence activity in the way we assume, and this provides greater context to our results. Second, we are showing all of our data, and presenting it using estimation statistics which go beyond the dichotomy of a simple p value yes or no (Ho J, Tumkaya T, Aryal S, Choi H, Claridge-Chang A. 2019. Moving beyond P values: data analysis with estimation graphics. Nat Methods 16: 565-66). Estimation statistics have become a more standard statistical approach in the last 15 years and is the preferred method for the Society for Neuroscience’s eNeuro Journal. This method shows the effect size and the confidence interval of the distribution. For the 3 hr time point in Fig. 5B the CTZ/ethanol vs. ethanol data points exhibit very little overlap and the effect size demonstrates a near doubling of spike frequency, and the confidence interval shows a clear separation from 0. This was a pairwise comparison as we compared values at each time point after the addition of ethanol or ethanol/CTZ. Third, the plots illustrate an upward trend in spike frequency at 1 and 6 hrs, but that there is also clear variability. It is important to note that while these recordings help us to understand effects on spiking across the cultured network, they cannot directly speak to spiking activity in the principal neurons that we target. This complication along with the variability inherent in these cultures could make simple comparisons difficult to interpret. Regardless, we do see some increase in spiking with CTZ and we clearly see increases in mIPSC amplitude, thus providing some support for the idea that spiking could be a critical player in terms of GABAergic scaling, particularly when put in the context of our other findings. However, it is important to recognize that something other than total spike rate may contribute to GABAergic scaling, such as the pattern of spiking that produces a particular calcium transient, and this will be discussed in the resubmission.
  
  Then, the fact that TTX applied on top of CTZ drives a increase in mIPSC amplitude is interpreted as a conclusive demonstration that GABAergic scaling is sensing spiking. It is inevitable, however, that TTX will also severely reduce AMAP-R activation - a very plausible alternative explanation is that the augmentation of AMPAR activation caused by CTZ is not sufficient to overcome the dramatic impact of TTX. All together, these data do not provide substantial evidence for the conclusion drawn by the authors.
  
  We understand this point when considering the CTZ/TTX experiments by themselves. However, spiking appears to be a more straightforward trigger when the CTZ/TTX results are coupled with the prevention of GABAergic downscaling by optogenetic restoration of spiking in the presence of AMPAR antagonists. Further, an important point here is that our results with TTX vs. TTX + CTZ are different for GABAergic scaling (no difference) and AMPAergic scaling (CTZ diminished upward scaling) suggesting different triggers for the two forms of scaling. We will make this more clear in our resubmission.
  
  Specific points:
  
  The logic of the basis for the argument is somewhat flawed: A homeostat does not require a multiplicative mechanism, nor does it even need to be synaptic. Membrane excitability is a locus of homeostatic regulation of firing, for example. In addition, synapse-specific modulation can also be homeostatic. The only requirement of the homeostat is that its deployment subserves the stabilization of a biological parameter (e.g., firing rate).
  
  We agree with the reviewer and should not have suggested that this was a necessary requirement for a spike rate hemostat. What we should have said was that historically this definition has been attributed to AMPAergic scaling, which is thought to be a spike rate homeostat. We will correct this in the resubmission.
  
  Line 63 parenthetically references an important, but contradictory study as a brief "however". Given the tone of the writing, it would be more balanced to give this study at least a full sentence of exposition.
  
  Agreed, we will do this.
  
  The authors state (line 11) that expression of a hyperpolarizing conductance did not trigger scaling. More recent work ('Homeostatic synaptic scaling establishes the specificity of an associative memory') does this via expression of DREADDs and finds robust scaling.
  
  The purpose of citing this study was to argue that the spike rate homeostat hypothesis doesn’t make sense for AMPAergic scaling based on a study that hyperpolarized an individual cell while leaving the rest of the network unaltered and therefore leaving network activity and neurotransmission largely normal. In this case scaling was not triggered, suggesting reduced spike rate within an individual cell was insufficient to trigger scaling. The study that the reviewer refers to hyperpolarizes a majority of cells in the network and therefore will also alter neurotransmission throughout the network, which does not separate the importance of spiking and receptor activation as in the above-mentioned study. We will make this point more clearly in the resubmission.
  
  Supplemental figure 1 looks largely linear to me? Out of curiosity, wouldn't you expect the left end to be aberrant because scaling up should theoretically increase the strength of some synapses that would have been previously below threshold for detection?
  
  We agree that the scaling ratio plot is largely linear. To be clear, the linearity of the ratio plot was interesting but our main point here was that this line had a positive slope meaning ratios (CNQX mPSC amplitudes/control mPSC amplitudes) got bigger for the larger CNQX-treated mPSCs. Alternatively, a multiplicative relationship where mPSCs are all increased by a single factor (e.g. 2X) would be a flat line with 0 slope at the multiplicative value (e.g. 2). In terms of the left side of the plot, we do see values that rise abruptly from 1 - this is partially obstructed by the Y axis in this figure and we will adjust this. This left part of the plot is likely due the CNQX-induced increases in mPSC amplitudes of mini’s that were below our detection threshold of 5pA. Therefore, mini’s that were 4pAs could now be 5pAs after CNQX treatment and these are then divided by the smallest control mPSCs which are 5 pAs (ratio of 1). We will try to do a better job describing this in the resubmission.
  
  Given that figure 2B also shows warping at the tail ends of similar distributions, how is this to be interpreted?
  
  The left side of the ratio plot shows evidence consistent with the idea that mIPSCs are dropping into the noise after CNQX treatment (similar to above argument), while most of the distribution suggests mIPSCs are reduced to 50% by CNQX treatment. On the right side of the ratio plot the values appear to mostly increase. We are not sure why this is happening, but it looks like some mIPSCs are not purely multiplicative at 0.5, particularly in TTX. It is also important to point out that this is a relatively small percent of the total population and the biggest mPSCs can vary to a great degree from one cell to the next. We will discuss this in the resubmission.
  
  The readability of the figures is poor. Some of them have inconsistent boundary boxes, bizarre axes, text that appears skewed as if the figures were quickly thrown together and stretched to fit.
  
  We will address these issues in the resubmission.
  
  I'm concerned about the optogenetic restoration of activity experiment. Cortical pyramidal neuron mean firing rates are log normally distributed and span multiple orders of magnitude. The stimulation experiments can only address the total firing at a network-level - given than a network level "mean" is meaningless in a lognormal distribution, how are we to think about the effect of this manipulation when it comes to individual neurons homeostatically stabilizing their own activities? In essence, the argument is made at the single-neuron level, but the experiment is conducted with a network-level resolution.
  
  As described above, we do not have the capacity to know what the actual firing rate of a particular neuron was before and after introducing a drug and so we cannot absolutely say that we have restored the original firing rates of neurons. However, there is reason to believe that this is achieved to some extent. Our optogenetic stimulation is only 50-100 ms long activating a subset of neurons. This is sufficient to provide a synaptic barrage that then triggers a full blown network burst where the majority of spikes occur, but this is after the light is off. In other words, the optogenetic light pulse only initiates what becomes a normal network burst that fortunately allows the individual cells to express their relatively normal (pre-drug) activity pattern. In our previous study we show that this is the case for individual units - the spiking of an individual unit during a burst is similar before and after CNQX/optostim (see Figure 4b and Suppl. Fig 4 in Fong et al. 2015 Nat. Comm.). We are not claiming that we have restored spiking to exactly the pre-drug state, but bring it back toward those levels and we see this is associated with a return of the mIPSC amplitude to near control levels. We will include a description of this in the resubmission.
  
  Line 198-99: multiplicativity is not a requirement of a homeostatic mechanism.
  
  Line 264-265 - again, neither multiplicativity and synaptic mechanisms are fundamentally any more necessary for a homeostatic locus than anything else that can modulate firing rate in via negative feedback.
  
  Agreed, see above discussion of homeostat requirement. Will adjust these statements in our resubmission.
  
  277: do you mean AMPAR?
  
  We were not clear enough here. We actually do mean GABAR. The idea is that CTZ increases network activity and thus increases both AMPAergic and GABAergic transmission. We will clarify this in the resubmission.
  
  Example: Figure 1A is frustratingly unreadable. The axes on the raster insets are microscopic, the arrows are strangely large, and it seems unnecessary to fill so much realestate with 4 rasters. Only one is necessary to show the concept of a network burst. The effect of time+CNQX on the frequency of burst is shown in B and C.
  
  Example: Figure 2 appears warped and hastily assembled. Statistical indications are shown within and outside of bounding boxes. Axes are not aligned. Labels are not aligned. Font sizes are not equal on equivalent axes.
  
  We will adjust these issues in the resubmission.
  
  The discussion should include mention of the limitations and/or constraints of drawing general conclusions from cell culture.
  
  We agree and will adjust the discussion. Also, this is why we cited studies that argue GABAergic neurons have a particularly important role in homeostatic regulation of firing following sensory deprivations in vivo.
  
  The discussion should include mention of the role of developmental age in the expression of specific mechanisms. It is highly likely that what is studied at ~P14 is specific to early postnatal development.
  
  We will discuss caveats of cortical cultures at DIV 14-20.
  
  It is essential to ensure that the data presented in the paper adequately supports the conclusions drawn. A more cautious approach in interpreting the results may lead to a stronger argument and a more robust understanding of the underlying mechanisms at play.
  
  Agreed.
  
  Reviewer #2 (Public Review):
  
  Synaptic scaling has long been proposed as a homeostatic mechanism for the regulation for the activity of individual neurons and networks. The question of whether homeostasis is controlled by neuronal spiking or by the activation of specific receptor populations in individual synapses has remained open. In a previous work, the Wenner group had shown that upscaling of glutamatergic transmission is triggered by direct blockade of glutamate receptors rather than by the concomitant reduction in firing rate (Nat Comm 2015). In this manuscript they investigate the mechanisms regulating scaling of GABA-mediated responses in cortical cell cultures using whole-cell recordings to detect GABAergic currents and multielectrode arrays to monitor global firing activity, and find that spiking plays a fundamental role in scaling.
  
  Initially, the authors show that chronic blockade (24 h) of glutamatergic transmission by CNQX first reduces spontaneous spiking (at 2 h), but later (24 h) firing grows back towards higher frequencies, suggesting a compensatory mechanism. Then it is shown that either chronic CNQX treatment or TTX cause a reduction in the amplitude of GABAergic mIPSCs. Effects of CNQX on IPSCs are then reverted by replacing spontaneous network firing by chronic optogenetic stimulation of the entire culture, also indicating that GABAergic transmission is homeostatically regulated by global firing. Enhancing glutamatergic transmission with CTZ increases mIPSC amplitude, while addition of TTX in the presence of CTZ causes the opposite effect. Finally, increasing spiking activity using bicuculline also increases mIPSC amplitude, and the authors conclude that spiking activity rather than neurotransmission control homeostatic GABA scaling. The manuscript shows interesting properties in the regulation of global GABAergic transmission and highlight the important role of spiking activity in triggering GABA scaling. However, it is strongly recommended to address some caveats in order to better support the conclusions presented in the manuscript.
  
  Major points:
  
  1) The reason why CNQX does not completely eliminate spiking is unclear (Fig. 1). What is the circuit mechanism by which spiking continues, although at lower frequency, in the absence of AMPA-mediated transmission and what the mechanism by which spiking frequency grows back after 24h (still in the absence of AMPA transmission)?
  
  Is it possible that NMDA-mediated transmission takes over and triggers a different type of network plasticity?
  
  The bursting in AMPAR blockade is due to the remaining NMDA receptor mediated transmission. We showed this in our previous study in Suppl. Figure 2 and 6 of Fong et al., 2015 Nat. Comm.. Our ability to optically induce normal looking bursts of spikes was also dependent NMDAR activation. Further, in Dr Fong’s PhD dissertation it was shown that the bursting activity was abolished when AMPA and NMDA receptors were both blocked. There are likely many factors that contribute to the recovery of activity, and certainly one of them is likely to be the weakening of inhibitory GABAergic currents. These points will be discussed in the resubmission.
  
  2) A possible activation of NMDARs should be considered. One would think that experiments involving chronic glutamatergic blockade could have been conducted in the presence of NMDAR blockers. Why this was not the case?
  
  Unfortunately, it was not possible to optogenetically restore normal bursting in the presence of NMDAR blockade (even when AMPAergic transmission was intact), as NMDARs appeared to be critical for the optical restoration of the normal duration of the burst (see Suppl. Figure 6 Fong et al., 2015 Nat. Comm). The reviewer raises an excellent point about a possible NMDAR contribution to altered synaptic strength, however. It is likely that NMDAR signaling is reduced in the presence of CNQX since burst frequency was reduced along with AMPAR-mediated depolarizations. We cannot rule out the possibility that NMDAR signaling could contribute to the alterations in GABAergic mIPSCs and will discuss this in the resubmission. However, previous work suggests that 24/48 hour block NMDARs (APV) did not trigger AMPAergic scaling in cortical or hippocampal cultures (see Figure 1 Turrigiano et al., 1998 Nature and Suppl. Figure 4 Sutton et al., 2006 Cell), moreover, our previous study showed that restoring NMDAergic transmission optogentically, at least to some point, had no influence on AMPAergic scaling (Fong et al., 2015, Nat. Comm.). Regardless, we cannot rule out a role for NMDAergic transmission in GABAergic scaling and this discussion will be included in the resubmission.
  
  Also, experiments with global ChR2 stimulation with coincident pre and postsynaptic firing might also activate NMDARs and result in additional effects that should be taken into consideration for the global scaling mechanism.
  
  To be clear, our optical stimulation was turned off before the vast majority of spiking that occurred in the bursts, which played out in a relatively natural manner (see lower panel of Figure 3B optogenetic stimulation – short duration only at onset of burst – we will make this clearer in resubmission). Therefore, we were unlikely to trigger significant synchronous activation that does not normally occur in network bursts.
  
  3) Cultures exposed to CTZ to enhance AMPA receptors generated variable results (Fig. 5), somewhat increasing spiking activity in a non-significant manner but, at the same time, strengthening mIPSC amplitude. This result seems to suggest that spiking might be involved in GABAergic scaling, but it does not seem to prove it.Then, addition of TTX that blocked spiking reduced mIPSC amplitude. It was concluded here that the ability of CTZ to enhance GABAergic currents was primarily due to spiking, rather than the increase in AMPA-mediated currents. However, in addition to blocking action potentials, TTX would also prevent activation of AMPARs in the presence of CTZ due to the lack of glutamatergic release. Therefore, under these conditions, an effect of glutamatergic activation on GABAergic scaling cannot be ruled out.
  
  These concerns were very similar to reviewer 1’s first comments. We will address these issues in the resubmission, but to briefly repeat our responses: We are going a step beyond most scaling studies by assessing MEA-wide firing rate, but this still provides an incomplete picture of the particular cells that we target for patch recordings in terms of their firing before and after a drug. Further, we see considerable variability in effect on firing rate from culture to culture, which we will better recognize in the resubmission. Finally, While the CTZ results are not conclusive, taken together with the optogenetic results we think our results are most consistent with idea that GABAergic scaling is a strong candidate as a spike rate homeostat.
  
  4) The sample size is not mentioned in any figure. How many cells/culture dishes were used in each condition?
  
  The individual dots represent either individual cells for mIPSC amplitude or individual cultures in MEA experiments. Number of cultures for figures were: Figure 2 – con = 10, TTX = 3, CNQX = 6, Figure 4 – CNQX = 4, con = 10, CNQX/photostim = 6, Figure 5 – ethanol = 3, CTZ = 3, CTZ + TTX =3, Figure 6 – con = 10, bicuculline = 4. We will include the number of cultures for mIPSC amplitude experiments in the figure legends upon resubmission.
  
  5) Cortical cultures may typically contain about 5-10% GABAergic interneurons and 90-95 % pyramidal cells. One would think that scaling mechanisms occurring in pyramidal cells and interneurons could be distinct, with different impact on the network. Although for whole-cell recordings the authors selected pyramidal looking cells, which might bias recordings towards excitatory neurons, naked eye selection of recording cells is quite difficult in primary cultures. Some of the variability in mIPSC amplitude values (Fig. 2A for example) might be attributed to the cell type? One could use cultures where interneurons are fluorescently labeled to obtain an accurate representation. The issue of the possible differential effects of scaling in pyramidal cells vs. interneurons and the consequences in the network should be discussed.
  
  We will include this discussion in the resubmission. Briefly, we chose large cells, which will be predominantly glutamatergic neurons as suggested by the reviewer. Ultimately, even among glutamatergic principal cells there may be variability in the response to drug application. All of these issues could contribute to variability and we will expand our description of the variability in our results, including that based on cellular heterogeneity.
  
  Reviewer #3 (Public Review):
  
  This paper concerns whether scaling (or homeostatic synaptic plasticity; HSP) occurs similarly at GABA and Glu synapses and comes to the surprising conclusion that these are regulated separately. This is surprising because these were thought to be co-regulated during HSP and in fact, the major mechanisms thought to underlie downscaling (TTX or CNQX driven), retinoic acid and TNF, have been shown to regulate both GABARs and AMPARs directly. (As a side note, it is unclear that the manipulations used in Josesph and Turrigiano represent HSP, and so might not be relevant). Thus the main result, that GABA HSP is dissociable from Glu HSP, is novel and exciting. This suggests either different mechanisms underlie the two processes, or that under certain conditions, another mechanism is engaged that scales one type of synapse and not the other.
  
  However, strong claims require strong evidence, and the results presented here only address GABA HSP, relying on previous work from this lab on Glu HSP (Fong, et al., 2015). But the previous experiments were done in rat cultures, while these experiments are done in mice and at somewhat different ages (DIV). Even identical culture systems can drift over time (possibly due to changes in the components of B27 or other media and supplements). Therefore it is necessary to demonstrate in the same system the dissociation. To be convincing, they need to show the mEPSCs for Fig 4, clearly showing the dissociation. Doing the same for Fig 5 would be great, but I think Fig 4 is the key.
  
  We understand the concern of the reviewer as we do see significant variability within our cultures and they were plated in different places, by different people, in different species (rat vs mouse). Therefore, in the resubmission to strengthen the conclusions we will repeat our optogenetic studies restoring activity in the presence of AMPAergic blockade in our mouse cortical cultures and measuring AMPA mEPSCs to assess scaling.
  
  The paper also suggests that only receptor function or spiking could control HSP, and therefore if it is not receptor function then it must be spiking. This seems like a false dichotomy; there are of course other options. Details in the data may suggest that spiking is not the (or the only) homeostat, as TTX and CNQX causes identical changes in mIPSC amplitude but have different effects on spiking. Further, in Fig 5, CTZ had a minimal effect on spiking but a large effect on mIPSCs. Similar issues appear in Fig 6, where the induction of increased spiking is highly variable, with many cells showing control levels or lower spiking rates. Yet the synaptic changes are robust, across all cells. Overall, this is not persuasive that spiking is necessarily the homeostat for GABA synapses.
  
  Together our results argue against AMPAR or GABAR activation as a trigger for GABAergic scaling and that this is different than our results for AMPAergic scaling. These points alone are important to recognize. While changes in spiking do not perfectly follow the changes in GABAergic scaling they do always trend in the right direction. As mentioned above, total spiking activity is only one measure of spiking. It is possible that these drugs alter the pattern of spiking that translates into an altered calcium transient that is important for triggering the plasticity. Again, it is important to note that we are going a step beyond most homeostatic plasticity studies that add a drug and simply assume it is having an effect on spiking (e.g. CNQX was initially thought to completely abolish spiking, but clearly does not). Based on the variability that we observe and the nature of our MEA recordings we cannot precisely determine how the total activity or pattern of activity changes with drug application in the specific cells that we target for whole cell recordings. However, we believe our results are more consistent with our proposal that GABAergic scaling is a strong candidate as a spike rate homeostat. Regardless, in the resubmission we will include a broader discussion about these possibilities, and the reality that there could be multiple homeostatic mechanisms that act to recover spiking activity.
  
  The paper also suggests that the timing of the GABA changes coincides with the spiking changes, but while they have the time course of the spiking changes and recovery, they only have the 24h time point for synaptic changes. It is impossible to conclude how the time courses align without more data.
  
  We can only say that by the 24 hour CNQX time point, when overall spiking is recovered, that GABAergic scaling has already occurred. We will state this more clearly in the resubmission.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.03.08.531789v2
www.biorxiv.org www.biorxiv.org

Parallel processing of quickly and slowly mobilized reserve vesicles in hippocampal synapses

1
1. Public_Reviews 20 Jun 2023
  
  in eLife
  
  Author Response:
  
  We are grateful to the editors for getting our study reviewed, and are pleased that the reviewers found value in our findings. We plan to submit a revision that we believe can resolve much of the remaining doubt about the major conclusions.
  
  Our current understanding is that much of the uncertainty stems from extensive diversity among synapses. The FM-dye de-staining technique does have single synapse resolution, so it should be possible to develop new kinds of analysis that can make each of our points at the level of individual synapses. For a preview, see Figure 2D (explained in lines 126-141), and Figure 2-Figure supplement 5 of the current version.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2020.08.14.251975v4
www.biorxiv.org www.biorxiv.org

New submission 20/06/2023, 09:42:21

1
1. Public_Reviews 20 Jun 2023
  
  in eLife
  
  Author Response
  
  Reviewer #1 (Public Review):
  
  While the CTD human brain organoids show a decrease in Cr (in absence of Cr in the culture medium) as compared to control organoids (4 times less), they are not devoid of Cr. Do these organoids express the two enzymes allowing Cr synthesis (AGAT and GAMT), and in which brain cell types? If yes, how to explain the decrease in Cr in the CTD organoids?
  
  There is a lack of functional CRT in the CTD human brain organoids. The basal level of creatine in CTD human brain organoid is significantly lower than in healthy human brain organoids. The intracerebral creatine synthesis is due to different expression of the AGAT and GAMT enzymes and relies on functional CRT for the transport of the GAA intermediate Litterature pointed out that both enzymes are rarely co-expressed (Braissant et al., 2001, PMID: 11165387) meaning that GAA intermediate needs to be transported by CRT to neurones for complete creatine synthesis. Even if we evidenced a slight mRNA expression of AGAT and GAMT enzymes, the creatine synthesis is not effective since the GAA intermediate could not be transporterd in cell expressing GAMT due to the non functional creatine transporter in the CTD human brain organoids.
  
  The rescue experiment, re-establishing a functional Cr transporter (CRT or SLC6A8) in the CTD human brain organoids, is very interesting, as this may help the design and development of new treatments for CTD. However, authors claim that the functional CRT expressed in the rescued CTD organoids was expressed in each cell. This may be a difficulty in the development of new CTD treatments, as CRT should be expressed in neurons and oligodendrocytes, but not in astrocytes. Authors may want to comment on this point.
  
  As shown in Figure S2C, the whole brain organoid in the resue experiment shows the expression of the GFP protein, thus also the co-expressed wild-type CRT. In these experiments we did not make a detailed cellular characterization of the rescued organoids, and this may be a task in our next experiments for an exact characterization of the cell-specific CRT expresion and function in the rescued brain organoids. According to this, we will correct in the revision version of manuscript the statement on page 6: “SLC6A8 expressing brain organoids showed GFP fluorescence in the whole area of the organoid (Fig S2C).”
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.01.543271v1
www.biorxiv.org www.biorxiv.org

New submission 20/06/2023, 09:37:56

1
1. Public_Reviews 20 Jun 2023
  
  in eLife
  
  Author Response
  
  Reviewer #2 (Public Review):
  
  The current work was basically a follow-up of a previous study in juvenile mice, and the results were also very similar to the juvenile results (Sommeijer et al., 2017). One possible interpretation of the results is that the lack of OD plasticity in adult V1 and dLGN was caused by an early blockade of the development of the inhibitory circuit in dLGN, which retains the dLGN in an immature stage till adulthood. The authors indeed claimed in the discussion that the 2-day OD shift is intact in juvenile dLGN and V1 in KO mice, and provided evidence in supplementary figure that GABAergic and cholinergic synapse amount are similar between WT and KO mice. However, the 7-day OD shift is indeed defected in juvenile V1 and dLGN in KO mice (Sommeijer et al., 2017), and it is possible that this early functional deficit didn't induce a structural remodeling in adulthood. To better support the author's claim that the lack of adult V1 OD plasticity is specifically due to reduced dLGN synaptic inhibition, the author should generate conditional KO mice that dLGN synaptic inhibition was only interfered in adulthood.
  
  In order to address this point it is important to discuss the plasticity deficits in dLGN and V1 separately.
  
  Concerning V1 plasticity: We have previously shown that brief MD during the standard critical period induces an OD shift in V1 of mice lacking thalamic synaptic inhibition in dLGN (Sommeijer et al, 2017). OD plasticity induced by brief MD is a hallmark of critical period plasticity in V1, and it thus seems unlikely that critical period onset in V1 is defective or that development of V1 is halted in an immature state that does not support OD plasticity in thalamus-specific GABRA1 deficient mice.
  
  The observed plasticity deficit during the critical period was limited to the second stage of the OD shift in V1, which requires long-term monocular deprivation. The straightforward explanation for this result and our current findings is that both during the critical period and in adulthood, the second stage of OD plasticity in V1 induced by long-term monocular deprivation requires thalamic plasticity or inhibition. The proposed alternative, that lack of thalamic synaptic inhibition during development results in a possible lack of structural change in V1 that would cause a lifelong deficiency selectively affecting OD plasticity induced by long-term monocular deprivation, is not impossible but requires many more assumptions.
  
  Concerning dLGN plasticity: The simplest explanation for the observed lack of OD plasticity in dLGN is that it is a direct consequence of the absence of synaptic inhibition in the KO mice. However, an alternative explanation could indeed be that dLGN is kept in an immature (pre-critical period-like) state due to the developmental absence of synaptic inhibition. This situation would be analogous to that in V1 of GAD65 deficient mice (which have reduced GABA release), in which OD plasticity cannot be induced by brief monocular deprivation during the critical period or in adulthood (Fagiolini and Hensch, 2000). Because this deficit can be reversed by treating the mice with benzodiazepines (positive allosteric modulators of GABA receptors) at any age, it is thought that development of V1 in GAD65 mice is halted in a pre-critical period-like state until inhibition is strengthened. We cannot exclude that something similar occurs in dLGN of mice lacking thalamic synaptic inhibition, although we did not observe any changes in hallmarks of dLGN maturity, such as reduced receptive field size (Fig. 1C), and increased cholinergic and inhibitory bouton densities (Suppl. Fig. 1).
  
  However, if the analogy with the developmental deficit in V1 of GAD65 deficient mice is valid, the reduced plasticity is still likely to be a direct consequence of reduced inhibition. In GAD65 deficient mice, long-term monocular deprivation during the critical period causes a full OD shift, showing that no additional deficits (besides reduced inhibition) limit OD plasticity in V1 of these mice (Fagiolini and Hensch, 2000). And, as already mentioned, increasing inhibition rescues OD plasticity in GAD65 KO mice. Thus, the immature state of V1 in these mice is probably a situation in which inhibition tone is too low to support efficient OD plasticity. In dLGN, knocking out GABRA1 at a later age could therefore also create a situation in which inhibition is too low to support thalamic OD plasticity, which is not different from the situation in which the gene is inactivated at birth. Only if lack of synaptic inhibition in thalamus affects another, unknown developmental process that is of importance later in life to support OD plasticity in dLGN, the proposed experiment would result in a different outcome. We are not convinced that this scenario is likely enough to justify repeating most of this study, but now using mice in which GABRA1 is inactivated in dLGN through bilateral AAV-cre injections.
  
  Independently of the exact cause of the plasticity deficit in dLGN, our results make clear that a cortical plasticity deficit in adulthood can have a thalamic origin, which we believe is an important insight that is highly relevant.
  
  2) The authors found that in juveniles, dLGN OD shift is dependent on V1 feedback, but not in adults. However, a recent work showed that the effects of V1 silencing on dLGN OD plasticity could differ with various starting points and duration of the V1 silencing and MD (Li et al., 2023). Could the authors provide more details of the MD and V1 silencing for an in-depth discussion?
  
  We would be happy to include some discussion about this interesting new paper in a revised manuscript. Some of the results may appear to contradict our findings. Most strikingly, the study by Li et al does not find evidence for OD plasticity in dLGN of 60-day old mice after 7 days of monocular deprivation. This seems to be at odds with the current work and with that of (Jaepel et al 2017) and (Huh et al. 2020). However, in the (Li et al, 2022) study, only the binocular neurons which responded to both contralateral and ipsilateral stimulus were included to measure the OD. This has important consequences for assessing OD and its plasticity. To illustrate: if dLGN neurons are monocularly responsive to the contralateral eye and become binocular after deprivation of the contralateral eye, they are excluded from analysis before deprivation but included after. This would cause an underestimation of the size of this OD shift. In our experiments, all dLGN neurons with receptive fields that were within 30o degrees away from the center of the visual field were included in the analysis, potentially explaining the different outcome of the studies.
  
  Also, Li et al observed that an OD shift in dLGN was still present after silencing V1 at p24. This observation is not necessarily at odds with our observation that the OD shift reduces at p30 upon silencing V1, as we find that the ODI does not return to normal but remains slightly lower (though not significantly so). Moreover, the age and the duration of deprivation were different and as mentioned before, analysis was performed differently.
  
  Interestingly, an excitotoxic lesion of V1 was found to alter OD in dLGN during development and affect OD plasticity in dLGN at various ages in the work of Li et al. This suggests that continuous crosstalk between thalamus and cortex during development guides plasticity, possibly optimizing thalamocortical and corticothalamic connections. The continued absence of corticothalamic feedback is likely to have a much larger impact on dLGN plasticity than the acute silencing we performed.
  
  Fagiolini M, Hensch TK. Inhibitory threshold for critical-period activation in primary visual cortex. Nature. 2000 Mar 9;404(6774):183-6.
  
  Huh CYL, Abdelaal K, Salinas KJ, Gu D, Zeitoun J, Figueroa Velez DX, Peach JP, Fowlkes CC, Gandhi SP. Long-term Monocular Deprivation during Juvenile Critical Period Disrupts Binocular Integration in Mouse Visual Thalamus. J Neurosci. 2020 Jan 15;40(3):585-604. doi: 10.1523/JNEUROSCI.1626-19.2019
  
  Jaepel J, Hübener M, Bonhoeffer T, Rose T. Lateral geniculate neurons projecting to primary visual cortex show ocular dominance plasticity in adult mice. Nat Neurosci. 2017 Dec;20(12):1708-1714
  
  Li N, Liu Q, Zhang Y, Yang Z, Shi X, Gu Y. Cortical feedback modulates distinct critical period development in mouse visual thalamus.. iScience. 2022 Dec 7;26(1):105752.
  
  Sommeijer JP, Ahmadlou M, Saiepour MH, Seignette K, Min R, Heimel JA, Levelt CN. Thalamic inhibition regulates critical-period plasticity in visual cortex and thalamus. Nat Neurosci. 2017 Dec;20(12):1715-1721.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.03.23.533941v2
www.biorxiv.org www.biorxiv.org

New submission 20/06/2023, 08:41:30

1
1. Public_Reviews 20 Jun 2023
  
  in eLife
  
  Author Response
  
  We sincerely appreciate the reviewers for investing their valuable time in assessing our manuscript. We understand the considerable effort involved in the review process, and we will make use of these suggestions in order to make the revised manuscript more complete in terms of explanation, discussion, additional simulations, experiments and analyses.
  
  -Specifically, we will experimentally and computationally investigate how activation via anti-CD3 antibodies relates to our mechanism.
  
  -We will also utilize a weaker pMHC binder in the pMHC-mediated T cell activation experiments.
  
  -We will improve the description of the function of the FG loop and the role of the connecting peptide (CP).
  
  -Furthermore, we will improve our description of and justification for the simulation methodology. We want to emphasize that all potentials have been described, and we will draw attention to these methodological descriptions where needed.
  
  The reviewers also suggested a number of additional simulations that are probably beyond our current capability. These include:
  
  -simulations of TCR in complex with a weaker agonist -simulations of the proline and alanine TCR mutants in complex with a pMHC.
  
  While we agree that such simulations would provide new insights into the mechanism of TCR triggering, they simply are not feasible at this time. We will give a more detailed explanation for these arguments in the revised manuscript.
  
  Below, please find our point-by-point planned action items:
  
  Reviewer #1 (Public Review):
  
  The manuscript entitled: "TCR-pMHC complex formation triggers CD3 dynamics" by Van Eerden et al. mainly uses coarse-grained molecular dynamics to probe the dynamic changes, in terms of CDε spatial arrangements around 226 TCRs, before and after the engagements of MCC/I-Ek. The broader distributions of CDε iso-occupancies after pMHC binding correlate with the decreases of TCR-CD3 contacts and extensions of TCR conformations. Given the observed release of motion restrictions upon antigen recognition, the authors proposed a "drawbridge" model to describe the initial triggering processes from pMHC association to TCR straightening, FG-loop getaway, and CD3 enhanced mobility. In addition, the authors briefly investigated the functional effects of the rigidified connecting peptide (CP) in T-cell activation using in silico and in vitro mutagenesis. The manuscript raises an important and exciting hypothesis about the allostery of TCR-CD3 during TCR triggering; however, due to current not-yet-convincing evidence, both computationally and experimentally, in supporting their conclusions.
  
  I would like to see additional work before supporting the publication of this manuscript in Life. See details below:
  
  1) As mentioned by the authors, the TCR triggering and T cell activation have been illustrated by a number of models, such as mechanosensing and kinetic proofreading, "in which TCRs discriminate agonistic from antagonistic pMHCs." However, the critical feature of antigen discrimination is lacking in the drawbridge model. So far, the CDε movements qualitatively distinguish on and off states. The simulation of the antagonist or weaker binder would strengthen the manuscript by demonstrating the relevance of CDε mobility in the triggering mechanism. 226 TCR associated with K99E/I-Ek has been resolved in Ref (DOI: 10.4049/jimmunol.1100197), which can potentially serve as the "intermediate" system to formulate the gradual increase of CDε dynamics.
  
  Planned actions:
  
  -Explain why the current study can not easily address pMHC discrimination
  
  -Explain why simulation of antagonist or weaker binding pMHC is technically difficult
  
  2) The linkage between conserved motifs in CP and CDε mobility is less apparent to this reviewer. The notion of the rigidified hinge (PP) requires further clarification. Computationally, the details of fine-grained simulations are required to justify the origin of the apparent mobility increase in PP. The direct comparison between Fig. 2 and Fig. 7 can help assess the relevance of CP through the alignment by FG-loop at a fixed direction in polar coordinates. Experimentally, anti-CD3 positive experiments and, ideally, another antagonist on 3A9 TCRs can strengthen the current functional assay. The baseline level of TCR expression (after positive selection) and 0h activation (Fig. S8) is missing.
  
  Planned actions:
  
  -Provide additional analysis of the role of CP as a hinge
  
  -Better clarify the FG simulation methodology
  
  -Align the CG and the FG polar plots
  
  -Perform experiments with anti-CD3 antibody 2C11
  
  -Perform additional experiment using weaker agonist (HEL peptide mutant)
  
  -Measure baseline-level TCR expression
  
  -Perform T cell activation experiments at t=0 h
  
  3) Regarding the section "The TCRβ FG loop acts as a gatekeeper," besides contact analysis, additional motion analysis, such as RMSF or PCA, can further establish the importance of FG loops.
  
  Planned actions:
  
  -Perform additional analyses of FG loop dynamics
  
  4) The discussion on anti-CD3 antibody effects and their potential contribution to CD3 mobility is highly recommended.
  
  Planned actions:
  
  -We will add the discussion of anti-CD3 antibody effects
  
  Reviewer #2 (Public Review):
  
  In this research article a new allosteric mechanism for T cell receptor (TCR) triggering upon peptide-MHC complex binding is presented in which conformational change in the TCR facilitates activation by controlling CD3 dynamics around the TCR. The authors find that the Cb FG loop acts as a gatekeeper and Cb connecting peptide acts as a hinge to control TCR flexibility.
  
  As an initial result, the authors set up two sets of simulations - TCR-CD3 and pMHC-TCR-CD3 in POPC bilayers and identified that the CD3e chains exhibit a wider range of mobility in the pMHC-TCR-CD3 system as compared to the TCR-CD3 system. Next, they examined the contacts between all subunits during the course of both simulations and established that CD3g and CD3eg made far fewer contacts with TCRb in the pMHC-TCR-CD3 simulations. Next, they identified that the TCR is extended in the pMHC-TCR-CD3 simulations with larger tilt angle of 150º and FG loop acts as gatekeeper that allows CD3 movements upon pMHC binding. Finally, Mutations in Cb connecting peptide regions indicated rigidified TCR leading to hypersensitive TCR, proved both by simulations and in vitro experiments.
  
  The following major concerns must be addressed.
  
  Major concerns:
  
  1) The simulations were performed with intracellular regions unfolded and free from the membrane. A more complete system should have the intracellular regions embedded in the membrane. An NMR structure of CD3e is available (Xu et al., Cell, 2008) and could have been modeled into the TCR-CD3 system before the simulation. Prakaash et al., (PLoS, Comput Biol, 2021) studied cytoplasmic domain motions during in their simulation experiments.
  
  Planned actions:
  
  -Explain why we can not perform adequate additional simulations of ITAMs
  
  2) Comparing Fig. 2C and Fig.7C, the movement of CD3eg is more restricted in Fig.7C. Is this because the simulation time is lower in the mutation experiments?
  
  Planned actions:
  
  -Explain the differences between the CG and FG polar plots
  
  3) Only TCR-CD3 simulation were performed for PP and AA mutants. A simulation with pMHC (pMHC-TCRmutants-CD3) should be performed to show increased CD3 mobility.
  
  Planned actions:
  
  -Explain why TCR-CD3-pMHC simulations of the mutants are not feasible at this time
  
  4) Using CD3e antibody, OKT3, for activation instead of pMHC as a separate experiment would add more value to this study. They can look at CD3 mobility and TCR elongation in the system with OKT3 antibody and compare it to the CD3 mobility and TCR elongation with the pMHC system. They can also use OKT3 with AA and PP mutants and perform both simulation and in vitro activation experiments.
  
  Planned actions:
  
  -Perform anti-CD3 (2C11) experiments
  
  -Perform CG simulation of TCR with CD3 Fab fragment
  
  -Explain why we cannot perform FG simulations of TCR mutants with CD3
  
  5) The activation experimental data is rather underwhelming. The difference between WT and PP in 2hr experiment at 0.016 ug/mL looks exceedingly low. A stronger TCR-pMHC system should be considered for the in vitro activation experiments.
  
  Planned actions:
  
  -Explain that this is a dilution curve, which is why at lower concentrations the effect is smaller, but at higher concentrations the effect is clear
  
  6) There is some concern that the scientific work lacks solid experimental functional data and lack of novelty due to earlier TCR-CD3 simulation studies (Pandey et al., 2021, eLife) that already reported flexibility and elongation of the complex.
  
  Planned actions:
  
  -Explain the similarities and difference between this and Pandey’s work; clarify how our study contributes novel findings
  
  Reviewer #3 (Public Review):
  
  The authors first explore structural differences of unbound TCR-CD3 complexes and pMHC-bound TCR-CD3 complexes with coarse-grained simulations. In the simulations with pMHC-bound complexes, the transmembrane (TM) domains of the TCR-CD3 complex and of pMHC are embedded in two opposing membrane patches. In the pMHC membrane patch, a pore is created and stabilised in the simulation setup with the aim to allow water transport in and out of the compartment between the membranes. The authors report a more upright conformation of the TCR extracellular (EC) domain in the simulations in which this EC domain is bound to pMHC, compared to simulations with unbound TCR, and postulate an allosteric signalling model based on these apparent conformational changes and associated changes in TCR-CD3 quaternary arrangements. Subsequently, the authors identify a GxxG motif in the TCRbeta connecting peptide between EC domain and TM domain as putative hinge in allosteric signalling, and explore the effect of double proline and double alanine substitutions in atomistic simulations and experiments.
  
  While these simulation and experimental setups and the addressed questions are of interest in the field, the following weaknesses prevail in my overall assessment of the work:
  
  (1) I am not convinced that the reported coarse-grained simulation results are sound or allow to draw the conclusions stated in the work. In the simulations with a pMHC-bound TCR-CD3 complex, the intermembrane distance in the setup shown in Figure S1 appears excessively large and likely leads to a rather strong force in the membrane-vertical direction and to the reported upright conformation of the TCR EC domain. This upright confirmation thus appears to be a consequence of force from the simulation setup, rather than a consequence of pMHC binding alone as suggested by the authors. While the membrane pore in principle allows water exchange, the relaxation of the intermembrane distance resulting from this water exchange in the 10 microsecond long simulation trajectories is not (but needs to be) addressed. This relaxation eventually would lead to an equilibrated membrane separation, in which essentially no force is exerted on the TCR-pMHC EC complex. However, I suspect that this computationally demanding equilibration is not achieved in the simulations, with the consequence that forces on the TCR-pMHC EC complex in the membrane-vertical direction remain.
  
  In addition, I am not convinced that the Martini force field of the coarse-grained simulations allows a reliable assessment of the quaternary interactions between the TCR and CD3 EC domains. Getting protein structures and interactions right in coarse-grained simulations is notoriously difficult. In simulations with the coarse-grained Martini force field, secondary protein structures are constrained as a standard procedure, and the authors also use a recommended Go-potential procedure, likely to stabilise tertiary protein structures. The quaternary interactions between the TCR EC domain and the pMHC EC domain are modelled by rather strong harmonic constraints to prevent dissociation. While the treatment of the quaternary interactions between the TCR EC domain and the CD3 EC domains in the simulations is not (but needs to be) addressed in detail, I suspect that there are no additional, or only weak constraints to stabilise these interactions. In any case, I think that a faithful representation of these quaternary interactions is beyond the reach of the Martini force field, as is the reported diffusion of the CD3 EC domains around the TCR EC domain, which plays a central role in the allosteric mechanism proposed by the authors (see Fig 2 and 5).
  
  Planned actions:
  
  -We will provide further description and justification for the CG simulations
  
  (2) The allosteric model suggested by the authors is motivated in an introduction that appears to omit central controversial aspects in the field, as well as experimental evidence that is not compatible with allosteric conformational changes in the TCR. These aspects are:
  
  The mechanosensor model is controversial. In original versions of this model, a transversal force has been postulated to be required for T cell activation. However, more recent single-molecule force-sensor experiments reported in J Goehring et al., Nat Commun 12, 1 (2021) provide no evidence for a scenario in which transversal forces beyond 2 pN are associated with T cell activation.
  
  The role of catch bonds is controversial. Evidence for TCR catch bonds has been mainly obtained in experimental setups using the biomembrane force probe, in which force is applied to TCRs on the surface of T cells, but is not reproduced in experimental setups using isolated TCRs, see e.g. L Limozin et al., PNAS 116, 16943 (2019)
  
  Ref. 1 of the manuscript prominently discusses the kinetic segregation model of T cell activation, which is not (but needs to be) addressed in the introduction. In this model, exclusion of CD45 from close-contact zones around pMHC-bound TCRs triggers T cell activation. The model is supported by evidence from diverse experiments, see for example M Aramesh et al., PNAS 118, e2107535118 (2021) and Ref. 1. At least part of this evidence is not compatible with mechanosensing or allosteric models of T cell activation.
  
  Planned actions:
  
  -We will add the requested literature references and include a better description of the kinetic segregation model
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.07.27.501668v4
www.biorxiv.org www.biorxiv.org

New submission 19/06/2023, 10:05:57

1
1. Public_Reviews 19 Jun 2023
  
  in eLife
  
  Author Response:
  
  The major criticism from the reviewers is that factors other than high-impact rare variants – such as environmental factors or epistasis – could have produced the complex tail architecture that we test for and detect. While we did explain this point in the Discussion, we agree with the reviewers that this should have been emphasized more and earlier in the manuscript.
  
  Regarding suggestions for more complex simulations and methods, we absolutely agree that much more work is needed here to produce optimised inference of all the causes of complex tail architecture. We are performing multiple projects at various stages of completion that we hope will contribute to this, but we felt that this was a good stopping-point in this project to publish what we had completed so far, in order to: (1) introduce the idea of inferring complex genetic architecture from siblings without requiring genetic data, (2) outline an initial theoretical framework for inferring complex tail architecture from sibling data, (3) provide simple tests powered to identify enrichments of de novo or ‘Mendelian’ variants in the tails (albeit tests that make several strong simplifying assumptions), (4) enable others interested in the topic to build upon this work now. However, we plan to expand our simulations and analyses in a revised manuscript based on reviewer feedback.
  
  We thank the reviewers for their comments about the value of our work, its mathematical robustness and the promise of our method.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.02.19.529159v1

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators