4,993 Matching Annotations
  1. Mar 2021
    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Bothe and colleagues studied the effect of repeated glucocorticoid exposure on DNA accessibility and gene expression in A549 and U2OS-GR cells and show that most of the glucocorticoid receptor (GR) induced changes are reversible in both cell lines and after long-term (20 hrs) and short-time (4 hrs) dexamethasone (Dex) or cortisone treatment. They identified a single gene that seem to have persisting memory of previous Dex exposure, namely ZBTB16.

      Major Comments

      1. The authors used the cancer cell lines A549 and U2OS-GR as model systems the latter additionally overexpresses GR. In order to make the work more translatable an in-vivo model comparing the effect of long-term, short-term and repeated glucocorticoid (GC) treatment on DNA accessibility and gene expression is necessary. The authors should clearly emphasizes this limitation of their study in the discussion or add in-vivo data (e.g. qPCRs) to strengthen the translatability.
      2. The authors draw conclusions of the association of DNA accessibility, H3K27ac, P300 and GR occupancy from independent heatmaps. This cannot be easily done from the current way the data is presented. A direct link between accessibility, H3K27ac and mRNA expression of the associated gene for example is missing. a.) The authors show several heatmaps to indicate changes in accessibility, H3K27ac and P300 upon Dex treatment as well as GR binding patterns in Fig. 1 and S1. Those are sorted by decreasing signal strength (I assume). To make those results more comparable, I suggest to sort them all in the same way (e.g. by descending ATAC-Seq signal or fold-change). b.) In line with a.), it is unclear to the reader if those sides opening /closing are the same sides showing increased/decreased H3K27ac or P300 occupancy and if those sides bind GR. Integrating this data together with mRNA e.g as correlation plots would strengthen the author's argument that accessibility, H3K27ac and mRNA changes are indeed correlated. What about the GR binding sites that do not change accessibility or H3K27ac? What makes those different? Therefore, the statement "Furthermore, closing peaks, which show GC-induced loss of H3K27ac levels and lack GR occupancy (Fig. S1c-f), were enriched near repressed genes" on page 10 as well as the statement "suggesting that transcriptional repression by GR does not require nearby GR binding." in the abstract and discussion cannot be made from how the data is presented. c.) Several recent studies have shown that GR's effect on gene expression and chromatin modification at enhancers might be locus-/context-specific ("tethering", competition, composite DNA binding) and/or recruitment of different co-regulators (see Sacta et al. 2018 (doi: 10.7554/eLife.34864), Gupte et al. 2013 (doi.org/10.1073/pnas.1309898110) and many more). Defining the GR-bound or opening/closing sides in terms of changing H3K27ac (or having H3K27ac or not) more closely would help to link those to gene expression changes e.g. in violin plots. Furthermore, the authors could include a motif analysis to see if the different enhancer behaviours can be explained by differences in the GR motif sequence or co-occurring motifs. Thereby more closely defining the mechanism of chromatin closure a sites that lack GR binding e.g. by displacement of other transcription factors as described for p65 in macrophages (Oh et al. 2017 (doi.org/10.1016/j.immuni.2017.07.012)). In general a more detailed analysis of the data is required before the authors could state "Instead, our data support a 'squelching model' whereby repression is driven by a redistribution of cofactors away from enhancers near repressed genes that become less accessible upon GC treatment yet lack GR occupancy." on page 10. The results might also be explained by competitive transcription factor binding, tethering or selective co-regulator recruitment (e.g. HDACs).
      3. The authors use U2OS-GRa cells as a second cell line. Those cells overexpress rat GRa (see DOI: 10.1128/mcb.17.6.3181) in a cell line that usually does not express GR. I am wondering to what extend the overexpression reflects residence times and GR binding kinetics of cells endogenously expressing GR (mostly to at a lower protein level). At least the number of GR binding sites as well as the number of opening chromatin sites is much higher in U2OS-GR cells the A549 cells. The authors should discuss this point with respect to the observed preservation of some GR-binding sites U2OS-GR cells after Dex treatment and washout.
      4. In figure 1 and S1, the authors show coverage plots on top of the heatmaps to show the mean signal in ATAC-Seq, GR, H3K27ac or GR signal between the different subset. These plots are statistically inappropriate as a significant portion of the enhancers does not have a signal and a few enhancers show a very strong signal (at least for H3K27ac, P300 and GR) which skews the mean. Plotting the signal distribution or the distribution of the Dex-dependent change in signal (fold-change, e.g. as violin plots) more accurately reflects the diversity in the signal response.
      5. ChIP qPCRs against histone marks in figures 5B and S2C are not normalized for histone H3, but the author's clearly see changes in nucleosomal occupancy at those sides by ATAC-Seq. Additional normalization by total H3 is highly recommended.
      6. Figures 1C, 2D, 4A/B, 5B/C/E, 6C/F, S2C/E and S3A-D lack statistics.
      7. In figure 6, the authors compare the ZBTB16 locus with FKBP5, a locus that as by the data presented is very different from the ZBTB16 locus in terms of expression level (Fig 6C/F) and H3K27me3 occupancy (Fig. 5B). The authors should compare ZBTB16 to a locus with similar expression level and H3K27me3 deposition. Especially the co-occurrence of H3K27me3 and H3K4me3 (Fig. 5B) at the ZBTB16 promoter indicates its poised chromatin state whereas the FKBP5 promoter is marked by an active chromatin state.
      8. ZBTB16 itself is a transcriptional regulator, but its elevated expression upon repeated Dex treatment does not affect other genes. How do the authors explain this observation? Is ZBTB16 elevated on the protein level as well?

      Minor Comments

      1. The authors nicely explained the data analysis of their ATAC-Seq data, I recommend to include some more information on if and how the ChIP-Seq data was normalized (library size, scaling factors or spike-ins) even if most of the data sets are published.
      2. In figures 1F and S1F, the authors show the association of opening/closing an non-changing sites and GR peaks with genes that are up/down-regulated or unchanged upon Dex treatment. This gene-centric analysis is skewed by the different sizes of up-/down regulated gene sets and opening/closing chromatin (especially for the U2OS-GR cells that have 15.6x more opening sites then closing sites). Could the authors also include a peak-centric view showing how many closing/opening and non-changing sites are associated with down/up-regulated or unchanged genes? How good is the association (correlation)?
      3. In the figures 1F and S1F it is unclear how the authors handled genes with associated peaks (within +/-50kb) that show different characteristics e.g. a gene with a peak that gains and another peak that loses accessibility. How do the authors account for >1 opening or closing peaks per gene? In relation to this. Do opening/closing sites cluster around up/down-regulated genes? What is the stoichiometry as 1.6x more closing sites (then opening sites) relate to 1/3 of repressed when compared to activated genes?
      4. The authors claim on p10 that "We could validate several examples of opening and closing sites and noticed that opening sites are often GR-occupied whereas closing sites are not occupied by GR". As most of the ChIP-Seq experiments were performed on formaldehyde-only fixed cells, the authors might miss "tethered" sides, which are mostly linked to gene repression. You might rephrase this part to most closing sites lack direct DNA binding.
      5. The P300 ChIP-Seq in Fig S1B shows less sides with P300 occupancy then sides with H3K27ac. Is this a ChIP quality issue or do other factors mediated changes in H3K27ac? Similar to mayor comment 1a, are the P300 sites on the top the same sites as the top H3K27ac sites?
      6. Please indicate the primer position of qPCR primers if the genome browser tracks are displayed. That makes the comparison of sequencing and qPCR results easier.
      7. The authors nicely show that GR binding sites with persisting accessibility after Dex treatment and washout in U2OS-GR cells show residual GR binding and are bound by GR at Dex concentrations of 0.1nM. Could the authors specify if differences in the GR motif exist between those and the non-persisting sites?
      8. The authors focus on ZBTB16, FKBP5 and GILZ to show the priming effect of glucocorticoid treatment on ZBTB16 (Fig. 4), but GILZ was not included in the initial ATAC-Seq (Fig. 1) and ATAC-Seq washout (Fig. 2) experiments. For better comparison, I recommend adding qPCR results on GILZ in figures 1 and 2.
      9. The authors indicate that the washout of Dex does restore gene expression in A549 cells to pre-Dex levels (Fig. 4). These cells did not show any persisting GR binding, so. How does the gene expression in U2OS cells behave? E.g. for the genes displayed in Fig. S2C.
      10. In Fig. S3C, the authors observe that Gilz expression in U2OS-GR cells is similarly induced upon 1st and 2nd stimulation with Dex using 4hrs treatment. How does this relate to the preserved Dex response after 20hrs treatment and washout (Fig. S2C)? Was the expression of GILZ altered after 20hrs (see comment 9)? Are H3K27ac and GR signal after 4hrs Dex stimulation and washout comparable as well? Please comment on the differences observed between the 20hrs and 4hrs experiments.
      11. The GR enhancer of ZBTB16 seems to be simultaneously marked H3K27ac and H3K27me3 (Fig. 5A). Please comment. Is this an artefact of bulk ChIP-Seq? Is this due to the different timings (H3K27me3 after 1h and H3K27ac after 3hrs)? Can both marks co-exists or do they reflect allelic differences?
      12. Please comment on the observed differences in H3K27me3 response to Dex between ChIP-Seq (Fig. 5A) and the ChIP qPCR (Fig. 5B). Is this a timing issue?
      13. Please indicate the number of replicates for the ChIP-Seq experiments in the figure legends.
      14. The statement "Upon hormone treatment, both the number of transcripts per cell and the number of transcriptional foci increases." on page 13 is confusing. Most cells only have two alleles (max. two transcription foci). Is ZBTB16 duplicated in A549 cells?
      15. ZBTB16 is marked by H3K27me3 (Fig. 5A/B). How many GR binding sites do overlap H3K27me3 in A549 cells? How many genes associated with GR/H3K27me3 sites are expressed in A549 cells? Is ZBTB16 the only one?
      16. Is ZBTB16 a GR target gene that is regulated by GR tissue-independently (like GILZ and FKBP5)?

      Significance

      The work is of significant interest as glucocorticoids (GCs) are physiologically secreted with circadian and ultradian rhythms, but widely prescribed with repeated dosing during the day (in order to maintain high GC levels) in patients during chemotherapy (doi: 10.1016/j.critrevonc.2018.04.002, doi: 10.1186/1471-2407-8-84 ) or anti-inflammatory therapy in rheumatoid arthritis (doi: 10.1186/ar4686, doi:10.1093/rheumatology/kes086) for example. Therefore the assessment of long-term versus short-term as well as the effect of repeated GC exposure on various cell types is of high interest to understand adverse effects of GC therapy. However, the choice of cell lines as model system dampens the overall translatability of the findings, as does the choice of those cell lines. Alveolar epithelial cells (A549) are not classically known as a cell type affected by GC side effects in therapy. However, GR is widely known to regulate tissue-specific gene programs (doi: 10.1038/emboj.2013.106, doi: 10.1016/j.steroids.2016.05.003, doi: 10.1016/j.molcel.2011.06.016). Hepatic, skeletal muscle cells or fat cells would reflect those tissues more accurately. Obtaining in-vivo data is hampered by the cofounding effect of endogenous glucocorticoids and their circadian expression (doi: 10.1016/j.molcel.2019.10.007 ), but primary cells would overcome those limitations and still be a closer model system the cancer cell lines.

      That the glucocorticoid receptor mostly binds accessible genomic regions and changes the DNA accessibility of a subset of binding sites after short-term treatment with Dex was previously described (doi: 10.7554/eLife.35073, doi: 10.1093/nar/gkx1044, doi: 10.1038/ng.759 ), but the reversibility of these effects were not studied before. Therefore, this study adds an interesting conceptual finding.

      The observation that ZBTB16 expression can be boosted by repeated Dex treatment is interesting and seems to be tissue independent. Again, in-vivo or patient data confirming this observation would strengthen the conclusions from this paper and exclude an artefact from immortalized (cancer) cell lines. The impact of this observation depends on ZBTB16 function and if ZBTB16 is elevated on the protein level as well.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Bothe et al investigated whether GR induced chromatin changes could be somehow preserved after inactivation of the receptor. They performed ATAC-seq to examine the status of chromatin accessibility under several treatment conditions in two different human cell lines. Their main finding is that GR changes to chromatin are universally reversable, with the exception of a tissue-specific single locus (ZBTB16). Additionally, the authors claim their data support a squelching mechanism for transcriptional repression by GR.

      Major comments:

      Are the key conclusions convincing? Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. Are the data and the methods presented in such a way that they can be reproduced? Are the experiments adequately replicated and statistical analysis adequate?

      The manuscript is very well written. The data is clearly presented. The methods are explained in sufficient detail with a few exceptions mentioned below, and statistical analysis are adequate. There are some concerns and suggestions about the experimental design and data presentation.

      • Drug treatments. It is not clear whether the cells were previously grown on charcoal-stripped serum before hormone treatments. From methods, it seems they were grown in 5% FBS and directly treated with the hormones. Also, what "hormone-free medium" mean? Is it charcoal stripped Serum or not Serum at all? Replicates for these data sets? The ATAC and Chip-Seq should have at least 2. The concordance of the ATAC-seq and Chip-seq replicates should be described and shown in supplemental figures. Fig1A - The ATAC-seq HM should be clustered to show which peaks in opening/closing and unchanged peaks also have called GR chip peaks. Showing browser shots as in Fig1B is cherry picking data and can be put in a supplementary figure as an example. This is a main point of emphasis of the manuscript so show the data. The atac peaks that do overlap with GR chip peaks should be sorted by GR peak intensity. The QPCR is then only needed to confirm the quantitative changes.

      To show both the ATAC sites and H3K27ac sites are specific to hormone treatment, a random set of 15K peaks not in this peak set also should be shown in HMs and should not change with the treatments. Why does the H3K27ac go down in the 6768 non changing sites with dex?

      The D & E parts of Fig1 can then be eliminated to become parts of Fig1A. Its not clear in the text that the HMs in Fig1 are all sorted in the same way.

      • Fig. 1b (and d). The ChIP data is from 3h-hormone treatment while the ATAC-seq data is from a 20h hormone treatment. It seems a bit misleading to directly compare GR occupancy with the state of the chromatin at different time windows. Shouldn't the authors show their ATAC-seq 4h treatment data (shown in Fig S1) here instead?
      • Fig. 1f. The authors sate "downregulated genes only show a modest enrichment of GR peaks". However, there is a significant enrichment of GR-peaks in repressive genes compared to non-regulated genes. It would be interesting to see how some of these peaks look in a browser shot. While the general conclusion "transcriptional repression, in general, does not require nearby GR binding", seems valid, the observation that many GR peaks appear directly bound to nearby repressed genes ought to be more emphatically recognized in the text.
      • Concept of naïve cells (Fig. 3A). If cells are normally grown in serum-containing media, which is known to have some level of steroids, can the cells described here as "Basal expression" be truly free of a primed state? In the first part of the experimental design (+/- 4h hormone), which type of media is present here? Is it 5% FBS? A concern is that the authors may require the assumption that the (4h + 24h) period a is sufficient to erase all memory of the cells, which is exactly what they are trying to test.

      The transcriptional memory is a second major emphasis of the paper.

      The RNA primers (Table 1) span within an exon or across 2 exons to best measure mRNA levels. The QPCR primers should span exon intron boundaries to better reflect transcriptional activity (prior to mRNA splicing) at the collection time point.

      It would be interesting to do a time course of the hormone-free period of the washout to determine the memory of the chromatin environment that results in the enhanced transcriptional response instead of just 24 and 48 hrs in A549 cells.

      Fig 5A appears to show H3K27ac overlaying H3K27me marks near the promoter of ZBTB16 and at the GR sites within the gene locus with no reduction in H3K27me levels. This seems counterintuitive and should be explained or addressed especially since the authors use quantitative comparisons of H3K27ac levels with and without treatment in other figures.

      Showing the changes of ZBTB16 upon 2nd stimulation via FISH is not terribly surprising and is even the most expected reason for higher RNA levels. Why does it only occur at that gene is a better question and is touched on in the discussion. It is more likely that this gene has a very low level of pre-hormone transcription compared to FKBP5 (see Fig 3e and the FISH images). ZBTB16 is in the lower 3rd of basemean RNA levels of GR responsive genes according to the RNAseq data. Selection of 1 or 2 other genes with similar basemean levels of RNA (from the RNA-Seq data) would make the data more

      Minor comments:

      Specific experimental issues that are easily addressable. Are prior studies referenced appropriately? Are the text and Figures clear and accurate? Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      • In the Intro (paragraph two), the authors explain the different mechanisms by which GR might repress genes. One alternative the authors appear to have missed is the possibility of direct binding to GREs while, for example, recruiting a selective corepressor such as GRIP1 (Syed et al., 2020). There are many recent critics to the notion that transrepression via tethering is responsible for GR repressive actions at all (Escoter-Torres et al., 2020; Hudson et al., 2018; Weikum et al., 2017).
      • When the authors introduce the concept of tethering to AP-1, they go way back to the first description of tethering. However, one of the references (Ref 20) actually goes against the tethering model as they did not detect protein-protein interactions between AP-1 and GR, and also, they conclude that repression requires the DNA-binding domain. -Figure 2. The authors state "This suggests that the few sites with persistent opening are likely a simple consequence of an incomplete hormone washout and associated residual GR binding". The authors should check the subcellular distribution of GR after their washout protocol. If the washout is not completed, GR should still be in the nuclear compartment.
      • The first part of the manuscript (Repression through "squelching") seems a bit disconnected from the rest of the results (reversibility in accessibility). The abstract is structured in a way that this disconnection seems much less obvious. Perhaps the authors could try to present their squelching part in the middle of the manuscript, following the flow of the abstract? This is just a suggestion.
      • Figures have CAPS panel letters (A,B,C, etc) while the text calls for lower case letter (a,b,c...)

      Escoter-Torres, L., Greulich, F., Quagliarini, F., Wierer, M., and Uhlenhaut, N.H. (2020). Anti-inflammatory functions of the glucocorticoid receptor require DNA binding. Nucleic Acids Res 48, 8393-8407. Hudson, W.H., Vera, I.M.S., Nwachukwu, J.C., Weikum, E.R., Herbst, A.G., Yang, Q., Bain, D.L., Nettles, K.W., Kojetin, D.J., and Ortlund, E.A. (2018). Cryptic glucocorticoid receptor-binding sites pervade genomic NF-kappaB response elements. Nat Commun 9, 1337. Syed, A.P., Greulich, F., Ansari, S.A., and Uhlenhaut, N.H. (2020). Anti-inflammatory glucocorticoid action: genomic insights and emerging concepts. Curr Opin Pharmacol 53, 35-44. Weikum, E.R., de Vera, I.M.S., Nwachukwu, J.C., Hudson, W.H., Nettles, K.W., Kojetin, D.J., and Ortlund, E.A. (2017). Tethering not required: the glucocorticoid receptor binds directly to activator protein-1 recognition motifs to repress inflammatory genes. Nucleic Acids Res 45, 8596-8608.

      Significance

      The study tackles two important questions. One is regarding the effects of inducible transcription factors on chromatin structure after inactivation. The second is on the mechanisms behind transcriptional repression.

      The effect of GR inactivation on chromatin accessibility has already been addressed in previous work for a single locus (Refs 38) or genome wide (Ref 33). However, the 24h temporal windows have not been addressed before. In this sense, the manuscript sheds some new light into the matter. Even though the authors conclude that accessibility is globally reversable, they only studied in detail the mechanism behind a single-locus exception.

      Regarding the mechanisms behind transcriptional repression, the authors present data supporting the squelching mechanism, which is still highly controversial.

      The manuscript will be of interest to the molecular and cell biology communities, especially those working on chromatin structure, transcription factors, gene regulation, and nuclear receptors. Overall, this is an interesting paper with somewhat limited novel findings that is suitable for publication after addressing the above comments. The rigor of the findings needs to be better described via replicates and if they have not been done, it should be a major requirement of revision.

      The reviewers specialize in transcription factor dynamics.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank both reviewers for their comments on our manuscript. Our responses to their specific comments and plan to modify the manuscript are described bellow.

      Response to reviewer #1

      • *

      > Figure 1C is difficult to interpret. Am I supposed to see anything in particular in the two insets? Please provide a descriptive interpretation of the inset and let the reader know if anything in particular is to be noted.

      We agree that it is a bit difficult to interpret although the goal of these images is to show that spermatogenesis appears globally not disturbed until the histone-to-protamine transition in distorter males. We will change the legend of the figure to clarify this particular point.

      > Figure 1D. Elsewhere, in supplemental S1, they have an image for SD5/CyO. This should be provided here as a control. As presented, the genetics don't prove that the interaction between SD5 and Gla is the cause of the phenotype. As presented in the figure, the effect could be caused by SD5 alone, independent of Gla. In S1, this is not the case - SD5 with CyO doesn't produce the phenotype. Likewise, I think they should provide the SD5/CyO image in S1A in Figure 1C.

      We can add the images of the SD5/CyO genotype (currently in FigS1) in Fig1C (whole testis) and in Fig1D (single cyst). We also suggest to present in this figure the other distorter genotype cn bw/CyO (which is currently in Fig S1). However, because the modified figure is going to be too big, we also suggest to split Figure 1 in two Figures with Figure 2 presenting FISH results including all controls. In this case, we will remove the supplemental figure 1.

      > Figure 1E. These images are the formal proof (especially for Gla/SD5 genotype) that the large Rsp array is on the chromosomes that seem destined for removal from the cyst. However, there is no control. The authors should provide FISH results for the genotypes Gla/CyO and, ideally, also cn1 bw1/Cyo.

      We agree with reviewer #1 and will provide images of the Gla/CyO control and cn1 bw1/Cyo in a new Figure 2 as explained above.

      > Figure 2. Keeping consistent with other figures, can the Gla/SD5 panel be in the middle?

      Yes. We swapped the Gla/SD5 and cn bw/SD-Mad panels.

      Also, shouldn't there be SD5/CyO in Figures 2, 3 and 4, to demonstrate that the phenotypes are the result of the interaction rather than just SD5? I am OK with providing just the cn 1 bw1/SD-Mad here alone, since it is simply contrasted with Gla/SD5.

      We agree that it would be better to show also the SD5/CyO controls. However, we chose to show only one control (Gla/CyO) to make the figures easier to read. We thus suggest to provide all images of the SD5/CyO genotype in supplemental figures.

      > Figure 3A. In the scheme, can you provide greater detail as to where F-Actin is expected?

      The scheme was modified to clarify this point.

      > Figure 3B. It is stated that there is a size difference in the nuclei for IC stage and greater variation in ProtB-GFP staining within bundles. Can there be an effort to quantify these observations?

      It would be difficult to quantify ProtB-GFP signal intensity and nuclear size in IC stage cyst because nuclei are very close to each other. The best way would be to squash testes to spread spermatid nuclei but there might be a bias on nuclear size/shape due to the squashing procedure. In addition, on squashed preparations, it is difficult to be sure that the nuclei analyzed and compared belong to the same cyst. We agree that quantification would help to describe the phenotype better but we think that the best read-out of the different SD phenotypes is the quantification of number of abnormally compacted nuclei in seminal vesicles which is provided later in the manuscript.

      > Figure S4. There doesn't appear to be the same phenotype for Gla/SD-Mad (DAPI, ProtB-GFP) in post-IC stage bundles compared to what is seen in 3C for Gla/SD-5. In particular, in figure 3, the defective nuclei seem to be trailing, but in S4, while the bundle appears disorganized, there doesn't appear to be the trailing nuclei. Is this difference real or is it just the result of a single picture contrast? Some clarification could be helpful.

      Actually, the images that were shown on Figure 3C for Gla/SD5 post-IC probably show the SD5 nuclei of one cyst (the normal one) and the Rsp nuclei being eliminated from another cyst (these are trailing behind nuclei which are too far to be included in the same image). We thus changed the images for Gla/SD5 for an image which looks like the one shown for Gla/SD-5 genotype for clarity.

      We did not mention this observation in the manuscript but we actually see cysts in which abnormally-shaped nuclei are trailing behind the normal nuclei and sometimes IC cones around the abnormally shaped nuclei seem to be stuck close to the normal nuclei which are already individualized. It might be possible that IC progression around abnormal nuclei is slowed down compared to normal nuclei. The difference could not reflect different phenotypes but more likely different states of a dynamic process.


      Response to reviewer #2


      Reviewer #2 had no specific comments.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      SD is a multi-component system, where two major factors Sd (a truncation allele of RanGAP that mislocalizes) and Rsp, a satellite DNA (whose copy number determines sensitivity to RanGAP distorting allele).

      This study by Herbette et al. provide cytological characterization of Drosophila SD (segregation distortor), a male meiotic drive system, focusing on the process of histone-to-protamine transition. By thoroughly studying multiple alleles of SD, they find that the mechanisms by which SD accomplishes segregation distortion are not uniform. In some cases, spermatogenesis is perturbed at the level of protamine incorporation and in other cases, mature sperm can be generated yet they exhibit distorted segregation.

      In one combination Gla/SD5, histone elimination is delayed (never complete), whereas cn bw/SD-Mad exhibit normal timing in histone elimination/protamine incorporation, although these two combinations result in similar, severe degree of distortion. They further show that DNA compaction is incomplete in these SD alleles (again more severe in Gla/SD5 condition) by using dsDNA antibody. Interestingly, defective spermatids in Gla/SD5 combination never progress to sperm maturation and enter seminal vesicle, defective spermatids in cn bw/SD-Mad combination are capable of entering seminal vesicle, but likely fail to fertilize or develop after fertilization, resulting in distortion.

      This is a well-done study and provides important insights into the mechanisms of segregation distortion in the Drosophila melanogaster SD system. The quality of data is high, and I don't have any major concerns on this manuscript. Of course, the exact mechanisms of how SDs drive (i.e. why Rsp(S) alleles fail to condense properly, and how it is related to the Rsp copy number) remains unclear, this study provides a significant step forward to tackle this fascinating phenomenon of segregation distortion.

      Significance

      This study provides important insights into the underlying mechanism of segregation distortion during D. melanogaster spermatogenesis. Segregation distortion is a fascinating phenomenon of significant interest in evolutionary biology. Thorough cytological characterization of spermatogenesis phenotype that leads to segregation distortion provides much needed information, and this study is a significant step forward to understand how meiotic drivers might exploit the system to distort segregation for their advantage.

      Referees cross-commenting

      I think I and reviewer #1 seems to be in good agreement. I don't have anything in particular to add. This is a nice paper.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is a very nice paper that combine cytology and genetics to provide insight into the mechanism of segregation distortion in the Drosophila SD system. The conclusions are well supported with multiple different experiments from different in angles. By using different genetic backgrounds - their conclusion that Rsp abundance dictates distinct outcomes is well supported. My primary suggestion is that they include a few more controls and provide some additional quantitative analysis. In some cases, quantitative conclusions are made without sufficient support.

      Specific Comments.

      Figure 1C is difficult to interpret. Am I supposed to see anything in particular in the two insets? Please provide a descriptive interpretation of the inset and let the reader know if anything in particular is to be noted.

      Figure 1D. Elsewhere, in supplemental S1, they have an image for SD5/CyO. This should be provided here as a control. As presented, the genetics don't prove that the interaction between SD5 and Gla is the cause of the phenotype. As presented in the figure, the effect could be caused by SD5 alone, independent of Gla. In S1, this is not the case - SD5 with Cyo doesn't produce the phenotype. Likewise, I think they should provide the SD5/CyO image in S1A in Figure 1C.

      Figure 1E. These images are the formal proof (especially for Gla/SD5 genotype) that the large Rsp array is on the chromosomes that seem destined for removal from the cyst. However, there is no control. The authors should provide FISH results for the genotypes Gla/CyO and, ideally, also cn1 bw1/Cyo.

      Figure 2. Keeping consistent with other figures, can the Gla/SD5 panel be in the middle? Also, shouldn't there be SD5/CyO in Figures 2, 3 and 4, to demonstrate that the phenotypes are the result of the interaction rather than just SD5? I am OK with providing just the cn 1 bw1/SD-Mad here alone, since it is simply contrasted with Gla/SD5.

      Figure 3A. In the scheme, can you provide greater detail as to where F-Actin is expected?

      Figure 3B. It is stated that there is a size difference in the nuclei for IC stage and greater variation in ProtoB-GFP staining within bundles. Can there be an effort to quantify these observations?

      Figure S4. There doesn't appear to be the same phenotype for Gla/SD-Mad (DAPI, protoB-GFP) in post-IC stage bundles compared to what is seen in 3C for Gla/SD-5. In particular, in figure 3, the defective nuclei seem to be trailing, but in S4, while the bundle appears disorganized, there doesn't appear to be the trailing nuclei. Is this difference real or is it just the result of a single picture contrast? Some clarification could be helpful.

      I think this is a nice paper and I enjoyed reading it very much. The combination of the genetics (different RSP alleles from nature and the different X chromosomes) with the cytology provide a very reasonable explanation for why different genotypes seem to yield different effects. It provides some reconciliation among previous studies.

      Significance

      I think it is very significant.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We are grateful for the careful read and constructive comments provided by the 3 reviewers assigned to our manuscript. Each reviewer provided thoughtful and clearly structured comments that helped us to better clarify points or summarize results in the manuscript that they indicated were not presented clearly or completely. We have revised the manuscript to address the points raised by the reviewers, incorporating edits and additional text throughout the manuscript, figure legends, and supplemental materials. We feel the revised version of the manuscript is much improved as a result of the revisions in response to the reviewers.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The authors demonstrate a powerful method utilizing mNGS of individual mosquitoes utilizing reference-free analysis. This allows researchers to combine the resulting datasets of mosquito identification, blood-meal source, microbiome, viral sequencing, etc. Such knowledge could be a useful tool in detecting and responding to transmission of mosquito-borne diseases that affect human or animal populations, even though the technology is currently likely too expensive for widespread use (as acknowledged by the authors).

      Major Comments:

      No major revisions requested.

      The authors provide their detailed methodology, including code, allowing for replication by other groups.

      Minor Comments:

      The authors' discussion of using this technique in order to detect pathogens should be qualified regarding detection vs possible transmission. Detecting a virus in an engorged mosquito does not necessarily mean that said mosquito can transmit the virus, but may have simply acquired it from a recent blood meal. The same can be said of detecting a plant pathogen following a recent sugar meal.

      From the methods, it seems that mosquitoes were not washed prior to processing. This may make it difficult to discriminate between internal and external microbiota as well as lead to cross-contamination of surface microbiota between mosquitoes collected in the same trap.

      Significance

      This work currently would be of interest to other research groups examining the co-occurence of pathogens, other microbiota, and blood meals for field collected mosquitoes. While of great potential application to public health surveillance, the current cost is likely prohibitive.

      My field of expertise is virology and vector biology with minimal background in NGS.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this study, the authors utilized unbiased meta-transcriptomic in sequencing 148 diverse wild-caught mosquitoes (Aedes, Culex, and Culiseta ​mosquito species) collected in California, with main aim of detecting sequences of eukaryotic, prokaryotic and viral origin. Their results show that majority of their sequenced data assembled into contigs corresponding to viral genomes. In their data, 7.4 million viral reads clustered as +ssRNA viruses including ​Solemoviridae, Luteoviridae, Tombusviridae, Narnaviridae, Flaviviridae, Virgaviridae, and Filovirida​ whereas 2.25 million viral reads identified as -ssRNA viruses comprising of ​Peribunayviridae, Phasmaviridae, Phenuiviridae, Orthomyxoviridae, Chuviridae, Rhabdoviridae, and Ximnoviridae​. With 0.94 million viral reads, dsRNA viruses formed the third most abundant virus category with viruses under families ​Chrysoviridae, Totiviridae, Partitiviridae, and Reoviridae. Under the prokaryotic taxa, Wolbachia​ species was the dominant group, followed by other lower abundance bacterial taxa that includes Alphaproteobacteria, Gammaproteobacteria, Terrabacteria group, and Spirochaetes. Trypanosomatidae was the most dominant eukaryotic taxa, followed up by reads from ​Bilateria​ and Ecdysozoa taxa. Ultimately, this study demonstrates that single mosquito meta-transcriptomic analysis has potential in identifying vectors of human health significance, potent emerging pathogens being transmitted by them and their reservoirs all in one assay.

      Major comments:

      1.Are the key conclusions convincing? The conclusions are accurate.

      2.Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? None. The study's results, discussion and conclusion are appropriate.

      3.Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      As much as the authors describe the use of mNGS as a tool in validating mosquito species and providing an unbiased look at the vector-associated pathogens, it is still prudent for them to use qPCR to validate the obtained RNASeq data (e.g. validation of the viral sequences).

      4.Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. The outlined methodology is realistic.

      5.Are the data and the methods presented in such a way that they can be reproduced? The methodology is reproducible.

      6.Are the experiments adequately replicated and statistical analysis adequate? Yes

      Minor comments:

      1.Specific experimental issues that are easily addressable. qPCR validation the obtained RNASeq data should be conducted.

      2.Are prior studies referenced appropriately? The recently publications about mosquito microbiome/virome should be added. (eg.  doi: 10.1128/mSystems.00640-20.)

      3.Are the text and figures clear and accurate? The resolution for Fig 4, Fig 6, SFig 2, SFig 4, and SFig 5 is poor. The author should update them.

      4.Do you have suggestions that would help the authors improve the presentation of their data and conclusions? (1)in the method section, the mosquito has been washed to avoid the contamination from the environment before RNA extraction? (2)most part of non-host reads are matched to the viruses (10.5M), however only few of them were belong to the prokaryotes, does it means mosquito carries more viruses than prokaryotes. (3)none of the mosquito-borne virus known to occur in California (eg. WNV, SLEV, WEEV, ) has been found in Table 1 for the virus detected with complete genome in this study. In contigs level, did the author detected any mosquito-borne virus known to occur in California. Since the mNGS is very sensitive and this study include large sample numbers, why no known mosquito-borne virus was detected in their study should be discussed.

      Significance

      1.Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. With the existential threat of emerging novel pathogens of global health concern, efficient and rapid public health surveillance strategies are crucial in monitoring and possibly averting such eventual calamities. Specifically, mosquitoes are widely diverse and are known to harbor and transmit various pathogenic agents to humans and animals. Thus, this rapid identification of relevant vector species, pathogens and their reservoirs in one assay is a promising and convenient aspect of surveillance in the public health sector.

      2.Place the work in the context of the existing literature (provide references, where appropriate). Shi et al reported the first single mosquito viral metagenomics study, in which her and the team demonstrated the feasibility of using single mosquito for viral metagenomics, a methodology that has potential to provide much more precise virome profiles of mosquito populations. In the present study, the authors have gone a step higher by aiming to combine three objective points in single mosquito meta-transcriptomic, as described in brief in their abstract and the comprehensive methodology outline.

      Reference: Shi, C., Beller, L., Deboutte, W. et al. Stable distinct core eukaryotic viromes in different mosquito species from Guadeloupe, using single mosquito viral metagenomics. Microbiome 7, 121 (2019). https://doi.org/10.1186/s40168-019-0734-2

      3.State what audience might be interested in and influenced by the reported findings. The methodology and findings described in this manuscript are important in advancing the public health field of vector surveillance. The identification of relevant vector species, pathogens and their reservoirs in one assay is a promising and convenient aspect of surveillance.

      4.Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      I am an Associate Professor at a research institute. My lab research work focuses on Arbovirology studies, more specifically vector surveillance of known and novel viruses associated with mosquitoes and ticks, mosquito-transcriptomic studies, mosquito viruses tropism studies and other related mosquito-virus interaction studies.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is a very interesting and well designed study on mNGS of mosquitoes. The authors demonstrate that they can distill valuable information on the vector species, the source of the blood meals and the microbiome/virome using a simple experimental approach and using single mosquitoes. A highlight of the work is that the paper is very comprehensive with an overwhelming dataset and thoughtful analysis. It is a showcase how sequencing data from a relative compact number of mosquitoes specimens can be used to conduct sophisticated computational analysis leading to meaningful conclusions. The authors make a strong case for the power of mNGS of mosquitoes that may be applicable to other (invertebrate) species. Especially the phylogenetic analysis based on SNP distance without have reference genomes and the grouping of contigs by means of co-occurence in datasets is original. We feel that the work deserves to be published.

      Significance

      We have a number of comments that the authors may consider in further improving the quality of their manuscript:

      What is the impact of this paper?

      I think it is possible that the paper will have a decent impact on the mosquito arbovirus field, because it adequately shows the possibilities that individual mosquito sequencing can bring (e.g. co-occurrence analysis). It may shift the balance to doing more individual mosquito sequencing instead of pools. The paper is also very extensive in the analyses that it does on this very rich data set. Below, some suggestions are given for additional analysis, which should be interpreted as a compliment to the interesting data set acquired. It should however be noted that the ideas and approaches taken are not entirely new. Sequencing individual mosquitoes, co-occurrence analysis and metagenomic sequencing have been done before, although not to this extent and not in this field. Several novel possibilities:

      1. An unbiased way to check if you have the correct mosquito species and the ability to detect subspecies. Using the genetic distance of the transcriptomes they have likely corrected the missed identification in some samples, where these calls had a logical mistake made. The fact that subspecies overlapped with the sites of capture is very interesting and confirms the relevance of looking at the genetic distance also within species.
      2. Blood-meal analysis from sequence data. Here they can get to species level for 10 out of 40 blood-engorged mosquitoes. The idea is interesting, as you would be able to get a lot more information if you can determine blood-meal origin from RNA-seq data (as shown in this paper). However, I feel that in the current paper (and this may be intentional) they do not properly show that RNA-seq is an adequate alternative to DNA sequencing of the blood. To convince me, I would have liked to have these results compared to DNA sequencing and see how much overlap there is. I understand however that the choice was made not to do this, but I do have a small note for the information given now. It was mentioned that 1 contig with an LCA of vertebrates is enough for a 'blood-meal origin' call. I am however left to wonder how reliable is 1 read? Are there really no contigs with an LCA in vertebrates in the non blood-fed mosquitoes? Also, what do we think happened in the mosquitoes that were visibly bloodfed but nothing was found; any speculation?
      3. The study of co-occurrence, although not novel, is a nice addition to the mosquito virome/microbiome determination field. Identifying novel segments and missed segments of viruses is very nice. I do however wonder: did it ever occur that co-occurrence finds a 'linked' fragment that was clearly wrong? Were some post-analyses done to check if the results make sense? It seems, especially because the paper elaborates on examples, that you need some follow-up. This is not problematic, but a nice addition to the paper would be (as is also described below) to mention which segments were added to viral genomes by co-occurrence and if some checks were done to verify these hits.
      4. Being able to say something about differences in viruses within the same mosquito species is super interesting. Pools do not give the possibility to say something about profiles and prevalence and the large size (148 mosquitoes) allows to find interesting correlations.

      What parts do you think are problematic?

      1. We question the validity 'blood-meal calls' as outlined above.
      2. In this study they use % of non-host reads as a measure for the abundance of a pathogen (see e.g. Figure 3). I don't understand this at all... If you have more pathogens, then the amount of non-host reads would have to go up right? It seems to assume that the amount of non-host reads you have is similar in all samples? It becomes even more problematic when the trend is mentioned that having a higher % of non-host reads for Wolbachia is related to a lower % of non-host reads for viruses. This seems to be trivial as the amount of non-host reads goes up with increased Wolbachia infection, and therefore the % of non-host reads for viruses goes down due to the larger denominator. A different number than 'non-host reads' should be taken to normalise the data and say something about abundance. E.g. host reads or spiked RNA?

      What are the most relevant questions you are left with?

      1. I am curious about the limited overlap with Sadeghi et al., 2018, who sequenced so many Culex mosquitoes in California. I would suggest to say a little but more about these discrepancies and their potential causes in the discussion.
      2. What do the authors think are in those 'dark reads'? Is the amount of dark reads the same across the different samples? Similarly, are the 'tetrapoda' reads reduced/absent in mosquitoes with a reference genome available?
      3. In the first part of the results, mention is made to being able to characterize to kingdom level 77% of the 13 million non-host reads (also see comment on non-host reads below). I am however puzzled with the description in the text and supplemental figure 3: which 3 million contigs were not able to be characterized? Where in supplemental figure 3 are they? This is especially puzzling as the main text mentions that 11 million non-host reads are from complete viral genomes, 0.9 million to eukaryotic taxa and 0.7 million to prokaryotic taxa?
      4. There seem to be 131 bars, corresponding to individual mosquitoes, in figure 3? Where are the remaining 17?

      What are your tips (in addition to responses to above questions)?

      1. I think the definition of 'non-host reads' needs to be clearly made and used consistently across the document. At the end of the paragraph 'Comprehensive and quantitative analysis of non-host sequences detected in single mosquitoes' the concept of "...13 million non-host reads..." is introduced. At first glance of supplemental figure 3 it seems that "non-host reads" could also be defined as the 16.7 aligned reads that are left after putative host sequences are removed. Although it is true that the derivation of 13 million is explained in the figure text of supplemental figure 3, it may be easier for the reader (as it cost me some time) to explain this in the main text. In addition, is the definition of 'non-host reads' (corresponding to 13-million reads) corresponding to "classified non-host reads" in the following excerpt: "For every sample, "classified non-host reads" refer to those reads mapping to contigs that pass the above filtering, Hexapoda exclusion, and decontamination steps. "Non-host reads" refers to the classified non-host reads plus the reads passing host filtering which failed to assemble into contigs or assembled into a contig with only two reads."? This caused some confusion.
      2. I believe it would be a valuable addition to add a table for the viruses which includes: 1) How it was determined that the complete genome is there, 2) The percentage overlap for those segments that were identified with blast and 3) Which viruses were already known.
      3. Have the numbers of the caught mosquitoes somewhere written out in the materials and methods.
      4. Pg2 L1-3: "Metagenomic sequencing..... a single assay." Perhaps a bit early for this statement. Would suggest to place it two paragraphs later before:"Here, we analyzed...."
      5. Figure S4 is too pixelated to read. Perhaps due to pdf conversion, but please do check before submission.
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Many flatworm species reproduce asexually, by fission, and the process relies on the activity of stem cells (neoblast), which drive regeneration. The question that this work tries to address is what is the dynamics of stem cells in this process, including how many stem cells contribute to regeneration, what are the mutation rates and selection mechanisms, if any. Towards this, the authors tracked one specimen of planarian Girardia tigrina for more than ten rounds of fission, and re-sequenced its genome at multiple time points and applied methods of population genetics to analyze and model the data. The main conclusion of the work is that there is high somatic mutation rate, rapid loss of heterozygosity, and a small size of the stem cell population that contributes to regeneration after fission.

      Reviewer #1 (Significance (Required)):

      The work has value, since it provides a framework to address the evolutionary aspects of stem cell dynamics in flatworms. However, as the authors point multiple times, there are many unknown biological parameters, such as, for example, the ratio of cell to organism regeneration (g), and simplifications, which can significantly influence the results. For this reason, the authors provide a range of estimates for somatic mutation rates and the effective stem cell population size, rather than some final conclusions. As the authors point out, further work will be needed to refine the model but generating new data for that is beyond the scope of this manuscript. As such, I find this manuscript is an important initial contribution to the field of stem cell population dynamics in flatworms, and its methods, results and conclusions convincing. I don't have further suggestions for improving this manuscript.

      Thank you very much for your positive assessment of our work.

      **Referees cross-commenting**

      I agree with the suggestion of Reviewer #2 that repeating the analysis on additional contings, instead of focusing only on one longest contig in the assembly, will be useful.

      We will process and analyze a few additional contigs to evaluate genomic variation in transmission of somatic variants in this system.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      The authors aimed to use population genetics to determine the number of stem cells active in each cycle of regeneration or the equality of their relative contributions in planarians. They approached this by establishing a population with serial fission from one wild isolate of Girardia cf. tigrina collected in Italy. They used next generation sequencing to sample variants of regenerated worms at different generations of fissioning. They estimated the effective population size of stem cells to be a few hundreds, besides calculation of nucleotide diversity and somatic mutation rate. They propose small effective number of propagating stem cells might contribute to reducing reproductive conflicts in clonal organisms.

      **Major comments:**

      • Are the key conclusions convincing?

      The mutation rate is reasonable. The effective stem cell population size and the genetic diversity may vary between different species. A small effective stem cell population size is not counter intuitive.

      Generally, the work is interesting and deserves to be published.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      The current analysis is based on many assumptions, one single set of experiments and a genome that is not well assembled. The authors have been careful with their language and documented the limitations in discussion.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      I will feel more comfortable if the authors can repeat the analysis with two more random long contigs to have a better idea if the localization of markers impacts the conclusion. The concern is if different parts of the genome behave differently and if the Girardia genome is highly repetitive. As the pipeline of analysis is established, I expect this can be completed in a month with no experimental cost.

      We will process and analyze a few additional contigs to evaluate genomic variation in transmission of somatic variants in this system.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Yes

      • Are the data and the methods presented in such a way that they can be reproduced?

      Yes

      • Are the experiments adequately replicated and statistical analysis adequate?

      Yes

      **Minor comments:**

      • Specific experimental issues that are easily addressable.

      Yes.

      • Are prior studies referenced appropriately?

      Yes.

      • Are the text and figures clear and accurate?

      Yes.

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Yes. Please also see the significance section.

      Specifically, my concerns are about writing and the context of current study.

      "Small effective number of propagating stem cells might contribute to reducing reproductive conflicts in clonal organisms." is confusing. Not a good abstract ending sentence for the work presented. Reproductive conflicts need clarification. How the work relates to that concept needs support. I recommend keeping the interpretations simple and focused on the data.

      Please see response under “Significance”.

      Reviewer #2 (Significance (Required)):

      Both questions the authors attempt to address, the genetic diversity of clonal animals and the number of stem cells contributing to regeneration, are interesting and important. The combination of these two is a bit odd in the manuscript. In other words, the population genetics approach did not address the cell biology question:how many or what proportion of stem cells are active in each cycle of regeneration. I would recommend the authors to focus the writing on one question only: the genetic diversity and evolution of a clonal species, which is driven by stem cell genome evolution and the process of regeneration. The cell biology question, phrased by the author in the abstract and introduction, need to be resolved by cell biologists. I understand the appeal to put the current study in the context of regeneration research. A balance should be achieved. Currently, the second sentence of the abstract and the first paragraph of introduction are odd and misleading. The first paragraph of the introduction can be a second paragraph to introduce the planarian system for the study.

      We will restructure the manuscript to clearly separate the findings that arose directly from experimental (sequencing) data i.e. magnitude and inheritance pattern of somatic variation, and the findings that were inferred from our approximate population genetic model and depend on the unknown parameter g i.e. the effective number of stem cells and the somatic mutation rate. We will emphasize the distinction. The statements that are tangentially relevant and are not directly supported by our analyses will be modified or removed.

      In the context of genetic diversity of clonal species, many studies shall be referenced. It is interesting as well to draw comparisons with other species. Asexual planarians are unique and interesting in that space.

      Thus said, the attempt to examine stem cell population genetics is especially interesting and important as the fissiparous planarians do not undergo bottleneck selection by zygotes. In the context of recent progress studying planarian genetic diversity (Nishimura, O. et al. 2015, Guo, L. et al. 2016), Asgharian H. et al.'s work is timely and an important contribution to planarian researchers and evolutionary biologists. The question has general interest to cancer biologists as well. The manuscript does not have the data and is not written in a way to reach such broader audiences yet. A community is growing to address these questions.

      We agree with the reviewer’s point about the pioneering works of Nishimura et al. 2015 and Guo et al. 2016. Both papers were indeed cited in our manuscript. We will cite more studies pertaining to the question of somatic genetic diversity in planarians.

      The study of planarian genetic diversity has just started with two publications (Nishimura, O. et al. 2015, Guo, L. et al. 2016). It is reasonable to have lots of limitations and assumptions in the manuscript. The work is an interesting piece to be published, assuming the major points listed in the review is addressed. The reported findings will be part of the early literature and inspiration for planarian researchers and evolutionary biologists. I expect many more future manuscripts will be published, either to reexamine the reported findings or to push our understanding of the question deeper.

      Thank you very much for this assessment. We fully agree.

      My expertise is with planarian biology, genome, genetics, and diversity. I do not have sufficient expertise to evaluate the equations used in the study.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      The authors aimed to use population genetics to determine the number of stem cells active in each cycle of regeneration or the equality of their relative contributions in planarians. They approached this by establishing a population with serial fission from one wild isolate of Girardia cf. tigrina collected in Italy. They used next generation sequencing to sample variants of regenerated worms at different generations of fissioning. They estimated the effective population size of stem cells to be a few hundreds, besides calculation of nucleotide diversity and somatic mutation rate. They propose small effective number of propagating stem cells might contribute to reducing reproductive conflicts in clonal organisms.

      Major comments:

      • Are the key conclusions convincing?

      The mutation rate is reasonable. The effective stem cell population size and the genetic diversity may vary between different species. A small effective stem cell population size is not counter intuitive.

      Generally, the work is interesting and deserves to be published.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      The current analysis is based on many assumptions, one single set of experiments and a genome that is not well assembled. The authors have been careful with their language and documented the limitations in discussion.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      I will feel more comfortable if the authors can repeat the analysis with two more random long contigs to have a better idea if the localization of markers impacts the conclusion. The concern is if different parts of the genome behave differently and if the Girardia genome is highly repetitive. As the pipeline of analysis is established, I expect this can be completed in a month with no experimental cost.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Yes

      • Are the data and the methods presented in such a way that they can be reproduced?

      Yes

      • Are the experiments adequately replicated and statistical analysis adequate?

      Yes

      Minor comments:

      • Specific experimental issues that are easily addressable.

      Yes.

      • Are prior studies referenced appropriately?

      Yes.

      • Are the text and figures clear and accurate?

      Yes.

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Yes. Please also see the significance section. Specifically, my concerns are about writing and the context of current study.

      "Small effective number of propagating stem cells might contribute to reducing reproductive conflicts in clonal organisms." is confusing. Not a good abstract ending sentence for the work presented. Reproductive conflicts need clarification. How the work relates to that concept needs support. I recommend keeping the interpretations simple and focused on the data.

      Significance

      Both questions the authors attempt to address, the genetic diversity of clonal animals and the number of stem cells contributing to regeneration, are interesting and important. The combination of these two is a bit odd in the manuscript. In other words, the population genetics approach did not address the cell biology question:how many or what proportion of stem cells are active in each cycle of regeneration. I would recommend the authors to focus the writing on one question only: the genetic diversity and evolution of a clonal species, which is driven by stem cell genome evolution and the process of regeneration. The cell biology question, phrased by the author in the abstract and introduction, need to be resolved by cell biologists. I understand the appeal to put the current study in the context of regeneration research. A balance should be achieved. Currently, the second sentence of the abstract and the first paragraph of introduction are odd and misleading. The first paragraph of the introduction can be a second paragraph to introduce the planarian system for the study.

      In the context of genetic diversity of clonal species, many studies shall be referenced. It is interesting as well to draw comparisons with other species. Asexual planarians are unique and interesting in that space.

      Thus said, the attempt to examine stem cell population genetics is especially interesting and important as the fissiparous planarians do not undergo bottleneck selection by zygotes. In the context of recent progress studying planarian genetic diversity (Nishimura, O. et al. 2015, Guo, L. et al. 2016), Asgharian H. et al.'s work is timely and an important contribution to planarian researchers and evolutionary biologists. The question has general interest to cancer biologists as well. The manuscript does not have the data and is not written in a way to reach such broader audiences yet. A community is growing to address these questions.

      The study of planarian genetic diversity has just started with two publications (Nishimura, O. et al. 2015, Guo, L. et al. 2016). It is reasonable to have lots of limitations and assumptions in the manuscript. The work is an interesting piece to be published, assuming the major points listed in the review is addressed. The reported findings will be part of the early literature and inspiration for planarian researchers and evolutionary biologists. I expect many more future manuscripts will be published, either to reexamine the reported findings or to push our understanding of the question deeper.

      My expertise is with planarian biology, genome, genetics, and diversity. I do not have sufficient expertise to evaluate the equations used in the study.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Many flatworm species reproduce asexually, by fission, and the process relies on the activity of stem cells (neoblast), which drive regeneration. The question that this work tries to address is what is the dynamics of stem cells in this process, including how many stem cells contribute to regeneration, what are the mutation rates and selection mechanisms, if any. Towards this, the authors tracked one specimen of planarian Girardia tigrina for more than ten rounds of fission, and re-sequenced its genome at multiple time points and applied methods of population genetics to analyze and model the data. The main conclusion of the work is that there is high somatic mutation rate, rapid loss of heterozygosity, and a small size of the stem cell population that contributes to regeneration after fission.

      Significance

      The work has value, since it provides a framework to address the evolutionary aspects of stem cell dynamics in flatworms. However, as the authors point multiple times, there are many unknown biological parameters, such as, for example, the ratio of cell to organism regeneration (g), and simplifications, which can significantly influence the results. For this reason, the authors provide a range of estimates for somatic mutation rates and the effective stem cell population size, rather than some final conclusions. As the authors point out, further work will be needed to refine the model but generating new data for that is beyond the scope of this manuscript. As such, I find this manuscript is an important initial contribution to the field of stem cell population dynamics in flatworms, and its methods, results and conclusions convincing. I don't have further suggestions for improving this manuscript.

      Referees cross-commenting

      I agree with the suggestion of Reviewer #2 that repeating the analysis on additional contings, instead of focusing only on one longest contig in the assembly, will be useful.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We want to thank the reviewers for their careful evaluation of our work and their helpful suggestions. We provide at the end of this letter a point by point response of how we aim to address their concerns, which can be summarised in the following main points:

      1-We will provide further evidence for the efficiency and dynamics of beta-catenin deletion in adult neural stem cells in vivo (point raised by both reviewers).

      We fully agree that although we tested for the disappearance of beta-catenin transcripts in sorted NSCs after deletion, providing further proof of the absence of beta-catenin protein in these cells will help strengthen our conclusions. For this, we are performing additional stainings for beta-catenin and Wnt/beta-catenin targets, together with neural stem cell markers, to quantify the loss of beta-catenin and Wnt/beta-catenin signalling in NSCs at P90 (30 days after deletion), as well as new P150 samples (90 days after deletion).

      2-We will investigate in further detail the effects (or lack of effect) of beta-catenin deletion on adult neurogenesis.

      The focus of our work is the effect of Wnt/beta-catenin signalling on NSCs. Nevertheless, we agree with reviewer 2 that extending our analysis to later stages in the neurogenic process will be of importance to better contrast our results with previous reports identifying a role for Wnt in neuronal production in the adult hippocampus. We are currently processing new material from mice in which beta-catenin was deleted at P60 and brains collected after 3 months to evaluate the long-term effects of beta-catenin deletion on the neurogenic output of NSCs. We will also perform stainings of Wnt-responsive neuronal genes, such as NeuroD1 and Prox1, at P90 and P150 in both control and beta-catenin cKO mice.

      3- We are aiming to confirm that the in vitro effects of CHIR99021 on NSCs are mediated by beta-catenin. We already provide evidence that stimulation with Wnt3a has the same effect as inhibition of GSK3beta by CHIR99021. To further prove the link of the observed effects to Wnt-beta-catenin signalling, we will repeat some of our key experiments using beta-catenin floxed cells (both induction of neuronal differentiation and re-activation from quiescence) as reviewer 1 suggests.

      Reviewer #1

      Overall, the results are reliable and important for the field. However, several points need to be addressed and clarified to support their conclusion. I am hopeful that the authors find my comments helpful and constructive.

      Many thanks for your insightful comments, we believe they will indeed help us improve our manuscript.

        • Validation of cKO in vivo.*
      • Although the authors validated cKO of beta-catenin in vivo using FACS/qPCR at the transcript level, it would be important to check when and to what extent beta-catenin proteins are downregulated in qNSC/activeNSCs in vivo. This will be easily assessed by immunohistochemistry. In the same line, although the authors confirmed the reduction of beta-catenin signaling using beta-gal signaling in cKO mice, it would be important to check if this can be cross-checked by staining the nuclear localization of beta-catenin. This confirmation would strength the authors statement and clear that some remained beta-catenin at the plasma membrane may not be compensating their function.*

      • Independent of the confirmation of beta-catenin cKO, it would be important to check if the downstream targets of Wnt/beta-catenin signals (ex. Expression of Axin2) were also attenuated. This point should be addressed both in vivo and in vitro. *

      We are performing immunohistochemistry and quantification of beta-catenin in control and cKO brain samples, as suggested by the reviewer. Unfortunately, we have not yet found an antibody and labelling protocol that allows us to detect nuclear beta-catenin, even in control samples, so with our current antibody, we won’t be able to show a reduction in nuclear localization of beta-catenin in the cKO samples. We are testing alternative beta-catenin antibodies that could help us overcome this limitation. As the reviewer mentions, we do see a reduction in reporter expression in BATGAL mice upon deletion of beta-catenin. In order to further demonstrate effective Wnt signalling attenuation in our mutant mice we are testing antibodies for Wnt targets such as Axin2, CcnD1 and NeuroD1.

      • Wnt/beta-catenin signals in qNSC and active NSC in vitro.*

      The authors indicated that the depletion of beta-catenin had no effect on qNSCs and active NSCs in vitro. However, it is not clear whether Wnt/beta-catenin signaling is activated in their culture conditions. If there are no inputs of Wnt signaling in cultured cells, the depletion of beta-catenin will not lead any impacts. Therefore, it would be critical to check if the Wnt-signaling is activated in control cells in their culture condition, and if the downstream targets of Wnt-signaling are downregulated in cKO qNSCs/active NSCs.

      We agree that this is an important conceptual point that needs to be clarified. From our data (see Figure S3C), we can see that deletion of beta-catenin in NSCs in vitro blocks their response to Wnt stimulation (with CHIR99021) but it did not lower the levels of Axin2. From this, we can deduce that Wnt signalling is indeed not significantly activated in proliferating NSCs in vitro, despite the expression of Wnt ligands by these cells (Figure 3). We will perform further analysis of Wnt target genes in control and cKO NSCs in vitro to confirm this observation. Of note, the lack of Wnt signalling activity in NSCs would further support our claim that Wnt is dispensable for their proliferation and maintenance. We will make this point clearer in the manuscript.

      • ChIR treatment on cKO cells.*

      The authors only use WT cells for ChIR treatment. To investigate whether the effect of ChIR come through the beta-catenin signaling pathway, why don't they use cKO NSCs for ChIR treatment (Fig5-7)?

      This is a great suggestion and we are performing these experiments with control and cKO NSCs.

      Different Wnt signaling levels between in vivo and in vitro.

      The authors indicated that different levels of Wnt signaling could results in different outcomes based on in vitro observation. What are the levels of Wnt signaling in vivo compared to in vitro ChIR treatment? Activation of Wnt/beta-catenin in vivo is much weaker than in vitro CHIR treatment, therefore the contribution of Wnt signaling at endogenous levels is negligible? This may help to explain why Wnt/beta-catenin is dispensable in vivo, at least in young state. This can be addressed by probing the levels of downstream targets.

      Levels of Wnt signalling are indeed central to our conclusions and we agree that a comparison of Wnt/beta-catenin signalling levels between our in vitro interventions and the in vivo situation would be important. However, we find that directly comparing the levels of downstream Wnt targets between the two systems might prove challenging due to differences in methodology (immunolabeling is not a reliably quantitative method, especially when performed on such different sample types, with different fixation conditions, etc). We will nevertheless attempt such quantifications using immunolabelings for CcnD1, Axin2 and NeuroD1 both in vivo and in vitro. We also want to point out that CHIR is not the only way in which we have stimulated Wnt signalling in NSCs in vitro. In Figure S5, we demonstrate that treatment with Wnt3a can reactivate quiescent neural stem cell in a dose-dependent manner, showing that the effect of Wnt signalling on NSCs can be achieved also with a more physiological intervention.

      Reviewer #2

      A major challenge is to separate cell adhesion functions of beta-catenin from its function in the canonical Wnt/beta-catenin signaling pathway. The authors tested two different conditional bcat alleles (bcatdel ex2-6 ; bcatdel ex3-6) to delete bcat from stem cells. It is a bit unfortunate that the authors chose to test two conditional alleles that would affect cell adhesion and transcriptional activity instead of the Ctnnb1dm allele (Draganova et al. 2015, Stem Cells), which would have been a cleaner way to specifically address the contribution of beta-catenin transcriptional activity in adult hippocampal neural stem cells. Was there a specific reason not to use the Ctnnb1dm conditional mice? Please comment / discuss.

      We agree with the reviewer that the Ctnnb1dm allele would better differentiate between cell adhesion and transcriptional effects of beta-catenin deletion. However, as we see no effect of beta-catenin deletion, we did not find it necessary to further dissect the differential contribution of cell adhesion and the Wnt/beta-catenin pathway in this particular case. We will add a comment on this point to the discussion.

      The authors control for downregulation of beta-catenin signaling activity in the bcatdel ex2-6 through the analysis of the BATGAL reporter. 30 days after recombination, they observe a drop in reporter activity (from 31% to 13%). While this drop shows that at the time of analysis beta-catenin signaling activity was reduced, the lack of complete downregulation of reporter activity raises the issue whether long-term stability of the b-catenin protein may be a confounding factor at this time-point. In particular effects of b-catenin on the DCX population, which to a significant extent is generated several days to weeks before the time-point of analysis, may not be revealed. Data on the time-course of downregulation of the BATGAL reporter could help for the interpretation of the data as would analysis of beta-catenin protein levels in recombined cells. In addition, analysis of bcatdel ex2-6 at a later time-point after recombination, at which beta-catenin signaling activity is further downregulated, would strengthen the surprising finding that loss of beta-catenin signaling activity does not hamper neuronal differentiation in the adult hippocampus.

      We will monitor the disappearance of beta-catenin using immunohistochemistry for beta-catenin and downstream targets of Wnt in control and cKO brains, both at P90 and at a longer time after deletion (P150), as the reviewer suggests. Of note, when we deleted beta-catenin in vitro in NSCs, we could confirm the disappearance of the protein by 48 hours, and therefore beta-catenin stability cannot explain the lack of effect of the deletion (Figure S3B).

      Was quantification performed only in recombined (i.e., reporter positive) cells or in recombined and non-recombined cells? I could not locate that information. Given the evidence for feed-back regulation from intermediate precursor cells / immature neurons to stem cells (e.g. Lavado et al. 2010, Plos Biology), it is important to separately evaluate the development of recombined and non-recombined cells to evaluate the behavior of beta-catenin signaling deficient stem cells.

      The quantifications were always performed in YFP+ recombined cells. The efficiency of recombination was very high (from 83 to 97%), therefore allowing no room for confounding effects of unrecombined cells. We will convey this information in a clearer way in our revised manuscript.

      Reports from (Kuwabara et al. 2009, Nat Neurosci), (Gao et al. 2009, Nat Neurosci) and (Karalay et al. 2011, PNAS) suggest that beta-catenin signaling activity drives dentate granule neuron identity through regulating the expression of Neurod1 and Prox1. Given that in these studies neither loss of Neurod1 nor of Prox1 affects neuronal fate commitment but long-term survival and that the studies by (Gao et al. 2007, J Neurosci) and (Heppt et al. 2020, EMBO J) suggest that loss-of-beta-catenin affects neuronal survival, it may be interesting to evaluate a) whether a dentate granule neuron identity, b) long-term survival of adult generated neurons are affected. At the minimum these studies should be more extensively discussed.

      As mentioned in our response summary, our main aim is to test the effects of Wnt/beta-catenin signalling on NSCs. Nevertheless, these are excellent suggestions and we are currently performing immunohistochemistry for NeuroD1 and Prox1 to test whether they are downregulated in cKO brain samples. We have also performed a longer deletion of beta-catenin (deletion at P60 and analysis at P150) to test whether neurogenesis is affected in the cKO mice in the longer term.

      It has been suggested that the neural stem cell population in the adult hippocampus may be heterogenous with one population being responsible for baseline neurogenesis and being resistant to age-associated depletion and a second population driving high levels of neurogenesis in young adults (see also Urban, Bloomfield and Guillemot 2019, Neuron). The observation that beta-catenin signaling is only active in a small fraction of stem cells and their progeny raises the question whether it fulfills only a function in a specific subpopulation. Such possibility should at least be discussed.

      This is a very interesting point, which we will include in the discussion of our revised manuscript. We are also performing immunohistochemistry for Id4 together with beta-catenin or downstream targets of Wnt and NSC markers to determine whether the resting population (described in Urban et al. 2016 and Harris et al. 2021), which has low levels of Id4 is more responsive to Wnt than the dormant population.

      The recently published studies by (Rosenbloom et al. 2020, PNAS) and (Heppt et al. 2020, EMBO J) strongly suggest that beta-catenin signaling dynamics are critical for the regulation / modulation of adult hippocampal neurogenesis. The aspect of beta-catenin signaling dynamics should be discussed.

      We will include a discussion of beta-catenin signalling dynamics in the revised version of the manuscript.

      **Significance:**

      Adult neurogenesis is considered an important factor in hippocampal plasticity and its disturbance is thought to contribute to the pathogenesis in several psychiatric and degenerative diseases. Wnt/beta-catenin signaling is considered central to the regulation of adult hippocampal neurogenesis. In this regard, the manuscript describes the potentially very important and surprising finding that deletion of beta-catenin from neural stem cells does not generate major neurogenesis phenotypes. The concern with the present manuscript is, that the lack of phenotype requires additional analyses to exclude that phenotypes develop with a delay because of long-term stability of the beta-catenin protein.

      We believe the revisions outlined above will address these concerns.

      The significance of the manuscript and its interest to a wider audience would in addition be greatly enhanced, if the authors could provide some mechanistic data that would explain the discrepancies between published functions of Wnt/beta-catenin-signaling dependent regulation of neurogenesis and their own findings. The manuscript would also gain significance if the authors would provide solid data for their interesting hypothesis that beta-catenin-signaling contributes to the regulation of adult hippocampal neurogenesis in response to extrinsic stimuli. In this regard one potential approach would be to analyse whether extrinsic stimuli such as running would be able stimulate the activation of stem cells.

      Both finding a mechanism to explain the observed discrepancies and demonstrating that Wnt has a role in the response of NSCs to extrinsic stimuli are excellent follow-up suggestions to our work and we thank the reviewer for these recommendations. However, addressing these points would take many months (if not years) and is not necessary to support the current conclusions of our work. We therefore believe they are out of the scope of this current manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Wnt/beta-catenin signaling is considered central to the regulation of adult hippocampal neurogenesis. In this manuscript Austin and colleagues interrogate the function of beta-catenin-dependent signaling using in vivo beta-catenin conditional knockout and gain-of-function approaches combined with in vitro pharmacological and genetic approaches. The authors confirm previous reports of Wnt/beta-catenin signaling in adult hippocampal neurogenesis and report the surprising findings that • Deletion of beta-catenin from stem cells does not affect stem cell numbers and their activation / proliferation in vivo and in vitro • Deletion of beta-catenin from stem cells does not affect neuronal differentiation in vivo and in vitro Moreover, the authors show that expression of a stabilized form of beta-catenin affects stem cell positioning in vivo and that the effects of treatment of cultured hippocampal stem/progenitor cells with a pharmacological stimulator of Wnt/beta-catenin signaling are dose and time-dependent. The authors discuss that their findings suggest that Wnt/beta-catenin signaling is dispensable for neural stem cell homeostasis and that Wnt/beta-catenin signaling may have a function in the response of stem cells to external stimuli.

      Comments:

      A major challenge is to separate cell adhesion functions of beta-catenin from its function in the canonical Wnt/beta-catenin signaling pathway. The authors tested two different conditional bcat alleles (bcatdel ex2-6 ; bcatdel ex3-6) to delete bcat from stem cells. It is a bit unfortunate that the authors chose to test two conditional alleles that would affect cell adhesion and transcriptional activity instead of the Ctnnb1dm allele (Draganova et al. 2015, Stem Cells), which would have been a cleaner way to specifically address the contribution of beta-catenin transcriptional activity in adult hippocampal neural stem cells. Was there a specific reason not to use the Ctnnb1dm conditional mice? Please comment / discuss.

      The authors control for downregulation of beta-catenin signaling activity in the bcatdel ex2-6 through the analysis of the BATGAL reporter. 30 days after recombination, they observe a drop in reporter activity (from 31% to 13%). While this drop shows that at the time of analysis beta-catenin signaling activity was reduced, the lack of complete downregulation of reporter activity raises the issue whether long-term stability of the b-catenin protein may be a confounding factor at this time-point. In particular effects of b-catenin on the DCX population, which to a significant extent is generated several days to weeks before the time-point of analysis, may not be revealed. Data on the time-course of downregulation of the BATGAL reporter could help for the interpretation of the data as would analysis of beta-catenin protein levels in recombined cells. In addition, analysis of bcatdel ex2-6 at a later time-point after recombination, at which beta-catenin signaling activity is further downregulated, would strengthen the surprising finding that loss of beta-catenin signaling activity does not hamper neuronal differentiation in the adult hippocampus.

      Was quantification performed only in recombined (i.e., reporter positive) cells or in recombined and non-recombined cells? I could not locate that information. Given the evidence for feed-back regulation from intermediate precursor cells / immature neurons to stem cells (e.g. Lavado et al. 2010, Plos Biology), it is important to separately evaluate the development of recombined and non-recombined cells to evaluate the behavior of beta-catenin signaling deficient stem cells.

      Reports from (Kuwabara et al. 2009, Nat Neurosci), (Gao et al. 2009, Nat Neurosci) and (Karalay et al. 2011, PNAS) suggest that beta-catenin signaling activity drives dentate granule neuron identity through regulating the expression of Neurod1 and Prox1. Given that in these studies neither loss of Neurod1 nor of Prox1 affects neuronal fate commitment but long-term survival and that the studies by (Gao et al. 2007, J Neurosci) and (Heppt et al. 2020, EMBO J) suggest that loss-of-beta-catenin affects neuronal survival, it may be interesting to evaluate a) whether a dentate granule neuron identity, b) long-term survival of adult generated neurons are affected. At the minimum these studies should be more extensively discussed.

      It has been suggested that the neural stem cell population in the adult hippocampus may be heterogenous with one population being responsible for baseline neurogenesis and being resistant to age-associated depletion and a second population driving high levels of neurogenesis in young adults (see also Urban, Bloomfield and Guillemot 2019, Neuron). The observation that beta-catenin signaling is only active in a small fraction of stem cells and their progeny raises the question whether it fulfills only a function in a specific subpopulation. Such possibility should at least be discussed.

      The recently published studies by (Rosenbloom et al. 2020, PNAS) and (Heppt et al. 2020, EMBO J) strongly suggest that beta-catenin signaling dynamics are critical for the regulation / modulation of adult hippocampal neurogenesis. The aspect of beta-catenin signaling dynamics should be discussed.

      Significance

      Significance:

      Adult neurogenesis is considered an important factor in hippocampal plasticity and its disturbance is thought to contribute to the pathogenesis in several psychiatric and degenerative diseases. Wnt/beta-catenin signaling is considered central to the regulation of adult hippocampal neurogenesis. In this regard, the manuscript describes the potentially very important and surprising finding that deletion of beta-catenin from neural stem cells does not generate major neurogenesis phenotypes. The concern with the present manuscript is, that the lack of phenotype requires additional analyses to exclude that phenotypes develop with a delay because of long-term stability of the beta-catenin protein.

      The significance of the manuscript and its interest to a wider audience would in addition be greatly enhanced, if the authors could provide some mechanistic data that would explain the discrepancies between published functions of Wnt/beta-catenin-signaling dependent regulation of neurogenesis and their own findings. The manuscript would also gain significance if the authors would provide solid data for their interesting hypothesis that beta-catenin-signaling contributes to the regulation of adult hippocampal neurogenesis in response to extrinsic stimuli. In this regard one potential approach would be to analyse whether extrinsic stimuli such as running would be able stimulate the activation of stem cells.

      Expertise:

      Adult neurogenesis, stem cell biology, signaling

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      Wnt/beta-catenin signaling has been studies in the context of adult neurogenesis for decades. It has been shown that modulation of Wnt signaling regulates adult neurogenesis, but the consequences were not always consistent. In this study, the authors developed conditional knockout mouse lines to test whether beta-catenin is essential for the regulation of adult neurogenesis.

      First, using a published single cell seq-data and a reporter TG mouse system, they validated the expression of Wnt-pathway molecules in qNSCs and active NSCs. Then, beta-catenin conditional cKO mice were analyzed. The authors did not find any changes in total number of NSCs, the activation of NSCs, and the number of IPCs as well as neuroblasts. Subsequently, using in vitro culture system, the authors addressed if the proliferation and differentiation are affected in vitro conditions. Both proliferation and activation from the quiescent state were not affected in cKO NSCs. Finally, they demonstrated that an artificial stimulation of Wnt signaling by CHIR can induce differentiation or proliferation depending on cellular states and doses, thus NSCs can respond to Wnt signaling. Based on these data, they concluded that beta-catenin is dispensable for the maintenance/activation of NSCs in vivo, although NSCs can respond to Wnt/beta-catenin signaling. Overall, the results are reliable and important for the field. However, several points need to be addressed and clarified to support their conclusion. I am hopeful that the authors find my comments helpful and constructive.

      1. Validation of cKO in vivo. Although the authors validated cKO of beta-catenin in vivo using FACS/qPCR at the transcript level, it would be important to check when and to what extent beta-catenin proteins are downregulated in qNSC/activeNSCs in vivo. This will be easily assessed by immunohistochemistry. In the same line, although the authors confirmed the reduction of beta-catenin signaling using beta-gal signaling in cKO mice, it would be important to check if this can be cross-checked by staining the nuclear localization of beta-catenin. This confirmation would strength the authors statement and clear that some remained beta-catenin at the plasma membrane may not be compensating their function. Independent of the confirmation of beta-catenin cKO, it would be important to check if the downstream targets of Wnt/beta-catenin signals (ex. Expression of Axin2) were also attenuated. This point should be addressed both in vivo and in vitro.
      2. Wnt/beta-catenin signals in qNSC and active NSC in vitro The authors indicated that the depletion of beta-catenin had no effect on qNSCs and active NSCs in vitro. However, it is not clear whether Wnt/beta-catenin signaling is activated in their culture conditions. If there are no inputs of Wnt signaling in cultured cells, the depletion of beta-catenin will not lead any impacts. Therefore, it would be critical to check if the Wnt-signaling is activated in control cells in their culture condition, and if the downstream targets of Wnt-signaling are downregulated in cKO qNSCs/active NSCs.
      3. ChIR treatment on cKO cells The authors only use WT cells for ChIR treatment. To investigate whether the effect of ChIR come through the beta-catenin signaling pathway, why don't they use cKO NSCs for ChIR treatment (Fig5-7)?
      4. Different Wnt signaling levels between in vivo and in vitro<br> The authors indicated that different levels of Wnt signaling could results in different outcomes based on in vitro observation. What are the levels of Wnt signaling in vivo compared to in vitro ChIR treatment? Activation of Wnt/beta-catenin in vivo is much weaker than in vitro CHIR treatment, therefore the contribution of Wnt signaling at endogenous levels is negligible? This may help to explain why Wnt/beta-catenin is dispensable in vivo, at least in young state. This can be addressed by probing the levels of downstream targets.

      Significance

      Significant.

      A genetic approach to address the role of Wnt/Beta-catenin signaling is critical for the field. The audience would be interested if this study make it clear previously reported discrepancy.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to reviewers

      We first thank Review Commons for recruiting such knowledgeable reviewers to comment on our manuscript. We appreciate their diverse set of useful and constructive comments, which should help us improve the manuscript substantially. Please see our response to each reviewer’s comments below.

      Reviewer #1:

      **Summary:** The authors describe a useful modified fluctuation assay that couples conventional mutation rate analysis with mutational spectrum characterization of forward mutations at the S. cerevisiae CAN1 locus. They nicely showed that wild yeast isolates display a wide range of mutation rates with strains AAR and AEQ displaying rates ~10-fold higher than the control lab strain. These two strains also showed a bias for C>A mutations, and were the only strains analyzed that had a mutation spectrum statistically different from the lab control. Together, these data provide a compelling proof-of-principle of the applicability of the modified fluctuation analysis approach described in this manuscript. Overall, the manuscript is very well written, and the work reported in it does represent a valuable contribution to the field. However, two primary shortcomings were identified that can be addressed to strengthen the conclusions prior to publication. Both points described below pertain to the analysis of the possible C>A specific mutator phenotype in strains AAR and AEQ.

      Response:

      We thank the reviewer for this positive response. We have made a plan, detailed below, to address the shortcomings the reviewer has highlighted.

      **Major comments:**

      1. The work presented in the manuscript does suggest that these two haploids are likely to display the C>A mutator phenotype. Yet, the authors fell short of providing a full and unambiguous demonstration that would elevate the significance of their discovery. They could have directly tested the predicted C>A specific mutator phenotype by conducting additional experiments, one of which is relatively simple. Specifically, they could have performed a simple reversion-based mutation assay to validate the reported C>A mutator phenotype displayed by AAR and AEQ. For example, into AAR, AEQ, and a wild type control, the authors could introduce an engineered auxotrophic marker allele (e.g., ura3 mutation) caused by an A to C substitution, which upon mutation back to A restores prototrophic growth in minimal media (ie. reversion from ura3-C to URA3-A). Such specific reversible allele should be relatively easy to integrate into the AAR and AEQ genomes, as well as in the control strain. Based on the authors' prediction, AAR and AEQ should display a very large increase (far higher than 10 fold) in the reversion rate when compared to a control haploid. To demonstrate the specificity of the mutation spectrum, the authors could test the reversion rates of a different engineered allele requiring a reversion mutation in the opposite direction (ie. reversion from ura3-A to URA3-C). If the AAR and AEQ mutator is specific C>A, one would predict that all three strains should have similar mutation rates for a reversion in the A>C direction. This additional genetic work would thoroughly validate the central discovery and would reinforce the usefulness of the method described in the manuscript.

      Alternatively, a conventional mutation accumulation and whole genome re-sequencing experiment with parallel lines of AAR, AEQ and a control strain would also very effectively validate the C>A mutator prediction, and it would also answer the authors' discussion point about specificity to the CAN1 locus. However, it would be more costly and much more time consuming.

      Response:

      We thank the reviewer for these detailed, clear suggestions regarding additional methodology for further validating our results. We appreciate that parallel independent validations always add credibility to unexpected results like the ones presented in our manuscript. We’ve been considering these suggestions seriously, but our concern is that it is much less straightforward to engineer the genomes of these wild yeast than one might expect based on experiments with standard laboratory strains. Unforeseen roadblocks related to the biology of AAR and AEQ could end up making the URA3 reversion assay take even longer than an MA study. As we understand it, the two main concerns that might necessitate this additional undertaking are that either our novel assay for ascertaining mutations in CAN1 doesn’t work properly, or that the mosaic beer strains mutate significantly differently outside CAN1. Below we describe revisions to the text that we think will clearly represent these caveats and the relatively modest uncertainty associated with them.

      To further justify the soundness of our claim that AAR and AEQ have distinctive mutation rates and spectra, we plan to add additional discussion of the validation approaches that are presented in the manuscript to verify the accuracy of our pipeline. Although the ability of fluctuation assays to estimate mutation rates is well established, the identification of the spectra using our next-generation-sequencing-based pipeline is novel, so we used Sanger sequencing to validate the exact de novo mutations it ascertained in a select control strain. Our Sanger sequencing test found our assay to have an undetectably low false positive rate and a false negative rate that was much too low to account for the differences we measured between AAR, AEQ, and the standard lab strains. The fact that we also observed similar mutation spectra from control lab strains used in previous CAN1-based studies further demonstrates the reliability of our method, and it is notable that most natural isolates were measured to have very similar mutation spectra to lab strains (Figure 4 and Supplementary Figure S8-S9). We agree that further validation would be needed to read much into the more subtle differences in mutation rates and spectra that we saw hints of between other strains, and for that reason, we focused this paper on the differences that well exceed what we measured to be our measurement pipeline’s margin of error.

      It is true that the genome-wide mutation rate might differ somewhat from the mutation rate at the CAN1-locus, but the mutation spectrum at the CAN1 locus measured in a previous study (Lang and Murray, 2008) was very similar to the genome-wide mutation spectra obtained from MA studies (Sharp et al., 2018), with just a small overall increase of mutations with C/G nucleotides (the second to last paragraph on page 17 and Supplementary Figure S13). Moreover, we have avoided making any claims of seeing distinct mutation rates or spectra based on “apples-to-oranges” comparisons between mutation spectra measured at CAN1 and spectra measured across the whole genome.

      We also note that the enrichment of C>A mutations in AEQ and AAR is not only observed from our de novo mutation data in CAN1, but also seen in rare natural polymorphisms genome-wide (Figure 1B, 5A,B). Rare natural polymorphisms are recent mutations that occurred during the history of the strain, and the fact that they disproportionately enrich in C>A mutations in these strains indirectly shows that the C>A enrichment occurs not only at CAN1, as measured in our experiment, but has also been occurring during natural mutation accumulation genome-wide.

      The second concern is in regard to the relatively extensive conclusions drawn about the possible evolutionary significance of the possible C>A mutator in AAR and AEQ. The authors should be more cautious and conservative in the proposed interpretation. As the authors note:

      'Three of the four C>A-enriched mosaic beer strains, AAR, AEQ, and SACE_YAG, are all haploid derivatives of the [highly heterozygous] diploid Saccharomyces cerevisiae var diastaticus strain CBS1782, which was isolated in 1952 from super-attenuated beer.'

      From this statement, and because the paper cited provided few details on the isolation of CBS1782, it is presumed that these haploid derivatives were most likely isolated as recombinant spores. Furthermore, it is unclear when this isolation occurred, and for how many generations strains AAR and AEQ have been propagated in a haploid state.

      Herein lies a critical point: AAR and AEQ were recently derived from a diploid background with a "high level of heterozygosity". In a heterozygous diploid context, deleterious point mutations (and any resulting mutator phenotypes) would likely be masked by the presence of wild-type alleles. Now, as haploids, they express a novel genotype (i.e., combination of defective or incompatible parental alleles), which manifests as a mutator phenotype. In this respect, AAR and AEQ appear analogous to the spore derivatives of the incompatible cMLH1-kPMS1 isolate referred to in the manuscript as a notable exception. The analysis of strains harboring incompatible MLH1-PMS1 mutations by Raghavan et al. demonstrated that the heterozygous diploid parents were not themselves mutators, but that haploid spores which had inherited the pair of incompatible alleles displayed mutator phenotype. Collectively, while it can certainly be argued that the strains AAR and AEQ (like the MLH1/PMS1 incompatible strains) are mutators now, this fact alone does not support the conclusion that they have adapted to survive the expression of an extant mutator phenotype. This premise could be tested by analyzing the mutation rates/spectra of four new spores derived from a single tetrad of CBS 1782. Do the four sibling spores display similar or different mutational rates and spectra? If all four spores from a single tetrad exhibit the 10-fold increase in CAN1 mutation rate and the C>A transversion bias, then it can be inferred that the diploid parent is also a mutator in the same manner. Further direct analysis of mutation rates and spectrum in the parent diploid CBS 1782 would complete the work. This finding would be quite significant, and would provide strong evidence that wild strains can in fact tolerate the expression of a chronic mutator allele.

      Response:

      We thank the reviewer for suggesting additional study of the ancestral diploid strain CBS 1782, and we agree this could add a lot to the manuscript, especially given the high level of heterozygosity in the diploid and the link to the previous MLH1-PMS1 incompatibility story. We have obtained a sample of CBS 1782 and plan to knock out its HO locus using CRISPR, perform tetrad dissection of spores freshly derived from the diploid, and then measure mutation rates and spectra in all four segregants derived from a single tetrad (provided that all four spores end up growing). We plan to collect and sequence about 50 mutations to get qualitative results on the mutation rates and spectra of these segregants. We also plan to sequence the whole genome of the strain CBS 1782 and examine polymorphisms together with the 1011 strains to check for any signal of C>A enrichment. We recognize that our pipeline as currently implemented will not let us directly measure the mutation spectrum of the diploid, which is inaccessible to our pipeline given its two functional copies of CAN1 and the recessive nature of canavanine resistance. That being said, the elevation of the C>A fraction in natural polymorphisms found in AAR and AEQ provides evidence for prolonged activity of the mutator phenotype in the wild and/or in the domesticated environment from which CBS 1782 was derived. However, we acknowledge we have limited information about how these haploids were propagated before they were banked.

      **Minor comments:** A final, relatively minor point. That the new haploids AAR and AEQ show distinct mutation rates and spectra opens the door to an interesting line of inquiry, which may help to identify the causative mutator allele in a manner more efficient than searching for missense mutations. It is stated, and it is understandable, that the identification of the possible causal mutations is beyond the scope of the present manuscript. In this spirit, it would be much more appropriate to restrict such considerations to the Discussion section. Specifically, while the authors make a plausible case for OGG1 being a candidate gene responsible for the C>A mutator phenotype, no experimental demonstration was attempted. As such, that text segment should be moved from the Results to the Discussion section.

      Response:

      We agree with the reviewer of lacking genetic evidence on OGG1 in the current manuscript and we will move that section from the results to the discussion. Future work is underway to test and identify the causal loci for the mutator phenotype.

      Reviewer #1 (Significance (Required)): As stated in the summary section above, the manuscript by Jiang et al represents a substantial contribution to the fields of genome stability and genome evolution. The method described is likely to be useful beyond budding yeast. The work will be appreciated by a broad audience of geneticists. The additional work and text modifications proposed above would likely further elevate the impact of this work.

      Response:

      We are very grateful for this generous assessment and we likewise hope our planned revisions will further elevate the paper’s potential impact.

      Reviewer #2:

      Mutation is a fundamental force in organismal evolution, and therefore understanding the evolution of mutational mechanisms are important in evolutionary studies. In this manuscript, the authors used strains of S. cerevisiae as a model system to study the variations of rates and spectra in mutations with bioinformatic and experimental approaches. First, the authors analyzed the polymorphism data from 1011 strains by PCA analysis and show the variations in spectra. Second, the authors used fluctuation test combined with deep sequencing of the resistance gene to identify mutation rates and spectra in 18 strains, which show ~10-fold mutation rate variations and increased C-to-A mutations in two strains.

      For the second part, the experimental procedures and statistical analysis are mostly solid. For the first part, as what authors said in the introduction, polymorphism is not equal to the mutation spectra. I think the authors did a good job by being cautious in the wording and having no over-inference after the analysis. It is thus inevitable that the conclusion of this part sounds mostly descriptive. The overall writing is very clear. I will recommend the publication in field-specific journals.

      Response:

      We thank the reviewer for these positive comments. We will address each minor point below.

      **Minor comments:** P9 - It is very hard to not wonder how the 16 strains were picked in the fluctuation tests. Some comments on that will be appreciated. E.g., was that informed by the results of Fig 1?

      Response:

      We actually did not pick strains based on the results of Figure 1, one reason being that the CAN1 reporter method only works on haploid strains with a canavanine sensitivity phenotype. We also restricted our analysis to strains without known aneuploidies to maximize our ability to accurately measure the spectra of the strains’ polymorphisms. When possible, given these constraints, we included at least two randomly selected strains from each clade of the 1011 collection whenever possible. These constraints are currently explained on the second to last paragraph on page 9, and will be explained in more detail in revision.

      P17- In the paragraph "natural selection might contribute ..." , is there any example of "certain mutation types are more often beneficial than others"?

      Response:

      One example of this is that transitions are more often synonymous than transversions are (Freeland and Hurst, 1998), and mutations that create or destroy CpG sites are more likely to alter gene regulation than other mutation types are (in species other than yeast where CpGs are methylated). We recognize that these effects are likely not large, which is one reason we don’t think natural selection is a great explanation for mutation spectrum difference among groups.We will mention these examples explicitly in the revised text.

      P20 - Extra ')' in the sentence "Adjacent indels were merged if their frequencies differed by less than 10%)."

      Response:

      We will fix this in revision.

      In the discussion, it might be good to add a paragraph to compare the rate and spectra reported here and the ones found by MA and then NGS approach(e.g., Zhu et al. 2014).

      Response:

      We’ll be sure to add a reference to the Zhu et al. (2014) spectrum in the discussion, extending our existing comparison of mutation spectra previously reported using CAN1 (Lang and Murray, 2008) and the MA approach (Sharp et al., 2018) (currently discussed on the second to last paragraph on page 17, Supplementary Figure S13). Our CAN1 method also obtains results that are consistent with the Lang et al 2008 study on the same control strain (the last paragraph on page 11).

      Reviewer #2 (Significance (Required)): The significance of this manuscript will be relatively specific to evolutionary biologists and geneticists, especially those who use yeasts as a model system. For example, I expect the variation of mutation rates and spectra found in this manuscript will impact the following population-genetic analysis in this collection of 1011 strains and motivate more studies on the molecular machineries which affect mutation rates and spectra.

      In addition, in terms of methodological novelty, adding a novel step of reporter-gene sequencing is a reasonable way to get some information on mutation spectra as it is less labor-intensive than NGS of MAs. Other statistical or experimental procedures in this manuscript mostly follow the approaches which have been developed in previous literature and thus show not much novelty.

      Response:

      We thank the reviewer for this positive assessment. Since evolutionary biology, population genetics, and model organism genetics are three of eLife’s major focus areas, we are hoping to communicate our results to this journal’s broad audience rather than restrict ourselves to a journal focusing too narrowly on just one of these focus areas.

      Reviewer #3:

      **Summary** The authors show that certain yeast strains have altered mutation rates/bias. The study is well motivated, genetic variation in mutation rates are not easily uncovered, and capitalizes on yeast and a high-throughput mutation rate/bias method that validates findings of C>A bias from yeast polymorphism data. The results are solid and clearly presented and I have no major concerns.

      Response:

      We are very grateful for this positive response. Please find our response to each minor comment below.

      **Major comments** None

      **Minor comments** Should have comma: "In addition, environmental ..."

      Response:

      We will fix this in revision.

      Using S. paradoxus to classify derived vs ancestral alleles may not work as well as allele frequency. A 1/100 rare variant is 100x more likely derived than common variant. But with S. paradoxus divergence of say 5%, 5% polymorphic sites are misclassified or NA. Of course, since you used both, this is not a concern. But the number of variants included/excluded in each analysis should be reported. Also, I was a bit surprised that the rare variants are more noisy since most variants are rare.

      Response:

      We agree that the heuristic of classifying rare alleles as derived will do the right thing the majority of the time, but this could potentially create artifactual differences between the mutation spectra of different populations because the exact ratio of rare derived alleles to common derived alleles depends on the population’s demographic history and true site frequency spectrum. If two populations had the same mutation spectrum but very different proportions of variants that are polarized incorrectly, this could create the appearance of a mutation spectrum difference where none exists. In the revision, we will be sure to report the total number of variants filtered because of the variation present in S. paradoxus.

      The reviewer is right to point out that rare variants are generally more abundant than common variants, but this pattern is much more pronounced in a species like humans that has undergone recent population expansion than it appears to be in S. cerevisiae, which appears to have a higher proportion of older, shared variation. We hope this clarifies why the rare variant mutation spectrum PCA appears noisier than the plot made from variation across more frequency categories.

      In regards to variation in mutation rate based on canavanine resistanct. There is a caveat that some strains may be more canavanine resistant - due to differences in transporter abundanced or some other aspect of metabolism. Thus, the same mutation would survive and grow (barely) in one strain background, but not another. This caveat is very unlikely to have much of an impact but it would be worth discussing.

      Response:

      Thanks for pointing this out. We also considered the possibility that our mutation rate estimates could be confounded by slight differences in canavanine resistance between strains, and will address this point in the discussion.

      The explanation for synonymous mutations is hitchhikers or errors. However, they could also disrupt translation, here's one possibility PMC4552401.

      Response:

      Thanks for pointing this out. We will expand our statement on the possible significance of synonymous mutations to include modification of transcription and translation efficiency.

      Are there CAN allele differences between strains? If there are some, it might be worth mentioning why you do/don't think this influences the mutation rate. E.g. CGG is one step from stop but CGT is not.

      Response:

      The reviewer makes a good point that there are segregating differences among these strains in the sequence of CAN1. We plan to add an analysis where we calculate the number of opportunities for missense mutations and nonsense in each strain, as a function of its CAN1 sequence, to put a bound on the amount that these differences could affect our estimates of mutation rates in each strain.

      For the allele counts in Figure 5B. 2 indicates a variant is present in one strain so there are only 9 mutations present in AAR and not found in ANY other strain or just not found in the four listed? Likewise AAR has 36 for count 4, meaning that there are 36 variants present in AAR and one other strain, where other strains are just the 4 shown in the table, or other strains being any of the 1011?

      Response:

      The allele count in Figure 5B represents the number of times the derived allele is present in the whole population. In this case, the whole population refers to the 1011 strains minus 336 strains that are so closely related to other strains in the panel that they are effectively duplicates. An allele of count 2 might be homozygous in AAR and absent from all other strains, or present as one heterozygous copy in AAR as well as one heterozygous copy in another strain. We will explain this more clearly in the revised manuscript.

      "To our knowledge, this is one of the first" This is an odd way to put it and could be rephrased. As it stand you are either the first and not knowledgeable or knowledgeable and not the first.

      Response:

      Thanks. We will revise this to state that to our knowledge, we are the first to report such a discovery.

      "humans, great apes, .." Could you put the citations in the discussion too. I was a little surprised there was no mention of C>A bias as it relates to studies in bacteria and cancer, where there has been a lot of work on mutational spectra. A comment on this literature or whether the C>A biases are not found elsewhere would be nice.

      Response:

      We will add citations and discussion of bacteria and cancer in the revised manuscript. The reviewer is right to point out that C>A mutations do come up in cancer signatures, for example in familial adenomatous polyposis disorders where excision repair of 8-oxoguanine is compromised.

      Reviewer #3 (Significance (Required)):

      I am an evolutionary geneticist with expertise in genomics and bioinformatics. In addition to reviewing papers I also regularly handle papers as an editor. The manuscript provides rare insight into population variation in mutation rates. While differences in mutational biases are well known between species and in some cases within a species, we typically don't know what causes this biases. Environmental factors are often thought to be involved; this work clearly shows that genetic (mutator strains) exist and impact polymorphism in yeast. The manuscript does a nice job in the introduction of explaining the background on mutation rate research and motivation for the work. It also clear explains the advantage of an experimental highthroughput mutation rate/spectra approach. Thus, I believe this new angle on a long-standing problem will be of interest to the community of evolutionary geneticists outside of yeast researchers.

      Response:

      We appreciate this very generous assessment, thank you!

      Reference

      Freeland, S. J. and Hurst, L. D. (1998) ‘The genetic code is one in a million’, Journal of molecular evolution, 47(3), pp. 238–248.

      Lang, G. I. and Murray, A. W. (2008) ‘Estimating the Per-Base-Pair Mutation Rate in the Yeast Saccharomyces cerevisiae’, Genetics, 178(1), pp. 67–82.

      Sharp, N. P. et al. (2018) ‘The genome-wide rate and spectrum of spontaneous mutations differ between haploid and diploid yeast’, Proceedings of the National Academy of Sciences of the United States of America, 115(22), pp. E5046–E5055.

      Zhu, Y. O. et al. (2014) ‘Precise estimates of mutation rate and spectrum in yeast’, Proceedings of the National Academy of Sciences of the United States of America, 111(22), pp. E2310–8.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      The authors show that certain yeast strains have altered mutation rates/bias. The study is well motivated, genetic variation in mutation rates are not easily uncovered, and capitalizes on yeast and a high-throughput mutation rate/bias method that validates findings of C>A bias from yeast polymorphism data. The results are solid and clearly presented and I have no major concerns.

      Major comments

      None

      Minor comments

      Should have comma: "In addition, environmental ..."

      Using S. paradoxus to classify derived vs ancestral alleles may not work as well as allele frequency. A 1/100 rare variant is 100x more likely derived than common variant. But with S. paradoxus divergence of say 5%, 5% polymorphic sites are misclassified or NA. Of course, since you used both, this is not a concern. But the number of variants included/excluded in each analysis should be reported. Also, I was a bit surprised that the rare variants are more noisy since most variants are rare.

      In regards to variation in mutation rate based on canavanine resistanct. There is a caveat that some strains may be more canavanine resistant - due to differences in transporter abundanced or some other aspect of metabolism. Thus, the same mutation would survive and grow (barely) in one strain background, but not another. This caveat is very unlikely to have much of an impact but it would be worth discussing.

      The explanation for synonymous mutations is hitchhikers or errors. However, they could also disrupt translation, here's one possibility PMC4552401.

      Are there CAN allele differences between strains? If there are some, it might be worth mentioning why you do/don't think this influences the mutation rate. E.g. CGG is one step from stop but CGT is not.

      For the allele counts in Figure 5B. 2 indicates a variant is present in one strain so there are only 9 mutations present in AAR and not found in ANY other strain or just not found in the four listed? Likewise AAR has 36 for count 4, meaning that there are 36 variants present in AAR and one other strain, where other strains are just the 4 shown in the table, or other strains being any of the 1011?

      "To our knowledge, this is one of the first" This is an odd way to put it and could be rephrased. As it stand you are either the first and not knowledgeable or knowledgeable and not the first.

      "humans, great apes, .." Could you put the citations in the discussion too. I was a little surprised there was no mention of C>A bias as it relates to studies in bacteria and cancer, where there has been a lot of work on mutational spectra. A comment on this literature or whether the C>A biases are not found elsewhere would be nice.

      Significance

      I am an evolutionary geneticist with expertise in genomics and bioinformatics. In addition to reviewing papers I also regularly handle papers as an editor. The manuscript provides rare insight into population variation in mutation rates. While differences in mutational biases are well known between species and in some cases within a species, we typically don't know what causes this biases. Environmental factors are often thought to be involved; this work clearly shows that genetic (mutator strains) exist and impact polymorphism in yeast. The manuscript does a nice job in the introduction of explaining the background on mutation rate research and motivation for the work. It also clear explains the advantage of an experimental highthroughput mutation rate/spectra approach. Thus, I believe this new angle on a long-standing problem will be of interest to the community of evolutionary geneticists outside of yeast researchers.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Mutation is a fundamental force in organismal evolution, and therefore understanding the evolution of mutational mechanisms are important in evolutionary studies. In this manuscript, the authors used strains of S. cerevisiae as a model system to study the variations of rates and spectra in mutations with bioinformatic and experimental approaches. First, the authors analyzed the polymorphism data from 1011 strains by PCA analysis and show the variations in spectra. Second, the authors used fluctuation test combined with deep sequencing of the resistance gene to identify mutation rates and spectra in 18 strains, which show ~10-fold mutation rate variations and increased C-to-A mutations in two strains.

      For the second part, the experimental procedures and statistical analysis are mostly solid. For the first part, as what authors said in the introduction, polymorphism is not equal to the mutation spectra. I think the authors did a good job by being cautious in the wording and having no over-inference after the analysis. It is thus inevitable that the conclusion of this part sounds mostly descriptive. The overall writing is very clear. I will recommend the publication in field-specific journals.

      Minor comments:

      P9 - It is very hard to not wonder how the 16 strains were picked in the fluctuation tests. Some comments on that will be appreciated. E.g., was that informed by the results of Fig 1?

      P17- In the paragraph "natural selection might contribute ..." , is there any example of "certain mutation types are more often beneficial than others"?

      P20 - Extra ')' in the sentence "Adjacent indels were merged if their frequencies differed by less than 10%)." In the discussion, it might be good to add a paragraph to compare the rate and spectra reported here and the ones found by MA and then NGS approach(e.g., Zhu et al. 2014).

      Significance

      The significance of this manuscript will be relatively specific to evolutionary biologists and geneticists, especially those who use yeasts as a model system. For example, I expect the variation of mutation rates and spectra found in this manuscript will impact the following population-genetic analysis in this collection of 1011 strains and motivate more studies on the molecular machineries which affect mutation rates and spectra.

      In addition, in terms of methodological novelty, adding a novel step of reporter-gene sequencing is a reasonable way to get some information on mutation spectra as it is less labor-intensive than NGS of MAs. Other statistical or experimental procedures in this manuscript mostly follow the approaches which have been developed in previous literature and thus show not much novelty.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The authors describe a useful modified fluctuation assay that couples conventional mutation rate analysis with mutational spectrum characterization of forward mutations at the S. cerevisiae CAN1 locus. They nicely showed that wild yeast isolates display a wide range of mutation rates with strains AAR and AEQ displaying rates ~10-fold higher than the control lab strain. These two strains also showed a bias for C>A mutations, and were the only strains analyzed that had a mutation spectrum statistically different from the lab control. Together, these data provide a compelling proof-of-principle of the applicability of the modified fluctuation analysis approach described in this manuscript. Overall, the manuscript is very well written, and the work reported in it does represent a valuable contribution to the field. However, two primary shortcomings were identified that can be addressed to strengthen the conclusions prior to publication. Both points described below pertain to the analysis of the possible C>A specific mutator phenotype in strains AAR and AEQ.

      Major comments:

      1. The work presented in the manuscript does suggest that these two haploids are likely to display the C>A mutator phenotype. Yet, the authors fell short of providing a full and unambiguous demonstration that would elevate the significance of their discovery. They could have directly tested the predicted C>A specific mutator phenotype by conducting additional experiments, one of which is relatively simple. Specifically, they could have performed a simple reversion-based mutation assay to validate the reported C>A mutator phenotype displayed by AAR and AEQ. For example, into AAR, AEQ, and a wild type control, the authors could introduce an engineered auxotrophic marker allele (e.g., ura3 mutation) caused by an A to C substitution, which upon mutation back to A restores prototrophic growth in minimal media (ie. reversion from ura3-C to URA3-A). Such specific reversible allele should be relatively easy to integrate into the AAR and AEQ genomes, as well as in the control strain. Based on the authors' prediction, AAR and AEQ should display a very large increase (far higher than 10 fold) in the reversion rate when compared to a control haploid. To demonstrate the specificity of the mutation spectrum, the authors could test the reversion rates of a different engineered allele requiring a reversion mutation in the opposite direction (ie. reversion from ura3-A to URA3-C). If the AAR and AEQ mutator is specific C>A, one would predict that all three strains should have similar mutation rates for a reversion in the A>C direction. This additional genetic work would thoroughly validate the central discovery and would reinforce the usefulness of the method described in the manuscript.

      Alternatively, a conventional mutation accumulation and whole genome re-sequencing experiment with parallel lines of AAR, AEQ and a control strain would also very effectively validate the C>A mutator prediction, and it would also answer the authors' discussion point about specificity to the CAN1 locus. However, it would be more costly and much more time consuming.

      1. The second concern is in regard to the relatively extensive conclusions drawn about the possible evolutionary significance of the possible C>A mutator in AAR and AEQ. The authors should be more cautious and conservative in the proposed interpretation. As the authors note:

      'Three of the four C>A-enriched mosaic beer strains, AAR, AEQ, and SACE_YAG, are all haploid derivatives of the [highly heterozygous] diploid Saccharomyces cerevisiae var diastaticus strain CBS1782, which was isolated in 1952 from super-attenuated beer.'

      From this statement, and because the paper cited provided few details on the isolation of CBS1782, it is presumed that these haploid derivatives were most likely isolated as recombinant spores. Furthermore, it is unclear when this isolation occurred, and for how many generations strains AAR and AEQ have been propagated in a haploid state.

      Herein lies a critical point: AAR and AEQ were recently derived from a diploid background with a "high level of heterozygosity". In a heterozygous diploid context, deleterious point mutations (and any resulting mutator phenotypes) would likely be masked by the presence of wild-type alleles. Now, as haploids, they express a novel genotype (i.e., combination of defective or incompatible parental alleles), which manifests as a mutator phenotype. In this respect, AAR and AEQ appear analogous to the spore derivatives of the incompatible cMLH1-kPMS1 isolate referred to in the manuscript as a notable exception. The analysis of strains harboring incompatible MLH1-PMS1 mutations by Raghavan et al. demonstrated that the heterozygous diploid parents were not themselves mutators, but that haploid spores which had inherited the pair of incompatible alleles displayed mutator phenotype. Collectively, while it can certainly be argued that the strains AAR and AEQ (like the MLH1/PMS1 incompatible strains) are mutators now, this fact alone does not support the conclusion that they have adapted to survive the expression of an extant mutator phenotype. This premise could be tested by analyzing the mutation rates/spectra of four new spores derived from a single tetrad of CBS 1782. Do the four sibling spores display similar or different mutational rates and spectra? If all four spores from a single tetrad exhibit the 10-fold increase in CAN1 mutation rate and the C>A transversion bias, then it can be inferred that the diploid parent is also a mutator in the same manner. Further direct analysis of mutation rates and spectrum in the parent diploid CBS 1782 would complete the work. This finding would be quite significant, and would provide strong evidence that wild strains can in fact tolerate the expression of a chronic mutator allele.

      Minor comments:

      A final, relatively minor point. That the new haploids AAR and AEQ show distinct mutation rates and spectra opens the door to an interesting line of inquiry, which may help to identify the causative mutator allele in a manner more efficient than searching for missense mutations. It is stated, and it is understandable, that the identification of the possible causal mutations is beyond the scope of the present manuscript. In this spirit, it would be much more appropriate to restrict such considerations to the Discussion section. Specifically, while the authors make a plausible case for OGG1 being a candidate gene responsible for the C>A mutator phenotype, no experimental demonstration was attempted. As such, that text segment should be moved from the Results to the Discussion section.

      Significance

      As stated in the summary section above, the manuscript by Jiang et al represents a substantial contribution to the fields of genome stability and genome evolution. The method described is likely to be useful beyond budding yeast. The work will be appreciated by a broad audience of geneticists. The additional work and text modifications proposed above would likely further elevate the impact of this work.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Answers to the reviewers’ comments

      We deeply appreciate the reviewers for their thoughtful, critical and constructive comments, which have undoubtedly provided us with valuable opportunities to improve our manuscript.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Extravasation of lymphocytes from HEV in the lymph nodes is mediated by the interaction between lymphocyte L-selectin and PNAd-carrying sulfated sugars expressed by HEVs. Multiple steps of lymphocyte migration interacting with ECs at the luminal side of HEVs have been studied intensively; however, post-luminal migration steps are unclear. In this study, using intravital confocal microscopy of peripheral lymph nodes (pLNs), the authors found that GlcNAc6ST1 deficiency, required for sulfation of PNAd, delays trans-fibroblastic reticular cell (FRC) migration of lymphocytes, and hot spots of trans-HEV EC migration and trans-FRC migration. Interestingly, hot spots of trans-FRC migration are often associated with dendritic cells (DCs). Thus, the authors concluded that FRCs delicately regulate the transmigration of T and B cells across the HEV wall, which could be mediated by perivascular DCs.

      **Main comments**

      1. This study focused on pLNs, which are quite different from mesenteric lymph nodes (mLNs) in many ways. The authors should include mLNs in their study to make the general statement with regard to the T/B cell entry into lymph nodes. In addition, it will be more significant if this study includes challenged pLNs.

      We thank the reviewer for raising the important point. We agree that mesenteric lymph nodes are quite different from peripheral lymph node that this study focuses on. Therefore, we specified the popliteal or peripheral lymph node in the revised manuscript as follows.

      In the Abstract (page 2), “… Herein, we performed intravital imaging to investigate post-luminal T and B cell migration in popliteal lymph node, consisting of trans-EC migration, crawling in the perivascular channel (a narrow space between ECs and FRCs) and trans-FRC migration. … These results suggest that HEV ECs and FRCs with perivascular DCs delicately regulate T and B cell entry into peripheral lymph nodes.”

      In the Introduction (page 4), “Herein, we clearly visualized the multiple steps of post-luminal T and B cell migration in popliteal lymph node, including trans-EC migration, intra-PVC crawling and trans-FRC migration, using intravital confocal microscopy and fluorescent labelling of ECs and FRCs with different colours.

      In the Discussion (page 21), “… These results imply that pericyte-like FRCs, the second cellular barrier of HEVs, regulate the entry of T and B cells to maintain peripheral lymph node homeostasis more precisely and restrictively than we previously thought.”

      In addition, we discussed the difference in lymphocyte migration across HEVs between peripheral lymph node, mesenteric lymph node, and peyer’s patches in the Discussion of the revised manuscript. We also discussed inflamed lymph nodes in the Discussion as follows.

      In the Discussion (page 20), “… Although this work focused on peripheral lymph node, the other lymphoid organs have different lymphocyte homing efficiency61 due to organ-specific gene expression on HEVs62. B cells home better to mesenteric lymph nodes and peyer’s patches than peripheral lymph nodes61 by CD22-binding glycans expressed preferentially on the HEVs of mesenteric lymph nodes and peyer’s patches62.

      Inflamed peripheral lymph node become larger by recruiting more lymphocytes and even L-selectin-negative leukocytes that are excluded in the steady state63,64. Inflamed HEV ECs show different gene expression, such as downregulation of GLYCAM1 and GlcNAc6ST-160. In addition, inflamed HEV integrity may be loosen due to markedly increased leukocyte influx although the HEV FRCs can prevent bleeding by interacting with platelet CLEC-248. CD11c+ DCs are associated with inflamed HEV EC proliferation that is functionally associated with increased leukocyte entry65. The stepwise migration of lymphocyte across inflamed HEVs and their hot spots with perivascular CD11+ DCs will be interesting topic for future study.”

      The finding that GlcNAc6ST1 deficiency delays lymphocyte trans-FRC migration but not trans-HEV EC migration is surprising. However, the reason this occurs is neither shown nor discussed. Is GlcNAc6ST1 also expressed in FRCs? Or does GlcNAc6ST1 expression on HEV license lymphocytes to transmigrate across FRCs?

      This is valid point to be addressed. GlcNAc6ST-1 is predominantly involved in PNAd expression on the abluminal side rather than on the luminal side. Therefore, our results that GlcNAc6ST-1 deficiency increased the time required for trans-FRC migration but not that for trans-EC migration, could be attributable to deficiency of GlcNAc6ST-1-synthesizing L-selectin ligands in the abluminal side of HEV.

      In addition to PNAd expression in the luminal and abluminal sides of endothelial cells in HEV, PNAd expression has been observed in reticular network close to HEV as following figures. We believe that PNAds are expressed in FRCs close to HEV and can affect lymphocyte migration such as trans-FRC migration and parenchymal migration. By looking at the data (Table S1, Rodda et al., Immunity 2008), GlcNAc6ST-1 (Chst2) is expressed in T-cell-zone reticular cells while GlcNAc6ST-2 (Chst4) is absent. Therefore, it is presumable that FRC-expressed GlcNAc6ST1 may regulate trans-FRC migration in some extent.

      Figures. PNAD expression on HEVs (arrows) and reticular network (arrow heads) close to the HEVs

      We included these points in the Discussion of the revised manuscript (page 15) as follows.

      “… GlcNAc6ST-1 is predominantly involved in PNAd expression on the abluminal side rather than on the luminal side, although GlcNAc6ST-1 deficiency also modestly affects the luminal migration of lymphocytes by increasing the rolling velocity9. GlcNAc6ST-1 deficiency increased the time required for trans-FRC migration but not that for trans-EC migration. This could be attributable to deficiency of GlcNAc6ST-1-synthesizing L-selectin ligands in the abluminal side of HEV. In addition to the abluminal side of HEV endothelial cells, FRCs also express GlcNAc6ST-1, but not GlcNAc6ST-227, implying that FRC-expressed GlcNAc6ST-1 may regulate trans-FRC migration in some extent. … Thus, PNAds expressed at the endothelial junction and on the abluminal side of HEVs facilitate the efficient transmigration of lymphocytes across the HEV wall but do not slow transmigration in the perivascular region. GlcNAc6ST-1 deficiency and MECA79 antibody also decreased the parenchymal B and T cell velocities immediately after extravasation, respectively, probably because of blockade of parenchymal expression of PNAd in close proximity to HEV6,21,28.”

      Because of the adoptive transfusion experiment, the actual number of transmigrating lymphocytes in Fig. 3F is underestimated.

      We agree with the reviewer’s comment. We corrected the y-axis label in Fig. 3F from ‘average number of cells transmigrating at one site’ to ‘average number of labeled cells transmigrating at one site.’

      Whether DCs covering FRCs have a role for lymphocyte trans-migration is not shown.

      We leaved this work as future research and discussed about the potential mechanisms in the Discussion (page 17-18) that the DC may regulate lymphocyte entering by interacting FRC with LTβR or CLEC-2 signaling. We also included ‘Martinez et al Cell Rep 2019 (ref.51)’ in the discussion of the revised manuscript (page 18). In addition, we also discussed about better characterization of the CD11c+ DC in the Discussion of the revised manuscript (page 19) as follows.

      In the Discussion (page 18), “The podoplanin of FRCs also controls FRC contractility49,50 and ECM production51 by interacting with the CLEC-2 of DCs in inflamed lymph nodes. In the steady state, resident DCs in lymph nodes express CLEC-252. Thus, it is conceivable that CLEC-2+ resident DCs may control the contractility of FRCs and remodel ECM surrounding HEVs to facilitate the trans-FRC migration of T and B cells. Thus, the CLEC-2/podoplanin signalling may represent a key molecular mechanism underlying our discovery that trans-FRC migration hot spots preferentially occur at FRCs covered by CD11c+ DCs.”

      In the Discussion (page 19), “… In addition, better characterization of the CD11c+ DCs located in the hot spots of HEVs is required to differentiate them from the other CD11c+ DCs observed in the non-hot-spot regions of HEVs. Some T-cell-zone resident macrophages can also express CD11c54. Imaging of a triple-transgenic mouse with Zbtb46-cre;tdTomato and CD11b-GFP will be able to differentiate 3 types of DCs and macrophages potentially associated with the hot spots: Zbtb46+CD11b- cDC1, Zbtb46+CD11b+ cDC2, and Zbtb46-CD11b+ macrophage54,55.”

      In Fig. 1, time required for trans HEV EC migration and trans-FRC migration of T cells is shorter than that of B cells; however, this finding is not observed in Fig. 2C and E.

      Although the statistical comparison between T and B cells are not shown in Fig. 2C-F and S5., there are actually significant difference between T and B cells, which are similar results as Fig. 1 except for the dwell time in PVC. P values between T and B cells in wildtype mice are 0.0003, In the Result (page 6), “… The mean velocity of T cells (5.3 ± 1.7 μm/min) was significantly higher than that of B cells (4.1 ± 1.4 μm/min) during intra-PVC migration (Fig. 1E), while the dwell time and total path length in the PVC were not significantly different between T and B cells (Fig. 1, H and I). Similar results were obtained when both cells were imaged simultaneously, except that B cells had significant longer dwell time than T cells (Fig. 2C-F and Fig. S5). Interestingly, more than half of the T and B cells crawled from 50 μm to 350 μm inside the PVC (Fig. 1I), …”

      In the legend of Fig. 2, “… P values between T and B cells in wild-type mice were 0.0003 (C), …”

      In the legend of Fig. S5, “… P values between T and B cells in wild-type mice were 0.0240 (A), 0.3614 (B), 0.7518 (C) and 0.1337 (D). …”

      **Minor comments**

      1. Please provide evidence for GlcNAc6ST1 deficiency in HEV and surrounding tissues.

      Previous studies (Uchimura et al., JBC 2004, Nat. Immunol. 2005; ref9 and 10, respectively, in the manuscript) confirmed systemic deficiency of GlcNAc6ST-1 in peripheral lymph nodes of the GlcNAc6ST-1 KO mice.

      Images for delayed trans-FRC migration in GlcNAc6ST1 KO mice relative to WT are not convincing (Fig. 2G and H).

      We think the reason why the images look unconvincing is probably because it is not easy to quickly determine the images corresponding to the trans-FRC migration in the image sequence. To make the transmigration images easier to recognize, we added arrow heads indicating the transmigration site in Fig. 2G and 2H, and Fig. S4 as follows.

      Provide actual time periods required for Fig. 3F and G. Lack of isotype control IgG experiment in Fig. S3.

      We added the time periods (3 hours) in the figure legend as follows.

      “… (F) Average numbers of labeled T and B cells transmigrating at one site for 3 hours. (G) Ratio of hot spots to total transmigration sites for 3 hours. …”

      The purpose of Fig. S3 was to confirm that the anti-ER-TR7 antibody injection for labeling FRC do not alter normal T cell motility, rather than to confirm the function of ER-TR7. Therefore, we used non-injected group as control rather than control antibody injection group.

      Line 12 on page 11, "the ratio of hot spots to the total “observed” transmigration sites..." is not appropriate. The ratio must be calculated by hot spots to the total "potential" transmigration sites, although it is challenging to find total potential sites.

      We corrected the expression from ‘the total observed transmigration sites’ to ‘the total potential transmigration sites’.

      Please correct typos of angiomoduin to angiomodulin (page 16), ET-TR7 to ER-TR7 (page 17), Anti-CD3 to anti-CD3 (page 22), half the dose to half dose (page 22), the Multiple step to the multiple step (page 23).

      We thank the reviewer for finding those errors. We corrected them and performed proofreading repeatedly to correct typos and grammatic errors.

      Please provide an additional explanation of why actin-DsRed in HEVs is more strongly expressed than surrounding tissues such as FRCs in Fig. 1 although actin-DsRed should be expressed in all cell types in mice.

      We were also surprised when we found that HEV ECs expressed red fluorescence more strongly compared to surrounding tissues. Although the other cells such as FRCs and endogenous lymphocytes also express DsRed under control of a promotor gene, beta-actin, we believe that HEV ECs express more strongly, which is sufficient to image only HEV-EC by adjusting an image contrast. We revised the explanation of this point in the Methods (page 21) as follows.

      “HEV ECs of actin-DsRed mouse popliteal lymph node expressed red fluorescence much stronger than the surrounding stromal cells and endogenous lymphocytes, which was sufficient to image only HEV ECs by adjusting an image contrast (Fig. 1, A and B).”

      Reviewer #1 (Significance (Required)):

      The study focused on lymphocytes post extravasation of HEV, which is an understudied question, using intravital imaging. The in vivo imaging study was deliberately and beautifully performed, and the finding is insightful for understanding lymphocyte trafficking in lymph nodes. However, additional experimental should be performed to address some weaknesses listed in our comments.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The present study by K. Choe meticulously monitored the stepwise transmigration behavior of T cells and B cells, respectively, through the high endothelial venules of the mouse popliteal lymph node using the laser scanning confocal microscopy. In particular, the study focused on the post-luminal migration of T and B cells and reported the following. (1) Mice deficient in GlcNAc6ST-1 which is necessary for PNAd expression on the abluminal side of HEV showed significantly reduced abluminal migration of both T and B cells, (2) the footpad injection of the ER-TR7 antibody did not affect T cell transmigration across HEVs but marginally increased the parenchymal T cell velocity when compared with injection of control antibody, (3) T cells and B cells tended to share FRC migration hot spots but this was not the case with trans-EC migration hot spot, (4) the trans-FRC migration was observed at the FRCs closely associated with CD11c+ dendritic cells in HEV.

      While the present study is obviously the product of very meticulous and time-consuming work, it basically describes only a phenomenology, just reporting the lymphocyte behavior within and outside lymph node HEVs, without sufficiently analyzing the mechanistic aspect of the individual event they observed. The only antibody blocking experiments they performed to obtain mechanistic insights was by the use of commercially available monoclonal antibodies, all of which unfortunately contained a preservative, sodium azide, which potently blocks lymphocyte migration in vivo (Freitas AA & Bognacki J, Immunol 36:247, 1979). Therefore, the results of these antibody blocking experiments cannot be taken at face value.

      We thank the reviewer for raising the important point. Freitas et al used pre-treated lymphocytes with sodium azide in vitro for 1 hour while we injected the antibody into the footpad of recipient mouse 3 hours before lymphocyte injection via tail vein and imaging. Sodium azide might be highly diluted in vivo condition. In addition, Fig. S3 shows no significant difference in T cell migration in HEV between anti-ER-TR7 antibody-injected and non-injected groups although the anti-ER-TR7 antibody also contains sodium azide. We believe that the effect of sodium azide on our convincing results of the PNAd-blocking antibody compared to the control antibody (Fig. S8) may be insignificant. The potential side effect of sodium azide was mentioned in the Methods of the revised manuscript (page 22) as follows.

      “All antibodies we used contains sodium azide that has potential side effects on lymphocyte migration in lymph node57. However, Fig. S3 shows no significant difference in T cell migration in HEV between anti-ER-TR7-injected and non-injected groups.”

      Reviewer #2 (Significance (Required)):

      Real time imaging experiments were performed very carefully. However, as mentioned above, authors used sodium azide-containing antibodies for blocking experiments, and hence, these experiments cannot be interpreted properly.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This study presents a detailed investigation of T and B cell entry into lymph nodes (LN) via HEV. Substantial high quality intravital imaging is used to examine trans-EC and trans-FRC migration and define the role of PNAds in this process. The authors find that T and B cells use 'hot spots' to cross EC and FRC barriers, which supports prior similar observations by others. They also show that where T and B cells cross EC and FRC layers can differ, with regions of shared trans-FRC migration but more distinct EC crossing sites. This may relate to differences in the structure of these cellular layers, but provides novel insight into the mechanisms of cell entry into LNs via HEV. Assessment of the dependence on PNAd using antibodies or GlcNAc6ST-1 KO mice revealed perivascular and parenchymal cell behavior is also influenced by these signals. Lastly, examination of DCs that sit on the perivascular FRCs suggested that cells may prefer to cross at sites co-localized by DCs, although the reasons for this are not explored.

      This is a well performed study, with high quality imaging data and analysis. The results are convincing, with sufficient numbers of mice and adequate statistical analysis. There are a number of minor grammatical errors throughout the text, which should be easy to fix.

      We thank the reviewer for the positive evaluation. We carefully performed proofreading repeatedly to correct typos and grammatical errors.

      Reviewer #3 (Significance (Required)):

      Although 'hot spots' have been proposed by others, this detailed analysis provides new knowledge of how lymphocytes can cross the HEV and FRC barriers to enter LNs. This is an important study to advance our understanding of cell recruitment to lymph nodes. The role of perivascular and parenchymal PNAd signals observed here should also be of interest to immunologists to help define the signals required for immune cell motility in tissues.

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      The authors have used a combination of intravital confocal imaging and transgenic models to study the migration of T and B cells through the HEVs. They move on from Moscacci et al. and Park et al., studies on lymphocyte migration. This study focuses on visualization and molecular mechanism of post-trans-EC migration, including the intra-PVC and trans-FRC migration of T and B cells in HEVs. They have been able to show how lymphocytes migrate through the HEV into the parenchyma. Using the GlcNAc6sT-1 (catalyst for sulfation of PNAds) KO model (and MECA control for PNAds blocking) they identify the role of L-selectin/PNAd for lymphocyte transmigration. The identification of hot spots of T and B cell transmigration in HEVs is novel and extremely interesting for the field however the data shown is not entirely convincing in their current form. The hot spots were defined as areas where the lymphocytes migrate through the HEV epithelial cells and pericyte (FRC) regions. These are areas where migration was greatly shared T and B cells. Using the CD11c-YFP mouse model they identified CD11c+ cells in proximity to the FRCs located at the migration hotspots which can drive further speculation regarding the mechanism by which these areas of the HEVs are more permissive.

      **Major comments**

      1) Intravital imaging of T and B cell transmigration across HEVS composed of ECs and FRCs

      • Figure 1: The authors mention that they performed similar experiments for B cells. Authors should show comparative data for T cells and B cells.

      • Panel S1B should be provided for both T and B cells in figure 1.

      We added the image sequence of B cell migration and the panels (Fig S1B of previous manuscript) showing intra-PVC segments of T or B cells in Fig. 1C of the revised manuscript as follows.

      2) T and B cells preferentially share hotspots for trans-FRC migration not EC-migration

      • Figure 4: This data is important to the storyline but as presented it is difficult to understand. Results are overstated in the text however it is difficult to see where these conclusions come from based on the figure. In Figure 4B the authors should show percentages on the Venn diagram or remove it entirely. In Figure 4C the authors should add labels to their y-axis and separate the data in order to assist with the storyline and convince of the presence of hot spots.

      We agree with the reviewer’s opinion. We removed the Venn diagram, separated the Fig. 4C into 4B and 4C, and added y-axis labels in the figures. In addition, we revised the figure legends and the text in the Results to make it easier to understand as follows.

      In the figure legend, “…(B-C) The round and diamond symbols represent predicted and observed values, respectively, for the percentage of T cell hot spots in B cell hot spots (B), for the percentage of B cell hot spots in T cell hot spots (C). …”

      In the Results (page12), “Simultaneously imaging T and B cells showed that some T and B cells transmigrated across FRCs at the same site (Fig. 4A and Movie S8). To investigate whether T and B cells share their hot spots preferentially or accidentally, we compared the percentage of T cell hot spots in total B cell hot spots (diamond symbols in Fig. 4B) with its predicted value that is the possibility of accidently sharing T and B cell hot spots (round symbols in Fig. 4B). The predicted value can be calculated as the percentage of T cell hot spots in total transmigration sites. To note, the percentage of hot spots in total sites for trans-FRC migration was higher than that for trans-EC migration (Fig. 3G and round symbols in Fig. 4B) maybe because the number of trans-FRC migration sites was less than that of trans-EC migration sites. It implies that the possibility of accidently sharing T and B cell hot spots for trans-FRC migration is higher than that for trans-EC migration. However, surprisingly, the percentage of T cell hot spots in B cell hot spots was significantly higher than its predicted value of accidently sharing hot spots for trans-FRC migration (Fig. 4B). Similarly, the percentage of B cell hot spots in T cell hot spots was also significantly higher than its predicted value for trans-FRC migration (Fig. 4C). These results imply that T and B cells preferentially share trans-FRC migration hot spots beyond the prediction for accidently sharing. However, there were no significant differences between observed and predicted values for trans-EC migration (Fig. 4B and 4C), which implies T and B cells just accidently share their trans-EC migration hot spots.”

      3) T and B cells prefer to transmigrate across FRCs covered by perivascular CD11c+ DCs

      • DCs drive changes to FRC phenotype and contractility. The interaction between CLEC-2 (on DCs and platelets) is important for driving permeability of the HEVs. The authors use the CD11c-YFP mouse model in Figure 5 (and the supporting figures) to show the proximity of the CD11c+ cells and FRCs. Data from Baratin et al., (Immunity, 2017) suggest that CD11c+ cells in the parenchyma are also T cell zone macrophages (TZMs) that were previously characterized as DCs. Macrophages have previously been shown important for perivascular transmigration of neutrophils during bacterial skin infection (Abtin et al.2014- Nat Immun). CD11c-YFP alone does not show the cells proximal to FRCs are DCs so the authors should try to stain them with CLEC-2 or use the CLEC9a-cre mouse model to better characterise these cells.

      We thank the reviewer for raising important point. We agree that the perivascular CD11c+ cells could be T-cell-zone macrophages (TZMs). Better characterization of the CD11c+ cells located in the hot spots of HEVs is required to determine if they are DCs or macrophages, and also to differentiate them from the other CD11c+ cells observed in the non-hot-spot regions of the HEVs. To differentiate DCs from TZMs, Zbtb46-GFP mouse can be used for imaging because Zbtb46-GFP are highly expressed in conventional DCs (cDCs) but not monocytes, macrophages, or other lymphoid or myeloid lineages (Satpathy et al, JEM 2012). However, endothelial cells also express Zbtb46-GFP. To visualize only DCs in HEVs, we need to make a chimeric mouse by adoptive transfer of Zbtb46-GFP bone-marrow cells into irradiated wild-type mouse. Furthermore, using a triple transgenic mouse with Zbtb46-cre;tdTomato and CD11b-GFP will be able to differentiate 3 types of DCs and TZMs potentially associated with the hot spots: Zbtb46+CD11b- cDC1 (red), Zbtb46+CD11b+ cDC2 (yellow), and Zbtb46-CD11b+ macrophage (green). However, since generation or obtaining of those transgenic mice models including CLEC9a-cre mouse will take long time, we will leave this work as future research and discussed this point in the Discussion of the revised manuscript as follows. In addition, we think that it will be difficult to differentiate the CLEC2 of perivascular DCs from that of platelets by in vivo labeling by injection of anti-CLEC2 antibody conjugated with a fluorescent dye because the CLEC2 of platelets maintains HEV integrity with interacting of FRC podoplanin (Herzog et al, Nature 2013).

      In the Discussion (page 19), “… In addition, better characterization of the CD11c+ DCs located in the hot spots of HEVs is required to differentiate them from the other CD11c+ DCs observed in the non-hot-spot regions of HEVs. Some T-cell-zone resident macrophages can also express CD11c54. Imaging of a triple-transgenic mouse with Zbtb46-cre;tdTomato and CD11b-GFP will be able to differentiate 3 types of DCs and macrophages potentially associated with the hot spots: Zbtb46+CD11b- cDC1 (red), Zbtb46+CD11b+ cDC2 (yellow), and Zbtb46-CD11b+ macrophage (green)54,55.”

      **Minor comments**

      1) Intravital imaging of T and B cell transmigration across HEVS composed of ECs and FRCs

      • The velocity differences observed could be due to location of HEV in the parenchyma. Furthermore, FRC plasticity can cause differences in secretion of chemokine gradients based on the location of cells and their niche (Rhoda et al., Immunity 2018). HEVs regulation of lymphocyte entry can be influenced by their niche (Veerman et al., Cell Reports 2019). The authors should comment on the HEV position relative to B cell areas.

      We included this point with the references (Rhoda et al, immunity 2018, ref 27; Veerman et al., Cell Rep. 2019, ref 60) in the Discussion of the revised manuscript (page 19-20) as follows.

      “Compared to T cell, B cells took a longer time to pass EC and FRC layers in HEV and had lower velocity in PVC and parenchyma just after extravasation. Furthermore, the adhesion rate of B cells to HEV EC in luminal side is lower than that of T cells5. These could be attributed to lower expression of L-selectin and CCR7 on B cells than T cells18,59. The difference in homing efficiency between T and B cells may vary depending on the HEV location due to the heterogeneous expression of chemokines and integrins on HEV EC and surrounding FRCs in peripheral lymph node27,60. The HEVs imaged in this work were located around 40-70 μm depth from the capsule where might be close to B cell follicles. B cell homing efficiency in the deeper paracortical T cell zone could be different from our data probably due to less CXCL13 that is chemoattractant for B cells highly expressed in follicles. …”

      • Images shown in Fig1A is the same as Fig S1A/B. I presume this is an error.

      Fig. 1A and Fig. S1A correspond to a 20-um-thick maximum intensity projection and single z-frame without projection, respectively. To avoid the confusion, we changed Fig.1A to the single z-frame (Fig S1A) and remove the 20-um thick maximum projection.

      • Figure S3: Data for Ab treated appears to be identical to what is shown for T cells in Fig 1. I presume this is an error and the correct control will be shown.

      We used the data of Fig. 1D-1I as the Ab-injected group in Fig. S3. We are sorry for the lack of clear explanation about this. We included the explanation in the figure legend as follows.

      In the legend of Fig. S3, “(A-E) There is no significant difference between antibody-injected group (Ab) and non-injected group (Non) in T cell migration from trans-EC migration to trans-FRC migration. Non-injected means that no substance is injected into a footpad of mouse. We used the data of Fig. 1D-1I as the antibody-injected group. …”

      2) Non-redundant role of L-selectin/PNAd interactions in post-luminal migration of T and B cells in HEV

      • Could the authors clarify the number of mice used for this analysis (same applies to figure 1)

      In the legends of Fig. 1-2, S6 and S8, there is the number of mice we used. In Fig. 1, “Four and 3 mice were used for the analysis of T and B cells, respectively.” In Fig. 2, “Four mice were analysed for each group.” In Fig. S6, “Three mice were analysed for each group.” In Fig. S8, “Five and 4 mice were analysed for the control Ab and MECA79 groups, respectively.”

      In addition, we added the number of mice in the legend of Fig. S7. In Fig. S7, “The images are representative of 4 popliteal lymph nodes of 2 mice and 2 popliteal lymph nodes of a mouse for MECA79 and control IgM antibody, respectively.”

      • Figure S6: further to percentages of T cell populations the authors should also provide the number of T cells (CD4, CD8, CM and naive) for both wildtype and KO.

      We included the analyzed cell number by FACS in Fig. S6 and revised the figure legend as follows.

      In the Fig. S6, “… (B) Analyzed cell numbers by FACS for 3 control and 3 KO mice. (C) Percentage of each type of T cells in DsRed+ T cells. No difference in the percentage of homing central memory, Naïve CD4 and CD8 T cells between wild-type and KO mice. …”

      **Methods** for the flow cytometry analysis could the details of how samples were processed (or reference) be provided.

      We added the details in the Methods (page 24) as follows.

      “Popliteal and inguinal lymph nodes were harvested and single-cell suspensions were prepared by mechanical dissociation on a cell strainer (RPMI-1640 with 10% FBS). Cell suspensions were centrifuged at 300g for 5 min. Erythrocytes in lymph nodes were lysed with ACK lysis buffer for 5 min at RT. Cell suspensions were washed and filtered through 40um filters. Non-specific staining was reduced by using Fc receptor block (anti-CD16/CD32). Cells were incubated for 30 min with varying combinations of the following fluorophore-conjugated monoclonal antibodies: anti-CD3e (clone 145-2C11, BD pharmigen), anti-CD4 (clone GK1.5, BD Pharmingen), anti-CD8 (clone 53-6.7, eBioscience), anti-CD44 (clone IM7, Biolegend) and anti-CD62L (clone MEL-14, eBioscience) antibodies (diluted at a ratio of 1:200) in FACS buffer (5% bovine serum in PBS). After several washes, cells were analyzed by FACS Canto II (BD Biosciences) and the acquired data were further evaluated by using FlowJo software (Treestar).

      **References:** The discussion covers key references in the field, but more recent studies should be included. Some examples have been suggested in the comments sections. Key references missing that can help discussion/interpretation of the data include: 1) Veerman et al 2019, Cell reports. The data in that paper shows the heterogeneity of the HEV and different regulation of genes that control lymphocyte entry. This can also be linked to the comments above regarding section 1 and 2. 2) Rhodda et al 2018, Immunity that focuses on niche-associated heterogeneity of lymph node stromal cells. The authors should also include Webster et al., 2006, JEM which describes the role of DCs in regulating vascular growth in the lymph node.

      We thank the reviewer for suggesting good references to discuss. We included the references #1 and #2 in the revised manuscript as we responded to the minor comment #1. We also cited Webster et al., JEM 2006 (as ref 65) in the Discussion of the revised manuscript (page 20) as follows.

      “Inflamed peripheral lymph node become larger by recruiting more lymphocytes and even L-selectin-negative leukocytes that are excluded in the steady state63,64. Inflamed HEV ECs show different gene expression, such as downregulation of GLYCAM1 and GlcNAc6ST-160. In addition, inflamed HEV integrity may be loosen due to markedly increased leukocyte influx although the HEV FRCs can prevent bleeding by interacting with platelet CLEC-248. CD11c+ DCs are associated with inflamed HEV EC proliferation that is functionally associated with increased leukocyte entry65. The stepwise migration of lymphocyte across inflamed HEVs and their hot spots with perivascular CD11+ DCs will be interesting topic for future study.”

      Reviewer #4 (Significance (Required)):

      This paper asks important questions and can make a significant contribution to the field if all revisions are addressed. The authors identified PNAd as an important factor for T cell migration. Further to previous studies in the field suggesting non-random transmigration sites. The authors used intra-vital confocal imaging to identify how lymphocytes cross the epithelial cells and FRCs of the HEVs to migrate to the parenchyma. The authors identify hotspots used by lymphocytes to transmigrate. Finally, the authors show that CD11c+ cells are proximal to FRCs hotspots and might have a role in driving lymphocyte transmigration.

      Audience: Lymphocyte/immune cell biology, stomal immunology, FRC and lymph node inflammation. My expertise: Stomal immunology, immunology, innate immunity

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      The authors have used a combination of intravital confocal imaging and transgenic models to study the migration of T and B cells through the HEVs. They move on from Moscacci et al. and Park et al., studies on lymphocyte migration. This study focuses on visualization and molecular mechanism of post-trans-EC migration, including the intra-PVC and trans-FRC migration of T and B cells in HEVs. They have been able to show how lymphocytes migrate through the HEV into the parenchyma. Using the GlcNAc6sT-1 (catalyst for sulfation of PNAds) KO model (and MECA control for PNAds blocking) they identify the role of L-selectin/PNAd for lymphocyte transmigration. The identification of hot spots of T and B cell transmigration in HEVs is novel and extremely interesting for the field however the data shown is not entirely convincing in their current form. The hot spots were defined as areas where the lymphocytes migrate through the HEV epithelial cells and pericyte (FRC) regions. These are areas where migration was greatly shared T and B cells. Using the CD11c-YFP mouse model they identified CD11c+ cells in proximity to the FRCs located at the migration hotspots which can drive further speculation regarding the mechanism by which these areas of the HEVs are more permissive.

      Major comments:

      1) Intravital imaging of T and B cell transmigration across HEVS composed of ECs and FRCs

      • Figure 1: The authors mention that they performed similar experiments for B cells. Authors should show comparative data for T cells and B cells.
      • Panel S1B should be provided for both T and B cells in figure 1.

      2) T and B cells preferentially share hotspots for trans-FRC migration not EC- migration

      • Figure 4: This data is important to the storyline but as presented it is difficult to understand. Results are overstated in the text however it is difficult to see where these conclusions come from based on the figure. In Figure 4B the authors should show percentages on the Venn diagram or remove it entirely. In Figure 4C the authors should add labels to their y-axis and separate the data in order to assist with the storyline and convince of the presence of hot spots.

      3) T and B cells prefer to transmigrate across FRCs covered by perivascular CD11c+ DCs

      • DCs drive changes to FRC phenotype and contractility. The interaction between CLEC-2 (on DCs and platelets) is important for driving permeability of the HEVs. The authors use the CD11c-YFP mouse model in Figure 5 (and the supporting figures) to show the proximity of the CD11c+ cells and FRCs. Data from Beratin et al., (Immunity, 2017) suggest that CD11c+ cells in the parenchyma are also T cell zone macrophages (TZMs) that were previously characterised as DCs. Macrophages have previously been shown important for perivascular transmigration of neutrophils during bacterial skin infection (Abtin et al.2014- Nat Immun). CD11c-YFP alone does not show the cells proximal to FRCs are DCs so the authors should try to stain them with CLEC-2 or use the CLEC9a-cre mouse model to better characterise these cells.

      Minor comments:

      1) Intravital imaging of T and B cell transmigration across HEVS composed of ECs and FRCs

      • The velocity differences observed could be due to location of HEV in the parenchyma. Furthermore FRC plasticity can cause differences in secretion of chemokine gradients based on the location of cells and their niche (Rhoda et al., Immunity 2018).HEVs regulation of lymphocyte entry can be influenced by their niche (Veerman et al., Cell Reports 2019).The authors should comment on the HEV position relative to B cell areas.
      • Images shown in Fig1A is the same as Fig S1A/B. I presume this is an error.
      • Figure S3: Data for Ab treated appears to be identical to what is shown for T cells in Fig 1. I presume this is an error and the correct control will be shown.

      2) Non-redundant role of L-selectin/PNAd interactions in post-luminal migration of T and B cells in HEV

      • Could the authors clarify the number of mice used for this analysis (same applies to figure 1)
      • Figure S6: further to percentages of T cell populations the authors should also provide the number of T cells (CD4, CD8, CM and naive) for both wildtype and KO.

      Methods:

      for the flow cytometry analysis could the details of how samples were processed (or reference) be provided.

      References:

      The discussion covers key references in the field but more recent studies should be included. Some examples have been suggested in the comments sections.Key references missing that can help discussion/interpretation of the data include: 1) Veerman et al 2019, Cell reports. The data in that paper shows the heterogeneity of the HEV and different regulation of genes that control lymphocyte entry. This can also be linked to the comments above regarding section 1 and 2. 2) Rhodda et al 2018, Immunity that focuses on niche-associated heterogeneity of lymph node stromal cells. The authors should also include Webster et al., 2006, JEM which describes the role of DCs in regulating vascular growth in the lymph node.

      Significance

      This paper asks important questions and can make a significant contribution to the field if all revisions are addressed. The authors identified PNAd as an important factor for T cell migration. Further to previous studies in the field suggesting non-random transmigration sites. The authors used intra-vital confocal imaging to identify how lymphocytes cross the epithelial cells and FRCs of the HEVs to migrate to the parenchyma. The authors identify hotspots used by lymphocytes to transmigrate. Finally the authors show that CD11c+ cells are proximal to FRCs hotspots and might have a role in driving lymphocyte transmigration.

      Audience: Lymphocyte/immune cell biology, stomal immunology, FRC and lymph node inflammation.

      My expertise: Stomal immunology, immunology, innate immunity

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study presents a detailed investigation of T and B cell entry into lymph nodes (LN) via HEV. Substantial high quality intravital imaging is used to examine trans-EC and trans-FRC migration and define the role of PNAds in this process. The authors find that T and B cells use 'hot spots' to cross EC and FRC barriers, which supports prior similar observations by others. They also show that where T and B cells cross EC and FRC layers can differ, with regions of shared trans-FRC migration but more distinct EC crossing sites. This may relate to differences in the structure of these cellular layers, but provides novel insight into the mechanisms of cell entry into LNs via HEV. Assessment of the dependence on PNAd using antibodies or GlcNAc6ST-1 KO mice revealed perivascular and parenchymal cell behaviour is also influenced by these signals. Lastly, examination of DCs that sit on the perivascular FRCs suggested that cells may prefer to cross at sites co-localised by DCs, although the reasons for this are not explored.

      This is a well performed study, with high quality imaging data and analysis. The results are convincing, with sufficient numbers of mice and adequate statistical analysis. There are a number of minor grammatical errors throughout the text, which should be easy to fix.

      Significance

      Although 'hot spots' have been proposed by others, this detailed analysis provides new knowledge of how lymphocytes can cross the HEV and FRC barriers to enter LNs. This is an important study to advance our understanding of cell recruitment to lymph nodes. The role of perivascular and parenchymal PNAd signals observed here should also be of interest to immunologists to help define the signals required for immune cell motility in tissues.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The present study by K. Choe meticulously monitored the stepwise transmigration behavior of T cells and B cells, respectively, through the high endothelial venules of the mouse popliteal lymph node using the laser scanning confocal microscopy. In particular, the study focused on the post-luminal migration of T and B cells and reported the following. (1) Mice deficient in GlcNAc6ST-1 which is necessary for PNAd expression on the abluminal side of HEV showed significantly reduced abluminal migration of both T and B cells, (2) the footpad injection of the ER-TR7 antibody did not affect T cell transmigration across HEVs but marginally increased the parenchymal T cell velocity when compared with injection of control antibody, (3) T cells and B cells tended to share FRC migration hot spots but this was not the case with trans-EC migration hot spot, (4) the trans-FRC migration was observed at the FRCs closely associated with CD11c+ dendritic cells in HEV.

      While the present study is obviously the product of very meticulous and time-consuming work, it basically describes only a phenomenology, just reporting the lymphocyte behavior within and outside lymph node HEVs, without sufficiently analyzing the mechanistic aspect of the individual event they observed. The only antibody blocking experiments they performed to obtain mechanistic insights was by the use of commercially available monoclonal antibodies, all of which unfortunately contained a preservative, sodium azide, which potently blocks lymphocyte migration in vivo (Freitas AA & Bognacki J, Immunol 36:247, 1979). Therefore, the results of these antibody blocking experiments cannot be taken at face value.

      Significance

      Real time imaging experiments were performed very carefully. However, as mentioned above, authors used sodium azide-containing antibodies for blocking experiments, and hence, these experiments cannot be interpreted properly.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Extravasation of lymphocytes from HEV in the lymph nodes is mediated by the interaction between lymphocyte L-selectin and PNAd-carrying sulfated sugars expressed by HEVs. Multiple steps of lymphocyte migration interacting with ECs at the luminal side of HEVs have been studied intensively; however, post-luminal migration steps are unclear. In this study, using intravital confocal microscopy of peripheral lymph nodes (pLNs), the authors found that GlcNAc6ST1 deficiency, required for sulfation of PNAd, delays trans-fibroblastic reticular cell (FRC) migration of lymphocytes, and hot spots of trans-HEV EC migration and trans-FRC migration. Interestingly, hot spots of trans-FRC migration are often associated with dendritic cells (DCs). Thus, the authors concluded that FRCs delicately regulate the transmigration of T and B cells across the HEV wall, which could be mediated by perivascular DCs.

      Main comments:

      1. This study focused on pLNs, which are quite different from mesenteric lymph nodes (mLNs) in many ways. The authors should include mLNs in their study to make the general statement with regard to the T/B cell entry into lymph nodes. In addition, it will be more significant if this study includes challenged pLNs.
      2. The finding that GlcNAc6ST1 deficiency delays lymphocyte trans-FRC migration but not trans-HEV EC migration is surprising. However, the reason this occurs is neither shown nor discussed. Is GlcNAc6ST1 also expressed in FRCs? Or does GlcNAc6ST1 expression on HEV license lymphocytes to transmigrate across FRCs?
      3. Because of the adoptive transfusion experiment, the actual number of transmigrating lymphocytes in Fig. 3F is underestimated.
      4. Whether DCs covering FRCs have a role for lymphocyte trans-migration is not shown.
      5. In Fig. 1, time required for trans HEV EC migration and trans-FRC migration of T cells is shorter than that of B cells; however, this finding is not observed in Fig. 2C and E.

      Minor comments:

      1. Please provide evidence for GlcNAc6ST1 deficiency in HEV and surrounding tissues.
      2. Images for delayed trans-FRC migration in GlcNAc6ST1 KO mice relative to WT are not convincing (Fig. 2G and H).
      3. Provide actual time periods required for Fig. 3F and G. Lack of isotype control IgG experiment in Fig. S3.
      4. Line 12 on page 11, "the ratio of hot spots to the total;observed' transmigration sites..." is not appropriate. The ratio must be calculated by hot spots to the total "potential" transmigration sites, although it is challenging to find total potential sites.
      5. Please correct typos of angiomoduin to angiomodulin (page 16), ET-TR7 to ER-TR7 (page 17), Anti-CD3 to anti-CD3 (page 22), half the dose to half dose (page 22), the Multiple step to the multiple step (page 23).
      6. Please provide an additional explanation of why actin-DsRed in HEVs is more strongly expressed than surrounding tissues such as FRCs in Fig. 1 although actin-DsRed should be expressed in all cell types in mice.

      Significance

      The study focused on lymphocytes post extravasation of HEV, which is an understudied question, using intravital imaging. The in vivo imaging study was deliberately and beautifully performed, and the finding is insightful for understanding lymphocyte trafficking in lymph nodes. However, additional experimental should be performed to address some weaknesses listed in our comments.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Our response to reviewers has been provided as a formatted typeset pdf file. This includes the original review comments (bolded) and our responses. In particular, our responses include several figures. Our intention is to include the full set of reviews and responses as supplementary information in our manuscript once published at a journal - we would also be happy to have this document uploaded to biorXiv for readers as well.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Kannan et al start with the good idea of using Shannon entropy as a way to temporally classify the development of cells, quantifying their maturation status by implementing it on single cell gene expression as measured by scRNAseq. The idea behind is that as cells develop, genes are silenced and hence the overall GeX entropy goes down. This approach would allow a robust method to compare heterogeneous datasets, an important problem that current scRNAseq analysis methods (such as Monocle) using dimensionality reduction are unable to robustly perform this task. Unfortunately the analysis and calculation of the entropy and also the results obtained do not generate convincing proof that Entropy is actually a good metric for comparing development in diverse datasets/cell types.

      Major Comments:

      -The calculation of the entropy is not clear enough (or not performed correctly).Shouldn't Pi be the GeX distribution of Gene i across all cells? The authors seem to have calculated Pi as the probability of expression in one cell then summed across. Unless I am wrong, this does not make sense and invalidates all the analysis.

      -Entropy score correlated only moderately with pseudotimes for the three methods. This is a major problem that needs to be explained. One would expect entropy to give a higher correlation if it is a robust measure of development.

      -One of the main purposes of the approach is to classify maturation of in vitro datasets, but basically no entropy changes are found. They are minimal in figures 5c. Following with this, the developmental times of the datasets as shown by color codes do not match the changes in entropy (see Figs 4b, 5a/b.

      Minor Comments:

      -Also Pi being a probability, how was the normalization performed so that the sum of the probability is 1. Given the variability in gene expression, scRNAseq platforms and number of cells it would be good to have a metric estimating the quality of the distribution. -why is the entropy not compared between the Kannan dataset and Wang and Yao? This would prove that indeed entropy is a good measure as opposed to UMAP+monocle.

      Fig 3 should be in the supplement.

      Significance

      The idea behind this study is of potential significance as well stated by the authors, but the implementation of these ideas lacks scientific rigor. Entropy analysis needs to be repeated or clarified/better explained.

      Referees cross-commenting

      After reading the other reviewers comments showing the relevance of the approach developed by the authors, I do feel that with some clarification/discussion regarding the technical questions of the analysis solving the doubts I expressed, the manuscript could be of interest.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript does a fairly exhaustive job of comparing and bench-marking different single cell/nucleus RNA-seq on in vivo cardiomyocytes and in vitro cardiomyocyte differentiation protocols. The analyses is clearly described.

      Minor comments, questions and clarifications sought:

      It may be useful to emphasize that matching the entropy score of in vivo cardiomyocytes (or a given CM developmental state) is not a sufficient indication of matching the expression patterns of the in vivo counterpart. Compare entropy scores from cardiomyocytes from snRNA-seq on post mortem tissue (Litviňuková, et al. Nature volume 588, pages466-472(2020)) There are differences in cardiomyocytes obtained from different regions of the human heart (atrial vs. ventricular, left vs. right, etc.). It will be informative to compare the many in vitro differentiation datasets (and protocols) that may give result in atrial-like or ventricular-like CM to their in vivo counterparts. This question pertains to in vitro CM differentiation: Is entropy score sensitive to cell-types that differentiate into alternative lineages during in vitro differentiation (issue of purity)? Different cell lineages may have different maturation rates and if they are not excluded, the non-cardiomyocyte cells could contribute to noisy measurements. If the entropy score is calculated after a first round of clustering, on identified CM among the population (as opposed to cardiac progenitor cells, for example), I would be more confident of the entropy score.

      This also pertains to in vitro CM differentiation: Even within the cardiomyocyte lineage, there may be different rates of development that ultimately lead to the same end point. Therefore there may be the need to coarse-grain the developmental time-points to account for the precocious ones and the 'late bloomers'. It may be useful to anchor the developmental trajectory based on entropy score to biological milestones (such as when the CM's start beating in plates). Can the authors comment on this, please?

      CM's are interesting in that they are post-mitotic and as such, will attain a level or maturity at the end of the maturation process. I can imagine this not being the case for cells that continue to cycle and divide. It would be interesting to compare the change in entropy score for such cells. How about cells that differentiate when activated by an external stimulus (e.g., immune cells)? As long as a cell has high transcriptional variability or is transcriptionally active (e.g., as stress response) it may still show high entropy score. How would one interpret Entropy scores in such situations?

      The authors note "higher mtGENE in differentiated cells and later time points."- Fig 2a. Could this be related to difficulty in dissociation, as part of stress response? The authors note "In particular, 10x Chromium and STRT-seq datasets appeared to have systematically higher percentages of ribosomal protein-coding genes than other protocols." Could this simply be due to higher transcript capture rate of these protocols? These protocols/techniques may not be statistically sampling a cell's transcripts at the same rate as the techniques with "lower" capture efficiency.

      Can entropy score be used in the context of activation (under external stimulus) or deactivation (when the external stimulus is removed)?

      What do the black dots represent in Fig 2c?

      Significance

      The manuscript, "Transcriptomic entropy benchmarks stem cell-derived cardiomyocyte maturation against endogenous tissue at single cell level" by Kannan et al. introduces an interesting phenomenon, transcriptional entropy to track the rate of maturation in an important in PSC-derived cardiomyocytes. The need for cardiomyocyte in translational and clinical research along with the difficulty in getting live, mature cardiomyocytes from humans and make it imperative that in vitro systems are sought. Being able to characterize the rate of differentiation and maturation in these in vitro systems is also valuable and in that respect, the manuscript does a fairly exhaustive job of comparing and benchmarking different cardiomyocyte differentiation protocols that have been profiled by sc/snRNA-seq to date. Most importantly, comparing entropy scores between in vitro and in vivo counterparts is a simple and elegant way to anchor in vitro differentiation to pre- and post-natal development. Another interesting aspect of transcriptional entropy measure in a single cell is that it is independent of neighboring cells, and is therefore a conceptually different and novel way to characterize single cell data that, to date, have been analyzed by techniques that group cells by each cell's similarity to others. The study is well conceived and systematically explored. The manuscript is also well written. I recommend that the manuscript be accepted for publication.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Kannan et al. have developed an approach based on the quantification of gene distributions to assess pluripotent stem cell (PSC)-derived cell and tissue maturation. Methodologically, they combined single cell RNA-seq (scRNA-seq) with bioinformatic and statistical approaches to calculate transcriptomic entropy scores to benchmark cellular maturation. Their findings address unresolved issues regarding the developmental state of isolated cells and current problems associated with cell population heterogeneity. As model systems, the authors focused on cardiomyocytes (CMs) from mouse heart and on CMs generated through in vitro differentiated of PSCs from human. The authors examine a spectrum of CMs from mouse heart as a function of developmental time and provide evidence showing that scRNA-seq captures maturation related changes. Using a modification of the Shannon entropy of scRNA-seq and CMs isolated from embryonic, fetal, neonatal and early adult mouse hearts, they show that transcriptomic entropy scores decrease with developmental time. The authors then extend their results to human cells and perform a meta-analysis of publicly available scRNA-seq datasets. When cross-study comparisons were performed, meaningful comparisons could only be generated after gene and cell filtration. The output of the resulting workflow and computed entropy scores show good concordance among cells generated using different in vitro differentiation and different isolation techniques, and between stage-matched mouse and human tissues. The authors go on to show that in vitro derived CMs or reprogrammed CMs (from fibroblasts) undergo an apparent developmental block to maturation in vitro. The relevance of their approach to other cell systems was demonstrated using datasets from pancreatic beta cells and hepatocytes. In summary, the calculated entropy scores recapitulate known CM maturation gene expression profiles, making this approach invaluable for future comparisons between engineered and in vivo derived tissues.

      Comments:

      The key conclusions of the manuscript by Kannan et al. are supported by an examination of multiple datasets and the use of extensive and complementary bioinformatic and statistical analyses. The authors utilized a digestion and cell sorting approach that permits the isolation of viable CMs from mouse heart. The choice of scRNA-seq approaches eliminated cell type heterogeneity (either physically or bioinformatically) from otherwise complex cell populations. The authors then employed a variety of analytical approaches to identify limitations to cross-data comparisons and to define the maturation state of the cells. By minimizing protocol-related biases, resolving mismapping of mitochondrial reads to pseudogenes, taking into account variations in study sensitivity, and excluding datasets of relative poor quality, they were able to develop an informative workflow to generate meaningful entropy scores to benchmark maturation in cross-study and cross-species comparisons. These comparisons were validated using reprogrammed fibroblasts, hepatocytes and pancreatic beta cells. Overall, the experiments were well designed, the experimental and bioinformatic limitations addressed, and the conclusions supported by robust datasets, entropy scores, bioinformatics and statistics. This leads me to conclude that their validated approach will be of significant value to other researchers who need to benchmark cell maturation using a quantitative, transcriptome-based approach.

      A few experimental additions or discussion points would have strengthened the overall impact of this study.

      First, the process of cell dissociation coupled with cell sorting may be associated with a time lag in sample preparation that might be expected to affect RNA stability. If comparisons were performed between scRNA-seq and bulk RNA-seq, would the entropy scores have been equally informative or would differences have been observed from RNA instability that may have affected the entropy scores? While this test would be difficult with in vivo acquired cells, such a comparison could have been made using purified (but not sorted) hPSC-CMs. An answer to this question might be valuable to investigators who wish to use your approach to examine existing bulk RNA-seq datasets. Basically, is the workflow only applicable for scRNA-seq data where problems of cell heterogeneity can be eliminated, even though you provide evidence on how to exclude non-CMs from your datasets using transcriptome profiles?

      Second, would mouse strain differences or sex differences cause a shift in the entropy scores or pseudotime analyses, even if only marginally? Not all mouse models develop at the same rate and sex is known to affect both murine fetal and infant growth.

      Third, when performing the entropy scores and pseudotime analyses, were there specific transcripts or groups of transcripts that were more informative of specific stages of maturation? You mention that ~81.5% were identified as differentially expressed by all methods and some transcript profiles are shown in Figure 4e, but were any informative genes or gene sets (i.e., markers) more useful for assessing maturation that would not require scRNA-seq? This information (which could be added in the supplement) might make your approach more accessible to the broader research community (i.e., the identification of new and informative markers of CM development or differentiation). Alternatively, it may be that scRNA-seq is required. If so, then this should be discussed. Finally, could you comment further on the application of entropy scores to study maturation and how your approach may be of value to the research community? A number of situations beyond comparisons of engineered and in vivo tissues, and somatic cell reprogramming protocols might include an evaluation of PSC-CMs for pharmaceutical and toxicity testing, and the prediction of pathways that may be essential for maturation of cells either through a gene regulatory network or through individual signaling pathways. While these experiments and discussion points are not necessary to support your conclusions, an evaluation of these points and limitations in the Discussion may broaden the paper's impact and significance.

      As minor critiques, there are a few typos (e.g., celltypes [cell types]), redundancies (e.g., ...transcript and protein level expression [...transcript and protein levels.]), and some improvements to the figures that could be made. For the latter, the font sizes are often too small (Figs 1, 3, 4, 5), as are some of the timepoints listed on the x axis (Fig 3a,d, 4b). Otherwise, the figures are visually informative, and the supplemental data are necessary to the assessment of the procedure.

      Significance

      The approach describe by Kannan et al. represents a significant advance over existing strategies to benchmark maturation states of PSC derivatives. Gene expression studies1 and transcriptome-based studies2-4 have been useful to estimate the developmental state of mouse and human PSC-CMs; however, most published studies have relied either on an assessment of a few markers or on data from a limited number of in vivo derived samples. These earlier studies were further limited by the confounding problem of heterogeneous cell populations. Omics based quantitative approaches have been proposed for improved maturation benchmarking and have proved valuable to study the differentiation of stem cells to progenitors and to committed lineages. 5-9 In this paper, Kannen et al. have improved upon these approaches and report the use of entropy scores to benchmark in vitro PSC-CM maturation against a gold standard of in vivo counterparts. The result is a reference resource that captures transcriptomic profiles from mouse CMs across a broad range of developmental states that will be particularly valuable to the cardiac field. By extending the assessments to include meta-analyses and cross-species comparisons (mouse versus human), they have established a workflow that results in a meaningful benchmark a cell's maturation state. Kannan et al., thus, have developed a quantitative and reproducible approach (entropy score) that simultaneously resolves issues of cell heterogeneity and estimates then in vivo maturation state of in vitro derived cells. This quantitative approach is likely to advance studies designed to assess drug and toxicity testing of more "adult-like" CMs, and adoption of this approach by the broader stem cell community will likely prove invaluable for the assessment of engineered tissues made from complex cell populations and for applications to regenerative medicine.

      Keywords: Reviewer's field of expertise Cardiovascular Physiology, Stem Cell Biology, Omics

      References:

      1. AC Fijnvandraat, et al., Cardiomyocytes derived from embryonic stem cells resemble cardiomyocytes of the embryonic heart tube. Cardiovascular Research 58, 399-409 (2003).
      2. E Poon, et al., Transcriptome-guided functional analyses reveal novel biological properties and regulatory hierarchy of human embryonic stem cell-derived ventricular cardiomyocytes crucial for maturation. PLoS ONE 8, e77784 (2013).
      3. CW van den Berg, et al., Transcriptome of human foetal heart compared with cardiomyocytes from pluripotent stem cells. Development (Cambridge, England) 142, 3231-3238 (2015).
      4. H Uosaki, et al., Transcriptional Landscape of Cardiomyocyte Maturation. Cell Reports 13, 1705-1716 (2015).
      5. D Grun, et al., De Novo Prediction of Stem Cell Identity using Resource De Novo Prediction of Stem Cell Identity using Single-Cell Transcriptome Data. Cell Stem Cell 19, 266-277 (2016).
      6. W Chen, AE Teschendorff, Estimating Differentiation Potency of Single Cells Using Single- Cell Entropy (SCENT). Comput. Methods for Single-Cell Data Analysis 1935, 125-139 (2019).
      7. M Guo, EL Bao, M Wagner, JA Whitsett, Y Xu, SLICE : determining cell differentiation and lineage based on single cell entropy. Nucleic Acids Res. 45, 1-14 (2017).
      8. AE Teschendorff, T Enver, Single-cell entropy for accurate estimation of differentiation potency from a cell's transcriptome. Nat. Commun. 8, 1-15 (2017).
      9. GS Gulati, et al., Single-cell transcriptional diversity is a hallmark of developmental potential. Science 367, 405-411 (2020).
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      To whom it may concern:

      We are thank the reviewers for their kind assessment of our work and its potential impact. Here we have outlined key points that we plan to address during revisions.

      1. The erect wing story could be investigated a bit further. We agree the erect wing phenotype is intriguing, and will try to improve our understanding. We plan to use fat-body specific c564-Gal4 or BaraA-Gal4 to express UAS-BaraA and attempt to rescue the phenotype. In this way, we will also give insight into whether erect wing can be rescued by immune-tissue or BaraA-endogenous tissue effects. We will note that the cause of erect wing may be due to a lack of BaraA during development and/or during the immune response, which will require careful investigations in the future.
      2. The in vitro antifungal data are modest. We agree. We will perform additional experiments to further corroborate these data to increase confidence in the trends observed.
      3. The nature of the genetic backgro__unds is not clear.__ We will do our best to explain the genetic background complications in the main text. We use w; **∆BaraA flies as an independent means of confirming isogenic data (and vice versa). We had to backcross the ∆BaraA mutation with an arbitrary genetic background prior to experiments to remove an off-site mutation that we detected in the antifungal gene Daisho2 (formerly IM14). As such, there is no appropriate wild-type control for these flies as the background is mixed. We include OR-R as a generic wild-type representative. OR-R flies survive bacterial infection like w; **∆BaraA in multiple assays, and so we feel that different immune competences of the genetic backgrounds is unlikely to explain major susceptibilities to fungal infection. We have additional data for bassiana R444 infection (Fig. 4C-D) with a second wild-type that we can include if desired, which shows similar trends when compared to w; **∆BaraA. We will also perform additional experiments with newly-generated isogenic flies to increase confidence in the trends, and to better inform on interactions between BaraA and other immune effectors. For other minor points, we will be happy to make suggested changes to improve clarity of the figures or methodology.

      Best regards,

      Mark Hanson and Bruno Lemaitre

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Hanson et al. have set out to investigate the BaraA gene, and show that the gene encodes for several immune induced molecule (IM) peptides, namely IM10, IM12 (and its sub-peptide IM6), IM13 (and its sub-peptides IM5 and IM8), IM22, and IM24. Flies lacking BaraA are viable but susceptible to specific infections, notably by the entomopathogenic fungus Beauveria bassiana. Furthermore, they show that BaraA is antimicrobial and, when combined with the antifungal Pimaricin, it inhibits fungal growth. In principle, this is a nicely written paper with interesting findings. The authors show induction of BaraA with different micro-organisms and where BaraA is expressed, using a BaraA reporter. The exploration of the genomic area, showing the duplication of the BaraA locus is really nice work. Also, the survival experiments show quite clear phenotypes and therefore effects for BaraA.

      Major comments:

      Line 153, related results: Fold induction of BaraA is greater with E. coli (~50) than with C. albicans (~20) or M. luteus (~6) - any comments on this? Also, infection times with these microbes are different - some comments about BaraA kinetics? Based on Fig 1B, BaraA looks to be highly induced by E. coli, although in Fig 1C, after 60h, reporter induction by E. coli is much less than with M. luteus. Some clarification about the kinetics of BaraA in these different infection models is needed.

      Erect wing phenotype in males: This is a bit surprising finding/interpretation. I have also seen erect wings in E. faecalis-infected flies, but I am not sure now in which flies I saw this; I have never tried to quantify this nor made any notes about females/males in this context. I normally use Myd88 RNAi (VDRC #25399) as a control in my experiments, and if they were the ones showing the erect wing phenotype in a prevalent manner, they would also lack BaraA (which is dependent on the Toll pathway function). At the time of doing my experiments, I just interpreted this in the way that the flies looked "sick", they were lifting their wings up and walking around rather than flying. When monitoring my survival experiments, I assumed that the ones with wings up were the ones dying next (the sickest). What is your interpretation; are the flies still ok or very sick, when this erect wing starts to appear?

      Minor comments:

      Wording: In the intro, line 78: "Many of the genes that encode these components of the immune peptidic secretome have remained largely unexplored." - I would say "had remained" until recently - especially the quite recent Bomanin work and work with Daisho1 & 2 have brought about a lot of new information about this "immune peptidic secretome".

      Fig 1A: What is BaraA called in DeGregorio et al? Can't find it (easily) from their lists. Please write BaraA into the Fig 1A graph, to make it clearer. Also, write somewhere in the text or Figure legend what the gene is called in DeGregorio et al (CG33470? CG18278? something else?)

      Line 238 Reference to Supplementary data file 1: In the supplementary data files I downloaded, I can't see the files numbered as data files 1 and so on. Instead, there are folders (Fly stocks, NF-kappaB sites, Primers used) and the files have names. Please clarify that the supplement names match the text.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. State what audience might be interested in and influenced by the reported findings.

      I think the significance of this work is great for Drosophila immunity researchers. The nature and mode of action of many of the Toll pathway -induced peptides is not known, so more information on them is much appreciated by the field. Also, studying molecules with potential antimicrobial activities is also potentially interesting for wider audience.

      • Place the work in the context of the existing literature (provide references, where appropriate).

      The main Drosophila immunity pathways are the Toll and the Imd pathway, and when activated, several immune effector genes are induced. In 2015, a group of Toll pathway target genes was identified by mass spectrometry, that the authors here call "the immune peptidic secretome". (Clemmons AW et al., PLoS Pathogens 2015). Many of these peptide genes have been uncharacterized, although emerging studies have shed light to these findings in the past three years (Lindsay SA et al J. Inn. Imm. 2018; Cohen LB et al. Front.Imm. 2020). This research brings about new information on yet uncharacterized peptides in this group.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. Drosophila melanogaster, innate immunity, humoral immunity, cellular immunity Toll pathway, Imd pathway, immune-induced molecules
    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Hanson et al. investigated an antifungal gene they named BaraA, which codes for a protein that is proteolytically processed into 8 smaller peptides. BaraA expression is induced by Toll pathway signaling with minor input from the Imd pathway. It is expressed in the fat body upon immune challenge and expressed in other tissues such as head and eyes. Overexpression of BaraA increased the survival of animals defective in both IMD and Toll pathways. In vitro, the combination of the three major BaraA peptides displayed modest inhibitory effect on fungal pathogens when combined with the antifungal drug Pimaricin, however BaraA peptides alone showed little or no antifungal activity. BaraA deficient mutants showed little to no significant difference in bacterial resistance but appeared to show susceptibility to fungal infection; this fungal susceptibility was independent of the Bomanins. Male BaraA mutants also displayed an erected-wings phenotype when subjected to infection.

      There are 3 key findings:

      • BaraA overexpression conferred protection against fungal infection.
      • BaraA-derived peptides displayed antifungal activity in vitro in conjunction with Pimaricin, in vitro
      • Loss of BaraA decreased fungal resistance.

      Major Concerns:

      The results from the overexpression experiments were clear. However, the second and third findings were less convincing.

      • The cocktail of IM10-like BaraA peptides showed significant synergy with Pimaricin in killing C. albicans at only one dose out of the five tested, and this combination has modest (19-29%) inhibition on hyphae growth of B. bassiana. The in vitro antifungal experiments might be more compelling if other fungi were examined and/or combinations with other antifungals were investigated, where synergy might be more robust.
      • The most problematic issue with this data is the control of genetic background in the study of the BaraA mutant strains. Much of the survival data compares mutant strains (BaraA and/or Bom∆) with Oregon-R as a wildtype. As best we can tell, the BaraA and Bom strains are not in the genetic background and neither is particularly similar to OR-R. If the authors can justify the use of OR-R as the wildtype control for these experiments, they should do so explicitly. Otherwise, these experiments are very difficult to interpret. This issue is highlighted by other data, where genetic background is carefully controlled, in the iso-w background, and the survival phenotypes are much more mild, and do reach significance is some infections, by log-rank analysis. All experiments should be performed in this controlled background to enable firm conclusions and interpretations.

      Minor comments:

      • Figure 1A mined data from a previous published study, which is acceptable, but this data presentation lacks proper description of the methodology, reproducibility, and statistics.
      • The authors need to clarify the condition of the flies in Figures 1D to G (as well as S1C and D). Infected? Baseline? It is not clear.
      • There is no visualization of the genomic location of the BaraA deletion, which should be added to figure 2C.
      • The authors should include the full genotype information for the Bloomington stocks, since the BL numbers may change over time.
      • In Figure 2C, the authors should include some information about which lines possess the single BaraA locus and which lines have the duplication event.
      • The author should elaborate on what is known about Dso2 and how the aberrant Dso2 locus might affect their assays. The info here is incomplete and confusing.
      • Does the Ecc15 strain used in the paper innately resist Ampicillin? If yes, then the result of Ecc15 resisting the combination of IM cocktail and Ampicillin does not reveal much.
      • It is unclear what the concentration of pimaricin was used for Figure 3E.
      • The authors should include a clear genetic explanation for their conclusion that BaraA and Bomanins function independently. The text describing this double mutant analysis could be more informative.
      • BaraA overexpression significantly improved female survival against M. luteus (Figure S4C, p=0.006), this is interesting but not mentioned in the text.
      • The author should be clear and consistent about the pathogen source (lab grown vs. commercial) and method of infection (natural infection vs. septic injury). The authors should explain the difference in virulence between different infection models and methods.
      • The sex-specific erected wings phenotype is interesting, but does not contribute to the overall significance of the manuscript. The authors should consider moving Figure 6 to the supplement.

      Significance

      This work is a potential step in characterizing the immune effectors downstream of the Toll pathway that contribute to the Drosophila defense against fungal pathogens. These effectors so far have not been characterized and understood. We are familiar with the Toll pathway and its effectors, but in no way are experts.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The authors use the fruitfly Drosophila melanogaster as a model to study innate immunity. In this manuscript, they study the effects of a set of antimicrobial peptides (AMPs) that are produced by furin cleavage of a larger precursor (Baramicin A, BaraA). Bara A is immune-induced in a Toll-dependent manner and has antifungal activity. Somewhat in line with expression in non-immune tissues, BaraA mutants show ab erect-wing phenotype in males.

      Major comments:

      The experiments are well-presented in a reproducible and statistically sound way. In particular care is taken to control effects of the genetic background. The immune phenotype of BaraA mutants is somewhat subtle but convincing and in line with recent findings by the same authors that some of the recently created CRISPR/Cas mutants in antimicrobial peptides have broader effects while others target intruders in a more specific manner or in combination with other AMPs. These are very relevant studies, which provide a balanced view of innate immunity in particular AMP action. I have one comment about the (BarA dependent, male-specific) erect wing phenotype: this is an interesting observation, which could stimulate work by others, I guess this is one reason why it was included in the manuscript. On its own it stands out a bit in the manuscript since in contrast to other parts, where insight into the underlying mechanisms is provided, this is not the case for the erect wing phenotype. The authors speculate about the non-immune expression, which may be responsible. One might use tissue-specific knockdown or rescue to check up on this (wing muscle or nervous system). This would be cost effective but delay publication for a few months. It depends a bit on the respective journal policy and the plans for further investment of the groups involved whether the phenotype is considered part of BaraA pleiotropism (which I could buy) or is considered too descriptive and should be used later for a later publication. Along similar lines, while sex-specific immune phenotypes are highly interesting, they open up many discussions about the underlying causes, both proximal and ultimate.

      Minor comments:

      The experiments look sound and previous work is mentioned sufficiently. The experimental design and results are easy to follow. I have mentioned some concerns about the erect wing phenotype (see above). Is there any evidence for metabolic regulation of BaraA (TF binding sites for example) in particular in the promoter fragment used for the reporter line? Did any of the fat body drivers show the same effect as the ubiquitous actin driver (this would increase specificity).<br> Why was pimaricin used, it seems presently as a representative of membrane-active antifungal drugs, which BaraA peptides are likely not. Still, using combinations with other insect (Drosophila) antifungal AMPs would be more physiological, maybe this was tried and did not work, but should still be discussed. Or do the authors want to imply that physiologically the Daisho peptides or Bomanins have this effect? Perhaps elaborate on this.

      In Fig. 1: part H is missing although mentioned in the legend.

      In the abstract:

      it should be more clearly mentioned that the erect wing phenotype was observed in the mutants. line 27 and 28, replace one "characterized" line 28: contribute line 33: entomopathogenic

      other places:

      line 68: AMPs

      Significance

      Significance:

      The evolutionary relevance and therapeutic potential of AMP synergism is an emerging topic both within insect immunity, innate immunity in general and its use in patient treatment [1, 2]. The latter aspect may be interesting to justify the use of pimaricin. Thus, the work presented here in combination with previous work from the authors leads to a more balanced view of the action of insect AMPs (the authors call that the logic of the Drosophila effector response) with implications for human innate immunity and perhaps even therapy of diseases. Therefore, the data will be interesting for a broad audience. The use of models such as Drosophila, which can be manipulated in a targeted manner has provided insights that are beyond the study of single AMPs in vitro. Still, using overexpression as done in some cases here should be interpreted cautiously and should - if available - compared to data on in vivo concentration of AMPs (the authors try to derive estimates from the MS data), which may be difficult in case there are large local differences.

      My own field:

      primarily insect immunity with a background in mammalian immunity, although I am not able to keep up with all recent development in mammalian immunity.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Trypanosoma brucei causes African sleeping sickness and related cattle disease, both diseases that urgently need new therapeutics. One reason for the lack of a drug or a vaccine is the parasite's way to escape the immune system: their cell surface is covered by the variant surface glycoprotein (VSG) of which many variants exist, but only one is expressed. The switching between the different VSG forms is called antigenic variation and involves a not fully understood epigenetic mechanism. It is essential for the parasite's survival that the VSG surface coat is very dense at any given time: antibodies of the host should not be able to recognise any invariant proteins on the cell surface that are 'hidden' in between the VSG molecules. Consequently, the VSG protein is the most abundant protein in the cell (10% of total). This high protein abundance is achieved by both transcriptional and posttranscriptional mechanisms. One major posttranscriptional mechanism is the stabilisation of the VSG mRNA. Two cis-elements in the VSG mRNA 3´UTR have been known for a long time to be essential for this stability (an 8-mer and a 16-mer). However, nothing was known about the underlying mechanism of VSG mRNA stabilisation. In this work, the authors have addressed this problem. They have purified the VSG mRNA from trypanosomes in two very different ways and, in both approaches, they found the cyclin F-box protein 2 (CFB2) to co-purify. They have defined the full complex that binds to the VSG mRNA. Most importantly, the authors could clearly show the very specific effect on VSG mRNA stability when CFB2 was RNAi depleted. Moreover, CFB2 RNAi mostly phenocopied the phenotype that was previously described for VSG RNAi. The CFB2 protein is present in a very low copy number and the authors provide data suggesting that it may be tightly autoregulated by interaction with SKP1. The authors further show that the regulation of VSG mRNA stability by CFB2 depends on the 16-mer cis-element, but not on the 8-mer. The data are, throughout, very convincing, experiments are done with all the essential controls and the data are well presented. The conclusions are supported by the data. The authors have, beyond any doubt, finally identified the major posttranscriptional regulator protein that is responsible for VSG mRNA stability, a milestone in the field, and provide a mechanism on how it could work and be autoregulated. I only have one major point (and a few very minor points)

      My main criticism is on the introduction: major information is missing here or presented far too short. People from outside of the trypanosome field will find the paper almost impossible to understand. It is important to explain the life cycle and its stages (as these are mentioned later) as well as the parasites special transcription of mRNAs by PolI and PolII in more detail. Trypanosome translation initiation factors and PABPs should be introduced. Nomenclature of the VSG is also a confusing throughout. Why switching to VSG4 in Figure 8 for example. Also, it would be beneficial to phrase the question better and stress the importance of why this needs to be answered to understand the basic biology of the parasite.

      R: We have extended the Introduction section as suggested. The reason for the switch is now explained.

      **Minor stuff:** Line 76: ' supporting direct binding to mRNA in vivo' Is this true? I thought the poly(A) oligos can also purify protein complexes? (but I may be wrong)

      1. Yes, but probably not very much when the complexes have been washed with lithium chloride and urea. In any case, the readers can find in the supplementary Table 1 the false discovery rate (FDR) values obtained for each identified protein for both purifications (oligodT and VSG/Tub antisense) taking into consideration the data from the control experiments.

        Line 104: 'Kinetoplastid specific'. Better 'Trypanosome specific' if its absent in Leishmania? The correlation between presence of antigenic variation and number of CFB could be worked out a little better, perhaps presented in a main Figure.

      2. CFB genes are absent in Leishmania; thus, we have edited it as suggested. Since we do not actually know whether it has any biological meaning, we have also removed the association between the presence of multiple copies of CFB genes and antigenic variation.

        Line 161: Tb927.8.1945, ad: 'encoding a hypothetical protein of unknown function'.

      3. Done. Line 202: MG132, better: 'the proteasome inhibitor MG132' R. Done. Line 310-311: no, best to delete this sentence.

      We prefer to leave it in.

      Reviewer #1 (Significance (Required)):

      There is no doubt about this being a truly significant contribution to the trypanosome field. Method-wise, it is also a nice example of how mRNA binding proteins can be identified and validated and there are clear mechanistic insights here into the regulation of the VSG mRNA. This is not frequently found, in any organism. I believe that this work will be publishable in any parasitology journal, and, once the introduction has been changed (see above) also in any RNA journal.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The current study describes the isolation and characterisation of Variant Surface Glycoprotein (VSG) mRNA-bound proteins in the bloodstream form African trypanosome. CFB2 is identified as a VSG mRNA positive regulator which depends upon a conserved 16mer in the VSG mRNA 3'-UTR.

      1. The authors state in their abstract that "CFB2 is essential for VSG mRNA stability". They also "describe cis-acting elements within the VSG 3'-untranslated region that regulate the interaction". Expression of a GFP reporter appears to be reduced by only ~3-fold in bloodstream-form cells when the relevant cis-acting element (the 16mer) is removed, however (Fig. 8B). This would suggest that the mRNA lacking the 16mer could still be relatively stable ("VSG mRNA is extremely stable, having a half-life of 1-2h compared with less than 20 min for most other mRNAs").

      Was half-life measured for an mRNA lacking a 16-mer or for VSG mRNA in cells lacking CFB2?

      1. Yes, this was previously published, references are in the Introduction. The presence of the 16-mer in VSG is essential for survival in the T. brucei bloodstream stage (PMID: 28906055). Could CFB2 impact mRNA maturation rather than stability?

      2. The reporter experiments rule this out since in Kinetoplastids, the 3'-UTR sequence has no role in controlling polyadenylation, beyond a preference for sites with several A residues. This is now explicitly stated. Also, which data demonstrate an altered interaction between CFB2 and the mRNA lacking a 16mer? The authors could consider adjusting these statements and also the quantitative impact that CFB2 has on VSG mRNA stability, as well as evidence supporting differing interactions between CFB2 and mRNAs containing or lacking the 16mer.

      We do now show new data demonstrating that binding of CFB2 to the reporters depends on the VSG 3'-UTR and is unaffected by the 8-mer mutation. Unfortunately, the GFP-VSGm16mer mRNA was too low in abundance to quantitate, even by qPCR. The 16-mer and 8-mer are the only sequences in the 3'-UTR that are conserved in different VSG mRNAs. Binding to the upstream UC-rich region remains a theoretical possibility but it seems very unlikely since this region is variable and such sequences are present in numerous other 3'-UTRs (for example, the alpha tubulin 3'-UTR, which is the first we looked at, includes the sequence CCUUCCUUCCCCUU). Our preliminary results suggest indeed that region is not involved (Suppl. Fig 13E). And in that case, why would mutating the 16-mer affect the response to CFB2 expression? We cannot rule out the possibility that CFB2 binds to m6A - it's a chicken-and-egg problem, because mutation of the 16-mer eliminates the methylation. However, this too seems unlikely since m6A is by no means restricted to VSG (https://doi.org/10.1101/2020.01.30.925776; PMID: 30573362). To find out it would be necessary to identify the m6A "writers", and reduce their expression; this is well beyond the scope of this manuscript and is being actively pursued in another lab. An alternative would be to express soluble CFB2 for in vitro binding studies, but so far this has not been possible despite several attempts.

      In relation to point 1 above, Fig. 2A and Fig. 3D show CFB2 binding to the VSG 3'-UTR, to the 16mer in the latter case. This interaction could be presented as a 'model' whereas it seems too speculative to be included in the current data-Figures. Indeed, the authors "suggest that CFB2 recognizes the 16mer" in their Discussion and do also consider alternatives. A caveat has been added to the Figure 3D legend.

      Given the emphasis on the experimental approach and "the potential to supply detailed biological insight into mRNA metabolism in any eukaryote" (end of abstract), can the authors explain how their method improves upon / differs from the approach of Theil et al., 2019 and other similar approaches?

      Our approach is slightly different to the one described by Theil et al. (antisense oligo length, incubation temperature) and a detailed description of our protocol can be found in the Methods section. We have stressed the method because there is only one previous successful example attempting the purification of the protein bound to a native mRNA. Our intention is not to compare approaches but to encourage researchers willing to perform these experiments in a variety of other organisms.

      **Other points:** i. Fig. 2B: Why does N-GFP- SBP migrate more slowly in the Tet+ eluate? Also why does the slower-migrating form of the protein appear to dominate in Fig. 2C?

      1. N-GFP-SBP protein migrates as a single band. In Fig. 2C, the membrane was first probed with anti-RBP10 and then with anti-GFP antibodies. What is observed in the input and flow-through (I/FT) is RBP10 signal and not GFP. The concentration of N-GFP-SBP in the eluate is much higher than in the I/FT (it is the only protein visualized upon Ponceau staining in eluates). That causes the band to appear in the eluate as “ghost band” (ECL reagent is consumed in the middle region of the band) while in the I/FT, the concentration is still not enough to give a signal. The same occurs in Fig. 2B. The faint bands that are seen in the I/FT in Fig. 2B are likely products of cross-reactivity.

        ii. Fig. 3D: What's the evidence that SKP1 interacts with VSG-mRNA-bound CFB2? Is this protein enriched in the data shown in Fig. 1C and can the relevant data-point be labelled?

      2. Our interactome capture results (PMID: 26784394) suggest that in bloodstream form, Skp1 (Tb927.11.6130) do not bind poly(A) RNA directly; thus, it is not enriched in the VSG mRNA-bound proteome. What we know is that Skp1 interacts, in a Y2H setting, with CFB2 and that mutations in the CFB2 F-box domain abolish this interaction. The data we have presented suggest the interaction with Skp1 regulates CFB2 levels. We actually do not know whether Skp1 binds to free or to VSG-mRNA-bound CFB2.

        iii. There are four other highly abundant mRNAs in Fig. 4C. Are these related to VSG expression?

      They are tubulins, EF1, HSP83 and HSP70.

      iv. Lines 85-88: Suggest citing the studies used to prioritise RBPs, expressed only in the bloodstream form, that increase mRNA stability or translation when "tethered" to an mRNA. R. References have been added.

      Is CFB2 expressed only in the bloodstream form?

      Yes, this is described in more detail later.

      v. We spotted a number of other potential corrections, including: Lines 161 and 171; should '4E' be '4C'? Line 202; explain MG132. Define RPM, ns, BS, ++ etc in the Figures. Yeast-2-hybrid and CAT may be standard assays, but we suggest briefly describing them in the Methods section. Done.

      Reviewer #2 (Significance (Required)):

      Post-transcriptional control of gene expression by mRNA binding proteins (RBPs) is an area of major current research interest and activity. Much remains unknown regarding control of mRNA stability, nuclear export or translation and there are many uncharacterised or only partially characterised RBPs in eukaryotic cells. Trypanosomes present an important model in this context since global polycistronic transcription places a major emphasis on post-transcriptional controls. They are also important parasites. The variant surface glycoprotein is a key virulence factor and one of the few genes that is under transcriptional control in African trypanosomes, yet RBPs are thought to be important for generating/maintaining the highly abundant VSG mRNA in bloodstream form cells (and for low abundance in the insect stage), possibly via interaction with the highly conserved regulatory elements in the 3'-UTR.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The current study describes the isolation and characterisation of Variant Surface Glycoprotein (VSG) mRNA-bound proteins in the bloodstream form African trypanosome. CFB2 is identified as a VSG mRNA positive regulator which depends upon a conserved 16mer in the VSG mRNA 3'-UTR.

      1. The authors state in their abstract that "CFB2 is essential for VSG mRNA stability". They also "describe cis-acting elements within the VSG 3'-untranslated region that regulate the interaction". Expression of a GFP reporter appears to be reduced by only ~3-fold in bloodstream-form cells when the relevant cis-acting element (the 16mer) is removed, however (Fig. 8B). This would suggest that the mRNA lacking the 16mer could still be relatively stable ("VSG mRNA is extremely stable, having a half-life of 1-2h compared with less than 20 min for most other mRNAs"). Was half-life measured for an mRNA lacking a 16-mer or for VSG mRNA in cells lacking CFB2? Could CFB2 impact mRNA maturation rather than stability? Also, which data demonstrate an altered interaction between CFB2 and the mRNA lacking a 16mer? The authors could consider adjusting these statements and also the quantitative impact that CFB2 has on VSG mRNA stability, as well as evidence supporting differing interactions between CFB2 and mRNAs containing or lacking the 16mer.
      2. In relation to point 1 above, Fig. 2A and Fig. 3D show CFB2 binding to the VSG 3'-UTR, to the 16mer in the latter case. This interaction could be presented as a 'model' whereas it seems too speculative to be included in the current data-Figures. Indeed, the authors "suggest that CFB2 recognizes the 16mer" in their Discussion and do also consider alternatives.
      3. Given the emphasis on the experimental approach and "the potential to supply detailed biological insight into mRNA metabolism in any eukaryote" (end of abstract), can the authors explain how their method improves upon / differs from the approach of Theil et al., 2019 and other similar approaches?

      Other points:

      i. Fig. 2B: Why does N-GFP- SBP migrate more slowly in the Tet+ eluate? Also why does the slower-migrating form of the protein appear to dominate in Fig. 2C?

      ii. Fig. 3D: What's the evidence that SKP1 interacts with VSG-mRNA-bound CFB2? Is this protein enriched in the data shown in Fig. 1C and can the relevant data-point be labelled?

      iii. There are four other highly abundant mRNAs in Fig. 4C. Are these related to VSG expression?

      iv. Lines 85-88: Suggest citing the studies used to prioritise RBPs, expressed only in the bloodstream form, that increase mRNA stability or translation when "tethered" to an mRNA. Is CFB2 expressed only in the bloodstream form?

      v. We spotted a number of other potential corrections, including: Lines 161 and 171; should '4E' be '4C'? Line 202; explain MG132. Define RPM, ns, BS, ++ etc in the Figures. Yeast-2-hybrid and CAT may be standard assays, but we suggest briefly describing them in the Methods section.

      Significance

      Post-transcriptional control of gene expression by mRNA binding proteins (RBPs) is an area of major current research interest and activity. Much remains unknown regarding control of mRNA stability, nuclear export or translation and there are many uncharacterised or only partially characterised RBPs in eukaryotic cells. Trypanosomes present an important model in this context since global polycistronic transcription places a major emphasis on post-transcriptional controls. They are also important parasites. The variant surface glycoprotein is a key virulence factor and one of the few genes that is under transcriptional control in African trypanosomes, yet RBPs are thought to be important for generating/maintaining the highly abundant VSG mRNA in bloodstream form cells (and for low abundance in the insect stage), possibly via interaction with the highly conserved regulatory elements in the 3'-UTR.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Trypanosoma brucei causes African sleeping sickness and related cattle disease, both diseases that urgently need new therapeutics. One reason for the lack of a drug or a vaccine is the parasite's way to escape the immune system: their cell surface is covered by the variant surface glycoprotein (VSG) of which many variants exist, but only one is expressed. The switching between the different VSG forms is called antigenic variation and involves a not fully understood epigenetic mechanism. It is essential for the parasite's survival that the VSG surface coat is very dense at any given time: antibodies of the host should not be able to recognise any invariant proteins on the cell surface that are 'hidden' in between the VSG molecules. Consequently, the VSG protein is the most abundant protein in the cell (10% of total). This high protein abundance is achieved by both transcriptional and posttranscriptional mechanisms. One major posttranscriptional mechanism is the stabilisation of the VSG mRNA. Two cis-elements in the VSG mRNA 3´UTR have been known for a long time to be essential for this stability (an 8-mer and a 16-mer). However, nothing was known about the underlying mechanism of VSG mRNA stabilisation. In this work, the authors have addressed this problem. They have purified the VSG mRNA from trypanosomes in two very different ways and, in both approaches, they found the cyclin F-box protein 2 (CFB2) to co-purify. They have defined the full complex that binds to the VSG mRNA. Most importantly, the authors could clearly show the very specific effect on VSG mRNA stability when CFB2 was RNAi depleted. Moreover, CFB2 RNAi mostly phenocopied the phenotype that was previously described for VSG RNAi. The CFB2 protein is present in a very low copy number and the authors provide data suggesting that it may be tightly autoregulated by interaction with SKP1. The authors further show that the regulation of VSG mRNA stability by CFB2 depends on the 16-mer cis-element, but not on the 8-mer.

      The data are, throughout, very convincing, experiments are done with all the essential controls and the data are well presented. The conclusions are supported by the data. The authors have, beyond any doubt, finally identified the major posttranscriptional regulator protein that is responsible for VSG mRNA stability, a milestone in the field, and provide a mechanism on how it could work and be autoregulated. I only have one major point (and a few very minor points)

      My main criticism is on the introduction: major information is missing here or presented far too short. People from outside of the trypanosome field will find the paper almost impossible to understand. It is important to explain the life cycle and its stages (as these are mentioned later) as well as the parasites special transcription of mRNAs by PolI and PolII in more detail. Trypanosome translation initiation factors and PABPs should be introduced. Nomenclature of the VSG is also a confusing throughout. Why switching to VSG4 in Figure 8 for example. Also, it would be beneficial to phrase the question better and stress the importance of why this needs to be answered to understand the basic biology of the parasite.

      Minor stuff:

      Line 76: ' supporting direct binding to mRNA in vivo' Is this true? I thought the poly(A) oligos can also purify protein complexes? (but I may be wrong)

      Line 104: 'Kinetoplastid specific'. Better 'Trypanosome specific' if its absent in Leishmania? The correlation between presence of antigenic variation and number of CFB could be worked out a little better, perhaps presented in a main Figure.

      Line 161: Tb927.8.1945, ad: 'encoding a hypothetical protein of unknown function'.

      Line 188, 216, 246: typos/grammar, also: 8mer or 8-mer (decide for one)

      Line 202: MG132, better: 'the proteasome inhibitor MG132'

      Line 310-311: no, best to delete this sentence.

      Significance

      There is no doubt about this being a truly significant contribution to the trypanosome field. Method-wise, it is also a nice example of how mRNA binding proteins can be identified and validated and there are clear mechanistic insights here into the regulation of the VSG mRNA. This is not frequently found, in any organism. I believe that this work will be publishable in any parasitology journal, and, once the introduction has been changed (see above) also in any RNA journal.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We wish to thank the reviewers for their detailed and constructive comments on our manuscript. This valuable feedback has resulted in substantial improvements to our paper. A detailed list addressing the reviewers’ comments and the changes to our manuscript since the first submission is outlined below:

      Reviewer #1:

      The manuscript by Martens et al investigates the mechanisms of Bnip3-mediated cell damage during hypoxia. The Authors show that modulation of prostaglandin (PG) E1 signaling with misoprostol prevents cardiac dysfunction, mitochondrial impairment and cell death induced by hypoxia. In addition, they show that the effect of misoprostol is dependent on PKA-mediated Thr181 phosphorylation. The Authors also suggest that there is a possible interaction between Bnip3 and 14-3-3beta that prevents ER Ca2+ release and mitochondrial Ca2+ overload. The Authors conclude that Bnip3 phosphorylation plays a key role in the regulation of cardiac and metabolic dysfunction and identify misoprostol treatment as a therapeutic intervention to prevent hypoxia-induced cardiac injury.

      **Major Comments:**

      1.Results presented in Figs. 1, 2 and 3 pertaining to hypoxia-induced changes in Bnip3 expression, changes in mitochondrial function, cell death, Bnip3-dependent Ca2+ transfer from ER to mitochondria and the effect of misoprostol have partially been demonstrated in a previous publication from the same group (PMID: 30275982).

      Thank you for noting our previous work using predominantly the HCT-116 cell line and a rat model of neonatal hypoxia published in the journal Cell Death Discovery. The work in the current manuscript builds on our previous papers, and extends these findings utilizing a neonatal mouse model, primary neonatal ventricular myocytes, human iPSC-derived cardiomyocytes, and H9c2 cells. These models not only demonstrate the robust nature of the effects of misoprostol treatment on the hypoxic neonatal cardiomyocytes, but they also allowed us to utilize powerful genetic models, such as the Bnip3 knockout mouse, and knockout mouse embryonic fibroblasts (MEFs), which phenocopy many of the effects of misoprostol treatment. These findings strongly implicate Bnip3 as a primary target of misoprostol treatment in the hypoxic neonatal heart in rodents.

      In addition, Figure 1 contains very important in vivo endpoints that we have not previously utilized, including echocardiography to assess neonatal cardiac function, transmission electron microscopy to assess mitochondrial ultrastructure, cardiac ATP and lactate levels, an array of gene expression, and HMGB1 immunofluorescence (Fig.1 I in the current version of the manuscript) implicating a necro-inflammatory phenotype that is modulated by misoprostol treatment. Moreover, in Figures 2 and 3 we confirm previous observations related to hypoxia- and Bnip3-mediated mitochondrial function, and calcium signaling, but also extend these observations to include the impact of hypoxia, Bnip3 and misoprostol treatment on mitochondrial morphology and necro-inflammatory markers, in additional to utilizing human cardiomyocytes and knockout MEFs. Finally, Figures 4-7 of our manuscript describes a novel mechanism by which misoprostol treatment can therapeutically target Bnip3 function both in vivo and in cell models (see below).

      Although based on our previous observations, we feel the work in the present manuscript is highly novel and original, and adds substantially to our knowledge of both Bnip3 function, and neonatal hypoxic injury, which currently represents a world-wide health crisis that is underrepresented in the biomedical literature.

      2.The very same publication from 2018 shows that misoprostol treatment of pups exposed to hypoxia for 7 days is able to prevent the increase in Bnip3 protein levels. Yet, in the present manuscript misoprostol treatment had no effect on Bnip3 protein levels in the same model (Fig. 1E). This raises some concerns regarding the solidity and soundness of the results presented. Thank you for noting this in our previous work. In our 2018 paper, we treated hypoxic neonatal rats with misoprostol and observed a complete repression of Bnip3 expression in the gut and hippocampus, but only a partial repression in the heart. This observation prompted us to explore other mechanisms by which misoprostol could inhibit Bnip3 function. We have increased our sample size for the data in Figure 1 J and K to be more statistically conclusive. The evidence is now stronger that misoprostol only the partial represses Bnip3 expression in the neonatal mouse heart. In addition, we provided a representative western blot (Fig. 1 J), which is consistent with the result in Figure 3C in MEF cells.

      3.The validation of the custom antibody against p-Thr181 needs to be shown. Fig. 4E shows that p-Bnip3 band is quite strong in H9c2 cells, despite total endogenous Bnip3 levels are barely detectable. In addition, phosphorylation of the Bnip3 Thr181 residue in cells and/or in vivo should be confirmed by mass spectrometry.

      Our plan in the next revision, we will be to provide additional validation of the p-Thr181 antibody, as we have done previously (PMID: 33044904). In addition, we will re-run the western blots noted above to improve their quality, as the difference between total Bnip3 and p-Bnip3 in H9c2 cells is likely due to different exposures of the two blots. Confirmation of Bnip3 phosphorylation using mass spectrometry in extracts from intact cells was previously published (PMID: 26102349), however, the nature of the signaling pathways leading to phosphorylation was not determine, nor was the mechanism of Bnip3 inhibition previously determined.

      4.Fig. 4L shows that misoprostol treatment of H9c2 cells leads to an increase in Bnip3 phosphorylation, but this does not seem to be the case in normoxic conditions in vivo (Fig. 4N). Moreover, shouldn't this presumable increase in phosphorylation induced by misoprostol in normoxic conditions lead to Bnip3 accumulation in the cytosol thereby reducing its colocalization with mitochondria (Fig. 6B)? The results obtained with the colocalization method should be corroborated using different methods, such as cell fractionation.

      In the previous version of the manuscript, we reported that misoprostol treatment increases Bnip3 phosphorylation in H9c2 cells following acute exposure (Supplement 5 B). In the updated version of the manuscript, we confirmed this in vitro observation in PVNC’s, demonstrating that in culture, acute misoprostol drug treatments during normoxia result in Bnip3 phosphorylation (Supplement 5 C). However, when we increased our N in the revised manuscript this difference remained not statistically sustained after 7 days of misoprostol treatment in vivo (Fig. 4 M, N). Importantly though, our observation that hypoxia exposure resulted in reduced Bnip3 phosphorylation, and that misoprostol drug treatment was sufficient to restore it is particularly novel, and ties together with our new colocalization data (Fig. 4 M, N). What is very intriguing about the data in Figure 6B is that myc-Bnip3 did not colocalize with the mitochondrial matrix-targeted mito-Emerald under normoxic conditions, and thus was not impacted by misoprostol treatment in normoxia. However, in hypoxic cells the colocalization coefficient between myc-Bnip3 and mito-Emerald increased and was abrogated by misoprostol treatment. This observation suggests that Bnip3 is actively translocated deeper into the mitochondria ultrastructure during hypoxic stress, and that misoprostol treatment can prevent this phenomenon. This observation is consistent with the pBnip3 data shown in Figure 4M. In the most recent version of the manuscript, we have performed additional confocal experiments to substantiate this novel observation. New data clearly demonstrates that hypoxia exposure in vivo increases the colocalization of Bnip3 with the inner mitochondrial membrane protein Opa1 (Fig. 6 C). However, when mice are treated with misoprostol, the colocalization with Opa1 is reduced and the colocalization of Bnip3 with 14-3-3b increases (Fig. 6 M). We have also shown in H9c2 cells that expression of Opa1 prevents Bnip3-induced mitochondrial fission (Fig. 3 J), and that when both Bnip3 and 14-3-3b are ectopically expressed, misoprostol treatment can increase their colocalization (Fig. 6 N, O), suggesting that this is regulated by post-translational modification and not alterations in Bnip3 expression due to hypoxia. Our plan is to include additional fractionation experiments in the next revision of the manuscript; however, this approach may not be as sensitive as confocal microscopy.

      5.In relation to Fig. 4 M, N (page 19), the Authors concluded that the reduction in Bnip3 phosphorylation suggests an increase in Bnip3 activity in the hypoxic neonatal hearts. Nevertheless, this has not been demonstrated.

      Thank you for pointing this out. At this time, we do not have data to suggest that a reduction in Bnip3 phosphorylation increases its activity in vivo. In the revised manuscript, we have new confocal-based colocalization experiments using fixed sections from hypoxic and misoprostol treated hearts that provide insightful information into the subcellular localization of Bnip3 (Please see above; Fig. 6 C, M). Importantly, based on our data in figure 5, particularly Fig.5F using Bnip3-null MEFs, the protective effect of misoprostol is completely prevented by reconstitution of the T181A mutant, but not wild-type Bnip3, suggesting phosphorylation at T181 is an important mechanism by which misoprostol inhibits Bnip3-induced mitochondrial depolarization. Finally, we have also been careful not to overstate our conclusions is the most recent version of the manuscript, have been more specific with our language, and have avoided vague terms like ‘activity’.

      6.Along that line, the Authors concluded that misoprostol-induced cytoprotection is dependent on PKA Thr181 phosphorylation. Nevertheless, this dependence has not been convincingly demonstrated in hypoxic cells and in vivo.

      The new data described outline above, in point #4 and #5, have provided assurances regarding the role of Bnip3 phosphorylation on its subcellular location in vivo and in cultured cells. To further address the dependency of PKA on T181 phosphorylation, we have performed experiments using the PKA inhibitor, H89, in cellular experiments and evaluate whether the protective effect of misoprostol is lost in the presence of this inhibitor. This new data has been added to the most recent version of the manuscript (Fig. 4 G).

      7.Previous studies showed that Bnip3 induces mitochondrial fragmentation and mitophagy (PMID: 16645637, 20436456). What is the hypothesis for the inhibition of mitochondrial fragmentation induced by misoprostol in the present study? Does it prevent Bnip3 interaction with Opa1 or is this event downstream of ER Ca2+ release and mitochondrial Ca2+ overload? Does misoprostol affect mitophagy?

      We have added new experimental data to address the hypothesis that Bnip3 colocalizes with Opa1 to induce mitochondrial fragmentation (as noted by the reviewer, they were previously shown to physically interact), and that this is inhibited by misoprostol treatment (Fig. 6 C). We have also added new data to the supplemental material demonstrating that misoprostol inhibits hypoxia- and Bnip3-induced mitophagy. Based on our data, we proposal that misoprostol inhibits both Bnip3-induced ER-calcium release and Opa1-dependent mitochondrial fusion. This is based on our data that misoprostol prevents Bnip3 accumulation at both the ER and mitochondria, respectively.

      8.The link between Bnip3 interaction with 14-3-3 and Bnip3 Thr181 phosphorylation, if there is any, is not clear. The Authors mention that Thr181 lies within the 14-3-3 binding domain. Is Thr181 phosphorylation required for 14-3-3 binding or are these events unrelated? What is the significance of these events in hypoxia, does 14-3-3 binding to Bnip3 occur in vivo? Is Bnip3 localization affected by hypoxia, 14-3-3 binding and/or misoprostol treatment in vivo?

      Previously, we described the role of phosphorylation of Bnip3L (Nix) at Ser-212 how this regulates the interaction with 14-3-3 (PMID: 33044904). This phosphorylation site is conserved in Bnip3 as T181. Interestingly, phosphorylation of Nix by PKA was not required for interested with 14-3-3b, but the interaction between Nix and 14-3-3b was enhanced by phosphorylation. Our plan is to perform similar experiments with Bnip3 and 14-3-3b to determine if this mechanism is conserved. However, as noted above we have new in vivo data showing that misoprostol increased the colocalization of Bnip3 and 14-3-3b in the hypoxic heart (Fig. 6 M).

      9.Fig. 6P shows the presence of myc-tag after IP for HA-tag, even when HA-14-3-3 was not expressed (middle lane). How is this possible?

      This appears to be a small amount of non-specific interaction between the HA antibody and myc-Bnip3. This is relatively small compared to the band in lane 3, which demonstrates specificity, and the importance of including this control condition. Our plan is to re-run this CO-IP to improve the western blot quality.

      **Minor Comments:**

      1.Please co-stain with the cardiomyocyte marker in Fig. 2A (such as alpha-actinin).

      Yes, good suggestion.

      2.The Methods are not sufficiently detailed. For instance, it is not clear what is the Ca2+ concentration used for Ca2+ pulses in the CRC experiment. The fact that cardiac mitochondria are able to uptake only two Ca2+ pulses raises some concerns regarding the quality of mitochondrial preparation. What is the reason for isolating mitoplasts instead of intact mitochondria?

      We have provided more detail in the revised manuscript.

      3.TMRM fluorescence should be measured before and after FCCP administration, to account for the difference in plasma membrane potential (the results should be expressed as F/FFCCP).

      We can provide some additional control experiments in the revised manuscript, if necessary.

      4.Measurement of extracellular acidification is mentioned in the methods, but the relative results are not shown.

      Thank you, this has been removed.

      5.RNAi experiments targeting Bnip3 are also mentioned in the methods, but the results are not described.

      Thank you. This has been fixed.

      Reviewer #1 (Significance (Required)):

      Previous studies have demonstrated that Bnip3 is upregulated by hypoxia and plays a key role in inducing mitochondrial dysfunction and PTP opening that eventually results in cell death (PMID: 12169648, 10922063). Along that line, misoprostol has been shown to prevent damaging effects of hypoxia by repressing Bnip3 and promoting the expression of pro-survival alternative splicing isoforms (PMID: 30275982). Indeed, the same study showed that misoprostol treatment prevents loss of mitochondrial membrane potential, ROS formation and impairment in mitochondrial oxygen consumption caused by hypoxia in primary neonatal cardiomyocytes. The present manuscript recapitulates these previously published findings. The truly novel findings concern the identification of Bnip3 residue Thr181 as target for PKA phosphorylation and the possible interaction of Bnip3 with 14-3-3. However, the role and/or involvement of these events has not been thoroughly investigated in relation to hypoxia and misoprostol treatment in cells or in vivo.

      Thank you for noting our previous work and identifying the novelty in our present work. As stated above, for Reviewer #1 comments 4, 6, and 8. We have provided additional mechanistic and in vivo data to more fully describe the role of T181 phosphorylation and the interaction with 14-3-3 chaperones in the revised manuscript.

      Reviewer #2:

      Systemic hypoxia, a major complication associated with reduced gestational time, affects more 60% of preterm infants and is a known driver of hypoxia-induced Bcl-2-like 19kDa-interacting protein 3 (Bnip3) expression in neonatal heart. At the level of the cardiomyocyte, Bnip3 activity plays a prominent role in the evolution of necrotic cell death, disrupting subcellular calcium homeostasis and initiating mitochondrial permeability transition (MPT). Emerging evidence suggests both a cardioprotective role for protein kinase A (PKA) through stimulatory prostaglandin (PG) E1 signalling during prolonged periods of hypoxia, and a cytoprotective role for Bnip3 phosphorylation, indicating that post-translational modifications of Bnip3 may be a point of convergence for these two protective pathways. Using a combination of in vivo and multiple cell models, including human iPSC-derived cardiomyocytes, the authors tested if the PGE1 analogue misoprostol is cardioprotective during neonatal hypoxic injury by altering the phosphorylation status of Bnip3. Here we report that hypoxia exposure significantly increases Bnip3 expression, mitochondrial-fragmentation, -ROS, -calcium accumulation and -permeability transition, while reducing mitochondrial membrane potential, all of which were restored to control levels with addition of misoprostol, despite elevated Bnip3 protein expression. Through both gain- and loss-of function genetic studies, the authors show that misoprostol-induced protection directly affects Bnip3, preventing mitochondrial perturbations. They demonstrate that this is a result of PG EP4 receptor signalling, PKA activation, and direct Bnip3 phosphorylation at threonine-181. Furthermore, when this PKA phosphorylation site within Bnip3 is neutralized, the protective misoprostol effect is lost. They also provide evidence that misoprostol traffics Bnip3 away from the ER through a physical interaction with 14-3-3β, thereby preventing aberrant ER calcium release and MPT. In vivo studies further demonstrate that misoprostol treatment increases Bnip3 phosphorylation at threonine-181 in the mouse heart, while both misoprostol treatment and genetic ablation of Bnip3 prevented hypoxia-induced reductions in contractile function. Taken together, these results demonstrate a foundational role for Bnip3 phosphorylation in the molecular regulation of cardiomyocyte contractile and metabolic dysfunction and identifies EP4 signaling as a potential pharmacological mechanism to prevent hypoxia-induced neonatal cardiac injury. While this work is interesting, a number of issues remain.

      1.English expression needs some attention. For example, the first sentence of the abstract - "more than 60%...."; Page 20, line 9 "We observed that misoprostol's ability to to". Many sections should be broken into 2 or 3 sentences.

      Thank you, we have made these changes and have fully proof-read our manuscript.

      2.Evidence from In vivo studies such as those described in section 3.7 is minimal. Much more in vivo evidence is needed. It is unclear the authors established this in vivo model of hypoxia - supposedly gestational hypoxia should be considered. Consider citing these reviews on maternal over- and under-nutrition for postnatal heath (PMID 33181042; 22982026).

      Thank you, we will cite these papers and have clarified our in vivo model, related to comparable human gestation time, in the Methods section. We have also revised the Introduction and Discussion to be more consistent with our in vivo model. In addition, and noted above, we have preformed addition in vivo experiments in both the hypoxia/misoprostol model, and in Bnip3 KO mice to more fully support our conclusions. Additional HMGB1 immunofluorescence has already been added to Figures 1 and 7, and we have include additional confocal-based colocalization experiments from fixed tissues (described in more detail above; Fig. 7 D).

      3.Which one does misoprostol exactly execute its action? Phosphorylation through PKA or trafficking Bnip3 away from the ER through a physical interaction with 14-3-3β?

      This is a very good question. Based on our previous work on Bnip3L (Nix; PMID: 33044904,) PKA-induced phosphorylation of the transmembrane domain increased the physical interaction with 14-3-3b, which acts as a chaperone to translocate Nix away from the mitochondria and ER/SR. We will preform similar experiments with Bnip3 and 14-3-3b for the next revision to provide additional support for this conclusion.

      4.More in vivo proof of concept studies are needed to validate the signaling mechanism - this is an invitro-based study (hypoxia challenge occurs in vitro).

      We have already included additional in vivo immunofluorescence to Figure 1 and 7, and have performed additional colocalization experiments to validate the signaling pathway in this revision. Many of these experiments are described above.

      5.Quality of figures is somewhat poor.

      Our images are the highest possible resolution within the confines of the figure size limit. Perhaps the reviewer received a web-optimized version of the figures for review.

      Reviewer #2 (Significance (Required)):

      Relatively high - although in vivo evidence is needed.

      Thank you. This is provided in the revised manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Systemic hypoxia, a major complication associated with reduced gestational time, affects more 60% of preterm infants and is a known driver of hypoxia-induced Bcl-2-like 19kDa-interacting protein 3 (Bnip3) expression in neonatal heart. At the level of the cardiomyocyte, Bnip3 activity plays a prominent role in the evolution of necrotic cell death, disrupting subcellular calcium homeostasis and initiating mitochondrial permeability transition (MPT). Emerging evidence suggests both a cardioprotective role for protein kinase A (PKA) through stimulatory prostaglandin (PG) E1 signalling during prolonged periods of hypoxia, and a cytoprotective role for Bnip3 phosphorylation, indicating that post-translational modifications of Bnip3 may be a point of convergence for these two protective pathways. Using a combination of in vivo and multiple cell models, including human iPSC-derived cardiomyocytes, the authors tested if the PGE1 analogue misoprostol is cardioprotective during neonatal hypoxic injury by altering the phosphorylation status of Bnip3. Here we report that hypoxia exposure significantly increases Bnip3 expression, mitochondrial-fragmentation, -ROS, -calcium accumulation and -permeability transition, while reducing mitochondrial membrane potential, all of which were restored to control levels with addition of misoprostol, despite elevated Bnip3 protein expression. Through both gain- and loss-of function genetic studies, the authors show that misoprostol-induced protection directly affects Bnip3, preventing mitochondrial perturbations. They demonstrate that this is a result of PG EP4 receptor signalling, PKA activation, and direct Bnip3 phosphorylation at threonine-181. Furthermore, when this PKA phosphorylation site within Bnip3 is neutralized, the protective misoprostol effect is lost. They also provide evidence that misoprostol traffics Bnip3 away from the ER through a physical interaction with 14-3-3β, thereby preventing aberrant ER calcium release and MPT. In vivo studies further demonstrate that misoprostol treatment increases Bnip3 phosphorylation at threonine-181 in the mouse heart, while both misoprostol treatment and genetic ablation of Bnip3 prevented hypoxia-induced reductions in contractile function. Taken together, these results demonstrate a foundational role for Bnip3 phosphorylation in the molecular regulation of cardiomyocyte contractile and metabolic dysfunction and identifies EP4 signaling as a potential pharmacological mechanism to prevent hypoxia-induced neonatal cardiac injury. While this work is interesting, a number of issues remain.

      1.English expression needs some attention. For example, the first sentence of the abstract - "more than 60%...."; Page 20, line 9 "We observed that misoprostol's ability to to". Many sections should be broken into 2 or 3 sentences.

      2.Evidence from In vivo studies such as those described in section 3.7 is minimal. Much more in vivo evidence is needed. It is unclear the authors established this in vivo model of hypoxia - supposedly gestational hypoxia should be considered. Consider citing these reviews on maternal over- and under-nutrition for postnatal heath (PMID 33181042; 22982026) .

      3.Which one does misoprostol exactly execute its action? Phosphorylation through PKA or trafficking Bnip3 away from the ER through a physical interaction with 14-3-3β?

      4.More in vivo proof of concept studies are needed to validate the signaling mechanism - this is an invitro-based study (hypoxia challenge occurs in vitro).

      5.Quality of figures is somewhat poor.

      Significance

      relatively high - although in vivo evidence is needed

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Martens et al investigates the mechanisms of Bnip3-mediated cell damage during hypoxia. The Authors show that modulation of prostaglandin (PG) E1 signaling with misoprostol prevents cardiac dysfunction, mitochondrial impairment and cell death induced by hypoxia. In addition, they show that the effect of misoprostol is dependent on PKA-mediated Thr181 phosphorylation. The Authors also suggest that there is a possible interaction between Bnip3 and 14-3-3beta that prevents ER Ca2+ release and mitochondrial Ca2+ overload. The Authors conclude that Bnip3 phosphorylation plays a key role in the regulation of cardiac and metabolic dysfunction and identify misoprostol treatment as a therapeutic intervention to prevent hypoxia-induced cardiac injury.

      Major Comments:

      1.Results presented in Figs. 1, 2 and 3 pertaining to hypoxia-induced changes in Bnip3 expression, changes in mitochondrial function, cell death, Bnip3-dependent Ca2+ transfer from ER to mitochondria and the effect of misoprostol have partially been demonstrated in a previous publication from the same group (PMID: 30275982).

      2.The very same publication from 2018 shows that misoprostol treatment of pups exposed to hypoxia for 7 days is able to prevent the increase in Bnip3 protein levels. Yet, in the present manuscript misoprostol treatment had no effect on Bnip3 protein levels in the same model (Fig. 1E). This raises some concerns regarding the solidity and soundness of the results presented.

      3.The validation of the custom antibody against p-Thr181 needs to be shown. Fig. 4E shows that p-Bnip3 band is quite strong in H9c2 cells, despite total endogenous Bnip3 levels are barely detectable. In addition, phosphorylation of the Bnip3 Thr181 residue in cells and/or in vivo should be confirmed by mass spectrometry.

      4.Fig. 4L shows that misoprostol treatment of H9c2 cells leads to an increase in Bnip3 phosphorylation, but this does not seem to be the case in normoxic conditions in vivo (Fig. 4N). Moreover, shouldn't this presumable increase in phosphorylation induced by misoprostol in normoxic conditions lead to Bnip3 accumulation in the cytosol thereby reducing its colocalization with mitochondria (Fig. 6B)? The results obtained with the colocalization method should be corroborated using different methods, such as cell fractionation.

      5.In relation to Fig. 4 M, N (page 19), the Authors concluded that the reduction in Bnip3 phosphorylation suggests an increase in Bnip3 activity in the hypoxic neonatal hearts. Nevertheless, this has not been demonstrated.

      6.Along that line, the Authors concluded that misoprostol-induced cytoprotection is dependent on PKA Thr181 phosphorylation. Nevertheless, this dependence has not been convincingly demonstrated in hypoxic cells and in vivo.

      7.Previous studies showed that Bnip3 induces mitochondrial fragmentation and mitophagy (PMID: 16645637, 20436456). What is the hypothesis for the inhibition of mitochondrial fragmentation induced by misoprostol in the present study? Does it prevent Bnip3 interaction with Opa1 or is this event downstream of ER Ca2+ release and mitochondrial Ca2+ overload? Does misoprostol affect mitophagy?

      8.The link between Bnip3 interaction with 14-3-3 and Bnip3 Thr181 phosphorylation, if there is any, is not clear. The Authors mention that Thr181 lies within the 14-3-3 binding domain. Is Thr181 phosphorylation required for 14-3-3 binding or are these events unrelated? What is the significance of these events in hypoxia, does 14-3-3 binding to Bnip3 occur in vivo? Is Bnip3 localization affected by hypoxia, 14-3-3 binding and/or misoprostol treatment in vivo?

      9.Fig. 6P shows the presence of myc-tag after IP for HA-tag, even when HA-14-3-3 was not expressed (middle lane). How is this possible?

      Minor Comments:

      1.Please co-stain with the cardiomyocyte marker in Fig. 2A (such as alpha-actinin).

      2.The Methods are not sufficiently detailed. For instance, it is not clear what is the Ca2+ concentration used for Ca2+ pulses in the CRC experiment. The fact that cardiac mitochondria are able to uptake only two Ca2+ pulses raises some concerns regarding the quality of mitochondrial preparation. What is the reason for isolating mitoplasts instead of intact mitochondria?

      3.TMRM fluorescence should be measured before and after FCCP administration, to account for the difference in plasma membrane potential (the results should be expressed as F/FFCCP).

      4.Measurement of extracellular acidification is mentioned in the methods, but the relative results are not shown.

      5.RNAi experiments targeting Bnip3 are also mentioned in the methods, but the results are not described.

      Significance

      Previous studies have demonstrated that Bnip3 is upregulated by hypoxia and plays a key role in inducing mitochondrial dysfunction and PTP opening that eventually results in cell death (PMID: 12169648, 10922063). Along that line, misoprostol has been shown to prevent damaging effects of hypoxia by repressing Bnip3 and promoting the expression of pro-survival alternative splicing isoforms (PMID: 30275982). Indeed, the same study showed that misoprostol treatment prevents loss of mitochondrial membrane potential, ROS formation and impairment in mitochondrial oxygen consumption caused by hypoxia in primary neonatal cardiomyocytes. The present manuscript recapitulates these previously published findings. The truly novel findings concern the identification of Bnip3 residue Thr181 as target for PKA phosphorylation and the possible interaction of Bnip3 with 14-3-3. However, the role and/or involvement of these events has not been thoroughly investigated in relation to hypoxia and misoprostol treatment in cells or in vivo.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank both reviewers for their insightful comments and suggestions. We propose to address these as described below.

      Reviewer 1

      **Major points:**

      Point 1

      1. A logical question comes up and I do not think the authors addressed, in a human body what happens to the extracted drugs after loading on HDLs? This requires some mentioning in the discussion.

      1. This is indeed a good question. We have now added in the discussion what may happen to the HDL-extracted drugs in a whole organism. It reads as follows: The likely fate of HDL-extracted drugs in humans is that they are carried to the liver by HDLs. Scavenger receptors such as SR-BI expressed by hepatocytes can then bind HDLs carrying the extracted drugs allowing the drugs to be taken up by the cells. In hepatocytes, the drugs may be inactivated and excreted in the bile (https://doi.org/10.1016/j.cld.2016.08.001, https://doi.org/10.1161/CIRCRESAHA.119.312617). Point 2

      2. Is the effect specific to the fully mature HDL molecule or do apo-lipoproteins that compose HDLs have similar effects?

      1. This is an interesting question. Apo-AI is the characteristic and most abundant apolipoprotein found in HDLs. It is however not trivial to compare the activities of ApoAI and HDLs because of the difficulty of producing large amounts of ApoAI. In the present paper, the lowest concentration of HDLs that induces drug efflux is 0.125 mM. As there are about 3 molecules of Apo-AI per HDL molecule, we should use 0.375 (3 x 0.125) mM Apo-AI to see if the Apo-AI content of these HDLs can mediate or mimic the drug efflux capacity of the lipoproteins. About 100 mg of recombinant Apo-AI would be required to make 10 ml of a ~0.3 mM Apo-AI cell culture solution. This is an enormous task requiring substantial time and money investment. We are therefore not in a position to perform this experiment that would be of interest but which is not central for supporting the main message of our manuscript. Point 3

      2. What are non-SERCA-mediated effects of TG?

      1. The SERCA-independent toxic effects of TG have been shown to be a consequence of mitochondrial dysfunction resulting from the ability of TG to induce mitochondrial permeability transition (DOI: 10.1046/j.1432-1327.1999.00724.x). This is now mentioned in the discussion. Point 4

      2. Why don't HDLs protect cells from low dose TG despite its removal?

      1. Our data indicate indeed that HDLs do not affect the ability of TG to inhibit SERCA and the low ER stress response that ensues. This can be explained by the fact that very low concentrations of TG inhibit SERCA in an irreversible manner (Ki values of 0.2, 1.3, and 12 nM for SERCA1b, SERCA2b, and SERCA3a, respectively) (DOI:https://doi.org/10.1074/jbc.M510978200). Hence, even though HDLs can remove a substantial amount of TG from cells, the concentration of TG that remains in cells is presumably still sufficient to fully inhibits the SERCA pumps. This explanation is now included in the discussion. Point 5

      Line 144. No information on the siRNA was given (refer to the materials section to guide the reader).

      The siPOOLs we have used correspond, for each targeted gene, to a pool of 30 optimally-designed proprietary siRNAs from Biotech. The company does not disclose the sequences of these siRNAs.

      Minor comments:

      Point 6

      1. There needs to be an abbreviation section. Make sure that you only abbreviate the terms that are used more than once in the text.

      1. An abbreviation list is now provided. Point 7

      2. Lines 104, 277, 283 and anywhere else: use TG instead of thapsigargin.

      1. Thank you for noting this. This has now been done. Point 8

      2. Line 262: you don't have to redefine SERCA.

      1. Done Point 9

      2. I suggest adding structures of the used drugs.

      1. The structures of the drugs used in this work are now presented in Figure S9. Point 10

      2. I suggest using a table for the RT-PCR primers. Protein Direction Number Sequence Description NCBI entryh-SERCA2 Fwd #1612 5'ATG GGG CTC CAA CGA GTT AC nucleotides 648-667 of human SERCA2, variant a NM_001681.4

      1. Thank you for this suggestion that we have now followed and that indeed facilitates the reading of the RT-PCR method section. Point 11

      2. Line 93: DMEM (Gibco; ref 61965-059;) the lot number is missing.

      1. The lot number is now indicated. Point 12

      2. Line 102: 500'000 (and all other thousand numbers) the apostrophe's place is strange.

      1. We have now removed the apostrophe in numbers. Point 13

      2. Line 381: cholesterol carriers.

      1. This typo has now been corrected Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      **Major concerns**

      Point 14

        1. Figure 2, The authors should perform western blot to evaluate the protein expression levels (not only mRNA levels by Q-PCR)
      1. We have performed these experiments in the past in MIN6 cells (Pétremand et al. Diabetes 2012 May; 61(5): 1100-1111; Figure 2). This earlier work showed that HDLs reduce the induction of TG-induced ER stress markers at the protein (CHOP and BiP) and functionality (IRE1 activity on XBP1 splicing). We will repeat these experiments in DLD1 cells as per the reviewer’s suggestion. Point 15.
      1. Could the authors evaluate whether HDL treatment reduces the amount of SERCA (mRNA/protein) in their cells? The loss of SERCA could explain the reduced accumulation of the BODIPY-TG in the cell?

      We would argue that it is unlikely that a reduction in SERCA expression from cells has any significant impact on TG cell loading as the cell-associated drug is certainly in vast excess compared to the number of SERCA molecules in cells. We will nevertheless perform the requested experiment using DLD-1 cells and assess whether HDLs modulate their SERCA2 expression.

      Point 16.

      1. To generalize their observation, It would have been interesting to test more lipophilic/hydrophilic drugs to quantitatively validate that HDLs are selective of lipophilic drugs.

      We will test 2 new lipophilic (letermovir and lumefantrine) and 2 new hydrophilic drugs (levetiracetam and cefepime) for their ability to be extracted by HDLs (experiment set-up as in Figure 4).

      Point 17.

      1. The ABC transporter part in this manuscript has to be improved with the down-regulation of extinction of ABCA1 and ABCG1 to determine in a comprehensive manner the effect of these transporters in the pro-survival role of HDL.

      We will invalidate the genes encoding ABCA1, ABCB1, ABCG1, and ABCG2 using the CRISPR/Cas9 technology and test the ability of the invalidated cells to promote efflux of thapsigargin to HDLs (experiment set-up as in Figure 6) and to protect them from the drug (experiment set-up as in Figure 6). The choice of the cell lines to be used for the invalidation depends on what ABC transporters they express. No single cell line expresses all four ABC transporters to high levels. The following cell lines will be used because, according to the literature or to the Human Protein Atlas (https://www.proteinatlas.org/), they display strong expression of the indicated transporters: for ABCA1: HCT116; for ABCB1: HEK293T; for ABCG1 and ABCG2: MCF7. For consistency with the experiments already performed in the manuscript, the invalidation will also be performed in the DLD1 cell line.

      **Minor point:** Point 18.

        1. ABCB1 blot in figure 7B is not convincing and should be improved.
      1. We will redo this WB to improve the quality of the blot.
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Christian Widmann and colleagues describe how HDLs can protect cells by promoting the extraction of lipophilic drugs such as thapsigargin (TG). The authors observe that HDLs do not affect the ability of TG to inhibit SERCA but instead decrease lipophilic drug content inside cells and therefore protect cells against their lethal effects. Using some compounds (probably not enough to conclude), the authors claim that HDLs can promote the exclusion of lipophilic drugs while hydrophilic drugs or compounds like doxorubicin hydrochloride, an anticancer drug, or Rhodamine 123, were not extracted from cells. Finally using small interfering RNA, the authors reveal that ABCB1 mediates some of the drug effluxes to HDLs. This study is sound and well-written. Although of interest from a therapeutic standpoint, this manuscript should address some questions to strengthen these data.

      Major concerns

      1. Figure 2, The authors should perform western blot to evaluate the protein expression levels (not only mRNA levels by Q-PCR)
      2. Could the authors evaluate whether HDL treatment reduces the amount of SERCA (mRNA/protein) in their cells? The loss of SERCA could explain the reduced accumulation of the BODIPY-TG in the cell?
      3. To generalize their observation, It would have been interesting to test more lipophilic/hydrophilic drugs to quantitatively validate that HDLs are selective of lipophilic drugs.
      4. The ABC transporter part in this manuscript has to be improved with the down-regulation of extinction of ABCA1 and ABCG1 to determine in a comprehensive manner the effect of these transporters in the pro-survival role of HDL.

      Minor point:

      1. ABCB1 blot in figure 7B is not convincing and should be improved.

      Significance

      This study can interest a large scientific audience. Some additional experiments have to be performed to render more convincing some part of this study.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      It was my pleasure to evaluate the work submitted to Review Commons. I have reviewed the work and my comments are as follows: This manuscript entitled "HDLs extract lipophilic drugs from cells" by Zheng and colleagues describes a new mechanistic picture of how HDLs protect cells against death. The authors meticulously describe a novel ability of HDLs to extract hydrophobic xenobiotics from cells akin to their cholesterol-extracting function. I would like to thank the authors for a pleasurable read and their well-defined experimental design. This manuscript is of great value and significance to the fields of clinical chemistry and pharmacology. I therefore do think this manuscript merits publication after tending to these major and minor comments.

      Major points:

      • A logical question comes up and I do not think the authors addressed, in a human body what happens to the extracted drugs after loading on HDLs? This requires some mentioning in the discussion.
      • Is the effect specific to the fully mature HDL molecule or do apo-lipoproteins that compose HDLs have similar effects?
      • What are non-SERCA-mediated effects of TG?
      • Why don't HDLs protect cells from low dose TG despite its removal?
      • Line 144. No information on the siRNA was given (refer to the materials section to guide the reader). Minor comments:
      • There needs to be an abbreviation section. Make sure that you only abbreviate the terms that are used more than once in the text.
      • Lines 104, 277, 283 and anywhere else: use TG instead of thapsigargin.
      • Line 262: you don't have to redefine SERCA.
      • I suggest adding structures of the used drugs.
      • I suggest using a table for the RT-PCR primers. Protein Direction Number Sequence Description NCBI entry h-SERCA2 Fwd #1612 5'ATG GGG CTC CAA CGA GTT AC nucleotides 648-667 of human SERCA2, variant a NM_001681.4
      • Line 93: DMEM (Gibco; ref 61965-059;) the lot number is missing.
      • Line 102: 500'000 (and all other thousand numbers) the apostrophe's place is strange.
      • Line 381: cholesterol carriers.

      Significance

      This manuscript is of great value and significance to the fields of clinical chemistry and pharmacology.

      Referees cross-commenting

      I agree with the experiments suggested by reviewer #2

  2. Feb 2021
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Below is our point-by-point response:


      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manuscript of Lalanne and coworkers address the cellular responses to varied translation termination factor expression in Bacillus subtilis. The authors set-up a system to fine-tune the expression of release factor RF1, RF2 as well as PrmC that post-translationally modifies RF1/RF2 to maximize their catalytic hydrolysis activity. They then monitor the fitness costs associated with overexpression or depletion of the factor by following the changes in growth rate. The set-up is nicely illustrated in Figure 1. The results in Figure 2 show that overexpression of RF1 and RF2 has relatively modest effect on the growth rate compared to overexpression of PrmC that leads to dramatic growth rate reduction. By contrast, depletion of RF1 has a strong negative influence on fitness, whereas a similar level of depletion of RF2 had little influence on fitness. PrmC overexpression appears to be correlated with the induction of the sigmaB regulon, however, the authors do not manage to ascertain why this is. By contrast, RF2 depletion also results in the induction of the sigmaB regulon and the authors demonstrate convincingly that this is due to a termination defect within the rsbQ-rsbV operon that contains an overlapping start-stop AUGA

      A few points that the authors might consider discussing

      1. The natural abundance of each RF in bacteria in relation to the usage of different stop codons in different organisms.

      Response: We thank the reviewer for their suggestion. A correlation between RF abundance and stop codon usage across bacterial species has been previously reported (Korkmaz et al., 2014; Wei et al., 2016), which is corroborated by our quantification (see below). This correlation provides further evidence that the RF expression may be optimized to meet their demands in translation termination. We now include a new discussion in the main text (p. 9, lines 410-415): "Our data thus corroborate several previous lines of evidence suggesting that RF expression might be precisely tuned. First, it was found that the relative expression between RF1 and RF2 correlates with stop codon usage between different species (Korkmaz et al., 2014; Wei et al., 2016). For instance, B. subtilis has a higher abundance of RF1 and more frequent UAG usage compared to E. coli, suggesting that RF1’s expression setpoint meets translational demand (Methods).”

      Below we include additional analyses that may be of interests to the Reviewer.

      From our ribosome profiling quantification in E. coli, B. subtilis, C. crescentus, and V. natriegens (Lalanne et al., 2018), we can compare the relative usage of the three stop codons (frequency of stop codons weighted by expression) with abundances of RF1 and RF2:

      Despite the limited sample size, we find reasonable agreement with the expected correlation between codon usage and cognate RF abundance. In species with substantial differences between RF1 and RF2 abundances (E. coli and B. subtilis), the most heavily used non-UAA stop corresponds to the most highly expressed RF. This argues in favor of expression tuning of these important enzymes and is consistent with the growth optimization we directly observe.

      As a word of caution, although the low usage of UAG in E. coli and low expression of RF1 (reported long ago, e.g., (Adamski et al., 1994)) is well established, it should be noted that strain MG1655’s RF2 factor harbors a debilitating missense A246T mutation near its active site (Dinçbas-Renqvist et al., 2000), which potentially complicates interpretation of the expression of E. coli’s release factors [interestingly, we do not see any difference in RF1 and RF2 expression from ribosome profiling data in strain NCM3722, which contains the RF2 variant without the A246T mutation (JBL, unpublished data)].

      The role of the frameshifting mechanism in RF2 and how then RF1 levels are regulated.

      Response: We thank the reviewer to raising the interesting topic of release factor expression regulation. We have added a section in our discussion to comment on RF2 regulation (p. 9, lines 415-420).

      “Second, the gene encoding RF2 has a broadly conserved UGA-based frameshift event that autoregulates the expression based on its own activity (Baranov et al., 2002; Craigen and Caskey, 1986; Craigen et al., 1985). Interestingly, there are no reports of RF1 autoregulation to our knowledge, and we found that ectopic over- or under-expression does not affect its own promoter activity (Fig. S7). Therefore, a lack of autoregulation does not necessarily indicate that cells are less sensitive to small perturbations on its expression.”

      The statement above includes an additional analysis on RF1 regulation that was motivated by the Reviewer’s comment. In contrast to RF2, no definitive evidence exists on autoregulatory mechanisms for RF1. Following the Reviewer’s comment, we realized that our dataset allowed us to search for evidence of endogenous regulation in B. subtilis: our RF1 expression strain has a markerless deletion of prfA and prmC genes, leaving the surrounding regions, and notably the promoter, intact. As such, possible unbeknownst regulatory mechanisms at the promoter level could be identified in our RNA-seq data under steady-state perturbation of RF1 levels. Quantifying the expression of the 5’ untranslated region and operonic gene ywkF at the ablated prfAlocus (presented in Fig. S7, reproduced below), we find no significant changes in expression across over 30-fold range in RF1 expression, arguing against such transcriptional regulatory mechanisms. Although this does not rule out other regulatory mechanisms at the post-transcriptional level, no such mechanisms have been documented for RF1 to our knowledge.

      The authors observe queuing in front of the relevant stop codons upon RF depletion, however, do not discuss about readthrough events, which are usually competing with termination. Surprisingly, in this context the authors don't discuss the work from Mankin and coworkers showing sequestration of RFs from termination by peptides such as apideacin leads to translational readthrough.

      Response: We concur with the Reviewer about the importance of the recent work from Mankin et al. This paper was referenced in our original submission, but our literature management software improperly formatted its citation. The corrected reference to (Mangano et al., 2020) is now included in the revised manuscript.

      Translational readthrough is indeed clearly visible in our ribosome profiling data from acute CRISPRi knockdown of RF1/PrmC and RF2. Using an approach analogous to Mangano and Florin et al, we quantified readthrough as the ribosome footprint density downstream of the stop codon (+5 to +45 bp) to the density in the gene body for isolated genes (no codirectional genes within 55 bp). We find five-fold increase in the median readthrough for genes that are terminated by the RF under perturbation (shown in a new panel in the main text, Fig. 4b, reproduced below). This new analysis is included in the section regarding translational phenotypes identified from ribosome profiling under RF depletion, p. 7, lines 309-312.

      “The stop-codon-specific queuing is associated with translational readthrough downstream (Fig. 4b), consistent with a recent observation based on inhibition of peptide release by the antimicrobial apidaecin in E. coli (Mangano et al., 2020).”

      This additional analysis, in conjunction with (Mangano et al., 2020), also allows us to calibrate the depletion of RFs in our non steady-state CRISPRi perturbation. Given that apidaecin treatment (shown to lead to a nearly complete depletion of free RF in the cell) causes a >100-fold increase in readthrough, this suggests that our CRISPRi perturbation experiments only led to partial RF depletion at the moment of cell harvesting.

      The efficiency of translation termination is well-known to be dependent on the context of the stop codon. Do the authors also observe such a trend. Especially, UGAC for RF2, one would expect to observe high levels of readthrough upon RF2 depletion.

      Response: Further assessment of the sequence determinants that dictate susceptibility of certain genes and regulatory elements to RF perturbation is of great interest. We now include additional analyses for the effect of stop codon context on readthrough.

      In our RF2 CRISPRi knockdown data, stratifying the translational readthrough (data from Fig. 4b) by stop codon and its next nucleotide, we observe only a modest (≈2×, p“We also observed a trend of tetranucleotide-dependent (UGAN) readthrough for RF2 knockdowns (Methods, Appendix Fig. 2) consistent with previous characterizations (Poole et al., 1995).”

      As an additional point of interest, the importance of the 4th nucleotide in termination has not been studied outside of E. coli. Although indirect, one way to assess the influence of the 4th nucleotide is to determine the aggregated usage of each tetranucleotide stop signal by ribosome profiling. Interestingly, and as pointed out by the Reviewer, whereas E. coli (MG1655) displays a 16× increase in usage between the maximum UGAU (tetranucleotide usage 0.064) and minimum UGAC (tetranucleotide usage 0.004), no such difference is observed in B. subtilis (usage for UGAU and UGAC both at 0.015), suggesting that the immediate sequence context surrounding stop codons could have different consequences in different species.

      Reviewer #2 (Significance (Required)):

      Overall, the experiments are clearly performed and beautifully illustrated. Clearly, a lot of work has gone into this study but the end message that the cell regulates carefully RF concentrations is not surprising. Especially given that RF2 carefully regulates its own levels using an autoregulatory frameshifting mechanism. The major finding that the rsbQ-rsbV operon with the RF2 dependence leading to induction of the sigmaB regulon is in the end rather trivial since these regulators depend on RF2 for termination. Therefore, this manuscript is unlikely to have general interest to people in the translation field (such as myself) but rather those working in the field of synthetic biology.

      Response: We thank the Reviewer for their positive assessment of our presentation and experimental methods, and for their judgment that our work will be of interest to synthetic biologists.

      In our study, we used translation as a well-characterized system to interrogate the cellular response when enzyme concentrations are perturbed. Because the system is so well characterized, it allowed to ask whether the fitness effects are due to perturbations to the translation flux itself, or rather driven by spurious distal connections in the regulatory network. The end message we wish to convey is that enzyme expression is entrenched by spurious regulatory connections, suggesting that predictive bottom-up models of expression-fitness landscapes will require near-exhaustive characterization of parts.

      Although our focus is on the cellular response, there are several interesting findings related to translation. First, we show that even though RF1 and PrmC are not subject to the strict autoregulation as RF2 is, cell growth is similarly or even more sensitive to RF1 and PrmC abundance. Second, among the numerous regulators that depend on RF2 for termination, RbsV/RbsW is exceptionally sensitive to RF2 depletion (Fig. 4e). This result not only points to our incomplete understanding of translation regarding what makes this pair particularly susceptible, and further underscores the spurious nature of the cellular response to perturbations. We have expanded the discussions on the implication of these findings in the revised manuscript.


      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this paper, the authors use a combination of RNA sequencing, ribosome profiling and measurements of cellular composition and growth rate to gain insight into the multi-scale affects that perturbations to translation termination factors have on general physiological states and reproductive fitness using Bacillus subtilis as their model organism. Specifically, they find that perturbing the expression levels of peptide chain release factors in any direction has a negative effect on growth-rate. This negative effect was not due to a direct impact of the gene on the cell, but instead due to a chain of regulatory interactions that cause the activation of the general stress regulon. This leads to upregulation of a large chunk of the genome and an indirect impact on the expression of all other genes. Critically, the knock-on effects observed for the specific perturbations studied suggest that it may be difficult to predict expression-fitness landscapes of a cell, without carrying out a detailed mapping of all genes and the cell's physiological state.

      Overall, the core findings in the paper are well justified by the data presented and the experiments appear to have been rigorously carried out.

      Response: We thank the reviewer for their positive assessment.

      My only concern is that it is unclear if biological replicates of the ribosome profiling were performed. Also, biological replicates are mentioned for the RNA-seq data, but no data is shown. Even a simple graph demonstrating the expression levels across these would be useful to be assured of no issues in reproducibility given the complex processing of the data involved.

      Response: We now include additional analyses for biological replicates of RNA-seq and ribosome profiling experiments, which show the same high degree of reproducibility as we have demonstrated in previous studies (Johnson et al., 2020; Lalanne et al., 2018; Li et al., 2014).

      With respect to RNA-seq quantification, we compared our 6 wild-type datasets (biological replicates except for different inert inducer concentrations, using the same batch of conditioned MCC medium) against each other in all possible pairs. The data is now included as Appendix Fig. 1a (referred to in the main text, p. 4, line 138), and is reproduced below. Across pairs, the mRNA level quantification shows a median FC1090 (10th and 90th percentile in fold-change) between 0.86 to 1.16, and median R2 of log-transformed data at 0.99. These statistics showcasing reproducibility of our RNA-seq methodology are now included in our description of our RNA-seq approach in the Methods, p. S8, lines 313-320.

      Regarding ribosome profiling quantification, we now include comparisons between pairs of two replicates for wild type cells, and pairs of replicates wild-type with inert fluorescent protein expression, each pair of samples with their own batch of conditioned MCC medium. These samples were taken under different inducer concentrations, which are expected to affect the expression of two genes and not others. As indicated in Appendix Fig. 1b and reproduced below, the Pearson correlation of log-transformed footprint density is respectively of R2=0.98 and 0.99 (genes with >100 reads mapped), with a 10th to 90th percentile of fold-changes between 0.83 to 1.17, and 0.91 to 1.12. These results are described in the Methods, p. S9, lines 339-345.

      Related to this, I see no mention of data availability in the paper. For this study to be useful to others, providing the raw data (unprocessed) would be essential (ideally in a public repository).

      Response: We are sorry that the statement on data availability was buried in the original Methods section that was not a part of the merged PDF file. The raw sequencing data were submitted to Gene Expression Omnibus under the accession number GSE162169. The processed data, including fitness scores, mRNA levels, protein synthesis rates, were included as Supplementary Data Tables 1-9. We now moved the data availability statement to the main document at p. 12, lines 512-516.

      The presentation of the work is excellent, with very clear figures and text that helped guide the reader through the results. There were a few minor comments:

      1. Abstract: "in bacterium Bacillus subtilis" should read "in the bacterium Bacillus subtilis".

      Response: This typo is now corrected.

      Page 4: "found that under numerous ways" should read "found that under the numerous ways".

      Response: This typo is now corrected.

      The authors mention that changes in the expression level of RF1 impacted motility and biofilm genes, but not how this impacts fitness. Would they be able to experimentally identify origin of RF1 growth defects in the same way they did for PrmC? This is not essential for the main findings but would help strengthen the work.

      Response: The cause of the growth defect under RF1 knockdown is indeed interesting. We now present evidence ruling out the hypothesis that the growth defect is caused by the expression decrease for motility and biofilm genes.

      This hypothesis is driven by our result that ablation of SigB regulon rescues the fitness defect during PrmC overexpression (Fig. 3g) and by the observed downregulation of motility and lyt operons and upregulation of the eps operon during RF1 knockdown. To test this hypothesis, we used a strain without sigD (the motility sigma factor), which displays similar expression changes to what we observed in RF1 knockdown (Chai et al., 2009). Comparing the growth rates of wild-type to DsigD, we found only a slight difference (30% growth defect measured upon RF1 knockdown, it appears that transcriptional changes to the motility regulon can only partially explain of the RF1 growth defect. These results are discussed on p. 10, lines 459-463. Further assessment will constitute interesting future research avenues.

      It is difficult to know how generalisable the findings of this work are due to the very limited scope. It could be helpful for the authors in the discussion to consider and comment on how such approaches might be scaled-up to enable broader and more general studies of expression-fitness landscapes and where they will find most use.

      Response: Indeed, the spurious nature of the expression-fitness landscape makes it difficult to generalize the exact mechanisms that we described here to other proteins. However, what is generalizable is our conclusion that such spurious connections limit the feasibility of bottom-up models for predicting fitness landscapes unless one has near-exhaustive characterization of all parts.

      Our approach of mechanistic profiling of cell states under perturbations therefore provides a path forward that can be scaled up by recent developments in multiscale measurements. We now include a discussion for broader and more general studies on p. 11, lines 473-480.

      “Various strategies can now generate expression-fitness landscapes for a large number of genes in parallel, for example using suites of promoters (Keren et al., 2016), genome-scale library of inducible gene expression (Arita et al., 2021), or tunable CRISPR perturbations (Hawkins et al., 2020; Jost et al., 2020; Mathis et al., 2021). Together with the advent of single-cell transcriptomics in bacteria (Blattman et al., 2020; Imdahl et al., 2020; Kuchina et al., 2020), these methods open the possibility of dissecting the molecular underpinnings of expression-fitness landscapes genome-wide, and to comprehensively identify instances of regulatory entrenchment.”

      Reviewer #3 (Significance (Required)):

      This work has a number of contributions. Firstly, it demonstrates how to combine several complementary sequencing approaches to characterize in detail the transcriptional and translational state of a cell, as well as its overall growth rate to generate comprehensive expression-fitness maps. Secondly, it shows how the interwoven nature of cellular regulatory networks and the molecular interactions encoded within the genome can lead to cryptic responses in cellular behavior and fitness at a system-level that can only be understood by taking a detailed "bottom-up" approach. Finally, it suggests that some of these regulatory interactions may in fact "entrench" an organism's evolutionary path, by causing small genetic perturbations to propagate and potentially amplify their negative effect. While the results are compelling and well supported by experiments, the limited scope of the work makes it difficult to know whether this is in fact a rare or common occurrence. However, I do believe there is significance to these findings and that it will likely spur further studies to assess the generality of these findings.

      Overall, I believe the work will have a wide appeal covering areas such as Systems Biology, Gene Regulation, Evolution, Quantitative Biology, Sequencing, High-throughput Technologies.

      Response: We thank the reviewer for their assessment that our work will be of appeal to a broad audience.

      My field of expertise is in the quantitative measurement of core cellular processes (e.g. transcription and translation) using novel sequencing techniques and the application of this knowledge to biological design. As such, I believe I have sufficient expertise to review this paper in detail.

      Response references

      Adamski, F.M., McCaughan, K.K., Jørgensen, F., Kurland, C.G., and Tate, W.P. (1994). The concentration of polypeptide chain release factors 1 and 2 at different growth rates of Escherichia coli. J. Mol. Biol. 238, 302–308.

      Arita, Y., Kim, G., Li, Z., Friesen, H., Turco, G., Wang, R.Y., Climie, D., Usaj, M., Hotz, M., Stoops, E., et al. (2021). A genome-scale yeast library with inducible expression of individual genes. BioRxiv 2020.12.30.424776.

      Baranov, P. V, Gesteland, R.F., and Atkins, J.F. (2002). Release factor 2 frameshifting sites in different bacteria. 3, 373–377.

      Blattman, S.B., Jiang, W., Oikonomou, P., and Tavazoie, S. (2020). Prokaryotic single-cell RNA sequencing by in situ combinatorial indexing. Nat. Microbiol. 5, 1192–1201.

      Chai, Y., Normam, T., Kolter, R., and Losick, R. (2009). An epigenetic switch governing daughter cell separation in Bacillus subtilis. Genes Dev. 7824, 754–765.

      Craigen, W.J., and Caskey, C.T. (1986). Expression of peptide chain release factor 2 requires high-efficiency frameshift. Nature 322, 273–275.

      Craigen, W.J., Cook, R.G., Tate, W.P., and Caskey, C.T. (1985). Bacterial peptide chain release factors: Conserved primary structure and possible frameshift regulation of release factor 2. Proc. Natl. Acad. Sci. U. S. A. 82, 3616–3620.

      Dinçbas-Renqvist, V., Engström, Å., Mora, L., Heurgué-Hamard, V., Buckingham, R., and Ehrenberg, M. (2000). A post-translational modification in the GGQ motif of RF2 from Escherichia coli stimulates termination of translation. EMBO J. 19, 6900–6907.

      Hawkins, J.S., Silvis, M.R., Koo, B.M., Peters, J.M., Osadnik, H., Jost, M., Hearne, C.C., Weissman, J.S., Todor, H., and Gross, C.A. (2020). Mismatch-CRISPRi Reveals the Co-varying Expression-Fitness Relationships of Essential Genes in Escherichia coli and Bacillus subtilis. Cell Syst. 11, 523-535.e9.

      Imdahl, F., Vafadarnejad, E., Homberger, C., Saliba, A.E., and Vogel, J. (2020). Single-cell RNA-sequencing reports growth-condition-specific global transcriptomes of individual bacteria. Nat. Microbiol. 5, 1202–1206.

      Johnson, G.E., Lalanne, J.B., Peters, M.L., and Li, G.W. (2020). Functionally uncoupled transcription–translation in Bacillus subtilis. Nature 585, 124–128.

      Jost, M., Santos, D.A., Saunders, R.A., Horlbeck, M.A., Hawkins, J.S., Scaria, S.M., Norman, T.M., Hussmann, J.A., Liem, C.R., Gross, C.A., et al. (2020). Titrating gene expression using libraries of systematically attenuated CRISPR guide RNAs. Nat. Biotechnol. 38, 355–364.

      Keren, L., Hausser, J., Lotan-Pompan, M., Vainberg Slutskin, I., Alisar, H., Kaminski, S., Weinberger, A., Alon, U., Milo, R., and Segal, E. (2016). Massively Parallel Interrogation of the Effects of Gene Expression Levels on Fitness. Cell 166, 1282-1294.e18.

      Korkmaz, G., Holm, M., Wiens, T., and Sanyal, S. (2014). Comprehensive analysis of stop codon usage in bacteria and its correlation with release factor abundance. J. Biol. Chem. 289, 30334–30342.

      Kuchina, A., Brettner, L.M., Paleologu, L., Roco, C.M., Rosenberg, A.B., Carignano, A., Kibler, R., Hirano, M., William DePaolo, R., and Seelig, G. (2020). Microbial single-cell RNA sequencing by split-pool barcoding. Science (80-. ).

      Lalanne, J.B., Taggart, J.C., Guo, M.S., Herzel, L., Schieler, A., and Li, G.W. (2018). Evolutionary Convergence of Pathway-Specific Enzyme Expression Stoichiometry. Cell 749–761.

      Li, G.W., Burkhardt, D., Gross, C., and Weissman, J.S. (2014). Quantifying absolute protein synthesis rates reveals principles underlying allocation of cellular resources. Cell 157, 624–635.

      Mangano, K., Florin, T., Shao, X., Klepacki, D., Chelysheva, I., Ignatova, Z., Gao, Y., Mankin, A.S., and Vázquez-Laslop, N. (2020). Genome-wide effects of the antimicrobial peptide apidaecin on translation termination in bacteria. Elife 9, 1–24.

      Mathis, A.D., Otto, R.M., and Reynolds, K.A. (2021). A simplified strategy for titrating gene expression reveals new relationships between genotype, environment, and bacterial growth. Nucleic Acids Res. 49, e6.

      Poole, E.S., Brown, C.M., and Tate, W.P. (1995). The identity of the base following the stop codon determines the efficiency of in vivo translational termination in Escherichia coli. EMBO J. 14, 151–158.

      Wei, Y., Wang, J., and Xia, X. (2016). Coevolution between Stop Codon Usage and Release Factors in Bacterial Species. Mol. Biol. Evol. 33, 2357–2367.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this paper, the authors use a combination of RNA sequencing, ribosome profiling and measurements of cellular composition and growth rate to gain insight into the multi-scale affects that perturbations to translation termination factors have on general physiological states and reproductive fitness using Bacillus subtilis as their model organism. Specifically, they find that perturbing the expression levels of peptide chain release factors in any direction has a negative effect on growth-rate. This negative effect was not due to a direct impact of the gene on the cell, but instead due to a chain of regulatory interactions that cause the activation of the general stress regulon. This leads to upregulation of a large chunk of the genome and an indirect impact on the expression of all other genes. Critically, the knock-on effects observed for the specific perturbations studied suggest that it may be difficult to predict expression-fitness landscapes of a cell, without carrying out a detailed mapping of all genes and the cell's physiological state.

      Overall, the core findings in the paper are well justified by the data presented and the experiments appear to have been rigorously carried out. My only concern is that it is unclear if biological replicates of the ribosome profiling were performed. Also, biological replicates are mentioned for the RNA-seq data, but no data is shown. Even a simple graph demonstrating the expression levels across these would be useful to be assured of no issues in reproducibility given the complex processing of the data involved. Related to this, I see no mention of data availability in the paper. For this study to be useful to others, providing the raw data (unprocessed) would be essential (ideally in a public repository).

      The presentation of the work is excellent, with very clear figures and text that helped guide the reader through the results. There were a few minor comments:

      1. Abstract: "in bacterium Bacillus subtilis" should read "in the bacterium Bacillus subtilis".
      2. Page 4: "found that under numerous ways" should read "found that under the numerous ways".
      3. The authors mention that changes in the expression level of RF1 impacted motility and biofilm genes, but not how this impacts fitness. Would they be able to experimentally identify origin of RF1 growth defects in the same way they did for PrmC? This is not essential for the main findings but would help strengthen the work.
      4. It is difficult to know how generalisable the findings of this work are due to the very limited scope. It could be helpful for the authors in the discussion to consider and comment on how such approaches might be scaled-up to enable broader and more general studies of expression-fitness landscapes and where they will find most use.

      Significance

      This work has a number of contributions. Firstly, it demonstrates how to combine several complementary sequencing approaches to characterize in detail the transcriptional and translational state of a cell, as well as its overall growth rate to generate comprehensive expression-fitness maps. Secondly, it shows how the interwoven nature of cellular regulatory networks and the molecular interactions encoded within the genome can lead to cryptic responses in cellular behavior and fitness at a system-level that can only be understood by taking a detailed "bottom-up" approach. Finally, it suggests that some of these regulatory interactions may in fact "entrench" an organism's evolutionary path, by causing small genetic perturbations to propagate and potentially amplify their negative effect. While the results are compelling and well supported by experiments, the limited scope of the work makes it difficult to know whether this is in fact a rare or common occurrence. However, I do believe there is significance to these findings and that it will likely spur further studies to assess the generality of these findings.

      Overall, I believe the work will have a wide appeal covering areas such as Systems Biology, Gene Regulation, Evolution, Quantitative Biology, Sequencing, High-throughput Technologies.

      My field of expertise is in the quantitative measurement of core cellular processes (e.g. transcription and translation) using novel sequencing techniques and the application of this knowledge to biological design. As such, I believe I have sufficient expertise to review this paper in detail.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript of Lalanne and coworkers address the cellular responses to varied translation termination factor expression in Bacillus subtilis. The authors set-up a system to fine-tune the expression of release factor RF1, RF2 as well as PrmC that post-translationally modifies RF1/RF2 to maximize their catalytic hydrolysis activity. They then monitor the fitness costs associated with overexpression or depletion of the factor by following the changes in growth rate. The set-up is nicely illustrated in Figure 1. The results in Figure 2 show that overexpression of RF1 and RF2 has relatively modest effect on the growth rate compared to overexpression of PrmC that leads to dramatic growth rate reduction. By contrast, depletion of RF1 has a strong negative influence on fitness, whereas a similar level of depletion of RF2 had little influence on fitness. PrmC overexpression appears to be correlated with the induction of the sigmaB regulon, however, the authors do not manage to ascertain why this is. By contrast, RF2 depletion also results in the induction of the sigmaB regulon and the authors demonstrate convincingly that this is due to a termination defect within the rsbQ-rsbV operon that contains an overlapping start-stop AUGA

      A few points that the authors might consider discussing

      1. The natural abundance of each RF in bacteria in relation to the usage of different stop codons in different organisms.
      2. The role of the frameshifting mechanism in RF2 and how then RF1 levels are regulated.
      3. The authors observe queuing in front of the relevant stop codons upon RF depletion, however, do not discuss about readthrough events, which are usually competing with termination. Surprisingly, in this context the authors don't discuss the work from Mankin and coworkers showing sequestration of RFs from termination by peptides such as apideacin leads to translational readthrough.
      4. The efficiency of translation termination is well-known to be dependent on the context of the stop codon. Do the authors also observe such a trend. Especially, UGAC for RF2, one would expect to observe high levels of readthrough upon RF2 depletion.

      Significance

      Overall, the experiments are clearly performed and beautifully illustrated. Clearly, a lot of work has gone into this study but the end message that the cell regulates carefully RF concentrations is not surprising. Especially given that RF2 carefully regulates its own levels using an autoregulatory frameshifting mechanism. The major finding that the rsbQ-rsbV operon with the RF2 dependence leading to induction of the sigmaB regulon is in the end rather trivial since these regulators depend on RF2 for termination. Therefore, this manuscript is unlikely to have general interest to people in the translation field (such as myself) but rather those working in the field of synthetic biology.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      This reviewer did not leave any comments

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer response

      We thank the reviews for the careful reviews, and were delighted to see that they assessed both the quality and significance of the work so highlty.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors investigated the cross-neutralization capacity of serum antibodies to past and future 229E coronaviruses using 229E spikes isolated from five time points and sera from two different periods (1985-1995 and 2020). They demonstrated a general pattern of asymmetric cross-neutralization, with sera cross-reactive to historical but not future strains. Using chimeras, the authors showed this pattern was mostly driven by antibodies to the evolving RBD. The rate of change in the neutralization titer, a possible measure of antigenic evolution, was estimated to be on par with that of flu B viruses. Interesting differences in individuals' cross-neutralization capacity were observed. The main take-away is that reinfection with 229E is enabled by antigenic escape, not "weak" immunity after infection (as proposed by others).

      Thanks for the excellent summary of the paper. We agree with it, although we would note that our work does not exclude “weak immunity” as a possible compounding explanation for re-infection in addition to the antigenic evolution we demonstrate.

      **Major comments:**

      The key conclusions are convincing and justified by the data. The work is clearly presented and presented with sufficient detail for reproducibility. Characteristically and laudably, the authors have made all the code and data publicly available on GitHub.

      Thanks for the favorable summary.

      **Minor comments:**

      p 3: Perhaps it is clearer to write that 229E has been identified/isolated in humans for >50 y? Or do you really mean to imply (by contrast with "circulated") that NL63 emerged very recently?

      This is a good suggestion. We really do not know how long either CoV-229E or CoV-NL63 have been circulating humans, only that CoV-229E was first isolated >50 years ago whereas CoV-NL63 was first identified only in 2003. It is possible both viruses have been circulating for longer than that. We have made the suggested change to clarify.

      p 3: An important citation for the antigenic implications of the ladder-like phylogeny AND phylogenetic clustering by date is the classic paper introducing phylodynamics by Grenfell et al. (2004, Science).

      Thanks for pointing out this citation; we have added it.

      p 4: I might not be like all readers, but I prefer to see a bit in the main text about the source of sera for this kind of study. (I wonder about age, if donors are healthy, etc.)

      This is a good question, and we have expanded on it in both the main text and the methods. Briefly, the sera were all from apparently healthy individuals, and no information about recent respiratory virus infections were available. We have provided the age of the serum donor (at the time of serum collection) above the title of each plot showing person-specific neutralization data.

      p 4: "Our reason for focusing..." stops short. Is the idea that these are probably people who were recently infected?

      This is a good question, and we have elaborated in the revised text. We don’t have any direct information on whether the individuals had recent infections, although that seems plausible. More pragmatically, we reasoned that sera that had reasonably high initial titers would provide better dynamic range to see how titers changed as the virus evolved given our assay has a lower limit of detection.

      p 5: Probably my biggest suggestion for the paper is that it mention another relevant study. In 1980, Anne Underwood demonstrated similar asymmetric cross-immunity among early strains of H3N2 (but using rabbits, not human sera), finding that antibodies raised to one strain reacted more strongly by HAI to past strains than to later strains (doi: 10.1128/IAI.27.2.397-404.1980). This relates to the significance of the paper (next section).

      Thanks, this is a good and relevant citation, and we have added it when we discuss the possible asymmetry of antigenic change with respect to time.

      Obviously, there are citations to update throughout due to the booming SARS-CoV-2 literature.

      We have updated the other citations to keep pace with the fast-changing literature!

      Reviewer #1 (Significance (Required)):

      This study, if anything, undersells itself. Obviously it is a huge contribution to our understanding of how a seasonal coronavirus that bears important phenotypic resemblance to SARS-CoV-2 evolves, but I think it is also providing a foundational piece of evidence--a mechanism--of how rapid viral turnover (by antigenic evolution) occurs. There is no reason to think this should be limited to the coronaviruses, and I suspect the evidence here will go a long way to unifying the evolutionary and epidemiological dynamics of fast-evolving viruses.

      Thanks for the praise of the manuscript. Indeed, we were surprised to find that no similarly designed studies have been done even for influenza virus, and so are now interested in expanding our future work to do that as we fully agree it could provide insight more broadly.

      Asymmetric competition is nearly an ecological requirement for one strain to successfully invade and displace another. It is thought (unsure how widely?) that flu evolves antigenically, with new strains eventually displacing old ones, by mutating at key epitopes in ways that the immune system does not immediately pick up. That is, immune memory is biased to recall responses to conserved epitopes, which on average are probably less neutralizing. This will induce competition between mutant and resident viruses, but it would be symmetric, since infection with either would induce responses to conserved epitopes on the other. But if on infection with the mutant, immune memory sometimes reuses (boosts) antibody responses to target the mutated epitopes, those recycled antibodies might be less effective against the mutant, making the competition asymmetric.

      What this paper and Underwood (1980) suggest is that we can get this asymmetric, antibody-mediated competition fairly easily and without extensive memory. Underwood showed this more powerfully in rabbits, but in this paper too we see an indirect suggestion of asymmetry in relatively inexperienced children (Fig. 3). Mutants (future strains) successfully invade when they can trigger presumably recalled antibodies that are more harmful to the resident (soon historical) strain than the mutant. If this is so easy to do, as judged by the extensive data here, then it could be common.

      I've gone off on a theoretical limb here, but the paper is still important without these considerations. This work will be of interest to evolutionary biologists, epidemiologists, vaccinologists, and everyone else wondering what SARS-CoV-2 will do next and how immunity to antigenically variable pathogens works.

      We completely agree with the ideas mentioned above, and appreciate having it put in this nice context, particularly alongside the Underwood paper (with which we were not previously familiar). That said, we believe that the small number of recent children sera samples in the current study preclude us from drawing strong conclusions about the asymmetry--as the reviewer says, our data provides an indirect suggestion too. So overall we have not tried to expand this angle here because as the reviewer says, the paper is still important without these considerations. However, we are actively working to see if we can design a similar study with more children sera in the future to separately address the questions about asymmetry.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      An important question in coronavirology is what governs their ability to seemingly reinfect people regularly (within 2 or 3 years). While waning protective immunity has been proposed and is of current concern for SARS CoV-2, the role of antigenic drift driven by escape from neutralizing antibodies has not been well characterized. The authors have attempted to look at this through examining historical Spike proteins from HCoV-229E over a period of 30-odd years. The authors show that 229E evolves along a linear trajectory consistent with yearly selection by pre-existing immunity. Taking representative spike proteins from different time points into pseudovirus neut assays, they find that older spike proteins are less sensitive to neut by more recent sera. Conversely, spike proteins from prior to the birth of an individual display markedly less sensitivity to neut that those prevalent during the persons lifetime. Sequence analysis of the spike shows variation accruing in both N-termina regions and the RBD, parts of spike predominantly targeted by nABs. Lastly producing early spikes with chimeric RBDs from late viruses enhances the sensitivity to more recent sera.

      This is a potentially important MS that addresses a pertinent question that is of wide interest for the CoV2 pandemic. While it is limited in addressing the relative contribution of antigenic escape vs waning Ab titers because of the nature of the sample, the MS makes a strong case for Spike evolution being driven by antigenic escape.

      Thanks for the summary. We agree that our paper does not really address waning immunity because we don’t have sequential serum samples from the same individual. However, it does clearly show that antigenic evolution is important independent of waning immunity, because all of the experiments (e.g., Figure 2 and 3) show the same serum sample tested against newer spikes, and neutralization titers definitely decrease as the spike evolves. The reviewer is correct that this doesn’t rule out the possibility of waning immunity as a separate phenomenon, and we have been sure to emphasize that in the revised text.

      Reviewer #2 (Significance (Required)):

      While the Figs 1-3 are clear, the data in Fig 4 is somewhat preliminary. In all likelihood many people are making neutralizing antibodies both against RBD and the N-terminal region and the relative proportion probably underlies the variability in the data in Fig 4B. I think the MS would benefit from the following:

      A comparison of NTD vs RBD vs NTD/RBD chimeras in Fig 4B to give a fuller picture of antigenic escape with statistical support.

      The reviewer is correct that our manuscript does not provide a decisive answer on the relative role of NTD versus RBD targeting antibodies, although the data in Fig. 4B clearly show that RBD antibodies are important for many individuals as simply changing the RBD to that of newer viruses recapitulates the full spike antigenic evolution without any changes in the NTD or elsewhere (e.g., subject SD87_2 or SD85_3 in Fig 4B). However, for some other individuals NTD antibodies may play a role.

      In general, full dissection of the role of RBD versus NTD antibodies is beyond the scope of our study (and in some cases not even possible with the available volumes of the older serum). In any case, the major point of our study—the first experimental demonstration that seasonal coronaviruses undergo antigenic evolution—does not depend on dissecting the relative roles of RBD and NTD antibodies. We have therefore added new text explaining that we cannot fully parse the relative role of antibodies to these domains beyond knowing that RBD antibodies play n important role. We have added text to emphasize that antibodies to other regions including the NTD could also be important.

      A figure to map the polymorphic residues in Fig 4A onto the 229E spike structure to visualise their position and special relatedness, with perhaps a comparison with the latest knowledge of SASR CoV-2 epitopes.

      We agree that visualizing the variable sites on the structure is useful and have added such a visualization as a new panel in Figure 4. This allows us to more clearly show the clustering of variability in the RBD and NTD. This clustering of mutations in those regions is consistent with what is currently being seen with the emergence of SARS-CoV-2 variants with mutations in those regions of spike. However, given the divergence between SARS-CoV-2 and CoV-229E, we are not able to do a more fine-grained comparison of epitope sites as many important sites in the RBD and NTD do not have a clear one-to-one alignment (for instance, the RBD’s don’t even bind the same receptor).

      Additional discussion to reflect the new SARS CoV-2 variants and their potential selection by escape in the light of the authors data.

      We have updated the manuscript to describe the new SARS-CoV-2 variants (which mostly emerged after submission of our original manuscript) and how this emerging antigenic evolution of SARS-CoV-2 is consistent with what we saw in CoV-229E.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      An important question in coronavirology is what governs their ability to seemingly reinfect people regularly (within 2 or 3 years). While waning protective immunity has been proposed and is of current concern for SARS CoV-2, the role of antigenic drift driven by escape from neutralizing antibodies has not been well characterized. The authors have attempted to look at this through examining historical Spike proteins from HCoV-229E over a period of 30-odd years. The authors show that 229E evolves along a linear trajectory consistent with yearly selection by pre-existing immunity. Taking representative spike proteins from different time points into pseudovirus neut assays, they find that older spike proteins are less sensitive to neut by more recent sera. Conversely, spike proteins from prior to the birth of an individual display markedly less sensitivity to neut that those prevalent during the persons lifetime. Sequence analysis of the spike shows variation accruing in both N-termina regions and the RBD, parts of spike predominantly targeted by nABs. Lastly producing early spikes with chimeric RBDs from late viruses enhances the sensitivity to more recent sera.

      This is a potentially important MS that addresses a pertinent question that is of wide interest for the CoV2 pandemic. While it is limited in addressing the relative contribution of antigenic escape vs waning Ab titers because of the nature of the sample, the MS makes a strong case for Spike evolution being driven by antigenic escape.

      Significance

      While the Figs 1-3 are clear, the data in Fig 4 is somewhat preliminary. In all likelihood many people are making neutralizing antibodies both against RBD and the N-terminal region and the relative proportion probably underlies the variability in the data in Fig 4B. I think the MS would benefit from the following:

      • A comparison of NTD vs RBD vs NTD/RBD chimeras in Fig 4B to give a fuller picture of antigenic escape with statistical support.

      • A figure to map the polymorphic residues in Fig 4A onto the 229E spike structure to visualise their position and special relatedness, with perhaps a comparison with the latest knowledge of SASR CoV-2 epitopes.

      • Additional discussion to reflect the new SARS CoV-2 variants and their potential selection by escape in the light of the authors data.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors investigated the cross-neutralization capacity of serum antibodies to past and future 229E coronaviruses using 229E spikes isolated from five time points and sera from two different periods (1985-1995 and 2020). They demonstrated a general pattern of asymmetric cross-neutralization, with sera cross-reactive to historical but not future strains. Using chimeras, the authors showed this pattern was mostly driven by antibodies to the evolving RBD. The rate of change in the neutralization titer, a possible measure of antigenic evolution, was estimated to be on par with that of flu B viruses. Interesting differences in individuals' cross-neutralization capacity were observed. The main take-away is that reinfection with 229E is enabled by antigenic escape, not "weak" immunity after infection (as proposed by others).

      Major comments:

      The key conclusions are convincing and justified by the data. The work is clearly presented and presented with sufficient detail for reproducibility. Characteristically and laudably, the authors have made all the code and data publicly available on GitHub.

      Minor comments:

      p. 3: Perhaps it is clearer to write that 229E has been identified/isolated in humans for >50 y? Or do you really mean to imply (by contrast with "circulated") that NL63 emerged very recently?

      p. 3: An important citation for the antigenic implications of the ladder-like phylogeny AND phylogenetic clustering by date is the classic paper introducing phylodynamics by Grenfell et al. (2004, Science).

      p. 4: I might not be like all readers, but I prefer to see a bit in the main text about the source of sera for this kind of study. (I wonder about age, if donors are healthy, etc.)

      p. 4: "Our reason for focusing..." stops short. Is the idea that these are probably people who were recently infected?

      p. 5: Probably my biggest suggestion for the paper is that it mention another relevant study. In 1980, Anne Underwood demonstrated similar asymmetric cross-immunity among early strains of H3N2 (but using rabbits, not human sera), finding that antibodies raised to one strain reacted more strongly by HAI to past strains than to later strains (doi: 10.1128/IAI.27.2.397-404.1980). This relates to the significance of the paper (next section).

      Obviously, there are citations to update throughout due to the booming SARS-CoV-2 literature.

      Significance

      This study, if anything, undersells itself. Obviously it is a huge contribution to our understanding of how a seasonal coronavirus that bears important phenotypic resemblance to SARS-CoV-2 evolves, but I think it is also providing a foundational piece of evidence--a mechanism--of how rapid viral turnover (by antigenic evolution) occurs. There is no reason to think this should be limited to the coronaviruses, and I suspect the evidence here will go a long way to unifying the evolutionary and epidemiological dynamics of fast-evolving viruses.

      Asymmetric competition is nearly an ecological requirement for one strain to successfully invade and displace another. It is thought (unsure how widely?) that flu evolves antigenically, with new strains eventually displacing old ones, by mutating at key epitopes in ways that the immune system does not immediately pick up. That is, immune memory is biased to recall responses to conserved epitopes, which on average are probably less neutralizing. This will induce competition between mutant and resident viruses, but it would be symmetric, since infection with either would induce responses to conserved epitopes on the other. But if on infection with the mutant, immune memory sometimes reuses (boosts) antibody responses to target the mutated epitopes, those recycled antibodies might be less effective against the mutant, making the competition asymmetric.

      What this paper and Underwood (1980) suggest is that we can get this asymmetric, antibody-mediated competition fairly easily and without extensive memory. Underwood showed this more powerfully in rabbits, but in this paper too we see an indirect suggestion of asymmetry in relatively inexperienced children (Fig. 3). Mutants (future strains) successfully invade when they can trigger presumably recalled antibodies that are more harmful to the resident (soon historical) strain than the mutant. If this is so easy to do, as judged by the extensive data here, then it could be common.

      I've gone off on a theoretical limb here, but the paper is still important without these considerations. This work will be of interest to evolutionary biologists, epidemiologists, vaccinologists, and everyone else wondering what SARS-CoV-2 will do next and how immunity to antigenically variable pathogens works.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      This study of Nils Halberg and colleagues aims to characterize tumor-associated immune cell

      infiltrates in a mouse model of diet-induced obesity. Authors compared different syngeneic

      tumor cell lines for mammary adenocarcinoma and pancreatic ductal adenocarcinoma. Tumor

      infiltrating leukocytes were analyzed by a 36-parametric mass cytometry protocol. The authors

      put a lot of efforts in the generation of high-quality data by applying state-of-the-art methods for

      sample barcoding and batch analyses, removal of batch-specific variations and in the

      subsequent pipeline of data analysis. The clinical relevance of the topic addressed is well

      documented in several studies, showing a clear association between obesity and the

      development of several tumors, including those tumors investigated in this study.

      Main findings of this study can be summarized that in the model system used tumor-dependent

      differences in the qualitative and quantitative composition of immune cell infiltrates were

      observed. Unfortunately, the mouse model system used obviously did not reveal convincing

      data whether obesity may modulate the process of tumor infiltration.

      The manuscript is well written, quantity of figures is appropriately and of excellent quality and

      prior studies were referenced appropriately.

      In conclusion, authors made tremendous methodological and technical efforts to generate

      robust and high-quality mass cytometry data, but the overall outcome of the study remains

      limited in respect to shedding some new light how obesity is possibly involved in the qualitative

      and quantitative modulation of tumor-related immune cell infiltration.

      Authors’ Response: We thank the reviewer for their constructive and positive feedback as well

      as appreciation of our rigorous approach. We would however argue that our data significantly

      contributes to the understanding of how obesity affects tumor immunity. We believe that our

      systemic approach across multiple tumor systems highlights that i) it matters what model you

      choose, as each model have a separate response to the obesity challenge ii) for one model, the

      E0771 model, our data reflect obesity-dependent alterations to the CD8+ T-cells population.

      This was corroborated by a parallel publication by Rigel et al., 2020 as highlighted by the 2nd

      reviewer. That being said, we too, were surprised that the pro-inflammatory obese environment

      did not have more pronounced effects on the tumor immune infiltrates across the five models.

      **Major comments:**

      Due to the limited data really showing an association between obesity and immune cell

      infiltration of tumors investigated I would suggest that authors should change emphasis of their

      results more closely related to the findings of tumor-dependent immune cell infiltrations than

      obesity-related associations. So, the title of the study should be appropriately changed since

      "High dimensional immunotyping of the obese tumor micro-environment" rather implies

      analyses of spatial relationships of immune, tumor and fat cells by immunohistological analyses,

      which would indeed help to strengthen the outcome of this mass cytometry study.

      Authors’ Response: We appreciate the constructive suggestion. We did not intend the title to

      infer immunohistochemical analysis and apologize that was the case. We have therefore

      changed the title to “High dimensional immunotyping of tumors grown in obese and non-obese

      mice” in the revised version (line 1).

      Although all the efforts made in mass cytometry data generation are quite commendable in this

      study, basic statistical issues are not clearly addressed regarding the number of biological

      replicates. How many mice were treated per tumor cell line? According to figure 1B nine chow

      and eight HFD animals were used: does this mean that only one or two mice were analyzed per

      cell line, respectively? Please explain how many animals belong to each of the seven mouse

      cohorts.

      Authors’ Response: We agree that this was not clearly defined in the manuscript. We have

      updated Figure 1A and the corresponding legend to make it clearer. The mouse numbers,

      referred to as tumors, are also located in Table 3. In total 69 mice were used, distributed as:

      E0771_1 consists of 4 chow fed mice and 4 HFD mice (N=4/4, where N=chow/HFD, for a total

      of 8 mice)

      E0771_2 is N=5/4. Wnt1 is N=6/6. TeLi is N=5/5. C11_1 is N=5/4. C11_2 is N=5/5. UN-KC is N=5/6.

      Figure 1B shows representative mouse weights only. The female mice are from breast cancer

      cohort E0771_1 and the male mice are from pancreatic cancer cohort C11_1. We chose to only

      show representative data since diet-induced obesity is well established in the C57Bl/6 strain.

      Obviously, cell lines E07771 and C11 were analyzed as duplicates only. Regarding E0771,

      tumor growth was 31 and 23 days, respectively. So, large inter-individual differences in tumor

      growth were obvious and how this is reflected at the level of tumor infiltration? Therefore, please

      explain which criteria were used to decide when the tumors had to be removed. Furthermore,

      please indicate weight, viability and absolute cell number of each tumor sample in a

      supplementary table to get an impression about variability in tumor growth.

      Authors’ Response: The reviewer brings up an important point. The E0771 and C11 cohorts are

      included in the paper as combined cohorts. The individual C11 cohorts had too few tumors

      remaining after removal of samples with too low viability (as discussed below) to analyze

      separately. The E0771 cohorts are presented together as a representation of that tumor model.

      Data analysis for the E0771 cohorts separately shows comparable population abundance

      differences and obesity-dependent changes between chow and HFD tumors. The metacluster

      fold change for non-obese and obese tumors between E0771_1 and E0771_2 correlated with a

      R2 = 0.8586. Presenting the data combined provides a more concise view of the model.

      Removal of E0771 and C11 tumors in each individual cohort were time matched. E0771 tumors

      were continuously measured by caliper and removed before they reached 1 cm3 according to the

      local ethical guidelines. The E0771_2 cohort tumors had to be removed sooner as one tumor

      reached 1 cm3 earlier. We have reported the tumor mass in Figure 1C as that is a more accurate

      measurement of final tumor burden. Pancreatic tumors were removed based on optical imaging

      of luciferase expressing cancer cells and careful monitoring of mouse distress based on the

      grimace scale. The material and methods section has been updated to reflect this (line 556-558).

      Only pancreatic tumors had viability poor enough that they had to be excluded from analysis. A

      cutoff of CD45+ 5000 cells was set and applied to cells remaining following the gating strategy

      shown in Figure 1D. Therefore, CyTOF data for tumors with fewer than 5000 CD45+/Cisplatin

      negative cells were excluded from analysis as indicated with an X in Figure1C. As requested, we

      have included tumor weights and available viability measurements in new Table 5.

      **Minor comments:**

      The generation of orthotopic pancreatic cancer mouse models is technically challenging, and

      needs more complex imaging methods to monitor the growth of the implanted tumor cells.

      Furthermore, orthotopic implantation of tumor cells into the pancreas by surgery can also inflict

      significant physical trauma to the recipient animals. How authors have monitored tumor cell

      implantation?

      Authors’ Response: We agree that tracking tumor growth in the orthotopic pancreas cancer

      model is challenging. As mentioned above, these cells were engineered to express luciferase

      and optical imaging was used to monitor growth of the implanted cells. We did not report these

      numbers as we were unable to convincingly correct for possible light absorption by the

      enhanced adipose tissue mass in the high fat group. As such, these scans were used to

      estimate the end point.

      The number of CD45-positive cells per tumor sample is not given in the manuscript, but this

      information would be important to know, because it can be expected that most of the samples

      showed less than 20.000 cells. This relatively low number of total leukocytes would not allow a

      statistically significant profiling of rare cell subsets, such as DC's or MDSC's. This limitation

      should at least clearly addressed in the discussion section.

      Authors’ Response: The reviewer raises a great point. Since the cells were live cryopreserved

      and thawed before measuring CD45, we did not determine the total immune cell infiltrate. After

      thawing, the CD45+ cells accounted for roughly 1-12% of the total events collected across all

      batches leading to a total number of CD45+ cells ranging between 54,317 and 1,102,767 per

      batch. Numbers for each batch can be found in Table 3. After gating and exclusion of tumors

      with less than 5000 CD45+ cells, the remaining tumor data were equally sampled and 5206

      CD45+ cells were included in further analysis from each tumor. Overall, we were focused on

      broad phenotyping of the immune infiltrate and not on rare subsets. Some subsets had low

      abundance in some tumors and high abundance in others. Because the analysis was performed

      altogether, the overall phenotyping and clustering did not find any truly rare subsets. DCs and

      MDSC were not rare when assessed across the datasets. While we cannot characterize the

      subsets that are small in a specific tumor type, we can be confident in the characterization

      provided by the streamlined analysis of the data as a whole.

      According to table 2 authors have used 36 immune cell-related antigens including casp3, which

      was only used to exclude apoptotic cells from downstream analyses. But as written in the

      results section only 26 phenotyping markers were used to generate the viSNE map shown in

      Figure 3. In Figure 3C-F 30 markers were shown. Please explain this obvious inconsistency of

      markers used.

      Authors’ Response: Thank you for this question. Our goal here was to generate a viSNE map

      that best separated out immune cells by phenotype. Lineage markers and well-established

      phenotyping markers were therefore included to create the well separated viSNE map. It follows

      that some markers were not included: i) Markers that were used to gate the population of

      interest (CD45 and c-Cas3) were excluded from the viSNE input parameters.; ii) Markers that

      had relatively low signal were also excluded such as MHC-1 and CD117. Including negative

      markers is computationally costly, provides limited biological insight, and can produce a worse

      viSNE map by reducing cell separation due to shared lack of signal (Diggins et al., 2015); iii)

      Activation/ exhaustion markers were excluded from the viSNE analysis because the focus of the

      phenotyping was on major cell subsets and not on activation states. The hope was to observe

      differences in exhaustion marker expression between chow and HFD; and iv) CD5 was

      excluded because having two bright T cell markers skewed the map towards a more T cell

      dominant view. Markers with meaningful expression were reintroduced in the MEM analysis

      after the viSNE map was made. Exclusion of markers from viSNE analysis is a generally

      accepted practice and has been applied previously (Wogsland et al., 2017, Cheng et al., 2016,

      Huse et al., 2019, Leelatian et al., 2020, Doxie et al., 2018, Okamato et al., 2021, Henderson et

      al., 2020). The reasoning behind using the 26 phenotyping markers have been included in the

      revised manuscript (line 754 – 757)

      How viability of tumor samples was determined?

      Authors’ Response: Viability was measured at three points using membrane exclusion assays.

      Viability was first measured upon tumor dissociation using trypan blue and a Countess cell

      counter on the single cell suspension before freezing. Values were used to guide cell aliquoting

      for cryopreservation. Viability was again measured with trypan blue upon thaw in order to

      barcode and stain 3 million live cells per sample. Before fixation, cells were again stained for

      viability, this time with cisplatin, to exclude dead cells after data collection with gating. This has

      been added to the methods section (line 559-562)

      Cells were additionally stained for cleaved-Cas3 as an indicator of cells undergoing apoptosis.

      Only pancreatic tumors had viability poor enough that they had to be excluded form analysis.

      Tumors with fewer than 5000 CD45+ Cisplatin negative cells were excluded from analysis as

      indicated with an open X in Figure1C. The tumor count in parentheses in Table 3 indicates the

      tumors that were not excluded.

      Please indicate cell loss caused by cryopreservation of dispersed tumor tissue samples.

      Authors state that mainly neutrophilic granulocytes will be lost during cryopreservation, and that

      this would help to the "definitive identification and characterization of G-MDSC". But there are

      several reports showing that MDSC-subsets also behave very sensitive during cryopreservation

      and that it is recommended to analyze fresh samples if MDSC's are of particular interest (DOI:

      10.1177/1753425912463618; DOI: 10.1177/1753425912463618). This possible limitation

      should be discussed in the manuscript and not only highlighted as advantage on the way to

      identify MDSC-subsets.

      Authors’ Response: We thank the reviewer for this insightful comment. We agree that we likely

      lost some MDSC during the cryopreservation process as shown in the reference above. But

      since no neutrophils survive standard cryopreservation (Graham-Pole et al., 1977), the Ly6G

      positive cells in our analysis are G-MDSC and not neutrophils. We assume that any cell death

      related to cryopreservation would be consistent across samples, so although cell totals may be

      lower than in the tumor, abundance differences and phenotype can still be evaluated. We have

      included a discussion of this in the revised manuscript

      (line 408 – 410).

      In the Figure 1D X-axis named by "193Ir-NA" should be replaced by "193Ir-DNA".

      Authors’ Response: NA is shorthand for nucleic acid since the iridium intercalates into DNA

      and RNA. The figure legend has been updated to make this clear.

      Furthermore, please explain "(T)" in the figure legend. Percentages in the last two dot plots

      related to "all previous gates" are confusing: 20,44% of all DNA-containing single cells were

      finally intact, living CD45+ cells, i.e. almost 80% of cells were excluded because they were dead

      or apoptotic and this corresponds to 57,06% of intact, living CD45-positive cells related to all

      CD45-positive cells? How these percentages are related to the "Percent of CD45/total raw

      events" in the last column of Table 3?

      Authors’ Response: These are great points. Thank you for bringing them to our attention. This

      confusing notation has been removed since Figure 1D is a representative gating strategy. “All

      previous gates” means that the previous gates were all applied to the population showing in that

      plot. CyTOF data requires thorough gating to remove the events that are not representative of

      actual cells so yes, many events were removed before analysis. Even more cells were excluded

      here since our focus was on the CD45+ cells and not the cancer cells. The CD45+ cells

      indicated in Table 3 and visualized in Figure 1E can be calculated by summing the total gated

      CD45+ cells per Figure 1D for each batch and dividing that by the total number of events

      collected per batch. The summed CD45+ values and the total collected events are also in Table

      3.

      Authors claimed that "155Gd_IRF4" was changed to "155Gd", but it is not clear why to mention

      that IRF4 has been NOT used throughout the study? Please provide only those technical

      details, which are necessary to understand what has been done.

      Authors’ Response: We apologize for any confusion. This change was mentioned because

      most cohorts included the IRF4 channel while a two (C11_2 and Wnt_1) did not. The FCS files

      were changed to allow for simultaneous analysis. The IRF4 antibody did not work so there

      shouldn’t be any bleed into other channels in the samples that were stained with IRF4.

      According to general practice, we believe that it is important to make note of any manipulation to

      the FCS files.

      Re Figure 6: please explain the abbreviation "TNBC".

      Authors’ Response: We apologize for not explaining this abbreviation. TNBC is short for triple

      negative breast cancer. This has been corrected in the resubmitted version.

      Experiments done with TKO mice are not described in the Materials and Methods section. In

      particular, it would be important to know the number of replicates and the number of tumors

      grown in this model. It should be also discussed that the growth kinetics of tumors in chow and

      HFD TKO mice seem to be much faster as compared to wild type mice. Principally, the TKO

      model used here is only of limited value to clarify especially the role of CD8 cells since all other

      T- and B- cell subsets including NK cells are also absent in this knockout model and indirect

      effects caused by these cells cannot be excluded.

      Authors’ Response: We deeply apologize that the TKO experiments were not included in the

      Triple knockout (Rag2-/-::CD47-/-::Il2rg-/-; TKO) mice were purchased from Jackson Laboratories (Stock No: 025730).

      We agree with the reviewer it is an important point that the E0771 tumors overall grew faster in the TKO model. Ringel et al. 2020 saw similar results when depleting CD8 T cells in their MC38 model. Comparably, the most striking difference observed was that the tumor growth between obese and non-obese mice disappeared in the TKO mice.

      We have modified the results section to include these points (line 309-310). Reviewer #1 (Significance (Required)):

      material and methods section.

      experiment was performed with N=5/5. The description of the TKO model has been added to

      Orthotopic implantation and

      monitoring of E0771 and C11 cells were performed as with the wild type C57BL/6 mice. Each

      the methods section (line 520- 533) and number of mice used has been added to the figure

      legend.

      did not observe any major growth changes (overall growth rate and growth differences between

      obese and non-obese mice) in the TKO mice compared to the wild type mice.

      In the C11 model, interestingly, we

      We agree that the combined lack of B- and NK- cells in combination to the lack of T-cells

      exclude a direct conclusion on the effect of obesity-dependent alteration in T-cell phenotypes.

      Altogether, this study is a paragon that a single technology-based study alone, even when well-

      designed, is not sufficient to explore complex tumor microenvironment-immune cell interactions

      and that additional information on spatial relationships of cells and possibly single cell-based

      RNAseq techniques are necessary to shed new light on this ambitious topic. But there is no

      doubt that the potential of mass cytometry has been not fully exploited in this study and that a

      more focused view on particular cell types identified so far, such as macrophages or CD8 cells,

      by using as many immunophenotypic and functionally-related parameters as necessary would

      allow a more in depth-phenotyping of particular immune cell compartments.

      The significance of this subject would have been tremendously increased if human samples will

      be analyzed in a future confirmative study.

      Authors’ Response: Again, these are important insights. To what extend we have taken full

      advantage of the suspension mass cytometry technology is of course debatable. When we set

      out to perform these studies, we were compelled to take a broad approach rather than focusing

      on a single cell type for the following reasons: i) we had noticed extreme variability in immune

      targeted analysis through FACS of murine cancer models. Since we set out to demonstrate

      systemic effects of the obese environment rather than model-specific effects, the broad antibody

      panel made the most sense and ii) tumor immune infiltrates are known to be composed of

      multiple cell types and the effect of the obese state would likely affect multiple of these. To not

      bias ourselves this prompted us to design a rather broad immune panel. With the knowledge

      derived from this study and others (For example Rigel et al, Cell 2020 and Chung et all, Cell

      2020), new and more focused panels could be developed and implemented for future studies.

      We agree that the inclusion of human data would be of great value. We were, however, unable

      to obtain suitable human material that could be used for this suspension mass cytometry

      analysis. This was largely due to large inconsistencies in reported patient BMI and inadequate

      tumor freezing conditions.

      Even when I'm not a specialist in tumor biology, based on my expertise in the fields of chronic

      inflammation and cytometry, I'm convinced that the outlined way of generating

      immunophentypic data by single cell-based mass cytometry is of major interest not only for

      tumor biologists, but will be for sure recognized by a broad scientific community interested in the

      generation of single cell-based immunophenotypic data.

      Authors’ Response: Thank you for your helpful and supportive feedback. It is indeed our hope

      and motivation that the immunophenotyping platform presented herein will be broadly applicable

      to other cancer immunologists and fields.

      Reviewer #2

      (Evidence, reproducibility and clarity (Required)):

      Wogsland et al. apply herein mass cytometry (CyTOF) to investigate how obesity affects tumor

      immune infiltrates. They use several models of murine breast and pancreatic cancers and

      analyse their immune landscape thanks to an extended panel of 36 markers. They notably

      describe a decrease in CD8 T cells in one breast cancer model fed with high fat diet inducing

      obesity which favors tumor development.

      Overall, the report is clearly written and follows a very logical plan. Figures are also clear and

      nicely support the text. The mass cytometry approach appears quite original and could be

      relevant for many readers.

      Authors’ Response: We thank the referee for their constructive and positive comments on our work.

      The referee raises the general criticism that our study is descriptive.

      Nevertheless, some concerns have to be made and would need to be acknowledged by

      authors:

      -First of all, the paper appears very descriptive. Except at the end of the last figures, authors

      only establish of catalog of immune cells in different tumors. Even if the trueness of such

      observations is undisputable, their relevance to improve our understanding of tumor biology is

      clearly questionable.

      Authors’ Response:

      While we agree that the majority of the manuscript is focused on establishing a robust immune

      atlas in multiple tumor models grown in obese and non-obese mice, we believe that such work

      has important merit: i) our immune cell atlas of 5 transplant models will be a valuable resource

      for other cancer researchers interested in the immune-oncology field (as also highlighted by the

      first referee); ii) our findings clearly underscore the critical need to apply multiple cell lines in

      experimental setups when studying the interaction between tumors and immune cells –

      particularly in the obese setting; iii) we have implemented an analysis pipeline that is broadly

      applicable for high dimensional mass cytometry data that will be useful for future high-

      dimensional immunotyping efforts, iv) through our unbiased analysis pipeline we did identify

      obesity-dependent alterations to the CD8 cell population in the E0771 model. This finding was

      corroborated by the studies by Ringel et al., 2020. Collectively, we strongly believe that our

      studies will contribute to the advancement of our understanding of tumor biology.

      -Moreover, the major finding claimed in this study (CD8 T cells decrease in tumors from HFD

      mice) has been very recently published paper also providing mechanistic insights (Ringel et al.,

      Cell 2020). Authors could legitimately be disappointed but the interest of their study is sadly

      severely impacted by this prior publication. This key paper should be at least discussed and

      included in references.

      Authors’ Response: The paper by Ringel et al., was published after we originally submitted our

      manuscript for review and was therefore not referenced or discussed (Ringel et al., 2020). In the

      resubmitted version of the manuscript, we have included a thorough discussion of the paper’s

      findings in terms of consistencies and inconsistencies with our conclusions (line 545-463).

      -Finally, even if the initial strategy of integration of breast and pancreatic cancers was

      indubitably a good one, results reported in figures 5 & 6 clearly show a quite specific

      observation in the E0771 model. So in this context, integrating all these datasets do not improve

      the understanding of this phenotype

      Authors’ Response: We thank the referee for bringing up this point. Regardless of the outcome, we would strongly argue that the integrated approach to be advantageous to individual analysis. The integrated approach did not hinder new discoveries in any of the datasets – if anything, the integrated analysis pipeline developed herein would facilitate new discoveries that would be missed by repeating individual analysis. By integrating the datasets, we enabled the robust identification of more cell subsets. In particular cell types that displayed low abundance in some models. Those cells would have likely been hidden or even missed in a larger subset had the models been analyzed separately. As such, we maintain that the integrated approach is the correct and most biological meaningful to follow when given the possibility.

      Besides these quite general comments, few more specific points:

      -In Fig 2, the F4/80 signal appears very weak in all datasets except one (TeLi) with an almost

      flat curve for all the other ones. It asks the question of the reproducibility of the staining that

      could be only partially corrected with batch correction algorithms.

      Authors’ Response: The F4/80 peak in the TeLi cohort is indicative of a large F4/80

      population rather than a sign of signal intensity differences. TeLi tumors had much higher

      abundance of F4/80+ cells than did the other tumor types as can be seen in Figure 4B. For

      each mass cytometry run, we included a control sample to ensure equal staining patterns

      between the antibodies in each run.

      -Obesity is clearly known to be sex/hormone dependent as confirmed by authors themselves in

      their Fig 1B so again the global integration (both sex and 2 organs) strategy is disputable here.

      It is hard to know if there is no effect in the pancreas because of tissue or sex specificities.

      Authors’ Response: Thank you for the feedback. We specifically tried to show the different

      tumors side by side without making too many comparisons across tumor types because of the

      sex and tissue differences (as was noted in the results section of the manuscript, line 267). Both

      breast and pancreatic tumor models are relevant for studying the obesity cancer connection

      which is why we have worked to develop these models with different cancer types. Even with

      the sex and tissue differences, male and female mice became obese on a high fat diet, and

      tumors from both tissues grew larger on a high fat diet

      . It is our hope that this work will pave the

      way for future studies to interrogate these differences.

      -On Fig 1C, red dots are closed or open but explanation of this is lacking.

      Authors’ Response: Thank you for pointing this out. The figure legend has been updated. The

      X indicates a tumor that had too few live CD45+ cells to be included in the data CyTOF

      analysis. We apologize this was not clear.

      -Authors use 36 markers in their CyTOF panel but use only 26 for the dimension reduction

      without clearly explaining this choice. Should be amended. For example, why excluding CD5?

      Authors’ Response: Thank you for bringing this up. We have addressed these concerns in

      response to Reviewer #1 above.

      Reviewer #2 (Significance (Required)):

      Severly impaired by Ringel et al., Cell 2020

      Authors’ Response:

      It is clear that the study by Ringel et al., demonstrate new and important mechanistic insights

      into the connection between obesity, T-cell biology and tumor behavior. Our studies share many

      of the same conclusions on tumor immune cell infiltrate in obesity – particularly the T-cell finding

      in our E0771 model. However, we stipulate that our experimental approach and scientific

      questions differ. Our approach was to generate high-dimensional immune phenotyping atlas

      across multiple models to identify overarching obesity-dependent effects. The manuscript by

      Ringel et al., has a more mechanistic focus. The field would benefit from the additive insights

      from the two papers combined.

      Rebuttal References:

      CHENG, Y., WONG, M. T., VAN DER MAATEN, L. & NEWELL, E. W. 2016. Categorical Analysis of Human T Cell Heterogeneity with One-Dimensional Soli-Expression by Nonlinear Stochastic Embedding. The Journal of Immunology, 196, 924-932.

      DIGGINS, K. E., FERRELL, P. B., JR. & IRISH, J. M. 2015. Methods for discovery and characterization of cell subsets in high dimensional mass cytometry data. Methods, 82, 55-63.

      DOXIE, D. B., GREENPLATE, A. R., GANDELMAN, J. S., DIGGINS, K. E., ROE, C. E., DAHLMAN, K. B., SOSMAN, J. A., KELLEY, M. C. & IRISH, J. M. 2018. BRAF and MEK inhibitor therapy eliminates Nestin-expressing melanoma cells in human tumors. Pigment Cell & Melanoma Research, 31, 708-719.

      GRAHAM-POLE, J., DAVIE, M. & WILLOUGHBY, M. L. 1977. Cryopreservation of human granulocytes in liquid nitrogen. Journal of Clinical Pathology, 30, 758.

      HENDERSON, L. A., HOYT, K. J., LEE, P. Y., RAO, D. A., JONSSON, A. H., NGUYEN, J. P., RUTHERFORD, K., JULÉ, A. M., CHARBONNIER, L.-M., CASE, S., CHANG, M. H., COHEN, E. M., DEDEOGLU, F., FUHLBRIGGE, R. C., HALYABAR, O., HAZEN, M. M., JANSSEN, E., KIM, S., LO, J., LO, M. S., MEIDAN, E., SON, M. B. F., SUNDEL, R. P., STOLL, M. L., NUSBAUM, C., LEDERER, J. A., CHATILA, T. A. & NIGROVIC, P. A. 2020. Th17 reprogramming of T cells in systemic juvenile idiopathic arthritis. JCI Insight, 5.

      HUSE, K., WOGSLAND, C. E., POLIKOWSKY, H. G., DIGGINS, K. E., SMELAND, E. B., MYKLEBUST, J. H. & IRISH, J. M. 2019. Human Germinal Center B Cells Differ from Naïve and Memory B Cells in CD40 Expression and CD40L-Induced Signaling Response. Cytometry Part A, 95, 442-449.

      LEELATIAN, N., SINNAEVE, J., MISTRY, A. M., BARONE, S. M., BROCKMAN, A. A., DIGGINS, K. E., GREENPLATE, A. R., WEAVER, K. D., THOMPSON, R. C., CHAMBLESS, L. B., MOBLEY, B. C., IHRIE, R. A. & IRISH, J. M. 2020. Unsupervised machine learning reveals risk stratifying glioblastoma tumor cells. eLife, 9.

      OKAMATO, Y., GHOSH, T., OKAMOTO, T., SCHUYLER, R. P., SEIFERT, J., CHARRY, L. L., VISSER, A., FESER, M., FLEISCHER, C., PEDRICK, C., AUGUST, J., MOSS, L., BEMIS, E. A., NORRIS, J. M., KUHN, K. A., DEMORUELLE, M. K., DEANE, K. D., GHOSH, D., HOLERS, V. M. & HSIEH, E. W. Y. 2021. Subjects at-risk for future development of rheumatoid arthritis demonstrate a PAD4-and TLR-dependent enhanced histone H3 citrullination and proinflammatory cytokine production in CD14hi monocytes. Journal of Autoimmunity, 117, 102581.

      RINGEL, A. E., DRIJVERS, J. M., BAKER, G. J., CATOZZI, A., GARCÍA-CAÑAVERAS, J. C., GASSAWAY, B. M., MILLER, B. C., JUNEJA, V. R., NGUYEN, T. H., JOSHI, S., YAO, C.-H., YOON, H., SAGE, P. T., LAFLEUR, M. W., TROMBLEY, J. D., JACOBSON, C. A., MALIGA, Z., GYGI, S. P., SORGER, P. K., RABINOWITZ, J. D., SHARPE, A. H. & HAIGIS, M. C. 2020. Obesity Shapes Metabolism in the Tumor Microenvironment to Suppress Anti-Tumor Immunity. Cell, 183, 1848-1866.e26.

      WOGSLAND, C. E., GREENPLATE, A. R., KOLSTAD, A., MYKLEBUST, J. H., IRISH, J. M. & HUSE, K. 2017. Mass Cytometry of Follicular Lymphoma Tumors Reveals Intrinsic Heterogeneity in Proteins Including HLA-DR and a Deficit in Nonmalignant Plasmablast and Germinal Center B-Cell Populations. Cytometry B Clin Cytom, 92, 79-87.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Wogsland et al. apply herein mass cytometry (CyTOF) to investigate how obesity affects tumor immune infiltrates. They use several models of murine breast and pancreatic cancers and analyse their immune landscape thanks to an extended panel of 36 markers. They notably describe a decrease in CD8 T cells in one breast cancer model fed with high fat diet inducing obesity which favors tumor development.

      Overall, the report is clearly written and follows a very logical plan. Figures are also clear and nicely support the text. The mass cytometry approach appears quite original and could be relevant for many readers.

      Nevertheless, some concerns have to be made and would need to be acknowledged by authors:

      -First of all, the paper appears very descriptive. Except at the end of the last figures, authors only establish of catalog of immune cells in different tumors. Even if the trueness of such observations is undisputable, their relevance to improve our understanding of tumor biology is clearly questionable.

      -Moreover, the major finding claimed in this study (CD8 T cells decrease in tumors from HFD mice) has been very recently published paper also providing mechanistic insights (Ringel et al., Cell 2020). Authors could legitimately be disappointed but the interest of their study is sadly severely impacted by this prior publication. This key paper should be at least discussed and included in references.

      -Finally, even if the initial strategy of integration of breast and pancreatic cancers was indubitably a good one, results reported in figures 5 & 6 clearly show a quite specific observation in the E0771 model. So in this context, integrating all these datasets do not improve the understanding of this phenotype

      Besides these quite general comments, few more specific points: -In Fig 2, the F4/80 signal appears very weak in all datasets except one (TeLi) with an almost flat curve for all the other ones. It asks the question of the reproducibility of the staining that could be only partially corrected with batch correction algorithms.

      -Obesity is clearly known to be sex/hormone dependent as confirmed by authors themselves in their Fig 1B so again the global integration (both sex and 2 organs) strategy is disputable here. It is hard to know if there is no effect in the pancreas because of tissue or sex specificities.

      -On Fig 1C, red dots are closed or open but explanation of this is lacking.

      -Authors use 36 markers in their CyTOF panel but use only 26 for the dimension reduction without clearly explaining this choice. Should be amended. For example, why excluding CD5?

      Significance

      Severly impaired by Ringel et al., Cell 2020

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This study of Nils Halberg and colleagues aims to characterize tumor-associated immune cell infiltrates in a mouse model of diet-induced obesity. Authors compared different syngeneic tumor cell lines for mammary adenocarcinoma and pancreatic ductal adenocarcinoma. Tumor infiltrating leukocytes were analyzed by a 36-parametric mass cytometry protocol. The authors put a lot of efforts in the generation of high-quality data by applying state-of-the-art methods for sample barcoding and batch analyses, removal of batch‐specific variations and in the subsequent pipeline of data analysis. The clinical relevance of the topic addressed is well documented in several studies, showing a clear association between obesity and the development of several tumors, including those tumors investigated in this study.

      Main findings of this study can be summarized that in the model system used tumor-dependent differences in the qualitative and quantitative composition of immune cell infiltrates were observed. Unfortunately, the mouse model system used obviously did not reveal convincing data whether obesity may modulate the process of tumor infiltration. The manuscript is well written, quantity of figures is appropriately and of excellent quality and prior studies were referenced appropriately. In conclusion, authors made tremendous methodological and technical efforts to generate robust and high-quality mass cytometry data, but the overall outcome of the study remains limited in respect to shedding some new light how obesity is possibly involved in the qualitative and quantitative modulation of tumor-related immune cell infiltration.

      Major comments:

      Due to the limited data really showing an association between obesity and immune cell infiltration of tumors investigated I would suggest that authors should change emphasis of their results more closely related to the findings of tumor-dependent immune cell infiltrations than obesity-related associations. So, the title of the study should be appropriately changed since "High dimensional immunotyping of the obese tumor micro-environment" rather implies analyses of spatial relationships of immune, tumor and fat cells by immunohistological analyses, which would indeed help to strengthen the outcome of this mass cytometry study. Although all the efforts made in mass cytometry data generation are quite commendable in this study, basic statistical issues are not clearly addressed regarding the number of biological replicates. How many mice were treated per tumor cell line? According to figure 1B nine chow and eight HFD animals were used: does this mean that only one or two mice were analyzed per cell line, respectively? Please explain how many animals belong to each of the seven mouse cohorts. Obviously, cell lines E07771 and C11 were analyzed as duplicates only. Regarding E0771, tumor growth was 31 and 23 days, respectively. So, large inter-individual differences in tumor growth were obvious and how this is reflected at the level of tumor infiltration? Therefore, please explain which criteria were used to decide when the tumors had to be removed. Furthermore, please indicate weight, viability and absolute cell number of each tumor sample in a supplementary table to get an impression about variability in tumor growth.

      Minor comments:

      The generation of orthotopic pancreatic cancer mouse models is technically challenging, and needs more complex imaging methods to monitor the growth of the implanted tumor cells. Furthermore, orthotopic implantation of tumor cells into the pancreas by surgery can also inflict significant physical trauma to the recipient animals. How authors have monitored tumor cell implantation? The number of CD45-positive cells per tumor sample is not given in the manuscript, but this information would be important to know, because it can be expected that most of the samples showed less than 20.000 cells. This relatively low number of total leukocytes would not allow a statistically significant profiling of rare cell subsets, such as DC's or MDSC's. This limitation should at least clearly addressed in the discussion section. According to table 2 authors have used 36 immune cell-related antigens including casp3, which was only used to exclude apoptotic cells from downstream analyses. But as written in the results section only 26 phenotyping markers were used to generate the viSNE map shown in Figure 3. In Figure 3C-F 30 markers were shown. Please explain this obvious inconsistency of markers used. How viability of tumor samples was determined? Please indicate cell loss caused by cryopreservation of dispersed tumor tissue samples. Authors state that mainly neutrophilic granulocytes will be lost during cryopreservation, and that this would help to the "definitive identification and characterization of G-MDSC". But there are several reports showing that MDSC-subsets also behave very sensitive during cryopreservation and that it is recommended to analyze fresh samples if MDSC's are of particular interest (DOI: 10.1177/1753425912463618; DOI: 10.1177/1753425912463618). This possible limitation should be discussed in the manuscript and not only highlighted as advantage on the way to identify MDSC-subsets. In the Figure 1D X-axis named by "193Ir-NA" should be replaced by "193Ir-DNA". Furthermore, please explain "(T)" in the figure legend. Percentages in the last two dotplots related to "all previous gates" are confusing: 20,44% of all DNA-containing single cells were finally intact, living CD45+ cells, i.e. almost 80% of cells were excluded because they were dead or apoptotic and this corresponds to 57,06% of intact, living CD45-positive cells related to all CD45-positive cells? How these percentages are related to the "Percent of CD45/total raw events" in the last column of Table 3 ?

      Authors claimed that "155Gd_IRF4" was changed to "155Gd", but it is not clear why to mention that IRF4 has been NOT used throughout the study? Please provide only those technical details, which are necessary to understand what has been done.

      Re Figure 6: please explain the abbreviation "TNBC". Experiments done with TKO mice are not described in the Materials and Methods section. In particular, it would be important to know the number of replicates and the number of tumors grown in this model. It should be also discussed that the growth kinetics of tumors in chow and HFD TKO mice seem to be much faster as compared to wild type mice. Principally, the TKO model used here is only of limited value to clarify especially the role of CD8 cells since all other T- and B- cell subsets including NK cells are also absent in this knockout model and indirect effects caused by these cells cannot be excluded.

      Significance

      Altogether, this study is a paragon that a single technology-based study alone, even when well-designed, is not sufficient to explore complex tumor microenvironment-immune cell interactions and that additional information on spatial relationships of cells and possibly single cell-based RNAseq techniques are necessary to shed new light on this ambitious topic. But there is no doubt that the potential of mass cytometry has been not fully exploited in this study and that a more focused view on particular cell types identified so far, such as macrophages or CD8 cells, by using as many immunophenotypic and functionally-related parameters as necessary would allow a more in depth-phenotyping of particular immune cell compartments. The significance of this subject would have been tremendously increased if human samples will be analyzed in a future confirmative study.

      Even when I'm not a specialist in tumor biology, based on my expertise in the fields of chronic inflammation and cytometry, I'm convinced that the outlined way of generating immunophentypic data by single cell-based mass cytometry is of major interest not only for tumor biologists, but will be for sure recognized by a broad scientific community interested in the generation of single cell-based immunophenotypic data.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank all three reviewers for their very useful and constructive comments. Below is our point-by-point response.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Viais R et al describes a novel role for augmin complex in apoptosis prevention during brain development. Augmin complex recruits g TuRC to microtubule lattices to nucleate microtubule branches. The authors show how -in its absence- neural progenitors have elevated p53 activity and apoptotic rate, with severe consequences on overall brain development. In particular, augmin-deleted neural progenitors display spindle abnormalities and mitotic delay, which induce DNA damage accountable for p53-induced apoptosis.

      One point that I personally found very interesting is the role of augmin-dependent MT nucleation depletion in interphase. The authors mention (line 152) that at stage E13.5, besides the number of neurons being reduced, a few neurons were misplaced in the apical region, indicating a role for augmin-driven MT nucleation in cell migration. Moreover, the authors showed that p53 genetic deletion in the Haus6 cKO rescues the apoptosis phenotype but not the tissue disorganisation, suggesting that augmin-dependent microtubule might play a role in tissue polarity. While this is well presented in the discussion, the title in line 268 narrowly refers to mitotic augmin roles. I would like here to see the authors referring to putative roles for augmin-mediated MT nucleation in interphase, by toning down the title in line 268.

      We note that severe loss of tissue integrity is evident in the p53 KO background. In this background cells are allowed to repeatedly undergo defective cell divisions with aberrant chromosome segregation, producing increasingly abnormal daughter cells that may eventually fail to support epithelial integrity. Regarding possible neuronal migration defects, this has been previously observed in a study by the Hoogenraad group (Cunha-Ferreira et al., Cell Reports, 2018, 24, 791–800) and this is mentioned in our discussion. To account for the possibility that augmin may have roles beyond mitosis, we have changed the heading to a more neutral statement, not specifically referring to proliferation/mitosis:Loss of augmin in p53 KO brains disrupts neuroepithelium integrity”.

      Overall, the text is well written and flows easily. Figures are clear and legends provide sufficient information on experimental conditions, number of replicates and scale bars. I noticed that, although the number of repeats is specified, the number of cells scored per experiment is not always included. In my comments below I highlight cases where this missing information should be added.

      **Specific points:**

      1. In the Cep63 KO (Marjanovic et al, 2015) and the CenpJ KO mice (Insolera et al, 2014), as well as other recently published papers (e.g. Phan TP et al, EMBO Journal, 2020) part of the phenotypical characterisation of the KO mice displays pictures of the overall brain dissected from the mice. Could the author show these images?

      The main difference between the cited studies (including our own on the role of CEP63 in brain development) and our current study is that in the previous studies brains are microcephalic but essentially intact, whereas in our current study brain development was aborted and accompanied by cell death and severe tissue disruption. As a result, these brains are very fragile and difficult/impossible to isolate. An additional challenge is the fact that brain disruption occurs at a very early developmental stage (before E13.5), where dissection is more difficult than at later stages. Indeed, we note that all the brains presented in the above cited studies were from later embryonic stages or newborn/adult mice. Therefore, instead of dissecting brains, we decided to present encephalic coronal and sagittal sections as shown in Fig. 1c, d, e, Fig. S1c, and Fig. 3b, e to show the overall impact of Haus6 cKO and Haus6 cKO p53 KO on embryonic brain morphology at E13.5 and E17.5.

      Fig2d: do the insets correspond to higher magnification images? What is the zoom factor? I could not find it in the legend.

      The zoom factor is 1.4 - we have added this information to the figure legend.

      Fig2E,I and K graphs: how many cells were quantified here over how many experiments? I could not find information in the figure legend.

      We have added the information regarding the number of embryos and counted cells to the figure legends.

      The impact of Haus6 on mitotic spindle needs further clarification:

      o Fig2F: here, the authors show quantification for abnormal and multipolar spindle together. Later on, the abnormal spindle phenotype is no longer discussed (Fig4). I was wondering what is the individual contribution of abnormal and multipolar spindle, separately. Which one of the two is more frequent? Could the authors explain in the text how they define an abnormal spindle? Is it the lack of MT with the condensed chromosome area?

      We agree that our previous classification was somewhat confusing. The spindle defects in Haus6 cKO cells are directly linked to the spindle pole fragmentation phenotype shown in Fig. 2d, e. Association of spindle microtubules with these scattered PCM fragments causes spindles to appear overall disorganized. In some cases, multiple smaller asters are present, which is what we had termed “multipolar”. However, this does not always involve multipolar DNA configurations, which we separately quantify in Fig. 4. To avoid confusion, we now classify spindle morphologies based on tubulin staining simply as “normal” (bipolar configuration, two robust and focused asters) or “disorganized” (lack of bipolar configuration, in some cases multiple smaller asters). We have included a better description of this classification (lines 202-205).

      o Could it be that augmin deletion induce an instability in MTs within the mitotic spindle, leading to the "empty" or with very few MTs spindles? Or could it be that more cold-sensitive MTs are affected by fixation? What is the percentage of the spindle with no MT in control?

      It is possible that augmin-deficient spindles are less well-preserved during fixation due to compromised spindle microtubule stability. Indeed, in tissue culture cells augmin deficient spindle microtubules are more cold-sensitive than controls (Zhu et al., 2008, JCB, 183, 835-848). To address this we will determine the percentage of mitotic control and Haus6 cKO cells lacking microtubule staining.

      o Did the authors quantify anaphase/telophase phenotypes as they did in Fig4f?

      Yes, this quantification was already included in Fig. 4j, where we compared abnormal chromosome configurations between Haus6 cKO and Haus6 cKO p53 KO.

      o How do authors explain PCM fragmentation here? Could this phenotype be due to an initial cytokinesis defect which led the cells to accumulate extra centrosomes? Or could this maybe be a product of aberrant PCM maturation/centrosome duplication? Could the authors add here a line to discuss the possible origin of pole fragmentation?

      The PCM fragmentation phenotype has previously been described after augmin RNAi in cultured cells (Lawo et al., 2009, Curr Biol, 19, 816-826). We refer to this result in the discussion and we have added the above reference, to emphasize this point. The authors showed that this phenotype does not involve amplification of centriole number, but is caused by an imbalance in microtubule-dependent forces acting on the PCM and leading to its fragmentation. Thus, the extra poles were formed by acentriolar PCM fragments. We will clarify this issue by quantifying centriole numbers in mitotic cells (when centriole duplication is complete) in control and Haus6 cKO brains. We expect that this will confirm the data previously obtained in cell lines showing that in most cells the fragmented poles are not due to extra centrioles (see also below).

      Apart from the PCM fragmentation phenotype that does not alter centriole number, previous work in cultured cells also described cytokinesis defects. Failed cytokinesis would indeed lead to increased centriole number. However, it would also increase DNA content, which would be visible by an increase in the size of interphase nuclei (which we observed in Haus6 cKO p53 KO cells and quantified in Fig. 4J) and a larger size of mitotic figures. We now refer to the possibility of cytokinesis defects and cite previous work in lines 272-274. In case we observe cells with increased centriole number, which we will quantify for the revised version of the manuscript (see above), we will also determine if this corelates with an increased size of the corresponding mitotic figures. If so, this would be consistent with failed cytokinesis as cause of extra centrosomes.

      Fig 4 Did the authors quantify centrosome fragmentation and abnormal spindle here? As they characterised them for the Haus6 cKO mouse, it would be preferable to maintain the same characterisation for the Haus6 cKO p53KO.

      We will quantify pole fragmentation and spindle defects also in Haus6 cKO p53 KO as shown for Haus6 cKO in Fig. 2.

      Fig4c and d: how many replicates were done to obtain these graphs? I think the authors forgot to add this information in the figure legend.

      This information has been included in the figure legend.

      Fig4f,g, I and J: how many cells were counted per experiment? I appreciate the authors writing the n of experiments performed.

      We have added this information to the figure legend.

      Fig5d: how many cells were counted per experiment?

      We have added this information to the figure legend.

      Reviewer #1 (Significance (Required)):

      While it was already known that mitotic delay affects the neuronal progenitor pool through activation of p53-dependent apoptosis (Pilaz L-J, Neuron 2016; Mitchell-Dick A, Dev Neurosci 2020), and that this can be triggered by depletion of centrosomal proteins as Cenpj and Cep63, the role of surface-dependent microtubule nucleation was not identified so far. Some insights come from a Haus6-KO mouse model which dies during blastocyst stage after several aberrant mitosis (Watanabe S, Cell Reports, 2016). In parallel, McKinley KL et al showed that Haus8 depletion in human cells (RPE1cells) triggered p53-dependent G1 arrest following mitotic defects (McKinley KL, Developmental Cell, 2017). Building on the Hause6 KO mouse and human cell line data, here Viais R et al discover a novel role for the augmin-mediated MT nucleation in neural progenitor growth and brain development in vivo, through prevention of p53-induced apoptosis.

      Specifically, Viais R et al show that:

      1. Surface-dependent microtubule nucleation depletion severely impacts brain development, disrupting partly or completely forebrain domains and cerebellum;
      2. Surface-dependent microtubule nucleation depletion induce spindle abnormalities, resulting in mitotic delay in apical progenitors;
      3. Mitotic delay results in DNA breaks, p53 activation and p53-induced apoptosis.

        This is a tidy, well-executed study with good quality data. These findings propose a novel mechanism that results essential for neural progenitor and overall brain development.

        In my opinion, a large audience will benefit from these discoveries: from developmental biologists to cell biologists focused on microtubule dynamics, cell cycle, differentiation, stem cells and cell polarity.

      Key works describing my area of expertise: microtubule dynamics, centrosome function, cell cycle regulation and cell polarity.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Viais, Lüders and colleagues here present an analysis of augmin's roles in neural stem cell development. They describe a dramatic impact of the conditional ablation of Haus6 on embryonic brain development in the mouse, with mitotic problems that lead to greatly-increased levels of apoptosis. The rescue of this apoptosis by mutation of the gene that encodes p53 did not restore brain development, which was still aberrant, due to mitotic errors.

      The paper is clearly written, with well-designed and controlled experiments. Its conclusions are well supported by the data presented. I have few comments on the technical aspects of the work- it appears very solid to me.

      **Specific comments**

      1. Clearer explanation of the mouse strains used should be provided. The section describing the generation of the Haus6 conditional on p.5 should specify that this is the same as was already published in the 2016 Watanabe paper (this is in the Materials and Methods, but this should be more clearly specified. More specific details of the p53 knockout mice from Jackson should be included in the Materials and Methods.

      We have included additional information describing the generation of the Haus6 cKO mice in the main text (line 137-140). It is not exactly the same as described in the Watanabe et al. paper. The previously published strain (Watanabe et al., 2016, Cell Reports, 15, 54-60) contained a floxed Haus6 cKO allele with a flanking neomycin cassette. For the current study the neomycin cassette was removed. Details are described in the method section and also shown in Fig. S1a. Specific information regarding the p53 KO strain has been added to the method section.

      Figure 1a contains minimal information on the Haus6 locus. More detail should be included for information, if this Figure is to remain (although reference to the targeting details in the original description would be sufficient). It is unclear what the timeline diagram is to convey and it should be improved or deleted. A similar comment applies for the details in Figure 3a, although the colour scheme for the different genotypes is useful.

      More detailed information on the Haus6 locus is shown in the schematic of Fig. S1a and in the referenced study (Watanabe et al., 2016, Cell Reports, 15, 54-60). Since the targeting of Haus6 exon1 was previously described, we feel that including this information as a supplementary figure and referring to the previous study is appropriate.

      Regarding the schematics in Fig. 1a and Fig. 3a, we have improved these. The timeline shows the time points of Cre expression and of obtaining embryos for analysis.

      The important PCR controls in Figure S1b have an unexplained 1000 bp band that appears only in the floxed heterozygote. It would be helpful if the authors explained this in the relevant Figure legend.

      This band is an artifact and represents heteroduplexes of floxed (1080 bp) and wild type (530 bp) DNA strands due to extended regions of complementary. We have explained this in the figure legend.

      Assuming the putative centrosome 'clusters' in Figure 6c are similar to the fragmented structures seen in thalamus in Figure 2d, a different description should be used to avoid confusion with multiple centrosomes, which is not a phenotype here. It is not clear how the loss of centrosomes from the ventricular surface was scored, whether it was based on total gamma-tubulin signal or individual centrosomes; how fragmented poles would affect that is unclear, so the legend and relevant details should clarify this point.

      The fragmented spindle poles shown in Fig. 2d are different from the centrosome clusters in Fig. 6c. The fragmented poles are fragments of PCM rather than extra centrosomes. Fragmentation is specific to mitosis, involving forces exerted by spindle microtubules (Lawo et al., 2009, Curr Biol, 19, 816-826). In contrast, the centrosome clusters that we observed in Haus6 cKO p53 KO apical progenitors represent centrosomes from multiple cells in interphase, most likely as part of apical membrane patches that have delaminated form the ventricular surface. In the intact epithelium of controls these centrosomes line the ventricular surface. To avoid confusion, we now indicate in the text and legend that these centrosome clusters involve interphase cells.

      Phospho-histone H2AX should be referred to as a marker of activation of the DNA damage response, rather than DNA repair.

      We have changed the text accordingly.

      **Minor points**

      i. Figure 1b should include a scale bar.

      We have added the scale bar.

      ii. The labelling of Figure 1f should be revised.

      The labels have been fixed.

      iii. Figure 2k is not labelled in this Figure.

      This has been fixed.

      iv. Scale bars should be included in the blow-ups in Figure 6c.

      We have added the scale bars.

      Reviewer #2 (Significance (Required)):

      While it is striking that they see complete disruption of brain development, rather than microcephaly, arguably the mechanistic novelty of the findings is moderate, in that the impacts of Haus6 deficiency on mitotic spindle assembly are well established. The authors only allude to potential additional and novel activities of augmin (in neural progenitors, potentially) that might explain this possibly-unexpected outcome of this study. The topic is likely to be of interest to people in the field of mitosis, genome stability and brain development.

      My expertise is cell biology/ mitosis, less so on murine brain development.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Jens Lüders &Co demonstrates the essential role of Augmin-mediated MT is critical for proper brain development in mice. The most striking point is that even p53 is eliminated, the microcephaly phenotypes of Haus6 KOs were not rescued. This could mean that the Augmin-mediated MT process is critical to cellular functions that are independent of p53. The authors claim that there are increased DNA damage and excessive mitotic errors. In these aspects, the current work is fascinating. Nevertheless, what causes massive damage to the neural epithelial tissues in the double mutant is not well explained or examined. Few questions appear in mind before I go into the detail. Are these animals still harbor functional centrosomes and their numerical status?

      This is an important point that was also raised by the other reviewers. Based on previous work in cells lines (Lawo et al., 2009, Curr Biol, 19, 816-826), we do not expect that loss of augmin directly impairs centrosomes. Indeed, the authors showed that centriole number was unaffected. The only centrosome defect that the authors observed was fragmentation of the PCM during mitosis, but this was shown to be due to imbalanced forces exerted by spindle microtubules: fragmentation could be rescued by microtubule depolymerization or depletion of the cortical microtubule tethering factor NUMA (Lawo et al., 2009, Curr Biol, 19, 816-826). That being said, we will examine this issue also in our mouse model by staining and counting of centrioles in mitotic apical progenitors of control and Haus6 cKO embryos.

      The microcephaly part of the introduction needs some more work. In particular, the authors need to explain apical progenitors' depletion, possibly the correct mechanisms in causing microcephaly. By saying cortical progenitors, it becomes vague. Indeed, there would also be cortical progenitors depleted. But, the fundamental mechanisms are the depletion of apical progenitors lined up at VZ's lumen. Two works in this connection generated brain tissues from microcephaly patients carrying mutations in CenpJ and CDK5RAP2 (Gabriel and Lancaster et al). Authors should cite their work and relate their findings to mouse brain data.

      We have introduced text changes in the introduction to indicate the specific role of apical progenitor depletion in microcephaly and the differences in the underlying mechanism between mouse and human organoid models (line 63; lines 86-92). In this context we also cite the Gabriel et al. and Lancaster et al. studies.

      -What makes me worry is, looking at figure 1E, there is pretty much no brain, and of course, authors have analyzed what is left over. How could one distinguish reduced PAX6 area and TUJ1 area is due to the gross defects in brain development. Clearly, Haus6 KO causes a severe defect in brain development. Thus, deriving a conclusion from the damaged brain can be misleading. One way to circumvent this problem is to perform 2D experiments with isolated cell types (let us say NPCs and testing if they can spontaneous differentiate).

      We note that overall brain structures are only lost by E17.5, but brain structures (albeit defective) are still present at E13.5. Indeed, all of our quantifications were done at E13.5 or earlier stages. That being said, we understand the concern that quantifications in defective brain structures may be misleading. However, 2D cultures, for which cells are removed from their tissue context, may have similar issues. For this reason, we plan to provide two different type of analyses. We will measure PAX6 and TUJ1 layers in brains from embryos at E.11.5, since the relevant tissues will be less damaged at this earlier stage. In addition, we will use BrdU injection prior to fixation of embryos. Proliferating apical progenitors will incorporate the label during S phase and subsequently we will determine the relative amounts of BrdU-positive cell types (apical progenitors vs neurons) in control and Haus6 cKO brains. Tissue damage will have less impact in this short-term labelling experiment.

      Figure 2: A nice illustration that Hau6 KO animals harbor many mitotic figures. The quantifications lack how many slices and how many cells were analyzed. Simply n=4 does not say much. 4 animals were considered but how many cells/slices would help identify mitotic cells/animals' distribution. A simple bar diagram does not tell a lot.

      We have added this information to the figure legend.

      As a minor point, how did the authors unambiguously scored prometaphase cells and other mitotic figures? Representative figures will help. Besides, what is the meaning of many prometaphase cells? At least a discussion would help.

      This is a good suggestion and we will provide examples of the mitotic figures that we scored. We now explain the meaning of the increase in prometaphase cells in the description of this result (lines 178-180).

      Can the authors probe centrosomes (not by using gamma-tubulin) and relate their presence or absence to p53 upregulation? This is an important point because a complete loss of centrosome is known to trigger p53 upregulation. This may be different in Haus6 KO. This could mean (i.e, centrosomes are normal in numbers or increase in numbers), p53 upregulation is regardless of centrosomes loss.

      Indeed, we believe that p53 upregulation in Haus6-deficient brains is not caused by loss of centrosomes. Instead, our data suggest, as explained in the discussion, that mitotic delay caused by augmin deficiency is sufficient for p53 upregulation. We will further support this conclusion by counting centrioles in mitotic cells. At this point of the cell cycle centriole duplication is complete and we expect to observe largely normal centriole numbers. In some cells we may observe increased numbers due to cytokinesis failure (see response to reviewer #1).

      I have a hard time to ascertain how the authors scored interphase cells that enriched with p53. Some representative images with identity markers will help.

      Scoring p53-positive interphase cells is relatively straightforward since the p53 signal is nuclear and not observed in mitotic apical progenitors. We have included a magnified region of the tissue shown in Fig. 2j, displaying PAX6/p53-positive nuclei of individual cells.

      Looking at the p53 status in Haus6 KO animals, it is intriguing that p53 upregulation is not unique to centrosome loss. At this point, it becomes essential to thoroughly analyze the centrosome status to cross-check if Haus6 loss abrogates centrosomes; if so, how much.

      Since centrosome number is linked to centriole number, we will address this point by quantifying centriole numbers in mitotic apical progenitors (see above).

      Double KO could subside the cell death, but not tissue growth is impressive. So what is going on there? Is there a premature differentiation that leads to NPCs depletion? I believe the authors should generate 2D experiments with cells derived from these double KO animals compared to Haus6 KO and test if there is a premature differentiation that can lead to malformation of the forebrain. Here staining for the forebrain progenitor markers will additionally help (Perhaps FOXG1).

      As explained in response to reviewer #1, we prefer to analyse this issue in vivo rather than in cells that are removed from their native tissue context, which may affect cell fate decisions. To address whether cells prematurely differentiate, we will use injection of BrdU (incorporated by proliferating apical progenitors) prior to fixation, followed by staining for cell type-specific markers. If there is premature differentiation, this should be visible as an increase in BrdU-positive post-mitotic cells.

      Looking at Figure 6, it becomes clear that the double KOs have severe issues in maintaining the apical progenitors suggesting that they undergo premature differentiation before attaining a sufficient pool of NPCs. Testing this will bridge the paper between descriptive findings to mechanisms.

      This point relates to the reviewer’s previous point: do Haus6 cKO p53 KO apical progenitors prematurely differentiate? We believe that cell loss, tissue disruption, and aborted development may also be explained without premature differentiation. In the absence of p53, repeated abnormal mitoses and the resulting increasingly severe chromosomal aberrations including DNA damage (Fig. 5) may produce cells that eventually won’t be able to proliferate and function properly. However, we will test premature differentiation by BrdU injection and staining with appropriate markers as explained above.

      The discussion section is excellent, but it should add some human relevance. In particular, are there p53 dependent cell deaths that have been described in human tissues. In my opinion, it seems specific in the mouse brain. The discussion can also have statements about why the human brain is so sensitive even for mild mutations. I am not sure if those human mutations can cause similar defects in the mouse brain. Most of the mice based studies have been focusing on eliminating complete genes of interest.

      We have included a section in the discussion to relate our findings to human brain development and the differences with results obtained in mouse models regarding the role of apoptosis (lines 386-391).

      Reviewer #3 (Significance (Required)):

      Overall, this is a very well done work but requires some more experiments for mechanisms understanding. Addressing those will make the paper fit to get published.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Jens Lüders &Co demonstrates the essential role of Augmin-mediated MT is critical for proper brain development in mice. The most striking point is that even p53 is eliminated, the microcephaly phenotypes of Haus6 KOs were not rescued. This could mean that the Augmin-mediated MT process is critical to cellular functions that are independent of p53. The authors claim that there are increased DNA damage and excessive mitotic errors. In these aspects, the current work is fascinating. Nevertheless, what causes massive damage to the neural epithelial tissues in the double mutant is not well explained or examined. Few questions appear in mind before I go into the detail. Are these animals still harbor functional centrosomes and their numerical status? The microcephaly part of the introduction needs some more work. In particular, the authors need to explain apical progenitors' depletion, possibly the correct mechanisms in causing microcephaly. By saying cortical progenitors, it becomes vague. Indeed, there would also be cortical progenitors depleted. But, the fundamental mechanisms are the depletion of apical progenitors lined up at VZ's lumen. Two works in this connection generated brain tissues from microcephaly patients carrying mutations in CenpJ and CDK5RAP2 (Gabriel and Lancaster et al). Authors should cite their work and relate their findings to mouse brain data.

      -What makes me worry is, looking at figure 1E, there is pretty much no brain, and of course, authors have analyzed what is left over. How could one distinguish reduced PAX6 area and TUJ1 area is due to the gross defects in brain development. Clearly, Haus6 KO causes a severe defect in brain development. Thus, deriving a conclusion from the damaged brain can be misleading. One way to circumvent this problem is to perform 2D experiments with isolated cell types (let us say NPCs and testing if they can spontaneous differentiate)

      Figure 2: A nice illustration that Hau6 KO animals harbor many mitotic figures. The quantifications lack how many slices and how many cells were analyzed. Simply n=4 does not say much. 4 animals were considered but how many cells/slices would help identify mitotic cells/animals' distribution. A simple bar diagram does not tell a lot.

      As a minor point, how did the authors unambiguously scored prometaphase cells and other mitotic figures? Representative figures will help. Besides, what is the meaning of many prometaphase cells? At least a discussion would help.

      Can the authors probe centrosomes (not by using gamma-tubulin) and relate their presence or absence to p53 upregulation? This is an important point because a complete loss of centrosome is known to trigger p53 upregulation. This may be different in Haus6 KO. This could mean (i.e, centrosomes are normal in numbers or increase in numbers), p53 upregulation is regardless of centrosomes loss.

      I have a hard time to ascertain how the authors scored interphase cells that enriched with p53. Some representative images with identity markers will help.

      Looking at the p53 status in Haus6 KO animals, it is intriguing that p53 upregulation is not unique to centrosome loss. At this point, it becomes essential to thoroughly analyze the centrosome status to cross-check if Haus6 loss abrogates centrosomes; if so, how much.

      Double KO could subside the cell death, but not tissue growth is impressive. So what is going on there? Is there a premature differentiation that leads to NPCs depletion? I believe the authors should generate 2D experiments with cells derived from these double KO animals compared to Haus6 KO and test if there is a premature differentiation that can lead to malformation of the forebrain. Here staining for the forebrain progenitor markers will additionally help (Perhaps FOXG1).

      Looking at Figure 6, it becomes clear that the double KOs have severe issues in maintaining the apical progenitors suggesting that they undergo premature differentiation before attaining a sufficient pool of NPCs. Testing this will bridge the paper between descriptive findings to mechanisms.

      The discussion section is excellent, but it should add some human relevance. In particular, are there p53 dependent cell deaths that have been described in human tissues. In my opinion, it seems specific in the mouse brain. The discussion can also have statements about why the human brain is so sensitive even for mild mutations. I am not sure if those human mutations can cause similar defects in the mouse brain. Most of the mice based studies have been focusing on eliminating complete genes of interest.

      Significance

      Overall, this is a very well done work but requires some more experiments for mechanisms understanding. Addressing those will make the paper fit to get published.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Viais, Lüders and colleagues here present an analysis of augmin's roles in neural stem cell development. They describe a dramatic impact of the conditional ablation of Haus6 on embryonic brain development in the mouse, with mitotic problems that lead to greatly-increased levels of apoptosis. The rescue of this apoptosis by mutation of the gene that encodes p53 did not restore brain development, which was still aberrant, due to mitotic errors.

      The paper is clearly written, with well-designed and controlled experiments. Its conclusions are well supported by the data presented. I have few comments on the technical aspects of the work- it appears very solid to me.

      Specific comments

      1. Clearer explanation of the mouse strains used should be provided. The section describing the generation of the Haus6 conditional on p.5 should specify that this is the same as was already published in the 2016 Watanabe paper (this is in the Materials and Methods, but this should be more clearly specified. More specific details of the p53 knockout mice from Jackson should be included in the Materials and Methods.
      2. Figure 1a contains minimal information on the Haus6 locus. More detail should be included for information, if this Figure is to remain (although reference to the targeting details in the original description would be sufficient). It is unclear what the timeline diagram is to convey and it should be improved or deleted. A similar comment applies for the details in Figure 3a, although the colour scheme for the different genotypes is useful.
      3. The important PCR controls in Figure S1b have an unexplained 1000 bp band that appears only in the floxed heterozygote. It would be helpful if the authors explained this in the relevant Figure legend.
      4. Assuming the putative centrosome 'clusters' in Figure 6c are similar to the fragmented structures seen in thalamus in Figure 2d, a different description should be used to avoid confusion with multiple centrosomes, which is not a phenotype here. It is not clear how the loss of centrosomes from the ventricular surface was scored, whether it was based on total gamma-tubulin signal or individual centrosomes; how fragmented poles would affect that is unclear, so the legend and relevant details should clarify this point.
      5. Phospho-histone H2AX should be referred to as a marker of activation of the DNA damage response, rather than DNA repair.

      Minor points

      i. Figure 1b should include a scale bar.<br> ii. The labelling of Figure 1f should be revised. iii. Figure 2k is not labelled in this Figure. iv. Scale bars should be included in the blow-ups in Figure 6c.

      Significance

      While it is striking that they see complete disruption of brain development, rather than microcephaly, arguably the mechanistic novelty of the findings is moderate, in that the impacts of Haus6 deficiency on mitotic spindle assembly are well established. The authors only allude to potential additional and novel activities of augmin (in neural progenitors, potentially) that might explain this possibly-unexpected outcome of this study.

      The topic is likely to be of interest to people in the field of mitosis, genome stability and brain development.

      My expertise is cell biology/ mitosis, less so on murine brain development.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Viais R et al describes a novel role for augmin complex in apoptosis prevention during brain development. Augmin complex recruits g TuRC to microtubule lattices to nucleate microtubule branches. The authors show how -in its absence- neural progenitors have elevated p53 activity and apoptotic rate, with severe consequences on overall brain development. In particular, augmin-deleted neural progenitors display spindle abnormalities and mitotic delay, which induce DNA damage accountable for p53-induced apoptosis.

      One point that I personally found very interesting is the role of augmin-dependent MT nucleation depletion in interphase. The authors mention (line 152) that at stage E13.5, besides the number of neurons being reduced, a few neurons were misplaced in the apical region, indicating a role for augmin-driven MT nucleation in cell migration. Moreover, the authors showed that p53 genetic deletion in the Haus6 cKO rescues the apoptosis phenotype but not the tissue disorganisation, suggesting that augmin-dependent microtubule might play a role in tissue polarity. While this is well presented in the discussion, the title in line 268 narrowly refers to mitotic augmin roles. I would like here to see the authors referring to putative roles for augmin-mediated MT nucleation in interphase, by toning down the title in line 268.

      Overall, the text is well written and flows easily. Figures are clear and legends provide sufficient information on experimental conditions, number of replicates and scale bars. I noticed that, although the number of repeats is specified, the number of cells scored per experiment is not always included. In my comments below I highlight cases where this missing information should be added.

      Specific points:

      1. In the Cep63 KO (Marjanovic et al, 2015) and the CenpJ KO mice (Insolera et al, 2014), as well as other recently published papers (e.g. Phan TP et al, EMBO Journal, 2020) part of the phenotypical characterisation of the KO mice displays pictures of the overall brain dissected from the mice. Could the author show these images?
      2. Fig2d: do the insets correspond to higher magnification images? What is the zoom factor? I could not find it in the legend.
      3. Fig2E,I and K graphs: how many cells were quantified here over how many experiments? I could not find information in the figure legend.
      4. The impact of Haus6 on mitotic spindle needs further clarification:

      o Fig2F: here, the authors show quantification for abnormal and multipolar spindle together. Later on, the abnormal spindle phenotype is no longer discussed (Fig4). I was wondering what is the individual contribution of abnormal and multipolar spindle, separately. Which one of the two is more frequent? Could the authors explain in the text how they define an abnormal spindle? Is it the lack of MT with the condensed chromosome area?

      o Could it be that augmin deletion induce an instability in MTs within the mitotic spindle, leading to the "empty" or with very few MTs spindles? Or could it be that more cold-sensitive MTs are affected by fixation? What is the percentage of the spindle with no MT in control?

      o Did the authors quantify anaphase/telophase phenotypes as they did in Fig4f?

      o How do authors explain PCM fragmentation here? Could this phenotype be due to an initial cytokinesis defect which led the cells to accumulate extra centrosomes? Or could this maybe be a product of aberrant PCM maturation/centrosome duplication? Could the authors add here a line to discuss the possible origin of pole fragmentation?

      1. Fig 4 Did the authors quantify centrosome fragmentation and abnormal spindle here? As they characterised them for the Haus6 cKO mouse, it would be preferable to maintain the same characterisation for the Haus6 cKO p53KO.
      2. Fig4c and d: how many replicates were done to obtain these graphs? I think the authors forgot to add this information in the figure legend.
      3. Fig4f,g, I and J: how many cells were counted per experiment? I appreciate the authors writing the n of experiments performed.
      4. Fig5d: how many cells were counted per experiment?

      Significance

      While it was already known that mitotic delay affects the neuronal progenitor pool through activation of p53-dependent apoptosis (Pilaz L-J, Neuron 2016; Mitchell-Dick A, Dev Neurosci 2020), and that this can be triggered by depletion of centrosomal proteins as Cenpj and Cep63, the role of surface-dependent microtubule nucleation was not identified so far. Some insights come from a Haus6-KO mouse model which dies during blastocyst stage after several aberrant mitosis (Watanabe S, Cell Reports, 2016). In parallel, McKinley KL et al showed that Haus8 depletion in human cells (RPE1cells) triggered p53-dependent G1 arrest following mitotic defects (McKinley KL, Developmental Cell, 2017). Building on the Hause6 KO mouse and human cell line data, here Viais R et al discover a novel role for the augmin-mediated MT nucleation in neural progenitor growth and brain development in vivo, through prevention of p53-induced apoptosis.

      Specifically, Viais R et al show that:

      1. Surface-dependent microtubule nucleation depletion severely impacts brain development, disrupting partly or completely forebrain domains and cerebellum;
      2. Surface-dependent microtubule nucleation depletion induce spindle abnormalities, resulting in mitotic delay in apical progenitors;
      3. Mitotic delay results in DNA breaks, p53 activation and p53-induced apoptosis.

      This is a tidy, well-executed study with good quality data. These findings propose a novel mechanism that results essential for neural progenitor and overall brain development.

      In my opinion, a large audience will benefit from these discoveries: from developmental biologists to cell biologists focused on microtubule dynamics, cell cycle, differentiation, stem cells and cell polarity.

      Key works describing my area of expertise: microtubule dynamics, centrosome function, cell cycle regulation and cell polarity.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to reviewers: Woodcock et al. 2021

      Reviewer 1 (Evidence, reproducibility, and clarity):

      Summary The authors resolved the biosynthesis of trehalose and alpha-glucan in Pseudomonas aeruginosa and the role of these two compounds in osmotic and desiccation stress.

      We thank the reviewer for their positive review of our manuscript. Our responses to their specific queries are interspersed below.

      Major comments:

      • Are the key conclusions convincing? Yes

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? No

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. Not necessary, comprehensive coverage of research topic.

        • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. * Not applicable
      • Are the data and the methods presented in such a way that they can be reproduced? Yes

        • Are the experiments adequately replicated and statistical analysis adequate? * Yes, everything is adequate but just one subtle concern: check the significance of the number of digits in the entries listed in Table S3. Revise Table S3.

      Table S3 has been revised as requested. The data in this table is now presented correct to one decimal place.

      Minor comments:

      • Specific experimental issues that are easily addressable. Not applicable (Table S3: see above)

        • Are prior studies referenced appropriately? No. Refs. 18- 32: The subjects of 'trehalose' and 'osmotic stress' have already been addressed in the Pseudomonas field and should be referenced. The authors cite work carried out on trehalose and osmotic stress on phylogenetically distant microorganisms, but do not cite related work from the Pseudomonas field which I consider to be inappropriate. Similarly, trehalose biosynthesis in Pseudomonas* has not only been covered by refs. 47 and 48.

      This is a fair comment. The focus of our introduction came from a desire to concentrate specifically on the metabolism and intracellular function of trehalose/α-glucan in Pseudomonas. In hindsight, we acknowledge that our introduction is a little too narrowly focussed. We have expanded the introduction and discussion sections to include additional discussion of trehalose in Pseudomonas and its regulation in the CF lung.

      • Are the text and figures clear and accurate? Extremely well written manuscript and prepared figures

        • Do you have suggestions that would help the authors improve the presentation of their data and conclusions? Revise the list of references and discuss more thoroughly your novel findings in the light of existing knowledge in the Pseudomonas* field.

          Please see previous comment relating to the literature.

      Reviewer 1 (Significance):

      Significance

        • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. Conceptual advance: The authors identified and characterized the enzymatic pathway of trehalose and alpha-glucan biosynthesis in Pseudomonas aeruginosa and its role to cope with osmotic and desiccation stress. The authors' conclusions do not correspond with recently published peers' work; hence they should discuss in more detail why they consider their data to be more accurate to discern the role of trehalose to contain desiccation and osmotic stress in P. aeruginosa*.

          Please see previous comment relating to the literature. In general, the published work to date on trehalose in Pseudomonas spp. does not consider GlgE pathway-mediated link to α-glucan that we characterise in this paper. Our work demonstrates that synthesis and metabolism of the two molecules are implicitly linked in species where the GlgE pathway is present, and they cannot be considered in isolation. For this reason we are very confident that our study represents the most accurate model to date for trehalose and α-glucan metabolism and their associated phenotypes in P. aeruginosa. We have therefore emphasised that the role of trehalose in Pseudomonas spp. should be re-evaluated in light of our findings.

        • Place the work in the context of the existing literature (provide references, where appropriate). Existing literature focusing on trehalose, osmotic stress, desiccation stress in the Pseudomonas *field not cited by the authors:
      • These papers are of variable scientific quality, but the conceptual work by Hallsworth and the work by Behrens on the PA metabolome in CF lungs are worth discussing. All other work provides pieces of information on function and biosynthesis of trehalose up to now known by the Pseudomonas community. The authors resolved the function of the GlgA operon which will be definitely appreciated.

        We thank the reviewer for these helpful suggestions. We have reviewed these papers carefully and have incorporated several, including the papers from Hallsworth and Behrens into the revised manuscript.

      Strengths of the manuscript:

      • Meticulously planned and carefully executed experiments, not a single experimental flaw
      • Very high technical quality of experiments and primary data
      • Comprehensive coverage of the research topic
      • Excellent presentation in text and illustrations Only weakness:

      • Insufficient consideration of peers' published work on trehalose and its role in stress response in P. aeruginosa

        Please see previous comment relating to the literature.

        • State what audience might be interested in and influenced by the reported findings. Scientists working in the fields of glycoconjugate and carbohydrate research, biochemists, microbiologists with interest in metabolic pathways, stress response and/or Pseudomonas. __Reviewer #2 (Evidence, reproducibility, and clarity):*__

      It will be difficult for me to write a review of this paper and for the authors to make sense of my review because the manuscript's pages / lines are not numbered…

      We apologise to the reviewer for this oversight.

      Summary The authors carried out a comprehensive characterization of the metabolism of trehalose in Pseudomonas aeruginosa PA01, using techniques of biochemistry, reverse genetics, and bioinformatics. The main findings include that the disaccharide trehalose is synthesized in this organism from branched chain α-glucans and that the catabolism of trehalose proceeds via another disaccharide, maltose and is fed back into the synthesis of α-glucans. Trehalose and α-glucans have been implicated in conferring resistance to abiotic stresses in other organisms. The authors show that mutants that are blocked in the synthesis of trehalose are sensitive to high salinity but are normal with respect to their sensitivity to desiccation, whereas mutants impaired in the accumulation of α-glucans are sensitive to desiccation without being unduly sensitive to osmotic stress. These results indicate that trehalose and α-glucans have different roles in abiotic stress-tolerance.

      Major points

      This manuscript describes an impressive amount of careful work and presents new insights into the metabolism of trehalose, maltose, and α-glucans. However, the authors should address the following major comments before the paper is accepted.

      We thank the reviewer for their thorough and positive assessment of the manuscript. We address their specific points below.

      • Discussion: the authors state that "trehalose protects Pseudomonas ssp. against osmotic stress, most likely due to its role as a compatible solute." According to Table 2, P. aeruginosa grown in the medium of low osmolarity accumulated 0.13% trehalose per gram dry weight, i.e. ~4 μmol / g dry weight. Assuming that the dry weight / wet weight ratio of P. aeruginosa is the same as that of P. putida, which is ~1/3 (PMID: 6508285), the concentration of trehalose in the cells calculates to be ~2 mM. It is not plausible that trehalose could be significant as compatible solute at this low concentration.
      • One way out could be if the accumulation of this disaccharide were increased by osmotic stress. The authors should also measure the trehalose content of cells grown in medium containing 0.85 M NaCl. In case of positive results in this experiment, it would be interesting to determine the effects of osmotic stress on the levels of trehalose biosynthetic and catabolic enzymes, but this would not be necessary for the acceptance of the paper.

        This is a fair point. To address this, we measured the trehalose and maltose-1-phosphate levels for PA01 grown in the presence of 0.85 M NaCl. We saw a highly significant increase in the abundance of trehalose, compared to growth on standard M9 media. This strongly suggests that trehalose accumulates under conditions of osmotic stress as suggested by the reviewer. These new results have been added to the relevant sections of the manuscript (M&M, results, table 2 and discussion). The student (Danny Ward) who conducted these new experiments has been added to the author list.

      • However, there is also an extensive literature suggesting that trehalose has antioxidant functions e.g. PMID: 29241092 (the first paper that came up in Google search for "trehalose as antioxidant"). The authors should discuss this possible alternate role of trehalose.

        The reviewer is correct that trehalose has well-documented antioxidant functions in various species. We have modified the introduction to address this. To maintain the focus of our manuscript on bacteria we have used a different example to that suggested by the reviewer.

      • It is not described adequately in the Materials and Methods how the cellular contents of trehalose and maltose-1-phosphate (M1P) were determined.

        The Materials and Methods section has been revised to include more details of this method.

      • I found the growth curves in Figure 8, especially in panel B, to be uninterpretable. The authors should spread these data into more panels or use some other method to make them clearer.

        We have expanded the legend for Figure 8 to describe more fully what is going on in this figure. The results in Figure 8 are grouped according to the operons in which each set of genes is located. As such, the graphs contain unequal numbers of curves, with 8B containing the most and 8C only showing data for WT and ΔglgP.

      • The statement "The GlgA and GlgE proteins . . . enable two alternate mechanisms for linear α-glucan biosynthesis", which is echoed a number of times in the manuscript, seems to create the impression that there are two de novo pathways of synthesis of these polysaccharides. However, as shown in Figure 1, the GlgA pathway is the only route to the net synthesis of α-glucans, and GlgE is only part of a recycling pathway. Therefore, it cannot be true that "the vast majority of α-glucan accumulated by P. aeruginosa will be produced by GlgE".

        We have revised this section to further clarify what we mean when we state that the majority of α-glucan accumulated by P. aeruginosa will be produced by GlgE. Our data suggest that there is a big difference between the generation of α-glucan (conducted by both GlgA and GlgE) and its accumulation (flux through GlgA generated α-glucan is high, so only GlgE generated α-glucan can accumulate to generate large polymers).

      • The authors state that "MalQ disproportionates (sic) α-glucan with glucose to produce maltose." Figure 1 shows that GlgE uses an "acceptor", which I assume could be glucose. How is free glucose synthesized? Could cells grown on a non-carbohydrate as sole carbon source make free glucose?

        P. aeruginosa is able to carry out gluconeogenesis, so it can produce glucose from non-carbohydrate carbon sources if necessary.

      Our data show that GlgE acceptor preference gets lower as the acceptor molecule gets shorter. It is possible to detect GlgE activity without an acceptor. In this case we see a lag, implying M1P hydrolyses slowly at first and priming with glucose is also slow. Eventually however, the products get long enough for the reaction to take off. MalQ will work with DP2 or longer as the donor and DP1 or longer as the acceptor, moving one glucose unit at a time.

      • Pedantic point, but "disproportionation" means an oxidation-reduction reaction in which two identical molecules are used to produce two different molecules (https://en.wikipedia.org/wiki/Disproportionation). The reaction catalysed by MalQ does not involve electron transfer. Don't the authors mean that this enzyme is a glycosyl transferase?

        We have checked this, and our use of disproportionation in the manuscript is correct. The definition of disproportionation is any desymmetrizing reaction of the following type: 2 A → A' + A", and is not limited to redox reactions. MalQ carries out a reaction of this type when presented with a maltooligosaccharide.

      • The authors state that TreS had "a very high Km for trehalose (>100 mM)". In view of the low concentration of trehalose (Point 1, above), the physiological relevance of this suggested activity is questionable.

        See response to question 1 above. As trehalose levels are elevated under osmostress conditions this concern becomes less critical. It is of course true that conditions in vitro may not fully reflect cellular conditions and that this activity may be higher in vivo, but this is a general limitation of all protein biochemistry studies. The important point here is that trehalose synthase activity is detected for PA01 TreS.

      • Explain better what "predicted mean log10(CFU) means.

        The predicted mean refers to the value of log10(CFU) predicted by the statistical model we use. We have clarified this in the relevant sections of the manuscript.

      • Can the authors suggest how "α-glucan protects PA01 against desiccation"?

        Without further investigation we can only speculate as to how α-glucan confers desiccation tolerance in PA01. One possibility is that α-glucan functions as a hydrogel, like the exopolysaccharide alginate, trapping water molecules and slowing their evaporation. Alternatively, it may confer a structural role akin to that of trehalose, preventing the loss of cell integrity as water levels decrease. We now address these possibilities in the discussion.

      • Can P. aeruginosa metabolize exogenous trehalose or maltose? If the authors know either way, they should mention it. If they don't know, I am not suggesting that they should test this for this paper, but it would be interesting to know whether these compounds would induce the expression of the trehalose or maltose catabolic enzymes or repress the relevant biosynthetic enzymes. >P. aeruginosa is able to metabolise exogenous maltose and trehalose. While the experiments that the reviewer suggests are certainly interesting, in our view tre/glg gene regulation is beyond the scope of the current manuscript. This field is certainly worth investigating in the future, however.

      Minor points

      • First page under "Results": "phosphomutase" should be "phosphoglucomutase"?

        Changed as requested.

      • Discussion: insert "P. syringae" before "Pto".

        Changed as requested.

      • Materials and Methods: describe how ADP was quantified in the maltokinase assay.

        The materials and methods section has been updated as requested.

      Reviewer 2 (Significance):

      Significance

      Until this work, the biosynthesis of trehalose has been most extensively characterized in Escherichia coli, in which it has been shown that this disaccharide is made by the reaction of glucose-6-phosphate and UDP-glucose to give trehalose-6-phosphate and dephosphorylation to trehalose, catalysed by OtsA and OtsB. The authors discovered a very different pathway in P. aeruginosa in which the synthesis of trehalose goes through α-glucans as intermediates.

      Because trehalose and α-glucans are needed for osmotic stress- and desiccation-tolerance, respectively, this work is of significance to researchers studying abiotic stress resistance.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Review of manuscript "Trehalose and α-glucan mediate distinct abiotic responses in Pseudomonas aeruginosa" by S. D. Woodcock et al.

      It will be difficult for me to write a review of this paper and for the authors to make sense of my review because the manuscript's pages / lines are not numbered. I will do my best write a review, but for the future, I urge this Journal to print the text on pages in which the lines are numbered or require this of the authors.

      Summary.

      The authors carried out a comprehensive characterization of the metabolism of trehalose in Pseudomonas aeruginosa PA01, using techniques of biochemistry, reverse genetics, and bioinformatics. The main findings include that the disaccharide trehalose is synthesized in this organism from branched chain α-glucans and that the catabolism of trehalose proceeds via another disaccharide, maltose and is fed back into the synthesis of α-glucans. Trehalose and α-glucans have been implicated in conferring resistance to abiotic stresses in other organisms. The authors show that mutants that are blocked in the synthesis of trehalose are sensitive to high salinity but are normal with respect to their sensitivity to desiccation, whereas mutants impaired in the accumulation of α-glucans are sensitive to desiccation without being unduly sensitive to osmotic stress. These results indicate that trehalose and α-glucans have different roles in abiotic stress-tolerance.

      Major points.

      This manuscript describes an impressive amount of careful work and presents new insights into the metabolism of trehalose, maltose, and α-glucans. However, the authors should address the following major comments before the paper is accepted.

      1. Discussion: the authors state that "trehalose protects Pseudomonas ssp. against osmotic stress, most likely due to its role as a compatible solute." According to Table 2, P. aeruginosa grown in the medium of low osmolarity accumulated 0.13% trehalose per gram dry weight, i.e. ~4 μmol / g dry weight. Assuming that the dry weight / wet weight ratio of P. aeruginosa is the same as that of P. putida, which is ~1/3 (PMID: 6508285), the concentration of trehalose in the cells calculates to be ~2 mM. It is not plausible that trehalose could be significant as compatible solute at this low concentration.<br> One way out could be if the accumulation of this disaccharide were increased by osmotic stress. The authors should also measure the trehalose content of cells grown in medium containing 0.85 M NaCl. In case of positive results in this experiment, it would be interesting to determine the effects of osmotic stress on the levels of trehalose biosynthetic and catabolic enzymes, but this would not be necessary for the acceptance of the paper.<br> However, there is also an extensive literature suggesting that trehalose has antioxidant functions e.g. PMID: 29241092 (the first paper that came up in Google search for "trehalose as antioxidant"). The authors should discuss this possible alternate role of trehalose.<br> It is not described adequately in the Materials and Methods how the cellular contents of trehalose and maltose-1-phosphate (M1P) were determined.
      2. I found the growth curves in Figure 8, especially in panel B, to be uniterpretable. The authors should spread these data into more panels or use some other method to make them clearer.
      3. The statement "The GlgA and GlgE proteins . . . enable two alternate mechanisms for linear α-glucan biosynthesis", which is echoed a number of times in the manuscript, seems to create the impression that there are two de novo pathways of synthesis of these polysaccharides. However, as shown in Figure 1, the GlgA pathway is the only route to the net synthesis of α-glucans, and GlgE is only part of a recycling pathway. Therefore, it cannot be true that "the vast majority of α-glucan accumulated by P. aeruginosa will be produced by GlgE".
      4. The authors state that "MalQ disproportionates (sic) α-glucan with glucose to produce maltose." Figure 1 shows that GlgE uses an "acceptor", which I assume could be glucose.<br> How is free glucose synthesized? Could cells grown on a non-carbohydrate as sole carbon source make free glucose? Pedantic point, but "disproportionation" means an oxidation-reduction reaction in which two identical molecules are used to produce two different molecules (https://en.wikipedia.org/wiki/Disproportionation). The reaction catalyzed by MalQ does not involve electron transfer. Don't the authors mean that this enzyme is a glycosyl transferase?
      5. The authors state that TreS had "a very high Km for trehalose (>100 mM)". In view of the low concentration of trehalose (Point 1, above), the physiological relevance of this suggested activity is questionable.
      6. Explain better what "predicted mean log10(CFU) means.
      7. Can the authors suggest how "α-glucan protects PA01 against desiccation"?
      8. Can P. aeruginosa metabolize exogenous trehalose or maltose? If the authors know either way, they should mention it. If they don't know, I am not suggesting that they should test this for this paper, but it would be interesting to know whether these compounds would induce the expression of the trehalose or maltose catabolic enzymes or repress the relevant biosynthetic enzymes.

      Minor points.

      1. First page under "Results": "phosphomutase" should be "phosphoglucomutase"?
      2. Discussion: insert "P. syringae" before "Pto".
      3. Materials and Methods: describe how ADP was quantified in the maltokinase assay.

      Significance

      Significance.

      Until this work, the biosynthesis of trehalose has been most extensively characterized in Escherichia coli, in which it has been shown that this disaccharide is made by the reaction of glucose-6-phosphate and UDP-glucose to give trehalose-6-phosphate and dephosphorylation to trehalose, catalyzed by OtsA and OtsB. The authors discovered a very different pathway in P. aeruginosa in which the synthesis of trehalose goes through α-glucans as intermediates.<br> Because trehalose and α-glucans are needed for osmotic stress- and desiccation-tolerance, respectively, this work is of significance to researchers studying abiotic stress resistance.

      The Reviewers' guidelines stipulate that Reviewers should define their fields of expertise.

      My credentials are: a) I have been solicited to review this paper, and b) I have publications in osmotic stress adaptation and trehalose biosynthesis in Enterobacteriaceae.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate). Please place your comments about significance in section 2.

      The authors resolved the biosynthesis of trehalose and alpha-glucan in Pseudomonas aeruginosa and the role of these two compounds in osmotic and desiccation stress.

      Major comments:

      • Are the key conclusions convincing?

      yes

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      no

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      Not necessary, comprehensive coverage of research Topic

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Not applicable

      • Are the data and the methods presented in such a way that they can be reproduced?

      yes

      • Are the experiments adequately replicated and statistical analysis adequate?

      Yes, everything is adequate but just one subtle concern: check the significance of the number of digits in the entries listed in Table S3. Revise Table S3.

      Minor comments:

      • Specific experimental issues that are easily addressable.

      not applicable (Table S3: see above)

      • Are prior studies referenced appropriately?

      No. Refs. 18- 32: The subjects of 'trehalose' and 'osmotic stress' have already been addressed in the Pseudomonas field and should be referenced. The authors cite work carried out on trehalose and osmotic stress on phylogenetically distant microorganisms, but do not cite related work from the Pseudomonas field which I consider to be inappropriate. Similarly, trehalose biosynthesis in Pseudomonas has not only been covered by refs. 47 and 48.

      • Are the text and figures clear and accurate?

      Extremely well written manuscript and prepared figures

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Revise the list of references and discuss more thoroughly your novel findings in the light of existing knowledge in the Pseudomonas field

      Significance

      2. Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      Conceptual advance: The authors identified and characterized the enzymatic pathway of trehalose and alpha-glucan biosynthesis in Pseudomonas aeruginosa and its role to cope with osmotic and desiccation stress. The authors' conclusions do not correspond with recently published peers' work, hence they should discuss in more detail why they consider their data to be more accurate to discern the role of trehalose to contain desiccation and osmotic strass in P. aeruginosa.

      • Place the work in the context of the existing literature (provide references, where appropriate).

      Existing literature focusing on trehalose, osmotic stress, desiccation stress in the Pseudomonas field not cited by the authors

      Pazos-Rojas LA, Muñoz-Arenas LC, Rodríguez-Andrade O, López-Cruz LE, López- Ortega O, Lopes-Olivares F, Luna-Suarez S, Baez A, Morales-García YE, Quintero- Hernández V, Villalobos-López MA, De la Torre J, Muñoz-Rojas J. Desiccation- induced viable but nonculturable state in Pseudomonas putida KT2440, a survival strategy. PLoS One. 2019 Jul 19;14(7):e0219554. doi:10.1371/journal.pone.0219554.

      Wang T, Jia S, Dai K, Liu H, Wang R. Cloning and expression of a trehalose synthase from Pseudomonas putida KT2440 for the scale-up production of trehalose from maltose. Can J Microbiol. 2014 Sep;60(9):599-604. doi: 10.1139/cjm-2014-0330.

      Harty CE, Martins D, Doing G, Mould DL, Clay ME, Occhipinti P, Nguyen D, Hogan DA. Ethanol Stimulates Trehalose Production through a SpoT-DksA-AlgU-Dependent Pathway in Pseudomonas aeruginosa. J Bacteriol. 2019 May 22;201(12):e00794-18. doi: 10.1128/JB.00794-18.

      Cross M, Biberacher S, Park SY, Rajan S, Korhonen P, Gasser RB, Kim JS, Coster MJ, Hofmann A. Trehalose 6-phosphate phosphatases of Pseudomonas aeruginosa. FASEB J. 2018 Oct;32(10):5470-5482. doi: 10.1096/fj.201800500R.

      Wang T, Jia S, Dai K, Liu H, Wang R. Cloning and expression of a trehalose synthase from Pseudomonas putida KT2440 for the scale-up production of trehalose from maltose. Can J Microbiol. 2014 Sep;60(9):599-604. doi: 10.1139/cjm-2014-0330.

      Behrends V, Ryall B, Zlosnik JE, Speert DP, Bundy JG, Williams HD. Metabolic adaptations of Pseudomonas aeruginosa during cystic fibrosis chronic lung infections. Environ Microbiol. 2013 Feb;15(2):398-408. doi: 10.1111/j.1462-2920.2012.02840.x

      Behrends V, Ryall B, Wang X, Bundy JG, Williams HD. Metabolic profiling of Pseudomonas aeruginosa demonstrates that the anti-sigma factor MucA modulates osmotic stress tolerance. Mol Biosyst. 2010 Mar;6(3):562-9. doi: 10.1039/b918710c.

      Matthijs S, Koedam N, Cornelis P, De Greve H. The trehalose operon of Pseudomonas fluorescens ATCC 17400. Res Microbiol. 2000 Dec;151(10):845-51. doi: 10.1016/s0923-2508(00)01151-7.

      van der Werf MJ, Overkamp KM, Muilwijk B, Koek MM, van der Werff-van der Vat BJ, Jellema RH, Coulier L, Hankemeier T. Comprehensive analysis of the metabolome of Pseudomonas putida S12 grown on different carbon sources. Mol Biosyst. 2008 Apr;4(4):315-27. doi: 10.1039/b717340g.

      Hallsworth JE, Heim S, Timmis KN. Chaotropic solutes cause water stress in Pseudomonas putida. Environ Microbiol. 2003 Dec;5(12):1270-80. doi: 10.1111/j.1462-2920.2003.00478.x.

      Ball P, Hallsworth JE. Water structure and chaotropicity: their uses, abuses and biological implications. Phys Chem Chem Phys. 2015 Apr 7;17(13):8297-305. doi: 10.1039/c4cp04564e

      Cray JA, Russell JT, Timson DJ, Singhal RS, Hallsworth JE. A universal measure of chaotropicity and kosmotropicity. Environ Microbiol. 2013 Jan;15(1):287-96. doi: 10.1111/1462-2920.12018.

      Chin JP, Megaw J, Magill CL, Nowotarski K, Williams JP, Bhaganna P, Linton M, Patterson MF, Underwood GJ, Mswaka AY, Hallsworth JE. Solutes determine the temperature windows for microbial survival and growth. Proc Natl Acad Sci U S A. 2010 Apr 27;107(17):7835-40. doi: 10.1073/pnas.1000557107.

      These papers are of variable scientific quality, but the conceptual work by Hallsworth and the work by Behrens on the PA metabolome in CF lungs are worth discussing. All other work provides pieces of information on function and biosynthesis of trehalose up to now known by the Pseudomonas community. The authors resolved the function of the GlgA operon which will be definitely appreciated.

      Strengths of the manuscript:

      • Meticulously planned and carefully executed experiments, not a single experimental flaw • very high technical quality of experiments and primary data • comprehensive coverage of the research topic • excellent presentation in text and illustrations

      only weakness: • insufficient consideration of peers' published work on trehalose and its role in stress response in P. aeruginosa

      • State what audience might be interested in and influenced by the reported findings.

      Scientists working in the fields of glycoconjugate and carbohydrate research, biochemists, microbiologists with interest in metabolic pathways, stress response and/or Pseudomonas

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Reviewer's expertise: Pseudomonas genomics and physiology, respiratory tract infections, solid background in biochemistry and molecular biology

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reviewer #4

      Evidence, reproducibility and clarity

      We have reviewed "A specific regulator of neuronal V-ATPase in Drosophila melanogaster." by Dulac et al. The authors have identified VhaAC45L as a regulator of neuronal V-ATPase in Drosophila melanogaster. The authors have utilized multiple techniques to determine the localization of VhaAC45L in neurons and specifically in the synapse. The use of multiple approaches including determining RNA levels in different regions of the fly, and using CRISPR-Cas9 technique to insert V5 tag, makes a very convincing argument about the synapse-specific expression of VhaAC45L.

      The combined use of co-immunoprecipitation technique and LC/MS to show that VhaAC45L co-precipitated with V-ATPase complex subunits is convincing that VhaAC45L is a subunit of V-ATPase. To determine the role of VhaAC45L in acidification of synaptic vesicles the authors have utilized pHluorins in combination with multiple RNAi lines. The authors have used a well-designed experiment to prove that VhaAC45L regulates acidification of the synaptic vesicles. Further, larval locomotion and quantal size determination using VhaAC45LRNAi which is known to be altered due to pH gradient of synaptic vesicles shows the functional role of VhaAC45L in synaptic vesicle acidification.

      Minor comments:

      1. For all graphs, please remove gridlines to make data points more visible.
      2. Line 120-123: Authors indicate the VhaAC45LRNAi induced lethal phenotype when expressed in glutamatergic and cholinergic drivers but the figure is missing. Please indicate as "data not shown" if not included in Figure.
      3. A diagram summarizing the role of VhaAC45L in V-ATPase enzymatic complex and specific role is recommended.

      Significance

      V-ATPase play a crucial role at the synapse by being responsible for acidification of the synaptic vesicles and identification of a synaptic vesicle specific regulator of V-APTase is important to understand the complex regulation of synapse function. The authors have used well-designed experiments to convince the localization and function of VhaAC45L in synaptic vesicle acidification.

      Referees cross commenting

      The summary of Reviewer#2 looks good!

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reviewer #3

      Evidence, reproducibility and clarity

      In this study, Dulac and colleagues investigated roles of VhaAC45-like gene, which codes one of the V-ATPase accessory proteins in Drosophila, in synaptic transmission. First, they demonstrated that VhaZC45L transcripts are expressed selectively in neurons and that the gene products are addressed to synaptic areas. Second, they showed that VhaAC45L is co-immunoprecipitated with some subunits of V-ATPases, which is consistent with bio-informatics predictions. They further demonstrated that VhaAC45L-knockdown (KD) resulted in defects in synaptic vesicle acidification as well as a reduction in quantal size of glutamate, indicating that VhaAC45L play a key role in regulating neurotransmitter release by modulating the driving force for transmitter uptake. Last, not least, they demonstrated that VhaAC45L-KD in motoneurons attenuated larvae locomotor performance, indicating its physiological relevance. Overall, this study is rigorously executed and nicely presented, and adds one more component of the V-ATPase that is responsible for neurotransmitter uptake into synaptic vesicles. However, since this study simply confirmed an established notion from other species such as yeast and mammals that AC45 is one of the accessory proteins of the V-ATPase complex, a conceptual novelty beyond the previous knowledge is relatively poor in its present form. Thus, this reviewer would suggest several issues as following to improve the comprehensiveness as well as novelty of the current manuscript.

      1. The reason why the authors focused on VhaAC45-'like' is somewhat obscure, and therefore should be explained. How different VhaAC45 and VhaAC45L are in terms of amino acid sequences, tissue distributions, and KO phenotypes. It seems more comprehensive if the authors provide some experimental evidence on VhaAC45; e.g. whether it is also expressed in neurons or not (Fig. 1), and, if VhaAC45 is neuronal, whether it can rescue the phenotypes of VhaAC45L-KD to certain degree (Figs 4 & 5).
      2. What is the mechanism of Ac45 in regulating V-ATPase activity? In mammals, it has been suggested that Ac45 is essential for proper sorting of the V-ATPase to the destined organelles (e.g. Jansen et al., Mol. Biol. Cell., 2010; Jansen et al., BBA, 2008). In this context, it should be examined whether VhaAC45L-KD would affect the synaptic localization of other V-ATPase subunits.
      3. Given that a rodent brain SV contains a few copies of the V-ATPase on average (Takamori et al., 2006, and some newer papers by others), it is interesting that >80% reduction of Ac45 showed moderate effects on quantal size. If SVs under study also contains 1 or 2 V-ATPase per SV, there must be some SVs lacking VhAC45L upon KD. In this context, it is interesting to see how VhaAC-KD (RNAi1~3) affect the frequencies of minis.
      4. In general, decrease in mini amplitudes is accounted for by changes in postsynaptic sensitivity for neurotransmitters. Although acidification deficits would support that decrease in quantal size is due to the decrease in the driving force for glutamate uptake, it should be examined whether the postsynaptic receptor fields are not affected by VhaAC45L-KD by recording postsynaptic response upon application of non-saturable concentrations of glutamate.
      5. Related to 4, it is also interesting to see if evoked responses are also attenuated as a result of VhaAC45L-KD, which is more physiologically relevant for locomotor activity phenotype than minis.

      Minor points

      1. Quantal size of glutamate is not affected by reduced expression of DVGLUT (Daniels et al., Neuron, 2006), which highly contrasts with VhaAC45L, expression of which defines quantal size. Distinct regulation of quantal size by the transporter and the V-ATPase subunit should be discussed.
      2. For electrophysiological experiments, respective sample traces should be shown in Figure 5.
      3. Only RNAi1 and RNAi2 lines were examined for SV pH estimation and mini analysis. The results from RNAi3 should be presented, or at least mentioned in the text.

      Significance

      As mentioned above, as it stands, the authors merely confirmed the pre-existing bioinformatic knowledge on one of the AC45 homologues in Drosophila. The audience of The EMBO Journal might be interested in how different/similar VhaAC45 and VhaAC45-like are, and their functional relevance. In particular, is VhaAC45 also mandatory for the V-ATPase functioning in neurons? Adding some basic information of VhaAC45, e.g. tissue distribution, KO phenotypes, and ability to rescue the VhaAC45-like-KD phenotypes, will certainly improve the comprehensiveness of this study, and capture audience's attention.

      Referees cross commenting

      I am fine with the summary of Reviewer#2

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reviewer #2

      Evidence, reproducibility and clarity

      In this study and using Drosophila melanogaster as a model system, Dulac et al report the very interesting discovery of a previously characterized neuronal specific regulator of the V-ATPase called VhaAC45L. They combine genetics, IHC, Mass spec and ephys to unravel the expression pattern and function of this protein. They find that it is required to acidify synaptic vesicles in glutamatergic neurons of the Drosophila larval neuromuscular junction, for appropriate synaptic transmission and for larval locomotion. The experiments are very well performed, the data presented very convincing and the paper is well written. Nonetheless, a few additional pieces of evidence and some level of expanded analysis would strengthen the conclusions and increase the depth of the work.

      Major comments:

      1. Figure 1F: the while the localization to the presynaptic terminal is convincing, where exactly the protein is localized to is not studied. The imaging in these experiments could use increased resolution and concomitantly colocalization studies with more specific synaptic vesicle markers.
      2. Figure 3B-G: these experiments should be complemented by a rescue experiment, ideally of the null mutant using a UAS construct and a pan neuronal driver, or - if such animals are viable to the third larval instar stage - a glutamatergic driver. If possible, it would also be good to study the NMJ phenotype of the null mutant rescued to viability using a neuronal driver that does not express in motor neurons (e.g. Chat-G4).
      3. Figure 5: the authors focus on quantal size which measures the postsynaptic response to spontaneous release from the presynaptic terminal. However, it is unclear how this directly relates to the locomotor deficit beyond signaling potential deficits in vesicle loading or fusion. It would be more convincing to also study evoked release, and expand the analysis of presynaptic properties (number of events, amplitude, frequency).
      4. General: showing some level of genetic interaction with V-ATPase subunits in at least some of the assays would be welcome.

      Minor comments:

      Some of the images, especially those in Figure 3, should be larger for ease of visualization.

      Significance

      The discovery of a neuronal specific regulator of the V-ATPase is very interesting. To my knowledge it is the first description of a neuronal specific V-ATPase related protein since the description of Vha100-1 by Hiesinger and colleagues in 2005. The work is therefore of great interest to researchers working on synaptic function in general and on synaptic vesicle biology in particular.

      I note that I do not have in depth expertise in electrophysiology, although I am sufficiently familiar with basic NMJ physiology experiments to render the opinions stated above.

      Referees cross commenting

      There seems to be overall consensus among the reviewers on 3 issues:

      1. A somewhat more precise understanding of the role of vhaAC45L in the synaptic vesicle cycle through better localization studies and some classic assays (like FM dye uptake).
      2. A little more characterization of the transmission defects (e.g. studying evoked responses) would be welcome.
      3. Ascertaining the validity of the alleles with rescue experiments, perhaps in the V5 mutant background to allow localization analysis in a rescued background.

      I think further biochemical analysis is interesting but probably beyond the scope of this initial description and would take too much time.

      The minor issues are easy to address

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reviewer #1

      Evidence, reproducibility and clarity

      Dulac et al. present a first in vivo characterization of the 'accessory' v-ATPase subunit vhaAC45L in Drosophila. The key findings are localization and association of the protein with v-ATPase complexes at synapses and a functional requirement based on lethality and reduced synaptic function. This is certainly a useful contribution to our understanding of neuronal v-ATPase functions in vivo. The main weakness of the study is a lack of depth. The study focuses on localization, co-IP of associated proteins, an analysis of acidification and reduced synaptic function in fly larvae, thus providing a baseline for mechanistic study. However, the mechanism of vhaAC45L is not addressed in this short report. How does is vhaAC45L function different from its homolog vhaAC45? Is it required for v-ATPase assembly? Is it required to localize the full v-ATPase complex (or just V0) to the synapse? Is the defect really due to partial loading of synaptic vesicles or does loss of vhaSC45L also affect endosomal and lysosomal function at synapses? The work as is certainly represents a publishable contribution without answering any of these questions - more as an invite for the community to study the role of vhaAC45L; however, I feel this is a bit of a missed opportunity to put the function of a new potential regulator of specific synaptic v-ATPase functions in the context of the most basic functions obvious in this field.

      My main concerns are:

      1. clearly, vhaAC45L is required for SOME function of v-ATPase in neurons - but it remains entirely unclear which one. It is not even clear what compartments are affected. Reduced quantal size of single vesicle exocytosis events can be a direct or indirect consequence of problems in SV biogenesis and recycling. Is exo- /endocytosis unaffected? (FM1-43 uptake!). What compartments are affects? (markers for synaptic vesicles versus lysosomal compartments!).
      2. molecular function: is vhaAC45L required for v-ATPase assembly? (IP/Pull-downs of v-ATPase complexes in the presence or absence of vhaAC45L with other subunits!).
      3. vha100 was proposed in Drosophila to function on synaptic vesicles and the lysosomal pathway, but, if I remember correctly, here quantal size was normal. I am missing a comparison between the two.
      4. The V5 knock-in is used both as a mutant as well as a tool to analyze protein localization. This is likely okay, but a little concern of course has to be that by creating a mutant protein through stop codon deletion its subcellular localization, turnover, etc. are not normal. Similarly, anti-V5 co-IPs will isolate proteins bound to the mutant variant of vhaAC45L. Minimally, IPs or pull-downs using other members of the V0 complex should be done to understand the role of vhaAC45L in direct comparison with vhaAC45 on complex assembly and possibly targeting to the synapse (or ideally targeting to specific compartments).

      Significance

      There is significance to the reporting of an accessory v-ATPase subunit required for SOME function of the v-ATPase in neurons. There is some lack of significance in the absence of basic mechanistic insight as to what vhaAC45L does to the v-ATPase in neurons.

      Referees cross commenting

      I'm fine with this summary!

    5. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank the editors and the four reviewers for their careful consideration of our manuscript. We are very grateful for their positive appreciation of our work and we believe that their suggestions, which have been included in the preliminary revised version of the manuscript whenever possible, have greatly improved the quality of the paper and have helped us deepen our understanding of the results.

      We were happy to note that all the reviewers found value in our work, as stated in their general comments: “This is certainly a useful contribution to our understanding of neuronal V-ATPase functions in vivo” (…)” (Reviewer 1) – “Dulac et al report the very interesting discovery of a previously uncharacterized neuronal specific regulator of the V-ATPase. (…) The experiments are very well performed, the data presented very convincing and the paper is well written.” (Reviewer 2) – “The discovery of a neuronal specific regulator of the V-ATPase is very interesting (…) The work is therefore of great interest to researchers working on synaptic function in general and on synaptic vesicle biology in particular.” (Reviewer 3) – “The authors have used well-designed experiments to convince the localization and function of VhaAC45L in synaptic vesicle acidification.” (Reviewer 4).

      In their remarks, the reviewers suggested additional experiments that could be done to improve our understanding of the role of this new V-ATPase regulator, as well as several minor issues. We have addressed all their comments in our answers below, in which the full text of the reviews is included in blue type, and the responses in black. The line numbers refer to the revised version of the manuscript.

      Reviewer #1

      Dulac et al. present a first in vivo characterization of the 'accessory' v-ATPase subunit vhaAC45L in Drosophila. The key findings are localization and association of the protein with v-ATPase complexes at synapses and a functional requirement based on lethality and reduced synaptic function. This is certainly a useful contribution to our understanding of neuronal v-ATPase functions in vivo. The main weakness of the study is a lack of depth. The study focuses on localization, co-IP of associated proteins, an analysis of acidification and reduced synaptic function in fly larvae, thus providing a baseline for mechanistic study. However, the mechanism of vhaAC45L is not addressed in this short report. How does is vhaAC45L function different from its homolog vhaAC45? Is it required for v-ATPase assembly? Is it required to localize the full v-ATPase complex (or just V0) to the synapse? Is the defect really due to partial loading of synaptic vesicles or does loss of vhaAC45L also affect endosomal and lysosomal function at synapses? The work as is certainly represents a publishable contribution without answering any of these questions - more as an invite for the community to study the role of vhaAC45L; however, I feel this is a bit of a missed opportunity to put the function of a new potential regulator of specific synaptic v-ATPase functions in the context of the most basic functions obvious in this field.

      My main concerns are:

      1. clearly, vhaAC45L is required for SOME function of v-ATPase in neurons - but it remains entirely unclear which one. It is not even clear what compartments are affected. Reduced quantal size of single vesicle exocytosis events can be a direct or indirect consequence of problems in SV biogenesis and recycling.

      Is exo- /endocytosis unaffected? (FM1-43 uptake!).

      We agree that alterations in the synaptic vesicle release/recycling cycle could indeed contribute to the locomotion defect, in addition to the acidification impairment observed in VhaAC45L knockdown larvae. As suggested by the reviewer, we plan to carry out FM-dye assays to measure endocytosis and exocytosis at the neuromuscular junction of control versus VhaAC45L-KD animals. If successful, a new figure will be added to the final version of the paper.

      What compartments are affected? (markers for synaptic vesicles versus lysosomal compartments!).

      Finding out whether VhaAC45L is specifically involved in the acidification of synaptic vesicles, or if it also plays a similar role in other synaptic organelles, in particular lysosomes, would be very interesting indeed. However, we found that it was technically difficult to address this issue in the Drosophila nervous system. A good way would be to check whether the lysosomal pH is affected by VhaAC45L knockdown, as it is the case for synaptic vesicles.

      Unfortunately, because lysosomes are not abundant in neurons, lysosome-specific pH-sensitive probes such as Lysotracker do not yield detectable signals at Drosophila larval synapses. So, whether VhaAC45L is specific for synaptic vesicles or involved in the regulation of V-ATPase activity in all neuronal compartments reminas an open question for now.

      1. molecular function: is vhaAC45L required for v-ATPase assembly? (IP/Pull-downs of v- ATPase complexes in the presence or absence of vhaAC45L with other subunits!).

      In accordance with the reviewer, we are also very much eager to learn more about the precise molecular function of VhaAC45L, and in particular whether it is required or not for assembly of the V-ATPase complex. Pull-downs of V-ATPase proteins in controls versus VhaAC45L-KD could be used to address this question, but this would require a large quantity of antibodies directed against subunits of the V0 and V1 domains, respectively. Unfortunately, there are no such antibodies commercially available against Drosophila V-ATPase proteins. We have tried several antibodies that recognize V-ATPase subunits from other species and were predicted to react against Drosophila homologs, but with no success. The only V-ATPase antibodies currently at our disposal were samples generously sent to us by other laboratories in insufficient quantities for carrying out such experiments. To our regret, therefore, we were not able to answer this question until now because of the lack of appropriate tools.

      1. vha100 was proposed in Drosophila to function on synaptic vesicles and the lysosomal pathway, but, if I remember correctly, here quantal size was normal. I am missing a comparison between the two.

      We thank the reviewer for this comment. A comparison with previously published results on subunit Vha100-1 has now been added (lines 458-469) in the discussion related to this topic in the revised manuscript.

      1. The V5 knock-in is used both as a mutant as well as a tool to analyze protein localization. This is likely okay, but a little concern of course has to be that by creating a mutant protein through stop codon deletion its subcellular localization, turnover, etc. are not normal. Similarly, anti-V5 co-IPs will isolate proteins bound to the mutant variant of vhaAC45L. Minimally, IPs or pull- downs using other members of the V0 complex should be done to understand the role of vhaAC45L in direct comparison with vhaAC45 on complex assembly and possibly targeting to the synapse (or ideally targeting to specific compartments).

      It is indeed a legitimate concern to question the physiological relevance of results obtained by studying V5-tagged VhaAC45L. However, the V5 tag is very small (14 amino acids) and we fused it in place of the stop codon to keep intact the whole sequence of the protein. In addition, we found that the V5 knock-in flies are viable and fertile as homozygous. Given that the null mutants, as well as strong RNAi knockdowns, are lethal at early developmental stage, this suggests that the V5 knock-in has limited negative effects, if any, on VhaAC45L function. This led us to believe that at least a good portion of the V5-tagged protein might be targeted to the right subcellular compartment, and associate to its physiological partners.

      Significance:

      There is significance to the reporting of an accessory v-ATPase subunit required for SOME function of the v-ATPase in neurons. There is some lack of significance in the absence of basic mechanistic insight as to what vhaAC45L does to the v-ATPase in neurons.

      We agree that we did not elucidate here the precise molecular mechanisms by which VhaAC45L contributes to synaptic vesicle acidification. It is rather an initial description of a novel neuronal protein that appears to be essential for proper synaptic functioning, and we provide consistent evidence that its function requires specific interaction with the V-ATPase complex, and in particular with three subunits that reproducibly co-immunoprecipitated with VhaAC45L (namely Vha1C39-1, Vha100-1 and ATP6AP2). Please note that it took many years and many papers before the molecular mechanisms of action of comparable accessory subunits, such as ATP6AP1/AC45 or ATP6AP2, was better understood, and it is still nowadays a matter of investigation. It is therefore very demanding to expect that we describe the exact function of the previously uncharacterized VhaAC45L at all levels in a single first paper.

      Reviewer #2

      In this study and using Drosophila melanogaster as a model system, Dulac et al report the very interesting discovery of a previously uncharacterized neuronal specific regulator of the V-ATPase called VhaAC45L. They combine genetics, IHC, Mass spec and ephys to unravel the expression pattern and function of this protein. They find that it is required to acidify synaptic vesicles in glutamatergic neurons of the Drosophila larval neuromuscular junction, for appropriate synaptic transmission and for larval locomotion. The experiments are very well performed, the data presented very convincing and the paper is well written. Nonetheless, a few additional pieces of evidence and some level of expanded analysis would strengthen the conclusions and increase the depth of the work.

      Major comments:

      1. Figure 1F: the while the localization to the presynaptic terminal is convincing, where exactly the protein is localized to is not studied. The imaging in these experiments could use increased resolution and concomitantly colocalization studies with more specific synaptic vesicle markers.

      We agree that it would be very good to show this additional result. However, confocal microscopy does not provide sufficient resolution to localize the protein at the membrane of individual synaptic vesicles. Another way would be to see if VhaAC45L immunostaining co- localizes with domains enriched in synaptic vesicle markers, but these organelles are rather ubiquitously distributed in the synaptic boutons at the Drosophila neuromuscular junction. To correctly perform this experiment, we would have to do immuno-electron microscopy, a technique we do not master in our laboratory and that we did not plan to implement for the present work.

      1. Figure 3B-G: these experiments should be complemented by a rescue experiment, ideally of the null mutant using a UAS construct and a pan neuronal driver, or - if such animals are viable to the third larval instar stage - a glutamatergic driver. If possible, it would also be good to study the NMJ phenotype of the null mutant rescued to viability using a neuronal driver that does not express in motor neurons (e.g. Chat-G4).

      Although a rescue experiment could potentially add a further evidence that Vha45ACL deficiency is responsible for the synaptic vesicle acidification defect described in Figure 3, we don’t think that it is a requisite here because we obtained similar results by knocking down the gene using two different RNAis. As described in the manuscript, the pan-neuronal expression of Vha45ACL could rescue the embryonic lethality of the null mutant, so it would be theoretically possible to check the acidity level of synaptic vesicles at the neuromuscular junction of the recued larvae. However, this would involve making rather complex genetic constructions to express VMAT-pHluorin in motor neurons in rescued mutant background. In addition, the conclusions we could draw from such experiment would be limited by the lack of comparison. Indeed, in Figure 3 the defect was observed in knockdown context, and the same experiment could not be performed in knockout larvae due to the early lethality. If we could measure the acidity level of rescued null mutants, we would not have any comparison point besides the knockdown experiments. As knockout and knockdown are not likely to produce identical phenotype (especially in terms of magnitude of effect), the ideal would be to compare the rescued phenotype to the null mutant expressing VhaAC45L in all neurons except motoneurons, as suggested by the reviewer. However, such genotype would certainly not be viable, since we observed that expression of VhaAC45L RNAis with a stronger motoneurons driver (D42-Gal4) was sufficient to induce lethality at early developmental stage.

      1. Figure 5: the authors focus on quantal size which measures the postsynaptic response to spontaneous release from the presynaptic terminal. However, it is unclear how this directly relates to the locomotor deficit beyond signaling potential deficits in vesicle loading or fusion. It would be more convincing to also study evoked release, and expand the analysis of presynaptic properties (number of events, amplitude, frequency).

      We fully agree with this comment shared by Reviewers 2 and 3 related to the electrophysiology experiments. Note that these experiments have been carried out in collaboration with another laboratory located in another city. The Covid-19 situation during the past year has prevented, and is still complicating, movements between labs, preventing us from going further with the electrophysiology analyses of VhaAC45L KD. If the situation in the near future allows it, we would very much like to add a more extensive electrophysiological analysis, including in particular the study of evoked release. In the revised manuscript, we have nevertheless completed Figure 5 by adding representative distributions of spontaneous mEPSP amplitudes in control and VhaAC45L knockdown larvae, as well as the results of new analyses showing lack of effects the KD on the mean EPSP frequency.

      1. General: showing some level of genetic interaction with V-ATPase subunits in at least some of the assays would be welcome.

      We are definitely in accordance with the reviewer on that point, but we think that this would involve a lot of work and be beyond the scope of the present initial description. Here we show by proteomic analyses that at least 12 proteins co-precipitate and so potentially interact with VhaAC45L, three of them being previously identified constitutive or accessory V-ATPase subunits. In our opinion, studying the interactions between VhaAC45L and these proteins through genetic and molecular studies will be the subject of future works. As stated by Reviewer 2 in the Referees cross commenting below: “further biochemical analysis is interesting but probably beyond the scope of this initial description and would take too much time”. We fully agree with this statement.

      Minor comments:

      Some of the images, especially those in Figure 3, should be larger for ease of visualization.

      As requested, the images of Figure 3 have been enlarged.

      Significance

      The discovery of a neuronal specific regulator of the V-ATPase is very interesting. To my knowledge it is the first description of a neuronal specific V-ATPase related protein since the description of Vha100-1 by Hiesinger and colleagues in 2005. The work is therefore of great interest to researchers working on synaptic function in general and on synaptic vesicle biology in particular.

      We are grateful to the reviewer for his very positive assessment of our work.

      I note that I do not have in depth expertise in electrophysiology, although I am sufficiently familiar with basic NMJ physiology experiments to render the opinions stated above.

      Reviewer #3

      In this study, Dulac and colleagues investigated roles of VhaAC45-like gene, which codes one of the V-ATPase accessory proteins in Drosophila, in synaptic transmission. First, they demonstrated that VhaZC45L transcripts are expressed selectively in neurons and that the gene products are addressed to synaptic areas. Second, they showed that VhaAC45L is co- immunoprecipitated with some subunits of V-ATPases, which is consistent with bio-informatics predictions. They further demonstrated that VhaAC45L-knockdown (KD) resulted in defects in synaptic vesicle acidification as well as a reduction in quantal size of glutamate, indicating that VhaAC45L play a key role in regulating neurotransmitter release by modulating the driving force for transmitter uptake. Last, not least, they demonstrated that VhaAC45L-KD in motoneurons attenuated larvae locomotor performance, indicating its physiological relevance. Overall, this study is rigorously executed and nicely presented, and adds one more component of the V- ATPase that is responsible for neurotransmitter uptake into synaptic vesicles. However, since this study simply confirmed an established notion from other species such as yeast and mammals that AC45 is one of the accessory proteins of the V-ATPase complex, a conceptual novelty beyond the previous knowledge is relatively poor in its present form. Thus, this reviewer would suggest several issues as following to improve the comprehensiveness as well as novelty of the current manuscript.

      1. The reason why the authors focused on VhaAC45-'like' is somewhat obscure, and therefore should be explained. How different VhaAC45 and VhaAC45L are in terms of amino acid sequences, tissue distributions, and KO phenotypes. It seems more comprehensive if the authors provide some experimental evidence on VhaAC45; e.g. whether it is also expressed in neurons or not (Fig. 1), and, if VhaAC45 is neuronal, whether it can rescue the phenotypes of VhaAC45L- KD to certain degree (Figs 4 & 5).

      Following the reviewer’s request, we have added a sequence alignment of VhaAC45 and VhaAC45L, as well as a graph showing tissue distributions of both genes in Supplementary Figure 1 of the revised manuscript. To our knowledge, there is no published functional study of VhaAC45 in Drosophila, so we can only make assumptions derived from studies on predicted homologs in evolutionarily distant species. For that reason, it is difficult to compare VhaAC45 to VhaAC45L, as it would first require an entire new study of VhaAC45 function in flies. Since our interest is to study neuronal physiology, we focused on VhaAC45L because compelling evidence indicates that this subunit is specific to the nervous system, as described in our manuscript, rather than on VhaAC45 which seems to be expressed in all tissues. In addition, homologs of VhaAC45L have never been functionally characterized to date in any species, making it very interesting to study this new protein in a genetically tractable organism.

      1. What is the mechanism of Ac45 in regulating V-ATPase activity? In mammals, it has been suggested that Ac45 is essential for proper sorting of the V-ATPase to the destined organelles (e.g. Jansen et al., Mol. Biol. Cell., 2010; Jansen et al., BBA, 2008). In this context, it should be examined whether VhaAC45L-KD would affect the synaptic localization of other V-ATPase subunits.

      We thank the reviewer for pointing out these very interesting references. We have indeed tried to determine the relative abundance of two other V-ATPase subunits at the larval neuromuscular junction in control and VhaAC45L knockdown contexts. However, because the tested subunits are not specific to neurons, and are expressed at relatively low levels in synapses, it was not possible for us to properly separate the synaptic signal from the background immunostaining in surrounding muscles. This unfortunately prevented us from performing an accurate and reliable quantification.

      1. Given that a rodent brain SV contains a few copies of the V-ATPase on average (Takamori et al., 2006, and some newer papers by others), it is interesting that >80% reduction of Ac45 showed moderate effects on quantal size. If SVs under study also contains 1 or 2 V-ATPase per SV, there must be some SVs lacking VhAC45L upon KD. In this context, it is interesting to see how VhaAC-KD (RNAi1~3) affect the frequencies of minis.

      The reviewer’s valuable comment prompted us to undertake new analyses on our electrophysiological recordings. We have now added in Figure 5E graphs showing the mean EPSP frequency for larvae expressing VhAC45L RNAi1 and RNAi2, which are the ones that were used in the quantal analysis. Both of these RNAi apparently decreased the frequency compared to controls, but this difference was not statistically significant. As detailed in the Discussion (line 458-469), this may suggest that VhaAC45L does not influence the abundance of the V-ATPase complex at nerve terminals, but rather its efficiency.

      1. In general, decrease in mini amplitudes is accounted for by changes in postsynaptic sensitivity for neurotransmitters. Although acidification deficits would support that decrease in quantal size is due to the decrease in the driving force for glutamate uptake, it should be examined whether the postsynaptic receptor fields are not affected by VhaAC45L-KD by recording postsynaptic response upon application of non-saturable concentrations of glutamate.

      Testing for potential postsynaptic receptor field alteration by glutamate application would be an interesting experiment indeed, but, as we believe, not a critical control for the present manuscript. Because we expressed RNAis presynaptically, any modification in the postsynaptic receptor field would have to be an indirect consequence of VhaAC45L downregulation in motoneurons, and so, likely to be related to the synaptic vesicle acidification defect. It would not change, therefore, our conclusion that VhaAC45L deficiency in motoneurons induces a decrease in quantal size. Because electrophysiology experiments were carried out in collaboration with another laboratory located in another city, the current sanitary context has so far prevented us from performing this test (please refer to our answer to comment 3 of Reviewer 2 for more details).

      1. Related to 4, it is also interesting to see if evoked responses are also attenuated as a result of VhaAC45L-KD, which is more physiologically relevant for locomotor activity phenotype than minis.

      We also agree with this comment, shared by Reviewer 2, to which we already responded above in our answer to comment 3 of Reviewer 2.

      Minor points:

      1. Quantal size of glutamate is not affected by reduced expression of DVGLUT (Daniels et al., Neuron, 2006), which highly contrasts with VhaAC45L, expression of which defines quantal size. Distinct regulation of quantal size by the transporter and the V-ATPase subunit should be discussed.

      As suggested by the reviewer, a discussion of this point has been added (lines 458-469). and Daniels et al. 2006 is now cited in the revised manuscript.

      1. For electrophysiological experiments, respective sample traces should be shown in Figure 5.

      Quantal size is not directly visible in sample traces, so we added instead representative distributions of spontaneous mEPSP amplitudes in control and VhaAC45L knockdown larvae in the new Figure 5C.

      1. <![endif]>Only RNAi1 and RNAi2 lines were examined for SV pH estimation and mini analysis. The results from RNAi3 should be presented, or at least mentioned in the text.

      These experiments were performed using two different RNAi constructs to ensure that similar effects were observed and to exclude the possibility of potential off-targets. Knocking down VhaAC45L in neurons with RNAi1 and 2 was lethal at pupal stages, suggesting that they give similar levels of inactivation. RNAi3 systematically induced lighter phenotypes, producing viable adults, which led us to believe it had a lower efficiency. Because the results on synaptic vesicle acidification and electrophysiology were very consistent with RNAi1 and RNAi2, we considered that it was not necessary to repeat the experiment with RNAi3.

      Significance

      As mentioned above, as it stands, the authors merely confirmed the pre-existing bioinformatic knowledge on one of the AC45 homologues in Drosophila. The audience of The EMBO Journal might be interested in how different/similar VhaAC45 and VhaAC45-like are, and their functional relevance. In particular, is VhaAC45 also mandatory for the V-ATPase functioning in neurons? Adding some basic information of VhaAC45, e.g. tissue distribution, KO phenotypes, and ability to rescue the VhaAC45-like-KD phenotypes, will certainly improve the comprehensiveness of this study, and capture audience's attention.

      As mentioned in our response to point 1 of the reviewer above, we have added more data comparing the structure and distribution of VhaAC45 and VhaAC45L in the revised manuscript. VhaAC45 appears to be ubiquitously expressed whereas VhaAC45L is neuron-specific.

      Comparing VhaAC45 to VhaAC45L would require a completely new study of VhaAC45 function, because it has never been done before in Drosophila to our knowledge. This would require repeating all the experiments with this other gene, probably involving two more years of work, and would make for a much longer and very different manuscript. It is understandable that this cannot be envisaged. Because homologs of VhaAC45L have never been functionally characterized to date in any species, we considered that it was worth studying this new protein on its own.

      Reviewer #4

      We have reviewed "A specific regulator of neuronal V-ATPase in Drosophila melanogaster." by Dulac et al. The authors have identified VhaAC45L as a regulator of neuronal V-ATPase in Drosophila melanogaster. The authors have utilized multiple techniques to determine the localization of VhaAC45L in neurons and specifically in the synapse. The use of multiple approaches including determining RNA levels in different regions of the fly, and using CRISPR- Cas9 technique to insert V5 tag, makes a very convincing argument about the synapse-specific expression of VhaAC45L.

      The combined use of co-immunoprecipitation technique and LC/MS to show that VhaAC45L co- precipitated with V-ATPase complex subunits is convincing that VhaAC45L is a subunit of V- ATPase. To determine the role of VhaAC45L in acidification of synaptic vesicles the authors have utilized pHluorins in combination with multiple RNAi lines. The authors have used a well- designed experiment to prove that VhaAC45L regulates acidification of the synaptic vesicles.

      Further, larval locomotion and quantal size determination using VhaAC45LRNAi which is known to be altered due to pH gradient of synaptic vesicles shows the functional role of VhaAC45L in synaptic vesicle acidification.

      Minor comments:

      1. For all graphs, please remove gridlines to make data points more visible.

      We found that gridlines can be helpful for the readers to assess approximate values on the graphs. So, we have not removed them but rather changed the colour to a light grey so it does not affect any more visibility. We have also placed the points over the error bars in all the graphs, so they become more apparent.

      1. Line 120-123: Authors indicate the VhaAC45LRNAi induced lethal phenotype when expressed in glutamatergic and cholinergic drivers but the figure is missing. Please indicate as "data not shown" if not included in Figure.

      This mention has been added in the manuscript (line 125).

      1. A diagram summarizing the role of VhaAC45L in V-ATPase enzymatic complex and specific role is recommended.

      We believe that it is too early in this first report to draw an accurate diagram summarizing the role of this new protein in the V-ATPase complex.

      Significance

      V-ATPase play a crucial role at the synapse by being responsible for acidification of the synaptic vesicles and identification of a synaptic vesicle specific regulator of V-ATPase is important to understand the complex regulation of synapse function. The authors have used well-designed experiments to convince the localization and function of VhaAC45L in synaptic vesicle acidification.

      We thank the reviewer for his very positive appreciation of our work.

      Referees cross commenting

      (Written by Reviewer 2)

      There seems to be overall consensus among the reviewers on 3 issues:

      1. A somewhat more precise understanding of the role of vhaAC45L in the synaptic vesicle cycle through better localization studies and some classic assays (like FM dye uptake).

      —See our answers to comments 1 of Reviewer 1 and Reviewer 2.

      1. A little more characterization of the transmission defects (e.g. studying evoked responses) would be welcome.

      —See our answers comment 3 of Reviewer 2.

      1. Ascertaining the validity of the alleles with rescue experiments, perhaps in the V5 mutant background to allow localization analysis in a rescued background.

      —See our answers to comment 2 of Reviewer 2.

      I think further biochemical analysis is interesting but probably beyond the scope of this initial description and would take too much time.

      We fully agree with this statement.

      The minor issues are easy to address

      We have addressed all of them in the preliminary revised version of the manuscript.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer 1

      We would like to thank you for the comments concerning our manuscript. We responded to each question, as described below. All the authors feel that our manuscript has been much improved by your comments.

      Minor comments:

      Q1. Fig 1 any male vs female mice differences in ATF6b expression?

      Response. We performed qPCR using several tissues from male and felame WT mice, and confirmed no significant differences in Atf6b mRNA levels between male and female mice. We put this result in Figure S1 C.

      Q2. Fig 2C. Please show molecular weight markers on blots

      Response. We put molecular weight markers in Fig.2C, as you suggested.

      Q3. Fig 2C. what are the doublet bands on calnexin?

      Response. Calnexin is sometimes shown as double bands in tissues such as kidney, liver and heart by western blotting (Zeng et al., PLoS One. 2009 Aug 26;4(8):e6787). Although the mechanism is unknown, it could be due to the post-translational modification such as phosphorylation (Wong et al, J Biol Chem. 1998 Jul 3;273(27):17227-35) or partial degradation although proteinase inhibitors are added in the lysis buffer. To my knowledge, alternative splicing is not likely to be the case.

      Q4. Fig 3. what are the ERSE sequences? several different binding sites are reported in literature.

      Response. We put the ERSE sequence in Materials and Methods and in the Figure legends for Figure 3 as “CCAATN9CCACG (Yoshida et al., 1998)”.,

      Q5. p8. What is meant by 5' Atf6b lacks 10 and 11?

      Response We corrected to “Atf6b transcript, which lacks exon 10 and 11, in these mice”.

      Discussion: Please clarify if anti-ATF6-beta antibodies were available for these studies.

      Response. We tried different anti-ATF6β antibodies to detect endogenous ATF6β in culture neurons by western blot. We successfully observed both full-length and N-terminal fragment (the active form) using the one from Biolegend (#853202) (Figure 1E in the new version). We replaced the result with FLAG antibody in HEK293T cells in the old version.

      Discussion: It is puzzling that ATF6a induces calreticulin more potently than ATF6b, but the calreticulin defect is selectively dependent on ATF6b. Could authors speculate on this paradox? It would be interesting to expand on differences between ATF6a and ATF6b function and phenotypes in Discussion in mouse and in people.

      Response. In Discussion, we added sentences regarding a bit puzzling role for ATF6β in CRT expression in the CNS, as below. “All the data from RNA-sequence to the promoter analysis suggested that CRT expression was ATF6β-dependent in primary hippocampal neurons. However, overexpression of ATF6α and ATF6β both enhanced CRT promoter activity…”

      And we proposed a new scenario as below,

      “These results may raise a scenario that, in the CNS, expression of molecular chaperones in the ER is generally governed by ATF6α as previously described (Yamamoto et al., 2007) and that ATF6β functions as a booster if their levels are too low. However, expression of CRT is somewhat governed by ATF6β, and ATF6α functions as a booster. The underlying mechanism for this scenario is not clear yet, but neurons may require a high level of CRT expression even under normal condition, as described in Table S2, which may lead to the development of a unique biological system to constitutively produce CRT in neurons. Further studies are required to clarify the molecular basis how this unique system is constructed and regulated.”

      Reviewer 2

      We would like to thank you for the comments concerning our manuscript. We responded to each question, as described below. All the authors feel that our manuscript has been much improved by your comments.

      Major comments:

      Q1. The post-translational processing of ATF6beta must be demonstrated in hippocampal neurons and not in HEK293T cells in Figure 1E. The authors conclude on Page 6, line 18 that "these results suggest that ATF6beta functions in neurons" but it is not obvious how expression in HEK293T cells contributes to this conclusion in any way.

      Response. We performed western blot with different anti-ATF6β antibodies to detect endogenous ATF6β in culture neurons. We successfully observed both full-length and N-terminal fragment (the active form) from the one from Biolegend (#853202). We therefore replaced the result in HEK293T cells with the one in the hippocampal neurons (Figure 1E in new version).

      Q2. The hippocampal neurons are affected by the loss of ATF6β, even though the mice are not exposed to tunicamycin. Could the authors present evidence that there is physiological ER stress in hippocampal neurons? If not, why is ATF6beta required.

      Response Evidence suggests that neuronal activities including excitatory signals can cause physiological ER stress and induce the UPR at the distal dendrites in the hippocampal neurons (Murakami et al., Neuroscience. 2007 Apr 25;146(1):1-8, Saito et al., J Neurochem. 2018 Jan;144(1):35-49). Among the UPR branches, Ire1-XBP1 pathway has been reported to play an important role in this dendritic UPR and expression of BDNF in cell soma (Saito et al., 2018). Although the present study focuses on the role of ATF6β in the pathological ER stress which causes neuronal death, we believe that it will be intriguing to analyze its role of ATF6β in the physiological ER stress and in the local UPR machinery in neurons.

      Q3. In Figure 3, is there a specific reason why the authors do not mutate the ERSEs in the mouse CRT reporter, pCC1 and instead opt to analyze the huCRT reporter? Given that all the other observations in the manuscript are in mouse calreticulin, it is important to show that the ERSEs in the mouse calreticulin promoter are also regulated in an ATF6beta-dependent manner. Similar to the huCRT reporter, it is also crucial to examine if ATF6beta can regulate the mouse CRT promoter. This would provide an explanation for why calreticulin expression is not completely abolished in ATF6beta mutants.

      Response We added the data of the deletion mutant of mouse CRT promoter, pCC3, which has only 415bp, but still keeps both ERSE1 and 2 in it. pCC3 showed similar promoter activity to pCC1 (Figure 3 B) and huCRT (wt) (Figure 3 C) in both of WT and Atf6b-/- neurons. Because pCC5, which has 260bp but does not have ERSEs in it, lost completely CRT promoter activity (Waser et al., 1997), it is most likely that mouse and human CRT promoters are regulated in a similar manner via ERSEs.

      Q4. In Figure 5A and B, the density of Tubulin staining varies from panel to panel, and is much lower in ATF6beta mutants treated with Tg/Tm. Presumably this is because of cell death but this should be clarified in the main text. Additionally, it is unclear if the EthD-1 staining is nuclear localized. It would help if single channel images for Hoechst and EthD-1 were provided to visualize this.

      Response In Figure 5A and B, we added the statement for the reduction of Calcein-AM (A) and βIII tubulin (B) in the main text. We also added single channel images for Hoechst and EthD-1 in Figure S4 to confirm the nuclear localization of EthD-1.

      Q5. The literature reports that BAPTA-AM treatment itself could cause ER stress (e.g. PMID: 12531184). Here, the authors report the opposite effect. How could the authors reconcile the difference? The effects of BAPTA-AM and 2-APB must individually be examined in Figure 6C and not just in combination with Tm.

      Response. We added the data that BAPTA-AM and 2-APB alone did not cause neuronal death at the concentrations used in this study in Figure S6 B and in the main text.

      Q6. The authors allude to "impairment of Ca2+ homeostasis in ATF6beta mutants" in Page 13 Line 2, but do not show any direct evidence in support of it. While treatment with BAPTA-AM and 2-APB is a start in that direction, it certainly does not demonstrate that under homeostatic conditions in vivo or in vitro there is any change in calcium flux in ATF6beta hippocampal neurons. To make the case that there is indeed perturbation of Ca2+ in ATF6beta mutant hippocampal neurons, the authors need to examine calcium flux and measure calcium indicators and how they are affected when ER stress is induced in these mutant cells.

      Response We added the data that the Ca2+ store in the ER was reduced and Ca2+ concentration in the cytosol increased in Atf6b-/- neurons both under normal and ER stress conditions in Figure 4C.

      Q7. The effect of 2-APB and salubrinal alone on hippocampal neurons need to be examined in Figure 9B-D to eliminate the possibility that these drugs are not enhancing cell survival under normal conditions in a parallel manner.

      Response We added the data that 2-APB and salubrinal alone did not cause neuronal death in the hippocampus in our model in Figure S8 C.

      Q8. The rationale for the examination of Fos, Fosb and Bdnf is poorly described (page 14, line 13) and the conclusions from this line of experimentation are rather weak. The results from Figure 9 to some extent serve to confirm in vivo the data seen in Figure 6C but by no means provide a mechanism for why ATF6beta mutants have perturbed calcium homeostasis (page 14, line 22).

      Response We agreed with your comments that the examination of Fos, Fosb and Bdnf is relatively weak. We, therefore, moved these data to supplementary information (Figure S8 A and B).

      Minor comments:

      Q1. Page 8, line 3: Their rationale for why ATF6beta 5'UTR sequences are seen in their RNA seq data is not clearly explained. This must be rewritten for clarity.

      Response In Atf6b-/- mice, exon 10 and 11 were deleted by homologous recombination. Therefore, 5’ part of Atf6b gene including exon 1-9 can be transcribed. We added the statement in Results, as below.

      “this may be due to the presence of the 5’ Atf6b transcript with exon 1-9 in these mice, in which exon 10 and 11 were deleted by homologous recombination.”

      Q2. Page 8, line 5, the authors write that besides Atf6β , CRT was the only UPR-regulated gene downregulated in Atf6β mutant mice. The authors need to state how they defined "UPR-regulated genes". There must be a list, which the authors do not cite.**

      Response. To avoid the possible confusion, we changed the term “UPR-regulated genes” to “ER stress-responsive genes”.

      Q3. Page 9, line 10: A reference is required for ERSEs.

      Response We added the reference for ERSEs, as you suggested.

      Q4. Page 10, line 6: The authors say "ATF6beta specifically induces CRT promoter activity". This is a confusing statement because "induction" is in response to stress, but the context here is homeostatic regulation since there is ostensibly no stress being induced. This distinction should be made and corrected here and throughout the manuscript.

      Response To avoid the confusion, we changed the sentence to “ATF6β specifically enhances CRT promoter activity”.

      Q5. Page 10, line 16: The use of "latter" here is confusing and it would help to restructure this sentence for clarity.

      Response To avoid the confusion, we changed the phrase to “under control condition and after stimulation with Tg (Figure 4A upper row) or Tm (Figure 4A lower row)

      Q6. Figure 9A is missing Y-axis labels.

      Response We changed Figure 9A (Figure S8 A in new version) and Figure Legends to clarify what each axis indicates.

      Reviewer 3

      We would like to thank you for the comments concerning our manuscript. We responded to each question, as described below. All the authors feel that our manuscript has been much improved by your comments.

      Major comments

      Comment #1. The authors show that overexpression of either Atf6a or Atf6b both increase Crt expression in Atf6b knockout cells. While it is clear that deletion of Atf6a does not basally reduce Crt levels, the overexpression experiment does lead to a question as to how Atf6b can specifically be involved in regulating Crt expression. In the discussion, the authors seem to propose that homo- and hetero-dimerization of ATf6a and Atf6b are required for the basal expression of Crt and that Atf6b serves as a 'booster' of ER chaperone expression. They explicitly state that "Atf6a and Atf6b are required to induce CRT expression". However, it remains unclear to me why in this case would Atf6a deletion not impair Crt expression? The authors address this by invoking a mechanism whereby hippocampal neurons are more reliant on Atf6b for Crt expression, but this doesn't really make sense to me. Ultimately, this point underscores the lack of clear mechanistic basis to explain how Atf6b selectively regulates Crt in the hippocampus. This needs to be better resolved through more experimentation. For example, a ChIP experiment monitoring the binding of ATF6b and ATF6a to the Crt promoter in hippocampal and control cells would go a long way towards addressing this issue.**

      Response. In Discussion, we first made the point clearer that CRT expression is ATF6β-dependent, while those of other molecular chaperones in the ER are ATF6α-dependent. Then, we raised a scenario that, in the CNS, expression of molecular chaperones in the ER is generally governed by ATF6α as previously described (Yamamoto et al., 2007) and ATF6β functions as a booster if their levels are too low. However, expression of CRT is somewhat governed by ATF6β, and ATF6α functions as a booster. We also wrote the limitation of the current study and requirement of the further study to clarify the molecular basis of the unique system to ensure CRT expression in neurons.

      Comment #2. The importance of ATF6b for protecting against insults needs to be better described. For example, the authors should show that overexpression of ATF6b protects against ER stress induced neuronal toxicity in cell culture and in vivo kainate induced neuronal toxicity. Similarly, the authors should evaluate how overexpression of ATF6a protects against these insults to better define the specific dependence of hippocampal neurons on ATF6b. The authors do show that overexpression of ATF6b can rescue the reduced Crt observed in Atf6b-deleted neurons, but the protection should similarly be demonstrated.**

      Response. We performed rescuing experiments to see both of ATF6β and ATF6α overexpression improve cell viability of Atf6b-/- neurons under ER stress. Interesting. ATF6β, but not ATF6α, rescued Atf6b-/- neurons. In Discussion, we raised the possible reasons as below.

      “The lack of rescuing effect of ATF6α may be due to the fact that this molecule enhances the expression of different genes including cell death-related molecule CHOP in addition to molecular chaperons in the ER (Yoshida et al., 2000).”

      Comment #3. Similar to #2, the authors should show that the potential for ATF6b (and ATF6a) overexpression to protect against different insults is impaired in Crt+/- neurons. The authors demonstrate that Crt-depletion increases sensitivity to toxic insults. This would go a long way to demonstrate the importance of the proposed ATF6b-CRT signaling axis in regulating neuronal survival in response to pathologic insults.**

      Response. Unfortunately, right now, the breeding of Calr+/- mice is not in good condition. Although we are increasing the number of mice used for breeding, we have to wait pregnancies to get embryos for isolating neurons from hippocampus. Once we get enough number of mice, we would try the rescuing experiment of Calr+/- hippocampal neurons with ATF6β and ATF6α. However, we also think rescuing experiments of Atf6b-/- neurons by ATF6β, ATF6α, and CRT may be enough in this paper.

      Comment #4. When reporting the RNAseq data, the authors should use the q-value (i.e., FDR) instead of the p-value. This will likely affect the number of genes reported in Table 1, but it is the appropriate statistical test for this type of data.**

      Response. As you suggested, we replace Table1 with a new list which was filtered with the q-value. However, some important and consistent information were obtained from the list filtered with the p-value, we keep it as Table S1 in the supplementary information.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript, the authors define the functional importance of ATF6b in the hippocampus. They show that ATF6b is highly expressed in the hippocampus relative to other tissues. They demonstrate that deletion or depletion of ATF6b in cultured hippocampal neurons enhances ER stress induced death. Similarly, Atf6b-/- mice show increased sensitivity to kainate induced neuronal death. These results reveal an important role for ATF6b in regulating hippocampal survival in response to pathologic insults. To define a molecular basis for this protection, the authors utilized RNAseq to identify the lectin chaperone calreticulin (Crt) as a gene whose expression is basally reduced in cultured hippocampal neurons where Atf6b is deleted. They show the re-overexpression of Atf6b (or Atf6a) both restore Crt levels in these neurons, underscoring the importance of Atf6 in regulating basal Crt levels. They go on to demonstrate that loss of Atf6b impairs ER stress-dependent increases in Crt, while minimally impacting other Atf6 target genes, again highlighting the importance of Atf6b for Crtregulation. Importantly, overexpression of Crt rescues the increased ER stress-induced toxicity observed in Atf6b knockout neurons, indicating that a primary mechanism by which Atf6b regulates neuronal survival in response to ER stress is through increased Crt expression. Consistent with this, mimicking the 50% reduction in Crt observed in Atf6b knockout neurons using Crt+/- mice showed similar sensitivity to kainate induced neuronal death. Collectively, these results describe an Atf6-Crt axis that is important for regulating neuronal survival in response to pathologic insults.

      Overall the experiments are interesting and provide new insights into the importance of Atf6b in neuronal survival. Notably, the evidence showing that loss of Atf6b increases hippocampal neuron sensitivity to ER stress and kainate induced toxicity are compelling. Any results describing the biological function of Atf6b are interesting, considering how little we know about this ER stress sensing protein. That being said, I have some concerns about the work described that require addressing before publication. Notably, I think more work needs to be done to define the molecular basis for the specific dependence of Crt expression on ATF6b in hippocampal neurons. Further, the authors need to do more experiments to demonstrate the specific importance of ATF6b signaling in the context of ER stress and in vivo neuronal death. I outline these various concerns below:

      Comment #1. The authors show that overexpression of either Atf6a or Atf6b both increase Crt expression in Atf6b knockout cells. While it is clear that deletion of Atf6a does not basally reduce Crt levels, the overexpression experiment does lead to a question as to how Atf6b can specifically be involved in regulating Crt expression. In the discussion, the authors seem to propose that homo- and hetero-dimerization of ATf6a and Atf6b are required for the basal expression of Crt and that Atf6b serves as a 'booster' of ER chaperone expression. They explicitly state that "Atf6a and Atf6b are required to induce CRT expression". However, it remains unclear to me why in this case would Atf6a deletion not impair Crt expression? The authors address this by invoking a mechanism whereby hippocampal neurons are more reliant on Atf6b for Crt expression, but this doesn't really make sense to me. Ultimately, this point underscores the lack of clear mechanistic basis to explain how Atf6b selectively regulates Crt in the hippocampus. This needs to be better resolved through more experimentation. For example, a ChIP experiment monitoring the binding of ATF6b and ATF6a to the Crt promoter in hippocampal and control cells would go a long way towards addressing this issue.

      Comment #2. The importance of ATF6b for protecting against insults needs to be better described. For example, the authors should show that overexpression of ATF6b protects against ER stress induced neuronal toxicity in cell culture and in vivo kainate induced neuronal toxicity. Similarly, the authors should evaluate how overexpression of ATF6a protects against these insults to better define the specific dependence of hippocampal neurons on ATF6b. The authors do show that overexpression of ATF6b can rescue the reduced Crt observed in Atf6b-deleted neurons, but the protection should similarly be demonstrated.

      Comment #3. Similar to #2, the authors should show that the potential for ATF6b (and ATF6a) overexpression to protect against different insults is impaired in Crt+/- neurons. The authors demonstrate that Crt-depletion increases sensitivity to toxic insults. This would go a long way to demonstrate the importance of the proposed ATF6b-CRT signaling axis in regulating neuronal survival in response to pathologic insults.

      Comment #4. When reporting the RNAseq data, the authors should use the q-value (i.e., FDR) instead of the p-value. This will likely affect the number of genes reported in Table 1, but it is the appropriate statistical test for this type of data.

      Significance

      This manuscript provides new context for understanding the functional relationship between Atf6a and the less-studied Atf6b in regulating neuronal survival. As with other studies focused on the relationship between these two ATF6 isoforms, this study demonstrates that these transcriptional programs integrate to coordinate a tissue-specific response to ER stress. Intriguingly, these studies indicate that ATF6b has a specific role in regulating the ER lectin chaperone CRT and that this ATF6b-CRT axis uniquely regulates neuronal survival in response to ER stress. While additional experiments are required to support this claim, the work described herein is a nice addition to our evolving understanding of the importance of ATF6b in regulating ER and cellular physiology during pathologic insults.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      Unfolded Protein Response (UPR) refers to homeostatic signaling pathways that play protective roles in various cell types. This work by Nguyen et al focuses on the UPR-mediator ATF6. In mammals, there are two isoforms of ATF6, alpha and beta. Nguyen et al show that the expression of the ATF6beta isoform is higher in hippocampal neurons whereas the ATF6alpha isoform is more evenly distributed across various neuronal subtypes. By performing gene expression profiling in mouse brain samples, they identify the ER chaperone calreticulin (CRT) as being significantly downregulated in ATF6beta null mutants. They further validate this observation by comparing hippocampi from ATF6alpha and ATF6beta null mice, where CRT is lowered in the latter but not the former. They identify and mutate putative ER stress response elements (ERSE) in the CRT promoter region to show that expression of CRT can be regulated by both ATF6alpha and beta. They demonstrate that treatment of hippocampal neurons with ER stress inducing chemicals leads to induction of CRT, which is suppressed in ATF6beta mutants. Such treatment also leads to cell death, which is exacerbated in ATF6beta mutants but rescued by ectopic expression of CRT. They also extend these observations to cell death induced by treatment with the glutamate receptor agonist, kainate, which was also exacerbated in ATF6beta mutants, but was rescued by counter treatment with ER stress inhibitors. Together, their data suggest a protective role for ATF6beta in hippocampal neurons in the context of ER stress.

      Major comments

      The primary advantage of this work is that much of it was done in vivo in mice, providing immediate context for the role of ATF6β under physiological conditions. They identify a specific region of the brain that requires ATF6beta. On the other hand, the ATF6-CRT signaling axis reported here had been established previously, and therefore, this study brings limited conceptual advances regarding the signaling mechanism itself (see Significance section below).

      Overall, the authors' data support their primary claim that ATF6β has a neuroprotective role in the context of ER stress. The data presented are clear and convincing, and their methods appear rigorous. The manuscript could be further improved if the authors could provide sufficient rationale for some of their experiments, which are discussed below.

      1. The post-translational processing of ATF6beta must be demonstrated in hippocampal neurons and not in HEK293T cells in Figure 1E. The authors conclude on Page 6, line 18 that "these results suggest that ATF6beta functions in neurons" but it is not obvious how expression in HEK293T cells contributes to this conclusion in any way.
      2. The hippocampal neurons are affected by the loss of ATF6β, even though the mice are not exposed to tunicamycin. Could the authors present evidence that there is physiological ER stress in hippocampal neurons? If not, why is ATF6beta required.
      3. In Figure 3, is there a specific reason why the authors do not mutate the ERSEs in the mouse CRT reporter, pCC1 and instead opt to analyze the huCRT reporter? Given that all the other observations in the manuscript are in mouse calreticulin, it is important to show that the ERSEs in the mouse calreticulin promoter are also regulated in an ATF6beta-dependent manner. Similar to the huCRT reporter, it is also crucial to examine if ATF6beta can regulate the mouse CRT promoter. This would provide an explanation for why calreticulin expression is not completely abolished in ATF6beta mutants.
      4. In Figure 5A and B, the density of Tubulin staining varies from panel to panel, and is much lower in ATF6beta mutants treated with Tg/Tm. Presumably this is because of cell death but this should be clarified in the main text. Additionally, it is unclear if the EthD-1 staining is nuclear localized. It would help if single channel images for Hoechst and EthD-1 were provided to visualize this.
      5. The literature reports that BAPTA-AM treatment itself could cause ER stress (e.g. PMID: 12531184). Here, the authors report the opposite effect. How could the authors reconcile the difference? The effects of BAPTA-AM and 2-APB must individually be examined in Figure 6C and not just in combination with Tm.
      6. The authors allude to "impairment of Ca2+ homeostasis in ATF6beta mutants" in Page 13 Line 2, but do not show any direct evidence in support of it. While treatment with BAPTA-AM and 2-APB is a start in that direction, it certainly does not demonstrate that under homeostatic conditions in vivo or in vitro there is any change in calcium flux in ATF6beta hippocampal neurons. To make the case that there is indeed perturbation of Ca2+ in ATF6beta mutant hippocampal neurons, the authors need to examine calcium flux and measure calcium indicators and how they are affected when ER stress is induced in these mutant cells.
      7. The effect of 2-APB and salubrinal alone on hippocampal neurons need to be examined in Figure 9B-D to eliminate the possibility that these drugs are not enhancing cell survival under normal conditions in a parallel manner.
      8. The rationale for the examination of Fos, Fosb and Bdnf is poorly described (page 14, line 13) and the conclusions from this line of experimentation are rather weak. The results from Figure 9 to some extent serve to confirm in vivo the data seen in Figure 6C but by no means provide a mechanism for why ATF6beta mutants have perturbed calcium homeostasis (page 14, line 22).

      Minor comments

      1. Page 8, line 3: Their rationale for why ATF6beta 5'UTR sequences are seen in their RNA seq data is not clearly explained. This must be rewritten for clarity.
      2. Page 8, line 5, the authors write that besides Atf6β , CRT was the only UPR-regulated gene downregulated in Atf6β mutant mice. The authors need to state how they defined "UPR-regulated genes". There must be a list, which the authors do not cite.
      3. Page 9, line 10: A reference is required for ERSEs.
      4. Page 10, line 6: The authors say "ATF6beta specifically induces CRT promoter activity". This is a confusing statement because "induction" is in response to stress, but the context here is homeostatic regulation since there is ostensibly no stress being induced. This distinction should be made and corrected here and throughout the manuscript.
      5. Page 10, line 16: The use of "latter" here is confusing and it would help to restructure this sentence for clarity.
      6. Figure 9A is missing Y-axis labels.

      Significance

      The authors summarize their major findings of the study (at the beginning of the Discussion) as ATF6β being required for CRT induction in the hippocampus, and that this ATF6β -CRT axis is important for the survival of hippocampal neurons. The idea that ATF6 induces CRT had been previously shown by others (PMID 9837962), and therefore, this is not the major new discovery of this study. In addition, the ATF6-calreticulin axis having a cell protective role had been reported in other biological contexts (e.g. PMID: 32905769), so that concept is also not a novel concept presented in this work. Similarly, the role of UPR in glutamate receptor agonist-induced neuronal cell death had been shown previously (the authors cite Kitao e tal., 2001; Sokka et al., 2007; Kezuka et al., 2016), so this link is not the major novel discovery revealed by this study. Instead, this study reports that ATF6β KO mice have specific phenotypes in hippocampal neurons, which had not been reported previously. Furthermore, this manuscript reports detailed information regarding Atf6β's downstream target genes in this tissue. In summary, this study's finding that ATF6 regulates CRT is confirmatory, rather than bringing new conceptual advances. The merit of this study is in the identification of the hippocampus as the organ that specifically requires ATF6beta. While the findings here may not appeal to a broader audience interested in UPR signaling mechanisms, it may draw interest from those who study hippocampal neuron physiology.

      For the editor's reference, this reviewer's field of expertise is in UPR signaling mechanisms in animal models

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Nguyen and colleagues provide evidence that ATF6-beta selectively induces calreticulin expression in mouse hippocampal neurons to protect these neurons from ER stress-inducing toxins. This is a well-written and well-organized report that provides functional information about ATF6-beta, a poorly studied homolog of the ATF6-alpha Unfolded Protein Response regulator. The report suggests that ATF6-beta has a previously unknown and important function in helping brain neurons survive ER stress by regulating calreticulin.

      The study shows that addition of BAPTA, 2-APB, or salubrinal significantly improves neuronal survival in ATF6-/- explants and mice brains in response to ER toxins. But, prior study (PMID: 15705855) used salubrinal at much higher concentration 75uM with little effect at the 5uM dose used in the current study. Evidence should be provided that these drugs are specifically inhibiting ER stress or off-target mechanisms should be discussed in their experimental models.

      Minor comments:

      Fig 1 any male vs female mice differences in ATF6b expression?

      Fig 2C. Please show molecular weight markers on blots

      Fig 2C. what are the doublet bands on calnexin?

      Fig 3. what are the ERSE sequences? several different binding sites are reported in literature.

      p8. What is meant by 5' Atf6b lacks 10 and 11?

      Discussion: Please clarify if anti-ATF6-beta antibodies were available for these studies.

      Discussion: It is puzzling that ATF6a induces calreticulin more potently than ATF6b, but the calreticulin defect is selectively dependent on ATF6b. Could authors speculate on this paradox? It would be interesting to expand on differences between ATF6a and ATF6b function and phenotypes in Discussion in mouse and in people.

      Significance

      ATF6-beta is homolog of ATF6-alpha and assumed to function like ATF6-alpha. This report describes a selective function of ATF6-beta in inducing calreticulin in mouse neurons during ER stress. This suggests ATF6-beta has some different functions than ATF6-alpha in the mouse hippocampal neurons.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      RESPONSE TO REVIEWERS

      We thank Review Commons and its three reviewers. Reviewers 2 and 3 provide detailed comments, which we address individually. Reviewer 1, however, gives a general critique of how we have approached asking how genome architecture affects the extent of evolution and the details of evolutionary trajectories. Our interpretation of their comments is that our approach and the one that they advocate represent two philosophically different, but complementary, views about how to study evolution in the laboratory. We begin by discussing this difference and then proceed to a point by point response to the three reviews.

      Reviewer 1

      Philosophical differences with Reviewer 1

      We interpret Reviewer 1’s comments as endorsing a formal, quantitative study of evolution that aims to explain the factors that control the rate at which fitness increases during experimental evolution. This approach derives from classical population genetics and aims to use a mixture of theory and experiment to uncover general principles that would allow rates of evolution and evolutionary trajectories, expressed as population fitness over time, to be predicted from quantitative parameters, such population sizes, mutation rates, distributions of the fitness effects of mutations (including their degree of dominance in diploids), and global descriptions of either general (e.g. diminishing returns) or allele-specific epistasis.

      This approach aims to predict how the average fitness trajectory should be affected by variations in these parameters and describe the variation, at the level of fitness, in the outcomes in a set of parallel experiments. This is an important approach and have previously used it to investigate how the strength of selection influences the advantage of mutators (Thompson, Desai, & Murray, 2006) and to produce and test theory that predicts how mutation rate and population size control the rate of evolution (Desai, Fisher, & Murray, 2007). Like every approach to evolution, this one has limitations: 1) if it doesn’t identify mutations or investigate phenotype other than fitness, it cannot reveal the biological and biochemical basis of adaptation or report on how variations in population genetic parameters (population size, haploids versus diploids, etc.) influence which genes acquire adaptive mutations, and 2) if the details of experiments (e.g. whether populations are clonal or contain standing variation, or which phenotypes are being selected for) have strong effects on the population genetic parameters, these must be measured before theoretical or empirical relationships could be used to predict the mean and variance of fitness trajectories produced by a given selection. A variety of evidence suggests that the second limitation is real. Examples include the absence of a universal finding that diploid populations evolve more slowly than haploids (discussed on Lines 437-442), even within the same experimental organism, and the finding that diminishing returns epistasis applies well to domesticated yeast evolving in a variety of laboratory environments (e. g. papers from the Desai lab, starting with (Kryazhimskiy, Rice, Jerison, & Desai, 2014) but not to the evolutionary repair experiments that we have conducted (Fumasoni & Murray, 2020; Hsieh, Makrantoni, Robertson, Marston, & Murray, 2020; Laan, Koschwanez, & Murray, 2015).

      The second approach to experimental evolution, which we, as molecular geneticists and cell biologists, predominantly take, is to follow the molecular and cell biological details of how organisms adapt to selective pressure. We subject organisms to defined selective forces, identify candidate causative mutations, test them by reconstructing the evolved mutations, individually and in combination, and perform additional experiments to ask how these mutations are increasing fitness. Because these experiments are performed on model organisms and often address phenotypes that have been studied by classical and molecular genetics, we can often say a good deal about the cell biological and biochemical mechanisms that increase fitness and this work can complement and extend what we know from classical and molecular genetics.

      The current manuscript and its predecessor are examples of finding causative mutations and asking how they improve fitness, with the first paper (Fumasoni & Murray, 2020) demonstrating how mutations in three functional modules could overcome most of the fitness cost of removing an important but non-essential protein and the current paper asking how alterations in genome architecture and dynamics (diploidy and eliminating double-strand break-dependent recombination) affect the extent to which populations increase in fitness and which genes and functional modules acquire mutations as they do so.

      By definition, such experiments are anecdotal: they report on how particular genotypes and genome architectures respond to particular selection pressures. Any individual set of experiments can produce conclusions about the effects of variables, such a population size, mutation rate, and genome architecture, on the mutations that increased fitness in response to the specific selection, but they can do more than lead to speculation and inference about what would happen in other experiments: speculation from the results of a single project and inferences from the combined results of multiple projects. Our interpretation is that the evolutionary repair experiments that we have performed, which have perturbed budding, DNA replication, and the linkage between sister chromatids do indeed lead to a common set of inferences: most of the selected mutations reduce or eliminate the function of genes, the interactions between the selected mutations are primarily additive, and the mutations cluster in a few functional modules.

      We believe that the population and molecular genetic approaches to experimental evolution are complementary and that a full understanding of evolution will require combining both of them. We think this will be especially true as we try to use the findings from laboratory studies to improve our understanding of evolution outside the lab, which takes place over longer periods, in more temporally and spatially variable environments, and is subject to variation in multiple population genetic and biological parameters.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): In their previous work the authors examined adaptation in response to replication stress in haploid yeast, via experimental evolution of batch cultures followed by sequencing. Here they extend this approach to include diploid and recombination-deficient strains to explore the role of genome architecture in evolution under replication stress. On the whole, a common set of functional modules are found to evolve under all genetic architectures. The authors discuss the molecular details of adaptation and use their findings to speculate on the determinants of adaptation rate.

      **SECTION A - Evidence, reproducibility and clarity** Experimental evolution can reveal adaptive pathways, but there are some challenges when applying this approach to compare genetic backgrounds or environments. They key challenge is that adaptation potentially depends on both the rate of mutation and the nature of selection. Distinct adaptation patterns between groups could therefore reflect differential mutation, selection, or both. The authors allude to this dichotomy but have very limited data to address it. The closest effort is engineering putatively-adaptive variants into all genetic background including those where they did not arise; the fact that such variants remain beneficial suggests they did not arise in certain backgrounds because of a lower mutation rate, but this is a difficult issue to tackle quantitatively.

      We agree, wholeheartedly, that adaptation depends on the combination of mutation rates and the nature of selection and our goal was to ask how the molecular nature of adaptation depends on genome architecture when three different architectures are subjected to the same selection: constitutive replication stress caused by removing an important component of replisome. We used a haploid strain as a baseline and compared it to two other strains chosen to influence either the effect of mutations (a diploid, where fully recessive mutations that were beneficial in the haploid would become neutral) or the rate of mutations (a recombination-defective strain that would be unable to use ectopic recombination to amplify segments of the genome). In both cases, we expected to see effects that are closer to qualitative than quantitative: the absence of fully recessive mutations in evolved diploids and absence of segmental amplification in the recombination-deficient haploid. We see both effects and they then allow us to ask two other questions: 1) does influencing the effect of a class of mutation (diploids) or preventing a class of mutation (recombination defect) have a major effect on the rate of evolution, and 2) do these differences affect which modules adaptive mutations occur in. As far as we can tell, the answer is no to both questions. We use “as far as we can tell” because our experiments do have limitations. First, the recombination-defective strain has a higher point mutation rate making it impossible to tell how much this elevation, rather than any other factor, accounts for it showing a greater fitness increase than the recombination-proficient haploid. Unfortunately, to our knowledge, it’s impossible to abolish recombination without affecting mutation rates. Second, we only experimentally tested a subset of the inferred causative mutations meaning that for many genes, our assertion that they are adaptive is a statistical inference and their assignment to a particular functional module is based on prior literature rather than our own experiments. In response to this criticism, we have now rephrased some of our sentences (see below).

      From mutation accumulation experiments, where the influence of selection is minimized, there is evidence that genetic architecture affects the rate and spectrum of spontaneous mutations. In this experiment, the allele used to eliminate recombination, rad52, will also increase the mutation rate generally. The diploid strain is also likely to have a distinct mutational profile--as a null expectation diploids should have twice the mutation rate of haploids. Recent evidence indicates the mutation rate difference between haploid and diploid yeast might be less than two-fold, but that there are additional differences in the mutation spectrum, including rates of structural change. The context for this study is therefore three genetic architectures likely to differ in multiple dimensions of their mutation profiles, but mutation rates are not measured directly.

      The reviewer is correct that we did not explicitly measure mutation rates, although the frequency of synonymous mutations (Figure 3-S1B) is a proxy for the point mutation rate as long as the majority of these mutations are assumed to be neutral. By this measure, the mutation rates for ctf4∆ haploids and ct4∆/ctf4∆ diploids, expressed per haploid genome, are close to each other (1.94 for haploids and 1.37 for diploids) but different enough to return p = 0.044 by Welch’s test, whereas the mutation rate for the recombination-deficient, ctf4∆ rad52∆ haploid is 4 to 5-fold higher (7.03). In contrast, we can infer that the ctf4∆ rad52∆ strain has much lower rates of segmental aneuploidy produced by recombination: we see only one such event in this strain in contrast to 16 in the ctf4∆ haploid and 44 in the ctf4∆/ctf4∆ diploid (Supplementary table 4), even though the amplification of the cohesin loader gene, SCC2, confers similar benefits in all three strains.

      The nature of selection on haploids and diploids is expected to differ because of dominance, but ploidy-specific selection is also possible. The authors discuss how recessive beneficial alleles may be less available to diploids, though this can be offset by relatively rapid loss of heterozygosity. However, diploids should also incur more mutations, all else being equal. The rate of beneficial mutation, as opposed to the rate of mutation generally, will depend on the mutational "target size" of fitness, and the authors findings recapitulate other literature (particularly regarding "compensatory" adaptation) that points to faster adaptation in genotypes with lower starting fitness.

      We agree with the reviewer and tried to make the point that which mutations are fixed is primarily determined by the product of the rate at which they occur and the benefit which they confer (lines 193-196). Evidence in budding yeast suggests that in diploid cells, removing one copy of most genes fails to produce a measurable fitness benefit (Deutschbauer et al., 2005), suggesting that losing one copy of many genesis purely recessive. If this was always the case, it would be very hard for such heterozygous, loss-of-function mutations to contribute to evolution in diploids: a mutation that inactivates one copy of a gene would have to rise to high enough frequency by genetic drift that homozygosis of this mutation mitotic recombination would have a significant probability. Instead we find that heterozygous mutations in some genes (inactivation of RAD9, what are likely to be hypomorphic mutations in SLD5) but not others (inactivation of IXR1) confer benefits in diploids that allow their frequency to rise much more rapidly by selection than they would by drift, allowing them to reach frequencies at which mitotic recombination becomes probable.

      There is ample literature on the above topics, particularly discussions of the evolutionary advantages of haploidy versus diploidy. While adaptation to replication stress provides a novel starting point for this investigation, much of the manuscript is devoted to long-standing questions that are not specific to replication stress. Unfortunately, the data the authors collected is not sufficient to shed light on these questions, because mutation and selection cannot be effectively distinguished. The Discussion states that "We find that the genes that acquire adaptive mutations, the frequency at which they are mutated, and the frequency at which these mutations are selected all differ between architectures but that mutations that confer strong benefits always lie in the same three modules" (line 379), but it is not clear that these statements are all supported by the data.

      The reviewer makes two points: we fail to make a significant contribution to long-standing questions about the evolutionary genetics of adaptation and the we make statements that are not supported by our data. On the first we disagree: unlike much of the previous work which compares the effects of mutation rates and population sizes on the rates of evolution, we sequence genomes, identify putative causative mutations, verify that they increase fitness, and test, by reconstruction, how their contribution to fitness is affected by fully characterized genome architectures. We know of no comparable work and we believe that this is a useful contribution to understanding evolution. In addition, some of the literature, for example the discussion of haploidy versus diploidy, has failed to reach a universal conclusion. On the second point, we realized that the statement that the reviewer quotes is stronger than it should be since we do not show “that mutations that confer strong benefits always lie in the same three modules”. What we do show is that mutations in all three modules are found in all three genome architectures (Figure 5), and that combining one mutation from each module (using mutations in genes that are found in that architecture) can reproduce the observed fitness increase in each architecture (Figure 6 B), but the reviewer is correct that we have not demonstrated that every clone from every population has an adaptive mutation in all three modules. We have therefore modified the quoted sentence as follows (altered wording underlined)

      "We find that the genes that acquire adaptive mutations, the frequency at which they are mutated, and the frequency at which these mutations are selected all differ between architectures but that mutations conferring strong benefits can occur in all three modules in each architecture" (Lines 405-408)

      Focusing on the more novel aspect of their experiment-the presence of replication stress-would arguably be a better approach. On this topic the authors have some interesting observations and speculation, but clear predictions are lacking. The introduction section could be redesigned to explicitly state why genome architecture might affect adaptation in response to replication stress in particular, rather than (or in addition to) adaptation generally. If there were no differences in mutation, does the nature of Ctf4 lead to predictions that the molecular basis of compensatory adaptation should differ among genome architectures? Without such predictions it will be difficult for readers to know whether the observation that different genome architectures follow similar adaptive paths is surprising or not.

      We believe that following this suggestion would diminish the paper. We set out to ask how genome architecture affected adaptation to the strong fitness defect produced by removing an important component of an essential process, DNA replication. We chose replication stress as an example of cell biological damage that cells would have to repair with the hope that the results would give general clues about evolutionary repair, rather than hoping that the experiment would inform us about how replication stress altered the types of mutation (e. g. point mutations versus segmental amplification) that were selected As we point out at the beginning of our response, we recognize that the result of any one such experiment must be anecdotal and any attempt to generalize must be described as speculation if it refers only to this one experiment, or inference if it refers to this experiment and other published work. In those cases where we discuss the effect of genome architecture on evolutionary trajectories, we can draw conclusions that apply to our own experiments, but can only speculate on adaptation to different selections. In others, where we see commonalities between our experiments and previous work on evolutionary repair (cite Review), we can make inferences about evolution to adapt to removing important proteins and speculate about other forms of selection. We have revised the discussion to make it clear where we conclude, where we speculate, and where we infer. We suspect that our finding that genome architecture has a larger effect on which genes acquire adaptive mutations than it does on which modules these mutations alter will generalize to other evolutionary repair experiments and may be true even more broadly.

      We deliberately did not make predictions about the effect of genome architecture on the rate at which population fitness increased or the mechanism of adaptation to replication stress because we believed that our ignorance and the diverging results of previous experiments was sufficient to make both exercises worthless. After the fact, we interpret our results to suggest that mutations that reduce the activity of components, such as Sld5, that are stably associated with replication forks should be semi-dominant, but we were not nearly smart enough to make such a specific prediction before the experiment began!

      **Minor comments:** Shifts in ploidy from diploid to haploid are less common than the reverse change, so the observation of such a shift (Fig. 1) should be discussed in more detail.

      We now mention that haploids becoming diploids is more common than the reverse transformation and point out that genome sequencing reveals that these strains are true haploids rather than aneuploids.

      “One diploid population (EVO14) gave rise to a population with a haploid genome content, suggesting a possible haploidization event during evolution. Sequencing revealed no aneuploidies as a potential explanation of this phenomenon. While diploidization has been recurrently observed during experimental evolution with budding yeast (Aleeza C. Gerstein & Otto, 2011; Aleeza C Gerstein, Chun, Grant, & Otto, 2006; Harari, Ram, Rappoport, Hadany, & Kupiec, 2018; Venkataram et al., 2016), reports of spontaneous haploidization events have been instead scarce. Given the difficulties introduced by the change of ploidy over the 1000 generations, we have excluded EVO14 from all our analyses.” (Lines 122-128)

      We believe that the most likely mechanism is that the strain sporulated to produce haploids that were fitter than their diploid parent, but because this event occurred in only one out of eight populations and the proposed explanation is pure speculation we have not included in the revised manuscript.

      Line 88 typo 'stains'.

      Fixed. Thank you.

      Reviewer #1 (Significance (Required)): **SECTION B - Significance** The novel aspect of this study is the combination of replication stress and genome architecture, but here the significance is limited by a lack of clear predictions on how these factors might interact. On the other hand, much of the manuscript is devoted to why adaptation might vary among genome architectures in general, but this long-standing and important question is not particularly well resolved by this experimental approach, which can't disentangle mutation and selection.

      Our belief is that quantitatively predicting how selection will change fitness is nearly impossible because we lack the detailed knowledge of population genetic parameters that apply to our experiments. Prediction is even harder if the goal is to identify which genes will fix adaptive mutations and understand how these mutations alter cellular phenotypes to increase fitness. Thus our approach is almost entirely empirical: we do experiments that alter interesting variables, collect data, and do our best to interpret them and suggest how the conclusions of individual experiments might generalize.

      The authors highlight the dichotomy when discussing the evolution of ploidy: "We suggest that... genome architecture affects two aspects of the mutations that produce adaptation: the frequency at which they occur and the selective advantage they confer" (line 399), but presenting this as a novel inference does not appropriately acknowledge prior research and discussion of these ideas; several relevant papers are cited by the authors in other contexts. It may be possible to recast these findings as a test of the role of genome architecture in adaptation generally, but the authors should clarify the limitations of experimental evolution and more fully consider the theory and data outlined in previous research. In particular, few studies can claim to directly compare mutation rates between genome architectures, and it is not obvious that the present study is an example of such.

      We have the disadvantage that the reviewer doesn’t identify the literature we fail to cite. To us the argument the reviewer quotes is self-evident. As we mention above, our goal was not to test either general or detailed predictions and the level at which we analyzed our experiment, especially demonstrating that mutations were causal and reconstructing them individually and in combination, is missing from previous work. Finally measuring mutation rates is supremely difficult: you either need good ways of following all possible forms of mutation, quantitatively and without selection, or you resort to selecting mutations with a particular phenotype and molecularly characterizing them, knowing that these assays may well give different ratios of the rates of different types of mutation at different loci. We do make and report one measure of mutation rate, the rate of synonymous mutation in protein coding genes, which we discuss above.

      Reviewer expertise: Evoutionary genetics; experimental evolution; mutation.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): **Summary** This manuscript investigates the effect of an organism's genotype (or, as the authors call it, an organism's 'genome architecture') on evolutionary trajectories. For this, the authors use Saccharomyces cerevisiae strains that experience some form of replication stress due to specific gene deletions, and that further differ in ploidy and/or the type of gene(s) deleted. They find the same three functional modules (DNA replication, DNA damage checkpoint, sister chromatid cohesion) are affected across the 3 different genotypes tested; although the specific genes that are mutated varies. **Major comments** This is a solid and exceptionally eloquent paper, comprising a large body of work that is in general well presented. That said, I do have some suggestions and questions. At several points in the manuscript, the authors should perhaps be more careful in their wording and avoid to overgeneralize data without providing additional evidence for these claims.

      We thank the reviewer for their constructive review and address their request for more careful wording below.

      • Some key points of the study are not entirely clear to me; possibly because the study builds upon a previous study that was recently published in eLife. Anyhow, I think it would be useful to clarify the following points a bit more:

        • Why exactly was ctf4∆ chosen as a model for replication stress? What is the evidence that ctf4∆ is a good model for replication stress? Without including some evidence for this, it is unclear how well the findings in this study really can be generalized to replication stress (which is what the authors do now).

      We described the reasons for choosing CTF4 deletion to mimic DNA replication stress in our previous eLife paper, to which we refer at. Nevertheless, the reviewer is right in asking us not to assume that the reader will have read our previous work. Briefly: DNA replication stress is a term that is loosely defined as the combination of the defects in DNA metabolism and the cellular response to these defects in cells whose replication has been substantially perturbed (Macheret & Halazonetis, 2015). Established methods in the field to induce DNA replication stress consist of either pharmacological treatments or genetic perturbation. Pharmacological treatments include hydroxyurea, which target the ribonucleotide reductase and hence stalls forks as a result of dNTP depletion (Crabbé et al., 2010), or aphidicolin, which directly inhibits polymerases α, ε and δ (Vesela, Chroma, Turi, & Mistrik, 2017b; Wilhelm et al., 2019). For genetic perturbation, the conditional depletion of replicative polymerases (Zheng, Zhang, Wu, Mieczkowski, & Petes, 2016) is frequently used. These methods are incompatible with experimental evolution, as cells can mutate the targets of replication inhibitors or alter the expression of genes that have been reduced in expression or activity. Removing an important but non-essential component of the replication machinery avoids these problems. We chose CTF4 deletion as a manipulation that affected the coordination of events at the replication fork: in the absence of Ctf4, the polα-primase complex is no longer physically bound to the replicative helicase, and thus the polymerase’s abundance at the replisome decreases (Tanaka et al., 2009). This manipulation achieves the same effects as polymerase depletion and replisome stalling, producing a constitutive DNA replication stress that can only be overcome by mutations in other genes. Multiple studies have shown that ctf4**D cells display replication intermediates commonly associated to DNA replication stress, such as the accumulation of ssDNA gaps and reversed forks (Abe et al., 2018; Fumasoni, Zwicky, Vanoli, Lopes, & Branzei, 2015), fork stalling (Fumasoni & Murray, 2020), checkpoint activation (Poli et al., 2012; Tanaka et al., 2009) and altered chromosome metabolism (Kouprina et al., 1992).

      We now justify our choice of deleting CTF4 at line 74:

      “DNA replication stress is often induced with drugs or by reducing the level of DNA polymerases (Crabbé et al., 2010; Vesela, Chroma, Turi, & Mistrik, 2017a; Wilhelm et al., 2019; Zheng et al., 2016). To avoid evolving drug resistance or increased polymerase expression, which would rapidly overcome DNA replication stress,** we deleted the CTF4 gene, which encodes a non-essential subunit of the DNA replication machinery (the replisome) (Kouprina NYu, Pashina, Nikolaishwili, Tsouladze, & Larionov, 1988). Ctf4 is a homo-trimer that functions as a structural hub within the replisome (Villa et al., 2016; Yuan et al., 2019) by binding to the replicative DNA helicase, primase (the enzyme that makes the RNA primers that initiate DNA replication), and other accessory factors (Gambus et al., 2009; Samora et al., 2016; Simon et al., 2014; Villa et al., 2016). In the absence of Ctf4, the Pol**a-primase and other lagging strand processing factors are poorly recruited to the replisome (Samora et al., 2016; Tanaka et al., 2009; Villa et al., 2016), causing several characteristic features of DNA replication stress, such as accumulation of single strand DNA (ssDNA) gaps (Abe et al., 2018; Fumasoni et al., 2015), reversed and stalled forks (Fumasoni & Murray, 2020; Fumasoni et al., 2015), cell cycle checkpoint activation (Poli et al., 2012; Tanaka et al., 2009) and altered chromosome metabolism (Hanna, Kroll, Lundblad, & Spencer, 2001; Kouprina et al., 1992). As a consequence of these defects, ctf4**D cells have substantially reduced reproductive fitness (Fumasoni & Murray, 2020).**”

      Would the authors expect to see similar routes of adaptation if a 'genomic architecture' with a less severe/other replication defect would have been used? I realize the last question is perhaps difficult to address without actually doing the experiment (which I am not suggesting the authors should do); I just want to point out that perhaps some data should not be over-generalized.

      We share the reviewer’s interest in asking whether different forms of DNA replication stress would lead to the same results described, and we plan to rigorously investigate this question in a separate paper. We note that the careful comparison between different forms of DNA replication stress has never been made and that authors studying this phenomenon often rely on a single perturbation to induce DNA replication stress (Crabbé et al., 2010; Wilhelm et al., 2019; Zheng et al., 2016). We agree that such a comparison will be useful, but we believe (as indicated by the reviewer) it will require an amount of work that goes beyond the scope of our study. To avoid over-generalization, we are using now using “a form of DNA replication stress” in lines 33, 244, 401, 414 and 461, to make it clear that our conclusions (as opposed to inferences and speculations) are restricted to the response to a single example of replication stress.

      Likewise, why was RAD52 selected as the gene to delete to affect homologous recombination? I understand that it is a key gene, but on the flipside, absence of RAD52 affects multiple cellular pathways and (as the authors also observe in their populations) also results in increased mutation rates which might confound some of the results.

      We aimed to observe the largest deficiency in DNA recombination possible and therefore chose to delete RAD52 because of its many roles in different forms of homologous recombination (Pâques & Haber, 1999) . The choice of other genes, such as RAD51, would have inhibited canonical double strand break (DSB) repair, but allowed other mechanisms that can rescue stalled replication forks (Ait Saada, Lambert, & Carr, 2018), such as break induced replication (BIR) or single strand annealing (SSA) (Ira & Haber, 2002).

      Our position regarding the inevitable increase in mutations rates obtained while working with genome maintenance process has been instead elaborated in response to reviewer #1 above.

      A sentence describing our choice to delete RAD52 has now been included at line 86:

      “…as well as from haploids impaired in homologous recombination due to the deletion of RAD52 (Figure 1A), which encodes a conserved enzyme required for pairing homologous DNA sequences during recombination (Pâques & Haber, 1999). Because Rad52 is involved in different forms of homologous recombination, it’s absence produces the most severe recombination defects and thus allows us to achieve the largest recombination defect achievable with a single gene deletion (Symington, 2002)..”

      Related to the first comment, it is also unclear to me how well the system chosen by the authors is representative of the replication stress experienced by tumor cells (as briefly touched upon in the final section of the discussion). Are some of the homologs key oncogenes that drive carcinogenesis?

      We should have been clearer. Our goal was to argue that the lesions and responses produced by replication stress in tumor cells, such as stalled replication forks and checkpoint activation, were similar to those seen in yeast cells lacking Ctf4. We did not mean to imply removing Ctf4 from yeast cells had the same effects on cell proliferation and survival as inactivating tumor suppressors and activating proto-oncogenes have in mammalian cells. Despite the difference between direct (removing Ctf4) and indirect effects on DNA replication (tumor cells), the replication intermediates (ssDNA, stalled and reversed forks), the cell cycle defects (G2/M delay), the genetic instability (increased mutagenesis and chromosome loss) and chromosome dynamics (late replication zones and chromosome bridges) generated by the absence of Ctf4 are similar to those observed in oncogene-induced DNA replication stress in mammalian cells (Kotsantis, Petermann, & Boulton, 2018). We therefore believe our experiments reveal evolutionary responses to a constitutive DNA replication stress that resembles the replication stress seen in cancer cells. Nevertheless, we agree that the comparison with cancer evolution remains speculative and we therefore avoided mentioning cancer in the title our paper or our conclusions, and only discuss it in a speculative section of the discussion.

      We have modified this section of the discussion as follows (line 554):

      “While generated through a different mechanism (unrestrained proliferation, rather than replisome perturbation), oncogene induced DNA replication stress produces cellular consequences (Kotsantis et al., 2018) which are remarkably similar to those seen in the absence of Ctf4, such as the accumulation of ssDNA, stalled and reversed forks (Abe et al., 2018; Fumasoni & Murray, 2020; Fumasoni et al., 2015), genetic instability (Fumasoni et al., 2015; Hanna et al., 2001; Kouprina et al., 1992) and DNA damage response activation (Poli et al., 2012; Tanaka et al., 2009). Based on these similarities we speculate that evolutionary adaptation to DNA replication stress could reduce its negative effects on cellular fitness and thus assist tumor evolution.”

      The authors should consider rephrasing some sentences regarding the occurrence of adaptive mutations. Sentences such as 'which genes are mutated depends on the selective advantage' (p1; lines 15-16); 'genome architecture controls the frequency at which mutations occur' (p15), "genome architecture controls which genes are mutated" (p1, line 20) makes it sound like the initial occurrence of mutations is not random, whereas in reality, the mutational landscape is the result of the combined effect of occurrence and fitness effect of the mutations, with the later rather than the former likely being the main driver behind the observed patterns.

      We thank the reviewer for asking for more precision in the above sentences, whose proposed changes we now list:

      “Mutations in individual genes are selected at different frequencies in different architectures, but the benefits these mutations confer are similar in all three architectures, and combinations of these mutations reproduce the fitness gains of evolved populations.” (Lines 13-15)

      “Genome architecture influences the distribution of adaptive mutants” (Line 277)

      "genome architecture influences the frequency at which mutations occur, the fitness benefit they confer, and the extent of overall adaptation." (Lines 462-463)

      Some important methodological information is missing or unclear in the manuscript:

      The authors should provide more details on how they decided which clones to select for sequencing. Did they select the biggest colonies; were colonies picked randomly, ...

      This following sentence is now reported in the materials and methods section (Line 603)

      “To capture the within-population genetic variability we selected the clones displaying the largest divergence of phenotypes in terms of resistance to genotoxic agents (methyl-methanesulfonate, hydroxyurea and camptothecin).”

      What is the population size during the evolution experiment?

      We now added the following sentence at line 599:

      “In this regime, the effective population size is calculated as N0 x g where N0 is the size of the population bottleneck at transfer and g is the number of generations achieved during a batch growth cycle and corresponds to approximately to 107 cells.”

      Sequencing of populations and clones: coverage should be mentioned

      The following sentence has now been added at line 616:

      “Clones and populations were sequenced at approximately the following depths: 25-30X for haploid clones, 50-60X for diploid clones, 50-60X for haploid populations and 120-130X for diploid populations.”

      Identification of mutations (p19, line 573): Is this really how the authors defined whether a variant is a mutation? Based on the definition given here, DNA mutations that lead to a synonymous mutation in the protein are not considered as mutations?

      We apologize for this typo. We do identify and consider synonymous mutations as evidenced by Figure 3-S1B. Now the sentence at line 626 correctly reports:

      “A variant that occurs between the ancestor and an evolved strain is labeled as a mutation if it either (1) causes a substitution in a coding sequence or (2) occurs in a regulatory region, defined as the 500 bp upstream and downstream of the coding sequence.”

      Perhaps the information can be found elsewhere, but the source data excel files for mutations is incomplete and should at the very least contain information on the type of mutation (eg. T->A), as well as the location of this mutation in the respective gene.

      Perhaps the reviewer is referring to Supplementary table 2, where we list the number of times a gene has been mutated in different populations (and thus summaries different types of mutations affecting the same gene). The information they request is reported in Supplementary table 1 for all the variants detected in populations and clones sequencing.

      **Minor comments** • While the author already cite several significant papers relevant for their manuscript, some other studies could also be included:

      We thank the reviewer for highlighting these references, which are now cited at line 28

      From the text in the abstract, it is unclear what the three genomic architectures (line 13) exactly are, the authors should consider spelling this out.

      In repose o the reviewer request for clarity we now propose the following change in line 13:

      “We asked how these trajectories depend on a population’s genome architecture by comparing the adaptation of haploids to that diploids and recombination deficient haploids.” (Lines 9-11)

      Can the authors speculate on why a homozygous ctf4D/ctf4D rad52D/rad52D would be lethal, and a haploid not?

      See below

      The authors note that a diploid ctf4D/ctf4D strain is less fit than its haploid counterpart. Why do the authors think this is the case?

      In response to the two previous questions, we now propose the following speculations that we include in the text (Line 97):

      “Diploid cells require twice as many forks as haploids and Ctf4-deficient diploids are thus more likely to have forks that cause severe cell-cycle delays or cell lethality. We speculate that this increased probability explains the more prominent fitness defect displayed by diploid cells. Interestingly, homologs of Ctf4 are absent in prokaryotes, where the primase is physically linked to the replicative helicase (Lu, Ratnakar, Mohanty, & Bastia, 1996) and Ctf4 is essential in the cells of eukaryotes with larger genomes such as chickens (Abe et al., 2018) and humans (Yoshizawa-Sugata & Masai, 2009). Rad52 is likely involved in rescuing stalled replication forks by recombination-dependent mechanisms (Fumasoni et al., 2015; Yeeles, Poli, Marians, & Pasero, 2013). We speculate that the absence of Rad52 increases the duration of these stalls and leads some of them to become double-stranded breaks resulting in cell lethality and explaining the decreased fitness of ctf4D rad52D haploid double mutants. In diploids ctf4D rad52D cells, which have twice as many chromosomes, the number of irreparably stalled fork may be sufficient to kill most of the cells in a population, thus explaining the unviability of the strain.”

      The authors passage their cells for 100 cycles and assume that this corresponds to around 1000 generations for each population. However, the fitness differences between the different starting strains (see also Figure 1B) are likely to cause considerable differences in number of generations between the different strains. Do the authors have more precise measurements of number of generations per population? If not, perhaps it should be noted that some lineages may have undergone more doublings than others, and perhaps also discuss if and how this could influence the results?

      In a batch culture regime, where populations are allowed to reach saturation after each dilution, the number of generations at each passage are dictated by the dilution factor (Van den Bergh, Swings, Fauvart, & Michiels, 2018). A dilution of 1:1000 from a saturated culture will allow for approximately 10 generations before populations reach a new saturated phase. As long as saturation is allowed to occur, this number is independent of the fitness of the cultured strains: Slower-dividing strains will simply employ more time to reach saturation after each dilution. At the beginning of the experiment, we had to dilute the ctf4D rad52**D strains being passaged every 48hrs instead of 24hrs. After generation 50, ctf4D rad52**D strains reached saturation within 24hrs and were then diluted daily. The total count considers the number of passages a culture has undergone, and not the number of days of culture, and thus should guarantee approximately the same number of generations in all three genome architectures.

      Panel A of figure 1A is somewhat confusing; as this seems to indicate that the ctf4∆ was introduced after strains were made, for example, haploid recombination deficient (which is not how these strains were constructed). Perhaps a better way of representing would be to have the indication of DNA replication stress pictured inside the yeast cells.

      We have modified Figure 1A to better represent the way the strains were constructed. For space reasons we have not represented a perturbed fork within each cell, but rather above all of them.

      Legend to Figure 1: is fitness expressed relative to haploid or diploid WT cells for the diploid strains?

      We apologize for having missed this detail in the figure legends. Throughout the figures, haploid and diploid cells were competed against reference strains with the same ploidy. We now add this sentence in Figure 1 and in the materials and methods (line 686).

      Figure 3: to improve readability of this figure, the authors could consider placing the legend of the different symbols (#, *,..) in the figure as well and not just in the figure legend.

      We now include the symbols legend in Figure 3.

      Figure 5 shows Indels, but if I am correct, these mutations are not discussed in the text; nor is it mentioned what the authors used as a cut-off to determine indels (the authors use the term 'small indels' without defining it)? For example, the data shown in Figure 3 and Figure 4 only includes SNPs and not indels (correct?) - but the indels should also be taken into account when investigating which modules are hit.

      Gapped alignments of the relatively long 150 paired-end reads in our data set permits the identification of small indels ranging in size from 1–55 bp using VarScan pileup2indels tool (Koboldt et al., 2012). All small indels (and the respective sequence affected) are listed together with SNPs in Supplementary table 1. Figure 3A, Figure 4 and Figure 5B are representation of ‘gene mutations’ which include both SNPs and small InDels. Large chromosomal Insertion and deletions, not detectable by short read gap alignment are instead identified using the VarScan pileup2copynumber tool (Koboldt et al., 2012), and are represented as amplifications or deletions in Figure 3B and 5C.

      The following sentence has been added to the material and methods at line 629:

      “Gapped alignments of the 150 paired-end reads in our data set permits the identification of small indels ranging in size from 1–55 bp using VarScan pileup2indels tool (Koboldt et al., 2012). All small indels (and the respective sequence affected) are listed together with SNPs in Supplementary table 1.”

      The following definition has been added in Figure legends 3A, 4 and 5A and B.

      “Gene mutations (SNPs and small InDels 1-55bp)”

      Figure 5 mentions: # gene mutations. So these are only the mutations in genes, and not in their up- or downstream regulatory regions?

      We use a broader definition of a gene, not restricted to the open reading frame, and including its regulatory regions. The following definition has been added to figure 5’s legend.

      “Frequency of SNPs and small InDels (1-55bp) affecting genes (Open reading frames and associated regulatory regions).”

      Figure 3-S1: labels of C panels are missing.

      Labels are now included in Figure 3-S1

      Figure 3-S1, panel B: why did the authors focus on synonymous mutations?

      The panel B is commented upon in line 186 and contrasted with panel A to argue that the increased number of mutations detected in ctf4∆ rad52∆ strains is due to a higher mutation rate(which is expected to increase synonymous mutations) instead of an higher number of adaptive mutations (which are less likely to be synonymous) being selected.

      Reviewer #2 (Significance (Required)): This is a solid and clearly written study, comprising a large body of work that is generally well presented and that will be of interest to scientists active in the field of (experimental) evolution and replication. However, many aspects studied in this manuscript have already been studied and reported before; including the recent eLife paper by the same group, as well as studies by other labs that have investigated how genome architecture / genotype affects evolutionary trajectories, the effect of ploidy on evolution, .... Because of this, I do feel that the authors should put their findings more in the context of existing literature context, including a general description of which results are truly novel, which confirm previous findings and which results seem to go against previous reports. This is already so at some points in the text, but I feel this could be done even more.

      We now rephrase the following paragraphs in our discussion to better highlight the main conclusions in contrast to the existing literature:

      “Engineering one mutation in each module into an ancestral strain lacking Ctf4 is enough to produce the evolved fitness increase in all three genomic architectures. Furthermore, engineering mutations in individual genes confer benefits in all three architectures (Fig. 6A) ,even in those where the mutations in these genes was rare, and combining these mutations recapitulated the evolved fitness increase in all three architectures (Fig. 6B). Altogether our results demonstrate the existence of a common pathway for yeast cells to adapt to a form of constitutive DNA replication stress.” (Lines 409-414)

      “Our results thus go against the trend of slower adaptation in diploids as compared to haploids reported by the majority other studies (A. C. Gerstein, Cleathero, Mandegar, & Otto, 2011; Marad, Buskirk, & Lang, 2018; Zeyl, Vanderford, & Carter, 2003). This effect is not limited to populations experiencing DNA replication stress (Figure 2A) but is also present in control wild-type populations (Figure 2B). Our results support the idea that the details of genotypes, selections, and experimental protocols can determine the effect of ploidy on adaptation.” (Lines 437-442)

      “Our results therefore agree with previous reports observing declining adaptability across strains with different initial fitness but largely fail to observe diminishing return epistasis as a potential justification of this phenomenon. Our experiments and two previous evolutionary repair experiments (Hsieh et al., 2020; Laan et al., 2015) both show interactions that are approximately additive between different selected mutations. The reasons for this difference are currently unknown.” (Lines 450-455)

      Additionally, I think the authors should be more careful not to over-generalize their findings, which come from only a few specific genetic manipulations that might not be representative for general replication stress. For example (p15), can the authors really claim that they have unraveled general principles of adaptation to constitutive DNA replication stress? Perhaps a better motivation of the choice of ctf4 as a model mutation for DNA replication stress could also help (see also my earlier comments). A similar comment applies to the molecular mechanisms affecting adaptation in diploid cells - what evidence do the authors have that their findings are not specific to the one specific type of diploid strain they used in their study? Adding a bit more background information or nuance for some of the claims would help tackle this issue.

      We now followed the suggestions made previously by the reviewer to justify our experimental choices better and to use a language that avoids over-generalizations.

      Field of expertise of this reviewer: genetics, evolution, genomics

      Reviewer #3 (Evidence, reproducibility and clarity (Required)): **Summary:** Here the authors carry out an evolution experiment, propagating replicate populations of the budding yeast with the CTF 4 gene deleted in three different genetic backgrounds: haploid , diploid and recombination deficient (RAD52 deletion). The authors find that the rate of evolution depends on the initial fitness of the different genetic backgrounds which is consistent with a repeated finding of evolution experiments: that beneficial mutations tend to have a smaller fitness effect in high fitness genetic backgrounds. Curiously even though the targets of selection tended to be specific to each of the three different genetic backgrounds, genetic reconstruction experiments showed beneficial mutations convert a fitness increase in all genetics backgrounds. The authors go on to provide a plausible explanation for why each of the three genetic backgrounds are predisposed to certain types of beneficial mutations. Overall, these results provide important context and caveats for an emerging consensus that genetic background determines the rate of evolution, a comprehensive molecular breakdown of adaptation to DNA replication stress and a mechanistic explanation for why different beneficial mutations are favoured in diploids, haploids and recombination deficient strains. This is a well-executed study that is beautifully presented and easy to follow. This will be of great interest to those in the experimental evolution community and the data an excellent resource.

      We thank reviewer #3 for emphasizing that reconstructed mutations are beneficial even in architectures where they were not ultimately detected at the end of the experiment. We have now highlighted this point in our conclusions as a response to the reviewer’s #1 and #2 request for more clarity regarding our novel findings.

      “We find that the genes that acquire adaptive mutations, the frequency at which they are mutated, and the frequency at which these mutations are selected all differ between architectures but that mutations that confer strong benefits can occur in all three modules in each architecture. Engineering one mutation in each module into an ancestral strain lacking Ctf4 is enough to produce the evolved fitness increase in all three genomic architectures. Furthermore, reconstruction of a panel of mutations into all three architectures proved they are adaptive even in architectures where the affected genes were not found significantly mutated by the end of the experiment. Altogether our results demonstrate the existence of a common pathway for yeast cells to adapt to a form of constitutive DNA replication stress.” (Lines 405-414)

      **Major comments:**

      • Are the key conclusions convincing? Yes, the convergent evolution analysis, fitness assays, and genetic reconstructions are sufficient to characterise the genetic causes of adaptation in this experiment, and are of the highest standard. The authors do particularly well to fully recover the fitness increases that evolved with their genetic reconstructions, which imparts a completeness to their understanding of what happened in their evolution experiment.
      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? No, in nearly all cases the authors make reasonable claims. One exception is on L419 in the discussion, where the authors speculate why some mutations do not follow diminishing returns epistasis, but this idea does not really have any basis (no citation or reasons to suggest that DNA repair genes are less connected with other genes in the genome). If the authors cannot support this statement, it should be removed, and instead write that is currently unknown why some individual mutations do not follow the pattern of diminishing returns.

      On reflection, we agree with the reviewer and now state,

      “Our results confirm previous reports observing declining adaptability across strains with different initial fitness but largely fail to observe diminishing return epistasis as a potential justification of this phenomenon. Our experiments and two previous evolutionary repair experiments (Hsieh et al., 2020; Laan et al., 2015) both show interactions that are approximately additive between different selected mutations. The reasons for this difference are currently unknown.

      A hypothesis, which would need experimental validation, could be that the different mutations have different degrees of epistatic interactions with the rest of the genome. Ixr1, whose mutation follows diminishing return epistasis, is a transcription factor that could in principle affect the expression of many other genes implicated in different cellular modules. Sld5, Scc2 and Rad9 instead, whose mutations have the same effect across different genome architectures, having more mechanistic roles in genome maintenance may have strong epistatic interactions only with a restricted number of cellular modules implicated with DNA metabolism.

      • Would additional experiments be essential to support the claims of the paper? No.
        • Are the data and the methods presented in such a way that they can be reproduced? Yes, but some more details are needed for the convergent evolution analysis, see minor comments.
        • Are the experiments adequately replicated and statistical analysis adequate? Yes, but some more statistic reporting in the main text or figure legends would be helpful, for example. L159: Please report the statistical test, test statistic and p value in the text or in the figure legend. Currently significance is indicated, but the methods do not specify the test.

      We apologize for the lack of clarity in the main text. The test used for all fitness analysis was only reported in the materials and methods as follow:

      “The P-values reported in figures are the result of t-tests assuming unequal variances (Welch’s test)”

      We now include the test and the associated p-value in line 184, and write the above sentence in all the relevant figures.

      This should also be done for the GO analysis shown in figure 3A.

      We thank reviewer #3 pointing out this omission. We now include the following section:

      “Gene ontology (GO) enrichment analysis:

      The list of genes with putatively selected mutations (Figure 3A) or homozygous mutations in diploids (Figure 4) were input as ‘multiple proteins’ in the STRING database, which reports on the network of interactions between the input genes (https://string-db.org). The GO term enrichment analysis provided by STRING are reported in Supplementary Table 3 and Supplementary Table 6 respectively. Briefly, the strength of the enrichment is calculated as Log10(O/E), where O is the number of ‘observed’ genes in the provided list (of length N) which belong to the GO-term, and E is the number of ‘expected’ genes we would expect to find matching the GO-term providing a list of the same length N made of randomly picked genes. P-values are computed using a Hypergeometric test and corrected for multiple testing using the Benjamini-Hochberg procedure. The resulting P-values are represented as ‘False discovery rate’ in the supplementary tables and describe the significance of the GO terms enrichment (Franceschini et al., 2013).”

      **Minor comments:**

      • Specific experimental issues that are easily addressable. Not a new experiment, but extra details are required. The authors carried out both clone and whole population sequencing. For their convergent evolution analysis, what is the criteria for a mutation to be included- ie, does it need to be fixed, have attained a certain frequency? This is important- if the criteria were low (say 5%), it would be important to know whether gene A had fixed in 4 populations, while gene B had attained a frequency of 10% in 5 populations. As it stands both would be described as examples of convergent evolution. This can be handled by providing these details in the methods.

      For the population sequencing we disregarded variants found at less than 25% and 35% of the reads in haploid and diploid populations respectively as we observed they were largely the product of alignment errors. All the variants found at frequencies higher than the thresholds indicated were used for the parallel evolution analysis. The frequency at which each individual variant was detected in each population is reported in Supplementary table 1, while the average frequency at which a gene has been found mutated across different populations is reported in Supplementary table 2. The reason why we didn’t solely focus on fixed mutations for our convergent evolution analysis was that from previous work we knew of the existence of clonal interference which kept the frequency of verified adaptive mutations that coexisted in the same population (e.g. ixr1 and sld5) well below 90% (Fumasoni & Murray, 2020).

      For clarity we now add the following sentence in the material and methods:

      “Variants found in less than 25% and 35% of the reads in haploid and diploid populations respectively were discarded, since many of these corresponded to misalignment of repeated regions. For clone sequencing, only variants found in more than 75% of the reads in haploids and 35% of the reads in diploids (to account for heterozygosity) were considered mutations. The frequency of the reads associated with all the variants detected are reported in Supplementary table 1”

      • Are prior studies referenced appropriately? I note that the authors use the term declining adaptability where as other papers use the term diminishing returns epistasis- I am sure the authors have good reasons for their choice of nomenclature but I think it would be helpful for their readers to connect this work to other work by mentioning that declining adaptability is also referred to as diminishing returns.

      We use both terms (for instance in line 446 and line 448) with a different meaning : By ‘declining adaptability’ we refer the phenomenon where more fit strains display lower adaptation rates than less fit ones. By ‘diminishing returns epistasis’ we refer to a possible explanation of such a phenomenon, where adaptive mutations have different fitness effects due to their ‘global’ epistatic interactions with other alleles. It has to be noted that ‘diminishing returns epistasis’ is not the only proposed explanation of the phenomenon of declining adaptability (Couce & Tenaillon, 2015). In our case, we do find evidence of declining adaptability but very limited evidence for diminishing return epistasis (only 1 mutation in 5 has a different fitness effect in different architectures).

      A reference the authors have missed: L419, as well as citing the Desai Lab bioxive paper, they should cite another theory paper that obtained similar conclusions. Lyons, D.M., et al. https://doi.org/10.1038/s41559-020-01286-y.

      We thank the reviewer for the suggested reference, which is now cited at line 450.

      • Are the text and figures clear and accurate? This paper is beautifully written and easy to follow, a lot of thought has gone into the figures which are aesthetically pleasing and easy to navigate.

        • Do you have suggestions that would help the authors improve the presentation of their data and conclusions? No.

        **Typos**

        L32 "do" should be "to" L95 analyzed L219 are the authors referring to ref 15 here? I think so, but please specify

      We thank the reviewer for carefully finding the typos, which are now all corrected.

      Reviewer #3 (Significance (Required)):

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. This paper is an important conceptual result and an immediate advance for basic research. The authors have done an outstanding job of showing the potential for the clinical translation of this research, especially regarding cancer biology.
      • Place the work in the context of the existing literature (provide references, where appropriate). This study follows up on and builds upon an earlier paper by these same authors published in E-life in 2020. Conceptually this work is most closely related to work in Michael Desai's, Sergey Kryazhimskiy's, Tim Coopers and Chris Marx's labs work looking at diminishing returns epistasis in yeast, and studies contrasting evolution of haploids and diploids led by Greg Lang's and Sarah Otto's labs.
      • State what audience might be interested in and influenced by the reported findings. This work will be of great interest to the Experimental evolution and molecular evolution communities and also of interest to those who study cancer genomics and DNA replication and repair.
      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. Microbial experimental evolution.

      REFERENCES CITED IN THE REVIEW RESPONSE

      Abe, T., Kawasumi, R., Giannattasio, M., Dusi, S., Yoshimoto, Y., Miyata, K., … Branzei, D. (2018). AND-1 fork protection function prevents fork resection and is essential for proliferation. Nature Communications, 9(1), 3091. https://doi.org/10.1038/s41467-018-05586-7

      Ait Saada, A., Lambert, S. A. E., & Carr, A. M. (2018, November 1). Preserving replication fork integrity and competence via the homologous recombination pathway. DNA Repair. Elsevier B.V. https://doi.org/10.1016/j.dnarep.2018.08.017

      Couce, A., & Tenaillon, O. A. (2015). The rule of declining adaptability in microbial evolution experiments. Frontiers in Genetics, 6(MAR), 99. https://doi.org/10.3389/fgene.2015.00099

      Crabbé, L., Thomas, A., Pantesco, V., De Vos, J., Pasero, P., & Lengronne, A. (2010). Analysis of replication profiles reveals key role of RFC-Ctf18 in yeast replication stress response. Nature Structural & Molecular Biology, 17(11), 1391–1397. https://doi.org/10.1038/nsmb.1932

      Desai, M. M., Fisher, D. S., & Murray, A. W. (2007). The speed of evolution and maintenance of variation in asexual populations. Current Biology : CB, 17(5), 385–394. https://doi.org/10.1016/j.cub.2007.01.072

      Deutschbauer, A. M., Jaramillo, D. F., Proctor, M., Kumm, J., Hillenmeyer, M. E., Davis, R. W., … Giaever, G. (2005). Mechanisms of Haploinsufficiency Revealed by Genome-Wide Profiling in Yeast. Genetics, 169(4), 1915–1925. https://doi.org/10.1534/GENETICS.104.036871

      Franceschini, A., Szklarczyk, D., Frankild, S., Kuhn, M., Simonovic, M., Roth, A., … Jensen, L. J. (2013). STRING v9.1: Protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Research, 41(D1), D808. https://doi.org/10.1093/nar/gks1094

      Fumasoni, M., & Murray, A. W. (2020). The evolutionary plasticity of chromosome metabolism allows adaptation to constitutive DNA replication stress. ELife, 9. https://doi.org/10.7554/eLife.51963

      Fumasoni, M., Zwicky, K., Vanoli, F., Lopes, M., & Branzei, D. (2015). Error-Free DNA Damage Tolerance and Sister Chromatid Proximity during DNA Replication Rely on the Polα/Primase/Ctf4 Complex. Molecular Cell, 57(5), 812–823. https://doi.org/10.1016/j.molcel.2014.12.038

      Gambus, A., van Deursen, F., Polychronopoulos, D., Foltman, M., Jones, R. C., Edmondson, R. D., … Labib, K. (2009). A key role for Ctf4 in coupling the MCM2-7 helicase to DNA polymerase alpha within the eukaryotic replisome. EMBO J., 28(19), 2992–3004. https://doi.org/10.1038/emboj.2009.226

      Gerstein, A. C., Cleathero, L. A., Mandegar, M. A., & Otto, S. P. (2011). Haploids adapt faster than diploids across a range of environments. Journal of Evolutionary Biology, 24(3), 531–540. https://doi.org/10.1111/j.1420-9101.2010.02188.x

      Gerstein, Aleeza C., & Otto, S. P. (2011). Cryptic fitness advantage: Diploids invade haploid populations despite lacking any apparent advantage as measured by standard fitness assays. PLoS ONE, 6(12), 26599. https://doi.org/10.1371/journal.pone.0026599

      Gerstein, Aleeza C, Chun, H.-J. E., Grant, A., & Otto, S. P. (2006). Genomic Convergence toward Diploidy in Saccharomyces cerevisiae. PLoS Genetics, 2(9), e145. https://doi.org/10.1371/journal.pgen.0020145

      Hanna, J. S., Kroll, E. S., Lundblad, V., & Spencer, F. a. (2001). Saccharomyces cerevisiae CTF18 and CTF4 are required for sister chromatid cohesion. Mol Cell Biol., 21(9), 3144–3158. https://doi.org/10.1128/MCB.21.9.3144-3158.2001

      Harari, Y., Ram, Y., Rappoport, N., Hadany, L., & Kupiec, M. (2018). Spontaneous Changes in Ploidy Are Common in Yeast. Current Biology, 28(6), 825-835.e4. https://doi.org/10.1016/j.cub.2018.01.062

      Hsieh, Y. Y. P., Makrantoni, V., Robertson, D., Marston, A. L., & Murray, A. W. (2020). Evolutionary repair: Changes in multiple functional modules allow meiotic cohesin to support mitosis. PLoS Biology, 18(3), e3000635. https://doi.org/10.1371/journal.pbio.3000635

      Ira, G., & Haber, J. E. (2002). Characterization of RAD51-independent break-induced replication that acts preferentially with short homologous sequences. Molecular and Cellular Biology, 22(18), 6384–6392. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/12192038

      Koboldt, D. C., Zhang, Q., Larson, D. E., Shen, D., McLellan, M. D., Lin, L., … Wilson, R. K. (2012). VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. Genome Research, 22(3), 568–576. https://doi.org/10.1101/gr.129684.111

      Kotsantis, P., Petermann, E., & Boulton, S. J. (2018, May 1). Mechanisms of oncogene-induced replication stress: Jigsaw falling into place. Cancer Discovery. American Association for Cancer Research Inc. https://doi.org/10.1158/2159-8290.CD-17-1461

      Kouprina, N., Kroll, E., Bannikov, V., Bliskovsky, V., Gizatullin, R., Kirillov, A., … et al. (1992). CTF4 (CHL15) mutants exhibit defective DNA metabolism in the yeast Saccharomyces cerevisiae. Mol Cell Biol., 12(12), 5736–5747. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/1341195

      Kouprina NYu, Pashina, O. B., Nikolaishwili, N. T., Tsouladze, A. M., & Larionov, V. L. (1988). Genetic control of chromosome stability in the yeast Saccharomyces cerevisiae. Yeast (Chichester, England), 4(4), 257–269. https://doi.org/10.1002/yea.320040404

      Kryazhimskiy, S., Rice, D. P., Jerison, E. R., & Desai, M. M. (2014). Microbial evolution. Global epistasis makes adaptation predictable despite sequence-level stochasticity. Science (New York, N.Y.), 344(6191), 1519–1522. https://doi.org/10.1126/science.1250939

      Laan, L., Koschwanez, J. H., & Murray, A. W. (2015). Evolutionary adaptation after crippling cell polarization follows reproducible trajectories. ELife, 4, e09638. https://doi.org/10.7554/eLife.09638

      Lu, Y. B., Ratnakar, P. V, Mohanty, B. K., & Bastia, D. (1996). Direct physical interaction between DnaG primase and DnaB helicase of Escherichia coli is necessary for optimal synthesis of primer RNA. Proc. Natl. Acad. Sci. USA, 93(23), 12902–12907. https://doi.org/10.1073/pnas.93.23.12902

      Macheret, M., & Halazonetis, T. D. (2015). DNA replication stress as a hallmark of cancer. Annu. Rev. Pathol. Mech. Dis., 10, 425–448. https://doi.org/10.1146/annurev-pathol-012414-040424

      Marad, D. A., Buskirk, S. W., & Lang, G. I. (2018). Altered access to beneficial mutations slows adaptation and biases fixed mutations in diploids. Nature Ecology & Evolution, 2(5), 882–889. https://doi.org/10.1038/s41559-018-0503-9

      Pâques, F., & Haber, J. E. (1999). Multiple pathways of recombination induced by double-strand breaks in Saccharomyces cerevisiae. Microbiology and Molecular Biology Reviews : MMBR, 63(2), 349–404. https://doi.org/

      Poli, J., Tsaponina, O., Crabbé, L., Keszthelyi, A., Pantesco, V., Chabes, A., … Rothstein, R. (2012). dNTP pools determine fork progression and origin usage under replication stress. EMBO J., 31(4), 883–894. https://doi.org/10.1038/emboj.2011.470

      Samora, C. P., Saksouk, J., Goswami, P., Wade, B. O., Singleton, M. R., Bates, P. A., … Masai, H. (2016). Ctf4 Links DNA Replication with Sister Chromatid Cohesion Establishment by Recruiting the Chl1 Helicase to the Replisome. Molecular Cell, 63(3), 371–384. https://doi.org/10.1016/j.molcel.2016.05.036

      Simon, A. C., Zhou, J. C., Perera, R. L., van Deursen, F., Evrin, C., Ivanova, M. E., … Pellegrini, L. (2014). A Ctf4 trimer couples the CMG helicase to DNA polymerase alpha in the eukaryotic replisome. Nature, 510(7504), 293–297. https://doi.org/10.1038/nature13234

      Symington, L. S. (2002). Role of RAD52 Epistasis Group Genes in Homologous Recombination and Double-Strand Break Repair. Microbiology and Molecular Biology Reviews, 66(4), 630–670. https://doi.org/10.1128/mmbr.66.4.630-670.2002

      Tanaka, H., Katou, Y., Yagura, M., Saitoh, K., Itoh, T., Araki, H., … Shirahige, K. (2009). Ctf4 coordinates the progression of helicase and DNA polymerase alpha. Genes to Cells, 14(7), 807–820. https://doi.org/10.1111/j.1365-2443.2009.01310.x

      Thompson, D. A., Desai, M. M., & Murray, A. W. (2006). Ploidy controls the success of mutators and nature of mutations during budding yeast evolution. Current Biology : CB, 16(16), 1581–1590. https://doi.org/10.1016/j.cub.2006.06.070

      Van den Bergh, B., Swings, T., Fauvart, M., & Michiels, J. (2018). Experimental Design, Population Dynamics, and Diversity in Microbial Experimental Evolution. Microbiol Mol Biol Rev., 82(3), e00008-18. https://doi.org/10.1128/MMBR.00008-18

      Venkataram, S., Dunn, B., Li, Y., Agarwala, A., Chang, J., Ebel, E. R., … Petrov, D. A. (2016). Development of a Comprehensive Genotype-to-Fitness Map of Adaptation-Driving Mutations in Yeast. Cell, 166(6), 1585-1596.e22. https://doi.org/10.1016/J.CELL.2016.08.002

      Vesela, E., Chroma, K., Turi, Z., & Mistrik, M. (2017a). Common Chemical Inductors of Replication Stress: Focus on Cell‐Based Studies. Biomolecules, 7(1), 19. https://doi.org/10.3390/biom7010019

      Vesela, E., Chroma, K., Turi, Z., & Mistrik, M. (2017b, February 21). Common chemical inductors of replication stress: Focus on cell-based studies. Biomolecules. MDPI AG. https://doi.org/10.3390/biom7010019

      Villa, F., Simon, A. C., Ortiz Bazan, M. A., Kilkenny, M. L., Wirthensohn, D., Wightman, M., … Dutta, A. (2016). Ctf4 Is a Hub in the Eukaryotic Replisome that Links Multiple CIP-Box Proteins to the CMG Helicase. Molecular Cell, 63(3), 385–396. https://doi.org/10.1016/j.molcel.2016.06.009

      Wilhelm, T., Olziersky, A. M., Harry, D., De Sousa, F., Vassal, H., Eskat, A., & Meraldi, P. (2019). Mild replication stress causes chromosome mis-segregation via premature centriole disengagement. Nature Communications, 10(1), 1–14. https://doi.org/10.1038/s41467-019-11584-0

      Yeeles, J. T., Poli, J., Marians, K. J., & Pasero, P. (2013). Rescuing stalled or damaged replication forks. Cold Spring Harbor Perspectives in Biology, 5(5), a012815. https://doi.org/10.1101/cshperspect.a012815

      Yoshizawa-Sugata, N., & Masai, H. (2009). Roles of human AND-1 in chromosome transactions in S phase. J Biol Chem., 284(31), 20718–20728. https://doi.org/10.1074/jbc.M806711200

      Yuan, Z., Georgescu, R., Santos, R. de L. A., Zhang, D., Bai, L., Yao, N. Y., … Li, H. (2019). Ctf4 organizes sister replisomes and Pol α into a replication factory. ELife, 8, e47405. https://doi.org/10.7554/eLife.47405

      Zeyl, C., Vanderford, T., & Carter, M. (2003). An Evolutionary Advantage of Haploidy in Large Yeast Populations. Science, 299(5606), 555–558. https://doi.org/10.1126/SCIENCE.1078417

      Zheng, D.-Q., Zhang, K., Wu, X.-C., Mieczkowski, P. A., & Petes, T. D. (2016). Global analysis of genomic instability caused by DNA replication stress in Saccharomyces cerevisiae. Proc Natl Acad Sci U S A., 113(50), E8114–E8121. https://doi.org/10.1073/pnas.1618129113

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      This reviewer did not leave any comments

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Here the authors carry out an evolution experiment, propagating replicate populations of the budding yeast with the CTF 4 gene deleted in three different genetic backgrounds: haploid , diploid and recombination deficient (RAD52 deletion). The authors find that the rate of evolution depends on the initial fitness of the different genetic backgrounds which is consistent with a repeated finding of evolution experiments: that beneficial mutations tend to have a smaller fitness effect in high fitness genetic backgrounds. Curiously even though the targets of selection tended to be specific to each of the three different genetic backgrounds, genetic reconstruction experiments showed beneficial mutations convert a fitness increase in all genetics backgrounds. The authors go on to provide a plausible explanation for why each of the three genetic backgrounds are predisposed to certain types of beneficial mutations. Overall, these results provide important context and caveats for an emerging consensus that genetic background determines the rate of evolution, a comprehensive molecular breakdown of adaptation to DNA replication stress and a mechanistic explanation for why different beneficial mutations are favoured in diploids, haploids and recombination deficient strains. This is a well-executed study that is beautifully presented and easy to follow. This will be of great interest to those in the experimental evolution community and the data an excellent resource.

      Major comments:

      • Are the key conclusions convincing? Yes, the convergent evolution analysis, fitness assays, and genetic reconstructions are sufficient to characterise the genetic causes of adaptation in this experiment, and are of the highest standard. The authors do particularly well to fully recover the fitness increases that evolved with their genetic reconstructions, which imparts a completeness to their understanding of what happened in their evolution experiment.
      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? No, in nearly all cases the authors make reasonable claims. One exception is on L419 in the discussion, where the authors speculate why some mutations do not follow diminishing returns epistasis, but this idea does not really have any basis (no citation or reasons to suggest that DNA repair genes are less connected with other genes in the genome). If the authors cannot support this statement, it should be removed, and instead write that is currently unknown why some individual mutations do not follow the pattern of diminishing returns.
      • Would additional experiments be essential to support the claims of the paper? No.
      • Are the data and the methods presented in such a way that they can be reproduced? Yes, but some more details are needed for the convergent evolution analysis, see minor comments.
      • Are the experiments adequately replicated and statistical analysis adequate? Yes, but some more statistic reporting in the main text or figure legends would be helpful, for example. L159: Please report the statistical test, test statistic and p value in the text or in the figure legend. Currently significance is indicated, but the methods do not specify the test. This should also be done for the GO analysis shown in figure 3A.

      Minor comments:

      • Specific experimental issues that are easily addressable. Not a new experiment, but extra details are required. The authors carried out both clone and whole population sequencing. For their convergent evolution analysis, what is the criteria for a mutation to be included- ie, does it need to be fixed, have attained a certain frequency? This is important- if the criteria were low (say 5%), it would be important to know whether gene A had fixed in 4 populations, while gene B had attained a frequency of 10% in 5 populations. As it stands both would be described as examples of convergent evolution. This can be handled by providing these details in the methods.
      • Are prior studies referenced appropriately? I note that the authors use the term declining adaptability where as other papers use the term diminishing returns epistasis- I am sure the authors have good reasons for their choice of nomenclature but I think it would be helpful for their readers to connect this work to other work by mentioning that declining adaptability is also referred to as diminishing returns.

      A reference the authors have missed: L419, as well as citing the Desai Lab bioxive paper, they should cite another theory paper that obtained similar conclusions. Lyons, D.M., et al. https://doi.org/10.1038/s41559-020-01286-y. .

      • Are the text and figures clear and accurate? This paper is beautifully written and easy to follow, a lot of thought has gone into the figures which are aesthetically pleasing and easy to navigate.
      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions? No.

      Typos

      L32 "do" should be "to" L95 analyzed<br> L219 are the authors referring to ref 15 here? I think so, but please specify

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. This paper is an important conceptual result and an immediate advance for basic research. The authors have done an outstanding job of showing the potential for the clinical translation of this research, especially regarding cancer biology.
        • Place the work in the context of the existing literature (provide references, where appropriate). This study follows up on and builds upon an earlier paper by these same authors published in E-life in 2020. Conceptually this work is most closely related to work in Michael Desai's, Sergey Kryazhimskiy's, Tim Coopers and Chris Marx's labs work looking at diminishing returns epistasis in yeast, and studies contrasting evolution of haploids and diploids led by Greg Lang's and Sarah Otto's labs.
        • State what audience might be interested in and influenced by the reported findings. This work will be of great interest to the Experimental evolution and molecular evolution communities and also of interest to those who study cancer genomics and DNA replication and repair.
        • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. Microbial experimental evolution.
    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      This manuscript investigates the effect of an organism's genotype (or, as the authors call it, an organism's 'genome architecture') on evolutionary trajectories. For this, the authors use Saccharomyces cerevisiae strains that experience some form of replication stress due to specific gene deletions, and that further differ in ploidy and/or the type of gene(s) deleted. They find the same three functional modules (DNA replication, DNA damage checkpoint, sister chromatid cohesion) are affected across the 3 different genotypes tested; although the specific genes that are mutated varies.

      Major comments

      This is a solid and exceptionally eloquent paper, comprising a large body of work that is in general well presented. That said, I do have some suggestions and questions. At several points in the manuscript, the authors should perhaps be more careful in their wording and avoid to overgeneralize data without providing additional evidence for these claims.

      • Some key points of the study are not entirely clear to me; possibly because the study builds upon a previous study that was recently published in eLife. Anyhow, I think it would be useful to clarify the following points a bit more:

      • Why exactly was ctf4∆ chosen as a model for replication stress? What is the evidence that ctf4∆ is a good model for replication stress? Without including some evidence for this, it is unclear how well the findings in this study really can be generalized to replication stress (which is what the authors do now). Would the authors expect to see similar routes of adaptation if a 'genomic architecture' with a less severe/other replication defect would have been used? I realize the last question is perhaps difficult to address without actually doing the experiment (which I am not suggesting the authors should do); I just want to point out that perhaps some data should not be over-generalized.

      • Likewise, why was RAD52 selected as the gene to delete to affect homologous recombination? I understand that it is a key gene, but on the flipside, absence of RAD52 affects multiple cellular pathways and (as the authors also observe in their populations) also results in increased mutation rates which might confound some of the results.

      • Related to the first comment, it is also unclear to me how well the system chosen by the authors is representative of the replication stress experienced by tumor cells (as briefly touched upon in the final section of the discussion). Are some of the homologs key oncogenes that drive carcinogenesis?

      • The authors should consider rephrasing some sentences regarding the occurrence of adaptive mutations. Sentences such as 'which genes are mutated depends on the selective advantage' (p1; lines 15-16); 'genome architecture controls the frequency at which mutations occur' (p15), "genome architecture controls which genes are mutated" (p1, line 20) makes it sound like the initial occurrence of mutations is not random, whereas in reality, the mutational landscape is the result of the combined effect of occurrence and fitness effect of the mutations, with the later rather than the former likely being the main driver behind the observed patterns.
      • Some important methodological information is missing or unclear in the manuscript:

      • The authors should provide more details on how they decided which clones to select for sequencing. Did they select the biggest colonies; were colonies picked randomly, ...

      • What is the population size during the evolution experiment?

      • Sequencing of populations and clones: coverage should be mentioned

      • Identification of mutations (p19, line 573): Is this really how the authors defined whether a variant is a mutation? Based on the definition given here, DNA mutations that lead to a synonymous mutation in the protein are not considered as mutations?

      • Perhaps the information can be found elsewhere, but the source data excel files for mutations is incomplete and should at the very least contain information on the type of mutation (eg. T->A), as well as the location of this mutation in the respective gene.

      Minor comments

      • While the author already cite several significant papers relevant for their manuscript, some other studies could also be included:

      • From the text in the abstract, it is unclear what the three genomic architectures (line 13) exactly are, the authors should consider spelling this out.

      • Can the authors speculate on why a homozygous ctf4/ctf4 rad52/rad52 would be lethal, and a haploid not?

      • The authors note that a diploid ctf4/ctf4 strain is less fit than its haploid counterpart. Why do the authors think this is the case?

      • The authors passage their cells for 100 cycles and assume that this corresponds to around 1000 generations for each population. However, the fitness differences between the different starting strains (see also Figure 1B) are likely to cause considerable differences in number of generations between the different strains. Do the authors have more precise measurements of number of generations per population? If not, perhaps it should be noted that some lineages may have undergone more doublings than others, and perhaps also discuss if and how this could influence the results?

      • Panel A of figure 1A is somewhat confusing; as this seems to indicate that the ctf4∆ was introduced after strains were made, for example, haploid recombination deficient (which is not how these strains were constructed). Perhaps a better way of representing would be to have the indication of DNA replication stress pictured inside the yeast cells.

      • Legend to Figure 1: is fitness expressed relative to haploid or diploid WT cells for the diploid strains?

      • Figure 3: to improve readability of this figure, the authors could consider placing the legend of the different symbols (#, *,..) in the figure as well and not just in the figure legend.

      • Figure 5 shows Indels, but if I am correct, these mutations are not discussed in the text; nor is it mentioned what the authors used as a cut-off to determine indels (the authors use the term 'small indels' without defining it)? For example, the data shown in Figure 3 and Figure 4 only includes SNPs and not indels (correct?) - but the indels should also be taken into account when investigating which modules are hit.

      • Figure 5 mentions: # gene mutations. So these are only the mutations in genes, and not in their up- or downstream regulatory regions?

      • Figure 3-S1: labels of C panels are missing.

      • Figure 3-S1, panel B: why did the authors focus on synonymous mutations?

      Significance

      This is a solid and clearly written study, comprising a large body of work that is generally well presented and that will be of interest to scientists active in the field of (experimental) evolution and replication.

      However, many aspects studied in this manuscript have already been studied and reported before; including the recent eLife paper by the same group, as well as studies by other labs that have investigated how genome architecture / genotype affects evolutionary trajectories, the effect of ploidy on evolution, .... Because of this, I do feel that the authors should put their findings more in the context of existing literature context, including a general description of which results are truly novel, which confirm previous findings and which results seem to go against previous reports. This is already so at some points in the text, but I feel this could be done even more.

      Additionally, I think the authors should be more careful not to over-generalize their findings, which come from only a few specific genetic manipulations that might not be representative for general replication stress. For example (p15), can the authors really claim that they have unraveled general principles of adaptation to constitutive DNA replication stress? Perhaps a better motivation of the choice of ctf4 as a model mutation for DNA replication stress could also help (see also my earlier comments). A similar comment applies to the molecular mechanisms affecting adaptation in diploid cells - what evidence do the authors have that their findings are not specific to the one specific type of diploid strain they used in their study? Adding a bit more background information or nuance for some of the claims would help tackle this issue.

      Field of expertise of this reviewer: genetics, evolution, genomics

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In their previous work the authors examined adaptation in response to replication stress in haploid yeast, via experimental evolution of batch cultures followed by sequencing. Here they extend this approach to include diploid and recombination-deficient strains to explore the role of genome architecture in evolution under replication stress. On the whole, a common set of functional modules are found to evolve under all genetic architectures. The authors discuss the molecular details of adaptation and use their findings to speculate on the determinants of adaptation rate.

      SECTION A - Evidence, reproducibility and clarity

      Experimental evolution can reveal adaptive pathways, but there are some challenges when applying this approach to compare genetic backgrounds or environments. They key challenge is that adaptation potentially depends on both the rate of mutation and the nature of selection. Distinct adaptation patterns between groups could therefore reflect differential mutation, selection, or both. The authors allude to this dichotomy but have very limited data to address it. The closest effort is engineering putatively-adaptive variants into all genetic background including those where they did not arise; the fact that such variants remain beneficial suggests they did not arise in certain backgrounds because of a lower mutation rate, but this is a difficult issue to tackle quantitatively.

      From mutation accumulation experiments, where the influence of selection is minimized, there is evidence that genetic architecture affects the rate and spectrum of spontaneous mutations. In this experiment, the allele used to eliminate recombination, rad52, will also increase the mutation rate generally. The diploid strain is also likely to have a distinct mutational profile--as a null expectation diploids should have twice the mutation rate of haploids. Recent evidence indicates the mutation rate difference between haploid and diploid yeast might be less than two-fold, but that there are additional differences in the mutation spectrum, including rates of structural change. The context for this study is therefore three genetic architectures likely to differ in multiple dimensions of their mutation profiles, but mutation rates are not measured directly.

      The nature of selection on haploids and diploids is expected to differ because of dominance, but ploidy-specific selection is also possible. The authors discuss how recessive beneficial alleles may be less available to diploids, though this can be offset by relatively rapid loss of heterozygosity. However, diploids should also incur more mutations, all else being equal. The rate of beneficial mutation, as opposed to the rate of mutation generally, will depend on the mutational "target size" of fitness, and the authors findings recapitulate other literature (particularly regarding "compensatory" adaptation) that points to faster adaptation in genotypes with lower starting fitness.

      There is ample literature on the above topics, particularly discussions of the evolutionary advantages of haploidy versus diploidy. While adaptation to replication stress provides a novel starting point for this investigation, much of the manuscript is devoted to long-standing questions that are not specific to replication stress. Unfortunately, the data the authors collected is not sufficient to shed light on these questions, because mutation and selection cannot be effectively distinguished. The Discussion states that "We find that the genes that acquire adaptive mutations, the frequency at which they are mutated, and the frequency at which these mutations are selected all differ between architectures but that mutations that confer strong benefits always lie in the same three modules" (line 379), but it is not clear that these statements are all supported by the data.

      Focusing on the more novel aspect of their experiment-the presence of replication stress-would arguably be a better approach. On this topic the authors have some interesting observations and speculation, but clear predictions are lacking. The introduction section could be redesigned to explicitly state why genome architecture might affect adaptation in response to replication stress in particular, rather than (or in addition to) adaptation generally. If there were no differences in mutation, does the nature of Ctf4 lead to predictions that the molecular basis of compensatory adaptation should differ among genome architectures? Without such predictions it will be difficult for readers to know whether the observation that different genome architectures follow similar adaptive paths is surprising or not.

      Minor comments:

      Shifts in ploidy from diploid to haploid are less common than the reverse change, so the observation of such a shift (Fig. 1) should be discussed in more detail.

      Line 88 typo 'stains'.

      Significance

      SECTION B - Significance

      The novel aspect of this study is the combination of replication stress and genome architecture, but here the significance is limited by a lack of clear predictions on how these factors might interact. On the other hand, much of the manuscript is devoted to why adaptation might vary among genome architectures in general, but this long-standing and important question is not particularly well resolved by this experimental approach, which can't disentangle mutation and selection.

      The authors highlight the dichotomy when discussing the evolution of ploidy: "We suggest that... genome architecture affects two aspects of the mutations that produce adaptation: the frequency at which they occur and the selective advantage they confer" (line 399), but presenting this as a novel inference does not appropriately acknowledge prior research and discussion of these ideas; several relevant papers are cited by the authors in other contexts. It may be possible to recast these findings as a test of the role of genome architecture in adaptation generally, but the authors should clarify the limitations of experimental evolution and more fully consider the theory and data outlined in previous research. In particular, few studies can claim to directly compare mutation rates between genome architectures, and it is not obvious that the present study is an example of such.

      Reviewer expertise: Evoutionary genetics; experimental evolution; mutation.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to Reviewers

      We thank the reviewers for their careful reading of our manuscript and their valuable suggestions and comments. To address the reviewers’ concerns and improve our manuscript, we will complete the additional experiments and further revise the text as described below.


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate). Please place your comments about significance in section 2. The authors present an in vivo analysis of pdzd8 (CG10362) and a synthetic ER-mitochondria tether in the regulation of locomotor activity, lifespan, and mitochondrial turnover of Drosophila melanogaster, using basic bioinformatics, RNAi, SPLICS, imaging and microscopies observations (i. e. TEM, SIM), fly lines, and a representative AD fly disease model, etc. The research methodologies were detailed in good order. The model system employed was suitable to address the research topic. The manuscript was written in a clear language and statistical analysis were correctly applied.

      **Major comments:**

      *-Are the key conclusions convincing?*

      Yes. The results/conclusions are logical and provide an overview of Pdzd8 in the regulation of mitochondrial quality control and neuronal homeostasis.

      *-Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.*

      No. Experiments were generally well performed, and all the data support the conclusions.

      *-Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.*

      No suggested experiments needed.

      *-Are the data and the methods presented in such a way that they can be reproduced?*

      Yes. The authors have followed proper experimental design and methods have been described in sufficient detail.

      *-Are the experiments adequately replicated and statistical analysis adequate?*

      Yes, they are.

      **Minor comments:**

      *-Specific experimental issues that are easily addressable.*

      No comment.

      *-Are prior studies referenced appropriately?*

      Yes. The relevant literatures have been cited appropriately.

      *-Are the text and figures clear and accurate?*

      1.Please pay attention to the correct spelling of the described protein name (Pdzd8) and gene name (should be in 'italic') throughout the manuscript, i. e. line 36, 98, and 556, etc.

      As this is the first published characterization of the fly homolog of the mammalian Pdzd8 We have decided to name the fly protein pdzd8, using the lower case “p” to distinguish it from the mammalian protein. We have checked and corrected our use of italics for the gene name as noted in track changes.

      2.In figure 1C and its figure legend, please state what the numbers "201" and "195" stand for.

      We have added the text “numbers on bars indicate number of mitochondria analysed” to the figure legend.

      3.Your data needs to be converted the lowercase letter "x" to math symbol "×" when representing times sign, i. e. line 523, 5x, etc.

      Corrected

      *-Do you have suggestions that would help the authors improve the presentation of their data and conclusions?*

      No comment.

      Reviewer #1 (Significance (Required)):

      *-Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.*

      Discoveries from this study include 1) characterization of the tethering protein Pdzd8 in Drosophila melanogaster, and 2) shed light on a possible way on how to enhance mitochondrial quality control and to help promote healthy aging of neurons by manipulating MERCs.

      *-Place the work in the context of the existing literature (provide references, where appropriate).*

      With this manuscript, the authors present a straightforward but sound piece of scientific research, with the intent to illustrate the consequences of neuronal depletion of pdzd8 in Drosophila melanogaster. Since Pdzd8 plays specific functions in ER-mitochondrial tethering complexes and dysregulations of MERCs are damaging to neurons, this protein represents a good potential target. In this context the characterization of Pdzd8 should represent an interesting starting point. To this purpose, the gene was knockdown and the tether construct was recombinantly produced. The fly lines were then subjected to analysis both at the organismal and at the cellular level.

      *-State what audience might be interested in and influenced by the reported findings.* Audience might include those who are in the field of neuroscience and pharmaceutical, and benefit from an awareness of this research.

      *-Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.*

      Key words in my field of expertise: Ageing, neurodegenerative diseases, Alzheimer's disease, mitophagy, NAD+, neuroprotection. My group is investigating the molecular mechanisms of ageing and age-related neurodegeneration (especially AD) using cross-species model systems, ranging from human brain samples, iPSCs, C. elegans, Drosophila melanogaster, and mice, therefore I have sufficient expertise to evaluate this paper.

      **Referees Cross-commenting**

      To this reviewer the key novelty of this paper was the study of the regulation of the mitochondrial-ER contact sites (MERCs) in life and health. The data indicate that MERCs mediated by the tethering protein pdzd8 play a critical role in the regulation of mitochondrial homeostasis, neuronal function, and lifespan. In a transitional perspective, this reviewer would ask to check whether this mechanism conserves in rodents or not (e.g. to to memory in the AD mice and to run lifespan in mitochondrial toxin condition). This may be to much. But will depend on the standard of the journal. We thank the reviewer for their input, evaluation and interest. We too are keen to know whether this mechanism is conserved and hope to investigate this in our ongoing work including characterizing a mouse mutant, but the current work already represents a substantial investment of resources and a worthy study in its own right as the first description of the in vivo role of pdzd8, so we feel it is beyond the scope of the current work.

      Reviewer #2

      (Evidence, reproducibility and clarity (Required)):

      Hewitt et al. describe and characterize for the first time the ortholog of pdzd8 in Drosophila melanogaster. In accordance with pdzd8's previously described function as a member of mitochondrial-ER contact sites (MERCs) the authors show reduced MERCs upon RNAi mediated depletion of pdzd8 via TEM, SIM and a split-GFP based contact site sensor. Pdzd8 depletion results in the increased life span as well as improved locomotor activity in aging flies while increase of MERCs with a synthetic tether accelerates the age-related declines in survival and locomotion. Moreover, pdzd8 depleted flies are more resistant against mitochondrial toxins. The authors correlate these protective effects of pdzd8 knockdown with an increase in mitophagy using a mitophagy sensor and describe a rescue of locomotor defects in an Alzheimer disease fly model by pdzd8 depletion.

      **Major comments:**

      1.The authors quantify the number of MERCs in thin sections of TEM (Fig 1B and C). It would add to the paper if the authors would show a representative reconstruction of the quantified somata, as a 3D reconstruction would visualize ER-Mito contacts more reliable than thin sections.

      We agree that the 3D reconstruction of TEM images would provide a satisfying addition to the current analyses, however such advanced techniques are not readily available. The current samples used to collect these data cannot be used to generate 3D reconstructions. To counter this, we have used three independent methods to analyse the changes in MERCs, all of which show a decrease in MERCs in the flies with less pdzd8 supporting that these observations are reproducible and robust.

      2.The authors quantify MERCs in pdzd8 KD also by SIM (Fig1F, G). However, they quantify the number of MERCs in epidermal cells while they also show SIM images of larval neurons (Fig S1D). For consistency and to support their claim of MERC reduction in neurons, we ask the authors to include the quantification based on larval neurons especially as the authors show that pdzd8 is predominantly expressed in the CNS.

      Unfortunately, the soma of larval neurons have extremely limited cytosol (see fig. S1D) which creates very challenging conditions to discern the spatial separation of ER and mitochondria by light microscopy. While co-localisation of organelle markers in such cells has been reported in the literature, we are extremely concerned that the lack of space within the cytosol renders such analysis unreliable. However, we will attempt to quantify the extent of co-localisation of the ER and mitochondria in these cells. In contrast, epidermal cells are much larger providing greater spatial separation of ER and mitochondria. Notably, we complement the co-localisation analysis of epidermal cells with two additional approaches, TEM analysis and the SPLICS reporter construct, to demonstrate pdzd8-RNAi results in decreased MERCs specifically in neurons.

      3.The authors describe a decreased NMJ volume in Fig 4G. It would improve and complete the functional characterization of pdzd8 in flies if the authors can provide further data whether pdzd8 KD causes a general synaptic defect. Can the authors show morphological synaptic defects in the existing TEM data of the adult brain or provide additional ERG recordings, which would elucidate the functional consequences of pdzd8 depletion in the CNS?

      Our TEM data are not suitable for us to properly analyse defects in synaptic morphology as our images centered around the cell bodies where the organelle morphology was easiest to distinguish and there are very few synapses. While it is not surprising that the knockdown of pdzd8 has some detrimental effects, we chose to focus our efforts on trying to determine the cause of the protective effect on locomotor activity in aged flies rather than to exhaustively characterise the myriad phenomena which may be impacted as a knock-on effect of the disrupted cell biology that we have demonstrated. We hope to further explore the detrimental functional consequences of pdzd8 depletion on such phenomena as neurotransmission in future work.

      1. Hewitt et al. suggest a beneficial effect of increased turnover of mitochondria for healthy aging. To convince readers we would like to ask the following:

      a) This claim is based on their observation of increased mitophagy in pdzd8 depleted flies using one reporter (Fig 5). Can the authors support their data with an alternative method as this is one of the key claims of the manuscript?

      The mitoQC tool is well established in the field and we have found it to perform better but consistent with mito-Keima (Lee et al. 2018 JCB doi: 10.1083/jcb.201801044). We would be happy to consider other assays if the reviewer can suggest an unbiased and established alternative.

      b) An increased turnover of Mitochondria would also suggest that there are more "young" mitochondria present in the pdzd8 KD neurons. Can the authors experimentally address that?

      We understand the reviewer’s point here but due to the continual fission and fusion, as well as piecemeal turnover of mitochondria (see Vincow et al. 2019 Autophagy doi: 10.1080/15548627.2019.1586258), the concept of ‘young’ versus ‘old’ mitochondria is misplaced. The mitochondrial network essentially exists as a milieu of components which are produced and degraded at different rates.

      c)Furthermore, we would like to ask the authors to use also the MERC tether as control in the mitophagy assay. This would allow further conclusions about the role of the mitophagy, its protective effect during aging and the role of MERCs in this process.

      We remind the reviewer that this MERC tether is constructed from an RFP with N- and C-terminal tethering peptides. The presence of this RFP prevents the proper analysis of the mitoQC mCherry signal. However, given the dramatic phenotypes we think that it is unlikely that a decrease in mitophagy alone can explain the detrimental effects of increased tethering.

      1. In Fig6 A,B the authors should include also the pdzd8 KD to support their claim that the rescue of climbing defects correlates with an reduction of MERCs.

      We thank the reviewer for this suggestion and we will perform this experiment.

      Moreover, it would be beneficial for their final conclusion, if the authors could show that increases mitophagy in the background of Ab42 expressing flies.

      We thank the reviewer for this suggestion and we will perform this experiment.

      **Minor comments:**

      1.Can the authors add to the figure legend of Fig 1F how the ER and Mitochondria were labeled?

      We have added the constructs to the figure legend (full genotypes for all figures are given in Table S2).

      2.Error bars should be added in the quantification of MERCs in Fig1C.

      The MERCs are quantified in three brains per genotype but as there were variable numbers of sections suitable for imaging from each brain the total values are combined to give a single percentage.

      3.A reference to Supplementary Fig S1D is missing in the main text.

      This figure is referenced in line 135

      4.Can the authors label the individual genotypes in Fig S3C and 4F?

      Figure labels and legends have been modified to clarify this.

      5.Can the author specify which brain region they imaged in Fig 5C?

      The regions imaged and quantified were chosen for their clear organelle morphology rather than targeting a specific brain region. All images were from the protocerebrum and the methods and figure legends have been updated to note this.

      6.Are the ATP levels normalized to ADP in Fig S3D? Can the authors specify in the figure and figure legend to what ATP was normalized?

      Figure labels and legends have been modified to clarify the ATP levels are normalised to total protein quantification of the samples.

      7.Please sort the supplementary figures in accordance to their reference order in the text.

      We thank the reviewer for checking this. This figure order will be rechecked in the final version as addressing reviewer comments is likely to lead to further changes.

      Reviewer #2 (Significance (Required)):

      The authors present here novel insights about the functional role of a new member of the MERCs, pdzd8, using RNAi mediated depletion and Drosophila melanogaster as a model system. As MERCs receive more attention especially in the context of their potential role in neurological diseases, the author's manuscript will be of high interest to the scientific community. The in vivo model combined with multiple different technical approaches add to the significance of the paper. There are some controls and additional experiments that are required to support the author's main claims and complete the functional characterization of pdzd8 (see major comments).

      Field of expertise: neuroscience, fly genetics, neurodegeneration.

      Reviewer #3

      (Evidence, reproducibility and clarity (Required)):

      This manuscript entitled "Decreasing pdzd8-mediated mitochondrial-ER contacts in neurons improves fitness by increasing mitophagy" by Hewitt and collaborators describes the role of the Drosophila ortholog of PDZD8 in ER-mitochondria contacts in neurons and the physiological consequence of pdzd8 loss. The authors show that ER-mitochondria contacts are reduced in fly neurons expressing a pdzd8-RNAi construct. Decreasing pdzd8 expression in neurons was accompanied by a slowed age-associated decline in locomotor activity, and an increased lifespan. In presence of mitochondrial toxins, neurons deficient for pdzd8 were protected. Finally, the authors showed that pdzd8 silencing increased mitophagy in aged neurons, and protected against neurodegeneration in a model of Alzheimer's disease.

      **Major points:**

      1)There are important controls that are missing. RNAi expression often affects off-target genes which could unfortunately modify the observed phenotypes. The authors should verify that a) the phenotypes observed by RNAi-mediated pdzd8 silencing can be rescued by the expression of an RNAi-insensitive pdzd8 construct (the authors should verify the rescue of the most crucial phenotypes described in the manuscript); b) the RNAi-LacZ-line that they use as control in the paper does not behave differently from a WT line, which could be induced by an off-target effect of the RNAi-LacZ (again with the most crucial phenotypes).

      While the Drosophila community is fortunate to have a plethora of readily available tools for interrogating the function of nearly all genes in the genome – tools which form the foundation of most work in Drosophila labs worldwide – the availability is not limitless. In this instance, the transgenic RNAi line generated as a resource for the community comprises a 500 bp hairpin, computed to be the most selective target for that gene. Being a 500 bp sequence it is unrealistic to be able to establish an RNAi-resistant variant that still faithfully functions as normal. Nevertheless, although imperfect we show in Figure S3B that pdzd8-RNAi rescues the climbing defect produced by overexpressing pdzd8, providing evidence the construct is specifically acting on this sequence.

      Similarly, the availability of ‘control’ RNAi reagents is generous but still limited. This LacZ-RNAi line is one of a few well-established controls that has provided a cornerstone reference for a wealth of studies. Nevertheless, we will provide experimental data that aged climbing of nSyb>LacZ-RNAi is highly comparable to several other well-established control genotypes.

      2) Did the author analyzed their EM data in a blinded-way to minimize subjective bias? This type of analysis is complicated by the manual annotation of ultrastructures, which is by nature subjective. For instance, this reviewer would have annotated the two mitochondria in the middle of Fig 1B, right as "Mitochondria with ER contact", as there is a membrane tube present at the interface of these two organelles.

      The EM data were analysed blinded to the genotypes. This is noted in the methods section.

      3) There is a controversy in the field on the role of PDZD8: some papers show its involvement in ER-mitochondria contacts, others in ER-lysosome contacts. The authors should discuss this point in more details. Moreover, the authors should localize the protein in Drosophila neurons; is the protein associated with mitochondria or endo/lysosomes?

      We recognize that there is some debate in the field over the localization and role of PDZD8. However, since there is currently no antibody against the Drosophila protein and the sequence is sufficiently divergent such that antibodies against the mammalian protein will not recognize the fly protein, we are not well-positioned to determine the localization of Drosophila pdzd8. Consequently, we will expand our discussion to reflect the differing views.

      We can offer instead to quantify the localization of mouse PDZD8 in our newly generated NIH-3T3 Pdzd8-Halo knock in line to help resolve the controversy regarding the location(s) and function(s) of mammalian Pdzd8.

      4) The authors should specify in more details how the different quantifications were performed. For instance Fig 1G: how many samples were quantified (i.e. how many flies, and how many neurons); what is compared? Fields-of-view, neurons, flies...?

      Further details have been added to the figure legends 1G (now H), 4I, 5 and Fig S2.

      **Minor point:**

      1)Could the authors show the SIM images Fig1F together with the binarized images.

      These images have been added to Figure 1 and the legend and text updated accordingly.

      2) It is surprising to see that data otherwise similar are represented with so many different types of graph (For instance Fig 5, bar graph, box-plot, violin plot). Why individual data points are not always present on the graphs?

      The graphs will be redrawn using more consistent representations once the data for the revisions has been gathered.

      3) The way that data are presented is sometimes odd: for instance, line 101, the authors wrote "To establish that MERCs were decreased...". This would imply that they knew the result before performing the experiment. And later, line 103 "Accordingly...".

      These sentences have been rephrased “To determine whether MERCs were decreased..” and “These results showed the…”

      Reviewer #3 (Significance (Required)):

      This study about the role of pdzd8 is timely. The functional description of inter-organelle contacts is a hot topic in cell biology. There are several recent reports describing the identification of pdzd8 role in inter-organelle contact formation. This manuscript provides data on the role of pdzd8 in a whole organism and expands our understanding of this protein.

      My expertise: inter-organelle contacts (human cells)

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript entitled "Decreasing pdzd8-mediated mitochondrial-ER contacts in neurons improves fitness by increasing mitophagy" by Hewitt and collaborators describes the role of the Drosophila ortholog of PDZD8 in ER-mitochondria contacts in neurons and the physiological consequence of pdzd8 loss. The authors show that ER-mitochondria contacts are reduced in fly neurons expressing a pdzd8-RNAi construct. Decreasing pdzd8 expression in neurons was accompanied by a slowed age-associated decline in locomotor activity, and an increased lifespan. In presence of mitochondrial toxins, neurons deficient for pdzd8 were protected. Finally, the authors showed that pdzd8 silencing increased mitophagy in aged neurons, and protected against neurodegeneration in a model of Alzheimer's disease.

      Major points:

      1)There are important controls that are missing. RNAi expression often affects off-target genes which could unfortunately modify the observed phenotypes. The authors should verify that a) the phenotypes observed by RNAi-mediated pdzd8 silencing can be rescued by the expression of an RNAi-insensitive pdzd8 construct (the authors should verify the rescue of the most crucial phenotypes described in the manuscript); b) the RNAi-LacZ-line that they use as control in the paper does not behave differently from a WT line, which could be induced by an off-target effect of the RNAi-LacZ (again with the most crucial phenotypes).

      2)Did the author analyzed their EM data in a blinded-way to minimize subjective bias? This type of analysis is complicated by the manual annotation of ultrastructures, which is by nature subjective. For instance, this reviewer would have annotated the two mitochondria in the middle of Fig 1B, right as "Mitochondria with ER contact", as there is a membrane tube present at the interface of these two organelles.

      3)There is a controversy in the field on the role of PDZD8: some papers show its involvement in ER-mitochondria contacts, others in ER-lysosome contacts. The authors should discuss this point in more details. Moreover, the authors should localize the protein in Drosophila neurons; is the protein associated with mitochondria or endo/lysosomes?

      4)The authors should specify in more details how the different quantifications were performed. For instance Fig 1G: how many samples were quantified (i.e. how many flies, and how many neurons); what is compared? Fields-of-view, neurons, flies...?

      Minor point:

      1)Could the authors show the SIM images Fig1F together with the binarized images.

      2)It is surprising to see that data otherwise similar are represented with so many different types of graph (For instance Fig 5, bar graph, box-plot, violin plot). Why individual data points are not always present on the graphs?

      3)The way that data are presented is sometimes odd: for instance, line 101, the authors wrote "To establish that MERCs were decreased...". This would imply that they knew the result before performing the experiment. And later, line 103 "Accordingly...".

      Significance

      This study about the role of pdzd8 is timely. The functional description of inter-organelle contacts is a hot topic in cell biology. There are several recent reports describing the identification of pdzd8 role in inter-organelle contact formation. This manuscript provides data on the role of pdzd8 in a whole organism and expands our understanding of this protein.

      My expertise: inter-organelle contacts (human cells)

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Hewitt et al. describe and characterize for the first time the ortholog of pdzd8 in Drosophila melanogaster. In accordance with pdzd8's previously described function as a member of mitochondrial-ER contact sites (MERCs) the authors show reduced MERCs upon RNAi mediated depletion of pdzd8 via TEM, SIM and a split-GFP based contact site sensor. Pdzd8 depletion results in the increased life span as well as improved locomotor activity in aging flies while increase of MERCs with a synthetic tether accelerates the age-related declines in survival and locomotion. Moreover, pdzd8 depleted flies are more resistant against mitochondrial toxins. The authors correlate these protective effects of pdzd8 knockdown with an increase in mitophagy using a mitophagy sensor and describe a rescue of locomotor defects in an Alzheimer disease fly model by pdzd8 depletion.

      Major comments:

      1.The authors quantify the number of MERCs in thin sections of TEM (Fig 1B and C). It would add to the paper if the authors would show a representative reconstruction of the quantified somata, as a 3D reconstruction would visualize ER-Mito contacts more reliable than thin sections.

      2.The authors quantify MERCs in pdzd8 KD also by SIM (Fig1F, G). However, they quantify the number of MERCs in epidermal cells while they also show SIM images of larval neurons (Fig S1D). For consistency and to support their claim of MERC reduction in neurons, we ask the authors to include the quantification based on larval neurons especially as the authors show that pdzd8 is predominantly expressed in the CNS.

      3.The authors describe a decreased NMJ volume in Fig 4G. It would improve and complete the functional characterization of pdzd8 in flies if the authors can provide further data whether pdzd8 KD causes a general synaptic defect. Can the authors show morphological synaptic defects in the existing TEM data of the adult brain or provide additional ERG recordings, which would elucidate the functional consequences of pdzd8 depletion in the CNS?

      4.Hewitt et al. suggest a beneficial effect of increased turnover of mitochondria for healthy aging. To convince readers we would like to ask the following:

      a)This claim is based on their observation of increased mitophagy in pdzd8 depleted flies using one reporter (Fig 5). Can the authors support their data with an alternative method as this is one of the key claims of the manuscript?

      b)An increased turnover of Mitochondria would also suggest that there are more "young" mitochondria present in the pdzd8 KD neurons. Can the authors experimentally address that?

      c)Furthermore, we would like to ask the authors to use also the MERC tether as control in the mitophagy assay. This would allow further conclusions about the role of the mitophagy, its protective effect during aging and the role of MERCs in this process.

      5.In Fig6 A,B the authors should include also the pdzd8 KD to support their claim that the rescue of climbing defects correlates with an reduction of MERCs. Moreover, it would be beneficial for their final conclusion, if the authors could show that increases mitophagy in the background of Ab42 expressing flies.

      Minor comments:

      1.Can the authors add to the figure legend of Fig 1F how the ER and Mitochondria were labeled?

      2.Error bars should be added in the quantification of MERCs in Fig1C.

      3.A reference to Supplementary Fig S1D is missing in the main text.

      4.Can the authors label the individual genotypes in Fig S3C and 4F?

      5.Can the author specify which brain region they imaged in Fig 5C?

      6.Are the ATP levels normalized to ADP in Fig S3D? Can the authors specify in the figure and figure legend to what ATP was normalized?

      7.Please sort the supplementary figures in accordance to their reference order in the text.

      Significance

      The authors present here novel insights about the functional role of a new member of the MERCs, pdzd8, using RNAi mediated depletion and Drosophila melanogaster as a model system. As MERCs receive more attention especially in the context of their potential role in neurological diseases, the author's manuscript will be of high interest to the scientific community. The in vivo model combined with multiple different technical approaches add to the significance of the paper. There are some controls and additional experiments that are required to support the author's main claims and complete the functional characterization of pdzd8 (see major comments).

      Field of expertise: neuroscience, fly genetics, neurodegeneration.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate). Please place your comments about significance in section 2. The authors present an in vivo analysis of pdzd8 (CG10362) and a synthetic ER-mitochondria tether in the regulation of locomotor activity, lifespan, and mitochondrial turnover of Drosophila melanogaster, using basic bioinformatics, RNAi, SPLICS, imaging and microscopies observations (i. e. TEM, SIM), fly lines, and a representative AD fly disease model, etc. The research methodologies were detailed in good order. The model system employed was suitable to address the research topic. The manuscript was written in a clear language and statistical analysis were correctly applied.

      Major comments:

      -Are the key conclusions convincing?

      Yes. The results/conclusions are logical and provide an overview of Pdzd8 in the regulation of mitochondrial quality control and neuronal homeostasis.

      -Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      No. Experiments were generally well performed, and all the data support the conclusions.

      -Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      No suggested experiments needed.

      -Are the data and the methods presented in such a way that they can be reproduced?

      Yes. The authors have followed proper experimental design and methods have been described in sufficient detail.

      -Are the experiments adequately replicated and statistical analysis adequate?

      Yes, they are.

      Minor comments:

      -Specific experimental issues that are easily addressable.

      No comment.

      -Are prior studies referenced appropriately?

      Yes. The relevant literatures have been cited appropriately.

      -Are the text and figures clear and accurate?

      1.Please pay attention to the correct spelling of the described protein name (Pdzd8) and gene name (should be in 'italic') throughout the manuscript, i. e. line 36, 98, and 556, etc.

      2.In figure 1C and its figure legend, please state what the numbers "201" and "195" stand for.

      3.Your data needs to be converted the lowercase letter "x" to math symbol "×" when representing times sign, i. e. line 523, 5x, etc.

      -Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      No comment.

      Significance

      -Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      Discoveries from this study include 1) characterization of the tethering protein Pdzd8 in Drosophila melanogaster, and 2) shed light on a possible way on how to enhance mitochondrial quality control and to help promote healthy aging of neurons by manipulating MERCs.

      -Place the work in the context of the existing literature (provide references, where appropriate).

      With this manuscript, the authors present a straightforward but sound piece of scientific research, with the intent to illustrate the consequences of neuronal depletion of pdzd8 in Drosophila melanogaster. Since Pdzd8 plays specific functions in ER-mitochondrial tethering complexes and dysregulations of MERCs are damaging to neurons, this protein represents a good potential target. In this context the characterization of Pdzd8 should represent an interesting starting point. To this purpose, the gene was knockdown and the tether construct was recombinantly produced. The fly lines were then subjected to analysis both at the organismal and at the cellular level.

      -State what audience might be interested in and influenced by the reported findings. Audience might include those who are in the field of neuroscience and pharmaceutical, and benefit from an awareness of this research.

      -Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Key words in my field of expertise: Ageing, neurodegenerative diseases, Alzheimer's disease, mitophagy, NAD+, neuroprotection. My group is investigating the molecular mechanisms of ageing and age-related neurodegeneration (especially AD) using cross-species model systems, ranging from human brain samples, iPSCs, C. elegans, Drosophila melanogaster, and mice, therefore I have sufficient expertise to evaluate this paper.

      Referees Cross-commenting

      To this reviewer the key novelty of this paper was the study of the regulation of the mitochondrial-ER contact sites (MERCs) in life and health. The data indicate that MERCs mediated by the tethering protein pdzd8 play a critical role in the regulation of mitochondrial homeostasis, neuronal function, and lifespan. In a transitional perspective, this reviewer would ask to check whether this mechanism conserves in rodents or not (e.g. to to memory in the AD mice and to run lifespan in mitochondrial toxin condition). This may be to much. But will depend on the standard of the journal.

  3. Jan 2021
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Overall:

      We thank the reviewers for their thoughtful comments and suggestions on how to improve the manuscript. We also thank the reviewers for describing the study as “highly significant,” “rigorous and reliable as described and can be reproduced by others,” and as “relevant to investigators working in the field of rickettsial diseases and to a broader audience studying mechanisms of intracellular parasitism and host responses.”

      In this revised manuscript we have addressed all the minor points raised by the reviewers. In regard to additional experiments, all three reviewers suggested that we perform histology of skin lesions, and in a revised manuscript we propose to thoroughly address this by performing histology at multiple time points in infected wild type and in interferon receptor-deficient mice. We will also attempt to use immunohistochemistry to identify the infected cell types in the skin and in internal organs. We will compare these findings to histology of human eschars. We feel that the reviewer comments support our contention that a manuscript containing these proposed additional experiments will be of strong significance in the field.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      “Rickettsial eschars are hallmarks of less severe spotted fever diseases. The underlying mechanisms involved in the formation of the eschar caused by pathogenic rickettsiae remains unknown. The authors of this manuscript studied this interesting research question by using Ifnar-/-Ifngr-/- mice and Sca2 or OmpB mutant of R. parkeri. R. parkeri probably is the best rickettsial species to study rickettsial eschar due to the clinical features of R. parkeri rickettsioses and the biosafety level required to work with it. The data presented in the manuscript are very promising. The conclusions are supported by the presented results. For the first time, this study recapitulated human eschar-like skin lesion observed in patients with R. parkeri rickettsioses in the mouse models. More interestingly, mice inoculated with Sca2 mutant of R. parkeri i.d. had less disseminated rickettsiae in tissues, which helps us to understand the mechanisms by which pathogenic rickettsiae cause systemic infection after the arthropod bite.”

      **Minor comments: **

      “1) Figure 2D, it looks likely the lethality of mice i.d. infection with R. parkeri is not dose dependent. For example, mice inoculated with 10^4 showed greater lethality compared to 10^7. The authors might want to explain it in the Discussion.”

      The reviewer is correct in observing that the lethality between different doses of R. parkeri in Ifnar-/-Ifngr-/-mice after intradermal infection is not dose dependent with the current number of mice used per group. We do not understand the reason for this, and more broadly we don’t understand the mechanism of lethality. We speculate that there could be a bottleneck; however, answering this question will require future investigations into the mechanisms of lethality that are beyond the scope of this study. To address the reviewer’s point, we now include this statement: “Degrees of lethality between different doses in Ifnar-/-Ifngr-/- mice were not significantly different from one another, and the cause of lethality in this model remains unclear.”

      “2) Line 202, innate immunity in vitro might need to be revised.”

      We agree that the previous description was vague. We changed the description to be more specific and it now reads: “…Sca2 does not significantly enhance the ability of R. parkeri to evade interferon-stimulated genes or inflammasomes in vitro.

      “3) It is unclear what is the unit of the inoculum in animal experiments, PFU?”

      Yes, it is PFU. We have now indicated this in the figure legends.

      “4) Line 36, in the study of "Reference 16", C3H/HeN mice, not B6 mice, were used.”

      We thank the reviewer for noticing this error and we have changed the text to C3H/HeN.

      “5) The conclusion on eschar will be greatly strengthened if histological analysis is included, particularly whether dermis necrosis is present or not.”

      In the revised manuscript we will perform histology on eschars in wild type and Ifnar-/-Ifngr-/- mice over time. We will also use immunohistochemistry to analyze the infected cell types and will compare this to data on human eschars. We agree that this will greatly strengthen our conclusions regarding the similarities between the mouse and human eschars.

      “6) Line 357, it is not clear what "spinfection" means.”

      We have changed this to “infection” for clarity.

      “Reviewer #1 (Significance (Required)): Several approaches employed in the study are new to the field of animal models of the rickettsioses. For example, fluorescent dextran was used to investigating the vascular damage in skin at the inoculation site; body temperature for mice infected with R. parkeri. Overall, the study is highly significant since it has answered the important questions in the research area of spotted fever rickettsioses and employed appropriate approaches. No major concerns were noticed.”

      We thank the reviewer for appreciating the significance of this work.

      **Referees cross commenting** I agree with other reviewers' comments. Thanks for the invite.”

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      “The manuscript utilizes a new model of spotted fever rickettsiosis. Using this model, the authors have determined that knockout of the sca2 or ompB gene attenuates Rickettsia parkeri, and vaccination with the attenuated rickettsiae provides protection against virulent challenge. However, the model is far less than ideal as it has eliminated important effectors of immunity.”

      We thank the reviewer for their comments and we hope to thoroughly address their concerns. In regard to the effects of interferons on long-lasting immunity to R. parkeri, we note to the reviewer that we observed that immunized Ifnar-/-Ifngr-/- mice were completely and robustly protected from rechallenge. No lethality and no loss of body weight or temperature was observed after a rechallenge dose of 10x the LD-50. These data reveal that interferons are dispensable for long-lasting immunity to R. parkeri in inbred mice and are not important effectors of adaptive immunity to R. parkeri. This is thus the first model that can be used to investigate the factors required for adaptive immunity to R. parkeri in mice.

      If the reviewer’s comment is not referring to long-lasting adaptive immunity to R. parkeri but is instead referring to the general concept of using immunocompromised mice as models, we note that immunocompromised mice are used as models for a variety of pathogens, including many Rickettsia species (reviewed in Osterloh, Med Microbiol Immunol 2017), and Ifnar-/-Ifngr-/- mice specifically are used as models for Zika and Dengue virus infections. Unlike many other immunocompromised mice, Ifnar-/-Ifngr-/- mice do not require maintenance on antibiotics and they have no noticeable differences to wild type mice in regard to breeding or growth.

      “Manuscript also fails to recognize that there is a Amblyomma maculatum tick transmitted model of Rickettsia parkeri infection that causes an eschar and disseminated pathology”

      In the previous version of the manuscript in lines 266-269 we cited and acknowledged the reported tick transmission model in non-human primates (Banajee et al., 2015). As also noted by Reviewer 3, our model with needle inoculation is significantly less time consuming and expensive than a tick transmission model. Moreover, needle inoculation makes it feasible to precisely measure the number of bacteria that are administered, which is not true with ticks. Lastly, the tick model was described in non-human primates, which are significantly more expensive than inbred mice and are not amenable to genetic manipulation. Thus, our model provides many significant advantages over the tick model in non-human primates, including cost, time, availability of genetic mutants, and reproducibility.

      “The model that they have used is inadequately characterized. The cutaneous lesion was not evaluated histologically to determine if it features the actual characteristics of an eschar.”

      We thank the reviewer for the suggestion and as a part of our revision plan, we propose to thoroughly analyze the lesion histologically.

      “Although bacteria were found in the liver and spleen, in which macrophages are significant target, there was no evaluation of the vital organs including lung and brain nor demonstration of the target cells or pathologic lesions.”

      In previous work from our lab (Engström et al., 2019), we found that lungs of wild type mice contained similar number of infectious R. parkeri as the spleen and liver after intravenous infection. Thus, in order to be able to process more samples quickly, we did not include lungs in the experiments described here. In unreported data, we also found that organs including the brain, kidneys, and heart had no/little recoverable PFUs. As a part of our revision plan, we propose to perform immunohistochemistry in the spleen, liver, lung, and skin to identify the infected cell types. Identifying the infected cell types will reveal if the same cell types are infected in our mouse model as in humans.

      “Unfortunately, the assay of vascular permeability was applied only to the inoculation site and not to the disseminated visceral organs such as lung and brain.”

      We have performed the vascular permeability assay using internal organs alongside the skin; however, little/no fluorescence was observed in any sample. We were unable to distinguish differences between control groups or between control and experimental groups in organs from mice that were treated and untreated with the fluorescent dextran. Thus, we were unfortunately not able to apply the described vascular damage assay to organs other than the skin. We now indicate this in the revised text.

      Reviewer #2 (Significance (Required)):

      “The authors all have misrepresented the eschar as a critically important lesion whereas the patients usually do not even know i's presence until they began to develop systemic symptoms and it is a detected by a physician examining the patient.”

      We did not intend to suggest that the eschar is either more or less critically important than other features of rickettsial disease. We simply described the eschar as a “hallmark feature” of eschar-associated rickettsiosis. Additionally, as the reviewer notes, patients report systemic symptoms, and our model elicits systemic disease by R. parkeri in mice. Thus, the model we describe recapitulates both an eschar and disseminated disease and is the first mouse model for R. parkeri that exhibits both of the disease manifestations mentioned by the reviewer.

      “On line 30 the authors state that mice are the natural reservoir of Rickettsia parkeri. The references cited describe the failure of acquisition by feeding ticks, meaning that it is not a true reservoir. The reference describing animals with antibodies merely indicates exposure to a spotted fever group Rickettsia not sufficient evidence of a role as a reservoir.”

      We thank the reviewer for making this important distinction and we have altered the text to read: “…small rodents including mice have been found as seropositive for R. parkeri in the wild.”

      “In response to the request for my expertise, I have contributed a large amount of data to understanding mechanisms of immunity to rickettsiae and have developed several useful animal models of Rickettsial diseases. I also have expertise on clinical aspects of spotted fever group rickettsioses, including the eschar.”

      **Referees cross commenting**

      “This is not the first Mouse model of rickettsiosis to contain an eschar. There is a model of Rickettsia parkeri transmitted by Amblyomma maculatum ticks in which eschars occur.”

      As noted above by us and also by Reviewer 3, we cited and discussed the tick transmission model in non-human primates (Banajee et al., 2015) in the Discussion. We also note to the reviewer the many advantages of our i.d. infection model, including how it will make these experiments more widely accessible, more reproducible, less expensive, faster, and enable the infection of mice with various genetic modifications.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      “This manuscript reports novel observations pertinent to development in inbred mice of an eschar lesion and generalized lethal infection following intradermal infection with Rickettsia parkeri, the mice are deficient in two types of interferon receptors. This is a new observation for the murine system and expands the existing repertoire of model infections for tick-borne rickettsiae. This study also reports that Sca2-mediated actin-based motility is required for R. parkeri dissemination and provides indirect evidence that OmpB protein is involved in eschar formation, thus corroborating previous knowledge about these major surface exposed antigens of rickettsiae and host cell interactions and host responses to these organisms.”

      “The study is rigorous and reliable as described and can be reproduced by others given availability of adequate funding, access to similar facilities, strains of mice and rickettsial mutants, and technical personnel with similar skills and training. There are no ethical or technical concerns.”

      We thank the reviewer for their thoughtful comments and for appreciating the advantages of this model.

      “The main limitation of the manuscript is due to the fact that histological and immunohistochemical analysis of the eschar was not performed; therefore, it is not clear if pathological processes and features of this lesion formation are the same or related to the human pathology.”

      We thank the reviewer for this suggestion. As a part of the revision plan, we propose to perform histological and immunohistochemical analysis of the eschar and will compare these findings to reported data from humans. We will also identify the cell types infected in the skin and internal organs in wild type and Ifnar-/-Ifngr-/- mice.

      “Similarly, in an attempt to generalize (as the authors try very hard), it is not clear how these observations will be relevant to rickettsial pathogens which are responsible for more severe forms of rickettsioses (such as R. rickettsii and R. prowazekii) but are not known to cause eschar formation as a part of their clinical manifestations.”

      Our findings with Sca2 are in agreement with findings on R. rickettsii Sca2 in guinea pigs (Kleba et al., 2010), which showed that Sca2 was required for eliciting fever and an antibody response. Our work also expands on these findings by showing that sca2 mutants immunize against rechallenge and by finding reduced bacterial burdens in internal organs after intradermal infection with sca2 mutant bacteria. Thus, we believe that studying R. parkeri genes in Ifnar-/-Ifngr-/- mice can serve as a model to better understand conserved virulence genes in diverse rickettsial pathogens.

      Beyond virulence genes, we note that our model also recapitulates systemic disease including dissemination to internal organs. Thus, it provides a platform to study disease manifestations beyond the eschar that may be relevant to other rickettsial pathogens including R. rickettsii and R. prowazekii.

      Some other virulent rickettsial pathogens cause limited/no disease in WT C57Bl/6 mice, including R. akari, R. conorii, R. typhi, and O. tsutsugamushi (reviewed in Osterloh, Med Microbiol Immunol 2017). Thus, Ifnar-/-Ifngr-/- mice may potentially serve as models for these pathogens. We now include this point in the Discussion.

      “The other deficiency is due to a limited description of the Sca2 and OmpB mutants used in this study. It was necessary to locate and review previous publications by this group in order to understand the experiments conducted here and their interpretation. It would be useful to the readers if this information (a better more complete description of the mutants and their properties) is summarized in this manuscript.”

      We have now provided a more complete description of the mutants in the Introduction and Results.

      Reviewer #3 (Significance (Required)):

      *“The study is relevant to investigators working in the field of rickettsial diseases and to a broader audience studying mechanisms of intracellular parasitism and host responses.

      The study argues that difference(s) in dermal IFN signaling mechanism(s) distinguish human and murine susceptibility to R. parkeri infection. This is a very useful speculation; however, a better and deeper discussion would be helpful to demonstrate the relevance of these observations and their connection(s) to events occurring during the course of human infections. Regrettably, there are almost no citations of classic or current literature addressing these aspects of rickettsial pathogenesis and the role of IFN-dependent mechanisms beyond self-citations. Overall, the discussion includes four relatively short paragraphs, each addressing different directions of possible research, which indicates ample possible utility of this murine model; however, a more coherent and convincing discussion is desirable.”*

      We thank the reviewer for the suggestion, and we have now expanded the Discussion to address the role for IFN-dependent mechanisms in humans and mice during rickettsial infections, including classic and current literature citations.

      **Referees cross commenting**

      “I agree with the Reviewer #2 that per se this is not the first murine model reproducing eschar upon A. maculatum transmission; however, this is the first model that allows to monitor eschar formation using needle inoculation. This model can be widely used; while many labs maybe limited by their facility setup and can't afford/conduct tick transmission experiments. The authors acknowledged existing of the tick transmission model and discuss inclusion of this option in their future experiments.”

      We thank the reviewer for recognizing the many advantages of this model.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript reports novel observations pertinent to development in inbred mice of an eschar lesion and generalized lethal infection following intradermal infection with Rickettsia parkeri, the mice are deficient in two types of interferon receptors. This is a new observation for the murine system and expands the existing repertoire of model infections for tick-borne rickettsiae. This study also reports that Sca2-mediated actin-based motility is required for R. parkeri dissemination, and provides indirect evidence that OmpB protein is involved in eschar formation, thus corroborating previous knowledge about these major surface exposed antigens of rickettsiae and host cell interactions and host responses to these organisms.

      The study is rigorous and reliable as described, and can be reproduced by others given availability of adequate funding, access to similar facilities, strains of mice and rickettsial mutants, and technical personnel with similar skills and training. There are no ethical or technical concerns.

      The main limitation of the manuscript is due to the fact that histological and immunohistochemical analysis of the eschar was not performed; therefore, it is not clear if pathological processes and features of this lesion formation are the same or related to the human pathology. Similarly, in an attempt to generalize (as the authors try very hard), it is not clear how these observations will be relevant to rickettsial pathogens which are responsible for more severe forms of rickettsioses (such as R. rickettsii and R. prowazekii) but are not known to cause eschar formation as a part of their clinical manifestations.

      The other deficiency is due to a limited description of the Sca2 and OmpB mutants used in this study. It was necessary to locate and review previous publications by this group in order to understand the experiments conducted here and their interpretation. It would be useful to the readers if this information (a better more complete description of the mutants and their properties) is summarized in this manuscript.

      Significance

      The study is relevant to investigators working in the field of rickettsial diseases, and to a broader audience studying mechanisms of intracellular parasitism and host responses.

      The study argues that difference(s) in dermal IFN signaling mechanism(s) distinguish human and murine susceptibility to R. parkeri infection. This is a very useful speculation; however, a better and deeper discussion would be helpful to demonstrate the relevance of these observations and their connection(s) to events occurring during the course of human infections. Regrettably, there are almost no citations of classic or current literature addressing these aspects of rickettsial pathogenesis and the role of IFN-dependent mechanisms beyond self-citations. Overall the discussion includes four relatively short paragraphs, each addressing different directions of possible research, which indicates ample possible utility of this murine model; however, a more coherent and convincing discussion is desirable.

      Referees cross commenting

      I agree with the Reviewer #2 that per se this is not the first murine model reproducing eschar upon A. maculatum transmission; however, this is the first model that allows to monitor eschar formation using needle inoculation. This model can be widely used; while many labs maybe limited by their facility setup and can't afford/conduct tick transmission experiments. The authors acknowledged existing of the tick transmission model and discuss inclusion of this option in their future experiments.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript utilizes a new model of spotted fever rickettsiosis. Using this model, the authors have determined that knockout of the sca2 or ompB gene attenuates Rickettsia parkeri, and vaccination with the attenuated rickettsiae provides protection against virulent challenge. However, the model is far less than ideal as it has eliminated important effectors of immunity. Manuscript also fails to recognize that there is a Amblyomma maculatum tick transmitted model of Rickettsia parkeri infection that causes an eschar and disseminated pathology. The model that they have used is inadequately characterized. The cutaneous lesion was not evaluated histologically to determine if it features the actual characteristics of an eschar. Although bacteria were found in the liver and spleen, in which macrophages are significant target, there was no evaluation of the vital organs including lung and brain nor demonstration of the target cells or pathologic lesions. Unfortunately the assay of vascular permeability was applied only to the inoculation site and not to the disseminated visceral organs such as lung and brain.

      Significance

      The authors all have misrepresented the eschar as a critically important lesion whereas the patients usually do not even know i's presence until they began to develop systemic symptoms and it is a detected by a physician examining the patient.

      On line 30 the authors state that mice are the natural reservoir of Rickettsia parkeri. The references cited describe the failure of acquisition by feeding ticks, meaning that it is not a true reservoir. The reference describing animals with antibodies merely indicates exposure to a spotted fever group Rickettsia not sufficient evidence of a role as a reservoir.

      In response to the request for my expertise, I have contributed a large amount of data to understanding mechanisms of immunity to rickettsiae and have developed several useful animal models of Rickettsial diseases. I also have expertise on clinical aspects of spotted fever group rickettsioses, including the eschar.

      Referees cross commenting

      This is not the first Mouse model of rickettsiosis to contain an eschar.There is a model of Rickettsia parkeri transmitted by Amblyomma maculatum ticks in which eschars occur.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Rickettsial eschars are hallmarks of less severe spotted fever diseases. The underlying mechanisms involved in the formation of the eschar caused by pathogenic rickettsiae remains unknown. The authors of this manuscript studied this interesting research question by using Ifnar-/-Ifngr-/- mice and Sca 2 or OmpB mutant of R. parkeri. R. parkeri probably is the best rickettsial species to study rickettsial eschar due to the clinical features of R. parkeri rickettsioses and the biosafety level required to work with it. The data presented in the manuscript are very promising. The conclusions are supported by the presented results. For the first time, this study recapitulated human eschar-like skin lesion observed in patients with R. parkeri rickettsioses in the mouse models. More interestingly, mice inoculated with Sca2 mutant of R. parkeri i.d. had less disseminated rickettsiae in tissues, which helps us to understand the mechanisms by which pathogenic rickettsiae cause systemic infection after the arthropod bite.

      Minor comments:

      1)Figure 2D, it looks likely the lethality of mice i.d. infection with R. parkeri is not dose-dependent. For example, mice inoculated with 10^4 showed greater lethality compared to 10^7. The authors might want to explain it in the "Discussion".

      2)Line 202, innate immunity in vitro might need to be revised.

      3)It is unclear what is the unit of the inoculum in animal experiments, PFU?

      4)Line 36, in the study of "Reference 16", C3H/HeN mice, not B6 mice, were used.

      5)The conclusion on eschar will be greatly strengthened if histological analysis is included, particularly whether dermis necrosis is present or not.

      6)Line 357, it is not clear what "spinfection" means.

      Significance

      Several approaches employed in the study are new to the field of animal models of the rickettsioses. For example, fluorescent dextran was used to investigating the vascular damage in skin at the inoculation site; body temperature for mice infected with R. parkeri. Overall, the study is highly significant since it has answered the important questions in the research area of spotted fever rickettsioses and employed appropriate approaches. No major concerns was noticed.

      Referees cross commenting

      I agree with other reviewers' comments. Thanks for the invite.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank the reviewers for taking the time to carefully evaluate our manuscript. The paper will be significantly improved by their suggestions, and we are grateful for their perspectives.

      To address the reviewers’ concerns, we will complete additional control experiments and revise the manuscript as detailed below.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In the present work Stumpff, Reinholdt and co-workers investigate the mechanism by which micronuclei contribute to tumorigenesis. Micronuclei are classic markers of genomic instability widely used in the diagnosis of cancer, but whether they work as drivers of the process has recently attracted significant attention due to their link with chromothripsis. Here, the Stumpff/Reinhold labs have explored an interesting model to test some ideas about the role of micronuclei as drivers of tumorigenesis, based on Kif18A/p53 double KO mice. They confirm the formation of micronuclei in these animals, but find no substantial increase in survival and tumor incidence relative to p53 KO animals, despite higher incidence of micronuclei in Kif18A/p53 KO tumors. They conclude that, per se, micronuclei do not have the capacity to form tumors, regardless of p53 status. This was surprising, given the well-established role of p53 in preventing the proliferation of micronucleated cells. To shed light into this apparent paradox, they compared micronuclei from Kif18A KO cells with micronuclei generated by a number of other experimental conditions that promote formation of anaphase lagging chromosomes or generates acentric fragments. They found that micronuclei derived from Kif18A are intrinsically different from micronuclei generated by those other means and essentially showed increased accumulation of lamin B, were more resistant to rupture and preserved the capacity to expand as cells exited mitosis. Of note, they find a correlation between chromosome proximity to the poles/main chromosome mass and the different features that characterize micronuclei from Kif18A KO cells, compared with the other experimental conditions in which late lagging chromosomes are more frequent. Overall, I find this study extremely interesting, well designed and executed in a rigorous way that characterizes the consistent solid work from these laboratories over the years. I have just few minor points that I recommend to be addressed prior to publication. 1-Abstract and main text lines 70 and 100: the authors indicate that Kif18A mutant mice produce micronuclei due to unaligned chromosomes. This is correct, but at the same time misleading. The authors should clarify that although micronuclei derive from compromised congression, I was convinced from previous works (Fonseca et al., JCB, 2019) that it was their asynchronous segregation in anaphase that led to micronuclei formation. As is, a less familiar reader may conceive that misaligned chromosomes directly result in micronuclei, for example by being detached from the main chromosome mass.

      We thank the reviewer for raising this point. We agree that micronuclei form in the absence of KIF18A due to chromosome alignment defects, which reduces interchromosomal compaction and leads to asynchronous arrival of chromosomes at spindle poles during anaphase. As the reviewer suggests, micronuclei form around chromosomes that travel longer distances and arrive late to the poles. We have revised the manuscript to clarify this (Lines 12-13, 72-73, 102).

      2-Page 2, line 59: "cells entering cell division...become fragmented". It is not the cells, but the chromosomes that fragment. Please correct.

      We have revised this wording to indicate it is the chromosomes within micronuclei which fragment (Line 60-63).

      3-Page 4, line 149: "reduced survival in the Kif18A null, p53 mice". P53 what? KO, WT? Please clarify.

      We have revised this wording as suggested, to read: “reduced survival in the Kif18agcd2/gcd2, p53-/- mice,” (Line 158).

      4-Page 5, line 212: the authors refer that micronuclei were scored for absence of lamin A/C, but previously they scored it as "continuous/discontinuous". Please clarify.

      Thank you for raising this question. When we scored lamin A/C, we noted cases where lamin A/C signal was incompletely present (not fully co-localizing with the micronuclear area, as indicated by DAPI). In these infrequent cases, micronuclei were identified as having “discontinuous” lamin A/C signal and were binned with those micronuclei lacking lamin A/C, for purposes of creating a binary readout of the micronuclear envelope: either 1) “intact” (having full, completely continuous lamin A/C signatures) or 2) “ruptured” (lacking a complete micronuclear signal of lamin A/C). We will update the text and the methods to more clearly reflect this categorization (Lines 221-225; 603-607).

      5-Page 6, line 243: "Kif18A is not required for micronuclear envelope rupture". Shouldn't it be micronuclear envelope "integrity"?

      We apologize for the confusion here. The experiment performed was designed to distinguish whether micronuclear envelopes are more stable in KIF18A KO cells or if KIF18A itself is somehow required for the rupture of all micronuclear envelopes to occur. Since nocodazole-induced micronuclei were able to rupture in KIF18A KO cells at similar frequencies to those seen in control cells, the data indicate that KIF18A is not required for the process of micronuclear envelope rupture. We modified the text to improve clarity (lines 252-253).

      6-One of the most interesting results of the paper is the correlation between envelope formation in micronuclei with their respective position relative to the poles/midzone. Could the authors try to investigate causality? For instance, the authors refer to works from other labs in which MT bundles and a midzone Aurora B activity gradient might play a role in the different features associated with micronuclei envelope formation, depending on their origin. Could the authors manipulate this gradient and investigate whether it changes the outcome in terms of nuclear envelope assembly properties on micronuclei? Are there any detectable features in midzone MT organization in Kif18A KO cells that would justify the observed differences?

      We agree that this result is very interesting. However, we feel the proposed experiments would repeat previous work and are somewhat outside the purview of the present study. Elegant experiments to address Aurora’s role in preventing micronucleus formation have already been performed using genetic approaches in Drosophila neuroblasts and small molecule inhibitors in mammalian cells and Drosophila S2 cells (PMIDs: 24925910, 25877868, and 29986897). Interpreting effects of Aurora B inhibition are complicated by the many critical roles Aurora B plays in ensuring proper and faithful chromosome segregation. Thus, experiments to precisely test Aurora’s effect on micronuclear envelope stability require addition of Aurora B inhibitors on a cell-by-cell basis, administered within a narrow window of minutes during anaphase. It would require significant effort to obtain enough cells from different experimental conditions to make a meaningful comparison.

      The suggestion to investigate detectable differences or features in midzone MT organization in KIF18A KO cells is also appreciated. We have not observed gross differences in midzone microtubules in KIF18A KO cells, but we will quantitatively evaluate this and add these results to the revised manuscript.

      Reviewer #1 (Significance (Required)):

      Kif18A plays a key role in chromosome alignment, without apparently affecting kinetochore-microtubule attachments in non-transformed cells. Because they cannot establish a proper metaphase plate Kif18A KO cells enter anaphase with highly asynchronous segregation due to non-uniform chromosome distribution along the spindle axis. Consequently, some "delayed" chromosomes form micronuclei, in cell culture and in vivo. Interestingly, prior art has failed to detect any increased signs of genomic instability in Kif18A KO cells and mice, and, contrary to what would be expected based on current trends, these mice do now show any signs of increased incidence of tumors, in fact they even show some protective effect to induced colitis-associated colorectal cancer. Noteworthy, all previous experimental works pointing to a role of micronuclei as key intermediates of genomic instability in cancer relied on models in which the tumor suppressor protein p53 had been inactivated. In the present work, the authors explore the relationship between micronuclei formation and p53 inactivation by investigating tumor formation in Kif18A/p53 double KO animals (1 or 2 alleles of p53 inactivated).The reported results are timely and will attract the interest of a broad readership, while decisively contributing to shed light into an ongoing debate. I am therefore all in favor for the publication of this work in any journal affiliated with review commons, pending some minor revisions.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Sepaniac and colleagues use in vivo and in vitro approaches to examine why micronuclei generated by lack of KIF18A activity do not promote tumorigenesis. The authors conclude that micronuclei in KIF18A depleted cells form stable micronuclear envelopes, which may be a result from lagging chromosomes being closer to the spindle pole when the micronuclear envelope forms. The authors further conclude that the stability of the micronuclei arising from lack of KIF18A can explain why Kif18a mutant mice do not develop tumors. These results also suggest that the consequences of micronuclei and their possible contribution to tumorigenesis depend on the context of their genesis. While the mouse model data and characterization of the stability of micronuclei generated by different insults support the conclusions, the lagging chromosome positioning data could be improved. Moreover, a number of other issues should be addressed prior to publication.

      **Major issues:**

      1.Line 153-155. The authors conclude that the slight reduction in overall survival is "due to a reduced ability of Kif18a mutants to cope with rapid tumorigenesis," but it is unclear why this would be the case. There is also an increase in micronucleated cells in thymic lymphomas from Kif18a/p53 homozygous mice (Fig. 2B)-could this not contribute? In Fig. 3C, the authors show that micronuclear rupture is similar in both Kif18a WT and mutant mice, so it seems possible that the increase in the frequency of micronuclei (Fig. 2B) coupled with a similar frequency of micronuclear rupture (Fig. 3C) could lead to the reduced survival. Then, in the discussion, the authors downplay this finding by saying (line 371) "loss of Kif18a had modest or no effect on survival of Trp53 homozygotes and heterozygotes." Why then speculate earlier in the text that loss of Kif18a reduces the ability to cope with tumorigenesis?

      We thank the reviewer for pointing out this issue. Our goal here was to try and explain why the Kif18a/p53 mutant homozygotes display a small but significant reduction in survival compared to p53 mutants, while the Kif18a mutation does not impact survival of p53 heterozygotes, which could be considered a more sensitive model for detecting decreased survival. Kif18a homozygous mutants do display a small reduction in survival shortly after birth compared to heterozygote and wild type littermates (PMID: 25824710). Thus, we can’t exclude the possibility that incompletely penetrant, postnatal lethality might be coincident with reduced fitness in surviving mutants, thus naming them more sensitive to loss of p53 loss of function. We have removed this statement form the revised text.

      However, the reviewer’s point that the combination of increased micronuclei in Kif18a/p53 homozygous mutants combined with a similar rupture rate seen in p53 mutants could also underlie or at least contribute to reduced survival is a good one. We have softened our conclusion in the Results section regarding the reduced survival of double homozygous mice (lines 158-164). We also agree that the way in which this point is addressed in the results and discussion sound contradictory. Thus, we have edited the language in the Discussion to improve consistency (lines 393-399).

      2.Related to the point above, the authors show in figure 3 that the micronuclei found in healthy tissues display infrequent membrane rupture (panel B). However, micronuclear membrane rupture in tumor tissues is much more frequent (panel C). How do the authors explain this? Do they hypothesize that the micronuclei in the tumors originate by mechanisms other than the misalignment caused by lack of KIF18A? Does KIF18A depletion cause aneuploidy due to segregation of two sisters to the same pole? If so, one could expect the tumors to be aneuploid (is this the case?) and aneuploidy has been shown by numerous groups to cause genomic instability. Such genomic instability could then explain the difference in membrane rupture.

      We agree that this is an interesting question. We plan to investigate several possible contributors to increased rupture in tumor cells in a separate study. As outlined in the Discussion (lines 443-458), we hypothesize that rupture could increase in tumor tissue due to changes in lamin expression or cytoskeletal forces in these cells. However, as the reviewer notes, differences in aneuploidy could also potentially explain the differences in membrane rupture observed in healthy (non-tumorous) and thymic lymphoma tissues. For example, an increase in chromosome number could lead to lagging chromosomes being positioned closer to the midzone in Kif18a mutant cells or, as the reviewer suggests, the micronuclei could occur in aneuploid tumors due mitotic defects other than misalignment. This may be difficult to determine unequivocally in primary cell or tissue samples. However, we do have a limited quantity of primary thymic lymphoma-derived cells and we will use these to initially investigate aneuploidy in the two genotypes. The results of these studies will be added to the final revised manuscript. In addition, we will incorporate a discussion of how aneuploidy may increase rupture frequency in tumors into the revised manuscript.

      3.The authors conclude that lagging chromosomes in KIF18A KO cells are found closer to the main chromatin mass. The Stumpff lab showed in a 2019 JCB paper that KIF18 KO cells have a chromosome alignment defect and as a result during anaphase the chromosomes can be scattered rather than forming the tight, uniform mass that is observed in WT cells. The scattering of kinetochores resulting from this phenotype could affect the value of "Avg Chromosomes Distances" in Fig 7B and the normalized distance in the KIF18A KO cells. Therefore, live-cell imaging experiments would be helpful to resolve this and possibly strengthen this conclusion. RPE1 cells with fluorescently tagged CENP-A and centrin could be used to ensure that the lagging chromosomes will not rejoin the main nucleus. Moreover, these cells could be used for correlative live-fixed cell experiments in which fixed cell analysis following micronucleus formation could be used to show that chromosomes that lag farther away from the spindle pole are more likely to have defective micronuclear envelopes.

      The reviewer’s concern that the unalignment phenotype, characteristic of KIF18A KO cells, may impact the value of average chromosome distances used to set a threshold for chromosomes meeting our definition of lagging is valid. To address this, we analyzed the standard deviations for chromosome-to-pole distances within half spindles of KIF18A KO and nocodazole-washout treated anaphase cells as a way to compare chromosome scattering in these two conditions. This analysis revealed no significant difference between the standard deviations of chromosome positions in the two groups, suggesting that scattering is similar in nocodazole treated and KIF18A KO cells. We have included these data in the manuscript (Line 351-356, and additional data added to Figure S2C).

      In order to further strengthen this conclusion, we are certainly willing to attempt the live cell imaging experiments suggested by the reviewer. We would like to point out that the frequency of micronucleus formation in the KIF18A KO cells is relatively low compared to the frequency seen after other experimental treatments (~7% of divisions result in a micronucleus). Thus, a large number of individual cells would need to be imaged with relatively high temporal resolution to make conclusions about the effects of chromosome position on micronuclear envelope formation (such analyses are not possible with the live data sets we currently have, where cells were imaged every 2 minutes). This difficulty led us to perform these measurements in synchronized and fixed cells to begin with.

      4.Based on the Fonseca et al. 2019 JCB paper (video 2), micronuclei from KIF18A KO do not exclusively arise from lagging chromosomes. Instead, chromosomes can also escape the main chromatin mass after segregation and subsequently be excluded from the main nucleus. It would be important to know what fraction of the micronuclei in KIF18A KO cells arise via lagging chromosomes. Since Aurora B and/or bundled microtubules at the spindle midzone are believed to prevent proper nuclear envelope formation, chromosomes that properly segregate but later become separated from the main nucleus would be more likely to form proper micronuclear envelopes than those arising from lagging chromosomes. The correlative microscopy experiment suggested in the previous point could allow differentiation between these two routes to micronucleus formation.

      The reviewer is correct that we did occasionally see chromosomes escape the main chromatin mass after segregation in the Fonseca et al., 2019 study referenced. We did not quantify the frequency of these events in that study, but they were rare. To address this quantitatively, we have measured the incidence of micronuclear formation around lagging chromosomes and chromosomes that escape the main chromatin mass after segregation in videos of KIF18A KO cells. We find that when micronuclei form in these cells, they form around lagging chromosomes 98% (46 out of 47 events) of the time. These data were derived from 4 live cell imaging experiments. This information has been added to the Results section (line 328-330).

      **Minor issues:**

      1.Some parts of the manuscript are excessively wordy and some sentences are unclear or convoluted (e.g., lines 148-153 and 238-239).

      Thank you for this feedback. We have revised the text in these two locations to improve clarity (lines 159-162 and 247-248 in the revised manuscript).

      2.Lines 59-61. This sentence is formulated incorrectly. First of all, the subject of the sentence is "cells" and the verb is "can become fragmented." However, the authors mean that the DNA in the micronucleus can become fragmented (not the cells). Moreover, the way the sentence is currently formulated seems to suggested that the fragmentation occurs during cell division. However, this is not the case. Please, revise the text to make it more accurate.

      We appreciate this point and have revised this text to reflect more precise language to describe this model. It is certainly the micronucleated chromatin which may become fragmented, and this fragmentation occurs as a result of replication stress, including replication fork collapse, after an existing micronucleated cell enters a subsequent round of S or G2 phase (PMIDs: 22258507, 26017310).

      3.Lines 114-115. Please, provide references in support of this statement.

      The statement in question: “This arrest was at least partially dependent on p53, consistent with other reports of cell cycle arrest following micronucleation,” shares the same references as the sentence that follows it (Sablina 1998, Thompson and Compton, 2010; Fonseca et al., 2019). We have updated the references to appear after the first statement to make this clear.

      4.Line 153. The authors refer to Fig. 1C, but I think they mean Fig. 1B.

      Thank you, we have updated the text to read Fig 1B.

      5.Line 324. the authors find that RPE1 KIF18A KO cells have lagging chromosomes in ana/telophase 9% of the time, then say that this shows that lagging chromosomes are rare in KIF18A KO cells. However, this is a large increase compared to normal RPE1 cells, which only have 1-2% frequency of lagging chromosomes. So, they should revise the text here to say that the rates of lagging chromosomes from KIF18A KO are lower compared to the rates induced by nocodazole washout.

      This is an important distinction. We have removed this confusing statement from the revised text (lines 336-338).

      6.Line 383. The references listed here should be moved earlier and specifically after the statement summarizing the results of the studies instead of being listed after the authors' conclusion/interpretation of the data. The same issue was noted in other parts of the manuscript.

      We have corrected this error (Lines 402-408). Before final submission, we will further amend the style of the manuscript throughout to cite relevant papers after the statement summarizing the results of those studies, rather than after our interpretation of the studies.

      7.Figure 1A. In the text, the authors say they cross a Kif18a heterozygous mutant mouse with a p53 heterozygous mutant mouse, but the two mice in this figure are already heterozygous for both. Please, revise the text or depict the previous additional cross necessary to obtain the double heterozygous.

      We thank the reviewer for catching this discrepancy. We have revised the text to describe the crosses necessary to obtain the double heterozygous mice shown in the figure (lines 121-123). The gcd2 mutation in Kif18a was named due to the “germ cell depleted” phenotype it causes. These homozygous mice are therefore infertile (Czechanski et al., 2015). For this reason, heterozygous mice for each gene were crossed to achieve the necessary homozygous progeny.

      8.Figure 3A. Arrows or dotted circles outlining the micronuclei in the insets of the middle and bottom rows would be helpful since the DAPI signal in the micronuclei is low and somewhat difficult to see.

      We have updated these figures as suggested to more clearly indicate the micronuclear area.

      9.Figure 3B. Error bars should be added to the graph. Moreover, the authors noted that the differences are not significant. However, this seems surprising, given that in some cases there is a three- to five-fold difference between certain pairs. Indeed, a chi-square test using the numbers from table S1 indicated p values We appreciate this feedback on the statistical tests and comparisons among these data. The main point of these analyses is to demonstrate that tissues other than blood form micronuclei in vivo in the absence of Kif18a function and that the majority of these micronuclear envelopes are completely surrounded by Lamin A/C. The data presented in Figure 3B were obtained by counting several tissue types from a single mouse of each genotype. Thus, we do not believe that error bars are appropriate in this context. To avoid confusion, we have also removed the statistical bars which had indicated no significant differences in rupture frequency among the genotypes in each sampled tissue, as these are also probably inappropriate.

      We understand the reviewer’s point that some pairwise comparisons of the data in Table S1 indicate that they are significantly different. We originally used a Chi-square test to compare the data from all three genotypes for each tissue. Because these data did not rise to the threshold of significance necessary to reject the null hypothesis across all three genotypes within each individual tissue type, we did not think performing pairwise comparisons between only two of those genotypes was appropriate (Whitlock and Schluter, The Analysis of Biological Data, 2009). Specifically, analyses of rupture frequency for spleen, liver, and thymus tissue gave p-values above 0.05 (spleen, p = 0.35; liver, p = 0.056; thymus, p = 0.052). Thus, we did not proceed with pairwise comparisons. In contrast, the analyses of p53 effects on micronucleus levels in peripheral blood in Fig 1D utilized samples from 8 individual mice for each genotype, and are therefore more amenable to statistical comparisons. If the reviewer believes any of the details of this approach are incorrect, we are happy to revise the analyses.

      10.Figure 5G. When referring to this figure (lines 292-294), the authors talk about correlation. However, the points in this graph seem to be scattered a bit randomly.

      To address this concern, we performed a Pearson’s correlation test on the data in Figure 5G. As suspected by the reviewer, this analysis did not indicate a significant correlation, and we have removed this plot from the manuscript.

      11.Figure 6B-D. The Y-axis titles of the three graphs are a bit confusing. Please, consider revising.

      We have updated the Y-axis titles for these graphs to more accurately represent what is displayed on each plot.

      12.In Figure 7 and the text, the authors use the terms "late-lagging" and "lagging" chromosomes interchangeably, which is somewhat confusing in this context because lagging chromosome distance from the main chromosome mass is thought to contribute to defective assembly of micronuclear envelopes. It is not clear whether the authors intend to indicate, with this term, that the lagging chromosome is farther away from the main chromosome mass or that the lagging chromosome is in a "late" anaphase cell. Because this is confusing, I suggest just using the term "lagging chromosome" consistently. It could be useful to include representative images of lagging chromosomes located at different distances from the main chromosome mass. And certainly, the authors should include an example of a lagging chromosome in the KIF18A KO cells.

      We agree with the reviewer’s concern regarding confusion of these terms. We have updated the text to use the term “lagging chromosome” consistently, as the reviewer suggests. We have also updated Figure 7A to include a representative image of a lagging chromosome in a KIF18A KO cell.

      13.Figure S2A. The example in the bottom right image looks more like a chromosome bridge than a lagging chromosomes. Kinetochore staining is necessary to unequivocally identify lagging chromosomes.

      We agree with the reviewer that kinetochore staining is necessary to precisely identify lagging chromosomes. We had used these images to quickly and crudely assess the presence and frequency of potentially lagging chromosomes, observed in late-anaphase cells by eye, and for subsequent experiments where lagging chromosomes were measured, repeated these experiments with proper staining of poles and kinetochores to make precise, quantifiable assessments. Reviewer #2 (Significance (Required)):

      Based on the previous knowledge on the factors that cause abnormal assembly of the micronuclear membrane, the results presented in this study were somewhat predictable. However, these findings will add to the knowledge of how micronuclei form and the potential factors that lead to micronuclear membrane rupture. Previous studies investigating micronucleus behavior have focused on micronuclei arising via merotelic kinetochore mis-attachments. These mis-attachments lead to formation of micronuclei close to the spindle midzone. In the present study, instead, the micronuclei arising from lack of KIF18A activity form farther away from the spindle midzone. The results presented here suggest that the positioning of these micronuclei farther away from the midzone enables assembly of a more stable micronuclear membrane that will be less likely to rupture during the following cell cycle. A recent study showed that the microtubule bundles in the spindle midzone interfere with micronuclear membrane assembly. Based on this, it is not surprising that micronuclei forming away from the spindle midzone (like those resulting from lack of KIF18A activity) assemble more normal membranes. Although somewhat expected, this study provides the actual data in support of this phenomenon. This study will be of interest to cell biologists interested in cell division and genomic instability. My research has focused on cell division, aneuploidy, and chromosomal instability for nearly thirty years. Therefore, I believe I am fully qualified to evaluate this manuscript.

      **Referees cross-commenting**

      My areas of expertise do not include nuclear membrane structure and function. Therefore, I encourage the authors to consider the comments of reviewer #3 for issues related to reliable quantification of micronuclear membrane rupture.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      **Summary** Sepaniac et al demonstrate that loss of KIF18a, a motor protein required for proper chromosome congression and chromatin compaction during mitosis, is insufficient to drive tumor development in mice although it does increase the frequency of micronuclei (MN), nuclear compartments that form around broken or missegregated chromosomes, in both normal and tumor tissue. MN are thought to increase genome instability and metastasis by undergoing DNA damage and activating innate immune signaling after irreparable nuclear membrane rupture. The authors use a non-transformed human cell line, hTERT-RPE-1, with KIF18a knocked out to demonstrate that MN formed as a result of KIF18a loss have more stable nuclear membranes than MN generated by other methods. They go on to correlate this increased stability with increased chromosome proximity to the main chromatin mass during nuclear envelope assembly and increased chromatin decompaction by a combination of fixed and live cell imaging.

      **Major Comments**

      1.This study relies heavily on the use of lamin A loss or discontinuity to identify ruptured micronuclei. Although the authors validate this marker against "leakage" of the soluble nuclear protein mCherry-NLS, there are several lines of evidence suggesting that lamin A loss or disruption is not a reliable reporter. In figure S3C, the top two panels of intact MN in the KIF18KO appear ruptured based on the gH2AX labeling, yet have significant levels of lamin A and are labeled as intact. In figure 4D, the rate of MN rupture after nocodazole release (60% ruptured in 2 hours) is much faster than that reported in other papers (40-60% in 16-18 hours, Liu et al; 60% in 16 hours; Hatch et al). In addition, images in Hatch et al, 2013 show lamin A localizing to both intact and ruptured MN and anecdotal information in the field suggests that lamin A localization is not a reliable reporter.

      These discrepancies may be due to how the authors' define "mCherry-NLS leakage", which needs to be defined in the methods as previous studies have demonstrated that MN frequently have delayed or reduced nuclear import even though the membrane is intact. Regardless, the authors need to provide compelling independent evidence that lamin A loss and disruption faithfully recognize ruptured MN by either validating this marker against additional rupture reporters, such as Lap2, LBR, or emerin accumulation, or by repeating key experiments in cells expressing mCherry-NLS.

      Our decision to use lamin A/C as a reporter was based on its use as a marker for micronuclear envelope presence in prior studies (Hatch, 2013; Liu, 2018). We were unaware of anecdotal information in the field that suggests that lamin A localization may not be a reliable reporter.

      However, we think we understand the reviewer’s point to be that although it is clear from prior studies that gaps in the nuclear lamina are a known predictor of micronuclear rupture, these gaps can persist for some time before rupture has actually occurred. We agree that this is an important distinction and thank the reviewer for raising these questions.

      As the reviewer notes, we performed control experiments to address this issue and validate the use of lamin A/C as a marker of micronuclear envelope rupture. Our approach involved correlating lamin staining with the localization of mCherry-NLS signal to the micronucleus (Figure S1). We found that these signals correlated well. As the reviewer points out, this analysis in fixed cells could be misleading in cases where nuclear import is reduced, but the micronuclear envelope is intact. If this were a significant contributor, we may have expected to see greater instances of micronuclei that exhibit continuous lamin A/C signal but lack nuclear localization of mCherry-NLS. However, we found this combination was rare among the KIF18A and RPE1 nocodazole washout treated cells (2%, or 1 of 46 micronuclei had continuous lamin A/C while lacking mCherry-NLS). We admit that this assumption may be oversimplified though.

      The reviewer’s point about the timing of nocodazole treatment and washout something we have definitely considered. We note that prior studies have used differing time points after nocodazole treatment and release. For Hatch et al., 2013: U2OS cells were treated for 6 hours with nocodazole and then subjected to mitotic shakeoff, 48% of micronuclei were ruptured after 6 hours and ~60% were ruptured after 16 hours. Similarly, in Liu et al., 2018 60% of micronuclei were ruptured 16 hours post mitotic shake off and nocodazole release. While these results suggest that rupture increases with time after mitosis, it isn’t clear how early rupture may occur. In other words, does it take several hours in G2 before nearly half of micronuclei rupture or do many of these rupture shortly after cell division?

      We note that other explanations could also potentially contribute to the differences in rupture rates reported in our study compared to those in previous publications. For example, we used a short nocodazole treatment (2 hrs) compared to the longer treatments (6 hrs) used in previous studies. We did this originally in order to produce a similar percentage of micronucleated cells as is seen in KIF18A KO cell populations. However, the difference in nocadozole treatment length could potentially influence the types and frequencies of kinetochore microtubule attachments formed. For example, if centrosomes stay closer together in mitotic cells after short nocodazole treatments, this could increase the number of abnormal attachments (e.g. PMID: 22130796). Such an effect would be expected to increase the frequency of lagging chromosomes and/or potentially produce more lagging chromosomes within the anaphase midzone.

      The best way to address this issue would be to repeat our analyses of mcherry-NLS in live cells to track the formation and rupture of micronuclei. We did attempt these live imaging experiments previously and have found this experiment challenging due to: 1) the low frequency of micronuclear formation in KIF18A KO cell population; 2) a low transfection/expression efficiency for the mCherry-NLS plasmid in RPE1 cells, and 3) photobleaching of the mCherry-NLS plasmid. For these reasons, we transitioned into fixed cell experiments for the mCherry-NLS reporter. However, we propose to troubleshoot this assay and attempt to obtain the data necessary to determine when rupture is occurring. In addition, we will use additional markers to investigate micronuclear envelope stability, as the reviewer has suggested.

      Regardless of the outcome of these experiments, we have measured a clear difference between the lamin deposition within micronuclear envelopes of KIF18A KO cells compared to those formed following other insults. Lamin recruitment is well established as a predictor of nuclear envelope stability. If necessary, we could alter the text to indicate that the presence of lamin A/C and B within micronuclear envelopes of KIF18A KO cells are indicative of nuclear envelope stability, and that this is distinct from the lamin profiles of micronuclei in cells subjected to nocodazole-washout.

      2.Micronuclei in tumor sections and other dense tissues can appear very similar to other types of chromatin, including blebs from adjacent nuclei and dead cells. To verify that the quantified structures are bona fide micronuclei, the authors need to include a marker for the cell boundary. This is especially critical in the lamin a stained tumor sections with heterogenous lamin A protein expression.

      We appreciate the point this reviewer raises and we carefully considered accurate identification of micronuclei in tissues. Three optical sections were collected from each sample. During analyses, we scrolled through the ~2-micron thick sections to exclude chromatin bodies connected to an out-of-plane nucleus or nuclear bleb. We have a limited number of sectioned and preserved thymic lymphoma tissues remaining. We will use these samples to reassess micronuclear frequency in the presence of a cell boundary marker.

      3.Figure 4 compares MN rupture frequency between cells treated with different inducers of micronuclei - KIF18A KO, nocodazole release, and irradiation. These treatments have different effects on the cell cycle: KIF18A causes minor delays, nocodazole arrests cells in mitosis, and g-IR likely causes delays in S and G2. Since MN rupture frequency increases with the duration of interphase, the authors need to assess rupture frequency at similar time points after mitosis for all three conditions. One way to accomplish this would be to repeat this experiment and analyze cells collected by mitotic cells by shake-off prior to fixation and labeling.

      We appreciate this point regarding differences in mitotic timing. Since micronuclear rupture frequency increases with time in interphase, we would expect the MN in KIF18A KO cells to exhibit the highest level of rupture if cell cycle timing were the primary variable affecting stability in our experiments. KIF18A KO cells are asynchronously dividing, and the micronuclei examined in populations of those cells could have been generated at any time. We do not have the same type of temporal control of these events as we do with drug treatment. In contrast, the vast majority of the MN in nocodazole washout cells would not have been in interphase for more than 1.5 hours in our experiments, yet showed increased lamin A/C defects. RPE1 cells treated with MAD2 siRNA knockdown, which do not experience mitotic delays (PMID: 9606211; 15239953), also showed greater frequencies of micronuclear envelopes which lacked lamin A/C compared to those arising in KIF18A KO cells.

      To further address this question, we could attempt a mitotic shake-off assay, however, we believe that the formation of micronuclei, as a percentage in the population of KIF18A KO cells, will be limiting in these experiments.

      As an alternative, we propose to use live cell imaging to follow micronuclear formation and rupture, as described above in reference to point 1.

      **Minor Comments**

      1.In figure 6A, it is unclear when the videos start and how micronuclei are selected for analysis. Do the micronuclei have to be continuously visible from the time they missegregate? Do the videos all start at the same time point during mitosis or is it contingent on when the MN appears separated from the main nucleus? One concern is that a consistent delay in micronucleus appearance in the nocodazole treated cells could artificially decrease the amount of MN expansion observed.

      We thank the reviewer for these questions. The individual micronuclei did not need to be continuously visible from the time that they missegregated, though the majority were. When a micronucleus was not sufficiently in the plane of focus for an accurate area measurement, the individual measurement at that time point was not collected. In cases where one or more frames which were not measurable, a micronucleus was only included in the final data set if it was 1) the only micronucleus present in the daughter cell or 2) easily identifiable to be the same micronucleus. Measurements were taken until the micronuclear area reached an equilibrium for several frames. Final fold change in area was established by dividing final area measurements by initial measurements.

      The initial measurement for each micronucleus taken from the videos all start at the same relative point during mitosis, which is just after chromosome segregation has occurred.

      2.In figure 7A, it is difficult to identify the "lagging" chromosome in the top panel. It would be helpful to label the chromosome that becomes the MN, or ideally, to include a video or still images to demonstrate how micronuclei form in the KIF18A KO cells.

      We have updated the images in Figure 7A to include an example of a lagging chromosome in a KIF18A KO cell. We will also include a more explicit reference to our previous study (Fonseca et al., 2019), which described how micronuclei form around lagging chromosomes in KIF18A KO RPE1 cells.

      3.The two image panels in figure 7A are imaged at significantly different times during anaphase (early anaphase on bottom versus late anaphase/telophase on top). A better comparison would be between two cells at the same time point in anaphase.

      We have updated the images in Figure 7A to compare cells at similar stages of anaphase. In our quantification of lagging chromosomes, we also accounted for anaphase-timing differences by normalizing all measurements within each half-spindle.

      Reviewer #3 (Significance (Required)):

      In this study, the authors identify chromatin decondensation in micronuclei as a new predictor of membrane stability. Although these results are correlative, if their micronucleus rupture results can be validated as described in major comment 1, this study would advance our understanding of the micronucleus rupture mechanism by linking mitotic spindle location, chromatin decondensation, and lamin B1 protein recruitment. This would provide needed support to a current model in the field that micronucleus stability is largely determined during nuclear envelope assembly. In addition, if KIF18a loss generates stable micronuclei at high frequency, it will become a critical system for testing MN rupture hypotheses in the field. Thus, this work would be of significant interest to cell biologists working on nuclear envelope structure and function, chromosome organization, and mitosis. I include myself in this group as a cell biologist studying nuclear envelope structure and function with an expertise in membrane dynamics. The authors also find that mice mutant for KIF18a have increased micronucleation in normal tissues but not increased tumor initiation. They hypothesize that this is due to the low rupture frequency of KIF18a-induced MN, however their data cannot reject the null hypothesis that the small increase in MN they see in KIF18a mutant mice would be insufficient to induce tumorigenesis even if rupture frequency was high. Thus the significance of their finding that micronucleation is not sufficient for cancer progression is unclear. However, the thorough analysis of micronucleation and rupture in several healthy tissues as well as a tumor model in KIF18 mutant mice would be of interest to both pathologists and cancer researchers focused on mechanisms of genome instability. These types of experiments are critical to determine how micronuclei contribute to cancer progression and the quantifications presented in this paper are truly impressive.

      We appreciate this reviewer’s enthusiasm for our work and acknowledge that we cannot definitively conclude that micronuclear envelope stability explains why Kif18a mutant mice do not form tumors. However, it is interesting to note that the micronuclear loads measured using a peripheral erythrocyte assay are similar in Kif18agcd2/gcd2 mutant mice (0.6% micronucleated erythrocytes, of total erythrocytes) and ATMtm1 Awb/tm1 Awb mutant mice (0.6% of micronucleated erythrocytes, of total) (Fonseca et al., 2019). Yet, the tumor frequency in these two models is dramatically different: Kif18agcd2/gcd2 mutant mice do not spontaneously form tumors – while the majority of ATMtm1 Awb/tm1 Awb mutant mice do develop thymic lymphoma tumors between 2 and 4 months (Barlow, 1996). It is not clear how much micronuclei contribute to tumorigenesis in the ATM mutant model, but this comparison does suggest that the increase in MN seen in Kif18a mutants may be physiologically relevant. We have added this information to the revised text (lines 125-130).

      **Referees cross-commenting**

      I agree with the concerns raised by the other 2 reviewers, especially their comments about the need to clarify the mechanism of chromosome lagging versus chromosome congression and compaction. I think that all of these suggestions, though, are contingent on them being able to reproduce their micronucleus rupture results with a better marker of nucleus integrity. I strongly believe that additional validation of lamin A as a micronucleus rupture marker will demonstrate that it is unreliable, based both on our own observations in RPE-1 cells and the images they show

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      Sepaniac et al demonstrate that loss of KIF18a, a motor protein required for proper chromosome congression and chromatin compaction during mitosis, is insufficient to drive tumor development in mice although it does increase the frequency of micronuclei (MN), nuclear compartments that form around broken or missegregated chromosomes, in both normal and tumor tissue. MN are thought to increase genome instability and metastasis by undergoing DNA damage and activating innate immune signaling after irreparable nuclear membrane rupture. The authors use a non-transformed human cell line, hTERT-RPE-1, with KIF18a knocked out to demonstrate that MN formed as a result of KIF18a loss have more stable nuclear membranes than MN generated by other methods. They go on to correlate this increased stability with increased chromosome proximity to the main chromatin mass during nuclear envelope assembly and increased chromatin decompaction by a combination of fixed and live cell imaging.

      Major Comments

      1.This study relies heavily on the use of lamin A loss or discontinuity to identify ruptured micronuclei. Although the authors validate this marker against "leakage" of the soluble nuclear protein mCherry-NLS, there are several lines of evidence suggesting that lamin A loss or disruption is not a reliable reporter. In figure S3C, the top two panels of intact MN in the KIF18KO appear ruptured based on the gH2AX labeling, yet have significant levels of lamin A and are labeled as intact. In figure 4D, the rate of MN rupture after nocodazole release (60% ruptured in 2 hours) is much faster than that reported in other papers (40-60% in 16-18 hours, Liu et al; 60% in 16 hours; Hatch et al). In addition, images in Hatch et al, 2013 show lamin A localizing to both intact and ruptured MN and anecdotal information in the field suggests that lamin A localization is not a reliable reporter.

      These discrepancies may be due to how the authors' define "mCherry-NLS leakage", which needs to be defined in the methods as previous studies have demonstrated that MN frequently have delayed or reduced nuclear import even though the membrane is intact. Regardless, the authors need to provide compelling independent evidence that lamin A loss and disruption faithfully recognize ruptured MN by either validating this marker against additional rupture reporters, such as Lap2, LBR, or emerin accumulation, or by repeating key experiments in cells expressing mCherry-NLS.

      2.Micronuclei in tumor sections and other dense tissues can appear very similar to other types of chromatin, including blebs from adjacent nuclei and dead cells. To verify that the quantified structures are bona fide micronuclei, the authors need to include a marker for the cell boundary. This is especially critical in the lamin a stained tumor sections with heterogenous lamin A protein expression.

      3.Figure 4 compares MN rupture frequency between cells treated with different inducers of micronuclei - KIF18A KO, nocodazole release, and irradiation. These treatments have different effects on the cell cycle: KIF18A causes minor delays, nocodazole arrests cells in mitosis, and g-IR likely causes delays in S and G2. Since MN rupture frequency increases with the duration of interphase, the authors need to assess rupture frequency at similar time points after mitosis for all three conditions. One way to accomplish this would be to repeat this experiment and analyze cells collected by mitotic cells by shake-off prior to fixation and labeling.

      Minor Comments

      1.In figure 6A, it is unclear when the videos start and how micronuclei are selected for analysis. Do the micronuclei have to be continuously visible from the time they missegregate? Do the videos all start at the same time point during mitosis or is it contingent on when the MN appears separated from the main nucleus? One concern is that a consistent delay in micronucleus appearance in the nocodazole treated cells could artificially decrease the amount of MN expansion observed.

      2.In figure 7A, it is difficult to identify the "lagging" chromosome in the top panel. It would be helpful to label the chromosome that becomes the MN, or ideally, to include a video or still images to demonstrate how micronuclei form in the KIF18A KO cells.

      3.The two image panels in figure 7A are imaged at significantly different times during anaphase (early anaphase on bottom versus late anaphase/telophase on top). A better comparison would be between two cells at the same time point in anaphase.

      Significance

      In this study, the authors identify chromatin decondensation in micronuclei as a new predictor of membrane stability. Although these results are correlative, if their micronucleus rupture results can be validated as described in major comment 1, this study would advance our understanding of the micronucleus rupture mechanism by linking mitotic spindle location, chromatin decondensation, and lamin B1 protein recruitment. This would provide needed support to a current model in the field that micronucleus stability is largely determined during nuclear envelope assembly. In addition, if KIF18a loss generates stable micronuclei at high frequency, it will become a critical system for testing MN rupture hypotheses in the field. Thus, this work would be of significant interest to cell biologists working on nuclear envelope structure and function, chromosome organization, and mitosis. I include myself in this group as a cell biologist studying nuclear envelope structure and function with an expertise in membrane dynamics.

      The authors also find that mice mutant for KIF18a have increased micronucleation in normal tissues but not increased tumor initiation. They hypothesize that this is due to the low rupture frequency of KIF18a-induced MN, however their data cannot reject the null hypothesis that the small increase in MN they see in KIF18a mutant mice would be insufficient to induce tumorigenesis even if rupture frequency was high. Thus the significance of their finding that micronucleation is not sufficient for cancer progression is unclear. However, the thorough analysis of micronucleation and rupture in several healthy tissues as well as a tumor model in KIF18 mutant mice would be of interest to both pathologists and cancer researchers focused on mechanisms of genome instability. These types of experiments are critical to determine how micronuclei contribute to cancer progression and the quantifications presented in this paper are truly impressive.

      Referees cross-commenting

      I agree with the concerns raised by the other 2 reviewers, especially their comments about the need to clarify the mechanism of chromosome lagging versus chromosome congression and compaction.

      I think that all of these suggestions, though, are contingent on them being able to reproduce their micronucleus rupture results with a better marker of nucleus integrity. I strongly believe that additional validation of lamin A as a micronucleus rupture marker will demonstrate that it is unreliable, based both on our own observations in RPE-1 cells and the images they show

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Sepaniac and colleagues use in vivo and in vitro approaches to examine why micronuclei generated by lack of KIF18A activity do not promote tumorigenesis. The authors conclude that micronuclei in KIF18A depleted cells form stable micronuclear envelopes, which may be a result from lagging chromosomes being closer to the spindle pole when the micronuclear envelope forms. The authors further conclude that the stability of the micronuclei arising from lack of KIF18A can explain why Kif18a mutant mice do not develop tumors. These results also suggest that the consequences of micronuclei and their possible contribution to tumorigenesis depend on the context of their genesis. While the mouse model data and characterization of the stability of micronuclei generated by different insults support the conclusions, the lagging chromosome positioning data could be improved. Moreover, a number of other issues should be addressed prior to publication.

      Major issues:

      1.Line 153-155. The authors conclude that the slight reduction in overall survival is "due to a reduced ability of Kif18a mutants to cope with rapid tumorigenesis," but it is unclear why this would be the case. There is also an increase in micronucleated cells in thymic lymphomas from Kif18a/p53 homozygous mice (Fig. 2B)-could this not contribute? In Fig. 3C, the authors show that micronuclear rupture is similar in both Kif18a WT and mutant mice, so it seems possible that the increase in the frequency of micronuclei (Fig. 2B) coupled with a similar frequency of micronuclear rupture (Fig. 3C) could lead to the reduced survival. Then, in the discussion, the authors downplay this finding by saying (line 371) "loss of Kif18a had modest or no effect on survival of Trp53 homozygotes and heterozygotes." Why then speculate earlier in the text that loss of Kif18a reduces the ability to cope with tumorigenesis?

      2.Related to the point above, the authors show in figure 3 that the micronuclei found in healthy tissues display infrequent membrane rupture (panel B). However, micronuclear membrane rupture in tumor tissues is much more frequent (panel C). How do the authors explain this? Do they hypothesize that the micronuclei in the tumors originate by mechanisms other than the misalignment caused by lack of KIF18A? Does KIF18A depletion cause aneuploidy due to segregation of two sisters to the same pole? If so, one could expect the tumors to be aneuploid (is this the case?) and aneuploidy has been shown by numerous groups to cause genomic instability. Such genomic instability could then explain the difference in membrane rupture.

      3.The authors conclude that lagging chromosomes in KIF18A KO cells are found closer to the main chromatin mass. The Stumpff lab showed in a 2019 JCB paper that KIF18 KO cells have a chromosome alignment defect and as a result during anaphase the chromosomes can be scattered rather than forming the tight, uniform mass that is observed in WT cells. The scattering of kinetochores resulting from this phenotype could affect the value of "Avg Chromosomes Distances" in Fig 7B and the normalized distance in the KIF18A KO cells. Therefore, live-cell imaging experiments would be helpful to resolve this and possibly strengthen this conclusion. RPE1 cells with fluorescently tagged CENP-A and centrin could be used to ensure that the lagging chromosomes will not rejoin the main nucleus. Moreover, these cells could be used for correlative live-fixed cell experiments in which fixed cell analysis following micronucleus formation could be used to show that chromosomes that lag farther away from the spindle pole are more likely to have defective micronuclear envelopes.

      4.Based on the Fonseca et al. 2019 JCB paper (video 2), micronuclei from KIF18A KO do not exclusively arise from lagging chromosomes. Instead, chromosomes can also escape the main chromatin mass after segregation and subsequently be excluded from the main nucleus. It would be important to know what fraction of the micronuclei in KIF18A KO cells arise via lagging chromosomes. Since Aurora B and/or bundled microtubules at the spindle midzone are believed to prevent proper nuclear envelope formation, chromosomes that properly segregate but later become separated from the main nucleus would be more likely to form proper micronuclear envelopes than those arising from lagging chromosomes. The correlative microscopy experiment suggested in the previous point could allow differentiation between these two routes to micronucleus formation.

      Minor issues:

      1.Some parts of the manuscript are excessively wordy and some sentences are unclear or convoluted (e.g., lines 148-153 and 238-239).

      2.Lines 59-61. This sentence is formulated incorrectly. First of all, the subject of the sentence is "cells" and the verb is "can become fragmented." However, the authors mean that the DNA in the micronucleus can become fragmented (not the cells). Moreover, the way the sentence is currently formulated seems to suggested that the fragmentation occurs during cell division. However, this is not the case. Please, revise the text to make it more accurate.

      3.Lines 114-115. Please, provide references in support of this statement.

      4.Line 153. The authors refer to Fig. 1C, but I think they mean Fig. 1B.

      5.Line 324. the authors find that RPE1 KIF18A KO cells have lagging chromosomes in ana/telophase 9% of the time, then say that this shows that lagging chromosomes are rare in KIF18A KO cells. However, this is a large increase compared to normal RPE1 cells, which only have 1-2% frequency of lagging chromosomes. So, they should revise the text here to say that the rates of lagging chromosomes from KIF18A KO are lower compared to the rates induced by nocodazole washout.

      6.Line 383. The references listed here should be moved earlier and specifically after the statement summarizing the results of the studies instead of being listed after the authors' conclusion/interpretation of the data. The same issue was noted in other parts of the manuscript.

      7.Figure 1A. In the text, the authors say they cross a Kif18a heterozygous mutant mouse with a p53 heterozygous mutant mouse, but the two mice in this figure are already heterozygous for both. Please, revise the text or depict the previous additional cross necessary to obtain the double heterozygous.

      8.Figure 3A. Arrows or dotted circles outlining the micronuclei in the insets of the middle and bottom rows would be helpful since the DAPI signal in the micronuclei is low and somewhat difficult to see.

      9.Figure 3B. Error bars should be added to the graph. Moreover, the authors noted that the differences are not significant. However, this seems surprising, given that in some cases there is a three- to five-fold difference between certain pairs. Indeed, a chi-square test using the numbers from table S1 indicated p values <0.05 for several pairwise comparisons.

      10.Figure 5G. When referring to this figure (lines 292-294), the authors talk about correlation. However, the points in this graph seem to be scattered a bit randomly.

      11.Figure 6B-D. The Y-axis titles of the three graphs are a bit confusing. Please, consider revising.

      12.In Figure 7 and the text, the authors use the terms "late-lagging" and "lagging" chromosomes interchangeably, which is somewhat confusing in this context because lagging chromosome distance from the main chromosome mass is thought to contribute to defective assembly of micronuclear envelopes. It is not clear whether the authors intend to indicate, with this term, that the lagging chromosome is farther away from the main chromosome mass or that the lagging chromosome is in a "late" anaphase cell. Because this is confusing, I suggest just using the term "lagging chromosome" consistently. It could be useful to include representative images of lagging chromosomes located at different distances from the main chromosome mass. And certainly, the authors should include an example of a lagging chromosome in the KIF18A KO cells.

      13.Figure S2A. The example in the bottom right image looks more like a chromosome bridge than a lagging chromosomes. Kinetochore staining is necessary to unequivocally identify lagging chromosomes.

      Significance

      Based on the previous knowledge on the factors that cause abnormal assembly of the micronuclear membrane, the results presented in this study were somewhat predictable. However, these findings will add to the knowledge of how micronuclei form and the potential factors that lead to micronuclear membrane rupture. Previous studies investigating micronucleus behavior have focused on micronuclei arising via merotelic kinetochore mis-attachments. These mis-attachments lead to formation of micronuclei close to the spindle midzone. In the present study, instead, the micronuclei arising from lack of KIF18A activity form farther away from the spindle midzone. The results presented here suggest that the positioning of these micronuclei farther away from the midzone enables assembly of a more stable micronuclear membrane that will be less likely to rupture during the following cell cycle. A recent study showed that the microtubule bundles in the spindle midzone interfere with micronuclear membrane assembly. Based on this, it is not surprising that micronuclei forming away from the spindle midzone (like those resulting from lack of KIF18A activity) assemble more normal membranes. Although somewhat expected, this study provides the actual data in support of this phenomenon. This study will be of interest to cell biologists interested in cell division and genomic instability. My research has focused on cell division, aneuploidy, and chromosomal instability for nearly thirty years. Therefore, I believe I am fully qualified to evaluate this manuscript.

      Referees cross-commenting

      My areas of expertise do not include nuclear membrane structure and function. Therefore, I encourage the authors to consider the comments of reviewer #3 for issues related to reliable quantification of micronuclear membrane rupture.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In the present work Stumpff, Reinholdt and co-workers investigate the mechanism by which micronuclei contribute to tumorigenesis. Micronuclei are classic markers of genomic instability widely used in the diagnosis of cancer, but whether they work as drivers of the process has recently attracted significant attention due to their link with chromothripsis. Here, the Stumpff/Reinhold labs have explored an interesting model to test some ideas about the role of micronuclei as drivers of tumorigenesis, based on Kif18A/p53 double KO mice. They confirm the formation of micronuclei in these animals, but find no substantial increase in survival and tumor incidence relative to p53 KO animals, despite higher incidence of micronuclei in Kif18A/p53 KO tumors. They conclude that, per se, micronuclei do not have the capacity to form tumors, regardless of p53 status. This was surprising, given the well-established role of p53 in preventing the proliferation of micronucleated cells. To shed light into this apparent paradox, they compared micronuclei from Kif18A KO cells with micronuclei generated by a number of other experimental conditions that promote formation of anaphase lagging chromosomes or generates acentric fragments. They found that micronuclei derived from Kif18A are intrinsically different from micronuclei generated by those other means and essentially showed increased accumulation of lamin B, were more resistant to rupture and preserved the capacity to expand as cells exited mitosis. Of note, they find a correlation between chromosome proximity to the poles/main chromosome mass and the different features that characterize micronuclei from Kif18A KO cells, compared with the other experimental conditions in which late lagging chromosomes are more frequent. Overall, I find this study extremely interesting, well designed and executed in a rigorous way that characterizes the consistent solid work from these laboratories over the years. I have just few minor points that I recommend to be addressed prior to publication.

      1-Abstract and main text lines 70 and 100: the authors indicate that Kif18A mutant mice produce micronuclei due to unaligned chromosomes. This is correct, but at the same time misleading. The authors should clarify that although micronuclei derive from compromised congression, I was convinced from previous works (Fonseca et al., JCB, 2019) that it was their asynchronous segregation in anaphase that led to micronuclei formation. As is, a less familiar reader may conceive that misaligned chromosomes directly result in micronuclei, for example by being detached from the main chromosome mass.

      2-Page 2, line 59: "cells entering cell division...become fragmented". It is not the cells, but the chromosomes that fragment. Please correct.

      3-Page 4, line 149: "reduced survival in the Kif18A null, p53 mice". P53 what? KO, WT? Please clarify.

      4-Page 5, line 212: the authors refer that micronuclei were scored for absence of lamin A/C, but previously they scored it as "continuous/discontinuous". Please clarify.

      5-Page 6, line 243: "Kif18A is not required for micronuclear envelope rupture". Shouldn't it be micronuclear envelope "integrity"?

      6-One of the most interesting results of the paper is the correlation between envelope formation in micronuclei with their respective position relative to the poles/midzone. Could the authors try to investigate causality? For instance, the authors refer to works from other labs in which MT bundles and a midzone Aurora B activity gradient might play a role in the different features associated with micronuclei envelope formation, depending on their origin. Could the authors manipulate this gradient and investigate whether it changes the outcome in terms of nuclear envelope assembly properties on micronuclei? Are there any detectable features in midzone MT organization in Kif18A KO cells that would justify the observed differences?

      Significance

      Kif18A plays a key role in chromosome alignment, without apparently affecting kinetochore-microtubule attachments in non-transformed cells. Because they cannot establish a proper metaphase plate Kif18A KO cells enter anaphase with highly asynchronous segregation due to non-uniform chromosome distribution along the spindle axis. Consequently, some "delayed" chromosomes form micronuclei, in cell culture and in vivo. Interestingly, prior art has failed to detect any increased signs of genomic instability in Kif18A KO cells and mice, and, contrary to what would be expected based on current trends, these mice do now show any signs of increased incidence of tumors, in fact they even show some protective effect to induced colitis-associated colorectal cancer. Noteworthy, all previous experimental works pointing to a role of micronuclei as key intermediates of genomic instability in cancer relied on models in which the tumor suppressor protein p53 had been inactivated. In the present work, the authors explore the relationship between micronuclei formation and p53 inactivation by investigating tumor formation in Kif18A/p53 double KO animals (1 or 2 alleles of p53 inactivated).The reported results are timely and will attract the interest of a broad readership, while decisively contributing to shed light into an ongoing debate. I am therefore all in favor for the publication of this work in any journal affiliated with review commons, pending some minor revisions.

  4. Dec 2020
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their constructive suggestions, which have substantially improved this work. We have comprehensively revised the manuscript, and detail individual responses below:

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The study by Forbes et al describes and characterizes a 2nd generation peptide-based inhibitor of the MYB:CBP interaction, termed CRYBMIM, which they use to study MYB:cofactor interactions in leukemia cells. The CRYBMIM has improved properties relative to the MYBMIM peptide, and display more potency in biochemical and cell-based assays. Using a combination of epigenomics and biochemical screens, the authors define a list of candidate MYB cofactors whose functional significance as AML dependencies is supported by analysis of the DepMap database. Using genomewide profiling of TF and CBP occupancy, the authors provide evidence that CRYBMIM treatment reprograms the interactome of MYB in a manner that disproportionately changes specific cis-elements over others. Stated differently, the overall occupancy pattern of many TFs/cofactors shows gains and losses at specific cis elements, resulting in a complex modulation of MYB function and changes in transcription in leukemia cells. Overall, this is a strong, well-written study, with clear experimental results and relatively straightforward conclusions. The therapeutic potential of modulating MYB in cancer is enormous, and hence I believe this study will attract a broad interest in the cancer field and will likely be highly cited. I list below a few control experiments that would clarify the specificity of CRYBMIM. 1) Does CRYBMIM bind to other KIX domains, such as of MED15. It would be important to evaluate the specificity of this peptide for whether it binds to other KIX domains.

      Response: We analyzed all known human KIX domain sequences, and found that the most similar one to CBP/P300 is MED15 (38% identity), as shown in revised Supp. Fig. 2D. The sequence similarity of the remaining human KIX domains is substantially lower. To determine the specificity of CRYBMIM in binding the CBP/P300 versus MED15, we exposed human AML cell extracts to biotinylated CRYBMIM immobilized on streptavidin beads versus beads alone. Whereas CRYBMIM binds efficiently to CBP/P300, it does not exhibit any measurable binding to MED15 (even though MED15 is highly expressed), as shown in revised Supp. Fig. 2E, and reproduced for convenience below. While this does not exclude the possibility that CRYBMIM binds to other proteins, the biochemical specificity observed here, combined with the genetic requirement of CBP for cellular effects of CRYBMIM as shown by a genome-wide CRISPR screen (Fig. 1B and below), indicate that CRYBMIM is a specific ligand of CBP/P300. The manuscript has been revised on page 6 and 4-5 accordingly.

      2) Similarly, it would be useful to perform a mass spec analysis to all nuclear factors that associate with streptavidin-immobilized CRYBMIM. This again would be help the reader to understand the specificity of this peptide.

      Response: We agree with the reviewer that macromolecular ligands like CRYBMIM may interact with cellular proteins in complex ways. To define specific effects, we utilized four orthogonal strategies, explained below.

      First, we purified the CBP-containing nuclear complex using immunoprecipitation and determined its composition by mass spectrometry proteomics. This analysis revealed 833 proteins that are specifically associated with CBP (revised Table S6). Although technically feasible, the fact that CBP is associated with hundreds of proteins would make the experiment suggested by the reviewer difficult to interpret, because it would be a major challenge to distinguish proteins bound directly by the peptide versus proteins purified indirectly by virtue of the fact that CRYBMIM binds to CBP/P300, which in turn binds to many other proteins. While we recently developed improved methods for cross-linking mass spectrometry proteomics that permit the identification of direct protein-protein interactions (Ser, Cifani, Kentsis 2019, https://doi.org/10.1021/acs.jproteome.9b00085), we believe that these experiments are beyond the scope of the current manuscript, which already includes 40 new figure panels as part of this revision.

      In lieu of this experiment, we purified the CBP-containing nuclear complex after treatment with CRYBMIM or control using immunoprecipitation and determined its composition by targeting Western blotting. This analysis revealed RUNX1, LYL1 and SATB1 are specifically associated with CBP (revised Fig. 14B), among which RUNX1 is specifically remodeled in the MYB:CBP/P300 complex upon CRYBMIM binding. This transcriptional factor recruitment and remodeling support the idea of CRYBMIM’s specificity for the MYB:CBP/P300 complex.

      Second, to define the specificity of CRYBMIM, we used glycine mutants of CRYBMIM and its parent MYBMIM, CG3 and TG3, respectively, in which residues that form key salt bridge and hydrophobic interactions with KIX are replaced with glycines, but otherwise retain all other features of the active probes. Both CG3 and TG3 exhibit significantly reduced effects on the viability of AML cell lines, consistent with the specific effects of CRYBMIM (Fig. 3D).

      To confirm that this is due to CBP binding, we purified cellular CBP/P300 by binding to biotinylated CRYBMIM, and observed that it can be efficiently competed by excess of free CRYBMIM, but not TAT (Fig. 2E).

      Finally, to establish definitively that cellular CBP is responsible for CRYBMIM effects, we generated isogenic cell lines that are either deficient or proficient for CBP using CRISPR genome editing. This experiment demonstrated that CBP deficiency confers significant resistance to CRYBMIM, indicating that CBP is required for CRYBMIM-mediated effects (revised Figure 4), and reproduced below. We revised the manuscript on pages 21, 8, 6 and 9 accordingly.

      3) The major limitation of this study which modestly lessens my enthusiasm of this work is that the mechanistic model of MYB-sequestered TFs proposed here is based on a face-value interpretation of IP-MS data coupled with ChIP-seq data. Normally, I would expect such a mechanism to be supported with some additional focused biochemical experiments of specific interactions, to complement all of the omics approaches. For example, can the authors evaluate and/or validate further how MYB physically interacts with LYL1, CEBPA, SPI1, or RUNX1. Are these interactions direct or indirect? Which domains of these proteins are involved? Does CRYBMIM treatment modulate the ability of these proteins to associate with one another in a co-IP? Do these interactions occur in normal hematopoietic cells? A claim is made throughout this study that these are aberrant TF complexes, but I believe more evidence is required to support this claim.

      Response: We appreciate the reviewer’s comment and totally agree with this point. To examine how MYB aberrantly assembles transcription factors in AML, we performed MYB co-immunoprecipitation (co-IP) in a panel of seven genetically diverse AML cell lines with varying susceptibility to CRYBMIM, chosen to represent the common and refractory forms of human AML. Here, we confirmed co-assembly of CBP/P300, LYL1, E2A, LMO2 in all AML cell lines tested, and cell type-specific co-assembly of SATB1 and CEBPA, as shown in revised Fig. 8A, which are in agreement with the IP-MS and ChIP-seq results. We further corroborated these findings by co-IP studies of CBP/P300, as shown in the revised Fig. 8B. We performed similar co-IP experiments in normal hematopoietic progenitor cells, and found most of the co-assembled factors in AML cells were not observed in normal cells except for CBP/P300 and LYL1, as shown in the revised Figure 9E. Combined with the apparently aberrant expression of E2A and SATB1 in AML cells but not normal blood cells, this leads us to conclude that MYB assembles aberrant transcription factor complexes in AML cells. These complexes can be remodeled by peptidomimetic inhibitors, leading to their redistribution on chromatin, suppression of oncogenic gene expression and induction of cellular differentiation. We confirmed this mechanism by direct biochemical experiments in AML cells, demonstrating disassembly and remodeling of CBP/P300 complexes, as shown in the revised Figure 14. At least some of these interactions are direct, given the known direct binding between MYB and CEBPA (Oelgeschläger, Nuchprayoon, Lüscher, Friedman 1996, https://doi.org/10.1128/mcb.16.9.4717). We revised the manuscript text on pages 13, 15 and 21 accordingly.

      Reviewer #1 (Significance (Required)):

      Overall, this is a strong, well-written study, with clear experimental results and relatively straightforward conclusions. The therapeutic potential of modulating MYB in cancer is enormous, and hence I believe this study will attract a broad interest in the cancer field and will likely be highly cited.

      Response: We appreciate this sentiment and completely agree with the reviewer. The phenomenon reported in this work represents the first of its kind demonstration of the aberrant organization of transcription factor control complexes in cancer, and its pharmacologic modulation. We believe that this concept will serve as a transformative paradigm for understanding oncogenic gene control and the development of effective therapies for its definitive treatment.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This manuscript reports the generation of a new and improved peptide mimetic inhibitor of the interaction between MYB and CBP/P300. The original MYBMIM inhibitor of this interaction, reported recently by the same laboratory, was modified by addition and substitution of peptide sequences from CREB, thus improving the affinity of the resulting CRYBMIM peptide to CBP/P300. The improved inhibitor profile results in increased anti-AML efficacy of CRYBMIM over MYBMIM. The authors go on to examine the mechanism underlying the anti-AML activity of CRYBMIM by integrating gene expression analysis, chromatin immunoprecipitation sequencing and mass spectrometric protein complex identification in human AML cells. I have some minor questions the authors may wish to comment on:

      1) The relocation of MYB, along with CBP/P300, to genes controlling myeloid differentiation (clusters 4 and 9) upon CRYBMIM treatment is reminiscent of the increased binding of MYB to myeloid pro-differentiation genes in AML cells following RUVBL2 silencing, recently reported in Armenteros-Monterroso et al. 2019 Leukemia 33:2817. Do the authors know if there is any overlap between genes in either of the clusters and the list reported in the latter study?

      Response: We thank the reviewer for making this suggestion. We also observe both RUVBL2 and RUVBL1 in the protein complex specifically associated with MYB (Fig. 7A and B). We compared the gene expression changes induced by CRYBMIM with those reported by Armenteros-Monterroso et al in 2019 (https://doi.org/10.1038/s41375-019-0495-8), and found that 37% of upregulated genes by RUVBL2 silencing were shared with genes induced by CRYBMIM treatment. In addition, upregulated genes in cluster 4 and 9 included myeloid differentiation-related genes, such as JUN, FOS and FOSB, which were also induced RUVBL2 silencing. We revised the manuscript to reflect this association on page 12.

      2) Could the authors comment on a possible mechanism to explain the co-localization of MYB and CBP/P300 to the loci in clusters 4 and 9 following CRYBMIM treatment? Is it possible that CBP/P300 is recruited by other transcription factors to these loci, independently of binding to MYB? Or is the binding of CBP/P300 to MYB at these loci somehow more resistant to disruption by CRYBMIM?

      Response: The reviewer has focused on an interesting point. At least for cluster 9, these genes exhibit gain of CBP/P300 in association with RUNX1 (Figure 12A), which we confirm by direct biochemical studies of MYB and CBP/P300 complexes immunoprecipitated from AML cells (revised Figure 14B-C). These experiments show that CRYBMIM treatment disrupts the MYB:CBP/P300 complexes, leading to the increased assembly of CBP/P300 with RUNX1. These findings are consistent with a dynamic competition mechanism that governs availability of CBP/P300 to transcriptional co-activation, in which distinct transcription factors compete for limiting amounts of CBP/P300. This possible mechanism is discussed in the revised manuscript (page 18-19 and 21).

      3) In the first paragraph of page 9, the text states: "Previously, we found that MYBMIM can suppress MYB:CBP/P300-dependent gene expression, leading to AML cell apoptosis that required MYB-mediated suppression of BCL2 (Ramaswamy et al., 2018)." I think this is a typo, since in this study, MYBMIM treatment results in loss of MYB binding to the BCL2 gene and consequent reduction in BCL2 expression. Do the authors mean 'MYBMIM-mediated suppression of BCl2' or 'loss of MYB-mediated activation of BCL2'?

      Response: We thank the reviewer and have corrected this typographic error in the text.

      4) The authors explain the failure of excess CREBMIM to displace CBP/P300 from immobilised CREBMIM (Figure 1E-F) by the nature of the CREB:CBP/P300 interaction. Does this imply that CREBMIM is unable to disrupt the interaction between CREB and CBP/P300 in living cells and that the CBP/P300 purified from native MV4;11 lysates by immobilised CREBMIM was from a pool not associated with CREB?

      Response: We thank the reviewer for making this point. Indeed, we reproducibly observe that CRYBMIM binding to CBP can be competed with excess free CRYBMIM, but CREBMIM binding cannot be competed by excess CREBMIM. This may be due to the different stabilities of the CBP complexes that are available for binding in cells. Alternatively, it is also possible that CREB binding to CBP, as reflected by CREBMIM, has a relatively slow dissociation rate, as compared to MYB, as reflected by CRYBMIM. We have begun to purify cellular CBP complexes (revised Fig 8. and response to comment 2 for Reviewer 1), and aim to define their determinants in future studies, as enabled by the introduction of CRYBMIM, CREBMIM and MLLMIM probes in the current work.

      Reviewer #2 (Significance (Required)):

      Based on this integrative analysis, the authors propose a convincing hypothesis, involving the assembly of aberrant transcription factor complexes and sequestration of P300/CBP from genes involved in normal myeloid development, for the oncogenic activity of MYB in AML. As well as the obvious therapeutic potential of the CRYBMIM inhibitor itself, the data reported here reveal multiple avenues for future investigation into novel anti-AML therapeutic strategies. This is an innovative and important study.

      This study will be of interest to scientists and clinicians involved in leukaemia research as well as cancer biology in general.

      My field of expertise: leukaemia biology, leukaemia models, aberrant transcription factor activity in leukaemia

      Response: We appreciate and agree with this assessment.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This manuscript describes an improved MYB-mimetic peptide (cf the group's earlier work published in Nature Communications, 2018) and its effects on AML cell lines. It also describes - and this constitutes the majority of the paper - the dynamics of chromatin occupancy by MYB and other associated transcription factors upon disruption of the MYB-CBP/P300 interaction. The authors suggest this represents a shift from an oncogenic program to a myeloid differentiation program. \*Major comments:***

      Regarding the improved affinity, and biological activity, of CRYBMIM:

      1.Improved affinity of CRYBMIM cf MYBMIM: clearly, it is improved, but not by a lot. By MST the increased affinity is about 3x. In terms of effects on AML cell viability: there is no direct comparison, and this should be included. In the group's previous paper there is no direct estimate for MYBMIM but it looks like the IC50 is between 10 and 20 micromolar so the effect is again around 2.5 fold. Also, the effects of the amino acid substitutions in CG3 are also very small (2.4x) given that 3 critical residues are altered. This is quite concerning.

      Response: As pointed out by the reviewer, CRYBMIM exhibits several fold increase in binding affinity, as measured using purified proteins in vitro. Similar increase in cellular potency is observed after short-term treatment of AML cells, as shown in revised Figure 3C, and reproduced below. However, increasing the duration of treatment to several days leads to substantial improvement in apparent cellular potency (Figure 3G). For example, while MYBMIM induces approximately 100-fold reduction in cell viability of MV411 cells, CRYBMIM induces more than 1,000-fold reduction. Similarly, whereas MYBMIM exhibited relatively modest effects on OCIAML3 and SKM1 cells, CRYBMIM induces more than 1,000-fold reduction in cell viability. As we show in the revised manuscript, this appears to be due to the combination of increased biochemical affinity and specific proteolysis of MYB, which cooperate to induce extensive remodeling of MYB transcriptional complexes and gene expression (revised Figure 11). In all, this exemplifies how pharmacologic modulators of protein interactions can achieve significantly improved biological potency from relatively modest affinity effects, a concept that recently has been successfully used to develop a variety of PROTACs that leverage this “event-driven” as opposed to occupancy-driven pharmacology. The manuscript has been revised on page 8 and 18 to clarify this point.

      2.Does CRYBMIM really "spare" normal hematopoietic cells? Not according to Fig 2E, where there is only a 2-fold difference in IC50.

      Response: To better define the relative toxicity of CRYBMIM and MYBMIM, we examined their effects on the growth and survival of normal hematopoietic progenitor cells as compared to AML cells using colony forming assays in methylcellulose under more physiologic conditions in the presence of human hematopoietic cytokines (revised Figure 3E, and reproduced below). While CRYBMIM significantly reduced the clonogenic capacity, growth and survival of MV411 AML cells, there were no significant effects on the total clonogenic activity of normal CD34+ human umbilical cord blood progenitor cells under these conditions. At the highest dose, CRYBMIM induced modest reduction in CFU-MG colony formation, and modest increase in BFU-E colony formation of normal hematopoietic progenitor cells. We revised the manuscript to indicate that CRYBMIM “relatively spares” normal blood progenitor cells on page 8.

      Response: We appreciate the attention to this issue. In the original manuscript, we showed dose-response curves of cord blood progenitor cells cultured in suspension supplemented with fetal bovine serum, a system that is known to induce in appropriate hematopoietic cell differentiation (https://doi.org/10.1016/j.molmed.2017.07.003). In the revised manuscript, we show results of colony formation assays of hematopoietic progenitor cells cultured in serum-free, semi-solid conditions supplemented with human hematopoietic cytokines (revised Figure 3E and 3F). This is a more physiologic system which more faithfully maintains normal hematopoietic cell differentiation, as compared to the cellular differentiation induced by fetal bovine serum-containing media lacking hematopoietic growth factors, as used in the experiments in our original manuscript. To establish a positive control, in addition to treating AML cells under the same condition, we used doxorubicin, which is part of current treatment of patients with AML, and which in our experiments, exhibits significant and pronounced reduction in the clonogenic capacity, growth and survival of normal blood progenitor cells (revised Figure S3B). The manuscript has been revised on page 8 accordingly.

      1. Fig 2F doesn't include any lines that express very low or undetectable levels of MYB. Some of these should be included to further examine specificity.

      Response: We have now tested CRYBMIM against a large panel of non-hematopoietic tumor and non-tumor cell lines, with varying degrees of MYB expression. Some of those cells exhibit high level of MYB gene expression and MYB genetic dependency, which is at least in part correlated with susceptibility to CRYBMIM. (revised Figure S4, and reproduced below). The manuscript has been revised on page 8 accordingly.

      Effects on gene expression and MYB binding:

      Data on MYB target gene expression and apoptosis/differentiation, and the conclusions drawn per se are sound, but:

      5.Fig S3 seems to show that MYB protein is lost on treatment with CRYBMIM. This isn't even mentioned in the text but raises a whole range of major questions eg why is this the case? Is this what is responsible for the loss of MYB-p300 interaction and/or biological effects on AML cells? Is this what is responsible for the effects on MYB target gene expression in Fig 3 and MYB binding to chromatin in Fig 4? This must be addressed.

      Response: We have revised the manuscript to include this discussion, and performed additional experiments to define this phenomenon. We confirmed rapid reduction in MYB protein levels upon CRYBMIM treatment on the time-scale of one to four hours in diverse AML cell lines (revised Figure 11), with the rate of MYB protein loss correlating to the cellular susceptibility to CRYBMIM (revised Figure 11, and reproduced below). The manuscript has been revised on page 18 accordingly.

      This is consistent with the specific proteolysis of MYB induced by the peptidomimetic remodeling of the MYB:CBP/P300 complex. We confirmed this by combined treatment with the proteosomal/protease inhibitor MG132 (revised Figure 11C, and reproduced below). This effect was specific because overexpression of BCL2, which blocks MYBMIM-induced apoptosis (Ramaswamy et al, Kentsis, https://doi.org/10.1038/s41467-017-02618-6), was unable to rescue CRYBMIM-induced proteolysis of MYB, arguing that MYB proteolysis is a specific effect of CRYBMIM rather than a non-specific consequence of apoptosis. The manuscript has been revised on page 18 accordingly.

      6.Fig 4 and the accompanying text are a bit hard to follow, but if I understood them correctly, I am surprised that the "gained MYB peaks" don't include the MYB binding motif itself? This at least deserves some comment. Also, there doesn't seem to have been any attempt to integrate the ChIP-Seq data with the expression data of Fig 3. This would provide clearer insights into the identities and types of MYB-regulated genes that are directly affected by suppression of CBP/p300 binding to MYB.

      Response: We thank the reviewer for this suggestion. The revised manuscript now includes a comprehensive and integrated analysis of chromatin and gene expression dynamics (revised Figures 13A and 13B). In contrast to the model in which blockade of MYB:CBP/P300 induces loss of gene expression and loss of transcription factor and CBP/P300 chromatin occupancy, we also observed a large number of genes with increased expression and gain of CBP/P300 occupancy (revised Figure 13A-B, and reproduced below). This includes numerous genes that control hematopoietic differentiation, such as FOS, JUN, and ATF3. As a representative example, in the case of FOS, we observed that CRYBMIM-induced accumulation of CBP/P300 was associated with increased binding of RUNX1, and eviction of CEBPA and LYL1 (revised Figure 13C). Thus, the absence of “gained MYB peaks” is due to the redistribution of CBP/P300 with alternative transcription factors, such as RUNX1. In all, these results support the model in which the core regulatory circuitry of AML cells is organized aberrantly by MYB and its associated co-factors including LYL1, CEBPA, E2A, SATB1 and LMO2, which co-operate in the induction and maintenance of oncogenic gene expression, as co-opted by distinct oncogenes in biologically diverse subtypes of AML (revised Figure 14). This involves apparent sequestration of CBP/P300 from genes controlling myeloid cell differentiation. Thus, oncogenic gene expression is associated with the assembly of aberrantly organized MYB transcriptional co-activator complexes, and their dynamic remodeling by selective blockade of protein interactions can induce AML cell differentiation. The manuscript has been revised on page 20-21 accordingly.

      7.The MS studies on MYB-interacting proteins seem very interesting and novel. I am not an expert on MS, though, so I'd suggest this section be reviewed by someone who is. Moreover, I was unable to see the actual data from this study because the material I was provided with didn't include Table S4 and S5.

      Response: We appreciate this point. For this reason, we have deposited all of our mass spectrometry data to be openly available via PRIDE (accession number PXD019708), and also openly provide all of the analyzed data via Zenodo (https://doi.org/10.5281/zenodo.4321824), as additionally provided in the Supplementary Material for this manuscript.

      \Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?* 8.Claims regarding biological activity, specificity and improvements cf MYBMIM should be moderated given the small size of these effects as mentioned above (points 1 and 3).*

      Response: As explained in detail in response to comments 1-3 above (page 12-14 of this response), we have substantially revised the manuscript to incorporate both new experimental results and additional explanations (pages 6-8).

      9.I found the description of the studies related to Figs 5 and 6 somewhat difficult to follow and convoluted. While changes in MYB and CBP/p300 chromatin occupancy clearly occur on M CRYBMIM treatment, it is not clear that the complexes seen on genes prior to treatment represent "aberrant" complexes. These may just be characteristic of undifferentiated (myeloid) cells. The authors appear to argue that because some of the candidate co-factors show "apparently aberrant expression in AML cells" based on comparison of (presumably mRNA) expression data with normal cells, the presence of these factors in the complexes make them "aberrant" (moreover, the "aberrancy score" of Fig 5 C is not defined anywhere, as far as I can see). This inference is drawing a rather long bow, given that the AML-specific factors may not actually be absent from the complexes in normal cells. So this conclusion should be moderated if a more direct MS comparison cannot be provided (for which I understand the technical difficulties).

      Response: We have now measured protein abundance levels of key transcription factors assembled with MYB in AML cells in various normal human hematopoietic cells (revised Figure 9, and reproduced below). We found that most transcription factors that are assembled with MYB in diverse AML cell lines could be detected in one or more normal human blood cells, albeit with variable abundance, with the exception of CEBPA and SATB1 that were measurably expressed exclusively in AML cells (revised Figure 9A). Using unsupervised clustering and principal component analysis, we defined the combinations of transcription factors that are associated with aberrant functions of MYB:CBP/P300, as defined by their susceptibility to peptidomimetic remodeling (revised Figure 9B-D). In addition, we directly examined the physical assembly of MYB with key transcription factors in normal hematopoietic cells using co-immunoprecipitation studies (revised Figure 9E). In agreement with the physical association of MYB seen in AML cell lines, we observed association with CBP/P300 and LYL1 in normal hematopoietic cells. However, we did not observe physical association with E2A and SATB1 in normal cells, which indicates aberrant association of these in AML cell lines. This leads us to propose that these complexes are aberrantly assembled, at least in part due to the inappropriate transcription factor co-expression. The manuscript has been revised on page 15 accordingly.

      \Would additional experiments be essential to support the claims of the paper?*

      Response: As explained in detail in response to comment 5 above (page 16 of this response), we have carried out extensive studies of the specific proteolysis of MYB. We conclude that MYB transcription complexes are regulated both by MYB:CBP/P300 binding and by specific factor proteolysis, and can be induced by its peptidomimetic blockade in AML cells. Such “event-driven” pharmacology is emerging as a powerful tool to modulate protein function in cells, and studies reported in our work should enable its translation into improved therapies for patients, and improved probes for basic science.

      11.Provision of a positive control for the experiment of Fig S2.

      Response: As explained in detail in response to comment 2 above (page 13-14 of this response), we precisely defined the effects of CRYBMIM and MYBMIM on the clonogenic capacity, growth and survival of normal hematopoietic progenitor cells in serum-free, methylcellulose media supplemented with human hematopoietic cytokines. These experiments showed relatively modest effects (9.3 ± 3.8% reduction) of CRYBMIM on normal cells (Figure 3E), as compared to substantial inhibition (54 ± 2.4 % reduction) of the growth and survival of AML cells (Figures 3E). For comparison, doxorubicin led to more than 98 % reduction in clonogenic capacity (revised Figure S3B).

      12.\Are the data and the methods presented in such a way that they can be reproduced?**

      -Mostly yes

      Response: The revised manuscript includes a complete description of all methods, including a detailed supplement, listing technical details, with all analyzed data available openly via Zenodo (https://doi.org/10.5281/zenodo.4321824).

      13.\Are the experiments adequately replicated and statistical analysis adequate?**

      -Mostly yes

      Response: All experiments were performed in at least three replicates, with all quantitative comparisons performed using appropriate statistical tests, as explained in the manuscript.

      **Minor comments:**

      *Specific experimental issues that are easily addressable.*

      -These are mostly indicated above.

      In addition:

      14.Why is BCL2 expression down-regulated by MYBMIM but not CRYMYB?

      Response: We made the same observation, and attribute this difference to the fact that BCL2 expression is regulated by several transcription factors, including CEBPA, which is affected by CRYBMIM but not MYBMIM. Similar to MYBMIM treatment, MYB occupancy at the BCL2 enhancer was reduced upon CRYBMIM treatment. However, new binding sites of other factors, such as CBP/P300 and RUNX1, appeared simultaneously, suggesting that redistribution of transcription factors following CRYBMIM treatment can affect transcriptional regulation of BCL2 expression (revised Figure S9 and shown below).

      *Are prior studies referenced appropriately?

      -Yes *Are the text and figures clear and accurate?*

      15.Generally, although some details are missing eg what aberrancy score in Fig 5C means.

      Response: Thank you for pointing this out. We have revised this figure to clarify this score, which is defined as the ratio of gene expression in AML cells relative to normal hematopoietic progenitor cells (revised Figure 7C).

      16.\Do you have suggestions that would help the authors improve the presentation of their data and conclusions?**

      -The title of this manuscript could and I think should be changed. The term "therapeutic", is not appropriate because no therapeutic agents are described in the m/s nor is any form of AML, even experimentally, treated. Also "CBP" should be replaced with CBP/P300, especially since most evidence suggests that P300 is the likely more important partner of MYB (eg Zhao et al 2011

      Response: We agree and have revised the title to clarify the significance of this work: “Convergent organization of aberrant MYB complexes controls oncogenic gene expression in acute myeloid leukemia.” We have revised the manuscript to indicate CBP/P300.

      17.-It would be worth discussing the core observation that disruption of the MYB-CBP/P300 interaction actually results in changes in MYB DNA binding. That this would occur is not at all obvious, because CBP/p300 doesn't interact with MYB's DNA binding domain nor does it have intrinsic DNA binding activity.

      Response: We thank the reviewer for this comment, and agree that remodeling of the MYB complex must affect the binding of MYB and other cofactors to DNA, at least in part mediated by potential acetylation by CBP/P300 (page 24).

      Reviewer #3 (Significance (Required)):

      **The Nature and Significance of the Advance**

      1) The major significance of this work lies in the chromatin occupancy and MYB complex studies. There are a number of very interesting findings including the apparent redistribution of MYB and/or CBP/P300 upon treatment with CRYBMIM. These suggest a series of changes in factors associated with particular gene sets involved in myeloid differentiation, although as mentioned above particular target genes are not specifically identified. However the pathways corresponding to these are listed in Table S6.

      Response: We have revised the manuscript to include the target genes in revised Supplemental Table 4 as well as DESeq2 tables (deposited in Zenodo, https://doi.org/10.5281/zenodo.4321824).

      2) The new peptide design (CRYBMIM) is interesting but its differences in binding and biological effects of MYBMIM are mostly incremental. See above.

      Response: We respectfully disagree and would like to explain how this work is significant both for conceptual and technical reasons. First, while the biochemical affinity of CRYBMIM is quantitatively increased compared with MYBMIM, this quantitatively increased affinity translates into qualitatively improved biological potency, as a result of “event-driven” pharmacology that characterizes pharmacologic protein interaction modulators (please also see response to Reviewer 3, comment 1, page 6 of this response). MYBMIM suppresses the growth and survival mostly of MLL-rearranged leukemias, whereas CRYBMIM does so for the vast majority (10 out of 11) of studied subtypes of AML. This now enables its therapeutic translation, as we are currently pursuing in collaboration with Novartis. Second, its improved biological activity led to the discovery of the previously unknown and unanticipated CBP/P300 sequestration mechanism of oncogenic gene control. We use this discovery to develop a precise model of aberrant gene control in AML that for the first time unifies previously disparate observations into a general mechanism. This is highly significant because it provides shared molecular dependencies for most subtypes of AML, a long-standing conundrum in cancer biology.

      *Place the work in the context of the existing literature (provide references, where appropriate).*

      -This m/s builds on and extends the report from the same group in Nature Communications (2018), which described the earlier peptide MYBMIM, some effects on MYB target genes and on AML cells. It and the previous paper also draw on the findings regarding the role of the MYB-CBP/P300 interaction in myeloid leukemogenesis (Pattabirman et al 2014) and on previous genome-wide studies of MYB target genes (Zhoa et al 2011; Zuber et al 2011).

      *State what audience might be interested in and influenced by the reported findings.*

      -This m/s will likely be of interest to scientists interested in MYB per se, in AML, in cancer genomics and transcriptional regulation.

      *Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.* -My expertise: AML, experimental hematology, transcription, MYB, cancer genomics

      3) As mentioned above, I feel that additional expertise is required to review the MS studies.

      Response: We have deposited all raw data in PRIDE (accession number PXD019708) and all processed data in Zenodo (https://doi.org/10.5281/zenodo.4321824), making it available for the community for further analysis.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      This manuscript describes an improved MYB-mimetic peptide (cf the group's earlier work published in Nature Communications, 2018) and its effects on AML cell lines. It also describes - and this constitutes the majority of the paper - the dynamics of chromatin occupancy by MYB and other associated transcription factors upon disruption of the MYB-CBP/P300 interaction. The authors suggest this represents a shift from an oncogenic program to a myeloid differentiation program.

      Major comments:

      Regarding the improved affinity, and biological activity, of CRYBMIM:

      1.Improved affinity of CRYBMIM cf MYBMIM: clearly, it is improved, but not by a lot. By MST the increased affinity is about 3x. In terms of effects on AML cell viability: there is no direct comparison, and this should be included. In the group's previous paper there is no direct estimate for MYBMIM but it looks like the IC50 is between 10 and 20 micromolar so the fecct is again around 2.5 fold. Also, the effects of the amino acid substitutions in CG3 are also very small (2.4x) given that 3 critical residues are altered. This is quite concerning.

      2.Does CRYBMIM really "spare" normal hematopoietic cells? Not according to Fig 2E, where there is only a 2-fold difference in IC50.

      3.Fig 2E and Supp Fig S2 appear to be contradictory. The latter shows no effect of 20micromolar CRYBMIM on colony formation by normal CD34+ cells, in complete contrast to killing with IC50 of 12.8 micromolar in Fig 2E. There is no +ve control for Fig S2 ie does the peptide work under colony assay conditions? This MUST be addressed.

      4.Fig 2F doesn't include any lines that express very low or undetectable levels of MYB. Some of these should be included to further examine specificity.2

      Effects on gene expression and MYB binding:

      Data on MYB target gene expression and apoptosis/differentiation, and the conclusions drawn per se are sound, but:

      5.Fig S3 seems to show that MYB protein is lost on treatment with CRYBMIM. This isn't even mentioned in the text but raises a whole range of major questions eg why is this the case? Is this what is responsible for the loss of MYB-p300 interaction and/or biological effects on AML cells? Is this what is responsible for the effects on MYB target gene expression in Fig 3 and MYB binding to chromatin in Fig 4? This must be addressed.

      6.Fig 4 and the accompanying text are a bit hard to follow, but if I understood them correctly, I am surprised that the "gained MYB peaks" don't include the MYB binding motif itself? This at least deserves some comment. Also, there doesn't seem to have been any attempt to integrate the ChIP-Seq data with the expression data of Fig 3. This would provide clearer insights into the identities and types of MYB-regulated genes that are directly affected by suppression of CBP/p300 binding to MYB.

      7.The MS studies on MYB-interacting proteins seem very interesting and novel. I am not an expert on MS, though, so I'd suggest this section be reviewed by someone who is. Moreover, I was unable to see the actual data from this study because the material I was provided with didn't include Table S4 and S5.

      Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      8.Claims regarding biological activity, specificity and improvements cf MYBMIM should be moderated given the small size of these effects as mentioned above (points 1 and 3).

      9.I found the description of the studies related to Figs 5 and 6 somewhat difficult to follow and convoluted. While changes in MYB and CBP/p300 chromatin occupancy clearly occur on M CRYBMIM treatment, it is not clear that the complexes seen on genes prior to treatment represent "aberrant" complexes. These may just be characteristic of undifferentiated (myeloid) cells. The authors appear to argue that because some of the candidate co-factors show "apparently aberrant expression in AML cells" based on comparison of (presumably mRNA) expression data with normal cells, the presence of these factors in the complexes make them "aberrant" (moreover, the "aberrancy score" of Fig 5 C is not defined anywhere, as far as I can see). This inference is drawing a rather long bow, given that the AML-specific factors may not actually be absent from the complexes in normal cells. So this conclusion should be moderated if a more direct MS comparison cannot be provided (for which I understand the technical difficulties).

      Would additional experiments be essential to support the claims of the paper?

      1. Address the issue of the apparent loss of MYB protein upon CRYBMIM treatment. If this is occurring, the whole premise of the subsequent work is undermined.

      12.Provision of a positive control for the experiment of Fig S2.

      Are the data and the methods presented in such a way that they can be reproduced?

      -Mostly yes

      Are the experiments adequately replicated and statistical analysis adequate?

      -Mostly yes

      Minor comments:

      Specific experimental issues that are easily addressable. -These are mostly indicated above.

      In addition: oWhy is BCL2 expression down-regulated by MYBMIM but not CRYMYB?

      *Are prior studies referenced appropriately?

      -Yes

      Are the text and figures clear and accurate?

      -Generally, although some details are missing eg what aberrancy score in Fig 5C means.

      Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      -The title of this manuscript could and I think should be changed. The term "therapeutic", is not appropriate because no therapeutic agents are described in the m/s nor is any form of AML, even experimentally, treated. Also "CBP" should be replaced with CBP/P300, especially since most evidence suggests that P300 is the likely more important partner of MYB (eg Zhao et al 2011

      -It would be worth discussing the core observation that disruption of the MYB-CBP/P300 interaction actually results in changes in MYB DNA binding. That this would occur is not at all obvious, because CBP/p300 doesn't interact with MYB's DNA binding domain nor does it have intrinsic DNA binding activity.

      Significance

      The Nature and Significance of the Advance

      -The major significance of this work lies in the chromatin occupancy and MYB complex studies. There are a number of very interesting findings including the apparent redistribution of MYB and/or CBP/P300 upon treatment with CRYBMIM. These suggest a series of changes in factors associated with particular gene sets involved in myeloid differentiation, although as mentioned above particular target genes are not specifically identified. However the pathways corresponding to these are listed in Table S6.

      -The new peptide design (CRYBMIM) is interesting but its differences in binding and biological effects cf MYBMIM are mostly incremental. See above.

      Place the work in the context of the existing literature (provide references, where appropriate).

      -This m/s builds on and extends the report from the same group in Nature Communications (2018), which described the earlier peptide MYBMIM, some effects on MYB target genes and on AML cells. It and the previous paper also draw on the findings regarding the role of the MYB-CBP/P300 interaction in myeloid leukemogenesis (Pattabirman et al 2014) and on previous genome-wide studies of MYB target genes (Zhoa et al 2011; Zuber et al 2011).

      State what audience might be interested in and influenced by the reported findings.

      -This m/s will likely be of interest to scientists interested in MYB per se, in AML, in cancer genomics and transcriptional regulation.

      Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      -My expertise: AML, experimental hematology, transcription, MYB, cancer genomics

      -As mentioned above, I feel that additional expertise is required to review the MS studies.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript reports the generation of a new and improved peptide mimetic inhibitor of the interaction between MYB and CBP/P300. The original MYBMIM inhibitor of this interaction, reported recently by the same laboratory, was modified by addition and substitution of peptide sequences from CREB, thus improving the affinity of the resulting CRYBMIM peptide to CBP/P300. The improved inhibitor profile results in increased anti-AML efficacy of CRYBMIM over MYBMIM. The authors go on to examine the mechanism underlying the anti-AML activity of CRYBMIM by integrating gene expression analysis, chromatin immunoprecipitation sequencing and mass spectrometric protein complex identification in human AML cells.

      I have some minor questions the authors may wish to comment on:

      1) The relocation of MYB, along with CBP/P300, to genes controlling myeloid differentiation (clusters 4 and 9) upon CRYBMIM treatment is reminiscent of the increased binding of MYB to myeloid pro-differentiation genes in AML cells following RUVBL2 silencing, recently reported in Armenteros-Monterroso et al. 2019 Leukemia 33:2817. Do the authors know if there is any overlap between genes in either of the clusters and the list reported in the latter study?

      2) Could the authors comment on a possible mechanism to explain the co-localization of MYB and CBP/P300 to the loci in clusters 4 and 9 following CRYBMIM treatment? Is it possible that CBP/P300 is recruited by other transcription factors to these loci, independently of binding to MYB? Or is the binding of CBP/P300 to MYB at these loci somehow more resistant to disruption by CRYBMIM?

      3) In the first paragraph of page 9, the text states: "Previously, we found that MYBMIM can suppress MYB:CBP/P300-dependent gene expression, leading to AML cell apoptosis that required MYB-mediated suppression of BCL2 (Ramaswamy et al., 2018)." I think this is a typo, since in this study, MYBMIM treatment results in loss of MYB binding to the BCL2 gene and consequent reduction in BCL2 expression. Do the authors mean 'MYBMIM-mediated suppression of BCl2' or 'loss of MYB-mediated activation of BCL2'?

      4) The authors explain the failure of excess CREBMIM to displace CBP/P300 from immobilised CREBMIM (Figure 1E-F) by the nature of the CREB:CBP/P300 interaction. Does this imply that CREBMIM is unable to disrupt the interaction between CREB and CBP/P300 in living cells and that the CBP/P300 purified from native MV4;11 lysates by immobilised CREBMIM was from a pool not associated with CREB?

      Significance

      Based on this integrative analysis, the authors propose a convincing hypothesis, involving the assembly of aberrant transcription factor complexes and sequestration of P300/CBP from genes involved in normal myeloid development, for the oncogenic activity of MYB in AML. As well as the obvious therapeutic potential of the CRYBMIM inhibitor itself, the data reported here reveal multiple avenues for future investigation into novel anti-AML therapeutic strategies. This is an innovative and important study.

      This study will be of interest to scientists and clinicians involved in leukaemia research as well as cancer biology in general.

      My field of expertise: leukaemia biology, leukaemia models, aberrant transcription factor activity in leukaemia

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The study by Forbes et al describes and characterizes a 2nd generation peptide-based inhibitor of the MYB:CBP interaction, termed CRYBMIM, which they use to study MYB:cofactor interactions in leukemia cells. The CRYBMIM has improved properties relative to the MYBMIM peptide, and display more potency in biochemical and cell-based assays. Using a combination of epigenomics and biochemical screens, the authors define a list of candidate MYB cofactors whose functional significance as AML dependencies is supported by analysis of the DepMap database. Using genomewide profiling of TF and CBP occupancy, the authors provide evidence that CRYBMIM treatment reprograms the interactome of MYB in a manner that disproportionately changes specific cis-elements over others. Stated differently, the overall occupancy pattern of many TFs/cofactors shows gains and losses at specific cis elements, resulting in a complex modulation of MYB function and changes in transcription in leukemia cells.

      Overall, this is a strong, well-written study, with clear experimental results and relatively straightforward conclusions. The therapeutic potential of modulating MYB in cancer is enormous, and hence I believe this study will attract a broad interest in the cancer field and will likely be highly cited. I list below a few control experiments that would clarify the specificity of CRYBMIM.

      1) Does CRYBMIM bind to other KIX domains, such as of MED15. It would be important to evaluate the specificity of this peptide for whether it binds to other KIX domains.

      2) Similarly, it would be useful to perform a mass spec analysis to all nuclear factors that associate with streptavidin-immobilized CRYBMIM. This again would be help the reader to understand the specificity of this peptide.

      The major limitation of this study which modestly lessens my enthusiasm of this work is that the mechanistic model of MYB-sequestered TFs proposed here is based on a face-value interpretation of IP-MS data coupled with ChIP-seq data. Normally, I would expect such a mechanism to be supported with some additional focused biochemical experiments of specific interactions, to complement all of the omics approaches. For example, can the authors evaluate and/or validate further how MYB physically interacts with LYL1, CEBPA, SPI1, or RUNX1. Are these interactions direct or indirect? Which domains of these proteins are involved? Does CRYBMIM treatment modulate the ability of these proteins to associate with one another in a co-IP? Do these interactions occur in normal hematopoietic cells? A claim is made throughout this study that these are aberrant TF complexes, but I believe more evidence is required to support this claim.

      Significance

      Overall, this is a strong, well-written study, with clear experimental results and relatively straightforward conclusions. The therapeutic potential of modulating MYB in cancer is enormous, and hence I believe this study will attract a broad interest in the cancer field and will likely be highly cited.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 The authors study allostery with a beautiful genotype-phenotype experiment to study the fitness landscape of an allosteric lac repressor protein. The authors make a mutational library using error prone pcr and measure the impact on antibiotic resistance protein expression at varying levels of ligand, IPTG, expression. After measuring the impact of mutations authors fill-in the missing data using a neural net model. This type of dose response is not standard in the field, but the richness of their data and the discovery of the "band pass" phenomena prove its worth here splendidly. Using this mixed experimental/predicted data the authors explore how each mutation alters the different parameters of a hill equation fit of a dose response curve. Using higher order mutational space the authors look at how mutations can qualitatively switch phenotypes to inverted or band-stop dose-response curves. To validate and further explore a band-stop novel phenotype, the authors focused on a triple mutant and made all combinations of the 3 mutations. The authors find that only one mutation alone alters the dose-response and only in combination does a band-stop behavior present itself. Overall this paper is a fantastic data heavy dive into the allosteric fitness landscape of protein. Overall, the data presented in this paper is thoroughly collected and analyzed making the conclusions well-based. We do not think additional experiments nor substantial changes are needed apart from including basic experimental details and more biophysical rationale/speculation as discussed in further detail below.

      The authors do a genotype-phenotype experiment that requires extensive deep sequencing experiments. However, right now quite a bit of basic statistics on the sequencing is missing. Baseline library quality is somewhat shown in supplementary fig 2 but the figure is hard to interpret. It would be good to have a table that states how many of all possible mutations at different mutation depths (single, double, etc) there are. Similarly, sequencing statistics are missing- it would be useful to know how many reads were acquired and how much sequencing depth that corresponds to. This is particularly important for barcode assignment to phenotype in the long-read sequencing. In addition, a synonymous mutation comparison is mentioned but in my reading that data is not presented in the supplemental figures section.

      We thank the reviewer for this succinct summary of the manuscript and the results. We appreciate the reviewer identifying data of interest that were not included in the original manuscript. We agree that this information is necessary to consider the results. Specific changes are summarized in the comments below.

      The paper is very much written from an "old school" allostery perspective with static end point structures that are mutually exclusive - eg. p5l10 "relative ligand-binding affinity between the two conformations" - however, an ensemble of conformations is likely needed to explain their data. This is especially true for the bandpass and inverted phenotypes they observe. The work by Hilser et al is of particular importance in this area. We would invite the authors to speculate more freely about the molecular origins of their findings.

      We agree with the suggestions to adopt a modern allosteric perspective. We have changed the language throughout the manuscript to align with the ensemble model of allostery. We continue to frame results using the Monad-Wyman-Changeaux model, which reliably predicts LacI activity from biophysical parameters and is not exclusive of more modern models of allostery.

      **Minor** There are a number of small modifications. In general this paper is very technical and could use with some explanation and discussion for relevance to make the manuscript more approachable for a broader audience. P1L23: Ligand binding at one site causes a conformational change that affects the activity of another > not necessarily true - and related to using more "modern" statistical mechanical language for describing allostery.

      We agree with the reviewer’s comment. We have addressed this comment by adopting language in line with more modern view of allostery, for example:

      “With allosteric regulation, ligand binding at one site on a biomolecule changes the activity of another, often distal, site. Switching between active and inactive states provides a sense-and-response function that defines the allosteric phenotype.”

      P2L20: The core experiment of this paper is a selection using a mutational library. In the main body the authors mention the library was created using mutagenic pcr but leave it at that. More details on what sort of mutagenic pcr was used in the main body would be useful. According to the methods error prone pcr was used. Why use er-pcr vs deep point mutational libraries? Presumably to sample higher order phenotype? Rationale should be included. Were there preliminary experiments that helped calibrate the mutation level?

      We agree that justifying the decision to use error-prone PCR for library construction would be helpful. To explain this decision, we have added to the main text to explain this decision and to reflect on the consequences.

      “We used error-prone PCR across the full lacI CDS to investigate the effects of higher-order substitutions spread across the entire LacI sequence and structure.”

      And

      Novel phenotypes emerged at mutational distances greater than one amino acid substitution, highlighting the value in sampling a broader genotype space with higher-order mutations. Furthermore, the untargeted, random mutagenesis approach used here was critical for finding these novel phenotypes, as the genotypes required for these novel phenotypes were unpredictable.”

      P2L20: Baseline library statistics would be great in a table for coverage, diversity, etc especially as this was done by error prone pcr vs a more saturated library generation method. This is present in sup fig2 but it's a bit complicated.

      To more clearly convey the diversity within the library, we have included a heatmap of amino acid substitution counts found within the library (Supplementary Fig. 4). Additionally, we have added Supplementary Table 1, which lists the distribution of mutational distances of LacI variants found within the library, and the corresponding coverage of all possible mutations for each mutational distance.

      P2L26: How were FACS gates drawn? This is in support fig17 - should be pointed to here.

      We agree that a better description of the FACS process would be helpful. To address this we have included Supplementary Fig. 2, showing flow cytometry measurements of the library before and after FACS. Additionally, we have extended the description of the FACS process:

      “The initial library had a bimodal distribution of G__­0, as indicated by flow cytometry results, with a mode at low fluorescence (near G__­0 of wildtype LacI), and mode at higher gene expression. To generate a library in which most of the LacI variants could function as allosteric repressors, we used fluorescence activated cell sorting (FACS) to select the portion of the library with low fluorescence in the absence of ligand, gating at the bifurcation of the two modes (Sony SH800S Cell Sorter, Supplementary Fig. 2).”

      __

      P3L4: Where is the figure/data for the synonymous SNP mutations? This should be in the supplement.

      We agree this data is necessary to support the claim that LacI function was not impacted by synonymous mutations. We have included a new Supplementary Fig. 9, which shows the distribution of Hill equation parameters for LacI variants that code for the wild-type amino acid sequence, but with non-identical coding DNA sequences. Additionally, we included the results of a statistical analysis in the main text, this analysis compared all synonymous sequences in the library:

      “__We compared the distributions of the resulting Hill equation parameters between two sets of variants: 39 variants with exactly the wild-type coding DNA sequence for LacI (but with different DNA barcodes) and 310 variants with synonymous nucleotide changes (i.e. the wild-type amino acid sequence, but a non-wild-type DNA coding sequence). Using the Kolmogorov-Smirnov test, we found no significant differences between the two sets (p-values of 0.71, 0.40, 0.28, and 0.17 for G0, G∞, EC50, and n respectively, Supplementary Fig. 9).” __

      P3L20: The authors use a ML learning deep neural network to predict variant that were not covered in the screen. However, the library generation method is using error prone pcr meaning there could multiple mutations resulting in the same amino acid change. The models performance was determined by looking at withheld data however error prone pcr could result in multiple nonsynomymous mutations of the same amino acid. For testing were mutations truly withheld or was there overlap? Because several mutations are being represented by different codon combinations. Was the withheld data for the machine learning withholding specific substitutions?

      We thank the reviewer for identifying the need to clarify this critical data analysis. Data was held-out at the amino acid level, and so no overlap between the training and testing datasets occurred. We have clarified the description of the method in the main text:

      “We calculated RMSE using only held-out data not used in the model training, and the split between held-out data and training data was chosen so that all variants with a specific amino acid sequence appear in only one of the two sets.”


      In addition, higher order protein interactions are complicated and idiosyncratic. I am surprised how well the neural net performs on higher order substitutions. P4L4: Authors find mutations at the dimer/tetramer interfaces but don't mention whether polymerization is required. is dimerization required for dna binding? Tetramerization?

      We agree with the reviewer that, overall, a description of LacI structure and function would improve messaging the reported results. As such, we have added Supplementary Table 2, which defines the structural features discussed throughout the manuscript. Additionally, we have strived to describe the relevant structural and functional role of specific amino acids that are discussed in the text. Finally, we have also added a paragraph to the main text that summarizes the structure and function of LacI.

      “The LacI protein has 360 amino acids arranged into three structural domains__22–24__. The first 62 N-terminal amino acids form the DNA-binding domain, comprising a helix-turn-helix DNA-binding motif and a hinge that connects the DNA-binding motif and the core domain. The core domain, comprising amino acid positions 63-324, is divided into two structural subdomains: the N-terminal core and the C-terminal core. The full core domain forms the ligand-binding pocket, core-pivot region, and dimer interface. The tetramerization domain comprises the final 30 amino acids and includes a flexible linker and an 18 amino acid α-helix (Fig. 3, Supplementary Table 2). Naturally, LacI functions as a dimer of dimers: Two LacI monomers form a symmetric dimer that further assembles into a tetramer (a dimer of dimers).”

      P4L8: Substitutions near the dimer interface both impact g0 and ec50, which authors say is consistent with a change in the allosteric constant. Can authors explain their thinking more in the paper to make it easier to follow? Are the any mutations in this area that only impact g0 or ec50 alone? Why may these specific residues modify dimerization?

      We agree that a more in-depth discussion on the possible mechanisms behind these phenotypic changes would improve the manuscript. We have added discussion throughout the subsection “Effects of amino acid substitutions on LacI phenotype,” we believe this added discussion improve the manuscript and clarify the relationship between the observed allosteric phenotypes and the molecular mechanisms behind them. W

      Overall, we have made a number of changes in the manuscript that we hope will address these concerns.

      P4L8: The authors discuss the allosteric constant extensively within the paper but do not explain it. It would be helpful to have an explanation of this to improve readability. This explanation should include the statistical mechanical basis of it and some speculation about the ways it manifests biophysically.

      The allosteric constant is a critical concept, and we agree that it must be defined and discussed clearly throughout the manuscript. We have greatly expanded the discussion of the effects of single amino acid substitutions, and in the process we give examples of biochemical changes in the protein, and how they may affect the allosteric constant. We think this added text improves the manuscript and helps clarify the allosteric constant and the biomolecular processes that affect it.

      P4L1-16: Authors see mutations in the dimerization region that impact either G0 and Gsaturated in combination with Ec50 but not g0 and gsaturated together. Maybe we do not fully understand the hill equation but why are there no mutations that impact both g0 and gsaturated seen in support fig 13c? Why would mutations in the same region potentially impacting dimerization impact either g0 or gsaturated? What might be the mechanism behind divergent responses?

      It is important to recognize that the dimer interface does not just support the formation of dimers. There are many points of contact along the dimer interface that change when LacI switches between the active and inactive states. So, the dimer interface also helps regulate the balance between the active and inactive states. Our results show that different substitutions near the dimer interface can push this balance either toward the active or inactive states to varying degrees. We’ve added text throughout the description of single-substitutions effects to give specific examples and added a new paragraph at the end of that section to provide additional discussion and context. With regard to the more specific question of changes to both G0 and Ginf, the models indicate that simultaneous changes to those Hill Equation parameters requires an unusual combination of biophysical changes. To clarify this point, we added a short paragraph to the text:

      “None of the single amino substitutions measured in the library simultaneously decrease __G∞ and increase G0 (Supplementary Fig. 20c). This is not surprising, since substitutions that shift the biophysics to favor the active state tend to decrease G∞ while those that favor the inactive state tend to increase G0, and the biophysical models2,14,15 indicate that only a combination of parameter changes can cause both modifications to the dose-response. The library did, however, contain several multi-substitution variants with simultaneously decrease __G∞ and increase G0. These inverted variants, and their associated substitutions are discussed below.”


      P4L29: for interpretability it would be good to explain what log-additive effect means in the context of allostery.

      We agree that this information would be useful to the reader and have added additional text to explain log-additivity. We thank the reviewer for pointing out this oversight.

      “Combining multiple substitutions in a single protein almost always has a log-additive effect on EC50. That is, the proportional effects of two individual amino acid substitutions on the EC50 can be multiplied together. For example, if substitution A results in a 3-fold change, and substitution B results in a 2-fold change, the double substitution, AB, behaving log-additively, results in a 6-fold change__.”__

      P4L34-P5L19: This section is wonderful. Really cool results and interesting structural overlap! P5L34 Helix 9 of the protein is mentioned but it's functional relevance is not. This is common throughout the paper - it would be useful for there to be an overview somewhere to help the reader contextualize the results with known structural role of these elements.

      We agree with the reviewer that this information would help to contextualize the results. We have made a number of changes to address this. First, we have added Supplementary Table 2, which describes the structural features of LacI. Second, we have added a paragraph overviewing the structure and function of LacI. Third, we have expanded the section “the effects of individual amino acid substitutions on the function of LacI” to discuss the structural or biochemical impact of specific substitutions. We thank the reviewer for this suggestion.

      P5L39: The authors identified a triple mutant with the band-stop phenotype then made all combination of the triple mutant. Of particular interest is R195H/G265D which is nearly the same as the triple mutant. It would be nice if the positions of each of these mutations and have some discussion to begin to rationalize this phenotype, even if to point out how far apart they are and that there is no easy structural rationale!

      We appreciate the reviewer highlighting this area of interest. We have added structural information to Fig. 6, which indicates to position of the amino acid substitutions that result in the band-stop phenotype, as well as a small discussion in the main text:

      “To further investigate the band-stop phenotype, we chose a strong band-stop LacI variant with only three amino acid substitutions (R195H/G265D/A337D). These three positions are distributed distally on the periphery of the C-terminal core domain, and the role that each of these substitutions plays in the emergence of the band-stop phenotype is unclear.”

      P6L9: There should be more discussion of the significance of this work directly compared to what is known. For instance, negative cooperativity is mentioned as an explanation for bi-phasic dose response but this idea is not explained. Why would the relevant free energy changes be more entropic? Another example is the reverse-TetR phenotype observed by Hillen et al.

      We agree that more discussion is necessary to frame the results reported in the manuscript. To address this, we have added additional discussion throughout the manuscript that relates the results to the current understanding of allostery. Also, in the Conclusion, we added specific examples that lead us to link the ideas of bi-phasic dose response, negative cooperativity, and entropy/disorder. We believe these additions have improved the manuscript and we thank the reviewer for this suggestion.

      P6L28: The authors mention that phenotypes exist with genotypes that are discoverable with genotype-phenotype landscapes. This study due to the constraints of error prone pcr were somewhat limited. How big is the phenotypic landscape? Is it worth doing a more systematic study? What is the optimal experimental design: Single mutations, doubles, random - where is there the most information. How far can you drift before your machine learning model breaks down? How robust would it be to indels?

      The reviewer raises some excellent questions here, some of which are appropriate subjects for future work. The optimal experimental design depends on the objective: If the goal is to understand every possible mutation, a systematic site-saturation approach would be more appropriate. However, the landscape of a natural protein is limited by its wild-type DNA coding sequence, and so some substitutions are inaccessible (due to the arrangement of the codon table). The approach we took allowed to us characterize most of the accessible amino acid substitutions, while also allowing us to identify novel functions that would not have been identified with other approaches. We have added a little to the main text to discuss this (below). With regard to the DNN model, in the manuscript (SI Fig. 14), we show how the predictive accuracy degrades with mutational distance from the wild-type. It is possible that the type of DNN that we used could handle indels, since it effectively encodes each variant as a set of step-wise changes from the wild-type. But as with all machine-learning methods, it would require training with a dataset that included indels.

      “Novel phenotypes emerged at mutational distances greater than one amino acid substitution, highlighting the value in sampling a broader genotype space with higher-order mutations. Furthermore, the untargeted, random mutagenesis approach used here was critical for finding these novel phenotypes, as the genotypes required for these novel phenotypes were unpredictable.”

      Figures: Sup figs 3-7: The comparison of library-based results and single mutants is a great example of how to validate genotype-phenotype experiments!

      Thank you.

      Supp fig 5.: Missing figure number.

      We appreciate the reviewer catching this error and have attempted to properly label all figures and tables in this revision. Thank you.

      Supp fig7: G0 appears to have very poor fit between library vs single mutant version. Why might this be? R^2 would likely be better to report here as opposed to RMSE as RMSE is sensitize to the magnitude of the data such that you cannot directly compare RMSE of say 'n' to G0.

      We agree that these are important discussion points and have addressed this concern with an expanded discussion in the main text, as well as the addition of coefficient of correlation (R^2) in the caption for Figure 2 (previously supplementary figure 7). We believe these additions contribute meaningfully to the manuscript, and they address the concerns of the reviewer. The additional text reads:

      “We compared the Hill equation parameters from the library-scale measurement to those same parameters determined from flow cytometry measurements for each of the chemically synthesized LacI variants (Fig. 2). This served as a check of the new library-scale method’s overall ability to measure dose-response curves with quantitative accuracy. The accuracy for each Hill equation parameter in the library-scale measurement was: 4-fold for G0, 1.5-fold for G∞, 1.8-fold for EC50, and ± 0.28 for n. For G0, G∞, and EC50, we calculated the accuracy as: __, where __ is the root-mean-square difference between the logarithm of each parameter from the library-scale and cytometry measurements. For n, we calculated the accuracy simply as the root-mean-square difference between the library-scale and cytometry results. The accuracy for the gene expression levels (G0 and G∞) was better at higher gene expression levels (typical for G∞) than at low gene expression levels (typical for G0), which is expected based on the non-linearity of the fitness impact of tetracycline (Supplementary Figs. 10-11). Measurements of the Hill coefficient, n, had high relative uncertainties for both barcode-sequencing and flow cytometry, and so the parameter n was not used in any quantitative analysis.”

      Sup fig13c: it is somewhat surprising that mutations only appear to effect g0 and not gsaturated. This implies that basal and saturated activity are not coupled. Is this expected? Why or why not?

      This comment is partially addressed with a response above (P4L1-16). Coupled gene expression increases do occur, especially with substitutions at the start codon that result in fewer copies of LacI in the cell. In this instance, both G0 and G∞ are increased. Otherwise, changes to multiple biophysical parameters are required to increase both G0 and G∞.

      Reviewer #1 (Significance (Required)): Allostery is hard to comprehend because it involves many interacting residues propagating information across a protein. The Monod-Wyman-Changeux (MWC) and Koshland, Nemethy, and Filmer (KNF) models have been a long standing framework to explain much of allostery, however recent formulations have focused on the role of the conformational ensemble and a grounding in statistical mechanics. This manuscript focuses on the functional impact of mutations and therefore contribution of the amino acids to regulation. The authors unbiased approach of combining a dose-response curve and mutational library generation let them fit every mutant to a hill equation. This approach let the authors identify the allosteric phenotype of all measured mutations! The authors found inverted phenotypes which happen in homologs of this protein but most interesting is the strange and idiosyncratic 'Band-stop' phenotype. The band-stop phenotype is bi-phasic that will hopefully be followed up with further studies to explain the mechanism. This manuscript is a fascinating exploration of the adaptability of allosteric landscapes with just a handful of mutations. Genotype-phenotype experiments allow sampling immense mutational space to study complex phenotypes such as allostery. However, a challenge with these experiments is that allostery and other complicated phenomena come from immense fitness landscapes altering different parameters of the hill equation. The authors approach of using a simple error prone pcr library combined with many ligand concentrations allowed them to sample a very large space somewhat sparsely. However, they were able to predict this data by training and using a neural net model. I think this is a clever way to fill in the gaps that are inherent to somewhat sparse sampling from error prone pcr. The experimental design of the dose response is especially elegant and a great model for how to do these experiments. With some small improvements for readability, this manuscript will surely find broad interest to the genotype-phenotype, protein science, allostery, structural biology, and biophysics fields. We were prompted to do this by Review Commons and are posting our submitted review here: Willow Coyote-Maestas has relevant expertise in high throughput screening, protein engineering, genotype-phenotype experiments, protein allostery, dating mining, and machine learning. James Fraser has expertise in structural biology, genotype-phenotype experiments, protein allostery, protein dynamics, protein evolution, etc.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): The authors use deep mutational scanning to infer the dose-response curves of ~60,000 variants of the LacI repressor and so provide an unprecedently systematic dataset of how mutations affect an allosteric protein. Overall this is an interesting dataset that highlights the potential of mutational scanning for rapidly identifying diverse variants of proteins with desired or unexpected activities for synthetic biology/bioengineering. The relatively common inverted phenotypes and their sequence diversity is interesting, as is the identification of several hundred genotypes with non-sigmoidal band-stop dose-response curves and their enrichment in specific protein regions. A weakness of the study is that some of the parameter estimates seem to have high uncertainty and this is not clearly presented or the impact on the conclusions analysed. A second shortcoming is that there is little mechanistic insight beyond the enrichments of mutations with different effects in different regions of the protein. But as a first overview of the diversity of mutational effects on the dose-response curve of an allosteric protein, this is an important dataset and analysis. **Comments** **Data quality and reproducibility** "The flow cytometry results confirmed both the qualitative and quantitative accuracy of the new method (Supplementary Figs. 3-7)"

      • There need to be quantitative measures of accuracy in the text here for the different parameters.

      We believe this comment is addressed along with the following two comments.

      • Sup fig 7 panels should be main text panels - they are vital for understanding the data quality In particular, the G0 parameter estimates from the library appear to have a lower bound ie they provide no information below a cytometry Go of ~10^4. This is an important caveat and needs to be highlighted in the main text. The Hill parameter (n) estimate for wt (dark gray) replicate barcodes is extremely variable - why is this?

      • In general there is not a clear enough presentation of the uncertainty and biases in the parameter estimations which seem to be rather different for the 4 parameters. Only the EC50 parameter seems to correlate very well with the independent measurements.

      We thank the reviewer for identifying a need for more information on the accuracy of this method. So, we have moved Supplementary Fig. 7 to the main text (Fig 2 in the revised manuscript) and have added coefficients of correlation to each Hill equation parameter in that figure caption. Furthermore, we have added new data (Supplementary Fig. 11), which shows the uncertainty associated with different gene expression levels. Finally, we have added a discussion on the accuracy of this method for each parameter of the Hill equation to the main text. Estimation of the Hill coefficient (n) from data is often highly uncertain and variable, because that parameter estimate can be highly sensitive to random measurement errors at a single point on the curve. The estimate for the wild type appears to be highly variable because the plot contains 53 replicate measurements. So, the plotted variability represents approximately 2 standard deviations. The spread of wild-type results in the plot is consistent with the stated RMSE for the Hill coefficient. Furthermore, the Hill coefficient is not used in any of the additional quantitative analysis in our manuscript, partially because of its relatively high measurement uncertainty, but also because, based on the biophysical models, it is not as informative of the underlying biophysical changes.

      “We compared the Hill equation parameters from the library-scale measurement to those same parameters determined from flow cytometry measurements for each of the chemically synthesized LacI variants (Fig. 2). This served as a check of the new library-scale method’s overall ability to measure dose-response curves with quantitative accuracy. The accuracy for each Hill equation parameter in the library-scale measurement was: 4-fold for G0, 1.5-fold for G∞, 1.8-fold for EC50, and ± 0.28 for n. For G0, G∞, and EC50, we calculated the accuracy as: "exp" ["RMSE" ("ln" ("x" ))], where "RMSE" ("ln" ("x" )) is the root-mean-square difference between the logarithm of each parameter from the library-scale and cytometry measurements. For n, we calculated the accuracy simply as the root-mean-square difference between the library-scale and cytometry results. The accuracy for the gene expression levels (G0 and G∞) was better at higher gene expression levels (typical for G∞) than at low gene expression levels (typical for G0), which is expected based on the non-linearity of the fitness impact of tetracycline (Supplementary Figs. 10-11). Measurements of the Hill coefficient, n, had high relative uncertainties for both barcode-sequencing and flow cytometry, and so the parameter n was not used in any quantitative analysis.”

      • The genotypes in the mutagenesis library contain a mean of 4.4 aa substitutions and the authors us a neural network to estimate 3 of the Hill equation parameters (with uncertainties) for the 1991/2110 of the single aa mutations. It would be useful to have an independent experimental evaluation of the reliability of these inferred single aa mutational effects by performing facs on a panel of single aa mutants (using single aa mutants in sup fig 3-7, if there are any, or newly constructed mutants).

      We agree that the predictive performance of the DNN requires experimental validation. We evaluated the performance by withholding data from 20% of the library, including nearly 200 variants with single amino acid substitutions, and then compared the predicted effect of those substitutions to the measured effect. The results of this test are reported in Supplementary Fig. 14. Additionally, we have adjusted the main text to more clearly explain the evaluation process.

      “To evaluate the accuracy of the model predictions, we used the root-mean-square error (RMSE) for the model predictions compared with the measurement results. We calculated RMSE using only held-out data not used in the model training, and the split between held-out data and training data was chosen so that all variants with a specific amino acid sequence appear in only one of the two sets.” __ __

      • fig3/"Combining multiple substitutions in a single protein almost always has a log-additive effect on EC50." How additive are the other 2 parameters? this analysis should also be presented in fig 3. If they are not as additive is it simply because of lower accuracy of the measurements? If the mutational effects are largely additive, then a simple linear model (rather than the DNN) could be used to estimate the single mutant effects from the multiple mutant genotypes.

      We agree with the reviewer that exploring the log-additivity of the Hill equation parameters is informative, and have included Supplementary Figure 21, which displays this information. Furthermore, we expanded the discussion of log-additivity on all three parameters in the main text:

      “Combining multiple substitutions in a single protein almost always has a log-additive effect on EC50. That is, the proportional effects of two individual amino acid substitutions on the EC50 can be multiplied together. For example, if substitution A results in a 3-fold change, and substitution B results in a 2-fold change, the double substitution, AB, behaving log-additively, results in a 6-fold change. Only 0.57% (12 of 2101) of double amino acid substitutions in the measured data have EC50 values that differ from the log-additive effects of the single substitutions by more than 2.5-fold (Fig. 4). This result, combined with the wide distribution of residues that affect EC50, reinforces the view that allostery is a distributed biophysical phenomenon controlled by a free energy balance with additive contributions from many residues and interactions, a mechanism proposed previously1,39 and supported by other recent studies17, rather than a process driven by the propagation of local, contiguous structural rearrangements along a defined pathway.

      A similar analysis of log-additivity for G0 and G∞ is complicated by the more limited range of measured values for those parameters, the smaller number of substitutions that cause large shifts in G0 or G∞, and the higher relative measurement uncertainty at low G(L). However, the effects of multiple substitutions on G0 and G∞ are also consistent with log-additivity for almost every measured double substitution variant (Supplementary Fig. 21).”

      **Presentation/clarity of text and figures**

      • The main text implies that the DNN is trained to predict 3 parameters of the Hill equation but not the Hill coefficient (n). This should be clarified / justified in the main text.

      We agree that the decision to exclude the parameter ‘n’ requires explanation in the main text. To address this, we have added to the main text:

      “Measurements of the Hill coefficient, n, had high relative uncertainties for both barcode-sequencing and flow cytometry, and so the parameter n was not used in any quantitative analysis.”

      and

      “We trained the model to predict the Hill equation parameters G0, G∞, and EC50 (Supplementary Fig. 13), the three Hill equation parameters that were determined with relatively low uncertainty by the library-scale measurement.”

      • The DNN needs to be better explained and justified in the main text for a general audience. How do simpler additive models perform for phenotypic prediction / parameter inference?

      We agree with the reviewer that the DNN needs to be justified in the main text. As part of the revision plan, we propose to compare the predictive performance of the DNN to an additive model.

      • Ref 14. analyses a much smaller set of mutants in the same protein but using an explicit biophysical model. It would be helpful to have a more extensive comparison with the approach and conclusions to this previous study.

      Throughout the manuscript, we frame the results and discussion in terms of the referenced biophysical model. Using the model, we describe the biophysical effects that a substitution may have on LacI, based on observed changes to function associated with that substitution. We also comment briefly on the limitations of this model when applied to the extensive dataset presented here.

      “Most of the non-silent substitutions discussed above are more likely to affect the allosteric constant than either the ligand or operator affinities. Within the biophysical model, those affinities are specific to either the active or inactive state of LacI, i.e. they are defined conditionally, assuming that the protein is in the appropriate state. So, almost by definition, substitutions that affect the ligand-binding or operator-binding affinities (as defined in the models) must be at positions that are close to the ligand-binding site or within the DNA-binding domain. Substitutions that modify the ability of the LacI protein to access either the active state or inactive state, by definition, affect the allosteric constant. This includes, for example, substitutions that disrupt dimer formation (dissociated monomers are in the inactive state), substitutions that lock the dimer rigidly into either the active or inactive state, or substitutions that more subtly affect the balance between the active and inactive states. Thus, because there are many more positions far from the ligand- and DNA- binding regions than close to those regions, there are many more opportunities for substitutions to affect the allosteric constant than the other biophysical parameters. Note that this analysis assumes that substitutions don’t perturb the LacI structure too much, so that the active and inactive states remain somehow similar to the wild-type states. Our results suggest that this is not always the case: consider, for example, the substitutions at positions __K84 and M98 discussed above and the substitutions resulting in the inverted and band-stop phenotypes discussed below.”__

      • Enrichments need statistical tests to know how unexpected that results are e.g. p5 line 12 "67% of strongly inverted variants have substitutions near the ligand-binding pocket"

      We agree that this information is necessary to interpret the results. We have included p-values (previously reported only in the Methods section) throughout the main text of the manuscript.

      The publication by Poelwijk et al. was considered extensively when planning this work, and failing to cite that manuscript would have been tremendously unjust. We have included it, as well as a few additional references that have identified and discussed inverted LacI variants. We sincerely thank the reviewer for identifying this oversight.

      • What mechanisms do the authors envisage that could produce the band-stop dose response curves? There is likely previous theoretical work that could be cited here. In general there is little discussion of the biophysical mechanisms that could underlie the various mutational effects.

      We agree with the reviewer, that discussing the biophysical mechanisms that underlie many of the reported mutations is important to understand the results. We have expanded the subsection “Effects of amino acid substitutions on LacI phenotype” to include discussion on several of the key substitutions (or groups of substitutions) and their potential biophysical effects. Additionally, we consider mechanism that may underlie the band-stop sensor, and propose one model that could explain the band-stop phenotype:

      “In particular, the biphasic dose-response of the band-stop variants suggests negative cooperativity: that is, successive ligand binding steps have reduced ligand binding affinity. Negative cooperativity has been shown to be required for biphasic dose-response curves__42,43. The biphasic dose-response and apparent negative cooperativity are also reminiscent of systems where protein disorder and dynamics have been shown to play an important role in allosteric function1, including catabolite activator protein (CAP)44,45 and the Doc/Phd toxin-antitoxin system46. This suggests that entropic changes may also be important for the band-stop phenotype. A potential mechanism is that band-stop LacI variants have two distinct inactive states: an inactive monomeric state and an inactive dimeric state. In the absence of ligand, inactive monomers may dominate the population. Then, at intermediate ligand concentrations, ligand binding stabilizes dimerization of LacI into an active state which can bind to the DNA operator and repress transcription. When a second ligand binds to the dimer, it returns to an inactive dimeric state, similar to wildtype LacI. This mechanism, and other possible mechanisms, do not match the MWC model of allostery or its extensions2,13–15__ and require a more comprehensive study and understanding of the ensemble of states in which these band-stop LacI variants exist.”

      • "This result, combined with the wide distribution of residues that affect EC50, suggests that LacI allostery is controlled by a free energy balance with additive contributions from many residues and interactions." 'additive contributions and interactions' covers all possible models of vastly different complexity i.e. this sentence is rather meaningless.

      We have attempted to contextualize this statement by adding additional discussion and references. We hope these additions give more meaning to this section.

      “__This result, combined with the wide distribution of residues that affect EC50, reinforces the view that allostery is a distributed biophysical phenomenon controlled by a free energy balance with additive contributions from many residues and interactions, a mechanism proposed previously1,39 and supported by other recent studies17, rather than a process driven by the propagation of local, contiguous structural rearrangements along a defined pathway.”__

      • fig 4 c and d compress a lot of information into one figure and I found this figure confusing. It may be clearer to have multiple panels with each panel presenting one aspect. It is also not clear to me what the small circular nodes exactly represent, especially when you have one smaller node connected to two polygonal nodes, and why they don't have the same colour scale as the polygonal nodes.

      We agree with the reviewer that figure 4 (or Figure 5 in the revised manuscript) contains a lot of information. The purpose of this figure is to convey the structural and genetic diversity among the sets of inverted variants and band-stop variants. We designed this figure to convey this point at two levels: a brief overview, where the diversity is apparent by quickly considering the figure, and at a more informative level, with some quantitative data and structurally relevant points highlighted. We have modified the caption slightly, in an effort to improve clarity.

      • line 25 - 'causes a conformational change' -> 'energetic change' (allostery does not always involve conformational change

      We thank the reviewer for this comment and have adopted a more modern language describe allostery throughout the manuscript.

      • sup fig 5 legend misses '5'

      We thank the reviewer for pointing this out, we have attempted to number all figures and tables more carefully.

      • sup fig 7. pls add correlation coefficients to these plots (and move to main text figures).

      We agree that this information is of interest and have included this data as main text Figure 2. In addition, we have included coefficients of correlation in the caption of this figure.

      • Reference 21 is just a title and pubmed link

      We thank the reviewer for identifying this error, we have corrected this in the references.

      • "fitness per hour" -> growth rate

      To ensure that this connection is clearly established, when we introduce fitness for the first time, we clarify that it relates to growth rate:

      “Consequently, in the presence of tetracycline, the LacI dose-response modulates cellular fitness (i.e. growth rate) based on the concentration of the input ligand isopropyl-β-D-thiogalactoside (IPTG).”

      Also, we define ‘fitness’ in the Methods section:

      “The experimental approach for this work was designed to maintain bacterial cultures in exponential growth phase for the full duration of the measurements. So, in all analysis, the Malthusian definition of fitness was used, i.e. fitness is the exponential growth rate__58__.”

      • page 6 line 28 - "discoverable only via large-scale landscape measurements" - directed evolution approaches can also discover such genotypes (see e.g. Poelwijk /Tans paper). Please re-phrase.

      We agree with the reviewer and have adjusted the main text accordingly.

      “__Overall, our findings suggest that a surprising diversity of useful and potentially novel allosteric phenotypes exist with genotypes that are readily discoverable via large-scale landscape measurements.”__

      • pls define jargon the first time it is used e.g. band-stop and band-pass

      We agree that all unconventional terms should be explicitly defined when used, and we have attempted to define the band-pass and band-stop dose-response curves more clearly in the main text:

      “These include examples of LacI variants with band-stop dose-response curves (i.e. variants with high-low-high gene expression; e.g. Fig. 1e, Supplementary Fig. 7), and LacI variants with band-pass dose-response curves (i.e. variants with low-high-low gene expression; e.g. Supplementary Fig. 8).”

      **Methods/data availability/ experimental and analysis reproducibility:** The way that growth rate is calculated on page 17 equation 1- This section is confusing. Please be explicit about how you accounted for the lag phase, what the lag phase was, and total population growth during this time. In addition, please report the growth curves from the wells of the four plates, the final OD600 of the pooled samples, and exact timings of when the samples were removed from 37 degree incubation in a table. These are critical for calculating growth rate in individual clones downstream.

      We thank the reviewer for identifying the need to clarify this section of text. The ‘lag’ in this section referred to a delay before tetracycline began impacting the growth rate of cells. To address this, we have changed ‘lag’ in this context to ‘delay.’ Furthermore, we have attempted to clarify precisely the cause of this delay, and how we accounted for it in calculating growth rates:

      For samples grown with tetracycline, the tetracycline was only added to the culture media for Growth Plates 2‑4. Because of the mode of action of tetracycline (inhibition of translation), there was a delay in its effect on cell fitness: Immediately after diluting cells into Growth Plate 2 (the first plate with tetracycline), the cells still had a normal level of proteins needed for growth and proliferation and they continued to grow at nearly the same rate as without tetracycline. Over time, as the level of proteins required for cell growth decreased due to tetracycline, the growth rate of the cells decreased. Accordingly, the analysis accounts for the variation in cell fitness (growth rate) as a function of time after the cells were exposed to tetracycline. With the assumption that the fitness is approximately proportional to the number of proteins needed for growth, the fitness as a function of time is taken to approach the new value with an exponential decay:

      (3)

      where μitet is the steady-state fitness with tetracycline, and α is a transition rate. The transition rate was kept fixed at α = log(5), determined from a small-scale calibration measurement. Note that at the tetracycline concentration used during the library-scale measurement (20 µg/mL), μitet was greater than zero even at the lowest G(L) levels (Supplementary Fig. 10). From Eq. (3), the number of cells in each Growth Plate for samples grown with tetracycline is:

      • What were the upper and lower bounds of the measurements? (LacI deletion vs Tet deletion / autofluoresence phenotype - true 100% and true 0% activity). Knowing and reporting these bounds will also allow easier comparison between datasets in the future.

      We agree that knowing the limitations of the measurement are important for contextualizing the results. To address this point, we have included Supplementary Fig. 11, which shows the uncertainty of the measurement across gene expression levels.

      Please clarify whether there was only 1 biological replicate (because the plates were pooled before sequencing)? Or if there were replicates present an analysis of reproducibility.

      We thank the reviewer for pointing out the ambiguity in the original manuscript. The library-scale measurement reported here was completed once, the 24 growth conditions were spread across 96 wells, so each condition occupied 4 wells. The 4 wells were combined prior to DNA extraction. We have clarified this process in the methods by removing ‘duplicate’:

      “Growth Plate 2 contained the same IPTG gradient as Growth Plate 1 with the addition of tetracycline (20 µg/mL) to alternating rows in the plate, resulting in 24 chemical environments, with each environment spread across 4 wells.”

      Despite there being only a single library-scale measurement, the accuracy and reliability of the results are supported by many distinct biological replicates within the library (i.e. LacI variants with the same amino acid sequence but with different barcodes, see new Supplementary Fig. 9), as well as over 100 orthogonal dose-response curve measurements completed with flow cytometry (Figure 2). We believe these support the reproducibility of the work and we have included statistical analysis on the accuracy of the library-scale measurement results.

      “To test the accuracy of the new method for library-scale dose-response curve measurements, we independently verified the results for over 100 LacI variants from the library. For each verification measurement, we chemically synthesized the coding DNA sequence for a single variant and inserted it into a plasmid where LacI regulates the expression of a fluorescent protein. We transformed the plasmid into E. coli and measured the resulting dose-response curve with flow cytometry (e.g. Fig. 1e). We compared the Hill equation parameters from the library-scale measurement to those same parameters determined from flow cytometry measurements for each of the chemically synthesized LacI variants (Fig. 2). This served as a check of the new library-scale method’s overall ability to measure dose-response curves with quantitative accuracy. The accuracy for each Hill equation parameter in the library-scale measurement was: 4-fold for G0, 1.5-fold for G∞, 1.8-fold for EC50, and ± 0.28 for n. For G0, G∞, and EC50, we calculated the accuracy as: "exp" ["RMSE" ("ln" ("x" ))], where "RMSE" ("ln" ("x" )) is the root-mean-square difference between the logarithm of each parameter from the library-scale and cytometry measurements. For n, we calculated the accuracy simply as the root-mean-square difference between the library-scale and cytometry results (Supplementary Fig. 7).”

      • Please provide supplementary tables of the data (in addition to the raw sequencing files). Both a table summarising the growth rates, inferred parameter values and uncertainties for genotypes and a second table with the barcode sequence counts across timepoints and associated experimental data.

      We agree that access to this information is critical. Due to the size of the associated data, we have made this data available for download in a public repository. We direct readers to the repository information in the “Data Availability” statement:

      “The raw sequence data for long-read and short-read DNA sequencing have been deposited in the NCBI Sequence Read Archive and are available under the project accession number PRJNA643436. Plasmid sequences have been deposited in the NCBI Genbank under accession codes MT702633, and MT702634, for pTY1 and pVER, respectively.

      The processed data table containing comprehensive data and information for each LacI variant in the library is publicly available via the NIST Science Data Portal, with the identifier ark:/88434/mds2-2259 (https://data.nist.gov/od/id/mds2-2259 or https://doi.org/10.18434/M32259). The data table includes the DNA barcode sequences, the barcode read counts, the time points used for the libarary-scale measurement, fitness estimates for each barcoded variant across the 24 chemical environments, the results of both Bayesian inference models (including posterior medians, covariances, and 0.05, 0.25, 0.75, and 0.95 posterior quantiles), the LacI CDS and amino acid sequence for each barcoded variant (as determined by long-read sequencing), the number of LacI CDS reads in the long-read sequencing dataset for each barcoded variant, and the number of unintended mutations in other regions of the plasmid (from the long-read sequencing data).

      Code Availability

      All custom data analysis code is available at https://github.com/djross22/nist_lacI_landscape_analysis.”

      Reviewer #2 (Significance (Required)): The authors present an unprecedently systematic dataset of how mutations affect an allosteric protein. This illustrates the potential of mutational scanning for rapidly identifying diverse variants of allosteric proteins / regulators with desired or unexpected activities for synthetic biology/bioengineering. Previous studies have identified inverted dose-response curve for a lacI phenotypes https://www.cell.com/fulltext/S0092-8674(11)00710-0 but using directed evolution i.e. they were not comprehensive in nature. The audience of this study would be protein engineers, the allostery field, synthetic biologists and the mutation scanning community and evolutionary biologists interested in fitness landscapes. My relevant expertise is in deep mutational scanning and genotype-phenotype landscapes, including work on allosteric proteins and computational methods. Reviewer #3 (Evidence, reproducibility and clarity (Required)): In this interesting manuscript the authors developed in ingenious high throughput screening approach which utilizes DNA barcoding to select variants of LacI proteins with different allosteric profiles for IPTG control using E. coli fitness (growth rate) in a range of antibiotic concentrations as a readout thus providing a genotype-phenotype map for this enzyme. The authors used library of 10^5-10^ variants of LacI expressed from a plasmid and screened for distinct IPTG activation profiles under different conditions including several antibiotic stressors. As a result they identified various patterns of activation including normal (sigmoidal increase), inverted (decrease) and unusual stop-band where the dependence of growth on [IPTG] is non-monotonic. The study is well-conceived, well executed and provides statistically significant results. The key advance provided by this work is that it allows to identify specific mutations in LacI connected with one of three allosteric profiles. The paper is clearly written all protocols are explained and it can be reproduced in a lab that possesses proper expertise in genetics. Reviewer #3 (Significance (Required)): The significance of this work is that it discovered libraries of LacI variants which give rise to distinct profiles of allosteric control of activation of specific genes (in this case antibiotic resistance) by the Lac mechanism. The barcoding technology allowed to identify specific mutations which are (presumably) causal of changes in the way how allosteric activation of LacI by IPTG works. As such it provides a rich highly resolved dataset of LacI variants for further exploration and analysis. Alongside with these strengths several weaknesses should also be noted:

      1. First and foremost the paper does not provide any molecular-level biophysical insights into the impact of various types of mutations on molecular properties of LacI. Do the mutations change binding affinity to IPTG? Binding side? Communication dynamics? Stability? The diagrams of connectivity for the stop-band mutations (Fig.4) do not provide much help as they do not tell much which molecular properties of LacI are affected by mutations and why certain mutations have specific effect on allostery. A molecular level exploration would make this paper much stronger.

      We address this comment with comment (2), below.

      1. In the same vein a theoretical MD study would be quite illuminating in answering the key unanswered question of this work: Why do mutations have various and pronounced effects of allosteric regulation by LacI?. I think publication of this work should not be conditioned on such study but again adding would make the work much stronger.

      We appreciate the reviewer’s comments and agree that investigating the molecular mechanisms driving the phenotypic changes identified in this work is a compelling proposition. Throughout the manuscript, we identify positions and specific amino acid substitutions that affect the measurable function of LacI, and occasionally discuss the biophysical effects that may underly these changes. We have expanded the discussion to include possible molecular-level effects.

      The dataset reported here identifies many potential candidates for molecular-level study, either computationally or experimentally. However, this manuscript is scoped to report a large-scale method to measure the genotype-phenotype landscape of an allosteric protein, and a limited investigation into the emergence of novel phenotypes that are identified in the landscape.

      1. Lastly a recent study PNAS v.116 pp.11265-74 (2019) explored a library of variants of E. coli Adenylate Kinase and showed the relationship between allosteric effects due to substrate inhibition and stability of the protein. Perhaps a similar relationship can explored in this case of LacI.

      We thank the reviewer for highlighting this publication. We agree with the reviewer that similar effects may play a role in the activity of LacI. Establishing such a relationship would require additional experimentation, and, we think, is outside the scope of the submitted manuscript. Although, we hope follow-up studies using this dataset will investigate this phenomenon and other related mechanisms, that may underlie the band-stop phenotype and other observed effects.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this interesting manuscript the authors developed in ingenious high throughput screening approach which utilizes DNA barcoding to select variants of LacI proteins with different allosteric profiles for IPTG control using E. coli fitness (growth rate) in a range of antibiotic concentrations as a readout thus providing a genotype-phenotype map for this enzyme. The authors used library of 10^5-10^ variants of LacI expressed from a plasmid and screened for distinct IPTG activation profiles under different conditions including several antibiotic stressors. As a result they identified various patterns of activation including normal (sigmoidal increase), inverted (decrease) and unusual stop-band where the dependence of growth on [IPTG] is non-monotonic. The study is well-conceived, well executed and provides statistically significant results. The key advance provided by this work is that it allows to identify specific mutations in LacI connected with one of three allosteric profiles. The paper is clearly written all protocols are explained and it can be reproduced in a lab that possesses proper expertise in genetics.

      Significance

      The significance of this work is that it discovered libraries of LacI variants which give rise to distinct profiles of allosteric control of activation of specific genes (in this case antibiotic resistance) by the Lac mechanism. The barcoding technology allowed to identify specific mutations which are (presumably) causal of changes in the way how allosteric activation of LacI by IPTG works. As such it provides a rich highly resolved dataset of LacI variants for further exploration and analysis.

      Alongside with these strengths several weaknesses should also be noted:

      1. First and foremost the paper does not provide any molecular-level biophysical insights into the impact of various types of mutations on molecular properties of LacI. Do the mutations change binding affinity to IPTG? Binding side? Communication dynamics? Stability? The diagrams of connectivity for the stop-band mutations (Fig.4) do not provide much help as they do not tell much which molecular properties of LacI are affected by mutations and why certain mutations have specific effect on allostery. A molecular level exploration would make this paper much stronger.
      2. In the same vein a theoretical MD study would be quite illuminating in answering the key unanswered question of this work: Why do mutations have various and pronounced effects of allosteric regulation by LacI?. I think publication of this work should not be conditioned on sucgh study but again adding would make the work much stronger.
      3. Lastly a recent study PNAS v.116 pp.11265-74 (2019) explored a library of variants of E. coli Adenylate Kinase and showed the relationship between allosteric effects due to substrate inhibition and stability of the protein. Perhaps a similar relationship can explored in this case of LacI.
    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors use deep mutational scanning to infer the dose-response curves of ~60,000 variants of the LacI repressor and so provide an unprecedently systematic dataset of how mutations affect an allosteric protein. Overall this is an interesting dataset that highlights the potential of mutational scanning for rapidly identifying diverse variants of proteins with desired or unexpected activities for synthetic biology/bioengineering. The relatively common inverted phenotypes and their sequence diversity is interesting, as is the identification of several hundred genotypes with non-sigmoidal band-stop dose-response curves and their enrichment in specific protein regions. A weakness of the study is that some of the parameter estimates seem to have high uncertainty and this is not clearly presented or the impact on the conclusions analysed. A second shortcoming is that there is little mechanistic insight beyond the enrichments of mutations with different effects in different regions of the protein. But as a first overview of the diversity of mutational effects on the dose-response curve of an allosteric protein, this is an important dataset and analysis.

      Comments

      Data quality and reproducibility

      "The flow cytometry results confirmed both the qualitative and quantitative accuracy of the new method (Supplementary Figs. 3-7)"

      • There need to be quantitative measures of accuracy in the text here for the different parameters.
      • Sup fig 7 panels should be main text panels - they are vital for understanding the data quality In particular, the G0 parameter estimates from the library appear to have a lower bound ie they provide no information below a cytometry Go of ~10^4. This is an important caveat and needs to be highlighted in the main text. The Hill parameter (n) estimate for wt (dark gray) replicate barcodes is extremely variable - why is this?
      • In general there is not a clear enough presentation of the uncertainty and biases in the parameter estimations which seem to be rather different for the 4 parameters. Only the EC50 parameter seems to correlate very well with the independent measurements.
      • The genotypes in the mutagenesis library contain a mean of 4.4 aa substitutions and the authors us a neural network to estimate 3 of the Hill equation parameters (with uncertainties) for the 1991/2110 of the single aa mutations. It would be useful to have an independent experimental evaluation of the reliability of these inferred single aa mutational effects by performing facs on a panel of single aa mutants (using single aa mutants in sup fig 3-7, if there are any, or newly constructed mutants).
      • fig3/"Combining multiple substitutions in a single protein almost always has a log-additive effect on EC50." How additive are the other 2 parameters? this analysis should also be presented in fig 3. If they are not as additive is it simply because of lower accuracy of the measurements? If the mutational effects are largely additive, then a simple linear model (rather than the DNN) could be used to estimate the single mutant effects from the multiple mutant genotypes.

      Presentation/clarity of text and figures

      • The main text implies that the DNN is trained to predict 3 parameters of the Hill equation but not the Hill coefficient (n). This should be clarified / justified in the main text.
      • The DNN needs to be better explained and justified in the main text for a general audience. How do simpler additive models perform for phenotypic prediction / parameter inference?
      • Ref 14. analyses a much smaller set of mutants in the same protein but using an explicit biophysical model. It would be helpful to have a more extensive comparison with the approach and conclusions o this previous study.
      • Enrichments need statistical tests to know how unexpected that results are e.g. p5 line 12 "67% of strongly inverted variants have substitutions near the ligand-binding pocket"
      • missing citation: Poelwijk et al 2011 https://www.cell.com/fulltext/S0092-8674(11)00710-0 previously reported an inverted dose-response curve for a lacI mutant.
      • What mechanisms do the authors envisage that could produce the band-stop dose response curves? There is likely previous theoretical work that could be cited here. In general there is little discussion of the biophysical mechanisms that could underlie the various mutational effects.
      • "This result, combined with the wide distribution of residues that affect EC50, suggests that LacI allostery is controlled by a free energy balance with additive contributions from many residues and interactions." 'additive contributions and interactions' covers all possible models of vastly different complexity i.e. this sentence is rather meaningless.
      • fig 4 c and d compress a lot of information into one figure and I found this figure confusing. It may be clearer to have multiple panels with each panel presenting one aspect. It is also not clear to me what the small circular nodes exactly represent, especially when you have one smaller node connected to two polygonal nodes, and why they don't have the same colour scale as the polygonal nodes.
      • line 25 - 'causes a conformational change' -> 'energetic change' (allostery does not always involve conformational change
      • sup fig 5 legend misses '5'
      • sup fig 7. pls add correlation coefficients to these plots (and move to main text figures).
      • Reference 21 is just a title and pubmed link
      • "fitness per hour" -> growth rate
      • page 6 line 28 - "discoverable only via large-scale landscape measurements" - directed evolution approaches can also discover such genotypes (see e.g. Poelwijk /Tans paper). Please re-phrase.
      • pls define jargon the first time it is used e.g. band-stop and band-pass

      Methods/data availability/ experimental and analysis reproducibility:

      • The way that growth rate is calculated on page 17 equation 1- This section is confusing. Please be explicit about how you accounted for the lag phase, what the lag phase was, and total population growth during this time. In addition, please report the growth curves from the wells of the four plates, the final OD600 of the pooled samples, and exact timings of when the samples were removed from 37 degree incubation in a table. These are critical for calculating growth rate in individual clones downstream.
      • What were the upper and lower bounds of the measurements? (LacI deletion vs Tet deletion / autofluoresence phenotype - true 100% and true 0% activity). Knowing and reporting these bounds will also allow easier comparison between datasets in the future.
      • Please clarify whether there was only 1 biological replicate (because the plates were pooled before sequencing)? Or if there were replicates present an analysis of reproducibility.
      • Please provide supplementary tables of the data (in addition to the raw sequencing files). Both a table summarising the growth rates, inferred parameter values and uncertainties for genotypes and a second table with the barcode sequence counts across timepoints and associated experimental data.

      Significance

      The authors present an unprecedently systematic dataset of how mutations affect an allosteric protein. This illustrates the potential of mutational scanning for rapidly identifying diverse variants of allosteric proteins / regulators with desired or unexpected activities for synthetic biology/bioengineering.

      Previous studies have identified inverted dose-response curve for a lacI phenotypes https://www.cell.com/fulltext/S0092-8674(11)00710-0 but using directed evolution i.e. they were not comprehensive in nature.

      The audience of this study would be protein engineers, the allostery field, synthetic biologists and the mutation scanning community and evolutionary biologists interested in fitness landscapes.

      My relevant expertise is in deep mutational scanning and genotype-phenotype landscapes, including work on allosteric proteins and computational methods.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors study allostery with a beautiful genotype-phenotype experiment to study the fitness landscape of an allosteric lac repressor protein. The authors make a mutational library using error prone pcr and measure the impact on antibiotic resistance protein expression at varying levels of ligand, IPTG, expression. After measuring the impact of mutations authors fill-in the missing data using a neural net model. This type of dose response is not standard in the field, but the richness of their data and the discovery of the "band pass" phenomena prove its worth here splendidly.

      Using this mixed experimental/predicted data the authors explore how each mutation alters the different parameters of a hill equation fit of a dose response curve. Using higher order mutational space the authors look at how mutations can qualitatively switch phenotypes to inverted or band-stop dose-response curves. To validate and further explore a band-stop novel phenotype, the authors focused on a triple mutant and made all combinations of the 3 mutations. The authors find that only one mutation alone alters the dose-response and only in combination does a band-stop behavior present itself. Overall this paper is a fantastic data heavy dive into the allosteric fitness landscape of protein.

      Major

      Overall, the data presented in this paper is thoroughly collected and analyzed making the conclusions well-based. We do not think additional experiments nor substantial changes are needed apart from including basic experimental details and more biophysical rationale/speculation as discussed in further detail below.

      The authors do a genotype-phenotype experiment that requires extensive deep sequencing experiments. However, right now quite a bit of basic statistics on the sequencing is missing. Baseline library quality is somewhat shown in supplementary fig 2 but the figure is hard to interpret. It would be good to have a table that states how many of all possible mutations at different mutation depths (single, double, etc) there are. Similarly, sequencing statistics are missing- it would be useful to know how many reads were acquired and how much sequencing depth that corresponds to. This is particularly important for barcode assignment to phenotype in the long-read sequencing. In addition, a synonymous mutation comparison is mentioned but in my reading that data is not presented in the supplemental figures section.

      The paper is very much written from an "old school" allostery perspective with static end point structures that are mutually exclusive - eg. p5l10 "relative ligand-binding affinity between the two conformations" - however, an ensemble of conformations is likely needed to explain their data. This is especially true for the bandpass and inverted phenotypes they observe. The work by Hilser et al is of particular importance in this area. We would invite the authors to speculate more freely about the molecular origins of their findings.

      Minor

      There are a number of small modifications. In general this paper is very technical and could use with some explanation and discussion for relevance to make the manuscript more approachable for a broader audience.

      P1L23: Ligand binding at one site causes a conformational changes that affects the activity of another > not necessarily true - and related to using more "modern" statistical mechanical language for describing allostery.

      P2L20: The core experiment of this paper is a selection using a mutational library. In the main body the authors mention the library was created using mutagenic pcr but leave it at that. More details on what sort of mutagenic pcr was used in the main body would be useful. According to the methods error prone pcr was used. Why use er-pcr vs deep point mutational libraries? Presumably to sample higher order phenotype? Rationale should be included. Were there preliminary experiments that helped calibrate the mutation level?

      P2L20: Baseline library statistics would be great in a table for coverage, diversity, etc especially as this was done by error prone pcr vs a more saturated library generation method. This is present in sup fig2 but it's a bit complicated.

      P2L26: How were FACS gates drawn? This is in support fig17 - should be pointed to here.

      P3L4: Where is the figure/data for the synonymous SNP mutations? This should be in the supplement.

      P3L20: The authors use a ML learning deep neural network to predict variant that were not covered in the screen. However, the library generation method is using error prone pcr meaning there could multiple mutations resulting in the same amino acid change. The models performance was determined by looking at withheld data however error prone pcr could result in multiple nonsynomymous mutations of the same amino acid. For testing were mutations truly withheld or was there overlap? Because several mutations are being represented by different codon combinations. Was the withheld data for the machine learning withholding specific substitutions?

      In addition, higher order protein interactions are complicated and idiosyncratic. I am surprised how well the neural net performs on higher order substitutions.

      P4L4: Authors find mutations at the dimer/tetramer interfaces but don't mention whether polymerization is required. is dimerization required for dna binding? Tetramerization?

      P4L8: Substitutions near the dimer interface both impact g0 and ec50, which authors say is consistent with a change in the allosteric constant. Can authors explain their thinking more in the paper to make it easier to follow? Are the any mutations in this area that only impact g0 or ec50 alone? Why may these specific residues modify dimerization?

      P4L8: The authors discuss the allosteric constant extensively within the paper but do not explain it. It would be helpful to have an explanation of this to improve readability. This explanation should include the statistical mechanical basis of it and some speculation about the ways it manifests biophysically.

      P4L1-16: Authors see mutations in the dimerization region that impact either G0 and Gsaturated in combination with Ec50 but not g0 and gsaturated together. Maybe we do not fully understand the hill equation but why are there no mutations that impact both g0 and gsaturated seen in support fig 13c? Why would mutations in the same region potentially impacting dimerization impact either g0 or gsaturated? What might be the mechanism behind divergent responses?

      P4L29: for interpretability it would be good to explain what log-additive effect means in the context of allostery.

      P4L34-P5L19: This section is wonderful. Really cool results and interesting structural overlap!

      P5L34 Helix 9 of the protein is mentioned but it's functional relevance is not. This is common throughout the paper - it would be useful for there to be an overview somewhere to help the reader contextualize the results with known structural role of these elements.

      P5L39: The authors identified a triple mutant with the band-stop phenotype then made all combination of the triple mutant. Of particular interest is R195H/G265D which is nearly the same as the triple mutant. It would be nice if the positions of each of these mutations and have some discussion to begin to rationalize this phenotype, even if to point out how far apart they are and that there is no easy structural rationale!

      P6L9: There should be more discussion of the significance of this work directly compared to what is known. For instance negative cooperativity is mentioned as an explanation for bi-phasic dose response but this idea is not explained. Why would the relevant free energy changes be more entropic? Another example is the reverse-TetR phenotype observed by Hillen et al.

      P6L28: The authors mention that phenotypes exist with genotypes that are discoverable with genotype-phenotype landscapes. This study due to the constraints of error prone pcr were somewhat limited. How big is the phenotypic landscape? Is it worth doing a more systematic study? What is the optimal experimental design: Single mutations, doubles, random - where is there the most information. How far can you drift before your machine learning model breaks down? How robust would it be to indels?

      Figures:

      Sup figs 3-7: The comparison of library-based results and single mutants is a great example of how to validate genotype-phenotype experiments!

      Supp fig 5.: Missing figure number.

      Supp fig7: G0 appears to have very poor fit between library vs single mutant version. Why might this be? R^2 would likely be better to report here as opposed to RMSE as RMSE is sensitize to the magnitude of the data such that you cannot directly compare RMSE of say 'n' to G0.

      Sup fig13c: it is somewhat surprising that mutations only appear to effect g0 and not gsaturated. This implies that basal and saturated activity are not coupled. Is this expected? Why or why not?

      Significance

      Allostery is hard to comprehend because it involves many interacting residues propagating information across a protein. The Monod-Wyman-Changeux (MWC) and Koshland, Nemethy, and Filmer (KNF) models have been a long standing framework to explain much of allostery, however recent formulations have focused on the role of the conformational ensemble and a grounding in statistical mechanics. This manuscript focuses on the functional impact of mutations and therefore contribution of the amino acids to regulation. The authors unbiased approach of combining a dose-response curve and mutational library generation let them fit every mutant to a hill equation. This approach let the authors identify the allosteric phenotype of all measured mutations! The authors found inverted phenotypes which happen in homologs of this protein but most interesting is the strange and idiosyncratic 'Band-stop' phenotype. The band-stop phenotype is bi-phasic that will hopefully be followed up with further studies to explain the mechanism. This manuscript is a fascinating exploration of the adaptability of allosteric landscapes with just a handful of mutations.

      Genotype-phenotype experiments allow sampling immense mutational space to study complex phenotypes such as allostery. However, a challenge with these experiments is that allostery and other complicated phenomena come from immense fitness landscapes altering different parameters of the hill equation. The authors approach of using a simple error prone pcr library combined with many ligand concentrations allowed them to sample a very large space somewhat sparsely. However, they were able to predict this data by training and using a neural net model. I think this is a clever way to fill in the gaps that are inherent to somewhat sparse sampling from error prone pcr. The experimental design of the dose response is especially elegant and a great model for how to do these experiments.

      With some small improvements for readability, this manuscript will surely find broad interest to the genotype-phenotype, protein science, allostery, structural biology, and biophysics fields.

      We were prompted to do this by Review Commons and are posting our submitted review here:

      Willow Coyote-Maestas has relevant expertise in high throughput screening, protein engineering, genotype-phenotype experiments, protein allostery, dating mining, and machine learning.

      James Fraser has expertise in structural biology, genotype-phenotype experiments, protein allostery, protein dynamics, protein evolution, etc.

      Referees cross-commenting

      Seems like our biggest issues are: better uncertainty estimates of the parameters and more biophysical/mechanistic explanation/speculation. The uncertainty estimates might be tricky with the deep learning approach. The more biophysical speculation will require some re-writing around an ensemble rather than a static structure perspective.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)):**

      This study addresses the role IL-13 in promoting lung damage following migration of the helminth N. brasiliensis larvae from the circulatory system to the lung. The work clearly shows using IL13-/- mice that Nb elicited IL-13 immunity at days 2-6 post-infection reduces pathology. The authors demonstrate an association with reduced eosinophils but no effect on neutrophil numbers.

      Proteomic analysis identifies a number of molecules known to be involved in protecting against type 2 pathologies such as relm-a and SP-D.

      The authors then identify a clear requirement for IL-13 in driving relm-a expression.

      Finally, the authors present a whole lung RNA transcript profile which largely supports their proteomic observations.

      Taken together the work presents a sound case for IL-13 being an important player in protecting against initial lung pathology.

      **Major requests:**

      The paper is really very interesting and important. To an extent it questions existing dogma of IL-13 being a driver of lung inflammation.

      Addressing the following could hopefully be achieved using archived samples or with an acceptable amount of extra experimental work.

      Figure 1: D2 and D6 Lung IL-13 concentrations (ELISA) in WT mice would set the scene for the papers story*

      We agree that showing IL-13 concentrations in the lung would nicely set the stage for the role of IL-13 during __Nippostrongylus__ infection. In the current paper, we showed IL-13 mRNA levels in Figure 3 but in a revised version, we will include D2 and D6 mRNA data in Figure 1. We attempted to quantify IL-13 protein levels in the BAL fluid of infected WT mice on D2 and D6 post-infection. However, IL-13 in the BAL was below the levels of detection for our ELISA assay. Therefore, we would need to measure IL-13 protein in total lung homogenates but we do not have material archived at present. If the editor feels this is a critical piece of data we will perform repeat experiments.

      Figure 2: The authors should add evidence that function/activity of neutrophils/eosinophils is changed/not changed: e.g. granzyme, MBP, EPO release in BAL and/or lung.

      As supported by referee 3, we feel that measuring functional readouts of neutrophils and eosinophils, while interesting, is currently outside of the scope of the paper. Further, with respect to eosinophils, we see a major reduction in total eosinophil numbers in IL-13-deficient mice which would likely result in a reduction in the level of functional molecules such as MBP. Thus, these readouts in the BAL may not be a reliable indicator of cellular function and results difficult to interpret in light of altered cell numbers.

      Additionally, some data showing changes in epithelial stress related cytokines such as IL-23 and IL-33 would be informative (IHC and /or ELISA).

      The reviewer makes a good suggestion that would complement our proteomics/pathway analysis. As described in our comment below regarding Foxa2 pathways, we do have additional data showing epithelial cell defects in the absence of IL-13 and will add this to a revised manuscript. While we do see a trend for a reduction in IL-33 mRNA in infected IL-13-deficient mice, it is difficult to correlate this with functional protein. If requested, we can perform additional analyses to measure IL-23 and/or IL-33 protein levels in archived BAL fluid or by IHC of lung sections.

      *The following will require a new experiment:

      The authors present a strong case for RELMa being associated with/driven by IL-13 responses. The following I feel would prove that IL-13 driven RELMa is important in reducing lung pathology. Can enhanced lung pathology or cell responses associated with pathology be reduced/altered by dosing Nb infected IL13-/- mice with recombinant relma or by restimulating BAL cells (for example) from IL-13-/- mice. This team is well placed to comment on the potential for such an in vivo experiment to be feasible.

      Or could the authors could also test the ability for other candidate molecules to reduce lung pathology? Would for example i/n dosing of IL-13-/- mice with AMCase, BRP39 or SP-D protect against pathology? It would be expected to be the case for SP-D.*

      Our previous study has shown that RELM-__a plays an important yet highly complex role during lung repair (see Sutherland et al. 2018: https://doi.org/10.1371/journal.ppat.1007423____). The suggested experiments would advance our understanding of the function of RELM-a and other effector molecules during type 2 immunity and repair. However, it is unlikely that the impact of IL-13 will be due to a single effector molecule (as supported by Reviewer 3) and thus these types of experiments would shift the focus of the paper from the impact of IL-13 to understanding specific function of type 2 effectors. Since our study deals more broadly with the function of IL-13 rather than the downstream effectors, we hope that this will open up further investigation of these specific molecules to the wider community to take forward.__

      *Reviewer #1 (Significance (Required)):

      The manuscript places IL-13 as an important initiator of early protection from acute lung damage. This is important as it is to an extent a non-canonical role for this cytokine. This is also important as IL-13 can be manipulated therapeutically. To maximise potential application of such drugs requires detailed understanding of the various contextual roles of IL-13. This study provides such evidence.

      The authors identify a range of target mediators.

      This is an important body of work that is useful for understanding how acute lung damage can be regulated.

      This work will be of interest to Type 2 immunologists, any researcher with an interest in pulmonary inflammation as well as mucosal immunity.

      I make these suggestions/comments based on my own background in Type 2 immunity, lung inflammation and parasitic helminth infection and immunity.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Chenery et al report that IL-13 plays a critical role in protecting mice from lung damage caused by the infection of a nematode, Nippostrongylus Brasiliensis, in WT or IL-13 knockout mice (IL-13 eGFP knock-in mice, Neill et al., Nature 2010). Phenotypically, they demonstrated that IL-13 genetic deficiency resulted in more severe lung injuries and haemorrhaging following the larvae migratory infections. Through the proteomic and transcriptomic profiling, they identified gene-expression changes involved in the cellular stress responses, e.g. up-regulating the expression of epithelial-derived type 2 molecules, controlled by IL-13. They also found that type 2 effector molecules including RELM-alpha and surfactant protein D were compromised in IL-13 knockout mice. Thus, they proposed that IL-13 has tissue-protective functions during lung injury and regulates epithelial cell responses during type 2 immunity in this acute setting. Overall, the manuscript was clearly written and a number of findings were interesting and expected compared to the published knowledge. However, this work could be improved and more impactful by further performing the following suggested experiments.

      Major points:

      1. It may not be accurate to claim that "IL-13 played a critical role in limiting tissue injury ... in the lung following infection" since IL-13 participates in both repelling worms and activating tissue reparative responses. It is very hard to distinguish these two kinds of responses with the current experimental settings because the much higher worm burden led to more direct lung damage in IL-13-/- mice than WT counterparts.*

      The reviewer raises an important point that we will need to clarify in a revised manuscript. Based on several studies, the role of IL-13 in mediating __Nippostrongylus expulsion occurs in the small intestine, after the parasites have already cleared the lung tissue. The number of worms in the lung do not differ at the time points we are investigating. We have qRT-PCR data measuring Nippostrongylus__-specific actin levels, which we and others have previously shown to accurately reflect worm numbers. We can therefore demonstrate that the differences in lung damage do not reflect a difference in the number of larvae in the lungs of IL-13 KO mice compared to WT mice. These data will be incorporated into the manuscript to better clarify this point.

      1. It would be more informative if the authors could perform the RNA-seq analysis on the IL-13-responsive cell type such as airway epithelial cells (goblet cell) by comparing WT vs IL13-/- in the context of lung damage caused by Nitrostrongylus Brasiliensis infection.*

      RNA-sequencing of specific cells would indeed be an excellent experiment that would reveal more IL-13-depedendent processes in our model. However, this would be a considerable undertaking at this stage (as reviewer 3 has pointed out). Nonetheless, our extended analysis of the Foxa2 pathway as requested below has highlighted a number of genes regulated by IL-13, which are known to be involved in epithelial cell function.

      We agree with the reviewer that showing additional validation data to support the Foxa2 defect in IL-13-deficient mice would strengthen our paper’s overall message. We have additional qRT-PCR data of IL-13-dependent genes regulated by Foxa2 (__Clca1, Muc5ac, Ccl11, and Foxa3__) that clearly support this epithelial cell-specific defect that we can readily incorporate into the revised paper.

      *Reviewer #2 (Significance (Required)):

      Overall, the manuscript was clearly written and a number of findings were interesting and expected compared to the published knowledge.

      **Referees cross-commenting**

      To Reviewer #1's Review: fair and constructive

      To Reviewer #3's Review: agree in general.*

      * Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this study, Allen, Sutherland and colleagues utilize IL-13 deficient mice to investigate the function of IL-13 in the early response to lung tissue damage induced by helminth infection. They demonstrate that IL-13 deficiency has significant effects on the acute tissue response to helminth infection (at day 2 and 6 post-infection). Particularly, IL-13 deficiency results in increased lung hemorrhaging, and more pronounced lung tissue damage evidenced by increased gaps in the alveolar architecture. They perform proteomic and transcriptomic profiling of the lungs to determine IL-13-induced pathways and demonstrate many protein and gene expression changes in the absence of IL-13. These include dysregulated collagens, reduced epithelial-derived proteins RELMalpha and surfactant protein D, downregulated pathways related to cellular stress, and increased genes associated with the Foxa2 pathway.

      Overall, the key conclusions are convincing, and the study design, methods and data analysis are clear, rigorous and thorough.

      **Minor Comments:**

      1. The authors concluded that lung epithelial cells are more sensitive to IL-13 than IL-4, but the intranasal injection of both proteins showed a similar induction of RELMα - investigation into this difference would be useful. Alternatively, providing an explanation for these different findings could be helpful.*

      Our suggestion that epithelial cells are likely to be more sensitive to IL-13 was based both on our data and the existing literature. We would agree that we do not have definitive evidence for this. Indeed, because the type 2 receptor can respond to both IL-4 and IL-13 this issue is difficult to easily resolve experimentally. We will expand on this in a revised manuscript, making our explanations clearer whilst acknowledging the alternative explanations.

      This is a good suggestion and we have additional flow cytometry data looking at hematopoietic cell expression of RELM-__a from these experiments that we can incorporate into the revised manuscript. We have found that airway macrophages were another source of RELM-a__ in the lung and mirrored the airway epithelial cell responses to both intraperitoneal and intranasal delivery of IL-4 and IL-13.

      We agree that a comparison of IL-13Ra1 versus IL-13 deficiency should be included in the discussion of our manuscript. These authors found epithelial-specific defects in IL-13Ra1-deficient mice such as Clca1 (aka Clca3), RELM-__a, and chitinase-like proteins even under homeostatic conditions, which is highly consistent with our data. This study also found that IL-13Ra1 deficiency led to increased bleomycin-induced pathology and together with our data, offers further insight into the IL-13/IL-13Ra__1 axis during lung injury. We will add these points to our discussion and will attempt to directly compare their gene expression data set with our data to find more overlapping genes between the two mouse strains and disease models.

      This is indeed a very important point we will address in a revised discussion. IL-4R__a-deficient mice did show increased bleeding in the Chen et al. study that was not seen in the IL-13Ra__1 KO suggesting IL-4 alone is sufficient to limit bleeding. This is in contrast to our study where we found increased bleeding in IL-13 KO mice independent of IL-4. However, a major difference between the studies is the background strain of mice used, which was BALB/c in the Chen et al. study versus C57BL/6 mice we used in our study. In addition to differences in IL-4 and IL-13 levels between the strains, we have unpublished observations of major differences in vascular integrity with BALB/c much more prone to bleeding, which is an active area of investigation in the lab. Although we have yet to unravel these differences mechanistically, they could explain differential requirements for IL-4 versus IL-13 to limit bleeding between the two strains.

      Our apologies, we will fix the reference duplication.

      *Reviewer #3 (Significance (Required)):

      This study addresses the specific function of IL-13 in acute helminth infection of the lung, which has not previously been studied, as most studies investigate the combined function of IL-4 and IL-13 through IL-4 receptor KO or Stat6 KO mice.

      It is a thorough, well-conducted and well-organized study with high quality data using 'omics' strategies to profile IL-13-induced genes and proteins. Their data identifies intriguing pathways that are dependent on IL-13, opening new avenues to explore for IL-13-mediated protective roles in acute lung tissue damage. Therefore this study provides conceptual and technical advances. Additionally, since targeting IL-4 and IL-13 are in clinical trials or employed therapeutically for pulmonary disorders, the findings from these studies are clinically relevant. It would however have been useful to validate some of these pathways and demonstrate epithelial-specific outcomes for IL-13-induced tissue protection.

      Previous studies using IL4RKO have shown that IL-4 and IL-13 are necessary to protect from acute tissue damage in helminth infection (Chen, Nature medicine - referred to by authors). Other studies have investigated IL-13 in fibrosis and granulomatous inflammation (papers referenced by authors, and Ramalingam Nature Immunology 2009). Last, one study shows that IL-13Ra1 signaling is important for protection in bleomycin-induced lung injury, findings using a different transgenic mouse, which are relevant for this study and may be useful to discuss (Karo-Atar, Mucosal Immunology 2016).

      As stated above - the data in this manuscript identify intriguing pathways that are dependent on IL-13, opening new, exciting avenues to explore for IL-13-mediated protective roles in acute lung tissue damage. Their data is also unique as it combines proteomics and transcriptomics, and identities previously unappreciated IL-13 regulated pathways such as cellular stress and Foxa2, which would be interesting to investigate further.

      **Referees cross-commenting**

      To Reviewer 1: The suggested data for Figure 1 (IL-13 concentrations) could be useful, but suggested experiments for Figure 2 could be outside the main focus of the paper.

      For the main suggested experiment: treatment of IL-13-/- with RELMalpha, this could be useful, One caveat is that RELMalpha might not be the only effector molecule downstream of IL-13 so the authors may not get a definitive answer. An alternative (although not as RELMalpha-specific) would be to treat IL13KO mice with FcIL-4 or FcIL-13 - the latter that drives RELMalpha, and look at whether FcIL-13 is more protective than FcIL-4.*

      We agree that rescue experiments could provide insights into the relative protective effects of IL-4 versus IL-13. However, it might be challenging to interpret the results in part because of the difficulty in establishing physiologically relevant doses and timing and the fact that IL-4 will also signal through the type 2 receptor. These difficulties are reflected in the interpretation of our current data as discussed above (pt. 1 reviewer 3). Although we have found IL-4 and IL-13 delivery experiments valuable and have used them in many of our papers, we have always been cautious in our interpretation, as we typically use these at super-physiological doses. However, this is an experiment we would consider if the editors felt it essential to the story.

      To Reviewer 2: I agree with points 1 and 3 - especially with point 3, which would give more in-depth understanding into the functional outcomes of the IL-13 -> FoxA2 pathway identified. For point 2, RNA-seq of epithelial cells would be informative, but may be beyond the scope of the project.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study, Allen, Sutherland and colleagues utilize IL-13 deficient mice to investigate the function of IL-13 in the early response to lung tissue damage induced by helminth infection. They demonstrate that IL-13 deficiency has significant effects on the acute tissue response to helminth infection (at day 2 and 6 post-infection). Particularly, IL-13 deficiency results in increased lung hemorrhaging, and more pronounced lung tissue damage evidenced by increased gaps in the alveolar architecture. They perform proteomic and transcriptomic profiling of the lungs to determine IL-13-induced pathways and demonstrate many protein and gene expression changes in the absence of IL-13. These include dysregulated collagens, reduced epithelial-derived proteins RELMalpha and surfactant protein D, downregulated pathways related to cellular stress, and increased genes associated with the Foxa2 pathway.

      Overall, the key conclusions are convincing, and the study design, methods and data analysis are clear, rigorous and thorough.

      Minor Comments:

      1. The authors concluded that lung epithelial cells are more sensitive to IL-13 than IL-4, but the intranasal injection of both proteins showed a similar induction of RELMα - investigation into this difference would be useful. Alternatively, providing an explanation for these different findings could be helpful.
      2. Providing data by immunofluorescence or flow cytometry of non-epithelial expression of RELMalpha following intranasal versus intraperitoneal injection of IL-4 versus IL-13 would be useful.
      3. Discussion of IL-13Ra1 deficient mice would be useful, in particular the study by Karo-Atar and Munitz in Mucosal Immunology 2016, showing that IL13Ra1 is protective against bleomycin-induced pulmonary injury (PMID: 26153764). Comparing their data with the gene expression datasets from this study would be useful (acknowledging the caveat that IL-4 effects through the type 2 receptor would also be abrogated in these IL13Ra1 mice).
      4. The authors reference Chen et al. Nature Medicine 2012, but do not discuss the finding in this paper that neither IL-4-/- nor IL13Ra1-/- have increased lung hemorrhage. This might be a mouse strain issue and worthwhile discussing.
      5. Reference 32 and 36 (Sutherland PLoS pathogens) are duplicates

      Significance

      This study addresses the specific function of IL-13 in acute helminth infection of the lung, which has not previously been studied, as most studies investigate the combined function of IL-4 and IL-13 through IL-4 receptor KO or Stat6 KO mice.

      It is a thorough, well-conducted and well-organized study with high quality data using 'omics' strategies to profile IL-13-induced genes and proteins. Their data identifies intriguing pathways that are dependent on IL-13, opening new avenues to explore for IL-13-mediated protective roles in acute lung tissue damage. Therefore this study provides conceptual and technical advances. Additionally, since targeting IL-4 and IL-13 are in clinical trials or employed therapeutically for pulmonary disorders, the findings from these studies are clinically relevant. It would however have been useful to validate some of these pathways and demonstrate epithelial-specific outcomes for IL-13-induced tissue protection.

      Previous studies using IL4RKO have shown that IL-4 and IL-13 are necessary to protect from acute tissue damage in helminth infection (Chen, Nature medicine - referred to by authors). Other studies have investigated IL-13 in fibrosis and granulomatous inflammation (papers referenced by authors, and Ramalingam Nature Immunology 2009). Last, one study shows that IL-13Ra1 signaling is important for protection in bleomycin-induced lung injury, findings using a different transgenic mouse, which are relevant for this study and may be useful to discuss (Karo-Atar, Mucosal Immunology 2016).

      As stated above - the data in this manuscript identify intriguing pathways that are dependent on IL-13, opening new, exciting avenues to explore for IL-13-mediated protective roles in acute lung tissue damage. Their data is also unique as it combines proteomics and transcriptomics, and identities previously unappreciated IL-13 regulated pathways such as cellular stress and Foxa2, which would be interesting to investigate further.

      Referees cross-commenting

      To Reviewer 1: The suggested data for Figure 1 (IL-13 concentrations) could be useful, but suggested experiments for Figure 2 could be outside the main focus of the paper.

      For the main suggested experiment: treatment of IL-13-/- with RELMalpha, this could be useful, One caveat is that RELMalpha might not be the only effector molecule downstream of IL-13 so the authors may not get a definitive answer. An alternative (although not as RELMalpha-specific) would be to treat IL13KO mice with FcIL-4 or FcIL-13 - the latter that drives RELMalpha, and look at whether FcIL-13 is more protective than FcIL-4.

      To Reviewer 2: I agree with points 1 and 3 - especially with point 3, which would give more in-depth understanding into the functional outcomes of the IL-13 -> FoxA2 pathway identified. For point 2, RNA-seq of epithelial cells would be informative, but may be beyond the scope of the project.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Chenery et al report that IL-13 plays a critical role in protecting mice from lung damage caused by the infection of a nematode, Nippostrongylus Brasiliensis, in WT or IL-13 knockout mice (IL-13 eGFP knock-in mice, Neill et al., Nature 2010). Phenotypically, they demonstrated that IL-13 genetic deficiency resulted in more severe lung injuries and haemorrhaging following the larvae migratory infections. Through the proteomic and transcriptomic profiling, they identified gene-expression changes involved in the cellular stress responses, e.g. up-regulating the expression of epithelial-derived type 2 molecules, controlled by IL-13. They also found that type 2 effector molecules including RELM-alpha and surfactant protein D were compromised in IL-13 knockout mice. Thus, they proposed that IL-13 has tissue-protective functions during lung injury and regulates epithelial cell responses during type 2 immunity in this acute setting. Overall, the manuscript was clearly written and a number of findings were interesting and expected compared to the published knowledge. However, this work could be improved and more impactful by further performing the following suggested experiments.

      Major points:

      1. It may not be accurate to claim that "IL-13 played a critical role in limiting tissue injury ... in the lung following infection" since IL-13 participates in both repelling worms and activating tissue reparative responses. It is very hard to distinguish these two kinds of responses with the current experimental settings because the much higher worm burden led to more direct lung damage in IL-13-/- mice than WT counterparts.
      2. It would be more informative if the authors could perform the RNA-seq analysis on the IL-13-responsive cell type such as airway epithelial cells (goblet cell) by comparing WT vs IL13-/- in the context of lung damage caused by Nitrostrongylus Brasiliensis infection.
      3. Figure 6C, the transcriptional profiling of mouse lungs revealed that the Foxa2 pathway was significantly up-regulated in the IL-13-/- infected mice. This is an important finding because this pathway plays a critical role in the process of alveolarization and inhibiting goblet cell hyperplasia. In order to validate this finding, some components in this pathway could be further examined.

      Significance

      Overall, the manuscript was clearly written and a number of findings were interesting and expected compared to the published knowledge.

      Referees cross-commenting

      To Reviewer #1's Review: fair and constructive

      To Reviewer #3's Review: agree in general.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study addresses the role IL-13 in promoting lung damage following migration of the helminth N. brasiliensis larvae from the circulatory system to the lung. The work clearly shows using IL13-/- mice that Nb elicited IL-13 immunity at days 2-6 post-infection reduces pathology. The authors demonstrate an association with reduced eosinophils but no effect on neutrophil numbers.

      Proteomic analysis identifies a number of molecules known to be involved in protecting against type 2 pathologies such as relm-a and SP-D.

      The authors then identify a clear requirement for IL-13 in driving relm-a expression.

      Finally, the authors present a whole lung RNA transcript profile which largely supports their proteomic observations.

      Taken together the work presents a sound case for IL-13 being an important player in protecting against initial lung pathology.

      Major requests:

      The paper is really very interesting and important. To an extent it questions existing dogma of IL-13 being a driver of lung inflammation.

      Addressing the following could hopefully be achieved using archived samples or with an acceptable amount of extra experimental work.

      Figure 1: D2 and D6 Lung IL-13 concentrations (ELISA) in WT mice would set the scene for the papers story

      Figure 2: The authors should add evidence that function/activity of neutrophils/eosinophils is changed/not changed: e.g. granzyme, MBP, EPO release in BAL and/or lung. Additionally, some data showing changes in epithelial stress related cytokines such as IL-23 and IL-33 would be informative (IHC and /or ELISA).

      The following will require a new experiment:

      The authors present a strong case for RELMa being associated with/driven by IL-13 responses. The following I feel would prove that IL-13 driven RELMa is important in reducing lung pathology. Can enhanced lung pathology or cell responses associated with pathology be reduced/altered by dosing Nb infected IL13-/- mice with recombinant relma or by restimulating BAL cells (for example) from IL-13-/- mice. This team is well placed to comment on the potential for such an in vivo experiment to be feasible.

      Or could the authors could also test the ability for other candidate molecules to reduce lung pathology? Would for example i/n dosing of IL-13-/- mice with AMCase, BRP39 or SP-D protect against pathology? It would be expected to be the case for SP-D.

      Significance

      The manuscript places IL-13 as an important initiator of early protection from acute lung damage. This is important as it is to an extent a non-canonical role for this cytokine. This is also important as IL-13 can be manipulated therapeutically. To maximise potential application of such drugs requires detailed understanding of the various contextual roles of IL-13. This study provides such evidence.

      The authors identify a range of target mediators.

      This is an important body of work that is useful for understanding how acute lung damage can be regulated.

      This work will be of interest to Type 2 immunologists, any researcher with an interest in pulmonary inflammation as well as mucosal immunity.

      I make these suggestions/comments based on my own background in Type 2 immunity, lung inflammation and parasitic helminth infection and immunity.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for carefully reading our manuscript. We found their comments to be incredibly thoughtful and constructive and greatly appreciate their feedback. We are confident that addressing the reviewers’ concerns will strengthen our manuscript.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this manuscript entitled 'Combinatorial patterns of graded RhoA activation and uniform F-actin depletion promote tissue curvature' by Denk-Lobnig et al. the authors study the organisation of junctional F-actin during the process of mesoderm invagination during gastrulation in the model Drosophila. Following on from previous work by the same lab that identified and analysed a multicellular myosin II gradient across the mesoderm important for apical constriction and tissue bending, the authors now turn their attention to actin. Using imaging of live and fixed samples, they identify a patterning of F-actin intensity/density at apical junctions that they show is dynamically changing going into mesoderm invagination and is set up by the upstream transcription factors driving this process, Twist and Snail. They go on to show, using genetic perturbations, that both actin and the previously described myosin gradient are downstream of regulation and activation by RhoA, that in turn is controlled by a balance of RhoGEF2 activation and RhoGAP C-GAP inactivation. The authors conclude that the intricate expression patterns of all involved players, that all slightly vary from one another, is what drives the wild-type distinctive cell shape changes in particular rows of cells of the presumptive mesoderm and surrounding epidermis.

      This is a very interesting study analysing complex and large-scale cell and tissue shape changes in the early embryo. Much has been learned over the last decade and more about many of the molecular players and their particular behaviours that drive the process, but how all upstream regulators work together to achieve a coordinated tissue-scale behaviours is still not very well understood, and this study add important insights into this.

      The experiments seem well executed and support the conclusion drawn, but I have a few comments and questions that I feel the authors should address to strengthen their argument.

      We thank the reviewer for their interest in the paper and their helpful comments.

      **General points:**

      1. The authors state early on that they chose to focus on junctional rather than apical medial F-actin, but it is unclear to me really what the rationale behind that is. In much of the authors earlier work, they study the very dynamic behaviour of the apical-medial actomyosin that drives the apical cell area reduction in mesodermal cells required for folding. They have previously analysed F-actin in the constricting cells, but have only focused on the most constricting central cell rows (Coravos, J. S., & Martin, A. C. (2016). Developmental Cell, 1-14). The role of junctional F-actin compared to the apical-medial network on which the myosin works to drive constriction is much less clear, it could stabilize overall cell shape or modulate physical malleability or compliance of cells, or it could more actively be involved in implementing the 'ratchet' that needs to engage to stabilise a shrunken apical surface. I would appreciate more explanation or guidance on why the authors chose not to investigate apical-medial F-actin across the whole mesoderm and adjacent ectoderm, but rather focused in junctional F-actin, especially explaining better throughout what they think the role of the junctional F-actin they measure is.

      We focused on the junctional/lateral F-actin pool because this is where tissue-wide patterns in intensity variation are observed, especially when looking across the mesoderm-ectoderm boundary. Indeed, when we compare the apical-medial F-actin of marginal mesoderm cells to ectoderm cells in cross sections, we find no apparent difference, whereas there is a striking difference in junctional/lateral F-actin density (Fig. 1B, C; Supplemental Fig. 1A, D). We provide some preliminary en face views of the medial-apical surface in our response to Point 2, and we will obtain higher resolution images from live and fixed embryos to better show the network organization. We agree with the reviewer that this requires added justification. Therefore, we will: 1) Provide higher resolution images of apical-medial F-actin comparing different regions of mesoderm and ectoderm, and 2) revise the text to better justify why we chose junctional/lateral F-actin to focus our tissue-level analysis and to elaborate more on what we think the role of junctional/lateral F-actin in this process may be.

      Comparing the F-actin labeling in the above previous paper to the stainings/live images shown here, they look quite different. This is most likely due to the authors here not showing the whole apical area but focusing on junctional, i.e. below the most apical region. It is not completely clear to me from the paper at what level along the apical-basal axis the authors are analysing the junctional F-actin. Supplemental Figure 2 seems to suggest about half-way down the cell, which would be below junctional levels. Could the authors indicate this more clearly, please? Overall, I would appreciate if the authors could supply some more high-resolution images of F-actin from fixed samples (which I assume will give the better resolution) of how F-actin actually looks in the different cells with differing levels. Is there for instance a visible change to F-actin organisation? And could this help explain the observed changes in intensity and their function?

      We apologize for the confusion, we were referring to ‘junctions’ as the lateral contacts between cells, as opposed to the adherens junctions at the apical surface. We have modified the text to use the term ‘lateral’ rather than ‘junctional’ F-actin, so as to avoid this confusion. The difference in cortical F-actin staining is not restricted to a particular apical-basal position, but extends along the length of the lateral domain (Fig. 1B, C). As far as we can tell the actin is bundled and underlies the entire cell circumference. We will: 1) better define the apical-basal position within the cell that we are showing, and 2) show high-resolution en face images of F-actin at different apical-basal positions, across different cell positions, in live and fixed embryos to better justify our focus on lateral F-actin (similar orientation, but higher resolution/quality than preliminary live data below).

      Along the same lines of thought as in point 2): Dehapiot et al. (Dehapiot, B., ... & Lecuit, T. (2020). Assembly of a persistent apical actin network by the formin Frl/Fmnl tunes epithelial cell deformability. Nature Cell Biology, 1-21) have recently shown for the process of germband extension and amnioserosa contraction that two pools of F-actin can be observed, a persistent pool not dependent on Rho[GTP] and a Rho-[GTP] dependent one. Could the authors comment on what they think might occur in the mesoderm, are similar pools present here as well?

      1. As the authoirs state themselves, Rho does not only affect actin via diaphanous, but of course also myosin via Rock. So it would be good to refelect this more in the interpretation and discussion of data, as the causal timeline could be complex.

      We thank the reviewer for reminding us to address this point and to discuss this excellent recent paper. We have not observed a persistent medial actin network in mesoderm cells or ectoderm cells at this stage (i.e. prior to germband extension). It was previously shown in mesoderm cells that pulsed myosin contractions condense the medio-apical F-actin network, but that this is often followed by F-actin network remodeling and that total F-actin levels decrease during apical constriction (Mason et al., 2013). Furthermore, Rho-kinase inhibition in mesoderm cells significantly disrupts this network, but does not inhibit the rapid assembly and disassembly of apical F-actin cables, which could reflect elevated actin turnover (Mason et al., 2013; Jodoin et al., 2015). To address the reviewer’s points, we 1) now include a paragraph in the Discussion to discuss the Dehapiot et al. paper (Comment 3) and the possible roles of various pools of F-actin and Rock/myosin shape the tissue (Comment 4) (lines 404-408), and 2) will image the apical surface of mesoderm and ectoderm at this stage and also germband extension (as a positive control) in order to determine whether there is a persistent network.

      **More specific comments to experiments and figures:**

      1. Reduction of junction function by alpha-catenin-RNAi: how strong is the reduction in catenin? Could they label a-catenin in fixed embryos? The authors conclude the original pre-constriction patterning of F-actin intensity is not dependent on intact junctions, but they show that the increase in F-actin in the mesodermal cells concomitant with apical constriction is in fact impaired in the RNAi. Thus, the authors can also not conclude whether the continued accumulation of myosin and its persistence depend on intact junctions. The initial set-up of the myosin gradient in terms of intensity distribution is unaffected, but clearly dynamics, subcellular pattern, interconnectivity etc. of myosin are affected and thus may well depend on some mechanical feed-back. I find this section of the manuscript slightly overstated and feel the conclusion should be more cautious.

      We thank the reviewer for pointing this out; we completely agree that we should have been more precise with our language. Our main conclusion was that myosin accumulation in a gradient does not require ‘sustained mechanical connectivity’. We felt it was important, given our model of transcriptional patterning, to show that some patterns did not result from mechanics or even apical constriction. Alpha-catenin knock-down provides the cleanest and most severe disruption of adhesion that we can accomplish at this developmental stage. We showed that alpha-catenin-RNAi resulted in: a) almost no intercellular connectivity in myosin structures (Yevick et al., 2019), and b) no apical constriction (this study, Fig. 3B).

      We: 1) revised the text in this section, clarifying that we are only referring to the gradient and that other myosin properties clearly do depend on mechanics, 2) will include data better showing the extent of the alpha-catenin knockdown and its effects on junctions and actomyosin.

      Figure 1 versus Figure 2: Why do the Utrophin-ABD virtual cross-sections look so fuzzy and bad in comparison to phalloidin labelled F-actin in the virtual cross-section in Fig. 1B and C? The labelling shown in 2B and D does not even look very junctional...

      We apologize that we did not explain the difference in visualization methods more clearly. For live images (Figure 2), we used a projection of cross-sections, which includes 20 µm length along the anterior-posterior (AP) axis. This projection method is less dependent on the specific AP position of the cross-section and the specific cells being shown. We did this because the projection helps to visualize the tissue pattern in live images where fluorescence images are noisier than fixed images, which exhibit cleaner labeling (Fig. 1). To address this point, we plan to: 1) Edit the text to make the method of visualization clearer, and 2) fix snail and twist mutant embryos and also provide thin cross-sections analogous to Fig. 1.

      Figure 5 C and D: the control gradients for myosin shown in C and D are completely different, for C the half-way height cell row is deduced as 5 whereas the (in theory identical) control measure in D has row 3 at halfway height! Why is this? Putting all curves together in the same panel would suggest that that C control curve is very similar to RhoGEF2-OE! This can't be right.

      The reason for the different width of the gradients in these controls is the Sqh::GFP copy number. All of our experiments perturbing Rho were carefully controlled so as to ensure the same copy number of the fluorescent marker that we were visualizing. For technical reasons, we were only able to get 1 copy of the Sqh::GFP into the RhoGEF2-OE background. Having two copies of the Sqh::GFP appears to have a slightly activating effect; in fact, the reviewer might have noticed that ventral furrows with 2 copies Sqh::GFP (and a wider gradient) have lower curvature, consistent with our main conclusion (Fig. 7 C). The effects of fluorescently tagged markers were a concern for us and so we were careful to show that the effects of changing RhoA activity on tissue curvature occur regardless of the fluorescent marker (i.e., Sqh::GFP or Utr::GFP, Fig. 7 and Sup. Fig. 7).

      Still in Figure 5: Panels C and D again, but for apical area: are the control and C-GAP-RNAi or RhoGEF2-OE curves significantly different? What statistics were used on this?

      We thank the reviewer for this point. We did not include statistical comparisons of the gradient width originally, because we felt that it does not completely capture the difference between the two curves and that presenting the curves instead lets readers examine the intricacies of the data as a whole. However, to address the reviewer’s point, we will add statistical comparisons for apical area as well as myosin and actin patterns.

      Supplemental Figure 1: Panels in D: I appreciate this control, but would really also like to see the same control at a stage when folding has commenced and stretched cells are present at the margin of the mesoderm. How homogenous does the GAP43 label look in those?

      We will add a more apical projection (with quantification) of this embryo, in which folding has already commenced, to the revised manuscript, so its stage is clearer.

      Supplemental Figure 5: Panel 5 B: the authors conclude that the myosin gradient under RhoGEF2 RNAi is not smaller, but looking at the curves it in fact looks wilder. They also mention that the overall level of myosin in this condition is lower than the control...

      We will include quantification of absolute levels in Supplemental Figure 5 to compare overall levels. We will also statistically compare RhoGEF2 RNAi and control gradients and update our conclusions accordingly.

      Following on from the above, a comment of Figure 7: - The authors use RhoGEF2 RNAi stating that it affects the actin pattern, but the myosin pattern also seems affected. In line 318 the authors state that they use this condition to look at how junctional actin density affects curvature. I find this phrase misleading as It might lead the readers to think that RHoGEF2 RNAi only affects junctional F-actin, although it also affects myosin patterning.

      We thank the reviewer for catching this, that’s a good point. We have revised the text in lines 317-326 to more accurately describe the effect of RhoGEF2-RNAi on actin and myosin patterning, and to connect this to the effect on curvature.

      • Line 311: confusingly, the authors state that an increase in the actomyosin gradient affects curvature. But it is only the myosin gradient that is increased, while the junctional actin gradient is flatter than the control in both C-GAP RNAi and RhoGEF2 OE (the distinction is even made by authors line 243). Could this be clarified?

      We thank the reviewer for pointing out this imprecision on our part and have revised Line 311 to more precisely describe the individual effects on myosin and F-actin pattern changes upon RhoA perturbation.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Mesoderm invagination during Drosophila gastrulation has been a paradigm for how regionally restricted gene expression locally activates Rho signalling and for how subsequently activated acto-myosin drives cell shape changes which in turn lead to a change in tissue morphology. Despite the numerous studies on this subject and a good understanding of the overall process, several important aspects have remained elusive so far. Among these is the dynamics of cortical and junctional F-actin and its contribution to the shape changes of cells and tissue. Previous studies have focused on MyoII, the „active" component of the actin cytoskeleton. The dynamics of the „passive" counterpart, namely actin filaments, has been neglected, although it is clear that Rho signalling controls both branches.

      We thank the reviewer for the tough questions. The reviewer raises important points that, even if not all feasible to address experimentally, can be addressed by being more precise with our language__ and conclusions.__

      1. Although I clearly acknowledge the efforts taken by the authors to define a function of cortical (junctional) F-actin in apical constriction and furrow formation, several central aspects of the study are not sufficiently resolved and conclusive. Rho signalling controls MyoII via Rok and F-actin via forming/dia, among other less defined targets. The role of MyoII and cortical contraction could be conclusively sorted out, since inhibition of Rok affects the MyoII branch but not the other branches. A similar approach, i. e. a specific inhibition/depletion without affecting the other branch, has not been taken yet for the F-actin branch. The authors have not resolved this issue. When analysing the mutants, the authors cannot distinguish the effect of Rho signalling on the MyoII and F-actin branch. For this reason the changes in F-actin distribution in the mutants are linked to changes in Myo activity and thus a function cannot be assigned to F-actin. In order to derive a specific role of F-actin distribution for furrow formation, the authors need to find experimental ways to affect F-actin levels without affecting MyoII, for example by analysing mutants for dia or other formins.

      The reviewer’s assertion that Rok and Diaphanous only affect myosin and actin, respectively, is oversimplified. For example, in mammals, Rok regulates the Lim-Kinase/Cofilin pathway and thus F-actin (Geneste et al., JCB, 2002). The ‘F-actin branch’ of the RhoA pathway has been examined in multiple previous studies of mesoderm invagination (Fox and Peifer, 2007; Homem et al., 2008; Mason et al., 2013). We did not include diaphanous mutants in this tissue-level study because diaphanous mutants and actin drugs: a) affect RhoA signaling (Munjal et al., 2015; Coravos et al., 2016; Michaux et al., 2018), b) disrupt adherens junctions and tissue integrity (Homem et al, 2008; Mason et al., 2013), and c) have a preponderance of cellularization defects (Afshar et al., 2000). However, we agree with the reviewer that this could potentially be interesting, and so we 1) will look at the tissue-level pattern in Diaphanous-depleted embryos, 2) will analyze tissue-level actomyosin patterns in Rok inhibitor-injected embryos, and 3) have added a section to the Discussion (lines 418-432) explaining past work in this area and why we did not provide data on diaphanous mutants. A caveat of the proposed experiments is that actin and myosin ‘branches’ may be too interconnected to be conclusively separated.

      The authors employ a discontinuous spatial axis by the cell number. Although there are good arguments to understand and treat the cells as units, there are also good arguments for using a scale with absolute distance. I have doubts that the graded distributions presented by the authors are a result of this scaling with cell units. When looking at panel B of Fig 1 or Fig. 2A,B, for example, a sharp step like distribution is visible at the boundary between mesoderm and ectoderm anlage. In contrast a F-actin intensity distribution is graded after quantification. The graded distribution appears not to be a consequence of averaging because an even sharper step is very obvious in a projection along the embryonic axis as shown in panel B and D of Fig. 2, for example. The difference of a sharp step in the images and graded distribution after quantification with a spatial axis in cell number, is obvious for a-catenin in Fig. 3D and Rho signalling in Fig. 4 B. As the authors base their central conclusion (see headline) on the graded distribution, resolving the issue of spatial scale is a prerequisite of publication.

      We thank the reviewer for their point. It is an excellent idea and we have included representative plots with a continuous spatial scale in addition to our cell-based analysis (see below, each trace is average line intensity for 1 embryo). The spatially resolved analysis shows similar patterns for F-actin, myosin and RhoA pathway components as the cell-based metric and we plan to include this data as Supplemental Fig. 3 and 4 in a revised version of the manuscript.

      The authors put the spatial distribution of Rho signalling and F-actin into the center of their conclusion. They do so by affecting the pattern with mutants in twist/snail and varying upstream factors of Rho signalling. With respect to myo activation this have been done previously although possibly with less detail and it is no new insight that the width of the mesoderm anlage and corresponding Rho signalling domain has a consequence on the shape of the groove and furrow. To maintain the conclusion of the manuscript that spatially graded Rho signalling is contributes to tissue curvature, more convincing ways to change the pattern of Rho signalling are needed. Changing the balance of GEF and GAP shows the importance of Rho signalling and possibly signalling levels but not the contribution of its spatial distribution.

      A strength of our study was that we were able to stably ‘tune’ Rho signaling pattern and then follow tissue shape at later stages to determine the connection between the two. We respectfully disagree with the statement that, “with respect to myosin activation this has been done previously”. In past work, we expanded myosin activation by modifying embryonic cell fate, including changes in dorsal cell fates (Heer et al. 2017; Chanet et al., 2017). Here, we directly manipulate RhoA signaling, maintaining the width of the mesoderm anlage (see images below).

      A central conclusion of our study is that RhoA activation level determines the width of myosin activation within a normally sized mesoderm anlage, which has not been done before. The genetic approach presented in the paper was the best way we found to manipulate the spatial pattern of myosin/actin in a stable manner that lasts through invagination. It is worth noting that this approach allowed us to carefully ‘tune’ the level of RhoA activation so as to avoid elevating RhoA levels to the point that it disrupts signaling polarity within the cell (Mason et al., 2016). In our hands, optogenetic manipulation of RhoA, which requires continuous optical input, was less robust because: a) 2D tissue flow precludes delivering a consistent level of activation to given cells over the time course of invagination, b) tissue folding (i.e. 3D deformation) dramatically alters how much light is delivered to the mesoderm cells.

      To address the reviewer’s point, we: 1) edited the Discussion to explicitly state that we did not alter the pattern of RhoA activation without altering RhoA signaling levels and (lines 339-343), 2) plan to include Snail or Twist stainings showing that the width of the mesoderm anlage is not altered by changes in RhoA signaling so there is no confusion about this point, and 3) plan to include a mechanical model that compares how altering signaling levels vs. altering the spatial distribution of signaling affect fold curvature, respectively.

      Reviewer #2 (Significance (Required)):

      The question of a contribution of F-actin is addressed in this manuscript. The authors quantify F-actin in fixed and living embryos at two prominent steps in ventral furrow formation, (1) shortly prior to onset of apical constrictions and (2) when the groove has formed. They distinguish junctional and „medial" cortical F-actin. They employ a discontinuous spatial axis, the number of cells away from the ventral midline but not an absolute scale (see my notes below). The measurements are applied to wild type and mutant embryos affecting the transcriptional patterning (twist, snail), adherens junctions, and Rho signalling. The authors claim to reveal by their measurements a graded distribution of F-actin intensities with a peak at the ventral midline and a second peak at the boundary between mesoderm and ectoderm with a low point in the stretching cells of the mesectoderm. The authors further claim to reveal a graded distribution of Rho signalling components within the mesoderm anlage. Based on these data the authors conclude that graded Rho signalling and depletion of F-actin promote tissue curvature.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Previous work has shown that mesoderm invagination at the ventral midline of the Drosophila embryo requires precise spatial regulation of actomyosin levels in order to fold the tissue. In this work, Denk-Lobnig and colleagues further investigate the spatial distribution of myosin and F-actin in the mesoderm and how these patterns are established. The authors identify an F-actin pattern at the apical cell junctions that emerges upon folding, with elevated levels in the cells around the ventral midline, a decrease in junctional F-actin in the marginal mesoderm, and then an increase at the mesoderm-ectoderm border. They identify Snail and Twist as regulating different aspects of establishing this F-actin pattern. Additionally, by modulating RhoA activity (downstream of Twist) the authors are able to alter the width of the actomyosin pattern without affecting the width of the mesoderm tissue, which in turn affects the curvature of the tissue fold and the post-fold lumen size.

      The authors have conducted an elegant quantitative analysis of the distribution of actin, myosin and several of their regulators across the tissue. The study makes an attempt at integrating a large amount of information into a model of tissue folding, and the concept of mechanical gradients is exciting and still underexplored. I am concerned that the interpretation of some results focuses on specific details but ignores larger scale effects (e.g. potential effects of some of the manipulations on the ectoderm, and the impact that that could have on tissue folding are largely ignored). The statistical analysis of several results should also be improved. I suggest to address the following points.

      We thank the reviewer for their interest in our work and their important suggestions.

      **MAJOR**

      1. Line 127 and Figure 1E: The authors argue that there is an anticorrelation between F-actin distribution and cell areas. However, an R-squared value of 0.1083 rather suggests little-to-no correlation. The authors should evaluate the statistical significance of that correlation.

      To indicate whether the relationship between F-actin distribution and cell areas is significant, we will report the p-value for the F-test for overall significance for our regression analysis, as well as sample size, of this data in the revised manuscript. The F-statistic for this analysis is __F = 89.2, p-value = 4.7e-20.__

      Figure 5: claims that the width of the actomyosin gradient is affected by the various perturbations should be supported with statistical analysis. For example, the half-maximal gradient position for each individual myosin trace could be calculated (instead of using the mean trace), displayed using a box plot, and tested for significance using the Mann-Whitney U test, as in Figure 7. This is slightly complicated by the fact that the control group in Figure 5C is the same as the control group in Figure 3E, which needs to be carefully considered. Also, similar calculations should be made for the F-actin data in Fig 5E-G since throughout the rest of the paper, the authors refer to the width of the "actomyosin gradient" which implicates both myosin and actin.

      We thank the reviewer for this point We will include statistical comparisons for myosin gradients in the revised manuscript. To allow for multiple comparisons using the same control, we plan to use Kruskal-Wallis testing, which is analogous to one-way ANOVA for non-parametric data, and a post-hoc test to determine which pairs have significantly different distributions.

      We will update the language in the manuscript to distinguish between actin and myosin patterns. As our main conclusion is that F-actin depletion levels are changed by RhoA in marginal mesoderm cells, we will statistically compare this between groups.

      Line 142 and Figure 2B-C: I was confused by the description of the snail phenotype: - a. the claim that in snail mutants actin levels are uniform: based on Figure 2C, I'd say that F-actin levels decrease across the mesoderm moving away from the ventral midline, and that the main issue is with the accumulation of actin in the distal end of the mesoderm. The authors should better justify the claim that F-actin levels are uniform in snail mutants (or remove it). Maybe comparing F-actin levels in the first four or five rows of the mesoderm? - b. how about the increase of F-actin in the distal mesoderm, just adjacent to the ectoderm boundary? Why is it gone in snail mutants?

      1. We agree that the intensity in all embryos appears to decrease on the sides of the embryos when imaged in this way, but it is also clear that there is no abrupt increase in F-actin density going into the ectoderm. In our experience, the edge effect is due to the distance of the side of the embryo from the coverslip rather than actual lower F-actin density. This is suggested by: a) the fact that all snail mutant embryos peak at the center of the image even though they are not all oriented with the ventral side perfectly on top, and b) all embryos exhibit an intensity decrease within the ectoderm toward the edges of the image that are further away from the coverslip (Fig. 2 C, E, F). We will: 1) modify the text to include an explanation, and 2) fix and stain snail and twist mutant cross-sections that will not exhibit this effect of imaging depth, for comparison.
      2. We show in Figure S1C that in wild-type, F-actin does not actually increase in cells at the ectoderm boundary, but merely decreases in lateral mesoderm cells. Thus, it is likely that snail mutant embryos are merely lacking patterning in the mesoderm, where snail is active.
      3. With alpha-catenin-RNAi, F-actin depletion across the mesoderm still occurs, but junctional F-actin levels are not increased around the midline. While some explanations are offered in the text, the reason for this phenotype seems important for the story. The text in lines 204-205 suggests that F-actin that would normally be localized to the apical junctions is instead being drawn into medioapical actomyosin foci. Is this idea supported by evidence that medioapical F-actin in control embryos is lower than in alpha-catenin RNAi?

      We appreciate the reviewer’s suggestion to explain this more thoroughly. We find that in alpha-catenin-RNAi and even arm (β-catenin) mutant embryos, junctional complexes (i.e., E-cadherin) are drawn into the myosin spot through continuous contractile flow (see below and Martin et al., 2010 for arm). To make this clear in the manuscript, we plan to: 1) include data showing the effects of alpha-catenin RNAi on F-actin and E-cadherin localization in fixed embryos, which is now included in Supplemental Figure S3, and 2)

      include live imaging of UtrGFP-labeled alpha-catenin RNAi embryos.

      Figure 6A: there is a correlation between cell position and the productivity of myosin pulses, which the authors attribute to the RhoA gradient. This should be more definitively demonstrated by:

      • a. Plot and calculate the correlation between RhoA levels (measured with the RhoA probe) and the change in cell area caused by a contraction pulse. Is this a significant correlation?

      • b. How does myosin persistence change when RhoA is manipulated, e.g. in RhoA overexpressing embryos or in RhoA RNAi?

      It has already been shown that there is a correlation between myosin amplitude and apical constriction amplitude (Xie et al., 2015).__ Apical myosin and Rho-kinase localization depends entirely on RhoA activity (Mason et al., 2016) and Rho-kinase co-localizes precisely with myosin in both space and time (Vasquez et al., 2014). Changing levels of the RhoA regulator C-GAP has been shown to affect myosin persistence and the productivity of apical constriction, with higher C-GAP causing less productive constriction (Mason et al., 2016). We plan to update the text to connect the observation with what has been shown in previous studies and to make statements regarding causality on the tissue-level more cautious. However, our observation further shows how cytoskeletal activity is patterned across the mesoderm, so we think it has value and that it should be included in this paper. An in depth study of the connection between RhoA regulators and myosin persistence/pulsing is beyond the scope of the present study, especially considering possible COVID-19 restrictions. Making these connections will require substantial effort in the future.__

      **MINOR**

      1. The authors should indicate if the myosin shown in Figure 1A is junctional or medioapical. If it is junctional, does medioapical myosin better match junctional F-actin and cell areas? Similarly, if they are showing medioapical myosin, how does junctional myosin compare to junctional actin? It seems to me that consistently comparing the patterns of junctional F-actin and medioapical myosin (and RhoGEF2, RhoA, and ROCK in Figure 4) could be somewhat misleading, as the pools compared localize in different subcellular compartments.

      The myosin images shown throughout the paper are medioapical myosin. Junctional myosin in mesoderm cells is lower in intensity and cannot easily be seen by live imaging. We agree that it is important for the reader to see all pools of these proteins. Therefore, we will include in a supplemental figure high resolution images of actin and myosin at both apical and subapical positions for midline mesoderm, marginal mesoderm, and ectoderm cells at the time of folding. We will also justify why the analyzed pools were chosen, respectively.

      Most of the intensity traces for myosin and F-actin are presented as normalized intensity, relative to the highest intensity in the trace. However, there are claims throughout the text about the relative levels of myosin (ex. Line 241) and F-actin (conclusions based on Fig. 2B-D) that should be supported by quantification. It seems that changes in intensity for both F-actin and myosin, in addition to shape of the gradient, would contribute to the understanding of actomyosin regulation in this tissue. However, if intensities cannot be directly compared between groups due to variation in imaging settings or staining protocols, there should be no claims made about changes in overall F-actin or myosin intensity.

      We appreciate the point made by the reviewer here. To address this point, we will provide data for absolute levels in relevant cases and be more precise in our conclusions.

      The significance of the correlation in Figure 7E should be quantified.

      We will report the p-value for the F-test for overall significance for our regression analysis of this data. The F-statistic for this analysis is F = __15.6, p-value = 0.00103.__

      Supplemental Figure 2: does the segmentation image match the second Z reslice immediately above? It does not appear so, or perhaps they are just not aligned. Having the two match would be more convincing of the segmentation technique.

      We will ensure that matching images are used for this figure.

      Reviewer #3 (Significance (Required)):

      The authors have conducted an elegant quantitative analysis of the distribution of actin, myosin and several of their regulators across the tissue. The study makes an attempt at integrating a large amount of information into a model of tissue folding, and the concept of mechanical gradients is exciting and still underexplored. I am concerned that the interpretation of some results focuses on specific details but ignores larger scale effects (e.g. potential effects of some of the manipulations on the ectoderm, and the impact that that could have on tissue folding are largely ignored). The statistical analysis of several results should also be improved.

      This is a great point. It is important to note that our conclusions required us to ‘tune’ the expression of GEF and the depletion of GAP with GAL4 drivers to get expression levels that do not dramatically affect RhoA polarity within mesoderm cells, but that alter the tissue level pattern within the mesoderm. Furthermore, we were cautious in making sure that our perturbations that elevate RhoA activation level did not lead to elevated myosin in the ectoderm (Fig. 5A and B). It is worth noting that RhoGEF2 is still full-length in all cases and has all of the normal regulatory domains that allow its activity to be restricted to the mesoderm at this stage. To more explicitly show the effect of our perturbations on ectoderm cells, we plan to include higher resolution images comparing myosin and F-actin organization/levels in the ectoderm for our manipulations of RhoA signaling.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Previous work has shown that mesoderm invagination at the ventral midline of the Drosophila embryo requires precise spatial regulation of actomyosin levels in order to fold the tissue. In this work, Denk-Lobnig and colleagues further investigate the spatial distribution of myosin and F-actin in the mesoderm and how these patterns are established. The authors identify an F-actin pattern at the apical cell junctions that emerges upon folding, with elevated levels in the cells around the ventral midline, a decrease in junctional F-actin in the marginal mesoderm, and then an increase at the mesoderm-ectoderm border. They identify Snail and Twist as regulating different aspects of establishing this F-actin pattern. Additionally, by modulating RhoA activity (downstream of Twist) the authors are able to alter the width of the actomyosin pattern without affecting the width of the mesoderm tissue, which in turn affects the curvature of the tissue fold and the post-fold lumen size.

      The authors have conducted an elegant quantitative analysis of the distribution of actin, myosin and several of their regulators across the tissue. The study makes an attempt at integrating a large amount of information into a model of tissue folding, and the concept of mechanical gradients is exciting and still underexplored. I am concerned that the interpretation of some results focuses on specific details but ignores larger scale effects (e.g. potential effects of some of the manipulations on the ectoderm, and the impact that that could have on tissue folding are largely ignored). The statistical analysis of several results should also be improved. I suggest to address the following points.

      MAJOR

      1. Line 127 and Figure 1E: The authors argue that there is an anticorrelation between F-actin distribution and cell areas. However, an R-squared value of 0.1083 rather suggests little-to-no correlation. The authors should evaluate the statistical significance of that correlation.
      2. Figure 5: claims that the width of the actomyosin gradient is affected by the various perturbations should be supported with statistical analysis. For example, the half-maximal gradient position for each individual myosin trace could be calculated (instead of using the mean trace), displayed using a box plot, and tested for significance using the Mann-Whitney U test, as in Figure 7. This is slightly complicated by the fact that the control group in Figure 5C is the same as the control group in Figure 3E, which needs to be carefully considered. Also, similar calculations should be made for the F-actin data in Fig 5E-G since throughout the rest of the paper, the authors refer to the width of the "actomyosin gradient" which implicates both myosin and actin.
      3. Line 142 and Figure 2B-C: I was confused by the description of the snail phenotype:
        • a. the claim that in snail mutants actin levels are uniform: based on Figure 2C, I'd say that F-actin levels decrease across the mesoderm moving away from the ventral midline, and that the main issue is with the accumulation of actin in the distal end of the mesoderm. The authors should better justify the claim that F-actin levels are uniform in snail mutants (or remove it). Maybe comparing F-actin levels in the first four or five rows of the mesoderm?
        • b. how about the increase of F-actin in the distal mesoderm, just adjacent to the ectoderm boundary? Why is it gone in snail mutants?
      4. With alpha-catenin-RNAi, F-actin depletion across the mesoderm still occurs, but junctional F-actin levels are not increased around the midline. While some explanations are offered in the text, the reason for this phenotype seems important for the story. The text in lines 204-205 suggests that F-actin that would normally be localized to the apical junctions is instead being drawn into medioapical actomyosin foci. Is this idea supported by evidence that medioapical F-actin in control embryos is lower than in alpha-catenin RNAi?
      5. Figure 6A: there is a correlation between cell position and the productivity of myosin pulses, which the authors attribute to the RhoA gradient. This should be more definitively demonstrated by:
        • a. Plot and calculate the correlation between RhoA levels (measured with the RhoA probe) and the change in cell area caused by a contraction pulse. Is this a significant correlation?
        • b. How does myosin persistence change when RhoA is manipulated, e.g. in RhoA overexpressing embryos or in RhoA RNAi?

      MINOR

      1. The authors should indicate if the myosin shown in Figure 1A is junctional or medioapical. If it is junctional, does medioapical myosin better match junctional F-actin and cell areas? Similarly, if they are showing medioapical myosin, how does junctional myosin compare to junctional actin? It seems to me that consistently comparing the patterns of junctional F-actin and medioapical myosin (and RhoGEF2, RhoA, and ROCK in Figure 4) could be somewhat misleading, as the pools compared localize in different subcellular compartments.
      2. Most of the intensity traces for myosin and F-actin are presented as normalized intensity, relative to the highest intensity in the trace. However, there are claims throughout the text about the relative levels of myosin (ex. Line 241) and F-actin (conclusions based on Fig. 2B-D) that should be supported by quantification. It seems that changes in intensity for both F-actin and myosin, in addition to shape of the gradient, would contribute to the understanding of actomyosin regulation in this tissue. However, if intensities cannot be directly compared between groups due to variation in imaging settings or staining protocols, there should be no claims made about changes in overall F-actin or myosin intensity.
      3. The significance of the correlation in Figure 7E should be quantified.
      4. Supplemental Figure 2: does the segmentation image match the second Z reslice immediately above? It does not appear so, or perhaps they are just not aligned. Having the two match would be more convincing of the segmentation technique.

      Significance

      The authors have conducted an elegant quantitative analysis of the distribution of actin, myosin and several of their regulators across the tissue. The study makes an attempt at integrating a large amount of information into a model of tissue folding, and the concept of mechanical gradients is exciting and still underexplored. I am concerned that the interpretation of some results focuses on specific details but ignores larger scale effects (e.g. potential effects of some of the manipulations on the ectoderm, and the impact that that could have on tissue folding are largely ignored). The statistical analysis of several results should also be improved.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Mesoderm invagination during Drosophila gastrulation has been a paradigm for how regionally restricted gene expression locally activates Rho signalling and for how subsequently activated acto-myosin drives cell shape changes which in turn lead to a change in tissue morphology. Despite the numerous studies on this subject and a good understanding of the overall process, several important aspects have remained elusive so far. Among these is the dynamics of cortical and junctional F-actin and its contribution to the shape changes of cells and tissue. Previous studies have focused on MyoII, the „active" component of the actin cytoskeleton. The dynamics of the „passive" counterpart, namely actin filaments, has been neglected, although it is clear that Rho signalling controls both branches.

      1. Although I clearly acknowledge the efforts taken by the authors to define a function of cortical (junctional) F-actin in apical constriction and furrow formation, several central aspects of the study are not sufficiently resolved and conclusive. Rho signalling controls MyoII via Rok and F-actin via forming/dia, among other less defined targets. The role of MyoII and cortical contraction could be conclusively sorted out, since inhibition of Rok affects the MyoII branch but not the other branches. A similar approach, i. e. a specific inhibition/depletion without affecting the other branch, has not been taken yet for the F-actin branch. The authors have not resolved this issue. When analysing the mutants, the authors cannot distinguish the effect of Rho signalling on the MyoII and F-actin branch. For this reason the changes in F-actin distribution in the mutants are linked to changes in Myo activity and thus a function cannot be assigned to F-actin. In order to derive a specific role of F-actin distribution for furrow formation, the authors need to find experimental ways to affect F-actin levels without affecting MyoII, for example by analysing mutants for dia or other formins.
      2. The authors employ a discontinuous spatial axis by the cell number. Although there are good arguments to understand and treat the cells as units, there are also good arguments for using a scale with absolute distance. I have doubts that the graded distributions presented by the authors are a result of this scaling with cell units. When looking at panel B of Fig 1 or Fig. 2A,B, for example, a sharp step like distribution is visible at the boundary between mesoderm and ectoderm anlage. In contrast a F-actin intensity distribution is graded after quantification. The graded distribution appears not to be a consequence of averaging because an even sharper step is very obvious in a projection along the embryonic axis as shown in panel B and D of Fig. 2, for example. The difference of a sharp step in the images and graded distribution after quantification with a spatial axis in cell number, is obvious for a-catenin in Fig. 3D and Rho signalling in Fig. 4 B. As the authors base their central conclusion (see headline) on the graded distribution, resolving the issue of spatial scale is a prerequisite of publication.
      3. The authors put the spatial distribution of Rho signalling and F-actin into the center of their conclusion. They do so by affecting the pattern with mutants in twist/snail and varying upstream factors of Rho signalling. With respect to myo activation this have been done previously although possibly with less detail and it is no new insight that the width of the mesoderm anlage and corresponding Rho signalling domain has a consequence on the shape of the groove and furrow. To maintain the conclusion of the manuscript that spatially graded Rho signalling is contributes to tissue curvature, more convincing ways to change the pattern of Rho signalling are needed. Changing the balance of GEF and GAP shows the importance of Rho signalling and possibly signalling levels but not the contribution of its spatial distribution.

      Significance

      The question of a contribution of F-actin is addressed in this manuscript. The authors quantify F-actin in fixed and living embryos at two prominent steps in ventral furrow formation, (1) shortly prior to onset of apical constrictions and (2) when the groove has formed. They distinguish junctional and „medial" cortical F-actin. They employ a discontinuous spatial axis, the number of cells away from the ventral midline but not an absolute scale (see my notes below). The measurements are applied to wild type and mutant embryos affecting the transcriptional patterning (twist, snail), adherens junctions, and Rho signalling. The authors claim to reveal by their measurements a graded distribution of F-actin intensities with a peak at the ventral midline and a second peak at the boundary between mesoderm and ectoderm with a low point in the stretching cells of the mesectoderm. The authors further claim to reveal a graded distribution of Rho signalling components within the mesoderm anlage. Based on these data the authors conclude that graded Rho signalling and depletion of F-actin promote tissue curvature.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript entitled 'Combinatorial patterns of graded RhoA activation and uniform F-actin depletion promote tissue curvature' by Denk-Lobnig et al. the authors study the organisation of junctional F-actin during the process of mesoderm invagination during gastrulation in the model Drosophila. Following on from previous work by the same lab that identified and analysed a multicellular myosin II gradient across the mesoderm important for apical constriction and tissue bending, the authors now turn their attention to actin. Using imaging of live and fixed samples, they identify a patterning of F-actin intensity/density at apical junctions that they show is dynamically changing going into mesoderm invagination and is set up by the upstream transcription factors driving this process, Twist and Snail. They go on to show, using genetic perturbations, that both actin and the previously described myosin gradient are downstream of regulation and activation by RhoA, that in turn is controlled by a balance of RhoGEF2 activation and RhoGAP C-GAP inactivation. The authors conclude that the intricate expression patterns of all involved players, that all slightly vary from one another, is what drives the wild-type distinctive cell shape changes in particular rows of cells of the presumptive mesoderm and surrounding epidermis.

      This is a very interesting study analysing complex and large-scale cell and tissue shape changes in the early embryo. Much has been learned over the last decade and more about many of the molecular players and their particular behaviours that drive the process, but how all upstream regulators work together to achieve a coordinated tissue-scale behaviours is still not very well understood, and this study add important insights into this.

      The experiments seem well executed and support the conclusion drawn, but I have a few comments and questions that I feel the authors should address to strengthen their argument.

      General points:

      1. The authors state early on that they chose to focus on junctional rather than apical medial F-actin, but it is unclear to me really what the rationale behind that is. In much of the authors earlier work, they study the very dynamic behaviour of the apical-medial actomyosin that drives the apical cell area reduction in mesodermal cells required for folding. They have previously analysed F-actin in the constricting cells, but have only focused on the most constricting central cell rows (Coravos, J. S., & Martin, A. C. (2016). Developmental Cell, 1-14). The role of junctional F-actin compared to the apical-medial network on which the myosin works to drive constriction is much less clear, it could stabilize overall cell shape or modulate physical malleability or compliance of cells, or it could more actively be involved in implementing the 'ratchet' that needs to engage to stabilise a shrunken apical surface. I would appreciate more explanation or guidance on why the authors chose not to investigate apical-medial F-actin across the whole mesoderm and adjacent ectoderm, but rather focused in junctional F-actin, especially explaining better throughout what they think the role of the junctional F-actin they measure is.
      2. Comparing the F-actin labeling in the above previous paper to the stainings/live images shown here, they look quite different. This is most likely due to the authors here not showing the whole apical area but focusing on junctional, i.e. below the most apical region. It is not completely clear to me from the paper at what level along the apical-basal axis the authors are analysing the junctional F-actin. Supplemental Figure 2 seems to suggest about half-way down the cell, which would be below junctional levels. Could the authors indicate this more clearly, please? Overall, I would appreciate if the authors could supply some more high-resolution images of F-actin from fixed samples (which I assume will give the better resolution) of how F-actin actually looks in the different cells with differing levels. Is there for instance a visible change to F-actin organisation? And could this help explain the observed changes in intensity and their function?
      3. Along the same lines of thought as in point 2): Dehapiot et al. (Dehapiot, B., ... & Lecuit, T. (2020). Assembly of a persistent apical actin network by the formin Frl/Fmnl tunes epithelial cell deformability. Nature Cell Biology, 1-21) have recently shown for the process of germband extension and amnioserosa contraction that two pools of F-actin can be observed, a persistent pool not dependent on Rho[GTP] and a Rho-[GTP] dependent one. Could the authors comment on what they think might occur in the mesoderm, are similar pools present here as well?
      4. As the authoirs state themselves, Rho does not only affect actin via diaphanous, but of course also myosin via Rock. So it would be good to refelect this more in the interpretation and discussion of data, as the causal timeline could be complex.

      More specific comments to experiments and figures:

      1. Reduction of junction function by alpha-catenin-RNAi: how strong is the reduction in catenin? Could they label a-catenin in fixed embryos? The authors conclude the original pre-constriction patterning of F-actin intensity is not dependent on intact junctions, but they show that the increase in F-actin in the mesodermal cells concomitant with apical constriction is in fact impaired in the RNAi. Thus, the authors can also not conclude whether the continued accumulation of myosin and its persistence depend on intact junctions. The initial set-up of the myosin gradient in terms of intensity distribution is unaffected, but clearly dynamics, subcellular pattern, interconnectivity etc. of myosin are affected and thus may well depend on some mechanical feed-back. I find this section of the manuscript slightly overstated and feel the conclusion should be more cautious.
      2. Figure 1 versus Figure 2: Why do the Utrophin-ABD virtual cross-sections look so fuzzy and bad in comparison to phalloidin labelled F-actin in the virtual cross-section in Fig. 1B and C? The labelling shown in 2B and D does not even look very junctional...
      3. Figure 5 C and D: the control gradients for myosin shown in C and D are completely different, for C the half-way height cell row is deduced as 5 whereas the (in theory identical) control measure in D has row 3 at halfway height! Why is this? Putting all curves together in the same panel would suggest that that C control curve is very similar to RhoGEF2-OE! This can't be right.
      4. Still in Figure 5: Panels C and D again, but for apical area: are the control and C-GAP-RNAi or RhoGEF2-OE curves significantly different? What statistics were used on this?
      5. Supplemental Figure 1: Panels in D: I appreciate this control, but would really also like to see the same control at a stage when folding has commenced and stretched cells are present at the margin of the mesoderm. How homogenous does the GAP43 label look in those?
      6. Supplemental Figure 5: Panel 5 B: the authors conclude that the myosin gradient under RhoGEF2 RNAi is not smaller, but looking at the curves it in fact looks wilder. They also mention that the overall level of myosin in this condition is lower than the control...
      7. Following on from the above, a comment of Figure 7:
        • The authors use RhoGEF2 RNAi stating that it affects the actin pattern, but the myosin pattern also seems affected. In line 318 the authors state that they use this condition to look at how junctional actin density affects curvature. I find this phrase misleading as It might lead the readers to think that RHoGEF2 RNAi only affects junctional F-actin, although it also affects myosin patterning.
        • Line 311: confusingly, the authors state that an increase in the actomyosin gradient affects curvature.But it is only the myosin gradient that is increased, while the junctional actin gradient is flatter than the control in both C-GAP RNAi and RhoGEF2 OE (the distinction is even made by authors line 243). Could this be clarified?

      Significance

      Morphogenesis of organs, and how these highly coordinated processes are driven by transcriptional events, local control (of for instance cytoskeletal behaviour), is a major field in developmental and cell biology. Advances over the last decade have led to a much better understanding of the role of myosin (in the form of actomyosin) in defining cell and therefore tissue shape in morphogenesis. The role and control fo actin organisation, that the myosin depends upon for its action, is much less understood. Thus this study will add an important piece of understanding of the basic control of morphogenesis.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their enthusiastic support for our work and their insightful comments and suggestions which we believe strengthen the manuscript. Below we detail how we propose to respond to each of the specific points raised by each reviewer.

      Reviewer #1__

      1). It is convincingly shown that adding insulator elements (cHS4) reduces crosstalk between the two PAX6 CREs tested (Fig. 3). However, it is unclear if this approach will work for other CREs. This point should be discussed, and perhaps the authors could give some troubleshooting advice (e.g. adding more insulators or trying different insulator elements?).

      We will include these suggestions in the discussion and describe some ongoing efforts to characterise another insulator element in our assay.

      2). All CREs used in proof-of-concept experiments in this work have well known activities in zebrafish embryos. A new/uncharacterized CRE has not been tested yet using this system. It is unclear from the workflow (Fig. 1B) what happens if the CRE does not drive detectable levels of EGFP/mCherry. How does one determine whether lack of reporter expression is due to technical problem (with the transgene or phiC31 integration) or that the CRE is not active in zebrafish? Perhaps adding a PCR-based genotyping step could address this potential problem?

      We will include a PCR-based genotyping assay in the description of the assay pipeline and discuss its utility in assessing successful integration events as suggested by the reviewer.

      3). Other limitations of the system should also be discussed. For example, the system appears to be useful for identifying variant CREs that result in a change (either loss or gain) of temporal or spatial activity, but it is not clear how subtle changes in expression level (either slightly increased or decreased) would be identified or quantified. Perhaps other approaches could be used in combination with this system to fully analyze mutant CRE activity. Another limitation is that this approach is only be applicable to CREs that are active in the first few days of zebrafish embryonic development.

      We will include these suggestions in the discussion and clearly address the limitations of the system

      **Minor points:**

      i) Although it is discussed in the previous work published in PLoS Genetics, it is probably worth mentioning here why the gata2 minimal promoter was chosen for the reporter system.

      The choice of the gata2 promoter in our constructs was based on previously published work from our group. We will re-iterate and reference these studies in the workflow description.

      1. ii) It would be helpful if the cSH4 element is briefly described (e.g. "insulator element") in Fig.1 legend. We will modify the figure legend according to the suggestion.

      3). It is not clear from the manuscript whether the new reagents reported here-including dual reporter vectors and transgenic attB landing site zebrafish strains-will be made available to the scientific community, or how these reagents would be distributed.

      We would include a section describing our plans for distribution of reagents and tools described in the manuscript. All the vectors would be deposited in Addgene for distribution and all the zebrafish lines would be openly shared with the scientific community.

      Reviewer #2:

      1. The dual reporter system uses EGFP and mCherry to report the activities of two different CREs in the same animal. However, EGFP and mCherry have drastically different fluorescence properties which have not been measured particularly well in vivo and especially not in zebrafish. They have different maturation times (mCherry is much quicker). Both are quite stable in vivo, but mCherry is particularly stable in cell culture and in vivo, even resisting lysosomal degradation (EGFP does not - it is acid and protease sensitive) (Katayama et al., 2008; McWilliams et al., 2016). Often, promoter activity assays in zebrafish employ short lived "destabilized" FPs, such as destabilized GFP and destabilized dsRed. With stable FPs, false positives could be reported due to the fluorescent signal remaining for a long period of time after promoter activity has ceased. Replacing the traditional FPs with destabilized versions could be one way to improve the temporal resolution of this assay. This is probably not necessary to do in the present study but might be a worthy future direction.

      We would discuss these points in the possible limitations of our assay and will also endeavour to incorporate these suggestions in future versions of our assays.

      However, no matter which pair of FPs is chosen, there will be differences in signal intensity/brightness and decay rate. Thus, the FP swap experiments should be employed for any experiment claiming a temporal (Fig. 4) or quantitative (Fig. 5) difference between CRE activation or deactivation. If the EGFP/mCherry swap experiments show the same results, the confidence in the assay will be significantly bolstered.

      We estimate the proposed experiments to take about 4 months to allow for molecular cloning of the FP swapped constructs, injection into the "landing" strain, raising to sexual maturity (2.5 mo), screening for founders, and performing the imaging. These are the only two suggested experiments I would need to feel confident in the results and to recommend publication

      We appreciate the reviewer’s suggestion but would point out that we included dye-swaps for the PAX6-CREs described in Figure 3 in this manuscript. The dye-swap experiment for SBE2WT/SBE2Mut were described in our previous work published in Plos Genetics. However, to increase the confidence of the readers in our current system we would include the other suggested dye swaps in the revised version of our manuscript.

      Reviewer #3:

      **Major comments**

      1. First, given the importance of quality landing lines for the methodology, I would like to see more clarity and emphasis on validation of the Shh-SBE2 landing pad in the main text. Based on supplemental tables 1 and 2, this reviewer is somewhat unclear on whether there is one or three lines with Shh-SBE2 based landing pads (one site is mentioned in table 1, but table 3 mentions three F0 lines, and the text is ambiguous). The authors also state that the Shh-SBE2 landing pad is a single copy integration, but the data supporting this conclusion does not appear to be included (linker mediated PCR does not rule out other integrations).

      We will provide a detailed description of the landing lines addressing all the concerns raised by the reviewer.

      It would also be useful to have more clear numbers indicating the reproducibility of the expression pattern in F1 animals. Do 100% of F1 progeny from multiple crosses show the integration show the expression pattern in image 2 A? If there is variability how much, and how many fish were examined? This reviewer also wonders whether appropriate expression of Shh-SBE2 in this landing site is enough to call it neutral. For example, perhaps position effects might be observed with a different weaker CRE in this site? Better documentation will allow for more widespread and appropriate use of the landing pad.

      We will expand the description for the part of the pipeline the reviewer is referring to, providing the details of transgene segregation.

      Similar concerns apply to the integration of test constructs. To evaluate the practicality of the approach, it would be useful to have numbers reporting the frequency of recovering F1 individuals with PhiC mediated integration of the reporter into the desired landing site. It is also important to provide better documentation of the degree of reproducibility in expression patterns between F1 progeny. Numbers of embryos imaged and fraction with the indicated expression pattern are needed for all data in the main text. At minimum, gross expression patterns should be examined in at least 10 F1 larvae. If there is variability between individuals, some image documentation of this in supplementary data would be welcome.

      We will include the suggested information in the results and provide the supplementary data as suggested by the reviewer.

      **Minor comments:**

      i) For figure 1, it may be clearer to present generation of the landing pad lines and screening of CRES using these lines in separated figure panels (B) for generation of landing pads, and (C) for CRE analysis.

      We will modify figure 1 as suggested.

      ii) Landing pads that were less effective might also be moved out of figure 2, to the supplemental material to help improve clarity and to allow for focus on the tools with the most utility

      We will modify figure 2 as suggested.

      iii) Scale bars should be included in all images,

      This will be done for all the images

      iv) In some cases, image labeling somewhat obscures the relevant features

      We will rectify this in the revised version

      v) To help evaluate consistency, in all relevant figures (4, 5, sup fig 3 ect) the number of embryos examined should be included in the legend.

      We will modify the figure legends to include this information

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The manuscript by Bhatia addresses a longstanding need for rigorous methods to directly compare the effectiveness of cis-regulatory elements (CREs) during vertebrate embryogenesis. The manuscript describes a method for simultaneous quantitative assessment of the spatial and temporal activity of wild-type and mutant CREs using live imaging in zebrafish embryos. The approach takes advantage of a predefined neutral docking site, and dual-CRE reporter cassette that can be integrated into this site using PhiC31. Using this method, the authors demonstrate subtle differences in the spatial and temporal dynamics of two shh CREs that have been previously reported to have similar domains of activity, and they demonstrate changes in CRE activity in embryos harboring a disease specific mutation in the SBE2 CRE.

      Major comments

      Overall this manuscript describes a valuable tool and key conclusions regarding its need and utility convincing. However, some additional documentation of methods and key reagents, and numbers would be of value.

      First, given the importance of quality landing lines for the methodology, I would like to see more clarity and emphasis on validation of the Shh-SBE2 landing pad in the main text. Based on supplemental tables 1 and 2, this reviewer is somewhat unclear on whether there is one or three lines with Shh-SBE2 based landing pads (one site is mentioned in table 1, but table 3 mentions three F0 lines, and the text is ambiguous). The authors also state that the Shh-SBE2 landing pad is a single copy integration, but the data supporting this conclusion does not appear to be included (linker mediated PCR does not rule out other integrations). It would also be useful to have more clear numbers indicating the reproducibility of the expression pattern in F1 animals. Do 100% of F1 progeny from multiple crosses show the integration show the expression pattern in image 2 A? If there is variability how much, and how many fish were examined? This reviewer also wonders whether appropriate expression of Shh-SBE2 in this landing site is enough to call it neutral. For example, perhaps position effects might be observed with a different weaker CRE in this site? Better documentation will allow for more widespread and appropriate use of the landing pad.

      Similar concerns apply to the integration of test constructs. To evaluate the practicality of the approach, it would be useful to have numbers reporting the frequency of recovering F1 individuals with PhiC mediated integration of the reporter into the desired landing site. It is also important to provide better documentation of the degree of reproducibility in expression patterns between F1 progeny. Numbers of embryos imaged and fraction with the indicated expression pattern are needed for all data in the main text. At minimum, gross expression patterns should be examined in at least 10 F1 larvae. If there is variability between individuals, some image documentation of this in supplementary data would be welcome.

      Presumably nearly all of this data has already been collected during validation of the tools and just isn't reported clearly, so these updates would not require significant time or cost.

      Minor comments:

      With respect to clarity, while the authors do an excellent job of explaining the rational for their system, the details of execution in the manuscript can be difficult to follow at times, below are minor suggestions to help the reader follow more easily.

      For figure 1, it may be clearer to present generation of the landing pad lines and screening of CRES using these lines in separated figure panels (B) for generation of landing pads, and (C) for CRE analysis.

      Landing pads that were less effective might also be moved out of figure 2, to the supplemental material to help improve clarity and to allow for focus on the tools with the most utility

      Scale bars should be included in all images,

      In some cases, image labeling somewhat obscures the relevant features

      To help evaluate consistency, in all relevant figures (4, 5, sup fig 3 ect) the number of embryos examined should be included in the legend.

      Significance

      This manuscript is significant as if provides useful tools for direct comparison of CRE activity in stable transgenic embryos, where two CREs are integrated into a single genomic location. The method offers an advance in efficiency and rigor compared to past approaches. As a zebrafish researcher, it is easy to recognize the value of having a transgenic line with a validated neutral landing site for transgene analysis, and having a well-designed construct for detailed in vivo comparison of CRE activity.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This study presents a dual fluorescent protein (FP) reporter system to determine differential activities of Cis regulator elements (CREs) on transcription factor behavior in an in vivo setting. The strategy uses the PhiC31 system to ensure single copy insertion into a consistent genomic locus and is an important improvement over the authors' previous work using a similar system with random genomic integration and separated FP constructs. Because different genomic loci are more accessible than others, comparing the activities of randomly inserted CREs cannot be quantitative and requires generation and comparison of multiple lines for each CRE to validate. The bulk of this study is validation of the new specifically inserted, dual FP system including showing that including insulator sequences between the CREs of interest is necessary to prevent crosstalk. The last two figures demonstrate the utility of the system to interrogate spatial and temporal regulation of CRE variants and the quantitative expression levels of a mutant and WT CRE pair. This is an exciting tool with clear potential to uniquely compare CRE activities in vivo, and the results are clearly presented. However, given that the impact of this study is as a technical improvement over previous methods and that it is aimed to demonstrate the robustness and utility of the reporter system, additional controls are necessary to demonstrate that FP choice does not influence the temporal or quantitative readouts.

      The dual reporter system uses EGFP and mCherry to report the activities of two different CREs in the same animal. However, EGFP and mCherry have drastically different fluorescence properties which have not been measured particularly well in vivo and especially not in zebrafish. They have different maturation times (mCherry is much quicker). Both are quite stable in vivo, but mCherry is particularly stable in cell culture and in vivo, even resisting lysosomal degradation (EGFP does not - it is acid and protease sensitive) (Katayama et al., 2008; McWilliams et al., 2016). Often, promoter activity assays in zebrafish employ short lived "destabilized" FPs, such as destabilized GFP and destabilized dsRed. With stable FPs, false positives could be reported due to the fluorescent signal remaining for a long period of time after promoter activity has ceased. Replacing the traditional FPs with destabilized versions could be one way to improve the temporal resolution of this assay. This is probably not necessary to do in the present study but might be a worthy future direction. However, no matter which pair of FPs is chosen, there will be differences in signal intensity/brightness and decay rate. Thus, the FP swap experiments should be employed for any experiment claiming a temporal (Fig. 4) or quantitative (Fig. 5) difference between CRE activation or deactivation. If the EGFP/mCherry swap experiments show the same results, the confidence in the assay will be significantly bolstered.

      We estimate the proposed experiments to take about 4 months to allow for molecular cloning of the FP swapped constructs, injection into the "landing" strain, raising to sexual maturity (2.5 mo), screening for founders, and performing the imaging. These are the only two suggested experiments I would need to feel confident in the results and to recommend publication.

      Significance

      The impact of this study is as a technical improvement over previous methods and is aimed to demonstrate the robustness and utility of the reporter system.

      The manuscript is geared towards zebrafish experts with an interest in the imaging of intracellular and transcriptional processes.

      Our laboratory has expertise in zebrafish developmental genetics and live imaging of reporters.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is a technical manuscript that describes a new transgenic reporter system in zebrafish that is designed to simultaneously test the activity of two cis-regulatory elements (CREs) in the same living embryo. This is an extension of previous work from the authors that established methods to compare two CREs in transgenic zebrafish embryos (published in PLoS Genetics; DOI: 10.1371/journal.pgen.1005193). Here, to address the problem of position effects caused by random transgene integration, the authors have created a dual reporter transgene that can be integrated into a specific neutral site (using phiC31 recombination) in the zebrafish genome. Expression of different fluorescent proteins (EGFP and mCherry) are regulated by two CREs of interest in the zebrafish embryo, which allows visualization of the temporal and spatial activity of the CREs in real time during embryonic development. The authors propose this system could be used to directly compare wild-type and mutant CREs, and then provide several lives of evidence that establish proof-of-concept. Overall, the results are clearly presented, and the conclusions are convincing. The description of methods (including supplemental tables) is extensive, which will facilitate reproducibility. The manuscript is succinct, and describes a useful approach to characterize CREs. However, I have a few points for the authors to consider:

      Major points:

      1)It is convincingly shown that adding insulator elements (cHS4) reduces crosstalk between the two PAX6 CREs tested (Fig. 3). However, it is unclear if this approach will work for other CREs. This point should be discussed, and perhaps the authors could give some troubleshooting advice (e.g. adding more insulators or trying different insulator elements?).

      2)All CREs used in proof-of-concept experiments in this work have well known activities in zebrafish embryos. A new/uncharacterized CRE has not been tested yet using this system. It is unclear from the workflow (Fig. 1B) what happens if the CRE does not drive detectable levels of EGFP/mCherry. How does one determine whether lack of reporter expression is due to technical problem (with the transgene or phiC31 integration) or that the CRE is not active in zebrafish? Perhaps adding a PCR-based genotyping step could address this potential problem?

      3)Other limitations of the system should also be discussed. For example, the system appears to be useful for identifying variant CREs that result in a change (either loss or gain) of temporal or spatial activity, but it is not clear how subtle changes in expression level (either slightly increased or decreased) would be identified or quantified. Perhaps other approaches could be used in combination with this system to fully analyze mutant CRE activity. Another limitation is that this approach is only be applicable to CREs that are active in the first few days of zebrafish embryonic development.

      Minor points:

      1)Although it is discussed in the previous work published in PLoS Genetics, it is probably worth mentioning here why the gata2 minimal promoter was chosen for the reporter system.

      2)It would be helpful if the cSH4 element is briefly described (e.g. "insulator element") in Fig.1 legend.

      3)It is not clear from the manuscript whether the new reagents reported here-including dual reporter vectors and transgenic attB landing site zebrafish strains-will be made available to the scientific community, or how these reagents would be distributed.

      Significance

      This work introduces a new method to analyze cis-regulatory element (CRE) activity in vivo. By generating transgenic zebrafish with a neutral phiC31 landing site for reporter transgene integration, this work improves on previous methods by overcoming the problem of position effects caused by random transgene integration. This will be useful approach to characterize CREs during embryonic development, and variant CREs associated with human disease. This paper will be of interest to developmental biologists, and geneticists trying to understand CRE activity. I have expertise in zebrafish genetics, with extensive experience using Tol2 transgenesis, and some experience using phiC31 recombination. The described experimental approach here is straightforward, and will be easy to apply in labs with experience in zebrafish transgenesis, and imaging fluorescent protein expression in embryos.

  5. Nov 2020
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer__ #1 (Evidence, reproducibility and clarity (Required)):__ Septins are highly conserved small GTPase cytoskeletal proteins that function as molecular scaffolds for dynamic cell wall and plasma membrane-remodeling, as well as diffusion barriers restricting movement of membrane and cell wall-associated molecules. Recent work has started to unravel the functional connections between the septins, cell wall integrity MAPK pathway signaling, and lipid metabolism, however most studies have focused on a small sub-set of septin monomers and/or were conducted in primarily yeast-type fungi. Here the authors show in the filamentous fungus A. nidulans that the core hexamer septins are required for proper coordination of the cell wall integrity pathway, that all septins are involved in lipid metabolism. Especially sphingolipid, but not sterols and phosphoinositides, contributes to the localization and stability of core septins at the plasma membrane. The experiments are simple and clear, therefore the conclusion is convincing. Fig.8 model, I would like to see the situation of septin mutant.

      We thank the reviewer for the positive comments. In response to the request from this reviewer and a similar one from reviewer 2 for more on the effect of the loss of individual septins, we added text clarifying the roles of core hexamer, core octamer and noncore septins throughout the manuscript including in the legend to Fig 8 (li 439-444) and the discussion (li 388-402). Please see responses to reviewer 2 comments for more detail.

      Reviewer #1 (Significance (Required)):

      Since localization of cell wall synthesis proteins, lipid domains and septins are likely to depend on each other, sometimes difficult to evaluate the effect is direct or indirect. The comprehensive analyses like performed here are helpful to catch the overview in the field.

      Reviewer__ #2 (Evidence, reproducibility and clarity (Required)):__ **Summary** The study by Mela and Momany describes the function of core septins of A. nidulans and links with the requirement of the cell wall integrity pathway and the sphingolipids which, are required for membrane and cell wall stability. The study is of interest for the fungal genetics community, and the authors have conducted a substantial amount of work in a field they have substantial experience. However, one of the main weaknesses of the manuscript is the assumption whether the CWI pathway controls de septin function of if the core septins control it.

      We agree that while our data clearly indicate interactions between the septins and the CWI pathway, which component controls the other is not clear. We have modified the text to address this concern in several places as detailed in responses to the reviewer’s specific comments below.

      **Major comments** In the abstract, the authors claim that double mutant analysis suggested core septins function downstream of the final kinase of the cell wall integrity pathway. However, from the experiments showed, it is difficult to be convinced about that. The authors should make efforts do make it clear in the manuscript and the discussion. For example: -Line 25-26 (abstract): "Double mutant analysis suggested core septins function downstream of the final kinase of the cell wall integrity pathway."

      We agree that while the double mutant analysis shows interaction of septins with the CWI pathway, the evidence for them being downstream is not strong. We have revised the abstract as follows:

      Li29-30: Double mutant analysis with Δ**mpkA suggested core septins interact with the cell wall integrity pathway.”

      -Line 181-182; 219-220 (results) "Double mutant analyses suggest core septins modulate the cell wall integrity pathway downstream of the kinase cascade." This conclusion is one of the most important of the manuscript. However, this reviewer argues that it cannot be convincingly addressed if at least the phosphorylation ok the MAP kinase MpkA in the septins background is not evaluated under conditions of cell stress and sphingolipid biosynthesis inhibition. The genetic analysis alone maybe not enough to infer if septins control the CWI or the other way around. There may have compensatory effects when the CWI pathway is impaired. For example, most of the septins and mpkA double mutants seems to suppress the defect of the delta mpkA under cell wall stress. The authors should consider this idea.

      Although we discuss the epistasis experiments as one possible interpretation, we agree the genetic analysis is not enough to definitively show that the septins are upstream of the CWI pathway or the other way around. The suppression of cell wall defects by deletion of septins in a mpkA null mutant background under cell wall stress suggests a bypass of the CWI pathway for remediation of the cell wall or some other alternate regulatory node. One possible interpretation of these data could be that by inactivation of normal CWI integrity function through deletion of the final kinase, in addition to deletion of septins (possibly acting as negative regulators of CWI components), there may be a parallel node by which cell wall remediation could still occur.

      Wording throughout the abstract, results, and discussion has been modified accordingly.

      Li 29-30: Double mutant analysis with Δ**mpkA suggested core septins interact with the cell wall integrity pathway.

      Li 208-209: Double mutant analyses suggest the core septin aspB cdc3 modulates the cell wall integrity pathway in the ∆mpkA background under cell wall stress.

      Li 221-225: When challenged with low concentrations of CASP and CFW, the ∆aspBcdc3**∆mpkAslt2 and ∆aspE ∆mpkA slt2 mutants were more sensitive than ∆aspBcdc3 and ∆aspE single mutants, but suppressed the colony growth defects of ∆mpkA slt2. The novel phenotype of the double mutants shows that septins are involved in cell wall integrity and raises the possibility that they act in a bypass or parallel node for remediation of cell wall defects (Fig 4).

      Li 227-228: Fig 4. Double mutant analyses suggest core septins modulate the cell wall integrity pathway.

      Li 464-468: Double mutant analyses between septins and CWI pathway kinases also support a role for core septins in maintaining cell wall integrity under stress (Fig 4). Suppression of cell wall defects under cell wall stress by deletion of septins in an ∆mpkA slt2 background suggests a parallel node by which septins negatively regulate cell wall integrity pathway sensors or kinases could exist.

      There is no clear evidences on the manuscript that the core septins AspA, AspB, AspC, and ApsD are epithastic in A. nidulans. Therefore, the authors choice of using different Asp deletion mutants as a proxy for all the septins mutants is questionable. For example, there is no mention of why AspB was chosen for Figure 2 (chitin and β-1,3-glucan deposition), and AspA was chosen for Figure 3 (chitin synthase localization) since these experiments are correlated. The same is true for Figure S1 where AspB and AspE were used. One can wonder if some of the core septins would have a major impact in the chitin content.

      We agree with the reviewer that not all four core septins are equivalent. Previously published work from our lab shows that AspACdc11, AspBCdc3, AspCCdc12, and AspDCdc10 form octamers and that AspACdc11, AspBCdc3, and AspCCdc12 form hexamers, that both of these heteropolymers co-exist, and that the noncore septin AspE is not part of either core heteropolymer, though it appears to influence them possibly through brief interactions (Lindsay et al., 2010; Hernandez-Rodriguez et al., 2012; Hernandez-Rodriguez et al., 2014). This previous work also clearly shows that strains in which the hexameric septins have been deleted (ΔaspA, ΔaspB, and ΔaspC) have very similar phenotypes while strains in which the octamer-exclusive septin has been deleted (ΔaspD) have different phenotypes.

      In our attempt to simplify the current manuscript we discussed the four core septins as a group. In retrospect this caused us to miss important distinctions on the roles of hexamer vs octamer septins and we are grateful to the reviewer for pointing this out. We have modified language throughout the revised manuscript to specify whether results and interpretations apply to core hexamer septins, core octamer septins, the noncore septin, or individual septins. This more detailed analysis has given us several new ideas to test in future work.

      While we cannot exclude the possibility that interesting results might be produced by analyzing null alleles of each individual septin gene for all experiments, we agree with the cross-reference by Reviewer #3 that there is a very low likelihood that we would see different results by analyzing all individual septins within each subgroup (hexamer, octamer or noncore).

      To the reviewer’s questions on choice of septins for Fig 2, Fig 3, and Fig S1:

      ΔaspA, ΔaspB, and ΔaspC showed similar sensitivity to cell wall-disturbing agents in the plate-based assays in Fig 1 and are all part of the core hexamer. We have modified text including the figure legends to make it clear which septins were used in the experiments and which group they belong to.

      In a related comment about Figure 3, the reallocation of chitin synthases in the absence of septins is very interesting, but consider that all the core septin genes should be tested. Without a fully functioning cell wall, the formation of septa will be impaired. It makes their results less surprising.

      In the case of Fig 3, we were unable to recover ChsB-GFP in the ΔaspB or ΔaspC backgrounds but were able to recover it in the ΔaspA background. We have clarified as follows:

      Li184-187: To determine the localization of synthases, a chitin synthase B-GFP (chsB-GFP) strain was crossed with strains in which core hexamer septins were deleted. After repeated attempts, the only successful cross was with core hexamer deletion strain ∆aspA cdc11.

      Figure 3, Panels A and B, chitin was also labeled by Calcofluor White which clearly shows that the formation of septa was not impaired even in the septin null mutant background (this is in agreement with previous work form our lab which shows that septa still forms in individual septin null mutants). The results showed that unlike WT cells, chitin synthase is not only absent in most branch tips in the septin null mutant background, but seems to be limited primarily to longer (presumably actively growing/non-aborted) branches; these findings were surprising to us, considering other major cell wall synthesis events, such as targeting of cell wall synthases to septa during septation appeared to be unimpaired (based on the presence of fully-developed, chitin-labeled septa).

      The labeling of septa by calcofluor is now noted in the legend to Figure 3 as follows:

      Li 201: Calcofluor White labeling shows the presence of the polymer chitin at septa, main hyphal tips, branches, and …

      Why was chitin synthase B chosen to be analyzed in terms of reallocation? How many chitin synthases are in the A. nidulans genome. This rationale should be explained in the manuscript.

      We have added the following:

      Lines 173-182: A. nidulans contains six genes for chitin synthases: chsA, chsB, chsC, chsD, csmA, and csmB. Chitin synthase B localizes to sites of polarized growth in hyphal tips, as well as developing septa in vegetative hyphae and conidiophores, a pattern very similar to septin localization. Deletion of chitin synthase B shows severe defects in most filamentous fungi analyzed thus far, and repression of the chitin synthase b gene expression in chsA, chsC, and chsD double mutants exacerbated growth defects from a number of developmental states observed in each single mutant, suggesting it plays a major role in chitin synthesis at most growth stages (Fukuda et al., 2009). For these reasons, we chose chitin synthase B as a candidate to observe in septin mutant background for possible defects in localization.

      Figure 3 and Figure 4. The authors should make efforts to quantify the phonotypes they claim. They are overall very subtle, especially for Figure 3. Also, a decrease of fluorescence is a tricky observation that should be better reported by quantification.

      Line scans of aniline blue and CFW label were conducted and added as Fig S1. Quantitation was performed and added as Fig S3. See author’s response to Reviewer #3 below for details.

      Again, in Figures 5, 6, and 7, it is clear that the different septins respond differently when ergosterol or sphingolipids synthesis is impaired. It also raises the question again if there are differences in the role of septin genes. Can the authors use previous information about differences in septin function to improve the model (Figure 8)

      As described above, we have modified the manuscript throughout to clarify which phenotypes are seen for core hexamer, core octamer, and noncore septin deletions. As the reviewer notes, these are especially relevant for the sphingolipid-disrupting agents. Our model includes interaction of septins with sterol rich domains that contain both sphingolipids and ergosterol. Because it is not yet clear how subgroups of septins interact with each other and are organized at SRDs, we show all core septins in our model without distinguishing hexamers and octamers in the drawing, but we have now added text to clarify roles and outstanding questions.

      The changes are summarized in the abstract as follows:

      Li 37-40: Our data suggest that the core hexamer and octamer septins are involved in cell wall integrity signaling with the noncore septin playing a minor role; that all five septins are involved in monitoring ergosterol metabolism; that the hexamer septins are required for sphingolipid metabolism; and that septins require sphingolipids to coordinate the cell wall integrity response.

      The clarifications are reflected in the Figure 8 legend (and associated sections of the discussion) as follows:

      Li 436-441: As described in the text, our data suggest that all five septins are involved in cell wall and membrane integrity coordination. The core septins that participate in hexamers appear to be most important for sphingolipid metabolism while all septins appear to be involved in ergosterol metabolism and cell wall integrity. Because SRDs contain both sphingolipids and ergosterol and because it is not yet clear how subgroups of septins interact with each other at SRDs, we show all core septins in our model without distinguishing hexamers and octamers.

      For the above-discussed reasons, the conclusion on lines 384-388 (discussion) is not completely supported by the experiments shown in the manuscript. The authors need to make a better structured and more straightforward story emphasizing the stronger points and reducing descriptions of more speculative points.

      As discussed above, we have made changes throughout the manuscript to clarify which subgroups of septins are involved in which process and to refine our conclusions accordingly. The beginning of the discussion section has been changed as follows:

      Li 384-399: Our data show that A. nidulans septins play roles in both plasma membrane and cell wall integrity and that distinct subgroups of septins carry out these roles. Previous work has shown that the five septins of A. nidulans septins form hexamers (AspACdc11, AspBCdc3, and AspCCdc12) and octamers (AspACdc11, AspBCdc3, AspCCdc12, and AspDCdc10) and that the noncore septin AspE does not appear to be a stable member of a heteropolymer (20). The current work suggests that though all septins are involved in coordinating cell wall and membrane integrity, the roles of hexamers, octamers, and the noncore septin are somewhat different. Core hexamer septins appear to be most important for sphingolipid metabolism, all five septins appear to be involved in ergosterol metabolism, and core septins are most important for cell wall integrity pathway with the noncore septin possibly playing a minor role. As summarized in Figure 8 and discussed in more detail below, our previous and current data are consistent with a model in which: (A) All five septins assemble at sites of membrane and cell wall remodeling in a sphingolipid-dependent process; (B) All five septins recruit and/or scaffold ergosterol and the core hexamer septins recruit and/or scaffold sphingolipids and associated sensors at these sites, triggering changes in lipid metabolism; and (C) The core septins recruit and/or scaffold cell wall integrity machinery to the proper locations and trigger changes in cell wall synthesis. The noncore septin might play a minor role in this process.

      Minor comments Overall the figure caption could be shortened. They are too descriptive and contain details that are easily inferred for the images and from the materials and methods.

      Legends to the following figures have been streamlined by removing portions that belong in the methods: Figure 2, Fig 3, and Fig 6

      The authors made every effort to cove the precedent literature, but the manuscript has 115 references. The authors should evaluate if all the cited literature is extremely relevant. The manuscript would benefit for that conciseness.

      Because this manuscript addresses septins, ergosterol, sphingolipids, cell wall integrity, and multiple different pathways, there is a lot of literature underlying our approaches. Our strong preference is to cite primary literature, however we can shorten our reference list by relying on reviews if requested by the journal.

      Line 124, 493: Replace 10ˆ7, 10ˆ4 to 107, 104, etc

      “10^7” and all other scientific notation was altered to replace carrots “^7” with superscripts “7” throughout.

      The use of fludioxonil as a probe to detect cell wall impairment is perhaps out of context. This drug responds primarily to the HOG pathway and also respond to oxidative damage. So, these results could be suppressed.

      Previous work by Kojima et al., 2006 showed that in addition to the HOG pathway, cell wall integrity is required for resistance to fludioxonil treatment. C. neoformans cell wall integrity mutants bck1, mkk1, and mpk1 (Aspergillus nidulans bckA, mkkA, and mpkA homologues) all exhibit hypersensitivity to fludioxonil, and this was shown to be remediated by the addition of osmotic stabilizers, suggesting cell wall impairment was involved in the growth defect produced by this treatment. Although this drug seems to act primarily through the HOG pathway, the CWI and HOG pathways have been shown to antagonize/negatively regulate one another through a parallel pathway (SVG pathway in yeast) (Lee and Elion, 1999). It has been hypothesized that internal accumulation of glycerol by constitutive activation of the HOG pathway causes decreased cell wall integrity. Due to the apparent cross-pathway control between the HOG and CWI pathways, as well as the high level of conservation of these pathway components in filamentous fungi, we thought this treatment was rightfully dual-purposed to investigate both cell wall impairment in the septin mutants and any possible involvement of the HOG pathway. This seems to be would a reasonable drug treatment to look at cell wall impairment that is not likely to be redundant with the modes of action observed in the other Figure 1 treatments (e.g. CFW, Congo Red, and Caspofungin).

      The text clarifies this point as follows: li 110-112: Fludioxonil (FLU), a phenylpyrrol fungicide that antagonizes the group III histidine kinase in the osmosensing pathway and consequently affects cell wall integrity pathway signaling (Fig 1)(58-67).

      Line 140: "exposure" would be more appropriate than architecture. Please also consider that the difference in the cell wall reported in Figure S1 are very subtle. Are they relevant?

      The differences in the cell wall content reported in Figure S1 (Figure S2 in the revised manuscript) showed that the peak for 4-Glc was almost identical in WT and aspB null mutant, however the overall ratio of peaks switched, where 4-GlcNac content exceeded the 4-Glc content in the mutant compared to WT. By comparison, this was not the case with the septin aspE null mutant. Although this could be considered a ‘subtle’ change in chitin content, we believe this was an important unbiased analysis of the cell wall polysaccharide content and addressed some of the cell wall sensitivity phenotypes we observed, not only between WT and the septin mutants, but also between the septin null mutants which showed sensitivity to cell wall disturbing agents (i.e. aspA, aspB, and aspC) vs. those that did not show significant sensitivity (e.g. aspE). For these reasons we believe this warranted at the very least a supplemental figure for these data.

      Though our idea of cell wall architecture includes changes in polymer exposure, as pointed out by the reviewer, others might use the phrase to mean only content changes. To avoid this misunderstanding, we have replaced the word “architecture” with “organization” in Li 147-148: These data show that cell wall organization is altered in ∆aspB cdc3 and raise the possibility that it might be altered in other core hexamer septin null mutants as well.

      Line 144: explain briefly what it is about and why it was chosen instead of the total detection of chitin sugar monomers. Line 538: Cell wall extraction section. Is this a new method? There is no supporting literature.

      We chose this method because it provides an analysis of all cell wall polysaccharide components and associated linkages. Detection of chitin sugar monomers would have also been a reasonable analysis if this were the only component of the cell wall we were investigating initially. The results showed differences in cell wall chitin content, so these were the data we presented.

      This was addressed on lines 574-576: “Cell walls were isolated from a protocol based on (Bull, 1970); cell wall extraction and lyophilization were conducted as previously described in (Guest and Momany, 2000) with slight modifications listed in full procedure below.”

      The results described on lines 232-257 are marginal to the study and are not exploited by the authors to address the central question of the manuscript, which is the role of the CWI pathway, septins, and sphingolipids. This section could be suppressed or very briefly mentioned in the preceding section.

      We agree that these data did not show any additional involvement of septins in the Calcineurin and cAMP-PKA pathways, and the relevance of the TOR signaling pathway connection is still quite unclear. For this reason, these data were added as a supplemental figure. On the other hand, there are a number of important signaling pathways which have been shown to affect the Cell Wall Integrity pathway directly and indirectly (these three pathways in particular), which is part of the central question of the manuscript. Considering such extensive ‘cross-talk’ between pathways (references produced on Line 65) in filamentous fungi, we felt it necessary to inspect possible involvement of these pathways in septin function via plate-based assays and feel that this s most clearly communicated as its own brief section in the text.

      Reviewer #2 (Significance (Required)): The topic of the manuscript is highly relevant to the fungal biology field and employs a very important genetic model. The cooperation of signaling pathways in mains aspects of fungal physiology is the main significant contribution of this manuscript. Reviewer__ #3 (Evidence, reproducibility and clarity (Required)):__ **Summary:** In this work the authors use genetic analysis in Aspergillus nidulans to identify phenotypes of septin mutants that point to roles for septins in coordinating the cell wall integrity pathway with lipid metabolism in a manner involving sphingolipids. Most of the major conclusions derive from monitoring the effects of combined genetic or chemical manipulations that target specific components of the pathways of interest. Additionally, the authors monitor the subcellular localization of septins, cell-wall modifying enzymes, and components of the cell wall itself. **Major comments:** The key conclusions are convincing, with the unavoidable caveat that null mutations of this sort and chemical inhibitors of these kinds could have unanticipated effects, such as upregulation of unexpected pathways or other compensatory alterations. The authors qualify their conclusions appropriately in this regard. The methods are explained very clearly and the data are presented appropriately. In some cases results are shown as representative images illustrating altered localization of a protein or a cell wall component. The changes observed in the experimental conditions are fairly obvious, but some quantification would not be difficult and would likely make the results even more obvious. For example, the Calcofluor White staining patterns might be nicely quantified by linescans along the hyphal length, and the same is true for AspB-GFP localization upon addition of drugs.

      We thank the reviewer for the positive comments and have made the suggested changes as follows:

      Line scans of aniline blue and CFW label were conducted and added as Fig S1. Text has been modified accordingly (Li 140-147).

      Quantification of Chitin synthase-GFP localization and CFW staining and statistical analysis have now been added as Figure S3 and main text (Li 187-191) has been modified accordingly.

      I could imagine one simple experiment that might generate interesting and relevant results, but by no means would this be a critical experiment for this study. In yeast, exposure to Calcofluor triggers increased chitin deposition in the wall. It would be interesting to know how Calcofluor staining looks in WT or septin-mutant cells that have been growing the presence of Calcofluor for some time, particularly with regard to the localization of chitin deposition in these cells. Such experiments could help connect the idea of septins as sensors of membrane lipid status and also effectors of CWI signaling.

      This is a cool idea that we will pursue in future work. Thanks!

      **Minor comments:** • Body text refers to Figure 1A and 1B but the figure itself does not have panels labeled A or B.

      Figure 1 was revised to show panels A and B labeled clearly.

      • Line 885: "S3" is missing from the beginning of the title of the figure.

      “S” was added to the figure title.

      Reviewer Identity: This is Michael McMurray, PhD, Associate Professor of Cell and Developmental Biology, University of Colorado Anschutz Medical Campus

      Reviewer #3 (Significance (Required)): This is an important conceptual advance in our understanding of septin function because previous work in fungal septins mostly points toward them being important in directing or restricting the localization of other proteins that modify the cell wall or plasma membrane. This new work suggests that septins can play a sensing role, as well. As a fungal (budding yeast) septin researcher myself, I think that other fungal septin researchers would be very interested in these results, and I also think the broader septin community would appreciate it. Additionally, those studying fungal cell wall and plasma membrane biogenesis and coordination, including the Cell Wall Integrity Pathway, will be interested. REFEREES CROSS COMMENTING After reading Reviewer #1's comments, I agree that it would be appropriate to modify the wording of the authors' conclusions about where the septins lie in the CWI pathway (upstream or downstream). While they do mention that there may be other ways to interpret their results, a reader would have to search for the mention of these caveats and if the reader did not, then the strong conclusion statements might be taken as fact.

      The abstract, main text, and discussion have been modified to show that while there is evidence that the septins interact with the CWI pathway, it is not clear which component is upstream vs downstream. See response to reviewer 2 above for details.

      On the other hand, I don't think additional experiments looking at deletions of the other core septins will be worthwhile. I think that there is sufficient evidence to suspect that any single core septin deletion mutant will behave similar to another, and therefore that any one can be taken as representative. While it's possible that the authors might find something informative by looking at other mutants, I personally find the likelihood too low to justify additional experimentation along those lines.

      Based on results from previous work from our lab, there are two subgroups of core septins in A. nidulans (hexamer and octamer) and septins within subgroups appear to behave similarly. The results from the current work support this idea with the same groups of mutants behaving in very similar ways. So, the core hexamer septins, AspACdc11, AspBCdc3, and AspCCdc12 can be used to make predictions about each other, but not about the octamer-exclusive septin AspDCdc10 or the noncore septin AspE. We agree with reviewer 3 that repeating analysis on multiple septins within a subgroup is not likely to give new insight. However, we were not careful in the original version of the manuscript to distinguish between core hexamer and octamer septins. As detailed in the response to reviewer 2 above, we have modified the manuscript throughout to make clear which subgroup of septins were being examined and to put conclusions into this context.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this work the authors use genetic analysis in Aspergillus nidulans to identify phenotypes of septin mutants that point to roles for septins in coordinating the cell wall integrity pathway with lipid metabolism in a manner involving sphingolipids. Most of the major conclusions derive from monitoring the effects of combined genetic or chemical manipulations that target specific components of the pathways of interest. Additionally, the authors monitor the subcellular localization of septins, cell-wall modifying enzymes, and components of the cell wall itself.

      Major comments:

      The key conclusions are convincing, with the unavoidable caveat that null mutations of this sort and chemical inhibitors of these kinds could have unanticipated effects, such as upregulation of unexpected pathways or other compensatory alterations. The authors qualify their conclusions appropriately in this regard.

      The methods are explained very clearly and the data are presented appropriately. In some cases results are shown as representative images illustrating altered localization of a protein or a cell wall component. The changes observed in the experimental conditions are fairly obvious, but some quantification would not be difficult and would likely make the results even more obvious. For example, the Calcofluor White staining patterns might be nicely quantified by linescans along the hyphal length, and the same is true for AspB-GFP localization upon addition of drugs.

      I could imagine one simple experiment that might generate interesting and relevant results, but by no means would this be a critical experiment for this study. In yeast, exposure to Calcofluor triggers increased chitin deposition in the wall. It would be interesting to know how Calcofluor staining looks in WT or septin-mutant cells that have been growing the presence of Calcofluor for some time, particularly with regard to the localization of chitin deposition in these cells. Such experiments could help connect the idea of septins as sensors of membrane lipid status and also effectors of CWI signaling.

      Minor comments:

      • Body text refers to Figure 1A and 1B but the figure itself does not have panels labeled A or B. • Line 885: "S3" is missing from the beginning of the title of the figure.

      Reviewer Identity: This is Michael McMurray, PhD, Associate Professor of Cell and Developmental Biology, University of Colorado Anschutz Medical Campus

      Significance

      This is an important conceptual advance in our understanding of septin function because previous work in fungal septins mostly points toward them being important in directing or restricting the localization of other proteins that modify the cell wall or plasma membrane. This new work suggests that septins can play a sensing role, as well. As a fungal (budding yeast) septin researcher myself, I think that other fungal septin researchers would be very interested in these results, and I also think the broader septin community would appreciate it. Additionally, those studying fungal cell wall and plasma membrane biogenesis and coordination, including the Cell Wall Integrity Pathway, will be interested.

      REFEREES CROSS COMMENTING

      After reading Reviewer #1's comments, I agree that it would be appropriate to modify the wording of the authors' conclusions about where the septins lie in the CWI pathway (upstream or downstream). While they do mention that there may be other ways to interpret their results, a reader would have to search for the mention of these caveats and if the reader did not, then the strong conclusion statements might be taken as fact. On the other hand, I don't think additional experiments looking at deletions of the other core septins will be worthwhile. I think that there is sufficient evidence to suspect that any single core septin deletion mutant will behave similar to another, and therefore that any one can be taken as representative. While it's possible that the authors might find something informative by looking at other mutants, I personally find the likelihood too low to justify additional experimentation along those lines.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      The study by Mela and Momany describes the function of core septins of A. nidulans and links with the requirement of the cell wall integrity pathway and the sphingolipids which, are required for membrane and cell wall stability. The study is of interest for the fungal genetics community, and the authors have conducted a substantial amount of work in a field they have substantial experience. However, one of the main weaknesses of the manuscript is the assumption whether the CWI pathway controls de septin function of if the core septins control it.

      Major comments

      In the abstract, the authors claim that double mutant analysis suggested core septins function downstream of the final kinase of the cell wall integrity pathway. However, from the experiments showed, it is difficult to be convinced about that. The authors should make efforts do make it clear in the manuscript and the discussion.

      For example:

      -Line 25-26 (abstract): "Double mutant analysis suggested core septins function downstream of the final kinase of the cell wall integrity pathway."

      -Line 181-182; 219-220 (results) "Double mutant analyses suggest core septins modulate the cell wall integrity pathway downstream of the kinase cascade."

      This conclusion is one of the most important of the manuscript. However, this reviewer argues that it cannot be convincingly addressed if at least the phosphorylation ok the MAP kinase MpkA in the septins background is not evaluated under conditions of cell stress and sphingolipid biosynthesis inhibition. The genetic analysis alone maybe not enough to infer if septins control the CWI or the other way around. There may have compensatory effects when the CWI pathway is impaired. For example, most of the septins and mpkA double mutants seems to suppress the defect of the delta mpkA under cell wall stress. The authors should consider this idea.

      There is no clear evidences on the manuscript that the core septins AspA, AspB, AspC , and ApsD are epithastic in A. nidulans. Therefore, the authors choice of using different Asp deletion mutants as a proxy for all the septins mutants is questionable. For example, there is no mention of why AspB was chosen for Figure 2 (chitin and β-1,3-glucan deposition), and AspA was chosen for Figure 3 (chitin synthase localization) since these experiments are correlated. The same is true for Figure S1 where AspB and AspE were used. One can wonder if some of the core septins would have a major impact in the chitin content.

      In a related comment about Figure 3, the reallocation of chitin synthases in the absence of septins is very interesting, but consider that all the core septin genes should be tested. Without a fully functioning cell wall, the formation of septa will be impaired. It makes their results less surprising.

      Why was chitin synthase B chosen to be analyzed in terms of reallocation? How many chitin synthases are in the A. nidulans genome. This rationale should be explained in the manuscript.

      Figure 3 and Figure 4. The authors should make efforts to quantify the phonotypes they claim. They are overall very subtle, especially for Figure 3. Also, a decrease of fluorescence is a tricky observation that should be better reported by quantification.

      Again, in Figures 5, 6, and 7, it is clear that the different septins respond differently when ergosterol or sphingolipids synthesis is impaired. It also raises the question again if there are differences in the role of septin genes. Can the authors use previous information about differences in septin function to improve the model (Figure 8)

      For the above-discussed reasons, the conclusion on lines 384-388 (discussion) is not completely supported by the experiments shown in the manuscript. The authors need to make a better structured and more straightforward story emphasizing the stronger points and reducing descriptions of more speculative points. Minor comments Overall the figure caption could be shortened. They are too descriptive and contain details that are easily inferred for the images and from the materials and methods.

      The authors made every effort to cove the precedent literature, but the manuscript has 115 references. The authors should evaluate if all the cited literature is extremely relevant. The manuscript would benefit for that conciseness.

      Line 124, 493: Replace 10ˆ7, 10ˆ4 to 107, 104, etc

      The use of fludioxonil as a probe to detect cell wall impairment is perhaps out of context. This drug responds primarily to the HOG pathway and also respond to oxidative damage. So, these results could be suppressed.

      Line 140: "exposure" would be more appropriate than architecture. Please also consider that the difference in the cell wall reported in Figure S1 are very subtle. Are they relevant?

      Line 144: explain briefly what it is about and why it was chosen instead of the total detection of chitin sugar monomers. Line 538: Cell wall extraction section. Is this a new method? There is no supporting literature.

      The results described on lines 232-257 are marginal to the study and are not exploited by the authors to address the central question of the manuscript, which is the role of the CWI pathway, septins, and sphingolipids. This section could be suppressed or very briefly mentioned in the preceding section.

      Significance

      The topic of the manuscript is highly relevant to the fungal biology field and employs a very important genetic model. The cooperation of signaling pathways in mains aspects of fungal physiology is the main significant contribution of this manuscript.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Septins are highly conserved small GTPase cytoskeletal proteins that function as molecular scaffolds for dynamic cell wall and plasma membrane-remodeling, as well as diffusion barriers restricting movement of membrane and cell wall-associated molecules. Recent work has started to unravel the functional connections between the septins, cell wall integrity MAPK pathway signaling, and lipid metabolism, however most studies have focused on a small sub-set of septin monomers and/or were conducted in primarily yeast-type fungi.

      Here the authors show in the filamentous fungus A. nidulans that the core hexamer septins are required for proper coordination of the cell wall integrity pathway, that all septins are involved in lipid metabolism. Especially sphingolipid, but not sterols and phosphoinositides, contributes to the localization and stability of core septins at the plasma membrane.

      The experiments are simple and clear, therefore the conclusion is convincing. Fig.8 model, I would like to see the situation of septin mutant.

      Significance

      Since localization of cell wall synthesis proteins, lipid domains and septins are likely to depend on each other, sometimes difficult to evaluate the effect is direct or indirect. The comprehensive analyses like performed here are helpful to catch the overview in the field.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this study, the authors use focused-ion beam (FIB) milling coupled with cryo-electron tomography and subtomogram averaging to uncover the structure of the elusive proximal and distal centrioles, as well as different regions of the axoneme in the sperm of 3 mammalian species: pig, horse, and mouse. The in-situ tomograms of the sperm neck region beautifully illustrate the morphology of both the proximal centriole, confirming the partial degeneration of mouse sperm, and intriguingly, asymmetry in the microtubule wall of pig sperm. In distal centrioles, the authors show that in all mammalian species, microtubule doublets of the centriole wall are organized around a pair of singlet microtubules. The presented segmentation of the connecting piece is beautiful and nicely shows the connecting piece forming a nine-fold, asymmetric, chamber the centrioles. The authors further use subtomogram averaging to provide the first maps of the mammalian central pair and identify sperm-specific radial spoke-bridging barrel structures. Lastly, the authors perform further subtomogram averaging to show to the connecting site of the outer dense fibers to the microtubule doublet of the proximal principal piece and confirm the presence of the TAILS microtubule inner protein complex (Zabeo et al, 2018) in the singlet microtubules occupying the tip of sperm tails.

      The manuscript provides the clearest insight into flagellar base morphology to date, giving insight into the morphological difference between different mammalian cilia and centriole types. The manuscript is suitable for publication, once the following questions are addressed.

      We are ecstatic that the reviewer shares our enthusiasm for this work. We are particularly grateful that the reviewer appreciates the significance of the unique, and hitherto under-explored biology of the sperm centrioles and the flagellar base.

      **Major Points:**

      How many centrioles and axonemes were used in generating the averages presented in the paper? If too few samples were used, especially in centrioles undergoing dramatic remodeling or degeneration, the reality of MIPs and MAPs being present might be completely affected. For instance, In figure 1d, the authors present a cryoET map of the centriole microtubule triplet. However, centrioles are divided into several regions with different accessory elements. Here, the authors could show the presence of only part of the A-C linker. The A-C linker covers only 40% of the centriole, so does it mean that this centriole is made only of the accessories that characterize the proximal side of the centriole? In the same line, what were the boundaries governing subtomogram extraction? For example, in the distal centriole, were microtubules extracted from just before the start of the transition zone, to the end of the microtubule vaulting, more pronounced at the end of the proximal region? There are known heterogeneities in centriole, as well as flagella, ultrastructure along the proximal distal axis. If no pre-classification was performed for subtomogram longitudinal position along with the centriole and axoneme, structural features may be averaged out, and or present and not reflecting their real longitudinal localization. The classification should be applied here if it was not the case.

      These are all valid points. Because there is no easy way to target the PC/DC when cryo-FIB milling, and because there is only one of each structure in every cell, the chances of catching them in ~150-nm-thin lamellae are slim (not to mention the number of things that can and do go wrong when doing cryo-ET on lamellae). As such, the averages of the PC were generated from 3 tomograms (3 cells) and those of the DC from 2 tomograms (2 cells). We do have more tomograms with the PC/DC, but these were used for segmentation/visual inspection since we only used the best tomograms for averaging. These numbers are not entirely atypical for cryo-FIB datasets; the only other in situ centriole structures are from 5-6 centrioles (from Chlamydomonas, from Le Guennec et al 2020 doi: 10.1126/sciadv.aaz4137 and Klena et al 2020 doi: 10.15252/embj.2020106246).

      To allow readers to adjust their interpretations according to the small number of cells analysed, we explicitly stated the number of animals/cells/tomograms used to generate averages in Table S1. Furthermore, we amended the text to clarify which regions of the centrioles our averages represent. These changes are detailed below:

      (1) proximal centriole

      The lamellae used for averaging PC triplets caught mostly the proximal end of the centriole, and essentially all of the particles come from the most proximal ~ 400 nm. In a sense, this was a form of pre-classification. We now state explicitly that our structure represents only the proximal region and that proximal/distal differences may be identified in the future (see section on distal centriole below). Despite the limited particle number, we are confident in the presence of the MIPs as these are also visible in the raw data (the striations in Fig. 1a, now Fig. 1d, for instance). Page 7, Line 165 was edited accordingly as well as the legend to Fig. 1.

      (2) distal centriole

      The subtomograms used for the DC average were extracted from the region of the distal centriole closest to the base of the axoneme (i.e; the region marked “distal centriole” in Fig. 2h-i). Because the DC doublet average in Fig. 2j was generated from very few particles, we tried to be very conservative when interpreting it. Page 9, Line 216 was edited accordingly likewise the legend to Fig. 2.

      (3) axoneme

      We did attempt to average the axoneme from different regions of flagella (midpiece, proximal principal piece, distal principal piece). This is shown in Fig. 6d-l. The major difference we found was at the doublet-ODF connection. We did not find any striking differences in MIP densities, or in radial spoke densities along the proximodistal axis. As such, the averages in Fig. 5 are from the entire principal piece (but not the midpiece), which we state in the figure legend.

      Because mammalian sperm flagella are very long, it is possible that we missed more subtle differences. We now state this in the Discussion (page 20, line 491):

      **Minor Points:**

      • In line 3, motile cilia are not only used to swim, they can move liquid or mucus for instance.

      Done. Page 3, line 64

      • In line 175, the authors stated " a prominent MIP associated with protofilament A9, was also reported in centrioles isolated from CHO cells (Greenan et al. 2018) and in basal bodies from bovine respiratory epithelia (Greenan et al 2020). Actually, this MIP has been seen in many other centrioles from other species, such as Trichonympha (https://doi.org/10.1016/j.cub.2013.06.061 ), Chlamydomonas, and Paramecium ( DOI: 10.1126/sciadv.aaz4137 ). Citing these studies will reinforce the evolutionary conservation of this MIP and therefore its potential crucial role in the A microtubule.

      We thank the reviewer for pointing out these very important papers, we added them to the manuscript (page 7, lines 175-176).

      • In Line178, the authors stated: "Protofilaments A9 and A10 are proposed to be the location of the seam (Ichikawa et 2017)". High-resolution cryoEM maps confirmed it: https://doi.org/10.1016/j.cell.2019.09.030 . This publication should be cited. Moreover, authors should also refer to this paper when discussing MIPs in the microtubule doublet.

      Done (page 7, lines 178-179 and page 13, line 329).

      We also now cite Ma et al (along with Ichikawa et al 2019 doi: 10.1073/pnas.1911119116 and Khalifa et al 2020 doi: 10.7554/eLife.52760) in the Discussion when alluding to high-resolution structures as a possible means of identifying MIPs (page 19, lines 479).

      • In Line 187-189 the authors stated, "We resolved density of the A-C linker (gold) which is associated with protofilaments C9 and C10." The A-C linker interconnects the triplets of the proximal centriole (Guichard et. al. 2013, Li et. al. 2019, Klena et. al. 2020) with distinct regions binding the C-tubule, as shown by the authors in gold, as well as an A-link, making contact with the A-tubule through various protofilaments in a species-specific manner, but always on protofilament A9. The authors may have identified the A-link, labeled in green, on the outside of protofilament A8/A9 in Figure 1d.

      We thank the reviewer for pointing this out. The position of the olive green density associated with A8/A9 is indeed consistent with the A-link, and this is also now illustrated more clearly in the new version of Fig. 1e (now Fig. 1h, see below). We accordingly edited page 8, lines 187-188.

      • In figure 1e, the authors provide a 9-fold representation of the centriole based on their map. How relevant is this model ? the distance between triplet is inconsistent here, which has not been observed before. Do they use true 3D coordinates to generate this model? The A-C linker, which is only partially reconstructed, does not contact the A microtubule. Is it really the case? did the authors see that the A-link density of the A-C linker has disappeared? If these points are not clearly specified, this representation might be misleading.

      In order to avoid misleading readers, we replaced this panel with a model generated directly by plotting back the averages into their original positions and orientations in the tomogram (new Fig. 1h). This model now shows that the olive green density on A8/A9 is in the right position to form part of the A-C linker (as Reviewer 1 correctly pointed out in their previous point). We have amended the figure legend accordingly. We also described how the plotback was generated in the Materials and Methods section (page 26, line 648).

      As the reviewer points out, the distance between triplets does indeed seem inconsistent in the plotback. This is an interesting observation, but we feel it is a bit too preliminary to discuss in detail here. This can be explored in a follow-up study more focused on sperm centriole geometry.

      • The nomenclature regarding MIPs is sometimes confusing in this manuscript. For example, in lines 228-229 "We then determined the structure of DC doublets, revealing the presence of MIPs distinct from those in the PC." Does this include the gold and turquoise labeled structures in Figure 2j? These densities appear to correspond to the inner scaffold stem in the gold density presented in Figure 2j, and armA, presented in the turquoise density (Li et. al. 2011, Le Guennec et. al. 2020). The presence of this Stem here is important as it correlates with the presence of the molecular player making the inner scaffold (POC5, POC1B, CENTRIN): https://doi.org/10.1038/s41467-018-04678-8

      While we were initially very conservative with interpreting the DC doublet average (as stated above it comes from very few particles), we agree with the reviewer’s assessment that the gold and turquoise densities in Fig. 2j are consistent with the Stem and armA respectively of the inner scaffold. Because the inner scaffold contributes to centriole rigidity, it will be interesting to determine if and how it changes during remodelling of the atypical DC in mammalian sperm. Intriguingly, at least some inner scaffold components (including POC5, POC1B) reorganise into two rods in the mammalian sperm DC (Fishman et al 2018 doi: 10.1038/s41467-018-04678-8). We expanded the section on the DC average (page 9, lines 218-220):

      • The connecting piece is composed of column vaults emanating from the striated columns is compelling and beautiful segmentation data. However, it is important to note how many pig sperm proximal centrioles had immediate-short triplet side contact with the Y-shaped segmented column 9, as well as in how many mouse centrioles have the two electron-dense structures flanking the striated columns.

      Done. Material and Methods Page 25, lines 615-619.

      The resolution of the mammalian central pair is an important development brought by this work. The structural similarity between the central pair of pig and horse is convincing. However, with only 281 subtomograms being averaged for the murine central pair, corresponding to an estimated resolution of 49Å, the absence of the helical MIP of C1 with 8 nm periodicity suggests that there is simply not enough signal to capture it in the average. The same could be said for the smaller MIP displayed in Figure 4 c, panel ii. This point should be clearly stated.

      We agree with the reviewer that the quality of the mouse CPA structure is not on par with the pig and horse CPA structures. We now explicitly state this caveat in the text (pages 11, lines 276-277):

      Another piece of compelling data presented in this study is the attachment of the outer dense fibers to the axoneme of the midpiece and proximal and distal principal pieces. From the classification data presented along the flagellar length, it is clear that the only ODF contact made with the axoneme is at the proximal principle plate. However, this is far from obvious in the native top view images presented. Is it possible to include a zoomed inset of the connection between the A-tubule and ODF connection?

      We are very happy that the reviewer finds this data exciting. As Fig. 6 is quite cluttered as is, we instead tried to better annotate the cross-section views of the axoneme by tracing one doublet-ODF pair in each image (or only a doublet in the case of the distal principal piece). This shows that there is a gap between the doublet and the ODF in the midpiece, and that there is no such gap in the principal piece. We also hope that annotating one doublet-ODF pair helps the reader see that the same pattern holds true for the other doublets/ODFs. The legend to Fig. 6 was changed accordingly.

      Reviewer #1 (Significance (Required)):

      This work is of good quality and provides crucial information on the structure of centriole and axoneme in 3 different species. This work complements well the previous works.

      The audience for this type of study is large as it is of interest to researchers working on centrioles, cilium, and sperm cell architecture.

      We are pleased the reviewer appreciate the quality of our work and see the interest for broad audience.

      My expertise is cryo-tomography and centriole biology

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this study, Leung et al. used state-of-the-art EM imaging techniques, including FIB cryo-milling, Volta Phase plate, cryo-electron tomography and subtomogram averaging, to study the structure of sperm flagella from three mammalian species, pig, horse and mouse. First, they described two unique centrioles in the sperm, the PC and the DC. They found the PCs are composed of a mixture of triplet and doublet MTs. In contrast, the DCs are composed mainly of doublet and singlet MTs. By using subtomogram averaging, they identified a number of accessory proteins, including many MIPs bound to the MT wall. Many are unique to the mammalian sperm. They further described the connecting piece region of the sperm enclosing the centrioles and found an asymmetric arrangement. Furthermore, the authors presented the structure of sperm axonemes from all three species. These include the DMT and the CPA. Finally, they described the tail region of the sperm and described how the DMTs transitioned to the singlet MTs.

      This is a beautiful piece of work! It is by far the most comprehensive structural study of mammalian sperm cells. These findings will serve as a valuable resource for structure and function analysis of the mammalian flagella in the future. Now the stage is set for identifying the molecular nature of the structures and densities described in this study.

      We thank the reviewer for their positive evaluation! We are very happy that they share our excitement for the work, and that they also see it as “setting the stage” for future studies at the molecular level.

      The manuscript is clearly written. The data analysis is thorough. The conclusions are solid and not overstated. I don't have any major issues for its publication. A number of minor suggestions are listed below. Most are related to the figures and figure legends.

      Figure 1d, the figure legend should mention this is the subtomogram average of PC triplet MTs from pig sperm, though this is mentioned in the text. Also, for convenience, the color codes for the MIPs should be mentioned in the figure legend.

      Done.

      Figure 2J, similarly, the figure legend should mention this is the subtomogram average of DC doublets. It also needs a description of the color codes of the identified MIPs. For the DMT, please indicate the A- and B-tubule, which are colored in light or dark blue.

      Done, except we would prefer not to enumerate the MIPs as we did not name them nor discuss them extensively in the main text as we do not want to over-interpret the MIPs at this point as the average is from relatively small number of particles. However, we did specify that the gold and turquoise densities on the luminal surface are consistent with the inner scaffold. The figure legend was edited accordingly.

      Line 228, "We then determined the structure of DC doublet by subtomogram averaging"

      Done.

      For both Fig 2 and Fig 3. the DC doublets are colored in dark and light blue, please specify which is the A- or B-tubule in the figure legends.

      Done.

      Line 273, need space between "goldenrod"

      We would prefer to keep “goldenrod” spelled as is since this is how the color is referred to in Chimera and ChimeraX.

      Figure 4. need to expand the figure legend. Panels I, ii, iii, iv, are cut-through view of the lumen of CPA microtubules C1 and C2.

      Done.

      Line 338, Interestingly, the RS1 barrel is radially distributed asymmetrically around the axoneme

      Done.

      Figure 5, need color codes for the arrowheads (light pink, pink, magenta) in panels i~n,

      Done.

      Figure 7, (a-c) please use arrowheads to indicate the location of caps in the singlet MT.

      Done.

      Reviewer #2 (Significance (Required)):

      This is a beautiful and significant work - by far the most comprehensive analysis of mammalian sperm structure

      We are thrilled the reviewer appreciate the novelty of our work.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This is a very interesting study that explores the structural diversity of mammalian sperm flagella, in pig, mouse and horse, at high resolution using cryo-FIB milling and cryo-tomography. The study provides the first in situ cryo-EM structure of a mammalian centriole and describes a number of microtubule associated structures, such as MIPs and plugs at the plus-end of microtubules, that were not been reported so far. Additionally, the authors identify several asymmetries in the overall structure of the flagellum in the three species, which have implications for the understanding of the flagellar beat and waveform geometry in sperm, which are discussed by the authors. Although this study does not provide mechanistic novel information on the function of the described structures, it will undoubtedly serve as a reference for future theoretical and empirical work on the role of these structures in shaping the flagellar beat.

      With the exception of a couple of "eclectic word choices" in the Introduction (see detailed feedback in Minor Comments), the manuscript is also well written. Image acquisition and analysis are sound.

      We thank the reviewer for positively evaluating our work. We are glad that they feel our study will “serve as a reference” to inform future studies.

      However, I have some suggestions that should help the authors to strengthen their claims and present their results. The study is in principle suitable to be published, after the following points will be addressed:

      **Major comments:**

      • A major concern is that it is not clear how many animals, sperms and lamellae the authors used to acquire the data presented in the manuscript. This information needs to be provided, because it not uncommon to encounter aberrant flagella, even in a wildtype animal. The authors should state how many animals, and how many flagella per each animal were analyzed, in order to allow the reader to have an opinion on the reliability of their observations.

      • The figures are esthetically pleasing; however, the figures legends should be carefully revised to include necessary information about color codes, image annotations.

      We thank the reviewer for raising these points. We completely agree that the numbers of animals and cells are important pieces of information. As such, we now explicitly state the number of animals/cells/tomograms used for each average in Table S1. For more qualitative observations (such as the relationship between the asymmetry of the pig sperm PC and the Y-shaped segmented columns), we now state in the number of cells and animals in which we see each feature (see detailed response to Reviewer 1).

      **Minor comments:**

      • Line 26. I do not think that the word "menagerie" is properly used in this context.

      • Line 29. The same is true for the word "Bewildering" in this sentence.

      We apologise for our somewhat eclectic word choice. We see the reviewer’s point that unconventional word choice may distract readers, so we replaced these two words with ‘diverse’ and ‘an extensive’, respectively.

      • Line 286 "Our structures of the CPA are the first from any mammalian system, and our structures of the doublets are the first from any mammalian sperm, thus filling crucial gaps in the gallery of axoneme structures." Sentences like this one would fit much better in the Conclusions or at least in the Discussion.

      We thank the reviewer for this suggestion, but we would prefer to keep this sentence where it is, if possible. We think it is useful to tell the audience upfront why these structures are significant, especially since readers who aren’t deep in the field may be bogged down by all the details.

      • Line 377 "Large B-tubule MIPs have so far only been seen in human respiratory cilia (Fig. 5j) and in Trypanosoma (the ponticulus, Fig. 5n), but the morphometry of these MIPs differs from the helical MIPs in mammalian sperm." Please insert the citations for the studies about respiratory cilia and Trypanosoma flagella.

      Done.

      • In Figure 1. What do the stars shown in panel a and a' indicate?

      We indeed failed to specify what the asterisks/stars indicate. They are meant to emphasise that the electron-dense material in the lumen of the PC is continuous with the CP. We have now specified this in the text (page 10, lines 245).

      Given the complexity of the structures that compose the flagellar system of sperms, it would be helpful to add an illustration of the sperm with careful annotation of the centriole structures and the various segments of the flagellum.

      This is an excellent suggestion. To help orient readers, we added three panels to Fig. 1 (Fig. 1a-c) showing low-magnification images of whole sperm cells. We annotated different parts of the flagellum (neck, midpiece, principal piece, endpiece) so that readers can refer back to these panels in case they want to know which part of the cell the averages are from.

      • Figure 2. Explanation of the used color codes is missing. Additionally, the authors should include an explanation for the black and white arrows and for the 2 insets in i.

      Done. For the color code, please see response to Reviewer 2. For the black and white arrows, we edited the figure legend.

      • In "(j) In situ structure of the pig sperm DC with the tubulin backbone in grey and microtubule inner protein densities colored individually" ...it should be written "...sperm DC microtubule doublet..."

      Done.

      • In this figure, but also in every other figure that shows centriole, axoneme, or even microtubule averages it is important to indicate the microtubule polarity. Please add the symbol + and - to indicate microtubule polarity in the figures.

      Done. In order to avoid overcrowding, we only labelled the pig structures as the horse and the mouse structures are always shown in the same orientations as the pig.

      • Figure 3. Additional to the images in a,b, and c, the original tomographic slices (without segmentation) should be shown here, to allow the reader to visualize the structure.

      We now include three additional supplementary movies slicing through the respective tomograms.

      • Figure 7. Scale bars are missing in d-f.

      Done.

      • Scale bars are missing in most Supplementary figures.

      Done.

      • Table S1. The Information about horse and mouse centriole data is missing.

      The reviewer is correct, but this information is missing because we did not average from the horse and the mouse. For the mouse, the triplets were in various stages of degeneration, resulting in heterogeneity that precluded us from averaging. For the horse, we simply did not catch enough centrioles to generate a meaningful structure.

      Reviewer #3 (Significance (Required)):

      This study provides several novel structural insights in to the sperm flagellum structure that have implications for the understanding of the flagellar beat and waveform geometry in sperm. Although this study does not provide mechanistic novel information on the function of the described structures, it will undoubtedly serve as a reference for future theoretical and empirical work on the role of these structures in shaping the flagellar beat.

      Great to see the reviewer appreciate the novelty of our work.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This is a very interesting study that explores the structural diversity of mammalian sperm flagella, in pig, mouse and horse, at high resolution using cryo-FIB milling and cryo-tomography. The study provides the first in situ cryo-EM structure of a mammalian centriole and describes a number of microtubule associated structures, such as MIPs and plugs at the plus-end of microtubules, that were not been reported so far. Additionally, the authors identify several asymmetries in the overall structure of the flagellum in the three species, which have implications for the understanding of the flagellar beat and waveform geometry in sperm, which are discussed by the authors. Although this study does not provide mechanistic novel information on the function of the described structures, it will undoubtedly serve as a reference for future theoretical and empirical work on the role of these structures in shaping the flagellar beat. With the exception of a couple of "eclectic word choices" in the Introduction (see detailed feedback in Minor Comments), the manuscript is also well written. Image acquisition and analysis are sound.

      However, I have some suggestions that should help the authors to strengthen their claims and present their results. The study is in principle suitable to be published, after the following points will be addressed:

      Major comments:

      • A major concern is that it is not clear how many animals, sperms and lamellae the authors used to acquire the data presented in the manuscript. This information needs to be provided, because it not uncommon to encounter aberrant flagella, even in a wildtype animal. The authors should state how many animals, and how many flagella per each animal were analyzed, in order to allow the reader to have an opinion on the reliability of their observations.
      • The figures are esthetically pleasing; however, the figures legends should be carefully revised to include necessary information about color codes, image annotations.

      Minor comments:

      • Line 26. I do not think that the word "menagerie" is properly used in this context.
      • Line 29. The same is true for the word "Bewildering" in this sentence.
      • Line 286 "Our structures of the CPA are the first from any mammalian system, and our structures of the doublets are the first from any mammalian sperm, thus filling crucial gaps in the gallery of axoneme structures." Sentences like this one would fit much better in the Conclusions or at least in the Discussion.
      • Line 377 "Large B-tubule MIPs have so far only been seen in human respiratory cilia (Fig. 5j) and in Trypanosoma (the ponticulus, Fig. 5n), but the morphometry of these MIPs differs from the helical MIPs in mammalian sperm." Please insert the citations for the studies about respiratory cilia and Trypanosoma flagella.
      • In Figure 1. What do the stars shown in panel a and a' indicate? Given the complexity of the structures that compose the flagellar system of sperms, it would be helpful to add an illustration of the sperm with careful annotation of the centriole structures and the various segments of the flagellum.
      • Figure 2. Explanation of the used color codes is missing. Additionally, the authors should include an explanation for the black and white arrows and for the 2 insets in i.
      • In "(j) In situ structure of the pig sperm DC with the tubulin backbone in grey and microtubule inner protein densities colored individually" ...it should be written "...sperm DC microtubule doublet..."
      • In this figure, but also in every other figure that shows centriole, axoneme, or even microtubule averages it is important to indicate the microtubule polarity. Please add the symbol + and - to indicate microtubule polarity in the figures.
      • Figure 3. Additional to the images in a,b, and c, the original tomographic slices (without segmentation) should be shown here, to allow the reader to visualize the structure.
      • Figure 7. Scale bars are missing in d-f.
      • Scale bars are missing in most Supplementary figures.
      • Table S1. The Information about horse and mouse centriole data is missing.

      Significance

      This study provides several novel structural insights in to the sperm flagellum structure that have implications for the understanding of the flagellar beat and waveform geometry in sperm. Although this study does not provide mechanistic novel information on the function of the described structures, it will undoubtedly serve as a reference for future theoretical and empirical work on the role of these structures in shaping the flagellar beat.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, Leung et al. used state-of-the-art EM imaging techniques, including FIB cryo-milling, Volta Phase plate, cryo-electron tomography and subtomogram averaging, to study the structure of sperm flagella from three mammalian species, pig, horse and mouse. First, they described two unique centrioles in the sperm, the PC and the DC. They found the PCs are composed of a mixture of triplet and doublet MTs. In contrast, the DCs are composed mainly of doublet and singlet MTs. By using subtomogram averaging, they identified a number of accessory proteins, including many MIPs bound to the MT wall. Many are unique to the mammalian sperm. They further described the connecting piece region of the sperm enclosing the centrioles and found an asymmetric arrangement. Furthermore, the authors presented the structure of sperm axonemes from all three species. These include the DMT and the CPA. Finally, they described the tail region of the sperm and described how the DMTs transitioned to the singlet MTs.

      This is a beautiful piece of work! It is by far the most comprehensive structural study of mammalian sperm cells. These findings will serve as a valuable resource for structure and function analysis of the mammalian flagella in the future. Now the stage is set for identifying the molecular nature of the structures and densities described in this study.

      The manuscript is clearly written. The data analysis is thorough. The conclusions are solid and not overstated. I don't have any major issues for its publication. A number of minor suggestions are listed below. Most are related to the figures and figure legends.

      Figure 1d, the figure legend should mention this is the subtomogram average of PC triplet MTs from pig sperm, though this is mentioned in the text. Also, for convenience, the color codes for the MIPs should be mentioned in the figure legend.

      Figure 2J, similarly, the figure legend should mention this is the subtomogram average of DC doublets. It also needs a description of the color codes of the identified MIPs. For the DMT, please indicate the A- and B-tubule, which are colored in light or dark blue.

      Line 228, "We then determined the structure of DC doublet by subtomogram averaging"

      For both Fig 2 and Fig 3. the DC doublets are colored in dark and light blue, please specify which is the A- or B-tubule in the figure legends.

      Line 273, need space between "goldenrod"

      Figure 4. need to expand the figure legend. Panels I, ii, iii, iv, are cut-through view of the lumen of CPA microtubules C1 and C2.

      Line 338, Interestingly, the RS1 barrel is radially distributed asymmetrically around the axoneme

      Figure 5, need color codes for the arrowheads (light pink, pink, magenta) in panels i~n,

      Figure 7, (a-c) please use arrowheads to indicate the location of caps in the singlet MT.

      Significance

      This is a beautiful and significant work - by far the most comprehensive analysis of mammalian sperm structure

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this study, the authors use focused-ion beam (FIB) milling coupled with cryo-electron tomography and subtomogram averaging to uncover the structure of the elusive proximal and distal centrioles, as well as different regions of the axoneme in the sperm of 3 mammalian species: pig, horse, and mouse. The in-situ tomograms of the sperm neck region beautifully illustrate the morphology of both the proximal centriole, confirming the partial degeneration of mouse sperm, and intriguingly, asymmetry in the microtubule wall of pig sperm. In distal centrioles, the authors show that in all mammalian species, microtubule doublets of the centriole wall are organized around a pair of singlet microtubules. The presented segmentation of the connecting piece is beautiful and nicely shows the connecting piece forming a nine-fold, asymmetric, chamber the centrioles. The authors further use subtomogram averaging to provide the first maps of the mammalian central pair and identify sperm-specific radial spoke-bridging barrel structures. Lastly, the authors perform further subtomogram averaging to show to the connecting site of the outer dense fibers to the microtubule doublet of the proximal principal piece and confirm the presence of the TAILS microtubule inner protein complex (Zabeo et al, 2018) in the singlet microtubules occupying the tip of sperm tails. The manuscript provides the clearest insight into flagellar base morphology to date, giving insight into the morphological difference between different mammalian cilia and centriole types. The manuscript is suitable for publication, once the following questions are addressed.

      Major Points: How many centrioles and axonemes were used in generating the averages presented in the paper? If too few samples were used, especially in centrioles undergoing dramatic remodeling or degeneration, the reality of MIPs and MAPs being present might be completely affected. For instance, In figure 1d, the authors present a cryoET map of the centriole microtubule triplet. However, centrioles are divided into several regions with different accessory elements. Here, the authors could show the presence of only part of the A-C linker. The A-C linker covers only 40% of the centriole, so does it mean that this centriole is made only of the accessories that characterize the proximal side of the centriole? In the same line, what were the boundaries governing subtomogram extraction? For example, in the distal centriole, were microtubules extracted from just before the start of the transition zone, to the end of the microtubule vaulting, more pronounced at the end of the proximal region? There are known heterogeneities in centriole, as well as flagella, ultrastructure along the proximal distal axis. If no pre-classification was performed for subtomogram longitudinal position along with the centriole and axoneme, structural features may be averaged out, and or present and not reflecting their real longitudinal localization. The classification should be applied here if it was not the case.

      Minor Points:

      • In line 3, motile cilia are not only used to swim, they can move liquid or mucus for instance.
      • In line 175, the authors stated " a prominent MIP associated with protofilament A9, was also reported in centrioles isolated from CHO cells (Greenan et al. 2018) and in basal bodies from bovine respiratory epithelia (Greenan et al 2020). Actually, this MIP has been seen in many other centrioles from other species, such as Trichonympha (https://doi.org/10.1016/j.cub.2013.06.061 ), Chlamydomonas, and Paramecium ( DOI: 10.1126/sciadv.aaz4137 ). Citing these studies will reinforce the evolutionary conservation of this MIP and therefore its potential crucial role in the A microtubule.
      • In Line178, the authors stated: "Protofilaments A9 and A10 are proposed to be the location of the seam (Ichikawa et 2017)". High-resolution cryoEM maps confirmed it: https://doi.org/10.1016/j.cell.2019.09.030 . This publication should be cited. Moreover, authors should also refer to this paper when discussing MIPs in the microtubule doublet.
      • In Line 187-189 the authors stated, "We resolved density of the A-C linker (gold) which is associated with protofilaments C9 and C10." The A-C linker interconnects the triplets of the proximal centriole (Guichard et. al. 2013, Li et. al. 2019, Klena et. al. 2020) with distinct regions binding the C-tubule, as shown by the authors in gold, as well as an A-link, making contact with the A-tubule through various protofilaments in a species-specific manner, but always on protofilament A9. The authors may have identified the A-link, labeled in green, on the outside of protofilament A8/A9 in Figure 1d.
      • In figure 1e, the authors provide a 9-fold representation of the centriole based on their map. How relevant is this model ? the distance between triplet is inconsistent here, which has not been observed before. Do they use true 3D coordinates to generate this model? The A-C linker, which is only partially reconstructed, does not contact the A microtubule. Is it really the case? did the authors see that the A-link density of the A-C linker has disappeared? If these points are not clearly specified, this representation might be misleading.
      • The nomenclature regarding MIPs is sometimes confusing in this manuscript. For example, in lines 228-229 "We then determined the structure of DC doublets, revealing the presence of MIPs distinct from those in the PC." Does this include the gold and turquoise labeled structures in Figure 2j? These densities appear to correspond to the inner scaffold stem in the gold density presented in Figure 2j, and armA, presented in the turquoise density (Li et. al. 2011, Le Guennec et. al. 2020). The presence of this Stem here is important as it correlates with the presence of the molecular player making the inner scaffold (POC5, POC1B, CENTRIN): https://doi.org/10.1038/s41467-018-04678-8
      • The connecting piece is composed of column vaults emanating from the striated columns is compelling and beautiful segmentation data. However, it is important to note how many pig sperm proximal centrioles had immediate-short triplet side contact with the Y-shaped segmented column 9, as well as in how many mouse centrioles have the two electron-dense structures flanking the striated columns.

      The resolution of the mammalian central pair is an important development brought by this work. The structural similarity between the central pair of pig and horse is convincing. However, with only 281 subtomograms being averaged for the murine central pair, corresponding to an estimated resolution of 49Å, the absence of the helical MIP of C1 with 8 nm periodicity suggests that there is simply not enough signal to capture it in the average. The same could be said for the smaller MIP displayed in Figure 4 c, panel ii. This point should be clearly stated.

      Another piece of compelling data presented in this study is the attachment of the outer dense fibers to the axoneme of the midpiece and proximal and distal principal pieces. From the classification data presented along the flagellar length, it is clear that the only ODF contact made with the axoneme is at the proximal principle plate. However, this is far from obvious in the native top view images presented. Is it possible to include a zoomed inset of the connection between the A-tubule and ODF connection?

      Significance

      This work is of good quality and provides crucial information on the structure of centriole and axoneme in 3 different species. This work complements well the previous works. The audience for this type of study is large as it is of interest to researchers working on centrioles, cilium, and sperm cell architecture.

      My expertise is cryo-tomography and centriole biology

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      This manuscript follows on from previous work from the Rhind lab to investigate whether the load of MCMs at origins is a factor in when the origin activate (as a population average) during S phase. The authors use budding yeast and a auxin degron system to modulate the levels of an MCM subunit. This allows them to titrate down the concentration of the MCM hexamer and observe the effect. Crucially, they assay both the reduction in MCM load at origins and the subsequent replication dynamics in the same experiment. This is the power of their approach and allows them to rigorously test their hypothesis.

      **Major comments**

      1.I found the introductory paragraph discussing the Rhind lab hypothesis about the possibility of multiple MCM being loaded at origins somewhat misleading. The first paragraph of the discussion was much clear. However, I feel that the introductory paragraph should deal with the difference between the two proposals: 0-1 MCM-DH per origin (de Moura et al), vs 0-50+ MCM-DH (Yang et al). It s also important to note that Foss et al find that "In budding yeast, [MCM] complexes were present in sharp peaks comprised largely of single double-hexamers" - i.e. consistent with 0-1 MCM-DH per origin.

      To improve the balance of the introduction, I think the authors should briefly introduce the concepts behind the 0-1 MCM-DH per origin; this was defined as origin competence by Stillman and clearly described by McCune et al (2008; see figure 8) prior to the work from de Moura et al.

      Furthermore, in the discussion the authors should be more even-handed. To date there is no data to conclusively rule one way or the other in distinguishing between single vs multiple MCMs. The authors cite Lynch et al and state "overexpression of origin-activating factors in S phase causes most all origins to fire early in S phase, consistent with most origins having at least one MCM loaded". However, Lynch et al report equivalent (roughly equal) origin efficiencies, but the assay doesn't distinguish between all going up to high efficiency or all going to a lower intermediary efficiency. Given that fork factors (polymerases, etc) are likely to become limiting at some point (or checkpoints could be activated due to limited dNTP supplies) it would seem plausible that uniform origin efficiency could be a consequence of less than maximal origin firing. As part of this discussion it would be useful for the authors to include what conclusions have been reached on MCM load from in vitro systems (with chromatin substrates).

      Because the main focus of the paper is not dependent on whether MCM stoichiometry varies from 0 to 1 or 0 to many, we had relegated our discussion of absolute stoichiometry to the Discussion. However, it is clear from multiple reviewer's comments that it is something very much on readers minds. Therefore, we have now included a brief introduction to the 0-to-1 and 0-to-many scenarios in the Introduction and moved the bulk of the discussion of the data supporting the two scenarios to the Discussion.

      2.The authors are not the first to look at the consequence of reduced MCM concentrations on origin function. This was essentially the basis for the MCM screen undertaken by Bik Tye's lab that first identified the MCM genes. In addition to temperature sensitive mutants, the Tye group also examined heterozygotes (Lei et al., 1996) to show differential effect on the ability of two origins to support plasmid replication. The authors finds are entirely consistent with these early studies, particularly since ARS416 (formerly ARS1) was found to highly sensitive to reduced MCM levels and ARS1021 (formerly ARS121) was found to be insensitive to MCM levels. The authors find a signifiant reduction in MCM load at ARS416, but the MCM load at ARS1021 is unaltered by reduced MCM concentration. It would be worth the authors noting this consistency. The authors do cite the Lei study, but not in this context. The original MCM screen was published here:

      Maine, G., Sinha, P., Tye, B. (1984). Mutants of S. cerevisiae defective in the maintenance of minichromosomes Genetics 106(3), 365 - 385.

      Furthermore, at the end of the discussion the authors state that "it will be interesting to dissect the specific cis- and trans-acting factors that make origins sensitive or resistant to changes in MCM levels". The equivalent effect reported by the Tye lab has already been dissected by the Donaldson lab (Nieduszynski et al., 2006) and perhaps it would be worth briefly mentioning their findings.

      We have included both of these literature precedents in the Discussion.

      3.The authors should show the flow cytometry data for each of their cell cycle experiments, if only in supplementary figures. This is important to allow a reader (and reviewer) to judge the level of synchrony achieved when interpreting the results.

      This data is now included as Figure S1

      4.I think the authors should show the ChIP signal at some example origins, including ones sensitive and insensitive to the reduction in MCM concentration. Currently all the high resolution ChIP data (i.e. over 1400 bp, e.g. Fig 3a) is presented as meta-analyses of many origins.

      We will include this analysis in a subsequent revision.

      5.When describing the results in Fig 4a the authors focus on changes (highlighted in black boxes) that fit their expectation. However, there are other sites that should at least be mentioned that don't seem to fit the authors model, e.g. ARS517, ARS518. It would be worth discussing what fraction of the timing data can be explained by the reduced MCM load.

      We now explicitly point out that Figures 4c and 4d address this issue of the robustness of the correlation. Although there is significant variation, as the reviewer points out, the trend is seen genome wide. As it happens, both ARS517 and ARS518 do fit the model reasonably well. They have intermediate loss of MCM signal and intermediate delay in timing.

      **Minor comments**

      -These data, rather than this data (throughout).

      I suspect that the journal style and/or copy editors will make the final call. However, I will point out that although 'data' is most certainly plural in Latin, its predominate modern English usage is as a mass noun, such as water or sand or information. In general, users do not think of, or use, 'data' as a collection of discrete elements, each on being a 'datum', a contention supported by the very infrequent use of the word datum. For instance, in ChIP-seq experiment, what is a datum? Each individual read? Each individual nucleotide in each read? The quality score for each individual nucleotide in each read? Each pixel in each image from the sequencer? When one wants to refer to an individual piece of data, common usage is to refer to a data point, just as one would refer to a grain of sand. Moreover, if 'data' were plural, it would be incorrect to use it in phrases such as "there is very little data available". Would the review really suggest using "there are very few data available"?

      -the authors should clearly state in figure legends what window size has been used in analysing genomic data.

      All analyses were done using 1kb windows, as now stated in the figure legends.

      -in figure 2a the authors show pairwise comparisons between conditions, it would be nice to see the 3rd pairwise comparisons perhaps as a supplementary figure

      We have included the third comparison in Figure 2a.

      -in figure 2c it would be clearer to use the same colour for the lines and the points

      The regression lines are in the same colors as the data points they fit. x=y is shown in blue for comparison, as now noted in the figure legend.

      -the authors should avoid the use of red/green colour combinations in their figures (see: https://thenode.biologists.com/data-visualization-with-flying-colors/research/)

      All figures will be redrawn in colorblind-accessible colors in a subsequent revision.

      -in the text the authors state "ORC binding to the ACS and subsequent MCM loading is a directional process dependent on a ACS- site and a similar but inverted nearby sequence (Xu et al., 2006)". I think it would be more appropriate to cite the following study here:

      Coster, G., Diffley, J. (2017). Bidirectional eukaryotic DNA replication is established by quasi-symmetrical helicase loading Science (New York, NY) 357(6348), 314 - 318. https://dx.doi.org/10.1126/science.aan0063

      The Coster reference has been included.

      -the list of factors that influence replication timing should include Rif1, whereas it is less clear that Rpd3 acts within the unique genome (as opposed to indirectly via repetitive DNA, e.g. rDNA)

      Rif1 has been added to the list.

      -figure 4 - it might help to mark the centromere on panel a. Also, why do the ChIP peaks and annotated origins appear to line up so poorly?

      The shift between the peaks and the ACS positions was introduced during the construction of the figure. Thanks for catching it. The alignment has been corrected and the centromere annotation has been added.

      -figure 4d - would it not be better to use fraction of lost MCM signal on the x-axis as in previous figures?

      If T_rep was a linear function of MCM stoichiometry, fraction lost would work as well as amount lost. However, we find that there is a lower correlation between fraction of MCM signal lost and T_rep delay than between absolute MCM signal lost and T_rep delay, suggesting a more complicated relationship.

      -"with galactose or raffinose, to induce or repress Mcm2-7 overexpression, respectively." This is incorrect, raffinose does not repress this promoter (that requires glucose).

      Fixed.

      -the S. pombe spike in is a great addition to the over expression experiments. It's a shame that it wasn't included in the auxin experiments.

      Yes, we agree.

      -why does the data in fig 5d appear to be at much lower resolution that the previous ChIP data?

      The resolution was inadvertently reduced during the rendering of the figure. The resolution has restored.

      -in the sequencing analysis pipeline for MCM ChIP the authors use a 650 bp upper size limit; why have such a large threshold compared to the size of a nucleosome? Are the analyses and findings sensitive to this size threshold?

      Although the MNase digestion was optimized to produce mostly mononucleosomal-sized digestion, some di- and very little tri- nucleosomal fragments still remain. In order to capture as many of the MCM-protected immunoprecipitated fragments as possible, the upper limit was set at 650 bp (up to 4 nucleosomes-worth of DNA). However, there is a very minimal contribution from fragments larger than mononucleosomes, qualitatively as well as quantitatively in 1kb windows around origins. Figure 3a provides a qualitative depiction of the contribution of dinucleosomes (input, ~300bp).

      -the repliscope package was published here:

      Batrakou, D., Müller, C., Wilson, R., Nieduszynski, C. (2020). DNA copy-number measurement of genome replication dynamics by high-throughput sequencing: the sort-seq, sync-seq and MFA-seq family. Nature Protocols 15(3), 1255 - 1284. https://dx.doi.org/10.1038/s41596-019-0287-7

      The reference has been corrected.

      Reviewer #1 (Significance):

      This work builds upon a body of work from the Rhind group (and others) to determine the contribution of MCM load to replication origin activation dynamics. To my mind this is the most convincing dataset and analysis to date and goes a long way to supporting the model that the efficiency of MCM loading is a major factor in determining the mean replication time of an origin. As the authors state, they are still not able to distinguish between two different models of MCM load (single vs multiple). It would be interesting for the authors to discuss how these two models could be distinguished in the future (perhaps with single cell/molecule experiments).

      This study will be of interest to those in the fields of DNA replication and genome stability.

      My field of expertise is DNA replication and replication origin function.

      Reviewer #2 (Evidence, reproducibility and clarity):

      **Summary:**

      This is a nice study that characterizes the consequences of limiting or increasing Mcm expression on the replication program. Prior ChIP experiments in yeast have observed that not all origins exhibit the same level of Mcm enrichment and that increased mcm enrichment was correlated with origin activity. These observations led to two different models -- a) that multiple Mcm2-7 double hexamer complexes are loaded at some origins and b) a probabilistic model where the differential enrichment of Mcm2-7 reflected the fraction of cells in a population that had loaded the Mcm2-7 complex at a specific origin. While the titration experiments presented here don't provide any conclusive support for either model, they do provide some novel and relevant insights for the replication field, in part, due to the increased resolution and quantification afforded by the MNase ChIP-seq approach (and S. pombe spike in). The authors very nicely demonstrate that origins are differentially sensitive to Mcm2-7 depletion and that loss of Mcm2-7 loading results in an altered replication timing profile. The origins most impacted by loss of Mcm2-7 are 'weak' origins as described by the Fox group. Intriguingly, the authors find that the 5X overexpression of Mcm2-7 does not perturb the relative Mcm2-7 loading at individual origins, but rather instead globally represses Mcm2-7 association at all origins. They also find that overexpression of both Cdt1 and Mcm2-7 is detrimental to the cell (although no obvious replication phenotype was observed). Finally, the authors present a reasonable interpretation of their data in the context of models for replication timing which was very well articulated.

      **Major Comments:**

      From the methods it appears that different analyses were performed with different replicates?

      "Replicate #1 was used for all analyses except for V plots, for which the higher resolution Replicate #2 was used."

      Ideally all of the conclusions should be supported by all the replicates independently, or if the replicates are concordant -- they should be merged (at a similar sequencing depth) prior to doing the analyses.. Even the v-plots with merged replicates will be informative due to the greater sequencing depth.

      Though we agree that greater sequencing depth would be informative for aggregation analysis, we think that one of the main strengths of our study is the analysis of MCM quantitation and replication timing in the same population of cells. Although the experiments were performed in exactly the same way, there is always slight biological or temporal differences between the replicates, due to the complicated nature of the experimental design. This variation increases the noise between the MCM ChIP and the replication timing analyses. Therefore, were analyzed the replicates separately. However, we did do all of the analyses on both replicates and got similar results. We have now explicitly stated as much.

      The authors should provide a separate analysis for the larger nucleosomal sized fragments and smaller putative MCM double hexamer fragments with regards to the Mcm loading and relationship to ACS and orientation. They may represent an interesting intermediate with mechanistic consequences for the interpretation.

      We will include the suggested analysis in a subsequent revision.

      The authors should present the v-plots and an analysis of which side the Mcm's load for the overexpression studies. I was surprised that there was no further in-depth analysis for these two extremes. Perhaps similar conclusions will be reached, but it should at least be mentioned/presented as a supplementary figure.

      We will include the suggested analysis in a subsequent revision.

      **Minor Comments:**

      This is largely semantic, but the majority of MNase ChIP-seq signal recovered is associated with the nucleosomes and not in the NDR and as the signal in the NDR is differentially sensitive to digestion, I would suggest rephrasing the following sentence:

      "In contrast to previous genome-wide reports (Belsky et al., 2015), but in agreement with recent in-vitro cryo-EM structures (Miller et al., 2019), we also observe MCM signal in the nucleosome-depleted region (NDR) of origins. "

      to :

      "In agreement with a previous genome-wide report (belsky 2015), we found that the bulk of the MCM signal was associated with nucleosomal sized fragments; however the increased resolution afforded by our approach allowed us to also detect protected fragments in the NDR as predicted by recent in vitro cryo em structures..."

      We have modified the sentence as suggested.

      As a sanity check, please double check V-plots and presence of small fragments with the digestion conditions. In the Henikoff manuscript the bulk of sub-nucleosomal fragments were lost with the longer digestion time. Specifically, the TF footprints were more pronounced with minimal digestion. While it might be argued that the longer digestion more tightly resolved the binding site, in many cases they were completely lost with the 20 minute digestion. This is just a simple check -- I don't doubt the results as reported given the experimental conditions are very different. For example, the henikoff manuscript did not use cross linking or an antibody enrichment step.

      We double checked and confirmed that more small fragments are found in the more digested library. The reason that we see more small fragments when we digest more, in contrast to the contrary observation in the Henikoff paper is presumably because MCM has a larger footprint than a transcription factor and protects that footprint more effectively.

      Last paragraph of the "MCM associates with nucleosomes section" which reports that the Mcm2-7 complex is loaded up or downstream from the ACS independent of orientation should cite Belsky 2015 (Figure 5 and discussion) for the initial observation.

      Done.

      The authors argue that the global reduction in MCM loading associated with overexpression may be a technical artifact given that all origins exhibit a proportional reduction in mcm2-7 loading. However, this is exactly what the S. pombe spike in control is intended for. The relative difference between individual origins resulting from Mcm2-7 depletion would still be evident without the spike in. The authors do discuss different possibilities, but I would not be so keen to discard this as technical artifact.

      We, too, are reluctant to dismiss this result as a technical artifact. However, we are at a loss to offer any other explanation. We raise a handful of biological possibilities in the Discussion, but dismiss each one as failing to account for our results. We would be happy to entertain other suggestions.

      Reviewer #2 (Significance):

      This work has several advances that will be appreciated by the replication field -- including a high resolution view of Mcm2-7 loading in the context of chromatin; the impact of titrating (low and high) MCM expression on MCM loading and replication timing program; and a well reasoned discussion of how different models of MCM loading would impact origin activation and replication timing program. The work builds on prior studies in the field (eg. Belsky 2015), while some of the conclusions regarding the localization of the Mcm2-7 complex relative to the ACS and surrounding nucleosomes are confirmatory, the increased resolution provides new insight (like the enrichment of small fragments in the NDR) that could be further strengthened by additional analysis (see above).

      My expertise is DNA replication and chromatin.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this study, the authors use Auxin-mediated degradation of Mcm4 to reduce the concentration of the MCM helicase complex in yeast, and determine the effects of this reduction on both MCM-origin association (interpreted as MCM loading) by MNase-MCM-ChIPSeq and on replication origin function by Sync-Seq replication timing experiments (deep sequencing of a yeast population as it progresses through a synchronized S-phase). Complementary experiments testing the effect of induced MCM complex over-expression on MCM-origin association are also performed.

      The authors find that reducing Mcm4 levels (and thus loading-competent MCM complexes) causes yeast cells to be more sensitive to DNA replication stress. In addition, not all origins are equally susceptible to reductions in MCM levels; the origins that do lose MCM binding at reduced MCM levels show a reduction in activity and an associated delay in their replication time under those conditions. Finally, over-expression of the MCM complex has no effect on MCM-origin association or origin function, suggesting that MCM levels are not limiting for origin licensing in yeast under normal lab conditions. The strengths of the study are the well-executed experiments and very nice data that are presented. However, there are several weaknesses. The authors make conclusions that are not supported by their data; and several of the outcomes are not at all unexpected based on extensive published studies in yeast and mammalian cells, raising issues about whether this study advances and/or clarifies the current gaps in the field. While some of the relevant past studies were referenced, the authors did not place their own study in the context to published work and current models in the field, which reduced the scholarly value of their study. Because the work was not placed in context of the field, some of the rationale and conclusions were misleading.

      **Some specific major comments:**

      1,The title is misleading. The authors have clearly shown that when MCM levels are be made limiting in an engineered system, some origins are substantially less active, which means that these origin loci are replicated "passively" (i.e. by a Replication Fork (RF) emanating from a distal origin) rather than actively (i.e. by "firing" and initiating replication). Their own replication data show that. But this competition is only revealed when MCM levels are artificially/experimentally lowered. What is the evidence that competition for MCM complexes among individual origins establishes replication timing patterns in yeast? If anything, the over-expression experiment suggests the opposite--that MCM levels are not limiting and therefore do not play a substantial role in establishing the replication timing patterns that are observed in yeast. Instead those patterns appear to result primarily from the fact that MCM complex activation factors are present in limiting concentrations relative to origins.

      We agree with the reviewer's analysis and have revised the title to "The Capacity of Origins to Load MCM Establishes Replication Timing Patterns".

      2,The abstract states that "the number of MCMs loaded onto origins has been proposed to be a key determinant of when those origins initiate DNA replication during S-phase". While it is true that this lab has proposed this model in budding yeast, the current study performs no experiments that directly address this model--i.e. that i. individual origins possess a different number of MCM complexes and or ii that these differences underlie timing differences. They acknowledge this point in their Discussion--a ChIPSeq experiment is an ensemble experiment--there is no way to know that differences in MCM signals correspond to a different number of MCM complexes per origin versus a differences in the fraction of cells that contain and MCM complex at all at a given origin . But this statement in the abstract, combined with their conclusion in the same section of the paper: "Our results support a model in which the loading activity of origins, controlled by their ability to recruit ORC and compete for MCM, determines the number of helicases loaded, which in turn affects replication timing" implies that they have tested a model that they have not tested. Given how quickly readers "skim" the literature these days, a misleading abstract can do a lot of damage to a field. The results presented in this study neither support nor refute the model for the number of helicases loaded per origin, and the fact that reducing origin licensing efficiency by making the major substrate limiting reduces the number of licensed origins in a cell population is fully expected based on the current state of the field .

      Four questions are addressed in this comment. The first is whether there is variable MCM stoichiometry at origins. The second is whether that variation ranges from 0 to 1 and 0 to many. The third is if the variation is stoichiometry affects replication timing. The fourth is how this variation in stoichiometry comes about.

      Our work is based on the conclusion, supported by a substantial body of literature, that MCM loading stoichiometry varies among origins. Our data in this paper further supports this conclusion.

      As the reviewer notes, and as we had tried to make clear, the data is this paper does not address the range of the variation. Moreover, as we also tried to make clear, our hypotheses, results and conclusions are not affected by whether the range is 0 to 1 or 0 to many.

      This paper focuses on Questions 3 and 4. We have reworked the introduction to make these distinctions more clear.

      We have also corrected the abstract to refer to "the stoichiometry", instead of "the number", of MCMs.

      3,The rationale for the study as stated in the Introduction: "Although the molecular biochemistry of initiation at individual origins continues to be elucidated in great detail (Bleichert, 2019), the mechanism governing the time at which different regions of the genome replicate has remained largely elusive (Boos and Ferreira, 2019)." Is also misleading. In fact, in budding yeast (and other organisms) there have been several advances in this area particularly with respect to DNA replication origin activation. The S-phase origin activation factors are limiting for origin function, and factors such as Ctf19 at centromeres and Fkh1/2 at non-centromeric early-acting origins help to directly recruit the limiting S-phase factor, Dbf4, to origins. It is misleading to ignore this substantial progress and not make an effort to place this current study, which is important and one of the first to look directly at MCM loading control in yeast, into a relevant context with respect to what is known. What's interesting is that this S-phase model assumes/requires that most origins are, in fact, licensed and thus that differences in licensing efficiency are not a major driving of replication timing patterns in yeast. But we do not know why there are only subtle differences in MCM loading---this study may help explain that.

      We have broadened the scope of our Introduction and Discussion to address these points. However, it is not the case that "there are only subtle differences in MCM loading". MCM ChIP-seq (, and this paper) and MCM ChEC-seq both show well over ten-fold variation in MCM stoichiometry at origins. We have now explicitly made this point in the Introduction.

      4,The authors link the differential ability of MCM loading deficiencies when MCM is made limiting to differences in ORC binding categories. The "weak" origins, that presumably bind ORC weakly, were most affected by reductions in MCM. Are these origins less efficient than the other categories, DNA and chromatin-dependent (using the origin efficiency metric data from the Whitehouse lab) where MCM binding is not reduced as much? In normal cells are these early or late origins? Is the idea that the role of excess MCM is to achieve a sufficient number or "back up" origins per cell to deal with potential stress, as proposed by the Blow and Schwob labs in tissue culture cells many years ago? It seems likely that the data reported here are in fact confirmations of those early studies in mammalian cells---which is useful to know even if not unexpected.

      We will include the suggested analyses in a subsequent revision.

      Excess MCM do, as has been long appreciated and as we discuss, contribute to replication-stress tolerance. However, that is not a major point of our paper.

      5,Aren't the results that losing MCM signal corresponds to loss of origin activity peaks entirely expected? The same result would be obtained if you made a point mutation in that origin's ACS. Of course preventing an origin from being licensed will delay that region's replication time in S-phase because it now must be replicated passively. Licensing affects replication timing patterns because the MCM complex is the substrate for limiting S-phase factors, but that is far different from concluding that the number of MCMs at an origin is what controls the time in S-phase when an origin is activated.

      Yes, "the results that losing MCM signal corresponds to loss of origin activity peaks [are] entirely expected". However, this is not the important result. The key result is that the distribution of MCM at origins is not uniformly affected, which leads to our conclusions that, in wild-type cells, origin capacity dominates MCM stoichiometry and that, when MCM become limiting, origin activity (probably determined by ORC affinity) becomes critical—neither of which were expected results. In any case, the expected correlation between MCM loading and origin activity was observed as a consequence of measuring MCM stoichiometry and replication timing and is an obvious analysis to include, so we did so.

      6,The authors stated that the measured MCM abundance for the 43% of origins that are not known to be controlled by the multiple mechanisms that have been shown to control origin replication time. Is this because they think that MCM loading contributes to the timing control of only these origins? Was MCM loading not affected at any of these other origins when MCM levels were reduced? Are those 43% of origins in the "weak" binding category in terms of ORC? The rationale for eliminating so many origins from these analyses were not clear.

      We propose that the probability of origin activation is the product of the stoichiometry of MCM at the origin and the rate of MCM activation, which may be affected by trans-acting factors. For the 43% of origins for which there is no known trans-acting regulation, the correlation with stoichiometry is stronger. However, the correlation holds when looking at all origin, too. The suggestion to look at only the 57% of origins with known trans-action regulation is a good one. We will include this analysis and the other suggested analyses in a subsequent revision.

      7,Doesn't the data in Figure 4c at 0 mM auxin support the conclusion that differences in MCM ChIP signals have negligible effects on origin activation time, in contrast to the publication by Das, 2015 from this lab? Or is the point that these origins are sensitive to reductions in MCM levels and the more sensitive they are the more delayed their replication time (but again, doesn't that have to be true? If they are losing MCM signals they cannot function as origins, so they are replicated passively and, by definition, will show delayed replication timing. An origin is defined as such by a loaded MCM complex.)

      No. The reason the correlation in 4c is not a good as in our previous work is that in Das 2015 we compared origin-activation efficiency (calculated from our stochastic model in Yang 2010), instead of T_rep, which we used here. T_rep is a convolution of origin-activation time and passive-replication time, reducing to correlation. The important observation is that the correlation gets better as MCM levels are reduced.

      The correlation between MCM stoichiometry and activation efficiency may seem trivial, but just because a model is simple does not mean it is not correct. If stoichiometry was the only factor regulating origin activation, we would expect a stronger correlation. So, we conclude that there are other factors at play, quite possible the trans-acting factors that the reviewer mentions in their second point. However, if stoichiometry played no role, we would expect no correlation. So, we propose that MCM stoichiometry is "an important determinant of replication timing".

      8,I do not understand the conclusions from Figure 4d. There is an extremely small positive correlation between how much of an MCM signal is lost and delay in replication time of an origin, but this correlation is not surprising as an unlicensed origin cannot be an origin and will be replicated passively. What seems most surprising about these data is that the effect is so weak, not that it exists. There is quite a lot of scatter in this plot at 500 uM auxin, with some origins losing a given amount of signal (x) and being only slightly delayed in replication time, and others losing the same amount of signal (x) and being substantially delayed. What underlies this outcome?--Are the ones that are not substantially delayed closer to origins that have not been affected at all by MCM reductions? Why is the correlation so weak? The other regulators of origin activation time have stronger and more precise effects--for example the centromere-control can be precisely eliminated so that only the replication time of the centromere-proximal origins are delayed.

      We believe that much of the noise in Figure 4d is due, as the reviewer suggests, to passive replication of origins which lose most of their MCM signal and become inactive but happen to reside next to origins which don’t lost any MCM signal and fire early. And excellent example is ARS 510 (see Figure 4a). ARS510 loses most of its MCM signal and clearly loses its initiation peak in the T_rep plot. However, because it is next to ARS511, which does not lose much MCM signal and which remains a efficient origin, ARS510 is still replicated early. We will include this example in a subsequent revision.

      9,Multiple studies in yeast and mammalian cells indicate that MCM subunits are in excess relative to other licensing and S-phase initiation factors, so it is not unexpected that over-expressing MCM did not lead to enhanced levels of licensing. It seems much more plausible that Cdc6 or Cdt1 or both factors are present in limiting amounts for MCM loading, so I did not understand the point of over-producing MCM subunits. If the "weak" origins are the ones that are most dramatically affected by reducing MCM to "limiting" levels, isn't the question whether you can increase licensing at these origins when you over-produce a factor that is likely limiting for licensing, such as Cdt1 or Cdc6 (or both) while leaving MCM at its normal levels. The fact that MCM levels are not limiting for licensing is not surprising and, if anything, argues against these levels having a regulatory role in origin activation timing---which seems to be the opposite of what the authors want to conclude.

      Orc1-6, Cdc6 and Cdt1 are all substoichiometric to MCM. However, they all act catalytically to load MCM. So, although they may be kinetically limiting, they do not prevent most or all MCMs being loaded in wild-type cells. The fact that overexpressing MCMs (with or without Cdt1) does not allow for more MCM loading suggests that under normal conditions origins are saturated with MCMs and have little or no capacity to load more MCM, even when given plenty of time to do so. From this result, we conclude that origin capacity is a major determinant of MCM loading in wild-type cells. From our MCM-reduction experiments, we also conclude that, when MCM is limiting, origin competition affects which origins load MCMs faster. However, we agree with the reviewer's first point, that our title gave the incorrect impression that we concluded that origin competition is the primary determinant of MCM loading in wild-type cells. Thus, as suggested, we have changed the title. We have also reworked the Introduction and Discussion to more clearly explain that competition is only a determining factor when MCMs are limited.

      In summary, I think the technical aspects of the experiments were quite strong, but I do not think that the experiments answered the question that was posed by the authors.

      **Minor points:**

      Many places where "This data" should be changed to "These data". Data are plural.

      See comments on this point in the response to Reviewer #2.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study, the authors use Auxin-mediated degradation of Mcm4 to reduce the concentration of the MCM helicase complex in yeast, and determine the effects of this reduction on both MCM-origin association (interpreted as MCM loading) by MNase-MCM-ChIPSeq and on replication origin function by Sync-Seq replication timing experiments (deep sequencing of a yeast population as it progresses through a synchronized S-phase). Complementary experiments testing the effect of induced MCM complex over-expression on MCM-origin association are also performed.

      The authors find that reducing Mcm4 levels (and thus loading-competent MCM complexes) causes yeast cells to be more sensitive to DNA replication stress. In addition, not all origins are equally susceptible to reductions in MCM levels; the origins that do lose MCM binding at reduced MCM levels show a reduction in activity and an associated delay in their replication time under those conditions. Finally, over-expression of the MCM complex has no effect on MCM-origin association or origin function, suggesting that MCM levels are not limiting for origin licensing in yeast under normal lab conditions. The strengths of the study are the well-executed experiments and very nice data that are presented. However, there are several weaknesses. The authors make conclusions that are not supported by their data; and several of the outcomes are not at all unexpected based on extensive published studies in yeast and mammalian cells, raising issues about whether this study advances and/or clarifies the current gaps in the field. While some of the relevant past studies were referenced, the authors did not place their own study in the context to published work and current models in the field, which reduced the scholarly value of their study. Because the work was not placed in context of the field, some of the rationale and conclusions were misleading.

      Some specific major comments:

      1,The title is misleading. The authors have clearly shown that when MCM levels are be made limiting in an engineered system, some origins are substantially less active, which means that these origin loci are replicated "passively" (i.e. by a Replication Fork (RF) emanating from a distal origin) rather than actively (i.e. by "firing" and initiating replication). Their own replication data show that. But this competition is only revealed when MCM levels are artificially/experimentally lowered. What is the evidence that competition for MCM complexes among individual origins establishes replication timing patterns in yeast? If anything, the over-expression experiment suggests the opposite--that MCM levels are not limiting and therefore do not play a substantial role in establishing the replication timing patterns that are observed in yeast. Instead those patterns appear to result primarily from the fact that MCM complex activation factors are present in limiting concentrations relative to origins.

      2,The abstract states that "the number of MCMs loaded onto origins has been proposed to be a key determinant of when those origins initiate DNA replication during S-phase". While it is true that this lab has proposed this model in budding yeast, the current study performs no experiments that directly address this model--i.e. that i. individual origins possess a different number of MCM complexes and or ii that these differences underlie timing differences. They acknowledge this point in their Discussion--a ChIPSeq experiment is an ensemble experiment--there is no way to know that differences in MCM signals correspond to a different number of MCM complexes per origin versus a differences in the fraction of cells that contain and MCM complex at all at a given origin . But this statement in the abstract, combined with their conclusion in the same section of the paper: "Our results support a model in which the loading activity of origins, controlled by their ability to recruit ORC and compete for MCM, determines the number of helicases loaded, which in turn affects replication timing" implies that they have tested a model that they have not tested. Given how quickly readers "skim" the literature these days, a misleading abstract can do a lot of damage to a field. The results presented in this study neither support nor refute the model for the number of helicases loaded per origin, and the fact that reducing origin licensing efficiency by making the major substrate limiting reduces the number of licensed origins in a cell population is fully expected based on the current state of the field .

      3,The rationale for the study as stated in the Introduction: "Although the molecular biochemistry of initiation at individual origins continues to be elucidated in great detail (Bleichert, 2019), the mechanism governing the time at which different regions of the genome replicate has remained largely elusive (Boos and Ferreira, 2019)." Is also misleading. In fact, in budding yeast (and other organisms) there have been several advances in this area particularly with respect to DNA replication origin activation. The S-phase origin activation factors are limiting for origin function, and factors such as Ctf19 at centromeres and Fkh1/2 at non-centromeric early-acting origins help to directly recruit the limiting S-phase factor, Dbf4, to origins. It is misleading to ignore this substantial progress and not make an effort to place this current study, which is important and one of the first to look directly at MCM loading control in yeast, into a relevant context with respect to what is known. What's interesting is that this S-phase model assumes/requires that most origins are, in fact, licensed and thus that differences in licensing efficiency are not a major driving of replication timing patterns in yeast. But we do not know why there are only subtle differences in MCM loading---this study may help explain that.

      4,The authors link the differential ability of MCM loading deficiencies when MCM is made limiting to differences in ORC binding categories. The "weak" origins, that presumably bind ORC weakly, were most affected by reductions in MCM. Are these origins less efficient than the other categories, DNA and chromatin-dependent (using the origin efficiency metric data from the Whitehouse lab) where MCM binding is not reduced as much? In normal cells are these early or late origins? Is the idea that the role of excess MCM is to achieve a sufficient number or "back up" origins per cell to deal with potential stress, as proposed by the Blow and Schwob labs in tissue culture cells many years ago? It seems likely that the data reported here are in fact confirmations of those early studies in mammalian cells---which is useful to know even if not unexpected.

      5,Aren't the results that losing MCM signal corresponds to loss of origin activity peaks entirely expected? The same result would be obtained if you made a point mutation in that origin's ACS. Of course preventing an origin from being licensed will delay that region's replication time in S-phase because it now must be replicated passively. Licensing affects replication timing patterns because the MCM complex is the substrate for limiting S-phase factors, but that is far different from concluding that the number of MCMs at an origin is what controls the time in S-phase when an origin is activated.

      6,The authors stated that the measured MCM abundance for the 43% of origins that are not known to be controlled by the multiple mechanisms that have been shown to control origin replication time. Is this because they think that MCM loading contributes to the timing control of only these origins? Was MCM loading not affected at any of these other origins when MCM levels were reduced? Are those 43% of origins in the "weak"binding category in terms of ORC? The rationale for eliminating so many origins from these analyses were not clear.

      7,Doesn't the data in Figure 4c at 0 mM auxin support the conclusion that differences in MCM ChIPsignals have negligible effects on origin activation time, in contrast to the publication by Das, 2015 from this lab? Or is the point that these origins are sensitive to reductions in MCM levels and the more sensitive they are the more delayed their replication time (but again, doesn't that have to be true? If they are losing MCM signals they cannot function as origins, so they are replicated passively and, by definition, will show delayed replication timing. An origin is defined as such by a loaded MCM complex.)

      8,I do not understand the conclusions from Figure 4d. There is an extremely small positive correlation between how much of an MCM signal is lost and delay in replication time of an origin, but this correlation is not surprising as an unlicensed origin cannot be an origin and will be replicated passively. What seems most surprising about these data is that the effect is so weak, not that it exists. There is quite a lot of scatter in this plot at 500 uM auxin, with some origins losing a given amount of signal (x) and being only slightly delayed in replication time, and others losing the same amount of signal (x) and being substantially delayed. What underlies this outcome?--Are the ones that are not substantially delayed closer to origins that have not been affected at all by MCM reductions? Why is the correlation so weak? The other regulators of origin activation time have stronger and more precise effects--for example the centromere-control can be precisely eliminated so that only the replication time of the centromere-proximal origins are delayed.

      9,Multiple studies in yeast and mammalian cells indicate that MCM subunits are in excess relative to other licensing and S-phase initiation factors, so it is not unexpected that over-expressing MCM did not lead to enhanced levels of licensing. It seems much more plausible that Cdc6 or Cdt1 or both factors are present in limiting amounts for MCM loading, so I did not understand the point of over-producing MCM subunits. If the "weak" origins are the ones that are most dramatically affected by reducing MCM to "limiting" levels, isn't the question whether you can increase licensing at these origins when you over-produce a factor that is likely limiting for licensing, such as Cdt1 or Cdc6 (or both) while leaving MCM at its normal levels. The fact that MCM levels are not limiting for licensing is not surprising and, if anything, argues against these levels having a regulatory role in origin activation timing---which seems to be the opposite of what the authors want to conclude.

      In summary, I think the technical aspects of the experiments were quite strong, but I do not think that the experiments answered the question that was posed by the authors.

      Minor points:

      Many places where "This data" should be changed to "These data". Data are plural.

      Significance

      Significance: see above

      Referees Cross Commenting

      Reviewer 3. My overall conclusions about this study are that the data are extremely nice and useful to the field, but that their potential to advance the field or clarify it for 'outsiders' are limited by 1, a biased. model-centric presentation that fails to put the work in context of a lot of strong previous work. Some of the conclusions cannot event be tested by the experimental design 2, some of the data analyses, for example the parsing of origins for analyses of MCM effects versus effects on replication time seem arbitrary and were not clearly justified. 3, The correlation between reductions in MCM loading and Trep delay seemed weak, even after parsing for origins expected to experience the largest effects, which is actually kind of interesting, but was ignored in favor of the pre-determined interpretation.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This is a nice study that characterizes the consequences of limiting or increasing Mcm expression on the replication program. Prior ChIP experiments in yeast have observed that not all origins exhibit the same level of Mcm enrichment and that increased mcm enrichment was correlated with origin activity. These observations led to two different models -- a) that multiple Mcm2-7 double hexamer complexes are loaded at some origins and b) a probabilistic model where the differential enrichment of Mcm2-7 reflected the fraction of cells in a population that had loaded the Mcm2-7 complex at a specific origin. While the titration experiments presented here don't provide any conclusive support for either model, they do provide some novel and relevant insights for the replication field, in part, due to the increased resolution and quantification afforded by the MNase ChIP-seq approach (and S. pombe spike in). The authors very nicely demonstrate that origins are differentially sensitive to Mcm2-7 depletion and that loss of Mcm2-7 loading results in an altered replication timing profile. The origins most impacted by loss of Mcm2-7 are 'weak' origins as described by the Fox group. Intriguingly, the authors find that the 5X overexpression of Mcm2-7 does not perturb the relative Mcm2-7 loading at individual origins, but rather instead globally represses Mcm2-7 association at all origins. They also find that overexpression of both Cdt1 and Mcm2-7 is detrimental to the cell (although no obvious replication phenotype was observed). Finally, the authors present a reasonable interpretation of their data in the context of models for replication timing which was very well articulated.

      Major Comments:

      From the methods it appears that different analyses were performed with different replicates?

      "Replicate #1 was used for all analyses except for V plots, for which the higher resolution Replicate #2 was used."

      Ideally all of the conclusions should be supported by all the replicates independently, or if the replicates are concordant -- they should be merged (at a similar sequencing depth) prior to doing the analyses.. Even the v-plots with merged replicates will be informative due to the greater sequencing depth.

      The authors should provide a separate analysis for the larger nucleosomal sized fragments and smaller putative MCM double hexamer fragments with regards to the Mcm loading and relationship to ACS and orientation. They may represent an interesting intermediate with mechanistic consequences for the interpretation.

      The authors should present the v-plots and an analysis of which side the Mcm's load for the overexpression studies. I was surprised that there was no further in-depth analysis for these two extremes. Perhaps similar conclusions will be reached, but it should at least be mentioned/presented as a supplementary figure.

      Minor Comments:

      This is largely semantic, but the majority of MNase ChIP-seq signal recovered is associated with the nucleosomes and not in the NDR and as the signal in the NDR is differentially sensitive to digestion, I would suggest rephrasing the following sentence:

      "In contrast to previous genome-wide reports (Belsky et al., 2015), but in agreement with recent in-vitro cryo-EM structures (Miller et al., 2019), we also observe MCM signal in the nucleosome-depleted region (NDR) of origins. "

      to :

      "In agreement with a previous genome-wide report (belsky 2015), we found that the bulk of the MCM signal was associated with nucleosomal sized fragments; however the increased resolution afforded by our approach allowed us to also detect protected fragments in the NDR as predicted by recent in vitro cryo em structures..."

      As a sanity check, please double check V-plots and presence of small fragments with the digestion conditions. In the Henikoff manuscript the bulk of sub-nucleosomal fragments were lost with the longer digestion time. Specifically, the TF footprints were more pronounced with minimal digestion. While it might be argued that the longer digestion more tightly resolved the binding site, in many cases they were completely lost with the 20 minute digestion. This is just a simple check -- I don't doubt the results as reported given the experimental conditions are very different. For example, the henikoff manuscript did not use cross linking or an antibody enrichment step.

      Last paragraph of the "MCM associates with nucleosomes section" which reports that the Mcm2-7 complex is loaded up or downstream from the ACS independent of orientation should cite Belsky 2015 (Figure 5 and discussion) for the initial observation.

      The authors argue that the global reduction in MCM loading associated with overexpression may be a technical artifact given that all origins exhibit a proportional reduction in mcm2-7 loading. However, this is exactly what the S. pombe spike in control is intended for. The relative difference between individual origins resulting from Mcm2-7 depletion would still be evident without the spike in. The authors do discuss different possibilities, but I would not be so keen to discard this as technical artifact.

      Significance

      This work has several advances that will be appreciated by the replication field -- including a high resolution view of Mcm2-7 loading in the context of chromatin; the impact of titrating (low and high) MCM expression on MCM loading and replication timing program; and a well reasoned discussion of how different models of MCM loading would impact origin activation and replication timing program. The work builds on prior studies in the field (eg. Belsky 2015), while some of the conclusions regarding the localization of the Mcm2-7 complex relative to the ACS and surrounding nucleosomes are confirmatory, the increased resolution provides new insight (like the enrichment of small fragments in the NDR) that could be further strengthened by additional analysis (see above).

      My expertise is DNA replication and chromatin.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript follows on from previous work from the Rhind lab to investigate whether the load of MCMs at origins is a factor in when the origin activate (as a population average) during S phase. The authors use budding yeast and a auxin degron system to modulate the levels of an MCM subunit. This allows them to titrate down the concentration of the MCM hexamer and observe the effect. Crucially, they assay both the reduction in MCM load at origins and the subsequent replication dynamics in the same experiment. This is the power of their approach and allows them to rigorously test their hypothesis.

      Major comments

      1.I found the introductory paragraph discussing the Rhind lab hypothesis about the possibility of multiple MCM being loaded at origins somewhat misleading. The first paragraph of the discussion was much clear. However, I feel that the introductory paragraph should deal with the difference between the two proposals: 0-1 MCM-DH per origin (de Moura et al), vs 0-50+ MCM-DH (Yang et al). It s also important to note that Foss et al find that "In budding yeast, [MCM] complexes were present in sharp peaks comprised largely of single double-hexamers" - i.e. consistent with 0-1 MCM-DH per origin.

      To improve the balance of the introduction, I think the authors should briefly introduce the concepts behind the 0-1 MCM-DH per origin; this was defined as origin competence by Stillman and clearly described by McCune et al (2008; see figure 8) prior to the work from de Moura et al. Furthermore, in the discussion the authors should be more even-handed. To date there is no data to conclusively rule one way or the other in distinguishing between single vs multiple MCMs. The authors cite Lynch et al and state "overexpression of origin-activating factors in S phase causes most all origins to fire early in S phase, consistent with most origins having at least one MCM loaded". However, Lynch et al report equivalent (roughly equal) origin efficiencies, but the assay doesn't distinguish between all going up to high efficiency or all going to a lower intermediary efficiency. Given that fork factors (polymerases, etc) are likely to become limiting at some point (or checkpoints could be activated due to limited dNTP supplies) it would seem plausible that uniform origin efficiency could be a consequence of less than maximal origin firing. As part of this discussion it would be useful for the authors to include what conclusions have been reached on MCM load from in vitro systems (with chromatin substrates).

      2.The authors are not the first to look at the consequence of reduced MCM concentrations on origin function. This was essentially the basis for the MCM screen undertaken by Bik Tye's lab that first identified the MCM genes. In addition to temperature sensitive mutants, the Tye group also examined heterozygotes (Lei et al., 1996) to show differential effect on the ability of two origins to support plasmid replication. The authors finds are entirely consistent with these early studies, particularly since ARS416 (formerly ARS1) was found to highly sensitive to reduced MCM levels and ARS1021 (formerly ARS121) was found to be insensitive to MCM levels. The authors find a signifiant reduction in MCM load at ARS416, but the MCM load at ARS1021 is unaltered by reduced MCM concentration. It would be worth the authors noting this consistency. The authors do cite the Lei study, but not in this context. The original MCM screen was published here: Maine, G., Sinha, P., Tye, B. (1984). Mutants of S. cerevisiae defective in the maintenance of minichromosomes Genetics 106(3), 365 - 385. Furthermore, at the end of the discussion the authors state that "it will be interesting to dissect the specific cis- and trans-acting factors that make origins sensitive or resistant to changes in MCM levels". The equivalent effect reported by the Tye lab has already been dissected by the Donaldson lab (Nieduszynski et al., 2006) and perhaps it would be worth briefly mentioning their findings.

      3.The authors should show the flow cytometry data for each of their cell cycle experiments, if only in supplementary figures. This is important to allow a reader (and reviewer) to judge the level of synchrony achieved when interpreting the results.

      4.I think the authors should show the ChIP signal at some example origins, including ones sensitive and insensitive to the reduction in MCM concentration. Currently all the high resolution ChIP data (i.e. over 1400 bp, e.g. Fig 3a) is presented as meta-analyses of many origins.

      5.When describing the results in Fig 4a the authors focus on changes (highlighted in black boxes) that fit their expectation. However, there are other sites that should at least be mentioned that don't seem to fit the authors model, e.g. ARS517, ARS518. It would be worth discussing what fraction of the timing data can be explained by the reduced MCM load.

      Minor comments

      -These data, rather than this data (throughout).

      -the authors should clearly state in figure legends what window size has been used in analysing genomic data.

      -in figure 2a the authors show pairwise comparisons between conditions, it would be nice to see the 3rd pairwise comparisons perhaps as a supplementary figure

      -in figure 2c it would be clearer to use the same colour for the lines and the points

      -the authors should avoid the use of red/green colour combinations in their figures (see: https://thenode.biologists.com/data-visualization-with-flying-colors/research/)

      -in the text the authors state "ORC binding to the ACS and subsequent MCM loading is a directional process dependent on a ACS- site and a similar but inverted nearby sequence (Xu et al., 2006)". I think it would be more appropriate to cite the following study here: Coster, G., Diffley, J. (2017). Bidirectional eukaryotic DNA replication is established by quasi-symmetrical helicase loading Science (New York, NY) 357(6348), 314 - 318. https://dx.doi.org/10.1126/science.aan0063

      -the list of factors that influence replication timing should include Rif1, whereas it is less clear that Rpd3 acts within the unique genome (as opposed to indirectly via repetitive DNA, e.g. rDNA)

      -figure 4 - it might help to mark the centromere on panel a. Also, why do the ChIP peaks and annotated origins appear to line up so poorly?

      -figure 4d - would it not be better to use fraction of lost MCM signal on the x-axis as in previous figures?

      -"with galactose or raffinose, to induce or repress Mcm2-7 overexpression, respectively." This is incorrect, raffinose does not repress this promoter (that requires glucose).

      -the S. pombe spike in is a great addition to the over expression experiments. It's a shame that it wasn't included in the auxin experiments.

      -why does the data in fig 5d appear to be at much lower resolution that the previous ChIP data?

      -in the sequencing analysis pipeline for MCM ChIP the authors use a 650 bp upper size limit; why have such a large threshold compared to the size of a nucleosome? Are the analyses and findings sensitive to this size threshold?

      -the repliscope package was published here:

      Batrakou, D., Müller, C., Wilson, R., Nieduszynski, C. (2020). DNA copy-number measurement of genome replication dynamics by high-throughput sequencing: the sort-seq, sync-seq and MFA-seq family. Nature Protocols 15(3), 1255 - 1284. https://dx.doi.org/10.1038/s41596-019-0287-7

      Significance

      This work builds upon a body of work from the Rhind group (and others) to determine the contribution of MCM load to replication origin activation dynamics. To my mind this is the most convincing dataset and analysis to date and goes a long way to supporting the model that the efficiency of MCM loading is a major factor in determining the mean replication time of an origin. As the authors state, they are still not able to distinguish between two different models of MCM load (single vs multiple). It would be interesting for the authors to discuss how these two models could be distinguished in the future (perhaps with single cell/molecule experiments).

      This study will be of interest to those in the fields of DNA replication and genome stability.

      My field of expertise is DNA replication and replication origin function.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Abrams and Nance describes how the polarity proteins PAR-6 and PKC-3/aPKC promote lumen extension of the unicellular excretory canal in C. elegans. Using tissue-specific depletion methods they find that CDC-42 and the RhoGEF EXC-5/FGD are required for luminal localization of PAR-6, which recruits the exocyst complex required for lumen extension. Interestingly, they show that the ortholog of the mammalian exocyst receptor, PAR-3, is dispensable for luminal membrane extension. Overall, this is a well-written and interesting manuscript.*

      1.Because depletion of PAR-3 in the canal causes milder defects than PAR-6 or CDC-42 the authors suggest that they cannot rule out the possibility that an alternative isoform of PAR-3 is expressed and buffering the defect. They should perform canal-specific RNAi-mediated depletion of the entire PAR-3 gene to determine if this is true.

      We agree with Reviewer 1 that it would be useful to provide additional evidence that an alternative isoform of PAR-3 lacking the ZF1 degron is not expressed. While tissue-specific RNAi could be used, we have not been successful obtaining complete knockdown in previous tissue-specific RNAi experiments. Moreover, this approach does not target any maternal PAR-3 protein that may be inherited by the excretory cell. As an alternative approach to address this point, we will analyze par-3::zf1::yfp/par-3(null) worms following excretory-cell-specific expression of zif-1, and compare to par-3::zf1::yfp/par-3::zf1::yfp siblings. We would expect the excretory cell phenotype to become more severe if additional, ‘phenotype-buffering’ forms of PAR-3 were present, or if there was incomplete degradation of PAR-3::ZF1::YFP in our previous experiments.

      2.The authors suggest that GTP-loaded (activated) CDC-42 recruits PAR-6 to the luminal membrane. It would be nice if they could use a biosensor, such as the GBD-WSP-1 reagent from Buechner's lab to confirm that EXC-5 depletion also reduces activated CDC-42, as would be expected. This should be achievable since there is strong CDC-42 signal, even in the cytoplasm.

      This is an excellent suggestion. We will utilize a CDC-42 biosensor – an integrated cdc42p::gfp::wsp-1(gbd) strain created in our lab and previously validated and characterized (Zilberman et al. 2017). We have confirmed that the biosensor is detected in the excretory canal and appears enriched at or near the lumenal membrane. We will cross the biosensor into the exc-5::zf1::mScarlet background. This will allow us to assess lumenal enrichment, and using heat shock inducible ZIF-1, determine if there is a reduction in biosensor lumenal enrichment when EXC-5::ZF1::mScarlet is acutely degraded.

      If the biosensor is difficult to measure at the canal lumen, an alternative approach would be to use an available exc-5 null allele to examine genetically if cdc-42 and exc-5 are acting in the same pathway. We could cross CDC-42exc(-) larvae into exc-5(rh232) and quantify excretory canal phenotypes. If CDC-42 and EXC-5 are indeed functioning in the same pathway we would expect no enhancement of the CDC-42exc(-) phenotype.

      3.Related to point 2, (i) does mutation of the CRIB domain of PAR-6 impair its recruitment to the luminal membrane, and (ii) does this mutant exacerbate canal defects when PAR-3 is depleted?

      (i) Our lab has previously generated and characterized a transgenic par6P::par-6(**CRIB)::gfp strain (Zilberman et al., 2017). We will examine this strain to determine if expression is detectable in the excretory canal, and if so, we will compare lumenal enrichment of PAR-6(CRIB)::GFP to control worms expressing wild-type PAR-6::GFP.

      (ii) This is a very interesting experiment, as it would help address if the mild phenotype observed in PAR-3 depleted animals is due to the remaining PAR-6 that is recruited by CDC-42. Our lab has previously shown that par6P::par-6(**CRIB)::gfp cannot rescue the embryonic lethality of a par-6 mutant, in contrast to par-6::gfp (Zilberman et al. 2017). This indicates that the CRIB domain is needed for PAR-6 function during embryogenesis and suggests that CRIB domain mutations introduced by CRISPR would almost certainly be lethal, precluding analysis of the excretory cell.

      As an alternative experiment, we would determine if PAR-3 localizes to the lumenal membrane independently of CDC-42; such a finding would imply that PAR-3 and CDC-42 likely have independent contributions to PAR-6 localization (rather than CDC-42 promoting PAR-6 localization by localizing PAR-3). To do this, we will degrade ZF1::YFP::CDC-42 in the excretory cell and examine the localization of PAR-3::mCherry compared to controls. We have all of the strains needed for this experiment.

      4.The authors hypothesize that partial recruitment of PAR-6 by CDC-42 is sufficient for luminal membrane extension to explain the mild defects caused by PAR-3 depletion. Since depletion of PAR-6 and CDC-42 alone causes milder canal truncations the authors should co-deplete these proteins (as well as PAR-3 and CDC-42) to determine if there is an additive effect.

      This is an excellent suggestion in principal. However, it is not possible to know in any given degradation experiment whether the targeted protein is completely degraded; we can only say it is no longer detectable by fluorescence. Thus, any degron allele (in the presence of ZIF-1) could behave like a strong hypomorph rather than a null. It would not be possible to interpret double degradation experiments in such a case, as a more severe phenotype in the double could simply be a result of combining two hypomorphic alleles, further reducing pathway activity even if the genes function together in the same pathway. To interpret this experiment properly, a null allele of at least one of the genes would have to be used. This is not possible since par and cdc-42 null mutants are lethal and there is also maternal contribution. As an alternative to these double depletion experiments, we will deplete PAR-6::ZF1::YFP or PAR-3::ZF1::YFP in exc-5 null mutant larvae, as unlike cdc-42, exc-5 is not an essential gene.

      5.In figure 2, the authors show that depletion of PKC-3 causes more severe canal truncations than PAR-6. Since these proteins function in the same complex what do they think is the reason for this difference? This point could be discussed more in the manuscript.

      As described in the previous point, incomplete degradation could produce modestly different phenotypes even for genes that act in the same pathway. Therefore, it is not possible to determine whether PAR-6 and PKC-3 have different roles using this approach. We will add text to the discussion bringing up this point.

      6.Related to point 5, more experiments with PKC-3 should be done to determine if, for example, localization of SEC-10 is similarly affected as ablation of PAR-3, PAR-6 and CDC-42.

      We agree, and will address this point by acutely degrading ZF1::GFP::PKC-3 and examining transgenic SEC-10::mCherry, as we have done for other par genes.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): The manuscript by Abrams & Nance describes a precise investigation of the role of PAR proteins in the recruitment of the exocyst during and after the extension of the C. elegans excretory canal. State-of-the-art genetic techniques are used to acutely deplete proteins only in the targeted cell, and examine the localization of endogenously expressed markers. Experiments are well described and carefully quantified, with systematic statistical analysis. The manuscript is easy to follow and the bibliography is very good. Most conclusions are well supported.

      1) I am not entirely convinced by the presence of CDC-42 at the lumenal membrane (Fig3G); it seems to be more sub-lumenal that really lumenal. It peaks well before PAR-6 (Fig3H) which itself seem slightly less apical that PAR-3 (Fig3F). Could you use super-resolution microscopy (compatible with endogenous expression levels) to more precisely localize CDC-42? Similar point for PAR-3 and PAR-6 which do not seem to colocalize completely - a longitudinal line scan along the lumenal membrane might provide the answer even without super-resolution; this could help explain why these two proteins do not have the same function. These suggestions are easy to do provided the authors can have access to super-resolution (Airyscan to name it; although other methods will be perfectly acceptable I believe it is the most simple one).

      We agree that the CDC-42 localization peak does not precisely match the PAR-6 peak. As the reviewer notes, resolving the subcellular localization of these two proteins will not be feasible using standard confocal microscopy. We will image the ZF1::YFP::CDC-42; PAR-6:mKate strain using a Zeiss LSM 880 with Airyscan to determine if their subcellular localization patterns are distinct.

      To examine PAR-3 and PAR-6 colocalization at the lumen, we will acquire additional confocal images of the PAR-6-ZF1-YFP; PAR-3-mCherry strain and examine colocalization of the clusters along the lumenal membrane. As a positive control for two proteins that should co-localize, we will image ZF1::GFP::PKC-3; PAR-6-mKate; these two proteins bind directly and co-localize in nearly all cells in which they have been examined.

      2) The same group has described a CDC-42 biosensor to detect its active form. It could be used here to precisely pinpoint where active CDC-42 is required: in the cytoplasm? At the lumenal membrane? colocalizing with what other protein? This will require the expression of a transgene under an excretory cell specific promotor and a simple injection strategy while helping to strengthen the description of the CDC-42 role.

      See Reviewer 1 point #2.

      3) As the authors certainly know, there is a PAR-6 mutation which prevents its binding to CDC-42. They could express this construct in the excretory canal a simple extrachromosomal array should be sufficient) to validate the direct interaction between these proteins in this cell.

      See Reviewer 1 point #3.

      4) What is the lethality of ZIF-1-mediated depletion of the various factors under the exc promoter? Can homozygous strains be maintained? Authors just have to add a sentence in the Mat&Met section.

      All of the strains with excretory cell-specific degradation we have examined are viable when grown on NGM plates. We will add this point to the materials and methods.

      Provided that the authors have access to an Airyscan, all the questions asked here can be answered in two months (one month for constructs, one month for injection and data analysis) at a very minor cost.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Strengths of this manuscript include the use of endogenously tagged proteins (rather than over-expressed transgenes) for high resolution imaging and a cell-type specific acute depletion strategy that avoids complicating pleiotropies and allows tests of molecular epistasis. While some results were fairly expected based on prior studies of Cdc42, PAR proteins, and the exocyst in other tissues or systems, differences in the requirements for par-6 and pkc-3 vs. par-3 strongly suggest that the former genes play more important roles in exocyst recruitment. I was also excited to see a connection made between EXC-5 and PKC-3 localization.

      1.Lumen formation vs. lumen extension. The abstract and introduction use these two terms almost interchangeably, but they are not the same and more care should be taken to avoid the former term. The data here do not demonstrate any roles for par or other genes in lumen formation, but do demonstrate roles in lumen extension and organization/shaping.

      We agree and will correct wording to indicate that lumen extension is affected.

      2.Related to the above, mutant phenotypes here are surprisingly mild and variable. The authors discuss possible reasons for the particularly mild phenotype of par-3 mutants, but don't specifically address the mild phenotypes of the others. Clearly quite a bit of polarization and apical membrane addition occurs in ALL of the mutants. Is this because those early steps use other/redundant molecular players, or is depletion too late or incomplete to reveal an early role?

      We agree with Reviewer 3 and will bring up these points in the discussion. Degradation of proteins strongly predicted to function together (RAL-1 and SEC-5; PAR-6 and PKC-3) produce similar although not identical phenotypes; as discussed above we consider it likely that these differences reflect minor differences in degradation efficiency below our ability to detect by fluorescence. As Reviewer 3 points out, the excretory-specific driver we use to express ZIF-1 may not be active at the very earliest stages of lumen formation, and degradation could take 45 minutes or more after the promoter becomes active (Armenti et al, 2014). Thus, we agree that phenotypes could be more severe if it were possible to completely deplete each tagged protein prior to the onset of lumen formation. However, this caveat does not change the interpretations of our experiments since all proteins are degraded with the same driver. We have avoided mentioning that the phenotypes we observe reflect the ‘null’ phenotype for these reasons. We will emphasize these points in the discussion.

      The authors introduce a new reagent, "excP" (the promoter for T28H11.8), which they use to drive canal cell expression of ZIF-1 for their degron experiments. Please provide more information about when in embryogenesis this promoter becomes active, how that compares to when the par genes, sec-5, ral-1 and cdc-42 are first expressed, and what canal length is at that time. It would also be helpful to show the timeframe for degron-based depletion using this reagent (Figure 1C shows only depletion at L4, days later).

      Publicly available single cell RNA seq data (https://pubmed.ncbi.nlm.nih.gov/31488706/ and https://cello.shinyapps.io/celegans_explorer/) suggest that canal expression of the endogenous T28H11.8 gene doesn't really ramp up until the 580-650 minute timepoint, which is several hours after par gene canal expression (270-390 minutes) and the initiation of canal lumen formation (bean stage, 400-450 minutes). These data suggest that excP might come on too late to test requirements in lumen formation and early stages of extension. This caveat should be at least mentioned.

      See point #2 above. We agree that providing more information on expression from the T28H11.8 promoter would be important for interpreting the severity of phenotypes. We will raise this point in the discussion, and include existing published expression data and a more detailed analysis of the excP::mCherry transgene.

      3.There are two major aspects to the mutant phenotypes observed here: short lumens and cystic lumens. A short lumen makes sense intuitively, but the cysts could use a little more explanation. (What are cysts? What is thought to be the basis of their formation?). It is intriguing that cysts in sec-5 vs. ral-1 mutants (Figure 1) and par-6 vs. pkc-3 mutants (Figure 4) seem to have a very different size and overall appearance. Are these consistent differences, and if so, what could be the explanation for them?

      This is an interesting point. Since it is not practical to perform time-lapse imaging to watch canal cysts form, we analyzed only L1 and L4 larvae. We believe from our imaging that these are discontinuous regions of the lumen. One explanation for the expansion and dilation of the cystic lumens by L4 stage could be that the canal lumen has been expanded by fluid buildup resulting from a defect in canal function in osmoregulation, but we have not tested this directly. The reviewer also raises an interesting point regarding different appearances of cysts in SEC-5 and RAL-1 depleted larvae compared to PAR-6 and PKC-3. It is possible that these differences arise because SEC-5 and RAL-1 might direct whether vesicles will fuse at all, whereas PAR proteins direct where they will fuse in the cell (i.e. there could be fusion at basal surfaces, or just reduced apical fusion). We will bring up these points in the discussion.

      4.The authors did not test if PKC-3, like PAR-6, is required to recruit exocyst to the canal cell apical membrane, but their prior studies in the embryo suggested that it is (Armenti et al 2014). They also did not test if EXC-5 is required to recruit PAR-6 and the exocyst (along with PKC-3), or if CDC-42 is required to recruit PKC-3 (along with PAR-6). There seems to be an assumption that PAR-6 and PKC-3 are regulated and function in a common manner (as is often the case), but that has not been demonstrated here specifically. The basis for this assumption and alternatives to the linear model should be acknowledged.

      As discussed above (Reviewer 1 point #6), we will directly test whether PKC-3 is required to recruit SEC-10::mCherry to the lumenal membrane. We agree with Reviewer 3 that we have not shown that PAR-6 and PKC-3 always function similarly, although this is expected based on their similar phenotypes and co-dependent functions in other cells. We will mention this caveat in the discussion.

      5.EXC-5 is presumed to act upstream of CDC-42 based on shared phenotypes and the known Rho GEF activity of its mammalian homologs. However, direct evidence for this is currently lacking. In future, the authors might test if depleting EXC-5 affects CDC-42 activation/GTP-loading by using CDC-42 biosensors that have been reported in the literature (e.g. Lazetic et al 2018).

      See Reviewer 1 point #2.

      \*Minor comments:** Figure 1, Figure 4, Figure S3, Figure S4 Blue color/CFP indicates the apical/luminal membrane or the apical region of the canal cytoplasm, not the actual lumen as the labels suggest. The lumen is a hollow cavity on the opposite side of the plasma membrane from these markers, and it is shown as white in the Figure 1A upper right cartoon.*

      Thank you for pointing this out. We will correct the figure labelling.

      Figure 2, Figure S2 I'm not confident in the statistical analysis used here (Fisher's Exact test on two bins, 50% canal length), given that four length bins (not two) were defined. I recommend consulting a statistician.

      Our rationale for using two bins for the statistical analysis was because control larvae nearly all have a similar canal length (L1 stage: 99% of larvae have canal length that is 51-75% of body length; L4 stage: 98% of larvae have canal length that is 76-100% of body length), making it straightforward to ask if mutants are shorter. We chose not to make more granular phenotypic comparisons, as we cannot rule out that subtle differences in degradation efficiency, rather than differences in biological function, underlie any differences in canal length of the degron mutants. We will consult with a statistician to determine if this is an acceptable way to statistically compare controls and mutants.

      p.3 "Born during late embryogenesis..." Actually, the canal cell is born at ~270 minutes after first cleavage, which is in the first half of embryogenesis, not what I would call "late".

      We agree and will correct the wording.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The C. elegans excretory canal cell is a classic model for studying single cell tubulogenesis, where a cell establishes an intracellular apical domain that extends to form a lumen. Prior studies in this system have identified a set of gene products that localize to the growing apical domain and/or are important for its organization and size, but the molecular pathways through which these various gene products act remain poorly understood. Here, Abrams and Nance are able to connect the dots among several of these to flesh out a pathway for apical membrane addition. Specifically, they demonstrate that CDC-42 is needed to recruit PAR-6, and that PAR-6 is needed to recruit the exocyst to the apical membrane and to promote proper apical membrane growth and organization. EXC-5, a candidate GEF for CDC-42, also appears to act in this pathway.

      Strengths of this manuscript include the use of endogenously tagged proteins (rather than over-expressed transgenes) for high resolution imaging and a cell-type specific acute depletion strategy that avoids complicating pleiotropies and allows tests of molecular epistasis. While some results were fairly expected based on prior studies of Cdc42, PAR proteins, and the exocyst in other tissues or systems, differences in the requirements for par-6 and pkc-3 vs. par-3 strongly suggest that the former genes play more important roles in exocyst recruitment. I was also excited to see a connection made between EXC-5 and PKC-3 localization.

      Major comments:

      1.Lumen formation vs. lumen extension. The abstract and introduction use these two terms almost interchangeably, but they are not the same and more care should be taken to avoid the former term. The data here do not demonstrate any roles for par or other genes in lumen formation, but do demonstrate roles in lumen extension and organization/shaping.

      2.Related to the above, mutant phenotypes here are surprisingly mild and variable. The authors discuss possible reasons for the particularly mild phenotype of par-3 mutants, but don't specifically address the mild phenotypes of the others. Clearly quite a bit of polarization and apical membrane addition occurs in ALL of the mutants. Is this because those early steps use other/redundant molecular players, or is depletion too late or incomplete to reveal an early role?

      The authors introduce a new reagent, "excP" (the promoter for T28H11.8), which they use to drive canal cell expression of ZIF-1 for their degron experiments. Please provide more information about when in embryogenesis this promoter becomes active, how that compares to when the par genes, sec-5, ral-1 and cdc-42 are first expressed, and what canal length is at that time. It would also be helpful to show the timeframe for degron-based depletion using this reagent (Figure 1C shows only depletion at L4, days later).

      Publicly available single cell RNA seq data (https://pubmed.ncbi.nlm.nih.gov/31488706/ and https://cello.shinyapps.io/celegans_explorer/) suggest that canal expression of the endogenous T28H11.8 gene doesn't really ramp up until the 580-650 minute timepoint, which is several hours after par gene canal expression (270-390 minutes) and the initiation of canal lumen formation (bean stage, 400-450 minutes). These data suggest that excP might come on too late to test requirements in lumen formation and early stages of extension. This caveat should be at least mentioned.

      3.There are two major aspects to the mutant phenotypes observed here: short lumens and cystic lumens. A short lumen makes sense intuitively, but the cysts could use a little more explanation. (What are cysts? What is thought to be the basis of their formation?). It is intriguing that cysts in sec-5 vs. ral-1 mutants (Figure 1) and par-6 vs. pkc-3 mutants (Figure 4) seem to have a very different size and overall appearance. Are these consistent differences, and if so, what could be the explanation for them?

      4.The authors did not test if PKC-3, like PAR-6, is required to recruit exocyst to the canal cell apical membrane, but their prior studies in the embryo suggested that it is (Armenti et al 2014). They also did not test if EXC-5 is required to recruit PAR-6 and the exocyst (along with PKC-3), or if CDC-42 is required to recruit PKC-3 (along with PAR-6). There seems to be an assumption that PAR-6 and PKC-3 are regulated and function in a common manner (as is often the case), but that has not been demonstrated here specifically. The basis for this assumption and alternatives to the linear model should be acknowledged.

      5.EXC-5 is presumed to act upstream of CDC-42 based on shared phenotypes and the known Rho GEF activity of its mammalian homologs. However, direct evidence for this is currently lacking. In future, the authors might test if depleting EXC-5 affects CDC-42 activation/GTP-loading by using CDC-42 biosensors that have been reported in the literature (e.g. Lazetic et al 2018).

      Minor comments:

      Figure 1, Figure 4, Figure S3, Figure S4 Blue color/CFP indicates the apical/luminal membrane or the apical region of the canal cytoplasm, not the actual lumen as the labels suggest. The lumen is a hollow cavity on the opposite side of the plasma membrane from these markers, and it is shown as white in the Figure 1A upper right cartoon.

      Figure 2, Figure S2 I'm not confident in the statistical analysis used here (Fisher's Exact test on two bins, <50% and >50% canal length), given that four length bins (not two) were defined. I recommend consulting a statistician.

      p.3 "Born during late embryogenesis..." Actually, the canal cell is born at ~270 minutes after first cleavage, which is in the first half of embryogenesis, not what I would call "late".

      Significance

      Polarized plasma membrane addition is critical for the development of epithelial tissues, so understanding the mechanisms that control this is of broad interest to many cell and developmental biologists. This study will be of particularly high interest to researchers working on PAR proteins, the exocyst, or single cell tube development.

      The results here add to the existing body of evidence for PAR-dependent recruitment of exocyst to expanding apical/luminal surfaces (e.g. Bryant et al 2010; Jones et al 2011, 2014; Armenti et al 2014) and to evidence for key functional distinctions between PAR-6 & PKC-3 vs. PAR-3 (e.g. Achilleos et al 2010; Jones et al 2014). The results here are more robust than in those prior studies and more clearly illustrate directionality due to the authors' acute depletion strategy, which avoids major tissue disruptions that could secondarily affect protein localization.

      expertise keywords: C. elegans, epithelia, tubulogenesis

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript by Abrams & Nance describes a precise investigation of the role of PAR proteins in the recruitment of the exocyst during and after the extension of the C. elegans excretory canal. State-of-the-art genetic techniques are used to acutely deplete proteins only in the targeted cell, and examine the localization of endogenously expressed markers. Experiments are well described and carefully quantified, with systematic statistical analysis. The manuscript is easy to follow and the bibliography is very good. Most conclusions are well supported.

      I only have a few minor questions or remarks:

      1) I am not entirely convinced by the presence of CDC-42 at the lumenal membrane (Fig3G); it seems to be more sub-lumenal that really lumenal. It peaks well before PAR-6 (Fig3H) which itself seem slightly less apical that PAR-3 (Fig3F). Could you use super-resolution microscopy (compatible with endogenous expression levels) to more precisely localize CDC-42? Similar point for PAR-3 and PAR-6 which do not seem to colocalize completely - a longitudinal line scan along the lumenal membrane might provide the answer even without super-resolution; this could help explain why these two proteins do not have the same function. These suggestions are easy to do provided the authors can have access to super-resolution (Airyscan to name it; although other methods will be perfectly acceptable I believe it is the most simple one).

      2) The same group has described a CDC-42 biosensor to detect its active form. It could be used here to precisely pinpoint where active CDC-42 is required: in the cytoplasm? At the lumenal membrane? colocalizing with what other protein? This will require the expression of a transgene under an excretory cell specific promotor and a simple injection strategy while helping to strengthen the description of the CDC-42 role.

      3) As the authors certainly know, there is a PAR-6 mutation which prevents its binding to CDC-42. They could express this construct in the excretory canal a simple extrachromosomal array should be sufficient) to validate the direct interaction between these proteins in this cell.

      4) What is the lethality of ZIF-1-mediated depletion of the various factors under the exc promoter? Can homozygous strains be maintained? Authors just have to add a sentence in the Mat&Met section.

      Provided that the authors have access to an Airyscan, all the questions asked here can be answered in two months (one month for constructs, one month for injection and data analysis) at a very minor cost.

      Significance

      The reviewer has an expertise in cell polarity and membrane trafficking, using C. elegans as a model.

      The manuscript by Abrams & Nance describes a precise investigation of the role of PAR proteins in the recruitment of the exocyst during and after the extension of the C. elegans excretory canal. The interactions between these factors have already been examined in a number of models and contexts. In particular it follows a previous study from the same group (Armenti et al, Dev Biol, 2014) which established that the exocyst and RAL-1 controls polarized secretion in this model, and that PAR proteins are required for the polarized localization of the exocyst, but using the early embryo. This new manuscript is entirely focused on the excretory canal and 1) confirms the previous results, and 2) significantly extends them by precisely dissecting the role of CDC-42 and the apical PAR proteins. It will be of interest to researchers investigating the links between polarity and membrane trafficking with the description of a molecular cascade required for membrane trafficking in the context of a single-cell tube.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Abrams and Nance describes how the polarity proteins PAR-6 and PKC-3/aPKC promote lumen extension of the unicellular excretory canal in C. elegans. Using tissue-specific depletion methods they find that CDC-42 and the RhoGEF EXC-5/FGD are required for luminal localization of PAR-6, which recruits the exocyst complex required for lumen extension. Interestingly, they show that the ortholog of the mammalian exocyst receptor, PAR-3, is dispensable for luminal membrane extension. Overall, this is a well-written and interesting manuscript.

      Major comments

      1.Because depletion of PAR-3 in the canal causes milder defects than PAR-6 or CDC-42 the authors suggest that they cannot rule out the possibility that an alternative isoform of PAR-3 is expressed and buffering the defect. They should perform canal-specific RNAi-mediated depletion of the entire PAR-3 gene to determine if this is true.

      2.The authors suggest that GTP-loaded (activated) CDC-42 recruits PAR-6 to the luminal membrane. It would be nice if they could use a biosensor, such as the GBD-WSP-1 reagent from Buechner's lab to confirm that EXC-5 depletion also reduces activated CDC-42, as would be expected. This should be achievable since there is strong CDC-42 signal, even in the cytoplasm.

      3.Related to point 2, (i) does mutation of the CRIB domain of PAR-6 impair its recruitment to the luminal membrane, and (ii) does this mutant exacerbate canal defects when PAR-3 is depleted?

      4.The authors hypothesize that partial recruitment of PAR-6 by CDC-42 is sufficient for luminal membrane extension to explain the mild defects caused by PAR-3 depletion. Since depletion of PAR-6 and CDC-42 alone causes milder canal truncations the authors should co-deplete these proteins (as well as PAR-3 and CDC-42) to determine if there is an additive effect.

      5.In figure 2, the authors show that depletion of PKC-3 causes more severe canal truncations than PAR-6. Since these proteins function in the same complex what do they think is the reason for this difference? This point could be discussed more in the manuscript.

      6.Related to point 5, more experiments with PKC-3 should be done to determine if, for example, localization of SEC-10 is similarly affected as ablation of PAR-3, PAR-6 and CDC-42.

      Significance

      This manuscript builds off their previous work on the role of the exocyst in excretory canal extension and in our view represents an important advance that is relevant to biological tube development across phyla. Therefore, this work should be of interest to biologists studying tubulogenesis in many different model systems.

      My areas of expertise include model organism genetics, biological tube development, and biochemistry.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We are grateful to Review Commons for the opportunity to get valuable comments on our manuscript “Trim39 regulates neuronal apoptosis by acting as a SUMO-targeted E3 ubiquitin-ligase for the transcription factor NFATc3”. We would like to acknowledge the very nice and constructive reviews that our manuscript received. We found all of the reviewer comments well founded and we are taking them into careful consideration in preparing the revised version. We are currently performing additional experiments to address the questions raised by the reviewers. We are not yet able to provide a revised version of the manuscript, but you will find below our response to the reviewers and our plan of revision. It is difficult to anticipate exactly how much time we will need to get the requested results and to prepare a complete revised version, as it will depend on whether we can work normally and whether we encounter technical problems. However, it should be possible within a few months.

      Reviewer #1

      **Summary:**

      Desagher and co-workers investigate the regulation of the NFAT family member NFATc3, a transcription factor in neurons with a pro-apoptotic role. They identify TRIM39 as a ubiquitin E3 ligase regulating NFATc3. They demonstrate that TRIM39 can bind and ubiquitinate NFATc3 in vitro and in cells. They identify a critical SUMO interaction motif in TRIM39, that is required for its interaction with NFATc3 and for its ability to ubiquitinate NFATc3. Moreover, mutating sumoylation sites in NFATc3 reduces the interaction with TRIM39 and reduces its ubiquitination. Silencing TRIM39 increases the protein levels of NFATc3 and its transcriptional activity, leading to apoptosis of neurons. TRIM17 modulates the TRIM39-NFATc3 axis. Combined, TRIM39 appears to be a SUMO-targeted ubiquitin ligase (STUbL) for NFATc3 in neurons.

      **Major points:**

      1.This manuscript containing two stories: the rather exciting story that TRIM39 is a STUbL for NFATc3 (as mentioned in the title) and the second less exciting story: TRIM17 modulates the regulation of NFATc3 by TRIM39. These stories are now mixed in a confusing manner, disrupting the flow of the first story. It would be better to focus the current manuscript on the first story and strengthen it further and develop the second story in a second manuscript.

      We understand that the reviewer is more interested in the part of our manuscript related to the characterization of Trim39 as a STUbL due to his/her field of expertise. However, the two other reviewers are also interested in the other parts of our work. Notably the third reviewer would like us to highlight the physiological importance of our findings. Indeed, the main goal of this article is to describe the mechanisms regulating the stability of the transcription factor NFATc3. Trim17 plays a role in this regulation by inhibiting Trim39. It is particularly important for understanding the impact of these mechanisms on neuronal apoptosis as Trim17 is induced in these conditions. As we want to reach a wide audience, we prefer not to focus our manuscript on the identification of a new STUbL. However, we agree with the reviewer that it would be very interesting to strengthen this part of our work and we are grateful for his/her suggestions.

      2.Whereas the cellular experiments to indicate that TRIM39 acts as a STUbL are properly carried out, the observed effects are not necessarily direct. Direct evidence that TRIM39 is indeed a STUbL for sumoylated NFATc3 needs to be obtained in vitro, using purified recombinant proteins. Does TRIM39 indeed preferentially ubiquitinate sumoylated NFATc3? Is ubiquitination reduced for non-sumoylated NFATc3? Is ubiquitination of sumoylated NFATc3 dependent on SIM3 of TRIM39? Do other SIMs in TRIM39 contribute?

      We agree with the reviewer that additional in vitro experiments using purified recombinant proteins would strengthen the characterization of Trim39 as a STUbL. In order to answer the specific questions of the reviewer, we propose to perform in vitro ubiquitination using different forms of GST-Trim39 (WT/mSIM3/mSIM1&2) following in vitro SUMOylation (or not) of NFATc3 produced by TnT (wheat germ) and purified by immunoprecipitation. Preliminary results using WT Trim39 show that indeed the in vitro ubiquitination of NFATc3 is improved by prior in vitro SUMOylation. We have to confirm these results and to test the SIM mutants of Trim39 in the same conditions.

      3.Rule out potential roles for other STUbLs by including control knockdowns of RNF4 and RNF111 and verify the sumoylation of NFATc3 and ubiquitination of wildtype and sumoylation-mutant NFATc3.

      Our data show that silencing of Trim39 deeply decreases the ubiquitination level of NFATc3 in Neuro2A cells, indicating that Trim39 plays a major role in this process. We agree that this does not exclude the possible involvement of other STUbLs in NFATc3 ubiquitination in this model but their potential contribution would be limited. This point will be better addressed in the discussion.

      4.Figure 6B: use SUMO inhibitor ML-792 to demonstrate that ubiquitination of wildtype NFATc3 by TRIM39 is dependent on sumoylation.

      We thank the reviewer for suggesting this experiments that can easily improve the strength of our demonstration. Our preliminary results indeed indicate that in vivo ubiquitination of NFATc3 by Trim39 is strongly decreased following treatment with the SUMO inhibitor ML-792. We have to confirm these results.

      **Minor points:**

      5.Figure 1A and B: demonstrate by immunoprecipitation and Western that the endogenous counterparts indeed interact.

      We are currently setting the conditions to immunoprecipitate endogenous NFATc3 and Trim39 in order to demonstrate that they indeed interact.

      6.Figure 1C and 1E: Quantify the PLA results properly and perform statistics.

      We will perform these quantification and statistical analysis as requested.

      7.Figure 2B: Correct unequal loading of samples.

      We agree with the reviewer (as with reviewer #2) that the blots showing the total lysates of this experiment are confusing. As mentioned in the legend, some material has been lost during the TCA precipitation resulting in unequal loading. However, these experiments have been performed a very long time ago and we do not have the protein extracts anymore. We are currently trying to produce efficient shRNA-expressing lentiviruses to reproduce this experiment and provide a better figure.

      8.Figure 6B: proper statistics are needed here from at least three independent experiments.

      The reviewer is right. Statistics are needed to reinforce the significance of these results. We have quantified three independent experiments and made graphs and statistics that will be presented in the revised version of the manuscript. They better support our conclusion.

      Reviewer #1 (Significance (Required)):

      Humans have over 600 different ubiquitin E3s. Currently, RNF4 and RNF111 are the only known human SUMO-Targeted Ubiquitin Ligases (STUbLs). Here, the authors present evidence that the ubiquitin E3 ligase TRIM39 is a STUbL for sumoylated NFATc3. Identification of a new STUbL is an exciting finding for the ubiquitin and SUMO field and for the field of ubiquitin-like signal transduction in general, but needs to be strengthened as outlined above. My field of expertise is SUMO and ubiquitin signal transduction.

      Reviewer #2

      In this manuscript, the authors analyze the effect of TRIM39, a ubiquitin E3 ligase, on NFATc3, a transcription factor that regulates apoptosis in the nervous system. The authors show that TRIM39 can promote the ubiquitination of NFATc3 and regulate its half-life. Furthermore, ubiquitination depends on the SUMOylation state of NFATc3, which suggests that TRIM39 could be a new example of SUMOylation-dependent ubiquitin ligase or STUbL. **In addition, the authors show that TRIM17 interferes with TRIM39 ubiquitination, representing a new regulatory level for NFATc3 degradation. This has consequences on the regulation of apoptosis in cells derived from the nervous system.

      The authors show well-controlled, sound results for the most part. The manuscript is well written, and argumentation is convincing. Given the fact that only 2 STUbLs were previously characterized in mammals, the results are relevant and represent an advance in the field. Overall, this is a nice piece of work. Here are some comments.

      **Major comments**

      -In Fig. 2B, the levels of material loaded are uneven, which difficult the interpretation.

      We agree with the reviewer (as with reviewer #1) that the blots showing the total lysates of this experiment are confusing. As mentioned in the legend, some material has been lost during the TCA precipitation, resulting in unequal loading. In the other experiments, we have the same problem or the background is too high. We are currently trying to produce efficient shRNA-expressing lentiviruses to reproduce this experiment and provide a better figure.

      However, it seems that the control shRNA also has an effect on NFATc3 ubiquitination, which should not be the case.

      It is true that, in the present figure, the ubiquitination signal is decreased in cells transduced with the control shRNA. However, this is likely due to reduced expression of transfected NFATc3 following lentiviral infection, as it can be seen on the western blot of total lysates.

      Also, by reducing ubiquitination by TRIM39, shouldn't you expect an increase in the levels of NFATc3, if this ubiquitination was driving degradation? The authors do not specify whether those cells were treated or not with proteasomal inhibitor.

      We agree that an increase in the protein level of NFATc3 is expected following silencing of Trim39. However, in the assay presented in Figure 2B, NFATc3 is transfected and the part of overexpressed NFATc3 that is ubiquitinated by endogenous Trim39 is certainly low. Therefore, silencing of Trim39 cannot have a visible impact on the total protein level of NFATc3.

      Indeed, cells were treated with proteasome inhibitor. It is mentioned in the legend of Figure 2A. To avoid repeating it in the legend of Figure 2B, we just wrote that, after 24h transfection, cells were treated as in A, with includes MG-132 treatment for 6h.

      Same applies in Figure 4B, where no reduction in NFATc3 are seen after including TRIM39 in the reaction (beyond the fact that it looks reduced because the presence of ubiquitinated forms).

      In Figure 4B, the reaction of ubiquitination is performed in an acellular medium with purified recombinant proteins. Although NFATc3 is produced by in vitro transcription/translation in wheat germ extract, it is purified by immunoprecipitation before in vitro ubiquitination. Therefore, the reaction does not contain any proteasome and NFATc3 should not be degraded following its ubiquitination by TRIM39.

      -After the experiments in vitro shown in Fig. 2C, the authors conclude that the NFATc3 is a direct substrate of TRIM39. I think the authors used the right approach by using bacterially produced GST-TRIM39 for the ubiquitination reaction. However NFATc3 is produced by an in vitro transcription-translation system, which could in principle provide other contaminant proteins to the reaction. Did the authors try to use bacterially produced NFATc3? This might be difficult in the case of big proteins, in which case the authors could add some caution note in the text. Same applies in Figure 4B.

      The reviewer is right. It would have been preferable to use NFATc3 produced in bacteria. Indeed, we started with this approach. However, it was very difficult to get NFATc3 expressed in bacteria, and when we succeeded, most of the protein was degraded. We tried different protease inhibitor cocktails and we used a strain of bacteria (BL21-CodonPlus(DE3)-RP) that is mutated on the genes coding for the proteases Lon and OmpT and is further engineered to express tRNAs that are often limiting when expressing mammalian proteins. Unfortunately, this did not improve our production enough.

      We agree that, in principle, in vitro transcription-translation (TnT) systems can include contaminant proteins. However, we used wheat germ extract to produce NFATc3 by TnT. Moreover, we immunopurified NFATc3 from the TnT reaction prior to the ubiquitination reaction. The probability that proteins modifying NFATc3 are expressed in plants and are co-purified with NFATc3 is low. Nevertheless, we will discuss this point in the result section of the revised version of the manuscript, when describing results of Figure 2B and 4B.

      -In Fig. 6B, higher levels of ubiquitination in the different SUMOylation mutants are shown. Is this effect consistent? How this can be explained?

      We are grateful to the reviewer for pointing out this inconsistency in our manuscript. It will be corrected. Indeed, the values indicated in red in Figure 6B are confusing and are certainly not consistent. We calculated them by normalizing the intensity of the ubiquitination signal by the intensity of NFATc3 in total lysates, which seems to have introduced a bias. Variations in NFATc3 levels are probably responsible for the artificially higher levels of ubiquitination for different SUMOylation mutants after normalization. When quantifying three independent experiments, as requested by reviewer #1, we realized that results are much more consistent without normalization. Therefore, in the revised version of manuscript, we will add a graph showing the average and standard deviation of three independent experiments quantified without normalization. We will also replace the experiment currently presented in Figure 6B by another one in which the levels of NFATc3 show lower variations in the total lysates.

      In addition, variations in the levels of NFATc3 are shown in the total lysate, despite the use of proteasomal inhibitors. How the author explain this effect?

      These variations in NFATc3 levels in the total lysates may be due to differential protein precipitation by TCA. That is why, in more recent experiments, we collected a portion of the homogenous cell suspension before lysis in the guanidinium buffer, to assess the expression level of transfected proteins (as presented in Figures 4A and 7E).

      It is true that treatment with proteasome inhibitor should attenuate differences in protein level due to different ubiquitination levels. However, cells are transfected for 24h and then treated with MG-132 for 6h before lysis. Proteasome inhibition cannot compensate for what occurred in the cells during the 24h transfection. It is added essentially to accumulate poly-ubiquitinated forms of NFATc3.

      Somehow, this is contradictory with the general message of SUMOylation-dependent ubiquitination.

      The reduced levels of SUMOylation mutants in total lysates may appear to be contradictory with SUMOylation-dependent ubiquitination. However, as mentioned above, this could be due to differential protein precipitation by TCA or to different transfection efficiencies. In contrast, the half-life measurement of WT and EallA mutant, that does not rely on initial expression levels, clearly shows a stabilization of the SUMOylation mutant. Moreover, the average of the three ubiquitination experiments is really convincing. Therefore, we believe that the data that will be presented in the revised manuscript will strongly support our hypothesis.

      -In Fig. 7E, not clear to me what the big bands above 130 KDa is after the nickel beads. Do they correspond to monoUb NFATc3 or to the unmodified protein that is sticky to the beads? Do the authors have side-by-side gels of the initial lysate next to the nickel beads eluates to show the increase in molecular weight?

      The big bands above 130 kD among nickel bead-purified proteins in Figure 7A are unlikely to be unmodified NFATc3 sticking to the beads. Indeed, in the control condition, in which NFATc3 is overexpressed in the absence of His-ubiquitin, these bands are not visible. Therefore, they might be mono-ubiquitinated forms of NFATc3, or degradation products of poly-ubiquitinated NFATc3. We will correct the figure to clarify this point. Unfortunately, we do not have a gel with nickel bead eluates and total lysates side by side for this experiment.

      -Quantifications in some pictures (i.e. Figures 5A, 5B, 6B, 7) is shown in red above or below the bands. Not clear whether the quantifications shown correspond to that single experiment or is the average of several experiments. In the first case, the number would not be very valuable. Authors could add quantification graphs with standard deviations or error bars to the experiments if they want to make the point of changes (significant or not) in the levels. Alternatively, indicate in the Figure legends whether the numbers correspond to the average of several experiments.

      These quantifications correspond to the representative experiments shown in the different figures. We will clarify this point in the Figure legends of the revised manuscript. We added these quantifications to normalize the amount of co-precipitated proteins by the amount of the precipitated partner (Fig 5A, 5B, 7B, 7C, 7D) which is not always precipitated with the same efficiency in the different conditions. We think that it should help the reader to assess the degree of interaction. We also added quantifications to Figure 7E to normalize the ubiquitination signal by the amount of NFATc3 expressed in the total lysate. However, we did not want to overload the figures by adding too many graphs.

      For Figure 6B, where TCA precipitation of total lysates created an inconsistency, we will provide a graph with the average and standard deviation of three independent experiments, as requested by reviewer #1.

      -In Fig. 8, the quantification of apoptotic nuclei has been done just based on the morphology after DAPI staining. Could you use an apoptosis marker (i.e. cleaved caspase Abs) to label the apoptotic cells?

      We have been using primary cultures of cerebellar granule neurons (CGN) as an in vitro model of neuronal apoptosis for many years. Nuclear condensation, visualized after DAPI staining, is very characteristic in these neurons and allows a reliable assessment of neuronal apoptosis. In a previous study (Desagher et al. JBC 2005), we have shown that the kinetics of apoptosis in CGN is the same whether we measure cytochrome c release, active caspase 3 or nuclear condensation (Fig 1b). We therefore believe that the counting of apoptotic nuclei is sufficient to support our conclusions, notably for transfection experiments in Figure 8A which would require a lot of work to be repeated with active caspase 3 staining. However, if we can produce efficient shRNA-expressing lentiviruses, we will reproduce the experiment presented in Figure 8B and we will perform a western blot using anti-active caspase 3 to confirm our conclusion.

      **Minor comments**

      -In Figs. 1 and 5, the red channel should be put in black and white, as it is much easier to see the signal. Not relevant to have DAPI alone in B&W (it does not hurt either), as it is well visible in the merge picture. Also, quantification of the PLA positive dots should be shown in Fig. 1.

      We thank the reviewer for these suggestions. We will modify the figures and we will quantify the PLA dots in Figure 1 as requested.

      -In Fig. 3C, is the difference in TRIM17 expression between empty plasmid and NFATc3 plasmid significant? If so, indicate it in the graph. The same in panels D, E, indicate all significant differences. Same in other Figures.

      No, the difference in Trim17 expression is not statistically significant between NFATc3 and empty plasmid although it clearly increases. However, we agree with the reviewer that more significant differences could be shown in the figures, particularly in Figure 3. Nonetheless, we will try not to overload the figures and will restrict ourselves to comparisons that make sense.

      -It would be nice to show a scheme on the location of SIMs in TRIM39 in relation to the other feature of the protein.

      We are grateful to the reviewer for this suggestion. We will be happy to add a scheme of Trim39 showing its different domains and the location of its SIMs in the revised Figure 7.

      -In Fig. 2 legend, "Note that in the presence of ubiquitin the unmodified form of WT GST-Trim39 is lower due to high Trim39 ubiquitination." Please change to "...in the presence of ubiquitin the levels of the unmodified form..."

      -In Fig. 7 legend, the phrases "The intensity of the bands ... " are not clear. Please rephrase.

      -In Fig. 8 legend, "\** * PWe thank the reviewer for pointing out typographical errors and awkward sentences in our manuscript. Changes will be made as requested.

      Reviewer #2 (Significance (Required)):

      In this manuscript, the authors analyze the effect of TRIM39, a ubiquitin E3 ligase, on NFATc3, a transcription factor that regulates apoptosis in the nervous system. The authors show that TRIM39 can promote the ubiquitination of NFATc3 and regulate its half-life. Furthermore, ubiquitination depends on the SUMOylation state of NFATc3, which suggests that TRIM39 could be a new example of SUMOylation-dependent ubiquitin ligase or STUbL. In addition, the authors show that TRIM17 interferes with TRIM39 ubiquitination, representing a new regulatory level for NFATc3 degradation. This has consequences on the regulation of apoptosis in cells derived from the nervous system.

      The authors show well-controlled, sound results for the most part. The manuscript is well written, and argumentation is convincing. Given the fact that only 2 STUbLs were previously characterized in mammals, the results are relevant and represent an advance in the field. Overall, this is a nice piece of work.

      Audience: researchers interested on proteostasis in general and on nervous system regulation

      My expertise: postranslational modifications

      Reviewer #3

      **Summary:**

      In this study, Shrivastava et al. elucidated the previously unknown function of TRIM39 in regulating protein stability of NFATc3, the predominant member of the NFAT family of transcription factor in neurons, where it plays a pro-apoptotic role. NFATs have been shown to be regulated by multiple mechanisms, including at the level of protein stability. In this study, the authors identify TRIM39 as the E3 ligase for NFATc3. Interestingly, TRIM39 recognizes the SUMOylated form of NFATc3 and the interaction facilitates its ubiquitylation and subsequent proteasomal degradation. They further showed that binding of TRIM39 to NFATc3 can also be regulated by TRIM17. Like TRIM39, TRIM17 is a ring-finger containing protein previously shown by this group that it binds NFATc3 but the interaction resulted in an up- rather than down-regulation of NFATc3. In this study, they offer insight to the paradox that overexpression of TRIM17 binding to TRIM39 is to inhibit TRIM39-mediated ubiquitylation of NFATc3. Furthermore, they showed activation of NFATc3 transcriptionally activates TRIM17 expression, thus forming a feedback loop between NFATc3 and TRIM17. Hence, an TRIM17-TRIM39-NFATc3 signaling axis for modulating the protein stability for promoting the activity of NFATc3 in regulating apoptosis in the cerebellar granule neurons induced by KCl deprivation is proposed

      The key conclusions are convincing. The data in general are of good quality and with many of the key interactions vigorously documented **by conducting reciprocal interaction analysis. For knockdown expeRIMents, two shRNA independent sequences were used. However, some issues remain to be addressed:

      **Major comments:**

      1.Figure 1D - the authors should demonstrate that the depletion of TRIM39 expression by shRNA in Neuro2A by Western blotting

      We agree with the reviewer that it would be better to provide this control. Unfortunately, we have never been able to observe a convincing decrease in the protein level of Trim39, following knockdown, by Western blotting in Neuro2A cells. This is surprising because the decrease is clearly visible by immunofluorescence in Neuro2A cells, and by western blotting in neurons (see Figure 8C). It is possible that Neuro2A cells, but not neurons, express a protein that is non-specifically recognized by our best anti-Trim39 antibody in western blots and that migrates at the same size as Trim39, thus preventing the investigator to detect the depletion of Trim39. We will test additional anti-Trim39 antibodies to address this question.

      2.Figure 3 - the author should show overexpression of TRIM39 resulted in reduction of basal level of endogenous NFATc3 due to its effect on protein stability by using CHX or other pulse chase method.

      This is an important point and we have performed many experiments using cycloheximide to measure the half-life of NFATc3 in the presence or the absence of overexpressed Trim39. The results were neither consistent nor reproducible. This is certainly due to the fact that the half-life of endogenous NFATc3 is longer than that of overexpressed Trim39 and that cycloheximide inhibits the expression of both proteins. Therefore, we will perform pulse-chase experiments after metabolic labelling of cells with [35S]-Met. We are currently setting up the conditions to immunoprecipitate endogenous NFATc3 to be able to perform these experiments.

      3.Figure 3 - Does overexpression or knockdown of TRIM39 has an effect on affecting levels of NFATc3 mRNAs?

      The reviewer is right. It is important to control that overexpression and knockdown of Trim39 do not modify the mRNA level of NFATc3. Therefore, we are currently measuring NFATc3 mRNA levels in all the experiments used to make the graphs of Figure 3. These results will be added to the revised version of the manuscript as supplemental data. First results show no significant change of NFATc3 mRNA levels in these experiments.

      4.Figure 6A - the authors should confirm the multiple bands that are slower migrating are SUMO form of NFATc9 by demonstrating the presence of SUMO in these forms of NFATc3, or alternatively, perform His-SUMO pull-down and probe for NFATc3.

      The reactions shown in Figure 6B have been performed in vitro, with purified recombinant proteins and with NFATc3 produced by in vitro transcription/translation. The wheat germ extract used to produce NFATc3 is unlikely to provide the material needed for post-translation modification of a mammalian protein. However, we agree that it would be better to confirm that slower migrating bands are indeed SUMOylated forms of NFATc3. We may hybridize the membranes with an anti-SUMO antibody but it would give a smear as the enzymes added to the reaction mix are themselves SUMOylated. Therefore, we will show an experiment in which the reaction mix has been incubated with and without SUMO. The results show no slower migrating bands in the absence of SUMO although all conditions were otherwise identical. This will be added to the revised Figure 6.

      5.Figure 7C - the quantification for mSIM1 does not seem to agree with the band intensity.

      Yes, we agree with the reviewer that the quantification (122%) does not seem to reflect the amount of SUMO-chains bound to GST-Trim39 mSIM1. This is due to the normalization of the SUMO signals by the intensity of GST-Trim39 bands. Indeed, it is difficult to control exactly how much recombinant protein is used. GST-Trim39 mSIM1 was slightly less abundant than the other GST-Trim39 proteins in this experiment, explaining why less SUMO-chains were eluted in this condition. The normalization is mentioned in the legend of Figure 7C.

      6.TRIM17 reduces TRIM39/NFATc3 interaction and inhibits TRIM39 E3 activity, which results in stabilization of NFATc3. NFATc3 in turn transcriptionally induces TRIM17 expression, thus forming a feedback loop between NFATc3 and TRIM17. It will be good if the authors can discuss the possibility of the existence of this feedback mechanism in physiological context? Is the protein level of NFATc3 level, which should be low abundance at the resting state, elevated by KCI deprivation? If so, can the authors discuss the possible signalling event(s) that that lead to activation of NFATc3 upon KCI deprivation? For instance, does KCL deprivation cause de-SUMOylation of NFATc3?

      We thank the reviewer for these suggestions. Our preliminary results suggest that the protein level of NFATc3 is increased in neurons following KCl deprivation. We are currently performing additional experiments to confirm this result. If proved, this increase may be due to the transcriptional induction of Trim17 that should result in the stabilization of NFATc3 through the inhibition of Trim39. It may also be due to a possible deSUMOylation of NFATc3 following apoptosis induction, as suggested by the reviewer. To address the latter point, we are currently setting up PLA using anti-NFATc3 and anti-SUMO antibodies to assess the SUMOylation level of endogenous NFATc3 in neurons. If they are of good quality, we will add these data to Figure 8 and we will discuss the possible existence of feedback loops in neuronal apoptosis, as suggested by the reviewer.

      **Minor comments:**

      1.Line 294 - it should be "SUMOylation" instead of "SUMO".

      We thank the reviewer for pointing out this typographical error that will be corrected.

      2.Figure 8 - to include TRIM39/NFATc3 double knockdown to show the effect on increased neuronal apoptosis in the cells with TRIM39 knocked down was due to elevation of NFATc3 rather than other target(s) of TRIM39.

      We agree that it would be interesting to test whether the increase on neuronal apoptosis following Trim39 silencing is mainly due to its effect on NFATc3. We will therefore perform double silencing of Trim39 and NFATc3 in neurons in order to address this point.

      3.The discussion may be shortened and revised to highlight the physiological importance of the findings linked to cerebellar granule neurons survival.

      As suggested by the reviewer, we will modify the discussion to better highlight the physiological implications of our data, particularly by discussing the results of the additional experiments we will conduct in neurons.

      Reviewer #3 (Significance (Required)):

      Prior to this study, the mechanism by which protein stability of NFATc3, the pre-dominant member of the NFAT family of transcription factor in neurons, is regulated remains poorly understood. Shrivastava et al. have unravelled the interplay between ubiquitylation and SUMOylation involving TRIM39 and TRIM17 to have an important role in regulating protein stability of NFATc3. The work is interesting and bears significance towards understanding how apoptosis could be finely controlled in cerebellar granule neurons. Furthermore, the study has also expanded the understanding of the role and regulation of the TRIM family of proteins. The senior author is an expert in this field and over the years, her group has contributed many key discoveries on the function of TRIM family of E3 ubiquitin ligases and their critical ubiquitylation substrates in neuronal survival and its relevance to neuronal biology and diseases.

      The referee's field of expertise in in the field of mitochondrial apoptosis signalling. The referee extensively involved in studying how protein stability of regulators in apoptosis signalling are regulated by the ubiquitin-proteasome system (UPS) and how does the regulation play a role in physiology and diseases.

      Key words: apoptosis, ubiquitylation, cell signaling, liver diseases

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this study, Shrivastava et al. elucidated the previously unknown function of TRIM39 in regulating protein stability of NFATc3, the predominant member of the NFAT family of transcription factor in neurons, where it plays a pro-apoptotic role. NFATs have been shown to be regulated by multiple mechanisms, including at the level of protein stability. In this study, the authors identify TRIM39 as the E3 ligase for NFATc3. Interestingly, TRIM39 recognizes the SUMOylated form of NFATc3 and the interaction facilitates its ubiquitylation and subsequent proteasomal degradation. They further showed that binding of TRIM39 to NFATc3 can also be regulated by TRIM17. Like TRIM39, TRIM17 is a ring-finger containing protein previously shown by this group that it binds NFATc3 but the interaction resulted in an up- rather than down-regulation of NFATc3. In this study, they offer insight to the paradox that overexpression of TRIM17 binding to TRIM39 is to inhibit TRIM39-mediated ubiquitylation of NFATc3. Furthermore, they showed activation of NFATc3 transcriptionally activates TRIM17 expression, thus forming a feedback loop between NFATc3 and TRIM17. Hence, an TRIM17-TRIM39-NFATc3 signaling axis for modulating the protein stability for promoting the activity of NFATc3 in regulating apoptosis in the cerebellar granule neurons induced by KCl deprivation is proposed.

      The key conclusions are convincing. The data in general are of good quality and with many of the key interactions vigorously documented by conducting reciprocal interaction analysis. For knockdown expeRIMents, two shRNA independent sequences were used. However, some issues remain to be addressed:

      Major comments:

      1.Figure 1D - the authors should demonstrate that the depletion of TRIM39 expression by shRNA in Neuro2A by Western blotting

      2.Figure 3 - the author should show overexpression of TRIM39 resulted in reduction of basal level of endogenous NFATc3 due to its effect on protein stability by using CHX or other pulse chase method.

      3.Figure 3 - Does overexpression or knockdown of TRIM39 has an effect on affecting levels of NFATc3 mRNAs?

      4.Figure 6A - the authors should confirm the multiple bands that are slower migrating are SUMO form of NFATc9 by demonstrating the presence of SUMO in these forms of NFATc3, or alternatively, perform His-SUMO pull-down and probe for NFATc3.

      5.Figure 7C - the quantification for mSIM1 does not seem to agree with the band intensity.

      6.TRIM17 reduces TRIM39/NFATc3 interaction and inhibits TRIM39 E3 activity, which results in stabilization of NFATc3. NFATc3 in turn transcriptionally induces TRIM17 expression, thus forming a feedback loop between NFATc3 and TRIM17. It will be good if the authors can discuss the possibility of the existence of this feedback mechanism in physiological context? Is the protein level of NFATc3 level, which should be low abundance at the resting state, elevated by KCI deprivation? If so, can the authors discuss the possible signalling event(s) that that lead to activation of NFATc3 upon KCI deprivation? For instance, does KCL deprivation cause de-SUMOylation of NFATc3?

      Minor comments:

      1.Line 294 - it should be "SUMOylation" instead of "SUMO".

      2.Figure 8 - to include TRIM39/NFATc3 double knockdown to show the effect on increased neuronal apoptosis in the cells with TRIM39 knocked down was due to elevation of NFATc3 rather than other target(s) of TRIM39.

      3.The discussion may be shortened and revised to highlight the physiological importance of the findings linked to cerebellar granule neurons survival.

      Significance

      Prior to this study, the mechanism by which protein stability of NFATc3, the pre-dominant member of the NFAT family of transcription factor in neurons, is regulated remains poorly understood. Shrivastava et al. have unravelled the interplay between ubiquitylation and SUMOylation involving TRIM39 and TRIM17 to have an important role in regulating protein stability of NFATc3. The work is interesting and bears significance towards understanding how apoptosis could be finely controlled in cerebellar granule neurons. Furthermore, the study has also expanded the understanding of the role and regulation of the TRIM family of proteins. The senior author is an expert in this field and over the years, her group has contributed many key discoveries on the function of TRIM family of E3 ubiquitin ligases and their critical ubiquitylation substrates in neuronal survival and its relevance to neuronal biology and diseases.

      The referee's field of expertise in in the field of mitochondrial apoptosis signalling. The referee extensively involved in studying how protein stability of regulators in apoptosis signalling are regulated by the ubiquitin-proteasome system (UPS) and how does the regulation play a role in physiology and diseases.

      Key words: apoptosis, ubiquitylation, cell signaling, liver diseases

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, the authors analyze the effect of TRIM39, a ubiquitin E3 ligase, on NFATc3, a transcription factor that regulates apoptosis in the nervous system. The authors show that TRIM39 can promote the ubiquitination of NFATc3 and regulate its half-life. Furthermore, ubiquitination depends on the SUMOylation state of NFATc3, which suggests that TRIM39 could be a new example of SUMOylation-dependent ubiquitin ligase or STUbL. In addition, the authors show that TRIM17 interferes with TRIM39 ubiquitination, representing a new regulatory level for NFATc3 degradation. This has consequences on the regulation of apoptosis in cells derived from the nervous system. The authors show well-controlled, sound results for the most part. The manuscript is well written, and argumentation is convincing. Given the fact that only 2 STUbLs were previously characterized in mammals, the results are relevant and represent an advance in the field. Overall, this is a nice piece of work. Here are some comments.

      Major comments

      -In Fig. 2B, the levels of material loaded are uneven, which difficult the interpretation. However, it seems that the control shRNA also has an effect on NFATc3 ubiquitination, which should not be the case. Also, by reducing ubiquitination by TRIM39, shouldn't you expect an increase in the levels of NFATc3, if this ubiquitination was driving degradation? The authors do not specify whether those cells were treated or not with proteasomal inhibitor. Same applies in Figure 4B, where no reduction in NFATc3 are seen after including TRIM39 in the reaction (beyond the fact that it looks reduced because the presence of ubiquitinated forms).

      -After the experiments in vitro shown in Fig. 2C, the authors conclude that the NFATc3 is a direct substrate of TRIM39. I think the authors used the right approach by using bacterially produced GST-TRIM39 for the ubiquitination reaction. However NFATc3 is produced by an in vitro transcription-translation system, which could in principle provide other contaminant proteins to the reaction. Did the authors try to use bacterially produced NFATc3? This might be difficult in the case of big proteins, in which case the authors could add some caution note in the text. Same applies in Figure 4B.

      -In Fig. 6B, higher levels of ubiquitination in the different SUMOylation mutants are shown. Is this effect consistent? How this can be explained? In addition, variations in the levels of NFATc3 are shown in the total lysate, despite the use of proteasomal inhibitors. How the author explain this effect? Somehow, this is contradictory with the general message of SUMOylation-dependent ubiquitination.

      -In Fig. 7E, not clear to me what the big bands above 130 KDa is after the nickel beads. Do they correspond to monoUb NFATc3 or to the unmodified protein that is sticky to the beads? Do the authors have side-by-side gels of the initial lysate next to the nickel beads eluates to show the increase in molecular weight?

      -Quantifications in some pictures (i.e. Figures 5A, 5B, 6B, 7) is shown in red above or below the bands. Not clear whether the quantifications shown correspond to that single experiment or is the average of several experiments. In the first case, the number would not be very valuable. Authors could add quantification graphs with standard deviations or error bars to the experiments if they want to make the point of changes (significant or not) in the levels. Alternatively, indicate in the Figure legends whether the numbers correspond to the average of several experiments.

      -In Fig. 8, the quantification of apoptotic nuclei has been done just based on the morphology after DAPI staining. Could you use an apoptosis marker (i.e. cleaved caspase Abs) to label the apoptotic cells?

      Minor comments

      -In Figs. 1 and 5, the red channel should be put in black and white, as it is much easier to see the signal. Not relevant to have DAPI alone in B&W (it does not hurt either), as it is well visible in the merge picture. Also, quantification of the PLA positive dots should be shown in Fig. 1.

      -In Fig. 3C, is the difference in TRIM17 expression between empty plasmid and NFATc3 plasmid significant? If so, indicate it in the graph. The same in panels D, E, indicate all significant differences. Same in other Figures.

      -It would be nice to show a scheme on the location of SIMs in TRIM39 in relation to the other feature of the protein.

      -In Fig. 2 legend, "Note that in the presence of ubiquitin the unmodified form of WT GST-Trim39 is lower due to high Trim39 ubiquitination." Please change to "...in the presence of ubiquitin the levels of the unmodified form..."

      -In Fig. 7 legend, the phrases "The intensity of the bands ... " are not clear. Please rephrase.

      -In Fig. 8 legend, " P<0.001". Change to "* P<0.001".

      Significance

      In this manuscript, the authors analyze the effect of TRIM39, a ubiquitin E3 ligase, on NFATc3, a transcription factor that regulates apoptosis in the nervous system. The authors show that TRIM39 can promote the ubiquitination of NFATc3 and regulate its half-life. Furthermore, ubiquitination depends on the SUMOylation state of NFATc3, which suggests that TRIM39 could be a new example of SUMOylation-dependent ubiquitin ligase or STUbL. In addition, the authors show that TRIM17 interferes with TRIM39 ubiquitination, representing a new regulatory level for NFATc3 degradation. This has consequences on the regulation of apoptosis in cells derived from the nervous system.

      The authors show well-controlled, sound results for the most part. The manuscript is well written, and argumentation is convincing. Given the fact that only 2 STUbLs were previously characterized in mammals, the results are relevant and represent an advance in the field. Overall, this is a nice piece of work.

      Audience: researchers interested on proteostasis in general and on nervous system regulation

      My expertise: postranslational modifications

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Desagher and co-workers investigate the regulation of the NFAT family member NFATc3, a transcription factor in neurons with a pro-apoptotic role. They identify TRIM39 as a ubiquitin E3 ligase regulating NFATc3. They demonstrate that TRIM39 can bind and ubiquitinate NFATc3 in vitro and in cells. They identify a critical SUMO interaction motif in TRIM39, that is required for its interaction with NFATc3 and for its ability to ubiquitinate NFATc3. Moreover, mutating sumoylation sites in NFATc3 reduces the interaction with TRIM39 and reduces its ubiquitination. Silencing TRIM39 increases the protein levels of NFATc3 and its transcriptional activity, leading to apoptosis of neurons. TRIM17 modulates the TRIM39-NFATc3 axis. Combined, TRIM39 appears to be a SUMO-targeted ubiquitin ligase (STUbL) for NFATc3 in neurons.

      Major points:

      1.This manuscript containing two stories: the rather exciting story that TRIM39 is a STUbL for NFATc3 (as mentioned in the title) and the second less exciting story: TRIM17 modulates the regulation of NFATc3 by TRIM39. These stories are now mixed in a confusing manner, disrupting the flow of the first story. It would be better to focus the current manuscript on the first story and strengthen it further and develop the second story in a second manuscript.

      2.Whereas the cellular experiments to indicate that TRIM39 acts as a STUbL are properly carried out, the observed effects are not necessarily direct. Direct evidence that TRIM39 is indeed a STUbL for sumoylated NFATc3 needs to be obtained in vitro, using purified recombinant proteins. Does TRIM39 indeed preferentially ubiquitinate sumoylated NFATc3? Is ubiquitination reduced for non-sumoylated NFATc3? Is ubiquitination of sumoylated NFATc3 dependent on SIM3 of TRIM39? Do other SIMs in TRIM39 contribute?

      3.Rule out potential roles for other STUbLs by including control knockdowns of RNF4 and RNF111 and verify the sumoylation of NFATc3 and ubiquitination of wildtype and sumoylation-mutant NFATc3.

      4.Figure 6B: use SUMO inhibitor ML-792 to demonstrate that ubiquitination of wildtype NFATc3 by TRIM39 is dependent on sumoylation.

      Minor points:

      5.Figure 1A and B: demonstrate by immunoprecipitation and Western that the endogenous counterparts indeed interact.

      6.Figure 1C and 1E: Quantify the PLA results properly and perform statistics.

      7.Figure 2B: Correct unequal loading of samples.

      8.Figure 6B: proper statistics are needed here from at least three independent experiments.

      Significance

      Humans have over 600 different ubiquitin E3s. Currently, RNF4 and RNF111 are the only known human SUMO-Targeted Ubiquitin Ligases (STUbLs). Here, the authors present evidence that the ubiquitin E3 ligase TRIM39 is a STUbL for sumoylated NFATc3. Identification of a new STUbL is an exciting finding for the ubiquitin and SUMO field and for the field of ubiquitin-like signal transduction in general, but needs to be strengthened as outlined above. My field of expertise is SUMO and ubiquitin signal transduction.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      **A. Summary:**

      In this modeling study, the authors devised a multicellular model to investigate how circadian clocks in different parts (organs) of plants coordinate their timing. The model uses a plausible mechanism to explain how having a different sensitivity to light leads to different phase and period of circadian clock, which is observed in different plant organs. The model allows for entrainment in Light-Dark (LD) cycles and then a release in always-light (LL) environments. The model disentangles numerous factors that have confounded previous experiments. In one instance, the authors assigned different light sensitivities to the different organs (e.g., root tip, hypocotyl, etc.) which unambiguously show that this one element alone - spatially differing sensitivity to light - is sufficient for recapitulating experimentally observed differences in periods and phases between plant organs. The model also recapitulates the spatial waves of gene expression within and between organs that experimentalists reported. At the sub-tissue level, the model-produced waves have similar patterns as the experimentally observed waves. This confirmation further validates the model. By having the cells share clock mRNA, from any clock component genes, showed the same, experimentally observed spatial dynamics. The main conclusion of the study is that regional differences (e.g., between different organs) in light senilities, when combined with cell-to-cell sharing of clock-gene mRNAs, enables a robust, yet flexible, circadian timing under noisy environmental cycles.

      Thank you for your assessment of our work. We plan to make the following revisions based on your feedback.

      **B. Specific points:**

      1.Lines 125-127: "To simulate the variability observed in single cell clock rhythms, we multiplied the level of each mRNA and protein by a time scaling parameter that was randomly selected from a normal distribution." - Why not add a white (Gaussian) noise term to these equations? How does multiplying by a random variable (for rescaling time) different from my proposal? Some explanation should be given in the text here.

      We opted for a time scaling approach as this generates between-cell period differences but avoids within-cell period differences. This is consistent with single cell experiments (S1 Fig; Gould et al., 2018, eLife). We will provide an explanation of this in the text.

      2.Does the spatial network model simplify calculations by assuming separations of timescales (e.g., for equilibration in concentrations of mRNAs that diffuse between cells)? If so, it would be good to spell these out in the beginning of the Results section (where the model is described).

      We agree that a more detailed discussion of the model assumptions would be beneficial and we will provide this in the text.

      3.Lines 161-162: "....in a phase only model by local...." should be "....in a phase model only by local...."

      Thank you for your correction.

      4.Lines 188-190: The authors observed that qualitatively similar/indistinguishable behaviors arose regardless of which elements are varied (e.g., global versus local cell-cell coupling, setting light input to be equal in all regions of the seedling, etc.). Then they claim here that "...these results show that the assumptions of local cell-to-cell coupling and differential light sensitivity between regions are the key aspects of our model that allow a match to experimental data." - I don't see how this follows from the observation almost any of the variations lead to the same behaviors in this section (spatial waves). Show the reasoning in the text here.

      We observed spatial waves with different local coupling regimes (4 v. 8 nearest neighbours). However, we did not observe spatial waves with global coupling (S10 Fig). This led us to conclude that local coupling is a key aspect. In addition, we do not observe waves when setting the light input to be equal in all regions of the seedling (S11 Fig). This confirms that local differences in light sensitivity are also required in our simulations to generate spatial waves. We will clarify these points with revisions to the text.

      5.Pgs. 9-10: Section on "Cell-to-cell coupling maintains global coordination under noisy light-dark cycles": The simulation results rigorously support the authors' main conclusion here, which is that local cell-to-cell coupling allows for global coordination under noisy LD cycles. But I'm missing an intuitive explanation (or just any explanation) for why this is. At the end of this section, the authors should provide some intuition or qualitative explanation for the observations that they produced using their model in this section.

      We will revise the text to provide an intuitive explanation of these results. The coupling decreases the within-region phase differences. Despite the between-regions phase differences persisting, this effect is sufficient to improve the overall global synchrony.

      6.Lines 261-262: Replace the present tenses with past tenses.

      Thank you for your correction.

      7.Is the main idea that cell-to-cell coupling allows for averaging of fluctuations, between organs or cells within the same organ, while allowing for coordination of the average quantities? Is this responsible for both the flexibility and robustness observed under noisy environmental cycles?

      The cell-to-cell-coupling allows for the averaging of fluctuations between cells and the regional flexibility arises from the different light sensitivities in each region. What was interesting to us was that under light-dark cycles the regional flexibility was not lost due to either the noise in the light or the averaging effect of the cell-to-cell coupling. We will revise the text to emphasize these points. Thank you for your prompts.

      8.Line 304: Is it really true that the mammalian circadian rhythm is centralized? Don't some parts of our bodies have different circadian clock (e.g., slight differences in phase) than some other parts of our bodies?

      There are indeed some small phase differences between parts of our bodies because the mammalian system, like the plant system, is imperfectly coupled. However, the mammalian system is considered more centralized because the suprachiasmatic nucleus in the brain receives the key entraining signal of light and then coordinates rhythms across the body (Bell-Pedersen et al., 2005, Nat Rev Gen; Brown & Azzi, 2013, Circadian Clocks). We will expand on these interesting points by adding a paragraph to the discussion.

      Reviewer #1 (Significance):

      **Overall assessment:**

      I enthusiastically recommend this work for publication after the authors address my comments below (please see "Specific points").

      The model's main strength is that the authors could vary each ingredient separately - light sensitivity of each cell/organ, which gene's mRNA diffuses between cells, cellular noise, local versus global cell-cell coupling, etc. Afterwards, the authors could determine which of these variations produces which experimentally observed behaviors. Another strength of the model is that it can reproduce not just one, but numerous, experimentally observed behaviors that are important for understanding circadian clocks in plants. Thus, the model is grounded in experimental truth and produces experimentally observed results. Crucially, since the authors could vary every single element in the model independently of the other elements, the authors are able to provide plausible explanations for why the experiments produced the results that they did (experimentally, a number of confounding factors prevented one from pinpointing to which element produced which observation).

      Another strength of the model is also extendable, by other researchers to investigate other plant physiologies in the future (e.g., circadian clock's influence on cell division). The authors highlight these future uses in the discussion section. Therefore, I believe that this work will be valuable to plant biologists, non-plant biologists who are interested in circadian clocks, and systems biologists in general.

      The manuscript is also well written and relatively easy to follow, even for non-plant biologists like myself.

      Thank you for the positive feedback - we are pleased that you find the manuscript of broad interest to a range of readers.

      Comment on Reviewer #2:

      I agree with his/her major criticism #3 (ELF4 long-distance movement). I find this to be a reasonable request. Fulfilling it would increase the paper's impact.

      Please see our response to reviewer #2.

      Comment on Reviewer #3:

      The reviewer's point (1) asks for a reasonable request.

      Regarding his/her point (2): This is also reasonable. I'd recommend his/her suggestion (a). In the end, I'd be interested to see how the authors respond to this (what function they choose to let adjacent cells be subjected to some correlated light-input intensity. I'd be happy with something simple such as + noise, where is a deterministic term that, for example, decreases exponentially as one moves away from some central cell. Basically, I'd let the authors decide how to implement this and accept their current implementation - no correlation in light-intensity between adjacent cells - as an extreme scenario, as this reviewer points out.

      Please see our response to reviewer #3.

      Reviewer #2 (Evidence, reproducibility and clarity):

      **Summary:**

      The manuscript presents an improved model of the circadian clock network that accounts for tissue-specific clock behavior, spatial differences in light sensitivity, and local coupling achieved through intercellular sharing of mRNA. In contrast to whole-plant or "phase-only" models, the authors' approach enables them to address the mechanism behind coupling and how the clock maintains regional synchrony in a noisy environment. Using 34 parameters to describe clock activity and applying the properties mentioned above, the authors demonstrate that their model can recapitulate the spatial waves in circadian gene expression observed and can simulate how the plant maintains local synchrony with regional differences in rhythms under noisy LD cycles. Spatial models that incorporate cell-type-specific sensitivities to environmental inputs and local coupling mechanisms will be most accurate for simulating clock activity under natural environments.

      Thank you for your assessment of our work. We plan to make the following revisions based on your feedback.

      *We have the following **major criticisms** as follows*

      1) When assigning light sensitivities in different regions of the plant, the authors assign a higher sensitivity value to the root tip (L=1.03) than they do to the other part of the root (L=0.90). We are curious why the root tip would have higher light sensitivity than the rest of the root. Is this based on experimental data (if so, please cite in this section or methods)? It seems that these L values were assigned simply to make sure they recapitulated the period differences observed in Fig. 2A. Are these values based on PhyB expression in those organs? Or perhaps based on cell density in those locations?

      We assign the light sensitivity to match observed experimental period differences across the plant (Fig 2A,B). This is based on previous experiments demonstrating that experimental period differences are dependent on light input through the light sensing gene PHYB (Greenwood et al., 2019, PLoS Bio; Nimmo et al., 2020, Physiologia Plantarum). For example, in WT seedlings, the root tip oscillates faster than the root, but this difference is lost in the phyb-9 mutant (Greenwood et al., 2019). Thus, we assume the root tip to be more sensitive to light than the roots.

      Further supporting this assumption, there is evidence that expression of phytochromes and cryptochromes are increased in the root tip relative to the root (e.g., Somers & Quail, 1995, Plant J; Bognar et al., 1999, PNAS; Toth et al., 2001, Plant Physiol), as the reviewer proposes. However, further experiments would be needed to verify that these differences in expression are what lead to the differences in clock timing. We will add a discussion of these experiments to the text.

      2) In the discussion of the test where they set the "light inputs to be equal" in all regions to simulate the phyb-9 mutant, could the authors please clarify whether that means they set the L light sensitivity value equal in all regions?

      This is indeed what we mean, we will rephrase the text for clarity.

      a. If they are referring to setting the L value equal to all regions, we suggest that this discussion be moved to the section about different light sensitivities instead of the local sharing of mRNA section.

      Thank you for your suggestion, we agree and will move this discussion.

      b. Additionally, is it possible to set the light sensitivity to zero for all parts of the plant? We think this would be more suitable to simulate the phyb-9 mutant phenotype.

      We thank the reviewer for this suggestion. We will include a simulation with light sensitivity set to zero in the revised manuscript, in addition to the existing simulations with light sensitivity set to 1.

      3) Based on the recent Chen et al. (2020) paper showing ELF4 long-distance movement, we think it would be of great interest for the authors to model ELF4 protein synthesis/translation as the coupling factor, in addition to the modeling using CCA1/LHY mRNA sharing. We understand you may be saving this analysis for a future modeling paper, but this addition to the paper could increase the impact of this paper.

      Thank you for the suggestion to improve our manuscript. We agree it will be of interest to model ELF4 protein as the local coupling factor. In the revision, we will simulate each clock protein (including ELF4) as the local coupling factor and compare.

      In addition, we will also modify the coupling mechanism to simulate the long-distance transport of ELF4 proposed by Chen et al., 2020. Our preliminary simulations show that we can couple shoot rhythms to those in the root tip, but that this long range coupling can not on its own generate the spatial structure observed in experiments. We agree with the reviewers that this analysis and an associated discussion will further increase the impact of the paper.

      4) This model is able to simulate circadian rhythms under 12:12 LD cycles, which represents two days of the year-the equinoxes. We are curious if the model can simulate rhythms under short days and long days as well. We understand this analysis may be outside the scope of this paper and may require changing the values of the 34 parameters used but think it could be a useful addition here or in future work.

      We agree it would be interesting to observe the behavior of the model under different day lengths. We will include simulations under short and long days in the revision.

      *And **minor criticisms** as follows*

      1) In the first paragraph of the results section, it would be helpful for the authors to reference Table S1 when they mention the 34 parameters used to model oscillator function

      We agree and we will implement this helpful suggestion.

      2) In the first paragraph of the section titled "Local flexibility persists under idealized and noisy LD cycles", it would be helpful for the authors to reference S12 Fig after the last sentence that starts "However, ELF4/LUX appeared more synchronized..."

      We agree and we will implement this helpful suggestion.

      3) In the first paragraph of the section titled "Cell-to-cell coupling maintains global communication under noisy light-dark cycles", the authors refer to a "Table 1" but I think they mean to refer to Table S1"

      Thank you, we will implement this helpful suggestion.

      4) In Fig. 1, panel C is described as demonstrating the cell-to-cell coupling through the "level of CCA1/LHY". This phrasing is vague and we think could be improved to the "mRNA level of CCA1/LHY".

      We agree and will implement this helpful suggestion.

      Reviewer #2 (Significance (Required)):

      This work would be broadly interesting to other researchers studying cell-to-cell signaling and coupling of circadian rhythms in plants and other species where spatial waves of gene expression have been observed (i.e., mice and humans). Additionally, the computational modeling aspect of this work was easily interpretable for someone outside this expertise. Our expertise lies in plant circadian biology.

      We thank the reviewer for recognising the broad appeal of our work.

      Reviewer #3 (Evidence, reproducibility and clarity):

      **Summary:**

      The authors start by taking a previously published model of the plant circadian clock and implement five changes: 1) updating the network topology to reflect some recent experimental findings, 2) make a spatial model loosely based on a seedling template 3) introduce coupling between cells based on shared levels of CCA1/LHY 4) randomly rescale time in each cell to induce inter-cell differences in period, 5) include a light sensitivity that depends on the region considered.

      For a certain configuration of light sensitivities/intensities, the different periods of oscillations in each seedling region roughly match that of experiments. With a sufficiently high coupling between cells, the system can also generate spatial waves, which are also observed in the experimental system.

      With pulsed light inputs the spatial pattern is still produced. The authors then investigate the robustness to environmental noise by generating stochastic light signals and show that the global synchrony, as measured with a synchronisation index, increases with cell-to-cell coupling strength. The paper is overall well-written, and the background and details of the analysis are well presented.

      Thank you for your assessment of our work. We plan to make the following revisions based on your feedback.

      **Major comments:**

      For the first part of paper, the output of the model is certainly the focus. There is virtually no discussion of the inferred parameters and how much confidence the authors have in their values.

      Thank you for this point. We will add discussion of the inferred parameters to the initial part of the results.

      My main issue with the paper is about the section with noisy light signals, which is included in the title and is ultimately one of the main themes of the article.

      Specifically, on line 224:

      "This decrease in cell-to-cell variation revealed an underlying spatial structure (Fig 4D, middle and right, and S13 Fig), comparable to that observed under idealized LD cycles (Fig 4B, middle and right, and S12 Fig)."

      Firstly, I don't feel these conclusions match with the data presented. Comparing figure 4D middle and right with figure 4B middle and right shows a clear and pronounced loss in spatial structure. In its current form, this statement has to change, but I believe there are at least two other major issues with this figure:

      We agree there are some differences in the spatial structure between idealized (Fig 4B) and noisy (Fig 4D) LD cycles. Preliminary simulations suggest that this is due to the way the noisy LD cycles are programmed.

      In the current implementation of noisy LD cycles, the maximum intensity of L, L**max, differs between each region, such that relative differences in light sensitivity between regions are maintained. This means that some phase differences between regions are maintained. However, as the reviewer correctly points out in point 1 below, due to the noise fluctuations, the average level of light is lower than under idealized LD cycles, and with considerable day-to-day variation. We believe this is why the spatial structure differs.

      Preliminary simulations suggest that if we normalize the mean light intensity such that the mean is equal between the two conditions (as the reviewer suggests in point 1 below), the spatial structure appears similar. We will present this analysis in the revision.

      1) The figure is clearly designed to invite a comparison between the noise-free light cycles on the left with the noisy cycles on the right. However, due to how the noisy light is simulated, the variance of light signal increases AND the average intensity of light decreases by 50%. When comparing the left and the right, we therefore don't know whether the changes are due to differences in the average signal or differences from the stochasticity. I think the authors should simulate a noisy light signal with the same mean intensity level as the deterministic signal.

      As discussed above, we agree that the average intensity of the light decreases due to the noise, and this complicates interpretation. We will simulate idealized and noisy light cycles with the same mean light level upon revision.

      2) The noise model for the light doesn't seem realistic. On line 484 is says:

      "We made the simplifying assumption that each cell is exposed to an independent noisy LD cycle due to their unique positions in the environment. LD cycles were input to the molecular model through the parameter L".

      In fact, this could be considered as an incredibly complex signal, because for 800 cells it means drawing 800 random light signals. The implication is that two adjacent cells receive statistically independent light signals. Depending on chance, one cell might receive tropical levels of light while its neighbour experiences a cloudy day. This affects the interpretation and conclusions from figures 4 and 5. I propose two different ways of improving the simulation of the noisy light signal:

      a) In one extreme case, all cells receive the same noisy light signal, and the other extreme, they all receive independent signals. You could consider a mixture model of light signals, where each cell receives \lambda L_global(t) + (1-\lambda) L_individual(t), where L_global(t) is a global light signal that is shared by all cells and L_individual(t) is a light signal unique to an individual cell. The mixing parameter \lambda controls how similar the light signal is between cells

      b) Clearly the light signal will differ depending on the region, but there will be some spatial correlation. You could also consider methods of simulating light such that neighbouring cells receive correlated signals, although this might be difficult.

      Thank you for your proposals. We agree that our current implementation of noisy LD cycles represents an extreme scenario. Given that there is no environmental data at sufficient resolution to reliably evaluate which implementation is most realistic, we will explore different approaches based on your suggestions and present them in our revision.

      Assuming that the problem with the mean signal is corrected, do you expect the average spatial pattern to be the same between figure 4 B and D with no coupling (J=0) (although an increase in the variance between cells)? Perhaps not (owing to nonlinearities in the system), but it would be interesting to comment.

      We agree that the decreased light intensity complicates interpretation of the spatial structure. Although in the current implementation relative light differences between regions are maintained, the spatial structure is altered because the mean intensities are lower. Preliminary simulations with the mean intensity fixed do result in spatial patterns more similar to that seen in Fig 4B, but with increased variance. Comprehensive simulations will be included in the revised manuscript.

      The different periods in the different regions of the seedling are caused by differences in light sensitivity, which the authors claim is justified from refs 12-15. An alternative hypothesis is the that biochemical parameters such as degradation rates are different between regions. This is briefly alluded to in the introduction, but I think it would be interesting to discuss further. What would be the pros and cons of the two different mechanisms?

      We agree that an alternative hypothesis is that biochemical parameters such as degradation rates may differ between regions. Experimental evidence, however, more supports the light sensitivity hypothesis. This is because, for example, mutations in light signalling remove the spatial differences between regions. We agree though that this is an important point, and will add a paragraph to the discussion discussing the pros and cons of the two different mechanisms.

      I understand that the authors used a pre-existing model, but I must say that I find the way that light is incorporated into the model a bit confusing.

      On line 345 it says:

      "L(t) represents the input light signal (L = 0, lights off; L > 0, lights on) and D(t) denotes a corresponding darkness input signal (D = 1, lights off; D = 0, lights on)."

      Surely the only thing that matters biophysically is the number of photons hitting the plant? Could you explain why the model needs to have a separate "darkness signal" compared to just a single light signal?

      A darkness signal has been introduced in many circadian clock models because degradation rates of the clock genes can depend upon the light or dark condition. We agree with the reviewer that we should explain this clearer in the text.

      In the model, the light intensity changes depending on the region. It might make more sense for interpretability if instead there is an additional light-sensitivity coefficient that depends on the region, because at the moment I'm not sure what units L(t) is supposed to take.

      Thank you for your suggestion. We will try to implement this approach.

      **Minor comments**

      Could you more explicitly describe a possible molecular mechanism through which the coupling acts?

      Thank you for your suggestion. We will more explicitly discuss likely transport mechanisms in the text.

      In Figure 1C it looks like different genes are coupling to different genes, so you may need to rearrange it.

      In our model, the level of CCA1/LHY is shared. Thus, CCA1/LHY from one cell can be considered to repress the expression of other interacting genes in the neighbour cell.

      Line 103: "We found that regional differences persist even under LD cycles, but cell to-cell minimized differences between neighbor cells." Missing word.

      Thank you for your correction.

      Line 124: "The coupling strength was set to 2 (Methods)." This is meaningless in isolation, so it would be better to briefly explain what the coupling parameter is before mentioning its value.

      Thank you for your suggestion, we will describe the coupling function in more detail.

      Through the text, I think De Caluwe should be corrected to De Caluwé

      Thank you for your correction.

      Typo line 493

      Thank you for your correction.

      Code and data are not made available.

      Model code will be made available from our project GitLab page: https://gitlab.com/slcu/teamJL/greenwood_tokuda_etal_2020

      Output of analysis of experimental data and simulations will also be made available on the GitLab page.

      Reviewer #3 (Significance (Required)):

      The authors motivate the paper by highlighting that their proposed model improves on phase-based models in that it describes underlying molecular mechanisms.

      From an experimental side, it's interesting that a model is developed and directly compared with measured spatio-temporal waves of gene expression. From a theoretical side, the authors address questions relating to oscillations, multi-scale modelling and noise robustness that also generalise to other systems. I therefore expect that both experimental and theoretical audiences will be interested in the results.

      There are many possible additions and modifications that could be made to the model, and so the model and analysis could provide a platform for future research. However, I can't comment on whether there are similar pre-existing models of the plant circadian clock that contain both a molecular description of the circadian clock as well as a spatial scale.

      We appreciate the reviewer’s view that the work is interesting to both experimental and theoretical audiences.

      Comments on Review #1:

      The time is rescaled in each cell, meaning that each cell has a unique period, but the dynamics remain deterministic and hence the peak-to-peak times will be exactly the same for each cell. I imagine this isn't completely consistent with single-cell data (if available), where peak-to-peak times are very likely to be variable due to noisy gene expression. In a future paper it would be interesting to analyse the system using stochastic differential equations.

      Please see our response to reviewer #1.

      Comments on Review #2:

      I agree on the following two points:

      1) It would add value to discuss whether the different ranking of light sensitivities by organ matches any available experimental data.

      Please see our response to reviewer #2.

      2) As the Reviewers point out, there are many possibilities for testing the robustness of the system to light clues, including varying the length of the day. Although outside of the scope of this paper, I wonder if it's possible to find data from a light sensor measuring light intensity across an entire year? Plugging such data into the model and measuring how the amplitude and period changes would be really interesting, in my opinion.

      Thank you for your suggestion. We also see this as an interesting future direction.