6,040 Matching Annotations
  1. Last 7 days
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      SUMMARY OF THE PRESENTED FINDINGS

      Abstract

      1. LCOR (Ligand-dependent corepressor), which suppresses tumor growth by inducing the antigen presentation machinery (APM) of the tumor cells and constrains cellular plasticity.
      2. poly β-(amino esters) (pBAE) nanoparticles (NPs).. Our results show optimal endosomal escape, which results in high transfection efficiency in vitro and in vivo
      3. the combination of Lcor mRNA-loaded NPs with anti-PDL1 or anti-CTLA4 immunotherapies eradicated most of the tumors in our preclinical TNBC model.

      Introduction

      a. These structures facilitate endosomal escape due to protonation of tertiary amines at lower pH7.

      Results

      b. In human models MDAMB-231 and MCF7 cells, the NPs also showed high eGFP mRNA transfection efficiency

      c. The efficiency of eGFP mRNA-loaded pBAE-NPs to transfect mRNA into different mouse breast cancer cells (AT3, 4T07, EO771, EMT6, 66cl4, EpRAS, and 4T1) was tested using NPs encapsulating eGFP mRNA,

      d. Synthetic Lcor mRNA contained a Cap1, 5' and 3' untranslated regions (UTR) and a standard polyA tail (Fig. S2A), and all uracil were replaced for 5-methoxyuracil (5-moU) to avoid immunogenic reactions27,28. First, we measured and detected high levels of Lcor mRNA by qRT-PCR

      e. NPs were stable at 25ºC for 24 h (Fig. S2C). In contrast, under conditions simulating the physiological environment (37ºC), a decrease in FRET signaling was detected ... indicating disassembly of the NPs after 2 h (Fig. S2C).

      f. Lcor mRNA NPs, induces the expression of APM genes in AT3 and 4T07 cell lines

      g. AT3 cells that constitutively overexpress ovalbumin (OVA). In these cells, OVA is cleaved, generating the SIINFEKL antigen peptide presented in the H-2Kb context. This can be used to measure APM activity using the anti-SIINFEKL antibody via flow cytometry.

      h. We also observed a time- and dose-dependent effect regarding APM induction.

      i. When tumors reached 0.5 x 0.5 cm2, we treated them intratumorally with pBAE-NPs loaded with 5 ug of synthetic FLuc or eGFP mRNA. We detected BLI at 3 h, meaning that tumor cells had taken up the mRNA-loaded NPs and translated a luciferase active protein within 3 h. In both models, expression peaked around 6 to 10 hours after administration

      j. After local administration of 5 μg of Lcor mRNA-loaded NPs, we observed a rapid increase in Lcor mRNA in the tumor tissue, followed by a decrease, reaching baseline levels after 24 h (Fig. 3C). ..To unravel the protein dynamics, we used ... LCOR-HA protein and uniquely detect the ectopic protein using anti-HA by IF. As expected, LCOR-HA protein expression was delayed, peaking 3 h after administration (Fig. 3D). Linked to protein expression, at 3 h and 6 h after administration, we detected an increase in APM genes by RT-qPCR (Fig. 3E and S3D).

      k. the combination of Lcor mRNA-loaded NPs with anti-PDL1 therapy not only reduced tumor growth but also led to tumor eradication in 5 out of 7 mice.

      l. The combination of Lcor mRNA-loaded NPs with different ICIs showed high efficiency in preclinical models, thus supporting the feasibility of starting clinical studies and thus bringing the treatment closer to patients.

      Major points

      L. 277: "NPs were stable at 25ºC for 24 h (Fig. S2C). In contrast, under conditions simulating the physiological environment (37ºC), a decrease in FRET signaling was detected ... indicating disassembly of the NPs after 2 h (Fig. S2C)." - The disassembly of the NPs after 2 h is key to the performance of the chosen approach.

      L. 296: "The results showed an increased number of cells with higher OVA-SIINFEKL presentation, indicating the enhanced activity of the APM induced by the Lcor mRNA-loaded pBAE-NPs... demonstrate the efficiency of this mRNA nanotechnology to rescue the function of the LCOR TF in inducing tumor cell immunogenicity and thus modulating tumor phenotypes." - There is a key difference between activating antigen-presenting machinary and inducing immunogenicity, i.e. recognition by the immune system and activation of effector cells. There is no indication on how effective endogenous immune responses (e.g. antibody titers, TIL infiltration, cytokine release) are to the administration of Lcor mRNA-loaded NPs.

      L. 325: "Based on these results, we estimated an optimal therapeutic regimen of Lcor-mRNA-loaded pBAE-NPs administration in our preclinical experimental models would be every 3 days." - It is highly unclear how the authors came to this conclusion, as it should be based on the time frame of optimal immune responses.

      L. 332: "Lcor mRNA-loaded NPs were administered at a dose of 250 μg/kg by intratumoral (i.t.) injection twice a week" - This possibly is the strongest limitation of this study. Intratumor injections of largely unfeasible/unrealistic in clinical setting. Even more, the management of metastatic disease appears out of question.

      L. 337: "the results revealed that Lcor mRNA monotherapy was enough to reduce 4T07 tumor 338 growth." - These effects appear rather limited (Fig. 4A,B) and are not statistically significant in Fig. S4B and Fig. S5A.

      L. 338: "the combination of Lcor mRNA-loaded NPs with anti-PDL1 therapy not only reduced tumor growth but also led to tumor eradication in 5 out of 7 mice" - Fig. 4A bottom left panel. Three of the tumor growth curves abruptly stop at below 200 mm3. Typically, this is mouse death. This reduces the tumor pool to four xenografts. Among these, we notice two complete responses and two tumor progressions. Two tumor progressions are seen also in the combination Lcor mRNA+ α-PD-L1 group. We are unsure about the statistics of this experiment.

      L. 350: "The combination of Lcor mRNA-loaded NPs with different ICIs showed high efficiency in preclinical models, thus supporting the feasibility of starting clinical studies and thus bringing the treatment closer to patients."

      • Please see comment on L. 332. It appears unrealistic to consider clinical studies in patients unless a systemic administration of Lcor mRNA-loaded NPs is tackled and corresponding therapeutic efficacy is shown.

      Significance

      General assessment:strengths and limitations.

      The identification of a candidate therapeutic means, by supplying Lcor mRNA for induction of antigen-presenting molecules is of potential interest. As this is not a basic science study, but aims at developing feasible therapeutics, it falls short in this respect, as most likely unfeasible in patients. The combined effect with anti-immune blockade agents is of interest. However, if one assumes that effective immunostimulation was indeed induced by Lcor mRNA, its overall impact on tumor growth is per se weak, if any. Maybe only antigen presentation is induced, but this is in the absence of costimulatory signals? This needs to be investigated.

      Advance

      This article is based on good papers that were published years ago. The science novelty is limited. As the idea is to develop a novel therapeutic approach, the lack of realistic feasibility severely limits merits.

      Audience

      Scientists involved in preclinical studies.

      Reviewer expertise

      This reviewer and his research group have cloned the genes and biochemically characterized novel tumor drivers. He identified their function as stimulators of tumor cell growth and of metastatic spreading, together with roles in cell-cell adhesion, signal transduction and local cancer invasion. This led to the discovery of their prognostic / predictive relevance in human cancer. Two murine models of rare genetic diseases were generated by ablating the corresponding murine genes. He then pioneered the development of software for the identification of fusion oncogenes and of transcription factor-DNA binding sites. This reviewer fostered novel anti-cancer immunotherapies. He generated anti-cancer cytotoxic T lymphocytes, by the use of in vitro engineered antigen presenting cells. Using proprietary discovery platforms, this reviewer developed novel anti-cancer monoclonal antibodies, that selectively target cancer cells. This led to the engineering of humanized antibody-drug conjugates, bispecific anti-CD3/activated Trop-2 antibodies and innovative CAR-T designs. ADCs are now being tested in clinical trials in cancer patients.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Serra-Mir et al investigate the therapeutic potential of delivering the mRNA of LCOR transcription factor via nanoparticles to enhance the efficacy of immune checkpoint inhibitors. The authors show that the mRNA delivery mediated by H and R-nanoparticles was efficient in multiple breast cancer cell lines in vitro. Moreover, using mouse models, they show that LCOR mRNA delivery may improve the efficacy of the treatment with anti-PDL1 or anti-CTLA4 checkpoint inhibitors against tumors. Although this proof-of-concept study has promising aspects, there are significant weaknesses that should be addressed. Details below.

      Major points:

      1. In vitro delivery of LCOR appears to be effective in both AT3 and 4T07 cell lines when continuously exposed to the mRNA loaded nanoparticles. However, the impact of LCOR on antigen presentation machinery (APM) is rather mixed and not very convincing. The expression pattern and kinetics of several APM genes are inconsistent with LCOR kinetics and at several timepoints the expression in LCOR samples is essentially the same as in mutant LCOR negative controls (Figure 2D). Moreover, the APM reporter assay experiments show that APM in LCOR transduced 4T07 cells is induced rather modestly at best (Figure 2E). The APM effect needs to be demonstrated more rigorously to be convincing.
      2. Considering that previous studies by the authors suggest a role for LCOR in regulating stem cell properties in normal and malignant mammary cells (Celia-Terrassa et al Nat Cell Bio 2017; Perez-Nunez et al Nat Can 2022), it is important to address whether transduced LCOR mRNA impacts these properties. Moreover, other autocrine cell functions such as proliferation and apoptosis are also relevant and should be analyzed.
      3. The impact of LCOR delivery on immune responses in mouse models could be more rigorous. Analysis of APM genes shows rather modest difference in these gene after LCOR transduction (Figure 3E). Is this sufficient to induce effective anti-tumor immune response? What is the status of T cell activity or exhaustion? Furthermore, LCOR may regulate cytokines and chemokines that are critical for modulation of the immune environment. Did the authors measure any immune-modulating cytokines in the tumor microenvironment, following LCOR expression? Finally, whereas the study focuses on APM and its function, LCOR may directly modulate expression of checkpoint activators on cancer cells. The impact of LCOR transduction on PD-L1, PD-L2 and CTLA-4 expression in cancer cells should be determined.
      4. In line with point nr 2, it would be important to analyze the impact of delivered LCOR mRNA on cell functions such as proliferation and apoptosis in the mouse tumors. Even if LCOR delivery sensitizes tumors to checkpoint inhibitors, it cannot be assumed that the impact of LCOR is primarily due to induction of the APM.
      5. The experiments analyzing treatment efficacy in the 4T07 model in mice show lack of consistency and a substantial variation between mice that are treated in the same manner. Even the group treated with PBS and Ctr-mRNA contains mice with tumors that regress (Figure 5A). This inconsistency suggests that more mice are required to generate a convincing pattern. Furthermore, the inclusion of a second model would provide a stronger case for a broad applicability of the LCOR treatment with checkpoint inhibitors. Indeed, it is surprising that the authors did not use the AT3 model in vivo considering that mRNA delivery and LCOR expression is substantially more efficient in AT3 compared to 4T07.
      6. Following the injection of LCOR nanoparticles to the tumor, the proportion and spatial distribution of LCOR expressing cells should be determined. This is particularly relevant in light of the almost complete elimination of the tumors treated with combination therapy (Figures 4 and 5). Is this striking impact on tumors in spite of mRNA being delivered only to a small portion of cells within the tumor?
      7. The in vivo results indicate that expression levels of Fluc mRNA decline rapidly post-treatment, returning to baseline within 24 hours after peaking at 10 hours (Supplementary Figure 3). Although the investigators treat mice every 3rd day with LCOR nanoparticles in their therapeutic experiments, the analysis of durability of immune responses after single injection should be done and can provide important practical insights to guide therapeutic design.

      Minor points:

      1. The authors mention that LCOR mRNA delivery synergizes with checkpoint inhibitor treatment. However, synergy has a specific meaning when drug interaction is analyzed. This was not really addressed or calculated.
      2. There seems to be a mistake in the text (lines 261-263). Based on Figure 1C the mRNA delivery efficiency is higher in AT3 cells compared to 4T07 cells (very difficult to determine anything from Figure 3D, since the cell density is not visible).
      3. It is surprising how little expression of luciferase is observed in the 4T07 model (Figure S3), even if almost 60% of cancer cells and 40% of stromal cells are positive (Figure 3A). What could explain this discrepancy?
      4. Representative FACS plots from Figure 3 should be shown.
      5. There are issues with the figure legends of Figure 3 (from 3C onwards) and Figure S2 (from 2D onwards) that need to be fixed.

      Significance

      The study is a proof-of-concept investigation addressing whether LCOR mRNA can be delivered by nanoparticles to sensitize tumors to immunotherapy. This approach aims to overcome the limitations and difficulties of targeting transcription factors for therapeutic purposes. However, although the delivery of LCOR mRNA appears to be sufficient, further characterization of the resulting impact needs to be done. This includes both impact on immune responses as well as cell-autonomous impact on cancer cell proliferation and apoptosis.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-02946

      Corresponding author(s): Margaret, Frame

      Roza, Masalmeh

      [Please use this template only if the submitted manuscript should be considered by the affiliate journal as a full revision in response to the points raised by the reviewers.

      If you wish to submit a preliminary revision with a revision plan, please use our "Revision Plan" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      We thank the reviewers for recognizing the significance of our work and for their constructive feedback and suggestions, most of which we have implemented in our revised manuscript.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      Reviewer #1

      Evidence, reproducibility and clarity

      Review of Masalmeh et al. Title: "FAK modulates glioblastoma stem cell energetics..."

      Previous studies have implicated FAK and the related tyrosine kinase PYK2 in glioblastoma growth, cell migration, and invasion. Herein, using a murine stem cell model of glioblastoma, the authors used CRISPR to inactivate FAK, FAK-null cells selected and cloned, and lentiviral re-expression of murine FAK in the FAK-null cells (termed FAK Rx) was accomplished. FAK-/- cells were shown to possess epithelial characteristics whereas FAK Rx cells expressed mesenchymal markers and increased cell migration/invasion in vitro. Comparisons between FAK-/- and FAK Rx cells showed that FAK re-expressed increased mitochondrial respiration and amino acid uptake. This was associated with FAK Rx cells exhibiting filamentous mitochondrial morphology (potentially an OXPHOS phenotype) and decreased levels of MTFR1L S235 phosphorylation (implicated in mito morphology fragmentation). Mito and epithelial cell morphology of FAK-/- cells was reversed by treatment with Rho-kinase inhibitors that also increased mito metabolism and cell viability. Last, FAK-dependent glioblastoma tumor growth was shown by comparisons of FAK-/- and FAK Rx implantation studies.

      The studies by Masalmeh provide interesting findings associating FAK expression with changes in mitochondrial morphology, energy metabolism, and glutamate uptake. According to the authors model, FAK expression is supporting a glioblastoma stem cell like phenotype in vitro and tumor growth in vivo. What remains unclear is the mechanistic connection to cell changes and whether or not these are be dependent on intrinsic FAK activity or as the Frame group has previously published, potentially FAK nuclear localization. The associations with MTFR1L phosphorylation and effects by Rho kinase inhibition are likely indirect and remind this reviewer of long-ago studies with FAK-null fibroblasts that exhibit epithelial characteristics, still express PYK2, exhibited elevated RhoA GTPase activity. Some of these phenotypes were linked to changes in RhoGEF and RhoGAP signaling with FAK and/or Pyk2. At a minimum, it would be informative to know whether Pyk2 signaling is relevant for observed phenotypes and whether the authors can further support their associations with FAK-targeted or FAK-Pyk2-targeted inhibitors or PROTACs.

      Some questions that would enhance potential impact. 1. Cell generation. Please describe the analysis of FAK-/- clones in more detail. The "low viability" phenotype needs further explanation with regard to clonal expansion and growth characteristics?

      Response:

      • We included a better description and a supplementary figure in our revised manuscript to indicate that we have examined several FAK -/- clones and confirmed that our observations were not due to clonal variation; multiple clones displayed similar morphological changes (Figure S1D). We also show that the elongated mesenchymal-like morphology was observed at 48 h after nucleofecting the cells with the FAK‑expressing vector, before beginning G418 selection to enrich for cells expressing FAK (Figure S1C). We also included experiments to acutely modulate FAK signalling (detaching and seeding cells on fibronectin) (Figure S2D, E, F and Figure S3) to exclude the possibility that the profound effects are due to protocols/selection we used for generating FAK-deleted cells.
      • Regarding the term “low viability”, we have clarified in the text that there is no significant difference in cell number (Figure S1A) or ‘cell viability’ when it is assessed by trypan blue exclusion (a non-mitochondria-dependent read-out) (Figure S1B) between FAK-expressing FAK Rx and FAK-/- cells cultured for three days under normal conditions. Therefore, we agree the term ‘cell viability’ in this context could be confusing and have replace "cell viability” with “metabolic activity as measured by Alamar Blue.” in Figure 1D and Figure 5B, and the corresponding text in the original manuscript. This wording more accurately reflects the data.

      Figure 1F: need further support of MET change upon FAK KO and EMT reversion.

      Response: We have added a heatmap (Figure S1E) illustrating the changes in protein expression of core-enriched EMT/MET genes products (by proteomics) after FAK gene deletion (EMT genes as defined in Howe et al., 2018) ; this strengthens the conclusion that the MET reversion morphological phenotype is accompanied by recognised MET protein changes.

      Fig. 2: Need further support if FAK effects impact glycolysis or oxidative phosphorylation in particular as implicated by the stem cell model.

      Response: We show that FAK impacts both glycolysis (Figure 2A, 2E, and 2F) and mitochondrial oxidative phosphorylation on the basis of the oxygen consumption rate (OCR) (Figure 2B, and 2D), showing both are contributing pathways to FAK-dependent energy production. We have clarified this in the text.

      Is there a combinatorial potential between FAKi and chemotherapies used for glioblastoma. Need to build upon past studies.

      Response: Yes, previous studies suggest that inhibiting FAK can sensitize GBM cells to chemotherapy (Golubovskaya et al., 2012; Ortiz-Rivera et al., 2023). We have included a paragraph in the discussion section to make sure this is clearer. Although it is not the subject of this study, we appreciate it is useful context.

      The notation of changes in glucose transporter expression should be followed up with regard to the potential that FAK-expressing cells may have different uptake of carbon sources and other amino acids. Altered uptake could be one potential explanation for increase glycolysis and glutamine flux.

      Response: We agree with the reviewer that glucose uptake could be contributing and we include data that 2 glucose transporters are indeed FAK-regulated namely Glucose transporter 1 (GLUT1, encoded by Slc2a1 gene) and Glucose transporter 3 (GLUT 3, encoded by Slc2a3 gene) (shown in Figure S2B and C).

      It would be helpful to support the confocal microscopy of mitos with EM.

      Response:

      We are concerned (and in our experience) that Electron microscopy (EM) may introduce artefacts during sample preparation. In contrast, immunofluorescence sample preparation is less susceptible to artefacts. The SORA system we used is not a conventional point-scanning confocal microscope, but is a super-resolution module based on a spinning disk confocal platform (CSU-W1; Yokogawa) using optical pixel reassignment with confocal detection. This method enhances resolution in all dimensions with resolution in our samples measured at 120nm. This has been instructive in defining a new level of changes in mitochondrial morphology upon FAK gene deletion.

      Lack of FAK expression with increased MTFR1 phosphorylation is difficult to interpret.

      Response: We do not directly show that this phosphorylation event is causal in our experiments; however, we think it important to document this change since it has been published that phosphorylation of MTFR1 has been causally linked to the mitochondrial morphology we observed in other systems (Tilokani et al., 2022).

      Need to have better support between loss of FAK and the increase in Rho signaling. Use of Rho kinase inhibitors is very limited and the context to FAK (and or Pyk2) remains unclear. Past studies have linked integrin adhesion to ECM as a linkage between FAK activation and the transient inhibition of RhoA GTP binding. Is integrin signaling and FAK involved in the cell and metabolism phenotypes in this new model?

      Response: To better support the antagonistic effect of FAK on Rho-kinase (ROCK) signalling, we included a new experiment in which the integrin-FAK signalling pathway has been disrupted by treating FAK WT cells with an agent that causes detachment from the substratum, Accutase, and growing the cells in suspension in laminin-free medium. We present ROCK activity data, as judged by phosphorylated MLC2 at serine 19 (pMLC2 S19), relating this to induced FAK phosphorylation at Y397 (a surrogate for FAK activity) that is supressed after integrin disengagement. These measurements have been compared with conditions whereby integrin-FAK signalling is activated by growing the cells on laminin coated surfaces. We observed a time-dependent decrease in pFAK(Y397) levels (normalised to total FAK) in suspended cells compared to those spread on laminin, while pMLC2(S19) levels increased in a reciprocal manner over time in detached cells relative to spread cells (S4A and B). There is therefore an inverse relationship between integrin-FAK signalling and ROCK-MLC2 activity, consistent with findings from FAK gene deletion experiments. In the former case, we do not rely on gene deletion cell clones.

      Significance

      The studies by Masalmeh provide interesting findings associating FAK expression with changes in mitochondrial morphology, energy metabolism, and glutamate uptake. According to the authors model, FAK expression is supporting a glioblastoma stem cell like phenotype in vitro and tumor growth in vivo. What remains unclear is the mechanistic connection to cell changes and whether or not these are be dependent on intrinsic FAK activity or as the Frame group has previously published, potentially FAK nuclear localization. The associations with MTFR1L phosphorylation and effects by Rho kinase inhibition are likely indirect and remind this reviewer of long-ago studies with FAK-null fibroblasts that exhibit epithelial characteristics, still express PYK2, exhibited elevated RhoA GTPase activity. Some of these phenotypes were linked to changes in RhoGEF and RhoGAP signaling with FAK and/or Pyk2. At a minimum, it would be informative to know whether Pyk2 signaling is relevant for observed phenotypes and whether the authors can further support their associations with FAK-targeted or FAK-Pyk2-targeted inhibitors or PROTACs.

      __Response: __

      Deleting the gene encoding FAK in mouse embryonic fibroblasts leads to elevated Pyk2 expression (Sieg, 2000). However, in the GBM stem cell model we used here, Pyk2 was not expressed (determined by both transcriptomics and proteomics). We have included Figure S1E to show that PYK2 expression was undetectable in FAK -/- and FAK Rx cells at the RNA level (Figure S1F). We conclude that there is no compensatory increase in Pyk2 upon FAK loss in these cells. In the transformed neural stem cell model of GBM, we do not consistently or robustly detect nuclear FAK.

      Review #2

      Masalmeh and colleagues employ a neural stem/progenitor cell-based glioma model (NPE cells) to investigate the role of Focal Adhesion Kinase (FAK) in GBM, with a focus on potential links between the regulation of morphological/adhesive and metabolic GBM cell properties. For this, the authors employ wt cells alongside newly generated FAK-KO and -reexpressing cells, as well as pharmacological interventions to probe the relevance of specific signaling pathways. The authors´ main claims are that FAK crucially modulates glioma cell morphology, cell-cell and cell-substrate interactions and motility, as well as their metabolism, and that these effects translate to changes to relevant in vivo properties such as invasion and tumor growth.

      My main issues are with the model chosen by the authors.

      As per the methods section, generation of FAK-KO and -"Rx" NPE cells entailed protracted selection/expansion processes, which may have resulted in inadvertent selection for cellular/molecular properties unrelated to the desired one (loss or gain of FAK expression) and which may have had cascading effects on NPE cells. The authors nonetheless repeatedly claim the parameters they quantify, such as mitochondrial or cytoskeletal properties or metabolic features, to have directly resulted from FAK loss or reintroduction. Examples of such causal inferences are to be found in lines 123, 134/135, 165, 181. Such causal claims are, in my view, unsupported.

      Acute perturbation of FAK expression/activity, genetically or pharmacologically, followed by a rapid assessment of the processes under investigation, would be needed to begin to assess causality, even if acute genetic perturbations may be technically challenging as sufficient gene expression reduction or restoration to physiologically relevant levels may be hard to achieve.

      Response:

      We would like to first comment on the model we used here, which we think will clarify the validity of our approach. The model is a transformed stem cell model of GBM that was published in (Gangoso et al., Cell, 2021) and is now used regularly in the GBM field. As mentioned in the response to Reviewer 1, we have added text (page 4 and 5 in the revised manuscript) and a new supplementary figure (Figure S1D) clarifying that the morphological changes we observed were consistent across multiple FAK -/- clones, showing this was not due to any inter-clonal variability. We also added images showing that the morphological changes were apparent at 48 h after nucleofecting FAK -/- cells with the FAK‑expressing vector specifically (not the empty vector), prior to starting G418 selection to enrich for FAK‑expressing cells (Figure S1C), addressing the worry that clonal variation and selection was the cause of the FAK-dependent phenotypes we observed. We believe that our model provides a type of well controlled, clean genetic cancer cell system of a type that is commonly used in cancer cell biology, allowing us to attribute phenotypes to individual proteins.

      We have also carried out a more acute treatment by using the FAK inhibitor VS4718 to perturb FAK kinase activity and assessed the effects on glycolysis and glutamine oxidation after 48h treatment (Figure S2D, E and F). We found that treating the transformed neural stem cells (parental population) with FAK inhibitor (300nM VS4718) decreases glucose incorporation into glycolysis intermediates and glutamine incorporation into TCA cycle intermediates, consistent with a role for FAK’s kinase activity in maintaining glycolysis and glutamine oxidation.

      The employed pharmacological modulation of ROCK activity is the only approach that, given the presumably acute nature of the treatment, may have allowed the authors to probe the proposed functional links. The methods section of the manuscript does not however comprise details as to the duration of these treatments, which leaves open the possibility of long-term treatment having been carried out (data shown in Figure 5B refers to 72hr treatment).

      __Response: __

      We have added the duration of the treatment to the Methods section and Figure Legends, to clarify that cells were treated with ROCK inhibitors for 24h, before assessing the effects on mictochondria (Figure 4C, D, S4C and D) and glutamine oxidation (Figure 5A, and S5). For metabolic activity by AlamarBlue assay, cells were treated with ROCK inhibitors for 72h (Figure 5B).

      Even in the case of ROCK inhibitor experiments, it is however unclear if and how the effects on cell morphology and adhesion, mitochondrial organization and metabolic activity may be connected to each other and, if at all, to FAK expression.

      Given the above uncertainties due to the nature of the model and experimental approaches, it is hard to assess the reliability and thus the relevance of the findings.

      Response:

      FAK suppresses ROCK activity (as judged by pMLC2 S19, Figure 4A and B). Treating FAK -/- cells with two different ROCK inhibitors restored mesenchymal-like cell morphology, mitochondrial morphology and glutamine oxidation. As mentioned above, to strengthen our evidence for the antagonistic role of FAK in ROCK-MLC2 signalling, we have now introduced an experiment whereby integrin-FAK signalling was disrupted through treatment with a detachment agent (Accutase), and subsequently maintaining the cells in suspension in laminin-free medium. We assessed pMLC2 S19 levels (a measure of ROCK activity) relating this to FAK phosphorylation that is supressed after integrin disengagement. These results were evaluated relative to spread wild type cells growing on laminin where Integrin-FAK signalling was active (Figure S4A and B). We observed an inverse relationship between Integrin-FAK signalling and ROCK-MLC2 activity in keeping with our conclusions (Figure 4A and B).

      Experimental support for the ability of cell-substrate interaction modulation to concomitantly impact cellular metabolism and motility/invasion would be significant both in terms of advancing our understanding of glioma cell biology and of its translational potential, but the evidence being provided is at best compatible with the proposed model.

      Response: We carried out a new experiment to support the ability of cell-substrate interaction modulation to impact metabolism; specifically, we inhibited cell-substrate interactions by plating the cells on Poly-2-hydroxyethyl methacrylate (Poly 2-HEMA)-coated dishes. This suppressed FAK phosphorylation at Y397, as expected, with concomitant reduction in glutamine utilisation in the TCA cycle (Figure S3A, B and C).

      My background/expertise is in developmental and adult neurogenesis, in vivo modelling of gliomagenesis and cell fate control/reprogramming, with a focus on molecular mechanisms of differentiation and quantitative aspects of lineage dynamics; molecular details of the control of cellular metabolism, cell-cell adhesion and cytoskeletal dynamics are not core expertise of mine.

      We appreciate this reviewer’s expertise are not necessarily in the cancer cell biology and genetic intervention aspects of our study. We hope that the explanations we have provided satisfy the reviewer that our conclusions are valid.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Masalmeh and colleagues employ a neural stem/progenitor cell-based glioma model (NPE cells) to investigate the role of Focal Adhesion Kinase (FAK) in GBM, with a focus on potential links between the regulation of morphological/adhesive and metabolic GBM cell properties. For this, the authors employ wt cells alongside newly generated FAK-KO and -reexpressing cells, as well as pharmacological interventions to probe the relevance of specific signaling pathways. The authors´ main claims are that FAK crucially modulates glioma cell morphology, cell-cell and cell-substrate interactions and motility, as well as their metabolism, and that these effects translate to changes to relevant in vivo properties such as invasion and tumor growth. My main issues are with the model chosen by the authors.

      As per the methods section, generation of FAK-KO and -"Rx" NPE cells entailed protracted selection/expansion processes, which may have resulted in inadvertent selection for cellular/molecular properties unrelated to the desired one (loss or gain of FAK expression) and which may have had cascading effects on NPE cells. The authors nonetheless repeatedly claim the parameters they quantify, such as mitochondrial or cytoskeletal properties or metabolic features, to have directly resulted from FAK loss or reintroduction. Examples of such causal inferences are to be found in lines 123, 134/135, 165, 181. Such causal claims are, in my view, unsupported. Acute perturbation of FAK expression/activity, genetically or pharmacologically, followed by a rapid assessment of the processes under investigation, would be needed to begin to assess causality, even if acute genetic perturbations may be technically challenging as sufficient gene expression reduction or restoration to physiologically relevant levels may be hard to achieve.

      The employed pharmacological modulation of ROCK activity is the only approach that, given the presumably acute nature of the treatment, may have allowed the authors to probe the proposed functional links. The methods section of the manuscript does not however comprise details as to the duration of these treatments, which leaves open the possibility of long-term treatment having been carried out (data shown in Figure 5B refers to 72hr treatment). Even in the case of ROCK inhibitor experiments, it is however unclear if and how the effects on cell morphology and adhesion, mitochondrial organization and metabolic activity may be connected to each other and, if at all, to FAK expression.

      Significance

      Given the above uncertainties due to the nature of the model and experimental approaches, it is hard to assess the reliability and thus the relevance of the findings.

      Experimental support for the ability of cell-substrate interaction modulation to concomitantly impact cellular metabolism and motility/invasion would be significant both in terms of advancing our understanding of glioma cell biology and of its translational potential, but the evidence being provided is at best compatible with the proposed model.

      My background/expertise is in developmental and adult neurogenesis, in vivo modelling of gliomagenesis and cell fate control/reprogramming, with a focus on molecular mechanisms of differentiation and quantitative aspects of lineage dynamics; molecular details of the control of cellular metabolism, cell-cell adhesion and cytoskeletal dynamics are not core expertise of mine.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Review of Masalmeh et al.

      Title: "FAK modulates glioblastoma stem cell energetics..."

      Previous studies have implicated FAK and the related tyrosine kinase PYK2 in glioblastoma growth, cell migration, and invasion. Herein, using a murine stem cell model of glioblastoma, the authors used CRISPR to inactivate FAK, FAK-null cells selected and cloned, and lentiviral re-expression of murine FAK in the FAK-null cells (termed FAK Rx) was accomplished. FAK-/- cells were shown to possess epithelial characteristics whereas FAK Rx cells expressed mesenchymal markers and increased cell migration/invasion in vitro. Comparisons between FAK-/- and FAK Rx cells showed that FAK re-expressed increased mitochondrial respiration and amino acid uptake. This was associated with FAK Rx cells exhibiting filamentous mitochondrial morphology (potentially an OXPHOS phenotype) and decreased levels of MTFR1L S235 phosphorylation (implicated in mito morphology fragmentation). Mito and epithelial cell morphology of FAK-/- cells was reversed by treatment with Rho-kinase inhibitors that also increased mito metabolism and cell viability. Last, FAK-dependent glioblastoma tumor growth was shown by comparisons of FAK-/- and FAK Rx implantation studies.

      The studies by Masalmeh provide interesting findings associating FAK expression with changes in mitochondrial morphology, energy metabolism, and glutamate uptake. According to the authors model, FAK expression is supporting a glioblastoma stem cell like phenotype in vitro and tumor growth in vivo. What remains unclear is the mechanistic connection to cell changes and whether or not these are be dependent on intrinsic FAK activity or as the Frame group has previously published, potentially FAK nuclear localization. The associations with MTFR1L phosphorylation and effects by Rho kinase inhibition are likely indirect and remind this reviewer of long-ago studies with FAK-null fibroblasts that exhibit epithelial characteristics, still express PYK2, exhibited elevated RhoA GTPase activity. Some of these phenotypes were linked to changes in RhoGEF and RhoGAP signaling with FAK and/or Pyk2. At a minimum, it would be informative to know whether Pyk2 signaling is relevant for observed phenotypes and whether the authors can further support their associations with FAK-targeted or FAK-Pyk2-targeted inhibitors or PROTACs.

      Some questions that would enhance potential impact.

      1. Cell generation. Please describe the analysis of FAK-/- clones in more detail. The "low viability" phenotype needs further explanation with regard to clonal expansion and growth characteristics?
      2. Figure 1F: need further support of MET change upon FAK KO and EMT reversion.
      3. Fig. 2: Need further support if FAK effects impact glycolysis or oxidative phosphorylation in particular as implicated by the stem cell model.
      4. Is there a combinatorial potential between FAKi and chemotherapies used for glioblastoma. Need to build upon past studies.
      5. The notation of changes in glucose transporter expression should be followed up with regard to the potential that FAK-expressing cells may have different uptake of carbon sources and other amino acids. Altered uptake could be one potential explanation for increase glycolysis and glutamine flux.
      6. It would be helpful to support the confocal microscopy of mitos with EM.
      7. Lack of FAK expression with increased MTFR1 phosphorylation is difficult to interpret.
      8. Need to have better support between loss of FAK and the increase in Rho signaling. Use of Rho kinase inhibitors is very limited and the context to FAK (and or Pyk2) remains unclear. Past studies have linked integrin adhesion to ECM as a linkage between FAK activation and the transient inhibition of RhoA GTP binding. Is integrin signaling and FAK involved in the cell and metabolism phenotypes in this new model?

      Significance

      The studies by Masalmeh provide interesting findings associating FAK expression with changes in mitochondrial morphology, energy metabolism, and glutamate uptake. According to the authors model, FAK expression is supporting a glioblastoma stem cell like phenotype in vitro and tumor growth in vivo. What remains unclear is the mechanistic connection to cell changes and whether or not these are be dependent on intrinsic FAK activity or as the Frame group has previously published, potentially FAK nuclear localization. The associations with MTFR1L phosphorylation and effects by Rho kinase inhibition are likely indirect and remind this reviewer of long-ago studies with FAK-null fibroblasts that exhibit epithelial characteristics, still express PYK2, exhibited elevated RhoA GTPase activity. Some of these phenotypes were linked to changes in RhoGEF and RhoGAP signaling with FAK and/or Pyk2. At a minimum, it would be informative to know whether Pyk2 signaling is relevant for observed phenotypes and whether the authors can further support their associations with FAK-targeted or FAK-Pyk2-targeted inhibitors or PROTACs.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      'Please download the Response to Reviewers file, which contains Reviewers comments and authors responses and new data'

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this work, the authors investigate the cytoplasmatic roles of Mei2, an RNA-binding protein in fission yeast, in particular its interactions with processing bodies (PBs) in the cytoplasm. The manuscript rests heavily on microscopy data, using a combination of time-resolved microscopy and molecular mutation and tagging techniques.

      Mei2 is known for its role in the nucleus of zygotic cells. Here, it is shown that Mei2 co-localizes with the PB markers Dcp2 and Edc3. This happens in zygotes but not in gametes (e.g. when fusion is blocked in fus1 mutants) (Fig 4E). <br /> This co-localization in PBs is counteracted by Pat1-driven phosphorylation of Mei2. Phosphorylation by Pat1 is known to suppress Mei2 activity. Mei3 inhibits Pat1; in a mei3 mutant Mei2 cannot accumulate in PBs, the same happens with a non-phosphorylatable mei2 allele (Fig. 5). In a pat1Δ mutant, constitutively active Mei2 is compatible with growth if it stays in the nucleus (mei2-NLS), but not if Mei2 is forced to the cytoplasm (mei2-NES) (Fig. 3G). This indicates that it is the cytoplasmic function of Mei2 that is critical.

      Forcing Pat1 to be cytoplasmic (Pat1-NES) allowed normal vegetative growth and mating (Fig. 3A-C), whereas nuclear Pat1 (Pat1-NLS) produced premature mating (Fig. 3A,B). Thus, cytoplasmic Pat1 phosphorylation of Mei2 is critical for controlling the transition from mitotic growth to fusion and zygote formation.

      Mei2 shuttles between the nucleus and cytoplasm, and one of its RNA-binding domains (RRM1) drives nuclear import, while both RRM1 and RRM3 are required for export to the cytoplasm (Fig. 2 and S2). Little was known previously of the role of RRM1.

      They present evidence that this localization to PBs is required for development. Knocking out the RNA helicase Ste13 (ortholog of S. cerevisiae Dhh1 which is a PB component) reduces PB formation (Fig. 6A). Even a non-phosphorylatable mei2 allele (i.e. it cannot be inactivated by Pat1) is incapable of driving sporulation in a ste13Δ background (Fig. 6B-D). This demonstrates that Mei2 activity is dependent on PBs.

      The study is well conceived and performed, and the conclusions mostly well backed by data. Experimental and statistical procedures are well described, and the number of replicates is sufficient.

      There are some minor questions however:

      In the literature, Mei2 is described as appearing as a nuclear dot in zygotic cells, but invisible in mitotic cells. Here, the authors demonstrate a Mei2 dot already 30 minutes before fertilization (Fig. 2A). Is the reason for this a more sensitive microscopic technique, or something else?

      The authors claim that the RRM1 RNA-binding region of Mei2 is essential for cytoplasmic Mei2 function and recruitment to PBs. This contrasts with previous publications (Watanabe 1994, Watanabe 1997, Otsubo 2014), as pointed out by the authors, where RRM1 appears to be dispensable for development. How do the authors argue about this discrepancy?

      Significance

      Overall, this paper presents major advances in our understanding of the cytoplasmic functions of this intensely studied RNA-binding protein, Mei2, in the transitions between the mitotic and meiotic cell cycles.

      It builds on the original observations of Mei2 as an essential protein for fusion and meiosis (Watanabe EMBO J 1988), being RNA-binding (Watanabe Cell 1994), and forming a nuclear dot in meiotic cells (Yamashita Cell 1998). These were followed by e.g. reports how Pat1 phosphorylation regulates Mei2 degradation (Matsuo J Cell Sci 2007) and its binding to RNA (Shen J Mol Cell Biol 2022). The present manuscript gives a broader view of the functions of Mei2 beyond its previously described role in the nucleus, and characterizes its interactions with the other players in fusion and meiosis.

      These findings will be of great interest not only to the fission yeast community, but to a wide range of scientists specializing in meiosis and fertilization, and to the RNA biologists at large. Since Mei2 is conserved across many branches of the eukaryotic tree as an RNA-binding protein, albeit with somewhat different functions in e.g. plants, the work has general relevance.

      I have read this manuscript with a background in general yeast cell and molecular biology, including post-transcriptional regulation. I am no microscopy expert, however I find the experimental setup with fluorescent tagging, combinations of mutations in key components in the pathway, and high resolution microscopy data from time series, convincing.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Araoyinbo et al. present a wealth of detailed data analyzing the cellular behavior of mainly three proteins, the RNA-binding protein Mei2, the kinase Pat1 and its inhibitor the protein Mei3 during mating and subsequent initiation of meiosis in fission yeast. This analysis involve also the detailed testing of potential models about how these protein act on each other to fulfil their different functions, such as to block remating on zygotes, the initiation of zygotic S-phase and the initiation of meiosis and sporulation. These data converge to a model whereby Pat1 inhibition by Mei3 expression upon cell fusion unleashes Mei2 function in the cytoplasm. This is due to the subsequent dephosphorylation of Mei2, and its RNA-recognition motif RRM1 interacting with and recruiting Mei2-bound RNAs to P-bodies, where their translation is most likely repressed (at least the translation of a synthetic mRNA - Mei2 pair is repressed when the pair is targeted to P-bodies). Together, this study provides detailed insights into how the meiotic cycle is induced upon mating of fission yeast cells but not in gametes).

      Overall, this is a very carefully controlled study and the data are very convincing and very interesting. It makes a compelling case for the model proposed and makes many original observations and far reaching observations, such as the role of nucleo-cytoplasmic compartmentalization and P-bodies in implementing developmental decisions. Since the notion that P-bodies have a function at all has been strongly questioned in recent years, this study will be very useful for the field.

      The only limitations that I have concerns the readability of the manuscript. It is extremely dense and that makes it a laborious read. Furthermore, the manuscript is not particularly well motivated, such that it is not very obvious what questions the authors are after. This becomes more or less clear only slowly as the reader progresses, or in the second read. Therefore, this very nice piece of work may escape people who are not working on fission yeast mating and meiosis, which would be a pity. I therefore recommend working on better motivating the study and its different parts for a general audience, streamlining the fission yeast intricacies and explaining more precisely what is conceptually learnt from these studies, on a broad sense and possibly in a way that would be relevant beyond the model used. This paper is opening a reach area of research and it would be unfortunate to not make that point more clearly.

      Significance

      Overall, this is a very carefully controlled study and the data are very convincing and very interesting. It makes a compelling case for the model proposed and makes many original observations and far reaching observations, such as the role of nucleo-cytoplasmic compartmentalization and P-bodies in implementing developmental decisions. Since the notion that P-bodies have a function at all has been strongly questioned in recent years, this study will be very useful for the field.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The study by Araoyinbo et al. explores the role of the RNA-binding protein Mei2 in fission yeast zygotic development. It highlights Mei2's cytosolic functions, its interaction with P-bodies, and nucleocytoplasmic shuttling. Mei2's regulation by Mei3 and Pat1, and the importance of its RNA recognition motifs (RRM1 and RRM3) are also discussed.

      The main conclusion of the manuscript is somewhat unexpected from previous studies about Mei2. Particularly, the cytoplasmic function of Mei2 is a novel point in this field.

      Lots of experiments have been done to make the scenario of the manuscript. The experiments and results are technically sound, and I potentially agree with the interpretation by the authors. It would require some more explanation as well as additional experiments to conclude in the way the authors wish to do.

      Major points

      1. Page 4. "Taken together, these results show that fertilization, and Mei3 expression in particular, promote Mei2 nuclear export." It is also possible that Mei2-NLS-GFP was degraded somewhere in the cell (as Mei2 may be still shuttling even if NLS was fused) upon mating (120 min onwards in Fig2D) rather than exported to the cytoplasm. In mei3∆ (Fig 2E) Mei2-NLS-GFP might be somehow escaped from the degradation. Also, nuclear signal of Mei2 is very bright but cytosolic signal seems vague. I wonder the entire results in the manuscript could be interpreted from the viewpoint of degradation/protein stability/protein amount, rather than regulation of localization such as nuclear import and export.
      2. Page 4. "We conclude that RRM1 promotes nuclear import of Mei2." This may be true, but is it also possible that RRM1 inhibits nuclear export of Mei2? This type of possible dual explanation can be applied to the entire manuscript. This is expected to be neutralized or clarified at each point.
      3. Page 5. "Thus, diminishing nuclear Pat1 levels does not compromise its roles during growth and mating." It is interesting for me to find that Pat1-NLS induced ectopic meiosis. This is a fine finding. I wonder just addition of NLS (basic residues) at the C-terminus of Pat1 might deteriorate the activity of Pat1, apart from localization shift. Is it possible to exclude this possibility by making NES-Pat1-NLS-3GFP fusion, in which NLS and NES are fused doubly and distally, because proximal double fusion such as Pat1-NLS-NES-3GFP might just mutually cancel the NLS NES activities.
      4. In general in the Results section. What confused me is when each event occurred. Nutritional conditions, -N but not yet conjugated, after conjugation, premeiotic S or meiotic prophase (or even later). It is particularly hard to catch the story when the timing issue and the location issue (nuclear and cytosolic localization, NLS and NES...) are discussed at the same time. Explanation in chronological order, hopefully at the earlier stages such as explanation for Figures 2 and 3, would be appreciated. The model shown in Figure 8 is quite helpful for my understanding.

      Minor points

      1. "Fertilization" in the title, and "Mei2 is expressed in gametes" in the main text on pare 2. Authors try to generalize fission yeast mating as fertilization of higher organisms as both are events in which two haploids conjugate. I personally do not agree with this type of explanation. This is mainly because S. pombe conjugation (mating) is a part of sexual differentiation and therefore is biologically distinct from fertilization of higher organisms. S. pombe grows and divides in the haploid state, which is distinct from general gametes. To avoid such confusion, I would propose authors to neutralize expression throughout the manuscript.
      2. I found quite a few "surprising(ly)", which are hopefully neutralized, as it is somewhat emotional.

      Significance

      General assessment: strengths and limitations:

      Strengths: It provides novel understanding of molecular mechanisms of meiotic initiation of fission yeast. Technically sound. Lots of experiments. Limitations: The story is very confusing and difficult to catch. Explanation can be simplified.

      Advance: compare the study to existing published knowledge: does it fill a gap? What kind of advance does it make (conceptual, clinical, fundamental, methodological, incremental,,,,)? It is a big advancement. It is conceptually novel regarding how meiosis is initiated in fission yeast.

      Audience: which communities will be interested/influenced, what kind of audience (broad, specialized, clinical, basic research, applied science, fields and subfields,,,) It is mainly for audience of basic research, biology, molecular mechanism of gene explanation, meiosis or yeast cellular events. For non-yeast researchers, this manuscript is probably very hard to read/understand, although the authors tried to generalize yeast-specific events with general words.

      Describe your expertise:

      Yeast genetics, Meiosis, Cell biology, Gene expression regulation

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      'The authors do not wish to provide a response at this time.'

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      This manuscript presents a large-scale comparative genomics analysis of Salmonella genomes to identify and characterize the repertoire of Type VI Secretion System (T6SS) effectors. The authors combine bioinformatic predictions with experimental validation of one novel toxin domain (Tox-Act1), revealing a unique catalytic activity not previously reported in bacterial toxins. While the study is comprehensive and offers valuable insights into T6SS diversity, the insufficient description of computational methods and limited accessibility of underlying data reduce reproducibility and impact.

      Major comments

      1. The computational methods are inadequately described in the Materials and Methods section, and the authors did not provide the underlying datasets. These omissions make it impossible to reproduce the analysis or to apply the approach to other organisms.
      2. The criteria used to distinguish between T6SS effectors and non-effectors are unclear. The reliance on proximity to structural genes ("guilt-by-association") is insufficient and may have led to the omission of cargo effectors not proximal to these structural genes.
      3. No information is provided in the Materials and Methods section about the graph-based clustering strategy mentioned in the main text (Rows 109-111), including the Jaccard index and Louvain algorithm.
      4. The definition and identification of T6SS subtypes, including the use of the term "orphan," are not explained (Rows 111-112).
      5. The phylogenetic analysis of the newly identified domain Tox-Act1 lacks consistency and detail. For example, Rows 324-326 state: "To predict the function of Tox-Act1, we sought to understand its evolutionary relationship by constructing a phylogenetic tree using the sequences of Tox-Act1, TseH and additional permuted members, such as LRAT and YiiX." However, this contradicts Rows 342-344 and Figure 4A, which describe the phylogenetic tree as being built from permuted NlpC/P60 members, and indicate that a single query was used for PSI-BLAST, marked with a red star. It is unclear whether Tox-Act1, TseH, or another sequence was used as the initial PSI-BLAST query.
      6. The Tox-Act1 domain investigated is labeled as an acyltransferase, but the evidence presented supports only phospholipid-degrading activity. In my opinion, the naming should better reflect the activity demonstrated by the data.
      7. Table S1 should include representative protein accessions for each T6SS toxin domain. This is essential for evaluating the novelty of the identified domains and for enabling their use in future analyses. The repeated use of "This study" (96 times) as a reference, without further detail, is confusing and unhelpful. In my view, referencing the current study is appropriate only when the manuscript provides sufficient information on the corresponding domain.
      8. In general, the authors should place greater emphasis on ensuring that the proteins and genomes analyzed in this study can be reliably identified. Genomic accessions and locus tags should be traceable in public databases such as NCBI, and the supplemental information must correspond accurately to the main text. For example, I was unable to find information on FD01543424_00914, which was used as the query for the alignment of STox_15 (the name used in the supplemental information, while in the main text it is referred to as Tox-Act1; see related comment below).
      9. A supplementary table listing all Salmonella effectors and their domain annotations is missing. This is essential for transparency, reproducibility, and future use of the data.
      10. The GitHub repository contains a large volume of data and code but lacks detailed documentation and clear instructions, including example files. This greatly limits reproducibility and usability. The current organization of the repository makes it difficult to locate specific results; for example, Tox-Act1 is referred to as STox_15 in the GitHub files, but this is not mentioned in the manuscript. The authors should improve data organization and provide a README file for clarity.

      Minor comments

      1. The introduction should discuss previous work on Salmonella T6SS effectors, including Blondel et al. (2023) (ref 71 in the manuscript), Amaya et al. (2022), and Amaya et al. (2024).
      2. In Figure 1C, genomic examples should include strain names and locus tags.
      3. In Figure 1F, 'ND' should be replaced with 'Unknown' or 'Not Determined'.
      4. Figure 1E is overly complex and, in my opinion, does not add value, especially since the accompanying text is sufficient on its own. Moreover, the authors acknowledge that their initial analysis missed the similarity between Tox-Act1 and both DUF4105 and the TseH effector, which raises concerns about the accuracy and usefulness of this graph.
      5. Figure 3D lacks information about the number of replicates (n=?).
      6. Discrepancies in domain annotations:
        • Row 232: STox_47 is missing from Table S1.
        • Row 233: STox_18 is pore-forming and STox_53 is a nuclease (per Table S1), which contradicts the main text.
      7. Multiple grammatical and typographical errors exist throughout the text, including:
        • Row 41: "provide" should be "provides"
        • Rows 131, 222: "immunities" should be "immunity proteins"
        • Rows 170, 253, 288: "thee" should be "three"
        • Row 388: "corresponds" should be "correspond"
        • Row 389: "chomatogram" should be "chromatogram"
      8. Rows 257-259: The claim that PAAR and RHS domains assist in translocation across the bacterial inner membrane is presented as fact, but this is only a hypothesis and should be stated more cautiously.
      9. Figure 3A: The selection of representative genomic loci is unclear. For example, FD01843896 is shown in the figure, but cloning was performed using FD01848827, and the HHPred analysis was based on FD01543424. The rationale for using different sequences at each step should be clarified.
      10. Rows 296-299: The absence of a secretion assay in the study is notable. If this is due to the inability to activate the SPI-6 T6SS of Salmonella enterica serovar Typhimurium, as discussed in these lines, it should be explicitly mentioned in the text.
      11. Figure 4C (sequence logo) is not described in the Materials and Methods section.
      12. Row 467: The retrieval date of the gff files from the 10KSG database is missing.
      13. Rows 474-476: The domain models used for T6SS cluster prediction are not described.

      Significance

      This is a comprehensive study involving a large number of Salmonella genomes, potentially identifying many new T6SS effectors and toxic activities. One new domain analyzed in this work is experimentally investigated and shown to have a unique catalytic activity not previously observed in toxins. However, the bioinformatic methods are not described in sufficient detail, making it difficult to assess or reproduce the work. Protein accession numbers are missing, even for representative toxins, and locus tags are not traceable, making the identified effectors not readily accessible. There are many inaccuracies throughout the text and supplemental data. The Tox-Act1 domain investigated is labeled as an acyltransferase, but the evidence only supports phospholipid-degrading activity. While the study includes many graphs and histograms, they often obscure the main findings. Consequently, the audience is likely to be limited.

      Nevertheless, despite these concerns, I believe this is an important work that could be valuable to the broad community once a more thorough revision is undertaken, not only by addressing the specific comments raised, but also by rechecking the analyses, reorganizing the presentation, and ensuring that all data and annotations are clearly accessible and traceable.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The manuscript titled "Genome-directed study reveals the diversity of Salmonella T6SS effectors and identifies a novel family of lipid-targeting antibacterial toxins" presents a comprehensive in silico analysis of T6SS-associated effector and immunity genes across approximately 10,000 Salmonella genomes. In addition, the authors selected one of the newly identified effectors, Tox-Act1, for detailed biochemical characterization. To my knowledge, this study represents the most extensive genome-wide mining effort to date for T6SS-associated effectors and immunity proteins in Salmonella, employing a range of state-of-the-art computational prediction tools. The in vitro enzymatic characterization of Tox-Act1 further validates the in silico approach and adds a novel functional perspective to the dataset. Overall, the study provides a rich and comprehensive dataset. However, for readers without a strong bioinformatics background, the logic and workflow of the in silico prediction pipeline may be challenging to follow. Consequently, my comments focus primarily on the biochemical analysis of Tox-Act1, rather than the computational aspects of the study.

      Major comments:

      1. In Figure 3, the authors first demonstrated that Tox-Act1 and Imm-Act1 constitute a functional antibacterial toxin-immunity pair using a heterologous E. coli expression system. They then proceeded to an in vivo mouse colonization model, showing that prey cells lacking the tox-act1/imm-act1 locus exhibited reduced competitiveness when co-infected with a Salmonella strain carrying the endogenous tox-act1, compared to a ∆tssL mutant. As this is the first report identifying and characterizing Tox-Act1 function in Salmonella, the authors should provide additional experimental evidence addressing the following key points: (i) Whether Tox-Act1 is secreted by Salmonella in a T6SS-dependent manner; (ii) Whether target cells lacking imm-act1 (in either Salmonella or E. coli) can be intoxicated by Salmonella secreting Tox-Act1; (iii) Whether the observed competitive advantage in vitro conferred by Tox-Act1 is dependent on its phospholipase activity. Given that Salmonella T6SS can be activated by hns deletion, such experiments should be feasible and are crucial for the functional validation of any newly identified T6SS effector. Addressing these points would substantially strengthen the mechanistic basis of the study and reinforce the biological importance and relevance of Tox-Act1.
      2. In Figure 4, the authors present the evolutionary relationship between Tox-Act1 and the previously identified T6SS effector TseH from Vibrio, and they propose that these two effectors may share similar enzymatic activities and overlapping cellular targets. Given the ongoing debate and unresolved questions regarding the biochemical function of TseH, the authors should leverage their established in vitro phospholipase assay to test whether TseH exhibits phospholipase activity similar to that of Tox-Act1. Demonstrating such activity would not only substantiate the proposed functional conservation but also provide critical biochemical insight into a long-standing question in the T6SS field.
      3. In Figures 5C and 5D, the authors performed lipidomic analyses on E. coli cells heterologously expressing Tox-Act1 and reported that specific phospholipid species are altered in a manner dependent on Tox-Act1's phospholipase activity. However, the data presented in Figure 5D only include changes in the abundance of PG, FFA, LPG, and LPE. To provide a comprehensive overview of the lipidomic alterations, the authors should present the full dataset of all identified phospholipid species. This is essential to evaluate the extent and specificity of lipid remodeling induced by Tox-Act1. It is currently unclear whether the observed reduction in PG is the only statistically significant change or if additional lipid species were similarly affected but not shown. Furthermore, the authors claim that Tox-Act1 functions as a phospholipase A1. However, in Figures 5A and 5B, the signal corresponding to intact phospholipids remains relatively high, raising concerns about the apparent weak enzymatic activity in this assay. This observation contrasts with previously characterized phospholipase toxins in the antibacterial toxin field, such as Tle1 from Burkholderia, which exhibit robust activity under in vitro conditions. To substantiate the enzymatic potency of Tox-Act1 and clarify this discrepancy, the authors should include a side-by-side comparison using the same in vitro assay with a well-established phospholipase toxin (e.g., Tle1) as a positive control. This would allow for a direct evaluation of the relative enzymatic strength of Tox-Act1 and support the interpretation of its lipid-targeting function.

      Minor Comments:

      1. Line 32: Please specify "Type VI Secretion System (T6SS)" when first introducing the term in the abstract, to ensure clarity for a broad readership.
      2. There are inconsistencies between the numerical values reported in the main text and those shown in the figures. For instance, the manuscript repeatedly states that approximately 10,000 Salmonella genomes were analyzed in the in silico search, whereas Figure 1 indicates a total of 10,419 genomes. Similarly, Line 108 mentions 42,560 genomic sites, yet Figure 1 displays a count of 49,080. Please ensure that all numerical data are consistent across the manuscript and figures to avoid confusion or misinterpretation.
      3. The definition of "Orphan clusters" is not provided. Please specify the criteria used to define these clusters and clarify the rationale for grouping them separately from the other clusters (i1-i4) shown in Figure 1A. It would be helpful to explicitly state how they differ from the canonical clusters.
      4. Lines 114-119: The sentence structure in this section is overly long and difficult to follow. Please revise this portion for clarity and conciseness to ensure that the intended message is clearly conveyed.
      5. The color coding in Figure 1C is incomplete; only a few categories are indicated in the legend. Please revise the legend to include all color codes used in the figure for accurate interpretation.
      6. Lines 278-280: The authors state that "cells lysed without losing their rod shape, which suggests that the peptidoglycan was not affected... indicating that this is not the target of Tox-Act1." Please provide appropriate references or supporting evidence for this interpretation. Clarification is needed to explain the morphological criteria being used to infer peptidoglycan integrity.
      7. Please define "competitive index" in the legend of Figure 3D to ensure the metric is clearly understood by readers unfamiliar with the term.
      8. It is unclear to me why the author use (data not shown) in Line 315. Please provide evidence to support the claim in the paragraph.
      9. In Figure 4D, the authors compare the activity of wild-type and catalytic mutant Tox-Act1, but protein expression levels are not shown. Please include immunoblot or other relevant data to confirm equivalent expression of both constructs, to rule out differential expression as a confounding factor.

      Referee cross-commenting

      I agree with Reviewer #3 that the authors should provide more details on their search for better reproducibility.

      Significance

      This manuscript presents a large-scale in silico analysis of Salmonella T6SS effectors and immunity proteins, accompanied by the biochemical characterization of a novel phospholipase effector, Tox-Act1. The genome-wide dataset is comprehensive, representing the most extensive mining effort of its kind to date. The study is strengthened by in vitro validation of Tox-Act1 activity and its role in interbacterial competition. However, the manuscript would benefit from additional experimental data to confirm key mechanistic aspects, including T6SS-dependent secretion of Tox-Act1, its toxicity toward target cells lacking immunity, and the contribution of phospholipase activity to its antibacterial function. Comparative assays with established T6SS phospholipases (e.g., Tle1) are recommended to clarify enzymatic potency. Further, the authors should apply their phospholipase assay to test TseH activity and resolve long-standing questions in the field. Several areas also require clarification or correction, including inconsistencies in reported genome counts, incomplete figure legends, unclear terminology (e.g., "Orphan clusters"), and missing experimental controls (e.g., protein expression levels, full lipidomic dataset). Minor edits to improve clarity and consistency are also suggested. Overall, the study is significant and of high potential impact but requires additional experimental validation and revisions to improve clarity and completeness.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: In this study, authors used in silico approaches to analyse 10,000 bacterial genomes and identified 128 candidate effectors secreted via the T6SS of Salmonella. Among these, Tox-Act1 was selected for detailed characterisation. The authors demonstrated that Tox-Act1 harbours a permuted NlpC/P60 catalytic domain with phospholipase activity, targeting key membrane lipids. Furthermore, they confirmed that Tox-Act1 is secreted in a T6SS-dependent manner and enhances bacterial competitiveness during gut colonisation in mice, providing new insights into lipid-targeting toxin domains in interbacterial interactions. My concerns raised are all minor and should be readily addressable by the authors.

      Minor Concerns:

      Line 279-280: The statement that the peptidoglycan is not a target of Tox-Act1 is somewhat strong at this stage of the manuscript. The preservation of cell shape does not necessarily imply that the peptidoglycan remains unaltered at a subcellular level. Given that Tox-Act1 belongs to the NlpC/P60 family, members of which include known peptidases, the authors should moderate this assertion. Replacing "is not" with "is likely not" or using conditional phrasing would be more appropriate here.

      Lines 328-331: The conclusion that the Tox-Act1 clade is deployed in biological conflicts is not fully explained or substantiated. The authors are encouraged to provide a brief rationale to support this conclusion.

      Figure 4D: There appears to be a labelling inconsistency. The immunity protein is referred to as "Slmm15," which may relate to the original name of Tox-Act1 (i.e., STox_15), but the correct label should likely be "Imm-Act1."

      Line 401 and elsewhere: The correct spelling is "L-arabinose" with a capital "L". The manuscript should be checked for consistency in this regard.

      Throughout the text and figures: Bacterial species names are often incorrectly formatted, e.g., "S. Panama" (Line 226) should be written in scientific style as S. panama, with italics and the species name in lowercase. A systematic revision of species names is recommended to enhance rigour.

      Figure 3D: The X-axis labelling is somewhat confusing. The use of terms such as "attackers" and "prey" is misleading in this context, as the experiment tests the in vivo survival capacity of different Salmonella strains (WT or T6SS mutants mixed with toxin/immunity double mutants) in a mouse model, rather than a direct bacterial killing assay. Clarifying this would greatly improve readability.

      Significance

      Overall, this study is well-executed. The approach used to identify a previously uncharacterised diversity of T6SS effectors in Salmonella is robust and provides a valuable framework that could be extended to other systems involved in interbacterial competition. This renders the work relevant and of interest for publication. While the manuscript occasionally lacks clarity in explaining the rationale behind certain experimental choices, the narrative remains generally accessible.

      Field of expertise: Secretion systems, interbacterial competition, bacterial predation, live-cell imaging, protein network

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2023-02191

      Corresponding author: Jan Rehwinkel

      1. General Statements

      The authors wish to thank all three reviewers and the Review Commons team for carefully evaluating our study. We have addressed all points raised as detailed below.

      We have thoroughly revised our bulk RNAseq analysis, which is now performed at the transcript level using the latest GENCODE release. We have updated Figure 3 and associated supplementary figures and tables. This change from gene to transcript level was important for accurate motif analysis as requested by reviewer 2: matching promoters to individual IFN-regulated transcripts – rather than aggregating all promoters per gene – avoids significant signal dilution. This strategy yields higher-resolution expression data and is biologically preferable. Indeed, several well characterised IFN-regulated RNAs (e.g., the ADAR1-202 transcript encoding the p150 isoform) originate from promoters located far from the constitutive promoters of their host genes. In our revised manuscript, we now provide in the new supplementary figure 13 the requested promoter motif analysis. Using two computational approaches – de novo motif search and analysis of a curated motif database – we find strong enrichment of interferon-stimulated response elements (ISREs) in promoters of type I IFN regulated transcripts. No other motifs reached similarly high levels of enrichment, and our analysis did not reveal differences between different type I IFNs. These new data show that all type I IFNs engage a common regulatory pathway, supporting our overall conclusion that different type I IFNs do not induce qualitatively different responses in PBMCs.

      Regrettably, in the process of analysing the bulk RNAseq data at transcript level, we noticed that our original lncRNA analysis contained numerous false positives. Closer inspection showed that many “differentially expressed” LNCipedia models were likely not full-length transcripts and commonly shared a single IFN-induced set of exons that artificially inflated expression estimates for every overlapping model. To correct this issue, we replaced LNCipedia with the latest high-quality non-coding RNA catalogue from GENCODE, most entries of which were defined by full-length RNA sequencing [1]. We also tightened our filtering criteria and now report only transcripts that are robustly expressed in our dataset and are either classified as high-confidence by GENCODE or robustly supported at every splice junction by our RNAseq.

      We hope our manuscript is sufficiently improved and suitable for publication in PLoS Biology. New or revised text is highlighted in green in our revised manuscript.

      2. Point-by-point description of the revisions


      Reviewer #1

      Evidence, reproducibility and clarity:

      The study can be directly connected to a landmark paper in the field (Mostafavi et al. , Cell 2016). By comparison with this study, the authors use improved technologies to address the question if and how responses to type I IFN differ between human peripheral blood-derived cells types. In line with Mostafavi et al. the authors conclude that only a comparably low number of interferon-stimulated genes (ISG) is induced in all cell types and that considerable differences exist between cell types in the IFN-induced transcriptome. The authors address a second relevant aspect, whether and how the many different subtypes of type I IFN differ in the way they engage IFN signals to produce transcriptome changes. The data lead the authors to conclude that any differences are of quantitative rather than qualitative nature.

      The authors' conclusions are based on a mass cytometry approach to phenotype STAT activation in different cell types, bulk RNA sequencing to study ISG expression in PBMC, and single cell sequencing to study ISG responses in individual cell types. The data are solid, clear and reproducible in biological replicates (eg different blood donors).

      Significance: While some of the data can be considered confirmatory, the comprehensive analysis of cell-type specificity and IFN-I subtype specificity advances the field and provides a reference for future analyses. The study is complete and there is no obvious lack of a critical experiment. The number of scientists interested in the multitude of open questions around type I IFN is large, thus the study is likely to attract a broad readership.

      We thank the reviewer for her/his positive assessment of our study.

      The biggest limitation is to my opinion the low sequencing depth of scRNAseq which is clearly the downside of this technology. Using 11 hematopoietic cell types and bulk RNA sequencing the total number of ISG was determined to be 975 by Mostafavi et al. and the core ISG numbered 166. This is in stark contrast to this studies' 10 core ISG. The authors limitations paragraph should discuss the fact that scRNAseq reduces the overall ISG number that can be analyzed.

      Thank you for this valid comment. We amended the limitations paragraph as requested. We agree that the Mostafavi et al. 2016 Cell paper [2] is important but note that there are many differences to our study: Mostafavi et al. use mice, a seemingly very high IFN dose (10,000 Units) and microarrays (not RNAseq).

      A minor point concerns the 25 supplementary figures of the study. There must be a better way to support the conclusions with the necessary data.

      We agree that our supplementary materials are extensive. However, this is not unusual for studies reporting multiple large datasets. We would be delighted to organise our supplementary information differently in due course according to journal guidelines.



      Reviewer #2

      Evidence, reproducibility and clarity:

      The manuscript entitled “Single-cell analysis of signalling and transcriptional responses to type I interferon" by Rigby et al. examines the response to type I IFN subtypes in PBMCs using an integrative proteomics and transcriptomics approach. Some of the analysis could be deepened to provide better insights into what governs the magnitude of change in gene expression as well as the cell type-specific response to expression and generate more excitement for the study.

      We thank the reviewer for evaluating our study and the suggestions made.

      *Major Comments: *

      • Although the authors appropriately conclude that type I IFNs induce qualitatively similar, the response is not quantitatively similar. What elements in the promoters of ISGs make them more responsive to IFN subtypes? (PMID: 32847859) We thank the reviewer for the suggestion to study the promoters of genes regulated by type I IFNs. The analyses outlined below were performed by A. Fedorov, who is now a new co-author of our study. To investigate promoter features that might underlie the observed transcriptional responses across type I IFNs, we first performed a de novo*motif search using STREME [3] on our bulk RNAseq dataset (Figure 3). Specifically, we compared the promoters of transcripts that were up- or down-regulated by each IFN subtype (e.g., IFN-β-induced) either with one another or with promoters of robustly expressed RNAs that remained unresponsive to any treatment. No significant motifs emerged from these comparisons, except when we compared promoters of IFN-induced transcripts to the background set of unresponsive RNAs. This comparison consistently yielded strong enrichment of interferon-stimulated response element (ISRE)-like motifs in the promoters of up-regulated RNAs (new Figure S13a).

      Next, we conducted a complementary analysis using known transcription factor (TF) motifs from the JASPAR database [4]. We screened all promoters of annotated RNAs using clustered JASPAR motifs and Z-standardised motif scores relative to all high-confidence GENCODE RNAs, including those not expressed in PBMCs. We reasoned that TFs actively mediating IFN responses would likely bind promoters with high motif scores (Z ≥ 2), while promoters with low scores (Z ≤ -1) would represent an unregulated background. This approach produced two sets of RNAs per TF cluster: putatively regulated and unregulated. We then restricted each set to RNAs expressed in our dataset and associated each transcript with its estimated fold change in response to each type I IFN, regardless of statistical significance. Next, we compared median fold changes between the likely regulated and unregulated sets across all TF clusters and IFN subtypes (Figure S13b). Among all tested TF motifs, only the ISRE-like cluster showed strong and consistent associations with transcriptional changes across all IFN subtypes. We also observed statistically significant but much weaker associations for other TFs, including a known negative regulator of innate antiviral signaling, NRF1 [5]. However, effect sizes for these motifs were dwarfed by those of ISRE-like motifs, suggesting that no JASPAR TFs other than those within the ISRE-like cluster play a major role in PBMCs under our conditions. Overall, these findings support the idea that all type I IFNs engage a common regulatory pathway, differing primarily in the magnitude rather than the nature of their transcriptional effects.

      How do they relate to the activation of kinases by IFN subtypes?

      We did not analyse the activation of the canonical kinases (i.e., TYK2 and JAK1) downstream of IFNAR. This would be interesting and may be possible using phospho-specific antibodies to these kinases in our CyTOF setup. However, this would require a very large investment of time and resources to identify specific antibodies, optimise a new CyTOF staining panel and to acquire and analyse new datasets. We therefore believe this should be pursued as a separate future study.

      *Are there distinct features that dictate differential responses in monocytes and lymphocytes? *

      Following the computational approach described above, we applied STREME to identify DNA motifs that could distinguish promoters associated with monocyte- and lymphocyte-specific ISGs. Regrettably, this analysis did not yield any significant motifs, likely due in part to the limited number of genes in each category.

      • Figure 2a, d-h - Consider using the same scale for all heatmaps. This will allow for comparison of pSTATs median expression. Consider increasing the range in the color scale as some of the subtle changes in STAT phosphorylation across subtypes are not well appreciated. This also applies to Supplementary figures related to Figure 2.*

      Thank you for this suggestion. We tried using the same scale for all heatmaps. However, given that the values for pSTAT1 are higher than those for other pSTATs, the resulting heatmaps did not show differences for the other pSTATs well. We therefore decided to leave these panels unchanged. Please also note that Figures 2b and S3b provide comparison between pSTATs (and other markers) using the same scale.

      Minor Comments:

      • The title of subsections are a bit generic (e.g "Analysis of the signalling response to type I IFNs using mass cytometry". Consider updating them to reflect some of the findings from each analysis.* Thank you for this suggestion. We have amended sub-headers accordingly.

      • Figure 3 and S3 - Increase the heatmap scale to better appreciate changes in gene expression.*

      The scales have been enlarged for better visibility as requested.

      • Consider combining panel a and b in figure S7 for better contrasts of the response to IFNa1 or IFNb. *

      Thank you for the suggestion. We combined these panels.

      • Figure 4 - The authors could visualize ISGs that are unique across IFN types or cell types. *

      Figure 5 and several accompanying supplementary figures already depict ISGs unique to IFN subtypes or cell types. Whilst we appreciate the suggestion, we prefer not to add additional figures to avoid redundancies.

      • The gene ontology analysis should be performed with higher statistical stringency to capture the most significant IFN responsive processes. *

      Thank you for this comment. We changed the presentation of the GO analysis in Fig S11 by sorting on p-value (instead of % of hits in category). We hope this shows more clearly that GO category enrichment amongst genes encoding IFN-induced transcripts had high statistical significance (log10 p-values of about -5 or lower for many categories).

      Significance:* ** The authors provide an extensive compendium of cell type specific changes in response to type I IFN stimulation. They have created a public repository which extends the value of this dataset. *

      Audience: *** This is a valuable resource for immunologists, virologists, and bioinformaticians.*

      Thank you for these encouraging comments.



      Reviewer #3


      Evidence, reproducibility and clarity:

      *Summary *

      Rigby and collaborators analyzed the signaling responses and changes in gene expression of human PBMCs stimulated with different IFN type I subtypes, using mass cytometry, bulk and single-cell RNA sequencing. Their study represents the first single-cell atlas of human PBMCs stimulated with five type I IFN subtypes. The generated datasets are useful resources for anyone interested in innate immunity. The data and the methods are well presented. We thus recommend publication.

      Thank you for your positive assessment of our work and for recommending publication.

      *Major comments: *

      • *

      *Two of the key conclusions are not very convincing. *

      • *

      First, the authors claim that the magnitude of the responses varied between the 5 types of IFNs, however, as they point out in the 'limitation' paragraph, doses of the different IFNs were normalized using bioactivity. Knowing that this bioactivity is based on assays performed on A549 lung cells, this normalization likely induces a bias. How do the authors explain similar antiviral bioactivity but differing magnitudes of modulation of ISG expression? Would the authors expect the same differences of expression between the several IFNs tested in A549 cells? We thus recommend being very cautious when comparing magnitude of the response between the 5 types of IFNs.

      We thank the reviewer for this important point and included the following reasoning in our discussion:

      “An important technical consideration for our study was the normalisation of type I IFN doses used to treat cells (see also ‘Limitations of the study’ below). We relied on bioactivity (U/ml) that is measured by the manufacturer of recombinant type I IFNs using a cytopathic effect (CPE) inhibition assay. In brief, the lung cancer cell line A549 is treated with type I IFN and is infected with the cytopathic encephalomyocarditis virus (EMCV). Control cells not treated with IFN are killed by EMCV, whereas cells treated with sufficient IFN survive. How, then, is it possible that different type I IFNs induce differing magnitudes of STAT phosphorylation and ISG expression despite being used at the same bioactivity? Cell survival in the CPE inhibition assay may be due to one or a few ISGs. Indeed, single ISGs can mediate powerful antiviral defence. For example, MX1 is crucial for host defence against influenza A virus [6]. Thus, similar bioactivity of different IFNs in A549 cells against EMCV-triggered cell death may not reflect the breadth of effects on many ISGs. Moreover, IFN-induced survival of A549 cells following EMCV infection is a binary readout. Induction of the relevant ISG(s) mediating protection beyond a threshold required for cell survival is unlikely to register in this assay. Thus, similar antiviral bioactivity (in the CPE inhibition assay) and differing magnitudes of modulation of ISG expression (at transcriptome level) are compatible.”

      We believe inclusion of this paragraph demonstrates an appropriate level of caution in our data interpretation. Further, we would expect to make similar observations if we were to apply transcriptomic analysis to A549 cells treated with different type I IFNs. However, given our focus in this study on primary, normal cells, we decided not to pursue work with the transformed and lab adapted A549 cell line.

      Second, the qualitatively different responses to type I IFN subtypes claimed by the authors were not apparent. This seems true at the level of the bulk population (Fig. S10) but not at cell-type level (Fig. S15/S16).

      We believe there may be a misunderstanding here. In relation to Figure S10, we do not claim “qualitatively different responses to type I IFN subtypes”. Instead, we conclude that “differences in expression between the different type I IFNs were quantitative” (page 8; lines 229-230, now: 238-239). Moreover, Figures S15/S16 (now: S16/S17) do not refer to analyses of responses to different type I IFN subtypes.

      The authors state (line 311-312) that 'Consistent with our bulk RNAseq data, differences were again quantitative rather than qualitative' at the cell-type level. The response between cell types seems very different to us since a core set of only 10 ISGs are shared by all cell types and all 5 type I IFNs. Knowing that the expression of hundreds, sometimes thousands of genes, are induced by IFN, this seems like a rather small overlap (and thus qualitatively different responses). Fig S15 and S16 nicely illustrate that the responses are qualitatively different between cell-type. Please modify this conclusion accordingly.

      Thank you for highlighting this. The statement in lines 311-312 does not refer to differences between cell types but to differences between type I IFN subtypes. We are sorry this was not clear and changed this sentence (now lines 357-358). Furthermore, we have made it clearer in the revised text that qualitative differences were observed between cell types (e.g. lines 329 and 350-352).

      *No additional experiments are needed to support the claims. However, we believe that two additional analyses could provide useful information. *

      • *

      The levels of IFNAR1 and IFNAR2 expressed at the plasma membrane probably vary between cell types and may thus influence the magnitude of the IFN response. While it would be difficult to measure these levels by flow cytometric analysis on the different cell types, could the authors extract information from their scRNAseq analysis on the expression level of IFNAR1/2 in all cell types? This would give a hint about potential differences in expression (and thus in magnitude).

      We analysed IFNAR1/2 transcript levels in our scRNAseq dataset (Figure R1 below). Unfortunately, for many cells, IFNAR1 and IFNAR2 transcripts were not detected (see width of violin plots at zero), probably due to low sequencing depth inherent to scRNAseq analysis. We therefore prefer not to draw conclusions from these data.

      Could the authors investigate further the expression of lncRNAs at the single-cell levels? It would be useful to also define a core set of lncRNAs that are shared between cell types and IFN subtypes. If such a core set does not exist (since lncRNAs are less conserved than coding genes), it would be nice to mention it.

      Thank you for this suggestion. The expression of lncRNAs is generally lower than protein-coding genes, resulting in high drop-out rates in 10X datasets. Indeed, Zhao et al. comment that “current development of single-cell technologies may not yet be optimized for lncRNA detection and quantification” [7]. We only detected a small number of lncRNAs in our scRNAseq analysis, and only four lncRNAs were significantly differentially expressed between cell types. We thus could not perform a meaningful analysis of lncRNAs in our scRNAseq dataset. This is now mentioned in the limitations paragraph at the end of the manuscript.

      Minor comments:

      There is a typo in line 355 Fig.4C =>6C.

      Thank you for spotting this.

      ***Referees cross-commenting** *

      We agree with Reviewer 1 that the low sequencing depth of scRNAseq restricts the analysis and must be discussed in the 'limitation' paragraph. This would explain why the authors identified only 10 ISGs that are common to all cell types and all 5 IFN subtypes. Of note, as a comparison, Shaw et al (10.1371/journal.pbio.2004086) identified a core set of 90 ISGs that are upregulated upon IFN treatment in cells isolated mainly from kidney and skin of nine mammalian species ("core mammalian ISGs"). It is thus expected that stimulated blood cells isolated from a single mammalian species share more than 10 ISGs.

      We amended the limitations section as requested. Shaw et al. [8] used a single type I IFN (universal or IFNα, depending on species) at a very high dose (1000 U/ml). Taken together with the use of bulk RNAseq in this study, it is unsurprising that our work identified fewer core ISGs. We believe our small list of core ISGs is nonetheless both a high confidence and a high utility set of ISGs: these genes are induced by multiple type I IFNs, in all major cell types in blood and their regulation can be measured even when sequencing depth is low.

      Significance (Required)

      *Multiple single-cell RNAseq analysis of PBMCs, stimulated or not, have been previously performed in multiple contexts (for instance with PBMCs isolated from the blood of patients infected with influenza virus or SARS-CoV-2). The technical advance is thus limited. *

      • *

      *However, the work represents a conceptual advance for the field since it provides the first single-cell atlas of PBMCs stimulated with five type-I IFN subtypes. The generated datasets represent a great resource for anyone interested in innate immunity (virologists, immunologists and cancerologists). *

      • *

      Of note, we are studying innate immunity in the context of RNA virus infection but we have no expertise on scRNA sequencing. We may thus have missed a flaw in the analyses.

      We thank the reviewer for their positive assessment of the advances of our study and the value of our IFN resource.

      A

      B

      C

      D

      Figure R1. IFNAR1/2 expression in scRNAseq data.

      Violin plots showing expression of IFNAR1 (A,C) or IFNAR2 (B,D) in different cell types. In (A,B), data were pooled across conditions. In (C,D), data are shown separately for unstimulated control cells and cells stimulated with different type I IFNs.

      References

      Kaur G, Perteghella T, Carbonell-Sala S, Gonzalez-Martinez J, Hunt T, Madry T, et al. GENCODE: massively expanding the lncRNA catalog through capture long-read RNA sequencing. bioRxiv. 2024. Epub 20241031. doi: 10.1101/2024.10.29.620654. PubMed PMID: 39554180; PubMed Central PMCID: PMCPMC11565817. Mostafavi S, Yoshida H, Moodley D, LeBoite H, Rothamel K, Raj T, et al. Parsing the Interferon Transcriptional Network and Its Disease Associations. Cell. 2016;164(3):564-78. Epub 2016/01/30. doi: 10.1016/j.cell.2015.12.032. PubMed PMID: 26824662; PubMed Central PMCID: PMCPMC4743492. Bailey TL. STREME: accurate and versatile sequence motif discovery. Bioinformatics. 2021;37(18):2834-40. doi: 10.1093/bioinformatics/btab203. PubMed PMID: 33760053; PubMed Central PMCID: PMCPMC8479671. Rauluseviciute I, Riudavets-Puig R, Blanc-Mathieu R, Castro-Mondragon JA, Ferenc K, Kumar V, et al. JASPAR 2024: 20th anniversary of the open-access database of transcription factor binding profiles. Nucleic acids research. 2024;52(D1):D174-D82. doi: 10.1093/nar/gkad1059. PubMed PMID: 37962376; PubMed Central PMCID: PMCPMC10767809. Zhao T, Zhang J, Lei H, Meng Y, Cheng H, Zhao Y, et al. NRF1-mediated mitochondrial biogenesis antagonizes innate antiviral immunity. The EMBO journal. 2023;42(16):e113258. Epub 20230706. doi: 10.15252/embj.2022113258. PubMed PMID: 37409632; PubMed Central PMCID: PMCPMC10425878. Grimm D, Staeheli P, Hufbauer M, Koerner I, Martinez-Sobrido L, Solorzano A, et al. Replication fitness determines high virulence of influenza A virus in mice carrying functional Mx1 resistance gene. Proceedings of the National Academy of Sciences of the United States of America. 2007;104(16):6806-11. Epub 20070410. doi: 10.1073/pnas.0701849104. PubMed PMID: 17426143; PubMed Central PMCID: PMCPMC1871866. Zhao X, Lan Y, Chen D. Exploring long non-coding RNA networks from single cell omics data. Comput Struct Biotechnol J. 2022;20:4381-9. Epub 20220804. doi: 10.1016/j.csbj.2022.08.003. PubMed PMID: 36051880; PubMed Central PMCID: PMCPMC9403499. Shaw AE, Hughes J, Gu Q, Behdenna A, Singer JB, Dennis T, et al. Fundamental properties of the mammalian innate immune system revealed by multispecies comparison of type I interferon responses. PLoS Biol. 2017;15(12):e2004086. Epub 2017/12/19. doi: 10.1371/journal.pbio.2004086. PubMed PMID: 29253856.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      Rigby and collaborators analyzed the signaling responses and changes in gene expression of human PBMCs stimulated with different IFN type I subtypes, using mass cytometry, bulk and single-cell RNA sequencing. Their study represents the first single-cell atlas of human PBMCs stimulated with five type I IFN subtypes. The generated datasets are useful resources for anyone interested in innate immunity. The data and the methods are well presented. We thus recommend publication.

      Major comments:

      Two of the key conclusions are not very convincing.

      First, the authors claim that the magnitude of the responses varied between the 5 types of IFNs, however, as they point out in the 'limitation' paragraph, doses of the different IFNs were normalized using bioactivity. Knowing that this bioactivity is based on assays performed on A549 lung cells, this normalization likely induces a bias. How do the authors explain similar antiviral bioactivity but differing magnitudes of modulation of ISG expression? Would the authors expect the same differences of expression between the several IFNs tested in A549 cells? We thus recommend being very cautious when comparing magnitude of the response between the 5 types of IFNs.

      Second, the qualitatively different responses to type I IFN subtypes claimed by the authors were not apparent. This seems true at the level of the bulk population (Fig. S10) but not at cell-type level (Fig. S15/S16). The authors state (line 311-312) that 'Consistent with our bulk RNAseq data, differences were again quantitative rather than qualitative' at the cell-type level. The response between cell types seems very different to us since a core set of only 10 ISGs are shared by all cell types and all 5 type I IFNs. Knowing that the expression of hundreds, sometimes thousands of genes, are induced by IFN, this seems like a rather small overlap (and thus qualitatively different responses). Fig S15 and S16 nicely illustrate that the responses are qualitatively different between cell-type. Please modify this conclusion accordingly.

      No additional experiments are needed to support the claims. However, we believe that two additional analyses could provide useful information.

      The levels of IFNAR1 and IFNAR2 expressed at the plasma membrane probably vary between cell types and may thus influence the magnitude of the IFN response. While it would be difficult to measure these levels by flow cytometric analysis on the different cell types, could the authors extract information from their scRNAseq analysis on the expression level of IFNAR1/2 in all cell types? This would give a hint about potential differences in expression (and thus in magnitude).

      Could the authors investigate further the expression of lncRNAs at the single-cell levels? It would be useful to also define a core set of lncRNAs that are shared between cell types and IFN subtypes. If such a core set does not exist (since lncRNAs are less conserved than coding genes), it would be nice to mention it.

      Minor comments:

      There is a typo in line 355 Fig.4C =>6C.

      Referees cross-commenting

      We agree with Reviewer 1 that the low sequencing depth of scRNAseq restricts the analysis and must be discussed in the 'limitation' paragraph. This would explain why the authors identified only 10 ISGs that are common to all cell types and all 5 IFN subtypes. Of note, as a comparison, Shaw et al (10.1371/journal.pbio.2004086) identified a core set of 90 ISGs that are upregulated upon IFN treatment in cells isolated mainly from kidney and skin of nine mammalian species ("core mammalian ISGs"). It is thus expected that stimulated blood cells isolated from a single mammalian species share more than 10 ISGs.

      Significance

      Multiple single-cell RNAseq analysis of PBMCs, stimulated or not, have been previously performed in multiple contexts (for instance with PBMCs isolated from the blood of patients infected with influenza virus or SARS-CoV-2). The technical advance is thus limited.

      However, the work represents a conceptual advance for the field since it provides the first single-cell atlas of PBMCs stimulated with five type-I IFN subtypes. The generated datasets represent a great resource for anyone interested in innate immunity (virologists, immunologists and cancerologists).

      Of note, we are studying innate immunity in the context of RNA virus infection but we have no expertise on scRNA sequencing. We may thus have missed a flaw in the analyses.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript entitled "Single-cell analysis of signalling and transcriptional responses to type I interferon" by Rigby et al. examines the response to type I IFN subtypes in PBMCs using an integrative proteomics and transcriptomics approach. Some of the analysis could be deepened to provide better insights into what governs the magnitude of change in gene expression as well as the cell type-specific response to expression and generate more excitement for the study.

      Major Comments:

      1. Although the authors appropriately conclude that type I IFNs induce qualitatively similar, the response is not quantitatively similar. What elements in the promoters of ISGs make them more responsive to IFN subtypes? (PMID: 32847859) How do they relate to the activation of kinases by IFN subtypes? Are there distinct features that dictate differential responses in monocytes and lymphocytes?
      2. Figure 2a, d-h - Consider using the same scale for all heatmaps. This will allow for comparison of pSTATs median expression. Consider increasing the range in the color scale as some of the subtle changes in STAT phosphorylation across subtypes are not well appreciated. This also applies to Supplementary figures related to Figure 2.

      Minor Comments:

      1. The title of subsections are a bit generic (e.g "Analysis of the signalling response to type I IFNs using mass cytometry". Consider updating them to reflect some of the findings from each analysis.
      2. Figure 3 and S3 - Increase the heatmap scale to better appreciate changes in gene expression.
      3. Consider combining panel a and b in figure S7 for better contrasts of the response to IFNa1 or IFNb.
      4. Figure 4 - The authors could visualize ISGs that are unique across IFN types or cell types.
      5. The gene ontology analysis should be performed with higher statistical stringency to capture the most significant IFN responsive processes.

      Significance

      Significance:

      The authors provide an extensive compendium of cell type specific changes in response to type I IFN stimulation. They have created a public repository which extends the value of this dataset.

      Audience:

      This is a valuable resource for immunologists, virologists, and bioinformaticians.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The study can be directly connected to a landmark paper in the field (Mostafavi et al. , Cell 2016). By comparison with this study, the authors use improved technologies to address the question if and how responses to type I IFN differ between human peripheral blood-derived cells types. In line with Mostafavi et al. the authors conclude that only a comparably low number of interferon-stimulated genes (ISG) is induced in all cell types and that considerable differences exist between cell types in the IFN-induced transcriptome. The authors address a second relevant aspect, whether and how the many different subtypes of type I IFN differ in the way they engage IFN signals to produce transcriptome changes. The data lead the authors to conclude that any differences are of quantitative rather than qualitative nature. The authors' conclusions are based on a mass cytometry approach to phenotype STAT activation in different cell types, bulk RNA sequencing to study ISG expression in PBMC, and single cell sequencing to study ISG responses in individual cell types. The data are solid, clear and reproducible in biological replicates (eg different blood donors).

      Significance

      While some of the data can be considered confirmatory, the comprehensive analysis of cell-type specificity and IFN-I subtype specificity advances the field and provides a reference for future analyses. The study is complete and there is no obvious lack of a critical experiment. The number of scientists interested in the multitude of open questions around type I IFN is large, thus the study is likely to attract a broad readership.

      The biggest limitation is to my opinion the low sequencing depth of scRNAseq which is clearly the downside of this technology. Using 11 hematopoietic cell types and bulk RNA sequencing the total number of ISG was determined to be 975 by Mostafavi et al. and the core ISG numbered 166. This is in stark contrast to this studies' 10 core ISG. The authors limitations paragraph should discuss the fact that scRNAseq reduces the overall ISG number that can be analyzed.

      A minor point concerns the 25 supplementary figures of the study. There must be a better way to support the conclusions with the necessary data.

  2. Jul 2025
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We would like to thank all reviewers assigned by Review Commons for their thoughtful and constructive feedback, which helped us to further improve the quality and clarity of our manuscript. In this study, we developed a novel fluorescence-based live-cell imaging platform for detecting mitochondria-endoplasmic reticulum contact sites (MERCS), which we named MERCdRED. This system enables quantitative analysis of MERCS dynamics in living cells by combining stable gene expression of dimerization-dependent fluorescent proteins with single-cell cloning. Using this tool, we uncovered a nutrient-dependent regulatory mechanism of MERCS formation mediated by the ER-localized tethering protein PDZD8. We appreciate that all the reviewers acknowledged the methodological robustness of this work. In response to reviewers' comments, we will significantly improve the manuscript by adding the live-cell imaging to assess the reversible propertyof MERCdRED, and investigating the physiological impacts of MERCS remodeling in regulating metabolism in response to nutrient starvation. We believe that both the methodological advance and the biological findings presented in this study will be of broad interest to the cell biology community.

      1. Description of the planned revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: In this study, the authors successfully established a stable cell line expressing MERCdRED, a dimerization-dependent fluorescent protein (ddFP)-based sensor for monitoring mitochondrial-ER contact sites (MERCS). Through light and electron microscopy analyses, they demonstrated that MERCS formation is regulated by nutrient availability and requires PDZD8. While the work is technically sound and well-presented, the biological implications of nutrient-dependent MERCS regulation remain underexplored.

      Major Concerns: Although the manuscript is methodologically robust and suitable for a Methods-type article, its biological significance is limited. The findings primarily serve as proof-of-concept for the MERCdRED tool, without substantially advancing our understanding of MERCS regulation.

      We appreciate the reviewer for acknowledging the methodological robustness of our study. We would like to respectfully emphasize that, using the MERCdRED cell system, we uncovered the distinct features of MERCS dynamics by comparing structures of various sizes (Figure 4A-D). Furthermore, we discovered an unexpected biological finding: nutrient starvation leads to a reduction in MERCS formation, which contrasts with previous reports using cell lines (former Figure 4E-H). Additionally, we revealed that PDZD8 mediates nutrient-dependent MERCS regulation (former Figure 4E-H).

      To clarify these findings, we have now separated the former Figure 4 into two distinct figures (now Figure 4 and 5). Furthermore, to assess the functional relevance of PDZD8-mediated MERCS regulation upon nutritional change, we will perform rescue experiments by overexpressing PDZD8 in starved cells, along with a metabolomic analysis in these conditions. We will add these new data in Figure 6.

      Taken together, we believe that our data provide novel mechanistic insights into how MERCS are modulated and utilized for the regulation of metabolism under physiological stress, thereby contributing to a deeper understanding of the roles and regulation of MERCS beyond the scope of a mere proof-of-concept study.

      Reviewer #1 (Significance (Required)):

      To enhance the impact of the study, the authors could use this sensor to investigate novel biological questions-such as the molecular pathways linking nutrient sensing to MERCS dynamics-or explore downstream activities of nutrient-dependent MERCS formation. Deeper mechanistic insights would significantly strengthen the work's contribution to the field.

      We thank the reviewer for their constructive suggestions. We fully agree that the MERCdRED cell system has great potential for investigating upstream signaling pathways regulating MERCS dynamics, as well as the downstream consequences of nutrient-dependent MERCS modulation. As mentioned above, this study already presents important findings, including the discovery of PDZD8 as a key protein linking the nutrient starvation and MERCS remodeling, and a relationship between MERCS dynamics and contact site size.

      To further assess the biological consequence of the MERCS remodeling, we will perform metabolomics analysis in PDZD8-overexpressing cells under starved conditions.

      Additionally, to further reinforce the utility of MERCdRED and extend the findings presented in this study, we performed live-cell imaging experiments using MERCdRED. The preliminary results demonstrated dynamic and reversible changes in MERCS in response to nutrient starvation and subsequent recovery (Please see the response to Reviewer 3 below, Reviewer-only Figure 1).

      These new data will significantly strengthen the contribution of this study to the field.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This manuscript entitled "Live-cell imaging reveals nutrient-dependent dynamics of ER-mitochondria contact formation via PDZD8" by Saeko Aoyama-Ishiwatari et al., describes a novel methodology for visualizing contacts between mitochondria and the endoplasmic reticulum (MERCs) by fluorescence microscopy. Inter-organelle contacts, defined as membrane proximities below ~30 nm, fall below the diffraction limit of conventional light microscopy. The method developed by Hirabayashi's laboratory leverages dimerization-dependent fluorescence complementation to create a reporter capable of both visualizing and quantifying ER-mitochondria contacts (MERCs).

      Reviewer #2 (Significance (Required)):

      This timely study provides a valuable and innovative approach to overcoming a longstanding technical limitation in the field, enabling dynamic analysis of ER-mitochondria contacts.

      We appreciate the reviewer for recognizing the timeliness and innovation of our work.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, the authors develop a new system to study mitochondrial-ER contact sites in living mouse embryonic fibroblast cells and explore the impact that nutritent starvation has on these contact sites in real time. By stably expressing a bicistroic reporter construct of a dimerization-dependent fluorescent protein that will generate a signal once the two moities, one anchored in the ER by Sec61b, and the other anchored in the outer membrane of mitochondria, via TOM20, comme in close apposition. This cell model is validated using sofisticated CLEM experiments and via the ablation of known regulators of MERCs, such as PDZD8 and FKBP8.

      Major comments

      The authors claim to have developed a new for the study of MERCs. They indeed have benchmarked this system using very sophisticated CLEM approaches and through the ablation of known regulators of MERCs, all of which is very carefully performed and convincing.

      We appreciate the reviewer for acknowledging our efforts in the development and validation of the MERCdRED system presented in this study.

      They argue that the generation of a stable cell line via lentiviral delivery is an improvement over the transient transfection approaches that have been applied in the past (see cited references), which I would generally agree. However, they have not contrasted or compared their system to the widly-used SLPICs system from the Tito Cali group (Vallese, F. et al. An expanded palette of improved SPLICS reporters detects multiple organelle contacts in vitro and in vivo. Nat. Commun. 11, 6069 (2020)) which measures bi and tri-partite interactions with other membrane contacts, including mitochondria and ER at two specific distances, which in my opinion has been more extensivley used to study cell and tissue physiology. They accurately point out that the reversability of this and other systems is challenging and it would be important to define highlight whether the current system allows the study of reversible MERCs. It does not appear as though the reversability of MERCs has been explored in this study.

      We thank the reviewer for these thoughtful suggestions and agree that further investigation into the reversibility of MERCS using the MERCdRED system would be valuable. Following the reviewer's suggestion, we performed a live-cell imaging experiment using MERCdRED to monitor dynamic changes in MERCS in response to nutrient starvation and subsequent recovery. The preliminary results were obtained as shown in Reviewer-only Figure 1, which strongly suggests the utility of MERCdRED for detecting reversible MERCS formation. The data will be added in Figure 5 if the reproducibility is confirmed. This new data set highlights the distinct utility of the MERCdRED system in studying MERCS dynamics.

      We acknowledge that the SPLICS system has been widely adopted for studying membrane contact sites. In the revised manuscript, we will include a comparative discussion of MERCdRED, SPLICS, and other existing MERCS reporters, particularly with respect to their capabilities in capturing the reversible nature of these contacts.

      The genetic (PDZD8, FKBP8) and nutritional (starvation) interventions are very helpful to benchmark the system. The description of the methods and data appear to be reproducible and the stastical analyses are acceptable.

      We thank the reviewer for their positive evaluation of our data and analyses.

      Minor comments

      As mentioned above, it would be helpful to reference and compare the current study in the context of reversability, which the current MERCdRED system has the potential to provide beyond the state-of-the-art.

      We thank the reviewer for this helpful suggestion. We will include additional discussion comparing the reversibility of the MERCdRED system with that of existing tools, highlighting the potential advantages of MERCdRED in capturing dynamic and reversible MERCS.

      Reviewer #3 (Significance (Required)):

      Significance

      The major strength of this study is the development of a stable cell line that allows for the study of MERCs, which has the potential to study the reversible nature of these membrane contact sites. It is debatable as a stable cell line rather than a transient transfection offers a major advancement, even if it does make the study of the system more straightforward, especially if the phenomenon of reversibilty is to be explored.I believe that the CLEM study offers a very informative and precise way to benchmark the ddFP system. Defining how MERC formation and separation (once again the reversibility discovery) have impacts in cell physiology beyond the distances altered by starvation would improve the study. Examining the impact on calcium homeostasis, lipid metabolism, and other aspects of biology that are known to be influenced by MERCs would be interesting. As such, there are no new conceptual, mechanistic, or functional advances, simply minor technical advances in the creation of a stable cell line followed by very solid benchmarking experiments. More complex tri-partite interactions, studied elsewhere, which are conceptually very important for cell and organelle biology, have not been attempted here. Similarly, the notion of studied different types of MERCs, which have been proposed to be important for cell biology, has not been explored using this single reporter. The target audience for this study is one that is interested in membrane contact sites and quantitative biology. My expertise is in mitochondrial fluorescence imaging and biology. I am not an expert in CLEM.

      We thank the reviewer for their thoughtful and detailed comments. We would like to respectfully emphasize that the establishment of a clonal cell line has enabled us to uncover a striking and unexpected biological finding-namely, that nutrient starvation leads to a reduction, rather than an increase, in MERCS formation, and that this change is regulated by PDZD8. This observation directly contradicts previous reports and highlights the value of our robust and quantitative system for re-evaluating previously held assumptions.

      We agree that demonstrating the reversibility of MERCS formation using our system would further strengthen the utility and reliability of the MERCdRED platform. To address this, as mentioned above, we performed a live-cell imaging to assess the dynamic reversibility of MERCS formation (Reviewer-only Figure 1) and will add the results in the revised manuscript.

      We agree that investigation of tri-partite interactions is conceptually important for understanding the broader landscape of organelle communication. However, assessing tri-partite organelle contacts is beyond the scope of this study. We recognize that this is one of the key directions for future studies and believe that the MERCdRED platform is a promising tool for exploring such complex interactions.

      Regarding different types of MERCS, we would like to clarify that our study does address this point to some extent. We identified distinct features of MERCS behavior by comparing structures of different sizes-an aspect that, to our knowledge, has not been previously examined. These findings contribute conceptually to our understanding of the dynamic and heterogeneous nature of ER-mitochondria contacts.

      We believe that our methodological development provides important mechanistic insights into MERCS dynamics, as described above. In line with the reviewer's suggestion, we will investigate the physiological impacts of MERCS remodeling in regulating metabolism in response to nutrient starvation. We hope these forthcoming data will further enhance the biological relevance of our findings.

      Taken together, we believe our study provides both a solid technical advance and novel mechanistic insights into MERCS biology, which will be of interest to researchers working on membrane contact sites, organelle dynamics, and cell physiology.

      We will revise the manuscript to more clearly convey the significance and implications of this study.

      1. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer #2

      Major points:

      In Figure 1E (and the rest of the manuscript), the meaning of the label "MERCdRED on Mito" is unclear. A portion of the MERCdRED signal does not co-localize with mitochondria. The authors should clearly define what "MERCdRED on Mito" represents which appears to be the intensity of the MERCdRED signal within the mitochondrial mask. How about the global MERCdRED signal intensity? When the authors knocked-out PDZD8, did the global fluorescence intensity of the MERCdRED signal decrease?

      As the reviewer pointed out, some red signals appear outside of mitochondria in MERCdRED cells, which are presumably due to autofluorescence. While the global red channel fluorescence intensity also decreased upon PDZD8 conditional knockout (cKO), as shown in Reviewer-only Figure 2A, the reduction was less pronounced than the decrease observed when only the red signals on mitochondria were measured (Reviewer-only Figure 2B). We consider the mitochondrial red signals to represent MERCdRED signals, and we agree that the label "MERCdRED on Mito" may be misleading. To improve clarity, we revised the figure labels as follows: "MERCdRED" was changed to "Red channel," and "MERCdRED on Mito" was changed to "MERCdRED (Red signals on Mito)."

      1. While the authors demonstrate that MERCdRED can quantify a reduction in MERCs (e.g., in PDZD8 knockout conditions), it would be valuable to assess its sensitivity to increases in MERCs as well. For example, previous work from the authors (Nakamura et al., 2025) showed that FKBP8 overexpression leads to an increase in MERCs.

      We thank the reviewer for suggesting this valuable experiment. To assess whether the dynamic range of MERCdRED covers increased MERCS formation, we overexpressed PDZD8 in MERCdRED cells. Notably, PDZD8 overexpression resulted in a significant increase in MERCdRED signal intensity, demonstrating that the system is indeed capable of detecting enhanced MERCS formation. These new data were added in the revised manuscript as new Figure 3D-E.

      Minor points: 1. Please revise the sentence "First, signals from MERCdRED fluorescence overlapped with the mitochondrial marker Tomm20-iRFP were detected by confocal microscopy in living cells."

      We revised this sentence to "First, fluorescence from MERCdRED and the mitochondrial marker Tomm20-iRFP wasdetected by confocal microscopy in living cells."

      1. Description of analyses that authors prefer not to carry out

      Reviewer #2

      Major points: 1. The authors claim that their construct enables balanced expression of the RA and GB moieties of the reporter. This should be substantiated by showing protein expression levels via Western blot analysis.

      We thank the reviewer for pointing this out. In our system, Tomm20-GB and RA-Sec61β are expressed from a single plasmid using a self-cleaving P2A peptide sequence, which ensures that the two proteins are produced in equimolar amounts upon translation. Therefore, their expression levels are expected to be approximately equal. Given that comparing the expression levels of these two proteins by Western blotting would require extensive work, including obtaining reconstituted proteins to normalize band intensities, but remains inconclusive due to the semi-quantitative nature of the method, we have decided not to pursue this approach.

      Minor points:

      1. In Figure 2, the ER structures are not segmented in the EM images. It would enhance the manuscript to show the three-dimensional spatial relationship between mitochondria and the ER, rather than only highlighting the regions identified as contacts.

      We agree that visualizing the entire ER structure would enhance the reader's understanding of the three-dimensional spatial relationship between mitochondria and the ER. However, complete segmentation of the ER in EM images is extremely labor-intensive. Given the scope and focus of this study, we have decided not to include full ER segmentation in this manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript, the authors develop a new system to study mitochondrial-ER contact sites in living mouse embryonic fibroblast cells and explore the impact that nutritent starvation has on these contact sites in real time. By stably expressing a bicistroic reporter construct of a dimerization-dependent fluorescent protein that will generate a signal once the two moities, one anchored in the ER by Sec61b, and the other anchored in the outer membrane of mitochondria, via TOM20, comme in close apposition. This cell model is validated using sofisticated CLEM experiments and via the ablation of known regulators of MERCs, such as PDZD8 and FKBP8.

      Major comments

      The authors claim to have developed a new for the study of MERCs. They indeed have benchmarked this system using very sophisticated CLEM approaches and through the ablation of known regulators of MERCs, all of which is very carefully performed and convincing. They argue that the generation of a stable cell line via lentiviral delivery is an improvement over the transient transfection approaches that have been applied in the past (see cited references), which I would generally agree. However, they have not contrasted or compared their system to the widly-used SLPICs system from the Tito Cali group (Vallese, F. et al. An expanded palette of improved SPLICS reporters detects multiple organelle contacts in vitro and in vivo. Nat. Commun. 11, 6069 (2020)) which measures bi and tri-partite interactions with other membrane contacts, including mitochondria and ER at two specific distances, which in my opinion has been more extensivley used to study cell and tissue physiology. They accurately point out that the reversability of this and other systems is challenging and it would be important to define highlight whether the current system allows the study of reversible MERCs. It does not appear as though the reversability of MERCs has been explored in this study. The genetic (PDZD8, FKBP8) and nutritional (starvation) interventions are very helpful to benchmark the system. The description of the methods and data appear to be reproducible and the stastical analyses are acceptable.

      Minor comments

      As mentioned above, it would be helpful to reference and compare the current study in the context of reversability, which the current MERCdRED system has the potential to provide beyond the state-of-the-art.

      Significance

      The major strength of this study is the development of a stable cell line that allows for the study of MERCs, which has the potential to study the reversible nature of these membrane contact sites. It is debatable as a stable cell line rather than a transient transfection offers a major advancement, even if it does make the study of the system more straightforward, especially if the phenomenon of reversibilty is to be explored. I believe that the CLEM study offers a very informative and precise way to benchmark the ddFP system. Defining how MERC formation and separation (once again the reversibility discovery) have impacts in cell physiology beyond the distances altered by starvation would improve the study. Examining the impact on calcium homeostasis, lipid metabolism, and other aspects of biology that are known to be influenced by MERCs would be interesting. As such, there are no new conceptual, mechanistic, or functional advances, simply minor technical advances in the creation of a stable cell line followed by very solid benchmarking experiments. More complex tri-partite interactions, studied elsewhere, which are conceptually very important for cell and organelle biology, have not been attempted here. Similarly, the notion of studied different types of MERCs, which have been proposed to be important for cell biology, has not been explored using this single reporter. The target audience for this study is one that is interested in membrane contact sites and quantitative biology.

      My expertise is in mitochondrial fluorescence imaging and biology. I am not an expert in CLEM.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript entitled "Live-cell imaging reveals nutrient-dependent dynamics of ER-mitochondria contact formation via PDZD8" by Saeko Aoyama-Ishiwatari et al., describes a novel methodology for visualizing contacts between mitochondria and the endoplasmic reticulum (MERCs) by fluorescence microscopy. Inter-organelle contacts, defined as membrane proximities below ~30 nm, fall below the diffraction limit of conventional light microscopy. The method developed by Hirabayashi's laboratory leverages dimerization-dependent fluorescence complementation to create a reporter capable of both visualizing and quantifying ER-mitochondria contacts (MERCs).

      Major points:

      1. The authors claim that their construct enables balanced expression of the RA and GB moieties of the reporter. This should be substantiated by showing protein expression levels via Western blot analysis.
      2. In Figure 1E (and the rest of the manuscript), the meaning of the label "MERCdRED on Mito" is unclear. A portion of the MERCdRED signal does not co-localize with mitochondria. The authors should clearly define what "MERCdRED on Mito" represents which appears to be the intensity of the MERCdRED signal within the mitochondrial mask. How about the global MERCdRED signal intensity? When the authors knocked-out PDZD8, did the global fluorescence intensity of the MERCdRED signal decrease?
      3. While the authors demonstrate that MERCdRED can quantify a reduction in MERCs (e.g., in PDZD8 knockout conditions), it would be valuable to assess its sensitivity to increases in MERCs as well. For example, previous work from the authors (Nakamura et al., 2025) showed that FKBP8 overexpression leads to an increase in MERCs.

      Minor points:

      1. Please revise the sentence "First, signals from MERCdRED fluorescence overlapped with the mitochondrial marker Tomm20-iRFP were detected by confocal microscopy in living cells."
      2. In Figure 2, the ER structures are not segmented in the EM images. It would enhance the manuscript to show the three-dimensional spatial relationship between mitochondria and the ER, rather than only highlighting the regions identified as contacts.

      Significance

      This timely study provides a valuable and innovative approach to overcoming a longstanding technical limitation in the field, enabling dynamic analysis of ER-mitochondria contacts.

      Expertise: cell biology, membrane contact sites, lipid transfer proteins

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this study, the authors successfully established a stable cell line expressing MERCdRED, a dimerization-dependent fluorescent protein (ddFP)-based sensor for monitoring mitochondrial-ER contact sites (MERCS). Through light and electron microscopy analyses, they demonstrated that MERCS formation is regulated by nutrient availability and requires PDZD8. While the work is technically sound and well-presented, the biological implications of nutrient-dependent MERCS regulation remain underexplored.

      Major Concerns:

      Although the manuscript is methodologically robust and suitable for a Methods-type article, its biological significance is limited. The findings primarily serve as proof-of-concept for the MERCdRED tool, without substantially advancing our understanding of MERCS regulation.

      Significance

      To enhance the impact of the study, the authors could use this sensor to investigate novel biological questions-such as the molecular pathways linking nutrient sensing to MERCS dynamics-or explore downstream activities of nutrient-dependent MERCS formation. Deeper mechanistic insights would significantly strengthen the work's contribution to the field.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Major points

      • *

        • The introduction describes the effects of different environmental cues and aging on fibroblast phenotype, but it would be good to note the developmental origins of dermal fibroblasts, which specifies their fate and function (Driskell et al, Nature 2013).* Our response:

      In accordance with the reviewers' suggestions, we have incorporated a summary of prior research regarding the developmental origins of dermal fibroblasts into lines 53–56 of the Introduction.

      • In Fig 2, how do TEWL measurements compare to constructs without an epidermal layer or human skin? It may seem obvious that barrier function would be negligible in these models, but it would be a helpful negative control for interpreting the relative effects of vasculature on barrier function.*

      We appreciate your valuable comments regarding the accurate interpretation of TEWL measurements. Estimated TEWL values for human skin have been reported in a systematic review and meta-analysis by Kottner et al. Specifically, the estimated TEWL (95% CI) for individuals aged 18–64 years varies by anatomical site: 15.4 (13.9–17.0) g/m²h for the right cheek, 6.5 (6.2–6.8) g/m²h for the midvolar right forearm, and 36.3 (29.5–43.1) g/m²h for the right palm. In comparison, the TEWL of our EDV model was 9.68 g/m²h, a value relatively close to that of human skin.

      We also considered measuring TEWL in artificial skin models lacking epidermis. However, we found that such models remain moist due to culture medium, and pressing the measurement probe against them risks water droplets adhering to the sensor and causing damage. Although we recognize the significance of this measurement as a negative control, we refrained from conducting it due to the limitations of the equipment.

      This information has been added to the Results section, lines 178–182.

      • *

      • The mechanical measurements in Fig 2 are a nice idea, but it is a bit difficult to interpret without comparison to other conditions (e.g. human skin) or by reporting more universal mechanical parameters (e.g. Young's modulus).*

      We greatly appreciate your insightful comments regarding the interpretation of skin viscoelasticity measurements using the Cutometer. The Cutometer is a device that applies negative pressure to the skin to elevate its surface, allowing for the calculation of biomechanical properties based on the temporal changes in skin displacement. Notably, the R7 parameter—defined as the ratio of immediate retraction after pressure release to the maximum deformation during suction—has been shown to correlate significantly with age.

      In this study, we evaluated HSEs under the same measurement conditions as those used in previous human clinical studies. Accordingly, we have cited past Cutometer data for human skin and discussed the relationship between those findings and our HSE measurements. These revisions have been made to lines 205–215.

      We determined that performing Cutometer measurements on human skin would be impractical due to the ethical committee procedures and associated costs. Although evaluating Young’s modulus using techniques such as AFM to assess the mechanical properties of collagen fibers is a fascinating and informative approach, we have opted not to pursue this analysis due to the substantial time and cost required for sample preparation.

      • The induction of region-specific fibroblast markers is interesting and a bit unexpected since all the fibroblasts came from the same source before seeding into HSEs. The conclusions require additional support from quantification of the IF staining in Fig 3.*

      Our response:

      Thank you for your valuable advice on strengthening the conclusion of our manuscript. We are currently conducting quantitative analysis through manual counting across multiple fields for all mesenchymal cell markers and Vimentin immunostaining data presented in Fig. 3.

      • *

      • Likewise, could the authors clarify whether the cells were passaged before seeding into the HSE, and if so, what passage number. Could passaging affect the responses observed? Please add a discussion point about this.*

      Our response:

      For all cell types, passage 4 or 5 cells were utilized for the reconstitution of human skin equivalents (HSE). Indeed, Philippeos et al. demonstrated that while CD39, CD90, and CD36 are detectable in primary CD31⁻CD45⁻Ecad⁻ dermal cells, the expression of CD39 is lost after a single passage. In contrast, CD90 and CD36 remain detectable for up to four passages. These findings underscore the impact of in vitro culture on the depletion of fibroblast marker expression. Since we employed NHDFs that had undergone four to five passages for HSE reconstruction, it is reasonable to assume that these cells had already lost specific fibroblast subpopulations, including CD39⁺ cells. Consistent with this, our scRNA-seq analysis revealed that most fibroblasts cultured in 2D formed an artificial population comprising cells in the S and G2M phases, along with secretory-reticular fibroblasts. Additionally, immunohistochemical analysis confirmed a near-complete absence of CD39⁺, CD90⁺, FAP⁺, NG2⁺, and αSMA⁺ cells in the dermis of both D and DV models, further indicating that serial passaging significantly reduces the expression of markers associated with papillary fibroblasts, reticular fibroblasts, and pericytes. Interestingly, the introduction of vascular endothelial cells into the HSE appears to facilitate a partial restoration of fibroblast heterogeneity in cells passaged four to five times. However, whether this effect can be replicated in more extensively passaged fibroblasts remains to be verified. It is well established that excessive passaging induces cellular senescence, leading to reduced proliferative and differentiation capacities in mesenchymal stem cells. Therefore, it is conceivable that fibroblasts beyond a certain passage number may fail to recapitulate dermal mesenchymal cell heterogeneity, even in the presence of endothelial cells.

      We have added this discussion to the revised manuscript on lines 372-385, 391–397, and 470-471. However, due to the prolonged culture period required, we regret that we are unable to perform the additional validation experiments at this time.

      • The scRNA-seq suggests that the in vitro populations do not discriminate between secretory papillary and pro-inflammatory fibroblasts. Could the authors add some further analysis or discussion regarding this point?*

      Our response:

      We are currently conducting an additional enrichment analysis on fibroblast subpopulations #0, 1, 2, 6, 8, and 11, identified through UMAP analysis integrating HSE and human skin datasets. We believe that this analysis will elucidate the functional characteristics of each in vitro subpopulation and enable us to speculate on the underlying factors contributing to the observed differences from the human skin analysis results.

      • In Fig 6, it will be important to add quantification of epidermal thickness and differentiation marker expression to support the conclusions.*

      Our response:

      Thank you for your valuable advice regarding quantitative analysis. We are currently measuring the thickness of the entire epidermal layer, the CK5-positive cell layer, and the CK10-positive cell layer based on HE-stained and IHC-stained images.

      • A key question is how NP and AA conditions affect the fibroblast populations as this seems to be a key factor in HSE maturation and would then link back to the previous sections. It would be good to stain for fibroblast markers in these samples.*

      Our response:

      We are grateful for your insightful comments, which are crucial for a more precise understanding of the physiological relevance of the NP culture model. In response, we are currently undertaking additional analyses to investigate the expression patterns of dermal mesenchymal markers under both NP and AA conditions.

      • As noted above, the ability of the vasculature to direct differentiation of a common fibroblast population into different phenotypes is one of the key findings of the study. To strengthen these observations, could additional analysis of the transcriptional data be possible. For example, would trajectory analysis potentially show how the different populations are evolving or related? In addition, could the CellChat analysis be performed between the vasculature and the different populations in Fig 5, which are mapped to in vivo populations? This might be a more relevant analysis than the populations in Fig 4.*

      Our response:

      As pointed out by reviewers, we acknowledge that elucidating the process and underlying mechanisms by which fibroblasts, whose heterogeneity is compromised in 2D culture, re-differentiate into distinct dermal mesenchymal subtypes constitutes a critical additional analysis to strengthen our findings. Accordingly, we are currently conducting trajectory analysis using Monocle3. This includes identifying branch points that regulate the differentiation of dermal mesenchymal clusters shown in Fig. 4b, as well as predicting transcription factors and cell signaling pathways playing pivotal roles at those branch points. Furthermore, we are planning a CellChat analysis between vascular endothelial cells and dermal mesenchymal cells. We anticipate that integrating the results of these two analyses will provide valuable insights into the differentiation processes of dermal mesenchymal cells, particularly the induction of perivascular cell differentiation.

      • *

      • *

      Reviewer #1

      Minor points

      • *

        • The abstract states that enabling in vitro evaluation of drug efficacy using methodologies that are identical to those used in human clinical studies. This seems to be an over interpretation of the study and not well supported by the data. Please consider revising or removing.*

      Our Response:

      Upon thorough consideration, we have deleted the statements that may be regarded as exaggerated (line 26-28 and 346-348).

      • Check referencing formatting in lines 118-121*

      Our Response:

      We appreciate your attention to the reference format error. The necessary revisions have been completed.

      Reviewer #2 Major comments:

        • Despite its strengths, the study has several limitations that warrant further investigation. The authors describe a "senescent-like" phenotype under nutrient-poor (NP) conditions, yet do not provide direct evidence of cellular senescence using canonical markers such as SA-β-gal staining, p16^INK4a or p21 expression, or SASP profiling-weakening their aging-related conclusions.*

      Our Response

      Thank you for your valuable advice, which has helped clarify the physiological phenomena modeled by the NP condition. We are planning additional experiments involving histological analysis, including SA-β-gal staining and the detection of p16^INK4a and/or p21.

      • The 500 μM dose of ascorbic acid (AA), while within the reported range for skin models, is at the higher end compared to commonly used concentrations (100-300 μM) and lacks justification via dose/response data. Normal physiological levels and changes in aging dermis should be referenced in discussion. AA is also an additive in their standard HSE media, but this was not sufficiently emphasized to draw attention. Would its removal from the baseline media make a difference?*

      Our Response

      We sincerely appreciate the important comment regarding the rationale behind the ascorbic acid concentration used in the culture medium. As Reviewer 3 rightly pointed out, concentrations around 100–300 μM are commonly employed in general in vitro assays. In our artificial skin model, we opted for a concentration of 500 μM AA in the growth medium based on two considerations: (1) the model contains a high cell density of approximately 4 × 10⁶ cells immediately after reconstruction, which is expected to result in substantial AA consumption, and (2) AA is not sufficiently stable in culture medium. Given the relatively long medium exchange interval of 48–72 hours, we deemed it necessary to maintain a certain AA level throughout this period. While no rigorous dose–response validation has been conducted, we have confirmed that this concentration does not induce toxicity or abnormalities in skin morphogenesis.

      As part of the revision, we considered revisiting the basal medium formulation; however, due to the significant time and resource demands, we have decided to forgo further optimization at this stage.

      As described on lines 307–311, the NP medium was formulated to evaluate the potential impact of age-related declines in plasma component transport. We apologize for any confusion regarding the relationship between the HSE growth medium and the NP medium. In response to the reviewer’s suggestion, we have added clarifying explanations and cautionary notes regarding the composition and rationale of these two media in both the Results and Methods sections (line 307-311 and 634-636).

      • Mechanistically, fibroblast heterogeneity is attributed to keratinocyte and vascular signals, but the signaling pathways involved (e.g., Wnt, TGF-β, VEGF) are not directly examined. Validating which paracrine factors (VEGF, PDGF, LAMA5, KGF) are mediating fibroblast transitions using inhibitors or RNA profiling could shed more light.*

      Our response:

      As pointed out by reviewers, we acknowledge that elucidating the process and underlying mechanisms by which fibroblasts, whose heterogeneity is compromised in 2D culture, re-differentiate into distinct dermal mesenchymal subtypes constitutes a critical additional analysis to strengthen our findings. Accordingly, we are currently conducting trajectory analysis using Monocle3. This includes identifying branch points that regulate the differentiation of dermal mesenchymal clusters shown in Fig. 4b, as well as predicting transcription factors and cell signaling pathways playing pivotal roles at those branch points. Furthermore, we are planning a CellChat analysis between vascular endothelial cells and dermal mesenchymal cells. We anticipate that integrating the results of these two analyses will provide valuable insights into the differentiation processes of dermal mesenchymal cells, particularly the induction of perivascular cell differentiation. We fully recognize that validation using specific inhibitors is crucial to substantiate the mechanisms suggested by the scRNA-seq analysis. However, given that the reconstruction and reanalysis of the artificial skin model requires more than three months, we have decided not to include these experiments in the current revision and instead consider them as important subjects for future investigation.

      Minor comments: 1. The role of pericytes is also underexplored; while their presence is confirmed, functional assays or transcriptomic analyses to elucidate their contribution to ECM remodeling or vascular stability are not fully explored. The origin of pericyte-like cells remains uncertain without lineage tracing or barcoding to distinguish whether they derive from fibroblasts, endothelial cells, or culture artifacts. Since they observe induced differentiation of fibroblast-like cells in 3D culture, it would be compelling to reconstruct differentiation trajectories (pseudotime analysis) from progenitor states to papillary/reticular/pericyte-like states from their scRNAseq data.

      Our respnse:

      This point will be addressed and validated through our response to Major Comment 3 from Reviewer #2.

      • Although AA enhanced collagen production and elasticity in the vascularized EDV model, the lack of response in the ED model is not addressed mechanistically.*

      Our response

      We have planned additional experiments to examine two hypotheses regarding the mechanism underlying the improved responsiveness of the EDV model to AA. The first hypothesis posits that the behavior of ascorbic acid uptake in the cells constituting the EDV model differs from that in the ED model. To investigate this, we plan to analyze the expression patterns of transporter genes potentially involved in the uptake and efflux of ascorbic acid, such as SVCT1 (SLC23A1), SVCT2 (SLC23A2), GLUT1 (SLC2A1), GLUT3 (SLC2A3), GLUT4 (SLC2A4), and MRP4, using scRNA-seq data. The second hypothesis suggests that the absence of bFGF signaling and low FBS treatment under NP conditions may affect subpopulations of dermal mesenchymal cells in the HSEs. To test this, we plan to analyze the expression patterns of dermal mesenchymal cell markers by IHC under NP and AA conditions, following the same approach as shown in Fig. 3.

      • The omission of immune cells which are key players in skin aging and homeostasis could increase physiological relevance of the model.*

      Our response:

      As rightly noted by Reviewer 2, immune cells are integral to skin aging and the maintenance of tissue homeostasis, underscoring the necessity of incorporating them into future research models. Nonetheless, the primary aim of the present study is to elucidate the influence of vascular endothelial cells on dermal mesenchymal cell heterogeneity and to establish an in vitro research model specifically addressing this heterogeneity, with particular emphasis on perivascular cells. Accordingly, we would prefer to consider the analysis of immune cells as a subject for future investigation.

      • The exclusive use of standard HUVECs may not fully capture the behavior of tissue-specific microvascular endothelial cells, potentially limiting the fidelity of the vascular niche.*

      In this study, we opted to use HUVECs as vascular endothelial cells due to their relative ease of expansion in culture. Consequently, we acknowledge the potential limitation in fully recapitulating the functions of tissue-specific endothelial cells. To address this concern, we have revised and expanded the Discussion section on lines 352–356.

      Reviewer #3 Major comments:

        • Are the key conclusions convincing? The core claim-that tricellular interactions recapitulate dermal mesenchymal heterogeneity and enhance skin functionality-is well-supported by histology, immunohistochemistry, functional assays (TEWL, elasticity), and scRNA-seq.
      1. Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? The assertion that HSEs enable "identical" methodology to clinical studies (p. 2, line 29) is exaggerated. While elasticity was measured via Cutometer (used clinically), the model lacks immune/neural components and long-term stability for full translational equivalence.* Our Response:

      Upon thorough consideration, we have deleted the statements that may be regarded as exaggerated (line 26-28 and 346-348).

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. Adequacy of Experimental Evidence & Need for Additional Experiments: No essential control appears to be missing: the authors include conditions {plus minus}ascorbic acid and {plus minus}vascular cells to isolate those effects. One could suggest a few additional experiments to further bolster the conclusions, but they are not strictly required for the main message. For example, to pinpoint the contribution of each mesenchymal subset, the authors could engineer HSE variants lacking one component at a time (omit pericytes or use only papillary vs. only reticular fibroblasts) to see how each omission affects barrier or elasticity. This would directly confirm each cell type's role. However, such experiments may be technically involved (especially isolating pure papillary vs. reticular fibroblast populations and ensuring viability in 3D culture) and might be beyond the scope of a single study. Another possible extension could be mechanistic assays, such as examining specific molecular signals: e.g., testing if blocking known paracrine factors from pericytes or fibroblast subsets diminishes the observed improvements. Given that pericytes can secrete laminin-511 and other factors that promote keratinocyte growth, the authors might, in future work, explore whether such factors mediate the enhanced epidermal proliferation seen with the vascularized HSE. Overall, the current data are sufficiently convincing that additional experiments are not absolutely necessary for publication.
      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments-*

      Our response

      We are deeply grateful for the reviewer’s constructive feedback. As rightly pointed out, cell ablation and mechanistic assays utilizing signaling inhibitors to assess the contribution of individual mesenchymal subsets are indispensable for reinforcing our findings and claims. However, as the reviewer has also indicated, these experiments would require no less than four months to complete. Consequently, we have opted to forgo high-cost additional experiments such as the optimization of HSE construction protocols and inhibitor-based assays. Instead, we are proactively conducting mechanism-oriented analyses using our existing scRNA-seq and histological datasets. Specifically, we are currently implementing an integrated approach combining Monocle3 and CellChat to pinpoint critical branch points in dermal mesenchymal cell differentiation and to elucidate the signaling pathways orchestrating these bifurcations.

      • Are the experiments adequately replicated and statistical analysis adequate? The manuscript's data are presented in a manner that generally supports reproducibility. The authors state that all data are presented as "mean {plus minus} SD" (Methods, p.36). This is acceptable and clearly reported. However, I suggest that the authors consider using mean {plus minus} SEM for specific datasets where the primary goal is to assess statistical significance between groups - for example, for the Ki67-positive cell proliferation data (Fig. 6c) - as SEM better reflects the precision of the group mean for inferential comparisons. In contrast, for functional measures that inherently exhibit biological variation across samples (e.g., TEWL, skin elasticity), using mean {plus minus} SD remains fully appropriate, as SD reflects true inter-sample variability. To improve clarity and reproducibility, I encourage the authors to briefly state in the Methods or figure legends why SD or SEM is used in each case, in line with best practice guidelines.*

      Our Response:

      We appreciate your guidance regarding appropriate statistical analysis and data presentation. We planned to revise the depiction of error margins in accordance with best practice guidelines.

      Reviewer #3 Minor comments: 1. For Figure 4e, it would be helpful if the authors could clarify in the figure legend or Methods whether the heatmap shows log-normalized expression values (as derived from the Seurat object) or z-scored expression across cells or samples. This distinction affects the interpretation of relative versus absolute expression levels of the collagen and elastic fiber-related genes, which are central to the study's conclusions about ECM remodeling.

      Our response:

      Thank you for pointing out the inconsistency in data representation. We have revised the manuscript to clearly indicate that Fig. 4e presents the Z-score normalized average expression levels.

      • Typos: "factr" → "factor" (p. 16, line 244); "severl" → "several" (p. 22, line 367).

      *

      Our response

      Thanks for pointing out the typo, we have corrected it.

      Reviewer #4

      Minor Points:

        • The human skin control in Fig. 1c seems thinner than normal and would suggest that the ED and EDV models are hyperproliferative. Replacing the control with one that shows normal thickness would prevent incorrect conclusions of the data.* Our response:

      In accordance with the reviewer’s suggestion, the display area of the human skin image in Fig. 1c has been modified.

      KI67 and TEWL readings for human skin as controls for Fig. 2b-c would help gauge how the organoids perform and whether they are abnormal. What is the elasticity index for facial sagging?

      Thank you for your valuable advice, which has deepened our understanding of the evaluation results of HSEs. We are currently planning and conducting an additional analysis by including the quantification of Ki67-positive cells in human skin samples. Regarding the assessment of skin barrier and viscoelasticity using TEWL and Cutometer measurements, we have reffered data from previous clinical studies and added an explanation of the functional differences between HSEs and human skin.

      • Ascorbic acid utilizes SLC23A1 and SLC23A2 to transport across cell membranes. Are their expression more pronounced in cluster 14 fibroblasts? This would help connect the scRNA-seq data to the ascorbic acid experiments.

      *

      Our response:

      We appreciate the valuable suggestions provided to investigate the mechanisms underlying the altered VC responsiveness observed in the EDV model. We plan to analyze the expression patterns of transporter genes potentially involved in the uptake and efflux of ascorbic acid, such as SVCT1 (SLC23A1), SVCT2 (SLC23A2), GLUT1 (SLC2A1), GLUT3 (SLC2A3), GLUT4 (SLC2A4), and MRP4, using scRNA-seq data.

      There seems to be quite a bit of variability between replicant immunostains, in particular, vimentin in Fig. 3. Can the authors discuss this variability and whether any of the HSE organoid combinations reduced this variability?

      Our response:

      Thank you for your comments regarding the immunostaining. A reanalysis of the data, including newly acquired immunostaining images during the revision process, is planned.

      • Please provide number of replicates throughout figure legends.*

      Our response:

      Thank you for your valuable advice. We have added the number of replicates to all figure legends.

      • Line 148 states "E and EV models were transparent and extremely soft", should read "E and ED models".*

      Our response:

      The photographic data for the EV and ED models in Fig. 1b was incorrect and has therefore been corrected. We sincerely apologize for our oversight. As it was actually the E and EV models that appeared transparent, the description in the text remains unchanged.

      • Line 150-151 states "In the E and EV models, an abnormal epidermis lacking a basal cell layer formed". The Krt5 staining in Figure 2 clearly shows a basal cell layer in these models, albeit abnormal. Stating that this the abnormal epidermis displayed a disrupted basal cell layer or columnar shape of basal cells were disrupted is more appropriate. In addition, these results do not show "crosstalk between NHEKs and NHDFs is essential for epithelialization" as the E and EV organoid models show epithelial stratification.*

      Our response:

      We sincerely appreciate your insightful guidance regarding the accurate presentation of the histological analysis results. Accordingly, we have revised lines 154–156 in the Results section in line with your recommendations.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      The manuscript by Kimura et al. define how epidermal morphogenesis in human skin equivalents (HSE) differ by combining vascular endothelial cells, epidermal keratinocytes, and dermal fibroblasts using staining and single-cell RNA-sequencing (scRNA-seq). The three cell system (EDV) displayed higher levels of Ki67+ cells, decreased levels of TEWL, and higher elasticity in comparison to the keratinocyte and fibroblast HSE system (ED). The overall structural morphology between the two systems is quite similar, though the expression of cytokeratin markers varies. EDV organoids specifically express COL1 and COL4 collagen markers surrounding the blood vessels. VEGF-VEGFR1 signaling between endothelia-fibroblasts seems to be pronounced in the EDV organoids according to scRNA-seq, suggesting active signaling between these two cell types. And ascorbic acid appeared to help nutrient poor ED and EDV organoids proliferate compared to controls. This work is well detailed and interesting, helping to define how endothelial cells function to make HSE organoids more faithfully mimic in vivo human skin. Only minor clarifications detailed below are needed.

      1. The human skin control in Fig. 1c seems thinner than normal and would suggest that the ED and EDV models are hyperproliferative. Replacing the control with one that shows normal thickness would prevent incorrect conclusions of the data.
      2. KI67 and TEWL readings for human skin as controls for Fig. 2b-c would help gauge how the organoids perform and whether they are abnormal. What is the elasticity index for facial sagging?
      3. Ascorbic acid utilizes SLC23A1 and SLC23A2 to transport across cell membranes. Are their expression more pronounced in cluster 14 fibroblasts? This would help connect the scRNA-seq data to the ascorbic acid experiments.
      4. There seems to be quite a bit of variability between replicant immunostains, in particular, vimentin in Fig. 3. Can the authors discuss this variability and whether any of the HSE organoid combinations reduced this variability?
      5. Please provide number of replicates throughout figure legends.
      6. Line 148 states "E and EV models were transparent and extremely soft", should read "E and ED models".
      7. Line 150-151 states "In the E and EV models, an abnormal epidermis lacking a basal cell layer formed". The Krt5 staining in Figure 2 clearly shows a basal cell layer in these models, albeit abnormal. Stating that this the abnormal epidermis displayed a disrupted basal cell layer or columnar shape of basal cells were disrupted is more appropriate. In addition, these results do not show "crosstalk between NHEKs and NHDFs is essential for epithelialization" as the E and EV organoid models show epithelial stratification.

      Significance

      This work is well detailed and interesting, helping to define how endothelial cells function to make HSE organoids more faithfully mimic in vivo human skin. Only minor clarifications detailed below are needed.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The study develops a tricellular human skin equivalent (HSE) model incorporating epidermal keratinocytes (NHEKs), dermal fibroblasts (NHDFs), and vascular endothelial cells (HUVECs). This model autonomously organizes pericytes, papillary fibroblasts, and reticular fibroblasts, mimicking in vivo dermal mesenchymal heterogeneity. The EDV model (all three cell types) demonstrates enhanced epidermal barrier function (reduced TEWL), dermal elasticity, collagen deposition, and vascular organization compared to simpler models. Single-cell RNA-seq confirms the emergence of pericyte-like and fibroblast subpopulations resembling in vivo counterparts. Nutrient-poor (NP) culture replicates aging phenotypes (reduced proliferation, barrier dysfunction, disordered collagen), rescued by ascorbic acid (AA), highlighting vascular cells' role in skin homeostasis. However, several key methodological clarifications (e.g., heatmap normalization, statistical reporting), more precise qualification of certain claims, and enhanced contextualization within the literature are needed before the work can be considered suitable for publication; I therefore recommend major revision.

      Major comments:

      1. Are the key conclusions convincing?<br /> The core claim-that tricellular interactions recapitulate dermal mesenchymal heterogeneity and enhance skin functionality-is well-supported by histology, immunohistochemistry, functional assays (TEWL, elasticity), and scRNA-seq.
      2. Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? The assertion that HSEs enable "identical" methodology to clinical studies (p. 2, line 29) is exaggerated. While elasticity was measured via Cutometer (used clinically), the model lacks immune/neural components and long-term stability for full translational equivalence.
      3. Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. Adequacy of Experimental Evidence & Need for Additional Experiments: No essential control appears to be missing: the authors include conditions {plus minus}ascorbic acid and {plus minus}vascular cells to isolate those effects. One could suggest a few additional experiments to further bolster the conclusions, but they are not strictly required for the main message. For example, to pinpoint the contribution of each mesenchymal subset, the authors could engineer HSE variants lacking one component at a time (omit pericytes or use only papillary vs. only reticular fibroblasts) to see how each omission affects barrier or elasticity. This would directly confirm each cell type's role. However, such experiments may be technically involved (especially isolating pure papillary vs. reticular fibroblast populations and ensuring viability in 3D culture) and might be beyond the scope of a single study. Another possible extension could be mechanistic assays, such as examining specific molecular signals: e.g., testing if blocking known paracrine factors from pericytes or fibroblast subsets diminishes the observed improvements. Given that pericytes can secrete laminin-511 and other factors that promote keratinocyte growth, the authors might, in future work, explore whether such factors mediate the enhanced epidermal proliferation seen with the vascularized HSE. Overall, the current data are sufficiently convincing that additional experiments are not absolutely necessary for publication.
      4. Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments

      5. Are the data and the methods presented in such a way that they can be reproduced? Yes
      6. Are the experiments adequately replicated and statistical analysis adequate? The manuscript's data are presented in a manner that generally supports reproducibility. The authors state that all data are presented as "mean {plus minus} SD" (Methods, p.36). This is acceptable and clearly reported. However, I suggest that the authors consider using mean {plus minus} SEM for specific datasets where the primary goal is to assess statistical significance between groups - for example, for the Ki67-positive cell proliferation data (Fig. 6c) - as SEM better reflects the precision of the group mean for inferential comparisons. In contrast, for functional measures that inherently exhibit biological variation across samples (e.g., TEWL, skin elasticity), using mean {plus minus} SD remains fully appropriate, as SD reflects true inter-sample variability. To improve clarity and reproducibility, I encourage the authors to briefly state in the Methods or figure legends why SD or SEM is used in each case, in line with best practice guidelines.

      Minor comments:

      1. For Figure 4e, it would be helpful if the authors could clarify in the figure legend or Methods whether the heatmap shows log-normalized expression values (as derived from the Seurat object) or z-scored expression across cells or samples. This distinction affects the interpretation of relative versus absolute expression levels of the collagen and elastic fiber-related genes, which are central to the study's conclusions about ECM remodeling.
      2. Typos: "factr" → "factor" (p. 16, line 244); "severl" → "several" (p. 22, line 367).

      Significance

      The study innovatively reconstructs dermal mesenchymal heterogeneity using commercially available cells and autonomous tricellular interactions, bypassing costly cell-sorting approaches. This democratizes complex HSE models for broader labs. This study demonstrates that vascularization is critical not only for nutrient supply but for instructing fibroblast/pericyte differentiation and ECM organization. The NP+AA paradigm (Fig. 6) offers a facile in vitro model for skin aging interventions, highlighting AA's efficacy via perivascular mechanisms.

      Audience: Tissue engineers, dermatologists, cosmetic/pharma researchers (anti-aging screening), and developmental biologists studying mesenchymal niche regulation.

      Placement in existing literature: Recent advances in skin tissue engineering have highlighted the importance of dermal fibroblast heterogeneity in skin homeostasis and regeneration. Single-cell transcriptomic studies (Tabib et al., J Invest Dermatol 2018; Solé-Boldo et al., Commun Biol 2020) have established that papillary and reticular fibroblasts exhibit distinct gene expression and functional roles. Prior engineered skin models incorporating fibroblast subtypes (Moreira et al., Biomater Sci 2023) or pericytes (Paquet-Fifield et al., J Clin Invest 2009) demonstrated improvements in vascularization or epidermal differentiation. However, a unified 3D human skin equivalent integrating vascular cells, pericytes, and spatially organized fibroblast subpopulations has not been systematically achieved. The present work by Kimura et al. advances the field by demonstrating that autonomous interaction among keratinocytes, endothelial cells, pericytes, and heterogeneous fibroblasts significantly enhances both barrier function and dermal elasticity, thus bringing engineered skin models closer to physiological skin. This addresses a key gap between prior single-cell descriptive studies and functional tissue engineering.

      Define your field of expertise with a few keywords: experimental dermatology, skin cancer, tissue engineering and 3D skin models, cell biology, tumor microenvironment, and the skin microbiome and barrier function.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, the authors present a novel and well-executed approach to reconstructing human skin equivalents (HSEs) that more faithfully replicate the functional complexity of native skin by incorporating the natural heterogeneity of dermal mesenchymal cells, including spatially organized pericytes, papillary fibroblasts, and reticular fibroblasts. Through autonomous interactions among keratinocytes, fibroblasts, and vascular endothelial cells, the fully tricellular EDV model emerged as the most functionally complete among seven engineered HSE variants, demonstrating enhanced epithelialization, barrier integrity, dermal elasticity, and angiogenic architecture. The study's strengths lie in its realistic aging induction via nutrient deprivation by mimicking aspects of vascular insufficiency in the papillary dermis, and its integration of diverse and rigorous evaluation methods, including histological and molecular analyses (Ki67, ECM markers), barrier function (TEWL), and mechanical testing. Notably, ascorbic acid treatment improved epidermal turnover and extracellular matrix organization, particularly through effects on perivascular niche cells, highlighting its translational relevance for anti-aging interventions. Although the EDV model showed superior elasticity via suction testing, more comprehensive mechanical characterization and longitudinal ECM analysis could further elucidate how mesenchymal heterogeneity supports biomechanical resilience. Overall, this work underscores the importance of multicellular crosstalk in skin physiology and positions the EDV model as a robust in vitro platform with high relevance for regenerative medicine, aging research, and therapeutic screening, offering the potential to eliminate animal models in skin biology.

      Major comments:

      Despite its strengths, the study has several limitations that warrant further investigation. The authors describe a "senescent-like" phenotype under nutrient-poor (NP) conditions, yet do not provide direct evidence of cellular senescence using canonical markers such as SA-β-gal staining, p16^INK4a or p21 expression, or SASP profiling-weakening their aging-related conclusions.

      The 500 μM dose of ascorbic acid (AA), while within the reported range for skin models, is at the higher end compared to commonly used concentrations (100-300 μM) and lacks justification via dose/response data. Normal physiological levels and changes in aging dermis should be referenced in discussion. AA is also an additive in their standard HSE media, but this was not sufficiently emphasized to draw attention. Would its removal from the baseline media make a difference? Mechanistically, fibroblast heterogeneity is attributed to keratinocyte and vascular signals, but the signaling pathways involved (e.g., Wnt, TGF-β, VEGF) are not directly examined. Validating which paracrine factors (VEGF, PDGF, LAMA5, KGF) are mediating fibroblast transitions using inhibitors or RNA profiling could shed more light.

      Minor comments:

      The role of pericytes is also underexplored; while their presence is confirmed, functional assays or transcriptomic analyses to elucidate their contribution to ECM remodeling or vascular stability are not fully explored. The origin of pericyte-like cells remains uncertain without lineage tracing or barcoding to distinguish whether they derive from fibroblasts, endothelial cells, or culture artifacts. Since they observe induced differentiation of fibroblast-like cells in 3D culture, it would be compelling to reconstruct differentiation trajectories (pseudotime analysis) from progenitor states to papillary/reticular/pericyte-like states from their scRNAseq data. Although AA enhanced collagen production and elasticity in the vascularized EDV model, the lack of response in the ED model is not addressed mechanistically. The omission of immune cells which are key players in skin aging and homeostasis could increase physiological relevance of the model. The exclusive use of standard HUVECs may not fully capture the behavior of tissue-specific microvascular endothelial cells, potentially limiting the fidelity of the vascular niche.

      Significance

      This study presents a robust and innovative approach to human skin equivalent (HSE) reconstruction by integrating pericyte-like and endothelial cells with dermal fibroblast subtypes, using only commercially available cell types. A key strength lies in its ability to recapitulate aspects of in vivo fibroblast heterogeneity, including papillary, reticular, and perivascular populations, and to demonstrate functional consequences on tissue architecture, barrier integrity, ECM dynamics, and mechanical properties under aging-like, nutrient-poor conditions. The spontaneous emergence of a pericyte-like population without relying on freshly isolated primary pericytes or complex sorting protocols represents a methodological advance that increases the model's accessibility and scalability. Furthermore, the use of ascorbic acid to reverse aging-associated features in a vascular cell-dependent manner adds a compelling functional dimension, linking cell composition with therapeutic response.

      Compared to existing models that either lack vascular cell compartments or do not account for dermal fibroblast heterogeneity, this study fills an important gap at the intersection of skin aging, vascular biology, and mesenchymal-epithelial interactions. The advance is both conceptual by elucidating the role of vascular and perivascular cells in shaping fibroblast identity and function and methodological, through the generation of a human skin model that approximates in vivo complexity without requiring animal models or ethically limited human tissue. The work will be of strong interest to basic science researchers in dermatology, tissue engineering, and aging, and has potential influence in regenerative medicine, cosmetic science, and drug screening, especially in the context of skin repair and anti-aging therapies. The audience is broad but most relevant to specialized communities in skin biology, mesenchymal cell biology, vascular biology, and organoid modeling, and may also attract attention from those developing non-animal testing platforms in applied and translational settings.

      As a reviewer with expertise in inflammatory skin disease modeling using both animal systems and 3D organoid cultures, I bring a critical understanding of how cellular composition, microenvironmental cues, and co-culture conditions influence skin physiology and pathology. My interest in developing advanced co-culture systems to recapitulate human skin complexity positions me well to evaluate the relevance, innovation, and translational potential of this vascularized HSE model. I am especially qualified to assess the biological fidelity of the reconstructed skin architecture, the functional outcomes of introducing pericyte-like populations, and the implications of nutrient deprivation and ascorbic acid supplementation as aging-relevant perturbations.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Kimura et al investigates the role of different cell populations in the development of human skin equivalents (HSEs). The observe that the addition of vascular endothelial cells to HSEs improves epidermal differentiation and barrier function, alongside differentiation of fibroblasts into papillary, reticular, and pericyte like mesenchymal cells. The authors also use single-cell transcriptomics to characterise the gene signatures and putative signalling pathway in the fibroblasts. Finally, the authors use nutrient poor medium and ascorbic acid to modulate HSE develop.

      One of the most significant questions arising from the findings is how the presence of vasculature can induce differentiation of fibroblasts from a common population, especially given that previous studies have shown that fibroblast identity is programmed during development. Some specific comments and suggestions for improving the manuscript are listed below.

      Major points:

      1. The introduction describes the effects of different environmental cues and aging on fibroblast phenotype, but it would be good to note the developmental origins of dermal fibroblasts, which specifies their fate and function (Driskell et al, Nature 2013).
      2. In Fig 2, how do TEWL measurements compare to constructs without an epidermal layer or human skin? It may seem obvious that barrier function would be negligible in these models, but it would be a helpful negative control for interpreting the relative effects of vasculature on barrier function.
      3. The mechanical measurements in Fig 2 are a nice idea, but it is a bit difficult to interpret without comparison to other conditions (e.g. human skin) or by reporting more universal mechanical parameters (e.g. Young's modulus).
      4. The induction of region-specific fibroblast markers is interesting and a bit unexpected since all the fibroblasts came from the same source before seeding into HSEs. The conclusions require additional support from quantification of the IF staining in Fig 3.
      5. Likewise, could the authors clarify whether the cells were passaged before seeding into the HSE, and if so, what passage number. Could passaging affect the responses observed? Please add a discussion point about this.
      6. The scRNA-seq suggests that the in vitro populations do not discriminate between secretory papillary and pro-inflammatory fibroblasts. Could the authors add some further analysis or discussion regarding this point?
      7. In Fig 6, it will be important to add quantification of epidermal thickness and differentiation marker expression to support the conclusions.
      8. A key question is how NP and AA conditions affect the fibroblast populations as this seems to be a key factor in HSE maturation and would then link back to the previous sections. It would be good to stain for fibroblast markers in these samples.
      9. As noted above, the ability of the vasculature to direct differentiation of a common fibroblast population into different phenotypes is one of the key findings of the study. To strengthen these observations, could additional analysis of the transcriptional data be possible. For example, would trajectory analysis potentially show how the different populations are evolving or related? In addition, could the CellChat analysis be performed between the vasculature and the different populations in Fig 5, which are mapped to in vivo populations? This might be a more relevant analysis than the populations in Fig 4.

      Minor points:

      1. The abstract states that enabling in vitro evaluation of drug efficacy using methodologies that are identical to those used in human clinical studies. This seems to be an over interpretation of the study and not well supported by the data. Please consider revising or removing.
      2. Check referencing formatting in lines 118-121

      Significance

      Overall, the study represents a systematic analysis of how vasculature contributes to skin model development, and the impact on fibroblast differentiation is an interesting observation. It would have been more impactful if some of the pathways and genes were followed up with mechanistic studies, but the findings are still useful to the field. Likewise, further insight into exactly how the vasculature regulates fibroblast phenotype would add to the impact as this is an unexpected but important finding.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):*

      As stated by the authors in the introduction, the RNA-binding protein Sxl is foundational to understanding sex determination in Drosophila. Sxl has been extensively studied as the master regulator of female sex determination in the soma, where it is known to initiate an alternative splicing cascade leading to the expression of DsxF. Additionally, Sxl has been shown to be responsible for keeping X chromosome dosage compensation off in females, while males hyperactivate their X chromosome. While these roles have been well defined, the authors explore an aspect of Sxl that is quite separate from its role as master regulator of female fate. They describe Sxl-RAC, a Sxl isoform that is expressed in the male and female nervous system. Using several genomic techniques, the authors conclude that the Sxl-RAC isoform associates with chromatin in a similar pattern to the RNA polymerase II/III subunit, Polr3E, and Sxl depends on Polr3E for chromatin-association. Further, neuronal loss of Sxl causes changes in lifetime and geotaxis in a similar manner as loss of Polr3E. The work is thorough and significant and should be appropriate for publication if a few issues can be addressed.

      Major Concerns:*

      * 1) How physiological is the Sxl chromatin-association assay? As binding interactions are concentration-dependent, how similar is Sxl-DAM expression to wt Sxl expression in neurons? In addition, does the Sxl-DAM protein function as a wt Sxl protein? Does UAS-Sxl-DAM rescue any Sxl loss phenotypes?*

      Author response:

      As Reviewer 3 correctly notes, Targeted DamID relies on ribosomal re-initiation (codon slippage) to produce only trace amounts of the Dam-fusion protein. By design, this results in expression levels that are significantly lower than those of the endogenous protein. As such, the experiment can be interpreted within a near–wild-type context, rather than as an overexpression model. The primary aim of this experiment was to determine whether Sxl associates with chromatin, and our dataset provides clear evidence supporting such binding.

      2) Is Polr3E chromatin-association also dependent on Sxl? They should do the reciprocal experiment to their examination of Sxl chromatin-association in Polr3E knockdown. This might also help address point 1-if wt Sxl is normally required for aspects of Polr3E chromatin binding, then concerns about whether the Sxl-DAM chromatin-association is real or artifactual would be assuaged.

      Author response:

      This is an interesting thought, however, if Sxl were required for Polr3E recruitment to RNA Pol III, then, in most male Drosophila melanogaster cells, Polr3E would not be incorporated, and males would not be viable (as it is essential for Pol III activity). While it is possible that there could be a subtle effect on Polr3E recruitment, such an experiment, would not alter the central conclusion of our study - that Sxl is recruited to chromatin (accessory to the Pol III complex) via Polr3E.

      Minor concerns:

      * The observed Sxl loss of function phenotypes are somewhat subtle (although perhaps any behavior phenotype at all is a plus). Did they try any other behaviour assays-courtship, learning/memory, anything else at all to test nervous system function?*


      Author response:

      Given the exploratory nature of this study, we focused on broader behavioural and transcriptional assays.

      While well written, it is sometimes difficult to understand how the experiment was performed or what genotypes were used without looking into the methods sections. One example is they should describe the nature of the Sxl-DAM fusion protein clearly in the results.

      Author response:

      We will revise these sections to improve clarity and ensure there is no confusion.

      * Reviewer #1 (Significance (Required)):

      This manuscript represents a dramatic change in our thinking about the action of the Sex-lethal protein. Previously, Sxl was known as the master regulator of both sex determination and dosage compensation, and performed these roles as an RNA-binding protein affecting RNA splicing and translational regulation. Here, the authors describe a sex-non-specific role of Sxl in the male and female nervous system. Further, this activity appears independent of Sxl's RNA binding activity and instead Sxl functions as a chromatin-associating protein working with the RNA pol2/3 factor Polr3E to regulate gene expression. Thus, this represents a highly significant finding. *

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):*

      Summary: In this paper, the authors report on an unexpected activity for Sex lethal (Sxl) (a known splicing regulator that functions in sex determination and dosage compensation) in binding to chromatin. They show, using DamID, that Sxl binds to approximately the same chromatin regions as Polr3E (a subunit of RNA Pol III). They show that this binding to chromatin is unaffected by mutations in the RNA binding domains or by deletions of either N or C terminal regions of the Sxl protein. This leads the authors to conclude that Sxl must bind to chromatin through some interacting protein working through the central region of the Sxl protein. They show that Sxl binding is dependent on Polr3E function. They show that male-specific neuronal knockdown of Sxl gives similar phenotypes to knockdown of Polr3E in terms of lethality and improved negative geotaxis. They show gene expression changes with knockdown of Sxl in male adult neurons - mainly that metabolic and pigmentation genes go down in expression. They also show that expression of a previously discovered male adult specific form of Sxl (that does not have splicing activity) in the same neurons also leads to changes in gene expression, including more upregulated than downregulated tRNAs. But they don't see (or don't show) that the same tRNA genes are down with knockdown of Sxl. Nonetheless, based on these findings, they suggest that Sxl plays an important role in regulating Pol III activity through the Polr3E subunit.

      Major comments:

      *

      *To be honest, I'm not convinced that the conclusions drawn from this study are correct. The fact that every mutant form of Sxl shows the same result from the DamID labelling is a little concerning. I would like to see independent evidence of the SxlRac protein binding chromatin. *

      Do antibodies against this form (or any form) of Sxl bind chromatin in salivary gland polytene chromosomes, for example? Does Sxl from other insects where Sxl has no role in sex determination bind chromatin?


      __Author Response: __

      Regarding the reviewer’s overall concerns about the legitimacy of the Sxl binding data:

      1. i) The fold differences between Dam-Sxl-mutants and the Dam-only control are very robust (up to 9 log2 fold change (500-fold change)), which is higher than what we observe with most transcription factors using Targeted DamID.
      2. ii) We observed that Sxl binding was significantly reduced upon knockdown of Polr3E, confirming that the signal we observe is biologically specific and not due to technical noise or background. iii) If the concern relates to potential Sxl binding in non-neuronal tissues such as salivary glands, we would like to clarify that all DamID constructs were expressed under elav-GAL4, a pan-neuronal driver. Furthermore, dissections were performed to isolate larval brains, with salivary glands carefully removed. This ensures that chromatin profiles were derived from neuronal tissue exclusively.

      3. iv) Salivary gland polytene chromosome staining with a Sxl antibody in a closely related species (Drosophila virilis) show __binding of Sxl to chromatin __in both sexes (Bopp et al., 1996). We will include more text in the revised manuscript to emphasise these points.

      Do antibodies against this form (or any form) of Sxl bind chromatin in salivary gland polytene chromosomes, for example? Does Sxl from other insects where Sxl has no role in sex determination bind chromatin?

      Author Response:

      Prior work in Drosophila virilis (where Sxl is also required for sex determination and Sxl-RAC is conserved) has already demonstrated Sxl-chromatin association (using a full-length Sxl antibody) in salivary glands using polytene chromosome spreads (Bopp et al., 1996). Binding is observed in both sexes and across the genome, reflecting our observations. We will incorporate this into the revised discussion to support the chromatin-binding role of Sxl across species.

      There is a clear and long-overlooked precedent for Sxl's alternative, sex-independent roles, findings that have been largely overshadowed by the gene’s canonical function. Our study not only validates and extends these observations but also brings much-needed attention to this understudied aspect of fundamental biology.

      Bopp D, Calhoun G, Horabin JI, Samuels M, Schedl P. Sex-specific control of Sex-lethal is a conserved mechanism for sex determination in the genus Drosophila. Development. 1996 Mar;122(3):971-82. doi: 10.1242/dev.122.3.971. PMID: 8631274.

      I would like to see independent evidence of the SxlRac protein binding chromatin.

      * *__Author Response: __

      We do not believe this is necessary:

      1. i) Our data demonstrated that a large N-terminal truncation of Sxl (removing far more of the N-terminal region than is absent in Sxl-RAC) does not impair chromatin binding.
      2. ii) Our deletion experiments show that it is the central domain __of Sxl that is required for chromatin association (as removal of the N-or C-terminal domain has no effect). This central domain is __unaffected in Sxl-RAC. iii) Independent Y2H experiments have shown that it is exclusively the__ RBD-1 __(RNA binding domain 1) of the central domain of Sxl that interacts with Polr3E (Dong et al., 1999). Sxl-RAC contains this region, therefore will be recruited by Polr3E.

      iv) Review 3 also believes that this is not necessary (see cross-review below) and highlights the robustness of the Y2H experiments performed by Dong et al., 1999.

      • *

      Also, given that their DamID experiments reveal that Sxl binds half of the genes encoded in the Drosophila genome, finding that it binds around half of the tRNA genes is perhaps not surprising.


      __Author Response: __

      Our data show that Sxl binds to a range of Pol III-transcribed loci, and this binding pattern supports the proposed model that Sxl plays a broader regulatory role in Pol III activity. Within these Pol III targets, tRNA genes represent a specific and biologically relevant subset. The emphasis on tRNAs is not to suggest they are the exclusive or primary targets of Sxl, but rather to__ highlight a functionally important class of Pol III-transcribed elements__ that align with the model we are proposing. We will revise the text to better reflect this framing and avoid any confusion regarding the scope of Sxl’s binding profile.

      *I would like to see evidence beyond citing a 1999 yeast two-hybrid study that Sxl and Polr3E directly interact with one another. *


      Author response:

      We do not believe this is necessary (these points were also mentioned above):

      1. i) The Dong et al., 1999 study was highly comprehensive in its characterisation of Sxl binding to Polr3E.
      2. ii) Our DamID data provide strong complementary evidence for this interaction: knockdown of Polr3E robustly reduces Sxl’s recruitment to chromatin, strongly supporting the relevance of the interaction in vivo. iii) Review 3 highlights the robustness of the Y2H experiments performed by Dong et al., 1999.

      In my opinion, the differences in lethality observed with loss of Sxl versus control are unlikely to be meaningful given the different genetic backgrounds. The similar defects in negative geotaxis could be meaningful, but I'm unsure how often this phenotype is observed. What other class of genes affect negative geotaxis? It's a little unclear why having reduced expression of metabolic and pigment genes or of tRNAs would improve neuronal function.


      Author response:

      While the differences in survival were indeed subtle, they were statistically significant and thus warranted inclusion. Our primary aim in this section was to demonstrate that knockdown of Sxl or Polr3E results in comparable behavioural and transcriptional phenotypes, suggesting overlapping functional roles. In this context, we believe the data were presented transparently and effectively support our interpretation.

      Regarding the negative geotaxis phenotype, we appreciate the reviewer’s interest and agree that it is both intriguing and atypical. For this reason, we performed the assay multiple times, particularly in Polr3e knockdowns, to confirm the robustness of the result. To address potential confounding variables, we carefully selected control lines that account for genetic background and transgene insertion site, including KK controls and attP40-matched lines. We also employed multiple independent RNAi lines targeting Sxl to validate the phenotype across different genetic backgrounds.

      Although the observed improvement in climbing is unexpected, it is not without precedent in the RNA polymerase III field. Notably, Malik et al. (2024) demonstrated that heterozygous Polr3DEY/+ mutants exhibit a significantly delayed decline in climbing ability with age. We allude to this in the discussion and will revise the text to emphasise this connection more explicitly.

      Finally, while we recognise that negative geotaxis is a relatively broad assay and thus does not pinpoint the precise cellular mechanisms involved, we interpret the phenotype as suggesting a neural basis and a functional role for Sxl in the nervous system.

      One would expect that not just the same classes of genes would be affected by loss and overexpression of Sxl, but the same genes would be affected - are the same genes changing in opposite directions in the two experiments or just the same classes of genes. Likewise, are the same genes changing expression in the same direction with both Sxl and the Polr3E loss? Also, why are tRNA genes not also affected with Sxl loss. Finally, they describe the changes in gene expression as being in male adult neurons, but the sequencing was done of entire heads - so no way of knowing which cell type is showing differential gene expression.

      Author response:

      While we do examine gene classes, our approach also includes pairwise correlation analyses of gene expression changes between specific genotypes. Notably, we observed a significant positive correlation between Polr3e knockdowns and Sxl knockdowns, and a significant negative correlation between Sxl-RAC–expressing flies and Sxl knockdowns. Furthermore, we examined Sxl-DamID target genes within our RNA-seq datasets and found a consistent relationship between Sxl targets and genes differentially expressed in Polr3e knockdowns.

      Regarding the Pol III qPCR results, we note that tRNA expression changes may require a longer duration of RNAi induction (e.g., beyond 4 days) to become apparent, especially given that phenotypic effects such as changes in lifespan and negative geotaxis only emerge after 20 days or more. It is also plausible that Sxl knockdown leads to a partial reduction in Pol III efficiency, which may not be readily detectable through bulk Pol III qPCRs. We are willing to repeat Pol III qPCRs at later timepoints to further investigate this trend.

      Finally, we infer that gene expression changes observed in our RNA-seq data are of neuronal origin, as all knockdown and overexpression constructs used in this study were driven pan-neuronally using elav-/nSyb-GAL4. While we acknowledge that bulk RNA-seq does not provide cell-type resolution, tissue-specific assumptions are widely used in the field when driven by a relevant promoter.

      I'm also not sure what I'm supposed to be seeing in panel 5F (or in the related supplemental figure) and if it has any meaning - If they are using the Sxl-T2A-Gal4 to drive mCherry, I think one would expect to see expression since Sxl transcripts are made in both males and in females. Also, one would expect to see active protein expression (OPP staining) in most cells of the adult male brain and I think that is what is observed, but again, I'm not sure what I'm supposed to be looking at given the absence of any arrows or brackets in the figures.

      Author Response:

      Due to the presence of the T2A tag and the premature stop codon in exon 3 of early male Sxl transcripts, GAL4 expression is not expected in males unless the head-specific SxlRAC isoform is produced. The aim of panel 5F is to demonstrate the spatial overlap between SxlRAC expression (as we are examining male brains) and regions of elevated protein synthesis, as detected by OPP staining.

      To quantitatively assess this relationship, we performed colocalisation analysis using ImageJ, which showed a positive correlation between Sxl and OPP signal intensity, supporting this interpretation. It is also evident from our images that regions with lower levels of protein synthesis (such as the neuropil - as shown in independent studies Villalobos-Cantor et al., 2023) concurrently lack Sxl-related signal. We have highlighted regions in Fig. 5 exhibiting higher/lower levels of Sxl/OPP signal to better illustrate this relationship. We can also test the effects of knockdown/overexpression on general protein synthesis if required.

      Villalobos-Cantor S, Barrett RM, Condon AF, Arreola-Bustos A, Rodriguez KM, Cohen MS, Martin I. Rapid cell type-specific nascent proteome labeling in Drosophila. Elife. 2023 Apr 24;12:e83545. doi: 10.7554/eLife.83545. PMID: 37092974; PMCID: PMC10125018.

      Minor comments:

      * Line 223 - 225 - I believe that it is expected that Sxl transcripts would be broadly expressed in the male and female adult, given that it is only the spliced form of the transcript that is female specific in expression. *

      As explained above, the only isoform that will be ‘trapped’ by the T2A-GAL4 in males is the Sxl-RAC isoform (as the other isoforms contain premature stop codons). Our immunohistochemistry data indicate that Sxl-RAC is expressed in the male brain, specifically in neurons. Therefore, knockdown experiments in males will reduce all mRNA isoforms, of which, Sxl-RAC is the only one producing a protein.

      Line 236 - 238 - Sentence doesn't make sense.

      We have addressed and clarified this.

      Reviewer #2 (Significance (Required)):

      It would be significant to discover that a gene previously thought to function in only sex determination and dosage compensation also moonlights as a regulator of RNA polymerase III activity. Unfortunately, I am not convinced by the work presented in this study that this is the case.

      My expertise is in Drosophila biology, including development, transcription, sex determination, morphogenesis, genomics, transcriptomics, DNA binding

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):*

      Storer, McClure and colleagues use genome-wide DNA-protein binding assays, transcriptomics, and genetics to work out that Drosophila Sxl, widely known as an RNA-binding protein which functions as a splicing factor to determine sex identity in Drosophila and related species, is also a chromatin factor that can stimulate transcription by Pol III and Pol II of genes involved with metabolism and protein homeostasis, specifically some encoding tRNAs.

      The evidence for the tenet of the paper -- that Sxl acts as a chromatin regulator with Polr3E, activating at least some of its targets with either Pol III or Pol II -- is logical and compelling, the paper is well written and the figures well presented. Of course, more experiments could always be wished for and proposed, but I think this manuscript could be published in many journals with just a minor revision not involving additional experiments. I have a few specific comments below, all minor.*

      Scientific points: - The approach taken for the evaluation of Sxl DNA-binding activity in Fig2 is not entirely clear. I assume these are crosses of elav-Gal4 x different UAS- lines, then using males or females for UAS-Sxl-Full-Length. But what about the others? Were the experiments done in males only? This is hinted at in the main text but not explicitly indicated in the figure or the methods (at least, that I could easily find). And is this approach extended to all other experiments? Longevity? Climbing assays? Considering the role of Sxl, it may be helpful to be fastidiously systematic with this.


      Author Response:

      We have revised the wording to ensure greater clarity. Males were used for all survival and behavioural experiments (as only males can be leveraged for knocking down Sxl-RAC without affecting the canonical Sxl-F isoform).

      - In the discussion, lines 360-61, the authors say: Indeed, knockdown of Polr3E leads to a loss of Sxl binding to chromatin, suggesting a cooperative mechanism. Maybe I am misunderstanding the authors, but when I read "cooperation" in this context I think of biochemical cooperative binding. This is possible, but I do not think a simple 'requirement' test can suggest specifically that this mechanistic feature of biochemical binding is at play. I would expect, for starters, a reciprocal requirement for binding (which is not tested), and some quantitative features that would be difficult to evaluate in vivo. I do not think cooperative binding needs to be invoked anyway, as the authors do not make any specific point or prediction about it. But if they do think this is going on, I think it would need to be referred to as a speculation.


      Author Response:

      We appreciate that the original wording may have been unclear and will revise the text to more accurately reflect a functional relationship, rather than implying direct cooperation.

      - In lines 428-432, the authors discuss the ancestral role of Sxl and make a comparison with ELAV, in the context of an RNA-binding protein that has molecular functions beyond those of a splicing factor, considering the functions of ELAV in RNA stability and translation, and finishing with "suggesting that similar regulatory mechanisms may be at play". I do not understand this latter sentence. Which mechanisms are these? Are the authors referring to the molecular activities of ELAV and SXL? But what would be the similarity? SXL seems to have a dual capacity to bind RNA and protein interactors, which allows it to work both in chromatin-level regulation as well as post-transcriptionally in splicing; but ELAV seems rather to take advantage of its RNA binding function to make it work in multiple RNA-related contexts, all post-transcriptional. I do not see an obvious parallel beyond the fact that RNA binding proteins can function at different levels of gene expression regulation -- but I would not say this parallel are "similar regulatory mechanisms", so I find the whole comparison a bit confusing.


      Author Response:

      We have reduced this section, as it is largely speculative and intended to highlight potential, though indirect, links in higher organisms. Our goal was primarily to illustrate the possibility that Sxl may have an ancestral role distinct from its well-characterised function, and to suggest a potential avenue for future research into ELAV2’s involvement in chromatin or Pol III regulation.

      - One aspect of the work that I find is missing in the discussion is the possibility that the simultaneous capacity of Sxl for RNA binding and Polr3E binding: are these mutually exclusive? if so, are they competitive or hierarchical? how would they be coordinated anyway?


      Author Response:

      This is an interesting point, and we have expanded on it further in the Discussion section.

      - The only aspect of the paper where I found that one could make an experimental improvement is the claim that Sxl induces the expression of genes that have the overall effect of stimulating protein synthesis. The OPP experiment shows a correlation between the expression of Sxl and the rate of protein synthesis initiation. However, a more powerful experiment would be, rather obviously, to introduce Sxl knock-down in the same experiment, and observe whether in Sxl-expressing neurons the incorporation of OPP is reduced. I put this forth as a minor point because the tenet of the paper would not be affected by the results (though the perception of importance of the newly described function could be reinforced).

      • *

      Author Response:

      This could be a valid experiment and we are prepared to perform it if required.

      - In a similar way, it would be interesting to know whether the recruitment of Polr3E and Sxl to chromatin is co-dependent or Sxl follows Polr3E. This is also a minor point because this would possibly refine the mechanism of recruitment but does not alter the main discovery.

      Author Response:

      We have addressed a similar point for Reviewer 2 (see below) and will include a Discussion point for this:

      If Sxl were required for Polr3E recruitment to RNA Pol III, then, in most male Drosophila melanogaster cells, Polr3E would not be incorporated, and males would not be viable (as it is essential for Pol III activity). While it is possible that there could be a subtle effect on Polr3E recruitment, such an experiment, would not alter the central conclusion of our study - that Sxl is recruited to chromatin (accessory to the Pol III complex) via Polr3E.

      * Figures and reporting:

      • In Figure 2, it would be helpful to see the truncation coordinate for the N and C truncations.

      • In Figure 3D, genomic coordinates are missing.

      • In Figure 3E, the magnitude in the Y axis is not entirely clear (at least not to me). How is the amount of binding across the genome quantified? Is this the average amplitude of normalised TaDa signal across the genome? Or only within binding intervals?

      • Figure S3E-F: it would be interesting to show the degree of overlap between the downregulated genes that are also binding targets (regardless of the outcome).

      • Figure 5C-E: similarly to Figure S3, it would be interesting to know how the transcriptional effects compare with the binding targets.

      • Authors use Gehan-Breslow-Wilcoxon to test survival, which is a bit unusual, as it gives more weight to the early deaths (which are rare in most Drosophila longevity experiments). Is there any rationale behind this? It may be even favour their null hypothesis.*


      Author response:

      Thank you for the detailed feedback on our figures. We have__ incorporated__ the suggested changes.

      We agree that examining the overlap between Sxl binding sites and transcriptional changes is valuable, and we aimed to highlight this in the pie charts shown in Figures S3 and S5. If the reviewer is suggesting a more explicit quantification of the proportion of Sxl-Dam targets with significant transcriptomic changes, we are happy to include this analysis in the final version of the manuscript.

      As noted in the Methods, both Gehan–Breslow–Wilcoxon (GBW) and Kaplan–Meier tests were used. The significance in Figure 4a is specific to the GBW test, which we indicated by describing the effect as mild. Our focus here is not on the magnitude of survival differences, but on the consistent trends observed in both Polr3e and Sxl knockdowns.

      Writing and language:*

      • Introduction finishes without providing an outline of the findings (which is fine by me if that is what the authors wanted).

      • In lines 361-5, the authors say "We speculate that this interaction not only facilitates Pol III transcription but may also influence chromatin architecture and RNA Pol II-driven transcription as observed with Pol III regulation in other organisms". "This interaction" refers to Polr3E-Sxl-DNA interaction and with "Pol III transcription" I presume the authors refer to transcription executed by Pol III. I am not clear about the meaning of the end of the sentence "as observed with Pol III regulation in other organisms". What is the observation, exactly? That Pol III modifies chromatin in Pol II regulated loci, or that Pol III interactors change chromatin architecture?

      • DPE abbreviation is not introduced (and only used once).

      • A few typos: Line 41 ...splicing of the Sxl[late] transcripts, which is [ARE?] constitutively transcribed (Keyes et al.,... Line 76 ...sexes but appears restricted to the nervous system [OF] male pupae and adults (Cline et Line 289 ...and S41). To assess any effect [ON]translational output, O-propargyl-puromycin (OPP)o Line 323 ...illustrating that the majority (72%) changes in tRNA levels [ARE] due to upregulation...hi Line 402 ...it was discovered [WE DISCOVERED] Line 792 ...Sxl across chromosomes X, 2 L/R, 3 L/R and 4. The y-axis represents the log[SYMBOL] ratio... This happens in other figure legends as well.*


      Author response:

      Thank you for the detailed feedback, we have clarified and incorporated the suggested changes.

      **Referee Cross-commenting***

      Reviewer 1 asks how physiological is the Sxl chromatin-association assay. I think the loss of association in Polr3E knock-down and the lack of association of other splicing factors goes a long way into answering this question. It is true that having positive binding data specifically for Sxl-RAC and negative binding data for a deletion mutant of the RMM domain would provide more robust conclusions (see below), but I am not sure it is completely necessary -- though this will depend on which journal the authors want to send the paper to.

      I think that the comment of reviewer 1 about the levels of expression of Sxl-DAM does not apply here because of the way TaDa works - it relies on codon slippage to produce minimal amounts of the DAM fusion protein, so by construction it will be expressed at much lower levels than the endogenous protein.

      Reviewer 1 also asks whether Polr3E chromatin-association is also dependent on Sxl, to round up the model and also as a way to address whether Sxl association to chromatin is real. While I agree with this on the former aim (this would be a nice-to-have), I think I disagree on the latter; there is no need for Polr3E recruitment to depend on Sxl for Sxl association to chromatin to be physiologically relevant. Polr3E is a peripheral component of Pol III and unlikely to depend on a factor of restricted expression like Sxl to interact with chromatin. The recruitment of Sxl could well be entirely 'hierarchical' and subject to Polr3E.

      Revewer 2 is concerned with the fact that every mutant form of Sxl shows the same result from the DamID labelling. I have to agree with this to a point. A deletion mutant of RMM domains would address this. Microscopy evidence in salivary glands would be nice, certainly, but the system may not lend itself to this particular interaction, which might be short-lived and/or weak. I do not immediately see the relevance of the chromatin binding capacity of non-Drosophilidae Sxl -- though it might indicate that the impact of the discovery is less likely to go beyond this group.

      Reviewer 2 does not find surprising that some tRNA genes (less than half) are regulated by Sxl. I think the value of that observation is just qualitative, as tRNAs are Pol III-produced transcripts, but their point is correct. A hypergeometric test could settle this.

      Reviewer 2 is concerned that the evidence of direct interaction between Sxl and Polr3E is a single 1999 two-hybrid study. But that paper contains also GST pull-downs that narrow down the specific domains that mediate binding, and perform the binding in competitive salt conditions. I think it is enough. The author team, I think, are not biochemists, so finding the right collaborators and performing these experiments would take time that I am not sure is warranted.

      Reviewer 2 is also concerned that the longevity assays may not be meaningful due to the difference in genetic backgrounds. This is a very reasonable concern (which I would extend to the climbing assays - any quantitative phenotype is sensitive to genetic background). However, I think the authors here may have already designed the experiment with this in mind - the controls express untargeted RNAi constructs, but I lose track of which one is control of which. This should be clarified in Methods.

      Other comments are in line, I think, with what I have pointed out and I generally agree with everything else that has been said.

      Reviewer #3 (Significance (Required)):

      Drosophila Sxl is widely known as an RNA-binding protein which functions as a splicing factor to determine sex identity in Drosophila and related species. It is a favourite example of how splicing factors and alternative can have profound influence in biology and used cleverly in the molecular circuitry of the cell to enact elegant regulatory decisions.

      In this work, Storer, McClure and colleagues use genome-wide DNA-protein binding assays, transcriptomics, and genetics to work out that Sxl is also a chromatin factor with an sex-independent, neuron-specific role in stimulating transcription by Pol III and Pol II, of genes involved with metabolism and protein homeostasis, including some encoding tRNAs.

      This opens a large number of interesting biological questions that range from biochemistry, gene regulation or neurobiology to evolution. How is the simultaneous capacity of binding RNA and chromatin (with the same protein domain, RRM) regulated/coordinated? How did this dual activity evolve and which one is the ancestral one? How many other RRM-containin RNA-binding proteins can also bind chromatin? How is Sxl recruited to chromatin to both Pol II and Pol III targets and are they functionally related? If so, how is the coordination of cellular functions activated through different RNA polymerases taking place and what is the role of Sxl in this? What are the functional consequences to neuronal biology? Does this affect similarly all Sxl-expressing neurons?

      The evidence for the central tenet of the paper -- that Sxl acts as a chromatin regulator with Polr3E, activating at least some of its targets with either Pol III or Pol II -- is logical and compelling, the paper is well written and the figures well presented. Of course, more experiments could always be wished for and proposed, but I think this manuscript could be published in many journals with just a minor revision not involving additional experiments.*

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      *The convincing analysis demonstrates a role for the Drosophila Sex determining gene sex lethal in controlling aspects of transcription in the nervous system independent of its role in splicing. Interaction with an RNA Pol III subunit mediating Sxl association with chromatin and similar knockdown phenotypes strongly support the role of Sxl in the regulation of neuronal metabolism. Given that Sxl is an evolutionary recent acquisition for sex determination, the study may reveal an ancestral role for Sxl.

      The conclusions are well justified by the datasets presented and I have no issues with the study or the interpretation. Throughout the work is well referenced, though perhaps the authors might take a look at Zhang et al (2014) (PMID: 24271947) for an interesting evolutionary perspective for the discussion.*

      Author Response:

      Thank you for the thoughtful suggestion. We will be sure to incorporate the findings from Zhang et al. regarding the evolution of the sex determination pathway.

      *I have some minor comments for clarification:

      There is no Figure 2b, should be labelled 2 or label TaDa plots as 2b

      Clarify if Fig 2 data are larval or adult *

      *Larval

      Fig 3d - are these replicates or female and male?

      Please elaborate on tub-GAL80[ts] developmental defects

      Fig 4e, are transcriptomics done with the VDRC RNAi line? The VDRC and BDSC RNAi lines exhibit different behaviours - former has "better" survival and Better negative geotaxis, the latter seems to have poorer survival but little geotaxis effect?*

      *Fig S3 - volcano plot for Polr3E?

      Fig S4a - legend says downregulated genes?

      The discussion should at least touch on the fact that Sxl amorphs (i.e. Sxl[fP7B0] are male viable and fertile, emphasising that the newly uncovered role is not essential.*

      Author Response:

      We agree with the suggestions outlined in the comments and have made the appropriate revisions.

      Reviewer #4 (Significance (Required)):*

      A nonessential role for Sxl in the nervous system independent of sex-determination contributes to better understanding a) the evolution of sex determining mechanisms, b) the role of RNA PolIII in neuronal homeostasis and c) more widely to the neuronal aging field. I think this well-focused study reveals a hitherto unsuspected role for Sxl.*

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      The convincing analysis demonstrates a role for the Drosophila Sex determining gene sex lethal in controlling aspects of transcription in the nervous system independent of its role in splicing. Interaction with an RNA Pol III subunit mediating Sxl association with chromatin and similar knockdown phenotypes strongly support the role of Sxl in the regulation of neuronal metabolism. Given that Sxl is an evolutionary recent acquisition for sex determination, the study may reveal an ancestral role for Sxl.

      The conclusions are well justified by the datasets presented and I have no issues with the study or the interpretation. Throughout the work is well referenced, though perhaps the authors might take a look at Zhang et al (2014) (PMID: 24271947) for an interesting evolutionary perspective for the discussion. I have some minor comments for clarification:

      There is no Figure 2b, should be labelled 2 or label TaDa plots as 2b

      Clarify if Fig 2 data are larval or adult

      Fig 3d - are these replicates or female and male?

      Please elaborate on tub-GAL80[ts] developmental defects

      Fig 4e, are transcriptomics done with the VDRC RNAi line? The VDRC and BDSC RNAi lines exhibit different behaviours - former has "better" survival and Better negative geotaxis, the latter seems to have poorer survival but little geotaxis effect?

      Fig S3 - volcano plot for Polr3E?

      Fig S4a - legend says downregulated genes?

      The discussion should at least touch on the fact that Sxl amorphs (i.e. Sxl[fP7B0] are male viable and fertile, emphasising that the newly uncovered role is not essential

      Significance

      A nonessential role for Sxl in the nervous system independent of sex-determination contributes to better understanding a) the evolution of sex determining mechanisms, b) the role of RNA PolIII in neuronal homeostasis and c) more widely to the neuronal aging field. I think this well-focused study reveals a hitherto unsuspected role for Sxl.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Storer, McClure and colleagues use genome-wide DNA-protein binding assays, transcriptomics, and genetics to work out that Drosophila Sxl, widely known as an RNA-binding protein which functions as a splicing factor to determine sex identity in Drosophila and related species, is also a chromatin factor that can stimulate transcription by Pol III and Pol II of genes involved with metabolism and protein homeostasis, specifically some encoding tRNAs.

      The evidence for the tenet of the paper -- that Sxl acts as a chromatin regulator with Polr3E, activating at least some of its targets with either Pol III or Pol II -- is logical and compelling, the paper is well written and the figures well presented. Of course, more experiments could always be wished for and proposed, but I think this manuscript could be published in many journals with just a minor revision not involving additional experiments. I have a few specific comments below, all minor.

      Scientific points:

      • The approach taken for the evaluation of Sxl DNA-binding activity in Fig2 is not entirely clear. I assume these are crosses of elav-Gal4 x different UAS- lines, then using males or females for UAS-Sxl-Full-Length. But what about the others? Were the experiments done in males only? This is hinted at in the main text but not explicitly indicated in the figure or the methods (at least, that I could easily find). And is this approach extended to all other experiments? Longevity? Climbing assays? Considering the role of Sxl, it may be helpful to be fastidiously systematic with this.
      • In the discussion, lines 360-61, the authors say: Indeed, knockdown of Polr3E leads to a loss of Sxl binding to chromatin, suggesting a cooperative mechanism. Maybe I am misunderstanding the authors, but when I read "cooperation" in this context I think of biochemical cooperative binding. This is possible, but I do not think a simple 'requirement' test can suggest specifically that this mechanistic feature of biochemical binding is at play. I would expect, for starters, a reciprocal requirement for binding (which is not tested), and some quantitative features that would be difficult to evaluate in vivo. I do not think cooperative binding needs to be invoked anyway, as the authors do not make any specific point or prediction about it. But if they do think this is going on, I think it would need to be referred to as a speculation.
      • In lines 428-432, the authors discuss the ancestral role of Sxl and make a comparison with ELAV, in the context of an RNA-binding protein that has molecular functions beyond those of a splicing factor, considering the functions of ELAV in RNA stability and translation, and finishing with "suggesting that similar regulatory mechanisms may be at play". I do not understand this latter sentence. Which mechanisms are these? Are the authors referring to the molecular activities of ELAV and SXL? But what would be the similarity? SXL seems to have a dual capacity to bind RNA and protein interactors, which allows it to work both in chromatin-level regulation as well as post-transcriptionally in splicing; but ELAV seems rather to take advantage of its RNA binding function to make it work in multiple RNA-related contexts, all post-transcriptional. I do not see an obvious parallel beyond the fact that RNA binding proteins can function at different levels of gene expression regulation -- but I would not say this parallel are "similar regulatory mechanisms", so I find the whole comparison a bit confusing.
      • One aspect of the work that I find is missing in the discussion is the possibility that the simultaneous capacity of Sxl for RNA binding and Polr3E binding: are these mutually exclusive? if so, are they competitive or hierarchical? how would they be coordinated anyway?
      • The only aspect of the paper where I found that one could make an experimental improvement is the claim that Sxl induces the expression of genes that have the overall effect of stimulating protein synthesis. The OPP experiment shows a correlation between the expression of Sxl and the rate of protein synthesis initiation. However, a more powerful experiment would be, rather obviously, to introduce Sxl knock-down in the same experiment, and observe whether in Sxl-expressing neurons the incorporation of OPP is reduced. I put this forth as a minor point because the tenet of the paper would not be affected by the results (though the perception of importance of the newly described function could be reinforced).
      • In a similar way, it would be interesting to know whether the recruitment of Polr3E and Sxl to chromatin is co-dependent or Sxl follows Polr3E. This is also a minor point because this would possibly refine the mechanism of recruitment but does not alter the main discovery.

      Figures and reporting:

      • In Figure 2, it would be helpful to see the truncation coordinate for the N and C truncations.
      • In Figure 3D, genomic coordinates are missing.
      • In Figure 3E, the magnitude in the Y axis is not entirely clear (at least not to me). How is the amount of binding across the genome quantified? Is this the average amplitude of normalised TaDa signal across the genome? Or only within binding intervals?
      • Figure S3E-F: it would be interesting to show the degree of overlap between the downregulated genes that are also binding targets (regardless of the outcome).
      • Figure 5C-E: similarly to Figure S3, it would be interesting to know how the transcriptional effects compare with the binding targets.
      • Authors use Gehan-Breslow-Wilcoxon to test survival, which is a bit unusual, as it gives more weight to the early deaths (which are rare in most Drosophila longevity experiments). Is there any rationale behind this? It may be even favour their null hypothesis.

      Writing and language:

      • Introduction finishes without providing an outline of the findings (which is fine by me if that is what the authors wanted).
      • In lines 361-5, the authors say "We speculate that this interaction not only facilitates Pol III transcription but may also influence chromatin architecture and RNA Pol II-driven transcription as observed with Pol III regulation in other organisms". "This interaction" refers to Polr3E-Sxl-DNA interaction and with "Pol III transcription" I presume the authors refer to transcription executed by Pol III. I am not clear about the meaning of the end of the sentence "as observed with Pol III regulation in other organisms". What is the observation, exactly? That Pol III modifies chromatin in Pol II regulated loci, or that Pol III interactors change chromatin architecture?
      • DPE abbreviation is not introduced (and only used once).
      • A few typos: Line 41 ...splicing of the Sxl[late] transcripts, which is [ARE?] constitutively transcribed (Keyes et al.,... Line 76 ...sexes but appears restricted to the nervous system [OF] male pupae and adults (Cline et Line 289 ...and S41). To assess any effect [ON]translational output, O-propargyl-puromycin (OPP)o Line 323 ...illustrating that the majority (72%) changes in tRNA levels [ARE] due to upregulation...hi Line 402 ...it was discovered [WE DISCOVERED] Line 792 ...Sxl across chromosomes X, 2 L/R, 3 L/R and 4. The y-axis represents the log[SYMBOL] ratio... This happens in other figure legends as well.

      Referee Cross-commenting

      Reviewer 1 asks how physiological is the Sxl chromatin-association assay. I think the loss of association in Polr3E knock-down and the lack of association of other splicing factors goes a long way into answering this question. It is true that having positive binding data specifically for Sxl-RAC and negative binding data for a deletion mutant of the RMM domain would provide more robust conclusions (see below), but I am not sure it is completely necessary -- though this will depend on which journal the authors want to send the paper to.

      I think that the comment of reviewer 1 about the levels of expression of Sxl-DAM does not apply here because of the way TaDa works - it relies on codon slippage to produce minimal amounts of the DAM fusion protein, so by construction it will be expressed at much lower levels than the endogenous protein.

      Reviewer 1 also asks whether Polr3E chromatin-association is also dependent on Sxl, to round up the model and also as a way to address whether Sxl association to chromatin is real. While I agree with this on the former aim (this would be a nice-to-have), I think I disagree on the latter; there is no need for Polr3E recruitment to depend on Sxl for Sxl association to chromatin to be physiologically relevant. Polr3E is a peripheral component of Pol III and unlikely to depend on a factor of restricted expression like Sxl to interact with chromatin. The recruitment of Sxl could well be entirely 'hierarchical' and subject to Polr3E.

      Revewer 2 is concerned with the fact that every mutant form of Sxl shows the same result from the DamID labelling. I have to agree with this to a point. A deletion mutant of RMM domains would address this. Microscopy evidence in salivary glands would be nice, certainly, but the system may not lend itself to this particular interaction, which might be short-lived and/or weak. I do not immediately see the relevance of the chromatin binding capacity of non-Drosophilidae Sxl -- though it might indicate that the impact of the discovery is less likely to go beyond this group.

      Reviewer 2 does not find surprising that some tRNA genes (less than half) are regulated by Sxl. I think the value of that observation is just qualitative, as tRNAs are Pol III-produced transcripts, but their point is correct. A hypergeometric test could settle this.

      Reviewer 2 is concerned that the evidence of direct interaction between Sxl and Polr3E is a single 1999 two-hybrid study. But that paper contains also GST pull-downs that narrow down the specific domains that mediate binding, and perform the binding in competitive salt conditions. I think it is enough. The author team, I think, are not biochemists, so finding the right collaborators and performing these experiments would take time that I am not sure is warranted.

      Reviewer 2 is also concerned that the longevity assays may not be meaningful due to the difference in genetic backgrounds. This is a very reasonable concern (which I would extend to the climbing assays - any quantitative phenotype is sensitive to genetic background). However I think the authors here may have already designed the experiment with this in mind - the controls expres untargeted RNAi constructs, but I lose track of which one is control of which. This should be clarified in Methods.

      Other comments are in line, I think, with what I have pointed out and I generally agree with everything else that has been said.

      Significance

      Drosophila Sxl is widely known as an RNA-binding protein which functions as a splicing factor to determine sex identity in Drosophila and related species. It is a favourite example of how splicing factors and alternative can have profound influence in biology and used cleverly in the molecular circuitry of the cell to enact elegant regulatory decisions.

      In this work, Storer, McClure and colleagues use genome-wide DNA-protein binding assays, transcriptomics, and genetics to work out that Sxl is also a chromatin factor with an sex-independent, neuron-specific role in stimulating transcription by Pol III and Pol II, of genes involved with metabolism and protein homeostasis, including some encoding tRNAs.

      This opens a large number of interesting biological questions that range from biochemistry, gene regulation or neurobiology to evolution. How is the simultaneous capacity of binding RNA and chromatin (with the same protein domain, RRM) regulated/coordinated? How did this dual activity evolve and which one is the ancestral one? How many other RRM-containin RNA-binding proteins can also bind chromatin? How is Sxl recruited to chromatin to both Pol II and Pol III targets and are they functionally related? If so, how is the coordination of cellular functions activated through different RNA polymerases taking place and what is the role of Sxl in this? What are the functional consequences to neuronal biology? Does this affect similarly all Sxl-expressing neurons?

      The evidence for the central tenet of the paper -- that Sxl acts as a chromatin regulator with Polr3E, activating at least some of its targets with either Pol III or Pol II -- is logical and compelling, the paper is well written and the figures well presented. Of course, more experiments could always be wished for and proposed, but I think this manuscript could be published in many journals with just a minor revision not involving additional experiments.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this paper, the authors report on an unexpected activity for Sex lethal (Sxl) (a known splicing regulator that functions in sex determination and dosage compensation) in binding to chromatin. They show, using DamID, that Sxl binds to approximately the same chromatin regions as Polr3E (a subunit of RNA Pol III). They show that this binding to chromatin is unaffected by mutations in the RNA binding domains or by deletions of either N or C terminal regions of the Sxl protein. This leads the authors to conclude that Sxl must bind to chromatin through some interacting protein working through the central region of the Sxl protein. They show that Sxl binding is dependent on Polr3E function. They show that male-specific neuronal knockdown of Sxl gives similar phenotypes to knockdown of Polr3E in terms of lethality and improved negative geotaxis. They show gene expression changes with knockdown of Sxl in male adult neurons - mainly that metabolic and pigmentation genes go down in expression. They also show that expression of a previously discovered male adult specific form of Sxl (that does not have splicing activity) in the same neurons also leads to changes in gene expression, including more upregulated than downregulated tRNAs. But they don't see (or don't show) that the same tRNA genes are down with knockdown of Sxl. Nonetheless, based on these findings, they suggest that Sxl plays an important role in regulating Pol III activity through the Polr3E subunit.

      Major comments:

      To be honest, I'm not convinced that the conclusions drawn from this study are correct. The fact that every mutant form of Sxl shows the same result from the DamID labelling is a little concerning. I would like to see independent evidence of the SxlRac protein binding chromatin. Do antibodies against this form (or any form) of Sxl bind chromatin in salivary gland polytene chromosomes, for example? Does Sxl from other insects where Sxl has no role in sex determination bind chromatin?

      Also, given that their DamID experiments reveal that Sxl binds half of the genes encoded in the Drosophila genome, finding that it binds around half of the tRNA genes is perhaps not surprising.

      I would like to see evidence beyond citing a 1999 yeast two-hybrid study that Sxl and Polr3E directly interact with one another. In my opinion, the differences in lethality observed with loss of Sxl versus control are unlikely to be meaningful given the different genetic backgrounds. The similar defects in negative geotaxis could be meaningful, but I'm unsure how often this phenotype is observed. What other class of genes affect negative geotaxis? It's a little unclear why having reduced expression of metabolic and pigment genes or of tRNAs would improve neuronal function.

      One would expect that not just the same classes of genes would be affected by loss and overexpression of Sxl, but the same genes would be affected - are the same genes changing in opposite directions in the two experiments or just the same classes of genes. Likewise, are the same genes changing expression in the same direction with both Sxl and the Polr3E loss? Also, why are tRNA genes not also affected with Sxl loss. Finally, they describe the changes in gene expression as being in male adult neurons, but the sequencing was done of entire heads - so no way of knowing which cell type is showing differential gene expression.

      I'm also not sure what I'm supposed to be seeing in panel 5F (or in the related supplemental figure) and if it has any meaning - If they are using the Sxl-T2A-Gal4 to drive mCherry, I think one would expect to see expression since Sxl transcripts are made in both males and in females. Also, one would expect to see active protein expression (OPP staining) in most cells of the adult male brain and I think that is what is observed, but again, I'm not sure what I'm supposed to be looking at given the absence of any arrows or brackets in the figures.

      Minor comments:

      Line 223 - 225 - I believe that it is expected that Sxl transcripts would be broadly expressed in the male and female adult, given that it is only the spliced form of the transcript that is female specific in expression.

      Line 236 - 238 - Sentence doesn't make sense.

      Significance

      It would be significant to discover that a gene previously thought to function in only sex determination and dosage compensation also moonlights as a regulator of RNA polymerase III activity. Unfortunately, I am not convinced by the work presented in this study that this is the case.

      My expertise is in Drosophila biology, including development, transcription, sex determination, morphogenesis, genomics, transcriptomics, DNA binding

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      As stated by the authors in the introduction, the RNA-binding protein Sxl is foundational to understanding sex determination in Drosophila. Sxl has been extensively studied as the master regulator of female sex determination in the soma, where it is known to initiate an alternative splicing cascade leading to the expression of DsxF. Additionally, Sxl has been shown to be responsible for keeping X chromosome dosage compensation off in females, while males hyperactivate their X chromosome. While these roles have been well defined, the authors explore an aspect of Sxl that is quite separate from its role as master regulator of female fate. They describe Sxl-RAC, a Sxl isoform that is expressed in the male and female nervous system. Using several genomic techniques, the authors conclude that the Sxl-RAC isoform associates with chromatin in a similar pattern to the RNA polymerase II/III subunit, Polr3E, and Sxl depends on Polr3E for chromatin-association. Further, neuronal loss of Sxl causes changes in lifetime and geotaxis in a similar manner as loss of Polr3E. The work is thorough and significant and should be appropriate for publication if a few issues can be addressed.

      Major Concerns

      1. How physiological is the Sxl chromatin-association assay? As binding interactions are concentration-dependent, how similar is Sxl-DAM expression to wt Sxl expression in neurons? In addition, does the Sxl-DAM protein function as a wt Sxl protein? Does UAS-Sxl-DAM rescue any Sxl loss phenotypes?
      2. Is Polr3E chromatin-association also dependent on Sxl? They should do the reciprocal experiment to their examination of Sxl chromatin-association in Polr3E knockdown. This might also help address point 1-if wt Sxl is normally required for aspects of Polr3E chromatin binding, then concerns about whether the Sxl-DAM chromatin-association is real or artifactual would be assuaged.

      Minor concerns

      The observed Sxl loss of function phenotypes are somewhat subtle (although perhaps any behavior phenotype at all is a plus). Did they try any other behaviour assays-courtship, learning/memory, anything else at all to test nervous system function?

      While well written, it is sometimes difficult to understand how the experiment was performed or what genotypes were used without looking into the methods sections. One example is they should describe the nature of the Sxl-DAM fusion protein clearly in the results.

      Significance

      This manuscript represents a dramatic change in our thinking about the action of the Sex-lethal protein. Previously, Sxl was known as the master regulator of both sex determination and dosage compensation, and performed these roles as an RNA-binding protein affecting RNA splicing and translational regulation. Here, the authors describe a sex-non-specific role of Sxl in the male and female nervous system. Further, this activity appears independent of Sxl's RNA binding activity and instead Sxl functions as a chromatin-associating protein working with the RNA pol2/3 factor Polr3E to regulate gene expression. Thus, this represents a highly significant finding.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      • This manuscript represents a full revision incorporating all reviewer recommendations; the additional follow-up experiments and expanded analyses will be presented in dedicated subsequent manuscripts.
      • Congenital dyserythropoietic anemia type I (CDA-I) is a rare hereditary disease characterized by ineffective erythropoiesis and mutations in Codanin1 and CDIN1.
      • Our study reveals the structural and functional dynamics of the CDIN1-Codanin1 complex, shedding light on the molecular mechanisms of protein-protein interactions implicated in CDA-I pathology.
      • The main goal of our study was to examine the interaction between CDIN1 and the C‑terminal binding domain of Codanin1 using complementary biophysical approaches.
      • We quantified binding and identified interacting regions of Codanin1 and CDIN1.
      • We found that CDA-I-associated mutations in interacting regions disturb CDIN1‑Codanin1 complex.
      • We proposed a hypothetical molecular model of CDIN1-Codanin1 role in CDA-I hallmarks development.
      • Our initial studies on BioRxiv (2023) have been cited by leading publications in the field (Jeong, Frater et al. 2025, Sedor and Shao 2025, Nature Communications) and prompted further research on this topic.

      2. Point-by-point description of the revisions

      *Here we provide a point-by-point reply describing the revisions already carried out and included in the transferred manuscript. *

      Reply to the reviewers

      Reviewer #1 – Evidence, reproducibility and clarity

      This is a rigorous biophysical characterization of a protein-protein interaction relevant to CDA-1 disease. The two proteins were purified in an E. coli host but CD and DLS was performed to ensure that the purified protein is well folded. An impressive native protein EMSA was used to show a 1:1 complex. While common for protein-nucleic acid complexes, EMSAs are much more challenging with protein complexes. A higher-running complex, likely a heterotetramer was implied at higher protein concentrations. These results were supported with SEC-MALS analysis and analytic ultracentrifugation analysis. Thermophoresis and ITC were used to report a nanomolar affinity of the proteins for each other. SEC-SAXS supported the conclusions about stoichiometry and composition inferred from the earlier methods and suggested that the dimerization interface comes from CDIN1. Next HDX-MS was used to identify putative interface residues, which were then mutated in each of the proteins and assessed for binding using coimmunoprecipitation. This study uses at least 10 orthogonal biophysical and/or biochemical methodologies to characterize an important protein-protein interaction and the analysis is clear and so is the writing. I couldn't (reading it once) find any grammatical or other errors in the text or figures. This manuscript is top-quality and suitable for publication.

      __Reviewer #1 – Significance __

      Such detailed structural and mechanistic studies are greatly lacking in many clinical conditions for which mutations are known (unless they cause cancer, neurodegenerative disease, and so on). We need more such studies on disease topics! This study will be of interest to the hematologic diseases community.

      1. Response – ____Significance

      We thank Reviewer #1 for the thoughtful and encouraging evaluation of our work. We are particularly grateful for recognizing the significance of studying protein-protein interaction in the context of CDA-I disease, as well as the rigor and clarity of our biophysical and biochemical characterization.

      We appreciate the reviewer's acknowledgment of the challenges associated with native protein EMSAs. We are pleased that our use of multiple orthogonal techniques was recognized as a strength of the study. We are gratified that the comprehensiveness and coherence of our data and the manuscript's clarity were well received.

      We thank the reviewer for noting the broader impact of our findings on the hematologic disease community. As highlighted, there is a pressing need for a mechanistic understanding of non-oncologic, non-neurodegenerative diseases, and our studies address this gap.

      We are honored by the reviewer's endorsement of our manuscript as "top-quality and suitable for publication". We value the reviewer's highly supportive and motivating feedback.

      __Reviewer #2 – 1. Evidence, reproducibility and clarity __

      This manuscript presents structural and biochemical characterization of the interaction between CDIN1 and the C-terminal domain of Codanin1, shedding light on a complex implicated in Congenital Dyserythropoietic Anemia Type I (CDA-I). While the authors provide valuable structural insights and identify disease-associated mutations that impair CDIN1-Codanin1 binding, I think several important concerns should be addressed to strengthen both the mechanistic claims and their functional relevance.

      Contradiction Between Stoichiometry Models:

      The authors propose that CDIN1 and Codanin1Cterm primarily form a heterodimer in vitro. However, this appears to contradict previous reports indicating a tetra-heteromeric arrangement. Additionally, while CDIN1 homodimerize seems confusing to me, do the authors suggest it is stable without Codanin1? This seems contrary to findings that CDIN1 is unstable in the absence of Codanin1 (Sedor, S.F., Shao, S. nature comm 2025, Swickley, G., Bloch, Y., Malka, L. et al 2020 BMC Mol and Cell Biol). These inconsistencies raise concerns about whether the observed stoichiometries are physiologically relevant or artifacts of in vitro reconstitution, especially since full-length Codanin1 was not studied.

      2.1 Response ____– Consistent stoichiometry of Codanin1Cterm

      We thank Reviewer #2 for raising critical points regarding the stoichiometry and physiological relevance of the CDIN1-Codanin1 interaction. The following response clarifies the rationale and interpretation in relation to previous findings.

      Stoichiometry of CDIN1-Codanin1Cterm complex:

      Recent Cryo-EM studies of full-length Codanin1 (Jeong, Frater et al. 2025, Sedor and Shao 2025) suggest independent internal dimerization domains (452-798 and 841-1000 amino acid residue) driving homodimer formation, with each Codanin1 monomer binding one CDIN1 via the C-terminal region (1005-1227 amino acid residue), resulting in a tetra-heteromeric complex. Therefore, the complete assembly appears as a dimer of heterodimers in the full-length context.

      In our study, Codanin1 was truncated to retain only the CDIN1-binding C-terminus (1005-1227 amino acid residues), eliminating the homodimerization ability of Codanin1. Hence, in the case of truncated Codanin1Cterm, the minimal complex we observe is a 1:1 heterodimer of CDIN1-Codanin1Cterm, which is fully consistent with the equimolar stoichiometry of CDIN1-Codanin1 complex seen in the full-length structure.

      Stability and oligomeric state of CDIN1 in the absence of Codanin1:

      We concur with the reviewer that Sedor et al. (2025) and Swickley et al. (2020) reported decreased CDIN1 levels in cells lacking Codanin1, implying in vivo dependence of CDIN1 on Codanin1 partner for stability (Swickley, Bloch et al. 2020, Sedor and Shao 2025). The purified CDIN1 is monodisperse (Supplementary Figure 2D), exhibits thermal stability with a melting temperature of 48 °C (Supplementary Figure 2E), and displays proper folding as indicated by CD measurements (Supplementary Figure 2B). Additionally, SAXS profiles of CDIN1 correspond to AlphaFold predictions (Fig. 2B). Together, our findings indicate that the recombinant CDIN1 forms a stable conformation in vitro without Codanin1. To the best of our knowledge, no previous research has directly identified the endogenous oligomeric states of CDIN1 within cellular content.

      We fully acknowledge that future analysis of the full-length Codanin1-CDIN1 assembly in a cellular context will be necessary for understanding physiological stoichiometries. As outlined in the General statements, our study focuses on the C-terminus of Codanin1 to describe the binding interface and complex biophysical properties of the CDIN-Codanin1Cterm complex.

      __Reviewer #2 – ____2. Unvalidated Functional Claims: __

      The manuscript identifies several CDA-I-associated mutations that disrupt CDIN1-Codanin1 interaction. However, the authors do not test how these mutations affect the biological function of the complex, particularly its role in ASF1 sequestration or histone trafficking. Given the central importance of this axis in their disease model, functional validation (e.g., ASF1 localization, histone deposition assays) is necessary to support these mechanistic conclusions.

      2.2 Response – ____Hypothetical model as discussion merit

      We thank the reviewer for the comment regarding the functional implications of CDA-I-associated mutations and their potential impact on ASF1 sequestration and histone trafficking hypothesized within the Discussion. We fully agree that understanding the downstream biological consequences of disrupted CDIN1-Codanin1 interaction is critical for elucidating the full molecular basis of CDA-I pathogenesis.

      In the Future research directions of the Discussion, we have acknowledged and emphasized the need for follow-up studies using erythroblast cell lines to determine whether specific disease-associated mutations disrupt CDIN1-Codanin1 binding, leading to functional defects relevant to erythropoiesis and nuclear architecture typical for CDA-I disease.

      However, as we respectfully note in General Statements, the main aim of the present study was to provide a rigorous biophysical characterization of the CDIN1-Codanin1Cterm interaction. Proposed cellular experiments, though relevant, are beyond the conceptual scope of the presented studies.

      Reviewer #2 – ____3. Speculative and Potentially Contradictory Model:

      The proposed model suggests that CDIN1 competes with ASF1 for Codanin1 binding, thereby indirectly promoting histone delivery to the nucleus. However, emerging data indicate that Codanin1, CDIN1, and ASF1 can form a stable ternary complex, calling into question this competitive binding hypothesis (Sedor, S.F., Shao, S. nature comm 2025). The authors do not acknowledge or discuss these findings, and the model in its current form may therefore be oversimplified or inaccurate.

      2.3 Response – ____Hypothetical model fully aligned with current knowledge

      We fully acknowledged and discussed in the current manuscript the recent findings demonstrating that Codanin1, CDIN1, and ASF1 can form a ternary complex (Sedor, S.F., Shao, S. Nature Comm. 2025; Jeong, T. K. et al. Nature Comm. 2025). Our revised model was updated accordingly to reflect the collaborative binding of Codanin1, CDIN1, and ASF1, and is presented in alignment with published data.

      While earlier versions of our work published on the BioRxiv server (May 26, 2023) proposed a competitive hypothesis, the current manuscript incorporates recent literature and prior reviewer feedback to offer a refined model. We believe that the updated hypothesis suggests a plausible mechanism for how CDIN1 modulates Codanin1 function, which will be further tested in future cellular studies.

      Reviewer #2 – 4. Significance:

      Overall, the study adds to our structural understanding of CDIN1 and Codanin1 interactions, but the functional interpretations are currently speculative, and in some cases in conflict with existing literature. The manuscript would benefit significantly from addressing these discrepancies, incorporating relevant data on ASF1, and clarifying whether the observed assemblies reflect physiological complexes.

      __2.4 Response – Significance __

      We thank Reviewer #2 for the constructive feedback. As noted in General Statements, our current manuscript is primarily dedicated to defining the molecular architecture and interactions of the CDIN1–Codanin1Cterm core interface. We agree that follow-up ASF1‑dependent functional assays will be critical to fully validate observed assemblies, but these experiments lie outside the scope of the present study and are ongoing in our laboratory.

      To address the reviewer's concern about possible speculative interpretation, we have:

      • Used cautious language in Results and Discussion to prevent overstatement (e.g., page 31, line 754, “leads” exchanged to “may contribute” in legend of Fig. 4).
      • Described in the Discussion how our results enhance and add understanding to the body of published structural data of CDIN1–Codanin1Cterm.
      • Updated our hypothetical model in Fig. 4 to be fully in line with published data.
      • Clearly stated that the working hypothesis is connected with a subset of CDA-I mutations (p. 31, l. 758-759, “The proposed model represents a working hypothesis relating to a subset of CDA-I mutations and is not currently substantiated by experimental evidence at the cellular level.”)
      • Stated in Future research directions of Discussion that functional validation, including ASF1, will motivate future critical studies, p. 32, l. 771-773: “The ability of Codanin1 to interact with both CDIN1 and ASF1 motivates further investigation of how CDIN1 and ASF1 affect the function of full-length Codanin1, which even recent cryo-EM data has not addressed yet.”
      • Highlighted the necessity of complementary in vivo studies in erythroblast cell lines to determine if CDA-I-related mutations in CDIN1-Codanin1 interaction region cause typical CDA-I phenotypes, aiming to clarify the molecular mechanisms of inherited CDA-I anemia. We state in Future research directions in Discussion, p. 32, l. 774-780: “…follow-up research utilizing erythroblast model cell lines must be conducted to determine if specific mutations that disrupt CDIN1-Codanin1 binding also affect ASF1 localization and cause a phenotype typical of CDA-I. In future work, additional Codanin1 mutations, including those outside the C-terminal region, should be evaluated to determine how the mutations affect ASF1’s nuclear concentration and subcellular localization. The proposed research directions will provide additional deeper insights into the underlying mechanisms of the molecular origin of inherited anemia CDA-I.” We believe that the revisions objectively clarify the significance and the limits of the current work and set the stage for the detailed functional studies to follow.

      __Reviewer #3 – Evidence, reproducibility and clarity: __

      Congenital Dyserythropoietic Anemia Type I (CDA I) is an autosomal recessive disorder characterized by ineffective erythropoiesis and distinctive nuclear morphology ("Swiss cheese" heterochromatin) in erythroblasts. CDA I is caused by mutations in CDAN1 and CDIN1. Codanin1, encoded by CDAN1, is part of the cytosolic ASF1-H3.1-H4-Importin-4 complex, which regulates histone trafficking to the nucleus. CDIN1 has been shown to bind the C-terminal domain of Codanin-1, but until now, pathogenic mutations had not been directly linked to the disruption of this interaction.

      In this study, the authors used biophysical techniques to characterize the interaction between Codanin-1's C-terminal region (residues 1005-1227) and CDIN1, demonstrating high-affinity, equimolar binding. HDX-MS identified interaction hotspots, and disease-associated mutations in these regions disrupted complex formation. The authors propose that such disruption prevents ASF1 sequestration in the cytoplasm, thereby reducing nuclear histone levels and contributing to the chromatin abnormalities seen in CDA I.

      Major Comments:

      1. Use of Codanin-1 Fragment:

      Most experiments were conducted using only the C-terminal 223 amino acids of Codanin-1. While this region is known to bind CDIN1, it is unclear whether its conformation is maintained in the context of the full-length protein. This could affect binding properties and structural interpretations. The authors should discuss how structural differences between the isolated C-terminus and the full-length Codanin-1 may influence the conclusions.

      Response of authors ____#3

      3.1 Response: Use of Codanin-1 Fragment as biding part to CDIN1

      We thank the reviewer for the important observation regarding the use of the C-terminal fragment of Codanin1. As noted in the manuscript (e.g., p. 30, line 721 and p. 32, line 761), we fully acknowledge that the truncation of Codanin1 may influence its conformational dynamics or contextual folding relative to the full-length protein.

      However, several lines of evidence suggest that the C-terminal 223 amino acid residues—responsible for CDIN1 binding—are structurally autonomous and have minimal intramolecular contacts with upstream regions. Published cryo-EM and biochemical data (Jeong, Frater et al. 2025, Sedor and Shao 2025), in conjunction with AlphaFold structural predictions (Fig. 2D) and our co-immunoprecipitation assays (Fig. 3F), consistently support a model wherein the CDIN1-binding region is flexible and spatially isolated from the core structural domains of Codanin1. Additionally, results from our co-immunoprecipitation assay (Fig. 3F) indicate that full-length Codanin1 and truncated Codanin1Cterm interact with CDIN1 similarly, further supporting the isolated manner of the C-terminal fragment. The available data together imply that the C-terminal fragment used in our study retains its native conformation and binding properties when expressed independently.

      While our findings are confined to the interaction domain and do not reflect full-length Codanin1’s architecture, we believe the use of the C-terminal minimal fragment of Codanin1 enables precise dissection of the CDIN1-binding interface and yields mechanistic insights without introducing significant structural artifacts.

      We agree with the reviewer that future work incorporating full-length Codanin1, especially in a cellular context, will be instrumental to fully characterize higher-order assembly and regulatory functions.

      __Reviewer #3 – 2. ____Graphical Abstract and Domain Independence: __

      The graphical abstract presents the Codanin-1 C-terminus as an independent domain, but no direct evidence is provided to support its structural autonomy in vivo.

      The authors should clarify whether the C-terminal region functions as a distinct domain in the context of the full-length protein.

      __3.2 Response –____ Independent C-terminal domain __

      We thank the reviewer for bringing up the question of the independence of the C-terminal domain. Although direct in vivo proof of C-terminal autonomy is not yet available, published cryo-EM structures of full-length Codanin1, our biophysical characterization, and AlphaFold models all consistently indicate that the C-terminal 223 amino acid residues of Codanin1 form a structurally independent binding module. In the graphical abstract, we illustrated the C‑terminal domain as a loosely connected part of Codanin1 to highlight its independence and to emphasize the specific focus of our studies.

      To articulate limitations of our studies focused on the C-terminal part of Codanin1, we stated in the Functional implications of CDA-I-related mutations in the Discussion, p. 30, l. 721-724: “However, our measurements do not exclude the possible role of the disordered regions in full-length Codanin1. For example, CDIN1 could potentially stabilize full-length Codanin1 by rearranging the disordered regions into a more condensed structure, thereby augmenting the structural stability of Codanin1.”

      Reviewer #3 – 3.____Pathogenic Mutations Beyond the Binding Site:

      The study highlights a triplet mutation that impairs CDIN1 binding. However, most CDA I‑associated mutations in CDAN1 are dispersed across the entire protein and may not affect CDIN1 interaction directly.

      The authors should discuss alternative mechanisms by which mutations in other regions of Codanin-1 might cause disease.

      3.3 Response – Pathogenic mutations outside the binding site – alternative mechanisms

      We appreciate the reviewer noting that most CDA-I-associated CDAN1 mutations are outside the CDIN1-Codanin1 binding site and suggesting alternative mechanisms. In the revised Discussion, we added a paragraph on alternative pathogenic models, p. 29, l. 702-713:

      "Our study centers on the CDIN1-binding C-terminus, however, most CDA-I-associated CDAN1 mutations lie elsewhere and probably act through alternative mechanisms. Mutations such as P672L and F868I in the LOBE2 (452-798 amino acid residue) and F868I in the coiled-coil (841-1000 amino acid residue) domains may disturb Codanin1 homodimerization and higher-order complex assembly, directly affecting ASF1 sequestration (Jeong, T. K. et al. Nature Comm. 2025). Other mutant variants may also interfere with ASF1 sequestration, nuclear targeting, or chromatin-remodeling functions, while destabilizing mutations may induce misfolding and proteasomal degradation. Moreover, CDA-I-associated mutations, such as R714W and R1042W, might compromise the interaction between Codanin1 and ASF1 (Ask, Jasencakova et al. 2012). Collectively, the complementary alternative pathogenic mechanisms associated with Codanin1 mutations in distal regions and mutations in CDIN1‑binding C-terminus of Codanin1 may contribute to erythroid dysfunction in CDA-I."

      Reviewer #3 – 4. ____Contradictory Functional Models:

      Ask et al. (EMBO J, 2012) reported that Codanin-1 depletion increases nuclear ASF1 and accelerates DNA replication. This contrasts with the current hypothesis that disruption of the Codanin-1/CDIN1 complex reduces nuclear ASF1.

      The authors should attempt to reconcile this apparent contradiction, possibly by proposing a context-specific or dual-function model for Codanin-1 in histone trafficking.

      3.4 Response – ____Clarified explanation of hypothetical functional model

      We thank the reviewer for raising this point, which improved the clarity of our work. There is no real discrepancy between Ask et al. and our findings; both agree that Codanin1 restrains ASF1 in the cytoplasm. Ask et al. examined the complete loss of Codanin1, which abolishes cytoplasmic ASF1 sequestration and thus leads to maximal nuclear accumulation. We suggest the CDA-I-associated mutations selectively disrupt the CDIN1-Codanin1 interface, releasing ASF1 from the cytoplasm into the nucleus.

      To enhance clarity, we now state in the legend of Figure 4 describing the hypothesis (p. 31, l. 752-753): "…CDA-I-associated mutations prevent CDIN1-Codanin1 complex formation, thus prevent ASF1 sequestration to cytoplasm; ASF1 remains accumulated in nucleus."

      Reviewer #3 – 5. ____Conclusions and Claims:

      The proposed model of CDA I pathogenesis (Fig. 4) is plausible but not yet fully supported by the available data. The authors suggest that disruption of the Codanin-1/CDIN1 interaction leads to nuclear histone depletion, but this has not been experimentally confirmed.

      Claims about the general pathogenesis of CDA I should be clearly qualified as hypothetical and applicable to a subset of mutations. The presence and localization of ASF1 in the nucleus following disruption of the Codanin-1/CDIN1 complex should be tested experimentally.

      3.5 Response – __Tempered ____conclusions and claims: __

      We thank the reviewer for underscoring the need to temper our conclusions and to distinguish hypotheses from available results. We fully agree that our Fig. 4 model—linking disruption of the Codanin1-CDIN1 interface to nuclear histone imbalance—remains a working hypothesis, currently supported by indirect biochemical and structural data.

      Accordingly, we have:

      • Revised the text to explicitly state that this model is hypothetical and pertains to a subset of CDA-I-associated CDAN1 mutations. Specifically, we

      • Added to the last paragraph of the section Functional implications of CDA-I-related mutations in Discussion (p. 31, l. 744-749): “In considering functional implications of our findings within available data, it is essential to qualify that mechanistic claims regarding the general pathogenesis of CDA-I remain hypothetical and are restricted to a specific subset of mutations. Furthermore, direct experimental validation, such as immunolocalization or live-cell imaging, to assess ASF1’s nuclear presence and distribution following disruption of the CDIN1-Codanin1 complex is required to substantiate the proposed model.”

      • Included in the legend of Fig. 4: ”The proposed model represents a working hypothesis relating to a subset of CDA-I mutations and is not currently substantiated by experimental evidence at the cellular level.”
      • Replaced any associated definitive language (e.g., “leads to”) with qualified phrasing (e.g., “may contribute to”) in the legend of Fig. 4.
      • Clarified in the Discussion that direct measurement of nuclear ASF1 redistribution and histone levels following interface disruption has not yet been performed. Specifically, we added to the section Functional implications of CDA-I-related mutations in Discussion (p. 30, l. 734-735): “It should be noted, however, that direct quantification of nuclear ASF1 redistribution and histone levels after CDIN1-Codanin1 disruption has not yet been conducted.” Although experimental verification of nuclear ASF1 localization upon CDIN1-Codanin1 complex disruption falls beyond the current manuscript’s scope, we acknowledge its importance and have emphasized the need for such studies in future work within the Future research directions of the Discussion. Specifically, we concluded by stating (p. 32, l. 774-776): “Finally, follow‑up research utilizing erythroblast model cell lines must be conducted to determine if specific mutations that disrupt CDIN1-Codanin1 binding, also affect ASF1 localization and cause a phenotype typical of CDA-I.”

      __Reviewer #3 – 6.____Broader Mutation Analysis and ASF1 Localization: __

      To strengthen the link between Codanin-1/CDIN1 disruption and disease pathogenesis, it would be important to test the effects of additional CDAN1 mutations, including those outside the C-terminal region. Similarly, the impact on ASF1 nuclear concentration and localization should be directly assessed. These experiments would significantly bolster the central hypothesis. If feasible, they should be pursued or at least acknowledged as important future directions.

      3.6 Response – Broader mutation analysis and ASF1 localization in future directions

      We thank Reviewer #3 for emphasizing the value of a broader mutation survey and direct ASF1 localization studies. As noted above, our current manuscript is centered on delineating the molecular architecture of the CDIN1-Codanin1Cterm core interface; comprehensive mutational analyses outside the C-terminal binding region and ASF1-dependent functional assays will be critical to extend these findings but fall beyond the scope of the present work and will be the objective of our following studies. To address the reviewer’s concern, we have:

      • Expanded the Future Directions section to specify that additional CDA-I-linked CDAN1 variants, including non-C-terminal mutations, and quantitative assessments of ASF1 nuclear localization will be the subject of ongoing and planned investigations. Specifically, we added (p. 32, l. 776-778):” In future work, additional Codanin1 mutations, including those outside the C-terminal region, should be evaluated to determine how the mutations affect ASF1’s nuclear concentration and subcellular localization.”

      • Emphasized the need for complementary in vivo validation in erythroblast models to confirm whether the disturbance of CDIN1-Codanin1 binding recapitulates CDA-I phenotypes. We acknowledged the need for cell-line studies in future work within the Future research directions of Discussion (p. 32, l. 774-776): “Finally, follow-up research utilizing erythroblast model cell lines must be conducted to determine if specific mutations that disrupt CDIN1-Codanin1 binding, also affect ASF1 localization and cause a phenotype typical of CDA-I.” We believe these changes more precisely delimit the scope and significance of the current study while laying out a clear roadmap for the essential follow-up experiments.

      Reviewer #3 – 7. ____Rigor and Presentation and Cross-commenting

      __Minor Comments: __

      • Methods and Reproducibility:

      The experimental methods are well described, and the results appear reproducible.

      • Presentation:

      The text and figures are clear and well organized.

      Referee Cross-commenting

      I agree with reviewer 1 that the paper presents detailed structure study of Codanin-1 and CDIN1 protein. However, as reviewer 2 claims functional studies are missing and therefore the hypothesis regarding the pahtogenesis of CDAI is speculaltive especially with no studies regarding ASF1.

      3____.7 Response ____–____ Rigor and Presentation and Cross-commenting:

      We thank the reviewers for their positive appraisal of our results' reproducibility, presentation, and method descriptions. We also appreciate the cross-comment that, while our structural analysis of the CDIN1-Codanin1 complex is thorough, functional validation, particularly regarding ASF1, remains to be addressed.

      As outlined above, we have revised the manuscript to:

      • Emphasize that pathogenic hypotheses drawn from structural data are provisional (refer to Responses 2.2, 2.3, and 3.5).
      • Include follow-up studies for ASF1 localization assays and broader mutation profiling in our Future Directions (refer to Responses 2.4, 3.5, 3.6).
      • Integrate cautious language throughout to clearly delineate verified findings from model-based speculation (refer to Responses 2.4, 3.5, 3.6). The implemented adjustments ensure that the current work is positioned as a detailed structural and interaction foundation, upon which the essential functional studies will build. We believe that all extensions and clarifications fully satisfy the reviewers’ collective recommendations.

      __Reviewer #3 –____ Significance: __

      Nature and Significance of the Advance:

      This study extends prior work (e.g., Swickley et al., BMC Mol Cell Biol 2020; Shroff et al., Biochem J 2020) on Codanin-1/CDIN1 interaction by applying high-resolution biophysical techniques to identify mutations that disrupt this complex. It provides a plausible cellular mechanism by which specific mutations may lead to CDA I through impaired histone trafficking.

      Nevertheless, key question remains: How do mutations outside the Codanin-1 C-terminus contribute to the pathology?

      3.8 Response – Significance:

      • We thank Reviewer #3 for this important point. Although our work specifically dissects the C-terminal CDIN1-binding domain of Codanin1, we fully acknowledge that CDA-I-associated mutations throughout Codanin1 may operate via additional mechanisms. To address the additional mechanisms, we have added a new paragraph describing other possible pathogenic models to the Discussion (please refer to Response 3.3).
      • We also fully acknowledged the need for systematic functional assays of non-C-terminal mutations and their impact on ASF1 localization (please refer to Response 3.6).
      • We revised the text to clarify how mutations beyond the C-terminus may contribute to CDA-I pathogenesis and present the significance of our current structural analyses, biophysical characterizations, and molecular insights as a foundation for future research (please refer to Response 3.6). __Audience: __

      • Molecular and cellular biologists investigating nuclear-cytoplasmic trafficking mechanisms

      • Hematologists and geneticists studying rare red cell disorders
      • Clinicians managing CDA I patients and researchers exploring targeted therapies __Reviewer Expertise: __

      Pediatric hematologist with over 20 years of research experience in CDA I, including the initial identification of CDAN1 and the elucidation of Codanin-1's role in embryonic erythropoiesis. Not a specialist in the biophysical techniques used in this study.

      References

      Ask, K., Z. Jasencakova, P. Menard, Y. Feng, G. Almouzni and A. Groth (2012). "Codanin-1, mutated in the anaemic disease CDAI, regulates Asf1 function in S-phase histone supply." The EMBO Journal 31(8): 2013–2023.

      Jeong, T.-K., R. C. M. Frater, J. Yoon, A. Groth and J.-J. Song (2025). "CODANIN-1 sequesters ASF1 by using a histone H3 mimic helix to regulate the histone supply." Nature Communications 16(1): 2181.

      Sedor, S. F. and S. Shao (2025). "Mechanism of ASF1 engagement by CDAN1." Nature Communications 16(1): 2599.

      Swickley, G., Y. Bloch, L. Malka, A. Meiri, S. Noy-Lotan, A. Yanai, H. Tamary and B. Motro (2020). "Characterization of the interactions between Codanin-1 and C15Orf41, two proteins implicated in congenital dyserythropoietic anemia type I disease." Molecular and Cell Biology 21(1).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Congenital Dyserythropoietic Anemia type I (CDA I) is an autosomal recessive disorder characterized by ineffective erythropoiesis and distinctive nuclear morphology ("Swiss cheese" heterochromatin) in erythroblasts. CDA I is caused by mutations in CDAN1 and CDIN1. Codanin-1, encoded by CDAN1, is part of the cytosolic ASF1-H3.1-H4-Importin-4 complex, which regulates histone trafficking to the nucleus. CDIN1 has been shown to bind the C-terminal domain of Codanin-1, but until now, pathogenic mutations had not been directly linked to the disruption of this interaction. In this study, the authors used biophysical techniques to characterize the interaction between Codanin-1's C-terminal region (residues 1005-1227) and CDIN1, demonstrating high-affinity, equimolar binding. HDX-MS identified interaction hotspots, and disease-associated mutations in these regions disrupted complex formation. The authors propose that such disruption prevents ASF1 sequestration in the cytoplasm, thereby reducing nuclear histone levels and contributing to the chromatin abnormalities seen in CDA I.

      Major Comments:

      1. Use of Codanin-1 Fragment: Most experiments were conducted using only the C-terminal 223 amino acids of Codanin-1. While this region is known to bind CDIN1, it is unclear whether its conformation is maintained in the context of the full-length protein. This could affect binding properties and structural interpretations. The authors should discuss how structural differences between the isolated C-terminus and the full-length Codanin-1 may influence the conclusions.
      2. Graphical Abstract and Domain Independence: The graphical abstract presents the Codanin-1 C-terminus as an independent domain, but no direct evidence is provided to support its structural autonomy in vivo. The authors should clarify whether the C-terminal region functions as a distinct domain in the context of the full-length protein.
      3. Pathogenic Mutations Beyond the Binding Site: The study highlights a triplet mutation that impairs CDIN1 binding. However, most CDA I-associated mutations in CDAN1 are dispersed across the entire protein and may not affect CDIN1 interaction directly. The authors should discuss alternative mechanisms by which mutations in other regions of Codanin-1 might cause disease.
      4. Contradictory Functional Models: Ask et al. (EMBO J, 2012) reported that Codanin-1 depletion increases nuclear ASF1 and accelerates DNA replication. This contrasts with the current hypothesis that disruption of the Codanin-1/CDIN1 complex reduces nuclear ASF1. The authors should attempt to reconcile this apparent contradiction, possibly by proposing a context-specific or dual-function model for Codanin-1 in histone trafficking.
      5. Conclusions and Claims: The proposed model of CDA I pathogenesis (Fig. 4) is plausible but not yet fully supported by the available data. The authors suggest that disruption of the Codanin-1/CDIN1 interaction leads to nuclear histone depletion, but this has not been experimentally confirmed. Claims about the general pathogenesis of CDA I should be clearly qualified as hypothetical and applicable to a subset of mutations. The presence and localization of ASF1 in the nucleus following disruption of the Codanin-1/CDIN1 complex should be tested experimentally.
      6. Broader Mutation Analysis and ASF1 Localization: To strengthen the link between Codanin-1/CDIN1 disruption and disease pathogenesis, it would be important to test the effects of additional CDAN1 mutations, including those outside the C-terminal region. Similarly, the impact on ASF1 nuclear concentration and localization should be directly assessed. These experiments would significantly bolster the central hypothesis. If feasible, they should be pursued or at least acknowledged as important future directions.

      Minor Comments:

      • Methods and Reproducibility: The experimental methods are well described, and the results appear reproducible.
      • Presentation: The text and figures are clear and well organized.

      Referee Cross-commenting

      I agree with reviewer 1 that the paper present detailed strucutre study of Codann-1 and CDIN1 protein. However, as reviewer 2 claims functional studies are missing and therefore the hypothesis regarding the pahtogenesis of CDAI is speculaltive especially with no studies regarding ASF1.

      Significance

      Nature and Significance of the Advance:

      This study extends prior work (e.g., Swickley et al., BMC Mol Cell Biol 2020; Shroff et al., Biochem J 2020) on Codanin-1/CDIN1 interaction by applying high-resolution biophysical techniques to identify mutations that disrupt this complex. It provides a plausible cellular mechanism by which specific mutations may lead to CDA I through impaired histone trafficking. Nevertheless, key question remains: How do mutations outside the Codanin-1 C-terminus contribute to the pathology?

      Audience:

      Molecular and cellular biologists investigating nuclear-cytoplasmic trafficking mechanisms Hematologists and geneticists studying rare red cell disorders Clinicians managing CDA I patients and researchers exploring targeted therapies

      Reviewer Expertise:

      Pediatric hematologist with over 20 years of research experience in CDA I, including the initial identification of CDAN1 and the elucidation of Codanin-1's role in embryonic erythropoiesis. Not a specialist in the biophysical techniques used in this study

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript presents structural and biochemical characterization of the interaction between CDIN1 and the C-terminal domain of Codanin1, shedding light on a complex implicated in Congenital Dyserythropoietic Anemia Type I (CDA-I). While the authors provide valuable structural insights and identify disease-associated mutations that impair CDIN1-Codanin1 binding, I think several important concerns should be addressed to strengthen both the mechanistic claims and their functional relevance.

      Contradiction Between Stoichiometry Models:

      The authors propose that CDIN1 and Codanin1Cterm primarily form a heterodimer in vitro. However, this appears to contradict previous reports indicating a tetra-heteromeric arrangement. Additionally, while CDIN1 homodimerize seems confusing to me, do the authors suggest it is stable without Codanin1? This seems contrary to findings that CDIN1 is unstable in the absence of Codanin1(Sedor, S.F., Shao, S. nature comm 2025, Swickley, G., Bloch, Y., Malka, L. et al 2020 BMC Mol and Cell Biol). These inconsistencies raise concerns about whether the observed stoichiometries are physiologically relevant or artifacts of in vitro reconstitution, especially since full-length Codanin1 was not studied.

      Unvalidated Functional Claims:

      The manuscript identifies several CDA-I-associated mutations that disrupt CDIN1-Codanin1 interaction. However, the authors do not test how these mutations affect the biological function of the complex, particularly its role in ASF1 sequestration or histone trafficking. Given the central importance of this axis in their disease model, functional validation (e.g., ASF1 localization, histone deposition assays) is necessary to support these mechanistic conclusions.

      Speculative and Potentially Contradictory Model:

      The proposed model suggests that CDIN1 competes with ASF1 for Codanin1 binding, thereby indirectly promoting histone delivery to the nucleus. However, emerging data indicate that Codanin1, CDIN1, and ASF1 can form a stable ternary complex, calling into question this competitive binding hypothesis (Sedor, S.F., Shao, S. nature comm 2025). The authors do not acknowledge or discuss these findings, and the model in its current form may therefore be oversimplified or inaccurate.

      Significance

      Overall, the study adds to our structural understanding of CDIN1 and Codanin1 interactions, but the functional interpretations are currently speculative, and in some cases in conflict with existing literature. The manuscript would benefit significantly from addressing these discrepancies, incorporating relevant data on ASF1, and clarifying whether the observed assemblies reflect physiological complexes.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is a rigorous biophysical characterization of a protein-protein interaction relevant to CDA-1 disease. The two proteins were purified in an E. coli host but CD and DLS was performed to ensure that the purified protein is well folded. An impressive native protein EMSA was used to show a 1:1 complex. While common for protein-nucleic acid complexes, EMSAs are much more challenging with protein complexes. A higher-running complex, likely a heterotetramer was implied at higher protein concentrations. These results were supported with SEC-MALS analysis and analytic ultracentrifugation analysis. Thermophoresis and ITC were used to report a nanomolar affinity of the proteins for each other. SEC-SAXS supported the conclusions about stoichiometry and composition inferred from the earlier methods and suggested that the dimerization interface comes from CDIN1. Next HDX-MS was used to identify putative interface residues, which were then mutated in each of the proteins and assessed for binding using coimmunoprecipitation. This study uses at least 10 orthogonal biophysical and/or biochemical methodologies to characterize an important protein-protein interaction and the analysis is clear and so is the writing. I couldn't (reading it once) find any grammatical or other errors in the text or figures. This manuscript is top-quality and suitable for publication.

      Significance

      Such detailed structural and mechanistic studies are greatly lacking in many clinical conditions for which mutations are known (unless they cause cancer, neurodegenerative disease, and so on). We need more such studies on disease topics! This study will be of interest for the hematologic diseases community.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the Reviewers

      I would like to thank the reviewers for their comments and interest in the manuscript and the study.

      Reviewer #1

      1. I would assume that there are RNA-seq and/or ChIP-seq data out there produced after knockdown of one or more of these DBPs that show directional positioning.

      The directional positioning of CTCF-binding sites at chromatin interaction sites was analyzed by CRISPR experiment (Guo Y et al. Cell 2015). We found that the machine learning and statistical analysis showed the same directional bias of CTCF-binding motif sequence and RAD21-binding motif sequence at chromatin interaction sites as the experimental analysis of Guo Y et al. (lines 229-253, Figure 3b, c, d and Table 1). Since CTCF is involved in different biological functions (Braccioli L et al. Essays Biochem. 2019 ResearchGate webpage), the directional bias of binding sites may be reduced in all binding sites including those at chromatin interaction sites (lines 68-73). In our study, we investigated the DNA-binding sites of proteins using the ChIP-seq data of DNA-binding proteins and DNase-seq data. We also confirmed that the DNA-binding sites of SMC3 and RAD21, which tend to be found in chromatin loops with CTCF, also showed the same directional bias as CTCF by the computational analysis.

      __2. Figure 6 should be expanded to incorporate analysis of DBPs not overlapping CTCF/cohesin in chromatin interaction data that is important and potentially more interesting than the simple DBPs enrichment reported in the present form of the figure. __

      Following the reviewer's advice, I performed the same analysis with the DNA-binding sites that do no overlap with the DNA-binding sites of CTCF and cohesin (RAD21 and SMC3) (Fig. 6 and Supplementary Fig. 4). The result showed the same tendency in the distribution of DNA-binding sites. The height of a peak on the graph became lower for some DNA-binding proteins after removing the DNA-binding sites that overlapped with those of CTCF and cohesin. I have added the following sentence on lines 435 and 829: For the insulator-associated DBPs other than CTCF, RAD21, and SMC3, the DNA-binding sites that do not overlap with those of CTCF, RND21, and SMC3 were used to examine their distribution around interaction sites.

      3. Critically, I would like to see use of Micro-C/Hi-C data and ChIP-seq from these factors, where insulation scores around their directionally-bound sites show some sort of an effect like that presumed by the authors - and many such datasets are publicly-available and can be put to good use here.

      As suggested by the reviewer, I have added the insulator scores and boundary sites from the 4D nucleome data portal as tracks in the UCSC genome browser. The insulator scores seem to correspond to some extent to the H3K27me3 histone marks from ChIP-seq (Fig. 4a and Supplementary Fig. 3). We found that the DNA-binding sites of the insulator-associated DBPs were statistically overrepresented in the 5 kb boundary sites more than other DBPs (Fig. 4d). The direction of DNA-binding sites on the genome can be shown with different colors (e.g. red and green), but the directionality of insulator-associated DNA-binding sites is their overall tendency, and it may be difficult to notice the directionality from each binding site because the directionality may be weaker than that of CTCF, RAD21, and SMC3 as shown in Table 1 and Supplementary Table 2. We also observed the directional biases of CTCF, RAD21, and SMC3 by using Micro-C chromatin interaction data as we estimated, but the directionality was more apparent to distinguish the differences between the four directions of FR, RF, FF, and RR using CTCF-mediated ChIA-pet chromatin interaction data (lines 287 and 288).

       I found that the CTCF binding sites examined by a wet experiment in the previous study may not always overlap with the boundary sites of chromatin interactions from Micro-C assay (Guo Y et al. *Cell* 2015). The chromatin interaction data do not include all interactions due to the high sequencing cost of the assay, and include less long-range interactions due to distance bias. The number of the boundary sites may be smaller than that of CTCF binding sites acting as insulators and/or some of the CTCF binding sites may not be locate in the boundary sites. It may be difficult for the boundary location algorithm to identify a short boundary location. Due to the limitations of the chromatin interaction data, I planned to search for insulator-associated DNA-binding proteins without using chromatin interaction data in this study.
      
       I discussed other causes in lines 614-622: Another reason for the difference may be that boundary sites are more closely associated with topologically associated domains (TADs) of chromosome than are insulator sites. Boundary sites are regions identified based on the separation of numerous chromatin interactions. On the other hand, we found that the multiple DNA-binding sites of insulator-associated DNA-binding proteins were located close to each other at insulator sites and were associated with distinct nested and focal chromatin interactions, as reported by Micro-C assay. These interactions may be transient and relatively weak, such as tissue/cell type, conditional or lineage-specific interactions.
      
       Furthermore, I have added the statistical summary of the analysis in lines 372-395 as follows: Overall, among 20,837 DNA-binding sites of the 97 insulator-associated proteins found at insulator sites identified by H3K27me3 histone modification marks (type 1 insulator sites), 1,315 (6%) overlapped with 264 of 17,126 5kb long boundary sites, and 6,137 (29%) overlapped with 784 of 17,126 25kb long boundary sites in HFF cells. Among 5,205 DNA-binding sites of the 97 insulator-associated DNA-binding proteins found at insulator sites identified by H3K27me3 histone modification marks and transcribed regions (type 2 insulator sites), 383 (7%) overlapped with 74 of 17,126 5-kb long boundary sites, 1,901 (37%) overlapped with 306 of 17,126 25-kb long boundary sites. Although CTCF-binding sites separate active and repressive domains, the limited number of DNA-binding sites of insulator-associated proteins found at type 1 and 2 insulator sites overlapped boundary sites identified by chromatin interaction data. Furthermore, by analyzing the regulatory regions of genes, the DNA-binding sites of the 97 insulator-associated DNA-binding proteins were found (1) at the type 1 insulator sites (based on H3K27me3 marks) in the regulatory regions of 3,170 genes, (2) at the type 2 insulator sites (based on H3K27me3 marks and gene expression levels) in the regulatory regions of 1,044 genes, and (3) at insulator sites as boundary sites identified by chromatin interaction data in the regulatory regions of 6,275 genes. The boundary sites showed the highest number of overlaps with the DNA-binding sites. Comparing the insulator sites identified by (1) and (3), 1,212 (38%) genes have both types of insulator sites. Comparing the insulator sites between (2) and (3), 389 (37%) genes have both types of insulator sites. From the comparison of insulator and boundary sites, we found that (1) or (2) types of insulator sites overlapped or were close to boundary sites identified by chromatin interaction data.
      

      4. The suggested alternative transcripts function, also highlighted in the manuscripts abstract, is only supported by visual inspection of a few cases for several putative DBPs. I believe this is insufficient to support what looks like one of the major claims of the paper when reading the abstract, and a more quantitative and genome-wide analysis must be adopted, although the authors mention it as just an 'observation'.

      According to the reviewer's comment, I performed the genome-wide analysis of alternative transcripts where the DNA-binding sites of insulator-associated proteins are located near splicing sites. The DNA-binding sites of insulator-associated DNA-binding proteins were found within 200 bp centered on splice sites more significantly than the other DNA-binding proteins (Fig. 4e and Table 2). I have added the following sentences on lines 405 - 412: We performed the statistical test to estimate the enrichment of insulator-associated DNA-binding sites compared to the other DNA-binding proteins, and found that the insulator-associated DNA-binding sites were significantly more abundant at splice sites than the DNA-binding sites of the other proteins (Fig 4e and Table 2; Mann‒Whitney U test, p value 5. Figure 1 serves no purpose in my opinion and can be removed, while figures can generally be improved (e.g., the browser screenshots in Figs 4 and 5) for interpretability from readers outside the immediate research field.

      I believe that the Figure 1 would help researchers in other fields who are not familiar with biological phenomena and functions to understand the study. More explanation has been included in the Figures and legends of Figs. 4 and 5 to help readers outside the immediate research field understand the figures.

      6. Similarly, the text is rather convoluted at places and should be re-approached with more clarity for less specialized readers in mind.

      Reviewer #2's comments would be related to this comment. I have introduced a more detailed explanation of the method in the Results section, as shown in the responses to Reviewer #2's comments.

      Reviewer #2

      1. Introduction, line 95: CTCF appears two times, it seems redundant.

      On lines 91-93, I deleted the latter CTCF from the sentence "We examine the directional bias of DNA-binding sites of CTCF and insulator-associated DBPs, including those of known DBPs such as RAD21 and SMC3".

      2. Introduction, lines 99-103: Please stress better the novelty of the work. What is the main focus? The new identified DPBs or their binding sites? What are the "novel structural and functional roles of DBPs" mentioned?

      Although CTCF is known to be the main insulator protein in vertebrates, we found that 97 DNA-binding proteins including CTCF and cohesin are associated with insulator sites by modifying and developing a machine learning method to search for insulator-associated DNA-binding proteins. Most of the insulator-associated DNA-binding proteins showed the directional bias of DNA-binding motifs, suggesting that the directional bias is associated with the insulator.

       I have added the sentence in lines 96-99 as follows: Furthermore, statistical testing the contribution scores between the directional and non-directional DNA-binding sites of insulator-associated DBPs revealed that the directional sites contributed more significantly to the prediction of gene expression levels than the non-directional sites. I have revised the statement in lines 101-110 as follows: To validate these findings, we demonstrate that the DNA-binding sites of the identified insulator-associated DBPs are located within potential insulator sites, and some of the DNA-binding sites in the insulator site are found without the nearby DNA-binding sites of CTCF and cohesin. Homologous and heterologous insulator-insulator pairing interactions are orientation-dependent, as suggested by the insulator-pairing model based on experimental analysis in flies. Our method and analyses contribute to the identification of insulator- and chromatin-associated DNA-binding sites that influence EPIs and reveal novel functional roles and molecular mechanisms of DBPs associated with transcriptional condensation, phase separation and transcriptional regulation.
      

      3. Results, line 111: How do the SNPs come into the procedure? From the figures it seems the input is ChIP-seq peaks of DNBPs around the TSS.

      On lines 121-124, to explain the procedure for the SNP of an eQTL, I have added the sentence in the Methods: "If a DNA-binding site was located within a 100-bp region around a single-nucleotide polymorphism (SNP) of an eQTL, we assumed that the DNA-binding proteins regulated the expression of the transcript corresponding to the eQTL".

      4. Again, are those SNPs coming from the different cell lines? Or are they from individuals w.r.t some reference genome? I suggest a general restructuring of this part to let the reader understand more easily. One option could be simplifying the details here or alternatively including all the necessary details.

      On line 119, I have included the explanation of the eQTL dataset of GTEx v8 as follows: " The eQTL data were derived from the GTEx v8 dataset, after quality control, consisting of 838 donors and 17,382 samples from 52 tissues and two cell lines". On lines 681 and 865, I have added the filename of the eQTL data "(GTEx_Analysis_v8_eQTL.tar)".

      5. Figure 1: panel a and b are misleading. Is the matrix in panel a equivalent to the matrix in panel b? If not please clarify why. Maybe in b it is included the info about the SNPs? And if yes, again, what is then difference with a.

      The reviewer would mention Figure 2, not Figure 1. If so, the matrices in panels a and b in Figure 2 are equivalent. I have shown it in the figure: The same figure in panel a is rotated 90 degrees to the right. The green boxes in the matrix show the regions with the ChIP-seq peak of a DNA-binding protein overlapping with a SNP of an eQTL. I used eQTL data to associate a gene with a ChIP-seq peak that was more than 2 kb upstream and 1 kb downstream of a transcriptional start site of a gene. For each gene, the matrix was produced and the gene expression levels in cells were learned and predicted using the deep learning method. I have added the following sentences to explain the method in lines 133 - 139: Through the training, the tool learned to select the binding sites of DNA-binding proteins from ChIP-seq assays that were suitable for predicting gene expression levels in the cell types. The binding sites of a DNA-binding protein tend to be observed in common across multiple cell and tissue types. Therefore, ChIP-seq data and eQTL data in different cell and tissue types were used as input data for learning, and then the tool selected the data suitable for predicting gene expression levels in the cell types, even if the data were not obtained from the same cell types.

      6. Line 386-388: could the author investigate in more detail this observation? Does it mean that loops driven by other DBPs independent of the known CTCF/Cohesin? Could the author provide examples of chromatin structural data e.g. MicroC?

      As suggested by the reviewer, to help readers understand the observation, I have added Supplementary Fig. S4c to show the distribution of DNA-binding sites of "CTCF, RAD21, and SMC3" and "BACH2, FOS, ATF3, NFE2, and MAFK" around chromatin interaction sites. I have modified the following sentence to indicate the figure on line 501: Although a DNA-binding-site distribution pattern around chromatin interaction sites similar to those of CTCF, RAD21, and SMC3 was observed for DBPs such as BACH2, FOS, ATF3, NFE2, and MAFK, less than 1% of the DNA-binding sites of the latter set of DBPs colocalized with CTCF, RAD21, or SMC3 in a single bin (Fig. S4c).

       In Aljahani A et al. *Nature Communications* 2022, we find that depletion of cohesin causes a subtle reduction in longer-range enhancer-promoter interactions and that CTCF depletion can cause rewiring of regulatory contacts. Together, our data show that loop extrusion is not essential for enhancer-promoter interactions, but contributes to their robustness and specificity and to precise regulation of gene expression. Goel VY et al. *Nature Genetics* 2023 mentioned in the abstract: Microcompartments frequently connect enhancers and promoters and though loss of loop extrusion and inhibition of transcription disrupts some microcompartments, most are largely unaffected. These results suggested that chromatin loops can be driven by other DBPs independent of the known CTCF/Cohesin.
      
      I added the following sentence on lines 569-577: The depletion of cohesin causes a subtle reduction in longer-range enhancer-promoter interactions and that CTCF depletion can cause rewiring of regulatory contacts. Another group reported that enhancer-promoter interactions and transcription are largely maintained upon depletion of CTCF, cohesin, WAPL or YY1. Instead, cohesin depletion decreased transcription factor binding to chromatin. Thus, cohesin may allow transcription factors to find and bind their targets more efficiently. Furthermore, the loop extrusion is not essential for enhancer-promoter interactions, but contributes to their robustness and specificity and to precise regulation of gene expression.
      
       FOXA1 pioneer factor functions as an initial chromatin-binding and chromatin-remodeling factor and has been reported to form biomolecular condensates (Ji D et al. *Molecular Cell* 2024). CTCF have also found to form transcriptional condensate and phase separation (Lee R et al. *Nucleic acids research* 2022). FOS was found to be an insulator-associated DNA-binding protein in this study and is potentially involved in chromatin remodeling, transcription condensation, and phase separation with the other factors such as BACH2, ATF3, NFE2 and MAFK. I have added the following sentence on line 556: FOXA1 pioneer factor functions as an initial chromatin-binding and chromatin-remodeling factor and has been reported to form biomolecular condensates.
      

      7. In general, how the presented results are related to some models of chromatin architecture, e.g. loop extrusion, in which it is integrated convergent CTCF binding sites?

      Goel VY et al. Nature Genetics 2023 identified highly nested and focal interactions through region capture Micro-C, which resemble fine-scale compartmental interactions and are termed microcompartments. In the section titled "Most microcompartments are robust to loss of loop extrusion," the researchers noted that a small proportion of interactions between CTCF and cohesin-bound sites exhibited significant reductions in strength when cohesin was depleted. In contrast, the majority of microcompartmental interactions remained largely unchanged under cohesin depletion. Our findings indicate that most P-P and E-P interactions, aside from a few CTCF and cohesin-bound enhancers and promoters, are likely facilitated by a compartmentalization mechanism that differs from loop extrusion. We suggest that nested, multiway, and focal microcompartments correspond to small, discrete A-compartments that arise through a compartmentalization process, potentially influenced by factors upstream of RNA Pol II initiation, such as transcription factors, co-factors, or active chromatin states. It follows that if active chromatin regions at microcompartment anchors exhibit selective "stickiness" with one another, they will tend to co-segregate, leading to the development of nested, focal interactions. This microphase separation, driven by preferential interactions among active loci within a block copolymer, may account for the striking interaction patterns we observe.

       The authors of the paper proposed several mechanisms potentially involved in microcompartments. These mechanisms may be involved in looping with insulator function. Another group reported that enhancer-promoter interactions and transcription are largely maintained upon depletion of CTCF, cohesin, WAPL or YY1. Instead, cohesin depletion decreased transcription factor binding to chromatin. Thus, cohesin may allow transcription factors to find and bind their targets more efficiently (Hsieh TS et al. *Nature Genetics* 2022). Among the identified insulator-associated DNA-binding proteins, Maz and MyoD1 form loops without CTCF (Xiao T et al. *Proc Natl Acad Sci USA* 2021 ; Ortabozkoyun H et al. *Nature genetics* 2022 ; Wang R et al. *Nature communications* 2022). I have added the following sentences on lines 571-575: Another group reported that enhancer-promoter interactions and transcription are largely maintained upon depletion of CTCF, cohesin, WAPL or YY1. Instead, cohesin depletion decreased transcription factor binding to chromatin. Thus, cohesin may allow transcription factors to find and bind their targets more efficiently. I have included the following explanation on lines 582-584: Maz and MyoD1 among the identified insulator-associated DNA-binding proteins form loops without CTCF.
      
       As for the directionality of CTCF, if chromatin loop anchors have some structural conformation, as shown in the paper entitled "The structural basis for cohesin-CTCF-anchored loops" (Li Y et al. *Nature* 2020), directional DNA binding would occur similarly to CTCF binding sites. Moreover, cohesin complexes that interact with convergent CTCF sites, that is, the N-terminus of CTCF, might be protected from WAPL, but those that interact with divergent CTCF sites, that is, the C-terminus of CTCF, might not be protected from WAPL, which could release cohesin from chromatin and thus disrupt cohesin-mediated chromatin loops (Davidson IF et al. *Nature Reviews Molecular Cell Biology* 2021). Regarding loop extrusion, the 'loop extrusion' hypothesis is motivated by in vitro observations. The experiment in yeast, in which cohesin variants that are unable to extrude DNA loops but retain the ability to topologically entrap DNA, suggested that in vivo chromatin loops are formed independently of loop extrusion. Instead, transcription promotes loop formation and acts as an extrinsic motor that extends these loops and defines their final positions (Guerin TM et al. *EMBO Journal* 2024). I have added the following sentences on lines 543-547: Cohesin complexes that interact with convergent CTCF sites, that is, the N-terminus of CTCF, might be protected from WAPL, but those that interact with divergent CTCF sites, that is, the C-terminus of CTCF, might not be protected from WAPL, which could release cohesin from chromatin and thus disrupt cohesin-mediated chromatin loops. I have included the following sentences on lines 577-582: The 'loop extrusion' hypothesis is motivated by in vitro observations. The experiment in yeast, in which cohesin variants that are unable to extrude DNA loops but retain the ability to topologically entrap DNA, suggested that in vivo chromatin loops are formed independently of loop extrusion. Instead, transcription promotes loop formation and acts as an extrinsic motor that extends these loops and defines their final positions.
      
       Another model for the regulation of gene expression by insulators is the boundary-pairing (insulator-pairing) model (Bing X et al. *Elife* 2024) (Ke W et al. *Elife* 2024) (Fujioka M et al. *PLoS Genetics* 2016). Molecules bound to insulators physically pair with their partners, either head-to-head or head-to-tail, with different degrees of specificity at the termini of TADs in flies. Although the experiments do not reveal how partners find each other, the mechanism unlikely requires loop extrusion. Homologous and heterologous insulator-insulator pairing interactions are central to the architectural functions of insulators. The manner of insulator-insulator interactions is orientation-dependent. I have summarized the model on lines 559-567: Other types of chromatin regulation are also expected to be related to the structural interactions of molecules. As the boundary-pairing (insulator-pairing) model, molecules bound to insulators physically pair with their partners, either head-to-head or head-to-tail, with different degrees of specificity at the termini of TADs in flies (Fig. 7). Although the experiments do not reveal how partners find each other, the mechanism unlikely requires loop extrusion. Homologous and heterologous insulator-insulator pairing interactions are central to the architectural functions of insulators. The manner of insulator-insulator interactions is orientation-dependent.
      

      8. Do the authors think that the identified DBPs could work in that way as well?

      The boundary-pairing (insulator-pairing) model would be applied to the insulator-associated DNA-binding proteins other than CTCF and cohesin that are involved in the loop extrusion mechanism (Bing X et al. Elife 2024) (Ke W et al. Elife 2024) (Fujioka M et al. PLoS Genetics 2016).

       Liquid-liquid phase separation was shown to occur through CTCF-mediated chromatin loops and to act as an insulator (Lee, R et al. *Nucleic Acids Research* 2022). Among the identified insulator-associated DNA-binding proteins, CEBPA has been found to form hubs that colocalize with transcriptional co-activators in a native cell context, which is associated with transcriptional condensate and phase separation (Christou-Kent M et al. *Cell Reports* 2023). The proposed microcompartment mechanisms are also associated with phase separation. Thus, the same or similar mechanisms are potentially associated with the insulator function of the identified DNA-binding proteins. I have included the following information on line 554: CEBPA in the identified insulator-associated DNA-binding proteins was also reported to be involved in transcriptional condensates and phase separation.
      

      9. Also, can the authors comment about the mechanisms those newly identified DBPs mediate contacts by active processes or equilibrium processes?

      Snead WT et al. Molecular Cell 2019 mentioned that protein post-transcriptional modifications (PTMs) facilitate the control of molecular valency and strength of protein-protein interactions. O-GlcNAcylation as a PTM inhibits CTCF binding to chromatin (Tang X et al. Nature Communications 2024). I found that the identified insulator-associated DNA-binding proteins tend to form a cluster at potential insulator sites (Supplementary Fig. 2d). These proteins may interact and actively regulate chromatin interactions, transcriptional condensation, and phase separation by PTMs. I have added the following explanation on lines 584-590: Furthermore, protein post-transcriptional modifications (PTMs) facilitate control over the molecular valency and strength of protein-protein interactions. O-GlcNAcylation as a PTM inhibits CTCF binding to chromatin. We found that the identified insulator-associated DNA-binding proteins tend to form a cluster at potential insulator sites (Fig. 4f and Supplementary Fig. 3c). These proteins may interact and actively regulate chromatin interactions, transcriptional condensation, and phase separation through PTMs.

      10. Can the author provide some real examples along with published structural data (e.g. the mentioned micro-C data) to show the link between protein co-presence, directional bias and contact formation?

      Structural molecular model of cohesin-CTCF-anchored loops has been published by Li Y et al. Nature 2020. The structural conformation of CTCF and cohesin in the loops would be the cause of the directional bias of CTCF binding sites, which I mentioned in lines 539 - 543 as follows: These results suggest that the directional bias of DNA-binding sites of insulator-associated DBPs may be involved in insulator function and chromatin regulation through structural interactions among DBPs, other proteins, DNAs, and RNAs. For example, the N-terminal amino acids of CTCF have been shown to interact with RAD21 in chromatin loops.

       To investigate the principles underlying the architectural functions of insulator-insulator pairing interactions, two insulators, Homie and Nhomie, flanking the *Drosophila even skipped *locus were analyzed. Pairing interactions between the transgene Homie and the eve locus are directional. The head-to-head pairing between the transgene and endogenous Homie matches the pattern of activation (Fujioka M et al. *PLoS Genetics* 2016).
      

      Reviewer #3

      Major Comments:

      1. Some of these TFs do not have specific direct binding to DNA (P300, Cohesin). Since the authors are using binding motifs in their analysis workflow, I would remove those from the analysis.

      When a protein complex binds to DNA, one protein of the complex binds to the DNA directory, and the other proteins may not bind to DNA. However, the DNA motif sequence bound by the protein may be registered as the DNA-binding motif of all the proteins in the complex. The molecular structure of the complex of CTCF and Cohesin showed that both CTCF and Cohesin bind to DNA (Li Y et al. Nature 2020). I think there is a possibility that if the molecular structure of a protein complex becomes available, the previous recognition of the DNA-binding ability of a protein may be changed. Therefore, I searched the Pfam database for 99 insulator-associated DNA-binding proteins identified in this study. I found that 97 are registered as DNA-binding proteins and/or have a known DNA-binding domain, and EP300 and SIN3A do not directory bind to DNA, which was also checked by Google search. I have added the following explanation in line 257 to indicate direct and indirect DNA-binding proteins: Among 99 insulator-associated DBPs, EP300 and SIN3A do not directory interact with DNA, and thus 97 insulator-associated DBPs directory bind to DNA. I have updated the sentence in line 20 of the Abstract as follows: We discovered 97 directional and minor nondirectional motifs in human fibroblast cells that corresponded to 23 DBPs related to insulator function, CTCF, and/or other types of chromosomal transcriptional regulation reported in previous studies.

      2. I am not sure if I understood correctly, by why do the authors consider enhancers spanning 2Mb (200 bins of 10Kb around eSNPs)? This seems wrong. Enhancers are relatively small regions (100bp to 1Kb) and only a very small subset form super enhancers.

      As the reviewer mentioned, I recognize enhancers are relatively small regions. In the paper, I intended to examine further upstream and downstream of promoter regions where enhancers are found. Therefore, I have modified the sentence in lines 929 - 931 of the Fig. 2 legend as follows: Enhancer-gene regulatory interaction regions consist of 200 bins of 10 kbp between -1 Mbp and 1 Mbp region from TSS, not including promoter.

      3. I think the H3K27me3 analysis was very good, but I would have liked to see also constitutive heterochromatin as well, so maybe repeat the analysis for H3K9me3.

      Following the reviewer's advice, I have added the ChIP-seq data of H3K9me3 as a truck of the UCSC Genome Browser. The distribution of H3K9me3 signal was different from that of H3K27me3 in some regions. I also found the insulator-associated DNA-binding sites close to the edges of H3K9me3 regions and took some screenshots of the UCSC Genome Browser of the regions around the sites in Supplementary Fig. 3b. I have modified the following sentence on lines 974 - 976 in the legend of Fig. 4: a Distribution of histone modification marks H3K27me3 (green color) and H3K9me3 (turquoise color) and transcript levels (pink color) in upstream and downstream regions of a potential insulator site (light orange color). I have also added the following result on lines 356 - 360: The same analysis was performed using H3K9me3 marks, instead of H3K27me3 (Fig. S3b). We found that the distribution of H3K9me3 signal was different from that of H3K27me3 in some regions, and discovered the insulator-associated DNA-binding sites close to the edges of H3K9me3 regions (Fig. S3b).

      4. I was not sure I understood the analysis in Figure 6. The binding site is with 500bp of the interaction site, but micro-C interactions are at best at 1Kb resolution. They say they chose the centre of the interaction site, but we don't know exactly where there is the actual interaction. Also, it is not clear what they measure. Is it the number of binding sites of a specific or multiple DBP insulator proteins at a specific distance from this midpoint that they recover in all chromatin loops? Maybe I am missing something. This analysis was not very clear.

      The resolution of the Micro-C assay is considered to be 100 bp and above, as the human nucleome core particle contains 145 bp (and 193 bp with linker) of DNA. However, internucleosomal DNA is cleaved by endonuclease into fragments of multiples of 10 nucleotides (Pospelov VA et al. Nucleic Acids Research 1979). Highly nested focal interactions were observed (Goel VY et al. Nature Genetics 2023). Base pair resolution was reported using Micro Capture-C (Hua P et al. Nature 2021). Sub-kilobase (20 bp resolution) chromatin topology was reported using an MNase-based chromosome conformation capture (3C) approach (Aljahani A et al. Nature Communications 2022). On the other hand, Hi-C data was analyzed at 1 kb resolution. (Gu H et al. bioRxiv 2021). If the resolution of Micro-C interactions is at best at 1 kb, the binding sites of a DNA-binding protein will not show a peak around the center of the genomic locations of interaction edges. Each panel shows the number of binding sites of a specific DNA-binding protein at a specific distance from the midpoint of all chromatin interaction edges. I have modified and added the following sentences in lines 593-597: High-resolution chromatin interaction data from a Micro-C assay indicated that most of the predicted insulator-associated DBPs showed DNA-binding-site distribution peaks around chromatin interaction sites, suggesting that these DBPs are involved in chromatin interactions and that the chromatin interaction data has a high degree of resolution. Base pair resolution was reported using Micro Capture-C.

      Minor Comments:

      1. PIQ does not consider TF concentration. Other methods do that and show that TF concentration improves predictions (e.g., ____https://www.biorxiv.org/content/10.1101/2023.07.15.549134v2____or ____https://pubmed.ncbi.nlm.nih.gov/37486787____/). The authors should discuss how that would impact their results.

      The directional bias of CTCF binding sites was identified by ChIA-pet interactions of CTCF binding sites. The analysis of the contribution scores of DNA-binding sites of proteins considering the binding sites of CTCF as an insulator showed the same tendency of directional bias of CTCF binding sites. In the analysis, to remove the false-positive prediction of DNA-binding sites, I used the binding sites that overlapped with a ChIP-seq peak of the DNA-binding protein. This result suggests that the DNA-binding sites of CTCF obtained by the current analysis have sufficient quality. Therefore, if the accuracy of prediction of DNA-binding sites is improved, although the number of DNA-binding sites may be different, the overall tendency of the directionality of DNA-binding sites will not change and the results of this study will not change significantly.

       As for the first reference in the reviewer's comment, chromatin interaction data from Micro-C assay does not include all chromatin interactions in a cell or tissue, because it is expensive to cover all interactions. Therefore, it would be difficult to predict all chromatin interactions based on machine learning. As for the second reference in the reviewer's comment, pioneer factors such as FOXA are known to bind to closed chromatin regions, but transcription factors and DNA-binding proteins involved in chromatin interactions and insulators generally bind to open chromatin regions. The search for the DNA-binding motifs is not required in closed chromatin regions.
      

      2. DeepLIFT is a good approach to interpret complex structures of CNN, but is not truly explainable AI. I think the authors should acknowledge this.

      In the DeepLIFT paper, the authors explain that DeepLIFT is a method for decomposing the output prediction of a neural network on a specific input by backpropagating the contributions of all neurons in the network to every feature of the input (Shrikumar A et al. ICML 2017). DeepLIFT compares the activation of each neuron to its 'reference activation' and assigns contribution scores according to the difference. DeepLIFT calculates a metric to measure the difference between an input and the reference of the input.

       Truly explainable AI would be able to find cause and reason, and to make choices and decisions like humans. DeepLIFT does not perform causal inferences. I did not use the term "Explainable AI" in our manuscript, but I briefly explained it in Discussion. I have added the following explanation in lines 623-628: AI (Artificial Intelligence) is considered as a black box, since the reason and cause of prediction are difficult to know. To solve this issue, tools and methods have been developed to know the reason and cause. These technologies are called Explainable AI. DeepLIFT is considered to be a tool for Explainable AI. However, DeepLIFT does not answer the reason and cause for a prediction. It calculates scores representing the contribution of the input data to the prediction.
      
       Furthermore, to improve the readability of the manuscript, I have included the following explanation in lines 159-165: we computed DeepLIFT scores of the input data (i.e., each binding site of the ChIP-seq data of DNA-binding proteins) in the deep leaning analysis on gene expression levels. DeepLIFT compares the importance of each input for predicting gene expression levels to its 'reference or background level' and assigns contribution scores according to the difference. DeepLIFT calculates a metric to measure the difference between an input and the reference of the input.
      
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Osato and Hamada propose a systematic approach to identify DNA binding proteins that display directional binding. They used a modified Deep Learning method (DEcode) to investigate binding profiles of 1356 DBP from GTRD database at promoters (30 of 100bp bins around TSS) and enhancers (200 bins of 10Kb around eSNPs) and use this to predict expression of 25,071 genes in Fibroblasts, Monocytes, HMEC and NPC. This method achieves a good prediction power (Spearman correlation between predicted and actual expression of 0.74). They then use PIQ, and overlap predicted binding sites with actual ChIP-seq data to investigate the motifs of TFs that are controlling gene expression. They find 99 insulator proteins showing either a specific directional bias or minor non-directional bias, corresponding to 23 DBP previously reported to have insulator function. Of the 23 proteins they identify as regulating enhancer promoter interactions, 13 are associated with CTCF. They also show that there are significantly more insulator proteins binding sites at borders of polycomb domains, transcriptionally active or boundary regions based on chromatin interactions than other proteins.

      Major Comments:

      1. Some of these TFs do not have specific direct binding to DNA (P300, Cohesin). Since the authors are using binding motifs in their analysis workflow, I would remove those from the analysis.
      2. I am not sure if I understood correctly, by why do the authors consider enhancers spanning 2Mb (200 bins of 10Kb around eSNPs)? This seems wrong. Enhancers are relatively small regions (100bp to 1Kb) and only a very small subset form super enhancers.
      3. I think the H3K27me3 analysis was very good, but I would have liked to see also constitutive heterochromatin as well, so maybe repeat the analysis for H3K9me3.
      4. I was not sure I understood the analysis in Figure 6. The binding site is with 500bp of the interaction site, but micro-C interactions are at best at 1Kb resolution. They say they chose the centre of the interaction site, but we don't know exactly where there is the actual interaction. Also, it is not clear what they measure. Is it the number of binding sites of a specific or multiple DBP insulator proteins at a specific distance from this midpoint that they recover in all chromatin loops? Maybe I am missing something. This analysis was not very clear.

      Minor comments:

      1. PIQ does not consider TF concentration. Other methods do that and show that TF concentration improves predictions (e.g., https://www.biorxiv.org/content/10.1101/2023.07.15.549134v2 or https://pubmed.ncbi.nlm.nih.gov/37486787/). The authors should discuss how that would impact their results.
      2. DeepLIFT is a good approach to interpret complex structures of CNN, but is not truly explainable AI. I think the authors should acknowledge this.

      Referee Cross-Commenting

      I would like to mention that I agree with the comments of reviewers 1 and 2.

      Significance

      General assessment:

      This is the first study to my knowledge that attempts to use Deep Learning to identify insulators and directional biases in binding. One of the limitations is that no additional methods were used to show that these DBP have directional binding bias. It is not necessarily to employ additional methods, but it would definitely strengthen the paper.

      Advancements:

      This is a useful catalogue of potential DNA binding proteins of interest, beyond just CTCF. Some known TFs are there, but also new ones are found.

      Audience:

      Basic research mainly, with particular focus on chromatin conformation and TF binding fields.

      My expertise:

      ML/AI methods in genomics, TF binding models, epigenetics and 3D chromatin interactions.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this work, the authors describe a deep learning computational tool to identity binding motifs of DNA binding proteins associated to insulators that led to the discovery of 99 motifs related to insulation. This is in turn related to chromatin architecture and highlight the importance of directional bias in order to form chromatin loops.

      In general, there are some aspects to be clarified and better explored to make stronger conclusions. In particular, there are some aspects to clarify in the text about the Machine Learning procedure (see my points below). In addition, I have some general questions about the biological implications of the discussed findings, listed in detail in the following list.

      Also, I encourage the authors to integrate the current presentation of the data with other (published) data about chromatin architecture, to make more robust the claims and go deeper into the biological implications of the current work. Se my list below.

      It follows a specific list of relevant points to be addressed:

      Specific points:

      1. Introduction, line 95: CTCF appears two times, it seems redundant;
      2. Introduction, lines 99-103: Please stress better the novelty of the work. What is the main focus? The new identified DPBs or their binding sites? What are the "novel structural and functional roles of DBPs" mentioned?
      3. Results, line 111: How do the SNPs come into the procedure? From the figures it seems the input is ChIP-seq peaks of DNBPs around the TSS;
      4. Again, are those SNPs coming from the different cell lines? Or are they from individuals w.r.t some reference genome? I suggest a general restructuring of this part to let the reader understand more easily. One option could be simplifying the details here or alternatively including all the necessary details;
      5. Figure 1: panel a and b are misleading. Is the matrix in panel a equivalent to the matrix in panel b? If not please clarify why. Maybe in b it is included the info about the SNPs? And if yes, again, what is then difference with a.
      6. Line 386-388: could the author investigate in more detail this observation? Does it mean that loops driven by other DBPs independent of the known CTCF/Cohesin? Could the author provide examples of chromatin structural data e.g. MicroC?
      7. In general, how the presented results are related to some models of chromatin architecture, e.g. loop extrusion, in which it is integrated convergent CTCF binding sites?
      8. Do the authors think that the identified DBPs could work in that way as well?
      9. Also, can the authors comment about the mechanisms those newly identified DBPs mediate contacts by active processes or equilibrium processes?
      10. Can the author provide some real examples along with published structural data (e.g. the mentioned micro-C data) to show the link between protein co-presence, directional bias and contact formation?

      Significance

      In this work, the authors describe a deep learning computational tool to identity binding motifs of DNA binding proteins associated to insulators that led to the discovery of 99 motifs related to insulation. This is in turn related to chromatin architecture and highlight the importance of directional bias in order to form chromatin loops.

      In general, chromatin organization is an important topic in the context of a constantly expanding research field. Therefore, the work is timely and could be useful for the community. The paper appears overall well written and the figures look clear and of good quality. Nevertheless, there are some aspects to be clarified and better explored to make stronger conclusions. In particular, there are some aspects to clarify in the text about the Machine Learning procedure (see list of specific points). In addition, I have some general questions about the biological implications of the discussed findings, listed in detail in the above reported points.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The study by Osato and Hamada aims at computationally identifying a set of novel putative insulator-associated DNA binding proteins (DBPs) via estimation of their contribution to the expression of genes in the same chromosome region of their binding sites (+- 1Mbp from TSS). To achieve this, the authors leverage a deep learning architecture already published via which ChIP-seq peaks of DBPs in the TSS of a given gene are used to predict its expression level in four human cell lines.

      Building on this, the authors used another tool called DeepLIFT to evaluate the weight of each DBP binding site on the final gene expression value. Hence they made the assumption that if a given DBP had an insulator function they could restrict the prediction of the gene's expression to the region included between pairs of that DBP binding sites, and evaluate the pair's motif directionality bias in the distribution of weights. They exemplify their approach's validity by the fact that they can predict the known directionality bias of CTCF/cohesin-bound sites as the highest of the lot, with the F-R orientation of the pairs the most enriched, recapitulating what already known in literature: i.e., that F-R chromatin interaction peaks are the most enriched. In addition, they find several new DBPs showing significant directionality bias; hence they could be candidates for insulation activity. They then provide correlation between these putative insulator binding sites and sites of transition between euchromatin and heterochromatin by independently using histone mark and gene expression datasets. This, of course, is not surprising because (a) there is insulation between regions with heterotypic chromatin identities, and (b) it was already known from the first papers describing insulated chromatin domains that their boundaries were well-enriched for active transcription and transcriptional regulators (e.g., Dixon et al, Nature 2012).

      Finally, they use chromatin interaction (looping) sites to check the overlap between CTCF and all other DBPs and define a subset of putative insulator DBPs not overlapping CTCF peaks, suggesting potentially new insulatory mechanisms. These factors were all known transcriptional activators, but this part of the findings carry most of the novelty in the work and have the potential of opening up new directions for research in chromatin organization.

      Overall, the methodology applied here is adequate, clear, and reproducible. The major issue, in our view, is that the entire manuscript's findings relies on the usage of deepLIFT, a tool which was not benchmarked previously or by the current study. In fact, deepLIFT is public as regards its code, and also appears as a preprint from 2017 on biorXiv and published in the Proceedings of Machine Learning Research conference. Also, this key tool was developed by the Kundaje lab (who produce high quality alogrithms), and not by the authors. Therefore, the manuscript is predominantly based on the execution of existing workflows to publicly-available data. This does not take anything away from the interesting question posed here, but at the same time does not provide the community with any new algorithm/workflow.

      Finally, although I appreciate that the authors are purely computational and have likely no capacity for experimental validation of their claims of new DBPs having insulator roles, I would assume that there are RNA-seq and/or ChIP-seq data out there produced after knockdown of one or more of these DBPs that show directional positioning. Using this kind of data, effects on gene expression can at least be tested in regard to the authors' predictions. Moreover, in terms of validation, Figure 6 should be expanded to incorporate analysis of DBPs not overlapping CTCF/cohesin in chromatin interaction data that is important and potentially more interesting than the simple DBPs enrichment reported in the present form of the figure. Critically, I would like to see use of Micro-C/Hi-C data and ChIP-seq from these factors, where insulation scores around their directionally-bound sites show some sort of an effect like that presumed by the authors - and many such datasets are publicly-available and can be put to good use here.

      As secondary issues, we would point out that:

      • The suggested alternative transcripts function, also highlighted in the manuscript;s abstract, is only supported by visual inspection of a few cases for several putative DBPs. I believe this is insufficient to support what looks like one of the major claims of the paper when reading the abstract, and a more quantitative and genome-wide analysis must be adopted, although the authors mention it as just an 'observation'.
      • Figure 1 serves no purpose in my opinion and can be removed, while figures can generally be improved (e.g., the browser screenshots in Figs 4 and 5) for interpretability from readers outside the immediate research field.
      • Similarly, the text is rather convoluted at places and should be re-approached with more clarity for less specialized readers in mind.

      Significance

      The scientific novelty of the work lies primarily in the identification of a set of DBPs that are proposed to confer insulator activity genome-wide. This has been long sought after in human data (whilst it is well understood and defined in Drosophila). The authors produce a quantitative ranking of the putative insulation effect of these DBPs and, most importantly, go on to identify a smaller subset that are apparently non-overlapping with anchors of CTCF-cohesin loop anchors; the presence of strong motif orientation biases in many DBPs can also be of broad interest, especially those that cannot be trivially ascribable to the loop extrusion process.

      However, although these findings open the way for speculation on multiple insulation mechanisms via proteins with multiple regulatory functions, the manuscript provide no experimental or computational means to test the proposed roles of these DBPs - and, as such, this limits the potential impact of the work and mostly targets researchers in the field of genome organization that can test these findings. Having said this, if validated, this work can significantly broaden our understanding of how chromatin is organized in 3D nuclear space.

      I typically identify myself to the authors: A. Papantonis, expertise in 3D genome architecture, chromatin biology, and genomics/bioinformatics.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors provide a detailed characterization of the tumor microenvironment (TME) of 91 ovarian cancer patients, broken down in long and short-term survivors (post 5 years). The focus on the role of a subgroup of T cells, gamma/delta γδ) T cells with reported anti but also pro tumorigenic properties, Prior work of the lab has established a link between a subgroup of γδ T cells expressing CD73 and poor prognosis, due to the ability of these cells to produce immunosuppressive cytokines, such as IL10 or IL8 and the production of adenosine, by CD73, in the micromilieu. The data is further backed up by the analysis of fresh tumor specimens and tissue culture work.

      Here they continue this story by investigating the TME using tumor microarrays (91 samples), single cell RNA seq (12 patients), imaging mass cytometry (> 30 samples) and flow cytometry (form confirmatory purposes) to define cellular neighborhoods of CD73+ and CD73- γδ T cells. This revealed differences in cellular composition and spatial transcriptome analysis further helped to define the transcriptomes in γδ T cells, cancer cells and cancer associated fibroblasts.

      The authors conclude the in ovarian cancer γδ T cells expressing CD73 dampen anti-tumor immunity and propose detection and evaluation of CD73+ γδ T cells as prognostic marker.

      The manuscript is well written, and despite its descriptive nature, easy to follow. Data is presented in a clear and easy to read fashion.

      Reviewer #1 (Significance (Required)):

      Using a well characterized cohort of ovarian cancer patients with detailed clinical follow up the authors report on the predictive power of a subset of γδ T cells expressing CD73, with immune suppressive / regulatory capacity, reading out patient survival in high grade serous ovarian cancer, a still deadly disease. As such the identification of reliable markers predicting survival is a clear medical need. These findings contrast others made in different solid cancers, suggesting tumor type specific differences, which are only starting to emerge, but are of clear clinical relevance.

      What is unclear to me and needs to be addressed, is if these patient specimens were taken before or after initial therapy, whether the samples have been stratified according the treatment that they got, assuming it will be mostly platinum compounds (but maybe not), and that the p53 status of the tumors are (if genetics are available this would help to add some granularity to the study that, as it stands is largely descriptive, even though with extremely high resolution. This data should be available and could be integrated.

      We thank the reviewer for this insightful and constructive comment. We agree that clinical context and treatment stratification are essential to strengthen the interpretation and translational value of our findings.

      We confirm that all tumor samples used in this study were obtained prior to any systemic treatment, i.e., before first-line chemotherapy, during the Biopsy realized for the diagnosis. This information has now been clearly stated in the Methods and Results section (page 4, line 103) and also in Table S1.

      Although our primary aim was not to evaluate correlations with mutational status, we recognize the critical role that tumor genetics play in shaping the immune microenvironment. Using available clinical genomics data, we found that the TP53 mutational status of our cohort aligns with that of previous analyses. As expected for high-grade serous ovarian cancer (HGSOC), nearly all tumors exhibited TP53 mutations (present in 95% of patients). Due to the lack of variability in TP53 status, no meaningful stratification was observed based on this factor. This information has been added in the Materials and methods part (page 4 lines 104 to 106)

      Some minor issues

      • I would stick to CD73, and not mix it with NT5E, which is confusing at first (Fig 2).

      We appreciate this suggestion. To clarify the nomenclature and avoid confusion, we have consistently indicated throughout the text and figure legends that NT5E refers to the CD73 gene.

      • I would ask to compare the overall survival of CD73+ between densities - is it still significantly different in fig 1 - meaning is it about density, CD73 expression, or both. Comparing survival of tumors with a low density of CD73+ γδ T cells does not seem to be different from those having a low density of CD73- γδ T cells, which could be considered in data interpretation. Same for high density tumors.

      In the manuscript, the term “density” specifically refers to the density of γδ____ T cells and not the density of CD73 molecules expressed by these cells. Additionally, it is not feasible to conduct a density analysis of molecules using the data obtained from immunofluorescence (IF) staining of sample sections.

      Kaplan-Meier analyses were performed to assess patient survival based on the density of total γδ____ T cells, as well as the subsets of CD73⁺ and CD73⁻ γδ T cells. The results indicate that a higher density of γδ____ T cells is associated with poorer patient survival, with a more pronounced effect seen in those with a high density of CD73⁺ γδ T cells compared to those with CD73⁻ γδ T cells.

      As the reviewer pointed out, patients with a low density of CD73⁺ γδ T cells do not show significantly different survival outcomes compared to those with a low density of CD73⁻ γδ T cells (IC50 for low CD73⁺ = 6.0 years vs. IC50 for low CD73⁻ = 6.2 years). In response, we have revised the corresponding sentence in the text and included the IC50 values for greater clarity and informativeness (page 9).

      • figure 1, caption should include the word "patients" at the end, I guess.

      The modification has been done.

      • labelling and font can be improved in many panels, eg. the dot plots in Fig 2, panel B, right, same for panel C and D

      We appreciate the feedback on figure presentation. We have now updated Figure 2 with improved labeling, consistent font size, and enhanced resolution to ensure better readability across all panels, particularly panels B–D. The revised figure has been updated in the main manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this manuscript ("Deciphering the tumor-infiltrating CD73+ regulatory γδ T cell ecosystem associated with poor survival of patients with ovarian cancer"), Chabab et al. report on the phenotype and location of CD73+ γδ T cells in ovarian cancer. CD73+ γδ T cells can be immunosuppressive via the production of cytokines (IL-8, IL-10) and the expression of PD-L1. Here, the authors investigated the phenotype and location of CD73+ and -neg γδ T cells in ovarian cancers with a particular focus on the cells surrounding the γδ T cells in the tumour.

      Overall, the study is informative and well-performed. However, the way some of the data are presented does not allow to fully evaluate them. Besides this, this reviewer only has some minor comments.

      General comments:

      • The data provided in this manuscript are descriptive/correlative, and as such, causation cannot be inferred. Therefore, the language needs to reflect this; statements like "we investigated the impact of CD73+ regulatory γδ T cells in ovarian cancer" (L89) and "CD73+ γδ T cells were in close contact with more aggressive tumor cells" (L426), among others, are incorrect without functional data. The authors are advised to adjust the text throughout.

      We thank the reviewer for this thoughtful point. We have amended the text to make it consistent with the data.

      • Please make the figure legends self-explanatory without the need to search for the information in the M&M. For example, the graphs in fig1 and 3 contain many dots, but it is not explained what these dots represent. Please also add n for each experiment shown and state how often the experiment was performed independently.

      As requested by the reviewer, we have revised the figure legends to make them more explicit. We have indicated the number of biological replicates (n) and how many times each experiment was performed independently. This information has been added to each legend where consistent and relevant, to ensure clarity and reproducibility.

      • It would be helpful for the reader if abbreviations introduced in the M&M were also explained the first time they appeared in the results section.

      This point has been addressed as requested by the reviewer.

      • Please explain all abbreviations, e.g. FIGO, CST, NT5E, etc.
      • L235: typo 'that' instead of 'than'; L258 'reduced'; L259 'fig1d-f'; L451f twice 'CD73+'; 'naive' instead of 'naïve' throughout; SF2 legend: '2f' instead of '3f', SF9 legend: '1.105'.
      • L280ff: "Tumor cells ... were the most important cell type" - it may be clearer to use 'most frequent';

      All these points have been addressed.

      M&M - Please be consistent, if you provide catalogue numbers or dilutions (antibody, reagents) [which is good, maybe even adding the RRID number], do so for all items.

      This point has been addressed as requested by the reviewer.

      • The M&M does not state for the CAFs how long they were cultured before the supernatant was taken for the cytokine measurements.

      This point has been added in M&M section.

      • For the IL-6 ELISA, it is stated that the "cells were harvested"; what happened to them, and how do you get any SN from these cells?

      We have amended the protocol of IL-6 Elisa in M&M section for clarification.

      Figures Fig.1: - The authors used the word 'predict' in the heading, which seems not appropriate for a retrospective study; something like 'correlate' seems better.

      The word “predict” has been replaced by “correlate” as suggested by the reviewer.

      • Similarly, the title of the figure legends claims that the 'impact of γδ T cells' is shown, while only a correlation is presented.

      The title of the figure has been modified

      • For Fig1a-c, only summary data are presented. Please add exemplary pictures as well.

      Pictures of IF have been added as Supplementary Fig 1.

      • For Fig1d-f, the label for the x-axis is missing.

      The figure has been corrected.

      Fig.2 - It seems funny to call the patients 'naïve', maybe 'untreated' is clearer.

      We appreciate this suggestion and agree that ‘untreated’ is a clearer and more appropriate term in this context. We have replaced all instances of ‘naïve’ with ‘untreated’ throughout the manuscript to avoid ambiguity.

      • The graph in Fig2e does not allow comparing the cell frequencies properly. This would require either bar graphs or a table. Furthermore, the statistical analysis is missing. Without that, a statement like "associated with higher proportion of CAFs" (L265) is not supported.

      We thank the reviewer for this valuable observation. In response, we have replaced the original visualization in Figure 2E with grouped bar graphs showing the mean ± SEM of the relative proportions of each major cell type in the NT5E_low and NT5E_high groups, based on the median split. This format allows for clearer visual comparison of cell frequencies across conditions.

      Furthermore, we performed statistical comparisons using a t-test (a parametric test) on each population to evaluate differences in cell type proportions between the two groups. The results indicate a significantly higher proportion of CAFs and γδ T cells in the NT5E_high tumor profile. The corresponding p-values are provided in the figure legend. We hope this revised analysis and clearer presentation address the reviewer’s concerns.

      Fig.3 - For Fig3b+c, the IMC are derived from 4 patients (not clear for the flow data)

      As stated in both the figure legend and the text, the IMC analysis was conducted on 38 ROIs from four patient samples, while the flow cytometry analysis was performed on tumor samples from seven ovarian cancer patients.

      • did the authors noticed differences between patients?

      "As shown in new Figures 3b and 3c, no significant differences were observed between patients. Each individual patient is represented by a different color."

      • For Fig3e, the description in the text does not reflect the figure, e.g. cluster 1 does not show LAG3 expression, but this is claimed in the text (other descriptions are off as well).

      The text describing Fig. 3e has been amended in the new version of the manuscript.

      • In Fig3h, the authors stain cytokines in γδ T cells purified from ovarian cancer samples. The text seems to imply that the cytokine staining was performed directly ex vivo, without an in vitro stimulation of the cells, e.g. with PMA/ionomycin (if so, the description is missing). In any case, the values appear surprisingly high. Exemplary data are needed to clarify how the gating was done (for γδ T cells and the cytokines) and what the primary data looked like.

      The protocol has been amended in the “Materials and Methods” section. A gating strategy and primary data analysis from one representative patient are included in a supplementary Figure 4c.

      We agree with the reviewer’s comments that it is surprising that γδ T cell stimulation is not required for IL-8, IL10 and IFNγ production. However, one possible explanation is the high reactivity of γδ T cells compared to other T cell subsets, as well as their localization in the tumor microenvironment rather than in healthy tissue or blood.

      • In Fig3h, it is not clear what is meant with "IL-8 / IL-10", please explain.

      This analysis shows the percentage of cells that are positive for both IL-8 and IL-10.

      The figure and its legend have been amended for clarity.

      Fig.4 - Please provide the values and the statistical analyses for all cell populations.

      We performed statistical analyses (Wilcoxon signed-rank test) for all cell populations and provide the data in the Supplementary Fig. 5A. However, due to the heterogeneity of ROIs, a significant difference was observed for tumor cells, which were more prevalent more in the neighborhood of CD73- than CD73+ γδ T cells (p

      Fig.5/6 - In Fig5, the authors state that 8 cell populations were differentially enriched around CD73+ or -neg γδ T cells. However, in Fig4, only 4 of these populations are mentioned. Please add the remaining 4 to fig4 and name the 8 clusters in fig5 in line with the gating strategy used in fig4.

      We thank the reviewer for highlighting that the description of Figure 5 in our text was unclear. We have revised the text for clarification and specify that based on Supplementary Figure 7, which shows the number of cells for each cell type found in the neighborhood of all γδ T cell subsets (CD73- and CD73+) in all ROIs. We decided to perform phenotypic analysis on only four cell types (those with a sufficient cell counts), setting the cutoff at 700 cells.

      The four cell types are analyzed in Figures 5 and 6. Figure 5A shows tumor cells, with eight clusters identified, while Figure 5B represents fibroblasts, with seven clusters identified. Figure 6A shows CD4 T cells, with eight clusters, and Figure 6B CD8 T cells, with ten clusters.

      • Furthermore, the authors want to show in fig5 how the phenotype of these 8 cell populations differs depending on whether they are close to CD73+/- γδT cells. tSNE plots do not allow illustrating this (BTW: the plots lack the colour code). The frequencies of the cell types/phenotypes in the vicinity of CD73+/- γδ T cells need to be depicted differently (e.g. bar graphs). Furthermore, the claim that differences are observed, needs to be supported by showing the statistical values obtained. The same argument applies to Fig6 and SF8.

      We have added the code color of tSNE plots in Figures 5, 6, and SF9. The tables in Supplementary Figure 8 show the percentage of cells in each cluster within the vicinity of CD73+/- γδ T cells, allowing for an investigation of the neighborhood of each γδ T cell subset.

      • Fig6: This reviewer disagrees with the notion that the expression of HLA-DR or CD279 is enough to imply a functional state of the cell.

      As requested by the reviewer, we have amended the text to clarify that: “Cluster analysis revealed that CD4+ T cells in contact with effector γδ T cells (i.e., the CD73- subset) express HLA-DR and/or PD-1, both activation markers.”

      Supplements - SF2a: please check the labels; how can CD8+ CD4+ cells be labelled 'CD8 T cells' and why do the authors exclude the possibility that e.g. B cells could express HLA-DR?

      We thank the reviewer for pointing out the error in Figure 2a, which has now been corrected. The CD8+ cells have been relabeled as 'CD8 T cells,' and the B cells are now shown expressing HLA-DR.

      • SF7 is not clear to this reviewer. If the clusters represent different cell types, how can e.g. tumours be found in all of them?

      We believe the reviewer is referring to SF9 rather than SF7 in this comment. SF9 analyzes γδ T cells in proximity to CD73+ and CD73- γδ T cells. As in Figures 5 and 6, γδ T cell neighbors of CD73+ and CD73- γδ T cells were identified, and a clustering analysis revealed five distinct clusters. Tumor cells was not analyzed in this figure. We have clarified the text to prevent confusion

      • SF9b lacks a negative control and a statistical analysis, and SF9c lacks the summary data and statistical analysis.

      As requested by the reviewer, we have performed statistical analysis for SF9b and added a negative control. Additionally, we have included summary data with a statistical analysis in SF9c.

      • In the text, the authors state, "We and others reported that in ovarian tumors, IL-6 is mainly produced by CAFs and induces CD73 expression by γδ T cells (Extended Data Fig. 9 and 15)." The data in SF9b are not enough to make this claim and reference 15 is a review article that does not even mention 'IL-6'. This needs to be corrected.

      We have updated Supplementary Figure 9B to provide more robust data. We thank the reviewer for pointing out our error. The publication we intend to cite is a research article, not a review.” Hu G, Cheng P, Pan J, Wang S, Ding Q, Jiang Z, et al. An IL6-Adenosine Positive Feedback Loop between CD73+ γδ Tregs and CAFs Promotes Tumor Progression in Human Breast Cancer. Cancer Immunol Res. 2020;8:1273–86.” we made the correction in the manuscript.

      Reviewer #2 (Significance (Required)):

      In this manuscript ("Deciphering the tumor-infiltrating CD73+ regulatory γδ T cell ecosystem associated with poor survival of patients with ovarian cancer"), Chabab et al. report on the phenotype and location of CD73+ γδ T cells in ovarian cancer.

      CD73+ γδ T cells can be immunosuppressive via the production of cytokines (IL-8, IL-10) and the expression of PD-L1. Here, the authors investigated the phenotype and location of CD73+ and -neg γδ T cells in ovarian cancers with a particular focus on the cells surrounding the γδ T cells in the tumour.

      Overall, the study is informative and well-performed. However, the way some of the data are presented does not allow to fully evaluate them. Besides this, this reviewer only has some minor comments.

      To enable a full evaluation of the data, we have added new figures, amended others, and clarified certain points in the text, hoping that the reviewer will find these modifications sufficient to consider our manuscript for publication.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this article, Chabab et al. analyze sample from ovarian cancer patients, with a specific focus on gamma-delta T cells (Tγδ). The authors claim that CD73+ cells are associated with poor prognosis in ovarian cancer, and that CD73 expression is correlated with the composition and polarization of the microenvironment. Using imaging mass cytometry data, they also claim that the neighborhoods of CD73+ and CD73- Tγδ cells differs in composition.

      Major comments: - The prognostic value of CD73/NT5E is analyzed in TCGA-Ovarian RNAseq data. In the context of this article, it is implied that this should reflect CD73 expression by Tγδ but it is likely that other cell types are contributing to bulk CD73 expression.

      We appreciate the reviewer’s insightful comment. In fact, due to low proportion of Tγδ in TME we have stratified on NT5E total expression. We agree that this signal likely includes contributions from multiple cell types beyond γδ T cells, such as cancer-associated fibroblasts and endothelial cells, which are also known to express CD73 (NT5E gene).

      The stratification of patient based on NT5E total expression showed an association between high NT5E expression and poorer overall survival and increase in Tγδ gene markers (TRDC, TRGC1/2) and percentage of cells (Fig2E) in the patient cohort (Fig2C). To clarify this point, we have revised the Results and Discussion sections to explicitly state that the TCGA-based survival analysis reflects total intratumoral NT5E enrichment and cannot be attributed specifically to γδ T cells. We now refer to this analysis as an independent validation of the clinical relevance of CD73, while noting that its cell-type-specific contribution remains to be resolved in future studies using spatial transcriptomics or deconvolution approaches.

      • In the analysis of scRNAseq data, multiple public datasets are aggregated and the overall level of CD73 is used for stratification. Is this stratification confounded by dataset of origin?

      We thank the reviewer for raising this critical point regarding potential batch effects and dataset-driven bias in our stratification strategy. To address this, we performed additional analyses to assess whether NT5E (CD73) expression is confounded by dataset of origin.

      First, we verified that all single-cell datasets (GSE147082, GSE241221, and GSE235931) were processed using a harmonized integration workflow, including SCTransform normalization and integration using Seurat’s reciprocal PCA approach, which effectively minimizes batch-related variability.

      • The last part of the results discusses the role of IL6 produced by CAFs on Tγδ, but very little data is shown to support the proposed mechanisms. The authors report expression of CD73 by flow cytometry on blood-sorted Tγδ following culture with IL2, IL6, IL21. The data shown however only represents one donor and should therefore be repeated on multiple donors.

      We appreciate the reviewer’s insightful comment. We have added data and updated Supplementary Figure 9 to provide more robust findings. Regarding the role of IL-6, our data in ovarian cancer are consistent with the study by Hu et al. in breast cancer, which reports an IL-6-Adenosine Positive Feedback Loop between CD73+ γδ Tregs and CAFs that promotes tumor progression in human breast cancer."

      Minor comments:

      • The authors stratify their cohort by Tγδ density but I could not find the threshold used for stratification

      The threshold has been added in figure and text.

      • Labels for CD8+ and CD4+CD8+ T cells are swapped in Extended Data Fig 2A

      The correction of figure has been made.

      • The legend of graphs shown in multiple panels (for instance: Fig 3F) are not very clear: is each dot representing the average expression of one cluster in one patient?
      • In figure 3G there is no color scale, the authors need to add it with appropriate units so that readers can interpret the data shown

      These points have all been amended and corrected in the next version of the manuscript.

      Reviewer #3 (Significance (Required)):

      This paper shows interesting imaging mass cytometry data of ovarian cancer specimens. The focus on CD73 expression by Tγδ is fairly specific, although the exonucleotidases pathway involving CD73 is currently extensively studied for its immunosuppressive role.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this article, Chabab et al. analyze sample from ovarian cancer patients, with a specific focus on gamma-delta T cells (Tgd). The authors claim that CD73+ cells are associated with poor prognosis in ovarian cancer, and that CD73 expression is correlated with the composition and polarization of the microenvironment. Using imaging mass cytometry data, they also claim that the neighborhoods of CD73+ and CD73- Tgd cells differs in composition.

      Major comments:

      • The prognostic value of CD73/NT5E is analyzed in TCGA-Ovarian RNAseq data. In the context of this article, it is implied that this should reflect CD73 expression by Tgd but it is likely that other cell types are contributing to bulk CD73 expression.
      • In the analysis of scRNAseq data, multiple public datasets are aggregated and the overall level of CD73 is used for stratification. Is this stratification confounded by dataset of origin ?
      • The last part of the results discuss the role of IL6 produced by CAFs on Tgd, but very little data is shown to support the proposed mechanisms. The authors report expression of CD73 by flow cytometry on blood-sorted Tgd following culture with IL2, IL6, IL21. The data shown however only represents one donor and should therefore be repeated on multiple donors.

      Minor comments:

      • The authors stratify their cohort by Tgd density but I could not find the threshold used for stratification
      • Labels for CD8+ and CD4+CD8+ T cells are swapped in Extended Data Fig 2A
      • The legend of graphs shown in multiple panels (for instance : Fig 3F) are not very clear : is each dot representing the average expression of one cluster in one patient ?
      • In figure 3G there is no color scale, the authors need to add it with appropriate units so that readers can interpret the data shown

      Significance

      This paper shows interesting imaging mass cytometry data of ovarian cancer specimens. The focus on CD73 expression by Tgd is fairly specific, although the exonucleotidases pathway involving CD73 is currently extensively studied for its immunosuppressive role.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript ("Deciphering the tumor-infiltrating CD73+ regulatory γδ T cell ecosystem associated with poor survival of patients with ovarian cancer"), Chabab et al. report on the phenotype and location of CD73+ gd T cells in ovarian cancer. CD73+ gd T cells can be immunosuppressive via the production of cytokines (IL-8, IL-10) and the expression of PD-L1. Here, the authors investigated the phenotype and location of CD73+ and -neg gd T cells in ovarian cancers with a particular focus on the cells surrounding the gd T cells in the tumour. Overall, the study is informative and well-performed. However, the way some of the data are presented does not allow to fully evaluate them. Besides this, this reviewer only has some minor comments.

      General comments:

      • The data provided in this manuscript are descriptive/correlative, and as such, causation cannot be inferred. Therefore, the language needs to reflect this; statements like "we investigated the impact of CD73+ regulatory γδ T cells in ovarian cancer" (L89) and "CD73+ γδ T cells were in close contact with more aggressive tumor cells" (L426), among others, are incorrect without functional data. The authors are advised to adjust the text throughout.
      • Please make the figure legends self-explanatory without the need to search for the information in the M&M. For example, the graphs in fig1 and 3 contain many dots, but it is not explained what these dots represent. Please also add n for each experiment shown and state how often the experiment was performed independently.
      • It would be helpful for the reader if abbreviations introduced in the M&M were also explained the first time they appeared in the results section.
      • Please explain all abbreviations, e.g. FIGO, CST, NT5E, etc.
      • L235: typo 'that' instead of 'than'; L258 'reduced'; L259 'fig1d-f'; L451f twice 'CD73+'; 'naive' instead of 'naïve' throughout; SF2 legend: '2f' instead of '3f', SF9 legend: '1.105'.
      • L280ff: "Tumor cells ... were the most important cell type" - it may be clearer to use 'most frequent';

      M&M

      • Please be consistent, if you provide catalogue numbers or dilutions (antibody, reagents) [which is good, maybe even adding the RRID number], do so for all items.
      • The M&M does not state for the CAFs how long they were cultured before the supernatant was taken for the cytokine measurements.
      • For the IL-6 ELISA, it is stated that the "cells were harvested"; what happened to them, and how do you get any SN from these cells?

      Figures

      Fig.1:

      • The authors used the word 'predict' in the heading, which seems not appropriate for a retrospective study; something like 'correlate' seems better.
      • Similarly, the title of the figure legends claims that the 'impact of gd T cells' is shown, while only a correlation is presented.
      • For Fig1a-c, only summary data are presented. Please add exemplary pictures as well.
      • For Fig1d-f, the label for the x-axis is missing.

      Fig.2

      • It seems funny to call the patients 'naïve', maybe 'untreated' is clearer.
      • The graph in Fig2e does not allow comparing the cell frequencies properly. This would require either bar graphs or a table. Furthermore, the statistical analysis is missing. Without that, a statement like "associated with higher proportion of CAFs" (L265) is not supported.

      Fig.3

      • For Fig3b+c, the IMC are derived from 4 patients (not clear for the flow data) - did the authors noticed differences between patients?
      • For Fig3e, the description in the text does not reflect the figure, e.g. cluster 1 does not show LAG3 expression, but this is claimed in the text (other descriptions are off as well).
      • In Fig3h, the authors stain cytokines in gd T cells purified from ovarian cancer samples. The text seems to imply that the cytokine staining was performed directly ex vivo, without an in vitro stimulation of the cells, e.g. with PMA/ionomycin (if so, the description is missing). In any case, the values appear surprisingly high. Exemplary data are needed to clarify how the gating was done (for gd T cells and the cytokines) and what the primary data looked like.
      • In Fig3h, it is not clear what is meant with "IL-8 / IL-10", please explain.

      Fig.4

      • Please provide the values and the statistical analyses for all cell populations.

      Fig.5/6

      • In Fig5, the authors state that 8 cell populations were differentially enriched around CD73+ or -neg gd T cells. However, in Fig4, only 4 of these populations are mentioned. Please add the remaining 4 to fig4 and name the 8 clusters in fig5 in line with the gating strategy used in fig4.
      • Furthermore, the authors want to show in fig5 how the phenotype of these 8 cell populations differs depending on whether they are close to CD73+/- gdT cells. tSNE plots do not allow illustrating this (BTW: the plots lack the colour code). The frequencies of the cell types/phenotypes in the vicinity of CD73+/- gd T cells need to be depicted differently (e.g. bar graphs). Furthermore, the claim that differences are observed, needs to be supported by showing the statistical values obtained. The same argument applies to Fig6 and SF8.
      • Fig6: This reviewer disagrees with the notion that the expression of HLA-DR or CD279 is enough to imply a functional state of the cell.

      Supplements

      • SF2a: please check the labels; how can CD8+ CD4+ cells be labelled 'CD8 T cells' and why do the authors exclude the possibility that e.g. B cells could express HLA-DR?
      • SF7 is not clear to this reviewer. If the clusters represent different cell types, how can e.g. tumours be found in all of them?
      • SF9b lacks a negative control and a statistical analysis, and SF9c lacks the summary data and statistical analysis.
      • In the text, the authors state, "We and others reported that in ovarian tumors, IL-6 is mainly produced by CAFs and induces CD73 expression by γδ T cells (Extended Data Fig. 9 and 15)." The data in SF9b are not enough to make this claim and reference 15 is a review article that does not even mention 'IL-6'. This needs to be corrected.

      Significance

      In this manuscript ("Deciphering the tumor-infiltrating CD73+ regulatory γδ T cell ecosystem associated with poor survival of patients with ovarian cancer"), Chabab et al. report on the phenotype and location of CD73+ gd T cells in ovarian cancer. CD73+ gd T cells can be immunosuppressive via the production of cytokines (IL-8, IL-10) and the expression of PD-L1. Here, the authors investigated the phenotype and location of CD73+ and -neg gd T cells in ovarian cancers with a particular focus on the cells surrounding the gd T cells in the tumour. Overall, the study is informative and well-performed. However, the way some of the data are presented does not allow to fully evaluate them. Besides this, this reviewer only has some minor comments.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors provide a detailed characterizsation of the tumor microenvironment (TME) of 91 ovarian cancer patients, brokend down in long and short term survivors (post 5 years). The focus on the role of a subgroup of T cells, gamma/delta (gd) T cells with reported anti but also pro tumorigenic properties, Prior work of the lab has established a link between a subgroup of gd T cells expressing CD73 and poor prognosis, due to the ability of these cells to produce immunosuppressive cytokines, such as IL10 or IL8 and the production of adenosine, by CD73, in the micromilieu. The data is further backed up by the analysis of fresh tumor specimens and tissue culture work.

      Here they continue this story by investigating the TME using tumor microarrays (91 samples), single cell RNA seq (12 patients), imaging mass cytometry (> 30 samples) and flow cytometry (form confirmatory purposes) to define cellular neighborhoods of CD73+ and CD73- gd T cells. THis revealed differences in cellular composition and spatial transcriptome analysis further helped to define the ttranscriptomes in gd T cells, cancer cells and cancer associated fibroblasts.

      The authors conclude the in ovarian cancer gd T cells expressing CD73 dampen anti-tumor immun ity and propose detection and evaluation of CD73+ gd T cells as prognostic marker.

      The manuscript is well written, and despite its descriptive nature, easy to follow. Data is presented in a clear and easy to read fashion.

      Significance

      Using a well characterized cohort of ovarian cancer patients with detailed clinical follow up the authors report on the predictive power of a subset of gd T cells expressing CD73, with immune suppressive / regulatory capacity, reading out patient survival in high grade serous ovarian cancer, a still deadly disease. As such the identificaiton of reliable markers predicting survival is a clear medical need. These findings contrast others made in different solid cancers, suggesting tumor typ specific differences, which are only starting to emerge, but are of clear clinical relevance.

      What is unclear to me and needs to be addressed, is if these patient specimens were taken before or after initial therapy, whether the samples have been stratified according the treatment that they got, assuming it will be mostly platinum compounds (but maybe not), and that the p53 status of the tumors are (if genetics are available this would help to add some granularity to the study that, as as it stands is largely descriptive, even though with extremely high resolution. This data should be available and could be integrated.

      Some minor issues

      • I would stick to CD73, and not mix it with NT5E, which is confusing at first (Fig 2).
      • I would ask to compare the overall survival of CD73+ between densities - is it still significantly different in fig 1 - meaning is it about density, CD73 expression, or both. Comparing survival of tumors with a low density of CD73+ gd T cells does not seem to be different from those having a low density of CD73- gd T cells, which could be considered in data interpretation. Same for high density tumors.
      • figure 1, caption should include the word "patients" at the end, I guess.
      • labelling and font can be improved in many panels, eg. the dot plots in Fig 2, panel B, right, same for panel C and D
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors describe a modified version of single molecule Fluorescence In-situ Hybridization (smFISH) method they have adapted to successfully measure RNA levels in isolated human donor T cells, that are very hard to grow on glass and have small amounts of cytoplasm relative to cell size, a challenge for all researchers working with small cells that only grow in suspension cultures. Using this methodology, the authors have queried transcription status and mRNA localization and fate of the two cytokines, IFNG and TNF, upon T-cell activation. The main findings of the study are: (1) activation of T-cells results in rapid accumulation of IFNG an TNF mRNA; there is differential distribution of the cytokine mRNAs between the nucleus and cytoplasm with greater accumulation in the cytoplasm as activation progresses resulting in increased protein production. There is significant transcriptional heterogeneity in response to T-cell activation. (2) The cytokine mRNA turnover appears to be controlled by translation. (3) HUR, an RBP appears to control poly(A) tail length of TNF mRNA in response to T-cell activation. The successful implementation of a modified smFISH protocol used in this study is a welcome resource for all labs that want to study small human primary cells that are difficult to culture on glass coverslips and grow as suspension cultures. Although the authors have very exciting observations, they have shied away from discussing their results in the context of the biology of T-cell activation and how their observations may explain prior studies on cytokine gene expression patterns during T-cell activation.

      In my opinion, the authors should discuss their observations in depth from the context of T-cell activation and cytokine expression. I have enumerated several specific comments that may help the authors in revising the manuscript if they choose to do so.

      Specific comments:

      1. Based on the data presented in Figure 1 D and E, it is clear there is depletion of IFNG and TNF mRNA 4hrs after activation and then the mRNA levels go up at 6h in both cases. However, the authors suggest that only TNF mRNA is depleted at 4hrs of activation (lines 169-172). The median number of IFNG mRNA gradually decreases after 1h of activation and reaches a low at 4h and then substantially increases by 6h. Did the authors measure gene expression of these mRNAs at later time points in the activation process? Perhaps transcription is coupled to mature mRNA levels in the cytoplasm and transcription is ramped up again once the cytoplasmic mRNA levels reach a lower threshold. Is this just an anomaly of the system or is gene expression pattern of cytokines upon T-cell activation cyclical?
      2. In data presented Figure 2 and Suppl Figure 2, the authors show correlation between dual cytokine expression and biallelic expression. However, not all dual cytokine expressing cells show bi-allelic expression of both cytokines. It will be useful to know what fraction of cells are biallelic for both genes. Since the experiment was done using two color smFISH, a scatter plot will cluster those dual expressor cells for both cytokines that are also bi-allelic for both genes. Extending this further would be to systematically address protein expression in the various combination of expression patterns. Combining smFISH with immunofluorescence will help address this. Overall, these results will be helpful in getting a better understanding of gene expression patterns during T-cell activation.
      3. The mRNA localization data presented in Figure 3A and the associated supplemental figure: A better analysis and representation of the data presented in 3A would be a scatter plot of individual cells for their nuclear and cytoplasmic localization of mature mRNA. The authors might also want to extend this analysis based on the data presented in Figure 2 for dual expressors and bi-allelic expression. In other words, do cells with bi-allelic expression have more mRNA localized in the cytoplasm, and does this hold true in dual expressor cells? In the context of translation dependent decay of mRNA, do the dual expressor cells with biallelic expression fare better thereby producing and secreting cytokines continuously?
      4. The data presented for IFNG in Figure 4 is quite intriguing. In HuR-KO cells at 2h post induction, two of the three donors cell lines have only a small fraction of cells producing protein compared to the controls, however, they are substantially higher than the KO cells at time "0". Surprisingly, the amount of protein produced by these cells (panel B), although statistically lower than the control, is substantially higher than KO cells at "0"h. Does the lone donor cell line with higher number of protein producing cells contribute to majority of the protein produced? There appears to be substantial difference between the three donor cell lines in the number of protein producing cells and mature IFNG mRNA after activation (Suppl Figure 1G & H). The authors may wish to compare the results before combining the data of all three donor cell lines before interpreting the data.
      5. Also intriguing, HuR knock out results in a significant increase in transcription of IFNG at time "0" (Figure 4, panel E). Despite this, there is a significant loss in transcription of IFNG 2h post activation. However, there is significant accumulation of mature mRNA (panel D). Combined with the protein expression data presented in panels A & B, and the fact that translation induces mRNA decay, how do the authors reconcile this data?
      6. The differential effect of HuR knock out on poly(A) tail length of IFNG and TNF mRNA is of great importance and the most striking finding in this study! It is generally accepted that poly(A) tail length contributes to mRNA stability and survival. The results presented in Figures 4 and 5 argue otherwise. Only a small fraction of TNF mRNA have full length poly(A) tails, however, the number of mature TNF mRNA in KO cells is much greater than the control even at "0"h. In addition, the TNF mRNA appear to be well translocated into the cytoplasm and effectively translated. Given these conflicting observations, what possible mechanism do the authors envision that can explain this result.<br /> Again, plotting the data presented in Figure 5A as a scatter plot between # of RNA in the cytosol vs nucleus will give a better picture of the localization changes in individual cells.
      7. A more elaborate discussion of the results as it relates to the biology of cytokine gene expression during T-cell activation will immensely strengthen the manuscript.

      Minor comment:

      1. Images of cells with smFISH data (Figures 1, 3 & 4) must be bigger for better visualization. Show images with only a couple of cells enlarged to show the mRNA spots more visibly. Include images with more cells in the supplement instead.

      Referee's cross-commenting

      I must confess I am not an immunologist, so my knowledge of the intricacies of gene expression in T-cells in very limited. However, I do have a fair sense of transcription regulation and use single molecule approaches, especially smFISH, to address these questions. I agree with the other reviewer the study is of significance, especially the advancement in the ability to do smFISH in primary cells, a challenge that I know first hand. I also have to agree with the other reviewer that the discussion was too short and the authors shied away from the bigger picture of being able to comment on regulation of expression of cytokines during T-cell activation. It is remarkable that they see heterogeneity in gene expression of the individual target genes and bi-allelic expression. The other point of interest is the difference in p(A) tail length and its potential role in regulating TNF gene expression.

      Significance

      The successful implementation of a modified smFISH protocol used in this study is a welcome resource for all labs that want to study small human primary cells that are difficult to culture on glass coverslips and grow as suspension cultures.

      Overall, this work is of high quality and can be better presented to fully explore and discuss the biological implications of the observations from the study. It is not clear to me if the authors wished to present this manuscript reporting an advancement in technology tool to study gene expression during T-cell activation, or a more in-depth study of gene expression.

      The study will benefit the larger community that use single molecule approaches to understand genew expression.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Lattanzio and colleagues uses advanced single molecule FISH, adapted specifically for T cells, to examine RNA transcriptional dynamics in polyclonally stimulated human T cells. By examining the subcellular localisation of both IFNg and TNF mRNAs (nascent and mature), they are able to characterise rate things like rate of transcription and RNA stability. Key findings include the identification of bi-allelic vs mono-allelic transcription at the single cell level which maps to polyfucntion vs monofunctional T cells. Moreover, they identified distinct mechanisms regulating RNA stability and the role of RNA binding protein HuR in mediating that.

      Overall, this is really a proof of concept paper that uses elegant technologies and analysis tools showing just how much information can be obtained from this approach. The ability to examine RNA dynamics and the imapct of RNA binding proteins in regulating RNA stability/translation/transriptoin at a single cell level will be an advance for the field, not just those interested in T cell biology but all cell types.

      There are no specific experimental issues that came to mind that need to be addressed and it is really only some minor comments, particularly for the discussion that would strengthen the implicaitons of the study.

      1. I might have missed it but it wasn't exactly clear from the results or from the methods exactly how nascent vs mature RNA was discriminated. Was this just from the subcellular localisation (i.e nuclear vs cytoplasmic)? RNA imaged close to the TSS? If so, this should be noted somewhere. If there was some other way of precisely ascribing RNA status, this should be outlined (use of primers that targeted intergenic sequences).
      2. The discussion was very brief and would have benefited from a bit more speculation about the implications of their findings. Specifically, why would there be a need for different cytokine RNAs to be regulated in such distinct ways (IFNg vs IL-2)? Do the authors have any thoughts? Another point is the proposed explanations for the distinct T cell subsets observed that produce cytokines at different levels. While the authors propose three possible explanations, they are presented as being mutually exclusive, of course it could be a combination of all three. Moreover, putting the importance of higher functioning T cells (i.e those that produce more cytokines) into context is also important. Early studies from La Gruta et al., (J Immunol, 2004; doi: 10.4049/jimmunol.172.9.5553), Betts et al, (Blood, 2006 doi.org/10.1182/blood-2005-12-4818) and Darrah et al. (Nat Med, 2007 doi.org/10.1038/nm1592) linked higher cytokine production/multifunctionality to better immune outcomes, while Denton et al (PNAS, 2011, 10.1073/pnas.1112520108) linked the extent of cytokine production to cellular differentiation (and epigenetic landscape at the TNF and IFNg locus). These studies should be cited to provide a setting where this approach will be relevant.

      A minor point relates to line 329, that sentence stating "Even though most activated Teff cells express cytokine mRNAs, they display a two order of magnitude difference in mRNA and protein expression."

      It is not clear what this is relevant or compared to. A two order of magnitude difference compared to what?

      Significance

      This is a proof of concept study that demonstrates the utility of the T cell smFISH approach to delineate high resolution analysis of cytokine RNA dynamics at a single cell level, for multiple cytokine RNA species. It clearly provides interesting biology and further understanding of RNA dynamics in activated T cells. I especially appreciated the observation of bi-allelic vs mono-allelic transcription, and the ability to explore the role of RNA binding proteins in RNA regulation.

      This technique will have broader applicability and hence will be of interest to those outside T cell immunology. It only requires some minor corrections/revisions.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Significance (Required)):

      • *

      This study aims to bridge a gap between the mechanisms of preeclampsia and neurodegenerative disorders, and this through the existence of misfolded proteins in the preeclamptic placenta which has been reported before, in particular the beta amyloid protein, known as operative in Alzheimer's disease (AD) in particular.

      • *

      Our response: We sincerely appreciated the reviewer’s constructive comments.

      • *

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)): *

      Minor remarks

      1. It is classical now to present in extenso the WB as supplementary data for Fig 3, 4 and 5. Our response: We will include the full blots in Supplemental information.

      2. *

      It seems that the beta amyloid signal is not stronger for the early onset and the late onset PE samples. Have the authors an interpretation?

      • *

      Our response: The current manuscript includes both early-onset and late-onset cases. Thus, we are certain that amyloid beta deposition is involved in both early- and late-onset PE. We will discuss this matter.

      The figure 4b does not show the BeWo labeling in forskolin with or without beta amyloid peptides, why? It would be illustrative to show a decrease in the fusion processes

      • *

      Our response: In Fig. 4A, we pretreated BeWo cells with Aβ fibrils and after that, cell fusion was induced by Fsk. On the other hand, in Fig. 4b, we treated BeWo with Aβ fibrils and investigated the protein levels and subcellular localization of ZO-1 and E-cadherin. Fig.4b shows that expressions of proteins involved in cell-cell interaction were reduced by Aβ fibril treatment without Fsk. Cell-cell interaction before syncytialization is required for cell fusion, and these proteins disappear after cell fusion. Thus, our results demonstrate that elimination of cell-cell interaction by Aβ fibrils resulted in reduced cell fusion induced by Fsk. This is why we treated BeWo cells with Aβ fibrils before the induction of cell fusion by Fsk, and BeWo labeling in forskolin with or without Aβ fibrils will result in a loss of ZO-1 and E-cadherin regardless of the occurrence of cell fusion. We will discuss this matter in more detail.

      • *

      How do the authors explain that exposure to fibrils did not seem to slow down significantly the fusion process, even though markers are decreased?

      • *

      Our response: Since we previously demonstrated that loss of membrane E-cadherin slows the fusion (Iwahashi et al., Endocrinology, 2019, PMID: 30551188), we believe that reduction of membrane localization of E-cadherin also slows the fusion process. We will discuss this matter further.

      • Could the authors attempt a labeling with the Di-8, an interesting quantitative marker of cell fusion (see ref PMID: 38019394).*

      • *

      Our response: We have shown that pretreatment of BeWo cells and human primary cytotrophoblasts (CTBs) inhibited induction of syncytin-1 and β-hCG. Syncytin-1 is a critical driver of syncytialization and formation of the syncytiotrophoblast layer, and β-hCG is one of the major products of syncytiotrophoblasts. Thus, induction of these proteins is widely used as syncytialization markers of trophoblasts. On the other hand, Di-8-ANEPPS is a potentiometric fluorescent dye that may be used assess cell fusion simply and economically. Although we understand the robustness of this method, we believe that the current data are sufficient to demonstrate that Aβ fibril pretreatment inhibited syncytialization of BeWo cells and CTBs.

      * *

      • *

      Reviewer #2 (Significance (Required)):

      • *

      Investigating the deposition of Aβ in the placenta could enhance our understanding of pregnancy complications such as PE, fetal growth restriction, and neurodevelopmental risks. However, further research on this topic is necessary.

      • *

      Our response: We sincerely appreciate the critical reading and constructive comments of the reviewer. We agree that further research on protein aggregation and the pathogenesis of preeclampsia is necessary. We will discuss this matter in the discussion.

      • *

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      • *

      Major comments

        • If CTBs are treated with Aβ, and if it affects STB, what happens with EVT? Why didn't they check with EVT if the authors wanted to link with PE?*
      • *

      Our response: We thank the reviewer for the critical comment. We investigated Aβ generation by an EVT model cell, HTR8/SVneo cells. We found that HTR8/SVneo cells produced much less amount of Aβ compared to BeWo cells (unpublished). Gao et al. reported that Aβ aggregates induced autophagy in HTR8/SVneo cells and suggested that an excessive autophagy may be detrimental and be involved in the development of preeclampsia (Gao et al., J Mol Histol, 2024, PMID: 38777993). We will discuss this matter in the discussion.

      On the other hand, we have already investigated the effects of Aβ monomers in EVTs, and discovered that even low levels of Aβ produced by EVTs promote EVT invasiveness and have a physiological function. Please see below. We will add these new data in the revised manuscript.


      • *

      • *

        • Did the authors look for pathologies related to Aβ deposition on PE placentas?*
      • *

      Our response: We did not observe any pathologies near the Aβ deposition.

        • Line# 103, the IF images don't show that BACE1 is around HIF1. There are no merged images, and the results are over- or underestimated.*
      • *

      Our response: We agree with the reviewer. There is a time discrepancy between HIF activation and BACE1 induction. Our immunohistochemical analysis showed that PE placentas are in a chronic hypoxia condition and that BACE1 was increased in PE placentas. Our cell-based assay supports that HIF1α stabilization by Roxadustat increased BACE1 levels in BeWo cells. We will tone down the results section of the immunohistochemical analysis.

      • *

        • What is the intended purpose of using Roxadustat? If it inhibits HIF1α, could you explain the reason behind the increased expression of HIF1α? Furthermore, is there evidence to support the efficacy of this compound?*
      • *

      Our response: Roxadustat inhibits the proline hydroxylation of HIF1α and thereby inhibits the ubiquitination and degradation of HIF1α via the ubiquitin proteasomal system. In this study, we used Roxadustat as a HIF1α stabilizer to investigate whether BACE1 levels are increased with hypoxia and HIF1. Our data showed that treatment of BeWo cells with Roxadustat increased HIF1α levels, supporting the efficacy of Roxadustat. We will include this information in the result section for clarity.

        • Is Aβ deposition very specific to PE, or can it also occur for other reasons during pregnancy?*
      • *

      Our response: To date, no report has been found showing Aβ deposition in placentas other than PE. The deposition of protein aggregates, including those of Aβ and transthyretin, has previously been reported in PE. However, the presence and role of these protein deposits in placentas under pathological conditions, in addition to PE, remains to be elucidated. Several stresses such as hypoxia and ER stress may lead to deposition of protein aggregates in the placenta. These points will be discussed in the discussion.

      • *

        • BACE1 is expressed in Normal#2 and #3 but not in #1, #4, and #5. Why is this expressed in #2 and #3? Is there anything wrong with these samples? If patients had gestational hypertension or some other complications?*
      • *

      Our response: We did not find any other complications in the normal placentas. In the brain, Aβ is constitutively generated and thus, thought to play physiological roles. The amount of Aβ is determined by the balance between the production and the clearance. A sustained imbalance of Aβ production and Aβ clearance will lead Aβ aggregation and deposition. We found that BeWo cells expressed BACE1 in a normoxic condition and thus, normal placentas may express BACE1 and generate small amounts of Aβ. Our results suggested that chronic hypoxia in PE placentas resulted in increased BACE1 expression and increased Aβ production, which may eventually result in Aβ aggregation and deposition, because the aggregation process of Aβ is concentration-dependent. We will include this point in the revised manuscript.

      • *

        • PE placentae were compared with GA matched placentae. What is the expression of BACE1 and RB4CD12 in term control placentae?*
      • *

      Our response: We used RB4CD12 as a protein aggregation marker. As shown in Table 1, the current study includes 3 placentas whose gestational ages are over 37 weeks. We did not observe RB4CD12 and Aβ deposition in gestational age-matched control and observed BACE1 expression in one 37 weeks gestational age control. We will include these points in the result section.

      • *

        • If AB fibril deposition is hypoxia dependent, what happens at the early gestation, where oxygen conc is 1-2%?*
      • *

      Our response: At the early gestation, physiological hypoxia promotes the EVT invasion and helps the remodeling of spiral arteries for oxygen supply. Please see our response above. Severe hypoxia on the CTB side in early gestation may result in a miscarriage before PE develops.

      Minor comments

        • The authors only performed IF and IHC. Please confirm and correct the methods accordingly.*
      • *

      Our response: We thank the reviewer for pointing this out. We will correct the methods.

      • *

        • Was the BeWo-b21 clone cell line used for all the experiments in this paper? This is the only clone that can be used for BeWo-STB models.*
      • *

      Our response: We do not have information about the clone number of BeWo cells used in this study. We purchased them from the American Type Culture Collection (Manassas, VA) and they were authenticated by JCRB Cell Bank (National Institute of Biomedical Innovation Japan, report no. KBN0410). By using the same cells, we published three articles in which we successfully analyzed syncytialization of BeWo cells (Yamamoto et al., Endocrinology, 2017, PMID: 28938427; Iwahashi et al., Endocrinology, 2019, PMID: 30551188; Matsukawa et al., Biomolecules, 2022, PMID: 36008943). We would like to apologize for our mistake in the description of BeWo cells in the methods section and thank the reviewer for providing us with an opportunity to correct our mistake. We will note that BeWo cells were purchased from the American Type Culture Collection (Manassas, VA) and authenticated by JCRB Cell Bank (National Institute of Biomedical Innovation Japan, report no. KBN0410) in the methods section, and will upload the authentication report KBN0410 as a review process file.

      • *

        • Have all the experiments on BeWo only been performed once?*
      • *

      Our response: We repeated 6 experiments (the repetitions are biological, not technical, replicates). The results are shown as means ± SEM (n = 6) as stated in the Figure legends.

      • *

      * *

      • *

      Reviewer #3 (Significance (Required)):

      • *

      While Aβ is present in human placentas and accumulates in preeclamptic placentas, the production and role of Aβ in the human placenta remain unclear. The current findings suggest that increased Aβ production in cytotrophoblast by hypoxia may lead to the formation of Aβ fibrils, which inhibit syncytiotrophoblast formation and are detrimental to pregnancy, revealing a novel role of Aβ fibrils in the pathogenesis of preeclampsia.

      • *

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      • *

      The authors found that Amyloid β suppressed cytotrophoblasts syncytialization, which is innovative. The authors used human patient samples and human primary CTB culture which are powerful data.

      • *

      Our response: We appreciate the reviewer’s thoughtful feedback and support.

      • *

      Fig. 3. The authors used Roxadustat to stimulate HIF-1α and showed BACE1 increase. It would be better to have the cells in real hypoxia condition.

      • *

      Our response: There is a time discrepancy between the increase in HIF-1α levels by hypoxia and induction of BACE1. Because the purpose of this experiment is to show that increased HIF1-α correlated BACE1 induction, we used Roxadustat as a HIF1-α stabilizer and showed that sustained induction of HIF increased BACE1 levels. However, we do understand the reviewer’s concern. We will include data showing an increase in BACE1 in hypoxic conditions by performing new Western blotting experiments.

      • *

      Fig. 4 and 5. The authors used external Amyloid β for stimulation. Would the endogenous Amyloid β levels reach the concentration of external one? It would be better to see the quantitative levels of Amyloid β in Fig. 3b.

      • *

      Our response: Because the aggregation of Aβ requires a high concentration of a micromolar order, we used synthetic Aβ fibrils for stimulation. We propose that chronic hypoxia in preeclampsia leads to an elevated local concentration of Aβ through a sustained increase in Aβ production, which eventually results in Aβ fibrillogenesis and deposition of Aβ fibrils. Therefore, it will be difficult for the Aβ concentrations generated by BeWo cells to reach a level sufficient for fibrillogenesis. We will discuss this point in the revised manuscript. In addition, we have already performed ELISA assays to quantitatively analyze Aβ generation by BeWo cells. We will include these ELISA data in the revised manuscript.


      • *

      * *

      • *

      Reviewer #4 (Significance (Required)):

      • *

      The manuscript addresses an important theme recently identified to address the heterogeneous etiology of preeclampsia. Although the authors have used in vitro approaches, the study could have been a solid if not for some major concerns.

      • *

      The authors have focused on an already demonstrated phenomenon but have tried to validate the findings using their in vitro approaches. The manuscript is well written but some lapses for correct references.

      • *

      Our response: We thank the reviewer for the critical reading of our manuscript and his/her constructive comments. As the reviewer pointed out, recent studies suggest that preeclampsia is a proteinopathy. However, the mechanisms by which protein aggregate plays detrimental roles in placentation has not been well-understood. In the present study, we discovered a detrimental role of Aβ fibrils in syncytiotrophoblast formation.

      • *

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      Major comments:

        • In lines 50 and 51 of Introduction, the authors provide references to two publications. However, several other articles have appeared before or after these publications that demonstrated evidence for proteinopathy in the placenta and circulation of preeclampsia patients. The reviewer has gone through the literature and found several publications. For example, Kalkunte et al were the first ones to demonstrate the etiology of proteinopathy in preeclampsia placenta and focused on a protein called transthyretin (Am J Pathol. 2013, 183(5):1425-1436). Similarly, Cheng et al demonstrated using a novel blood test that serum from early onset and late onset preeclampsia manifestations contained Aβ and transthyretin (Nature Sci Rep. 2021;11:15934). Jash et al showed the presence of cis P-tau in the placenta and serum of early and late onset preeclampsia patients (Nat Commun. 2023;14:5414). This article and another article by Cheng et al (Hypertension 79(8):1738-1754) revealed that aggregated cis P-tau and transthyretin are etiologically critical for the onset of preeclampsia. There have been several other review and original articles that have talked about Alzheimer's like etiology in preeclampsia (Olie et al, JAMA Netw Open, 2024; e2412870; Basit et al, BMJ 2018; 363:k4109; Schliep et al, Hypertension 2023; 80:257-267, Cheng et al, Am J Reprod Immunol. 2016;75:372-381).*
      1. *

      Our response: We sincerely appreciate the reviewer for his/her helpful comment. We will revise the introduction by citing the references recommended by the reviewer.

      • *

      • Following up on the comments made above, the authors talk about induction of Aβ in hypoxia-treated human trophoblasts represented by an established cell line, BeWo, and primary human trophoblasts. However, it is not clear whether Aβ42 as stated in the manuscript was detected as an aggregated structure or a protein coupled with RB4CD12 aggregate marker. It would have been helpful if the authors could provide direct evidence for Aβ aggregation.*

      • *

      Our response: Based on our previous findings showing that highly sulfated domains of heparan sulfate are common components of protein aggregate deposits, we used RB4CD12, which recognizes these domains, as a marker of protein aggregate deposition. These include aggregates of Aβ in Alzheimer’s disease, transthyretin in ATTR, and p53 aggregates in p53-mutated cancers (Hoshono-fukao et al., Am J Pathol, 2012, PMID: 22429964; Kameyama et al., Am J Pathol, 2019, PMID: 30414409; Iwahashi, PNAS, 2020, PMID: 33318190). Please also see our reply to Comment 5 below. We will perform additional immunohistochemical analysis with the β0001 anti-Aβ antibody and the ProteoStat dye that recognizes protein aggregates.

      • *

      • What appeared to be more surprising is the statement on lines 162 and 163 that cultured CTBs produced Aβ40/42. Again, it is not clear whether the authors are talking about aggregated Aβ or just induction of Aβ. Why should normal CTBs produce Aβ? It is not clear whether this is a transient expression or a long-term phenomenon. The issue is distinction between normal and adverse pregnancy conditions, and the latter associated with protein aggregation as suggested in the literature.*

      • *

      Our response: BeWo cells and cultured CTBs produce Aβ peptides in a normoxic condition. In the brain, neurons constitutively produce Aβ peptides, which have physiological roles such as controlling neuronal hyperexcitability, enhancing of synaptic plasticity, and improving memory (reviewed in Kent et al., Acta Neuropathol, 2020, PMID: 32728795). The amount of Aβ in the brain is regulated by the balance between Aβ production and Aβ clearance, and the imbalance of the production and the clearance may result in an increase in Aβ local concentration and Aβ aggregation. Our results showing that hypoxia increased Aβ production in BeWo cells suggest that chronic hypoxia, which is a risk of preeclampsia, may lead to a sustained increase in Aβ production and an elevated local concentration of Aβ at or near the site of Aβ production. We will discuss these points in the discussion.

      In the present study, we showed that aggregated form of Aβ (i.e., Aβ fibrils) was detrimental to the CTB differentiation. On the other hand, we already found that Aβ monomers promoted EVT invasion (please see the below). We believe that promotion of EVT invasion by Aβ monomers represent a physiological function of Aβ in the placenta. We will include these new data in the revised manuscript and we will also perform experiments with BeWo cells and Aβ monomers in order to investigate whether Aβ monomers have some roles in CTB differentiation.

      • *

      • *

      • *

      • The authors have adequately pointed to importance of hypoxia in the onset of preeclampsia-like features. As a matter of fact, Lai et al demonstrated in a mouse pre-clinical model that hypoxia could induce severe features of preeclampsia (Hypertension. 2011;57:505-514). The use of hypoxia as driver of Aβ induction is appreciated.*

      • *

      Our response: We agree with the reviewer that studies using preclinical animal models are an important topic for the future. __We will discuss this point in the discussion. __

      • *

      • In Fig. 1, although the authors have used DIC approach, it would have been helpful if they presented individual Aβ and RB4CD12 green and red channels, and a merged profile. For example, PE #4 sample does not appear to have much RB4CD12. Again, there is a question of aggregated or native protein structures. It is difficult to have a satisfactory statistical analysis. Did the authors look for Aβ in the anchoring villi region of the placenta?*

      • *

      Our response: We will show the green and red channel images individually. We have noticed that we detected Aβ deposition without RB4CD12 signals. Aβ is small peptides of 40 to 42 amino acid residues and is extracellularly released after the production. Non-deposited Aβ monomers are not detected by immunohistochemical analysis, because these soluble Aβ peptides are spread out in the tissue fluid. Thus, in our statistical analysis, we calculated only merged signals of Aβ and RB4CD12, which suggests that our data show the aggregated and deposited Aβ. We will note this point in the results. In addition, we will perform immunohistochemical analysis with the anti-Aβ antibody and the ProteoStat dye. Please also see our response to Comment 2 above. We did not observe Aβ deposition at the anchoring villi.

      • *

      • Fig. 2 does not show significant staining for HIF1-α in PE placental tissue.*

      • *

      Our response: In a normoxic condition, HIF1-α is constitutively expressed but degraded via the proline-hydroxylation and the subsequent ubiquitination and degradation in the proteasome. Because the proline-hydroxylation is oxygen-dependent, hypoxia induce HIF1-α accumulation. Thus, our data suggest a hypoxic environment in the preeclamptic placentas. We will note this point in the results section.

      • *

      • Fig. 3B, why should there be Aβ40/42 under normoxic conditions? This is the most pertinent concern and the authors are validating significant expression of Aβ40/42 under normal conditions. In normal pregnancy placenta, this protein has not been detected.*

      • *

      Our response: Aβ peptides are constitutively produced in BeWo cells, and the production was enhanced by hypoxia. Aβ is small peptides of 40 to 42 amino acid residues. We did not observe Aβ signals in the immunohistochemical analysis of the normal pregnancy placentas, because Aβ peptides that do not aggregate and deposit in the placenta were distributed in the tissue fluid and lost before and during the processing of the placentas for the paraffin-embedding and immunostaining. Our immunohistochemical analysis detects only Aβ deposition. Thus, the absence of Aβ signals in the immunohistochemical analysis of normal placentas does not mean that normal placenta does not produce any Aβ peptides.

      • *

      • Figs. 4 and 5 present the crux of the conclusions that the authors are trying to draw from their study. Aβ peptide solution was incubated for 5 days at 370C to prepare so called Aβ fibril-like structures. What is the purity of fibril structures? Does this preparation show toxic effects on cell viability? Human trophoblasts expressing E-cadherin fail to participate in endovascular cross-talk with endothelial cells, a process required for spiral arteries. It appears that either BeWo cells or primary trophoblasts used in this study represent trophoblasts from third trimester. It is not clear why should Aβ fibril like structures should inhibit ZO-1 and E-cadherin or β-hCG (Fig. 5) for that matter. In Fig. 5C, there does not seem to be a major effect of Aβ fibrils. Did the authors try synthetic Aβ as a control. These experiments could have been meaningful but for proper controls.*

      • *

      Our response: Synthetic Aβ was purchased from Peptide Institute (Osak, Japan). The purity is >95%. We will include the data sheet as a review process file. In case that the reviewer wants to know the fibril content of the preparation, we will calculate the fibril content by using Native PAGE followed by Western blotting. We did not observe any cytotoxicity of the preparation as shown in Supplemental Fig. S3.

      We previously showed that membrane localization of cell-cell interaction proteins such as ZO-1 and E-cadherin in cytotrophoblasts is required for syncytialization (Iwahashi et al., Endocrinology, 2019, PMID: 30551188; Matsukawa et al., Biomolecules, 2022, PMID: 36008943). Because Aβ aggregates disrupt membrane localization of tight junction proteins partly by inducing excess autophagy (Marco et al., Neurosci Lett, 2006, PMID: 16644119; Chan et al., Exp Cell Res, 2012, PMID: 29856989), we hypothesized that Aβ fibrils may also disrupt membrane localization of ZO-1 and E-cadherin in BeWo cells. We are focusing on the effect of Aβ fibrils on cytotrophoblasts at the late stage of pregnancy when the remodeling of spiral arteries is completed. We understand the importance of investigating the effects of Aβ and Aβ fibrils on early pregnancy. We will cite an article showing the effects of Aβ aggregates on EVTs (Gao et al., J Mol Histol, 2024, PMID: 38777993) and include our data showing the Aβ monomer functions on EVT invasion. Please also see our reply to Comment 3 above. As for Fig. 5C, we will improve the quality of images. We will also perform experiments to investigate whether Aβ monomers alone affect syncytialization of BeWo cells.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      This manuscript focuses on the role of amyloid β (Aβ) in hypoxia-exposed human trophoblasts. Recent reports in the literature have confirmed the presence Aβ and other proteins, including Tau, transthyretin, and TDP-43, in placental tissue derived from preeclampsia deliveries. These proteins are recognized as hallmark causative factors for Alzheimer's disease related dementias. Hypoxia has also been shown to induce expression of these proteins, including Aβ, in human trophoblasts. In this regard, detection of Aβ hypoxia-exposed human trophoblast may not be a novel finding. This said, the manuscript presents some solid information and could have been very informative study. However, several conceptual, technical and literature concerns remain unaddressed and dampen the reviewer's enthusiasm for this study.

      Major comments:

      1. In lines 50 and 51 of Introduction, the authors provide references to two publications. However, several other articles have appeared before or after these publications that demonstrated evidence for proteinopathy in the placenta and circulation of preeclampsia patients. The reviewer has gone through the literature and found several publications. For example, Kalkunte et al were the first ones to demonstrate the etiology of proteinopathy in preeclampsia placenta and focused on a protein called transthyretin (Am J Pathol. 2013, 183(5):1425-1436). Similarly, Cheng et al demonstrated using a novel blood test that serum from early onset and late onset preeclampsia manifestations contained Aβ and transthyretin (Nature Sci Rep. 2021;11:15934). Jash et al showed the presence of cis P-tau in the placenta and serum of early and late onset preeclampsia patients (Nat Commun. 2023;14:5414). This article and another article by Cheng et al (Hypertension 79(8):1738-1754) revealed that aggregated cis P-tau and transthyretin are etiologically critical for the onset of preeclampsia. There have been several other review and original articles that have talked about Alzheimer's like etiology in preeclampsia (Olie et al, JAMA Netw Open, 2024; e2412870; Basit et al, BMJ 2018; 363:k4109; Schliep et al, Hypertension 2023; 80:257-267, Cheng et al, Am J Reprod Immunol. 2016;75:372-381).
      2. Following up on the comments made above, the authors talk about induction of Aβ in hypoxia-treated human trophoblasts represented by an established cell line, BeWo, and primary human trophoblasts. However, it is not clear whether Aβ42 as stated in the manuscript was detected as an aggregated structure or a protein coupled with RB4CD12 aggregate marker. It would have been helpful if the authors could provide direct evidence for Aβ aggregation.
      3. What appeared to be more surprising is the statement on lines 162 and 163 that cultured CTBs produced Aβ40/42. Again, it is not clear whether the authors are talking about aggregated Aβ or just induction of Aβ. Why should normal CTBs produce Aβ? It is not clear whether this is a transient expression or a long-term phenomenon. The issue is distinction between normal and adverse pregnancy conditions, and the latter associated with protein aggregation as suggested in the literature.
      4. The authors have adequately pointed to importance of hypoxia in the onset of preeclampsia-like features. As a matter of fact, Lai et al demonstrated in a mouse pre-clinical model that hypoxia could induce severe features of preeclampsia (Hypertension. 2011;57:505-514). The use of hypoxia as driver of Aβ induction is appreciated.
      5. In Fig. 1, although the authors have used DIC approach, it would have been helpful if they presented individual Aβ and RB4CD12 green and red channels, and a merged profile. For example, PE #4 sample does not appear to have much RB4CD12. Again, there is a question of aggregated or native protein structures. It is difficult to have a satisfactory statistical analysis. Did the authors look for Aβ in the anchoring villi region of the placenta?
      6. Fig. 2 does not show significant staining for HIF1-α in PE placental tissue.
      7. Fig. 3B, why should there be Aβ40/42 under normoxic conditions? This is the most pertinent concern and the authors are validating significant expression of Aβ40/42 under normal conditions. In normal pregnancy placenta, this protein has not been detected.
      8. Figs. 4 and 5 present the crux of the conclusions that the authors are trying to draw from their study. Aβ peptide solution was incubated for 5 days at 370C to prepare so called Aβ fibril-like structures. What is the purity of fibril structures? Does this preparation show toxic effects on cell viability? Human trophoblasts expressing E-cadherin fail to participate in endovascular cross-talk with endothelial cells, a process required for spiral arteries. It appears that either BeWo cells or primary trophoblasts used in this study represent trophoblasts from third trimester. It is not clear why should Aβ fibril like structures should inhibit ZO-1 and E-cadherin or β-hCG (Fig. 5) for that matter. In Fig. 5C, there does not seem to be a major effect of Aβ fibrils. Did the authors try synthetic Aβ as a control. These experiments could have been meaningful but for proper controls.

      Significance

      The manuscript addresses an important theme recently identified to address the heterogeneous etiology of preeclampsia. Although the authors have used in vitro approaches, the study could have been a solid if not for some major concerns.

      The authors have focused on an already demonstrated phenomenon but have tried to validate the findings using their in vitro approaches. The manuscript is well written but some lapses for correct references.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors found that Amyloid β suppressed cytotrophoblasts syncytialization, which is innovative. The authors used human patient samples and human primary CTB culture which are powerful data.

      Fig. 3. The authors used Roxadustat to stimulate HIF-1α and showed BACE1 increase. It would be better to have the cells in real hypoxia condition.

      Fig. 4 and 5. The authors used external Amyloid β for stimulation. Would the endogenous Amyloid β levels reach the concentration of external one? It would be better to see the quantitative levels of Amyloid β in Fig. 3b.

      Significance

      While Aβ is present in human placentas and accumulates in preeclamptic placentas, the production and role of Aβ in the human placenta remain unclear. The current findings suggest that increased Aβ production in cytotrophoblast by hypoxia may lead to the formation of Aβ fibrils, which inhibit syncytiotrophoblast formation and are detrimental to pregnancy, revealing a novel role of Aβ fibrils in the pathogenesis of preeclampsia.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, the authors examine the deposition of amyloid-β (A β) peptides that accumulate in the brains of patients with Alzheimer's disease (AD). The authors demonstrated the expression of HIF-1 in the pre-eclamptic (PE) placental tissue using immunofluorescence (which is not novel), alongside the expression of BACE1. These experiments were also validated using BeWo and primary trophoblast cells cultured under hypoxia to mimic one of the characteristics of PE. However, this manuscript is quite preliminary, and many additional experiments are necessary to confirm the deposition of Aβ fibrils in PE. The authors treated CTB and observed the effects on STB, but in PE, the main cell lineage affected is extravillous trophoblast (EVT) cells, which invade the spiral artery. The defect in this invasion is one of the major causes of PE. Therefore, the authors should investigate the effect of hypoxia and Aβ deposition on EVT invasion. Overall, this work appears very incomplete, and further experiments are warranted.

      Major comments

      • If CTBs are treated with Aβ, and if it affects STB, what happens with EVT? Why didn't they check with EVT if the authors wanted to link with PE?
      • Did the authors look for pathologies related to Aβ deposition on PE placentas?
      • Line# 103, the IF images don't show that BACE1 is around HIF1. There are no merged images, and the results are over- or underestimated.
      • What is the intended purpose of using Roxadustat? If it inhibits HIF1, could you explain the reason behind the increased expression of HIF1? Furthermore, is there evidence to support the efficacy of this compound?
      • Is Aβ deposition very specific to PE, or can it also occur for other reasons during pregnancy?
      • BACE1 is expressed in Normal#2 and #3 but not in #1, #4, and #5. Why is this expressed in #2 and #3? Is there anything wrong with these samples? If patients had gestational hypertension or some other complications?
      • PE placentae were compared with GA matched placentae. What is the expression of BACE1 and RB4CD12 in term control placentae?
      • If AB fibril deposition is hypoxia dependent, what happens at the early gestation, where oxygen conc is 1-2%?

      Minor comments

      • The authors only performed IF and IHC. Please confirm and correct the methods accordingly.
      • Was the BeWo-b21 clone cell line used for all the experiments in this paper? This is the only clone that can be used for BeWo-STB models.
      • Have all the experiments on BeWo only been performed once?

      Significance

      Investigating the deposition of Aβ in the placenta could enhance our understanding of pregnancy complications such as PE, fetal growth restriction, and neurodevelopmental risks. However, further research on this topic is necessary.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Proving that more Beta-amyloid are produced in preeclampsia, and that impacts negatively trophoblast cell fusion is interesting and provides a potential mechanism for interpreting some specific cases of preeclampsia. The authors analyzed the placenta from five control and five preeclamptic pregnancies (4 early onset et 1 late onset).

      The authors show first by IHC that amyloid beta and aggregate markers are apparently exclusively detected in the PE samples, the same observation is done for detection of HIF1alpha and BACE1, the enzyme that is responsible for the generation of amyloid peptides from digestion of the APP membrane neuron protein. After having used placental samples, the authors moved to the BeWo cell model, where they could analyze specifically cell biology in the context of syncytialization. The authors inhibited HIF1a prolylation (thus stabilizing it even in normoxia), and this leaded to the increase of BACE1, of beta-amyloid molecules, as shown by WB analyses; the same result was obtained by exposure to hypoxia, while a BACE1 inhibitor had the opposite effect.

      An interesting issue is the demonstration provided by the authors that in this model, syncytialization is decreased by Beta-amyloid fibrils, together with decreased hCG expression and decreased Syncytin-1. The authors also validate these results on primary human CTB from the third trimester.

      Minor remarks

      1. It is classical now to present in extenso the WB as supplementary data for Fig 3, 4 and 5.
      2. It seems that the beta amyloid signal is not stronger for the early onset and the late onset PE samples. Have the authors an interpretation?
      3. The figure 4b does not show the BeWo labeling in forskolin with or without beta amyloid peptides, why? It would be illustrative to show a decrease in the fusion processes
      4. How do the authors explain that exposure to fibrils did not seem to slow down significantly the fusion process, even though markers are decreased?
      5. Could the authors attempt a labeling with the Di-8, an interesting quantitative marker of cell fusion (see ref PMID: 38019394).

      Significance

      This study aims to bridge a gap between the mechanisms of preeclampsia and neurodegenerative disorders, and this through the existence of misfolded proteins in the preeclamptic placenta which has been reported before, in particular the beta amyloid protein, known as operative in Alzheimer's disease (AD) in particular.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      General Statement

      *Our lab was totally destroyed on June 15th by an Iranian missile. All stocks, equipment and reagents were lost. While we performed many of the experiments requested by the reviewers, unfortunately some were never completed. We thank you for your understanding. *

      We thank the three reviewers for their thoughtful comments and useful suggestions on how to improve our paper. Some of the reviewers claimed that the paper is “preliminary”. We would like to highlight that in our opinion “preliminary” has two possible meanings in this context: 1) the data does not yet support the claims that the authors wrote; 2) the story is short and should be extended. While we totally agree that type 1 “preliminary” should be addressed (and we have addressed that to the best of our abilities), type 2 “preliminary” is a matter of scope, the length of the paper/project and the publication home. We believe that this story, which has been led by an outstanding master’s student (and as such has had a limited timespan) is worthwhile of publication in its current scope.

      2. Point-by-point description of the revisions

      Reviewers’ comments are in BLUE while our responses are in BLACK.

      Reviewer 1 Summary: This study reports a role for matrix metalloproteinases (MMPs) in the developmental pruning of gamma Kenyon cells (KCs) in the fruit fly Mushroom Body during larval-pupal metamorphosis. The authors show through gene expression studies that MMP genes are upregulated in late larval stages as part of the early program for this type of neuronal pruning. They show through cell-targeted RNAi studies of both secreted MMP-1 and membrane-anchored MMP-2, that both genes are required in glial cells and to a lesser extent within KCs.

      Both MMPs have secreted and membrane-anchored isoforms and we did not assess whether the secreted/anchored isoforms are involved; e.g. see LaFever et al. 2017.

      The authors show that MMP secreted from glial is required for normal levels of Mushroom Body developmental neuronal pruning. They mention that MMP genes have been identified in schizophrenic patient screens in patients, and that perhaps a comparable pruning mechanism could be involved in the loss of grey matter (loss of synapses) in patients. The authors propose that MMP levels may be a potential therapeutic marker in the future.

      We thank the reviewer for his comments. We find it important to clarify that we do not think our work suggests that the MMPs levels may be a potential therapeutic marker without much additional work in the future. In the original text we added a claim from another paper suggesting MMPs as therapeutic target. However, due to the arising confusion, we decided to delete this statement from the text (original line 198). We also added a general disclaimer towards the end of the discussion regarding the genetic power of Drosophila but its limited implication into human health (new lines 276-278).

      Major Comments: Overall, the work is of a reasonable standard, but very preliminary

      Please see general note on two types of “preliminary” – we thank the reviewer for helping us substantiate our claims and strengthen our paper but we do not plan to significantly increase its scope.

      The study lacks the substance to completely convince me of any of the results. There is SUBSTANTIAL work that needs to be done to make this publishable. There are a lot of writing mistakes; so many that I do not list them in detail here

      We are not absolutely sure that we understand to which mistakes this reviewer is eluding. However, we carefully rewrote the manuscript, streamlined many of our claims and added many new and more recent references.

      The references citations are fairly old, but I do not list update replacements here

      Thanks – we added many newer and relevant citations.

      The text is very brief, and the overall writing needs to include significantly more description and detail

      We have included more descriptions and details, as will be elaborated later on, but – again - this is a short report and will remain as such.

      This is evident in all aspects of the manuscript, but especially notable in the Methods and Figure Legends

      Thanks for raising this comment, which was reverberated also by other reviewers – we have now included more details, with a particular focus on the genotypes (Table 2), that somehow were erroneously not included in the original submission, as well as more detailed figure legends.

      None of the Figure Legends include full genotypes of any of the fly lines, and these full fly lines are also not included in the Methods. This is vital to compare the experimental lines to the controls

      True – our apologies for this mistake, we now added the full genotypes in Table 2.

      Major points are listed below:

      1. Figure 2: It is important to note of the specific age of animals in these images when talking about the loss of genes in development. Are all the animals age-matched? High levels of synaptic pruning occur post-eclosion), and it is important to understand when these pruning defects occur. It is mentioned that that overlap for the gene expression data is upregulated during 6-18h APF is this when these images are taken? This is very important in the context of pruning as SCZ symptom presentation is very late relative to these early events.

      We thank the reviewer for this comment which suggests we were not clear enough in our description. We do not claim to have generated an SCZ model and have clarified this better in the text (lines 275-278). Furthermore, axon pruning happens during pupal development, but in all the main figures in this manuscript we dissected young adult flies (3-5 days post eclosion) and show the remnants of unpruned axons (as we have done in numerous studies). To make sure that initial development occurred normally, we also include larval brains in the Figure S7. We now clarified the fact that we are imaging adult brains as a readout to investigate whether pruning occurred during metamorphosis or not (line 124-126).

      1. Figure 2: In the figure legend, it is indicated that the arrows are unpruned axons, however in the controls these areas appear to be highly innervated. Further explanation is needed about the context of the arrows, as there are clear visual differences between these images and the controls, but they appear to have a more expansive phenotype than "unpruned axons". The data does not match the visual representation in comparison to the control.

      We apologize for this confusion. Unfortunately, the driver which we use to label the γ-axons, R71G10-QF2, is not absolutely specific to the γ type KCs but also expressed (sometimes) in the ɑ/β KCs. As the ɑ/β axons are very stereotypic in shape and also express high levels of FasII (which we stain for), we can easily distinguish between the ɑ lobe and unpruned γ axons. To clarify this point, we now clearly demarcate all lobes in the control images and specifically the ɑ lobe in all panels. Additionally, we added new schemes in Figure 2A and 2O to better clarify the anatomy and experimental design.

      1. Figure 2: There needs to be more descriptive definitions and clarifications to the defects labeled in panel K. This could be done in the figure legend, but it would be more useful to label the images provided. For example, if Mmp2 is a "mild pruning affect, put that in the pie chart somewhere, to help guide the description of the phenotype to what those confocal images look like.

      We understand that the pie chart in Figure 2 was confusing and therefore simplified it in the current version (Fig. 2B and 2P). Also, thanks to this great point, we now include a new Figure S3 that includes examples for the ranking categories, which were now performed by two independent investigators in a blind manner.

      Figure 3: The time points of the images of the Mushroom Body (MB) are vital to understanding the process and regulation of these genes.

      Please see our comment to point #1 – unless specifically stated otherwise, all images are MBs of adult flies, as now clearly mentioned in the figure legends, in the text and in the Material and Methods section.

      1. Figure 3D: Significant description of this graph needs to be added for clarity. What parameters separate each phenotypic defect? Labeling the images and showing images that belong in different groups would be very helpful and improve the paper significantly.

      We now included a new Figure S3 (also see our response to comment #3).

      1. Figure S1: Additional experiments would help answer the strength of the phenotype for the ALG-Gal 4 driver. The authors need to perform the rescue experiment. Use a MMP-2 null and then drive it back in the ALG-GAL4 to see if this is sufficient to rescue the neuron pruning. This also isolates the mechanisms to one subtype of glia.

      These are excellent suggestions that are, unfortunately, not doable. To perform a rescue experiment, one would need a viable loss-of-function phenotype of an Mmp2 mutant. There is one published Mmp2 loss-of-function null allele which is lethal during pupal development (Page-McCaw et al, 2003). Our previous data, using tissue specific (ts)CRISPR, suggested the involvement of Mmp2 in neurons for their remodeling (Meltzer et al, 2019). We therefore independently generated an Mmp2 germline mutant using CRISPR (harboring an indel resulting in a premature stop codon and predicted to encode a truncated, 77 amino-acid long protein), now described in Fig. S5A (and in the Materials and Methods). This allele is, as expected, unfortunately also lethal. We attempted to overcome lethality by generating MARCM (mosaic) clones in neurons, but as expected, because Mmp2 is largely secreted, there was no pruning defect phenotype (Fig. S5B-C). Unfortunately, it is not yet possible to generate glial clones.

      Figure 3 and 4: The other glial subtypes need to be analyze to make any conclusion about their involvement, as well as the involvement of the astrocytes. Running these exact same experiments on the cortex glial and ensheathing glia will provide essential insight into what glial subtype is involved. The presumed lack of phenotypes in these other glial subtypes will also strengthen the argument that the astrocytes are specifically involved in this process. These are vital experiments.

      We currently limited our analysis (and conclusions) to astrocytes. Despite the fact that this experiment is beyond our initial scope, we obtained reagents and performed preliminary experiments (using the R77A03-Gal4 driver for cortex glia, and the R83E12-Gal4 for ensheathing glia). In both cases, we observed extremely mild pruning defects, not comparable to those with Repo- or Alrm-Gal4. In these preliminary experiments we lacked a proper control, and now, unfortunately, due to the loss of our lab, we are unable to complete these experiments in a reasonable amount of time.

      1. Figure 4: Again, description of the phenotypes and examples of these would improve the quality of this figure substantially.

      Absolutely agree – see our response to comment #3 (and Fig. S3).

      1. Figure 5: An improvement on the quantifications of these phenotypes would strengthen the paper substantially. More detailed description of the phenotypes and how they related to the control would significantly improve the overall quality of the work.

      Thanks again for highlighting that we neglected to include the full genotypes that are now added (Table 2). We also thank the reviewer for raising the point regarding quantification. First, we generated a new Fig. S3A-E to show examples of the ranking by two independent rankers. Second, ranking was performed by looking at TdTomato positive vertical axons that are outside of the ɑ lobe (high FasII) – this is now better explained in the materials and methods. Additionally, while we would love to have a better scoring, and automatic, system – and even published a semi-automated scoring algorithm in Alyagor et al. 2018 (Figure 3O in the Alyagor paper), because the driver also labels vertical axons (ɑ/β) and because unpruned γ axons often express FasII, this quantification method does not always work. What we have done in previous cases, as we have also done here, is to provide independent ranking by two investigators and compare their ranking (Fig. S3F-G). Finally, we are working with our AI hub to develop automatic scoring systems that will not require human ranking – however this is beyond the scope for this manuscript.

      Minor Comments: 1. Figure 1A: I would suggest labeling the KC (gamma) and potentially one of the others (a/B, a'/B') to orient the reader to the differences between these two subsets of the KCs, and to emphasize which neurons are undergoing pruning and where the cell bodies are and where the axons project.

      Thanks for the suggestions – we now better annotated the scheme in Figure 1A as well as additional schematics in Figure 2 and, finally, better annotations in selected panels. Specifically, the ɑ lobe is outlined in magenta throughout all relevant panels.

      1. Figure 1C: This panel needs further labeling to explain the findings in the heat map. Labeling some of the genes that were found and where they were would be helpful. This could also be done in the figure legend, however without any further labeling or context the heatmap is confusing.

      We apologize for the incomplete figure. We did not want to overload the figure with data, which is why we are showing only the important clusters and did not include gene names. To keep the figure simple, but at the same time provide the complete information, we now include the full data in Fig. S1 (that includes the original heatmap with all the dynamic clusters I-IX, and including all the gene names). For the full raw data, including non-dynamic clusters, the reader is referred to look in Supplemental excel file 1. We hope this provides the clarity that this reviewer rightfully asks for.

      1. Figure 3B,C: The full genotypes need to be labeled. What is the exact genotype used for the control?

      The full genotypes of all figure panels are now included in Table 2 in the Materials and Methods.

      1. Figure S1: The stock number for the ALG-GAL4 is missing, there are multiple different drivers, therefore this could be helpful in understanding this phenotype, as some are better than others.

      Indeed, Alrm-Gal4 comes on two chromosomes – we used BDSC #67032, which is on chromosome III and this is now clearly mentioned the Materials and Methods section.

      1. Figures 3 and 4: Labeling needs to remain consistent; Figure 3 "Glia-Gal4", Figure 4 "glia-gal4".

      Thanks, done.

      Reviewer #1 (Significance (Required)):

      General Assessment: An interesting study on MMP function during an unusual type of neural development (axon pruning). Most of the MMP function appears to be in glia, although the MMP role in this context in unclear. The MMP function in the neurons being pruned is unexpected and even less clear. The study is somewhat poorly described in terse language lacking essential information, which gives the overall impression of a preliminary report.

      Advance: Glial MMP function has been described for neuronal clearance mechanisms following injury. The main advance here is to describe a similar function during normal development. Audience: Developmental neuroscientists, MMP biologists, possibly schizophrenia clinician researchers

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Neuropsychiatric conditions are often influenced by genetic factors. Schizophrenia is a complex mental disorder characterised by a mixture of hallucinations, delusions and disorganised thinking that causes lifelong problems in daily life. GWAS have identified a number of genes associated with the risk of developing schizophrenia, although genetic predisposition alone is not sufficient and additional environmental factors are required. In the current manuscript, the authors aim to exploit the strength of the Drosophila system to explore a link between schizophrenia-associated genes and neuronal remodelling during development. They focus on the mushroom body in the adult brain, where pronounced neuronal remodelling occurs during metamorphosis. To assess the potential role of the genes identified by the GWAS, they performed a targeted RNAi-based screen. They focus on the role of metalloproteases and find that they are required in neurons and in glia for the pruning of mushroom body axons. The study starts with a selection of 32 genes, 29 of which are listed (a bit hidden) in materials and methods and the identification of the Drosophila orthologs. The expression patterns of these genes in Kenyon cells are presented in Figure 1 - but unfortunately no information is given on who is expressed when

      We apologize for the confusion. We attempted to keep Figure 1 simple but this resulted in the absence of critical information, as the reviewer suggests. We now include a Figure S1 that includes the entire heatmap of the dynamically expressed clusters I-IX with all the gene names. Additionally, we now augmented the information in Table 1 to include the screen phenotypes. Finally, Supplemental excel file 1, also included in our original submission, includes all the data, and is now better referred to throughout the text.

      In a next step, Kenyon cell specific RNAi knockdown experiments are shown that identify a pruning phenotype for several genes. They demonstrate that Mmp2 (and similarly Mmp1) is also required in glia. Although Mmp2 was identified by neuronal RNAi-based knockdown, double knockdown experiments led the authors conclude that its primary function is in glia. The study emphasises the use of the advanced genetic model to understand complex human diseases. However, the paper does not go far enough in making use of the excellent genetics available. Basically, the report is about the identification of a few hits in a small RNAi screen, which is fine in itself, but leaves many questions unanswered. Do mmp1/2 mutants have a phenotype?

      This is a very important question that cannot be answered, unfortunately. There is one published Mmp2 loss of function null allele which is lethal during pupal development (Page-MaCaw et al, 2003). Our previous data, using tissue specific (ts)CRISPR, suggested the involvement of Mmp2 in neurons for their remodeling (Meltzer et al, 2019). We therefore independently generated an Mmp2 germline mutant using CRISPR (harboring an indel resulting in a premature stop codon and predicted to encode a truncated, 77 amino-acid long protein), now described in Fig. S5A (and in the Materials and Methods). This allele is, as expected, unfortunately also lethal. We attempted to overcome lethality by generating MARCM (mosaic) clones in neurons, but as expected, because Mmp2 is largely secreted, there was no pruning defect phenotype (Fig. S5B-C). Unfortunately, it is not yet possible to generate glial clones. Additionally, available Mmp1 mutants are, sadly, also homozygous lethal. That said, in our revised manuscript we now include data demonstrating that expression of a dominant negative variant of Mmp1 inhibits pruning (Fig. 3J-K). We strengthened the evidence regarding the reliability of Mmp1 RNAi using an antibody mix (Fig. S4), and for Mmp2 – we refer to a manuscript that tested its efficiency (Harmansa et al., 2023). Lastly, we added new data using an additional RNAi line targeting Mmp2 from the VDRC collection (Fig. 3L).

      Can the phenotype be rescued?

      Unfortunately, without a viable mutant LOF phenotype, a rescue experiment is impossible. Regardless, in an attempt to rescue the RNAi phenotype, we designed and generated an RNAi-resistant Mmp2 overexpression transgene. Unfortunately, due to the destruction of our lab – several days after we received this transgenic line from Bestgene – this experiment is not included in the revision.

      Does TIMP expression lead to similar phenotypes?

      This is an interesting question which we addressed in our experiments but did not include in the text. Unfortunately, overexpression of TIMP did not have any effect on MB development. We are adding this figure here as Reviewer Figure 1, but we think that adding this information to the paper will not improve it for several reasons. The lack of phenotype by overexpression of Timp can result from a technical issue such as low expression or mislocalization of the protein, or a biological issue such as more complicated involvement of TIMP or other MMP inhibitors.

      What is the temporal requirement for Mmp1/2?

      This is an excellent suggestion, not an easy experiment, but one that we initiated, using a temperature sensitive Gal80 to control the expression of the RNAi only during metamorphosis. However, to the unfortunate destruction of our lab, this experiment was never completed.

      What are the target proteins of Mmp2?

      This is the million-dollar question – but unfortunately is beyond the scope of this short report.

      Is Mmp2 still required when astrocyte motility is blocked? What is the morphology of glia after Mmp1/2 knockdown?

      Thank you for this wonderful suggestion. We initiated two types of experiments using sparse labeling techniques (both MARCM and SPARC) to identify the morphology of single astrocytes in WT vs. MMP KD. However, these are complicated crosses that were not completed prior to the destruction of our lab.

      Reviewer #2 (Significance (Required)):

      The strength of the study is to identify a pruning phenotype after RNAi-based knockdown. The limitations is that this study is very superficial, it is the beginning of a paper. The initial claim to use Drosophila because to its advanced genetics is not met. The results section is shorter than the discussion.

      While we agree with much of the reviewer’s statement this also relates to our general comment about “preliminary” type 1 and type 2 – True, this could be the beginning of a big paper and it would definitely be a more comprehensive and deep story. Most of the papers from my lab are indeed a 5 year endeavor. However, this short report (which is now longer, more detailed, and includes additional experiments) is a result of the work of an outstanding master’s student who came up with the idea for the project entirely by herself. Thus – given the data that she has acquired, and the fact that my lab will not continue to study MMPs or schizophrenia, the question needs to be whether the data supports the claims and whether this is an advance of science worthwhile of publication in a respectable journal. Our clear and decisive opinion is that the answer to that question is yes.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this work, Schuldiner and colleagues explore the role of Mmp1 and Mmp2 in neuronal remodeling in the mushroom body of Drosophila. Overall, this work is very interesting, but in its current form seems quite preliminary. The biggest limitation of the study is that single RNAi lines are used with no validation that the lines are working, despite the fact that Mmp antibodies are available as are endogenously tagged Mmp lines that could have been used to validate the genetic manipulations. Specific concerns are listed below.

      We thank reviewer 3 for his generally positive assessment of our work and we now performed additional experiments to strengthen and validate the original RNAi findings – for specifics see our reply to the points below.

      Major concerns 1) The scoring system for pruning of mushroom body neurons seems very variable, even in controls (where scoring can range from very mild to moderate), and it is very hard to assess from the images what one is looking at (rather than using our own judgment, we rely on the authors' words). It would be necessary to have better labeling and examples of what phenotypes are considered "mild", "severe", "wild type-like". It would also help to understand how phenotype assessment is guided by the overlap between the signals from TdTomato fluorescence and FasII stain.

      We thank the reviewer for raising this point, that has also been highlighted by other reviewers in some form. First, we have generated Figure S3A-E to show examples of the ranking, which was now performed by two independent investigators. Second, ranking was performed by looking at TdTomato positive vertical axons that are outside of the αlobe (high FasII) – this is now better explained in the materials and methods. Additionally, while we would love to have a better scoring, and automatic, system – and even published a semi-automated scoring algorithm in Alyagor et al. 2018 (Figure 3O in the Alyagor paper), because the driver also labels vertical axons (ɑ/β) and because unpruned γ axons often express FasII, this quantification method does not always work. What we have done in previous cases, as we have also done here, is to provide independent ranking by two investigators and compare their ranking (Fig. S3F-G). Finally, we are working with our AI hub to develop automatic scoring systems that will not require human ranking – however this is beyond the scope for this manuscript.

      2) The biggest limitations of the approach are that single RNAi lines are used to screen, with no accompanying validation of the tool (see above)

      We agree. Unfortunately not all RNAis are “equal” and thus not all of them work. To support the RNAi data, we have better clarified previous experiments that demonstrate the importance of neuronal Mmp2 via tissue specific (ts) CRISPR (Meltzer, et al, 2019). Unfortunately, the Mmp2 null mutant that is available is lethal during pupal development (Page-MaCaw et al, 2003). We therefore independently generated an Mmp2 germline mutant using CRISPR (harboring an indel resulting in a premature stop codon and predicted to encode a truncated, 77 amino-acid long protein), now described in Fig. S5A (and in the Materials and Methods). This allele is, as expected, unfortunately also lethal. We attempted to overcome lethality by generating MARCM (mosaic) clones in neurons, but as expected, because Mmp2 is largely secreted, there was no pruning defect phenotype (Fig. S5B-C). Unfortunately, it is not yet possible to generate glial clones. Additionally, available Mmp1 mutants are, sadly, also homozygous lethal. That said, in our revised manuscript we now include data demonstrating that expression of a dominant negative variant of Mmp1 inhibits pruning (Fig. 3J-K). We strengthened the evidence regarding the reliability of Mmp1 RNAi using an antibody mix (Fig. S4), and for Mmp2 – we refer to a manuscript that tested its efficiency (Harmansa et al., 2023). Lastly, we added new data using an additional RNAi line targeting Mmp2 from the VDRC collection (Fig. 3L).

      3) RNAi-based knockdown is used to infer epistatic information-this is not appropriate as epistasis experiments need to be done with null alleles to make firm conclusions. Additional concerns: ● Even with the same driver, knockdown efficiency for 2 different genes could be variable and dependent of the specific RNAi used. ● The comparison between drivers is even harder, as driver strength varies greatly. ● The knockdown efficiency drops with increasing numbers of RNAi used. ● The specific genotypes used for this experiment should be clarified, as it would be very important to ensure that the UAS dosage is equal across conditions.

      We agree that RNAi is not optimal to assess epistasis. And indeed, we did not mean to claim epistasis relationship between Mmp1 and Mmp2, nor between neurons and glia. We now use better language to clarify this. To define epistatic relationships, the use of mutants would be required, unfortunately the use of nulls is not possible because they are lethal and secreted (thus not enabling mosaic analyses). We agree that increasing the number of RNAi lines is expected to reduce their efficiency – this is why it is even more significant when we see an increased defective phenotype in the double knockdown experiments. Finally, we totally agree about the genotype comment and apologize that it was erroneously omitted in the original submission– all of which have been now added (Table 2 in materials and methods).

      4) To further deepen the rigor of this work, a few simple yet important things could have been done. First, it would be important to rule out that knocking down Mmps does not affect astrocyte numbers and health (could be assessed by counting numbers and observing their morphology). Also, the authors previously showed that astrocytes actively infiltrate the axon bundle prior to pruning to facilitate axon defasciculation and pruning (Marmor-Kollet et al., 2023). It would have provided an important insight to examine if astrocytes can infiltrate the axon bundle if Mmp2 and/or Mmp1 are knocked down.

      Thank you for these wonderful suggestions. We embarked on a few experiments as detailed below, unfortunately these are complicated crosses that were not completed prior to the destruction of our lab. 1) We initiated two types of experiments using sparse labeling techniques (both MARCM and SPARC) to identify the morphology of single astrocytes in WT vs. MMP KD. 2) Testing astrocytic infiltrations requires three binary systems, we obtained and generated stocks required for these experiments, but these were prematurely terminated. 3) We initiated experiments to count the number of glial nuclei in the vicinity of the degenerating axonal lobe (at the onset of pruning). Preliminary experiments with a small n (3 controls, 4 Mmp1 RNAi, and 5 Mmp2 RNAi) suggest that the number of glial nuclei is not significantly different between these conditions.

      Minor The introduction puts big emphasis on the role of glia, but then to narrows down candidate genes for the screen a γ-KCs transcriptional data set is used, and the initial screen is done via knockdown of those candidates in neurons (there is a disconnect between rationale and approach).

      We totally agree with this reviewer which is why we now changed the paper to include both neuronal and glial loss-of-function screens. Figure 1 is now augmented with the glial data.

      Rationale for looking into axon pruning and how that translates into insights about synaptic pruning defects in schizophrenia should be more clearly stated.

      Indeed, our belief that synapse pruning and axon pruning share molecular mechanisms remains yet unproven. However, both are steps during neuronal remodeling, which has been previously implicated in schizophrenia. That said, we now added an additional disclaimer to acknowledge the limitation of our findings in the context of human disease and synapse elimination (lines 275-279).

      Figure 1C: data visualization for this heat map should be improved. Parts of the data are faded, and the differences between gene clusters are unclear.

      We apologize for the incomplete figure. We did not want to overload the figure with data, which is why we are showing only the important clusters and did not include gene names. To keep the figure simple, but at the same time provide the complete information, we now include the full data in Fig. S1 (that includes the original heatmap with all the dynamic clusters I-IX, and including all the gene names). For the full raw data, including non-dynamic clusters, the reader is referred to look in Supplemental excel file 1. We hope this provides the clarity that this reviewer rightfully asks for.

      Reviewer #3 (Significance (Required)):

      In this work, Schuldiner and colleagues explore the role of Mmp1 and Mmp2 in neuronal remodeling in the mushroom body of Drosophila. Overall, this work is very interesting, but in its current form seems quite preliminary. The biggest limitation of the study is that single RNAi lines are used with no validation that the lines are working, despite the fact that Mmp antibodies are available as are endogenously tagged Mmp lines that could have been used to validate the genetic manipulations.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this work, Schuldiner and colleagues explore the role of Mmp1 and Mmp2 in neuronal remodeling in the mushroom body of Drosophila. Overall, this work is very interesting, but in its current form seems quite preliminary. The biggest limitation of the study is that single RNAi lines are used with no validation that the lines are working, despite the fact that Mmp antibodies are available as are endogenously tagged Mmp lines that could have been used to validate the genetic manipulations. Specific concerns are listed below.

      Major concerns

      1. The scoring system for pruning of mushroom body neurons seems very variable, even in controls (where scoring can range from very mild to moderate), and it is very hard to assess from the images what one is looking at (rather than using our own judgment, we rely on the authors' words). It would be necessary to have better labeling and examples of what phenotypes are considered "mild", "severe", "wild type-like". It would also help to understand how phenotype assessment is guided by the overlap between the signals from TdTomato fluorescence and FasII stain.
      2. The biggest limitations of the approach are that single RNAi lines are used to screen, with no accompanying validation of the tool (see above)
      3. RNAi-based knockdown is used to infer epistatic information-this is not appropriate as epistasis experiments need to be done with null alleles to make firm conclusions. Additional concerns:
        • Even with the same driver, knockdown efficiency for 2 different genes could be variable and dependent of the specific RNAi used.
        • The comparison between drivers is even harder, as driver strength varies greatly.
        • The knockdown efficiency drops with increasing numbers of RNAi used.
        • The specific genotypes used for this experiment should be clarified, as it would be very important to ensure that the UAS dosage is equal across conditions.
      4. To further deepen the rigor of this work, a few simple yet important things could have been done. First, it would be important to rule out that knocking down Mmps does not affect astrocyte numbers and health (could be assessed by counting numbers and observing their morphology). Also, the authors previously showed that astrocytes actively infiltrate the axon bundle prior to pruning to facilitate axon defasciculation and pruning (Marmor-Kollet et al., 2023). It would have provided an important insight to examine if astrocytes can infiltrate the axon bundle if Mmp2 and/or Mmp1 are knocked down.

      Minor

      The introduction puts big emphasis on the role of glia, but then to narrows down candidate genes for the screen a γ-KCs transcriptional data set is used, and the initial screen is done via knockdown of those candidates in neurons (there is a disconnect between rationale and approach).

      Rationale for looking into axon pruning and how that translates into insights about synaptic pruning defects in schizophrenia should be more clearly stated.

      Figure 1C: data visualization for this heat map should be improved. Parts of the data are faded, and the differences between gene clusters are unclear.

      Significance

      In this work, Schuldiner and colleagues explore the role of Mmp1 and Mmp2 in neuronal remodeling in the mushroom body of Drosophila. Overall, this work is very interesting, but in its current form seems quite preliminary. The biggest limitation of the study is that single RNAi lines are used with no validation that the lines are working, despite the fact that Mmp antibodies are available as are endogenously tagged Mmp lines that could have been used to validate the genetic manipulations.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Neuropsychiatric conditions are often influenced by genetic factors. Schizophrenia is a complex mental disorder characterised by a mixture of hallucinations, delusions and disorganised thinking that causes lifelong problems in daily life. GWAS have identified a number of genes associated with the risk of developing schizophrenia, although genetic predisposition alone is not sufficient and additional environmental factors are required.

      In the current manuscript, the authors aim to exploit the strength of the Drosophila system to explore a link between schizophrenia-associated genes and neuronal remodelling during development. They focus on the mushroom body in the adult brain, where pronounced neuronal remodelling occurs during metamorphosis. To assess the potential role of the genes identified by the GWAS, they performed a targeted RNAi-based screen. They focus on the role of metalloproteases and find that they are required in neurons and in glia for the pruning of mushroom body axons.

      The study starts with a selection of 32 genes, 29 of which are listed (a bit hidden) in materials and methods and the identification of the Drosophila orthologs. The expression patterns of these genes in Kenyon cells are presented in Figure 1 - but unfortunately no information is given on who is expressed when. In a next step, Kenyon cell specific RNAi knockdown experiments are shown that identify a pruning phenotype for several genes. They demonstrate that Mmp2 (and similarly Mmp1) is also required in glia. Although Mmp2 was identified by neuronal RNAi-based knockdown, double knockdown experiments led the authors conclude that its primary function is in glia.

      The study emphasises the use of the advanced genetic model to understand complex human diseases. However, the paper does not go far enough in making use of the excellent genetics available. Basically, the report is about the identification of a few hits in a small RNAi screen, which is fine in itself, but leaves many questions unanswered. Do mmp1/2 mutants have a phenotype? Can the phenotype be rescued? Does TIMP expression lead to similar phenotypes? What is the temporal requirement for Mmp1/2? What are the target proteins of Mmp2? Is Mmp2 still required when astrocyte motility is blocked? What is the morphology of glia after Mmp1/2 knockdown?

      Significance

      The strength of the study is to identify a pruning phenotype after RNAi-based knockdown. The limitations is that this study is very superficial, it is the beginning of a paper. The initial claim to use Drosophila because to its advanced genetics is not met. The results section is shorter than the discussion.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This study reports a role for matrix metalloproteinases (MMPs) in the developmental pruning of gamma Kenyon cells (KCs) in the fruit fly Mushroom Body during larval-pupal metamorphosis. The authors show through gene expression studies that MMP genes are upregulated in late larval stages as part of the early program for this type of neuronal pruning. They show through cell-targeted RNAi studies of both secreted MMP-1 and membrane-anchored MMP-2, that both genes are required in glial cells and to a lesser extent within KCs. The authors show that MMP secreted from glial is required for normal levels of Mushroom Body developmental neuronal pruning. They mention that MMP genes have been identified in schizophrenic patient screens in patients, and that perhaps a comparable pruning mechanism could be involved in the loss of grey matter (loss of synapses) in patients. The authors propose that MMP levels may be a potential therapeutic marker in the future.

      Major Comments:

      Overall, the work is of a reasonable standard, but very preliminary. The study lacks the substance to completely convince me of any of the results. There is SUBSTANTIAL work that needs to be done to make this publishable. There are a lot of writing mistakes; so many that I do not list them in detail here. The references citations are fairly old, but I do not list update replacements here. The text is very brief, and the overall writing needs to include significantly more description and detail. This is evident in all aspects of the manuscript, but especially notable in the Methods and Figure Legends. None of the Figure Legends include full genotypes of any of the fly lines, and these full fly lines are also not included in the Methods. This is vital to compare the experimental lines to the controls. Major points are listed below:

      1. Figure 2: It is important to note of the specific age of animals in these images when talking about the loss of genes in development. Are all the animals age-matched? High levels of synaptic pruning occur post-eclosion), and it is important to understand when these pruning defects occur. It is mentioned that that overlap for the gene expression data is upregulated during 6-18h APF is this when these images are taken? This is very important in the context of pruning as SCZ symptom presentation is very late relative to these early events.
      2. Figure 2: In the figure legend, it is indicated that the arrows are unpruned axons, however in the controls these areas appear to be highly innervated. Further explanation is needed about the context of the arrows, as there are clear visual differences between these images and the controls, but they appear to have a more expansive phenotype than "unpruned axons". The data does not match the visual representation in comparison to the control.
      3. Figure 2: There needs to be more descriptive definitions and clarifications to the defects labeled in panel K. This could be done in the figure legend, but it would be more useful to label the images provided. For example, if Mmp2 is a "mild pruning affect, put that in the pie chart somewhere, to help guide the description of the phenotype to what those confocal images look like.
      4. Figure 3: The time points of the images of the Mushroom Body (MB) are vital to understanding the process and regulation of these genes.
      5. Figure 3D: Significant description of this graph needs to be added for clarity. What parameters separate each phenotypic defect? Labeling the images and showing images that belong in different groups would be very helpful and improve the paper significantly.
      6. Figure S1: Additional experiments would help answer the strength of the phenotype for the ALG-Gal 4 driver. The authors need to perform the rescue experiment. Use a MMP-2 null and then drive it back in the ALG-GAL4 to see if this is sufficient to rescue the neuron pruning. This also isolates the mechanisms to one subtype of glia.
      7. Figure 3 and 4: The other glial subtypes need to be analyze to make any conclusion about their involvement, as well as the involvement of the astrocytes. Running these exact same experiments on the cortex glial and ensheathing glia will provide essential insight into what glial subtype is involved. The presumed lack of phenotypes in these other glial subtypes will also strengthen the argument that the astrocytes are specifically involved in this process. These are vital experiments.
      8. Figure 4: Again, description of the phenotypes and examples of these would improve the quality of this figure substantially.
      9. Figure 5: An improvement on the quantifications of these phenotypes would strengthen the paper substantially. More detailed description of the phenotypes and how they related to the control would significantly improve the overall quality of the work.

      Minor Comments:

      1. Figure 1A: I would suggest labeling the KC (gamma) and potentially one of the others (a/B, a'/B') to orient the reader to the differences between these two subsets of the KCs, and to emphasize which neurons are undergoing pruning and where the cell bodies are and where the axons project.
      2. Figure 1C: This panel needs further labeling to explain the findings in the heat map. Labeling some of the genes that were found and where they were would be helpful. This could also be done in the figure legend, however without any further labeling or context the heatmap is confusing.
      3. Figure 3B,C: The full genotypes need to be labeled. What is the exact genotype used for the control?
      4. Figure S1: The stock number for the ALG-GAL4 is missing, there are multiple different drivers, therefore this could be helpful in understanding this phenotype, as some are better than others.
      5. Figures 3 and 4: Labeling needs to remain consistent; Figure 3 "Glia-Gal4", Figure 4 "glia-gal4".

      Significance

      General Assessment: An interesting study on MMP function during an unusual type of neural development (axon pruning). Most of the MMP function appears to be in glia, although the MMP role in this context in unclear. The MMP function in the neurons being pruned is unexpected and even less clear. The study is somewhat poorly described in terse language lacking essential information, which gives the overall impression of a preliminary report.

      Advance: Glial MMP function has been described for neuronal clearance mechanisms following injury. The main advance here is to describe a similar function during normal development.

      Audience: Developmental neuroscientists, MMP biologists, possibly schizophrenia clinician researchers

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for providing us the opportunity to revise our manuscript titled “Identifying regulators of associative learning using a protein-labelling approach in C. elegans.” We appreciate the insightful feedback that we received to improve this work. In response, we have extensively revised the manuscript with the following changes: we have (1) clarified the criteria used for selecting candidate genes for behavioural testing, presenting additional data from ‘strong’ hits identified in multiple biological replicates (now testing 26 candidates, previously 17), (2) expanded our discussion of the functional relevance of validated hits, including providing new tissue-specific and neuron class-specific analyses, and (3) improved the presentation of our data, including visualising networks identified in the ‘learning proteome’, to better highlight the significance of our findings. We also substantially revised the text to indicate our attempts to address limitations related to background noise in the proteomic data and outlined potential refinements for future studies. All revisions are clearly marked in the manuscript in red font. A detailed, point-by-point response to each comment is provided below.

      1. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary:

      Rahmani et al., utilize the TurboID method to characterize the global proteome changes in the worm's nervous system induced by a salt-based associative learning paradigm. Altogether, Rahmani et al., uncover 706 proteins that are tagged by the TurboID method specifically in samples extracted from worms that underwent the memory inducing protocol. Next, the authors conduct a gene enrichment analysis that implicates specific molecular pathways in salt-associative learning, such as MAP-kinase and cAMP-mediated pathways. The authors then screen a representative group of the hits from the proteome analysis. The authors find that mutants of candidate genes from the MAP-kinase pathway, namely dlk-1 and uev-3, do not affect the performance in the learning paradigm. Instead multiple acetylcholine signaling mutants significantly affected the performance in the associative memory assay, e.g., acc-1, acc-3, gar-1, and lgc-46. Finally, the authors demonstrate that the acetylcholine signaling mutants did not exhibit a phenotype in similar but different conditioning paradigms, such as aversive salt-conditioning or appetitive odor conditioning, suggesting their effect is specific to appetitive salt conditioning.

      Major comments:

      1. The statistical approach and analysis of the behavior assay: The authors use a 2-way ANOVA test which assumes normal distribution of the data. However, the chemotaxis index used in the study is bounded between -1 and 1, which prevents values near the boundaries to be normally distributed.

      Since most of the control data in this assay in this study is very close to 1, it strongly suggests that the CI data is not normally distributed and therefore 2-way ANOVA is expected to give skewed results.

      I am aware this is a common mistake and I also anticipate that most conclusions will still hold also under a more fitting statistical test.

      We appreciate the point raised by Reviewer 1 and understand the importance of performing the correct statistical tests.

      The statistical tests used in this study were chosen since parametric tests, particularly ANOVA tests to assess differences between multiple groups, are commonly used to assess behaviour in the C. elegans learning and memory field. Below is a summary of the tests used by studies that perform similar behavioural tests cited in this work, as examples:

      Table 1 | A summary for the statistical tests performed by similar studies for chemotaxis assay data. References (listed in the leftmost column) were observed to (A) use parametric tests only or (B) performed either a parametric or non-parametric test on each chemotaxis assay dataset depending on whether the data passed a normality test. Listings for ANOVA tests are in bold to demonstrate their common use in the C. elegans learning and memory field.

      Reference

      Parametric test/s used in the reference

      Non-parametric test/s used in the reference

      Beets et al., 2020

      Two-way ANOVA

      None

      Hiroki & Iino 2022

      One-way ANOVA

      None

      Hiroki et al., 2022

      One-way ANOVA

      None

      Hukema et al., 2006

      T-tests

      None

      Hukema et al., Learn. Mem. 2008

      T-tests

      None

      Jang et al., 2019

      ANOVA

      None

      Kitazono et al., 2017

      Two-way ANOVA and t-tests

      None

      Lans et al., 2004

      One-way ANOVA

      None

      Lim et al., 2018

      Two-way ANOVA

      Wilcoxon rank sum test adjusted with the Benjamini–Hochberg method

      Lin et al., 2010

      Two-way or three-way ANOVA

      None

      Nagashima et al., 2019

      One-way ANOVA

      None

      Ohno et al., 2014

      None

      Sakai et al., 2017

      One-way ANOVA or t-tests

      None

      Stein & Murphy 2014

      Two-way ANOVA and t-tests

      None

      Tang et al., 2023

      One-way ANOVA or t-tests

      None

      Tomioka et al., 2006

      T tests

      None

      Watteyne et al., 2020

      One-way ANOVA

      Two-sided Kruskal–Wallis

      We note Reviewer 1's concern that this may stem from a common mistake. As stated, Two-way ANOVA generally relies on normally distributed data. We used GraphPad Prism to perform the Shapiro-Wilk normality test on our chemotaxis assay data as it is generally appropriate for sample sizes Table 2 | Shapiro-Wilk normality test results for chemotaxis assay data in Figure S8C. Chemotaxis assay data was generated to assess salt associative learning capacity for wild-type (WT) versus lgc-46(-) mutant C. elegans. Three experimental groups were prepared for each C. elegans strain (naïve, high-salt control, and trained). From top-to-bottom, the data below displays the ‘W’ value, ‘P value’, a binary yes/no for whether the data passes the Shapiro-Wilk normality test, and a ‘P value summary’ (ns = non-significant). W values measure the similarity between a normal distribution and the chemotaxis assay data. Data is considered normal in the Shapiro-Wilk normality test when a W value is near 1.0 and the null hypothesis is not rejected (i.e., P value > 0.05).*

      WT naïve

      WT high-salt control

      WT trained

      lgc-46 naïve

      lgc-46 high-salt control

      lgc-46 trained

      W

      0.9196

      0.9114

      0.8926

      0.8334

      0.8151

      0.8769

      P value

      0.5272

      0.4758

      0.3705

      0.1475

      0.1070

      0.2954

      Passed normality test (alpha=0.05)?

      Yes

      Yes

      Yes

      Yes

      Yes

      Yes

      P value summary

      ns

      ns

      ns

      ns

      ns

      ns

      The manuscript now includes the use of the Shapiro-Wilk normality test to assess chemotaxis assay data before using two-way ANOVA on page 51.

      Nevertheless an appropriate statistical analysis should be performed. Since I assume the authors would wish to take into consideration both the different conditions and biological repeats, I can suggest two options:

      • Using a Generalized linear mixed model, one can do with R software.
      • Using a custom bootstrapping approach. We thank Reviewer 1 for suggesting these two options. We carefully considered both approaches and consulted with the in-house statistician at our institution (Dr Pawel Skuza, Flinders University) for expert advice to guide our decision. In summary:

      • Generalised linear mixed models: Generalised linear mixed models (GLMMs) are generally most appropriate for nested/hierarchal data. However, our chemotaxis assay data does not exhibit such nesting. Each biological replicate (N) consists of three technical replicates, which are averaged to yield a single chemotaxis index per N. Our statistical comparisons are based solely on these averaged values across experimental groups, making GLMMs less applicable in this context.

      • __Bootstrapping: __Based on advice from our statistician, while bootstrapping can be a powerful tool, its effectiveness is limited when applied to datasets with a low number of biological replicates (N). Bootstrapping relies on resampling existing data to simulate additional observations, which may artificially inflate statistical power and potentially suggest significance where the biological effect size is minimal or not meaningful. Increasing the number of biological replicates to accommodate bootstrapping could introduce additional variability and compromise the interpretability of the results. The total number of assays, especially controls, varies quite a bit between the tested mutants. For example compare the acc-1 experiment in Figure 4.A., and gap-1 or rho-1 in Figure S4.A and D. It is hard to know the exact N of the controls, but I assume that for example, lowering the wild type control of acc-1 to equivalent to gap-1 would have made it non significant. Perhaps the best approach would be to conduct a power analysis, to know what N should be acquired for all samples.

      We thoroughly evaluated performing the power analysis: however, this is typically performed with the assumption that an N = 1 represents a singular individual/person. An N =1 in this study is one biological replicate that includes hundreds of worms, which is why it is not typically employed in our field for this type of behavioural test.

      Considering these factors, we have opted to continue using a two-way ANOVA for our statistical analysis. This choice aligns with recent publications that employ similar experimental designs and data structures. Crucially, we have verified that our data meet the assumptions of normality, addressing key concerns regarding the suitability of parametric testing. We believe this approach is sufficiently rigorous to support our main conclusions. This rationale is now outlined on page 51.

      To be fully transparent, our aim is to present differences between wild-type and mutant strains that are clearly visible in the graphical data, such that the choice of statistical test does not become a limiting factor in interpreting biological relevance. We hope this rationale is understandable, and we sincerely appreciate the reviewer’s comment and the opportunity to clarify our analytical approach.

      We hope that Reviewer 1 will appreciate these considerations as sufficient justification to retain the statistical tests used in the original manuscript. Nevertheless, to constructively address this comment, we have performed the following revisions:

      1. __Consistent number of biological replicates: __We performed additional biological replicates of the learning assay to confirm the behavioural phenotypes for the key candidates described (KIN-2 , F46H5.3, ACC-1, ACC-3, LGC-46). We chose N = 5 since most studies cited in this paper that perform similar behavioural tests do the same (see the table below). Table 3 | A summary for sample sizes generated by similar studies for chemotaxis assay data. References (listed in the leftmost column) were observed to the sample sizes (N) below corresponding to biological replicates of chemotaxis assay data. N values are in bold when the study uses N ≤ 5.

      Reference

      N used in the study for chemotaxis assay data

      Beets et al., 2020

      8

      Hiroki & Iino 2022

      5-8

      Hiroki et al., 2022

      6-7

      Hukema et al., 2006

      ≥ 4

      Hukema et al., Learn. Mem. 2008

      ≥ 4

      Jang et al., 2019

      ≥ 4

      Kitazono et al., 2017

      ≥ 4

      Kauffman et al., 2010

      ≥ 3

      Kauffman et al., J. Vis. Exp. 2011

      ≥ 3

      Lans et al., 2004

      2

      Lim et al., 2018

      2-4

      Lin et al., 2010

      ≥ 4

      Nagashima et al., 2019

      ≥ 7

      Ohno et al., 2014

      ≥ 11

      Sakai et al., 2017

      ≥ 4

      Stein & Murphy 2014

      3-5

      Tang et al., 2023

      ≥ 9

      Watteyne et al., 2020

      ≥ 10

      __Grouped presentation of behavioural data: __We now present all behavioural data by grouping genotypes tested within the same biological replicate, including wild-type controls, rather than combining genotypes tested separately. This ensures that each graph displays data from genotypes sharing the same N, also an important consideration for performing parametric tests. Accordingly, we re-performed statistical analyses using this reduced Nfor relevant graphs. As anticipated, this rendered some comparisons non-significant. All statistical comparisons are clearly indicated on each graph. Improved clarity of figure legends: __We revised figure legends for __Figures 5, 6, S7, S8, & S9 to make clear how many biological replicates have been performed for each genotype by adding N numbers for each genotype in all figures.

      The authors use the phrasing "a non-significant trend", I find such claims uninterpretable and should be avoided. Examples: Page 16. Line 7 and Page 18, line 16.

      This is an important point. While we were not able to find the specific phrasing "a non-significant trend" from this comment in the original manuscript, we acknowledge that referring to a phenotype as both a trend and non-significant may confuse readers, which was originally stated in the manuscript in two locations.

      The main text has been revised on pages 27 & 28 when describing comparisons between trained groups between two C. elegans lines, by removing mentions of trends and retaining descriptions of non-significance.

      Neuron-specific analysis and rescue of mutants:

      Throughout the study the authors avoid focusing on specific neurons. This is understandable as the authors aim at a systems biology approach, however, in my view this limits the impact of the study. I am aware that the proteome changes analyzed in this study were extracted from a pan neuronally expressed TurboID. Yet, neuron-specific changes may nevertheless be found. For example, running the protein lists from Table S2, in the Gene enrichment tool of wormbase, I found, across several biological replicates, enrichment for the NSM, CAN and RIG neurons. A more careful analysis may uncover specific neurons that take part in this associative memory paradigm. In addition, analysis of the overlap in expression of the final gene list in different neurons, comparing them, looking for overlap and connectivity, would also help to direct towards specific circuits.

      This is an important and useful suggestion. We appreciate the benefit in exploring the data from this study from a neuron class-specific lens, in addition to the systems-level analyses already presented.

      The WormBase gene enrichment tool is indeed valuable for broad transcriptomic analyses (the findings from utilising this tool are now on page 16); however, its use of Anatomy Ontology (AO) terms also contains annotations from more abundant non-neuronal tissues in the worm. To strengthen our analysis and complement the Wormbase tool, we also used the CeNGEN database as suggested by Reviewer 3 Major Comment 1 (Taylor et al., 2021), which uses single cell RNA-Seq data to profile gene expression across the C. elegans nervous system. We input our learning proteome data into CeNGEN as a systemic analysis, identifying neurons highly represented by the learning proteome (on pages 16-20). To do this, we specifically compared genes/proteins from high-salt control worms and trained worms to identify potential neurons that may be involved in this learning paradigm. Briefly, we found:

      • WormBase gene enrichment tool: Enrichment for anatomy terms corresponding to specific interneurons (ADA, RIS, RIG), ventral nerve cord neurons, pharyngeal neurons (M1, M2, M5, I4), PVD sensory neurons, DD motor neurons, serotonergic NSM neurons, and CAN.
      • CeNGEN analysis: Representation of neurons previously implicated in associative learning (e.g., AVK interneurons, RIS interneurons, salt-sensing neuron ASEL, CEP & ADE dopaminergic neurons, and AIB interneurons), as well as neurons not previously studied in this context (pharyngeal neurons I3 & I6, polymodal neuron IL1, motor neuron DA9, and interneuron DVC). Methods are detailed on pages 50 & 51. These data are summarised in the revised manuscript as Table S7 & Figure 4.

      To further address the reviewer’s suggestion, we examined the overlap in expression patterns of the validated learning-associated genes acc-1, acc-3, lgc-46, kin-2, and F46H5.3 across the neuron classes above, using the CeNGEN database. This was done to explore potential neuron classes in which these regulators may act in to regulate learning. This analysis revealed both shared and distinct expression profiles, suggesting potential functional connectivity or co-regulation among subsets of neurons. To summarise, we found:

      • All five learning regulators are expressed in RIM interneurons and DB motor neurons.
      • KIN-2 and F46H5.3 share the same neuron expression profile and are present in many neurons, so they may play a general function within the nervous system to facilitate learning.
      • ACC-3 is expressed in three sensory neuron classes (ASE, CEP, & IL1).
      • In contrast, ACC-1 and LGC-46 are expressed in neuron classes (in brackets) implicated in gustatory or olfactory learning paradigms (AIB, AVK, NSM, RIG, & RIS) (Beets et al., 2012, Fadda et al., 2020, Wang et al., 2025, Zhou et al., 2023, Sato et al., 2021), neurons important for backward or forward locomotion (AVE, DA, DB, & VB) (Chalfie et al., 1985), and neuron classes for which their function is yet detailed in the literature (ADA, I4, M1, M2, & M5). These neurons form a potential neural circuit that may underlie this form of behavioural plasticity, which we now describe in the main text on pages 16-20 & 34-35 and summarise in Figure 4.

      OPTIONAL: A rescue of the phenotype of the mutants by re-expression of the gene is missing, this makes sure to avoid false-positive results coming from background mutations. For example, a pan neuronal or endogenous promoter rescue would help the authors to substantiate their claims, this can be done for the most promising genes. The ideal experiment would be a neuron-specific rescue but this can be saved for future works.

      We appreciate this suggestion and recognise its potential to strengthen our manuscript. In response, we made many attempts to generate pan-neuronal and endogenous promoter re-expression lines. However, we faced several technical issues in transgenic line generation, including poor survival following microinjection likely due to protein overexpression toxicity (e.g., C30G12.6, F46H5.3), and reduced animal viability for chemotaxis assays, potentially linked to transgene-related reproductive defects (e.g., ACC-1). As we have previously successfully generated dozens of transgenic lines in past work (e.g. Chew et al., Neuron 2018; Chew et al., Phil Trans B 2018; Gadenne/Chew et al., Life Science Alliance 2022), we believe the failure to produce most of these lines is not likely due to technical limitations. For transparency, these observations have been included in the discussion section of the manuscript on pages 39 & 40 as considerations for future troubleshooting.

      Fortunately, we were able to generate a pan-neuronal promoter line for KIN-2 that has been tested and included in the revised manuscript. This new data is shown in Figure 5B __and described on __pages 23 & 24. Briefly, this shows that pan-neuronal expression of KIN-2 from the ce179 mutant allele is sufficient to reproduce the enhanced learning phenotype observed in kin-2(ce179) animals, confirming the role of KIN-2 in gustatory learning.

      To address the potential involvement of background mutations (also indicated by Reviewer 4 under ‘cross-commenting’), we have also performed experiments with backcrossed versions of several mutants. These experiments aimed to confirm that salt associative learning phenotypes are due to the expected mutation. Namely, we assessed kin-2(ce179) mutants that had been backcrossed previously by another laboratory, as well as C30G12.6(-) and F46H5.3(-) animals backcrossed in this study. Although not all backcrossed mutants retained their original phenotype (i.e., C30G12.6) (Figure 6D, a newly added figure), we found that backcrossed versions of KIN-2 and F46H5.3 both robustly showed enhanced learning (Figures 5A & 6B). This is described in the text on pages 23-26.

      __Minor comments: __

      1. Lack of clarity regarding the validation of the biotin tagging of the proteome. The authors show in Figure 1 that they validated that the combination of the transgene and biotin allows them to find more biotin-tagged proteins. However there is significant biotin background also in control samples as is common for this method. The authors mention they validated biotin tagging of all their experiments, but it was unclear in the text whether they validated it in comparison to no-biotin controls, and checked for the fold change difference.

      This is an important point: We validated our biotin tagging method prior to mass spectrometry by comparing ‘no biotin’ and ‘biotin’ groups. This is shown in Figure S1 in the revised manuscript, which includes a western blot comparing untreated and biotin treated animals that are non-transgenic or expressing TurboID. As expected, by comparing biotinylated protein signal for untreated and treated lanes within each line, biotin treatment increased the signal 1.30-fold for non-transgenic and 1.70-fold for TurboID C. elegans. This is described on __page 8 __of the revised manuscript.

      To clarify, for mass spectrometry experiments, we tested a no-TurboID (non-transgenic) control, but did not perform a no-biotin control. We included the following four groups: (1) No-TurboID ‘control’ (2) No-TurboID ‘trained’, (3) pan-neuronal TurboID ‘control’ and (4) pan-neuronal TurboID ‘trained’, where trained versus control refers to whether ‘no salt’ was used as the conditioned stimulus or not, respectively (illustrated in Figure 1A). Due to the complexity of the learning assay (which involves multiple washes and handling steps, including a critical step where biotin is added during the conditioning period), and the need to collect sufficient numbers of worms for protein extraction (>3,000 worms per experimental group), adding ‘no-biotin’ controls would have doubled the number of experimental groups, which we considered unfeasible for practical reasons. This is explained on __pages 8 & 9 __of the revised manuscript.

      Also, it was unclear which exact samples were tested per replicate. In Page 9, Lines 17-18: "For all replicates, we determined that biotinylated proteins could be observed ...", But in Page 8, Line 24 : "We then isolated proteins from ... worms per group for both 'control' and 'trained' groups,... some of which were probed via western blotting to confirm the presence of biotinylated proteins".

      • Could the authors specify which samples were verified and clarify how?

      Thank you for pointing out these unclear statements: We have clarified the experimental groups used for mass spectrometry experiments as detailed in the response above on pages 8 &____ 9. In addition, western blots corresponding to each biological replicate of mass spectrometry data described in the main text on page 10 and have been added to the revised manuscript (as Figure S3). These western blots compare biotinylation signal for proteins extracted from (1) No-TurboID ‘control’ (2) No-TurboID ‘trained’, (3) pan-neuronal TurboID ‘control’ and (4) pan-neuronal TurboID ‘trained’. These blots function to confirm that there were biotinylated proteins in TurboID samples, before enrichment by streptavidin-mediated pull-down for mass spectrometry.

      OPTIONAL: include the fold changes of biotinylated proteins of all the ones that were tested. Similar to Figure 1.C.

      This is an excellent suggestion. As recommended by the reviewer, we have included fold-changes for biotinylated protein levels between high-salt control and trained groups (on pages 9 & 10 for replicate #1 and in __Table S2 __for replicates #2-5). This was done by measuring protein levels in whole lanes for each experimental group per biological replicate within western blots (__Figure 1C __for replicate #1 and __Figure S3 __for replicates #2-5) of protein samples generated for mass spectrometry (N = 5).

      Figure 2 does not add much to the reader, it can be summarized in the text, as the fraction of proteins enriched for specific cellular compartments.

      • I would suggest to remove Figure 2 (originally written as figure 3) to text, or transfer it to the supplementry material.

      As noted in cross-comment response to Reviewer 4, there were typos in the original figure references, we have corrected them above. Essentially, this comment is referring to Figure 2.

      We appreciate this feedback from Reviewer 1. We agree that the original __Figure 2 __functions as a visual summary from analysis of the learning proteome at the subcellular compartment level. However, it also serves to highlight the following:

      • Representation for neuron-specific GO terms is relatively low, but even this small percentage represents entire protein-protein networks that are biologically meaningful, but that are difficult to adequately describe in the main text.
      • TurboID was expressed in neurons so this figure supports the relevance of the identified proteome to biological learning mechanisms.
      • Many of these candidates could not be assessed by learning assay using single mutants since related mutations are lethal or substantially affect locomotion. These networks therefore highlight the benefit in using strategies like TurboID to study learning. We have chosen to retain this figure, moving it to the supplementary material as Figure S4 in the revised manuscript, as suggested.

      • OPTIONAL- I would suggest the authors to mark in a pathway summary figure similar to Figure 3 (originally written as Figure 4) the results from the behavior assay of the genetic screen. This would allow the reader to better get the bigger picture and to connect to the systemic approach taken in Figures 2 and 3.

      We think this is a fantastic suggestion and thank Reviewer 1 for this input. In the revised manuscript, we have added Figure 7, which summarises the tested candidates that displayed an effect on learning, mapped onto potential molecular pathways derived from networks in the learning proteome. This figure provides a visual framework linking the behavioural outcomes to the network context. This is described in the main text on pages 32-33.

      Typo in Figure 3: the circle of PPM1: The blue right circle half is bigger than the left one.

      We thank the Reviewer for noticing this, the node size for PPM-1.A has been corrected in what is now Figure 2 in the revised work.

      Unclarity in the discussions. In the discussion Page 24, Line 14, the authors raise this question: "why are the proteins we identified not general learning regulators?. The phrasing and logic of the argumentation of the possible answers was hard to follow. - Can you clarify?

      We appreciate this feedback in terms of unclarity, as we strive to explain the data as clearly and transparently as possible. Our goal in this paragraph was to discuss why some candidates were seen to only affect salt associative learning, as opposed to showing effects in multiple learning paradigms (i.e., which we were defining as a ‘general learning regulator’). We have adjusted the wording in several places in this paragraph now on pages 36 & 37 to address this comment. We hope the rephrased paragraph provides sufficient rationalisation for the discussion regarding our selection strategy used to isolate our protein list of potential learning regulators, and its potential limitations.

      ***Cross-Commenting** *

      Firstly, we would like to express our appreciation for the opportunity for reviewers to cross-comment on feedback from other reviewers. We believe this is an excellent feature of the peer review process, and we are grateful to the reviewers for their thoughtful engagement and collaborative input.

      I would like to thank Reviewer #4 for the great cross comment summary, I find it accurate and helpful.

      I also would like to thank Reviewer #4 for spotting the typos in my minor comments, their page and figure numbers are the correct ones.

      We have corrected these typos in the relevant comments, and have responded to them accordingly.

      Small comment on common point 1 - My feeling is that it is challanging to do quantitative mass spectrometry, especially with TurboID. In general, the nature of MS data is that it hints towards a direction but a followup validation work is required in order to assess it. For example, I am not surprised that the fraction of repeats a hit appeared in does not predict well whether this hit would be validated behavioraly. Given these limitations, I find the authors' approach reasonable.

      We thank Reviewer 1 for this positive and thoughtful feedback. We also appreciate Reviewer 4’s comment regarding quantitative mass spectrometry and have addressed this in detail below (see response to Reviewer 4). However, we agree with Reviewer 1 that there are practical challenges to performing quantitative mass spectrometry with TurboID, primarily due to the enrichment for biotinylated proteins that is a key feature of the sample preparation process.

      Importantly, we whole-heartedly agree with Reviewer 1’s statement that “In general, the nature of MS data is that it hints towards a direction but a follow-up validation work is required in order to assess it”. This is the core of our approach: however, we appreciate that there are limitations to a qualitative ‘absent/present’ approach. We have addressed some of these limitations by clarifying the criteria used for selecting candidate genes, based additionally on the presence of the candidate in multiple biological replicates (categorised as ‘strong’ hits). Based on this method, we were able to validate the role of several novel learning regulators (Figures 5, 6, & S7). We sincerely hope that this manuscript can function as a direction for future research, as suggested by this Reviewer.

      I also would like to highlight this major comment from reviewer 4:

      "In Experimental Procedures, authors state that they excluded data in which naive or control groups showed average CI 0.5499 for N2 (page 36, lines 5-7). "

      This threshold seems arbitrary to me too, and it requires the clarifications requested by reviewer 4.

      As detailed in our response to Reviewer 4, Major Comment 2, data were excluded only in rare cases, specifically when N2 worms failed to show strong salt attraction prior to training, or when trained N2 worms did not exhibit the expected behavioural difference compared to untrained controls – this can largely be attributed to clear contamination or over-population issues, which are visible prior to assessing CTX plates and counting chemotaxis indices.

      These criteria were initially established to provide an objective threshold for excluding biological replicates, particularly when planning to assay a large number of genetic mutants. However, after extensive testing across many replicates, we found that N2 worms (that were not starved, or not contaminated) consistently displayed the expected phenotype, rendering these thresholds unnecessary. We acknowledge that emphasizing these criteria may have been misleading, and have therefore removed them from page 50 in the revised manuscript to avoid confusion and ensure clarity.

      Reviewer #1 (Significance (Required)):

      This study does a great job to effectively utilize the TurboID technique to identify new pathways implicated in salt-associative learning in C. elegans. This technique was used in C. elegans before, but not in this context. The salt-associative memory induced proteome list is a valuable resource that will help future studies on associative memory in worms. Some of the implicated molecular pathways were found before to be involved in memory in worms like cAMP, as correctly referenced in the manuscript. The implication of the acetylcholine pathway is novel for C. elgeans, to the best of my knowledge. The finding that the uncovered genes are specifically required for salt associative memory and not for other memory assays is also interesting.

      However overall I find the impact of this study limited. The premise of this work is to use the Turbo-ID method to conduct a systems analysis of the proteomic changes. The work starts by conducting network analysis and gene enrichment which fit a systemic approach. However, since the authors find that ~30% of the tested hits affect the phenotype, and since only 17/706 proteins were assessed, it is challenging to draw conclusive broad systemic claims. Alternatively, the authors could have focused on the positive hits, and understand them better, find the specific circuits where these genes act. This could have increased the impact of the work. Since neither of these two options are satisfied, I view this work as solid, but not wide in its impact and therefore estimate the audience of this study would be more specialized.

      My expertise is in C. elegans behavior, genetics, and neuronal activity, programming and machine learning.

      We thank the Reviewer for these comments and appreciate the recognition of the value of the proteomic dataset and the identification of novel molecular pathways, including the acetylcholine pathway, as well as the specificity of the uncovered genes to salt-associative memory.

      Regarding the reviewer’s concern about the overall impact and scope of the study, we respectfully offer the following clarification. Our aim was to establish a systems-level approach for investigating learning-related proteomic changes using TurboID, and we acknowledge that only a subset of the identified proteins was experimentally tested (now 26/706 proteins in the revised manuscript). Although only five of the tested single gene mutants showed a robust learning phenotype in the revised work (after backcrossing, more stringent candidate selection, improved statistical analysis in addressing reviewer comments), our proteomic data provides us a unique opportunity to define these candidates within protein-protein networks (as illustrated in Figure 7). Importantly, our functional testing focused on single-gene mutants, which may not reveal phenotypes for genes that act redundantly (now mentioned on pages 28-30). This limitation is inherent to many genetic screens and highlights the value of our proteomic dataset, which enables the identification of broader protein-protein interaction networks and molecular pathways potentially involved in learning.

      To support this systems-level perspective, we have added Figure 7, which visually integrates the tested candidates into molecular pathways derived from the learning proteome for learning regulators KIN-2 and F46H5.3. We also emphasise more explicitly in the text (on pages 32-33) the value of our approach by highlighting the functional protein networks that can be derived from our proteomics dataset.

      We fully acknowledge that the use of TurboID across all neurons limits the resolution needed to pinpoint individual neuron contributions, and understand the benefit in further experiments to explore specific circuits. Many circuits required for salt sensing and salt-based learning are highly explored in the literature and defined explicitly (see Rahmani & Chew, 2021), so our intention was to complement the existing literature by exploring the protein-protein networks involved in learning, rather than on neuron-neuron connectivity. However, we recognise the benefit in integrating circuit-level analyses, given that our proteomic data suggests hundreds of candidates potentially involved in learning. While validating each of these candidates is beyond the scope of the current study, we have taken steps to suggest candidate neurons/circuits by incorporating tissue enrichment analyses and single-cell transcriptomic data (Table S7 & Figure 4). These additions highlight neuron classes of interest and suggest possible circuits relevant to learning.

      We hope this clarification helps convey the intended scope and contribution of our study. We also believe that the revisions made in response to Reviewer 1’s feedback have strengthened the manuscript and enhanced its significance within the field.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      __Summary: __

      In this study by Rahmani in colleagues, the authors sought to define the "learning proteome" for a gustatory associative learning paradigm in C. elegans. Using a cytoplasmic TurboID expressed under the control of a pan-neuronal promoter, the authors labeled proteins during the training portion of the paradigm, followed by proteomics analysis. This approach revealed hundreds of proteins potentially involved in learning, which the authors describe using gene ontology and pathways analysis. The authors performed functional characterization of some of these genes for their requirement in learning using the same paradigm. They also compared the requirement for these genes across various learning paradigms, and found that most hits they characterized appear to be specifically required for the training paradigm used for generating the "learning proteome".

      Major Comments:

      1. The definition of a "hit" from the TurboID approach is does not appear stringent enough. According to the manuscript, a hit was defined as one unique peptide detected in a single biological replicate (out of 5), which could give rise to false positives. In figure S2, it is clear that there relatively little overlap between samples with regards to proteins detected between replicates, and while perhaps unintentional, presenting a single unique peptide appears to be an attempt to inflate the number of hits. Defining hits as present in more than one sample would be more rigorous. Changing the definition of hits would only require the time to re-list genes and change data presented in the manuscript accordingly. We thank Reviewer 2 for this valuable comment, and the following related suggestion. We agree with the statement that “Defining hits as present in more than one sample would be more rigorous”. Therefore, to address this comment, we have now separated candidates into two categories in Table 2 __in the revised manuscript: ‘__strong’ (present in 3 or more biological replicates) and ‘weak’ candidates (present in 2 or fewer biological replicates). However, we think these weaker candidates should still be included in the manuscript, considering we did observe relationships between these proteins and learning. For example, ACC-1, which influences salt associative learning in C. elegans, was detected in one replicate of mass spectrometry as a potential learning regulator (Figure S8A). We describe this classification in the main text on pages 21-22.

      We also agree with Reviewer 2 that the overlap between individual candidate hits is low between biological replicates; the inclusion of Figure S2 __in the original manuscript serves to highlight this limitation. However, it is also important to consider that there is notable overlap for whole molecular pathways between biological replicates of mass spectrometry data as shown in __Figure 2 __in the revised manuscript (this consideration is now mentioned on __pages 13-14). We have included Figure 3 to illustrate representation for two metabolic processes across several biological replicates normally indispensable to animal health, as an example to provide additional visual aid for the overlap between replicates of mass spectrometry. We provide this figure (described on pages 13 & 15) to demonstrate the strength of our approach in that it can detect candidates not easily assessable by conventional forward or reverse genetic screens.

      We also appreciate the opportunity to explain our approach. The criteria of “at least one unique peptide” was chosen based on a previous work for which we adapted for this manuscript (Prikas et al., 2020). It was not intended to inflate the number of hits but rather to ensure sensitivity in detecting low-abundance neuronal proteins. We have clarified this in our Methods (page 46).

      The "hits" that the authors chose to functionally characterize do not seem like strong candidate hits based on the proteomics data that they generated. Indeed, most of the hits are present in a single, or at most 2, biological replicate. It is unclear as to why the strongest hits were not characterized, which if mutant strains are publicly available, would not be a difficult experiment to perform.

      We thank the reviewer for this important suggestion. To address this, we have described two molecular pathways with multiple components that appear in more than one biological replicate of mass spectrometry data in Figure 3 (main text on page 13). In addition, we have included __Figures 6 & S7 __where 9 additional single mutants corresponding to candidates in three or more biological replicates of mass spectrometry were tested for salt associative learning. Briefly, we found the following (number of replicates that a protein was unique to TurboID trained animals is in brackets):

      • Novel arginine kinase F46H5.3 (4 replicates) displays an effect in both salt associative learning and salt aversive learning in the same direction (Figures 6A, 6B, & S9A, pages 31-32 & 37-38).
      • Worms with a mutation for armadillo-domain protein C30G12.6 (3 replicates) only displayed an enhanced learning phenotype when non-backcrossed, not backcrossed. This suggests the enhanced learning phenotype was caused by a background mutation (Figure 6, pages 24-25).
      • We did not observe an effect on salt associative learning when assessing mutations for the ciliogenesis protein IFT-139 (5 replicates), guanyl nucleotide factors AEX-3 or TAG-52 (3 replicates), p38/MAPK pathway interactor FSN-1 (3 replicates), IGCAM/RIG-4 (3 replicates), and acetylcholine components ACR-2 (4 replicates) and ELP-1 (3 replicates) (Figure S7, on pages 27-30). However, we note throughout the section for which these candidates are described that only single gene mutants were tested, meaning that genes that function in redundant or compensatory pathways may not exhibit a detectable phenotype. Because of the lack of strong evidence that these are indeed proteins regulated in the context of learning based on proteomics, including evidence of changes in the proteins (by imaging expression changes of fluorescent reporters or a biochemical approach), would increase confidence that these hits are genuine.

      We thank Reviewer 2 for this suggestion – we agree that it would have been ideal to have additional evidence suggesting that changes in candidate protein levels are associated directly with learning. Ideally, we would have explored this aspect further; however, as outlined in response to Reviewer 1 Major Comment 2 (OPTIONAL), this was not feasible within the scope of the current study due to several practical challenges. Specifically, we attempted to generate pan-neuronal and endogenous promoter rescue lines for several candidates, but encountered significant challenges, including poor survival post-microinjection (likely due to protein overexpression toxicity) and reduced viability for behavioural assays, potentially linked to transgene-related reproductive defects. This information is now described on pages 39 & 40 of the revised work.

      To address these limitations, we performed additional behavioural experiments where possible. We successfully generated a pan-neuronal promoter line for kin-2, which was tested and included in the revised manuscript (Figure 5B, pages 30 & 31). In addition, to confirm that observed learning phenotypes were due to the expected mutations and not background effects, we conducted experiments using backcrossed versions of several mutant lines as suggested by Reviewer 4 Cross Comment 3 (Figure 6, pages 23-24 & 24-26). Briefly, this shows that pan-neuronal expression of KIN-2 from the ce179 mutant allele is sufficient to repeat the enhanced learning phenotype observed in backcrossed kin-2(ce179) animals, providing additional evidence that the identified hits are required for learning. We also confirmed that F46H5.3 modulates salt associative learning, given both non-backcrossed and backcrossed F46H5.3(-) mutants display a learning enhancement phenotype. The revised text now describes this data on the page numbers mentioned above.

      Minor Comments:

      1. The authors highlight that the proteins they discover seem to function uniquely in their gustatory associative paradigm, but this is not completely accurate. kin-2, which they characterize in figure 4, is required for positive butanone association (the authors even say as much in the manuscript) in Stein and Murphy, 2014. We appreciate this correction and thank the Reviewer for pointing this out. We have amended the wording appropriately on page 31 to clarify our meaning.

      2. “Although kin-2(ce179) mutants were not shown to impact salt aversive learning, they have been reported previously to display impaired intermediate-term memory (but intact learning and short-term memory) for butanone appetitive learning (Stein and Murphy, 2014).”*

      Reviewer #2 (Significance (Required)):

      • General Assessment: The approach used in this study is interesting and has the potential to further our knowledge about the molecular mechanisms of associative behaviors. Strengths of the study include the design with carefully thought out controls, and the premise of combining their proteomics with behavioral analysis to better understand the biological significance of their proteomics findings. However, the criteria for defining hits and prioritization of hits for behavioral characterizations were major wweaknesses of the paper.
      • Advance: There have been multiple transcriptomic studies in the worm looking at gene expression changes in the context of behavioral training (Lakhina et al., 2015, Freytag 2017). This study compliments and extends those studies, by examining how the proteome changes in a different training paradigm. This approach here could be employed for multiple different training paradigms, presenting a new technical advance for the field.
      • Audience: This paper would be of interest to the broader field of behavioral and molecular neuroscience. Though it uses an invertebrate system, many findings in the worm regarding learning and memory translate to higher organisms.
      • I am an expert in molecular and behavioral neuroscience in both vertebrate and invertebrate models, with experience in genetics and genomics approaches. We appreciate Reviewer 2’s thoughtful assessment and constructive feedback. In response to concerns regarding definition and prioritisation of hits, we have revised our approach as detailed above to place more consideration on ‘strong’ hits present in multiple biological replicates. We have also added new behavioural data for additional mutants that fall into this category (Figures 6 & S7). We hope these revisions strengthen our study and enhance its relevance to the behavioural/molecular neuroscience community.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      __Summary: __

      In the manuscript titled "Identifying regulators of associative learning using a protein-labelling approach in C. elegans" the authors attempted to generate a snapshot of the proteomic changes that happen in the C. elegans nervous system during learning and memory formation. They employed the TurboID-based protein labeling method to identify the proteins that are uniquely found in samples that underwent training to associate no-salt with food, and consequently exhibited lower attraction to high salt in a chemotaxis assay. Using this system they obtained a list of target proteins that included proteins represented in molecular pathways previously implicated in associative learning. The authors then further validated some of the hits from the assay by testing single gene mutants for effects on learning and memory formation.

      Major Comments:

      In the discussion section, the authors comment on the sources of "background noise" in their data and ways to improve the specificity. They provide some analysis on this aspect in Supplementary figure S2. However, a better visualization of non-specificity in the sample could be a GO analysis of tissue-specificity, and presented as a pie chart as in Figure 2A. Non-neuronal proteins such as MYO-2 or MYO-3 repeatedly show up on the "TurboID trained" lists in several biological replicates (Tables S2 and S3). If a major fraction of the proteins after subtraction of control lists are non-specific, that increases the likelihood that the "hits" observed are by chance. This analysis should be presented in one of the main figures as it is essential for the reader to gauge the reliability of the experiment.

      We agree with this assessment and thank Reviewer 3 for this constructive suggestion. In response, we have now incorporated a comprehensive tissue-specific analysis of the learning proteome in the revised manuscript. Using the single neuron RNA-Seq database CeNGEN, we identified the proportion of neuronal vs non-neuronal proteins from each biological replicate of mass spectrometry data. Specifically, we present Table 1 __on page 17 (which we originally intended to include in the manuscript, but inadvertently left out), which shows that 87-95% (i.e. a large majority) of proteins identified across replicates corresponded to genes detected in neurons, supporting that the TurboID enzyme was able to target the neuronal proteome as expected. __Table 1 is now described in the main text of the revised work on page 16.

      In addition, we performed neuron-specific analyses using both the WormBase gene enrichment tool and the CeNGEN single-cell transcriptomic database, which we describe in detail on our response to Reviewer 1 Major Comment 2. To summarise, these analyses revealed enrichment of several neuron classes, including those previously implicated in associative learning (e.g., ASEL, AIB, RIS, AVK) as well as neurons not previously studied in this context (e.g., IL1, DA9, DVC) (summarised in Table S7). By examining expression overlap across neuron types, we identified shared and distinct profiles that suggest potential functional connectivity and candidate circuits underlying behavioural plasticity (Figure 4). Taken together, these data show that the proteins identified in our dataset are (1) neuronal and (2) expressed in neurons that are known to be required for learning. Methods are detailed on pages 50-51.

      Other than the above, the authors have provided sufficient details in their experimental and analysis procedures. They have performed appropriate controls, and their data has sufficient biological and technical replaictes for statistical analysis.

      We appreciate this positive feedback and thank the Reviewer for acknowledging the clarity of our experimental and analysis procedures.

      Minor Comments:

      There is an error in the first paragraph of the discussion, in the sentences discussing the learning effects in gar-1 mutant worms. The sentences in lines 12-16 on page 22 says that gar-1 mutants have improved salt-associative learning and defective salt-aversive learning, while in fact the data and figures state the opposite.

      We appreciate the Reviewer noting this discrepancy. As clarified in our response to Reviewer 1, Major Comment 1 above, we reanalysed the behavioural data to ensure consistency across genotypes by comparing only those tested within the same biological replicates (thus having the same N for all genotypes). Upon this reanalysis, we found that the previously reported phenotype for gar-1 mutants in salt-associative learning was not statistically different from wild-type controls. Therefore, we have removed references to GAR-1 from the manuscript.

      __Reviewer #3 (Significance (Required)): __Strengths and limitations: This study used neuron-specific TurboID expression with transient biotin exposure to capture a temporally restricted snapshot of the C. elegans nervous system proteome during salt-associative learning. This is an elegant method to identify proteins temporally specific to a certain condition. However, there are several limitations in the way the experiments and analyses were performed which affect the reliability of the data. As the authors themselves have noted in the discussion, background noise is a major issue and several steps could be taken to improve the noise at the experimental or analysis steps (use of integrated C. elegans lines to ensure uniformity of samples, flow cytometry to isolate neurons, quantitative mass spec to detect fold change vs. strict presence/absence). Advance: Several studies have demonstrated the use of proximity labeling to map the interactome by using a bait protein fusion. In fact, expressing TurboID not fused to a bait protein is often used as a negative control in proximity labeling experiments. However, this study demonstrates the use of free TurboID molecules to acquire a global snapshot of the proteome under a given condition. Audience: Even with the significant limitations, this study is specifically of interest to researchers interested in understanding learning and memory formation. Broadly, the methods used in this study could be modified to gain insights into the proteomic profiles at other transient developmental stages. The reviewer's field of expertise: Cell biology of C. elegans neurons.

      We thank the reviewer for their thoughtful evaluation of our work. We appreciate the recognition of the novelty and potential of using neuron-specific TurboID to capture a temporally restricted snapshot of the C. elegans nervous system proteome during learning. We agree that this approach offers a unique opportunity to identify proteins associated with specific behavioural states in future studies.

      We also appreciate the reviewer’s comments regarding limitations in experimental and analytical design. In revising the manuscript, we have taken several steps to address these concerns and improve the clarity, rigour, and interpretability of our data. Specifically:

      • We now provide a frequency-based representation of proteomic hits (Table 2), which helps clarify how candidate proteins were selected and highlights differences between trained and control groups.
      • We have added neuron-specific enrichment analyses using both WormBase and CenGEN databases (Table S7 & Figure 4), which help identify candidate neurons and potential circuits involved in learning (methods on pages 50-51).
      • We have clarified the rationale for using qualitative proteomics in the context of TurboID, in addition to acknowledging the challenges of integrating quantitative mass spectrometry with biotin-based enrichment (page 39). Additional methods for improving sample purity, such as using integrated lines or FACS-enrichment of neurons, could further refine this approach in future studies. For transparency, we did attempt to integrate the TurboID transgenic line to improve the strength and consistency of biotinylation signals. However, despite four rounds of backcrossing, this line exhibited unexpected phenotypes, including a failure to respond reliably to the established training protocol. As a result, we were unable to include it in the current study. Nonetheless, we believe our current approach provides a valuable proof-of-concept and lays the groundwork for future refinement. By addressing the major concerns of peer reviewers, we believe our study makes a significant and impactful contribution by demonstrating the feasibility of using TurboID to capture learning-induced proteomic changes in the nervous system. The identification of novel learning-related mutants, including those involved in acetylcholine signalling and cAMP pathways, provides new directions for future research into the molecular and circuit-level mechanisms of behavioural plasticity.

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript, authors used a learning paradigm in C. elegans; when worms were fed in a saltless plate, its chemotaxis to salt is greatly reduced. To identify learning-related proteins, authors employed nervous system-specific transcriptome analysis to compare whole proteins in neurons between high-salt-fed animals and saltless-fed animals. Authors identified "learning-specific genes" which are observed only after saltless feeding. They categorized these proteins by GO analyses and pathway analyses, and further stepped forward to test mutants in selected genes identified by the proteome analysis. They find several mutants that are defective or hyper-proficient for learning, including acc-1/3 and lgc-46 acetylcholine receptors, gar-1 acetylcholine receptor GPCR, glna-3 glutaminase involved in glutamate biosynthesis, and kin-2, a cAMP pathway gene. These mutants were not previously reported to have abnormality in the learning paradigm.

      Major comments:

      1) There are problems in the data processing and presentation of the proteomics data in the current manuscript which deteriorates the utility of the data. First, as the authors discuss (page 24, lines 5-12), the current approach does not consider amount of the peptides. Authors state that their current approach is "conservative", because some of the proteins may be present in both control and learned samples but in different amounts. This reviewer has a concern in the opposite way: some of the identified proteins may be pseudo-positive artifacts caused by the analytical noise. The problem is that authors included peptides that are "present" in "TurboID, trained" sample but "absent" in the "Non-Tg, trained" and "TurboID, control" samples in any one of the biological replicates, to identify "learning proteome" (706 proteins, page 8, last line - page 9, line 8; page 32, line 21-22). The word "present" implies that they included even peptides whose amounts are just above the detection threshold, which is subject to random noise caused by the detector or during sample collection and preparation processes. This consideration is partly supported by the fact that only a small fraction of the proteins are common between biological replicates (honestly and respectably shown in Figure S2). Because of this problem, there is no statistical estimate of the identity in "learning proteome" in the current manuscript. Therefore, the presentation style in Tables S2 and S3 are not very useful for readers, especially because authors already subtracted proteins identified in Non-Tg samples, which must also suffer from stochastic noise. I suggest either quantifying the MS/MS signal, or if authors need to stick to the "present"/"absent" description of the MS/MS data, use the number of appearances in biological replicates of each protein as estimate of the quantity of each protein. For example, found in 2 replicates in "TurboID, learned" and in 0 replicates in "Non-Tg, trained". One can apply statistics to these counts. This said, I would like to stress that proteins related to acquisition of memory may be very rare, especially because learning-related changes likely occur in a small subset of neurons. Therefore, 1 time vs 0 time may be still important, as well as something like 5 times vs 1 time. In summary, quantitative description of the proteomics results is desired.

      We thank the reviewer for these valuable comments and suggestions.

      We acknowledge that quantitative proteomics would provide beneficial information; however, as also indicated by Reviewer 1 (in cross-comment), it is practically challenging to perform with TurboID. We have included discussion of potential future experiments involving quantitative mass spectrometry, as well as a comprehensive discussion of some of the limitations of our approach as summarised by this Reviewer, in the Discussion section (page 39). However, we note that our qualitative approach also provides beneficial knowledge, such as the identification of functional protein networks acting within biological pathways previously implicated in learning (Figure 2), and novel learning regulators ACC-1/3, LGC-46, and F46H5.3.

      We agree with the assessment that the frequency of occurrence for each candidate we test per biological replicate is useful to disclose in the manuscript as a proxy for quantification. This was also highlighted by Reviewer 2 (Major Comment 1). As detailed above in response to R2, we have now separated candidates into two categories: ‘strong’ (present in 3 or more biological replicates) and ‘weak’ candidates (present in 2 or fewer biological replicates). We have also added behavioural data after testing 9 of these strong candidates in Figures 6 & S7.

      We have also added Table 2 to the revised manuscript, which summarises the frequency-based representation of the proteomics results, as suggested. This is described on pages 22-23. Briefly, this shows the range of candidates further explored using single mutant testing. Specifically, this data showed that many of the tested candidates were more frequently detected in trained worms compared to high-salt controls. This includes both strong and weak candidates, providing a clearer view of how proteomic frequency informed our selection for functional testing.

      2) There is another problem in the treatment of the behavioural data. In Experimental Procedures, authors state that they excluded data in which naive or control groups showed average CI 0.5499 for N2 (page 36, lines 5-7). How were these values determined? One common example for judging a data point as an outlier is > mean + 1.5, 2 or 3 SD, or Thank you for pointing this out. As mentioned by both Reviewer 1 and Reviewer 4, the original manuscript states the following: “Data was excluded for salt associative learning experiments when wild-type N2 displayed (1) an average CI ≤ 0.6499 for naïve or control groups and/or (2) an average CI either 0.5499 for trained groups.”

      To clarify, we only excluded experiments in rare cases where N2 worms did not display robust high salt attraction before training, or where trained N2 did not display the expected behavioural difference compared to untrained or high-salt control N2. These anomalies were typically attributable to clear contamination or starvation issues that could clearly be observed prior to counting chemotaxis indices on CTX plates.

      We established these exclusion criteria in advance of conducting multiple learning assays to ensure an objective threshold for identifying and excluding assays affected by these rare but observable issues. However, these criteria were later found to be unnecessary, as N2 worms robustly displayed the expected untrained and trained phenotypes for salt associative learning when not compromised by starvation or contamination.

      We understand that the original criteria may have appeared to introduce arbitrary bias in data selection. To address this concern, we have removed these criteria from the revised manuscript from page 50.

      Minor comments:

      1) Related to Major comments 1), the successful effect of neuron-specific TurboID procedure was not evaluated. Authors obtained both TurboID and Non-Tg proteome data. Do they see enrichment of neuron-specific proteins? This can be easily tested, for example by using the list of neuron-specific genes by Kaletsky et al. (http://dx.doi.org/10.1038/nature16483 or http://dx.doi.org/10.1371/journal.pgen.1007559), or referring to the CenGEN data.

      We thank this Reviewer for this helpful suggestion, which was echoed by Reviewer 3 (Major Comment 1). As indicated in the response to R3 above, the revised manuscript now includes Table 1 as a tissue-specific analysis of the learning proteome, using the single neuron RNA-Seq database CeNGEN to identify the proportion of neuronal proteins from each biological replicate of mass spectrometry data. Generally, we observed a range of 87-95% of proteins corresponded to genes from the CeNGEN database that had been detected in neurons, providing evidence that the TurboID enzyme was able to target the neuronal proteome as expected. Table 1 is now described in the main text of the revised work on pages 16 & 17.

      2) The behavioural paradigm needs to be described accurately. Page 5, line 16-17, "C. elegans normally have a mild attraction towards higher salt concentration": in fact, C. elegans raised on NGM plates, which include approximately 50mM of NaCl, is attracted to around 50mM of NaCl (Kunitomo et al., Luo et al.) but not 100-200 mM.

      We thank the Reviewer for pointing this out. We agree that clarification is necessary. The revised text reads as follows on page 5: “C. elegans are typically grown in the presence of salt (usually ~ 50 mM) and display an attraction toward this concentration when assayed for chemotaxis behaviour on a salt gradient (Kunitomo et al., 2013, Luo et al., 2014). Training/conditioning with ‘no salt + food’ partially attenuates this attraction (group referred to ‘trained’).”

      Authors call this assay "salt associative learning", which refers to the fact that worms associate salt concentration (CS) and either presence or absence of food (appetitive or aversive US) during conditioning (Kunitomo et al., Luo et al., Nagashima et al.) but they are looking at only association with presence of food, and for proteome analysis they only change the CS (NaCl concentration, as discussed in Discussion, p24, lines 4-5). It is better to attempt to avoid confusion to the readers in general.

      Thank you Reviewer 4 for highlighting this clarity issue. We clarify our definition of “salt associative learning” for the purpose of this study in the revised manuscript on page 6 with the following text:

      “Similar behavioural paradigms involving pairings between salt/no salt and food/no food have been previously described in the literature (Nagashima et al. 2019). Here, learning experiments were performed by conditioning worms with either ‘no salt + food’ (referred to as ‘salt associative learning’) or ‘salt + no food’ (called ‘salt aversive learning’).”

      3) page 32, line 23: the wording "excluding" is obscure and misleading because the elo-6 gene was included in the analysis.

      We appreciate this Reviewer for pointing out this misleading comment, which was unintentional. We have now removed it from the text (on page 21).

      4) Typo at page 24, line 18: "that ACC-1" -> "than ACC-1".

      This has been corrected (on page 37).

      5) Reference. In "LEO, T. H. T. et al.", given and sir names are flipped for all authors. Also, the paper has been formally published (http://dx.doi.org/10.1016/j.cub.2023.07.041).

      We appreciate the Reviewer drawing our attention to this – the reference has been corrected and updated.

      I would like to express my modest cross comments on the reviews:

      1) Many of the reviewers comment on the shortage in the quantitative nature of the proteome analysis, so it seems to be a consensus.

      Thank you Reviewer 4 for this feedback. We appreciate the benefit in performing quantitative mass spectrometry, in that it provides an additional way to parse molecular mechanisms in a biological process (e.g., fold-changes in protein expression induced by learning). However, we note that quantitative mass spectrometry is challenging to integrate with TurboID due to the requirement to enrich for biotinylated peptides during sample processing (we now mention this on page 39). Nevertheless, it would be exciting to see this approach performed in a future study.

      To address the limitations of our original qualitative approach and enhance the clarity and utility of our dataset, we have made the following revisions in the manuscript:

      • Candidate selection criteria: We now clearly define how candidates were selected for functional testing, based on their frequency across biological replicates. Specifically, “strong candidates” were detected in three or more replicates, while “weak candidates” appeared in two or fewer.
      • Frequency-based representation (_Table 2_):__We appreciate the suggestion by Reviewer 4 (Major Comment 1) to quantify differences between high-salt control and trained groups. We now provide the frequency-based representation of the candidates tested in this study within our proteomics data in __Table 2. This data showed that many of the tested candidates were more frequently detected in trained worms compared to high-salt controls. This includes both strong and weak candidates We hope these additions help clarify our approach and demonstrate the value of the dataset, even within the constraints of qualitative proteomics.

      2) Also, tissue- or cell-specificity of the identified proteins were commonly discussed. In reviewer #3's first Major comment, appearance of non-neuronal protein in the list was pointed out, which collaborate with my (#4 reviewer's) question on successful identification of neuronal proteins by this method. On the other hand, reviewer #1 pointed out subset neuron-specific proteins in the list. Obviously, these issues need to be systematically described by the authors.

      We agree with Reviewer 4 that these analyses provide a critical angle of analysis that is not explored in the original manuscript.

      Tissue analysis (Reviewer 3 Major Comment 1): We have used the single neuron RNA-Seq database CeNGEN, to identify that 87-95% (i.e. a large majority) of proteins identified across replicates corresponded to genes detected in neurons. These findings support that the TurboID enzyme was able to target the neuronal proteome as expected. Table 1 provides this information as is now described in the main text of the revised work on page 16.

      __Neuron class analyses (Reviewer 1 Major Comment 2): __In response, we have used the suggested Wormbase gene enrichment tool and CeNGEN. We specifically input proteins from the learning proteome into Wormbase, after filtering for proteins unique to TurboID trained animals. For CeNGEN, we compared genes/proteins from control worms and trained worms to identify potential neurons that may be involved in this learning paradigm.

      Briefly, we found highlight a range of neuron classes known in learning (e.g., RIS interneurons), cells that affect behaviour but have not been explored in learning (e.g., IL1 polymodal neurons), and neurons for which their function/s are unknown (e.g., pharyngeal neuron I3). Corresponding text for this new analysis has been added on pages 16-20, with a new table and figure added to illustrate these findings (Table S7 & Figure 4). Methods are detailed on pages 50-51.

      3) Given reviewer #1's OPTIONAL Major comment, as an expert of behavioral assays in C. elegans, I would like to comment based on my experience that mutants received from Caenorhabditis Genetics Center or other labs often lose the phenotype after outcrossing by the wild type, indicating that a side mutation was responsible for the observed behavioral phenotype. Therefore, outcrossing may be helpful and easier than rescue experiments, though the latter are of course more accurate.

      Thank you for this suggestion. To address the potential involvement of background mutations, we have done experiments with backcrossed versions of mutants tested where possible, as shown in Figure 6. We found that F46H5.3(-) mutants maintained enhanced learning capacity after backcrossing with wild type, compared to their non-backcrossed mutant line. This was in contrast to C30G12.6(-) animals which lost their enhanced learning phenotype following backcrossing using wild type worms. This is described in the text on pages 24-26.

      4) Just let me clarify the first Minor comment by reviewer #2. Authors described that the kin-2 mutant has abnormality in "salt associative learning" and "salt aversive learning", according to authors' terminology. In this comment by reviewer #2, "gustatory associative learning" probably refers to both of these assays.

      Reviewer 4 is correct. We have amended the wording appropriately on page 31 to clarify our meaning to address Reviewer 2’s comment.

      • “Although kin-2(ce179) mutants were not shown to impact salt aversive learning, they have been reported previously to display impaired intermediate-term memory (but intact learning and short-term memory) for butanone appetitive learning (Stein and Murphy, 2014).”*

      5) There seem to be several typos in reviewer #1's Minor comments.

      "In Page 9, Lines 17-18" -> "Page 8, Lines 17-18".

      "Page 8, Line 24" -> "Page 7, Line 24".

      "I would suggest to remove figure 3" -> "I would suggest to remove figure 2"

      "summary figure similar to Figure 4" -> "summary figure similar to Figure 3"

      "In the discussion Page 24, Line 14" -> "In the discussion Page 23, Line 14"

      (I note that because a top page was inserted in the "merged" file but not in art file for review, there is a shift between authors' page numbers and pdf page numbers in the former.)

      It would be nice if reviewer #1 can confirm on these because I might be wrong.

      We appreciate Reviewer 4 noting this, and can confirm that these are the correct references (as indicated by Reviewer 1 in their cross-comments)

      Reviewer #4 (Significance (Required)):

      1) Total neural proteome analysis has not been conducted before for learning-induced changes, though transcriptome analysis has been performed for odor learning (Lakhina et al., http://dx.doi.org/10.1016/j.neuron.2014.12.029). This guarantees the novelty of this manuscript, because for some genes, protein levels may change even though mRNA levels remain the same. We note an example in which a proteome analysis utilizing TurboID, though not the comparison between trained/control, has led to finding of learning related proteins (Hiroki et al., http://dx.doi.org/10.1038/s41467-022-30279-7). As described in the Major comments 1) in the previous section, improvement of data presentation will be necessary to substantiate this novelty.

      We appreciate this thoughtful feedback. We agree that while the neuronal transcriptome has been explored in Lakhina et al., 2015 for C. elegans in the context of memory, our study represents the first to examine learning-induced changes in the total neuronal proteome. We particularly agree with the statement that “for some genes, protein levels may change even though mRNA levels remain the same”. This is essential rationale that we now discuss on page 42.

      Additionally, we acknowledge the relevance of the study by Hiroki et al., 2022, which used TurboID to identify learning-related proteins, though not in a trained versus control comparison. Our work builds on this by directly comparing trained and control conditions, thereby offering new insights into the proteomic landscape of learning. This is now clarified on page 36.

      To substantiate the novelty and significance of our approach, we have revised the data presentation throughout the manuscript, including clearer candidate selection criteria, frequency-based representation of proteomic hits (Table 2), and neuron-specific enrichment analyses (Table S7 & Figure 4). We hope these improvements help convey the unique contribution of our study to the field.

      2) Authors found six mutants that have abnormality in the salt learning (Fig. 4). These genes have not been described to have the abnormality, providing novel knowledge to the readers, especially those who work on C. elegans behavioural plasticity. Especially, involvement of acetylcholine neurotransmission has not been addressed. Although site of action (neurons involved) has not been tested in this manuscript, it will open the venue to further determine the way in which acetylcholine receptors, cAMP pathway etc. influences the learning process.

      Thank you Reviewer 4, for this encouraging feedback. To further strengthen the study and expand its relevance, we have tested additional mutants in response to Reviewer 3’s comments, as shown in Figures 6 & S7. These results provide even more candidate genes and pathways for future exploration, enhancing the significance and impact of our study.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, authors used a learning paradigm in C. elegans; when worms were fed in a saltless plate, its chemotaxis to salt is greatly reduced. To identify learning-related proteins, authors employed nervous system-specific transcriptome analysis to compare whole proteins in neurons between high-salt-fed animals and saltless-fed animals. Authors identified "learning-specific genes" which are observed only after saltless feeding. They categorized these proteins by GO analyses and pathway analyses, and further stepped forward to test mutants in selected genes identified by the proteome analysis. They find several mutants that are defective or hyper-proficient for learning, including acc-1/3 and lgc-46 acetylcholine receptors, gar-1 acetylcholine receptor GPCR, glna-3 glutaminase involved in glutamate biosynthesis, and kin-2, a cAMP pathway gene. These mutants were not previously reported to have abnormality in the learning paradigm.

      Major comments:

      1. There are problems in the data processing and presentation of the proteomics data in the current manuscript which deteriorates the utility of the data. First, as the authors discuss (page 24, lines 5-12), the current approach does not consider amount of the peptides. Authors state that their current approach is "conservative", because some of the proteins may be present in both control and learned samples but in different amounts. This reviewer has a concern in the opposite way: some of the identified proteins may be pseudo-positive artifacts caused by the analytical noise. The problem is that authors included peptides that are "present" in "TurboID, trained" sample but "absent" in the "Non-Tg, trained" and "TurboID, control" samples in any one of the biological replicates, to identify "learning proteome" (706 proteins, page 8, last line - page 9, line 8; page 32, line 21-22). The word "present" implies that they included even peptides whose amounts are just above the detection threshold, which is subject to random noise caused by the detector or during sample collection and preparation processes. This consideration is partly supported by the fact that only a small fraction of the proteins are common between biological replicates (honestly and respectably shown in Figure S2). Because of this problem, there is no statistical estimate of the identity in "learning proteome" in the current manuscript. Therefore, the presentation style in Tables S2 and S3 are not very useful for readers, especially because authors already subtracted proteins identified in Non-Tg samples, which must also suffer from stochastic noise. I suggest either quantifying the MS/MS signal, or if authors need to stick to the "present"/"absent" description of the MS/MS data, use the number of appearances in biological replicates of each protein as estimate of the quantity of each protein. For example, found in 2 replicates in "TurboID, learned" and in 0 replicates in "Non-Tg, trained". One can apply statistics to these counts. This said, I would like to stress that proteins related to acquisition of memory may be very rare, especially because learning-related changes likely occur in a small subset of neurons. Therefore, 1 time vs 0 time may be still important, as well as something like 5 times vs 1 time. In summary, quantitative description of the proteomics results is desired.
      2. There is another problem in the treatment of the behavioural data. In Experimental Procedures, authors state that they excluded data in which naive or control groups showed average CI < 0.6499, and/or trained groups showed average CI < -0.0499 or > 0.5499 for N2 (page 36, lines 5-7). How were these values determined? One common example for judging a data point as an outlier is > mean + 1.5, 2 or 3 SD, or < mean - 1.5, 2 or 3 SD. Are these values any of these standards, or determined through other methods? If these values were determined simply by authors' decision, it could potentially introduce a bias and in the worst cases lead to incorrect conclusions. A related question is: authors state "trained animals showed a lower CI (~0.3)" where in the referred Figure 1B, the corresponding data shows averages close to 0. Why is the inconsistency? The assay that authors use is close to those described in the previous literature (Kunitomo et al., http://dx.doi.org/10.1038/ncomms3210). In this previous paper, it was described that animals conditioned under no salt with food show negative CI and are attracted to the low salt concentration area. Quantitative analysis of behavioural patterns showed migration bias towards lower salt concentrations (negative chemotaxis). Essentially the same concept was reported by Luo et al. (http://dx.doi.org/10.1016/j.neuron.2014.05.010). The experimental procedure employed in the current work is very similar with those by the Japanese group, with a notable difference: the chemotaxis assay plate included 50mM NaCl in Kunitomo et al, while authors used chemotaxis plate without added NaCl (p35, line 18). The latter is expected to cause shallow gradient towards the low-salt area, which may be the reason for the weak negative CI in the trained animals. In any case, the value of CI itself is not a problem, and authors' current assay is valid. The only concern of mine is the potential of author-introduced cognitive bias, possibly affecting, for example, whether a certain mutant has a significant defect or not. What happens if the cut-offs of -0.0499 and 0.5499 are omitted and all data were included in the analyses? What are the average CIs of N2 in all performed experiments for each of naive, control and trained groups?

      Minor comments:

      1. Related to Major comments 1), the successful effect of neuron-specific TurboID procedure was not evaluated. Authors obtained both TurboID and Non-Tg proteome data. Do they see enrichment of neuron-specific proteins? This can be easily tested, for example by using the list of neuron-specific genes by Kaletsky et al. (http://dx.doi.org/10.1038/nature16483 or http://dx.doi.org/10.1371/journal.pgen.1007559), or referring to the CenGEN data.
      2. The behavioural paradigm needs to be described accurately. Page 5, line 16-17, "C. elegans normally have a mild attraction towards higher salt concentration": in fact, C. elegans raised on NGM plates, which include approximately 50mM of NaCl, is attracted to around 50mM of NaCl (Kunitomo et al., Luo et al.) but not 100-200 mM. Authors call this assay "salt associative learning", which refers to the fact that worms associate salt concentration (CS) and either presence or absence of food (appetitive or aversive US) during conditioning (Kunitomo et al., Luo et al., Nagashima et al.) but they are looking at only association with presence of food, and for proteome analysis they only change the CS (NaCl concentration, as discussed in Discussion, p24, lines 4-5). It is better to attempt to avoid confusion to the readers in general.
      3. page 32, line 23: the wording "excluding" is obscure and misleading because the elo-6 gene was included in the analysis.
      4. Typo at page 24, line 18: "that ACC-1" -> "than ACC-1".
      5. Reference. In "LEO, T. H. T. et al.", given and sir names are flipped for all authors. Also, the paper has been formally published (http://dx.doi.org/10.1016/j.cub.2023.07.041).

      Cross-Commenting

      I would like to express my modest cross comments on the reviews:

      1. Many of the reviewers comment on the shortage in the quantitative nature of the proteome analysis, so it seems to be a consensus.
      2. Also, tissue- or cell-specificity of the identified proteins were commonly discussed. In reviewer #3's first Major comment, appearance of non-neuronal protein in the list was pointed out, which collaborate with my (#4 reviewer's) question on successful identification of neuronal proteins by this method. On the other hand, reviewer #1 pointed out subset neuron-specific proteins in the list. Obviously, these issues need to be systematically described by the authors.
      3. Given reviewer #1's OPTIONAL Major comment, as an expert of behavioral assays in C. elegans, I would like to comment based on my experience that mutants received from Caenorhabditis Genetics Center or other labs often lose the phenotype after outcrossing by the wild type, indicating that a side mutation was responsible for the observed behavioral phenotype. Therefore, outcrossing may be helpful and easier than rescue experiments, though the latter are of course more accurate.
      4. Just let me clarify the first Minor comment by reviewer #2. Authors described that the kin-2 mutant has abnormality in "salt associative learning" and "salt aversive learning", according to authors' terminology. In this comment by reviewer #2, "gustatory associative learning" probably refers to both of these assays.
      5. There seem to be several typos in reviewer #1's Minor comments. "In Page 9, Lines 17-18" -> "Page 8, Lines 17-18". "Page 8, Line 24" -> "Page 7, Line 24". "I would suggest to remove figure 3" -> "I would suggest to remove figure 2" "summary figure similar to Figure 4" -> "summary figure similar to Figure 3" "In the discussion Page 24, Line 14" -> "In the discussion Page 23, Line 14" (I note that because a top page was inserted in the "merged" file but not in art file for review, there is a shift between authors' page numbers and pdf page numbers in the former.) It would be nice if reviewer #1 can confirm on these because I might be wrong.

      Significance

      1. Total neural proteome analysis has not been conducted before for learning-induced changes, though transcriptome analysis has been performed for odor learning (Lakhina et al., http://dx.doi.org/10.1016/j.neuron.2014.12.029). This guarantees the novelty of this manuscript, because for some genes, protein levels may change even though mRNA levels remain the same. We note an example in which a proteome analysis utilizing TurboID, though not the comparison between trained/control, has led to finding of learning related proteins (Hiroki et al., http://dx.doi.org/10.1038/s41467-022-30279-7). As described in the Major comments 1) in the previous section, improvement of data presentation will be necessary to substantiate this novelty.
      2. Authors found six mutants that have abnormality in the salt learning (Fig. 4). These genes have not been described to have the abnormality, providing novel knowledge to the readers, especially those who work on C. elegans behavioural plasticity. Especially, involvement of acetylcholine neurotransmission has not been addressed. Although site of action (neurons involved) has not been tested in this manuscript, it will open the venue to further determine the way in which acetylcholine receptors, cAMP pathway etc. influences the learning process.
    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In the manuscript titled "Identifying regulators of associative learning using a protein-labelling approach in C. elegans" the authors attempted to generate a snapshot of the proteomic changes that happen in the C. elegans nervous system during learning and memory formation. They employed the TurboID-based protein labeling method to identify the proteins that are uniquely found in samples that underwent training to associate no-salt with food, and consequently exhibited lower attraction to high salt in a chemotaxis assay. Using this system they obtained a list of target proteins that included proteins represented in molecular pathways previously implicated in associative learning. The authors then further validated some of the hits from the assay by testing single gene mutants for effects on learning and memory formation.

      Major comments:

      In the discussion section, the authors comment on the sources of "background noise" in their data and ways to improve the specificity. They provide some analysis on this aspect in Supplementary figure S2. However, a better visualization of non-specificity in the sample could be a GO analysis of tissue-specificity, and presented as a pie chart as in Figure 2A. Non-neuronal proteins such as MYO-2 or MYO-3 repeatedly show up on the "TurboID trained" lists in several biological replicates (Tables S2 and S3). If a major fraction of the proteins after subtraction of control lists are non-specific, that increases the likelihood that the "hits" observed are by chance. This analysis should be presented in one of the main figures as it is essential for the reader to gauge the reliability of the experiment.

      Other than the above, the authors have provided sufficient details in their experimental and analysis procedures. They have performed appropriate controls, and their data has sufficient biological and technical replaictes for statistical analysis.

      Minor comments:

      There is an error in the first paragraph of the discussion, in the sentences discussing the learning effects in gar-1 mutant worms. The sentences in lines 12-16 on page 22 says that gar-1 mutants have improved salt-associative learning and defective salt-aversive learning, while in fact the data and figures state the opposite.

      Significance

      Strengths and limitations:

      This study used neuron-specific TurboID expression with transient biotin exposure to capture a temporally restricted snapshot of the C. elegans nervous system proteome during salt-associative learning. This is an elegant method to identify proteins temporally specific to a certain condition. However, there are several limitations in the way the experiments and analyses were performed which affect the reliability of the data. As the authors themselves have noted in the discussion, background noise is a major issue and several steps could be taken to improve the noise at the experimental or analysis steps (use of integrated C. elegans lines to ensure uniformity of samples, flow cytometry to isolate neurons, quantitative mass spec to detect fold change vs. strict presence/absence).

      Advance:

      Several studies have demonstrated the use of proximity labeling to map the interactome by using a bait protein fusion. In fact, expressing TurboID not fused to a bait protein is often used as a negative control in proximity labeling experiments. However, this study demonstrates the use of free TurboID molecules to acquire a global snapshot of the proteome under a given condition.

      Audience:

      Even with the significant limitations, this study is specifically of interest to researchers interested in understanding learning and memory formation. Broadly, the methods used in this study could be modified to gain insights into the proteomic profiles at other transient developmental stages.

      The reviewer's field of expertise: Cell biology of C. elegans neurons.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this study by Rahmani in colleagues, the authors sought to define the "learning proteome" for a gustatory associative learning paradigm in C. elegans. Using a cytoplasmic TurboID expressed under the control of a pan-neuronal promoter, the authors labeled proteins during the training portion of the paradigm, followed by proteomics analysis. This approach revealed hundreds of proteins potentially involved in learning, which the authors describe using gene ontology and pathways analysis. The authors performed functional characterization of some of these genes for their requirement in learning using the same paradigm. They also compared the requirement for these genes across various learning paradigms, and found that most hits they characterized appear to be specifically required for the training paradigm used for generating the "learning proteome".

      Major Comments:

      • The definition of a "hit" from the TurboID approach is does not appear stringent enough. According to the manuscript, a hit was defined as one unique peptide detected in a single biological replicate (out of 5), which could give rise to false positives. In figure S2, it is clear that there relatively little overlap between samples with regards to proteins detected between replicates, and while perhaps unintentional, presenting a single unique peptide appears to be an attempt to inflate the number of hits. Defining hits as present in more than one sample would be more rigorous. Changing the definition of hits would only require the time to re-list genes and change data presented in the manuscript accordingly.
      • The "hits" that the authors chose to functionally characterize do not seem like strong candidate hits based on the proteomics data that they generated. Indeed, most of the hits are present in a single, or at most 2, biological replicate. It is unclear as to why the strongest hits were not characterized, which if mutant strains are publicly available, would not be a difficult experiment to perform. Because of the lack of strong evidence that these are indeed proteins regulated in the context of learning based on proteomics, including evidence of changes in the proteins (by imaging expression changes of fluorescent reporters or a biochemical approach), would increase confidence that these hits are genuine.

      Minor Comments:

      • The authors highlight that the proteins they discover seem to function uniquely in their gustatory associative paradigm, but this is not completely accurate. kin-2, which they characterize in figure 4, is required for positive butanone association (the authors even say as much in the manuscript) in Stein and Murphy, 2014.

      Significance

      • General Assessment: The approach used in this study is interesting and has the potential to further our knowledge about the molecular mechanisms of associative behaviors. Strengths of the study include the design with carefully thought out controls, and the premise of combining their proteomics with behavioral analysis to better understand the biological significance of their proteomics findings. However, the criteria for defining hits and prioritization of hits for behavioral characterizations were major wweaknesses of the paper.
      • Advance: There have been multiple transcriptomic studies in the worm looking at gene expression changes in the context of behavioral training (Lakhina et al., 2015, Freytag 2017). This study compliments and extends those studies, by examining how the proteome changes in a different training paradigm. This approach here could be employed for multiple different training paradigms, presenting a new technical advance for the field.
      • Audience: This paper would be of interest to the broader field of behavioral and molecular neuroscience. Though it uses an invertebrate system, many findings in the worm regarding learning and memory translate to higher organisms.
      • I am an expert in molecular and behavioral neuroscience in both vertebrate and invertebrate models, with experience in genetics and genomics approaches.
    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Rahmani et al., utilize the TurboID method to characterize the global proteome changes in the worm's nervous system induced by a salt-based associative learning paradigm. Altogether, Rahmani et al., uncover 706 proteins that are tagged by the TurboID method specifically in samples extracted from worms that underwent the memory inducing protocol. Next, the authors conduct a gene enrichment analysis that implicates specific molecular pathways in salt-associative learning, such as MAP-kinase and cAMP-mediated pathways. The authors then screen a representative group of the hits from the proteome analysis. The authors find that mutants of candidate genes from the MAP-kinase pathway, namely dlk-1 and uev-3, do not affect the performance in the learning paradigm. Instead multiple acetylcholine signaling mutants significantly affected the performance in the associative memory assay, e.g., acc-1, acc-3, gar-1, and lgc-46. Finally, the authors demonstrate that the acetylcholine signaling mutants did not exhibit a phenotype in similar but different conditioning paradigms, such as aversive salt-conditioning or appetitive odor conditioning, suggesting their effect is specific to appetitive salt conditioning.

      Major comments:

      1. The statistical approach and analysis of the behavior assay:
        • The authors use a 2-way ANOVA test which assumes normal distribution of the data. However, the chemotaxis index used in the study is bounded between -1 and 1, which prevents values near the boundaries to be normally distributed. Since most of the control data in this assay in this study is very close to 1, it strongly suggests that the CI data is not normally distributed and therefore 2-way ANOVA is expected to give skewed results. I am aware this is a common mistake and I also anticipate that most conclusions will still hold also under a more fitting statistical test. Nevertheless an appropriate statistical analysis should be performed. Since I assume the authors would wish to take into consideration both the different conditions and biological repeats, I can suggest two options:
        • Using a Generalized linear mixed model, one can do with R software.
        • Using a custom bootstrapping approach.
        • The total number of assays, especially controls, varies quite a bit between the tested mutants. For example compare the acc-1 experiment in Figure 4.A., and gap-1 or rho-1 in Figure S4.A and D. It is hard to know the exact N of the controls, but I assume that for example, lowering the wild type control of acc-1 to equivalent to gap-1 would have made it non significant. Perhaps the best approach would be to conduct a power analysis, to know what N should be acquired for all samples.
        • The authors use the phrasing "a non-significant trend", I find such claims uninterpretable and should be avoided. Examples: Page 16. Line 7 and Page 18, line 16.
      2. Neuron-specific analysis and rescue of mutants:
        • Throughout the study the authors avoid focusing on specific neurons. This is understandable as the authors aim at a systems biology approach, however, in my view this limits the impact of the study. I am aware that the proteome changes analyzed in this study were extracted from a pan neuronally expressed TurboID. Yet, neuron-specific changes may nevertheless be found. For example, running the protein lists from Table S2, in the Gene enrichment tool of wormbase, I found, across several biological replicates, enrichment for the NSM, CAN and RIG neurons. A more careful analysis may uncover specific neurons that take part in this associative memory paradigm. In addition, analysis of the overlap in expression of the final gene list in different neurons, comparing them, looking for overlap and connectivity, would also help to direct towards specific circuits.
        • OPTIONAL: A rescue of the phenotype of the mutants by re-expression of the gene is missing, this makes sure to avoid false-positive results coming from background mutations. For example, a pan neuronal or endogenous promoter rescue would help the authors to substantiate their claims, this can be done for the most promising genes. The ideal experiment would be a neuron-specific rescue but this can be saved for future works.

      Minor comments:

      1.Lack of clarity regarding the validation of the biotin tagging of the proteome. The authors show in Figure 1 that they validated that the combination of the transgene and biotin allows them to find more biotin-tagged proteins. However there is significant biotin background also in control samples as is common for this method. The authors mention they validated biotin tagging of all their experiments, but it was unclear in the text whether they validated it in comparison to no-biotin controls, and checked for the fold change difference. Also, it was unclear which exact samples were tested per replicate. In Page 9, Lines 17-18: "For all replicates, we determined that biotinylated proteins could be observed ...", But in Page 8, Line 24 : "We then isolated proteins from ... worms per group for both 'control' and 'trained' groups,... some of which were probed via western blotting to confirm the presence of biotinylated proteins". - Could the authors specify which samples were verified and clarify how? - OPTIONAL: include the fold changes of biotinylated proteins of all the ones that were tested. Similar to Figure 1.C. 2.Figure 2 does not add much to the reader, it can be summarized in the text, as the fraction of proteins enriched for specific cellular compartments. - I would suggest to remove figure 3 to text, or transfer it to the supplementry material. - OPTIONAL: I would suggest the authors to mark in a pathway summary figure similar to Figure 4 the results from the behavior assay of the genetic screen. This would allow the reader to better get the bigger picture and to connect to the systemic approach taken in Figures 2 and 3. 3. Typo in Figure 3: the circle of PPM1: The blue right circle half is bigger than the left one. 4. Unclarity in the discussions. In the discussion Page 24, Line 14, the authors raise this question: "why are the proteins we identified not general learning regulators?. The phrasing and logic of the argumentation of the possible answers was hard to follow. - Can you clarify?

      Cross-Commenting

      I would like to thank Reviewer #4 for the great cross comment summary, I find it accurate and helpful. I also would like to thank Reviewer #4 for spotting the typos in my minor comments, their page and figure numbers are the correct ones.

      Small comment on common point 1 - My feeling is that it is challanging to do quantitative mass spectrometry, especially with TurboID. In general, the nature of MS data is that it hints towards a direction but a followup validation work is required in order to assess it. For example, I am not surprised that the fraction of repeats a hit appeared in does not predict well whether this hit would be validated behavioraly. Given these limitations, I find the authors' approach reasonable.

      I also would like to highlight this major comment from reviewer 4: "In Experimental Procedures, authors state that they excluded data in which naive or control groups showed average CI < 0.6499, and/or trained groups showed average CI < -0.0499 or > 0.5499 for N2 (page 36, lines 5-7). " This threshold seems arbitrary to me too, and it requires the clarifications requested by reviewer 4.

      Significance

      This study does a great job to effectively utilize the TurboID technique to identify new pathways implicated in salt-associative learning in C. elegans. This technique was used in C. elegans before, but not in this context. The salt-associative memory induced proteome list is a valuable resource that will help future studies on associative memory in worms. Some of the implicated molecular pathways were found before to be involved in memory in worms like cAMP, as correctly referenced in the manuscript. The implication of the acetylcholine pathway is novel for C. elgeans, to the best of my knowledge. The finding that the uncovered genes are specifically required for salt associative memory and not for other memory assays is also interesting.

      However overall I find the impact of this study limited. The premise of this work is to use the Turbo-ID method to conduct a systems analysis of the proteomic changes. The work starts by conducting network analysis and gene enrichment which fit a systemic approach. However, since the authors find that ~30% of the tested hits affect the phenotype, and since only 17/706 proteins were assessed, it is challenging to draw conclusive broad systemic claims. Alternatively, the authors could have focused on the positive hits, and understand them better, find the specific circuits where these genes act. This could have increased the impact of the work. Since neither of these two options are satisfied, I view this work as solid, but not wide in its impact and therefore estimate the audience of this study would be more specialized.

      My expertise is in C. elegans behavior, genetics, and neuronal activity, programming and machine learning.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study, Wasilewska and colleagues generated tmbim5-/- zebrafish line and demonstrated that tmbim5 loss of function leads to decrease in zebrafish size and induces muscle atrophy. Authors used immunohistochemistry to suggest that tmbim5-/- zebrafish shows reduced glycogen levels in muscle and liver. However, most of the immunohistochemistry is not quantitated and only qualitative differences are shown. Next, the authors measured mitochondrial calcium levels in the brain of tmbim5-/- zebrafish but there was no behavioral phenotype in the fish. It would have be better to measure mitochondrial calcium levels in the muscles of tmbim5-/- zebrafish as phenotype is muscle atrophy. Further, it is reported that the mitochondrial membrane potential and glycogen levels were perturbed in tmbim5-/- zebrafish.

      Next, the authors generated a scl8b1-/- (a probable NCLX ortholog in zebrafish) zebrafish, which did not show any drastic phenotype. However, neither slc8b1 function nor the phenotype of scl8b1-/- zebrafish was well characterized. Further, authors created two double knockout zebrafish lines i.e. tmbim5-/-/mcu-/- and tmbim5-/-/slc8b1-/-. Interestingly, both these lines were viable and do not show any drastic phenotypes. The authors concluded that in these transgenic fishes compensatory and/or alternative mitochondrial Ca2+ mobilization pathways counterbalance the effects of silencing of these proteins.

      Although it is an interesting study, the conclusions are not well supported with the data. At several places only qualitative images are shown and quantitative data is missing. Similarly, Ca2+ imaging in muscles of tmbim5-/- zebrafish is not performed. Finally, no molecular mechanism or molecular details are provided. Though Tmbim5's potential role in EMRE degradation is discussed, it is not experimentally investigated. The quality of the manuscript would significantly enhance if authors perform the suggested experiments.

      Major Comments:

      1. As a potential mechanism, Tmbim5's potential role in EMRE degradation is discussed but it is not experimentally investigated. It is very easy to test this hypothesis. If this is the case, it would be a very good contribution to the field.
      2. On Page 16, authors state that slc8b1 does not constitutes the major mitochondrial Ca2+ efflux transport system. Authors should do calcium imaging experiments just like they did with tmbim5 and mcu double knockouts (data presented in Figure 4C) to make any comments on functioning of slc8b1 in mitochondrial Ca2+ transport. This is important because slc8b1 is only a predictive ortholog of human NCLX and it is not experimentally examined yet.
      3. The data presented in Fig. 4C is very important but it is not fully explained and discussed in the results. Please discuss all the data sets presented in Fig4C in detail. As such, it is very difficult to follow and interpret the data.
      4. In tmbim5-/- zebrafish, what happens to mitochondrial Ca2+ signaling in muscle as phenotype is muscle atrophy only?
      5. Please validate the observation of decreased glycogen levels in tmbim5-/- fish by one more way. Only immunohistochemistry that too without quantitation is not convincing (Fig. 2E-H).

      Minor Comments:

      1. Authors state that tmbim5 loss of function leads to metabolic changes but the only data provided is decrease in glycogen levels. It would be helpful for the authors to focus comments specifically on the data presented in the manuscript to avoid potential over-interpretation.
      2. While discussing Fig4., authors mention that Tmbim5 may act as a MCU independent Ca2+ uptake mechanism and therefore they crossed tmbim5 mutants with mcu KO fish. But from the data presented in Fig.3 and as concluded by the authors themselves tmbim5 mutants do not show changes in the mitochondrial Ca2+ levels. Authors may clarify this point.
      3. Does tmbim5 contributes to mitochondrial Ca2+ uptake in presence or along with MCU. Further analysis of Fig4C may shed some light on this. Authors should test significance between tmbim5-/- and WT as well as between tmbim5-/- and tmbim5+/+ in mcu-/- background.
      4. Please check the labeling on traces in Fig3D.
      5. Please include quantitation of data presented in EV2E-F.
      6. Please include quantitation of immunohistochemistry data presented in 2E-H.

      Referee cross-commenting

      Several comments are common between the reviewers highlighting that those experiments are critical. Secondly, I agree with the concerns raised by other two reviewers.

      Significance

      In this study, authors report couple of new transgenic zebrafish lines. However, further characterization of slc8b1-/- is required. This study reinforces the existing idea that there are very robust compensatory mechanisms that maintain mitochondrial Ca2+ homeostasis. While the work provides useful insights, it could benefit from a broader scope to provide substantial advancement to existing knowledge.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: The work of Wasilewska et al. focusses on the MCU independent basal Ca2+ uptake mechanisms and the effects of MCU, NCLX, and TMBIM5 KO on Zebrafish Ca2+ homeostasis, mortality, anatomy and metabolism. The authors found evidence that tmbim5 potentially has a bidirectional mode of operation and is able to extrude Ca2+ from the matrix as well as transfer Ca2+ into mitochondria. Further, a reduced membrane potential in tmbim5-/- fish and altered metabolism was found. While the conclusion drawn are well argumented, a few points have to be addressed.

      Major Points:

      1. While all mitochondrial genes seem collectively reduced compared to control, it would be interesting to assess the mitochondrial mass and/or mitochondrial turnover rate in regard to e.g. mitophagy. The reduced membrane potential could lead to PINK1 accumulation on the outer mitochondrial membrane to mediate mitophagy leading overall to reduced mitochondrial count and mass.
      2. The characterization of slc8b1-KO fish needs some improvement to facilitate a better understanding of the molecular interactions of slc8b1 and tmbim5. This would also greatly improve the understanding of the phenotypical characterization and behavioral response to CGP.
      3. Functional Ca2+ measurements of the activity of slc8b1 gene product have to be done to ensure a KO phenotype. Especially in light of the surprising results presented in Figure 6A showing an effect of CGP on slc8b1-KO fish but not on tmbim5-KO fish I advise mitochondrial isolation to conduct mitochondrial basal and extrusion Ca2+experiments of slc8b1-KO fish, tmbim5-KO fish, and double KO-fish.

      Minor Points:

      The authors claim that mRNA levels of mitochondrial proteins involved in Ca2+ transport in tmbim5-/- are unaffected (Figure EV3). While the T-tests show no significant alteration, what happens if a 2-way ANOVA shows a more general effect revealed between WT and TMBIM5-/-?

      Significance

      This is a well-designed and carefully executed piece of work. The experimental design is thoughtfully elaborated, and the topic is worthy of investigation. The strengths of this study lie in translating our knowledge of TMBIN5 from single cells to organism and organ function. Moreover, the work provides important new information that will help the scientific community working on mitochondrial regulation AND muscle diseases to understand how ions coordinately regulate mitochondrial function.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Although the experimental approach is promising (see below), the results do not significantly expand our current understanding. This is partly due to the challenges of interpreting negative results, which are nonetheless worth reporting. Some of the conclusions and interpretations of the results could benefit from further clarification and contextualization to enhance their impact:

      • Figure 1D: The distribution of fiber size in wt vs. Tmbim5-ko fish shows a notable difference limited to one size range. Can the authors clarify this observation? Could this indicate a switch in fiber type? Is there a correlation between this finding and the differential PAS staining?
      • Figure 3: one of the advantages of the zebrafish model is its transparency, allowing for fluorescence imaging. Unfortunately, this proves to be impossible in the case of cepia2mt. The data provided by the authors show that the fluorescence of this probe does not vary following physiological stimuli. The only change is that induced by CCCP (Fig 3C-D), which according to the authors causes a discharge of mitochondrial calcium. However, the use of CCCP with GFP-based probes should be avoided, as the acidification caused by CCCP treatment leads to quenching of the fluorophore, resulting in a fluorescence decrease which is independent of Ca2+ levels. Although the experimental approach aims to detect dynamic changes in mitochondrial Ca2+ levels, the presented results in Figure 3 do not provide conclusive evidence to support this capability. While significant experimental effort is evident, these findings may require further validation or additional data to strengthen their impact. Alternatively, the authors could remove this Figure 3 and relevant text from the manuscript.
      • Figure 6A: In my opinion, this dataset is impossible to understand. To my knowledge, the precise molecular target of CGP-37157 remains elusive. While CGP is often considered an NCLX inhibitor, this classification lacks definitive experimental support. Although CGP is known to inhibit mitochondrial Na+-dependent Ca2+ extrusion, direct binding of CGP to NCLX has yet to be conclusively demonstrated. With this in mind, the authors show that pharmacological intervention with CGP elicits a distinct phenotype in the fish model. While this effect appears to persist in SLC8B1-KO fish, it is absent in Tmbim5-KO fish, suggesting Tmbim5 as a potential molecular target for CGP. However, this interpretation is inconsistent with the following observations: i) CGP remains effective in Tmbim5/Slc8b1 double-KO fish and ii) Tmbim5-KO fish exhibit no discernible phenotype. A comprehensive explanation that reconciles these findings is sought.
      • Figure 6B: according to the authors, the phenotype induced by CGP treatment is specific because a different substance with a completely different effect, CCCP, causes the same phenotype in both wt and Tmbim5-KO fish. Also in this case, the rationale and reasoning behind this experiment in not very evident. As I see it, CCCP blocks zebrafish motility because it is a metabolic poison, and its effect does not depend on any transporter.

      Significance

      The manuscript submitted by Wasilewska et al investigates the functional relationship between different mitochondrial calcium transporters using zebrafish as a model. The topic is of great interest. In the last 15 years, many mitochondrial calcium transporters have been identified. In some cases, their mechanism is not fully understood, such as in the case of TMBIM5, recently described by some as an H/Ca exchanger, or as a Ca channel by others. Furthermore, the functional relationship between different transporters has so far been studied in a partial and superficial way. I believe that this work is therefore of great interest because it aims to contribute to a fundamental problem that is still poorly studied. The idea of using zebrafish is interesting, as it is an organism that is easy to manipulate and phenotype, and because it is transparent, making it possible to use specific biosensors to characterize mitochondrial calcium dynamics, at least in principle. The paper therefore deserves attention.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their helpful comments and suggestions. Below you may find the point-by-point replies to their concerns.

      Reviewer #1

      “The research is meticulously conducted, and the data are compelling, as they demonstrate that the Nova-agrin-Lrp4-MuSK axis is also operational in non-vertebrates. The conclusions drawn by the authors are generally adequate; however, I find some instances of "it is the first time..." to be unnecessary.”

      We have removed all unnecessary claims to that effect.

      “The work also presents an unexpected finding that mouse Nova protein is unable to splice the Ciona agrin mini-gene (Figure 3). I believe the inability of mouse Nova1 and Nova2 to splice the Ciona agrin could also be due to insufficient expression levels of the mouse proteins. Therefore, the authors should include either a positive control (e.g., mouse agrin mini-gene) or demonstrate that the proteins are expressed at comparable levels.

      We have now included two additional datasets supporting our conclusion. First, we have included the positive control with the mouse Agrin minigene as suggested by the reviewer, which shows that mouse Nova1 and Nova2 are indeed still able to splice the mouse Agrin minigene in our assay (Figure 3C). Second, we included fluorescence images of the GFP-fused mouse Nova1 and Nova2 showing their proper expression in the cells (Figure S7).

      “I am also not fully convinced that the model of autoinhibition for Ciona Nova is supported by sufficient experimental data. Again, there are no data showing that the levels of the various deletion mutants of Nova are consistent and hence, there could be issues with the stbsility of some of the deletion mutants and this could explain the observed difference in activity.”

      We have added a few more datasets to further investigate the model. First, we have added an independent biological replicate of the “MLN” Nova isoform deletion mutant assays (Figure S8), as well as a separate assay using deletion mutants based on the “MMM” isoform (Figure S9). The results were consistent in both cases, confirming our initial observations. Next, we tested more directly the idea proposed by the reviewer that there are issues with stability, by looking at the fluorescence of the GFP-fused mutants. We did notice that the N/C-terminal deletion mutants were not expressed as well, but this was always mitigated by concurrent deletion of the KH3 domain. We have now expanded our discussion in the text to propose that there may be a negative effect of the KH3 domain on Nova expression/stability in the absence of the N/C termini. Although different from the model in which KH3 directly inhibits KH1/KH2, there does seem to be some inhibitory effect of KH3 on Nova expression/stability. “- In all schematic presentations, exon Z6 appears larger than exon Z5. However, Z6 is only 24 bp long, while Z5 is 3434 bp long. Please adjust this representation.”

      To clarify, in Ciona Z6 is 18 bp long, and Z5 is 15 bp long, hence they code for 6 and 5 amino-acids, respectively. This is different from the mammalian Z exons, which may be the source of the confusion here. In our schematics, we are only representing the Ciona Z exons.

      “- Is there consistency in the relative proportions of the 24-bp (Z6), 33-bp (Z5), and 57-bp (Z6 + Z5) PCR products? Studies in vertebrates have shown that AChR clustering activity is highest with the Z8 and Z19 products, while the Z11 product appears to be somewhat less active. It would be nice to also point out the different splice products are detected in Ciona.”

      It was not clear if there was any consistency in the relative proportions of Ciona Agrin splice products in the minigene assays as performed in cultured mammalian cells, though in Figure 1 we have pointed out a more detailed characterization of the different splice products in vivo in Ciona. The different splice products’ confirmed sequences are also shown in the supplemental sequences file.

      “Line 111: 'Z11' Agrin should be corrected to 'Z19' Agrin.

      To clarify again, we are only referring to the Ciona Agrin Z exons, which are not the same sizes as the mammalian Z exons. While Z19 would refer to the combination of exons Z8 and Z11 (8+11 = 19) in mammals, here in Ciona the equivalent combination is Z11 (Z5 + Z6).

      “Line 168: "Figure H" should be updated to "Figure 2H."”

      Fixed.

      Reviewer #2:

      __*“44 - ALS, references 8-12. These are old papers. A new review should be cited, either instead of in addition.”

      *__

      We have read some newer reviews and cite three more recent reviews (references 10-12) now.

      __* 56 - "many" cases of CMS - some are not due to mutations in this pathway

      *__

      We have altered this to say “many”.

      __* 57 - refs 29-46. This is a very large number of references for a point this is quite unimportant to the story. It would be better to cite recent reviews.

      *__

      We have removed some references and also cited more recent reviews here (references 38, 39).

      __* 168 - should be 2H

      *__

      Fixed.

      __* 205 - make N terminal extension more apparent in Figure 3D

      *__

      We have recolored the N terminus to be red, as to make it more apparent, in figures 3 and S8 and S9.

      __* 235 - not a complete sentence

      *__

      Fixed.

      308+ - can the authors clarify whether EBF knockdown has a selective effect on Nova vs general failure of the neurons to acquire a MN phenotype

      We have been investigating this in a separate study on MN specification and differentiation in Ciona, which will be published as a preprint soon. EBF does not have a selective effect on Nova expression, as it appears to be regulating multiple aspects of neuronal differentiation, consistent with its role as previously studied in Ciona and other organisms (e.g. Kratsios et al. 2012, Catela et al. 2019, etc).__*

      614 - explain in figure legend the decrease in apparent MR from left to right in 4B

      *__

      This is just an example of “bowed” or “curved” bands frequently seen in electrophoresis, usually due to uneven heat dissipation or other electrophoresis issues. However, the bands all correspond to the same products (Z+). We added an explanation in the legend.

      General - three other key components of the pathway are MuSK, rapsyn, and DOK-7. Functional studies of these genes fall beyond the scope of this paper, but it would be helpful to know whether they are expressed in muscle and, if so, whether expression is muscle-specific.

      We have added this to the discussion. While Musk and Dok-7 remain unstudied in Ciona, it has been shown that Rapsyn is muscle-specific in Ciona (Nishino et al. 2011).

      Reviewer #3:

      __*“1) The authors report two main Nova isoforms that seem to be produced by alternative promoters. They also claim that the MLN isoform is more strongly expressed in two of the studied conditions compared to the MMM protein (eggs and heart in Fig 1G), while both are equally abundant in st. 22.5 embryos and brain (Fig S1 and line 130). Therefore, both isoforms are likely involved in the regulation of the Agrin AS event. When performing the experiments that require to express the Nova protein, the authors choose to work with the "MLN" isoform arguing that it is more "ubiquitous" than the "MMM" isoform, although the last has a more evident nuclear localization signal (NLS) sequence. In the minigene analysis, the MLN isoform fails to produce transcripts with Z6 exon (which seems to be the most common Z+ isoform in the brain), and the amount of Z11-containing transcripts is very low compared to st. 22.5. Given that the N-terminal domain has a regulatory influence, as demonstrated by the authors, and that the MMM isoform is potentially more "neural-restricted" than the MLN, an intriguing possibility is that the MMM isoform might enhance the inclusion of Z6 and Z11 isoforms. To solve this issue, I suggest two experiments:

      • Perform the minigene assay with the MMM isoform of Nova and the wild type version of the minigene to check the level of inclusion of Z6 and Z6+Z5 (Z11) exons.*__”

      We have added additional minigene assay data using the MMM isoform (S9). We did not detect Z6 isoforms with MMM, though there may be slight differences in the ratio of Z5 and Z11 compared to the MLN assay. We believe this indicates that nuclear localization is not rate-limiting in our heterologous mammalian cell minigene assay, although it very well may regulate splicing activity more meaningfully in vivo in Ciona. This may be especially true in post-mitotic cells, as opposed to during embryogenesis when actively proliferating cells will break down and then reconstitute their nuclear envelopes over and over again, thus potentially allowing some of the MLN isoform to find its way into the nucleus. We still believe the production of the Z6 isoform may depend on additional Ciona-specific factors missing from the mammalian cells in our heterologous assay.

      “- Test the regulatory activity of the upstream genomic region of exon 1a, in an equivalent way as for exon 1b in Fig 7A and B, to explore whether the promoter of the MMM isoform has a neural-restricted expression that could explain the AS pattern observed in st. 22.5 and brain.” We have done this, shown in Figure S15, which revealed that the promoter upstream of exon 1a (encoding the MMM isoform) drives only expression in mesenchyme and some epidermal cells, with no neuronal expression visible. This suggests that the majority of the neural expression is due to the cis-regulatory elements in the region between exons 1a and 1b. However, this region does not necessarily activate transcription only at exon 1b (encoding MLN isoform), as intronic elements can loop back and regulate transcription off “upstream” promoters. Thus we propose that the Nova [1b] -2011/+6 region drives expression of both MLN and MMM isoforms, though this remains to be fully tested. We believe the regulation and function of the different Nova isoforms in Ciona is beyond the scope of the current paper, though we are interested in investigating this more thoroughly in follow-up studies.

      __*“2) The authors unveil the conservation of an Agrin AS event between mammals and a tunicate species with similar functional consequences for AChR clustering. While this is absolutely correct, the relatively low similarity of the AS exons between Ciona and mammals shown in Fig 1A may raise confusion or doubts in the readers regarding the homology of the event (as it did in my own case before I checked it in more detail). Therefore, an explicit alignment of both constitutive and alternative exons in a supplementary figure to clearly demonstrate the homology of the AS event across major taxonomic groups (with a few vertebrate and tunicate species) might help.

      Furthermore, expression of Nova in motor neurons of amphioxus (Branchiostoma lanceolatum) was previously reported (ref. 60), and a quick look into publicly available Agrin transcripts (____https://www.ncbi.nlm.nih.gov/gene/136443694____) reveals a homologous AS event in this cephalochordate species.

      C1 "Z7/Z6/Z8" C2 (partial)

      Bla QADPAPLRQEGVG--LDGTTILNYPNAINK ... E-SNSIRE ... QEPNQDDNHFEVTFRTTSDHGLLLWNHKPGGG-DFIALAI Cro HSTDLLQDEQATAIYLDGTTKIMYRNAVKA ... --PNDFRE ... SRART-HNNYEIVFRTTARHGLLLMVGKAREGVDYIALAI Mmu IVEKSVGDLETLA--FDGRTYIEYLNAVTE ... ELTNEIPA ... EKALQ-SNHFELSLRTEATQGLVLWIGKVGERADYMALAI : . :** * : * **:. .*.: ... *::*: :** : :**:* * *::****

      These two facts suggests a potential origin of the Nova-Agrin regulation at the base of the chordate phylum (and not restricted to Olfactores), which could be mentioned in the discussion as a relevant possibility.*__”

      We thank the reviewer for this suggestion. Indeed, we have now added a more detailed alignment with Agrin sequences from more species in Figures S2 and S3, including amphioxus as so helpfully identified by the reviewer. We have added the observation that amphioxus Agrin appears to have a single Z exon encoding the NxI/V motif (no evidence for two Z exons as in tunicates or vertebrates). This indeed suggests that this pathway may be a chordate innovation, as we now discuss. We also add AlphaFold-assisted predictions of the NxF motif binding to the equivalent pocket in Lrp4 in both Ciona and mammals (Figure S1).

      Line 168: Figure 2H instead of Figure H.

      Fixed.

      Line 287: "Taken together, these results reveal that a Nova-Agrin-Lrp4 pathway for AChR receptor clustering at the neuromuscular synapse is conserved from mammals to tunicates." While this sentence might be true, from mammals to tunicates might imply that it is conserved in all vertebrate and tunicate lineages, and this is not explored in the manuscript (there might be secondary losses). It would be more technically correct to say something similar like "...the neuromuscular synapse is conserved in the studied mammalian and tunicate lineages" or "...the neuromuscular synapse originated before the evolutionary divergence of tunicates and vertebrates"

      We have fixed this now in a few places.

      “Line 342. At the end of this paragraph, the possibility of conservation of the mechanism also in amphioxus could be discussed.”

      We now discuss the amphioxus sequence and the idea that this mechanism was present in the last common chordate ancestor.

      “Line 383: "the the apparent".”

      Fixed.

      “I agree that the mouse-specific agrin minigene to test the functionality of Nova1 and Nova2 would be a suitable positive control to discard protein stability/expression issues.

      We have tested this now with GFP fusion images (Figure S7) and using the mouse Agrin minigene (Figure 3C). Both indicate proper expression/splicing activity of mouse Nova1 and Nova2, supporting the idea that there is still some type of cross-species incompatibility as tested in mammalian cells.

      “The only minor limitation, in my opinion, is that it lacks testing of the MMM Nova isoform in the minigene assay, to explore whether it has (or not) a complementary function to the MLN isoform that could fully explain the endogenous AS pattern.”

      We have added MMM minigene assays, and these were largely identical to MLN assays. We propose that the N-terminus and nuclear localization do not significantly impact activity of Ciona Nova as tested in mammalian cells, however we cannot exclude the possibility that things may be different in vivo in Ciona.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary: The manuscript reports the conservation of the Nova-Agrin-Lrp4 pathway for AChR clustering in neuromuscular junctions beyond vertebrates, using the tunicate Ciona robusta as a model species. In addition, it also reveals Ebf as a key transcriptional activator of Nova in the motor neurons of Ciona. One of the main focuses of the work is the detailed study of an alternative splicing event in the Agrin gene of Ciona, demonstrating its regulation by Nova and the developmental cascade of consequences in AChR clustering due to misregulation of the Nova-Agrin-Lrp4 pathway through multiple functional experiments. Furthermore, it also explores molecular elements involved in the regulation of this event in trans and cis, including the KH and N-terminal domains of Nova (and their interactions) and the intronic YCAY binding domains.

      Major comments: The claims and conclusions of the manuscript are generally very well supported with appropriate and reproducible functional experiments. For instance, the work demonstrates a key regulatory link between Nova and the studied AS event of Agrin using both a minigene system in human cells and a set of CRISPR/Cas9 Ciona mutants. Although analyzing mosaic embryos can be challenging, the authors successfully test different combinations of gRNAs to achieve efficient mutagenesis. Moreover, using the AChRA1::GFP clusters to measure the impact of the different mutants is very convincing.

      While generally very robust and satisfying, the manuscript could benefit from addressing a few issues to improve its quality:

      1. The authors report two main Nova isoforms that seem to be produced by alternative promoters. They also claim that the MLN isoform is more strongly expressed in two of the studied conditions compared to the MMM protein (eggs and heart in Fig 1G), while both are equally abundant in st. 22.5 embryos and brain (Fig S1 and line 130). Therefore, both isoforms are likely involved in the regulation of the Agrin AS event. When performing the experiments that require to express the Nova protein, the authors choose to work with the "MLN" isoform arguing that it is more "ubiquitous" than the "MMM" isoform, although the last has a more evident nuclear localization signal (NLS) sequence. In the minigene analysis, the MLN isoform fails to produce transcripts with Z6 exon (which seems to be the most common Z+ isoform in the brain), and the amount of Z11-containing transcripts is very low compared to st. 22.5. Given that the N-terminal domain has a regulatory influence, as demonstrated by the authors, and that the MMM isoform is potentially more "neural-restricted" than the MLN, an intriguing possibility is that the MMM isoform might enhance the inclusion of Z6 and Z11 isoforms. To solve this issue, I suggest two experiments:
        • Perform the minigene assay with the MMM isoform of Nova and the wild type version of the minigene to check the level of inclusion of Z6 and Z6+Z5 (Z11) exons.
        • Test the regulatory activity of the upstream genomic region of exon 1a, in an equivalent way as for exon 1b in Fig 7A and B, to explore whether the promoter of the MMM isoform has a neural-restricted expression that could explain the AS pattern observed in st. 22.5 and brain.
      2. The authors unveil the conservation of an Agrin AS event between mammals and a tunicate species with similar functional consequences for AChR clustering. While this is absolutely correct, the relatively low similarity of the AS exons between Ciona and mammals shown in Fig 1A may raise confusion or doubts in the readers regarding the homology of the event (as it did in my own case before I checked it in more detail). Therefore, an explicit alignment of both constitutive and alternative exons in a supplementary figure to clearly demonstrate the homology of the AS event across major taxonomic groups (with a few vertebrate and tunicate species) might help.

      Furthermore, expression of Nova in motor neurons of amphioxus (Branchiostoma lanceolatum) was previously reported (ref. 60), and a quick look into publicly available Agrin transcripts (https://www.ncbi.nlm.nih.gov/gene/136443694) reveals a homologous AS event in this cephalochordate species.

                    C1                   "Z7/Z6/Z8"            C2 (partial)
      

      Bla QADPAPLRQEGVG--LDGTTILNYPNAINK ... E-SNSIRE ... QEPNQDDNHFEVTFRTTSDHGLLLWNHKPGGG-DFIALAI Cro HSTDLLQDEQATAIYLDGTTKIMYRNAVKA ... --PNDFRE ... SRART-HNNYEIVFRTTARHGLLLMVGKAREGVDYIALAI Mmu IVEKSVGDLETLA--FDGRTYIEYLNAVTE ... ELTNEIPA ... EKALQ-SNHFELSLRTEATQGLVLWIGKVGERADYMALAI : . : * : * :. ..: ... ::: : : :: * ::***

      These two facts suggests a potential origin of the Nova-Agrin regulation at the base of the chordate phylum (and not restricted to Olfactores), which could be mentioned in the discussion as a relevant possibility.

      Minor comments:

      Line 168: Figure 2H instead of Figure H.

      Line 287: "Taken together, these results reveal that a Nova-Agrin-Lrp4 pathway for AChR receptor clustering at the neuromuscular synapse is conserved from mammals to tunicates." While this sentence might be true, from mammals to tunicates might imply that it is conserved in all vertebrate and tunicate lineages, and this is not explored in the manuscript (there might be secondary losses). It would be more technically correct to say something similar like "...the neuromuscular synapse is conserved in the studied mammalian and tunicate lineages" or "...the neuromuscular synapse originated before the evolutionary divergence of tunicates and vertebrates"

      Line 342. At the end of this paragraph, the possibility of conservation of the mechanism also in amphioxus could be discussed.

      Line 383: "the the apparent".

      Referees cross-commenting

      I agree that the mouse-specific agrin minigene to test the functionality of Nova1 and Nova2 would be a suitable positive control to discard protein stability/expression issues.

      Significance

      General assessment: This manuscript describes and demonstrates the deep evolutionary origin of a complex molecular pathway in the neuromuscular synapses of chordates. This work takes advantage of the broad genetic tools available in the tunicate Ciona robusta to support its main claims rigorously and strongly with a focused set of functional experiments. Moreover, it expands the known pathway revealing an upstream regulator of Nova in Ciona (Ebf) and opening a new research line in vertebrate motor neurons. The only minor limitation, in my opinion, is that it lacks testing of the MMM Nova isoform in the minigene assay, to explore whether it has (or not) a complementary function to the MLN isoform that could fully explain the endogenous AS pattern. Nevertheless, the current version of the manuscript is sufficiently robust to sustain its main conclusions.

      Advance: Previous studies have reported deeply conserved AS events regulated by homologous tissue-specific splicing factors suggesting a putative similar function, such as the case of Esrp regulating the splicing of FGFRs in amphioxus and vertebrates (Burguera et al. 2017). However, to my knowledge, this work is the first to analyse the functional conservation of an alternative splicing event between chordate clades in its endogenous context while demonstrating an homologous ontogenetic role. In addition, it provides new insights of the molecular interactions between the N-terminal and KH domains of Nova and how they bind to the NISE elements in the Agrin pre-mRNA.

      Audience: I consider this work interesting for a substantially broad audience, given that it reveals a surprisingly deep conservation of a molecular pathway across chordate lineages that is essential for the proper establishment of neuromuscular synapses. Thus, this study is imaginably interesting for evolutionary and molecular biologists, physiologists and even biomedical researchers that might be interested to explore a potential regulatory connection between Ebf genes and Nova in human motor neurons.

      Fields of expertise: evo-devo, alternative splicing, chordates, transcriptomic and genome evolution.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: Formation of the postsynaptic apparatus at the mammalian skeletal neuromuscular junction is controlled by a signaling pathway in which NOVA-mediated splicing generates an active form of the proteoglycan agrin (z-agrin), which is released from motoneurons, and interacts with LRP4 in muscle, leading to clustering of acetylcholine receptors beneath the nerve termina. This delightful paper demonstrates striking conservation of the pathway in the non-vertebrate chordate Ciona robusta, and also reveals some striking differences.

      Major comments: none

      Minor comments:

      44 - ALS, references 8-12. These are old papers. A new review should be cited, either instead of in addition.

      56 - "many" cases of CMS - some are not due to mutations in this pathway

      57 - refs 29-46. This is a very large number of references for a point this is quite unimportant to the story. It would be better to cite recent reviews.

      168 - should be 2H

      205 - make N terminal extension more apparent in Figure 3D

      235 - not a complete sentence

      308+ - can the authors clarify whether EBF knockdown has a selective effect on Nova vs general failure of the neurons to acquire a MN phenotype

      614 - explain in figure legend the decrease in apparent MR from left to right in 4B

      General - three other key components of the pathway are MuSK, rapsyn, and DOK-7. Functional studies of these genes fall beyond the scope of this paper, but it would be helpful to know whether they are expressed in muscle and, if so, whether expression is muscle-specific.

      Significance

      The paper is complete, and the results are compelling, nicely explained, and carefully documented. My comments are minor.

      It provides an interesting example of ways in which a pathway for synaptic development is conserved across distant vertebrate species, as well as ways inhich it is modified. I know of no other papers that do this.

      Audience: basic research interested in neuroscience, evolution and development

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript examines the role of the splicing factor Nova in Ciona robusta, a tunicate that is the closest relative to vertebrates. The authors demonstrate the co-expression of Agrn and Nova mRNA during development in motor neurons, highlighting the correlative appearance of Nova-spliced exons Z6 and Z5. Importantly, CRISPR/Cas9-mediated inhibition of agrin splicing, deletion of its receptor Lrp4, and loss of Nova result in a significant reduction in the number of motor neuron-muscle synapses. This finding supports the notion that neuromuscular synapse formation is similarly regulated in non-vertebrates as it is in vertebrates. The authors subsequently investigate the domains within Nova responsible for agrin splicing and identify the transcription factor Ebf as a regulator of Nova expression in motor neurons.

      The research is meticulously conducted, and the data are compelling, as they demonstrate that the Nova-agrin-Lrp4-MuSK axis is also operational in non-vertebrates. The conclusions drawn by the authors are generally adequate; however, I find some instances of "it is the first time..." to be unnecessary.

      The work also presents an unexpected finding that mouse Nova protein is unable to splice the Ciona agrin mini-gene (Figure 3). I believe the inability of mouse Nova1 and Nova2 to splice the Ciona agrin could also be due to insufficient expression levels of the mouse proteins. Therefore, the authors should include either a positive control (e.g., mouse agrin mini-gene) or demonstrate that the proteins are expressed at comparable levels.

      I am also not fully convinced that the model of autoinhibition for Ciona Nova is supported by sufficient experimental data. Again, there are no data showing that the levels of the various deletion mutants of Nova are consistent and hence, there could be issues with the stbsility of some of the deletion mutants and this could explain the observed difference in activity.

      Minor Points:

      • In all schematic presentations, exon Z6 appears larger than exon Z5. However, Z6 is only 24 bp long, while Z5 is 3434 bp long. Please adjust this representation.
      • Is there consistency in the relative proportions of the 24-bp (Z6), 33-bp (Z5), and 57-bp (Z6 + Z5) PCR products? Studies in vertebrates have shown that AChR clustering activity is highest with the Z8 and Z19 products, while the Z11 product appears to be somewhat less active. It would be nice to also point out the different splice products are detected in Ciona.
      • Writing errors:

      Line 111: 'Z11' Agrin should be corrected to 'Z19' Agrin.

      Line 168: "Figure H" should be updated to "Figure 2H."

      Referees cross-commenting

      In my view, the other reviews provide interesting insights that will further strengthen the manuscript.

      Significance

      This manuscript examines the role of the splicing factor Nova in Ciona robusta, a tunicate that is the closest relative to vertebrates. The authors demonstrate the co-expression of Agrn and Nova mRNA during development in motor neurons, highlighting the correlative appearance of Nova-spliced exons Z6 and Z5. Importantly, CRISPR/Cas9-mediated inhibition of agrin splicing, deletion of its receptor Lrp4, and loss of Nova result in a significant reduction in the number of motor neuron-muscle synapses. This finding supports the notion that neuromuscular synapse formation is similarly regulated in non-vertebrates as it is in vertebrates. The authors subsequently investigate the domains within Nova responsible for agrin splicing and identify the transcription factor Ebf as a regulator of Nova expression in motor neurons.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Note : The original preprint version of our manuscript has been reviewed by 3 subject experts for Review Commons. All the three reviewers’ comments on the original version of our manuscript have been fully addressed. Their input was extremely valuable in helping us clarify and refine the presentation of our results and conclusions. Their feedback contributed to making the study both more thoroughly developed and more accessible to a broad readership, while preserving its mechanistic depth. We believe that this revised version more effectively highlights the conceptual advances brought by our findings.

      Reviewer #1

      Evidence, reproducibility and clarity

      The manuscript "Key roles of the zona pellucida and perivitelline space in promoting gamete fusion and fast block to polyspermy inferred from the choreography of spermatozoa in mice oocytes" by Dr. Gourier and colleagues explores the poorly understood process of gamete fusion and the subsequent block to polyspermy by live-cell imaging of mouse oocytes with intact zona pellucida in vitro. The new component in this study is the presence of the ZP, which in prior studies of live-cell imaging had been removed before. This allowed the authos to examine contributions of the ZP to the block in polyspermy in relation to the timing of sperm penetrating the ZP and sperm fusing with the oocyte. By carefully analysing the timing of the cascade of events, the authors find that the first sperm that reaches the membrane of the mouse oocyte is not necessarily the one that fertilizes the oocytes, revealing that other mechanisms post-ZP-penetration influence the success of individual sperm. While the rate of ZP penetration remains constant in unfertilized oocytes, it decreases upon fertilization for subsequent sperm, providing direct evidence for the known 'slow block to polyspermy' provided by changes to the ZP adhesion/ability to be penetrated. Careful statistical analyses allow the authors to revisit the role of the ZP in preventing polyspermy: They show that the ZP block resulting from the cortical reaction is too slow (in the range of an hour) to contribute to the immediate prevention of polyspermy in mice. The presented analyses reveal that the ZP does contribute to the block to polyspermy in two other ways, namely by effectively limiting the number of sperm that reach the oocyte surface in a fertilization-independent manner, and by retaining components like JUNO and CD9, that are shed from the oocyte plasma membrane after fertilization, in the perivitelline space, which may help neutralize surplus spermatozoa that are already present in the PVS. Lastly, the authors report that the ZP may also contribute to channeling the flagellar oscillations of spermatozoa in the PVS to promote their fusion competence.

      Major comments:

      • Are the key conclusions convincing?

      The authors provide a careful analysis of the dynamics of events, though the analyses are correlative, and can only be suggestive of causation. While this is a limitation of the study, it provides important analysis for future research. Moreover, by analysing also control oocytes without fertilization and the timing of events, the authors have in some instances clear 'negative controls' for comparison.

      Some claims would benefit from rewording or rephrasing to put the findings better in the context of what is already known and what is novel:

      • the phrasing 'challenging prior dogma' might be too strong since it had been observed before that it is not necessarily the first sperm that gets through the ZP that fertilizes the egg (though I am afraid that I do not have any citations or references for this). However, given that in the field people generally think it is not necessarily and always the first sperm, the authors may want to consider weakening this claim.

      Only real-time imaging of in vitro fertilization of zona pellucida-intact oocytes, as performed in our study, is capable of determining which spermatozoon crossing the zona pellucida fuses with the oocyte. However, such studies are rare, and most do not specifically address this question. As Reviewers 1 & 3, we have not found any citation or reference telling or showing that it is not necessarily the first spermatozoon to penetrate the zona pellucida that fertilizes the egg. In contrast, at least one reference (Sato et al., 1979) explicitly reports the opposite. If, as suggested by Reviewer 1 and 3, it has indeed been observed before that the first sperm to pass the ZP is not always the one that fertilizes, and if this idea is generally accepted in the field, then it is all the more important that a study demonstrates and publishes this point. This is precisely what our study makes possible. However, in case we may have overlooked a previous reference making the same observation as ours, we have removed the phrasing ‘challenging prior dogma’. That being said, the key issue is not so much that it is not necessarily the first spermatozoon penetrating the perivitelline space that fertilizes, but rather why spermatozoa that successfully reach the PVS of an unfertilized oocyte may fail to achieve fertilization. This is one of the central questions our study sought to address.

      • I do think the cortical granule release could still contribute to the block to polyspermy though - as the authors here nicely show - at a later time-point only, and thus not the major and not the immediate block as previously thought. The wording in the abstract should therefore be adjusted (since it could still contribute...)

      We are concerned that we may disagree on this point. The penetration block resulting from cortical granule release progressively reduces the permeability of the zona pellucida to spermatozoa, relative to its baseline permeability prior to sperm–oocyte fusion. Any decrease in this baseline permeability occurring before the fusion block becomes fully effective can contribute to the prevention of polyspermy by limiting the number of sperm that can access the oolemma at a time when fusion is still possible. In contrast, once the fusion block is fully established, limiting the number of spermatozoa traversing the ZP becomes irrelevant regarding the block to polyspermy, as the fusion block alone is sufficient to prevent additional fertilizations, rendering the penetration block obsolete. The only scenario that could challenge this obsolescence is if the fusion block were transient. In that case, as Reviewer 1 suggests, the penetration block could indeed play a role at a later time-point. However, taken together, our study and that of Nozawa et al. (2018) support the conclusion that this is not the case in mice:

      • Our in vitro study using kinetic tracking shows that the time constant for completion of the fusion block is typically 6.2 ± 1.3 minutes. During this time window, we observe that the permeability of the zona pellucida to spermatozoa does not yet decrease significantly from the baseline level it exhibited prior to sperm–oocyte fusion (see Figures 5B and S1B in the revised manuscript, and Figures 5A and 5B in the initial version). Consequently, before the fusion block is fully established, the penetration block can contribute only marginally—if at all—to the prevention of polyspermy. In contrast, the naturally low baseline permeability of the ZP—independent of any fertilization-triggered penetration block—as well as the relatively long timing of fusion ( minutes on average) after sperm penetration in the perivitelline space, are factors that contribute to the preservation of monospermic while the fusion block is still being established.
      • Our in vitro study using kinetic tracking shows that once the fusion block is completed following the first fusion event, no additional spermatozoa are able to fuse with the oocyte until the end of the experiment, 4 hours post-insemination (see blue points and fitting curve in Figure 5C). Meanwhile, one or more additional spermatozoa—most of them motile and therefore viable—are present in the perivitelline space in 50% of the oocytes analyzed (purple point in Figure 5C). This demonstrates that, once established, the fusion block remains effective for at least the entire duration of the experiment, supporting the idea of a fully functional and long-lasting fusion block.
      • Nozawa et al. (2018) found that female mice lacking ovastacin—the protease released during the cortical reaction that renders the zona pellucida impenetrable—are normally fertile. They additionally reported that the oocytes recovered from these females after mating are monospermic despite the systematic presence of additional spermatozoa in the perivitelline space. These findings further support the conclusion that in mice the fusion block is both permanent and sufficient to prevent polyspermy. For all these reasons, we believe that even at a later time-point, the penetration block does not contribute to the prevention of polyspermy in mice.

      To clarify the fact that the penetration block does not necessarily contribute to prevent polyspermy, which indeed challenges the commonly accepted view, we have substantially revised the discussion. Furthermore, Figure 9 from the initial version of the manuscript has been replaced by Figure 8 in the revised version. This new figure provides a more didactic illustration of the inefficacy of the penetration block in preventing polyspermy in mice, by showing the respective impact of the fusion block, the penetration block, as well as fusion timing and the natural baseline permeability of the zona pellucida, on the occurrence of polyspermy.

      As for the abstract, it has also been thoroughly revised. The content related to this section is now expressed in a way that emphasizes the factors that actively contribute to the prevention of polyspermy in mice, rather than those with no or marginal contribution (such as the penetration block in this case).

      • release of OPM components - in the abstract it's unclear what the authors mean by this - in the results part it becomes clear. Please already make it clear in the abstract that it is the fertility factors JUNO/CD9 that could bind to sperm heads upon their release and thus 'neutralize' them? I would also recommend not referring to it as 'outer' plasma membrane (there is no 'inner plasma membrane'). Moreover, in the abstract please clarify that this release is happening only after fusion of the first sperm and not all the time. In the abstract it sounds as if this was a completely new idea, but there is good prior evidence that this is in fact happening (as also then cited in the results part) - maybe frame it more as the retention inside the PVS as new finding.

      We thank reviewer 1 for pointing out the lack of precision in the abstract regarding the “components” released from the oolemma, and the fact that our phrasing may have given the impression that the post-fertilization release of CD9 and JUNO is a novel observation. The new observation is that CD9 and JUNO, which are known to be massively released from the oolemma after fertilization, bind to spermatozoa in the perivitelline space. However, we cannot rule out the possibility that other oocyte-derived molecules not investigated here may undergo a similar process. This is why we employed the broader term “components”, which encompasses both CD9 and JUNO as well as potential additional molecules. That said, we acknowledge the lack of precision introduced by this terminology. To address this, we have revised the corresponding sentence in the abstract to better reflect our new findings relative to previous ones, and to eliminate the ambiguity introduced by the word “component”.

      The revised sentence of the abstract reads as follows:

      “Our observation that non-fertilizing spermatozoa in the perivitelline space are coated with CD9 and JUNO oocyte’s proteins, which are known to be massively released from the oolemma after gamete fusion, supports the hypothesis that the fusion block involves an effective perivitelline space-block contribution consisting in the neutralization of supernumerary spermatozoa in the perivitelline space by these and potentially other oocyte-derived factors.”

      Moreover, we cannot state in the abstract that the release of CD9 and JUNO occurs only after the fusion of the first spermatozoon and not before, since some CD9 and JUNO are already detectable in the perivitelline space (PVS) prior to fusion. What our study shows is that, before fertilization, CD9 and JUNO are predominantly localized at the oocyte membrane. In contrast, after fusion (four hours post-insemination), oocyte CD9 is distributed between the membrane and the PVS, and the only JUNO signal detectable in the oocyte is found in the PVS. This is what we describe in the Results section on page 15.

      Regarding the acronym “OPM” in the initial version of the manuscript, although it was defined in the introduction as referring to the oocyte plasma membrane and not the outer plasma membrane (which, indeed, would not be meaningful), we acknowledge that it may have caused confusion to people in the field due to its resemblance to the commonly used meaningful acronym “OAM” for outer acrosomal membrane. To avoid any ambiguity, we have replaced the acronym “OPM” throughout the revised manuscript with the term “oolemma”, which unambiguously refers to the plasma membrane of the oocyte.

      It is unclear to me what the relevance of dividing the post-fusion/post-engulfment into different phases as done in Fig 2 (phase 1, and phase 2) - also for the conclusions of this paper this seems rather irrelevant and overly complicated, since the authors never get back to it and don't need it (it's not related to the polyspermy block analyses). I would remove it from the main figures and not divide into those phases since it is distracting from the main focus.

      Sperm engulfment and PB2 extrusion are two processes that follow sperm–oocyte fusion. As such, they are clear indicators that fusion has occurred and that meiosis has resumed. Their progression over time is readily identifiable in bright-field imaging: sperm engulfment is characterized by the gradual disappearance of the spermatozoon head from the oolemma, whereas PB2 extrusion is observed as the progressive emergence of a rounded protrusion from the oocyte membrane (Figure 2 in the initial manuscript and Figure S2 A&B in the revised version). The kinetics of these events, measured from the arrest of “push-up–like” movement of the sperm head against the oolemma —assumed to coincide with sperm-oocyte fusion, as further justified in a later response to Reviewer 1—provide reliable temporal landmarks for estimating the timing of fusion when the fusion event itself is not directly observed in real time (Figure S2 C&D).

      The four landmarks used in this estimation are:

      (i) the disappearance of the sperm head from the oolemma due to internalization (28 ± 2 minutes post-arrest, mean ± SD);

      (ii) the onset of PB2 protrusion from the oolemma (28 ± 2 minutes post-arrest);

      (iii) the moment when the contact angle between the PB2 protrusion and the oolemma shifts from greater than to less than 90° (49 ± 6 minutes post-arrest);

      (iv) the completion of PB2 extrusion (73 ± 10 minutes post-arrest).

      The approach used to determine the fusion time window of a fertilizing spermatozoon from these landmarks is detailed in the “Determination of the Fertilization Time Windows” section of the Materials and Methods. Compared to the initial version of the manuscript, we have added a paragraph explaining the rationale for using the arrest of the push-up–like movement as a reliable indicator for sperm–oocyte fusion and have clarified the description of the approach used to determine fertilization timing.

      The timed characterization of sperm engulfment and PB2 extrusion kinetics is highly relevant to the analysis of the penetration and fusion blocks, however we agree that its place is more appropriate in the Supplementary Information than in the main text. In accordance with the reviewer’s recommendation, this section has therefore been moved to the Supplementary Information SI2.

      For the statistical analysis, I am not sure whether the assumption "assumption that the probability distribution of penetration or fertilization is uniform within a given time window" is in fact true since the probability of fertilizing decreases after the first fertilization event.... Maybe I misunderstood this, but this needs to be explained (or clarified) better, or the limitation of this assumption needs to be highlighted.

      During in vitro fertilization experiments with kinetic tracking, each oocyte is observed sequentially in turn. As a result, sperm penetration into the perivitelline space or fusion with the oolemma may occur either during an observation round or in the interval between two rounds. In the former case, penetration or fusion is directly observed in real time, allowing for high temporal precision in determining the moment of the event. In contrast, when penetration or fusion occurs between two observation rounds, the precise timing cannot be directly determined. We can only ascertain that the event took place within the time window we have determined. Because, within a given penetration or fusion time window, we do not know the exact moment at which the event occurred, there is no reason to favor one time over another. This justifies the assumption that all time points within the window are equally probable. This explanation has been added in the section Statistical treatment of penetration and fertilization chronograms to study the kinetics of fertilization, penetration block and fusion block of the main text and in the section Statistical treatment of penetrations and fertilizations chronograms to study penetration and fusion blocks of the material and methods.

      -Suggestion for additional experiments:

      If I understood correctly, the onset of fusion in Fig 2C is defined by stopping of sperm beating? If it is by the sudden stop of the beating flagellum, this should be confirmed in this situation (with the ZP intact) that it correctly defines the time-point of fusion since this has not been measured in this set-up before as far as I understand. In order to measure this accurately, the authors will need to measure this accurate to be able to acquire those numbers (of time from fusion to end of engulfment), e.g. by pre-loading the oocyte with Hoechst to transfer Hoechst to the fusing sperm upon membrane fusion.

      The nuclear dye Hoechst is widely used as a marker of gamete fusion, as it transfers from the ooplasm—when preloaded with the dye—into the sperm nucleus upon membrane fusion, thereby signaling the happening of the fusion event. This technique is applicable in the context of in vitro fertilization using ZP-free oocytes. However, it is not suitable when cumulus–oocyte complexes are inseminated, as is the case in both in vitro experimental conditions of the present study (standard IVF and IVF with kinetic tracking). Indeed, when cumulus–oocyte complexes are incubated with Hoechst to preload the oocytes, the numerous surrounding cumulus cells also take up the dye. Consequently, upon insemination, spermatozoa acquire fluorescence while traversing and dispersing the cumulus mass—before reaching the ZP—thus rendering Hoechst labeling ineffective as a specific marker of membrane fusion. This remains true even under optimized conditions involving brief Hoechst incubation of cumulus–oocyte complexes ( Nonetheless, we have strong evidence supporting the use of the arrest of sperm movement as a surrogate marker for the moment of fusion. In our previous study (Ravaux et al., 2016; ref. 4 in the revised manuscript), we investigated the temporal relationship between the abrupt cessation of sperm head movement on the oolemma—resulting from strong flagellar beating arrest—and the fusion event, using ZP-free oocytes preloaded with Hoechst. That study revealed a temporal delay of less than one minute between the cessation of sperm oscillations and the actual membrane fusion, thereby supporting the conclusion that in ZP-free oocytes, the arrest of vigorous sperm movement at the oolemma is a reliable indicator of the moment at which fusion occurs. In the same study, the kinetics of sperm head internalization into the ooplasm were also characterized, typically concluding within 20–30 minutes after movement cessation. These findings are fully consistent with our current observations in ZP-intact oocytes, where sperm head engulfment was completed approximately 24 ± 3 minutes after the arrest of sperm oscillations. Taken together, these results strongly support the conclusion that, in both ZP-free and ZP-intact oocytes, the arrest of sperm movement is a reliable indicator of the fusion event. This assumption formed the basis for our determination of fertilization time points in the present study.

      These justifications were not fully detailed in the original version of the manuscript. We have addressed this in the revised version by explicitly presenting this rationale in the Materials and Methods section under Determination of the Fertilization Time Windows.

      Fig 8: 2 comments

      • To better show JUNO/CD9 pre-fusion attachment to the oocyte surface and post-fusion loss from the oocyte surface (but persistence in the PVS), an image after removal of the ZP (both for pre-fertilization and post-fertilization) would be helpful - the combination of those images with the ones you have (ZP intact) would make your point more visible.

      We have followed this recommendation. Figure 8 of the initial manuscript has been replaced by Figure 6 in the revised manuscript, which illustrates the four situations encountered in this study: fertilized and unfertilized oocytes, each with and without unfused spermatozoa in their PVS. To better show JUNO/CD9 pre-fusion presence to the oocyte plasma membrane, as well as their post-fusion partial (for CD9) and near-complete (for JUNO) loss from the oocyte membrane (but persistence in the PVS), paired images of the same oocyte before and after of ZP removal are now provided, both for unfertilized (Figure 6A) and fertilized oocytes (Figure 6C).

      • You show that the heads of spermatozoa post fusion are covered in CD9 and JUNO, yet I was missing an image of sperm in the PVS pre-fertilization (which should then not yet be covered).

      As staining and confocal imaging of the oocytes were performed 4 hours after insemination, images of sperm in the PVS of an oocyte “pre-fertilization” cannot be strictly obtained. However, we can have images of spermatozoa present in the PVS of oocytes that remained unfertilized. This situation, now illustrated in Figure 6B of the revised manuscript, shows that these spermatozoa are also covered in JUNO and CD9, which they may have progressively acquired over time from the baseline presence of these proteins in the PVS of unfertilized oocytes. This also may provide a mechanistic explanation for their inability to fuse with the oolemma, and, consequently, for the failure of fertilization in these oocytes.

      Minor comments:

      • The videos were remarkable to look at, and great to view in full. However, for the sake of time, the authors might want to consider cropping them for the individual phases to have a shorter video (with clear crop indicators) with the most important different stages visible in a for example 1 min video (e.g. video.

      We have followed this recommendation. The videos have been cropped and annotated in order to highlight the key events that support the points made in the result section from page 9 to 11 in the revised manuscript.

      • In general, given that the ZP, PVS and oocyte membrane are important components, a general scheme at the very beginning outlining the relative positioning of each before and during fertilization (and then possibly also including the second polar body release) would be extremely helpful for the reader to orient themselves.

      A general scheme addressing Reviewer 1 request, summarizing the key components and concepts discussed in the article and intended to help guide the reader, has been added to the introduction of the revised manuscript as Figure 1.

      • first header results "Multi-penetration and polyspermy under in vivo conditions and standard and kinetics in vitro fertilization conditions" is hard to understand - simplify/make clearer (comparison of in vivo and in vitro conditions? Establishing the in vitro condition as assay?)

      The title of the first Results section has been revised in accordance with Reviewer 1 suggestion. It now reads: Comparative study of penetration and fertilization rates under in vivo and two distinct in vitro fertilization conditions.

      • Large parts of the statistical analysis (the more technical parts) could be moved to the methods part since it disrupts the flow of the text.

      In the revised version of our manuscript, we have restructured this part of the analysis to ensure that more technical or secondary elements do not disrupt the flow of the main text. Accordingly, the equations have been reduced to only what is strictly necessary to understand our approach, their notation has been greatly simplified, and the statistical analysis of unfertilized oocytes whose zona pellucida was traversed by one or more spermatozoa has been moved to the Supplementary Information (SI1).

      • To me, one of the main conclusions was given in the text of the results part, namely that "This suggests that first fertilization contributes effectively to the fertilization-block, but less so to the penetration block". I would suggest that the authors use this conclusion to strengthen their rationale and storyline in the abstract.

      We agree with Reviewer 1 suggestion. Accordingly, we have not only thoroughly revised our abstract, but also the introduction and discussion, in order to better highlight the rationale of our study, its storyline, and the new findings which not only challenge certain established views but also open new research directions in the mechanisms of gamete fusion and polyspermy prevention.

      • Wording: To characterize the kinetics with which penetration of spermatozoa in the PVS falls down after a first fertilization," falls down should be replaced with decreases (page 10 and page 12)

      Falls down has been removed from the new version and replaced with decreases


      Significance

      Overall, this manuscript provides very interesting and carefully obtained data which provides important new insights particularly for reproductive biology. I applaud the authors on first establishing the in vivo conditions (how often do multiple sperm even penetrate the ZP in vivo) since studies have usually just started with in vitro condition where sperm at much higher concentration is added to isolated oocyte complexes. Thank you for providing an in vivo benchmark for the frequency of multiple sperm being in the PVS. While this frequency is rather low (somewhat expectedly, with 16% showing 2-3 sperm in the PVS), this condition clearly exists, providing a clear rationale for the investigation of mechanisms that can prevent additional sperm from entering.

      My own expertise is experimentally - thus I don't have sufficient expertise to evaluate the statistical methods employed here.

      __ __


      Reviewer #2

      Evidence, reproducibility and clarity

      Overall, this is a very interesting and relevant work for the field of fertilization. In general, the experimental strategies are adequate and well carried out. I have some questions and suggestions that should be considered before the work is published.

      1) Why are the cumulus cells not mentioned when the AR is triggered before or while the sperms cross it? It seems the paper assumes from previous work that all sperm that reach ZP and the OPM have carried out the acrosome reaction. This, though probably correct, is still a matter of controversy and should be discussed. It is in a way strange that the authors do not make some controls using sperm from mice expressing GFP in the acrosome, as they have used in their previous work.

      We do not mention the cumulus cells or whether the acrosome reaction is triggered before, during, or after their traversal (i.e., upon sperm binding to the ZP), as this question, while scientifically relevant, pertains to a distinct line of investigation that lies beyond the scope of the present study. Even with the use of spermatozoa expressing GFP in the acrosome, addressing this question would require a complete redesign of our kinetic tracking protocol, which was specifically conceived to monitor in bright field the dynamic behavior of spermatozoa from the moment they begin to penetrate the perivitelline space of an oocyte. Accordingly, we imaged oocytes that were isolated 15 minutes after insemination of the cumulus–oocyte complexes, by which time most (if not all) cumulus cells had detached from the oocytes, as explained in the fourth paragraph of the material and methods of both the initial and revised versions of the manuscript. The spermatozoa we had access to were therefore already bound to the zona pellucida at the time of removal from the insemination medium, and had thus necessarily passed through the cumulus layer. It is unclear for us why Reviewer 2 believes that we “assume from previous work that all sperm that reach ZP has carried out the acrosome reaction”. We could not find any statement in our manuscript suggesting, let alone asserting, such an assumption, which we know to be incorrect. Based on both published work from Hirohashi’s group in 2011 (Jin et al., 2011, DOI: 10.1073/pnas.1018202108) and our own unpublished observation (both involving cumulus-oocyte masses inseminated with spermatozoa expressing GFP in the acrosome), it is established that only a subset of spermatozoa reaching the ZP after crossing the cumulus layer has undergone acrosome reaction. Moreover, from the same sources—as well as from a recent publication by Buffone’s group (Jabloñsky et al., 2023 DOI: 10.7554/eLife.93792 ) which is the one to which reviewer 2 refers in her/his 3rd comment, it is also well established that spermatozoa have all undergone acrosome reaction when they enter the PVS. To the best of our knowledge, this latter point has long been widely accepted and is not questioned. Therefore, stating this in the first paragraph of the Discussion in the revised manuscript, while referencing the two aforementioned published studies, should be appropriate. What remains a matter of ongoing debate, however, is the timing and the physiological trigger(s) of the acrosome reaction in fertilizing spermatozoa. The 2011 study by Hirohashi’s group challenged the previously accepted view that ZP binding induces the acrosome reaction, showing instead that most spermatozoa capable of crossing the ZP and fertilizing the oocyte had already undergone the acrosome reaction prior to ZP binding. However, as this issue lies beyond the scope of our study, we do not consider it appropriate to include a discussion of it in the manuscript.

      2) In the penetration block equations, it is not clear to me why (𝑡𝑃𝐹1) refers to both PIPF1 and 𝜎𝜎𝑃I𝑃𝐹1. Is it as function off?

      That is correct: (tPF1) means function of the time post-first fertilization. Both the post-first fertilization penetration index (i.e. PIPF1) and its incertainty (i.e. 𝜎𝑃I𝑃𝐹1 ) vary as a function of this time. However, as mentioned in a previous response to Reviewer 1, this section has been rewritten to improve clarity and readability. The equations have been limited to those strictly necessary for understanding our approach, and their notation has been significantly simplified.

      3) Why do the authors think that the flagella stops. The submission date was 2024-10-01 07:27:26 and there has been a paper in biorxiv for a while that merits mention and discussion in this work (bioRxiv [Preprint]. 2024 Jul 2:2023.06.22.546073. doi: 10.1101/2023.06.22.546073.PMID: 37904966).

      Our experimental approach allows us to determine when the spermatozoon stops moving, but not why it stops. We thank Reviewer 3 for pointing out this very relevant paper from Buffone’s group (doi: 10.7554/eLife.93792) which shows the existence of two distinct populations of live, acrosome-reacted spermatozoa. These correspond to two successive stages, which occur either immediately upon acrosome reaction in a subset of spermatozoa, or after a variable delay in others, during which the sperm transitions from a motile to an immotile state. The transition from the first to the second stage was shown to follow a defined sequence: an increase in the sperm calcium concentration, followed by midpiece contraction associated with a local reorganization of the helical actin cortex, and ultimately the arrest of sperm motility. For fertilizing spermatozoa in the PVS, this transition was shown to occur upon fusion. However, it was also reported in some non-fertilizing spermatozoa that this transition took place within the PVS. These findings are consistent with the requirement for sperm motility in order to achieve fusion with the oolemma. Moreover, the fact that some spermatozoa may prematurely transition to the immotile state within the PVS can therefore be added to the list of possible reasons why a spermatozoon that penetrates the PVS of an oocyte might fail to fuse.

      This discussion has been added to the first paragraph of the Discussion section of our revised manuscript.

      4) Please correct at the beginning of Materials and Methos: Sperm was obtained from WT male mice, it should say were.

      Thank you, the correction has been done.

      5) This is also the case in the fourth paragraph of this section: oocyte were not was.

      The sentence in question has been modified as followed: “In the in vitro fertilization experiments with kinetic tracking, a subset of oocytes—together with their associated ZP-bound spermatozoa—was isolated 15 minutes post-insemination and transferred individually into microdrops of fertilization medium to enable identification.”


      Significance

      Understanding mammalian gamete fusion and polyspermy inhibition has not been fully achieved. The authors examined real time brightfield and confocal images of inseminated ZP-intact mouse oocytes and used statistical analyses to accurately determine the dynamics of the events that lead to fusion and involve polyspermy prevention under conditions as physiological as possible. Their kinetic observations in mice gamete interactions challenge present paradigms, as they document that the first sperm is not necessarily the one that fertilizes, suggesting the existence of other post-penetration fertilization factors. The authors find that the zona pellucida (ZP) block triggered by the cortical reaction is too slow to prevent polyspermy in this species. In contrast, their findings indicate that ZP directly contributes to the polyspermy block operating as a naturally effective entry barrier inhibiting the exit from the perivitelline space (PVS) of components released from the oocyte plasma membrane (OPM), neutralizing unwanted sperm fusion, aside from any block caused by fertilization. Furthermore, the authors unveil a new important ZP role regulating flagellar beat in fertilization by promoting sperm fusion in the PVS.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      SUMMARY: This study by Dubois et al. utilizes live-cell imaging studies of mouse oocytes undergoing fertilization. A strength of this study is their use of three different conditions for analyses of events of fertilization: (1) eggs undergoing fertilization retrieved from females at 15 hr after mating (n = 211 oocytes); (2) cumulus-oocyte complexes inseminated in vitro (n = 220 oocytes), and (3) zona pellucida (ZP)-intact eggs inseminated in vitro, transferred from insemination culture once sperm were observed bound to the ZP for subsequent live-cell imaging (93 oocytes). This dataset and these analyses are valuable for the field of fertilization biology. Limitations of this manuscript are challenges arise with some conclusions, and the presentation of the manuscript. There are some factual errors, and also some places where clearer explanations should to be provided, in the text and potentially augmented with illustrations to provide more clarity on the models that the authors interpret from their data.

      MAJOR COMMENTS:

      The authors are congratulated on their impressive collection of data from live-cell imaging. However, the writing in several sections is challenging to understand or seems to be of questionable accuracy. The lack of accuracy is suspected to be more an effect of overly ambitious attempts with writing style, rather than to mislead readers. Nevertheless, these aspects of the writing should be corrected. There also are multiple places where the manuscript contradicts itself. These contradictions should be corrected. Finally, there are factual points from previous studies that need correction.

      Second, certain claims and the conclusions as presented are not always clearly supported by the data. This may be connected to the issues with writing style, word and phrasing choices, etc. The conclusions could be expressed more clearly, and thus may not require additional experiments or analyses to support them. The authors might also consider illustrations as ways to highlight the points they wish to make. (Figure 7 is a strong example of how they use illustrations to complement the text).

      In response to Reviewer 3's concern about the writing style, which made several sections difficult to understand, we have thoroughly revised the entire manuscript to improve clarity, and precision. To further enhance comprehension, we have added illustrations in the revised version of the manuscript:

      • Figure 1A presents the gamete components; Figure 1B depicts the main steps of fertilization considered in the present study; and Figure 1C illustrates the penetration and fusion blocks, along with the respective contributing mechanisms: the ZP-block for the penetration block, and the membrane-block and PVS-block for the fusion block

      • Figure 2A provides a description of the three experimental protocols used in this study: Condition 1, in vivo fertilization after mating; Condition 2, standard in vitro fertilization following insemination of cumulus-oocyte complexes; and Condition 3, in vitro fertilization with kinetic tracking of oocytes isolated from the insemination medium 15 min after insemination of the cumulus-oocyte complexes.

      • Figure 4 (formerly Figure 7 in the initial version) now highlights all fusing and non-fusing situations documented in videos 1-6 and associated paragraphs of the Results section.

      • In the Discussion, Figure 9 from the original version has been replaced by Figure 8, which now provides a more pedagogical illustration of the inefficacy of the penetration block in preventing polyspermy in mice. This figure illustrates the respective contributions of the fusion block, the penetration block, fusion timing, and the intrinsic permeability of the zona pellucida to the occurrence of polyspermy.

      We hope that this revised version of the article will guide the reader smoothly throughout, without causing confusion.

      Regarding the various points that Reviewer 3 perceives as contradictions or factual errors, or the claims and the conclusions which, as presented, should not always supported by the data, we will provide our perspective on each of them as they are raised in the review.

      SPECIFIC COMMENTS:

      (1) The authors should use greater care in describing the blocks to polyspermy, particularly because they appear to be wishing to reframe views about prevention of polyspermic fertilization. The title mentions of "the fast block to polyspermy;" this problematic for a couple of different reasons. There is no strong evidence for block to polyspermy in mammals that occurs quickly, particularly not in the same time scale as the first-characterized fast block to polyspermy. To many biologists, the term "fast block to polyspermy" refers to the block that has been described in species like sea urchins and frogs, meaning a rapid depolarization of the egg plasma membrane. However, such depolarization events of the egg membrane have not been detected in multiple mammalian species. Moreover, the change in the egg membrane after fertilization does not occur in as fast a time scale as the membrane block in sea urchins and frogs (i.e., is not "fast" per se), and instead occurs in a comparable time frame as the conversation of the ZP associated with the cleavage of ZP2. Thus, it is misleading to use the terms "fast block" and "slow block" when talking about mammalian fertilization. This also is an instance of where the authors contradict themselves in the manuscript, stating, "the membrane block and the ZP block are established in approximatively the same time frame" (third paragraph of Introduction). This statement is indeed accurate, unlike the reference to a fast block to polyspermy in mammals.

      We fully agree with Reviewer 3 on the importance of clearly defining the two blocks examined in the present study—the penetration block and the fusion block (as referred to in the revised version) —and of situating them in relation to the three blocks described in the literature: the ZP-block, membrane-block, and PVS-block. We acknowledge that this distinction was not sufficiently clear in the original version of the manuscript. In the revised version, these two blocks and their relationship to the ZP-, membrane-, and PVS-blocks are now clearly introduced in the second paragraph of the Introduction section and illustrated in the first figure of the manuscript (Fig. 1C). They are then discussed in detail in two dedicated paragraphs of the Discussion, entitled Relation between the penetration block and the ZP-block and Relation between the fusion block and the membrane- and PVS-blocks.

      The penetration block refers to the time-dependent decrease in the number of spermatozoa penetrating the perivitelline space (PVS) following fertilization, whereas the fusion block refers to the time-dependent decrease in sperm-oolemma fusion events after fertilization. It is precisely to the characterization of these two blocks that our in vitro fertilization experiments with kinetic tracking allow us to access.

      In this study, as in the literature, fusion-triggered modifications of the ZP that hinder sperm traversal of the ZP are referred to as the ZP-block (also known as ZP hardening). The ZP-block thus contributes to the post-fertilization reduction in sperm penetration into the PVS and thereby underlies the penetration block. Similarly, fusion-triggered alterations of the PVS and the oolemma that reduce the likelihood of spermatozoa that have reached the PVS successfully to fuse with the oolemma are referred to as the PVS-block and membrane-block, respectively. These two blocks act together to reduce the probability of sperm-oolemma fusion after fertilization, and thus contribute to the fusion block.

      The time constant of the penetration block was found to be 48.3 ± 9.7 minutes, which is consistent with the typical timeframe of ZP-block completion—approximately one hour post-fertilization in mice—as reported in the literature. By contrast, the time constant of the fusion block was determined to be 6.2 ± 1.3 minutes, which is markedly faster than the time typically reported in the literature for the completion of the fusion-block (more than one hour in mice). This strongly suggests that the kinetics of the fusion block are not primarily governed by its membrane-block component, but rather by its PVS-block component—about which little to nothing was previously known.

      Contrary to what Reviewer 3 appears to have understood from our initial formulation, there is therefore no contradiction or error in stating that "the membrane block and the ZP block are established within approximately the same timeframe", while the fusion block, which proceeds much more rapidly, is likely to rely predominantly on the PVS-block. We have thoroughly revised the manuscript to clarify this key message of the study.

      However, we understand Reviewer 3’s objection to referring to the fusion block (or the PVS-block) as a fast block, given that this term is conventionally reserved for the immediate fertilization-triggered membrane depolarization occurring in sea urchins and frogs. Although the kinetics we report for the fusion block are considerably faster than those of the penetration block, they occur on the scale of minutes, and not seconds. In line with the reviewer's recommendation, we have therefore modified both the title and the relevant passages in the text to remove all references to the term fast block in the revised version.

      (2) The authors aim to make the case that events occurring in the perivitelline space (PVS) prevent polyspermic fertilization, but the data that they present is not strong enough to make this conclusion. Additional experiments would optional for this study, but data from such additional experiments are needed to support the authors' claims regarding these functions in fertilization. Without additional data, the authors need to be much more conservative in interpretations of their data. The authors have indeed observed phenomena (the presence of CD9 and JUNO in the PVS) that could be consistent with a molecular basis of a means to prevent fertilization by a second sperm. However, the authors would need additional data from additional experimental studies, such as interfering with the release of CD9 and JUNO and showing that this experimental manipulation leads to increased polyspermy, or creating an experimental situation that mimics the presence of CD9 and JUNO (in essence, what the authors call "sperm inhibiting medium" on page 20) and showing that this prevents fertilization.

      A major section of the Results section here (starting with "The consequence is that ... ") is speculation. Rather than be in the Results section, this should be in the Discussion. The language should be also softened regarding the roles of these proteins in the perivitelline space in other portions of the manuscript, such as the abstract and the introduction.

      Finally, the authors should do more to discuss their results with the results of Miyado et al. (2008), which interestingly, posited that CD9 is released from the oocytes and that this facilitates fertilization by rendering sperm more fusion-competent. There admittedly are two reports that present data that suggest lack of detection of CD9-containing exosomes from eggs (as proposed by Miyado et al.), but nevertheless, the authors should put their results in context with previous findings.

      We generally agree with all the remarks and suggestions made here. In the revised version of the manuscript, we have retained in the Results section (pp. 14–15) only the factual data concerning the localization of CD9 and JUNO in unfertilized and fertilized oocytes, as well as in the spermatozoa present in the PVS of these oocytes. We have taken care not to include any interpretive elements in this section, which are now presented exclusively in a dedicated paragraph of the Discussion, entitled “Possible molecular bases of the membrane-block and ZP-block contributing to the fusion block” (p. 21). There, we develop our hypothesis and discuss it in light of both the findings from the present study and previous work by other groups. In doing so, we also address the data reported by Miyado et al. (2008, https://doi.org/10.1073/pnas.0710608105), as well as subsequent studies by two other groups—Gupta et al. (2009, https://doi.org/10.1002/mrd.21040) and Barraud-Lange et al. (2012, https://doi.org/10.1530/REP-12-0040)—that have challenged Miyado’s findings.

      We are fully aware that our interpretation of the coverage of unfused sperm heads in the perivitelline space (PVS) by CD9 and JUNO, released from the oolemma—as a potential mechanism of sperm neutralization contributing to the PVS block—remains, at this stage, a plausible hypothesis or working model that, as such, warrants further experimental investigation. It is precisely in this spirit that we present it—first in the abstract (p.1), then in the Discussion section (p. 21), and subsequently in the perspective part of the Conclusion section (p. 22).

      (3) Many of the authors' conclusions focus on their prior analyses of sperm interaction - beautifully illustrated in Figure 7. However, the authors need to be cautious in their interpretations of these data and generalizing them to mammalian fertilization as a whole, because mouse and other rodent sperm have sperm head morphology that is quite different from most other mammalian species.

      In a similar vein, the authors should be cautious in their interpretations regarding the extension of these results to mammalian species other than mouse, given data on numbers of perivitelline sperm (ranging from 100s in some species to virtually none in other species), suggesting that different species rely on different egg-based blocks to polyspermy to varying extents. While these observations of embryos from natural matings are subject to numerous nuances, they nevertheless suggest that conclusions from mouse might not be able to be extended to all mammalian species.

      It is not clear to us whether Reviewer 3’s comment implies that we have, at some point in the manuscript, generalized conclusions obtained in mice to other mammalian species—which we have not—or whether it is simply a general, common-sense remark with which we fully agree: that findings established in one species cannot, by default, be assumed to apply to another.

      We would like to emphasize that throughout the manuscript, we have taken care to restrict our interpretations and conclusions to the mouse model, and we have avoided any unwarranted extrapolation to other species.

      To definitively close this matter—if there is indeed a matter—we have added the following clarifying statements in the revised version of the manuscript:

      In the introduction, second paragraph (pp. 2–3):"The variability across mammalian species in both the rate of fertilized oocytes with additional spermatozoa in their PVS (from 0 to more than 80%) after natural mating and the number of spermatozoa present in the PVS of these oocytes (from 0 to more than a hundred) suggests that the time for completion of the penetration block and thus its efficiency to prevent polyspermy can vary significantly between species."

      At the end of the preamble to the Results section (p. 4):"This experimental study was conducted in mice, which are the most widely used model for studying fertilization and polyspermy blocks in mammals. While there are many interspecies similarities, the findings presented here should not be directly extrapolated to humans or other mammalian species without species-specific validation."

      In the Conclusion, the first sentence is (p.22) : “This study sheds new light on the complex mechanisms that enable fertilization and ensure monospermy in mouse model.”

      Within the Conclusion section, among the perspectives of this work (p. 22):"In parallel, comparative studies in other mammalian species will be needed to assess the generality of the PVS-block and its contribution relative to the membrane-block and ZP-blocks, as well as the generality of the mechanical role played by flagellar beating and ZP mechanical constraint in membrane fusion."

      (4) Results, page 4 - It is very valuable that the authors clearly define what they mean by a penetrating spermatozoon and a fertilizing spermatozoon. However, they sometimes appear not to adhere to these definitions in other parts of the manuscript. An example of this is on page 10; the description of penetration of spermatozoon seems to be referring to membrane fusion with the oocyte plasma membrane, which the authors have alternatively called "fertilizing" or fertilization - although this is not entirely clear. The authors should go through all parts of the manuscript very carefully and ensure consistent use of their intended terminology.

      Overall, while these definitions on page 4 are valuable, it is still recommended that the authors explicitly state when they are addressing penetration of the ZP and fertilization via fusion of the sperm with the oocyte plasma membrane. This help significantly in comprehension by readers. An example is the section header in the middle of page 9 - this could be "Spermatozoa can penetrate the ZP after the fertilization, but have very low chances to fertilize."

      We chose to define our use of the term penetration at the beginning of the Results section because, as readers of fertilization studies, we have encountered on multiple occasions ambiguity as to whether this term was referring to sperm entry into the perivitelline space following zona pellucida traversal, or to the fusion of the sperm with the oolemma. To avoid such ambiguity, we were particularly careful throughout the writing of our original manuscript to use the term penetration exclusively to describe sperm entry into the PVS. The terms fertilizing and fusion were reserved specifically for membrane fusion between the gametes. However, as occasional lapses are always possible, we followed Reviewer 3’s recommendation and carefully re-examined the entire manuscript to ensure consistent use of our intended terminology. We did not identify any inconsistencies, including on page 10, which was cited as an example by Reviewer 3. We therefore confirm that, in accordance with our predefined terminology, all uses of the term penetration, on that page and anywhere else in our original manuscript, refer exclusively to sperm entry into the PVS and do not pertain to fusion with the oolemma.

      That said, it is important that all readers— including those who may only consult selected parts of the article—are able to understand it clearly. Therefore, despite the potential risk of slightly overloading the text, Reviewer 3’s suggestion to systematically associate the term penetration with ZP seems to us a sound one. However, we have opted instead to associate penetration with PVS, as our study focuses on the timing of sperm penetration into the perivitelline space, rather than on the traversal of the zona pellucida itself. Accordingly, except in a few rare instances where ambiguity seemed impossible, we have systematically used the phrasing “penetration into the PVS” throughout the revised version of the manuscript.

      Another variation of this is in the middle of page 9, where the authors use the terms "fertilization block" and "penetration block." These are not conventional terms, and venture into being jargon, which could leave some readers confused. The authors could clearly define what they mean, particularly with respect to "penetration block,"

      This point has already been addressed in our response to Comment 1 from Reviewer 3. We invite Reviewer 3 to refer to that response.

      This extends to other portions of the manuscript as well, such as Figure 2C, with the label on the y-axis being "Time after fertilization." It seems that what the authors actually observed here was the cessation of sperm tail motility. (It is not evident they they did an assessment of sperm-oocyte fusion here.)

      Regarding Figure 2C (original version), it has been merged with Figure 2B (original version) to form a single figure (Figure S2D), now included in Supplementary Information SI2. This new figure retains all the information originally presented in Figure 2C and indicates the time axis origin as the time when oscillatory movements of the sperm cease.

      That said, for the reasons detailed in our response to Reviewer 1 and in the Materials and Methods, we explain why it is legitimate to use the cessation of sperm head oscillations on the oolemma as a marker for the timing of the fusion event. We invite the reviewers to refer to that response for a full explanation of our rationale.

      (5) Several points that the authors try to make with several pieces of data do not come across clearly in the text, including Figure 2 on page 6, Figure 4 on page 9, and the various states utilized for the statistical treatment, "post-first penetration, post-first fertilization, no fertilization, penetration block and polyspermy block" on page 10. Either re-writing and clearer definitions'explanations are needed, and/or schematic illustrations could be considered to augment re-written text. Illustrations could be a valuable way present the intended concepts to readers more clearly and accurately. For example, Figure 4 and the associated text on page 9 get particularly confusing - although this sounds like a quite impressive dataset with observations of 138 sperm. Illustrations could be helpful, in the spirit of "a picture is worth 1000 words," to show what seem to be three different situations of sequences of events with the sperm they observed. Finally, the text in the Results about the 138 sperm is quite difficult to follow. It also might help comprehension to augment the percentages with the actual numbers of sperm - e.g., is 48.6% referring 67 of the total 138 sperm analyzed? Does the 85.1% refer to 57 of these 67 sperm?

      Figure 2 in the original version of our manuscript concerns sperm engulfment and PB2 extrusion. As already mentioned in our response to Reviewer 1, the characterization of sperm engulfment and PB2 extrusion kinetics is highly relevant to the analysis of the penetration and fusion blocks. However, we agree that its presence in the main text may distract the reader from the main focus of the study. Therefore, this figure and the associated text have been moved to the Supplementary Information in the revised manuscript (SI 2, pages 26–27).

      Regarding Figure 4 (original version), in response to Reviewer 3’s concern about the difficulty in grasping the message conveyed in its three graphs and associated text we have completely rethought the way these data are presented. Since the three graphs of Figure 4 were directly derived from the experimental timing data of sperm entry in the PVS and fusion with the oolemma in fertilized oocytes (originally shown in Figure 3A), we have combined them into a single figure in the revised manuscript: Figure 3 (page 8). This new Figure 3 now comprises three components:

      • Figure 3A remains unchanged from the original version and shows the timing of sperm penetration and fusion in fertilized oocytes. Each sperm category (fused or non-fused , penetrated in the PVS before fusion or after fusion) is represented using a color code clearly explained in the main text (last paragraph of page 7).
      • Figure 3B focuses specifically on the first spermatozoon to penetrate the PVS of each oocyte. It reports how many of these first-penetrating spermatozoa succeeded in fusing versus how many failed to do so, highlighting that being the first to arrive is not sufficient for fusion—other factors are involved. This is explained simply in the first paragraph of page 9.
      • Figure 3C considers all spermatozoa that entered the PVS of fertilized oocytes, classifying them into three categories: those that penetrated the PVS before fertilization, those that did so after fertilization, and those for which the timing could not be precisely determined. Such classification makes it apparent that the number of spermatozoa penetrating before and after fertilization is of the same order of magnitude, indicating that fertilization is not very effective at preventing further sperm entry into the PVS for the duration of our observations (~4 hours). To facilitate the identification of these three categories, the same color code used in Figure 3A is applied. In addition, within each category, the number of spermatozoa that successfully fused are indicated in black. This allows the reader to quickly assess the fertilization probability for each category—high for sperm entering before fertilization, very low or null for those entering after fertilization. This analysis shows that fertilization is far more effective at blocking sperm fusion than at blocking sperm penetration. This is clearly explained in the second paragraph of page 9. Regarding__ statistical analysis__, as already mentioned in our responses to Reviewers 1 and 2, this section has been rewritten to improve clarity and readability. The notation has also been significantly simplified. To improve the overall fluidity of the text related to the statistical analysis, Figure 3B (original version), which presented the timing of penetration into the perivitelline space of oocytes that remained unfertilized, along with its associated statistical analysis previously in Figure 5B), have been revised and transferred together in a single Figure S1 of the Supplementary Information (SI1, pages 26; now Figures S1A and S1B).

      (6) Introduction, page 2 - it is inaccurate to state that only diploid zygotes can develop into a "new being." Triploid zygotes typically fail early in develop, but can survive and, for example, contribute to molar pregnancies. Additionally, it would be beneficial to be more scientifically precise term than saying "development into a new being." This is recommended not only for scientific accuracy, but also due to current debates, including in lay public circles, about what defines "life" or human life.

      In response to Reviewer 3’s comment, we no longer state in the revised version of the manuscript that only diploid zygotes can develop into a new being. We have modified our wording as follows, on page 2, second paragraph: “In mammals, oocytes fertilized by more than one spermatozoon cannot develop into viable offspring.”

      (7) Introduction, page 2 - The mammalian sperm must pass through three layers, not just two as stated in the first paragraph of the Introduction. The authors should include the cumulus layer in this list of events of fertilization.

      The sentence from the introduction from the original manuscript mentioned by Reviewer 3 was: “To fertilize, a spermatozoon must successively pass two oocyte’s barriers.” This statement is accurate in the sense that the cumulus cell layer is not part of the oocyte itself, unlike the two oocyte’s barriers: the zona pellucida and the oolemma. Moreover, the traversal of the cumulus layer is not within the scope of our study, unlike the traversal of the zona pellucida and fusion with the oolemma. However, it is also correct that in our study the spermatozoa have passed through the cumulus layer before reaching the oocyte. Therefore, in response to Reviewer 3’s comment, we have revised the sentence to clarify this point as follows:

      “Once a spermatozoon has passed through the cumulus cell layer surrounding the oocyte, it still must overcome two oocyte’s barriers to complete fertilization.”

      (8) Introduction, page 2 - While there is evidence that zinc is released from mouse egg upon fertilization, the evidence is not convincing or conclusive that zinc is released from cortical granules or via cortical granule exocytosis.

      To better highlight the rationale, storyline, and scope of our study, the introduction has been thoroughly streamlined. In this context, the section discussing the cortical reaction and zinc release seemed more appropriate in the Discussion, specifically within the paragraph titled “Relationship between the penetration block and the ZP-block.”

      To address the uncertainty raised by Reviewer 3 regarding the origin of the zinc spark release, we have rephrased this part as follows:

      “The fertilization-triggered processes responsible for the changes in ZP properties are generally attributed to the cortical reaction—a calcium-induced exocytosis of secretory granules (cortical granules) present in the cortex of unfertilized mammalian oocytes—and to zinc sparks. As a result, proteases, glycosidases, lectins, and zinc are released into the perivitelline space (PVS), where they act on the components of the zona pellucida. This leads to a series of modifications collectively referred to as ZP hardening or the ZP-block”.

      (9) The authors inaccurately state, "only if monospermic multi-penetrated oocytes are able to develop normally, which to our knowledge has never been proven in mice" (page 4) - This was demonstrated with the Astl knockout, assuming that the authors use of "multi-penetrated oocytes" here refers to the definition of penetration that they use, namely penetrating the ZP. This also is one of the instances where the authors contradict themselves, as they note the results with this knockout on page 18.

      Thank you for bringing this point to our attention. Nozawa et al. (2018) found that female mice lacking ovastacin (Astl)—the protease released during the cortical reaction that plays a key role in rendering the zona pellucida impenetrable—are normally fertile. They also reported that oocytes recovered from these females after mating were monospermic, despite the consistent presence of additional spermatozoa in the perivitelline space. We can indeed consider that taken together these findings demonstrate that the presence of multiple spermatozoa in the PVS does not impair normal development, as long as the oocyte remains monospermic. In our study, we re-demonstrated this in a different way (by reimplantation of monospermic oocytes with additional spermatozoa in their PVS) in a more physiological context of WT oocytes, but we agree that we cannot state: “which to our knowledge has never been proven in mice.” This part of the sentence has therefore been removed. In the revised version of the manuscript, the sentence is now formulated in the first paragraph of page 5 as follows: “However, the contribution of the fusion block to prevent polyspermy has physiological significance only if monospermic oocytes with additional spermatozoa in their PVS can develop into viable pups.”

      Minor comments:

      There are numerous places where this reader marked places of confusion in the text. A sample of some of these:

      We will indicate hereinafter how we have modified the text in the specific examples provided by Reviewer 3. Beyond these, however, we would like to emphasize that we have thoroughly revised the entire manuscript to improve clarity and precision.

      Page 4 - "continuously relayed by other if they detach" - don't know what this means

      Replaced now p 5 by “can be replaced by others if they detach”

      Page 6 - "hernia" - do the authors mean "protrusion" on the oocyte surface?

      The paragraph from the Results section in question has now been moved to the Supplementary Information, on pages 26 and 27. The term hernia has been systematically replaced with protrusion, including in the Materials and Methods section on page 24.

      Page 10 - "penetration of spermatozoa in the PVS falls down" - don't know what this means

      Falls down has been removed from the new version and replaced with decreases

      Page 12 - "spermatozoa linked to the oocyte ZP" - not clear what "linked" means here

      Replaced now page 16 by “spermatozoa bound to the oocyte ZP”

      Page 14 - "by dint of oscillations" - don't know what this means

      Replaced now page 10 by “the persistent flagellum movements”

      Specifics for Materials and Methods:

      Exact timing of females receiving hCG and then being put with males for mating - assume this was immediate but this is an important detail regarding the timing for the creation of embryos in vivo.

      That is correct: females were placed with males for mating immediately after receiving hCG. This clarification has been added in the revised version of the manuscript.

      Please provide the volumes in which inseminations occurred, and how many eggs were placed in this volume with the 10^6 sperm/ml.

      The number of eggs may vary from one cumulus–oocyte complex to another. It is therefore not possible to specify exactly how many eggs were inseminated. However, we now indicate on page 23 the number of cumulus–oocyte complexes inseminated (4 per experiment), the volume in which insemination was performed (200 mL), and the sperm concentration used 106 sperm/mL.

      **Referees cross-commenting**

      I concur with Reviewer 1's comment, that the 'challenging prior dogma' about the first sperm not always being the one to fertilize the egg is too strong. As Reviewer 1 notes, "it had been observed before that it is not necessarily the first sperm that gets through the ZP that fertilizes the egg." I even thought about adding this comment to my review, although held off (I was hoping to find references, but that was taking too long).

      Please refer to our response to Reviewer 1 regarding this point.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      This study by Dubois et al. utilizes live-cell imaging studies of mouse oocytes undergoing fertilization. A strength of this study is their use of three different conditions for analyses of events of fertilization: (1) eggs undergoing fertilization retrieved from females at 15 hr after mating (n = 211 oocytes); (2) cumulus-oocyte complexes inseminated in vitro (n = 220 oocytes), and (3) zona pellucida (ZP)-intact eggs inseminated in vitro, transferred from insemination culture once sperm were observed bound to the ZP for subsequent live-cell imaging (93 oocytes). This dataset and these analyses are valuable for the field of fertilization biology. Limitations of this manuscript are challenges arise with some conclusions, and the presentation of the manuscript. There are some factual errors, and also some places where clearer explanations should to be provided, in the text and potentially augmented with illustrations to provide more clarity on the models that the authors interpret from their data.

      Major comments:

      The authors are congratulated on their impressive collection of data from live-cell imaging. However, the writing in several sections is challenging to understand or seems to be of questionable accuracy. The lack of accuracy is suspected to be more an effect of overly ambitious attempts with writing style, rather than to mislead readers. Nevertheless, these aspects of the writing should be corrected. There also are multiple places where the manuscript contradicts itself. These contradictions should be corrected. Finally, there are factual points from previous studies that need correction.

      Second, certain claims and the conclusions as presented are not always clearly supported by the data. This may be connected to the issues with writing style, word and phrasing choices, etc. The conclusions could be expressed more clearly, and thus may not require additional experiments or analyses to support them. The authors might also consider illustrations as ways to highlight the points they wish to make. (Figure 7 is a strong example of how they use illustrations to complement the text).

      Specific comments:

      1. The authors should use greater care in describing the blocks to polyspermy, particularly because they appear to be wishing to reframe views about prevention of polyspermic fertilization. The title mentions of "the fast block to polyspermy;" this problematic for a couple of different reasons. There is no strong evidence for block to polyspermy in mammals that occurs quickly, particularly not in the same time scale as the first-characterized fast block to polyspermy. To many biologists, the term "fast block to polyspermy" refers to the block that has been described in species like sea urchins and frogs, meaning a rapid depolarization of the egg plasma membrane. However, such depolarization events of the egg membrane have not been detected in multiple mammalian species. Moreover, the change in the egg membrane after fertilization does not occur in as fast a time scale as the membrane block in sea urchins and frogs (i.e., is not "fast" per se), and instead occurs in a comparable time frame as the conversation of the ZP associated with the cleavage of ZP2. Thus, it is misleading to use the terms "fast block" and "slow block" when talking about mammalian fertilization.

      This also is an instance of where the authors contradict themselves in the manuscript, stating, "the membrane block and the ZP block are established in approximatively the same time frame" (third paragraph of Introduction). This statement is indeed accurate, unlike the reference to a fast block to polyspermy in mammals.<br /> 2. The authors aim to make the case that events occurring in the perivitelline space (PVS) prevent polyspermic fertilization, but the data that they present is not strong enough to make this conclusion. Additional experiments would optional for this study, but data from such additional experiments are needed to support the authors' claims regarding these functions in fertilization. Without additional data, the authors need to be much more conservative in interpretations of their data. The authors have indeed observed phenomena (the presence of CD9 and JUNO in the PVS) that could be consistent with a molecular basis of a means to prevent fertilization by a second sperm. However, the authors would need additional data from additional experimental studies, such as interfering with the release of CD9 and JUNO and showing that this experimental manipulation leads to increased polyspermy, or creating an experimental situation that mimics the presence of CD9 and JUNO (in essence, what the authors call "sperm inhibiting medium" on page 20) and showing that this prevents fertilization.

      A major section of the Results section here (starting with "The consequence is that ... ") is speculation. Rather than be in the Results section, this should be in the Discussion. The language should be also softened regarding the roles of these proteins in the perivitelline space in other portions of the manuscript, such as the abstract and the introduction.

      Finally, the authors should do more to discuss their results with the results of Miyado et al. (2008), which interestingly, posited that CD9 is released from the oocytes and that this facilitates fertilization by rendering sperm more fusion-competent. There admittedly are two reports that present data that suggest lack of detection of CD9-containing exosomes from eggs (as proposed by Miyado et al.), but nevertheless, the authors should put their results in context with previous findings. 3. Many of the authors' conclusions focus on their prior analyses of sperm interaction - beautifully illustrated in Figure 7. However, the authors need to be cautious in their interpretations of these data and generalizing them to mammalian fertilization as a whole, because mouse and other rodent sperm have sperm head morphology that is quite different from most other mammalian species.

      In a similar vein, the authors should be cautious in their interpretations regarding the extension of these results to mammalian species other than mouse, given data on numbers of perivitelline sperm (ranging from 100s in some species to virtually none in other species), suggesting that different species rely on different egg-based blocks to polyspermy to varying extents. While these observations of embryos from natural matings are subject to numerous nuances, they nevertheless suggest that conclusions from mouse might not be able to be extended to all mammalian species.<br /> 4. Results, page 4 - It is very valuable that the authors clearly define what they mean by a penetrating spermatozoon and a fertilizing spermatozoon. However, they sometimes appear not to adhere to these definitions in other parts of the manuscript. An example of this is on page 10; the description of penetration of spermatozoon seems to be referring to membrane fusion with the oocyte plasma membrane, which the authors have alternatively called "fertilizing" or fertilization - although this is not entirely clear. The authors should go through all parts of the manuscript very carefully and ensure consistent use of their intended terminology.

      Overall, while these definitions on page 4 are valuable, it is still recommended that the authors explicitly state when they are addressing penetration of the ZP and fertilization via fusion of the sperm with the oocyte plasma membrane. This help significantly in comprehension by readers. An example is the section header in the middle of page 9 - this could be "Spermatozoa can penetrate the ZP after the fertilization, but have very low chances to fertilize."

      Another variation of this is in the middle of page 9, where the authors use the terms "fertilization block" and "penetration block." These are not conventional terms, and venture into being jargon, which could leave some readers confused. The authors could clearly define what they mean, particularly with respect to "penetration block,"

      This extends to other portions of the manuscript as well, such as Figure 2C, with the label on the y-axis being "Time after fertilization." It seems that what the authors actually observed here was the cessation of sperm tail motility. (It is not evident they they did an assessment of sperm-oocyte fusion here.) 5. Several points that the authors try to make with several pieces of data do not come across clearly in the text, including Figure 2 on page 6, Figure 4 on page 9, and the various states utilized for the statistical treatment, "post-first penetration, post-first fertilization, no fertilization, penetration block and polyspermy block" on page 10 . Either re-writing and clearer definitions'explanations are needed, and/or schematic illustrations could be considered to augment re-written text. Illustrations could be a valuable way present the intended concepts to readers more clearly and accurately. For example, Figure 4 and the associated text on page 9 get particularly confusing - although this sounds like a quite impressive dataset with observations of 138 sperm. Illustrations could be helpful, in the spirit of "a picture is worth 1000 words," to show what seem to be three different situations of sequences of events with the sperm they observed. Finally, the text in the Results about the 138 sperm is quite difficult to follow. It also might help comprehension to augment the percentages with the actual numbers of sperm - e.g., is 48.6% referring 67 of the total 138 sperm analyzed? Does the 85.1% refer to 57 of these 67 sperm?<br /> 6. Introduction, page 2 - it is inaccurate to state that only diploid zygotes can develop into a "new being." Triploid zygotes typically fail early in develop, but can survive and, for example, contribute to molar pregnancies. Additionally, it would be beneficial to be more scientifically precise term than saying "development into a new being." This is recommended not only for scientific accuracy, but also due to current debates, including in lay public circles, about what defines "life" or human life. <br /> 7. Introduction, page 2 - The mammalian sperm must pass through three layers, not just two as stated in the first paragraph of the Introduction. The authors should include the cumulus layer in this list of events of fertilization. 8. Introduction, page 2 - While there is evidence that zinc is released from mouse egg upon fertilization, the evidence is not convincing or conclusive that zinc is released from cortical granules or via cortical granule exocytosis.<br /> 9. The authors inaccurately state, "only if monospermic multi-penetrated oocytes are able to develop normally, which to our knowledge has never been proven in mice" (page 4) - This was demonstrated with the Astl knockout, assuming that the authors use of "multi-penetrated oocytes" here refers to the definition of penetration that they use, namely penetrating the ZP. This also is one of the instances where the authors contradict themselves, as they note the results with this knockout on page 18.

      Minor comments:

      There are numerous places where this reader marked places of confusion in the text. A sample of some of these:

      Page 4 - "continuously relayed by other if they detach" - don't know what this means

      Page 6 - "hernia" - do the authors mean "protrusion" on the oocyte surface?

      Page 10 - "penetration of spermatozoa in the PVS falls down" - don't know what this means

      Page 12 - "spermatozoa linked to the oocyte ZP" - not clear what "linked" means here

      Page 14 - "by dint of oscillations" - don't know what this means

      Specifics for Materials and Methods:

      Exact timing of females receiving hCG and then being put with males for mating - assume this was immediate but this is an important detail regarding the timing for the creation of embryos in vivo.

      Please provide the volumes in which inseminations occurred, and how many eggs were placed in this volume with the 10^6 sperm/ml.

      Referees cross-commenting

      I concur with Reviewer 1's comment, that the 'challenging prior dogma' about the first sperm not always being the one to fertilize the egg is too strong. As Reviewer 1 notes, "it had been observed before that it is not necessarily the first sperm that gets through the ZP that fertilizes the egg." I even thought about adding this comment to my review, although held off (I was hoping to find references, but that was taking too long).

      Significance

      This manuscript brings interesting new observations for the field of gamete and fertilization biology. For very obvious reasons, the understanding of mammalian fertilization has lagged behind the understanding of fertilization of species with external fertilization. Decades ago, developmental biologists first focused on studies of fertilization on gametes from species that release sperm and egg into water, either spontaneously or with relatively easy stimulation, and gametes that could be easily cultured and enabled to create embryos as researchers watched. Studies of mammalian fertilization have since caught up, with the elucidation of conditions that support in vitro fertilization in various mammalian species, most notably mouse as an experimental model.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Overall, this is a very interesting and relevant work for the field of fertilization. In general, the experimental strategies are adequate and well carried out. I have some questions and suggestions that should be considered before the work is published.

      1. Why are the cumulus cells not mentioned when the AR is triggered before or while the sperms cross it? It seems the paper assumes from previous work that all sperm that reach ZP and the OPM have carried out the acrosome reaction. This, though probably correct, is still a matter of controversy and should be discussed. It is in a way strange that the authors do not make some controls using sperm from mice expressing GFP in the acrosome, as they have used in their previous work.
      2. In the penetration block equations, it is not clear to me why (𝑡𝑃𝐹1) refers to both PIPF1 and 𝜎𝜎𝑃I𝑃𝐹1. Is it as function off?
      3. Why do the authors think that the flagella stops. The submission date was 2024-10-01 07:27:26 and there has been a paper in biorxiv for a while that merits mention and discussion in this work (bioRxiv [Preprint]. 2024 Jul 2:2023.06.22.546073. doi: 10.1101/2023.06.22.546073.PMID: 37904966).
      4. Please correct at the beginning of Materials and Methos: Sperm was obtained from WT male mice, it should say were.
      5. This is also the case in the fourth paragraph of this section: oocyte were not was.

      Significance

      Understanding mammalian gamete fusion and polyspermy inhibition has not been fully achieved. The authors examined real time brightfield and confocal images of inseminated ZP-intact mouse oocytes and used statistical analyses to accurately determine the dynamics of the events that lead to fusion and involve polyspermy prevention under conditions as physiological as possible. Their kinetic observations in mice gamete interactions challenge present paradigms, as they document that the first sperm is not necessarily the one that fertilizes, suggesting the existence of other post-penetration fertilization factors. The authors find that the zona pellucida (ZP) block triggered by the cortical reaction is too slow to prevent polyspermy in this species. In contrast, their findings indicate that ZP directly contributes to the polyspermy block operating as a naturally effective entry barrier inhibiting the exit from the perivitelline space (PVS) of components released from the oocyte plasma membrane (OPM), neutralizing unwanted sperm fusion, aside from any block caused by fertilization. Furthermore, the authors unveil a new important ZP role regulating flagellar beat in fertilization by promoting sperm fusion in the PVS.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript "Key roles of the zona pellucida and perivitelline space in promoting gamete fusion and fast block to polyspermy inferred from the choreography of spermatozoa in mice oocytes" by Dr. Gourier and colleagues explores the poorly understood process of gamete fusion and the subsequent block to polyspermy by live-cell imaging of mouse oocytes with intact zona pellucida in vitro. The new component in this study is the presence of the ZP, which in prior studies of live-cell imaging had been removed before. This allowed the authos to examine contributions of the ZP to the block in polyspermy in relation to the timing of sperm penetrating the ZP and sperm fusing with the oocyte. By carefully analysing the timing of the cascade of events, the authors find that the first sperm that reaches the membrane of the mouse oocyte is not necessarily the one that fertilizes the oocytes, revealing that other mechanisms post-ZP-penetration influence the success of individual sperm. While the rate of ZP penetration remains constant in unfertilized oocytes, it decreases upon fertilization for subsequent sperm, providing direct evidence for the known 'slow block to polyspermy' provided by changes to the ZP adhesion/ability to be penetrated. Careful statistical analyses allow the authors to revisit the role of the ZP in preventing polyspermy: They show that the ZP block resulting from the cortical reaction is too slow (in the range of an hour) to contribute to the immediate prevention of polyspermy in mice. The presented analyses reveal that the ZP does contribute to the block to polyspermy in two other ways, namely by effectively limiting the number of sperm that reach the oocyte surface in a fertilization-independent manner, and by retaining components like JUNO and CD9, that are shed from the oocyte plasma membrane after fertilization, in the perivitelline space, which may help neutralize surplus spermatozoa that are already present in the PVS. Lastly, the authors report that the ZP may also contribute to channeling the flagellar oscillations of spermatozoa in the PVS to promote their fusion competence.

      Major comments:

      • Are the key conclusions convincing?

      The authors provide a careful analysis of the dynamics of events, though the analyses are correlative, and can only be suggestive of causation. While this is a limitation of the study, it provides important analysis for future research. Moreover, by analysing also control oocytes without fertilization and the timing of events, the authors have in some instances clear 'negative controls' for comparison.

      Some claims would benefit from rewording or rephrasing to put the findings better in the context of what is already known and what is novel: - the phrasing 'challenging prior dogma' might be too strong since it had been observed before that it is not necessarily the first sperm that gets through the ZP that fertilizes the egg (though I am afraid that I do not have any citations or references for this). However, given that in the field people generally think it is not necessarily and always the first sperm, the authors may want to consider weakening this claim. - I do think the cortical granule release could still contribute to the block to polyspermy though - as the authors here nicely show - at a later time-point only, and thus not the major and not the immediate block as previously thought. The wording in the abstract should therefore be adjusted (since it could still contribute...) - the finding that the ZP presents a natural effective barrier for sperm entry is not that novel (as suggested here) - there are mutants that prevent sperm from getting through the ZP and thus to the oocyte and those lead to sterility - release of OPM components - in the abstract it's unclear what the authors mean by this - in the results part it becomes clear. Please already make it clear in the abstract that it is the fertility factors JUNO/CD9 that could bind to sperm heads upon their release and thus 'neutralize' them? I would also recommend not referring to it as 'outer' plasma membrane (there is no 'inner plasma membrane'). Moreover, in the abstract please clarify that this release is happening only after fusion of the first sperm and not all the time. In the abstract it sounds as if this was a completely new idea, but there is good prior evidence that this is in fact happening (as also then cited in the results part) - maybe frame it more as the retention inside the PVS as new finding.

      It is unclear to me what the relevance of dividing the post-fusion/post-engulfment into different phases as done in Fig 2 (phase 1, and phase 2) - also for the conclusions of this paper this seems rather irrelevant and overly complicated, since the authors never get back to it and don't need it (it's not related to the polyspermy block analyses). I would remove it from the main figures and not divide into those phases since it is distracting from the main focus.

      For the statistical analysis, I am not sure whether the assumption "assumption that the probability distribution of penetration or fertilization is uniform within a given time window" is in fact true since the probability of fertilizing decreases after the first fertilization event.... Maybe I misunderstood this, but this needs to be explained (or clarified) better, or the limitation of this assumption needs to be highlighted. - Suggestion for additional experiments:

      If I understood correctly, the onset of fusion in Fig 2C is defined by stopping of sperm beating? If it is by the sudden stop of the beating flagellum, this should be confirmed in this situation (with the ZP intact) that it correctly defines the time-point of fusion since this has not been measured in this set-up before as far as I understand. In order to measure this accurately, the authors will need to measure this accurate to be able to acquire those numbers (of time from fusion to end of engulfment), e.g. by pre-loading the oocyte with Hoechst to transfer Hoechst to the fusing sperm upon membrane fusion.

      Fig 8: 2 comments - To better show JUNO/CD9 pre-fusion attachment to the oocyte surface and post-fusion loss from the oocyte surface (but persistence in the PVS), an image after removal of the ZP (both for pre-fertilization and post-fertilization) would be helpful - the combination of those images with the ones you have (ZP intact) would make your point more visible. - You show that the heads of spermatozoa post fusion are covered in CD9 and JUNO, yet I was missing an image of sperm in the PVS pre-fertilization (which should then not yet be covered).

      Minor comments:

      • The videos were remarkable to look at, and great to view in full. However, for the sake of time, the authors might want to consider cropping them for the individual phases to have a shorter video (with clear crop indicators) with the most important different stages visible in a for example 1 min video (e.g. video 1)
      • In general, given that the ZP, PVS and oocyte membrane are important components, a general scheme at the very beginning outlining the relative positioning of each before and during fertilization (and then possibly also including the second polar body release) would be extremely helpful for the reader to orient themselves.
      • first header results "Multi-penetration and polyspermy under in vivo conditions and standard and kinetics in vitro fertilization conditions" is hard to understand - simplify/make clearer (comparison of in vivo and in vitro conditions? Establishing the in vitro condition as assay?)
      • Large parts of the statistical analysis (the more technical parts) could be moved to the methods part since it disrupts the flow of the text.
      • To me, one of the main conclusions was given in the text of the results part, namely that "This suggests that first fertilization contributes effectively to the fertilization
      • block, but less so to the penetration block". I would suggest that the authors use this conclusion to strengthen their rationale and storyline in the abstract.
      • Wording: To characterize the kinetics with which penetration of spermatozoa in the PVS falls down after a first fertilization," falls down should be replaced with decreases (page 10 and page 12)

      Significance

      Overall, this manuscript provides very interesting and carefully obtained data which provides important new insights particularly for reproductive biology. I applaud the authors on first establishing the in vivo conditions (how often do multiple sperm even penetrate the ZP in vivo) since studies have usually just started with in vitro condition where sperm at much higher concentration is added to isolated oocyte complexes. Thank you for providing an in vivo benchmark for the frequency of multiple sperm being in the PVS. While this frequency is rather low (somewhat expectedly, with 16% showing 2-3 sperm in the PVS), this condition clearly exists, providing a clear rationale for the investigation of mechanisms that can prevent additional sperm from entering.

      My own expertise is experimentally - thus I don't have sufficient expertise to evaluate the statistical methods employed here.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors use Dyngo-4a, a known Dynami inhibitor to test its influence on caveolar assembly and surface mobility. They investigate, whether it incorporates into membranes with Quartz-Crystal Microbalance, they investigate how it is organized in membranes using simulations. Finally, they use lipid-packing sensitive dyes to investigate lipid packing in the presence of Dyngo-4a, membrane stiffness using AFM and membrane undulation using fluorescence microscopy. They also use a measure they call "caveola duration time" to claim that something happens to caveolae after Dyngo-4a addition and using this parameter, they do indeed see an increase in it in response to Dyngo-4a, which is reduced back to the baseline after addition of cholesterol.

      Overall, the authors claim: 1) Dyngo-4a inserts into the membrane and this 2) results in "a dramatic dynamin-independent inhibition of caveola scission". 3) Dyngo-4a was inserted and positioned at the level of cholesterol in the bilayer and 4) Dyngo-4a-treatment resulted in decreased lipid packing in the outer leaflet of the plasma membrane 5) but Dyngo-4a did not affect caveola morphology, caveolae- associated proteins, or the overall membrane stiffness 6) acute addition of cholesterol counteracts the block in caveola scission caused by Dyngo-4a

      Overall, in this reviewers opinion, claims 1, 3, 4, 5 are well-supported by the presented data from electron and live cell microscopy, QCM-D and AFM. However, there is no convincing assay for caveolar endocytosis presented besides the "caveola duration" which although unclearly described seems to be the time it takes in imaging until a caveolae is not picked up by the tracking software anymore in TIRF microscopy. Since the main claim of the paper is a mechanism of caveolar endocytosis being blocked by Dyngo-4a, a true caveolar internalization assays is required to make this claim. This means either the intracellular detection of not surface connected caveolar cargo or the quantification of caveolar movement from TIRF into epifluorescence detection in the fluorescence microscope. Otherwise, the authors could remove the claim and just claim that caveolar mobility is influenced.

      Response: We thank the reviewer for the nice constructive comments, and we very much appreciate the positive critique. We have now included a FRAP experiment of endocytic Cav1-GFP supporting the effect on internalization. In addition, we are currently preforming CTxB HRP experiments to quantify the number of caveolae at PM using EM but due to reasons out of our control we have not managed to finish these on time, they will be included in the manuscript once they are ready in hopefully not too long.

      Reviewer #1 (Significance (Required)):

      A number of small molecule inhibitors for the GTPase dynamics exist, that are commonly used tools in the investigation of endocytosis. This goes as far that the use of some of these inhibitors alone is considered in some publications as sufficient to declare a process to be dynamin-dependent. However, this is not correct, as there are considerable off-target effects, including the inhibition of caveolar internalization by a dynamin-independent mechanism. This is important, as for example the influence of dynamin small molecule inhibitors on chemotherapy resistance is currently investigated (see for example Tremblay et al., Nature Communications, 2020).

      The investigation of the true effect of small molecules discovered as and used as specific inhibitors and their offside effects is extremely important and this reviewer applauds the effort. It is important that inhibitors are not used alone, but other means of targeting a mechanism are exploited as well in functional studies. The audience here thus is besides membrane biophysicists interested in the immediate effect of the small molecule Dyngo-4a also cell biologists and everyone using dynamic inhibitors to investigate cellular function.

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __

      This manuscript uses the small molecule dynamin inhibitors dynasore and dyngo to show that in dynamin triple knockout cells that these inhibitors impact lipid packing and organization in the plasma membrane. Data showing that dyngo affects caveolin dynamics using tirf microscopy is also shown and is interpreted to reflect inhibition of caveolae scission from the membrane.

      This data showing that dyngo and dynasore target membrane order is quite compelling and argues that the effects of these inhibitors is not dynamin specific and that inhibition of endocytosis by these small molecule inhibitors is dynamin-independent. The in vitro and in vivo data they provide is convincing.

      Similarly, the data showing that dynasore and dyngo affect caveolin dynamics and clathrin endocytosis (transferrin) is quite convincing and argues that altered lipid packing is impacting membrane dynamics at the plasma membrane. What is less convincing is the conclusion is that dyngo is preventing caveolae scission from the membrane. Study of caveolae endocytosis is based on a TIRF assay that has inherent limitations: - Caveolae are defined as bright cav1-positive spots in diffraction limited TIRF and their disappearance presumed to be endocytic events. Cav1 spots are presumed to be caveolae but the authors do not consider that they may be flat non-caveolar oligomers. The diffraction limited TIRF approach interprets the large structures as caveolae but evidence to that effect is lacking.

      Response: This is a valid comment and to address this we have now included data showing colocalization of cavin1 and EHD2 to the Cav1-GFP spots. We can however not determine if they are flat or invaginated. We do have extensive experience imaging caveolae using TIRF microscopy and carefully chose cells that display low expression of fluorescently labelled caveolin to avoid non-caveolar structures.

      • The analysis (and the diagram presented in figure 4) considers that caveolae can either diffuse laterally in the membrane or internalize and does not consider that caveolae can flatten and possibly fragment in the membrane. Is it not possible that loss of Cav1 spots is a fragmentation event and not necessarily a scission event?

      Response: This is a good question, yet, fragmentation and disassembly would result in shorter track durations and this is not what is observed in data. We have now also included data showing that cavin1 is persistently associated with the Cav1 spots identified as caveolae during Dyngo-4a treatment indicating that these are caveolae. Furthermore, IF stainings showing colocalization of Cav1GFP with cavin1 or EHD2 after Dyngo-4a treatment have also been added. We have now also expanded on the different interpretations of the data in the results section.

      • The analysis is based on overexpression of Cav1-GFP that may alter the stoichiometry between Cav1 and cavin1 such that while caveolae may be expressed, larger non-caveolar structures may accumulate.

      Response: Yes, this is correct, we have specifically imaged cell expressing low levels of Cav1-GFP to avoid accumulated non-caveolar structures that can be spotted in cells with high expression.

      • Cav1 has been shown to be internalized via the CLIC pathway (Chaudary et al, 2014) and if dyngo is impacting clathrin then maybe it is also impacting CLIC endocytosis and thereby Cav1 endocytosis via this pathway?

      Response: Dyngo-4a has been shown to not affect CLIC endocytosis (McCluskey et al., 2013) and in our data we do not see internalization following Dyngo-4a treatment.

      • The longer Cav1 TIRF track time and shorter displacement with dyngo is consistent with inhibition of caveolae scission. However, as the authors discuss, could not reduced membrane undulations due to dyngo's impact on membrane order be responsible for the longer tracks? Alternatively, perhaps the altered lipid packing is corralling Cav1 movement and reducing non-caveolar Cav1 endocytosis, resulting in shorter tracks of longer duration? The proposed interaction of dyngo with cholesterol could prevent scission but also stabilize large (flat?) Cav1 oligomers in the membrane, perhaps reducing Cav1 oligomer fragmentation.

      Response: We completely agree that membrane undulations contribute to instability of the TIRF-field and therefore disruption of cav1-GFP tracks as we discuss in the results section and have been described in previous work (Larsson et al., 2023). Yet, we have also shown that internalization of caveolae results in shorter tracks (Hubert et al., 2020; Larsson et al., 2023; Mohan et al., 2015). Furthermore, the tracked Cav1-GFP spots are persistently positive for cavin1 both with and without Dyngo-4a treatment showing that the majority do not disassemble become internalized by other pathways. Additionally, the added IF stainings after 30 min Dyngo-4a treatment also show that the Cav1-GFP spots remain positive for cavin1 and EHD2 just as ctrl-treated cells.

      My point here is not to discredit the data but only to suggest that the TIRF approach used is an indirect measure of caveolae scission from the membrane that requires substantiation using other approaches.

      Response: We appreciate these comments and have tried to address these by adding new data and discussions on the interpretation of the tracking data in the results section.

      Dyngo is certainly generally affecting lipid packing via cholesterol and thereby affecting Cav1 dynamics in the plasma membrane. The claim of caveolae scission should be qualified and alternative possibilities considered and discussed. If the authors persist in arguing that dyngo is affecting caveolae scission then the effect should be substantiated by accumulation of caveolae by quantitative EM and high spatial and temporal resolution imaging of Cav1 and cavin1 to define the endocytic events. As the latter represents a new, and potentially very challenging, line of experimentation, I would suggest that it is beyond the scope of the current study. As indicated above the additional experiments are not necessary and qualification of the claims would be sufficient.

      -Response: We have now included a FRAP experiment of endocytic Cav1-GFP supporting the effect on internalization. We are also currently preforming CTxB HRP experiments to quantify the number of caveolae at the PM using EM but due to reasons out of our control we have not managed to finish these on time, they will be included in the manuscript once they are ready in hopefully not too long.

      Other points

      Figure 1C - Cav1 positive spots cannot be interpreted to be caveolae from diffraction limited confocal images. Same comment applies to Fig 4G - caveola? duration.

      -Response: We completely agree with this and that the claims should be qualified. We have added IF stainings showing that the Cav1-GFP structures are also positive for cavin1. We have now clarified that we cannot distinguish between flat or different curved states of caveolae using this methodology. We have also changed the labelling of Fig. 4G.

      Figure 4C - it is not clear why this EM data is not quantified - for both the number of caveolae and clathrin coated pits - as this would help clarify the interpretation of the effect reported.

      -Response: We are currently preforming CTxB HRP experiments to quantify the number of caveolae using EM but due to reasons out of our control we have not managed to finish these on time, they will be included in the manuscript once they are ready in hopefully not too long.

      Figure 4D - the AFM experiments should perhaps be repeated as the non-significant effect of dyngo on the Young's modulus may be a result of insufficient n values. -Response: We would like to clarify that to ensure the robustness of our AFM measurements, we performed the experiments with sufficient biological and technical replicates. Specifically, each data point shown in Figure 4D represents a Young’s modulus value averaged from approximately sixty force-distance curves per cell. For each condition, we collected force-distance maps on eight to nine individual cells, obtained from two separate petri dishes per day. We repeated this process on two independent days. In total, we analysed thirty-one cells for the DMSO control and thirty-three cells for the Dyngo-4a treatment. We performed the “student’s t-test with Welch’s correction” to access the statistical significance between the two conditions, as described in the main text. We believe that the sample size and statistical approach are sufficient to support the conclusions presented. Furthermore, we also analysed cell stiffness by calculating the slope of the linear portion of the force-distance curves. This analysis also did not reveal any statistically significant differences between the conditions (data not shown), further supporting our conclusion that Dyngo-4a treatment does not significantly alter the Young’s modulus under our experimental setup (or conditions).

      Reviewer #2 (Significance (Required)):

      This data showing that dyngo and dynasore target membrane order is quite compelling and argues that the effects of these inhibitors is not dynamin specific and that inhibition of endocytosis by these small molecule inhibitors is dynamin-independent. The in vitro and in vivo data they provide is convincing.

      Similarly, the data showing that dynasore and dyngo affect caveolin dynamics and clathrin endocytosis (transferrin) is quite convincing and argues that altered lipid packing is impacting membrane dynamics at the plasma membrane. What is less convincing is the conclusion is that dyngo is preventing caveolae scission from the membrane.

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)): __

      Larsson et al present experimental and computational data on the role of Dyngo4a (a compound that was developed to inhibit dynamin) on the dynamics of caveolae. The manuscript mostly documents effects of Dyngo on caveolae, with one experiment to suggest a mechanism for this result. This one rather unconvincing result forms the focus of the manuscript contributing to a disconnect between the data and the presentation. Additionally, there are concerns with data interpretation. The writing could also benefit from revision to address grammar mistakes, strengthen referencing, and increase precision. Overall, the manuscript requires substantial revisions before being considered for publication. The central claim, in particular, needs stronger evidence to support the proposed mechanism. -Response: We thank the reviewer for the thorough review and for experimental suggestions that we believe has strengthened our data further.

      Significant issues (in approximate order of importance): 1. The data supporting the central mechanistic explanation appears limited. There is no evidence that Dyngo remains in one leaflet

      Response:The simulations show that the energy barrier for moving in between bilayers is very high. Furthermore, simulations of C-Laurdan has shown that it does not readily flip in between membrane leaflets (Barucha-Kraszewska et al., 2013) supporting that it reports on the outer lipid leaflet when added to cells. We have however now changed this and state that Dyngo-4a decreased the lipid order in the plasma membrane.

      the GP of the PM is very low compared to previous measurements,

      Response: The absolute GP-values will vary between setups depending on what filters are used so they are not comparable between laboratories. What is of importance is that we found a significant change in the relative GP-values in cells treated with Dyngo-4a and control cells. It is this change that we report. We have not performed any GP-measurements on this cell type earlier so it is unclear what previous measurements reviewer #3 are referring to.

      effects on other membranes are not explored,

      Response: The order of the intracellular membranes is as expected lower than that of the plasma membrane. Differentiating different intracellular membranes of interest like endocytotic vesicles from other intracellular membranes would be very difficult but, more importantly, our study is focused on what is happening in the plasma membrane where caveolae reside and would be of minor interest for plasma membrane dynamics.

      dynamin-directed effects of Dyngo are not considered,

      Response: In the discussion section we discuss the difficulties with disentangling dynamin-direct and indirect effects.

      The QCM-D measurements and claims require explanation as several aspects remains unclear. In Fig S2, the 'softness' (what does this mean?) changes by 4-fold with DMSO alone (what does this mean?), then fractionally more with Dyngo. Then fractionally more again when Dyngo is removed (why?). Then it remains somewhat higher when both Dyngo and DMSO are removed, which is somehow interpreted as Dyngo remaining in the bilayer, but not DMSO.

      Response:We understand the confusion of the reviewer and hope our explanations provide clarity. QCM-D measurements are based on an oscillating quartz crystal sensor. Specifically, alterations in oscillation frequency (ΔF) and the rate of energy dissipation from the sensor surface (ΔD) are what is measured. Allowing the measurement of: 1) materials adsorbing to the sensor surface, 2) changes in the viscoelastic properties of a solution in contact with the sensor surface, 3) changes in the material adsorbed to the sensor surface upone exposure to different solutions. The ratio of ΔD/-ΔF reports the mechanical softness or rigidity of an adsorbed material, in this case the SLB.A “buffer shift” is the term used when there is not an adsorption to the sensor surface, but rather an effect from altering the solution above the sensor surface. One reason is because different solutions can have different densities (e.g., a DMSO-buffer mixture vs buffer alone), which impacts the oscillations of the sensor. It was observed that the DMSO-buffer mixture alone gave a large buffer shift in comparison to the adsorption of the Dyngo-4a into the SLB, thereby muddling the data interpretation. Thus, in Fig. S2 the system was first equilibrated with the DMSO-buffer mixture prior to addition of the Dyngo-4a solution to allow for clearer visualization of the two events. In QCMD to assess if something has made a permeant change to the system you change back to the solutions used before the addition, thus first we washed with a DMSO-Buffer mixture followed by buffer alone. Control experiments were carried out in which no Dyngo-4a was added (also shown in Fig. S2). The control shows the same “buffer shift” from the DMSO-buffer mixture occurs in both systems and that upon returning to a buffer only condition there is no permanent change to the system caused from exposure to the DMSO. In contrast, once the system that received Dyngo-4a is changes back to a buffer only system we see that mass has been added to the system (ΔF) with little change to the dissipation (ΔD), thereby resulting in a lower ratio of ΔD/-ΔF, which is to say that the SLB after the adsorption of Dyngo-4a was more rigid that the SLB without Dyngo-4a.

      These interpretations are difficult to grasp, as the authors seem to be implying simple amphiphilic partitioning into the membrane, which should all be removable by efficient washing.

      Response: Amphiphilic partitioning is not fully reversible by “efficient washing” it depends on partitioning coefficients.

      I do not doubt that this compound interacts with membranes, but the quantifications appear ambiguous. A bilayer with 16 mol% (or worse, 30% if all in one leaflet) Dyngo is very unlikely (to remain a bilayer). Even if such a bilayer was conceivable, the authors are claiming an ADDITION of Dyngo that would INCREASE the area of one leaflet by 30%, which needs explanation as it appears unlikely.

      -Response: We understand that in our attempt provide numbers in the results section for the amount of binding observed in QCM-D, this can easily be interpreted as this is what is observed to insert into the PM. However, as discussed in the discussion, we also see aggregations of Dyngo-4a that associate with the membrane in the simulations which likely could contribute to the binding observed in QCM-D prior to washing. The precise amount of membrane inserted Dyngo-4a is difficult to measure as we discuss in the text. In order to make this clearer, we have now moved all these details to the discussion section where we elaborate on this. Furthermore, since Dyngo-4a, like cholesterol, is intercalating in between the head groups of the lipids the area would not increase in direct proportion to the mol%.

      Also, there are no replicates shown, so unclear how reproducible these effects are?

      Response: For clarity, only single experiments are shown. However, multiple experiments were performed and the range in measured values for 3 technical repeats can be observed in the standard deviations found in the main text (e.g., 6 ± 2 mol%).

      The simulations are insufficiently described and difficult to interpret. How big are these systems? Why do the figures show the aqueous system with lateral boundaries?

      Response: There are no explicit boundaries used in the simulations, periodic boundary conditions are applied in all three dimensions. The lateral boundaries observed in the figures correspond to the simulation box edges and are a visual artifact of 2D projections with QuickSurf representation. No artificial wall or constraints were introduced laterally. Additional technical details, including the system size and periodic boundary conditions have now been added to the methods section.

      It seems quite important that multiple Dyngo molecules aggregate rather than partition into membranes - is this likely to occur in experiment?

      Response: Yes, this is important and with the additional simulation experiments suggested by Reviewer #3 it has been clarified that they contribute a great deal to the change in lipid packing of lipid bilayers containing cholesterol. However, it is hard to test aggregation is the cellular system, but we believe that this happens and contribute to the effect on membranes. We have now emphasized the effect of the aggregates in the text.

      PMF simulations are strongly suggesting that Dyngo does not spontaneously cross membranes, which is inconsistent with its drug-like amphiphilicity (cLogP~2.5 is optimally suited for membrane permeation) and known effects on intracellular proteins. This suggests an artefact in these PMFs.

      Response:As stated in the submitted version of the manuscript, logP was used to validate the topology and the observed value was in a very good agreement with cLogP. Moreover, this validation complemented the standard procedure of CHARMM-GUI ligand modelling, that provided a reasonable penalty score (around 20) for the Dyngo-4a topology. POPC and cholesterol molecules are standard in the force field and validated by numerous studies. The parameters used for the membrane simulations and AWH in particular are very common for this type of studies. Thus, we do not see what may cause any artifacts in the free energy profile construction. In fact, amphiphilicity of the molecule may be one of the key reasons that Dyngo-4a molecule remains at the aqueous interface of the membrane and does not cross the membrane spontaneously. Also, we believe that the energy barrier of 40-60 kJ/mol is not prohibitively high and Dyngo-4a molecules may still overcome the barrier eventually, though we expect majority to reside in the upper leaflet*. *

      The authors should experimentally measure the permeation of Dyngo through bilayers (or lack thereof), to more robustly support their finding that Dyngo does not cross membranes spontaneously.

      -Response: We thank the reviewer for the suggestion, however this if very technically challenging and would require establishment of precise systems which is beyond the scope of this manuscript.

      Why not measure effect of Dyngo on lipid packing directly and more broadly in model membranes?

      -Response: With the added modelling experiments supporting the previous simulations and the calculated GP values from the C-Laurdan experiments on cellular plasma membrane, we do not find it necessary to include more model membranes experiments than the already existing ones on lipid monolayers and supported lipid bilayers.

      Statistics should not be done on individual cells (n>26), but rather on independent experiment (N=3?)

      -Response: We have performed the statistics on live cell particle tracking according to previous literature on similar systems (Boucrot et al., 2011; Larsson et al., 2023; Shvets et al., 2015; Stoeber et al., 2012).

      Fig 1G is important but rather unclear. Firstly, these kymographs are an odd way to show that the caveolae are not moving. More importantly, caveolae in normal cells have been shown to be quite stable and immobile (eg doi: 10.1074/jbc.M117.791400), yet here they are claimed to be very mobile.

      -Response: Although this might be an odd and unconventional way to depict dynamic processes, we believe that this is a very illustrative way to show track stability over time in bulk rather than just a kymograph over a few structures in a cell. Furthermore, we are not claiming that caveolae are very mobile but rather the opposite very stable in agreement with previous work (Boucrot et al., 2011; Larsson et al., 2023; Mohan et al., 2015). We have now edited the text to make this even clearer.

      Also, if Dyngo prevents caveolae scission, there should be more of them at the membrane - why no quantification like Fig 1C to show accumulation of caveolae upon Dyngo treatment? Or directly counting caveolae via EM, as in Fig 4C?

      -Response: We are currently preforming CTxB HRP experiments using EM but due to reasons out of our control we have not managed to finish these on time, they will be included in the manuscript once they are ready in hopefully not too long. However, Dynasore has previously been shown, by EM, to increase the number of caveolae at the PM (Moren et al., 2012; Sinha et al., 2011).

      The writing can be made more precise and referencing could be strengthened. Response: The introduction was written in a short format, and we have now extended this and made it more precise. Some examples: (a) 'scissoned' is not a word in English,

      Response: Thanks, we have now changed this.

      (b) what is meant by "Cav1 assembly is driven by high chol content"? There are many types of caveolin assemblies.

      Response: We agree that this can be made more precise and have now clarified this in the introduction.

      (c) "This generates a unique membrane domain with distinct lipid packing and a very high curvature." Unclear what 'this' refers to and there is no reference here, so what is the evidence for either of these claims? Caveolin-8S oligomers are not curved. Perhaps 'this' is caveolae, but they are relatively large and also not very highly curved and I am unaware of measurements of lipid packing therein.

      Response: caveolae are around 50 nm which in biology is a very high curvature of a membrane. It has been extensively proven that caveolae have a distinct lipid composition highly enriched in cholesterol and sphingolipids, which thereby also will generate a unique lipid packing as compared to the surrounding membrane. Yet, the reviewer is correct that lipid packing has not been measured in a caveola for obvious technical challenges. Thus, we have now changed the text to “special lipid composition”.

      The sentence following that one again makes a specific, but unreferenced, claim. (d) intro claims that lipid packing is critical for fission, but it is unclear quite what is meant by this claim. The references do not help, as they are often about the basic biophysics of lipids, rather than how packing affects fission.

      Response: We have now edited the text.

      (e) intro strongly implies that caveolae remain membrane attached because of stalled scission. How strong is the evidence for this? The fact that EHD2 is at the neck is not definitive,

      Response: We used the term stalled scission to describe that all omega shaped membrane invaginations do not scission in the same automatic way as clathrin coated vesicles. We have now changed this in the text. Caveolae are shown to be released (undergo scission) and be detected as internal caveolae if the protein EHD2 is removed. Hence this must be interpreted as if EHD2 stalls scission. The evidence includes data compiled over the last 12 years from others and us which include for example: 1) Caveolae with EHD2 have a longer duration time (Larsson et al., 2023; Mohan et al., 2015; Moren et al., 2012; Stoeber et al., 2012), Knock down of EHD2 results in more internalized caveolae as measured by CTxB HRP using EM (Moren et al., 2012) and shorter duration time at the PM (Hubert et al., 2020; Larsson et al., 2023; Mohan et al., 2015; Stoeber et al., 2012). 2) EHD2 overexpression results in less internalized caveolae as measured by CTxB HRP using EM (Stoeber et al., 2012). Furthermore, 3) overexpression or acute addition of purified EHD2 via microinjection counteracts lipid induced scission of caveolae and hence result in caveolae stabilization at the PM (Hubert et al., 2020). It is very hard to see that the release and internalization of caveolae could result from anything else than that these have undergone scission. EHD2 has been found around the rim of caveolae (Matthaeus et al., 2022) and overexpression of EHD2 oligomerizing mutants have been shown to expand the caveola neck (Hoernke et al., 2017; Larsson et al., 2023).

      (f) unclear what is meant by 'lipid packing frustration' and how Dyngo supposedly induces it.

      Response: Lipid packing frustration refers to what is usually referred to as lipid packing defect, but since lipid membranes are describe as a fluid system it should not have defects whereby, we believe that lipid packing frustration is more accurate. However, we have now changed the text and use “decreased lipid packing” or “decreased lipid order” more thoroughly to describe the effect on the plasma membrane.

      IF of Cav1 is insufficient to claim puncta as caveolae. Co-stained puncta of caveolin with cavin are much stronger evidence. Same issue for Cav1-GFP puncta.

      Response: We agree and have now provided IF showing cavin1 and EHD2 colocalization to Cav1GFP in non and Dyngo-4a-treated cells.

      Fig 3E claims that "preferred position of Dyngo-4a was closer to the head groups" but the minimum looks to be in similar place as Fig 3B without cholesterol.

      Response:We appreciate the reviewer’s observation. The PMF minima in the POPC and POPC:Chol membranes are indeed close in absolute position (~1.1–1.2 nm from the bilayer center). However, as clarified in the revised text, the presence of cholesterol leads to a slight shift of Dyngo-4a closer to the headgroup region and broadens the positional distribution. This is also evident from the added density profiles (Fig. S3A) and is now described more precisely in the manuscript.

      Critically, these results do not support the notion that Dyngo affects lipid packing sufficiently, which is not measured in the simulations (though could be).

      -Response: We thank the reviewer for the excellent suggestion. In response, we have now included a detailed analysis of Dyngo-4a’s effect on lipid packing in the simulations. As described in the revised manuscript, we measured deuterium order parameters, area per lipid (APL), and lipid–Dyngo–cholesterol spatial distributions (Figs. 3-H, S3C-E). The results demonstrate that Dyngo-4a decreases lipid order in POPC:Chol membranes. Both single molecules and clusters reduce the order parameter by up to 0.04 units, particularly in the upper leaflet, where Dyngo-4a reside.The reduction is most pronounced in the midchain region of the sn1 tail and around the double bond of the sn2 tail. These effects were accompanied by increased APL in POPC:Chol membranes and by colocalization of Dyngo-4a near cholesterol-rich regions. Together, these data confirm that Dyngo-4a perturbs membrane organization and lipid packing in a composition-dependent manner. We believe these additions directly address the concern and demonstrate that the simulations indeed support the conclusion that Dyngo-4a modulates lipid packing.

      Finally, the simulation data do not show "that Dyngo-4a is competing with cholesterol"; it is unclear what 'competition' means in this context, but regardless, the data only shows that Dyngo sits at a similar location as cholesterol.

      We agree with the reviewer that “competition” was an imprecise term. We have rephrased the relevant sections to clarify that Dyngo-4a and cholesterol localize to overlapping regions and exhibit spatial coordination. As now stated in the manuscript, cholesterol appears to partially displace Dyngo-4a from its preferred depth seen in pure POPC, broadens its membrane distribution, and alters lipid packing. According to the order parameters there is an interplay between chol and Dyngo-4a and the heatmaps show that the distribution of chol in the membrane gets less uniform in the presence of Dyngo-4a. These interactions suggest that Dyngo-4a perturbs cholesterol-rich domains.

      As new analysis routines were added to the study, we have now also added the details on those to the Methods section of the text.

      AFM measures the stiffness of the cell (as correctly explained in Results section) not "overall stiffness of the PM" as stated in the Discussion.

      Response: We thank the reviewer for pointing this out, we have now altered this in the discussion section.

      Fig2A: what was the starting lipid surface pressure? How does Dyngo insertion depend on initial lipid packing?

      Response: The starting pressure lipid pressure was 20 mN m-1 which we now have incorporated in the figure legend. We performed several such experiments with a starting pressure ranging from 20-23 mN m-1 showing consistent results which we described in the materials and methods section. Given that we also performed QCMD analysis and simulations on bilayers showing that Dyngo-4a adsorbed and inserted respectively, we have not performed a titration of starting pressures resulting in a MIP of Dygo-4a.

      Fig 4B is a strange approach to measure membrane motion. Why not RMSD or some other displacement based method? As its shown, it implies that the area of the cell changes.

      Response: The method that we used to quantify the area of the cell which is attached (or close to) the glass and thereby is visible in TIRF microscopy. This is area indeed changes over time which has been frequently observed and used to describe and quantify the mobility, lamellipodia and filopodia formation among other things. We agree that RMSD can also be used to analyze the data before and after treatments and we have now included RMSD­­­­ analysis in the manuscript.

      Reviewer #3 (Significance (Required)):

      The title, abstract, and introduction of the manuscript are largely framed around lipid packing, but most of the data investigate other unexpected effects of treating cells with Dyngo4a. The only measurement for lipid packing (or any other membrane properties) is Fig 4E-F. Therefore, this paper is effectively an investigation of an artefact of a common reagent, which itself could be a valuable contribution. However, the mechanism to explain its effect requires stronger evidence, and its broad biological significance needs further exploration.

      Overall, the impact of documenting the effects of Dyngo4a on membranes appears modest but may be valuable to the membrane trafficking community.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Larsson et al present experimental and computational data on the role of Dyngo4a (a compound that was developed to inhibit dynamin) on the dynamics of caveolae. The manuscript mostly documents effects of Dyngo on caveolae, with one experiment to suggest a mechanism for this result. This one rather unconvincing result forms the focus of the manuscript contributing to a disconnect between the data and the presentation. Additionally, there are concerns with data interpretation. The writing could also benefit from revision to address grammar mistakes, strengthen referencing, and increase precision.

      Overall, the manuscript requires substantial revisions before being considered for publication. The central claim, in particular, needs stronger evidence to support the proposed mechanism.

      Significant issues (in approximate order of importance):

      1. The data supporting the central mechanistic explanation appears limited. There is no evidence that Dyngo remains in one leaflet, the GP of the PM is very low compared to previous measurements, effects on other membranes are not explored, dynamin-directed effects of Dyngo are not considered,
      2. The QCM-D measurements and claims require explanation as several aspects remains unclear. In Fig S2, the 'softness' (what does this mean?) changes by 4-fold with DMSO alone (what does this mean?), then fractionally more with Dyngo. Then fractionally more again when Dyngo is removed (why?). Then it remains somewhat higher when both Dyngo and DMSO are removed, which is somehow interpreted as Dyngo remaining in the bilayer, but not DMSO. These interpretations are difficult to grasp, as the authors seem to be implying simple amphiphilic partitioning into the membrane, which should all be removable by efficient washing. I do not doubt that this compound interacts with membranes, but the quantifications appear ambiguous. A bilayer with 16 mol% (or worse, 30% if all in one leaflet) Dyngo is very unlikely (to remain a bilayer). Even if such a bilayer was conceivable, the authors are claiming an ADDITION of Dyngo that would INCREASE the area of one leaflet by 30%, which needs explanation as it appears unlikely. Also, there are no replicates shown, so unclear how reproducible these effects are?
      3. The simulations are insufficiently described and difficult to interpret. How big are these systems? Why do the figures show the aqueous system with lateral boundaries? It seems quite important that multiple Dyngo molecules aggregate rather than partition into membranes - is this likely to occur in experiment? PMF simulations are strongly suggesting that Dyngo does not spontaneously cross membranes, which is inconsistent with its drug-like amphiphilicity (cLogP~2.5 is optimally suited for membrane permeation) and known effects on intracellular proteins. This suggests an artefact in these PMFs. The authors should experimentally measure the permeation of Dyngo through bilayers (or lack thereof), to more robustly support their finding that Dyngo does not cross membranes spontaneously.
      4. Why not measure effect of Dyngo on lipid packing directly and more broadly in model membranes?
      5. Statistics should not be done on individual cells (n>26), but rather on independent experiment (N=3?)
      6. Fig 1G is important but rather unclear. Firstly, these kymographs are an odd way to show that the caveolae are not moving. More importantly, caveolae in normal cells have been shown to be quite stable and immobile (eg doi: 10.1074/jbc.M117.791400), yet here they are claimed to be very mobile. Also, if Dyngo prevents caveolae scission, there should be more of them at the membrane - why no quantification like Fig 1C to show accumulation of caveolae upon Dyngo treatment? Or directly counting caveolae via EM, as in Fig 4C?
      7. The writing can be made more precise and referencing could be strengthened. Some examples: (a) 'scissoned' is not a word in English, (b) what is meant by "Cav1 assembly is driven by high chol content"? There are many types of caveolin assemblies. (c) "This generates a unique membrane domain with distinct lipid packing and a very high curvature." Unclear what 'this' refers to and there is no reference here, so what is the evidence for either of these claims? Caveolin-8S oligomers are not curved. Perhaps 'this' is caveolae, but they are relatively large and also not very highly curved and I am unaware of measurements of lipid packing therein. The sentence following that one again makes a specific, but unreferenced, claim. (d) intro claims that lipid packing is critical for fission, but it is unclear quite what is meant by this claim. The references do not help, as they are often about the basic biophysics of lipids, rather than how packing affects fission. (e) intro strongly implies that caveolae remain membrane attached because of stalled scission. How strong is the evidence for this? The fact that EHD2 is at the neck is not definitive, (f) unclear what is meant by 'lipid packing frustration' and how Dyngo supposedly induces it.
      8. IF of Cav1 is insufficient to claim puncta as caveolae. Co-stained puncta of caveolin with cavin are much stronger evidence. Same issue for Cav1-GFP puncta.
      9. Fig 3E claims that "preferred position of Dyngo-4a was closer to the head groups" but the minimum looks to be in similar place as Fig 3B without cholesterol. Critically, these results do not support the notion that Dyngo affects lipid packing sufficiently, which is not measured in the simulations (though could be). Finally, the simulation data do not show "that Dyngo-4a is competing with cholesterol"; it is unclear what 'competition' means in this context, but regardless, the data only shows that Dyngo sits at a similar location as cholesterol.
      10. AFM measures the stiffness of the cell (as correctly explained in Results section) not "overall stiffness of the PM" as stated in the Discussion.
      11. Fig2A: what was the starting lipid surface pressure? How does Dyngo insertion depend on initial lipid packing?
      12. Fig 4B is a strange approach to measure membrane motion. Why not RMSD or some other displacement based method? As its shown, it implies that the area of the cell changes.

      Significance

      The title, abstract, and introduction of the manuscript are largely framed around lipid packing, but most of the data investigate other unexpected effects of treating cells with Dyngo4a. The only measurement for lipid packing (or any other membrane properties) is Fig 4E-F. Therefore, this paper is effectively an investigation of an artefact of a common reagent, which itself could be a valuable contribution. However, the mechanism to explain its effect requires stronger evidence, and its broad biological significance needs further exploration.

      Overall, the impact of documenting the effects of Dyngo4a on membranes appears modest but may be valuable to the membrane trafficking community.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript uses the small molecule dynamin inhibitors dynasore and dyngo to show that in dynamin triple knockout cells that these inhibitors impact lipid packing and organization in the plasma membrane. Data showing that dyngo affects caveolin dynamics using tirf microscopy is also shown and is interpreted to reflect inhibition of caveolae scission from the membrane.

      This data showing that dyngo and dynasore target membrane order is quite compelling and argues that the effects of these inhibitors is not dynamin specific and that inhibition of endocytosis by these small molecule inhibitors is dynamin-independent. The in vitro and in vivo data they provide is convincing.

      Similarly, the data showing that dynasore and dyngo affect caveolin dynamics and clathrin endocytosis (transferrin) is quite convincing and argues that altered lipid packing is impacting membrane dynamics at the plasma membrane. What is less convincing is the conclusion is that dyngo is preventing caveolae scission from the membrane. Study of caveolae endocytosis is based on a TIRF assay that has inherent limitations:

      • Caveolae are defined as bright cav1-positive spots in diffraction limited TIRF and their disappearance presumed to be endocytic events. Cav1 spots are presumed to be caveolae but the authors do not consider that they may be flat non-caveolar oligomers. The diffraction limited TIRF approach interprets the large structures as caveolae but evidence to that effect is lacking.
      • The analysis (and the diagram presented in figure 4) considers that caveolae can either diffuse laterally in the membrane or internalize and does not consider that caveolae can flatten and possibly fragment in the membrane. Is it not possible that loss of Cav1 spots is a fragmentation event and not necessarily a scission event?
      • The analysis is based on overexpression of Cav1-GFP that may alter the stoichiometry between Cav1 and cavin1 such that while caveolae may be expressed, larger non-caveolar structures may accumulate.
      • Cav1 has been shown to be internalized via the CLIC pathway (Chaudary et al, 2014) and if dyngo is impacting clathrin then maybe it is also impacting CLIC endocytosis and thereby Cav1 endocytosis via this pathway?
      • The longer Cav1 TIRF track time and shorter displacement with dyngo is consistent with inhibition of caveolae scission. However, as the authors discuss, could not reduced membrane undulations due to dyngo's impact on membrane order be responsible for the longer tracks? Alternatively, perhaps the altered lipid packing is corralling Cav1 movement and reducing non-caveolar Cav1 endocytosis, resulting in shorter tracks of longer duration? The proposed interaction of dyngo with cholesterol could prevent scission but also stabilize large (flat?) Cav1 oligomers in the membrane, perhaps reducing Cav1 oligomer fragmentation.

      My point here is not to discredit the data but only to suggest that the TIRF approach used is an indirect measure of caveolae scission from the membrane that requires substantiation using other approaches.

      Dyngo is certainly generally affecting lipid packing via cholesterol and thereby affecting Cav1 dynamics in the plasma membrane. The claim of caveolae scission should be qualified and alternative possibilities considered and discussed. If the authors persist in arguing that dyngo is affecting caveolae scission then the effect should be substantiated by accumulation of caveolae by quantitative EM and high spatial and temporal resolution imaging of Cav1 and cavin1 to define the endocytic events. As the latter represents a new, and potentially very challenging, line of experimentation, I would suggest that it is beyond the scope of the current study. As indicated above the additional experiments are not necessary and qualification of the claims would be sufficient.

      Other points

      Figure 1C - Cav1 positive spots cannot be interpreted to be caveolae from diffraction limited confocal images. Same comment applies to Fig 4G - caveola? duration.

      Figure 4C - it is not clear why this EM data is not quantified - for both the number of caveolae and clathrin coated pits - as this would help clarify the interpretation of the effect reported.

      Figure 4D - the AFM experiments should perhaps be repeated as the non-significant effect of dyngo on the Young's modulus may be a result of insufficient n values.

      Significance

      This data showing that dyngo and dynasore target membrane order is quite compelling and argues that the effects of these inhibitors is not dynamin specific and that inhibition of endocytosis by these small molecule inhibitors is dynamin-independent. The in vitro and in vivo data they provide is convincing.

      Similarly, the data showing that dynasore and dyngo affect caveolin dynamics and clathrin endocytosis (transferrin) is quite convincing and argues that altered lipid packing is impacting membrane dynamics at the plasma membrane.

      What is less convincing is the conclusion is that dyngo is preventing caveolae scission from the membrane.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors use Dyngo-4a, a known Dynami inhibitor to test its influence on caveolar assembly and surface mobility. They investigate, whether it incorporates into membranes with Quartz-Crystal Microbalance, they investigate how it is organized in membranes using simulations. Finally, they use lipid-packing sensitive dyes to investigate lipid packing in the presence of Dyngo-4a, membrane stiffness using AFM and membrane undulation using fluorescence microscopy. They also use a measure they call "caveola duration time" to claim that something happens to caveolae after Dyngo-4a addition and using this parameter, they do indeed see an increase in it in response to Dyngo-4a, which is reduced back to the baseline after addition of cholesterol.

      Overall, the authors claim: 1) Dyngo-4a inserts into the membrane and this 2) results in "a dramatic dynamin-independent inhibition of caveola scission". 3) Dyngo-4a was inserted and positioned at the level of cholesterol in the bilayer and 4) Dyngo-4a-treatment resulted in decreased lipid packing in the outer leaflet of the plasma membrane 5) but Dyngo-4a did not affect caveola morphology, caveolae- associated proteins, or the overall membrane stiffness 6) acute addition of cholesterol counteracts the block in caveola scission caused by Dyngo-4a

      Overall, in this reviewers opinion, claims 1, 3, 4, 5 are well-supported by the presented data from electron and live cell microscopy, QCM-D and AFM. However, there is no convincing assay for caveolar endocytosis presented besides the "caveola duration" which although unclearly described seems to be the time it takes in imaging until a caveolae is not picked up by the tracking software anymore in TIRF microscopy. Since the main claim of the paper is a mechanism of caveolar endocytosis being blocked by Dyngo-4a, a true caveolar internalization assays is required to make this claim. This means either the intracellular detection of not surface connected caveolar cargo or the quantification of caveolar movement from TIRF into epifluorescence detection in the fluorescence microscope. Otherwise, the authors could remove the claim and just claim that caveolar mobility is influenced.

      Significance

      A number of small molecule inhibitors for the GTPase dynamics exist, that are commonly used tools in the investigation of endocytosis. This goes as far that the use of some of these inhibitors alone is considered in some publications as sufficient to declare a process to be dynamin-dependent. However, this is not correct, as there are considerable off-target effects, including the inhibition of caveolar internalization by a dynamin-independent mechanism. This is important, as for example the influence of dynamin small molecule inhibitors on chemotherapy resistance is currently investigated (see for example Tremblay et al., Nature Communications, 2020).

      The investigation of the true effect of small molecules discovered as and used as specific inhibitors and their offside effects is extremely important and this reviewer applauds the effort. It is important that inhibitors are not used alone, but other means of targeting a mechanism are exploited as well in functional studies. The audience here thus is besides membrane biophysicists interested in the immediate effect of the small molecule Dyngo-4a also cell biologists and everyone using dynamic inhibitors to investigate cellular function.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, Reproducibility, and Clarity)

      Reviewer comment: This is a very well conceived study of responses to plasma membrane stresses in yeast that signal through the conserved TORC2 complex. Physical stress through small molecular intercalators in the plasma membrane is shown to be independent of their biochemistry and then studies for its effect on plasma membrane morphology and the distribution of free ergosterol (the yeast equivalent of cholesterol), with free being the pool of cholesterol that is available to probes and/or sterol transfer proteins. Experiments nicely demonstrate a negative feedback loop consisting of: stress -> increased free sterol and TORC2 inhibition -> activation of LAM proteins (as demonstrated by Relents and co-workers previously) -> removal of free sterol -> return to unstressed state of PM and TORC2.

      Author response: We thank the reviewer for their positive and encouraging feedback. We are pleased to submit our revised manuscript and have addressed all points raised below.

      Comment: Fig 2A: Is detection of PIP/PIP2/PS linear for target, or possibly just showing availability that is increased due to local positive curvature?

      Response: This is an excellent and fundamental question. While FLARE signal likely reflects lipid availability, its detection is indeed influenced by factors such as membrane curvature and lipid composition, due to varying insertion depths of the lipid-binding domains. For example, studies using NMR suggest that the PLCδ PH domain partially inserts into membranes, potentially conferring curvature sensitivity (Flesch et al., 2005; Uekama et al., 2009). Similarly, curvature influences lactadherin binding, though it's unclear if this extends to its isolated C2 domain (Otzen et al., 2012; Shao et al., 2008; Shi et al., 2004). We could not find direct evidence for curvature sensitivity of P4C(SidC), but assume some influence exists.

      To avoid overinterpreting these limitations, we now describe our data based solely on the FLAREs used, rather than inferring enrichment of specific lipid species. We refer to these PM structures as "PI(4,5)P₂-containing", consistent with prior literature (Riggi et al., 2018) and have revised our manuscript accordingly.

      Comment: Can any marker be identified for the D4H spots at 2 minutes? In particular, are they early endosomes (shown by brief pre-incubation with FM4-64)?

      Response: We appreciate the reviewer's suggestion and have now added new data (Fig. S2E-H). We tested colocalization of D4H spots with FM4-64 (early endosomes), GFP-VPS21 (early endosome marker), and LipidSpot{trade mark, serif} 488 (lipid droplets), but found no overlap. This later observation was not unexpected given that D4H does not recognize Sterol esters. D4H foci also did not overlap with ER (dsRED-HDEL), though they were frequently adjacent to it. While their exact identity remains unknown, we agree this is an intriguing direction for future investigation.

      Comment: Is there any functional (& direct) link between Arp inhibition (as in the Pombe study of LAMs by the lab of Sophie Martin) and PM disturbance by amphipathic molecules?

      Response: We have explored this connection and now present new data (see final paragraph of Results). Briefly, we show that CK-666 induces internalization of PM sterols in a Lam2/4-dependent manner, and that TORC2 activity is more strongly reduced in lam2Δ lam4Δ cells compared to WT. These findings support the idea that, like PalmC, Arp2/3 inhibition triggers a PM stress that is counteracted by sterol internalization.

      Minor Comment: Fig 2A: Labels not clear. Say for each part what FP is used for pip2.

      Response: As noted above, we revised image labels to clarify which FLAREs were used, and refer to data accordingly throughout.

      Minor Comment: Move fig s2d to main ms. The 1 min and 2 min data are integral to the story.

      Response: We agree and have incorporated the 1-min and 2-min data into the main figures. Vehicle-treated controls were moved to Fig. S2.

      Minor Comment: The role of Lam2 and Lam4 in retrograde sterol transport has in vivo only been linked to one of their two StART domains not both, as mentioned in the text.

      Response: Thank you for pointing this out. We have corrected the text to:

      "[...]Lam2 and Lam4[...] contain two START domains, of which at least one has been demonstrated to facilitate sterol transport between membranes (Gatta et al., 2015; Jentsch et al., 2018; Tong et al., 2018)."

      Minor Comment: Throughout, images of tagged D4H should be labelled as such, not as "Ergosterol".

      Response: We have updated all relevant figure labels and text to refer to "D4H" rather than "Ergosterol", in line with this recommendation.

      Reviewer #1 (Significance):

      These results in budding yeast are likely to be directly applicable to a wide range of eukaryotic cells, if not all of them. I expect this paper to be a significant guide of research in this area. The paper specifically points out that the current experiments do not distinguish the precise causation among the two outcomes of stress: increased free sterol and TORC2 inhibition. Of these two outcomes which causes which is not yet known. If data were added that shed light on this causation that would make this work much more signifiant, but I can understand 100% that this extra step lies beyond - for a later study for which the current one forms the bedrock.

      Response:

      We thank the reviewer for their generous assessment. We agree that understanding the causality between increased free sterol and TORC2 inhibition is a critical next step.

      Based on our current data, we believe the increase in free ergosterol precedes TORC2 inhibition. For example, TORC2 inhibition alone (e.g., via pharmacological means) does not initially increase free sterol, while it does enhance Lam2/4 activity, promoting sterol internalization (Fig. 3A). Baseline TORC2 activity also inversely correlates with free PM sterol levels in lam2Δ lam4Δ versus LAM2T518A LAM4S401A cells (Figs. 2D, S2C).

      Additionally, during sterol depletion, we observe an initial increase in TORC2 activity before growth inhibition occurs, after which activity declines-likely due to compromised PM integrity (Fig. S2M). We now also show that adaptation to several other stresses (e.g., osmotic shock, heat shock, CK-666) partially depends on sterol internalization, which correlates with TORC2 activation (Fig. 4, S4B).

      While these findings strengthen the model that PM stress perturbs sterol availability and secondarily impacts TORC2, we cannot yet definitively demonstrate causality. As suggested by Reviewer 3, we tested cholesterol-producing yeast (Souza et al., 2011), but found their response to PalmC indistinguishable from WT, making it difficult to draw mechanistic conclusions (Rebuttal Fig. 2).

      Taken together, we favour a model where sterols affect PM properties sensed by TORC2, probably lipid-packing, rather than acting as direct effectors. We hope our revised manuscript more clearly conveys this model and serves as a strong foundation for future mechanistic studies.

      Reviewer #2 (Evidence, Reproducibility, and Clarity)

      Reviewer comment: This manuscript describes multiple effects of positively-charged membrane-intercalating amphipaths (palmitoylcarnitine, PalmC, in particular) on TORC2 in yeast plasma membranes. It is a "next step" in the Loewith laboratory's characterization of the effect of this agent on this system. The study confirms the findings of Riggi et al.(2018) that PalmC inhibits TORC2 and drives the formation of membrane invaginations that contain phosphatidylinositol-bis-phosphate (PIP2) and other anionic phospholipids. It also demonstrates that PalmC intercalates into the membrane, acts directly (rather than through secondary metabolism) and is representative of a class of cationic amphipaths. The interesting finding here is that PalmC causes a rapid initial increase in the plasma membrane ergosterol accessible to the DH4 sterol probe followed by a decrease caused by its transfer to the cytoplasm through its transporter, LAM2/4. TORC2 is implicated in these processes. Loewith et al. have pioneered in this area and this study clearly shows their expertise. Several of the findings reported here are novel. However, I am concerned that PalmC may not be revealing the physiology of the system but rather adding tangential complexity. (This concern applies to the precursor studies using PalmC to probe the TORC2 system.) In particular, I am not confident that the data justify the authors' conclusions "...that TORC2 acts in a feedback loop to control active sterol levels at the PM and [the results] introduce sterols as possible TORC2 signalling modulators."

      Author response:

      We thank Reviewer #2 for the constructive and critical evaluation of our work. We appreciate the acknowledgment of the novelty and technical strength of several of our findings, and we understand the concern that PalmC could be eliciting non-physiological effects. Our study was designed precisely to use PalmC and similar membrane-active amphipaths as tools to strongly perturb the plasma membrane (PM) in a controlled and tractable way. We now state this intention explicitly in both the Introduction and Discussion sections. To address concerns about the specificity and physiological relevance of PalmC, we have expanded our dataset to include additional PM stressors (hyperosmotic shock, Arp2/3 inhibition, and heat shock), all of which reproduce key features observed with PalmC-namely, TORC2 inhibition, PM invaginations, and retrograde sterol transport (Fig. 4, S4).

      We hope this more comprehensive dataset, along with revised discussion and clarified claims, addresses the reviewer's concerns regarding physiological interpretation and artifact.

      Major issues 1 and 2: 1. The invaginations induced by PalmC may not be physiologic but simply the result of the well-known "bilayer couple" bending of the bilayer due to the accumulation of cationic amphipaths in the inner leaflet of the plasma membrane bilayer which is rich in anionic phospholipids. Such unphysiological effects make the observed correlation of invagination with TORC2 inhibition etc. hard to interpret.

      Electrostatic/hydrophobic association of PIP2 with PalmC could sequester the anionic phospholipid(s). Such associations could also drive the accumulation of PIP2 in the invaginations. This could explain PalmC inhibition of TORC2 through a simple physical rather than biological process. So, it is difficult to draw any physiological conclusion about PIP2 from these experiments.

      Response to major issues 1 and 2:

      We agree that amphipath-induced bilayer stress, including via the bilayer-couple mechanism, may contribute to PM curvature changes. However, the reviewer's assumption that PalmC inserts preferentially into the inner leaflet appears inconsistent with both literature and our observations. PalmC is zwitterionic, not cationic, and is unlikely to electrostatically sequester anionic lipids such as PIP2. For clarification, we included a short summary of our proposed mechanism of PalmC in the context of the current literature in our Discussion:

      "[...] study it was also demonstrated that addition of phospholipids to the outer PM leaflet causes an excess of free sterol at the inner PM leaflet, and its subsequent retrograde transport to lipid droplets (Doktorova et al., 2025). Although we cannot exclude that it is the substrate of a flippase or scramblase, PalmC is not a metabolite found in yeast, nor, given its charged headgroup, is it likely to spontaneously flip to the inner leaflet (Goñi, Requero and Alonso, 1996). Thus, we propose that PalmC accumulates in the outer leaflet, disrupts the lipid balance with the inner leaflet which is, similarly to the mammalian cell model (Doktorova et al., 2025), rectified by sterol mobilization, flipping and internalization (Fig. 5B)."

      While we agree that PM invaginations per se are not the central focus of this study, they are indeed a reproducible and biologically intriguing phenomenon. We emphasize that similar invaginations occur not only during PalmC treatment but also in response to other physiological stresses, such as hyperosmotic shock and Arp2/3 inhibition (Fig. 4), and have been reported independently by others (Phan et al., 2025). Furthermore, related structures have been documented in yeast mutants with altered PIP2 metabolism or TORC2 hyperactivity (Rodríguez-Escudero et al., 2018; Sakata et al., 2022; Stefan et al., 2002), and even in mammalian neurons with SJ1 phosphatase mutations (Stefan et al., 2002). These observations support our interpretation that the observed invaginations represent an exaggerated manifestation of a physiologically relevant stress-adaptive process. In our previous study we indeed proposed that PI(4,5)P2 enrichment in PM invaginations was important for PalmC-induced TORC2 inactivation, using the heat sensitive PI(4,5)P2 kinase allele mss4ts - a rather blunt tool (Riggi et al., 2018). We have now come to the conclusion that different mechanisms other than, or in addition to, PIP2 changes drive TORC2 inhibition in our system. In this study, we use the 2xPH(PLC) FLARE exclusively as a generic PM marker, not as a readout of PIP2 biology. Rather, we propose that sterol redistribution and/or the biophysical impact that this has on the PM are central drivers, with TORC2 acting as a signaling node that senses and adjusts PM composition accordingly.

      We now clarify these arguments in the revised Discussion and have reframed our use of PalmC as a probe to explore the capacity of the PM to adapt to acute stress via dynamic lipid rearrangements.

      Major issue 3:

      As the authors point out, a large number of intercalated amphipaths displace sterols from their association with bilayer phospholipids. This unphysiologic mechanism can explain how PalmC causes the transient increase in the availability of plasma membrane ergosterol to the D4H probe and its subsequent removal from the plasma membrane via LAM2/4. TORC2 regulation may not be involved. In fact, the authors say that "TORC2 inhibition, and thereby Lam2/4 activation, cannot be the only trigger for PalmC induced sterol removal." Furthermore, the subsequent recovery of plasma membrane ergosterol could simply reflect homeostatic responses independent of the components studied here.

      Response:

      We agree that increased free sterols in the inner leaflet likely initiate retrograde transport. Our results suggest that TORC2 inhibition facilitates this process by disinhibiting Lam2/4, allowing more efficient clearance of ergosterol from the PM (Fig. 3A, S2C). However, the process is not exclusively dependent on TORC2, and we state this explicitly.

      We do not observe recovery of PM ergosterol on the timescales measured, while TORC2 activity recovers, suggesting that restoration likely occurs later via biosynthetic or anterograde trafficking pathways, which are outside the scope of this study. These points are clarified in the revised Discussion.

      Major issue 3a:

      The data suggest that LAM2/4 mediates the return of cytoplasmic ergosterol to the plasma membrane. To my knowledge, this is a nice finding that not been reported previously and is worth confirming more directly.

      Response:

      We thank the reviewer for this observation but would like to clarify a misunderstanding: our data do not suggest that Lam2/4 mediates anterograde sterol transport. Our results and prior work (Gatta et al., 2015; Roelants et al., 2018) show that Lam2/4 mediate retrograde transport from the PM to the ER, and TORC2 inhibits this process. We now clarify this point in the revised manuscript, stating:

      "In vivo, Lam2/4 seem to predominantly transport sterols from the PM to the ER, following the concentration gradient (Gatta et al., 2015; Jentsch et al., 2018; Tong et al., 2018)."

      Major issue 4:

      I agree with the authors that "It is unclear if the excess of free sterols itself is part of the inhibitory signal to TORC2..." Instead, the inhibition of TORC2 by PalmC may simply result from its artifactual aggregation of the anionic phospholipids (especially, PIP2) needed for TORC2 activity. This would not be biologically meaningful. If the authors wish to show that accessible ergosterol inhibits TORC2 activity or vice versa, they should use more direct methods. For example, neutral amphipaths that do not cause the aforementioned PalmC perturbations should still increase plasma membrane ergosterol and send it through LAM2/4 to the ER.

      Response:

      We now provide evidence that three orthologous treatments (hyperosmotic shock, heat shock and Arp2/3 inhibition) similarly cause sterol mobilization and, in the absence of sterol clearance from the PM, prolonged TORC2 inhibition. These results do not support the reviewer's contention that the inhibition of TORC2 by PalmC is simply resulting from its artifactual aggregation of the anionic phospholipids. Furthermore, PalmC is zwitterionic, and its interaction with anionic lipids should be somewhat limited.

      In our experimental setup, neutral amphipaths did not trigger TORC2 inhibition or D4H redistribution While this differs from prior in vitro work (Lange et al., 2009), we attribute this in part to a discrepancy to experimental setup differences, including flow chamber artifacts that we discuss in the methods section.

      Importantly, only amphipaths with a charged headgroup, including zwitterionic (PalmC) and positively charged analogs, produced robust effects. A negatively charged derivative also seemed to have a minor effect on TORC2 activity and PM sterol internalization (Palmitoylglycine (Fig. 1D, Rebuttal Fig. 1). This suggests that in vivo, charge-based membrane perturbation is required to alter PM sterol distribution and TORC2 activity.

      Major issue 5.:

      The mechanistic relationship between TORC2 activity and ergosterol suggested in the title, abstract, and discussion is not secure. I agree with the concluding section of the manuscript called "Limitations of the study". It highlights the need for a better approach to the interplay between TORC2 and ergosterol.

      Response:

      This may have been true of the previous submission, but we now demonstrate that provoking PM stress in four orthogonal ways triggers mobilization of sterols, which left uncleared, prevents normal (re)activation of TORC2 activity. We thus conclude that free sterols, directly or more likely indirectly, inhibit TORC2. The role that TORC2 plays in sterol retrotranslocation has been demonstrated previously (Roelants et al., 2018). We believe our expanded data and clarified framework make a compelling case for a stress-adaptive role of sterol retrograde transport that is supervised and modulated-but not fully driven-by TORC2 activity.

      Thus, we feel in the present version of this manuscript that the title is now justified.

      Minor issue: Based on earlier work using the reporter fliptR, the authors claim that PalmC reduces membrane tension. They should consider that this intercalated dye senses many variables including membrane tension but also lipid packing. I suspect that, by intercalating into and thereby altering the bilayer, PalmC is affecting the latter rather than the former.

      Response:

      We thank the reviewer for this important point regarding the multifactorial sensitivity of intercalating dyes such as Flipper-TR®, including to membrane tension and lipid packing.

      We respectfully note, however, that our current study does not include any new data generated using Flipper-TR®. We referred to earlier work (Riggi et al., 2018) for context, where Flipper-TR® was used as a membrane tension reporter.

      We fully agree that the response of such "smart" membrane probes integrates multiple biophysical parameters-including tension, packing, and hydration-which are themselves interrelated as consequences of membrane composition (Colom et al., 2018; Ragaller et al., 2024; Torra et al., 2024). Indeed, this interconnectedness is central to our interpretation of PalmC's pleiotropic effects on the plasma membrane (PM). In our previous study, we observed that PalmC treatment not only reduced apparent PM tension (as measured by Flipper-TR®) but also increased membrane order ((Riggi et al., 2018); see laurdan GP, Fig. 6C), and here we show that it promotes the redistribution of free sterol away from the PM.

      Furthermore, PalmC's effect on membrane tension was supported by orthogonal in vitro data: its addition to giant unilamellar vesicles (GUVs) led to a measurable increase in membrane surface area and decreased tension, as shown by pipette aspiration ((Riggi et al., 2018), Fig. 3F). This provides complementary evidence that the membrane tension reduction is not merely an artifact of Flipper-TR® reporting.

      That said, we agree with the reviewer that in the case of TORC2 inhibition or hyperactivation, the observed changes in PM tension are based solely on Flipper-TR® data, without additional orthogonal validation. To address this concern, we have revised the relevant text in the manuscript to more cautiously reflect this complexity. The revised sentence now reads:

      "Consistent with this role, data generated with the lipid packing reporter dye Flipper-TR® suggest that acute chemical inhibition of TORC2 increases PM tension, while Ypk1 hyperactivation decreases it."

      This revised phrasing acknowledges both the utility and the limitations of Flipper-TR® as a probe of membrane biophysics.

      Reviewer #2 Significance:

      This is an interesting topic. However, use of the exogenous probe, palmitoylcarnitine, could be causing multiple changes that complicate the interpretation of the data.

      Reviewers #1 and #3 were much more impressed by this study than I was. I am not a yeast expert and so I may have missed or confused something. I would therefore welcome their expert feedback regarding my comments (#2). Ted Steck

      Response:

      Thank you for your constructive feedback.

      We believe that the manuscript is now much improved, and we hope to have convinced you that the mechanisms that we've elucidated using PalmC represent a general adaptation response to physiological PM stressors.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Reviewer comment: The authors describe the effects of surfactant-like molecules on the plasma membrane (PM) and its associated TORC2 complex. Addition of the surfactants with a positively-charged headgroup and a hydro-carbon tail of at least 16 caused the rapid clustering of PI-4,5P2 together with PI-4P and phosphatidylserine in large membrane invaginations. The authors convincingly demonstrate that this effect of the surfactants on the PM is likely caused by a direct disturbance of the PM organization and/or lipid composition. Interestingly, upon PalmC treatment, free ergosterol of the PM was found to first concentrate in the clusters, but within The kinetics of the changes in free ergosterol levels and the changes in TORC2 activity do not match. Ergosterol is rapidly depleted after PalmC treatment (The Lam2/4 data support the idea that ergosterol transport plays a role in the TORC2 recovery, but what role this is, is not clear to me. I think the data fit better with a model in which PalmC causes low tension of the PM which in turn disrupts normal lipid organization and thus causes TORC2 to shut down, maybe not by changes in free ergosterol but by changes, for instance, in lipid raft formation (which is in part effected by ergosterol levels). The transport of ergosterol is only one mechanism that is involved in restoring PM tension and TORC2 activity. However, sensing free ergosterol alone is most likely not the mechanism explaining how TORC2 senses PM tension.

      Therefore, I recommend that the model is revised (or supported by more data), reflecting the fact that free ergosterol levels do not directly correlate with the TORC2 activity, but instead might be only one of the PM parameters that regulate TORC2.

      Author response:

      We thank the reviewer for their thoughtful assessment and constructive suggestions. As described in more detail above, we have included in our revised version of this manuscript a variety of new data, including the sterol-internalization dependent adaptation of the PM and regulation of TORC2 during additional stresses. We think that these data vastly improve on our previous manuscript version. We have addressed each point risen by the reviewer below and revised the manuscript accordingly, including a rewritten discussion and updated model to better reflect the limitations of our current understanding of how TORC2 senses changes in the plasma membrane (PM). It is true that the appearance of PM invaginations tracks well with TORC2 inhibition, but it is not clear to us if they are upstream of this inhibition or merely another symptom of the preceding PM perturbation (PalmC-induced free sterol increase can be observed after 10s (Fig. S2A), but PM invaginations become visible only after ~1 min - meanwhile we can observe near complete TORC2 inhibition after 30s). In this study, we are mostly interested in the role of PM sterol redistribution in stress response. Indeed we think that the role of free sterol clearance during stresses is to adapt the PM to these stresses - thus restoring PM parameters which in turn reactivates TORC2. This can be seen for hyperosmotic stress and the newly introduced PM stressors, Arp2/3 inhibition and heat shock response (Fig. 4). We have therefore softened our model and updated discussion and final figure (Fig. 5) to reflect that TORC2 likely responds to broader changes in PM organization or tension, with sterol redistribution representing one of several contributing factors rather than the sole signal.

      Comment: - If TORC2 is indeed inhibited by free ergosterol, the addition of ergosterol to the growth medium should be able to trigger similar effects as PalmC. If this detection of free ergosterol is very specific (e.g. if TORC2 has a binding pocket for ergosterol) we would expect that addition of other sterols such a cholesterol or ergosterol precursors should not inhibit TORC2.

      Response:

      We appreciate this suggestion and agree that testing whether exogenous ergosterol can mimic PalmC effects would help assess specificity. However, yeast do not readily take up sterols under aerobic conditions, which renders artificial sterol enrichment at the yeast PM rather difficult. We have now included additional data characterizing our Lam2/4 mutants (see below), and pharmacological sterol synthesis inhibition, showing that a depletion of free sterols from the PM correlates with lower TORC2 activity (Fig. 2D, S2C). Additionally, as suggested, we tried to probe if ergosterol directly interacts with TORC2 through a specific binding pocket, by treating a yeast strain expressing cholesterol rather than ergosterol (Souza et al., 2011) with PalmC. However, the response of TORC2 activity in these cells was very similar to that of WT cells (Rebuttal Fig. 2). In conclusion, we agree that at present we do not know mechanistically how sterols affect TORC2 activity, although it does indeed seem more likely to be through an indirect mechanism linked to changes in PM parameters. The nature of such a mechanism will be subject to further studies. We hope that the introduced changes to the manuscript adequately reflect these considerations.

      Rebuttal Fig. 2: WT yeast cells which produce ergosterol as main sterol, and mutant cells which produce cholesterol instead were treated with 5 µM PalmC, and TORC2 activity was assessed by relative phosphorylation of Ypk1 on WB. One representative experiment out of two replicates.

      Comment: - The experiment in Figure 1C is not controlled for differences in membrane intercalation of the different compounds. For instance, does C16 choline and C16 glycine accumulate at the same rate in the PM (measure similar to experiment in Figure 1B). Maybe the positive charge at the headgroup of the surfactants increases the local concentration at the PM and therefore can explain the difference in effect on the PM.

      Response:

      We agree with the reviewer that the effects of the various PalmC derivatives are not directly controlled for differences in membrane intercalation. Our structure-activity screen was intended to demonstrate the general biophysical mode of action of PalmC-like compounds and to define minimal structural requirements for activity.

      We now note in the manuscript that differential membrane insertion could contribute to the observed variation in efficacy, particularly in relation to tail length. While we considered this additional suggested experiment, it was ultimately judged to be outside the scope of this study due to its complexity and limited impact on the central conclusions.

      A clarifying sentence has been added to the relevant results section to explicitly acknowledge this limitation:

      "We did not control for differences in PM intercalation efficiency."

      We also include a discussion here to further clarify our interpretation. Prior in vitro studies have shown that while intercalation is necessary, it is not sufficient for PM perturbation. For example, palmitoyl-CoA intercalates into membranes but does not induce the same biophysical effects as PalmC (Goñi et al., 1996; Ho et al., 2002). Thus, we believe that intercalation is only part of the story, and that the intrinsic propensity of different headgroups to perturb the PM plays a key role in the disruption of PM lipid organization.

      Comment: - Are the intracellular ergosterol structures associated (or in close proximity) with lipid droplets (ergosterol being modified and delivered into a lipid droplet)?

      Response:

      We thank the reviewer for raising this point. We now include additional data (Fig. S2H) showing that intracellular D4H-positive structures do not reside near or colocalize with lipid droplets. The latter is not entirely unexpected as D4H does not recognize esterified sterols. However, we do observe an increase in overall LD volume following PalmC treatment, consistent with the idea that internalized PM sterols may be stored in LDs as sterol esters over time - although we did not test if this increase in LD volume is Lam2/4 dependent. This increase is mentioned in the revised results text. An increase in cellular LDs has also been recently reported during hyperosmotic shock (Phan et al., 2025).

      For more attempts to identify a marker for intracellular D4H foci, see reply to reviewer 1.

      Comment:

      • How does the AA and DD mutations in Lam2/4 change the localization of the ergosterol sensor (before and after PalmC treatment).

      Response:

      We thank the reviewer for this question, as in the course of generating these data we realized that our "inhibited" DD mutant was in fact not phosphomimetic but displayed the same D4H distribution as the "hyperactive" AA mutant, i.e. a marked inwards shift of D4H signal away from the PM to internal structures due to increased PM-ER retrograde transport of sterols (Fig. S2C). This led us to critically re-evaluate and ultimately repeat our TORC2 activity WB experiments for PalmC treatment in LAM2/4 mutants. In this new set of experiments, the faster TORC2 recovery after PalmC treatment in the LAM2T518A LAM4S401A mutant did unfortunately not repeat robustly. It is possible that such differences can be observed under specific conditions. Nevertheless, the improved overall quality of the Western blot data allowed us to make the observation that baseline activity was already slightly different in these strains. The Lam2/4 centered part of the results section has subsequently been updated in the manuscript:

      "Using a phosphospecific antibody, we did not observe an increase in baseline TORC2 activity in lam2Δ lam4Δ cells, which had been previously reported by electrophoretic mobility shift (Murley et al., 2017). Instead, baseline TORC2 activity was consistently slightly decreased in these cells (Fig. 2D). Ypk1, activated directly by TORC2, inhibits Lam2 and Lam4 through phosphorylation on Thr518 and Ser401, respectively (Roelants et al., 2018; Topolska et al., 2020). We substituted these residues with alanine, generating a strain in which Lam2/4 were no longer inhibited by phosphorylation (Roelants et al., 2018). In these cells, yeGFP-D4H showed that free sterols were constitutively shifted away from the PM to intracellular structures (Fig. S2C, bottom panel). Intriguingly, in opposition to lam2Δ lam4Δ cells, basal TORC2 activity was increased in LAM2T518A LAM4S401A cells (Fig. 2D). This suggests that a decrease in free PM sterols stimulates TORC2 activity [...]"

      "In LAM2T518A LAM4S401A cells, TORC2 activity recovers with similar kinetics as the WT (Fig. 2D, bottom blot), suggesting that Lam2/4 release from TORC2 dependent inhibition during PalmC treatment is a fast and efficient process in WT cells, not further expedited by these constitutively active Lams."

      As suggested, we also observed D4H localization in LAM2T518A LAM4S401A after PalmC treatment, and implemented these data to further demonstrate that PalmC causes an increase in the fraction of free ergosterol at the PM, which is subsequently removed:

      "PalmC addition to LAM2T518A LAM4S401A cells likewise resulted first in a transient increase and then a further decrease in PM yeGFP-D4H signal (Fig. 3C, S3D)."

      Comment: - Does Lam2/4 localize to ER-PM contact sites near the large PM invaginations, which could allow for efficient transport of the free ergosterol that accumulates in these structures.

      Response:

      We were curious about this too, and have now added the requested data in our supplementary material and added a sentence in our results:

      "Indeed, in cells expressing GFP-Lam2 we observed that PalmC induced PM invaginations often formed at sites with preexisting GFP-Lam2 foci (Fig. S2K, cyan arrow), although GFP-Lam2 foci did not always colocalize with invaginations (Fig. S2K, yellow arrow) and vice versa. "

      Additionally, in the effort to characterize intracellular D4H foci during PalmC as requested by reviewer 1, we also looked at the localization of these foci relative to ER, and found that

      "During early timepoints, intracellular foci are usually in close vicinity to ER (Fig. S2E)"

      Reviewer #3 (Significance (Required)): The manuscript describes the effects of small molecule surfactants on the PM organization and on TORC2 activity. This is an important set of observation that helps understanding the response of cells to environmental stressors that affect the PM. This field of study is very challenging because of the limited tools available to directly observe lipids and their movements. I consider the data and most of its interpretations of high importance, but I am not convinced of the larger model that tries to link the ergosterol data with TORC2 activity. With adjustments of the model or additional experimental support, this manuscript will be of general interest for cell biologists, especially for researchers studying membrane stress response pathways.

      Response:

      We thank the reviewer for highlighting the importance of studying PM stress responses and acknowledging the technical challenges involved. We hope the applied changes and additional data succeed in softening our claims about TORC2 regulation while convincing the reviewer that free sterol levels at the PM are one of several contributing factors that correlate with changes in TORC2 activity.

      Colom, A., Derivery, E., Soleimanpour, S., Tomba, C., Dal Molin, M., Sakai, N., González-Gaitán, M., Matile, S., Roux, A., 2018. A Fluorescent Membrane Tension Probe. Nat. Chem. 10, 1118-1125. https://doi.org/10.1038/s41557-018-0127-3

      Flesch, F.M., Yu, J.W., Lemmon, M.A., Burger, K.N.J., 2005. Membrane activity of the phospholipase C-δ1 pleckstrin homology (PH) domain. Biochem. J. 389, 435-441. https://doi.org/10.1042/BJ20041721

      Gatta, A.T., Wong, L.H., Sere, Y.Y., Calderón-Noreña, D.M., Cockcroft, S., Menon, A.K., Levine, T.P., 2015. A new family of StART domain proteins at membrane contact sites has a role in ER-PM sterol transport. eLife 4. https://doi.org/10.7554/eLife.07253

      Goñi, F.M., Requero, M.A., Alonso, A., 1996. Palmitoylcarnitine, a surface-active metabolite. FEBS Lett. 390, 1-5. https://doi.org/10.1016/0014-5793(96)00603-5

      Ho, J.K., Duclos, R.I., Hamilton, J.A., 2002. Interactions of acyl carnitines with model membranes: a (13)C-NMR study. J. Lipid Res. 43, 1429-1439. https://doi.org/10.1194/jlr.m200137-jlr200

      Jentsch, J.-A., Kiburu, I., Pandey, K., Timme, M., Ramlall, T., Levkau, B., Wu, J., Eliezer, D., Boudker, O., Menon, A.K., 2018. Structural basis of sterol binding and transport by a yeast StARkin domain. J. Biol. Chem. 293, 5522-5531. https://doi.org/10.1074/jbc.RA118.001881

      Murley, A., Yamada, J., Niles, B.J., Toulmay, A., Prinz, W.A., Powers, T., Nunnari, J., 2017. Sterol transporters at membrane contact sites regulate TORC1 and TORC2 signaling. J. Cell Biol. 216, 2679-2689. https://doi.org/10.1083/jcb.201610032

      Otzen, D.E., Blans, K., Wang, H., Gilbert, G.E., Rasmussen, J.T., 2012. Lactadherin binds to phosphatidylserine-containing vesicles in a two-step mechanism sensitive to vesicle size and composition. Biochim. Biophys. Acta BBA - Biomembr., Protein Folding in Membranes 1818, 1019-1027. https://doi.org/10.1016/j.bbamem.2011.08.032

      Phan, J., Silva, M., Kohlmeyer, R., Ruethemann, R., Gay, L., Jorgensen, E., Babst, M., 2025. Recovery of plasma membrane tension after a hyperosmotic shock. Mol. Biol. Cell 36, ar45. https://doi.org/10.1091/mbc.E24-10-0436

      Ragaller, F., Sjule, E., Urem, Y.B., Schlegel, J., El, R., Urbancic, D., Urbancic, I., Blom, H., Sezgin, E., 2024. Quantifying Fluorescence Lifetime Responsiveness of Environment-Sensitive Probes for Membrane Fluidity Measurements. J. Phys. Chem. B 128, 2154-2167. https://doi.org/10.1021/acs.jpcb.3c07006

      Riggi, M., Niewola-Staszkowska, K., Chiaruttini, N., Colom, A., Kusmider, B., Mercier, V., Soleimanpour, S., Stahl, M., Matile, S., Roux, A., Loewith, R., 2018. Decrease in plasma membrane tension triggers PtdIns(4,5)P2 phase separation to inactivate TORC2. Nat. Cell Biol. 20, 1043-1051. https://doi.org/10.1038/s41556-018-0150-z

      Rodríguez-Escudero, I., Fernández-Acero, T., Cid, V.J., Molina, M., 2018. Heterologous mammalian Akt disrupts plasma membrane homeostasis by taking over TORC2 signaling in Saccharomyces cerevisiae. Sci. Rep. 8, 7732. https://doi.org/10.1038/s41598-018-25717-w

      Roelants, F.M., Chauhan, N., Muir, A., Davis, J.C., Menon, A.K., Levine, T.P., Thorner, J., 2018. TOR complex 2-regulated protein kinase Ypk1 controls sterol distribution by inhibiting StARkin domain-containing proteins located at plasma membrane-endoplasmic reticulum contact sites. Mol. Biol. Cell 29, 2128-2136. https://doi.org/10.1091/mbc.E18-04-0229

      Sakata, K.-T., Hashii, K., Yoshizawa, K., Tahara, Y.O., Yae, K., Tsuda, R., Tanaka, N., Maeda, T., Miyata, M., Tabuchi, M., 2022. Coordinated regulation of TORC2 signaling by MCC/eisosome-associated proteins, Pil1 and tetraspan membrane proteins during the stress response. Mol. Microbiol. 117, 1227-1244. https://doi.org/10.1111/mmi.14903

      Shao, C., Novakovic, V.A., Head, J.F., Seaton, B.A., Gilbert, G.E., 2008. Crystal Structure of Lactadherin C2 Domain at 1.7Å Resolution with Mutational and Computational Analyses of Its Membrane-binding Motif*. J. Biol. Chem. 283, 7230-7241. https://doi.org/10.1074/jbc.M705195200

      Shi, J., Heegaard, C.W., Rasmussen, J.T., Gilbert, G.E., 2004. Lactadherin binds selectively to membranes containing phosphatidyl-L-serine and increased curvature. Biochim. Biophys. Acta 1667, 82-90. https://doi.org/10.1016/j.bbamem.2004.09.006

      Souza, C.M., Schwabe, T.M.E., Pichler, H., Ploier, B., Leitner, E., Guan, X.L., Wenk, M.R., Riezman, I., Riezman, H., 2011. A stable yeast strain efficiently producing cholesterol instead of ergosterol is functional for tryptophan uptake, but not weak organic acid resistance. Metab. Eng. 13, 555-569. https://doi.org/10.1016/j.ymben.2011.06.006

      Stefan, C.J., Audhya, A., Emr, S.D., 2002. The yeast synaptojanin-like proteins control the cellular distribution of phosphatidylinositol (4,5)-bisphosphate. Mol. Biol. Cell 13, 542-557. https://doi.org/10.1091/mbc.01-10-0476

      Tong, J., Manik, M.K., Im, Y.J., 2018. Structural basis of sterol recognition and nonvesicular transport by lipid transfer proteins anchored at membrane contact sites. Proc. Natl. Acad. Sci. 115, E856-E865. https://doi.org/10.1073/pnas.1719709115

      Topolska, M., Roelants, F.M., Si, E.P., Thorner, J., 2020. TORC2-Dependent Ypk1-Mediated Phosphorylation of Lam2/Ltc4 Disrupts Its Association with the β-Propeller Protein Laf1 at Endoplasmic Reticulum-Plasma Membrane Contact Sites in the Yeast Saccharomyces cerevisiae. Biomolecules 10, 1598. https://doi.org/10.3390/biom10121598

      Torra, J., Campelo, F., Garcia-Parajo, M.F., 2024. Tensing Flipper: Photosensitized Manipulation of Membrane Tension, Lipid Phase Separation, and Raft Protein Sorting in Biological Membranes. J. Am. Chem. Soc. 146, 24114-24124. https://doi.org/10.1021/jacs.4c08580

      Uekama, N., Aoki, T., Maruoka, T., Kurisu, S., Hatakeyama, A., Yamaguchi, S., Okada, M., Yagisawa, H., Nishimura, K., Tuzi, S., 2009. Influence of membrane curvature on the structure of the membrane-associated pleckstrin homology domain of phospholipase C-δ1. Biochim. Biophys. Acta BBA - Biomembr. 1788, 2575-2583. https://doi.org/10.1016/j.bbamem.2009.10.009

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors describe the effects of surfactant-like molecules on the plasma membrane (PM) and its associated TORC2 complex. Addition of the surfactants with a positively-charged headgroup and a hydro-carbon tail of at least 16 caused the rapid clustering of PI-4,5P2 together with PI-4P and phosphatidylserine in large membrane invaginations. The authors convincingly demonstrate that this effect of the surfactants on the PM is likely caused by a direct disturbance of the PM organization and/or lipid composition. Interestingly, upon PalmC treatment, free ergosterol of the PM was found to first concentrate in the clusters, but within <5min this ergosterol seemed to be transported into intracellular structures, causing an overall loss in free ergosterol of the PM. The authors speculate that the initial spike in free ergosterol might be the trigger for the shutdown of TORC2 signaling. The PalmC-triggered transport of free ergosterol from the PM to intracellular structures required the lipid transport proteins Lam2/4. Loss of these transporters caused a delay in TORC2 reactivation, supporting the idea that ergosterol transport out of the PM plays a role in the recovery of normal PM organization. Hyperosmotic shock mimics some of the effects observed with PamlC, but unlike PalmC treatment, TORC2 recovery after hyperosmotic shock is not dependent on Lam2/4.

      The presented data are of high quality and most conclusions are well supported. However, based on the presented data the model that a PalmC-triggered increase in free ergosterol is the cause of TORC2 inactivation is not obvious to me. The kinetics of the changes in free ergosterol levels and the changes in TORC2 activity do not match. Ergosterol is rapidly depleted after PalmC treatment (<5min) whereas TORC2 activity requires 30min to recover. Also, the hyperosmotic data on free ergosterol levels and TORC2 activity do not match. In fact, the presence of the large PM invaginations is a better predictor of TORC2 activity. The Lam2/4 data support the idea that ergosterol transport plays a role in the TORC2 recovery, but what role this is, is not clear to me. I think the data fit better with a model in which PalmC causes low tension of the PM which in turn disrupts normal lipid organization and thus causes TORC2 to shut down, maybe not by changes in free ergosterol but by changes, for instance, in lipid raft formation (which is in part effected by ergosterol levels). The transport of ergosterol is only one mechanism that is involved in restoring PM tension and TORC2 activity. However, sensing free ergosterol alone is most likely not the mechanism explaining how TORC2 senses PM tension. Therefore, I recommend that the model is revised (or supported by more data), reflecting the fact that free ergosterol levels do not directly correlate with the TORC2 activity, but instead might be only one of the PM parameters that regulate TORC2.

      Further comments:

      • If TORC2 is indeed inhibited by free ergosterol, the addition of ergosterol to the growth medium should be able to trigger similar effects as PalmC. If this detection of free ergosterol is very specific (e.g. if TORC2 has a binding pocket for ergosterol) we would expect that addition of other sterols such a cholesterol or ergosterol precursors should not inhibit TORC2.
      • The experiment in Figure 1C is not controlled for differences in membrane intercalation of the different compounds. For instance, does C16 choline and C16 glycine accumulate at the same rate in the PM (measure similar to experiment in Figure 1B). Maybe the positive charge at the headgroup of the surfactants increases the local concentration at the PM and therefore can explain the difference in effect on the PM.
      • Are the intracellular ergosterol structures associated (or in close proximity) with lipid droplets (ergosterol being modified and delivered into a lipid droplet)?
      • How does the AA and DD mutations in Lam2/4 change the localization of the ergosterol sensor (before and after PalmC treatment).
      • Does Lam2/4 localize to ER-PM contact sites near the large PM invaginations, which could allow for efficient transport of the free ergosterol that accumulates in these structures.

      Significance

      The manuscript describes the effects of small molecule surfactants on the PM organization and on TORC2 activity. This is an important set of observation that helps understanding the response of cells to environmental stressors that affect the PM. This field of study is very challenging because of the limited tools available to directly observe lipids and their movements. I consider the data and most of its interpretations of high importance, but I am not convinced of the larger model that tries to link the ergosterol data with TORC2 activity. With adjustments of the model or additional experimental support, this manuscript will be of general interest for cell biologists, especially for researchers studying membrane stress response pathways.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript describes multiple effects of positively-charged membrane-intercalating amphipaths (palmitoylcarnitine, PalmC, in particular) on TORC2 in yeast plasma membranes. It is a "next step" in the Loewith laboratory's characterization of the effect of this agent on this system. The study confirms the findings of Riggi et al.(2018) that PalmC inhibits TORC2 and drives the formation of membrane invaginations that contain phosphatidylinositol-bis-phosphate (PIP2) and other anionic phospholipids. It also demonstrates that PalmC intercalates into the membrane, acts directly (rather than through secondary metabolism) and is representative of a class of cationic amphipaths. The interesting finding here is that PalmC causes a rapid initial increase in the plasma membrane ergosterol accessible to the DH4 sterol probe followed by a decrease caused by its transfer to the cytoplasm through its transporter, LAM2/4. TORC2 is implicated in these processes.

      Loewith et al. have pioneered in this area and this study clearly shows their expertise. Several of the findings reported here are novel. However, I am concerned that PalmC may not be revealing the physiology of the system but rather adding tangential complexity. (This concern applies to the precursor studies using PalmC to probe the TORC2 system.) In particular, I am not confident that the data justify the authors' conclusions "...that TORC2 acts in a feedback loop to control active sterol levels at the PM and [the results] introduce sterols as possible TORC2 signalling modulators."

      Major issues

      1. The invaginations induced by PalmC may not be physiologic but simply the result of the well-known "bilayer couple" bending of the bilayer due to the accumulation of cationic amphipaths in the inner leaflet of the plasma membrane bilayer which is rich in anionic phospholipids. Such unphysiological effects make the observed correlation of invagination with TORC2 inhibition etc. hard to interpret.
      2. Electrostatic/hydrophobic association of PIP2 with PalmC could sequester the anionic phospholipid(s). Such associations could also drive the accumulation of PIP2 in the invaginations. This could explain PalmC inhibition of TORC2 through a simple physical rather than biological process. So, it is difficult to draw any physiological conclusion about PIP2 from these experiments.
      3. As the authors point out, a large number of intercalated amphipaths displace sterols from their association with bilayer phospholipids. This unphysiologic mechanism can explain how PalmC causes the transient increase in the availability of plasma membrane ergosterol to the D4H probe and its subsequent removal from the plasma membrane via LAM2/4. TORC2 regulation may not be involved. In fact,the authors say that "TORC2 inhibition, and thereby Lam2/4 activation, cannot be the only trigger for PalmC induced sterol removal." Furthermore, the subsequent recovery of plasma membrane ergosterol could simply reflect homeostatic responses independent of the components studied here.

      3a. The data suggest that LAM2/4 mediates the return of cytoplasmic ergosterol to the plasma membrane. To my knowledge, this is a nice finding that not been reported previously and is worth confirming more directly. 4. I agree with the authors that "It is unclear if the excess of free sterols itself is part of the inhibitory signal to TORC2..." Instead, the inhibition of TORC2 by PalmC may simply result from its artifactual aggregation of the anionic phospholipids (especially, PIP2) needed for TORC2 activity. This would not be biologically meaningful. If the authors wish to show that accessible ergosterol inhibits TORC2 activity or vice versa, they should use more direct methods. For example, neutral amphipaths that do not cause the aforementioned PalmC perturbations should still increase plasma membrane ergosterol and send it through LAM2/4 to the ER. 5. The mechanistic relationship between TORC2 activity and ergosterol suggested in the the title, abstract and discussion is not secure. I agree with the concluding section of the manuscript called "Limitations of the study". It highlights the need for a better approach to the interplay between TORC2 and ergosterol.

      Minor issue

      Based on earlier work using the reporter fliptR, the authors claim that PalmC reduces membrane tension. They should consider that this intercalated dye senses many variables including membrane tension but also lipid packing. I suspect that, by intercalating into and thereby altering the bilayer, PalmC is affecting the latter rather than the former.

      Referees cross-commenting

      Reviewers #1 and #3 were much more impressed by this study than I was. I am not a yeast expert and so I may have missed or confused something. I would therefore welcome their expert feedback regarding my comments (#2). Ted Steck

      Significance

      This is an interesting topic. However, use of the exogenous probe, palmitoylcarnitine, could be causing multiple changes that complicate the interpretation of the data.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is a very well conceived study of responses to plasma membrane stresses in yeast that signal through the conserved TORC2 complex.

      Physical stress through small molecular intercalators in the plasma membrane is shown to be independent of their biochemistry and then studies for its effect on plasma membrane morphology and the distribution of free ergosterol (the yeast equivalent of cholesterol), with free being the pool of cholesterol that is available to probes and/or sterol transfer proteins. Experiments nicely demonstrate a negative feedback loop consisting of: stress -> increased free sterol and TORC2 inhibition -> activation of LAM proteins (as demonstrated by Relents and co-workers previously) -> removal of free sterol -> return to unstressed state of PM and TORC2.

      Comments

      Fig 2A: Is detection of PIP/PIP2/PS linear for target, or possibly just showing availability that is increased due to local positive curvature?

      Can any marker be identified for the D4H spots at 2 minutes? In particular, are they early endosomes (shown by brief pre-incubation with FM4-64)?

      Is there any functional (& direct) link between Arp inhibition (as in the Pombe study of LAMs by the lab of Sophie Martin) and PM disturbance by amphipathic molecules ?

      Minor

      Fig 2A: Labels not clear. Say for each part what FP is used for pip2. Move fig s2d to main ms. The 1 min and 2 min data are integral to the story

      The role of Lam2 and Lam4 in retrograde sterol transport has in vivo only been linked to one of their two StART domains not both, as mentioned in the text.

      Throughout, images of tagged D4H should be labelled as such, not as "Ergosterol".

      Significance

      These results in budding yeast are likely to be directly applicable to a wide range of eukaryotic cells, if not all of them. I expect this paper to be a significant guid elf research in this area.

      The paper specifically points out that the current experiments do not distinguish the precise causation among the two outcomes of stress: increased free sterol and TORC2 inhibition. Of these two outcomes which causes which is not yet known. If data were added that shed light on this causation that would make this work much more signifiant, but I can understand 100% that this extra step lies beyond - for a later study for which the current one forms the bedrock.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank all the reviewers for their helpful and constructive comments and for their time.


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):*

      Summary: Dady et al have developed fluorescent reporters to enable live imaging of cell behaviour and morphology in human pluripotent stem cell lines (PSCs). These reporters target 3 main features, the plasma membrane, nucleus and cytoskeleton. Reporter PSCs have been generated using a piggyBac transposon-mediated stable integration strategy, using a hyperactive piggyBac transposase (HyPBase). The same constructs were also used for mosaic labelling of cells within 2D cultures using lipofectamine transfection.

      The reporters used are tagged with either eGFP or mKate2 (far red) and tag the plasma membrane (pm) via the addition of a 20 amino-acid sequence from rat GAP-43 to the N-terminus of the fluorescent protein, the nucleus via Histone 2B with a laser-mediated photo-conversion option (H2B-mEos3.2), and the cytoskeleton via F-Tractin. In total, the authors produced lines with the following:

      • pm-mKate2 (far red) • pm-eGFP (green) • H2B-mEos3.2 (green to red) • F-tractin-mKate2 (far red) • H2B-mEos3.2 and pm-mKate2 (green to red, plus far red)

      The cell lines used to generate these were the human embryonic stem cell line H9 and human induced pluripotent cell line ChiPS4. The constructs were also used to label cells in a mosaic fashion, using lipofectamine transfection of the original cell lines once they had formed neural rosettes.

      Using these cells, Dady et al then performed live imaging in vitro of human spinal cord rosettes and assessed cell behaviour. In particular they analysed mitotic cleavage planes and apical positioning of neural progenitor cells (NPCs), and assessed actin dynamics within these cells. They showed a slowing of the cell cycle length after the initial expansion phase, an increase in the rate of asymmetric division of these NPCs, and abscission of the apical membrane during these divisions. The F-tractin reporter showed enrichment at the basal nuclear membrane during these cell divisions, suggested to help prevent basal chromosome displacement during mitosis.

      Major comments: The data presented are convincing and could be strengthened by the following additions and clarifications:*

      1. How long do the fluorescent reports take to be visible when transfected via lipofectamine? How efficiently are they expressed? And what concentrations were tested to enable the mosaic expression presented? * We followed the manufacturer’s instructions for Lipofectamine 3000 transfection, using the protocol recommended for set up for a 6 wells plate. We detected fluorescence the following morning ~16h. We did not assess earlier time points or optimise efficiency as we observed the mosaic pattern of expression we set out to achieve, with small groups of labelled cells and single cells as shown in Figure 3 and movies 2 and 3. This information and the detailed protocol provided below are now included in the Methods section “Labelling individual cells in human spinal cord rosettes by lipofection”.

      Manufacturer’s instructions for Lipofectamine 3000 transfection (6 well plate):

      • 1 tube containing 125 ul of Opti-MEM and 7.5 ul of Lipofectamine 3000
      • 1 tube containing 250 ul of Opti-MEM with 5 ug of DNA (total mix DNAs of 2 ug/ul) and P3000 Reagent
      • Add diluted DNA to diluted Lipofectamine 3000 (Ratio 1:1) and incubate for 10 to 15 min at Room Temperature.
      • 20 ul of DNA-Lipid complex was added to neural rosettes growing in 8 well IBIDI dishes (20 ul/well).
      • The ratio of DNA (PiggyBac plasmid) and HypBase transposase was kept at 5:1 (for a final concentration of 2ug/ul).
      • Cells in IBIDI dishes were left to develop in a sterile incubator overnight and mosaic fluorescence was observed the following morning (~16h post-lipofection).

      • Will these cell lines and constructs be made publicly available after publication?*

      The cell lines can be made available: for those reporters made in the H9 WiCell line an MTA will first have to be signed between the requesting PI and WiCell and permission for us to share the line(s) confirmed by WiCell; similarly, for reporters in ChiPS4 line an MTA will first need to be signed between the requesting PI and Cellartis/TakaraBio Europe. We will need to make a charge to cover costs. Constructs will be deposited with Addgene.

      • Were the H9 and ChiPS4 lines characterised after the reporters were added to show they still proliferate/differentiate as they did prior to the reporter integration*?

      In the Results we make clear that all lines created are polyclonal, with exception of a pm-eGFP ChiPS4 line, which is a monoclonal line (lines 145-150). We do not have direct data measuring cell proliferation but collected cell passaging data for all the reporter lines. This showed that they grow to similar densities at each passage compared to the parental line (this metadata is now provided as Supplementary data 1 and is cited in the Methods, line 348).

      As a proof of principle for this approach, we created one monoclonal line from a polyclonal line ChIPS4-pm-eGFP. The latter was made by selecting an individual clone and this was then expanded and characterised for expression of pluripotency markers (immunocytochemistry data Figure S4), and the ability to differentiate into 3 germ layers (qPCR Supplementary data 1). This information is already cited in the Methods (Lines 358-362).

      • Can the novel actin dynamics described be quantified? How many cells imaged show these novel dynamics?* Some of this quantification data was already reported in the paper (in figure 4 legend and in the Methods); we have now updated this and provide the detailed metadata in an Excel spread sheet, Supplementary data 4 (cited in the Methods, line 489)

      Minor comments: 1. Some images in the figures and supplemental movies are low in resolution, for example the DAPI in Fig 4B, making it hard to distinguish individual cells. Please increase this.

      We consider the DAPI labelling in Figure 4b to be clear, however, we wonder whether the reviewer was expecting to also see this combined with the other markers. We have therefore now provided these merged additional images in a revised Figure 4.

      • Please show a merge of Phalloidin and F-Tractin in Fig4, this will help the colocalization to be fully appreciated.*

      This has now been provided in revised Figure 4B.

      • Some additional annotation on the supplemental movies would be useful to indicate to the **reader exactly what cell to follow. *

      We have added indicative arrows to the movies, and note that more detailed labelling of the series of still images from these movies are provided in the main figures (Figures 3D and 4E & F).

      *Reviewer #1 (Significance (Required)):

      Human neurogenesis is currently poorly understood compared to many model systems used, yet key differences have already been identified between the human and the mouse, prompting the need for further investigation of human neural development. A major reason that human neurogenesis has been difficult to study is a lack of tools to enable cell morphology and behaviours to be analysed in real time.

      The reporters and reporter PSC lines generated by Dady et al will allow many of these cell characteristics to be observed using live imaging. For example, the morphology of neural progenitors during and after cell divisions, how the apical and basal processes and membranes are divided, and how the actin cytoskeleton helps to regulate these processes.

      *Importantly, PSC lines can be very heterogeneous, making generating reporter lines costly and time intensive. The use of these reporters with lipofectamine transfection, for a mosaic labelling, allows the visualisation of the plasma membrane, nucleus and cytoskeleton in any human PSC/NPC line, or even in human tissue cultures, without the need to generate each specific reporter line, making it a valuable tool for many labs in the field.

      We strongly agree with this final point; this is a major reason for our study.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):*

      The manuscript describes the generation of novel lines of human pluripotent stem cells bearing fluorescent reporters, engineered through piggyBac transposon-mediated integration. The cells are differentiated into neuronal organoids, allowing to capture cellular behaviors associated to cell division. A replating protocol allows the observation of aging neurons by reducing the thickness of the tissue thereby facilitating live imaging. The authors also leverage the transposon technology to create mosaically-labelled organoids which allows visualizing aspects of neuronal delamination, notably cytoskeleton dynamics. They discover an undescribed pattern of F-actin enrichment at the basal nuclear membrane prior to nuclear envelope breakdown.

      L104-109: "Moreover, the transposon system obviates drawbacks of directly engineering endogenous proteins...". Despite the risk of endogenous protein dysfunction, directly tagging allows the full regulation of gene expression (including the promoter, the enhancers and other regulatory regions rather than a strong constitutive promoter such as CAG). In addition, the number of copies integrated and the genomic regions are variable with PB, which does not reflect the endogenous expression. This could be rephrased by nuancing the advantages and drawbacks of each approach. The PiggyBac method is easier and faster, but it results in overexpression of a tagged protein that will be expressed since the hESC state and might not reflect the expression dynamics of the endogenous protein.* We agree and have now revised this in the Introduction L109-118.

      *L124-126: "To monitor cell shape and dynamics we used a plasma membrane (pm) localized protein tagged with eGFP or mKate2 (pm-eGFP or pm-mKate2)." Could the authors provide more details and a reference on the palmitoylated rat peptide use to force membrane expression? *

      This information, including the peptide sequence, is provided in the Methods (L330-331), we have now added a reference addressing its role in membrane localisation PMID: 2918027.

      L132-133: " Finally, to observe actin cytoskeletal dynamics we selected F-tractin, for its minimal impact on cytoskeletal homeostasis".

      A recent JCB paper (https://doi.org/10.1083/jcb.202409192) suggests that "F-tractin alters actin organization and impairs cell migration when expressed at high levels". Whether the overexpression of F-tractin in hESC using a CAG promoter reflects the physiological F-actin dynamics and/or if the high levels could lead to an alteration of cell behavior should be addressed or at least discussed. The paper we cite in this sentence (Belin et al 2014) evaluates F-tractin expression against other approaches to labelling and monitoring the actin cytoskeleton and concludes that in comparison F-tractin has minimal impact.

      We do appreciate that expression above the endogenous level has the potential to alter cell behaviour and have revised the paper to more explicitly acknowledge this: in the Introduction (L109-112), and in the Discussion/conclusion (L289-293) where we now note the recent advances reported in Shatskiy et al. 2025 PMID: 39928047.

      “A further potential limitation of this approach is that over-expression driven by the CAG promoter might not reflect physiological protein dynamics and/or alter cell behaviour; for example, high levels of F-Tractin can impair cell migration and induce actin bundling, interestingly, this can now be minimised by removing the N-terminal region (Shatskiy et al 2025)”.

      L146-147: "...to generate polyclonal cell lines selected for expression of easily detectable (medium level) fluorescence for live imaging studies". What are the criteria used to define medium level? Number of copies integrated into the genome? Or levels by FACS during clone selection?

      To clarify, all the lines presented here are polyclonal, except for one clonal line, pm-eGFP in ChiPS4. The numbers of copies integrated may vary from cell to cell in polyclonal lines. In this study, we selected cells for all lines with a FACS gate and this data is presented in Figure S1 (see line 147).

      L260-263: "Efficient stable integration and moderate expression levels were achieved by optimising, i) the quantity and ratio of piggyBac plasmids and transposase and ii) subsequent FACS to exclude high expressing cells, as well as iii) transfection methods, including temporally defined lipofection in hiPSC-derived tissues." The ration 5:1 is classically used for PB Transposase delivery, however there is still high variability in the number of copies integration. Lipofection in derived tissues has been shown to be challenging. Could the authors should provide quantitative data regarding the efficiency of their approaches, notably the level of mosaicism one could expect?

      We provide quantitative data for the efficiency of transfection using nucleoporation assays (FACS data presented in Supplementary figure S1), which shows more than 80-90% efficiency for eGFP in 82.82% of cells, mKate2 in 92.74% of cells, and H2B-mEos3 22.75% of cells, while 13.79% of cells co-expressed pm-Kate and H2B-mEos3.2. No comparative data regarding the efficiency of the tissue Lipofection assay was collected: our goal was to label single/small numbers of cells in order to monitor individual cell behaviours, and this “inefficient labelling” was readily achieved following the manufacturer’s instructions (please see response to Review 1 point 1), further details are now provided in the Methods.

      L191-194: "We further wished to monitor sub-cellular behaviour within the developing neuroepithelium. To achieve this, we devised a strategy to target a mosaic of cells in established neural rosettes using lipofection. PiggyBac constructs and HyPBase transposase were transfected into D8/D9 human spinal cord neural progenitors using lipofectamine (Felgner, et al., 1987)(Fig. 3A)." The mosaicism is not an all or nothing in this method but also leads to variations in expression levels among the positive cells. The protocol for lipofection could be better detailed to allow easy reproduction by other teams, and its expected efficiency should be discussed. It would be interesting to explore the relationship between individual cells phenotype and expression levels. Please see response to Reviewer 1 point 1 above for more detailed lipofection protocol which generated mosaic expression, this is now also included in the Methods. We agree that investigating the relationship between individual cell phenotypes and expression levels would be interesting, but we think this is beyond the scope of this paper.

      Additional comments: -Did the authors perform karyotyping of the hPSCs prior to use in the differentiation protocol?

      As these are polyclonal lines, we did not undertake karyotyping. This could be done for the one monoclonal line described here (pm-eGFP ChiPS4 line): we lack funds for commercial options, but we are exploring other possibilities.

      -Were pluripotency assays performed after reporter lines generation?

      These were carried out for the clonal pm-eGFP ChiPS4 line (lines 145-150). The latter was made by selecting an individual clone and this was then expanded and characterised for expression of pluripotency markers by IF (Figure S4), and the ability to differentiate into 3 germ layers by qPCR (Supplementary data 2). This information is provided in the Methods (Lines 358-362).

      *-Did the authors measure the cell proliferation rate in H2B-overexpressing cells and controls? Since H2B plays an important role in cytokinesis, it could interfere in cell division when H2B is overexpressed (see doi: 10.3390/cells8111391). *

      We did not directly measure cell division when H2B is over-expressed. However, we assessed cell -passaging time of all the transfected cell lines. This showed that they grow to similar densities at each passage compared to the parental line (this is now provided as Supplementary data 1 and is cited in the Methods, line 348). We also found no difference between apical visiting time of progenitors in spinal cord rosettes expressing pm-eGFP or H2B-mEoS3.2, further supporting the conclusion that levels of H2B-mEoS3.2 expression achieved in this line did not interfere with cell division (metadata provided in Supplementary data 3).

      The authors should provide data concerning the efficiency of expression of the distinct markers after electroporation. This is provided in Supplementary Figure S1 (FACS data) and detailed above for this reviewer.

      *At Fig 1C, the schematic representation describes clone selection, however in the methods it is stated (L348-349): "Sorted cells expressing medium levels of fluorescence were expanded and frozen then representative lots of each polyclonal cell line...". There is some confusion regarding which experiments were performed using polyclonal medium-level mixed populations or monoclonal populations. *

      We apologise for any confusion and have revised the Figure 1C schematic to indicate that cells can be selected to either make polyclonal lines or clonal lines.

      *Reviewer #2 (Significance (Required)):

      The study provides novel tools, as well as elements regarding neuroepithelium biology. It is well conducted and written, and the quality of images is excellent. It reads more as a resource paper in its current version, since the observation regarding neural cell division and delamination are interesting but not deeply explored, so this review will focus on those technical aspects rather than the novelty of the biological findings.

      This study would be of interests for researchers in stem cells and organoids, developmental biology, and neurosciences.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In the manuscript, "Engineering fluorescent reporters in human pluripotent cells and strategies for live imaging human neurogenesis" the authors Dady et al. describe the adaptation of a recent advancement in transposase technology (HyPBase) as a method to integrate live reporters in human pluripotent stem cells. They show that these florescent reporters paired with new imaging strategies can be used to confirm the existence cellular behaviour described in other species such as the interkinetic nuclear migration (IKNM) of dividing progenitors in neural tube development. Finally, they demonstrate that this live imaging system is also able to discover novel biology by identifying previously undescribed actin polymerization at the basal nuclear surface of cortical progenitors undergoing cell division. Overall, the study presents two examples in which this adapted tool will aid in live-imaging studies of cellular biology.

      Major Concerns: 1. This work needs more controls to properly demonstrate claims that their engineering strategy provides an advancement to current Piggyback methods. Their HyPBase strategy needs to be compared and quantified in terms of efficiency with other methods to support their claims (increased detection and reduced phototoxicity).*

      We do not make specific claims for our experiments with respect to the superiority of HyPBase strategy. Our comments on this approach referred to by the reviewer here are in the Introduction (L 94-103), are supported by the literature (e.g. more stable gene expression than native piggyBac or the Tc1/mariner transposase Sleeping Beauty (Doherty, et al., 2012, Yusa, et al., 2011) and serve to explain our selection of HyPBase for our experiments. We make a case for using HyPBase as opposed to another transposase and although it would be interesting to compare efficiencies, this comment does not specify what “other methods” might be informative.

      2.Throughout the manuscript more quantification is needed of the results. How many rosettes were examined? Were all the reported cells within one rosette? Were there differences between rosettes? This should be done for both the spinal and cortical differentiations.

      The reviewer appears to have missed this information – we placed detailed quantifications in the figure legends (numbers of independent experiments and rosettes) and in the Methods in a specific section on Quantification of cell behaviour (L465-486), rather than in the main text. These has since been further updated and we now also provide additional metadata in the form of Excel spreadsheets for quantifications and analyses made for both spinal cord and cortical rosettes (Supplementary data 3 and 4 respectively).

      Minor Comments: 1. Line 246 needs quantification shown in figures of the statements made. Specifically, how many cells were measured to get this number?

      This information was provided in the figure 4 legend and we have since added numbers to these data; we were able to monitor 169 divisions in 21 rosettes; 154/166 divisions had vertical cleavage planes (symmetric) and 12/166 had horizontal cleavage planes (asymmetric).

      These detailed observations were made in two independent experiments, along with observations of basal nuclear membrane F-Tractin localisation. This is noted in figure 4 legend, Methods and detailed metadata is provided in Supplementary data 4.

      2.How many cells in the cortical rosettes had the enriched actin at the basal nuclear surface?

      We confidently observed basal nuclear membrane F-Tractin enrichment in 141/146 divisions, for the remaining 20 cases (166-146), we could not tell whether F-Tractin is enriched or not at the basal nuclear membrane either because of low expression levels or because the basal nuclear membrane was out of focus at NEB. In 5 cases, we did not see the basal nuclear enrichment despite sufficient F-Tractin expression levels and the nucleus being in focus. We have updated the Fig4 legend excluding the non-analysable cases and see detailed metadata is provided in Supplementary data 4.

      *Reviewer #3 (Significance (Required)):

      General Assessment: This manuscript makes a very minor advancement in the field of stem cell engineering and developmental biology, but one that is worthy of publication with a few edits.

      Advance: While PiggyBac reporters are widely used in stem cell engineering, Dady et al. demonstrate a new workflow using HyPBase which would be beneficial to the field. However, to increase this benefit, much more description and quantification of the methods would be needed. The biological advances of this manuscript are also very minor, but interesting as most of them confirm that human neural rosettes mimic many of the observed cell behaviours seen in animal models. Along these lines is the actin dynamics observation in cortical rosettes is interesting, but a preliminary observation and in need of follow up experiments.

      Audience: Regardless, this technique would be of interest to the wider field of stem cell engineering.

      My Expertise: Human Stem Cell Engineering, Neural Tube Development*

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In the manuscript, "Engineering fluorescent reporters in human pluripotent cells and strategies for live imaging human neurogenesis" the authors Dady et al. describe the adaptation of a recent advancement in transposase technology (HyPBase) as a method to integrate live reporters in human pluripotent stem cells. They show that these florescent reporters paired with new imaging strategies can be used to confirm the existence cellular behaviour described in other species such as the interkinetic nuclear migration (IKNM) of dividing progenitors in neural tube development. Finally, they demonstrate that this live imaging system is also able to discover novel biology by identifying previously undescribed actin polymerization at the basal nuclear surface of cortical progenitors undergoing cell division. Overall, the study presents two examples in which this adapted tool will aid in live-imaging studies of cellular biology.

      Major Concerns:

      1.This work needs more controls to properly demonstrate claims that their engineering strategy provides an advancement to current Piggyback methods. Their HyPBase strategy needs to be compared and quantified in terms of efficiency with other methods to support their claims (increased detection and reduced phototoxicity). 2.Throughout the manuscript more quantification is needed of the results. How many rosettes were examined? Were all the reported cells within one rosette? Were there differences between rosettes? This should be done for both the spinal and cortical differentiations

      Minor Comments:

      1.Line 246 needs quantification shown in figures of the statements made. Specifically how many cells were measured to get this number? 2.How many cells in the cortical rosettes had the enriched actin at the basal nuclear surface?

      Significance

      General Assessment: This manuscript makes a very minor advancement in the field of stem cell engineering and developmental biology, but one that is worthy of publication with a few edits.

      Advance: While PiggyBac reporters are widely used in stem cell engineering, Dady et al. demonstrate a new workflow using HyPBase which would be beneficial to the field. However, to increase this benefit, much more description and quantification of the methods would be needed. The biological advances of this manuscript are also very minor, but interesting as most of them confirm that human neural rosettes mimic many of the observed cell behaviours seen in animal models. Along these lines is the actin dynamics observation in cortical rosettes is interesting, but a preliminary observation and in need of follow up experiments.

      Audience: Regardless, this technique would be of interest to the wider field of stem cell engineering.

      My Expertise: Human Stem Cell Engineering, Neural Tube Development

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript describes the generation of novel lines of human pluripotent stem cells bearing fluorescent reporters, engineered through piggyBac transposon-mediated integration. The cells are differentiated into neuronal organoids, allowing to capture cellular behaviors associated to cell division. A replating protocol allows the observation of aging neurons by reducing the thickness of the tissue thereby facilitating live imaging. The authors also leverage the transposon technology to create mosaically-labelled organoids which allows visualizing aspects of neuronal delamination, notably cytoskeleton dynamics. They discover an undescribed pattern of F-actin enrichment at the basal nuclear membrane prior to nuclear envelope breakdown.

      L104-109: "Moreover, the transposon system obviates drawbacks of directly engineering endogenous proteins...". Despite the risk of endogenous protein dysfunction, directly tagging allows the full regulation of gene expression (including the promoter, the enhancers and other regulatory regions rather than a strong constitutive promoter such as CAG). In addition, the number of copies integrated and the genomic regions are variable with PB, which does not reflect the endogenous expression. This could be rephrased by nuancing the advantages and drawbacks of each approach. The PiggyBac method is easier and faster, but it results in overexpression of a tagged protein that will be expressed since the hESC state and might not reflect the expression dynamics of the endogenous protein.

      L124-126: "To monitor cell shape and dynamics we used a plasma membrane (pm) localized protein tagged with eGFP or mKate2 (pm-eGFP or pm-mKate2)." Could the authors provide more details and a reference on the palmitoylated rat peptide use to force membrane expression?

      L132-133: " Finally, to observe actin cytoskeletal dynamics we selected F-tractin, for its minimal impact on cytoskeletal homeostasis..". A recent JCB paper (https://doi.org/10.1083/jcb.202409192) suggests that "F-tractin alters actin organization and impairs cell migration when expressed at high levels". Whether the overexpression of F-tractin in hESC using a CAG promoter reflects the physiological F-actin dynamics and/or if the high levels could lead to an alteration of cell behavior should be addressed or at least discussed.

      L146-147: "...to generate polyclonal cell lines selected for expression of easily detectable (medium level) fluorescence for live imaging studies". What are the criteria used to define medium level? Number of copies integrated into the genome? Or levels by FACS during clone selection?

      L260-263: "Efficient stable integration and moderate expression levels were achieved by optimising, i) the quantity and ratio of piggyBac plasmids and transposase and ii) subsequent FACS to exclude high expressing cells, as well as iii) transfection methods, including temporally defined lipofection in hiPSC-derived tissues." The ration 5:1 is classically used for PB Transposase delivery, however there is still high variability in the number of copies integration. Lipofection in derived tissues has been shown to be challenging. Could the authors should provide quantitative data regarding the efficiency of their approaches, notably the level of mosaicism one could expect?

      L191-194: "We further wished to monitor sub-cellular behaviour within the developing neuroepithelium. To achieve this, we devised a strategy to target a mosaic of cells in established neural rosettes using lipofection. PiggyBac constructs and HyPBase transposase were transfected into D8/D9 human spinal cord neural progenitors using lipofectamine (Felgner, et al., 1987)(Fig. 3A)." The mosaicism is not an all or nothing in this method but also leads to variations in expression levels among the positive cells. The protocol for lipofection could be better detailed to allow easy reproduction by other teams, and its expected efficiency should be discussed. It would be interesting to explore the relationship between individual cells phenotype and expression levels.

      Additional comments:

      • Did the authors perform karyotyping of the hPSCs prior to use in the differentiation protocol?
      • Were pluripotency assays performed after reporter lines generation?
      • Did the authors measure the cell proliferation rate in H2B-overexpressing cells and controls? Since H2B plays an important role in cytokinesis, it could interfere in cell division when H2B is overexpressed (see doi: 10.3390/cells8111391). The authors should provide data concerning the efficiency of expression of the distinct markers after electroporation. At Fig 1C, the schematic representation describes clone selection, however in the methods it is stated (L348-349): "Sorted cells expressing medium levels of fluorescence were expanded and frozen then representative lots of each polyclonal cell line...". There is some confusion regarding which experiments were performed using polyclonal medium-level mixed populations or monoclonal populations.

      Significance

      The study provides novel tools, as well as elements regarding neuroepithelium biology. It is well conducted and written, and the quality of images is excellent. It reads more as a resource paper in its current version, since the observation regarding neural cell division and delamination are interesting but not deeply explored, so this review will focus on those technical aspects rather than the novelty of the biological findings.

      This study would be of interests for researchers in stem cells and organoids, developmental biology, and neurosciences.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Dady et al have developed fluorescent reporters to enable live imaging of cell behaviour and morphology in human pluripotent stem cell lines (PSCs). These reporters target 3 main features, the plasma membrane, nucleus and cytoskeleton. Reporter PSCs have been generated using a piggyBac transposon-mediated stable integration strategy, using a hyperactive piggyBac transposase (HyPBase). The same constructs were also used for mosaic labelling of cells within 2D cultures using lipofectamine transfection.

      The reporters used are tagged with either eGFP or mKate2 (far red) and tag the plasma membrane (pm) via the addition of a 20 amino-acid sequence from rat GAP-43 to the N-terminus of the fluorescent protein, the nucleus via Histone 2B with a laser-mediated photo-conversion option (H2B-mEos3.2), and the cytoskeleton via F-Tractin. In total, the authors produced lines with the following:

      • pm-mKate2 (far red)
      • pm-eGFP (green)
      • H2B-mEos3.2 (green to red)
      • F-tractin-mKate2 (far red)
      • H2B-mEos3.2 and pm-mKate2 (green to red, plus far red)

      The cell lines used to generate these were the human embryonic stem cell line H9 and human induced pluripotent cell line ChiPS4. The constructs were also used to label cells in a mosaic fashion, using lipofectamine transfection of the original cell lines once they had formed neural rosettes.

      Using these cells, Dady et al then performed live imaging in vitro of human spinal cord rosettes and assessed cell behaviour. In particular they analysed mitotic cleavage planes and apical positioning of neural progenitor cells (NPCs), and assessed actin dynamics within these cells. They showed a slowing of the cell cycle length after the initial expansion phase, an increase in the rate of asymmetric division of these NPCs, and abscission of the apical membrane during these divisions. The F-tractin reporter showed enrichment at the basal nuclear membrane during these cell divisions, suggested to help prevent basal chromosome displacement during mitosis.

      Major comments:

      The data presented are convincing and could be strengthened by the following additions and clarifications: 1. How long do the fluorescent reports take to be visible when transfected via lipofectamine? How efficiently are they expressed? And what concentrations were tested to enable the mosaic expression presented? 2. Will these cell lines and constructs be made publicly available after publication? 3. Were the H9 and ChiPS4 lines characterised after the reporters were added to show they still proliferate/differentiate as they did prior to the reporter integration? 4. Can the novel actin dynamics described be quantified? How many cells imaged show these novel dynamics?

      Minor comments:

      1. Some images in the figures and supplemental movies are low in resolution, for example the DAPI in Fig 4B, making it hard to distinguish individual cells. Please increase this.
      2. Please show a merge of Phallodin and F-Tractin in Fig4, this will help the colocalization to be fully appreciated.
      3. Some additional annotation on the supplemental movies would be useful to indicate to the reader exactly what cell to follow.

      Significance

      Human neurogenesis is currently poorly understood compared to many model systems used, yet key differences have already been identified between the human and the mouse, prompting the need for further investigation of human neural development. A major reason that human neurogenesis has been difficult to study is a lack of tools to enable cell morphology and behaviours to be analysed in real time.

      The reporters and reporter PSC lines generated by Dady et al will allow many of these cell characteristics to be observed using live imaging. For example, the morphology of neural progenitors during and after cell divisions, how the apical and basal processes and membranes are divided, and how the actin cytoskeleton helps to regulate these processes.

      Importantly, PSC lines can be very heterogeneous, making generating reporter lines costly and time intensive. The use of these reporters with lipofectamine transfection, for a mosaic labelling, allows the visualisation of the plasma membrane, nucleus and cytoskeleton in any human PSC/NPC line, or even in human tissue cultures, without the need to generate each specific reporter line, making it a valuable tool for many labs in the field.

  3. Jun 2025
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Comments for the authors of Review Commons Manuscript RC-2024-02804:

      The author of the Review Commons manuscript "Antigen flexibility supports the avidity of hemagglutinin-specific antibodies at low antigen densities", present their recent work evaluating hemagglutinin interactions with cellular receptors and antibodies. This manuscript focuses specifically on the avidity of the hemagglutinin using a fluorescence-based assay to measure dissociation kinetics and steady-state binding of antibodies to virions. Their findings confirm that bivalent interactions can offset weak monovalent affinity and that HA ectodomain flexibility is an additional determinant of antibody avidity. These findings are key for our understanding of neutralizing antibodies. Below are some comments that I would like the authors to address as they revise the manuscript.

      Comments:

      1. Can the authors provide justification for the two influenza viruses that they used.

      We selected the lab-adapted IAV strains A/WSN/1933 (H1N1) and A/Hong Kong/1968 (H3N2) for this work because they are well-studied, including in the context of the antibodies used here, S139/1 and C05. While both antibodies bind to more contemporary H3N2 strains, they no not bind to HA from pandemic H1N1. Another feature of these strains is that their HAs have high enough affinity to both antibodies to enable strong signal in our imaging assays. This context for our strain selection has been added in lines 85-88.

      1. The use of filamentous particles is a strength, but authors should detail the role of filamentous vs. spherical in nature and lab settings. This will help researchers that plan to repeat these assays.

      We have revised the text (lines 336-339) to include more context on the biology of filamentous and spherical influenza viruses. In our experiments, HK68 naturally produces filaments in cell culture whereas WSN33 does not. To produce filaments artificially, we replace the M1 sequence from WSN33 with that of M1 from A/Udorn/1972, an H3N2 strain that is closely related to HK68.

      1. Did the authors add the Udorn M1 to the HK68 as well?

      Since HK68 naturally forms filaments, we did not introduce Udorn M1 into this strain. We note that the amino acid sequences of Udorn M1 and HK68 M1 differ only at position 167 (Alanine in Udorn, Threonine in HK68), and that this residue has previously been found to not correlate with virus morphology (10.1016/j.virol.2003.12.009).

      Reviewer #1 (Significance (Required)):

      This manuscript focuses specifically on the avidity of the hemagglutinin using a fluorescence-based assay to measure dissociation kinetics and steady-state binding of antibodies to virions. Thie findings confirm that bivalent interactions can offset weak monovalent affinity and that HA ectodomain flexibility is an additional determinant of antibody avidity. These findings are key for our understanding of neutralizing antibodies.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary

      In this study, Benegal et al. investigate the binding kinetics of HA-head-specific antibodies (S139/1 and C05) to intact influenza virus particles using a fluorescence microscopy-based technique to measure the dissociation rate (koff) of the antibodies. By applying their proposed equilibrium model for bivalent antibody binding to HA, the authors calculated the crosslinking rate (kx), which represents the rate at which a single-bound antibody crosslinks to an additional HA molecule. Their experiments revealed that antigen crosslinking significantly slows koff, reducing it by up to two orders of magnitude. The authors further utilized streptavidin-coated beads conjugated with biotinylated HA or biotinylated BSA at varying concentrations to control HA surface density. Their results demonstrated that the two tested HA-head-specific antibodies retained the ability to crosslink HAs even at ~10-fold lower HA surface densities. In a complementary experiment, they employed an HA-anchor-specific antibody to restrict HA flexibility, which led to reduced binding of S139/1 and C05 IgGs but not their Fab fragments. This finding suggests that HA flexibility, rather than density, is the primary determinant of antibody crosslinking and avidity. Overall, the authors present an innovative approach to elucidating the dissociation and crosslinking kinetics of antibodies targeting intact virions or nanoparticles. The study is well-designed, with alternative interpretations of the results carefully considered and addressed throughout. I have only a few minor comments and suggestions for clarification.

      Minor comments:

      1. In Figure 1, does the grey color of each IgG in panel C indicate the Fc domain? If so, please add the description of the colors to the figure legend. In fact, it may be better to explain all the colors used here (for HA1, HA2, Fab heavy chain, light chain, etc.).

      We have included this information in panel C and the caption for Figure 1.

      1. Under the section," Bivalent binding of S139/1 and C05 persists after ~10-fold reductions in HA surface densities", the beginning of the second paragraph writes, "For both S139/1 and C05 Fab, binding increases linearly with HA density, as expected for a monovalent interaction dictated by absolute HA availability rather than density (Fig. 3D). Interestingly, the same relationship is observed for S139/1 IgG."

      Visually, I think the same relationship also seems to hold for C05 IgG. Would it be better to perform some linear regression and report the R2 value for the fitting so that this assessment can be quantitative?

      We agree with the reviewer's point. In Figure 3 of the revised manuscript, we include the results from a linear regression analysis to make this assessment more quantitative.

      1. At the end of the same page, in the same paragraph, the authors mentioned, "In contrast to the IgG, Fab binding measured at twice the molar concentration of the IgG is nearly undetectable under these conditions, confirming the IgG binding is not occurring through monovalent interactions (Fig. S2E)." What are the conditions you are referring to? In Fig. S2E, there is only the Ab intensity for the Ab binding at 100% HA (and not the other percentages). For the Ab intensity of S139/1 Fab, what is the concentration of the Fab used in Figure 3D? Why could the intensity in this experiment for S139/1 Fab reach ~100,000, whereas that of the 8 nM in Fig. S2E can only reach ~20,000?

      To clarify this point, we have updated Figure 3 to include the antibody concentration used for each experiment. The experiments in Fig 3 are conducted approximately around the respective KD of each IgG or Fab to ensure both consistency and strong signal-to-noise. For S139/1, we use 4nM of IgG, and 25nM of Fab. In Fig S2E, we use a concentration of Fab fragments double to that of the IgG, to reach an equivalent concentration of binding sites and confirm that the IgG binding we see is indeed due to bivalent binding. In this case, we use 4nM of IgG and 8nM of Fab.

      1. Under the section, "Tilting of HA about its membrane anchor contributes to C05 and S139/1 avidity", in the second paragraph, the authors wrote, "If this is correct, we reasoned that avidity could be reduced by constraining tilting of the HA ectodomain. To test this hypothesis, we used FISW84, an antibody that binds to the HA anchor epitope and biases the ectodomain into a tilted conformation (Fig. 4B)."

      Can you use some computational models (maybe the same one you used for Figure 4A) to show that when an HA trimer is bounded by FISW84 Fabs, the tilting of HA is constrained? I think this will help substantiate the assertion above.

      This is an important point. The model that we employ in Figure 4A is suited to predicting the angles sampled by HAs when they are bound by an IgG antibody, but it does not take into consideration clashes with the viral membrane. It is these clashes that we predict based on published structures (reference 35 in the revised manuscript) will constrain HA tilting when FISW84 binds to the HA anchor. We have revised the text (Lines 247-249) to clarify these points.

      1. It would be good if you could mention the strain of HA used in the experiments in Figure 4 in the actual Figure as well (as supposed to just in the figure legend).

      We have added this information to Figure 4 in the revised manuscript.

      1. I do not see a method section for the structure-based model you used in Figure 4. In the text, you cited your previous study (ref 28) for the model, but it would be good to write about this briefly (and how you specifically apply the model in this study) in this current manuscript.

      We have updated the methods to include a subsection ("Geometric Model for Preferred Crosslinking Geometry") on how the structure-based model was set up, along with a corresponding visual in Fig S3 of the angles of freedom given.

      1. In Figure S1 panel D, what is the unit of the antibody concentration? Could you please add it to the graph legend?

      We have updated the figure (S1E in the revised manuscript) to include this information.

      Reviewer #2 (Significance (Required)):

      Previously, this group utilized the same fluorescence-based method to investigate the potency of anti-HA IgG1 antibodies in preventing viral entry versus egress, as well as the tendency of antibodies targeting different HA epitopes to crosslink two HA trimers in cis or in trans (He et al., J Virol, 2024). In this study, they extend their work by evaluating, in-depth, how the density and flexibility of hemagglutinin (HA) on the viral surface influence the binding avidity of anti-HA antibodies. Using two human IgG1 antibodies targeting the HA head, the authors demonstrate that these antibodies can crosslink two HA trimers in cis, even when the trimers are further apart than adjacent HAs. Notably, the study reveals that HA flexibility, rather than density, is the key determinant modulating antibody crosslinking. Even at a 10-fold reduced HA density compared to the original, the antibodies retained their ability to crosslink trimers.

      This study provides critical insights into the relationship between HA density, flexibility, and antibody function, adding to the broader understanding of antibody crosslinking-a topic frequently discussed in the field of influenza research. These findings could have significant implications for vaccine design, particularly for strategies involving the display of the HA ectodomain on nanoparticles, potentially guiding the development of more effective influenza vaccines. Furthermore, the broader relevance of these findings may extend to other viruses with similar structural and immunological properties.

      My expertise lies in the structural determination of antibody-antigen complexes in influenza and other pathogens. While I may not have sufficient expertise to evaluate specific technical details of the fluorescence-based methods employed, the authors have convincingly demonstrated the robustness of their experimental design and interpretation, supported by appropriate controls.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      SUMMARY In "Antigen flexibility supports the avidity of hemagglutinin-specific antibodies at low antigen densities", Benegal et al. develop a microscopy-based assay to measure dissociation of HA head-binding antibodies from intact virions. This assay allows the authors to explore the contribution of IgG bivalent avidity to antibody interaction with native virions, which is not accessible using other methods such as BLI. Using this assay, the authors further explore the effect of HA density on IgG avidity with engineered low-HA virions and then with artificial HA-coated microspheres. In addition to measuring antibody dissociation, the authors perform structural analyses to predict the conformational preferences of many HA IgGs from published structures. The authors conclude that low HA densities (down to ~10%) still support high avidity binding for the 2 IgGs tested, and thus there would be little evolutionary pressure for IAV to reduce the HA density as a strategy to evade immune recognition.

      MAJOR COMMENTS

      The data presented are generally convincing for the two antibodies tested, with some caveats listed below. I believe the microscopy technique is valuable and provides a significant contribution to the field, and I believe that the finding that avidity persists at low densities for IAV is compelling and worth communicating to other virologists. Overall, with the incorporation of the suggested major revisions, this manuscript represents a significant advancement in the field.

      A major limitation of the current study is the small number of antibodies tested. Two antibodies are quite few, particularly since this work attempts to generalize these observations with structural predictions of dozens or hundreds of HA antibodies. While I believe that the resilience of IgG binding to lower epitope densities is likely common to many HA antibodies (or antibodies in general), this work alone does not support this. To this end, the authors should acknowledge their limited sample size in the text or discussion and that the generalization to other antibodies is speculative. Alternatively, the authors could demonstrate with additional antibodies (such as F045-092 which is pointed out in Fig S3A and perhaps group 'i' antibodies according to Fig S3A).

      This is an important point, and we more explicitly acknowledge this limitation in lines 277-278.

      It seems to me lateral diffusion of HA in the viral membrane is an important discussion point that was missed in this manuscript. The authors should comment on what is known about the lateral mobility of HA on virions, and how this could impact the ability of an IgG to crosslink. The authors should comment about whether long range diffusion and/or short range "shuffling" of glycoproteins could contribute to crosslinking preferences of antibodies in addition to the tilt, which is the only movement discussed. As appropriate, the authors should then comment on how this may affect their interpretation of experiments using beads. In experiments on beads, there is certainly no lateral mobility of the HA trimers; what are the consequences of this on the analysis?

      We agree that this is an important consideration, and we have revised the manuscript (lines 296-298) to address these points. Briefly, we have previously performed fluorescence recovery after photobleaching of covalently labeled HA and NA on the surface of filamentous influenza particles (10.7554/eLife.43764; see Figure 1B of this reference for a representative example). This data indicates that long range diffusion does not seem to be occurring on the virion surface. Short range diffusion, or shuffling, has not been observed, but cannot be ruled out, and may increase conformations favorable to bivalent binding.

      Should the authors qualify the limitations in the scope of their experimental results and the system of choice (beads vs. virions) as described in my previous comments, I suggest three experiments that I believe are essential to support the authors' claims. Alternative to qualifying the limitations, two optional experiments are also listed that could support the authors' claims as they are - those require a more extensive experimental undertaking and are thus labeled [OPTIONAL].

      1) The photobleaching experiment shown in Figure S1A. I am concerned that measuring photobleaching in steady state conditions does not properly control for the experimental conditions. In steady state, bleached antibody could unbind and be replaced by fluorescent antibody that has diffused into the field of view. This should be more thoroughly controlled by irreversibly capturing antibody (such as with biotin) and imaging after excess antibody is washed away, or by some other method such as capturing and imaging virus that has been directly labeled with AF555. This should be possible using reagents and techniques already demonstrated by the authors.

      We have updated the supplemental information with a more rigorous control for photobleaching; the revised figures are shown in Fig S1A. In this experiment, fluorescent S139/1 IgG was bound to HK68 virions. The antibody was washed away, and the loss of fluorescence signal was imaged separately under two conditions: 1) Dissociation only; an image was collected at 0s and one at 60s. 2) Dissociation and photobleaching; an image was collected at a rate of 1 frame per second for 60 seconds. The difference between the endpoint intensities from both conditions is not statistically significant. This supports our conclusion that, in the absence of antibodies in solution that can exchange with those bound to virions, photobleaching does not make a detectable contribution to the loss of signal we observe in our antibody dissociation experiments.

      2) In imaging, the authors analyzed only filamentous virions because they exhibit the best signal to noise ratio, which is a reasonable technical simplification. However, this relies on the assumption that glycoprotein presentation is relatively constant between virions of different sizes. It would be helpful to perform some analysis of small virions in any movie where there is sufficient signal. This would support the assumption that rates for small virions are comparable to those of filaments in the same experiment. This should be possible by performing additional analysis on existing data, without requiring additional experiments.

      Thank you for calling our attention to a point that needs clarification. The analysis that was restricted to filaments was for the SEP-HA binding experiments (shown in Fig 3A&B). This was done in order to select only those particles that were not diffraction-limited, so that we could control for any systematic differences in size between the two populations by measuring HA signal per unit particle length. For the dissociation experiments (Fig 2), data was taken from all virions in the fields of view. For this analysis, the normalized dissociation curves were averaged in two ways to account for the potential discrepancy that the reviewer points out. In the first method, the average was taken with each virion equally weighted, while in the second method, the entire field of view was masked and normalized together. Both curves look very similar, suggesting that any potential differences between smaller virions and filaments are not enough to make a quantifiable difference in dissociation rate. A representative dissociation curve with both analyses shown side-by-side has been added in Figure S1B.

      3) In figure 3, C05 fab binding is used to assay HA content of the SEP HA virions. An additional method of confirming HA content that is more independent from the imaging assay would be beneficial, such as a Western blot to quantify HA relative to NP, NA, or M1 etc.

      We have used western blotting to quantify the amount of HA contained relative to M1 in each population. This new data is discussed in lines 163-168 of the revised manuscript and shown in Figure S2C. As noted in the revised text, western blot analysis suggests that the density of native HA is decreased to ~31% its normal level in SEP-HA virions, lower than the ~75% value determined via fluorescence microscopy. One possible reason for this disparity is the presence of virus-like particles in the SEP-HA sample that completely lack wildtype HA. These would be excluded from our imaging analysis but captured by the western blot.

      4) [OPTIONAL] In figure 4, it is depicted that FISW84 biases HA in a tilted conformation, and the authors reasonably propose the reduced flexibility discourages crosslinking by IgGs. From the modeling summarized in Figure S3A, are there any antibodies predicted to prefer crosslinking HA at the same angle FISW84 tilts the ectodomain? Would FISW84 enhance crosslinking by such an antibody?

      This is an interesting suggestion, and we have revised the manuscript (lines 247-249) to clarify our thinking on this point. Based on the structure of the FISW84 Fab (PDB ID 6HJQ), we conclude that binding of a single Fab fragment does not necessarily actively tilt the HA ectodomain in a specific direction. Rather, it restricts tilting in the direction that would cause a steric clash between the Fab and the membrane. As a result, HA can still sample a range of angles, but this range is no longer symmetrical about the ectodomain axis. By reducing the likelihood that two HA ectodomains would tilt towards each other at a favorable angle, we would expect all antibodies to be disadvantaged to some degree. A possible exception could be if three FISW84 Fab fragments manage to bind to a single HA trimer. In this case, the HA ectodomain would be forced to remain perpendicular to the membrane to accommodate them all. This would favor antibodies that prefer binding to HAs where the ectodomains are parallel to each other. In our analysis in Figure S3A, this includes primarily antibodies that bind to the HA central stalk, such as 31.b.09. However, we note that these antibodies may encounter barriers to bivalent binding that we do not consider here, including proximity to the FISW84 epitope and the high density of HA in the membrane.

      5) [OPTIONAL] In figure S3A, the authors display theoretical tilt and spacing preferences for many HA antibodies based on published structures. Interestingly, their group iii antibody is predicted to prefer greater spacing and tilt, and likewise the authors observe increased binding at lower densities (in figure 3E). It would be beneficial to the work to test group i antibodies (base binding) in the dissociation experiments. The behavior of a base binding antibody, particularly at low densities could reinforce the modeling performed for this work.

      This is an excellent suggestion which we are not currently able to pursue for technical reasons. In particular, it would be difficult to distinguish between increased binding of these antibodies at low antigen densities that is due to bivalent attachment (and thus reduced dissociation) versus increased accessibility of the epitope, which may be occluded at higher HA densities.

      The experiments are well explained and supported by methods that would enable reproducibility.

      The authors state "The statistical tests and the number of replicates used in specific cases are described in the figure legends" yet in many cases this information is absent. For the k values in fig 2D, some indication of error or confidence interval would be helpful.

      We have ensured that this information is included in each of the captions. Regarding the k values, formal error propagation is challenging due to the way the k values were derived. Specifically, these values were calculated by fitting the average of the three initial dissociation traces, rather than fitting each replicate individually and then averaging the rate constants. As a result, the usual methods for estimating confidence intervals or standard error of the mean are not directly applicable.

      MINOR COMMENTS

      o Some of the small details in fig 1A and fig S1 are lost due to small figure size - such as the sialic acid residues and lipid bilayer.

      We have resized the figure components.

      o Although described in the text, it could be helpful to incorporate into figure 2 why the BLI data is shown for S129 fab. Perhaps indicate in 2C that that curve is "too fast to accurately measure" and perhaps near the table in 2D indicate the blue data is from Lee et al. It may be fine to simply remove the BLI results from the figure and refer to them only in the discussion of the experiments. Even with the measured data, the difference between fab and IgG is striking enough to support the paper, and the BLI data may be more confusing in the figure than it adds.

      We have updated the caption for Figure 2D to clarify that binding between the S139/1 Fab and A/WSN/1933 HA is approaching the limit of detection in our assay, and that the additional rates are from Lee et al. We have also updated the table to make the presentation of the kinetic parameters more clear.

      o In figure 3A, better describe the fluorescent components in the fluorescent images in the legend.

      We have updated the caption for Figure 3A to describe the fluorescent components shown in the image. Specifically, the panel labeled 'HA' shows signal from a fluorescent FI6v3 scFv, while the panel labeled 'decoy' shows signal from the SEP-HA construct.

      o From personal experience, the flexibility of HA ectodomain can be significantly affected by how much of the membrane proximal linker region is retained or removed. Could the authors comment on how they chose the cutoff for their HA ectodomain used in the bead experiments and their rationale?

      This is an important point, and while the precise impact of the linker on HA flexibility remains uncertain, we agree that it may increase the freedom of motion of the ectodomain relative to the HA membrane anchor. We mention this caveat in the revised text (lines 188-191) and we have added an AlphaFold2 prediction of how our recombinant HA might look to Figure S2D.

      o In Figure S1B, if I understand correctly: black dashed line "IgG equivalent dissociation rate" is the experimental data, magenta "Crosslinking model fit" is the theoretically total antibody bound as described by the mathematical model. Then the gray lines "Double- /singly- bound antibodies plot the theoretical amount of antibody bound once and bound twice. If this is correct, I believe it would be clearer if the singly- and doubly- bound were plotted in separate colors, and that this is explained more clearly in the legend.

      We have revised the figure to show doubly- and singly-bound curves using different line styles.

      o Related to an earlier comment, if lateral diffusion may play a role, how might this differ between different types of antibodies?

      As mentioned in our previous response, we do not anticipate that lateral diffusion makes a significant contribution to antibody binding to the surface of virions, although it may be important on the cell surface.

      o Could the authors comment in the discussion on how their results on virions may translate to the surface of the infected cell, which is also decorated in viral glycoproteins? Early time points of infection could be an in vivo example of low-density HA. What extent may antibody binding and crosslinking affect viral proteins on the cell surface or the immune response?

      This is a very interesting point. Antibody binding to the infected cell surface has been shown to alter viral release and morphology, presumably at lower HA densities than those observed the viral surface. We have added a brief discussion of this point (lines 291-295) to the revised manuscript.

      o The github link in the methods is incorrect or not yet available.

      Thank you for noting this. We have updated the link.

      o Reference 1 has an incorrect or expired link.

      These references have been updated.

      Reviewer #3 (Significance (Required)):

      • This work represents a conceptual advance in our understanding of antibody action on viral pathogens. The authors adapt existing microscopy methodologies to measure antibody avidity in a new way that is better representative of in vivo conditions.

      • To my knowledge, this is the first instance of direct measurement of antibody off-rates from intact virus particles, instead of immobilized protein as in BLI, SPR, or interferometry.

      • This work should be of interest to virologist and biophysicists interested in the cooperative binding of antibodies and the relation of virus structural organization to antibody recognition. Immunologist may also be influenced by this work. This work may be followed up by other researchers similarly measuring the association and dissociation rates of antibodies with single virions, or otherwise comparing fab to IgG binding to gain insight into when crosslinking is or is not occurring.

      • Reviewer expertise: Single-virion imaging, protein complexes, biochemistry, influenza A.

      • I do not have sufficient expertise to evaluate the mathematical models and differential equations for modeling the k-on and k-off rates.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      In "Antigen flexibility supports the avidity of hemagglutinin-specific antibodies at low antigen densities", Benegal et al. develop a microscopy-based assay to measure dissociation of HA head-binding antibodies from intact virions. This assay allows the authors to explore the contribution of IgG bivalent avidity to antibody interaction with native virions, which is not accessible using other methods such as BLI. Using this assay, the authors further explore the effect of HA density on IgG avidity with engineered low-HA virions and then with artificial HA-coated microspheres. In addition to measuring antibody dissociation, the authors perform structural analyses to predict the conformational preferences of many HA IgGs from published structures. The authors conclude that low HA densities (down to ~10%) still support high avidity binding for the 2 IgGs tested, and thus there would be little evolutionary pressure for IAV to reduce the HA density as a strategy to evade immune recognition.

      Major comments

      The data presented are generally convincing for the two antibodies tested, with some caveats listed below. I believe the microscopy technique is valuable and provides a significant contribution to the field, and I believe that the finding that avidity persists at low densities for IAV is compelling and worth communicating to other virologists. Overall, with the incorporation of the suggested major revisions, this manuscript represents a significant advancement in the field.

      A major limitation of the current study is the small number of antibodies tested. Two antibodies are quite few, particularly since this work attempts to generalize these observations with structural predictions of dozens or hundreds of HA antibodies. While I believe that the resilience of IgG binding to lower epitope densities is likely common to many HA antibodies (or antibodies in general), this work alone does not support this. To this end, the authors should acknowledge their limited sample size in the text or discussion and that the generalization to other antibodies is speculative.

      Alternatively, the authors could demonstrate with additional antibodies (such as F045-092 which is pointed out in Fig S3A and perhaps group 'i' antibodies according to Fig S3A).

      It seems to me lateral diffusion of HA in the viral membrane is an important discussion point that was missed in this manuscript. The authors should comment on what is known about the lateral mobility of HA on virions, and how this could impact the ability of an IgG to crosslink. The authors should comment about whether long range diffusion and/or short range "shuffling" of glycoproteins could contribute to crosslinking preferences of antibodies in addition to the tilt, which is the only movement discussed. As appropriate, the authors should then comment on how this may affect their interpretation of experiments using beads. In experiments on beads, there is certainly no lateral mobility of the HA trimers; what are the consequences of this on the analysis?

      Should the authors qualify the limitations in the scope of their experimental results and the system of choice (beads vs. virions) as described in my previous comments, I suggest three experiments that I believe are essential to support the authors' claims. Alternative to qualifying the limitations, two optional experiments are also listed that could support the authors' claims as they are - those require a more extensive experimental undertaking and are thus labeled [OPTIONAL].

      1. The photobleaching experiment shown in Figure S1A. I am concerned that measuring photobleaching in steady state conditions does not properly control for the experimental conditions. In steady state, bleached antibody could unbind and be replaced by fluorescent antibody that has diffused into the field of view. This should be more thoroughly controlled by irreversibly capturing antibody (such as with biotin) and imaging after excess antibody is washed away, or by some other method such as capturing and imaging virus that has been directly labeled with AF555. This should be possible using reagents and techniques already demonstrated by the authors.
      2. In imaging, the authors analyzed only filamentous virions because they exhibit the best signal to noise ratio, which is a reasonable technical simplification. However, this relies on the assumption that glycoprotein presentation is relatively constant between virions of different sizes. It would be helpful to perform some analysis of small virions in any movie where there is sufficient signal. This would support the assumption that rates for small virions are comparable to those of filaments in the same experiment. This should be possible by performing additional analysis on existing data, without requiring additional experiments.
      3. In figure 3, C05 fab binding is used to assay HA content of the SEP HA virions. An additional method of confirming HA content that is more independent from the imaging assay would be beneficial, such as a Western blot to quantify HA relative to NP, NA, or M1 etc.
      4. [OPTIONAL] In figure 4, it is depicted that FISW84 biases HA in a tilted conformation, and the authors reasonably propose the reduced flexibility discourages crosslinking by IgGs. From the modeling summarized in Figure S3A, are there any antibodies predicted to prefer crosslinking HA at the same angle FISW84 tilts the ectodomain? Would FISW84 enhance crosslinking by such an antibody?
      5. [OPTIONAL] In figure S3A, the authors display theoretical tilt and spacing preferences for many HA antibodies based on published structures. Interestingly, their group iii antibody is predicted to prefer greater spacing and tilt, and likewise the authors observe increased binding at lower densities (in figure 3E). It would be beneficial to the work to test group i antibodies (base binding) in the dissociation experiments. The behavior of a base binding antibody, particularly at low densities could reinforce the modeling performed for this work.

      The experiments are well explained and supported by methods that would enable reproducibility.

      The authors state "The statistical tests and the number of replicates used in specific cases are described in the figure legends" yet in many cases this information is absent. For the k values in fig 2D, some indication of error or confidence interval would be helpful.

      Minor Comments

      • Some of the small details in fig 1A and fig S1 are lost due to small figure size - such as the sialic acid residues and lipid bilayer.
      • Although described in the text, it could be helpful to incorporate into figure 2 why the BLI data is shown for S129 fab. Perhaps indicate in 2C that that curve is "too fast to accurately measure" and perhaps near the table in 2D indicate the blue data is from Lee et al. It may be fine to simply remove the BLI results from the figure and refer to them only in the discussion of the experiments. Even with the measured data, the difference between fab and IgG is striking enough to support the paper, and the BLI data may be more confusing in the figure than it adds.
      • In figure 3A, better describe the fluorescent components in the fluorescent images in the legend.
      • From personal experience, the flexibility of HA ectodomain can be significantly affected by how much of the membrane proximal linker region is retained or removed. Could the authors comment on how they chose the cutoff for their HA ectodomain used in the bead experiments and their rationale?
      • In Figure S1B, if I understand correctly: black dashed line "IgG equivalent dissociation rate" is the experimental data, magenta "Crosslinking model fit" is the theoretically total antibody bound as described by the mathematical model. Then the gray lines "Double-/singly- bound antibodies plot the theoretical amount of antibody bound once and bound twice. If this is correct, I believe it would be clearer if the singly- and doubly-bound were plotted in separate colors, and that this is explained more clearly in the legend.
      • Related to an earlier comment, if lateral diffusion may play a role, how might this differ between different types of antibodies?
      • Could the authors comment in the discussion on how their results on virions may translate to the surface of the infected cell, which is also decorated in viral glycoproteins? Early time points of infection could be an in vivo example of low-density HA. What extent may antibody binding and crosslinking affect viral proteins on the cell surface or the immune response?
      • The github link in the methods is incorrect or not yet available.
      • Reference 1 has an incorrect or expired link.

      Significance

      • This work represents a conceptual advance in our understanding of antibody action on viral pathogens. The authors adapt existing microscopy methodologies to measure antibody avidity in a new way that is better representative of in vivo conditions.
      • To my knowledge, this is the first instance of direct measurement of antibody off-rates from intact virus particles, instead of immobilized protein as in BLI, SPR, or interferometry.
      • This work should be of interest to virologist and biophysicists interested in the cooperative binding of antibodies and the relation of virus structural organization to antibody recognition. Immunologist may also be influenced by this work. This work may be followed up by other researchers similarly measuring the association and dissociation rates of antibodies with single virions, or otherwise comparing fab to IgG binding to gain insight into when crosslinking is or is not occurring.
      • Reviewer expertise: Single-virion imaging, protein complexes, biochemistry, influenza A.
      • I do not have sufficient expertise to evaluate the mathematical models and differential equations for modeling the k-on and k-off rates.
    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      In this study, Benegal et al. investigate the binding kinetics of HA-head-specific antibodies (S139/1 and C05) to intact influenza virus particles using a fluorescence microscopy-based technique to measure the dissociation rate (koff) of the antibodies. By applying their proposed equilibrium model for bivalent antibody binding to HA, the authors calculated the crosslinking rate (kx), which represents the rate at which a single-bound antibody crosslinks to an additional HA molecule. Their experiments revealed that antigen crosslinking significantly slows koff, reducing it by up to two orders of magnitude.

      The authors further utilized streptavidin-coated beads conjugated with biotinylated HA or biotinylated BSA at varying concentrations to control HA surface density. Their results demonstrated that the two tested HA-head-specific antibodies retained the ability to crosslink HAs even at ~10-fold lower HA surface densities. In a complementary experiment, they employed an HA-anchor-specific antibody to restrict HA flexibility, which led to reduced binding of S139/1 and C05 IgGs but not their Fab fragments. This finding suggests that HA flexibility, rather than density, is the primary determinant of antibody crosslinking and avidity.

      Overall, the authors present an innovative approach to elucidating the dissociation and crosslinking kinetics of antibodies targeting intact virions or nanoparticles. The study is well-designed, with alternative interpretations of the results carefully considered and addressed throughout. I have only a few minor comments and suggestions for clarification.

      Minor comments:

      1. In Figure 1, does the grey color of each IgG in panel C indicate the Fc domain? If so, please add the description of the colors to the figure legend. In fact, it may be better to explain all the colors used here (for HA1, HA2, Fab heavy chain, light chain, etc.).
      2. Under the section," Bivalent binding of S139/1 and C05 persists after ~10-fold reductions in HA surface densities", the beginning of the second paragraph writes, "For both S139/1 and C05 Fab, binding increases linearly with HA density, as expected for a monovalent interaction dictated by absolute HA availability rather than density (Fig. 3D). Interestingly, the same relationship is observed for S139/1 IgG."

      Visually, I think the same relationship also seems to hold for C05 IgG. Would it be better to perform some linear regression and report the R2 value for the fitting so that this assessment can be quantitative? 3. At the end of the same page, in the same paragraph, the authors mentioned, "In contrast to the IgG, Fab binding measured at twice the molar concentration of the IgG is nearly undetectable under these conditions, confirming the IgG binding is not occurring through monovalent interactions (Fig. S2E)." What are the conditions you are referring to? In Fig. S2E, there is only the Ab intensity for the Ab binding at 100% HA (and not the other percentages). For the Ab intensity of S139/1 Fab, what is the concentration of the Fab used in Figure 3D? Why could the intensity in this experiment for S139/1 Fab reach ~100,000, whereas that of the 8 nM in Fig. S2E can only reach ~20,000? 4. Under the section, "Tilting of HA about its membrane anchor contributes to C05 and S139/1 avidity", in the second paragraph, the authors wrote, "If this is correct, we reasoned that avidity could be reduced by constraining tilting of the HA ectodomain. To test this hypothesis, we used FISW84, an antibody that binds to the HA anchor epitope and biases the ectodomain into a tilted conformation (Fig. 4B)."

      Can you use some computational models (maybe the same one you used for Figure 4A) to show that when an HA trimer is bounded by FISW84 Fabs, the tilting of HA is constrained? I think this will help substantiate the assertion above. 5. It would be good if you could mention the strain of HA used in the experiments in Figure 4 in the actual Figure as well (as supposed to just in the figure legend). 6. I do not see a method section for the structure-based model you used in Figure 4. In the text, you cited your previous study (ref 28) for the model, but it would be good to write about this briefly (and how you specifically apply the model in this study) in this current manuscript. 7. In Figure S1 panel D, what is the unit of the antibody concentration? Could you please add it to the graph legend?

      Significance

      Previously, this group utilized the same fluorescence-based method to investigate the potency of anti-HA IgG1 antibodies in preventing viral entry versus egress, as well as the tendency of antibodies targeting different HA epitopes to crosslink two HA trimers in cis or in trans (He et al., J Virol, 2024). In this study, they extend their work by evaluating, in-depth, how the density and flexibility of hemagglutinin (HA) on the viral surface influence the binding avidity of anti-HA antibodies. Using two human IgG1 antibodies targeting the HA head, the authors demonstrate that these antibodies can crosslink two HA trimers in cis, even when the trimers are further apart than adjacent HAs. Notably, the study reveals that HA flexibility, rather than density, is the key determinant modulating antibody crosslinking. Even at a 10-fold reduced HA density compared to the original, the antibodies retained their ability to crosslink trimers.

      This study provides critical insights into the relationship between HA density, flexibility, and antibody function, adding to the broader understanding of antibody crosslinking-a topic frequently discussed in the field of influenza research. These findings could have significant implications for vaccine design, particularly for strategies involving the display of the HA ectodomain on nanoparticles, potentially guiding the development of more effective influenza vaccines. Furthermore, the broader relevance of these findings may extend to other viruses with similar structural and immunological properties.

      My expertise lies in the structural determination of antibody-antigen complexes in influenza and other pathogens. While I may not have sufficient expertise to evaluate specific technical details of the fluorescence-based methods employed, the authors have convincingly demonstrated the robustness of their experimental design and interpretation, supported by appropriate controls.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Comments for the authors of Review Commons Manuscript RC-2024-02804:

      The author of the Review Commons manuscript "Antigen flexibility supports the avidity of hemagglutinin-specific antibodies at low antigen densities", present their recent work evaluating hemagglutinin interactions with cellular receptors and antibodies. This manuscript focuses specifically on the avidity of the hemagglutinin using a fluorescence-based assay to measure dissociation kinetics and steady-state binding of antibodies to virions. Thie findings confirm that bivalent interactions can offset weak monovalent affinity and that HA ectodomain flexibility is an additional determinant of antibody avidity. These findings are key for our understanding of neutralizing antibodies. Below are some comments that I would like the authors to address as they revise the manuscript.

      Comments:

      1. Can the authors provide justification for the two influenza viruses that they used.
      2. The use of filamentous particles is a strength, but authors should detail the role of filamentous vs. spherical in nature and lab settings. This will help researchers that plan to repeat these assays.
      3. Did the authors add the Udorn M1 to the HK68 as well?

      Significance

      This manuscript focuses specifically on the avidity of the hemagglutinin using a fluorescence-based assay to measure dissociation kinetics and steady-state binding of antibodies to virions. Thie findings confirm that bivalent interactions can offset weak monovalent affinity and that HA ectodomain flexibility is an additional determinant of antibody avidity. These findings are key for our understanding of neutralizing antibodies.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In Morris, M.J. et al., the authors strive to better understand the roles for the microcephaly protein WDR62 in brain growth and function. To accomplish this, an in situ biotinylation assay was performed and identified 42 proteins proximal to WDR62 including the Hsp70 co-chaperone BAG2. Through a series of co-immunoprecipitation assays, the BAG2-WDR62 interaction was shown to be mediated through the structured N-terminal region of WDR62, and it was proposed that common WDR62 mutations disrupt this interaction. In AD293 cells, loss of WRD62 expression resulted in an increase in the expression of BAG2 expression while reducing HPRT expression. Subsequent loss of BAG2 expression by siRNA treatment restored the expression of HPRT suggesting that there is an association between the stability of HPRT and BAG2, likely mediated through its proposed association with Hsp70/90 molecular chaperones. Finally, the authors investigate the subcellular localization and ability of WRD62 to phase separate. WRD62 was shown to form discrete condensates induced by sorbitol-mediated hyperosmotic stress. The formation of WRD62 granules are hypothesized to be driven by cell volume exclusion and macromolecular crowding. These granules appear similar, both in physical appearance and characteristics, to other reported biomolecular condensates such as those reported in metabolism (e.g. purinosomes). WRD62-containing condensates were shown to colocalize with enzymes in de novo purine biosynthesis; however, this association is not required for purinosome formation and/or its stability under both purine-depleted and sorbitol-driven growth conditions. Overall, the manuscript provided a previously unrealized and exciting association between WDR62 and purine metabolism.

      EVIDENCE, REPRODUCIBILITY AND CLARITY Summary: The current manuscript reads as multiple manuscripts with findings that are at times weakly connected (in my opinion). For example, I had a hard time understanding how the BioID results relate to the discovery of WRD62 phase-separation and its colocalization with purinosomes. I would strongly encourage the authors to consider dividing the results into separate manuscripts to strengthen their claims and create a more focused and cohesive manuscript (or series of manuscripts). I believe then several of my reservations associated with the current manuscript will be addressed, and in my opinion, the hard work from the authors will be better received across the scientific community.

      Response: We thank Reviewer #1 for acknowledging the novelty of our work and appreciate the constructive feedback regarding the lack of integration among individual findings. In response, we have removed content related to condensate formation and conducted additional experiments to more thoroughly characterize the mechanisms of WDR62 interaction. These new data, along with revisions to the manuscript text, have strengthened the coherence of our findings. We believe the revised manuscript now presents a more unified narrative, highlighting the complex roles of WDR62 in regulating purine metabolism.

      I would like to commend the authors for all the work that went into the current version of the manuscript. Being part of a biochemistry and cell biology research group, I completely understand how much time and effort must have went into generating these data. That being said, I felt that there were several instances where clarification and additional information is warranted to arrive at the conclusions made by the authors. These points are outlined below.

      Major Comments:

      1. There appears to be a discrepancy between the data presented in Figure 1 and what is stated in the main text. Clarification is necessary to better understand the results:

      • The following statement (and derivatives of it) are repeated throughout the manuscript: "...we found that the WDR62 interactome comprised molecular chaperones such as HSP70, HSP90, and their co-regulators, BAG2, STIP1, and DNAJC7" (lines 91-93, 316-318, 422-425). STIP1 and DNAJC7 were not identified in the list of 42 proximal proteins to WDR62 (Figure 1D). DNAJC7 was included because of a previous report curated in the BioGRID database, and there is no mention of HSP90 in the data produced in Figure 1. Please revise the main text to reflect the data that was generated.

      Response: We thank the reviewer for this valid point and highlighting the instances where our description of results did not accurately reflect the data generated. We have reworded the relevant sections (e.g. lines 105-107) in our revised manuscript to better delineate interactors identified in BioID studies (BAG2) as opposed to those previously reported on protein interaction databases such as BioGRID (DNAJC7).

      Based on the data presented in the Venn Diagrams in Figure 1D, the author's numbers do not seem to be consistent with the sentence on lines 126-128. I count 37 proteins unique to their BioID study, 90 unique to the BioGRID database, and 5 proteins that overlap between the two data sets. This sentence needs to be revised.

      Response: We thank the reviewer for pointing out this inconsistency. There were 95 protein interactors of WDR62 from BioGRID while we identified 42 proteins in our BioID study with 5 proteins overlapping. We have revised the main text (lines 144-146) and Fig. 1D to accurately reflect the protein numbers identified.

      What data were used to generate the interaction map in Figure 1I? Enzymes tied to purine metabolism were not identified from the data presented in Figure 1D but have now appeared. A discussion of this in the main text is warranted.

      Response: We generated the interaction map in Fig. 1I using STRING to visualise WDR62 protein-protein interactions derived from both the BioGRID database and our BioID analysis. As the reviewer correctly points out, purine metabolic enzymes were not direct interactors of WDR62. Purine enzymes are linked to the molecular chaperones which, in turn, associated with WDR62 from our BioID analysis. The links between purine enzymes and chaperones were obtained from the BioGRID database. In response to this feedback, we have revised our manuscript to include a more detailed description of how the interaction map in Fig. 1I was generated, both in the main text (lines 148-157) and the legend for Figure 1. The BioGRID interactions between heat shock proteins and purine enzymes were introduced in the manuscript text at lines 264-266.

      1. This reviewer has several reservations on how the various key players in the manuscript are related to substantiate the conclusions made in the manuscript. For instance, how is HPRT, purinosomes, and WDR62 related? How about HSP90, WRD62, and HPRT? Pairwise connections were made throughout the manuscript; however, trying to tie all three together is difficult with the data presented.

      • The authors tried to connect HPRT, purinosomes, and WDR62 with BAG2; however, this study could greatly improve if we understood how a knockdown of BAG2 impacts purinosome formation and/or WDR62 colocalization with purinosome enzymes.

      Response: We have incorporated additional experiments in our revised manuscript to better connect HPRT, WDR62 and BAG2. Using proximity ligation assays (PLA) we demonstrated endogenous interactions between WDR62 and BAG2 (Fig. 4K), as well as between WDR62/HPRT and BAG2/HPRT (Fig. 6I-J). The interaction between BAG2 and HPRT was decreased in WDR62 KO cells (Fig. 6J), and recent experiments revealed that BAG2 depletion similarly disrupted the WDR62/HPRT interaction. These findings suggest that WDR62 expression, and presumably its interaction with BAG2, is necessary for BAG2-mediated regulation of HPRT.

      Further, we found that the loss of HPRT expression in WDR62 KO cells was reversed by siRNA depletion of BAG2 (Fig. 6K), supporting our model in which elevated BAG2 levels in the absence of WDR62 promote aberrant HPRT degradation. Collectively, our results suggest that proper BAG2 regulation of HPRT requires WDR62.

      To address the reviewer’s suggestion, we also examined WDR62 cytoplasmic localisation following BAG2 depletion and found that BAG2 was not required for WDR62 to form granules in response to osmotic stress. We also show that BAG2 is not responsible for purinosome assembly or for the subcellular distribution/localisation of HPRT.

      Is HPRT a client of HSP90? And how are WRD62 and HSP90 related since they do not associated (based on your BioID data)? These connections would again strengthen the arguments made in the manuscript and help to explain the HSP90 inhibition data presented in Figures 7F and 7G.

      Response: Although our BioID data did not explicitly identify an association between WDR62 and HSP90, we initially focused on HSP90 due to the established role of BAG2 in protein misfolding and degradation through its interaction with HSP90 (doi: 10.7150/thno.78492). We hypothesised that while WDR62 may not directly interact with HSP90, its interaction with BAG2 could provide an indirect link. To strengthen our conclusions and address the limitations of our HSP90 inhibition data (NVP-AUY922), we performed additional experiments using a second HSP90 inhibitor (17-AAG) and an HSP70 inhibitor (MKT-077) across both short (1 h) and long (24 h) treatment durations (Fig 6 and Fig S10). Further details are provided in our response below to minor comment #1.

      Caution is warranted when making conclusions about WDR62 (and its granules) and purinosomes.

      Response: We acknowledge the reviewer’s feedback and have revised our manuscript to focus on the functional characterisation of WDR62 interaction and co-localisation with BAG2 and related HSP co-chaperones. As part of this revision, we removed the FRAP studies and sections discussing WDR62 phase separation and purinosome assembly (further details below). Additionally, we have softened out description of cytoplasmic WDR62 granules as purinosomes. Instead, we describe WDR62 as forming dynamic puncta containing purine enzymes and discuss the possibility that these granules may represent or overlap with bona fide purinosomes.

      The authors describe the association between WDR62 and purinosomes differently throughout the text. I would recommend that the authors come to some conclusion about this and be consistent.

      Response: We thank reviewer #1 for pointing out inconsistencies in our conclusions regarding WDR62 and purinosomes between sections of our manuscript. We have revised our manuscript to ensure our description of these findings are consistent throughout. Specifically, our findings show that WDR62 responds to osmotic and metabolic stress by forming dynamic cytoplasmic granules that share many protein components with purinosomes (Fig. 5). This suggests that WDR62 may be a novel component of bona fide purinosomes or that WDR62 granules substantially overlap with purinosomes both spatially and compositionally. However, the formation of granules by purine enzymes was not perturbed by WDR62 KO (Fig. S6). Thus, we conclude that while WDR62 colocalized with purine enzyme containing granules consistent with purinosomes in response to cell stress, WDR62 was not required for granule formation by purine enzymes such as PFAS and PPAT.

      A. (Lines 339-340) "WDR62 granules represent or overlap substantially with the phase-separated metabolons known as purinosomes". Based on the data presented, it appears that these might still be different entities but either overlap or have similar components. Purinosome localization with mitochondria (approx 60-80%) and microtubules (approx 90-95%) were significantly higher than those reported for WDR62 granules (approx 40%). This comparison would suggest that not all WDR62 granules behave similarly to purinosomes. And from the dot plot in Figure 3G, about half of the characterized WDR62 granules do not align with the previously reported characteristics of purinosomes.

      Response: In Fig. 3G, we measured the diameter and distribution of WDR62 granules and found their size and number per cell closely matched those reported for BAG2 condensates (doi: 10.1038/s41467-022-30751-4). This aligns with our findings that WDR62 interacts with BAG2 and is recruited to similar subcellular compartments. The reviewer correctly notes that WDR62 granules only partially align with previously reported characteristics of purinosomes, suggesting that they may be distinct entities. Our revised manuscript acknowledges this possibility while also emphasising that WDR62 granules share features and colocalise with many purinosome components. To enhance the focus and clarity of the manuscript, we have removed Fig. 3G as the diameter and number of WDR62 granules are already reported in Fig. 3F.

      In the abstract and introduction, the authors state that WDR62 is being recruited to the purinosome and leave out the other possibility. I would recommend that the authors soften this claim in these sections because of the above possibility but also the lack of characterization of the sorbitol-induced "purinosomes". There is little discussion or evidence for how sorbitol induces purinosome formation. Is de novo purine biosynthesis activated upon sorbitol treatment? Are multiple de novo purine biosynthetic enzymes present in the sorbitol-induced "purinosomes"? Further, I agree that there is a tendency for WDR62 to associate with condensates that bear an enzyme within de novo purine biosynthesis; however many of these proteins are known to self-aggregate upon cell stress. Therefore, the entities that are being observing and called purinosomes might not be bone fide purinosomes. Additional care is necessary to make these statements. In my opinion, the current data only suggests that this might be a possibility.

      Response: As indicated, we have softened our claim that stress-induced WDR62 granules represented bona fide purinosomes. Fig. 3 of our revision more precisely describes the characteristics of WDR62 granules while Fig. 4 now reports on the co-localisation of WDR62 granules with protein chaperones and de novo purine synthesis enzymes typically associated with purinosomes. We now conclude that WDR62 may be associated with purinosomes but may also represent distinct entities with shared components and characteristics. Notably, proteins such as BAG2 and PFAS may undergo phase separation in response to stress independently of purinosome assembly.

      In additional work conducted for our revised manuscript, we find that WDR62 loss reduced rates of purine synthesis in cells cultured in the presence of purines (Fig. 5) but was not involved in de novo purine biosynthesis under purine-depleted conditions (Fig. S9). This was consistent with the finding that WDR62 loss did not prevent stress-induced formation of PFAS or PPAT granules (Fig. S6) which are likely to represent purinosomes. We concede that additional investigation is required to determine the functional significance of WDR62 granules in response to stress stimuli and purine depletion.

      (Lines 325-329) The authors reference a previous manuscript demonstrating that co-chaperones co-cluster with purinosomes. Based on this fact, they infer that WDR62 granules might represent purinosomes since WDR62 interacts with these same set of co-chaperones. These co-chaperones interact with a large number of different proteins (in fact, most kinases), so it is uncertain how the authors decided to go down this path to link purine metabolism with WDR62. Discussion of how this connection was made would help elevate the story. What additional insights did they have that lead them down these investigations?

      Response: BAG2 functions as a co-chaperone that regulates the activity of HSP70/90. While the reviewer correctly points out that co-chaperones such as BAG2 have a broad number of clients, numerous studies have established the role of HSP70/90 in purine metabolism (e.g. doi: 10.1016/j.isci.2020.101058, 10.1073/pnas.1300173110) and in neurodevelopment (10.3389/fnins.2018.00821). Moreover, purines are critical for normal brain development and dysregulation is well known to lead to congenital defects including microcephaly. As such, when we identified a role for WDR62 in the chaperone network through interaction with BAG2, it was not a leap to hypothesise that neurometabolic defects stemming from dysregulated purine production or salvage might be involved in WDR62-associated microcephaly.

      Indeed, we show that WDR62 are localised with purine enzymes in response to purine-depletion and that WDR62 depletion leads to metabolic dysregulation. WDR62 has several binding partners with multiple cellular functions, and we do not exclude alternative mechanisms involved in cortical development. However, the mechanistic link with heat shock proteins and purine metabolism is a novel one that would be of broad interest in molecular neurodevelopmental biology. On this feedback, we have revised main text (lines 214-218, lines 260-263, lines 292-295, lines 378-383) to better explain the rationale underlying our experiments and overall study focus.

      If WDR62 is not required for purinosome formation, why would it localize with the purinosome? Is there any hypothesis that could be readily tested to better help understand this observation? Providing a better understanding of this would greatly elevate the work.

      Response: Given the role of HSP70/90 in purinosome assembly and the interaction of WDR62 with BAG2, and purine enzymes PFAS and PPAT, we were initially surprised that WDR62 depletion did not affect stress-induced PFAS and PPAT granule formation (Fig. S6). At the time of writing the original manuscript, we interpreted these granules as purinosomes. However, it remains possible that WDR62 might have a function in purine synthesis or in purinosome assembly that remains unidentified. Indeed, we have not yet tested different cell types or additional conditions that induce purinosome formation or determined the localisation or activity of other purine synthesis enzymes. Thus, we concede our conclusions on WDR62 and purinosome formation were premature.

      As our revised manuscript is now focused on the WDR62-BAG2-HPRT interaction and given the reviewer’s prior comment that PFAS and PPAT colocalization in granules may not represent purinosomes in all contexts, we acknowledge that potential WDR62 functions in purinosomes warrants further investigation beyond this study. In the revised discussion (lines 473-497) we address these limitations and propose alternative interpretations of our findings.

      (OPTIONAL) Please validate that the associations between WDR62 and the purine biosynthetic enzymes occur on the endogenous level (void of transient transfection). Many methods such as immunofluorescence and proximity ligation assays have been used by others to demonstrate protein-purinosome interactions. This result would reduce any concern that the association is a result of overexpression (artifact).

      Response: As suggested, we conducted proximity ligation assays (PLA) to validate endogenous interactions between WDR62 and BAG2, HPRT, and PFAS (Fig. 4K, Fig. 6I-K). Notably, sorbitol treatment increased the interaction between WDR62 and HPRT (Fig. 6H, I), supporting the role of WDR62 in regulating HPRT under cellular stress. Additionally, WDR62 deletion appear to reduce the interaction between BAG2 and HPRT (Fig. 6K), while BAG2 depletion similarly reduces the interaction between WDR62 and HPRT (Fig. 6J). These findings support a model in which WDR62 and BAG2 cooperatively regulate HPRT stability.

      Figures 6F and 6G conclude that nucleosides from purine-depleted growth conditions accumulate while the corresponding monophosphates do not change between WRD62 knock-out and wildtype cells. Given that purine-depleted growth conditions activate de novo purine biosynthesis (uncertain if this has been demonstrated in AD293 cells), could this result simply demonstrate that purine salvage is no longer used and the nucleosides have accumulated and are awaiting degradation (or exportation) rather than a loss of HPRT expression as inferred from the stated conclusions? The conclusions could be better substantiated with the use of a stable isotope incorporation assay.

      Is there a difference in the contribution of de novo purine biosynthesis and purine salvage to the generation of the monophosphates (AMP, GMP) between WDR62 knockout and wildtype AD293 cells? Use of a stable isotope (such as 15N-glutamine) could help to come to the appropriate conclusion.

      __Response: __We thank the reviewer for this helpful suggestion to better characterize WDR62-dependent purine defects in more detail. In our revised manuscript we performed targeted metabolomics experiments and tracked the incorporation of 13C2-glycine and 13C5-hypoxanthine into purine nucleosides to assess purine synthesis and purine salvage flux between WT and WDR62 KO cells (n=5). Indeed, purine nucleotides in KO cells showed a significant loss of incorporation of 13C2 from 13C2-glycine, consistent with impaired de novo synthesis in cells cultured in presence of purines. In contrast, labelling from 13C5-hypoxanthine showed no overt differences between WT and KO cells, suggesting that incorporation via the salvage pathway is not grossly altered under these conditions. We have subsequently added a section to the discussion (lines 498-521) to discuss these results which suggest that the reduced HPRT levels in KO cells may be sufficient to sustain rates of purine salvage which are not altered with WDR62 loss. Thus, the accumulation of nucleosides is most likely due to increase conversion from monophosphates or reduced degradation to uric acid. Nonetheless, we show that WDR62 is required for purine synthesis under basal conditions and has a complex role in regulating purine metabolism.

      (Lines 483-485) If there is a change in de novo purine biosynthesis, are there any detectable changes in AICAR levels that might influence purine metabolism at the transcriptional level?

      __Response: __This remains a possibility. However, we did not detect the AICAR intermediate in our untargeted LC-MS/MS metabolomics analysis perhaps due to low relative abundance and/or low stability. As a result, we were unable to comment on AICAR levels but this would be an interesting research direction to pursue in subsequent follow up studies.

      Are the data and the methods presented in such a way that they can be reproduced? Are the experiments adequately replicated and statistical analysis adequate? 1. For purine-depleted studies (metabolite analyses, microscopy), how long were the cells grown in purine-depleted medium before the analysis? And how was the purine-depleted medium generated? Please reference any source that might have been used.

      __Response: __We removed purines from the cell culture environment by incubating cells for 7 days with DMEM supplemented with FBS dialyzed to remove small molecules such as nucleosides and nucleobases. This important methodological detail was omitted in error in our original submission. Our revised manuscript includes description of how we depleted cells of purines in the Materials & Methods at Lines 636-640 with reference to source materials and prior studies.

      Details describing the BioID experiment are minimal. How many replicates were performed, was label-free or TMT quantitation used for the protein identification. Further the data analysis and mining of the proteins from the BioID study are missing - What database was used to identify the proteins from the peptides? Please include this information in the Materials and Methods section as well as a link to a repository where the LC-MS/MS data generated can be found. Additionally, it would be very helpful to have a spreadsheet or table that lists the biotinylated proteins and expectant or p values for each.

      __Response: __We performed three independent biological replicates (n = 3) for the BioID experiment. We apologise for the omission and have now included this information in the Fig. 1 legend. Label-free quantitation was used for protein identification, and peptides were identified using the ProteinPilot™ Software (v. 4.5) database. As part of our revision, we have updated the Materials and Methods section to include these details and will also provide a spreadsheet listing all biotinylated proteins across replicates, including their p-values. Furthermore, we have submitted our LC-MS/MS data as supplementary files associated with this manuscript.

      Please include information about the streptavidin pulldown presented in Figure 1C.

      __Response: __Streptavidin pulldown followed by immunoblot for known WDR62 interacting proteins is described in our Materials & Methods section at line 753-759. __ __Proteins bound to Streptavidin agarose beads were eluted with Laemmli buffer following washing. Pulldown fractions and total lysates were then resolved on SDS-PAGE, transferred to PVDF and blotted with primary antibodies to detect WDR62 interacting proteins such as CEP170, JNK and AURKA. We also used this method to confirm biotin-labelling and affinity isolation of BAG2 in Fig. 1C.

      Many of the figure legends could benefit from a statistical description.

      Response: As requested, we have updated the legends for all relevant figures and supplementary figures to include statistical descriptions, specifying analyses used and replicate (n) numbers. These additions complement the detailed description of our statistical methods provided in the Materials & Methods section (line 884).

      There seems to be only two data points for Figure S3A. While there is no significant difference observed, it would be ideal to have additional replicates.

      Response: We have completed an additional replicate and updated Fig. S3A for our revised submission. This study which now includes n = 3 independent biological replicates. While we observed a slight increase in the proportion of cells with MAPKBP1 granules in response to sorbitol stress, this change was not statistically significant. In contrast, WDR62 formed granules in a much larger proportion of cells (~90%) in response to stress (Fig. 3E).

      (Figure 5I) Please provide statistical analysis to demonstrate the colocalization between FGAMS and WDR62 is robust in purine-depleted AD293 cells.

      Response: Our revised manuscript now includes three independent replicates assessing WDR62 co-localisation with PFAS in purine-depleted AD293 cells (Fig. 4I in revision). We consistently observed a high degree of co-localisation, as quantified by Pearson’s correlation coefficient (mean = 0.8), which was significantly different from control conditions.

      1. The use of HSP90 inhibitors is a little confusing given the connections being made with BAG2 and other HSP70 co-chaperones in Figure 1.

      • Does the same conclusions hold true with an HSP70 inhibitor or siRNA?

      • (OPTIONAL) There are a lot of discrepancies between Hsp90 inhibitors and effective treatment concentrations. For example, NVP-AUY922 caused purinosomes to disassemble whereas STA9090 cause purinosomes to change morphology and adopt a more aggregated state. Do other Hsp90 inhibitors share the same phenotypic response as NVP-AUY922 in this study.

      • The treatment time (24 h) with NVP-AUY922 is very long. Given that Hsp90 interacts with hundreds of proteins, it is hard to understand whether the effect of Hsp90 inhibition is direct or indirect. Shorter times (1 h or less) would be more insightful.

      __Response: __To address these specific comments on the specificity of effects from HSP90 inhibitor treatment, we have conducted additional experiments using NVP-AUY922, in addition to another HSP90 inhibitor, 17-AAG, and the HSP70 inhibitor, MKT-077, at both 24-hour and 1-hour timepoints.

      Our results demonstrate that NVP-AUY922 can rescue the aggregated HPRT phenotype in WDR62 KO cells even after 1 hour of treatment (Fig. 6F, G). Similarly, 17-AAG exhibits a comparable effect, reinforcing the role of HSP90 inhibition in modulating the spatial distribution of HPRT in the cytosol (Fig. 6F, G). Additionally, we found that MKT-077, a HSP70 inhibitor, also rescues the aggregated HPRT phenotype, with the effect being most pronounced at 24 hours but still evident at 1 hour (Fig. S10A, B). We also utilized BAG2 siRNA but determined that BAG2 depletion rescued WDR62 KO effects on HPRT expression (Fig. 6L) but did not reverse the effect on HPRT spatial distribution (Fig. S10C).

      (OPTIONAL) Does the 2.6-fold increase in BAG2 increase its association with WDR62?

      Response: We observed a ~2.6-fold increase in BAG2 levels following WDR62 deletion (Fig. 6A). However, as WDR62 is not present in KO cells, it is not possible to verify whether there would be an increase association with WDR62 and we did not conduct an experiment to overexpress BAG2 in WT cells. However, we presume that increased cellular levels of BAG2 would lead to increased pulldown with WDR62 by immunoprecipitation for example.

      Is the degradation of HPRT occurring through BAG2-mediated proteasomal degradation? Showing HPRT recovery by treating the cells with MG132 along with CHX would provide meaningful clues as to how BAG2 and HPRT might be related - Is BAG2 concentration increasing to facilitate the enhanced degradation of HPRT?

      __Response: __We thank the reviewer for this useful suggestion. However, our initial experiments with MG132 and chloroquine to inhibit proteosomal and autophagic pathways respectively gave mixed results. Our preliminary findings suggest neither was sufficient to substantially rescue HPRT levels in WDR62 KO cells. However, this needs extensive follow up with more precise dissection of cell degradation pathways with additional inhibitor or genetic targeting of degradation machinery. Thus, we did not include these studies in the revision and will instead include this in a follow up paper once we have completed a more comprehensive investigation.

      Does HPRT colocalize with WDR62 in cells?

      __Response: __ In response to this comment, we have demonstrated that osmotic stress induces the spatial reorganisation of endogenous HPRT into puncta that juxtapose and co-localize with WDR62 granules in a stress-dependent manner (Fig. 6H). This was further validated by examining the endogenous WDR62-HPRT interaction using PLA, which also revealed a stress-induced increase upon sorbitol treatment (Fig. 6I).

      (OPTIONAL) It would be nice to see validation experiments of some of the hits in Figure 1D or 1E in a co-immunoprecipitation experiment conducted similar to Figure 1C.

      __Response: __Our BioID assay, presented in Fig. 1D and E, identified WDR62 interactors, such as AURKA, JNK, CEP170 and MAPKBP1, that have been previously validated by co-IP by our group and others. Among the chaperones identified, we focused on BAG2 in this particularly study and validated BAG2-WDR62 interactions between by coIP (Fig. 2) and by proximity ligation assays (Fig. 4).

      The authors presented the findings that suggest that BAG2 interacts differently with commonly observed WDR62 mutations in MCPH2? How do these mutations affect WDR62 condensation, colocalization with purinosomes, or alter HPRT activity? Tying back the observations to something clinical would help elevate the overall significance of the findings.

      Response: We investigated the condensation of mutant WDR62. Interestingly, R438H mutant, which binds BAG2 (Fig. 2), forms granules constitutively prior to stress treatment while the 3936dupC mutant, which does not bind BAG2 (Fig. 2), did not form granules in response to sorbitol stress treatment. We also find that PFAS is colocalized with R438H granules in the absence of stress, although this requires repeated analysis and quantification. However, WDR62 deletion does not prevent PFAS or PPAT granule formation (Fig. S6) and, given reviewer advice to focus the topic of our revised manuscript, we have not included the effects of WDR62 mutations on granule formation in our revised manuscript.

      However, in response to these comments, we have conducted rescue experiments with patient-identified MCPH mutant variants of WDR62. Expression of the R438H or 3936dupC mutant in WDR62 KO cells did not rescue HPRT to the same extent as full-length WDR62 with wild-type sequence (Fig. 6B). Additionally, attempts to restore BAG2 levels in WDR62 KO cells by expressing mutant WDR62 showed no discernible difference from full-length WDR62. Thus, mutations to WDR62 associated with MCPH alters binding to BAG2 (Fig. 2, increased with R438H and decreased with 3936dupC), this was associated with dysregulated levels of BAG2 and HPRT. In our revised manuscript, we also examined the effect of HPRT depletion on neurodevelopment in vivo (Fig. 7) and included description of these findings at lines 417-442.

      Are the text and figures clear and accurate?

      1. There are many times throughout the manuscript that the wrong figure is being referenced. These mistakes caused significant confusion at many times while reviewing the manuscript. Please double check all in-text references to figures. For example, I believe that you meant to use Figure S1C instead of Figure 2E with the statement on lines 183-185. Again, I believe that correct figure reference on line 501 is Figure 7G not Figure 7E.

      Response: We apologize for this oversight. We have amended the errors indicated by the reviewer. Line 544 (501 in first submission) now refers to the correct figure (Fig. 6F) and lines 204-206 (183-185 in first submission) correctly refers to Fig. S1C in addition to Fig 2E. Each of the authors have also revised the rest of the manuscript to ensure all figures are correctly referenced in the main text.

      The figure legend on Figure S4 does not match the figure and the main text references. Please verify that the text in the figure legends correspond correctly to the figure.

      Response: We apologize for these inconsistencies in the figure legend relating to Fig S4 in our original submission. In the revised manuscript, we have amended the figure legend and the main text referencing Fig. S4 to correctly correspond to order of data panels in this figure.

      Please provide this data for the sentence on lines 399-400 in the supplemental file.

      __Response: __As requested, we have revised the manuscript to include results on HPRT cytoplasmic localisation following osmotic stress. We show that osmotic stress induces the spatial reorganisation of HPRT into puncta that juxtapose and co-localize with WDR62 granules in a stress-dependent manner (Fig. 6H). This was further validated by examining the endogenous WDR62-HPRT interaction using PLA, which also revealed a stress-induced increase upon sorbitol treatment (Fig. 6I).

      I believe that the authors use the phrase "cell proliferation" to describe cell viability in the main text. In the Materials and Methods section, the authors state "The XTT cell proliferation assay enables quantification of cellular redox potential, providing a colorimetric readout of cell viability." Cell proliferation, viability, and cytotoxicity are different measurements, so please revise to reflect the correct experiment that was performed.

      __Response: __The XTT colorimetric assay can be used to determine cell proliferation or loss of cell viability depending on the specific experiment. The reviewer is correct in pointing out that our study using XTT to measure cell numbers in the context of purine-depletion (Figure 5B) is a measure of cell viability. We apologize for the misleading text in our description of the XTT methods in our original submission. In our revised manuscript, we have amended our description of the XTT assay in our methods and in the figure legend to more accurately reflect the experiment performed.

      Other Minor Comments:

      1. Move the sentence "In contrast, despite reduced mRNA..." (lines 387-388) to the last section when a reduction in PFAS expression was first mentioned.

      __Response: __As requested, we have moved this line referring to PFAS protein levels in WDR62 KO cells to the previous section to when a reduction in PFAS mRNA was first mentioned.

      1. Please reference the following in the manuscript: • BioGRID database in the main text and Materials and Methods section • The reported study showing the DNAJC7-WDR62 interaction (as curated from BioGRID) • Fiji in the Materials and Methods section

      __Response: __We have now included references to these in our revised manuscript. References to BioGRID database are in the main text (line 146) and in the Materials and Methods (line 765). The report of DNAJC7-WDR62 interaction (Ref #37) curated from BioGRID was added at line 157 and reference (Ref #82) to Fiji plug-in was indicated at line 690 in Materials & Methods.

      (Line 461-463) The authors state the following: "the loss of WDR62 leads to an increase in BAG2 and vice-versa (Fig. 7A) (Fig. S9B). I am not sure that the vice-versa (i.e. loss of BAG2 increases WDR62) is true. From the data presented in Figure 7H, I do not see a significant change in WDR62 expression upon BAG2 siRNA treatment.

      __Response: __We apologize for the incorrect use of the term “vice versa” in this context. We had meant that while WDR62 loss led to an increase in BAG2, the converse increased expression in WDR62 resulted in a decrease in BAG2 levels. The reviewer is correct that the siRNA knockdown in BAG2 did not substantially alter WDR62 levels. We have amended the text at lines 465-467 to clarify this statement.

      For your BioID study, do you know how many or the proportion of cells that were mitotically arrested with the low dose of nocodazole (200 ng/mL)? Given the small number of unique proteins that were in the mitotic only population, it is curious to know how enriched the cells were and whether WDR62 localization is important in the context of this study.

      __Response: __The overnight treatment with low dose nocodazole results in an enrichment of cells arrested in late prometaphase which we estimate at 50-60% of AD293 cells compared to

      1. Just to clarify, the WDR62-HA lane (third in each set) in Figure 1C is not WDR62-BirA*-HA and that it is only being used as a control.

      Response: This is correct. To improve clarity, we have amended the labels on the WDR62-HA lanes in Figure 1C to say “WDR62-HA only”.

      1. In the Discussion (lines 439-441) "We also show that WDR62 forms dynamic, phase-separated granules that co-localise with chaperones and purine metabolic enzymes, resembling purinosomes." I believe that the authors meant to say co-chaperones instead of chaperones given no microscopy data was presented showing the colocalization of HSP70/90 with WDR62 granules. Please revise.

      __Response: __This sentence (line 473) has been revised as suggested.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      The authors provide evidence to reveal the novel functions of WDR62 protein in maintaining the stability and activity of purine metabolic enzymes and overall purine homeostasis. WDR62 interacted with BAG2, and they are recruited to purinosome. WDR62 loss caused accelerated degradation of purine salvage enzyme HPRT, and led to the accumulation of purine nucleotide intermediates.

      While this study is compelling and significant for the field of neurodevelopmental disorders including microcephaly and purine metabolism, there are several concerns that should be addressed before publication.

      __Response: __We thank reviewer #2 for their constructive criticisms and supportive comments noting the statement reinforcing significance of our study in the field. We have made a meaningful and concerted effort to address the reviewer comments with extensive additional experimental work and substantial revision of our manuscript.

      Major comments:

      Although all experiments are conducted using non-neuronal cultured cells, does this phenomenon also occur in neuronal cells?

      __Response: __To address this comment and reviewer concerns regarding the links between WDR62 and HPRT in a neuronal context, we performed in utero-electroporation to determine the effects of HPRT depletion on formation of neocortex in mouse embryos. We electroporated embryonic day 14 (E14) mouse brains with siRNA targeting Wdr62, and Hprt and assessed neural progenitor proliferation, migration and differentiation using immunofluorescence. We find that the loss of both WDR62 and HPRT leads to a similar precocious delamination of neural progenitors from the apical ventricular surface (Fig. 7). This process is the first step in neural migration and required to generate a diversity of cells, both self-renewed (eg. outer radial glia) and differentiated neurons and glial cells in the developing neocortex (doi.org/10.1146/annurev-cellbio-101011-155801). Interestingly, we also uncovered that HPRT loss promoted the self-renewal of delaminated intermediate progenitors (IPs) which is unlike impaired the self-renewal of neural progrenitors observed following WDR62 depletion (Fig. 7). Thus, brain development is sensitive to HPRT levels and the HPRT depletion phenocopies WDR62 in cell delamination which supports a neural role for WDR62-HPRT. Moreover, our findings suggest WDR62 loss has more severe neurodevelopmental defects with hints at the complex metabolic functions of WDR62 (discussed in lines 563-577).

      What is the interaction between endogenous WDR62 and Bag2? This is because in overexpression systems, multiple chaperones may interact with the target protein during protein folding.

      Is endogenous WDR62 also present in the purinosome in purine depleted or sorbitol condition?

      __Response: __In response to these comments and similar concerns by reviewer #1, we examined interactions between WDR62, BAG2, HPRT, and PFAS at the endogenous level by utilising proximity ligation assays (PLA, Fig. 4+6). We determined a robust interaction between endogenous WDR62 and BAG2 (Fig. 4K), evident by abundant PLA puncta which were nuclear excluded and localised to the cytosol, consistent with our results in overexpression systems (Fig. 4). We also confirmed endogenous WDR62 interactions with purine enzymes PFAS (Fig. 4K) and HPRT (Fig. 6I) in a similar fashion. To determine whether sorbitol stress promotes their interaction, we assessed changes in the per cell numbers of these puncta in response to sorbitol stress. We confirmed that endogenous WDR62 interaction with HPRT was dependent on BAG2 (Fig. 6J). WDR62-HPRT interactions increased with sorbitol stress (Fig. 6I).

      Regarding Fig6 and Fig7, when HPRT decreases and inosine accumulates in WDR62-KO condition, did the levels of hypoxanthine, xanthine, and uric acid change?

      __Response: __ In Fig. 5G we used an untargeted metabolomics approach that relies on identification databases such as MS-DIAL and associated spectral libraries. Unlike targeted approaches, this method does not always allow for the confident identification of all metabolites of interest. As a result, hypoxanthine, xanthine, uric acid, and other purine intermediates (e.g., AICAR) were not positively identified. This is likely due to limitations in database coverage, spectral similarity to other compounds, or constrains related to our extraction method.

      Does HPRT and the three microcephaly-associated WDR62 mutants also recruited in the purinosome in purine depleted or sorbitol condition?

      __Response: __In response to this, and a similar comment by reviewer #1, we examined whether endogenous HPRT co-localised with WDR62 granules induced by sorbitol. We show that hyperosmotic stress induces the spatial reorganisation of HPRT into puncta that juxtapose and co-localize with WDR62 granules in a stress-dependent manner (Fig. 6H). This was further validated by examining the endogenous WDR62-HPRT interaction using PLA, which also revealed a stress-induced increase upon sorbitol treatment (Fig. 6I).

      As to whether mutant WDR62 was recruited to purinosomes, as detailed in our response to reviewer #1 above (minor comment #6), we find that R438H mutant formed condensed granules prior to stress treatment while 3936dupC mutant did not form granules in response to stress. Therefore, MCPH mutations appear to disrupt the stress-induced formation of WDR62 granules in the cytoplasm. Since we also find that WDR62 KO did not prevent stress-induced formation of PFAS and PPAT granules, which may represent or overlap with purinomes, we chose to not include our findings on granule localization of mutant WDR62 localization in our current revised manuscript. We instead focused on rescue of HPRT and BAG2 levels with patient-derived MCPH mutant variants of WDR62. We confirmed that, unlike WT WDR62, mutant WDR62 could not fully return HPRT or BAG2 levels in WDR62 KO cells (Fig. 6B).

      In Fig7C, HPRT/tubulin ratio appears to decrease in WT from 0hr to 24h, but the graph does not show this decrease. Additionally, quantification of PFAS(FGAMS) and BAG2/tubulin should be performed.

      Response: While slight variations in HPRT signals are visible from 0 h to 24 h in the representative blot, quantification across the n = 9 biological replicates do not support a significant decrease, with these variations falling within the SEM shown in the graph. This representative blot was selected for its clarity and since it most clearly depicts the key trend which is the increasing difference in the HPRT/Tubulin ratio between WT and KO cells with increased duration of CHX treatment. Additionally, in response to this comment, we have now quantified PFAS and BAG2/Tubulin and have inserted these data into Fig. 6C.

      Fig7D is problematic. HPRT in WDR62-KO cells seems to localize in the nucleus, possibly due to stronger exposure in KO conditions compared to WT. Also, the line scan is drawn in areas with low signal in WT. The comparison should be performed in areas with high perinuclear signal.

      __Response: __We appreciate the reviewer’s feedback and acknowledge their concern of an apparent differences in fluorescence intensity in WDR62 KO vs WT cells. In the original submission, slight differences in fluorescence intensity between the WT and WDR62 KO panels may have exaggerated differences in HPRT levels in the nucleus. To address this, we have replaced the representative images with those with more consistent fluorescence intensity across conditions and better represent the average population of sampled cells. Regardless, quantified the change in HPRT cytoplasmic redistribution in response to WDR62 loss across multiple independent biological replicates (n=4) and multiple cells (>12 cells per repeat) within each biological replicate to confirm a change in HPRT distribution in KO cells (Fig. 6E+G).

      The localization of HPRT should be compared in WT and WDR62-KO with BAG2 siRNA. It is also possible to confirm whether the phenotypes observed in KO, such as cell proliferation and xanthosine/inosine levels, are rescued.

      __Response: __We conducted a series of immunofluorescence experiments to assess the impact of BAG2 knockdown (siRNA) on the spatial distribution of HPRT in WT and WDR62 KO cells. BAG2 depletion had no effect on HPRT distribution and did not rescue its aggregated-like appearance in WDR62 KO cells (Fig. S10C). Thus, while abnormal HPRT localization in absence of WDR62 was due to excessive of HSP70/90 activity (Fig. 6F), this was not reversed by BAG2 siRNA. However, BAG2 siRNA reduced BAG2 levels to below wild-type cells (Fig. 6I). An imbalance of HSP and co-chaperone levels are known to be involved in aggregation of cytoplasmic proteins. (doi.org/10.1096/fj.202002645R). Therefore, while BAG2 siRNA may have returned HPRT levels, it may not have appropriately corrected the levels of HSP70/90 activity required for normal HPRT localization (lines 407-413 in revision).

      We did not attempt to rescue cell proliferation and xanthosine/inosine levels with BAG2 siRNA in order to prioritize other studies requested by reviewers such as neurodevelopment function of HPRT and flux analysis of purine synthesis/salvage.

      It should be considered that the induction of Bag2 in WDR62-KO might allow purinosome formation to proceed normally due to compensation. The co-localization of WDR62 and purine enzymes during purinosome formation should be compared when BAG2 expression is suppressed. Similarly, any changes in BAG2 localization in WDR62-KO should be examined. Furthermore, the purinosome formation ability should be compared in WDR62KO + Bagl2 siRNA condition.

      __Response: __To address these insightful comments and requests by reviewer #2 response, we have performed additional experiments to assess whether BAG2 facilitates WDR62 granule assembly, purinosome assembly, and the WDR62-HPRT interaction. siRNA-mediated BAG2 depletion did not prevent stress-induced assembly of WDR62 or PFAS granules (Fig. S6D+E). Thus, unlike HSP70/90 activity, purinosome assembly and WDR62 localization to purinosomes did not appear to require BAG2. Rather we demonstrated a role for WDR62-BAG2 in regulating HPRT (Fig. 6, lines 400-411).

      The reduction of HPRT in WDR62-KO should be examined for potential effects of enhanced degradation via the ubiquitin-proteasome system or the autophagy-lysosome system.

      __Response: __See our response to reviewer #1, minor comment #3. Briefly, neither MG132 blockade of proteosomal degradation nor chloroquine inhibition of autophagy was sufficient to return HPRT levels in WDR62 KO cells. However, these studies are not exhaustive and we are currently pursuing alternative and more specific inhibitors of UPS or lysosomal degradation. As this is not essential for the main findings of the current manuscript, we will include delineation of HPRT degradation pathway in a future publication.

      Although it is known that HPRT-KO mice do not exhibit any effects on normal brain development except in some dopaminergic neurons, what are your thoughts on this?

      Response: We thank the author for raising this interesting point. While global HPRT KO mice appear not to exhibit widespread brain development defects (doi: 10.1007/s00018-022-04326-x) this does not preclude a role for impaired HPRT to contribute to specific neurodevelopmental defects in context of WDR62 mutation or loss. In utero electroporation studies, we find that WDR62 or HPRT depletion results in precocious delamination of apical precursors which may trigger premature differentiation. However, while WDR62 depletion led to reduced proliferation of delaminated radial glia ventricular/subventricular zone, we observed increased proliferation with HPRT loss (Fig. 7). Our findings are in good concordance with the study mentioned by reviewer #2, Witteveen et al. 2022 (doi: 10.1007/s00018-022-04326-x), who similarly reported an increase in proliferation and abnormal cell migration patterns which may be attributed to apical delamination of radial glia. The increased proliferation of progenitors in the intermediate zone or outer ventricular/subventricular zone may compensate for premature differentiation of apical progenitors to explain the lack of overall reduction in brain size with HPRT deficiency alone. Thus, our findings indicate that defects in WDR62-HPRT may contribute to the premature apical delamination of radial glia but WDR62 has additional functions that are indispensable for normal brain development. This may include complex functions in regulating purine metabolism independent of HPRT. We have now included the paper by Witteveen et al. 2022 in our revised manuscript and the above was discussed in detail at lines 565-577.

      Minor comments: • Please write the full name before the abbreviation of the gene. • There is no measurement data for Fig7C, and a measurement line is drawn only in one panel of the ROI. • The line 488 "Fig11" looks like a typo.

      __Response: __As requested by the reviewer, we have included the full name of genes before their abbreviation and corrected the typographical error (line 548 in revision). For Fig S7C (Fig. S6B in revision), we have removed the measurement line which was included in error in our original manuscript. This supplementary figure demonstrates that the stress-stimulated granule assembly of ectopically expressed PFAS and PPAT was not altered or appreciably different in WDR62 KO cells. We quantified this for sorbitol treatment (Fig S6A). We performed the purine-depletion experiment twice with identical results. Given this was a negative result we focused our efforts elsewhere.

      The table could not be found.

      __Response: __We apologise for this oversight. The Supplementary Information file containing Tables S1-3 was excluded from the original submission has now been included in our revised submission.

      It is strange that all measurement values for WT or control in Fig2, Fig7, and FigS9 are exactly 1.0 without any variation. Please check the measurement method again.

      __Response: __Our densometric band measurements in western blots within the indicated figures are normalized against WT control cells as a reference condition. This removes variation in arbitrary densitometric values that changes from blot to blot even for identical samples. Thus, values are fold-change in protein levels relative to WT control conditions. Hence values for WT or control cells are 1 (no change relative to itself) as the reference points and there is no variation between replicate experiments. We apologize for not explaining this in our original submission. Our revision now describes this quantification and processing of raw data in methods and materials (lines 668-671).

      Please write the method for purine depleted medium.

      __Response: __Our revised manuscript includes description of how we depleted cells of purines in the Materials & Methods at lines 636-640 with reference to source materials and prior studies.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In the present work, authors describe a novel role of the microcephaly associated protein WDR62 in purine metabolism under cell stress conditions. In the proposed cellular model (AD293 WDR62 overexpression system), the WDR62 proximity binding partners are firstly identified and categorized according to their functional role in the cell (protein folding, purine metabolism, and stress granules). Among them, authors focus on BAG2 - a HSP70/90 co-chaperone involved in cellular stress responses. After the characterization of the WDR62-BAG2 physical interaction sites, suggested to be disrupted by WDR62 pathogenic mutations, their functional interaction in cellular stress responses is investigated. WDR62-associated granules are extensively characterized for their physical and dynamic properties under different conditions (i.e., hyper-osmotic stress). Further, through the evaluation of N-and C-terminally truncated form of WDR62 authors characterize the protein regions responsible for WDR62-containing granule condensation - suggesting a potential mechanism disruption in the event of pathological WDR62 mutations. Lastly, authors provide evidence that WDR62 condensation does not occur in canonical stress granules but in the so called-purinosomes, where it participates in the regulation of purine metabolic pathway stabilizing HPRT (purine salvage enzyme) via WDR62-BAG2-HSP70/90 axis.

      Major comments:

      Overexpression system and the employed cell line are a major limitation of the study. There is no experimental data on human neural cells and on endogenous WDR62, underestimating the potential difference in cell type-specific metabolism. In light of this consideration, the provided introduction and conclusions on neural development and microcephaly have to be re-formulated. I suggest providing a more general introduction/conclusions on WDR62 role (and alterations) in cell division and cell metabolism (neurodevelopment and cancer share common patterns) since purine homeostasis is not exclusive of neural progenitor cells.

      This reviewer thinks that the structure of the work is a bit convoluted (too many results in main figures that are not substantial). I suggest to re-organize and to prioritize the most relevant results. Further, it would be clinically relevant to add WDR62 mutant constructs in the functional evaluation of purine metabolism to better dissect the physiological role of WDR62 and the impact of the mutations on cellular physiology.

      Response: We are appreciative for this constructive evaluation of our manuscript and frank comments on the limitations of our study from reviewer #3. We have now included extensive new studies that provide evidence supporting endogenous mechanisms and insights into in vivo functions in neurodevelopment. We have also removed and combined several figures relating to the stress-induced purinosome assembly of WDR62 to better focus our manuscript on WDR62 interaction mechanisms and their purine metabolic function.

      Fig. 1: Overexpressed WDR62 fluorescence signal might be artifactual and may hide more detailed localization pattern during interphase. Authors should also provide endogenous WDR62 immunofluorescence panel in AD293 cells. Additionally, the "cytosolic" localization of WDR62 during interphase (indicated in the introduction, lines 88-89) has been re-defined in recent works pointing out that the protein is dynamically associated with the interphasic centrosome, the Golgi apparatus, and spindle poles during mitosis.

      __Response: __In response to this point, we have added text in the introduction (line 100-102) to clarify the dynamic association of WDR62 in cytoplasmic compartment during interphase includes the golgi apparatus. We have also added reference to the study by Dell’Amico and co-workers (doi: 10.7554/eLife.81716, Ref #24 in revision) alluded to by reviewer #3.

      We utilized ectopic expression of tagged WDR62 constructs to determine redistribution to stress-responsive cytoplasmic granules and co-localization with purine enzymes. Immunofluorescence staining of endogenous WDR62 also appears to reveal granule assembly but these findings are not as clear as the primary antibodies also detect additional proteins independent of WDR62 (validated using our KO cells). We agree that protein overexpression may result in artificial localization patterns but this can be mitigated with careful controls. We find that stress-induced WDR62 granule localization is highly dynamic and reversible. We observe the same response with full-length protein using different fluorescent protein or small affinity tags at either N- or C-terminus. High expression of mutant WDR62 (e.g. 3936dupC) or a closely related family member (MAPKBP1) do not form the same purinosome-associated granules. Moreover, in response to related comments by reviewer #1 and #2, we have now included proximity ligation assays confirm interactions between WDR62, BAG2 and purine enzymes (Fig. 3 and Fig. 6).

      Fig. 1C lacks quantification of BAG2/CEP170/AURKA signal. Further, how can authors exclude that is not nocodazole effect on microtubules disruption which impairs WDR62 spindle pole localization and therefore protein-protein interactions? A panel showing that "low dose" nocodazole do not impinge endogenous and exogenous WDR62 localization in mitotic cells is needed.

      __Response: __WDR62-BirA specific biotinyation and affinity isolation of BAG2, CEP170 and AURKA, compared to BirA or WDR62-HA only controls, was very clear in Fig. 1C. We did not quantify the extent that mitotic synchronization increased or decreased binding to WDR62 as the mitosis specific context was not a focus in our subsequent figures. Rather we focused on and quantified in detail WDR62-BAG2-HPRT mechanisms in response to cell stress.

      We are also very confident that low dose nocodazole treatment does not prevent spindle pole localization. This treatment impinges on microtubule dynamics to trigger spindle checkpoints, arresting cells in mitosis. The bipolar organization of spindles is lost but spindle microtubules and minus-end microtubule directed localization of WDR62 at spindle asters are retained under these conditions and is specific to mitotic cells. The robust WDR62-BirA biotinylation of AURKA, which is spindle pole-associated, specifically in mitotic arrested cells further confirms WDR62 is retained at the spindle. We demonstrated this in our previous papers (Ref. 5+6). Others have also shown that both endogenous (doi: 10.7554/elife.81716) and exogenous WDR62 (doi: 10.1083/jcb.202007167, doi: 10.1242/jcs.157537) retain spindle pole localisation under similar conditions.

      Fig. 3 H-J: The fluorescence signals are saturated (also evident in the intensity profile plot) and thus not applicable for any analysis. Further, how these linear ROIs are chosen? The signal pattern is not homogenously distributed in those images. Please provide a more consistent fluorescence analysis.

      __Response: __We acknowledge reviewer #3 concerns but while some granules, particularly those expressing G3BP-EGFP, exhibit saturated fluorescence signals, this does not impact or prevent our analysis. Our aim was not to quantify subtle fluorescence intensity changes within individual granules, but rather to compare fluorescence signal between granules across different channels to identify overlap. The linear ROIs were selected at random to illustrate that WDR62 and G3BP signals do not overlap between WDR62 and G3BP-positive granules.

      Minor comments:

      Abstract, line 49: How can these WDR62 mutations can result in a complete loss of the protein ("In cells lacking WDR62") if authors report co-IP experiments (Fig. 2) with clear mutant WDR62 bands? Rephrase accordingly.

      __Response: __The statement in our original abstract referenced by reviewer #3 referred to results presented in Fig 7 (now Fig. 6 in our revision) comparing WDR62 KO with WT cells and not co-IP experiments with mutant WDR62 in Fig 2. We have revised our abstract substantially to incorporate additional experimental work and to ensure clarity in our statements related to KO cells lacking WDR62 and cells expressing WDR62 mutants.

      Result referred to Fig. 2D reports that "BAG2 co-immunoprecipitated with WDR62(N)-EGFP but not WDR62(C)-EGFP". The blot and the relative quantification in figure 2D instead show BAG2 signal in the WDR62(C)-EGFP - even if significantly lower. Please rephrase accordingly.

      __Response: __We have revised line 192 of the main text to more accurately state the reduced interaction between WDR62(C)-EGFP and BAG2.

      Lines 186-187: authors declare that the C-terminal tail comprising the helix-loop-helix domain is required for BAG2 to bind full-length WDR62. There are no such data in support of this. The C-terminal fragment includes both the disordered region and the dimerization domain. How can authors conclude that the dimerization domain alone is sufficient to bind BAG2?

      __Response: __In Fig. 2, we show that the co-IP of BAG2 was significantly impaired in cells expressing WDR62(3936dupC), which lacks the C-terminal helix-loop-helix (HLH) domain. Additionally, we demonstrate that the C-terminal half of WDR62, which includes the HLH domain, does not bind BAG2. Based on these findings, we conclude that while the HLH domain is necessary for BAG2 binding to full-length WDR62, it is alone not sufficient. To ensure clarity, we have revised the main text (lines 207-209) to state “…the C-terminal helix-loop-helix domain—required for WDR62 dimerisation—is necessary but not sufficient for BAG2 to bind full-length WDR62.”

      Lines 189-190: results in AD293 cell line are not directly applicable in demonstrating that poor WDR62-BAG2 interaction can lead to alterations in brain development. Please rephrase.

      __Response: __We established that WDR62 interacts with BAG2 co-chaperone and MCPH mutations in WDR62 disrupt this interaction. Although our results were performed in AD293 cells, it seemed reasonable to speculate that WDR62 interactions with chaperones might contribute to brain development given well established WDR62 functions in this context. However, we acknowledge that this speculation may not be appropriate at this point of the manuscript, so we have removed this text (line 210) in our revised manuscript.

      Line 196: Indicate here, as the first mention, stress granules as "SGs" and use the abbreviation consistently throughout the manuscript.

      __Response: __We have abbreviated stress granules as suggested (first mentioned at line 102) and utilized this abbreviation consistently throughout the manuscript.

      Line 255: are human neural progenitor cells enough sensitive to sorbitol? If not, the proposed experimental design is a bit artifactual and the results/conclusions cannot be related to neural development alterations. I suggest applying more "physiological" stressors and frame the results in meaningful neurodevelopmental/tumorigenic environment. Please add this point to the discussion.

      __Response: __Neural progenitors are likely sensitive to sorbitol, as hyperosmotic stress has been used to induce phase separation of a wide variety of proteins in neural contexts (doi: doi.org/10.1038/s41598-023-39090-w, doi.org/10.1016/j.celrep.2018.06.094). In this study, we leveraged sorbitol-induced hyperosmotic stress as a controlled and reproducible means of triggering WDR62 phase separation, enabling us to examine its downstream interactions with BAG2, HPRT, and other purine enzymes. We further extend these observations to metabolic cell stress with purine-depletion.

      We found that WDR62 phase separation occurs rapidly at low sorbitol concentrations (~50 mM) (Fig. 3B), suggesting that even milder osmotic stress, particularly under prolonged exposure, could similarly drive WDR62 condensation in physiological settings. As requested by the reviewer, we have added a small section to the discussion (lines 472-480) to discuss the physiological implications of sorbitol stress on WDR62 granule assembly.

      Line 240: WDR62 granules association with microtubules and especially mitochondria is not convincing (Fig. S5). This data seems to be a bit qualitative, please provide more detailed quantification of this parameter.

      __Response: __The association of WDR62 granules with microtubules and mitochondria is quantitatively assessed using two methods, as shown in the graphs to the right of the images. One graph presents the proportion of WDR62 granules overlapping with CytC/Tubulin, providing a binary measure of colocalization. We also examined the degree of signal correlation across the entire ROI by calculating Pearson’s correlation coefficient. In response to sorbitol, we showed a higher association of WDR62 with Tubulin and CytC compared to randomised controls. We have updated the Materials and Methods to include a detailed description of this analysis (lines 708-720).

      Fig. 4 is convoluted. I suggest moving some data to supplementary to improve the clarity of the figure.

      __Response: __In addressing this comment and related comments from other reviewers to focus our manuscript, we have removed our data on fluorescence recovery and post-stress disassembly of WDR62 granules from what was Fig. 4 in our original submission and combined remaining components with Fig. 3 to centre on stress-induced assembly of WDR62 granules for our revised manuscript.

      Line 273: "Liquid-like protein condensates also exchange their contents with the bulk cytosol [52]". Reference 52 reviews the existing literature referred to biomolecular condensates that exert nuclear function (e.g., genome organization, gene expression, and DNA repair). No mention on events involving cytoplasm. Please add a more relevant reference.

      __Response: __We thank the reviewer for highlighting this inconsistency. However, this reference is no longer required and has been removed from our revised manuscript as the section of the main text has been deleted in alignment with the above response where figure panels relating to WDR62 phase separation were removed for focus and clarity.

      Lines 290-291: have authors considered the effect of sorbitol on microtubules dynamic that might reflect in granules dynamic changes?

      __Response: __We thank the authors for this insightful comment. Hyperosmotic stressors such as sorbitol are known to reduce microtubule dynamicity (doi.org/10.1016/j.devcel.2022.02.001), likely due to increased cytoplasmic viscosity and crowding effects. While we have not directly assessed microtubule dynamics in our study, it is certainly possible that these changes could influence WDR62 granule dynamics, given their association with microtubules (Fig. S6). While we have reduced emphasis on the dynamic nature of WDR62 granules in our revision, a useful direction for future studies to explore how alterations in microtubule dynamics induced by physiological stressors facilitate changes in WDR62 granule assembly or dynamics (e.g., fission, fusion).

      Line 295: I suggest moving the prediction of the disordered region of WDR62 when first mentioned (e.g., Supplement referred to Fig. 2)

      __Response: __This text is no longer required as we have removed this dataset from our revised manuscript to address reviewer consensus feedback to enhance cohesiveness and clarity.

      Fig. S6C-E, I: Unclear which is the criterion by which a cell is marked as "with" or "without" granules.

      __Response: __This text is no longer required as we have removed this dataset from our revised manuscript to address reviewer consensus feedback to enhance cohesiveness and clarity.

      Fig. S8: Unclear, also from the micrograph showed in the figure, how authors have counted/considered the microtubules/mitochondria associated purinosomes. Seems very qualitative and observer dependent. Please provide a more reliable analysis.

      __Response: __We apologise for omitting a description of the methodology used in the analysis of the images in Fig. S8 (now Fig. S6 in revision). We have now provided a detailed description in the Materials and Methods section (lines 709-721) of how microtubule- and mitochondria-associated purinosomes were identified and quantified.

      Fig. 6A: The same blot of WDR62 KO is shown in Fig. S7. Please remove one.

      __Response: __As requested, we have removed a set of blots demonstrating WDR62 protein deletion in KO cells from Fig. S7 (Fig. S6 in revision).

      Fig. 6C, D: Method for cell proliferation measure is indirect and "rounded cells" as indicator of cell death is sub-optimal. Analysis with specific markers would be preferable in both cases.

      Response: We used an XTT assay to measure cell viability as a function of cell number. In revised text, and also detailed in our response to reviewer #1 (point 4 under Text and Figures), we clarified that this was a measure of cell viability in response to purine-depletion as oppose to a direct measure of cell proliferation. Our amended text attributes the results in Fig 6C (now Fig. 5B in revision) to changes in cell viability rather than proliferation.

      With regards to additional measure of cell death, we had also performed LDH release assays to quantify cell death in addition to our measurement of cell rounding. The LDH assay is widely used and accepted measure of cell death or cytotoxicity and is indicated in Figure 5D in the revision.

      Fig.7B: Why the transfection control vector "EGFP only" significantly increases/decreases the BAG2/HPRT expression with respect to the negative control?

      __Response: __The reviewer comment here on Fig. 7B (now Fig. 6B in revision) refers to the control vector (EGFP only) transfected into WDR62 KO cells, as opposed to WT cells. Therefore, the difference in protein expression in this condition does not match the WT cells in the first lane as BAG2 and HPRT are increased and decreased respectively in KO cells compared to WT. This aligns with results presented in Fig. 6A.

      Paragraph from line 410 to 434: very confusing, the reported results are not well conveyed and therefore not convincing. To be reformulated.

      __Response: __We thank the reviewer for the direct and constructive feedback. The revised section (lines 378–416) addresses whether WDR62-BAG2 regulates HPRT levels. It has been substantially updated to include new experimental data and to reflect our latest findings and conclusions. We believe these revisions have significantly improved the logical flow and clarity of the discussion.

      Lines 524-526: the author's conclusion that: "...the loss of purine metabolic enzymes, including HPRT, disrupts neurogenesis, resulting in microcephaly, cell cycle defects, ciliopathies, and abnormalities in proliferation and neural progenitor fate decisions, mirroring the loss of WDR62." is not supported by the cited literature [29] and by the results presented in this work. Please provide additional references or remove the statement.

      __Response: __ As requested by the reviewer, we have removed the statement and substantially revised this section of the discussion (lines 563-677) to incorporate findings from our additional studies such as in utero electroporation.

      Lines 527-529: if authors state that "...other WD repeat-containing and microcephaly-associated proteins interact with purine enzymes..." have to provide additional references in addition to the NWD1 one. Otherwise, these lines should be rephrased as "another WD repeat containing and microcephaly-associated protein...".

      __Response: __We have amended this statement (line 589-592 in revision) as requested.

      Reference 62 is not well indexed in the Reference section. Please adjust.

      __Response: __We thank the reviewer for pointing out this error. The reference to Rauch et al. (2014) [Ref. 60 in the revised manuscript] has been corrected and now includes the complete bibliographic details.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In the present work, authors describe a novel role of the microcephaly associated protein WDR62 in purine metabolism under cell stress conditions. In the proposed cellular model (AD293 WDR62 overexpression system), the WDR62 proximity binding partners are firstly identified and categorized according to their functional role in the cell (protein folding, purine metabolism, and stress granules). Among them, authors focus on BAG2 - a HSP70/90 co-chaperone involved in cellular stress responses. After the characterization of the WDR62-BAG2 physical interaction sites, suggested to be disrupted by WDR62 pathogenic mutations, their functional interaction in cellular stress responses is investigated. WDR62-associated granules are extensively characterized for their physical and dynamic properties under different conditions (i.e., hyper-osmotic stress). Further, through the evaluation of N-and C-terminally truncated form of WDR62 authors characterize the protein regions responsible for WDR62-containing granule condensation - suggesting a potential mechanism disruption in the event of pathological WDR62 mutations. Lastly, authors provide evidence that WDR62 condensation does not occur in canonical stress granules but in the so called-purinosomes, where it participates in the regulation of purine metabolic pathway stabilizing HPRT (purine salvage enzyme) via WDR62-BAG2-HSP70/90 axis.

      Major comments:

      Overexpression system and the employed cell line are a major limitation of the study. There is no experimental data on human neural cells and on endogenous WDR62, underestimating the potential difference in cell type-specific metabolism. In light of this consideration, the provided introduction and conclusions on neural development and microcephaly have to be re-formulated. I suggest providing a more general introduction/conclusions on WDR62 role (and alterations) in cell division and cell metabolism (neurodevelopment and cancer share common patterns) since purine homeostasis is not exclusive of neural progenitor cells.

      This reviewer thinks that the structure of the work is a bit convoluted (too many results in main figures that are not substantial). I suggest to re-organize and to prioritize the most relevant results. Further, it would be clinically relevant to add WDR62 mutant constructs in the functional evaluation of purine metabolism to better dissect the physiological role of WDR62 and the impact of the mutations on cellular physiology.

      Fig. 1: Overexpressed WDR62 fluorescence signal might be artifactual and may hide more detailed localization pattern during interphase. Authors should also provide endogenous WDR62 immunofluorescence panel in AD293 cells. Additionally, the "cytosolic" localization of WDR62 during interphase (indicated in the introduction, lines 88-89) has been re-defined in recent works pointing out that the protein is dynamically associated with the interphasic centrosome, the Golgi apparatus, and spindle poles during mitosis.

      Fig. 1C lacks quantification of BAG2/CEP170/AURKA signal. Further, how can authors exclude that is not nocodazole effect on microtubules disruption which impairs WDR62 spindle pole localization and therefore protein-protein interactions? A panel showing that "low dose" nocodazole do not impinge endogenous and exogenous WDR62 localization in mitotic cells is needed.

      Fig. 3 H-J: The fluorescence signals are saturated (also evident in the intensity profile plot) and thus not applicable for any analysis. Further, how these linear ROIs are chosen? The signal pattern is not homogenously distributed in those images. Please provide a more consistent fluorescence analysis.

      Minor comments:

      Abstract, line 49: How can these WDR62 mutations can result in a complete loss of the protein ("In cells lacking WDR62") if authors report co-IP experiments (Fig. 2) with clear mutant WDR62 bands? Rephrase accordingly.

      Result referred to Fig. 2D reports that "BAG2 co-immunoprecipitated with WDR62(N)-EGFP but not WDR62(C)-EGFP". The blot and the relative quantification in figure 2D instead show BAG2 signal in the WDR62(C)-EGFP - even if significantly lower. Please rephrase accordingly.

      Lines 186-187: authors declare that the C-terminal tail comprising the helix-loop-helix domain is required for BAG2 to bind full-length WDR62. There are no such data in support of this. The C-terminal fragment includes both the disordered region and the dimerization domain. How can authors conclude that the dimerization domain alone is sufficient to bind BAG2?

      Lines 189-190: results in AD293 cell line are not directly applicable in demonstrating that poor WDR62-BAG2 interaction can lead to alterations in brain development. Please rephrase.

      Line 196: Indicate here, as the first mention, stress granules as "SGs" and use the abbreviation consistently throughout the manuscript.

      Line 255: are human neural progenitor cells enough sensitive to sorbitol? If not, the proposed experimental design is a bit artifactual and the results/conclusions cannot be related to neural development alterations. I suggest applying more "physiological" stressors and frame the results in meaningful neurodevelopmental/tumorigenic environment. Please add this point to the discussion.

      Line 240: WDR62 granules association with microtubules and especially mitochondria is not convincing (Fig. S5). This data seems to be a bit qualitative, please provide more detailed quantification of this parameter.

      Fig. 4 is convoluted. I suggest moving some data to supplementary to improve the clarity of the figure.

      Line 273: "Liquid-like protein condensates also exchange their contents with the bulk cytosol [52]". Reference 52 reviews the existing literature referred to biomolecular condensates that exert nuclear function (e.g., genome organization, gene expression, and DNA repair). No mention on events involving cytoplasm. Please add a more relevant reference.

      Lines 290-291: have authors considered the effect of sorbitol on microtubules dynamic that might reflect in granules dynamic changes?

      Line 295: I suggest moving the prediction of the disordered region of WDR62 when first mentioned (e.g., Supplement referred to Fig. 2)

      Fig. S6C-E, I: Unclear which is the criterion by which a cell is marked as "with" or "without" granules.

      Fig. S8: Unclear, also from the micrograph showed in the figure, how authors have counted/considered the microtubules/mitochondria associated purinosomes. Seems very qualitative and observer dependent. Please provide a more reliable analysis.

      Fig. 6A: The same blot of WDR62 KO is shown in Fig. S7. Please remove one.

      Fig. 6C, D: Method for cell proliferation measure is indirect and "rounded cells" as indicator of cell death is sub-optimal. Analysis with specific markers would be preferable in both cases.

      Fig.7B: Why the transfection control vector "EGFP only" significantly increases/decreases the BAG2/HPRT expression with respect to the negative control?

      Paragraph from line 410 to 434: very confusing, the reported results are not well conveyed and therefore not convincing. To be reformulated.

      Lines 524-526: the author's conclusion that: "...the loss of purine metabolic enzymes, including HPRT, disrupts neurogenesis, resulting in microcephaly, cell cycle defects, ciliopathies, and abnormalities in proliferation and neural progenitor fate decisions, mirroring the loss of WDR62." is not supported by the cited literature [29] and by the results presented in this work. Please provide additional references or remove the statement.

      Lines 527-529: if authors state that "...other WD repeat-containing and microcephaly-associated proteins interact with purine enzymes..." have to provide additional references in addition to the NWD1 one. Otherwise, these lines should be rephrased as "another WD repeat containing and microcephaly-associated protein...".

      Reference 62 is not well indexed in the Reference section. Please adjust.

      Referees cross-commenting

      This reviewer thinks that the points raised by reviewer #1 and #2 are very accurate and significant. Some of them are also shared between our three review reports and in general are referred to: clarity of the manuscript improvement, little consistency between the results displayed in the figures and the text/conclusions in some points, concerns about the reliability of some measurements/result and the employed cellular model, and the lack of endogenous protein data.

      Significance

      General assessment:

      The here described new role of WDR62 in purine metabolism and the proposed pathway are novel and relevant to shed light on pathophysiological cellular and molecular mechanisms that potentially underlie neurodevelopmental defects and carcinogenesis - processes in which WDR62 is implicated. The experimental design is extended and generally well-conceived even though quite dispersive in some points.

      The strength of the work resides in its versatility - making these findings potentially applicable to different cell types and different contexts (e.g., from neural development to malignancies) - and in the protein-protein interactions characterization under several conditions.

      Similarly, the major weakness is the generalist trait of the findings that describes WDR62 cellular behavior mostly in an over-expression system in an immortalized cell line, underestimating the intrinsic metabolic and protein expression-level differences among cell types.

      Advance:

      WDR62 is a scaffold protein with pleiotropic functions and a plethora of molecular interactors. Literature reports many molecular pathways involving WDR62 mainly in cell cycle progression, primary cilia biogenesis and centrosomal functions in a neurodevelopmental context. In the present work, authors describe mechanistic insights of a never reported WDR62-BAG2-HSP70/90 molecular pathway shedding new light on the role of this protein in cellular metabolism thus providing a new perspective on WDR62 pathophysiological functions.

      Audience:

      Basic research audience will be interested in this research work. The described molecular pathway involving WDR62 in purine metabolism might be relevant to other research on how WDR62 cellular and molecular dynamics are impactful on neural development and malignancies.

      Expertise:

      Human neural development and alterations, iPSCs, neural stem cells, CRISPR-Cas9

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: The authors provide evidence to reveal the novel functions of WDR62 protein in maintaining the stability and activity of purine metabolic enzymes and overall purine homeostasis. WDR62 interacted with BAG2, and they are recruited to purinosome. WDR62 loss caused accelerated degradation of purine salvage enzyme HPRT, and led to the accumulation of purine nucleotide intermediates.

      While this study is compelling and significant for the field of neurodevelopmental disorders including microcephaly and purine metabolism, there are several concerns that should be addressed before publication.

      Major comments:

      • Although all experiments are conducted using non-neuronal cultured cells, does this phenomenon also occur in neuronal cells?
      • What is the interaction between endogenous WDR62 and Bag2? This is because in overexpression systems, multiple chaperones may interact with the target protein during protein folding.
      • Is endogenous WDR62 also present in the purinosome in purine depleted or sorbitol condition?
      • Regarding Fig6 and Fig7, when HPRT decreases and inosine accumulates in WDR62-KO condition, did the levels of hypoxanthine, xanthine, and uric acid change?
      • Does HPRT and the three microcephaly-associated WDR62 mutants also recruited in the purinosome in purine depleted or sorbitol condition?
      • In Fig7C, HPRT/tubulin ratio appears to decrease in WT from 0hr to 24h, but the graph does not show this decrease. Additionally, quantification of PFAS(FGAMS) and BAG2/tubulin should be performed.
      • Fig7D is problematic. HPRT in WDR62-KO cells seems to localize in the nucleus, possibly due to stronger exposure in KO conditions compared to WT. Also, the line scan is drawn in areas with low signal in WT. The comparison should be performed in areas with high perinuclear signal.
      • The localization of HPRT should be compared in WT and WDR62-KO with BAG2 siRNA. It is also possible to confirm whether the phenotypes observed in KO, such as cell proliferation and xanthosine/inosine levels, are rescued.
      • It should be considered that the induction of Bag2 in WDR62-KO might allow purinosome formation to proceed normally due to compensation. The co-localization of WDR62 and purine enzymes during purinosome formation should be compared when BAG2 expression is suppressed. Similarly, any changes in BAG2 localization in WDR62-KO should be examined. Furthermore, the purinosome formation ability should be compared in WDR62KO + Bag2 siRNA condition.
      • The reduction of HPRT in WDR62-KO should be examined for potential effects of enhanced degradation via the ubiquitin-proteasome system or the autophagy-lysosome system.
      • Although it is known that HPRT-KO mice do not exhibit any effects on normal brain development except in some dopaminergic neurons, what are your thoughts on this?

      Minor comments:

      • Please write the full name before the abbreviation of the gene.
      • There is no measurement data for Fig7C, and a measurement line is drawn only in one panel of the ROI.
      • The line 488 "Fig11" looks like a typo.
      • The table could not be found.
      • It is strange that all measurement values for WT or control in Fig2, Fig7, and FigS9 are exactly 1.0 without any variation. Please check the measurement method again.
      • Please write the method for purine depleted medium.

      Referees cross-commenting

      I concur with the accurate point observations by the other reviewers. The authors should address the most of the comments provided, as many of the suggested experiments are feasible. If the paper aims to elucidate the one of the causes of microcephaly, specifically, the issues related to cell type and endogenous proteins experiments need to be resolved, and addressing these issues would substantially enhance its quality and impact.

      Significance

      Most of the roles of purinosomes in the central nervous system remain unknown. The discovery that the WDR62/MCPH2 gene, responsible for microcephaly, is related to purinosomes will have a major impact on this field. Additionally, the ability to easily induce purinosomes through sorbitol phase separation is a significant technical advance in terms of cost and simplicity. Furthermore, many genes related to microcephaly, such as MCPH, are factors directly involved in cell division by regulating the mitotic spindle and centrosomes. This study has revealed a new role for WDR62, uncovering part of a novel molecular mechanism for microcephaly.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In Morris, M.J. et al., the authors strive to better understand the roles for the microcephaly protein WDR62 in brain growth and function. To accomplish this, an in situ biotinylation assay was performed and identified 42 proteins proximal to WDR62 including the Hsp70 co-chaperone BAG2. Through a series of co-immunoprecipitation assays, the BAG2-WDR62 interaction was shown to be mediated through the structured N-terminal region of WDR62, and it was proposed that common WDR62 mutations disrupt this interaction. In AD293 cells, loss of WRD62 expression resulted in an increase in the expression of BAG2 expression while reducing HPRT expression. Subsequent loss of BAG2 expression by siRNA treatment restored the expression of HPRT suggesting that there is an association between the stability of HPRT and BAG2, likely mediated through its proposed association with Hsp70/90 molecular chaperones. Finally, the authors investigate the subcellular localization and ability of WRD62 to phase separate. WRD62 was shown to form discrete condensates induced by sorbitol-mediated hyperosmotic stress. The formation of WRD62 granules are hypothesized to be driven by cell volume exclusion and macromolecular crowding. These granules appear similar, both in physical appearance and characteristics, to other reported biomolecular condensates such as those reported in metabolism (e.g. purinosomes). WRD62-containing condensates were shown to colocalize with enzymes in de novo purine biosynthesis; however, this association is not required for purinosome formation and/or its stability under both purine-depleted and sorbitol-driven growth conditions. Overall, the manuscript provided a previously unrealized and exciting association between WDR62 and purine metabolism.

      EVIDENCE, REPRODUCIBILITY AND CLARITY

      Summary: The current manuscript reads as multiple manuscripts with findings that are at times weakly connected (in my opinion). For example, I had a hard time understanding how the BioID results relate to the discovery of WRD62 phase-separation and its colocalization with purinosomes. I would strongly encourage the authors to consider dividing the results into separate manuscripts to strengthen their claims and create a more focused and cohesive manuscript (or series of manuscripts). I believe then several of my reservations associated with the current manuscript will be addressed, and in my opinion, the hard work from the authors will be better received across the scientific community.

      I would like to commend the authors for all the work that went into the current version of the manuscript. Being part of a biochemistry and cell biology research group, I completely understand how much time and effort must have went into generating these data. That being said, I felt that there were several instances where clarification and additional information is warranted to arrive at the conclusions made by the authors. These points are outlined below.

      Major Comments:

      1. There appears to be a discrepancy between the data presented in Figure 1 and what is stated in the main text. Clarification is necessary to better understand the results:
        • The following statement (and derivatives of it) are repeated throughout the manuscript: "...we found that the WDR62 interactome comprised molecular chaperones such as HSP70, HSP90, and their co-regulators, BAG2, STIP1, and DNAJC7" (lines 91-93, 316-318, 422-425). STIP1 and DNAJC7 were not identified in the list of 42 proximal proteins to WDR62 (Figure 1D). DNAJC7 was included because of a previous report curated in the BioGRID database, and there is no mention of HSP90 in the data produced in Figure 1. Please revise the main text to reflect the data that was generated.
        • Based on the data presented in the Venn Diagrams in Figure 1D, the author's numbers do not seem to be consistent with the sentence on lines 126-128. I count 37 proteins unique to their BioID study, 90 unique to the BioGRID database, and 5 proteins that overlap between the two data sets. This sentence needs to be revised.
        • What data were used to generate the interaction map in Figure 1I? Enzymes tied to purine metabolism were not identified from the data presented in Figure 1D but have now appeared. A discussion of this in the main text is warranted.
      2. This reviewer has several reservations on how the various key players in the manuscript are related to substantiate the conclusions made in the manuscript. For instance, how is HPRT, purinosomes, and WDR62 related? How about HSP90, WRD62, and HPRT? Pairwise connections were made throughout the manuscript; however, trying to tie all three together is difficult with the data presented.
        • The authors tried to connect HPRT, purinosomes, and WDR62 with BAG2; however, this study could greatly improve if we understood how a knockdown of BAG2 impacts purinosome formation and/or WDR62 colocalization with purinosome enzymes.
        • Is HPRT a client of HSP90? And how are WRD62 and HSP90 related since they do not associated (based on your BioID data)? These connections would again strengthen the arguments made in the manuscript and help to explain the HSP90 inhibition data presented in Figures 7F and 7G.
      3. Caution is warranted when making conclusions about WDR62 (and its granules) and purinosomes.
        • The authors describe the association between WDR62 and purinosomes differently throughout the text. I would recommend that the authors come to some conclusion about this and be consistent.

      A. (Lines 339-340) "WDR62 granules represent or overlap substantially with the phase-separated metabolons known as purinosomes". Based on the data presented, it appears that these might still be different entities but either overlap or have similar components. Purinosome localization with mitochondria (approx 60-80%) and microtubules (approx 90-95%) were significantly higher than those reported for WDR62 granules (approx 40%). This comparison would suggest that not all WDR62 granules behave similarly to purinosomes. And from the dot plot in Figure 3G, about half of the characterized WDR62 granules do not align with the previously reported characteristics of purinosomes.

      B. In the abstract and introduction, the authors state that WDR62 is being recruited to the purinosome and leave out the other possibility. I would recommend that the authors soften this claim in these sections because of the above possibility but also the lack of characterization of the sorbitol-induced "purinosomes". There is little discussion or evidence for how sorbitol induces purinosome formation. Is de novo purine biosynthesis activated upon sorbitol treatment? Are multiple de novo purine biosynthetic enzymes present in the sorbitol-induced "purinosomes"? Further, I agree that there is a tendency for WDR62 to associate with condensates that bear an enzyme within de novo purine biosynthesis; however many of these proteins are known to self-aggregate upon cell stress. Therefore, the entities that are being observing and called purinosomes might not be bone fide purinosomes. Additional care is necessary to make these statements. In my opinion, the current data only suggests that this might be a possibility.

      • (Lines 325-329) The authors reference a previous manuscript demonstrating that co-chaperones co-cluster with purinosomes. Based on this fact, they infer that WDR62 granules might represent purinosomes since WDR62 interacts with these same set of co-chaperones. These co-chaperones interact with a large number of different proteins (in fact, most kinases), so it is uncertain how the authors decided to go down this path to link purine metabolism with WDR62. Discussion of how this connection was made would help elevate the story. What additional insights did they have that lead them down these investigations?
      • If WDR62 is not required for purinosome formation, why would it localize with the purinosome? Is there any hypothesis that could be readily tested to better help understand this observation? Providing a better understanding of this would greatly elevate the work.

      A. (OPTIONAL) Please validate that the associations between WDR62 and the purine biosynthetic enzymes occur on the endogenous level (void of transient transfection). Many methods such as immunofluorescence and proximity ligation assays have been used by others to demonstrate protein-purinosome interactions. This result would reduce any concern that the association is a result of overexpression (artifact).

      B. Figures 6F and 6G conclude that nucleosides from purine-depleted growth conditions accumulate while the corresponding monophosphates do not change between WRD62 knock-out and wildtype cells. Given that purine-depleted growth conditions activate de novo purine biosynthesis (uncertain if this has been demonstrated in AD293 cells), could this result simply demonstrate that purine salvage is no longer used and the nucleosides have accumulated and are awaiting degradation (or exportation) rather than a loss of HPRT expression as inferred from the stated conclusions? The conclusions could be better substantiated with the use of a stable isotope incorporation assay.

      Is there a difference in the contribution of de novo purine biosynthesis and purine salvage to the generation of the monophosphates (AMP, GMP) between WDR62 knockout and wildtype AD293 cells? Use of a stable isotope (such as 15N-glutamine) could help to come to the appropriate conclusion.

      (Lines 483-485) If there is a change in de novo purine biosynthesis, are there any detectable changes in AICAR levels that might influence purine metabolism at the transcriptional level?

      Are the data and the methods presented in such a way that they can be reproduced? Are the experiments adequately replicated and statistical analysis adequate?

      1. For purine-depleted studies (metabolite analyses, microscopy), how long were the cells grown in purine-depleted medium before the analysis? And how was the purine-depleted medium generated? Please reference any source that might have been used.
      2. Details describing the BioID experiment are minimal. How many replicates were performed, was label-free or TMT quantitation used for the protein identification. Further the data analysis and mining of the proteins from the BioID study are missing - What database was used to identify the proteins from the peptides? Please include this information in the Materials and Methods section as well as a link to a repository where the LC-MS/MS data generated can be found. Additionally, it would be very helpful to have a spreadsheet or table that lists the biotinylated proteins and expectant or p values for each.
      3. Please include information about the streptavidin pulldown presented in Figure 1C.
      4. Many of the figure legends could benefit from a statistical description.
      5. There seems to be only two data points for Figure S3A. While there is no significant difference observed, it would be ideal to have additional replicates.
      6. (Figure 5I) Please provide statistical analysis to demonstrate the colocalization between FGAMS and WDR62 is robust in purine-depleted AD293 cells.

      Minor Comments:

      Do you have suggestions that would help the authors improve the presentation of their ideas and conclusions?

      1. The use of HSP90 inhibitors is a little confusing given the connections being made with BAG2 and other HSP70 co-chaperones in Figure 1.
        • Does the same conclusions hold true with an HSP70 inhibitor or siRNA?
        • (OPTIONAL) There are a lot of discrepancies between Hsp90 inhibitors and effective treatment concentrations. For example, NVP-AUY922 caused purinosomes to disassemble whereas STA9090 cause purinosomes to change morphology and adopt a more aggregated state. Do other Hsp90 inhibitors share the same phenotypic response as NVP-AUY922 in this study?
        • The treatment time (24 h) with NVP-AUY922 is very long. Given that Hsp90 interacts with hundreds of proteins, it is hard to understand whether the effect of Hsp90 inhibition is direct or indirect. Shorter times (1 h or less) would be more insightful.
      2. (OPTIONAL) Does the 2.6-fold increase in BAG2 increase its association with WDR62?
      3. Is the degradation of HPRT occurring through BAG2-mediated proteasomal degradation? Showing HPRT recovery by treating the cells with MG132 along with CHX would provide meaningful clues as to how BAG2 and HPRT might be related - Is BAG2 concentration increasing to facilitate the enhanced degradation of HPRT?
      4. Does HPRT colocalize with WDR62 in cells?
      5. (OPTIONAL) It would be nice to see validation experiments of some of the hits in Figure 1D or 1E in a co-immunoprecipitation experiment conducted similar to Figure 1C.
      6. The authors presented the findings that suggest that BAG2 interacts differently with commonly observed WDR62 mutations in MCPH2? How do these mutations affect WDR62 condensation, colocalization with purinosomes, or alter HPRT activity? Tying back the observations to something clinical would help elevate the overall significance of the findings.

      Are the text and figures clear and accurate?

      1. There are many times throughout the manuscript that the wrong figure is being referenced. These mistakes caused significant confusion at many times while reviewing the manuscript. Please double check all in-text references to figures. For example, I believe that you meant to use Figure S1C instead of Figure 2E with the statement on lines 183-185. Again, I believe that correct figure reference on line 501 is Figure 7G not Figure 7E.
      2. The figure legend on Figure S4 does not match the figure and the main text references. Please verify that the text in the figure legends correspond correctly to the figure.
      3. Please provide this data for the sentence on lines 399-400 in the supplemental file.
      4. I believe that the authors use the phrase "cell proliferation" to describe cell viability in the main text. In the Materials and Methods section, the authors state "The XTT cell proliferation assay enables quantification of cellular redox potential, providing a colorimetric readout of cell viability." Cell proliferation, viability, and cytotoxicity are different measurements, so please revise to reflect the correct experiment that was performed.

      Other Minor Comments:

      1. Move the sentence "In contrast, despite reduced mRNA..." (lines 387-388) to the last section when a reduction in PFAS expression was first mentioned.
      2. Please reference the following in the manuscript:
        • BioGRID database in the main text and Materials and Methods section
        • The reported study showing the DNAJC7-WDR62 interaction (as curated from BioGRID)
        • Fiji in the Materials and Methods section
      3. (Line 461-463) The authors state the following: "the loss of WDR62 leads to an increase in BAG2 and vice-versa (Fig. 7A) (Fig. S9B). I am not sure that the vice-versa (i.e. loss of BAG2 increases WDR62) is true. From the data presented in Figure 7H, I do not see a significant change in WDR62 expression upon BAG2 siRNA treatment.
      4. For your BioID study, do you know how many or the proportion of cells that were mitotically arrested with the low dose of nocodazole (200 ng/mL)? Given the small number of unique proteins that were in the mitotic only population, it is curious to know how enriched the cells were and whether WDR62 localization is important in the context of this study.
      5. Just to clarify, the WDR62-HA lane (third in each set) in Figure 1C is not WDR62-BirA*-HA and that it is only being used as a control.
      6. In the Discussion (lines 439-441) "We also show that WDR62 forms dynamic, phase-separated granules that co-localise with chaperones and purine metabolic enzymes, resembling purinosomes." I believe that the authors meant to say co-chaperones instead of chaperones given no microscopy data was presented showing the colocalization of HSP70/90 with WDR62 granules. Please revise.

      Referees cross-commenting

      I agree with the comments and recommendations by the other reviewers. Many of our shared comments are those that need to be addressed to substantiate the claims made by the authors throughout the manuscript. The proposed experiments across the reviewer comments appear feasible given that similar experiments have already been presented in this version of the manuscript. I strongly encourage the authors to consider these comments when revising their manuscript to help strengthen their claims and boost its overall significance and impact.

      Significance

      Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. Place the work in the context of the existing literature (provide references, where appropriate).

      The work presented explains a previously unknown role for WDR62 in the regulation of purine metabolism. Despite all the hard work that was performed to reach their conclusions, the use of the AD293 cell line and the lack of correlating the common WDR62 disease-promoting mutations to the observed findings throughout the entire manuscript slightly reduced my enthusiasm for this work. The presented study leverages a lot of existing literature to establish connections between WR62, co-chaperones, and purine metabolic enzymes, with an emphasis on purinosome metabolon, a condensate comprised of the enzymes in de novo purine biosynthesis.

      State what audience might be interested in and influenced by the reported findings.

      The audience that might be interested in the reported findings would likely be those tied to biomolecular condensates in cellular metabolism and their connection to disease. I also feel that researchers that study microcephaly might be interested in this work. In my opinion, I believe that a broader readership could happen if additional studies were performed to make stronger connections between studies presented.

      Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      My field of expertise is tied to understanding the regulation of cellular metabolism through the use of biochemical and biophysical techniques. I am not as familiar with the in depth details of proteomic analysis such as those required for accurate reporting of data tied to protein proximity labeling (BioID) methods.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      The authors wish to thank the reviewers for fair and constructive comments and Review Commons for facilitating the process.

      2. Point-by-point description of the revisions

      Point-by-point replies to reviewers' comments on the original submitted manuscript are below. Authors' responses are in plain font.

      Reviewers' comments:

      Reviewer #1

      (Evidence, reproducibility and clarity (Required)):

      Summary: The authors identify cancer-associated ERBB4 mutations that are selected for functional characterization. Utilizing the BaF3 and MCF10A models, the authors investigate the potential oncogenic role for 11 recurrent ERBB4 mutations. Three mutants (S303F, E452K and L798R) were strongly transforming with the ability to transform both cell models, S303F being unique in its ability to transform both models in the absence of NRG-1. The authors perform modeling to decipher potential mechanisms of action of the ERBB4 S303F, E452K and L798R mutations. The authors assess the ability of HER3 mutations to dimerize with other HER family members and demonstrate that ERBB4 S303F can mediate its activating functions by stabilizing homo- and heterodimers with other ERBB receptors and that the heterodimerization is likely cell/tissue context dependent. The authors demonstrate that transforming ERBB4 mutants are sensitive to pan-ERBB inhibitors and drive resistance to EGFR-targeted therapy in EGFR-mutant NSCLC cells.

      Major comments:

      Patient data analysis is performed in more than 15 months ago in January 2024. This analysis should be updated.

      We thank the reviewer for pointing out the aspect of constantly expanding mutation data in clinical cancer sample databases. We reanalyzed the patient data in cBioPortal (data download 02 May 2025). In this new analysis, the distribution of mutations in ERBB4 did not change (Reviewer only Fig. 1A), and the 18 selected mutations were still the most recurrently mutated ERBB4 mutations (Reviewer only Fig. 1B). Reanalysis of updated patient data did not change the initial rationale of the study, or the conclusions in the submitted manuscript.

      Reviewer only Figure 1. Comparison of patient data derived from cBioPortal on January 2024 (01/2024) or May 2025 (05/2025). A) Figure 1B of the original submitted manuscript. B) Supplementary Figure S1C of the original submitted manuscript.

      The rationale for selecting the mutations to be studied is not entirely clear. There are no references to support studying mutations in Fig 1B red boxes.

      We apologize for not being sufficiently clear on our rationale for selecting the mutations for analysis. The spectrum of mutations across the ERBB4 gene do not demonstrate clear hotspots as seen in for example EGFR, KRAS, or BRAF. However, we observed that there are regions (not necessarily individual amino acid changes) in ERBB4 that seem to accumulate more mutations than other regions. Looking more closely, we observed that these "hot regions" tend to be located in areas where activating mutations have been described for other oncogenic ERBB family members and/or target structurally important regions for receptor activation such as dimerization interfaces. We hypothesized that these characteristics would suggest functional relevance for the mutations in these "hot regions". In the revised manuscript (on page 11), we have revised the text describing the selection of mutations for further analysis, and added references to justify our selection:

      "While the missense mutations were distributed across the 1,308 amino acid sequence of ERBB4, lacking obvious hotspot mutations such as observed for example in EGFR or KRAS, clusters of recurrent mutations could be identified (Fig. 1B). These clusters tended to be located in specific regions that are targeted by activating mutations in other oncogenic ERBB family members (Greulich et al., 2005, 2012; Lee et al., 2006; Bose et al., 2013; Jaiswal et al., 2013)and/or are important for receptor activation (Ferguson et al., 2003; Bouyain et al., 2005; Liu et al., 2012), suggesting functional relevance (red boxes in Fig. 1B). Some recurrent mutations were located in the unstructured C-terminal tail of ERBB4 (Fig. 1B). We selected in total 18 ERBB4 missense mutations (indicated in Fig. 1B) that were recurrent and/or located in the abovementioned regions of interest for functional characterization (indicated in Fig. 1B and Supplementary Fig. S1C) - hypothesizing that these mutations would be actionable. Of the different mutants at the same position of ERBB4 amino acid sequence, the most recurrent amino acid change was selected for characterization."

      Cell proliferation should be shown for BaF3 cells for continuity in Figure 2 instead of doubling time.

      We agree that it may cause confusion that the results for the Ba/F3 and MCF10a experiments in Fig. 2C and D (Fig. 2D and E in the revised manuscript) are reported using a different metric. The reason for this is that these assays measure different outputs: in the Ba/F3 assay, the emergence of proliferating cells under IL3 deprivation is measured, with repeated cell viability measurements over time. In the MCF10a experiment, the ability of ERBB4 mutations to sustain the proliferation of MCF10a cells in the absence of EGF is measured, using a fixed time point (8 days). Thus, doubling time, as an indicator for the time required for the emergence of proliferating cells, is more suitable metric to quantify the relative transforming capability of the different ERBB4 mutations in the Ba/F3 cells. In the case of MCF10a cells, the relevant metric is the cell viability (as a surrogate marker for the number of cells) at the endpoint measurement.

      The relative expression of HER3 constructs must be shown for BaF3 and MCF10A cells in Figure 2.

      We assume the reviewer is asking to demonstrate the expression levels of different ERBB4 mutants in the Ba/F3 and MCF10a cells used in experiments in Fig. 2C and D (Fig. 2D and E in the revised manuscript). We would like to thank the reviewer for this very relevant point. Western blots demonstrating the expression levels of different ERBB4 mutants in the Ba/F3 and MCF10a cells have now been added as a data new panel in the Figure 2 (Fig. 2B in the revised manuscript). No ERBB3 expression constructs were introduced into the cells.

      Blots in Figure 4 must be quantified.

      The blots in Figure 4 have now been quantified, and the relative signal intensities are shown below each blot. We thank the reviewer for suggesting this relevant analysis. The analysis revealed two issues that we have now revised:

      1) in Fig. 4D, the dimerization of EGFR with ERBB4 S303F is not convincingly increased when compared to EGFR dimerization with wild-type ERBB4. Therefore, we have omitted that conclusion from the results section:

      "Taking into account these expression level differences, ERBB4 S303F did indeed co-immunoprecipitate more efficiently than wild-type ERBB4 with ERBB2 and EGFR both in the presence or absence of NRG-1 (Fig. 4D), demonstrating that the S303F mutation promotes the formation of ERBB heterodimers."

      Omitting this data does not change our final conclusion, that the ERBB4 S303F mutation leads to enhanced ERBB4 heterodimerization.

      2) In Fig. 4C, the previously published ERBB4 D595V mutant, used as a control in the experiment, does not clearly demonstrate enhanced ERBB4 homodimerization after quantifying the blots. Therefore, we have cropped the lanes representing the ERBB4 D595V mutant from the blot, and omitted the part of the results text that discusses this ERBB4 mutant:

      "ERBB4 homodimers were assessed by crosslinking cell surface proteins with a cell membrane impermeable BS3, enabling detection of ERBB4 dimers as high molecular weight species of ERBB4 in western blot. Another activating extracellular ERBB4 mutation, D595V, was used as a positive control, as we have previously demonstrated D595V to stabilize ERBB4 dimers using the same assay (Kurppa et al., 2016). As predicted by the structural analyses, S303F resulted in more abundant active, phosphorylated ERBB4 dimers than wild-type ERBB4 in the presence of NRG-1, while the activating intracellular domain mutation L798R, that served as a negative control for dimer stabilization, did not (Fig. 4C)."

      Omitting these data does not change our final conclusion, that the ERBB4 S303F mutation leads to enhanced ERBB4 homodimerization.

      There are major concerns with Supplemental files. It is imperative that the effectiveness of HER3 shRNA be shown in S Fig3. These data are not interpretable without this.

      We apologize for confusion related to the supplemental files. The effectiveness of the ERBB3 (HER3) shRNA is shown in the Supplementary Figure S3B of the original submitted manuscript.

      Lanes in S Fig 4 are not marked again making data not interpretable.

      Some of the lanes in the Supplementary Figure 4B were not marked because the experiment contained other ERBB4 constructs in addition to the ones that are marked and discussed in the manuscript text. The reason for leaving the unmarked lanes in the final figure was to emphasize that the bands indicated come from the same membrane, blot and exposure. We understand how this may cause confusion, and thus have now cropped the blots to include only the lanes discussed in the manuscript text.

      It's unclear why Table 1 is included as this is already published data. This previously published data should be summarized in the text.

      We are happy to elaborate the novelty of the data in Table 1 of the original submitted manuscript. The data is from the SUMMIT trial (NCT01953926) (Hyman et al., 2018), the results of which have been published. However, the three patients in the top part of the table were enrolled to the SUMMIT trial based on the ERBB4mutation in their tumor, and the data for these patients have not previously been published. We received these data directly from Puma Biotechnology. In addition, while the ERBB4 mutation status for the patients in the lower part of the table has been published in the supplementary files of the Hyman and others publication, we feel that the patients' ERBB4 mutations merit discussion, and including these patient data in the table would complement the data on the three patients in the top part of the table. Due to these reasons, we feel that the table contains unpublished and relevant data for the study, and would like to keep the table in the manuscript by moving it into the Supplementary Data (Supplementary Table S2).

      To clarify the sources of the patient data, we have modified the methods section related to the table as follows:

      "Neratinib efficacy data, cancer types and co-alterations of patients harboring an ERBB4 alteration, enrolled in PUMA-NER-5201, the SUMMIT trial (NCT01953926), and treated with neratinib as a single agent (240 mg/day) were obtained from Puma Biotechnology (for patients enrolled based on an ERBB4 mutation - previously unpublished data) and cBioPortal (for patients with ERBB4 as a co-altered gene, enrolled based on an ERBB2 or ERBB3 mutation)."

      This text is now moved to "Supplementary Methods" under a new section "Neratinib efficacy in patients" on page 9 of the revised Supplementary Data -file

      There is a disconnect why the last two figures focus on a single model of NSCLC whereas the three most transforming mutations are found most commonly in breast, melanoma and GI tract cancers.

      The reviewer is correct in that the most transforming ERBB4 mutations are indeed found most commonly in beast and esophagogastric cancers and in melanoma. However, in the context of targeted therapy resistance,mutations that confer resistance are often acquired during therapy, and may not represent the typical cancer type-specific mutational patterns. The strongest evidence for a potential role of mutant ERBB4 in therapy resistance comes from the context of EGFR-targeted therapies and lung cancer. As mentioned in the results and discussion sections of the submitted manuscript, ERBB4 mutations identified in patients who developed resistance to EGFR-targeted therapy (Cremolini et al., 2019; Jänne et al., 2022), include the same mutation or mutation in the same residue as analyzed in the current study: the strongly transforming S303F or L798I. In addition, a recent study showed that EGFR-mutant lung cancer patients with co-occurring ERBB4 mutations have shorter relapse-free survival on osimertinib treatment (Vokes et al., 2022). Therefore, we focused on EGFR-mutant lung cancer as the model system to assess, as proof-of-concept, whether activating, transforming ERBB4 mutations are able to confer resistance to EGFR-targeted therapy. To make the transition to cancer therapy resistance and the rationale for choosing the model context more clear, we have added text to the start of the "Activating ERBB4 mutations drive resistance to EGFR-targeted therapy in EGFR-mutant NSCLC cells" -chapter of the revised manuscript:

      "There is emerging evidence associating ERBB4 with cancer therapy resistance across various cancer types and treatment regimens (Merimsky et al., 2001, 2002; Mendoza-Naranjo et al., 2013; Nafi et al., 2014; Saglam et al., 2017; Wege et al., 2018; Wang et al., 2019; Zhang et al., 2023; Debets et al., 2023; Albert et al., 2024; Arribas et al., 2024), including ERBB4 mutations that have been found in patient tumors after acquisition of therapy resistance (Cremolini et al., 2019; Jänne et al., 2022; Vokes et al., 2022; Yaeger et al., 2023; Yuan et al., 2023). Intriguingly, the ERBB4 mutations identified in patients who developed resistance to EGFR-targeted therapy (Cremolini et al., 2019; Jänne et al., 2022), include the same mutation or mutation in the same residue as analyzed in the current study: the strongly transforming S303F or L798I. In addition, co-occurring ERBB4 mutations in EGFR-mutant lung cancer patients have been shown to associate with shorter progression-free survival on EGFR inhibitor therapy (Vokes et al., 2022). These observations point to the possibility that mutant ERBB4 could promote resistance to targeted therapies."

      What are the differences in the recurrent ERBB4 mutant tumors versus ERBB4 wild-type tumors described in Figure 7?

      The reviewer points out a very relevant question. We suspect that in the tumors expressing mutant ERBB4, the activating ERBB4 mutants are able to compensate for the loss of EGFR signaling, particularly since the on-treatment cancer cells demonstrate elevated levels of ERBB4 ligands (Fig. 7C, D). This is analogous to accumulating evidence suggesting that ERBB4 independently and together with ERBB3 (and/or with increased availability of their ligands) compensate for survival and growth signaling upon ERBB2- or EGFR-targeted therapy (Carrión-Salip et al., 2012; Wilson et al., 2012; Nafi et al., 2014; Canfield et al., 2015; Yonesaka et al., 2015; Donoghue et al., 2018; Shi et al., 2018; Debets et al., 2023; Udagawa et al., 2023). Unfortunately, we are unable to approach this hypothesis using samples from the in vivo experiment in Fig.7. The treatment of the mice was stopped after 189 days of treatment in order to assess how many tumors grew back (i.e. how many mice were cured by the treatment). For this reason, we do not have the appropriate controls to analyze ERBB4 mutant-associated changes in on-treatment tumors.

      Figure 7C, D should be moved to supplemental as this is from previously published data and not strictly relevant to data shown in Fig 7.

      The data shown in Fig. 7C and D are a re-analysis of published single-cell RNA-seq data. While the single cell RNA-sequencing data set is previously published, the analysis of ERBB4 ligand expression performed, and shown in Fig. 7C and D has not been published before. We feel that these data provide evidence of a previously unrecognized upregulation of ERBB4 ligand expression in on-treatment EGFR-mutant NSCLC cells in vivo. Furthermore, as discussed in the results section of the original submitted manuscript (page 26; page 28 of the revised manuscript), the upregulation of ERBB4 ligands in the on-treatment tumors provides a plausible mechanism supporting mutant ERBB4 activation upon EGFR inhibitor treatment, as the transforming ERBB4 mutants seem to retain at least partly the dependency of ligand stimulation. Thus, we feel that these data are unpublished and relevant for the manuscript, and we would like to keep these data panels in the main Figure 7.

      Limitations should include consideration of endogenous levels of ERBB4 in the model systems used and disparate expression levels of wt ERBB4 versus ERBB4 mutation.

      We thank the reviewer for pointing out that we have not thoroughly disclosed the endogenous levels of ERBB4 expression in the used model systems. None of the used model systems (MCF10a, Ba/F3, COS-7, PC-9) express detectable levels of ERBB4 protein. This was mentioned in the original submitted manuscript for COS-7 (page 19; page 20 of the revised manuscript), Ba/F3 cells (page 18; page 19 of the revised manuscript), and PC-9 cells (page 24; page 24 of the revised manuscript), but not for MCF10a cells. We have now made this point more clear, and added a sentence "Neither of these models express detectable levels of ERBB4" in the results section under the chapter "Majority of the recurrent ERBB4 mutations are transforming in Ba/F3 or MCF10a cells" (page 12-13 of the revised manuscript), as well as to the discussion section (page 30 of the revised manuscript).

      Regarding the expression levels of different ERBB4 mutants versus ERBB4 wild-type, we have now added the new Figure 2B, showing the expression of all ERBB4 mutants and ERBB4 wild-type in Ba/F3 and MCF10a cells. We have also included the following text describing the expression levels of ERBB4 mutants in the results section under "Majority of the recurrent ERBB4 mutations are transforming in Ba/F3 or MCF10a cells" (page 13 of the revised manuscript):

      "The different ERBB4 mutants demonstrated similar expression levels compared to wild-type ERBB4 in both model systems with the exception of R106C and G907E mutants that were expressed predominantly as immature receptor forms in both models, suggesting defective receptor maturation. Also, the R1304W mutant demonstrated lower expression levels in the Ba/F3 cells, and could not be expressed at all in the MCF10a cells (Fig. 2B)."

      Minor comments:

      Fig1B lists ERBB3 V104V mutation?

      Thank you for noticing this mistake. This has now been corrected in the revised Figure 1B.

      List frequency of ERBB4 mutations in the introduction

      We thank the reviewer for the suggestion and have revised the introduction to include an example of the high frequency of ERBB4 missense mutations in cancer as follows:

      "Yet, despite the high frequency of ERBB4 missense mutations in various cancer types (up to 30% in non-melanoma skin cancer, Supplementary Fig. S1A, B) and characterization of several potentially oncogenic ERBB4 mutations (Prickett et al. 2009; Nakamura et al. 2016; Chakroborty et al. 2022; Kurppa et al. 2016; Tvorogov et al. 2009), the rationale for clinically targeting ERBB4 in cancer has not been fully developed."

      Clarification throughout if cells are serum-starved (how long) if stimulated with NRG-1

      We thank the reviewer for the thoughtful suggestion and have revised the main text and figure legends accordingly; in the revised manuscript on pages 6, 8, 9, 13, 17, 20, 25 and 26 "(10% serum)", on page 25 "following short-term stimulation with NRG-1 after overnight serum starvation (Fig. 6A).", as well as figure legends of Fig. 2, 4, 5, 6, S2, and S3.

      Reviewer #1 (Significance (Required)):

      General assessment: This work fills a gap in cancer research understanding if ERBB4 mutations could be targeted. Concerns and comments need to be addressed before definitive conclusions can be made.

      The authors wish to thank the reviewer for the positive assessment.


      Reviewer #2

      (Evidence, reproducibility and clarity (Required)):

      Ojala et al. report a very extensive exploration of the functional relevance of somatic mutations occurring in the ERBB4 gene. The Authors demonstrate that 11 out of 18 mutations they studied have oncogenic potential, with some of them actionable using clinically available ERBB inhibitors, while giving resistance to EGFR inhibitors.

      A very minor comment. At the beginning of page 21, I'd not define PD as the best respone. The Authors can write that all four patients progressed under treatment.

      We would like to thank the reviewer for the comment. We agree with the reviewer, and have now revised the sentence in question as follows:

      "Two of the three patients that were qualified for the SUMMIT trial due to a mutation in ERBB4, with no other qualifying mutations in ERBB family genes, had an ERBB4 mutation characterized in this study to be transforming (R544W and V840I) (Supplementary Table S2). Yet, neither of these patients, nor the patient with an ERBB4 VUS N465K, responded to neratinib and progressed under treatment (Supplementary Table S2)."

      Reviewer #2 (Significance (Required)):

      The work by Ojala et al. is the most detailed study of mutations occurring in ERBB4. Since these are relatively rare, they have not been properly studied up to now. The study is very well done.

      The authors wish to thank the reviewer for the very positive statement.


      Reviewer #3

      (Evidence, reproducibility and clarity (Required)):

      Summary - This work has mined cBioPortal to identify candidate cancer driver mutations in the gene encoding the ERBB4 receptor tyrosine kinase (Figure 1). These ERBB4 mutations occurred in clusters that are paralogous to activating mutations in other ERBB receptor genes or in clusters predicted to serve as dimerization interfaces of ERBB4. Eighteen such ERBB4 mutations were selected for characterization.

      • These mutants were tested in BaF3 and MCF-10A cells in the context of the ERBB4 JM-a CYT-2 isoform (Figure 2). Several of these ERBB4 mutants exhibited greater agonist-dependent coupling to cell proliferation than wild-type ERBB4. Moreover, some of the mutants exhibited greater agonist-independent coupling to cell proliferation than wild-type ERBB4. Five ERBB4 mutants (S303F, E452K, L798R, R992C, S1289A) exhibited greater activity in the BaF3 cells, whereas nine ERBB4 mutants (S303F, R393W, E452K, R544W, R711C, S774G, L798R, V840I, G870R) exhibited greater activity in the MCF10A cells. Thus, eleven of the ERBB4 mutants (S303F, R393W, E452K, R544W, R711C, S774G, L798R, V840I, G870R, R992C, S1289A) exhibited a gain-of-function phenotype. It should be noted that several of the ERBB4 gain-of-function mutants (R393W, R544W, R711C, V840I, G870R, R992C, S1289A) exhibited cell type specificity.

      • PyMol was used to "model" the effect of the most potent (S303F, E452K, and L798R) gain-of-function mutations on the structure of ERBB4 (Figure 3). These three mutations are predicted to cause increased ERBB4 dimerization.

      • When expressed in MCF-10A cells, the most potent (S303F, E452K, and L798R) gain-of-function ERBB4 mutants exhibited elevated ligand-dependent and ligand-independent tyrosine phosphorylation. This was accompanied by elevated EGFR, ERBB2, and ERBB4 tyrosine phosphorylation and elevated signaling by canonical effector proteins (Figure 4).

      • The homo- and heterodimerization of the most potent ERBB4 mutant (S303F) was studied following transient transfection of COS-7 cells (Figure 4). As predicted, the S303F mutant exhibited greater ERBB4 homodimerization and greater heterodimerization with EGFR and ERBB2, but not with ERBB3.

      • The data from the clinical trial NCT01953926 was mined to evaluate whether the presence of an ERBB4 activating mutation found in this work is associated with sensitivity to the pan-ERBB inhibitor neratinib (Table 1). Surprisingly, a compelling association was NOT found. In contrast, the proliferation of BaF3 cells that express gain-of-function ERBB4 mutants is sensitive to the irreversible pan-ERBB inhibitors neratinib, afatinib, and dacomitinib (Figure 5).

      • Mining the cBioPortal, AACR GENIE, and COSMIC datasets indicates that the three most potent ERBB4 gain-of-function mutants (S303F, E452K, and L798R) exhibit tissue specificity (Supplementary Figure S5). Moreover, the S303F mutation is coincident with a mutation in another ERBB receptor to a much lesser degree than other gain-of-function ERBB4 mutants, particularly E452K. This too is suggestive of differences in the mechanism of action among the gain-of-function ERBB4 mutants (Supplementary Figure S5).

      • To test the effect of ERBB4 gain-of-function mutants on resistance to EGFR inhibitors, PC-9 NSCLC cells (which contain an endogenous gain-of-function EGFR mutant but do not endogenously express ERBB4) were transduced with ERBB4 gain-of-function mutants. In these cells the S303F and L715K mutants exhibited elevated ERBB4 signaling, but the L798R and K935I mutants did not. Nonetheless, the S303F, E715K, and K935I mutants promoted osimertinib resistance upon long-term treatment in vitro, whereas the L798R mutant did not (Figure 6). Moreover, the E715K and S303F mutants caused osimertinib resistance in vivo.

      • Overall, this is an impressive body of work. The experiments have been carefully performed and the data are clearly presented. However, the breadth of this work makes it a bit unfocused and difficult to digest.

      The authors wish to thank the reviewer for the positive statement.

      Major Issues Affecting the Conclusions

      The COS-7 data in Figure 4 are probably generated using supraphysiological levels of ERBB4 expression, raising concerns about the ability to draw general conclusions from these data. This issue should be addressed.

      We appreciate the reviewer's insight on the details concerning experimentation in COS-7 cells. We acknowledge the drawbacks in experiments performed using transient overexpression of proteins in COS-7 cells using vectors with strong viral promoters. To mitigate these drawbacks, we routinely perform transient overexpression in COS-7 cells using the retroviral pBABE-vectors, which have a weak promoter and produce relatively moderate protein expression level. We have included here a reviewer-only figure (Reviewer-only Figure 2) that demonstrates the ERBB4 expression level derived from the pBABE-vector, compared to endogenous expression level of ERBB4 in T47D and MCF7 cells, as well as to ERBB4 expression derived from pcDNA3.1 vector that harbors a strong viral CMV promoter. With this, we hope to convince the reviewer that the ERBB4 expression levels in our COS-7 cell experiments are not supraphysiological.

      Reviewer-only Figure 2. The expression level of ERBB4 in T47D and MCF7 cells, as well as in COS-7 cells transiently transfected with equal amounts of pBABE-puro-gateway-ERBB4JM-aCYT-2 plasmid, or pcDNA3.1.-ERBB4JM-aCYT-2 plasmid.

      The inhibitor data shown in Figure 5 may be over-interpreted. The affinity of neratinib, afatinib, and dacomitinib for EGFR is reportedly higher than the affinity of these drugs for ERBB4. Thus, the failure of ERBB4 gain-of-function mutants to cause resistance to these inhibitors may be because the inhibitors bind to endogenous EGFR and therefore fail to bind to ERBB4.

      We thank the reviewer for the insightful comments. The experiments in Figure 5 were performed in Ba/F3 cells, which do not express endogenous EGFR, or other kinase competent ERBB receptors (Riese et al., 1995). Therefore, it is unlikely that the observed cellular responses to neratinib, afatinib, or dacomitinib are affected by the drugs' preferable binding to EGFR.

      Moreover, the conclusion that the gain-of-function ERBB4 mutants are targetable with these inhibitors appears to be an overreach.

      We have revised our conclusion into that ERBB4 mutants are "sensitive to" these inhibitors, as supported by our data in Figure 5. This revision has been made in the abstract (page 2), introduction section (page 4), results section (page 23), and in the discussion (page 31) of the revised manuscript.

      The inhibitor data shown in Figure 6 demonstrates that activating ERBB4 mutations are sufficient to drive inhibitor resistance. However, these data do not demonstrate that the mutations are necessary to drive inhibitor resistance. Thus, these data are of less value than represented in this work. Knockout or silencing (CRISPR or siRNA) experiments would be more definitive.

      We agree with the reviewer that performing knock-out or silencing experiments to demonstrate the necessity of mutant ERBB4 for inhibitor resistance would strengthen the conclusions. However, the PC-9 cells (or any other EGFR-mutant NSCLC cell lines) do not express endogenous ERBB4, and do not have endogenous ERBB4 mutations. Therefore, knock-out or silencing experiments are unfortunately not possible in this setting.

      Minor Issues That Can Confidently Be Addressed

      In Figure 2, the MCF10A data are more compelling than the BaF3 data. Thus, an argument can be made that the BaF3 data belong in a supplemental figure. However, the combination of data from both cell lines illustrate the fact that ERBB4 mutants appear to exhibit cell type specificity. If this point is emphasized in the text, then Figure 2 should remain as currently presented.

      We agree with the reviewer that our data suggest that the ERBB4 mutants demonstrate a level of context-specificity. This was mentioned in the results section of the original submitted manuscript (page 20; page 21 of the revised manuscript) as well as discussed in the discussion section (page 29; page 29 of the revised manuscript). To emphasize this further, we have revised our conclusions at the end of the "Majority of the recurrent ERBB4 mutations are transforming in Ba/F3 or MCF10a cells" -section as follows:

      "Taken together, these analyses indicate a potential oncogenic role for 11 recurrent ERBB4 mutations. Eight of the mutations were transforming in only one of the models used, suggesting context-specificity. Three mutants (S303F, E452K and L798R) were strongly transforming with the ability to transform both cell models, S303F being unique in its ability to transform both models in the absence of NRG-1."

      The modeling data shown in Figure 3 are a bit under-interpreted. It would appear that the S303F, E452K, and L798R mutants would cause increased ERBB4 signaling by (1) shifting the equilibrium of ERBB4 monomers between the tethered (inactive) state and the extended (active) state or by (2) directly fostering receptor dimerization. The modeling data should be interpreted in the context of these two paradigms.

      We thank the reviewer again for an insightful observation. We have now revised the text describing the modeling data based on the reviewer's suggestions (please see the revised manuscript, under "Structural analysis of the transforming ERBB4 mutations").

      The mechanistic data shown in Figure 4 are also a bit under-interpreted. The data from Figure 2 suggest that ERBB4 gain-of-function mutants are more likely to promote ERBB4 heterodimerization than ERBB4 homodimerization. Do the data from Figure 4 support this hypothesis?

      The authors agree with the reviewer in that the activating ERBB4 mutations lead to increased activation of other ERBB family members (Fig. 4A), supporting a hypothesis that activating ERBB4 mutations lead to increased heterodimerization. We have discussed this throughout the original submitted manuscript, for example making these conclusions:

      Results section, page 16 (page 18 of the revised manuscript): "In summary, these data indicate that S303F, E452K and L798R are activating, gain-of-function ERBB4 mutations that may co-operate with other ERBB receptors in malignant transformation.", page 19 (page 20 of the revised manuscript): "Together, these data suggest that while ERBB4 can be transforming in the absence of other ERBB receptors, mutant ERBB4 co-operates with ERBB3 to promote ligand-independent cell transformation.".

      Discussion section, page 30 (page 31 of the revised manuscript: "Together, these findings imply that ERBB4 heterodimers with other ERBB receptors can contribute to cell transformation and growth, supporting the rationale for pan-ERBB inhibition approach in targeting mutant ERBB4 in cancer."

      Reviewer #3 (Significance (Required)):

      General Assessment: Strengths and Limitations

      • This work makes a significant contribution to the hypothesis that ERBB4 gain-of-function mutants drive multiple human malignancies. However, this work dances around two issues. (1) Is heterodimerization of EGFR or ERBB2 with ERBB4 required for the transforming activity of these ERBB4 mutants? (2) Are these ERBB4 mutants found in the context of the JM-a/CYT-2 isoform or some other isoform? Are these ERBB4 mutants active in the context of isoforms other than JM-a/CYT-2?

      We thank the reviewer for the very positive assessment and insight on specific ERBB4 biology that could affect the functional effect of mutations in ERBB4. We would like to comment on these insights:

      1) Since the strongly transforming ERBB mutations all promoted the activation of EGFR, ERBB2, and ERBB3 (Fig. 4A), it is possible that heterodimerization plays a role in the transforming activity of these ERBB4 mutants. However, our data suggests that EGFR and ERBB2 are not necessary for transformation, since the Ba/F3 cells, where transformation by ERBB4 mutants was observed (Fig. 2D), do not express EGFR or ERBB2. We did see a consistent upregulation of endogenous ERBB3 upon IL3 deprivation in the ERBB4 S303F -expressing Ba/F3 cells (Fig. 4B), which contributed to the ERBB4 S303F -driven, IL3-independent transformation (Supplementary Fig. S3C-D).

      2) None of the analyzed ERBB4 mutations are located in the JM- or CYT-regions of ERBB4, and thus could hypothetically be expressed in the context of any of the four ERBB4 isoforms. However, cancer tissues almost exclusively express the JM-a isoforms of ERBB4, with roughly similar ratios of CYT-1 and CYT-2 isoforms. We chose to use the JM-a CYT-2 isoform in this study, based on our previous work that has implicated the JM-a CYT-2 isoform as being more oncogenic than JM-a CYT-1 isoform, as elaborated in the original submitted manuscript: "The ERBB4 JM-a CYT-2 isoform was used in the studies based on previous findings suggesting that JM-a CYT-2 is the more oncogenic ERBB4 isoform of the cancer-associated isoforms (Veikkolainen et al., 2011) in hematopoietic cell contexts (relevant for the Ba/F3 cell model) (Määttä et al., 2006; Chakroborty et al., 2022)". We do agree with the reviewer that future studies should determine the relative contribution of JM-a CYT-1 and JM-a CYT-2 isoforms in the ability of mutant ERBB4 to drive cancer growth.

      Advance: How Does This Work Advance the Field

      • This work will undoubtedly reinvigorate the ERBB4 field.

      Audience:

      • Those with an interest in the role that ERBB receptors play in human tumors.

      My Expertise:

      • 30+ years of experience studying ERBB receptors.

      REFERENCES

      Albert, V. et al. (2024) 'HER4 Affects Sensitivity to Tamoxifen and Abemaciclib in Luminal Breast Cancer Cells and Restricts Tumor Growth in MCF-7-Based Humanized Tumor Mice', International Journal of Molecular Sciences. Multidisciplinary Digital Publishing Institute, 25(13), p. 7475. doi: 10.3390/ijms25137475.

      Arribas, A. J. et al. (2024) 'ERBB4-mediated signaling is a mediator of resistance to BTK and PI3K inhibitors in B cell lymphoid neoplasms', Molecular Cancer Therapeutics, 23(3), pp. 368-380. doi: https://doi.org/10.1158/1535-7163.MCT-23-0068.

      Bose, R. et al. (2013) 'Activating HER2 mutations in HER2 gene amplification negative breast cancer', Cancer Discovery, 3(2), pp. 224-237. doi: 10.1158/2159-8290.CD-12-0349.

      Bouyain, S. et al. (2005) 'The extracellular region of ErbB4 adopts a tethered conformation in the absence of ligand', Proceedings of the National Academy of Sciences. National Academy of Sciences, 102(42), pp. 15024-15029. doi: 10.1073/pnas.0507591102.

      Canfield, K. et al. (2015) 'Receptor tyrosine kinase ERBB4 mediates acquired resistance to ERBB2 inhibitors in breast cancer cells', Cell Cycle. Taylor & Francis, 14(4), pp. 648-655. doi: 10.4161/15384101.2014.994966.

      Carrión-Salip, D. et al. (2012) 'Androgen-independent prostate cancer cells circumvent EGFR inhibition by overexpression of alternative HER receptors and ligands', International Journal of Oncology. Int J Oncol, 41(3), pp. 1128-1138. doi: 10.3892/ijo.2012.1509.

      Chakroborty, D. et al. (2022) 'An Unbiased Functional Genetics Screen Identifies Rare Activating ERBB4 Mutations', Cancer Research Communications, 2(1), pp. 10-27. doi: 10.1158/2767-9764.crc-21-0021.

      Cremolini, C. et al. (2019) 'Benefit from anti-EGFRs in RAS and BRAF wild-type metastatic transverse colon cancer: A clinical and molecular proof of concept study', ESMO Open, 4(2), p. e000489. doi: 10.1136/esmoopen-2019-000489.

      Debets, D. O. et al. (2023) 'Deep (phospho)proteomics profiling of pre- treatment needle biopsies identifies signatures of treatment resistance in HER2+ breast cancer', Cell Reports Medicine. Elsevier, 4(10), p. 101203. doi: 10.1016/j.xcrm.2023.101203.

      Donoghue, J. F. et al. (2018) 'Activation of ERBB4 in glioblastoma can contribute to increased tumorigenicity and influence therapeutic response', Cancers. Multidisciplinary Digital Publishing Institute (MDPI), 10(8), p. 243. doi: 10.3390/cancers10080243.

      Ferguson, K. M. et al. (2003) 'EGF Activates Its Receptor by Removing Interactions that Autoinhibit Ectodomain Dimerization', Molecular Cell, 11(2), pp. 507-517. doi: 10.1016/S1097-2765(03)00047-9.

      Greulich, H. et al. (2005) 'Oncogenic Transformation by Inhibitor-Sensitive and -Resistant EGFR Mutants', PLOS Medicine. Public Library of Science, 2(11), p. e313. doi: 10.1371/JOURNAL.PMED.0020313.

      Greulich, H. et al. (2012) 'Functional analysis of receptor tyrosine kinase mutations in lung cancer identifies oncogenic extracellular domain mutations of ERBB2', Proceedings of the National Academy of Sciences. National Academy of Sciences, 109(36), pp. 14476-14481. doi: 10.1073/pnas.1203201109.

      Hyman, D. M. et al. (2018) 'HER kinase inhibition in patients with HER2-and HER3-mutant cancers', Nature. Nature Publishing Group, 554(7691), pp. 189-194. doi: 10.1038/nature25475.

      Jaiswal, B. S. S. et al. (2013) 'Oncogenic ERBB3 Mutations in Human Cancers', Cancer Cell, 23(5), pp. 603-617. doi: 10.1016/j.ccr.2013.04.012.

      Jänne, P. A. et al. (2022) 'Efficacy and Safety of Patritumab Deruxtecan (HER3-DXd) in EGFR Inhibitor- Resistant, EGFR-Mutated Non-Small Cell Lung Cancer', Cancer Discovery. American Association for Cancer Research, 12(1), pp. 74-89. doi: 10.1158/2159-8290.CD-21-0715.

      Kurppa, K. J. et al. (2016) 'Activating ERBB4 mutations in non-small cell lung cancer', Oncogene. Nature Publishing Group, 35(10), pp. 1283-1291. doi: 10.1038/onc.2015.185.

      Lee, J. C. et al. (2006) 'Epidermal Growth Factor Receptor Activation in Glioblastoma through Novel Missense Mutations in the Extracellular Domain', PLoS Medicine | www, 3. doi: 10.1371/journal.

      Liu, P. et al. (2012) 'A single ligand is sufficient to activate EGFR dimers', Proceedings of the National Academy of Sciences, 109(27), pp. 10861-10866. doi: 10.1073/pnas.1201114109.

      Määttä, J. A. et al. (2006) 'Proteolytic cleavage and phosphorylation of a tumor-associated ErbB4 isoform promote ligand-independent survival and cancer cell growth', Molecular Biology of the Cell, 17(1), pp. 67-79. doi: 10.1091/mbc.E05-05-0402.

      Mendoza-Naranjo, A. et al. (2013) 'ERBB4 confers metastatic capacity in Ewing sarcoma', EMBO Molecular Medicine. Nature Publishing Group, 5(7), pp. 1019-1034. doi: 10.1002/emmm.201202343.

      Merimsky, O. et al. (2001) 'Correlation between c-erbB-4 receptor expression and response to gemcitabine-cisplatin chemotherapy in non-small-cell lung cancer', Annals of Oncology. Ann Oncol, 12(8), pp. 1127-1131. doi: 10.1023/A:1011665216616.

      Merimsky, O. et al. (2002) 'ErbB-4 expression in limb soft-tissue sarcoma: Correlation with the results of neoadjuvant chemotherapy', European Journal of Cancer. Eur J Cancer, 38(10), pp. 1335-1342. doi: 10.1016/S0959-8049(02)00075-8.

      Nafi, S. N. M. et al. (2014) 'Nuclear HER4 mediates acquired resistance to trastuzumab and is associated with poor outcome in HER2 positive breast cancer', Oncotarget. Impact Journals, LLC, 5(15), pp. 5934-5949. doi: 10.18632/oncotarget.1904.

      Nakamura, Y. et al. (2016) 'Afatinib against esophageal or head-and-neck squamous cell carcinoma: Significance of activating oncogenic HER4 mutations in HNSCC', Molecular Cancer Therapeutics. American Association for Cancer Research, 15(8), pp. 1988-1997. doi: 10.1158/1535-7163.MCT-15-0737.

      Prickett, T. D. et al. (2009) 'Analysis of the tyrosine kinome in melanoma reveals recurrent mutations in ERBB4.', Nature genetics, 41(10), pp. 1127-1132. doi: 10.1038/ng.438.

      Riese, D. J. et al. (1995) 'The cellular response to neuregulins is governed by complex interactions of the erbB receptor family', Molecular and Cellular Biology, 15(10), pp. 5770-5776. doi: 10.1128/mcb.15.10.5770.

      Saglam, O. et al. (2017) 'ERBB4 Expression in Ovarian Serous Carcinoma Resistant to Platinum-Based Therapy', Cancer control. NIH Public Access, 24(1), p. 89. doi: 10.1177/107327481702400115.

      Shi, J. et al. (2018) 'The HER4-YAP1 axis promotes trastuzumab resistance in HER2-positive gastric cancer by inducing epithelial and mesenchymal transition', Oncogene, 37(22), pp. 3022-3038. doi: 10.1038/s41388-018-0204-5.

      Tvorogov, D. et al. (2009) 'Somatic mutations of ErbB4: Selective loss-of-function phenotype affecting signal transduction pathways in cancer', Journal of Biological Chemistry, 284(9), pp. 5582-5591. doi: 10.1074/jbc.M805438200.

      Udagawa, H. et al. (2023) 'HER4 and EGFR Activate Cell Signaling in NRG1 Fusion-Driven Cancers: Implications for HER2-HER3-specific Versus Pan-HER Targeting Strategies', Journal of Thoracic Oncology. Elsevier Inc, 19(1), pp. 106-118. doi: 10.1016/j.jtho.2023.08.034.

      Veikkolainen, V. et al. (2011) 'Function of ERBB4 is determined by alternative splicing', Cell Cycle, 10(16), pp. 2647-2657. doi: 10.4161/cc.10.16.17194.

      Vokes, N. I. et al. (2022) 'Concurrent TP53 mutations facilitate resistance evolution in EGFR mutant lung adenocarcinoma', Journal of Thoracic Oncology. International Association for the Study of Lung Cancer, 17(6), pp. 779-792. doi: 10.1016/j.jtho.2022.02.011.

      Wang, D. S. et al. (2019) 'Liquid biopsies to track trastuzumab resistance in metastatic HER2-positive gastric cancer', Gut. BMJ Publishing Group, 68(7), pp. 1152-1161. doi: 10.1136/gutjnl-2018-316522.

      Wege, A. K. et al. (2018) 'HER4 expression in estrogen receptor-positive breast cancer is associated with decreased sensitivity to tamoxifen treatment and reduced overall survival of postmenopausal women', Breast Cancer Research. Breast Cancer Res, 20(1). doi: 10.1186/s13058-018-1072-1.

      Wilson, T. R. et al. (2012) 'Widespread potential for growth-factor-driven resistance to anticancer kinase inhibitors', Nature. Nature Publishing Group, 487(7408), pp. 505-509. doi: 10.1038/nature11249.

      Yaeger, R. et al. (2023) 'Molecular Characterization of Acquired Resistance to KRASG12C-EGFR Inhibition in Colorectal Cancer', Cancer Discovery, 13(1), pp. 41-55. doi: 10.1158/2159-8290.CD-22-0405.

      Yonesaka, K. et al. (2015) 'The pan-HER family tyrosine kinase inhibitor afatinib overcomes HER3 ligand heregulin-mediated resistance to EGFR inhibitors in non-small cell lung cancer.', Oncotarget. Oncotarget, 6(32), pp. 33602-11. doi: 10.18632/oncotarget.5286.

      Yuan, S. Q. et al. (2023) 'Residual circulating tumor DNA after adjuvant chemotherapy effectively predicts recurrence of stage II-III gastric cancer', Cancer Communications. John Wiley & Sons, Ltd, 43(12), pp. 1312-1325. doi: 10.1002/cac2.12494.

      Zhang, J. et al. (2023) 'Tracking of trastuzumab resistance in patients with HER2-positive metastatic gastric cancer by CTC liquid biopsy.', American journal of cancer research. e-Century Publishing Corporation, 13(11), pp. 5684-5697. Available at: http://www.ncbi.nlm.nih.gov/pubmed/38058840 (Accessed: 16 April 2024).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      • This work has mined cBioPortal to identify candidate cancer driver mutations in the gene encoding the ERBB4 receptor tyrosine kinase (Figure 1). These ERBB4 mutations occurred in clusters that are paralogous to activating mutations in other ERBB receptor genes or in clusters predicted to serve as dimerization interfaces of ERBB4. Eighteen such ERBB4 mutations were selected for characterization.
      • These mutants were tested in BaF3 and MCF-10A cells in the context of the ERBB4 JM-a CYT-2 isoform (Figure 2). Several of these ERBB4 mutants exhibited greater agonist-dependent coupling to cell proliferation than wild-type ERBB4. Moreover, some of the mutants exhibited greater agonist-independent coupling to cell proliferation than wild-type ERBB4. Five ERBB4 mutants (S303F, E452K, L798R, R992C, S1289A) exhibited greater activity in the BaF3 cells, whereas nine ERBB4 mutants (S303F, R393W, E452K, R544W, R711C, S774G, L798R, V840I, G870R) exhibited greater activity in the MCF10A cells. Thus, eleven of the ERBB4 mutants (S303F, R393W, E452K, R544W, R711C, S774G, L798R, V840I, G870R, R992C, S1289A) exhibited a gain-of-function phenotype. It should be noted that several of the ERBB4 gain-of-function mutants (R393W, R544W, R711C, V840I, G870R, R992C, S1289A) exhibited cell type specificity.
      • PyMol was used to "model" the effect of the most potent (S303F, E452K, and L798R) gain-of-function mutations on the structure of ERBB4 (Figure 3). These three mutations are predicted to cause increased ERBB4 dimerization.
      • When expressed in MCF-10A cells, the most potent (S303F, E452K, and L798R) gain-of-function ERBB4 mutants exhibited elevated ligand-dependent and ligand-independent tyrosine phosphorylation. This was accompanied by elevated EGFR, ERBB2, and ERBB4 tyrosine phosphorylation and elevated signaling by canonical effector proteins (Figure 4).
      • The homo- and heterodimerization of the most potent ERBB4 mutant (S303F) was studied following transient transfection of COS-7 cells (Figure 4). As predicted, the S303F mutant exhibited greater ERBB4 homodimerization and greater heterodimerization with EGFR and ERBB2, but not with ERBB3.
      • The data from the clinical trial NCT01953926 was mined to evaluate whether the presence of an ERBB4 activating mutation found in this work is associated with sensitivity to the pan-ERBB inhibitor neratinib (Table 1). Surprisingly, a compelling association was NOT found. In contrast, the proliferation of BaF3 cells that express gain-of-function ERBB4 mutants is sensitive to the irreversible pan-ERBB inhibitors neratinib, afatinib, and dacomitinib (Figure 5).
      • Mining the cBioPortal, AACR GENIE, and COSMIC datasets indicates that the three most potent ERBB4 gain-of-function mutants (S303F, E452K, and L798R) exhibit tissue specificity (Supplementary Figure S5). Moreover, the S303F mutation is coincident with a mutation in another ERBB receptor to a much lesser degree than other gain-of-function ERBB4 mutants, particularly E452K. This too is suggestive of differences in the mechanism of action among the gain-of-function ERBB4 mutants (Supplementary Figure S5).
      • To test the effect of ERBB4 gain-of-function mutants on resistance to EGFR inhibitors, PC-9 NSCLC cells (which contain an endogenous gain-of-function EGFR mutant but do not endogenously express ERBB4) were transduced with ERBB4 gain-of-function mutants. In these cells the S303F and L715K mutants exhibited elevated ERBB4 signaling, but the L798R and K935I mutants did not. Nonetheless, the S303F, E715K, and K935I mutants promoted osimertinib resistance upon long-term treatment in vitro, whereas the L798R mutant did not (Figure 6). Moreover, the E715K and S303F mutants caused osimertinib resistance in vivo.
      • Overall, this is an impressive body of work. The experiments have been carefully performed and the data are clearly presented. However, the breadth of this work makes it a bit unfocused and difficult to digest.

      Major Issues Affecting the Conclusions

      • The COS-7 data in Figure 4 are probably generated using supraphysiological levels of ERBB4 expression, raising concerns about the ability to draw general conclusions from these data. This issue should be addressed.
      • The inhibitor data shown in Figure 5 may be over-interpreted. The affinity of neratinib, afatinib, and dacomitinib for EGFR is reportedly higher than the affinity of these drugs for ERBB4. Thus, the failure of ERBB4 gain-of-function mutants to cause resistance to these inhibitors may be because the inhibitors bind to endogenous EGFR and therefore fail to bind to ERBB4. Moreover, the conclusion that the gain-of-function ERBB4 mutants are targetable with these inhibitors appears to be an overreach.
      • The inhibitor data shown in Figure 6 demonstrates that activating ERBB4 mutations are sufficient to drive inhibitor resistance. However, these data do not demonstrate that the mutations are necessary to drive inhibitor resistance. Thus, these data are of less value than represented in this work. Knockout or silencing (CRISPR or siRNA) experiments would be more definitive.

      Minor Issues That Can Confidently Be Addressed

      • In Figure 2, the MCF10A data are more compelling than the BaF3 data. Thus, an argument can be made that the BaF3 data belong in a supplemental figure. However, the combination of data from both cell lines illustrate the fact that ERBB4 mutants appear to exhibit cell type specificity. If this point is emphasized in the text, then Figure 2 should remain as currently presented.
      • The modeling data shown in Figure 3 are a bit under-interpreted. It would appear that the S303F, E452K, and L798R mutants would cause increased ERBB4 signaling by (1) shifting the equilibrium of ERBB4 monomers between the tethered (inactive) state and the extended (active) state or by (2) directly fostering receptor dimerization. The modeling data should be interpreted in the context of these two paradigms.
      • The mechanistic data shown in Figure 4 are also a bit under-interpreted. The data from Figure 2 suggest that ERBB4 gain-of-function mutants are more likely to promote ERBB4 heterodimerization than ERBB4 homodimerization. Do the data from Figure 4 support this hypothesis?

      Significance

      General Assessment: Strengths and Limitations

      This work makes a significant contribution to the hypothesis that ERBB4 gain-of-function mutants drive multiple human malignancies. However, this work dances around two issues. (1) Is heterodimerization of EGFR or ERBB2 with ERBB4 required for the transforming activity of these ERBB4 mutants? (2) Are these ERBB4 mutants found in the context of the JM-a/CYT-2 isoform or some other isoform? Are these ERBB4 mutants active in the context of isoforms other than JM-a/CYT-2?

      Advance: How Does This Work Advance the Field

      This work will undoubtedly reinvigorate the ERBB4 field.

      Audience:

      Those with an interest in the role that ERBB receptors play in human tumors.

      My Expertise:

      30+ years of experience studying ERBB receptors.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Ojala et al. report a very extensive exploration of the functional relevance of somatic mutations occurring in the ERBB4 gene. The Authors demonstrate that 11 out of 18 mutations they studied have oncogenic potential, with some of them actionable using clinically available ERBB inhibitors, while giving resistance to EGFR inhibitors.

      A very minor comment. At the beginning of page 21, I'd not define PD as the best respone. The Authors can write that all four patients progressed under treatment.

      Significance

      The work by Ojala et al. is the most detailed study of mutations occurring in ERBB4. Since these are relatively rare, they have not been properly studied up to now. The study is very well done.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: The authors identify cancer-associated ERBB4 mutations that are selected for functional characterization. Utilizing the BaF3 and MCF10A models, the authors investigate the potential oncogenic role for 11 recurrent ERBB4 mutations. Three mutants (S303F, E452K and L798R) were strongly transforming with the ability to transform both cell models, S303F being unique in its ability to transform both models in the absence of NRG-1. The authors perform modeling to decipher potential mechanisms of action of the ERBB4 S303F, E452K and L798R mutations. The authors assess the ability of HER3 mutations to dimerize with other HER family members and demonstrate that ERBB4 S303F can mediate its activating functions by stabilizing homo- and heterodimers with other ERBB receptors and that the heterodimerization is likely cell/tissue context dependent. The authors demonstrate that transforming ERBB4 mutants are sensitive to pan-ERBB inhibitors and drive resistance to EGFR-targeted therapy in EGFR-mutant NSCLC cells.

      Major comments:

      1. Patient data analysis is performed in more than 15 months ago in January 2024. This analysis should be updated.
      2. The rationale for selecting the mutations to be studied is not entirely clear. There are no references to support studying mutations in Fig 1B red boxes.
      3. Cell proliferation should be shown for BaF3 cells for continuity in Figure 2 instead of doubling time. The relative expression of HER3 constructs must be shown for BaF3 and MCF10A cells in Figure 2.
      4. Blots in Figure 4 must be quantified.
      5. There are major concerns with Supplemental files. It is imperative that the effectiveness of HER3 shRNA be shown in S Fig3. These data are not interpretable without this. Lanes in S Fig 4 are not marked again making data not interpretable.
      6. It's unclear why Table 1 is included as this is already published data. This previously published data should be summarized in the text.
      7. There is a disconnect why the last two figures focus on a single model of NSCLC whereas the three most transforming mutations are found most commonly in breast, melanoma and GI tract cancers.
      8. What are the differences in the recurrent ERBB4 mutant tumors versus ERBB4 wild-type tumors described in Figure 7? Figure 7C, D should be moved to supplemental as this is from previously published data and not strictly relevant to data shown in Fig 7.
      9. Limitations should include consideration of endogenous levels of ERBB4 in the model systems used and disparate expression levels of wt ERBB4 versus ERBB4 mutation.

      Minor comments:

      1. Fig1B lists ERBB3 V104V mutation?
      2. List frequency of ERBB4 mutations in the introduction
      3. Clarification throughout if cells are serum-starved (how long) if stimulated with NRG-1

      Significance

      General assessment: This work fills a gap in cancer research understanding if ERBB4 mutations could be targeted. Concerns and comments need to be addressed before definitive conclusions can be made.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      We thank the three reviewers for the time and caution taken to assess our manuscript, and for their constructive feedback that will help improve the study. We herewith provide a revision plan, expecting that the additional experiments and corrections will address the key points raised by the reviewers.

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      • *

      __Reviewer #1 (Evidence, reproducibility and clarity (Required)): __

      Summary: The manuscript by Delgado et al. reports the role of the actin remodeling Arp2/3 complex in the biology of Langerhans cells, which are specialized innate immune cells of the epidermis. The study is based on a conditional KO mouse model (CD11cCre;Arpc4fl/fl), in which the deletion of the Arp2/3 subunit ArpC4 is under the control of the myeloid cell specific CD11c promoter.

      In this model, the assembly of LC networks in the epidermis of ear and tail skin is preserved when examining animals immediately after birth (up to 1 week). Subsequently however LCs from ArpC4-deleted mice start displaying morphological aberrations (reduced elongation and number of branches at 4 weeks of age). Additionally, a profound decline in LC numbers is reported in the skin of both the ear and tail of young adult mice (8-10 weeks).

      To explore the cause of such decline, the authors then opt for the complementary in vitro study of bone-marrow derived DCs, given the lack of a model to study LCs in vitro. They report that ArpC4 deletion is associated with aberrantly shaped nuclei, decreased expression of the nucleoskeleton proteins Lamin A/C and B1, nuclear envelop ruptures and increased DNA damage as shown by γH2Ax staining. Importantly, they provide evidence that the defects evoked by ArpC4 deletion also occur in the LCs in situ (immunofluorescence of the skin in 4-week old mice).

      Increased DNA damage is further documented by staining differentiating DCs from ArpC4-deleted mice with the 53BP1 marker. In parallel, nuclear levels of DNA repair kinase ATR and recruitment of RPA70 (which recruits ATR to replicative forks) are reduced in the ArpC4-deleted condition. In vitro treatment of DCs with the topoisomerase II inhibitor etoposide and the Arp2/3 inhibitor CK666 induce comparable DNA damage, as well as multilobulated nuclei and DNA bridges. The authors conclude that the ArpC4-KO phenotype might stem, at least in part, from a defective ability to repair DNA damages occurring during cell division.

      The study in enriched by an RNA-seq analysis that points to an increased expression of genes linked to IFN signaling, which the authors hypothetically relate to overt activation of innate nucleic acid sensing pathways.

      The study ends by an examination of myeloid cell populations in ArpC4-KO mice beyond LCs. Skin cDC2 and cDC2 subsets display skin emigration defects (like LCs), but not numerical defects in the skin (unlike LCs). Myeloid cell subsets of the colon are also present in normal numbers. In the lungs, interstitial and alveolar macrophages are reduced, but not lung DC subsets. Collectively, these observations suggest that ArpC4 is essential for the maintenance of myeloid cell subsets that rely on cell division to colonize or to self-maintain within their tissue of residency (including LCs).

      MAJOR COMMENTS

      1. ArpC4 and Arp2/3 expression The authors argue that LCs from Arpc4KO mice should delete the Arpc4 gene in precursors that colonize the skin around birth. It would be important to show it to rule out the possibility that the lack of phenotype (initial seeding, initial proliferative burst) in young animals (first week) could be related to an incomplete deletion of ArpC4 expression. Also important would be to show what is happening to the Arp2/3 complex in LCs from Arpc4KO mice.

      __Response: __We thank this reviewer for the careful assessment of our manuscript. Regarding this specific comment, we would like to clarify that we do not expect ArpC4 to be deleted in LC precursors, as CD11c is only expressed once the cells have entered the epidermis. Instead, we expect the deletion to take place after birth around day 2-4 (Chorro et al., 2009). For this reason, we performed a deletion PCR of epidermal cells at postnatal day 7 (P7), a time at which the proliferative burst occurs. This analysis revealed CD11c-Cre-driven recombination in the ArpC4 locus (Fig. S2C). This experiment indicates that ArpC4 deletion does not alter LC proliferation and postnatal network formation.


      Revision plan: We will revise the manuscript text to more clearly explain when ArpC4 will be deleted during development when using the CD11c-Cre transgene, and better emphasize the rationale for the deletion PCR.

      In the in vitro studies with DCs, the level of ArpC4 and Arp2/3 deletion at the protein level is also not documented.


      __Response: __We have previously analyzed the expression of ArpC4 in BMDCs in a recent study, confirming its loss in CD11c-Cre;ArpC4fl/fl cells at the protein level: Rivera et al. Immunity 2022; doi: 10.1016/j.immuni.2021.11.008. PMID: 34910930 (Fig. S2D). Therefore, in the current manuscript we only refer to that paper (Results, first paragraph).

      The authors explain that surface expression of the CD11c marker, which drives Arpc4 deletion, gradually increased during differentiation of DCs: from 50% to 90% of the cells. Does that mean that loss of ArpC4 expression is only effective in a fraction of the cells examined before day 10 of differentiation (e.g. in the RNA-seq analysis)?

      __Response: __The reviewer is correct, there is heterogeneity in CD11c expression, which is inherent of these DC culture model, implying that Arpc4 gene deletion will be partial. However, despite this, we were able to detect significant differences between the transcriptomes of control and CD11c-Cre;ArpC4fl/fl DCs in early phases during differentiation, emphasizing that the phenotype of ArpC4 loss is robust.


      Revision Plan: We will include a notion on this heterogeneity in the revised manuscript text.

      Intra-nuclear versus extra-nuclear activities of Arp2/3

      The authors favor a model whereby intra-nuclear ArpC4 helps maintaining nuclear integrity during proliferation of DCs (and possibly LCs). However, multiple pools of Arp2/3 have been described and accordingly, multiple mechanisms may account for the observed phenotype: i) cytoplasmic pool to drive the protrusions sustaining the assembly of the LC network and its connectivity with keratinocytes ; ii) peri-nuclear pool to protect the nucleus ; iii) Intra-nuclear pool to facilite DNA repair mechanisms e.g. by stabilizing replicative forks (the scenario favored by the authors).


      __Response: __The referee is correct, and this is actually discussed in our manuscript (page 11, upper paragraph): we cannot exclude that several pools of branched actin are influencing the phenotype we here describe.

      Unfortunately, we have previously tested several antibodies against ArpC4, but in our hands, and despite comprehensive optimization, they did not yield specific signals that would enable us to assess changes in subcellular localization in murine cells. Upon this reviewer's comment, we will now reassess the available tools and found an antibody against ArpC2 (Millipore, Anti-p34-Arc/ARPC2, 07-227-I-100UG) that may work based on published data. We have ordered this product to test it for IF staining of ArpC2, hoping to be able to delineate the subcellular localization of ArpC2 in DCs and potentially LCs.

      Revision plan: Upon receipt, we will test the ArpC2 antibody (Millipore, #07-227-I-100UG) both in cultured DCs and in epidermal whole mounts, running various optimization protocols regarding fixation, permeabilization and blocking reagents, next to antibody dilution. That way we hope to be able to detect the subcellular localization of Arp2/3 complex components as requested by this reviewer.

      It is recommended that the authors try to gather more supportive data to sustain the intra-nuclear role. Documenting ArpC4 presence in the nucleus would help support the claim. It could be combined with treatments aiming at blocking proliferation in order to reinforce the possibility that a main function of ArpC4 is to protect proliferating cells by favoring DNA repair inside the nucleus.

      __Response: __We thank this reviewer for this very helpful comment. As outlined in the previous response, we will aim at obtaining subcellular localization data for Arp2/3 complex components, and along with that study a potential intranuclear localization. Beyond that, in comparison to commonly cultured cell types, however, we face two hurdles addressing the nuclear Arp2/3 role in full: 1) Due to poor transduction rates and epigenetic silencing, we cannot sufficiently express exogenous constructs such as ArpC4-NLS in DCs to assess the subcellular localization of Arp2/3 complex components. 2) We have performed preliminary tests to block proliferation in DCs, using the cyclin D kinase 1 inhibitor RO3306 at different concentrations and incubation times during DC differentiation. Unfortunately, most cells were found dead after treatment. Further lowering the inhibitor concentrations (below 3.5uM) will likely not block the cell cycle, rendering this approach unsuited.

      Revision plan: We will test the suitability of additional antibodies directed against Arp2/3 complex components to assess their subcellular localization, with the aim to discriminate peripheral cytoplasmic vs. perinuclear vs. intranuclear localization. In addition, we will add a comment in the discussion, further addressing this point. In the case we remain unable to pinpoint that Arp2/3 resides in the nucleus, we will further tone down our current phrasing in the discussion, also emphasizing the possibility that cytoplasmic or perinuclear pools of the complex may indirectly help maintain the integrity of the genome in LCs.

      Nuclear envelop ruptures

      The nuclear envelop ruptures are not sufficiently documented (how many cells were imaged? quantification?). The authors employ STED microscopy to examine Lamin B1 distribution. The image shown in Figure 4A does not really highlight the nuclear envelop, but rather the entire content. Whether it is representative is questionable. We would expect Lamin B1 staining intensity to be drastically reduced given the quantification shown in Figure 3D. In addition, although the authors have stressed in the previous figure that Arpc4-KO is associated with nucleus shape aberrations, the example shown in Figure 4A is that of a nucleus with a normal ovoid shape.

      It is recommended to quantify the ruptures with Lap2b antibodies (or another staining that would better delineate the envelop) in order to avoid the possible bias due to the reduced staining intensity of Lamin B1.

      __Response: __NE ruptures were quantified by imaging NLS-GFP-expressing DCs in microchannels to visualize leakage of their nuclear content (Fig. 4B,C). The STED image mentioned by the referee (Fig. 4A,D) was only shown to further illustrate examples of NE ruptures, here using Lamin B as an immunofluorescence marker for the NE. We do agree with the reviewer that it was not chosen optimally to represent the ArpC4-KO phenotype regarding nuclear shape and Lamin B1.

      Revision plan: We will provide representative examples of nuclear illustrations of the ArpC4-KO phenotype vs. control cells. In addition, we will perform STED microscopy of Lap2B immunostained DCs as suggested by the referee.

      A missing analysis is that of nuclear envelop ruptures as a function of nucleus deformations.

      __Response: __As stated in the manuscript (page 5, third paragraph), the morphology of DCs is quite heterogeneous. As mentioned above, nuclear rupture events were quantified by live-imaging of NLS-GFP expressing DCs, enabling the tracing of rupture events. Live imaging is the only robust manner to measure nuclear membrane rupture events as they are transient due to rapid membrane repair (Raab et al. Science 2016). The NLS-GFP label itself, however, is not accurate enough to also quantify nuclear deformations. The latter therefore was quantified after cell fixation, using DAPI and/or immunostaining for NE envelope markers (Figures 3 and S3).

      Revision plan: We will quantify nuclear deformations using Lap2B staining of the nuclear envelope as suggested by the referee.

      Fig 4B-C: same frequency of Arpc4-KO and WT cells displaying nuclear envelop ruptures in the 4-µm channels; however image show a rupture for the Arpc4-KO and no rupture for the WT cells (this is somehow misleading). Are ruptures similar in Arpc4-KO and WT cells in this condition?

      __Response: __We apologize for choosing an image that better reflects our quantification, our mistake.

      Revision plan: We will choose an image that better reflects our quantification.

      Fig 4D-E: is their a direct link between nuclear envelop ruptures and ƴH2A.X?

      __Response: __At present, we can only correlate the findings of increased gH2Ax and elevated events of nuclear envelope ruptures in ArpC4-KO DCs. Rescue experiments are very difficult to impossible in DCs (e.g. restoring Lamin A/C and B levels in the KOs and subsequently assessing the amount of DNA damage). While we are afraid that we cannot address a potential link between NE ruptures and DNA damage by experiments in a manner feasible within this manuscript's revision, we have discussed this interesting aspect based on observations in immortalized cell culture systems (page 10). However, we would like to note that this was indeed shown for different cell types in Nader et al. Cell 2021. This effect results from access of cytosolic nuclease Trex1 to nuclear DNA.

      Revision plan: This point will be clarified in our revised manuscript.


      Interesting (but optional) would be to understand what is happening to DNA, histones? Is their evidence for leakage in the cytoplasm?

      __Response: __We have not investigated so far. We will attempt to do so.

      Revision plan: To address this aspect, we plan to perform immunostainings for double-stranded DNA in the cytoplasm (along with an NE marker). This approach has been used in the field to mark cytoplasmic DNA.

      RNA seq analysis

      The RNA-seq analysis suffers from a lack of direct connection with the rest of the study. The extracted molecular information is not validated nor further explored. It remains very descriptive. The PCA analysis suggests a « more pronounced transcriptomic heterogeneity in differentiating Arpc4KO DCs ». However it seems difficult to make such a claim from the comparison of 3 mice per group. In addition, such heterogeneity is not seen in the more detailed analysis (Fig 5F). The authors claim that « day 10 control and Arpc4KO DCs showed no to very little differences in gene expression, in contrast to cells at days 7-9 of differentiation ». This is not obvious from the data displayed in the corresponding figure. In addition, it is not expected that cells that may take a divergent differentiation path at days 7-9 may would return to a similar transcriptional activity at day 10.

      A point that is not discussed is that before day 10 of DC differentiation, Arpc4 KO is expected to only occur in about 50% of the cell population. This is expected to impact the RNA-seq analysis.

      Not all clusters have been exploited (e.g. cluster 3 elevated, cluster 6 partly reduced). I suggest the authors reconsider their analysis and analysis of the RNA-seq analysis (or eventually invest in complementary analysis).

      __Response: __Despite a comprehensive analysis of the different transcriptomes of control and ArpC4 mutant cells during DC differentiation, we decided to focus the presentation and discussion of our RNAseq results on the most notable findings. Of these, the elevated innate immune responses in ArpC4-KO DCs (Fig. 5E,H) caught our particular attention, as this seemed highly meaningful in light of DC and LC functions.

      Revision plan: As suggested by the referee, in the revised manuscript, we will better connect the RNAseq data to the other cellular and molecular analyses shown, complementing these results by investigating the potential involvement of innate immune responses in the ArpC4-KO phenotype.

      What causes the profound numerical drop of LC in the epidermis?

      A major open question is what causes the massive drop of LCs. Although differentiating Arpc4KO DCs start accumulating DNA damage upon proliferation, they succeed in progressing through the cell cycle. There is even a slightly elevated expression of cell cycle genes at day 7 of differentiation in the DC model.

      Only a trend for increased apoptosis is observed in ear and tail skin. It would be important to provide complementary data documenting increased death (or aberrant emigration?) of LCs in the 4-8 week time window.

      __Response: __We agree with the reviewer that this is an important question. We exclude that elevated emigration causes the decline of LCs in ArpC4-KO epidermis, as ArpC4-mutant LCs are significantly reduced (and not increased) in skin-draining lymph nodes (Fig. 7E). To assess whether increased cell death contributed to LC loss, we have tried to identify LCs that are just about to die. As the reviewer noted, we could only observe a trend of apoptosis-positive LCs in mutant epidermis. We assume that this is because of a quick elimination of compromised LCs following DNA damage, with only a short time passing until LCs with impaired genome integrity will be cleared from the system, making it very difficult to detect gH2Ax-positive cells that are positive for markers of cell death.

      Revision plan: Despite the abovementioned expected limitations to detect DNA-damage-positive but viable LCs in vivo, for the manuscript revision we will collect 6-week-old mice to analyze LC numbers and apoptosis (cleaved Caspase-3), complementing our data derived from 7-day and 4-week-old mice (Figures S2A,B, S2E,F). Suited mice have been born end of May; we are ready to analyze them at 6-weeks of age, accordingly.

      Functional consequences

      Although the study reports novel aspects of LC biology, the consequence of ArpC4 deletion for skin barrier function and immunosurveillance are not investigated. It would seem very relevant to test how this model copes with radiation, chemical and/or microorganism challenges.

      __Response: __We fully agree with this reviewer that this is a very interesting point. Therefore, next to assessing the steady-state circulation of LCs and DCs, we also addressed the consequence of ArpC4 loss for LC function in chemically challenged skin: we performed skin painting experiments using the contact sensitizer fluorescein isothiocyanate (FITC), diluted in the sensitizing agent dibutyl phthalate (DBP), to detect cutaneous-derived phagocytes within draining lymph nodes. These experiments revealed that migration of Arpc4KO LCs (as well as of Arpc4KO DCs) to skin-draining lymph nodes was impaired (Fig. 7C-E), confirming an in vivo role of ArpC4 for immune cell migration to lymphatic organs following a chemical challenge. Considering the lengthy legal approval procedures for new animal experimentation procedures, additional in vivo challenges -beyond the provided FITC challenge study- would take at least 6 additional months, which would delay excessively the revision of our manuscript.

      Revision Plan: We will better explain the FITC painting experiment to highlight its importance.

      MINOR COMMENTS:

      1- Figure 1D

      Gating strategy: twice the same empty plots. The content seems to be missing... Does this need to be shown in the main figure?

      __Response: __We apologize for this problem; there might be a problem due to file conversion of PDF reader software. In our PDF versions (including the published bioRxiv preprint) we do see the data points (see below); however, we have earlier experienced incomplete FACS plots during manuscript preparation.


      Revision plan: We will take extra care and double-check the results after converting the figures into PDFs. In addition, we will provide JPG files when submitting the revised manuscript, to prevent such problems.

      2- Figure 2

      Best would be to keep same scale to compare P1 and P7 (tail skin, figure 2A)

      Response and revision plan: We will replace the examples with micrographs of comparable scale (already solved, will be provided with manuscript revision).

      Overlay of Ki67 and MHC-II does not allow to easily visualize the double-positive cells (Fig 2C)

      Response and revision plan: We will provide single-channel image next to the merged view, and improve the visualization of double-positive cells (already solved, will be provided with manuscript revision)

      Quality of Ki67 staining different for Arpc4-KO (less intense, less focused to the nuclei): a technical issue or could that reflect something?

      Response and revision plan: We thank the reviewer for spotting this. We have re-assessed all Ki67 micrographs and noted that the originally chosen examples indeed are not fully representative. We have meantime selected more representative examples of Ki67-positive cells in control and mutant tissues, reflecting no difference in the principal nature of Ki67 staining (already taken care of, will be provided with manuscript revision).

      Fig 2C: Panels mounted differently for ear and tail skin (different order to present the individual stainings, Dapi for tail skin only).

      Response and revision plan: We will harmonize the sequence of panels in figure 2 with submission of the revised manuscript.

      3- LC branch analysis (Fig 1 and 2)

      While Fig 1 indicates that ear skin LCs form in average twice as few branches as tail skin LCs (3-4 versus 8-9 branches per cell), Fig 2 shows the opposite (10-12 versus 6-7 branches per cell).

      Is this due to a very distinct pattern between the 2 considered ages (4 weeks versus 8-10 weeks)? Could the author double-check that there is no methodological bias in their analysis?


      Response: We thank the reviewer for hinting us on this apparent inconsistency. Indeed, our initial analysis suffered from a bias in detecting LC dendrites, as the tissue cellularity and overall morphology significantly differs between 4-week-old and adult animals: In adult animals, the immunostainings show a higher baseline background signal for the skin epithelium compared to P28. We had noted this beforehand and had adjusted the imaging pipeline accordingly, with a more stringent thresholding to eliminate background signals in the case of adult tissues. While we were able to detect the described ArpC4 phenotype, this strategy resulted in a reduced ability to detect dendrites (both in control and mutant tissues), explaining the seemingly reduced number of dendrites in adult vs. 4-week-old tissues.

      Revision plan: We have double-checked both the micrographs and the corresponding quantifications and did not identify errors. Instead, our assumption -that a too high stringency for background reduction in adults caused the discrepancy- turned out correct. At present, we are re-doing the detailled analyses of LC morphology at 4-week and adult stages by confocal microscopy using a 63x objective rather than a 40x objective as done previously. First results confirm that with this approach the number of LC dendrites across these ages are largely comparable, while the phenotypes of ArpC4 loss are retained. We will provide a completely new analysis with revision of the manuscript.

      4- Fig 3 E-G

      How many animals were examined (n=5)? Reproducible accros animals? Why was it done with 4-week animals (phenotype not complete? Event occurring before loss in numbers...)

      Response and revision plan: As mentioned in the figure legend for Fig. 3F we have analysed N = 4 control and N= 5 KO mice (for clarity, we will add this information to Figure 3E and G in the revised document). We chose the 4-week time-point as this was the stage when the loss of LCs first became apparent (even though non-significant at this age). We aimed to learn whether changes in nuclear morphology and nuclear envelope markers represented early molecular and cellular events following ArpC4 loss. Compared to later stages, this strategy poses a reduced risk to detect indirect effects of ArpC4 loss. We will clarify this in the revised manuscript text.

      Staining Lamin A/C globally more intense in the Arpc4-KO epidermis (also seems to apply to the masks corresponding to the LCs). Surprising to see that the quantification indicates a major drop of Lamin A/C intensity in the LCs.

      Response and revision plan: We again thank the reviewer for this careful assessment. The originally chosen micrographs are indeed not fully representative. As with many tissue stainings, there is inter-sample variability. We have now revisited the micrographs and did not find a significant global reduction of Lamin A/C in the entire epidermis (including keratinocytes/KCs). The drop of Lamin A/C intensity is restricted to ArpC4 LCs -and not KCs- and in line with the reduced Lamin A/C expression data in DCs (Fig. 3C,D). We have selected more representative examples, which will be provided with the revised manuscript.

      Legend Fig 4D replace confocal microscopy by STED microscopy

      Revision plan: We will replace "confocal microscopy" by "STED microscopy" accordingly.

      6- Figure 4F

      Intensity/background of γH2Ax staining very distinct between the 2 micrographs shown for WT and Arpc4-KO epidermis.

      Response and revision plan: We have revisited the micrographs and now selected more representative examples, which will be provided in the revised manuscript.

      7- Figure 7C, F, H

      Gating strategies: would be better to harmonize the style of the plots (dot plots and 2 types of contour plots have been used...)

      Response and revision plan: We agree and will provide a harmonized plot illustration in the revised manuscript.

      8- Figure 7H

      Legend of lower gating strategy seems to be wrong (KO and not WT).

      Response and revision plan: We thank the reviewer for pointing out this mistake. A corrected figure display will be provided with revision.

      Reviewer #1 (Significance (Required)):

      Strengths: the general quality of the manuscript is high. It is very clearly written and it contains a very detailed method section that would allow reproducing the reported experiments. This work entails a clear novelty in that it represents the first investigation of the role of ArpC4 in LCs. It opens an interesting perspective about specific mechanisms sustaining the maintenance of myeloid cell subsets in peripheral tissues. This work is therefore expected to be of interest for a large audience of cellular immunologists and beyond. Challenging skin function with an external trigger would lift the relevance for a even wider audience (see main point 6).

      __Response: __see point 6.

      Limitations: in its current version the manuscript suffers from a lack of solidity around a few analysis (see main points on ArpC4 and Arp2/3 protein expression, nuclear envelop rupture analysis,...). It also tends to formulate a narrative centered on the ArpC4 intra-nuclear function that is not definitely proven.

      The field of expertise of this reviewer is: cellular immunology and actin remodeling.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      SUMMARY This is a study in experimental mice employing both in vitro and, importantly, in vivo approaches. EPIDERMAL LANGERHANS CELLS serve as a paradigm for the maintenance of homeostasis of myeloid cells in a tissue, epidermis in this case. In addition to well known functions of the ACTIN NETWORK in cell migration, chemotaxis, cell adherence and phagocytosis the authors reveal a critical function of actin networks in the survival of cells in their home tissue.

      Actin-related proteins (Arp), specifically here the Arp2/3 complex, are necessary to form the filamentous actin networks. The authors use conditional knock-out mice where Arpc4 (an essential component of the Arp2/3 complex) is deleted under the control of CD11c, the most prominent dendritic cell marker which is also expressed on Langerhans cells. In normal mice, epidermal Langerhans cells reside in the epidermis virtually life-long. They initially settle the epidermis around and few days after birth an establish a dense network by a burst of proliferation and then they "linger on" by low level maintenance proliferation. In the epidermis of Arpc4 knock-out mice Langerhans cells also start off with this proliferative burst but, strikingly, they do not stay but are massively reduced by the age of 8-12 weeks.

      The analyses of this decline revealed that

      -- the shape (number of nuclear lobes) and integrity of cell nuclei was compromised; they were fragile and ruptured to some degree when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      -- DNA damage, as detected by staining for gamma-H2Ax or 53BP1 accumulated when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      -- recruitment of DNA repair molecules was inhibited when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      -- gene signatures of interferon signaling and response were increased when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      -- in vivo migration of dendritic cells and Langerhans cells from the skin to the draining lymph nodes in an inflammatory setting (FITC painting of the skin) was impaired when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      -- the persistence of the typical dense network of Langerhans cells in the epidermis, created by proliferation shortly after birth, is abrogated when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing. Importantly, this was not the case for myeloid cell populations that settle a tissue without needing that initial burst of proliferation. For instance, numbers of colonic macrophages were not affected when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing.

      Thus, the authors conclude that the Arp2/3 complex is essential by its formation of actin networks to maintain the integrity of nuclei and ensure DNA repair thereby ascertaining the maintenance proliferation of Langerhans cells and, as the consequence, the persistence of the dense epidermal netowrk of Langerhans cells.

      Up-to-date methodology from the fields of cell biology and cellular immunology (cell isolation from tissues, immunofluorescence, multiparameter flow cytometry, FISH, "good old" - but important - transmission electronmicroscopy, etc.) was used at high quality (e.g., immunofluorescence pictures!). Quantitative and qualitative analytical methods were timely and appropriate (e.g., Voronoi diagrams, cell shape profiling tools, Cre-lox gene-deletion technology, etc.). Importantly, the authors used a clever method, that they had developed several years ago, namely the analysis of dendritic cell migration in microchannels of defined widths. Molecular biology methods such as RNAseq were also employed and analysed by appropriate bioinformatic tools.

      MAJOR COMMENTS:

      • ARE THE KEY CONCLUSIONS CONVINCING? Yes, they are.

      • SHOULD THE AUTHORS QUALIFY SOME OF THEIR CLAIMS AS PRELIMINARY OR SPECULATIVE, OR REMOVE THEM ALTOGETHER? No, I think it is ok as it stands. The authors are wording their claims and conclusions not apodictically but cautiously, as it should be. They point out explicitely which lines of investigations they did not follow up here.

      • WOULD ADDITIONAL EXPERIMENTS BE ESSENTIAL TO SUPPORT THE CLAIMS OF THE PAPER? REQUEST ADDITIONAL EXPERIMENTS ONLY WHERE NECESSARY FOR THE PAPER AS IT IS, AND DO NOT ASK AUTHORS TO OPEN NEW LINES OF EXPERIMENTATION. I think that the here presented experimental evidence suffices to support the conclusions drawn. No additional experiments are necessary.

      • ARE THE SUGGESTED EXPERIMENTS REALISTIC IN TERMS OF TIME AND RESOURCES? IT WOULD HELP IF YOU COULD ADD AN ESTIMATED COST AND TIME INVESTMENT FOR SUBSTANTIAL EXPERIMENTS. Not applicable.

      • ARE THE DATA AND THE METHODS PRESENTED IN SUCH A WAY THAT THEY CAN BE REPRODUCED? Yes, they are.

      • ARE THE EXPERIMENTS ADEQUATELY REPLICATED AND STATISTICAL ANALYSIS ADEQUATE? Yes.

      __Response: __We thank the reviewer very much for assessing our work, for providing constructive suggestions, and for acknowledging the strength of the study.

      MINOR COMMENTS:

      • SPECIFIC EXPERIMENTAL ISSUES THAT ARE EASILY ADDRESSABLE. None

      • ARE PRIOR STUDIES REFERENCED APPROPRIATELY? Essentially yes. Regarding the reduction / loss of the adult epidermal Langerhans cell network, it may be of some interest to also refer to / discuss to another one of the few examples of this phenomenon. There, the initial burst of proliferation is followed by reduced proliferation and increased apoptosis when a critical member of the mTOR signaling cascade is conditionally knocked out (Blood 123:217, 2014).

      __Response and revision plan: __As suggested, we will include into the revised manuscript further examples with related phenotypes regarding the progressive decline of LCs.

      • ARE THE TEXT AND FIGURES CLEAR AND ACCURATE? Yes they are. Figures are well arranged for easy comprehension.

      • DO YOU HAVE SUGGESTIONS THAT WOULD HELP THE AUTHORS IMPROVE THE PRESENTATION OF THEIR DATA AND CONCLUSIONS?

      1. Materials & Methods. The authors write, regarding flow cytometry of epidermal cells: "Briefly, 1cm2 of back skin from 8-14 weeks old female wild-type and knockout littermates was dissociated in 0.25 mg/mL Liberase (Sigma, cat. #5401020001) and 0.5 mg/mL DNase (Sigma, cat.#10104159001) in 1 mL of RPMI (Sigma) and mechanically disaggregated in Eppendorf tubes, FOLLOWED BY INCUBATED for 2 h at 37 {degree sign}C." Followed by what?

      __Response and revision plan: __We apologize for this mistake. The text should read: "... followed by blocking and antibody labeling of cells in single cell suspension.". We will provide the correct text in the revised manuscript.

      Materials & Methods. BMDC electronmicroscopy. What is "IF". Please specify.

      __Response and revision plan: __We also regret this mistake in the method text. It should read: "... For electron microscopy analysis, after PDMS removal, cells were fixed using 2.5% glutaraldehyde ...". We will correct this in the revised manuscript.

      RESULTS in gene expression analyses. The authors observe some increase in apoptosis (as detected by cleaved-Caspase-3 staining). Is this observation in immunofluorescence also evident in the RNAseq data (where the IFN changes were seen), i.e., in Figure 5.

      __Response and revision plan: __We will check our RNAseq data regarding any changes in apoptosis-related genes and, if so, include these in the revised manuscript.

      Figure 7 F and G. Perhaps the authors may want to swap upper and lower panels in F or G, so that macrophage FACS plots and bar graphs are in the same row - ob, obiously, DC plots and bars likewise.

      __Response and revision plan: __We agree and will harmonize the panel sequence in the revised manuscript.

      Figure 7H. "Gating strategy in ArpC4WT Lung (previously gated in Live CD45+ cells)" - The lower row is knock-out, not WT. This is indicated correctly in the legand, but in the figure both rows are labeled as WT.

      __Response and revision plan: __Indeed, the legend information is correct, but the corresponding figure panel is incorrect. We will provide a corrected version with revision.

      The reference by Park et al. 2021 is missing in the list.

      __Response and revision plan: __We will add the reference to the revised bibliography.

      Figure 1D. Sure, the bar graphs are meant to say "CD11c"? The FACS plots show "CD11b".

      __Response and revision plan: __We will check the panels and correct where necessary.

      As to cDC1. In Figure 1D the FACS plot shows an absence of CD103+ cDC1 cells. In contrast, In Figure 7A-left side panel, there is not difference in cDC1 cells between WT and KO mice. Is therefore the flow cytometry plot in Figure 1D not representative regarding cDC1 cells? Correct?

      __Response and revision plan: __The reviewer is correct about this apparent discrepancy. We have not observed differences in the control vs. Aprc4-KO cDC1 population, hence Figure 7 represents our findings. For figure 1, we have by mistake chosen a non-representative plot, with the aim of illustrating the gating strategy. We apologize for this mistake and will provide a corrected an representative FACS plot figure in the revised manuscript.

      Reviewer #2 (Significance (Required)):

      • DESCRIBE THE NATURE AND SIGNIFICANCE OF THE ADVANCE (E.G. CONCEPTUAL, TECHNICAL, CLINICAL) FOR THE FIELD. This is a conceptual advance. It adds a big step to our understanding of how immune cells in tissues (which all come from the bone marrow or are seeded before birth from embryonal hematopoietic organs such as yolk sac and fetal liver) can remain resident in these tissues. For cell types such as Langerhans cells, which establish their final population density within their tissues of residence, the presented finding convincingly buttress the role of proliferation and thereby the role for the actin-related protein complex 2/3 (Arp2/3).

      • PLACE THE WORK IN THE CONTEXT OF THE EXISTING LITERATURE (PROVIDE REFERENCES, WHERE APPROPRIATE). While we know much about actin-related proteins (Arp), as correctly cited by the authors, this knowledge is derived mostly from in vitro studies. The submitted study translates the findings to an in vivo setting for the first time.

      • STATE WHAT AUDIENCE MIGHT BE INTERESTED IN AND INFLUENCED BY THE REPORTED FINDINGS. Skin immunologists foremost, but these findings are of interest to the entire community of immunologists, but also cell biologists.

      • DEFINE YOUR FIELD OF EXPERTISE. My expertise is in skin immunology, in particular skin dendritic cells including Langerhans cells.

      We acknowledge the referee for their positive assessment of our manuscript.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      The manuscript identifies a role of the Arp2/3 complex, the major regulator of actin branching in cells, for controlling the homeostasis of murine Langerhans cells (LCs), a specialized subset of dendritic cells in the skin epidermis. The findings of the study are based on the analysis of CD11c-Cre Arpc4-flox mice, a conditional knockout mouse model, which interferes with Arp2/3 function in Langerhans cells and other CD11c-expressing myeloid cells, e.g. dendritic cell or macrophage subsets. By using immunofluorescence and flow cytometry analysis of epidermis and skin tissues, the authors provide a detailed analysis of LC numbers at different developmental stages (postnatal day 1, 7, 28, and adult mice) and demonstrate that Arpc4-deficiency does not interfere with the establishment of LC networks until postnatal day 28. However, LCs in ear and tail skin are substantially reduced in Arpc4-deficient mice at 8-12 weeks of age. In parallel to their in vivo model, the authors analyze cultures of bone marrow-derived dendritic cells (BMDCs) from control and CD11c-Cre Arpc4-flox mice. Arpc4-deficiency in BMDCs, which develop over 8-10 days in culture, results in nuclear shape and lamina abnormalities, as well as signs of increased DNA damage. Aspects of this phenotype are also detected in Langerhans cells in epidermal preparations. Transcriptomic analysis of BMDCs highlights a gene signature of increased expression of the interferon response pathway and alterations in cell cycle regulation. Arpc4-deficient BMDCs show increased expression of DNA damage markers and reduced expression of certain DNA repair factors. Based on these correlative findings from the BMDC model, the authors conclude that the decline in LC numbers might develop from the accumulation of DNA damage over time, which the authors phrease "pre-mature aging of Langerhans cells". Lastly, the authors show a heterogenous picture how Arp2/3 depletion affects distinct DC populations in CD11c-Cre Arpc4-flox mice. While some tissue-resident DC subsets appear normal in numbers, others are declined in numbers in the tissue. This may be related to their proliferation potential in tissues.

      Major comments:

      • Are the claims and the conclusions supported by the data or do they require additional experiments or analyses to support them?

      1) The authors claim that Arpc4 deficiency selectively compromises myeloid cell populations that rely on proliferation for tissue colonization (Figure 7). The presented data might give hints for such a general hypothesis, but solid experimental proof to prove this is lacking. When comparing myeloid cell subsets from foru different irgans, the authors refer to published data that some dendritic cell subsets are more proliferative in tissues than others and that CD11cCre Arpc4-flox mice appear to have reduced cell numbers in these populations. However, the presented data are purely correlative and no functional connection to cell proliferation has been made to the phenotypes. While some dendritic cell subsets (Langerhans cells, alveolar DCs) show reduced cell numbers in CD11cCre Arpc4-flox mice, other myeloid cell cells subsets are unaffected (e.g. dermal cDC1 and 2, colon macrophages).There could be plenty of other reasons that might underly the observed discrepancies between these cell subsets, e.g. Arp2/3 knockout efficiency and myeloid cell turnover in the tissue are just two examples, which have not been taken into consideration. Direct measurement of cell proliferation, e.g. BrdU labeling, and the observed phenotype would be missing to make such claims. The data could either be removed. Experimentally addressing these points could take 3-6 months.

      Response and planned revisions: We thank the referee for bringing this point. We agree that these results give hints that support our conclusion but that do not address this question directly. However, we would like to insist on the fact that our conclusion is based on studies from others showing that alveolar macrophages self-maintain themselves through proliferation (Bain et al. Mucosal Immunology 2022). In contrast, it has been reported that most colonic macrophages are derived from monocytes that are being recruited to the gut through life (Bain et al. Mucosal Immunity 2023)

      We propose to better explain and discuss these points in our revised manuscripts. In addition, we will stress that we do not exclude that different intracellular Arpc4-dependent processes might contribute to the phenotypes observed (beyond maintenance of DNA integrity). These revisions will help mitigate our conclusions and leave open the potential implication of alternative mechanisms that will be discussed as suggested by the referee.

      2) The authors claim that DC subsets (e.g. dermal cDCs), which develop from pre-DCs, are not affected by Arp2/3 depletion (Figure 7, although the FACS plot in Fig. 1D would suggest a different picture for cDC1). This is surprising in light of the data with bone marrow-derived DCs (BMDCs), the major in vitro model of this study, which develop from CDPs that again develop from pre-DCs. BMDCs did show aberrant nuclei and signs of DNA damage. How would the authors then explain the discrepancies of the BMDC model with DC subsets, where the authors feel that the pre-DC origin explains the phenotypic difference? This is a general concern of the data interpretation and conclusions.

      __Response: __We thank the referee to bring this point that indeed requires clarification. Two non-exclusive hypotheses could explain this apparent discrepancy:

      • The ontogeny of bone-marrow-derived DCs: Depending on the protocol used, there might be variations in the precursors DCs develop from. We use one of the first protocols, which was pioneered by Paola Ricciardi-Castagnoli lab (Winzler et al. J.Exp.Med. 1997). It relies on a supernatant from J558 cells transfected with GMCSF, which contains additional cytokines and mainly generate DC2-like DCs. Langerhans cells are closer to DC2s, which resemble more macrophages than DC1s. We thus chose this protocol rather than the protocols that use Flt3-L, which produce both DC1s and DC2s developed from common dendritic-cell precursors (CDPs). It is thus possible that our BM-derived DCs develop from other precursor cells that are possibly closer to monocyte precursors.
      • As shown in Figure 5C, kinetics of acquisition of CD11c expression, and thus deletion of the Arpc4 gene, might be distinct in vivo and in vitro. In vivo, as stated in our manuscript, DCs acquire CD11c as preDCs and undergo few rounds of divisions after. In vitro, as shown by our cycling experiments, BM-derived DCs continuously cycle, so they will keep dividing after having acquire CD11c (around day 7) and deleting the Arpc4 gene. __Revision plan: __We propose to mention these hypotheses in the discussion of our manuscript to explain the apparent contradiction raised by the referee.

      3) In line with point 2, the authors never show that BMDCs show reduced proliferation, reduced cell numbers or increased cell death in Arpc4-deficient cell cultures, as a consequence of the detected DNA damage and impaired DNA repair. In fact, Figure 5C even shows that cell growth rates between control and KO are equal. This is a major mismatch in the current study. Since the authors use the BMDC model to explain the declining cell numbers in Langerhans cells (which derive from fetal liver cells), this phenotype is not mirrored by the BMDC culture and it remains open whether the observed changes in nuclear DNA damage and repair are indeed directly linked to the observed phenotype of declining cell numbers in the tissue. These aspects require argumentation why cell growth is unchanged in KO cells. Additional experiments addressing these points with sufficient biological replicates (cultures from different mice) could take 2-3 months, including preparation time.

      __Response____: __We thank the referee for bringing this point, which was probably not properly discussed in the first version of our manuscript. Indeed, Arpc4KO BM-derived DCs do not show the premature cell death phenotype observed in LCs in vivo, as stated by the referee. There are at least two putative non-exclusive explanations for this. First, unlike LCs, which are long-lived cells, BM-derived DCs can be kept in culture for only 10-12 days. As DNA damage-induced cell death takes time (LCs only start to die about 3-4 weeks after network establishment), the lifespan of BM-DCs could simply not be long enough to observe this phenotype. Second, in the epidermis, LCs are physically constrained and continuously exposed to diverse signals that might increase their sensitivity to DNA damage and thereby induction of subsequent cell death.

      __Revision Plan: __We will clarify this point in our revised manuscript by providing putative explanations for the death phenotype of Arpc4-deficient LCs not being observed in BM-derived DCs. We will further explain that this does not invalidate this cellular model as it was used to raise hypotheses on the putative role played by Arpc4 in myeloid cells, i.e. maintenance of DNA integrity, which was then confirmed in vivo (Arpc4KO LCs do indeed display DNA damage in the epidermis). Without this "imperfect cellular model", we would have probably not been able to uncover this novel function of Arp2/3 in immune cells.

      4) The authors refer to a "pre-mature aging" phenotype of Arpc4-deficient BMDCs and LCs, based on reductions in Lamin B, Lamin A and increases in gH2AX and 53BP1. I find this term and overstatement of the current data and suggest that other markers for cell senescence, such as p53, Rb, p21 and b-Galactosidase are then also used to make such strong claim on "aging" and cell senescence. Experimentally addressing this point with sufficient biological replicates could take 2-3 months, including preparation time.

      __Revision Plan: __We will assess the expression of these genes and senescence signatures in our RNAseq analysis as well as in Arpc4WT and Arpc4KO-derived DCs, as suggested by the referee.

      5) The study does not provide a mechanism how the Arp2/3 complex would mediate the observed effects on DNA damage and repairs has not been addressed in the cell model, and only potential scenarios from other non-myeloid cell lines are discussed. It remains unclear whether the observed phenotypes in Arpc4-depleted myleoid cells relate to the direct nuclear function of Arp2/3 or the cytosolic function of Arp2/3, including its roles in cytoskeletal regulation that may have secondary effects on the nuclear alterations. This is a general concern of the presented data, data on mechanism might require more than 6 months.

      __Revision Plan: __The referee is correct: Our manuscript shows that Arp2/3 deficiency in specific myeloid cells impacts on their survival in vivo and proposes that this could result at least in part from impaired maintenance of DNA integrity in these cells. We do not know whether this also applies to non-myeloid cells, which, although very interesting, is beyond the scope of the present study. In addition, we do not have any experimental tool to distinguish whether the DNA damage phenotype of Arpc4KO cells involves the nuclear or cortical pool of F-actin, this is why we have left this question open in the discussion of our manuscript.

      6) OPTIONAL: The authors make a strong case arguing that the increased interferon expression signature (based on the transcriptomics data) reflects the nuclear ruptures in Arpc4-deficient cells and adds to the observed phenotype. If this is so, what happens then in STING knockout cells in the presence of CK666 inhibitor?


      __Revision Plan____: __The referee is correct in that we do not show this point experimentally and should therefore temper this conclusion.

      • Are the data and the methods presented in such a way that they can be reproduced?

      1) The analyses include quite a number of intensity calculations of immunofluorescence signals (Fig. 3D, E; Fig. 4E, Fig. 5B and 6B)? The background stainings are often variable or very high. In some cases it is even unclear whether stainings are really detecting protein and go beyond background staining (Fig. 6A, Fig. 5F). How were immunofluorescence data acquired and dealt with different background staining intensities?

      __Revision Plan: __We will carefully describe the microscopes used for image acquisition as well as the downstream analyses for each experiment, which indeed vary depending on the signals observed with distinct antibodies or construct.

      2) It remained unclear to me on which basis the nuclear deformations in Fig. 3G, H were calculated?

      __Revision Plan: __We will carefully describe the methods used to quantify nuclear deformations.

      3) The detailed phenotype of control mice is a bit unclear. It appears as if these were Cre-negative animals. Did the authors have some proof-of-principle experiments showing that CD11cCre Arpc4 +/+ animals have comparable phenotypes to Cre-negative animals?

      • Are the experiments adequately replicated and statistical analysis adequate?

      __Revision Plan: __We have never observed any decline in LC numbers in other mouse lines/genotypes (for example in cPLA2flox/flox;CD11c-Cre mice shown in the manuscript, Fig. S6B), excluding a putative role for the Cre in LC death.

      For most experiments, the number of biological replicates (mice, or BMDC cultures from different mice) and individual values (n, cells) are indicated. Statistical analysis appears adequate.

      Minor comments:

      • Prior published studies on Arp2/3 function in immune cells are referenced accordingly. A number of additional pre-print manuscripts on this topic have not been cited and could be considered referencing.


      __Revision Plan: __We will fix this point and cite additional, relevant preprints.

      • The text is very clearly and very well written. Figures are clear and accurate for most cases. There are some open questions:

      • Fig. 1B: The number of dots betwenn graph and legend do not match. The dots are not n=12 for both genotypes. Additionally: What do the symbols in the circles in the graph stand for? This is also in another later figure unclear.

      • Fig. 2C: The current IF presentation (overlay MHCII with Ki67) is not very helpful. An additional image that shows only the Ki67 signal in the MHCII mask would be very helpful.

      • Fig. 4B: BMDCs of which culture day were used for these experiments?

      • Fig. 4A and D shows the same representative cells for two biological messages, which is only moderately convincing regarding a "general" phenotype.

      • Fig. 5, B: Scale bars are missing.

      __Revision Plan: __We will fix all these points.

      Reviewer #3 (Significance (Required)):

      Strengths and Advance:

      The study provides strong data and a very detailed analysis of how the Arp2/3 complex regulates stages of Langerhans cell development and homeostasis. The role of the Arp2/3 complex as regulator of actin branching, which is involved in many cellular functions, has previously not been reported for this cell type. Previous research in immune cells have already studied the Arp2/3 complex, but studies were focussed on its role in migration and the majority of published phenotypes related to cell migration. While there are already a number of in vitro studies showing that the Arp2/3 complex can regulate aspects of cell cycle control or cell death in non-immune cells, most of these studies were performed with immortalized, non-immune cell lines, which can be more easily manipulated to dissect mechanistic aspects of the cellular phenotype, but are limited in their physiological interpretation. Hence, it is a major strength of this study to investigate the effects of Arp2/3 in a primary immune cell type, directly in the native and physiological environment. This is important because in vitro data from other cell types cannot be easily extrapolated to any other cell type and it is critical for our understanding to collect physiological data from tissues, where the biology really happens. The finding that the Arp2/3 complex regulates the tissue-residency of Langerhans cell through processes that are unrelated to migration are partially unexpected, shifting the view of this protein complex's physiological role to other cell biological processes, e.g. regulation of cell proliferation.

      Limitations: The limitations of the study are detailed in the five major points listed above. The study accumulates many experiments that characterize the phenotype of Arpc4-depleted cells, showing signs of DNA damage in Langerhans cells and cultures of BMDCs. How the Arp2/3 complex would mechanistically mediate the observed effects on DNA damage and repairs have not been addressed. It also remains open whether this is due to the effects of the Arp2/3 complex in the nucleus or the cytosol, which would be biologically extremely important to understand. Above that, there are some discrepancies regarding the phenotype of the BMDC model, which does neither entirely match the Langerhans cell phenotype in the tissue (reduced proliferation, LC derive from different progenitors), nor other endogenous DC populations, which should also derive from similar progenitors.

      Audience and reviewer background:

      In its current form, the manuscript will already be of interest for several research fields: Langerhans cell and dendritic cell homeostasis, immune cell trafficking, actin and cytoskeleton regulation in immune cells, physiological role of actin-regulating proteins. My own field of expertise is immune cell trafficking in mouse models, leukocyte migration and cytoskeletal regulation. I cannot judge the analysis and clustering of the bulk RNA sequencing data.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      • *

      4. Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The manuscript identifies a role of the Arp2/3 complex, the major regulator of actin branching in cells, for controlling the homeostasis of murine Langerhans cells (LCs), a specialized subset of dendritic cells in the skin epidermis. The findings of the study are based on the analysis of CD11c-Cre Arpc4-flox mice, a conditional knockout mouse model, which interferes with Arp2/3 function in Langerhans cells and other CD11c-expressing myeloid cells, e.g. dendritic cell or macrophage subsets. By using immunofluorescence and flow cytometry analysis of epidermis and skin tissues, the authors provide a detailed analysis of LC numbers at different developmental stages (postnatal day 1, 7, 28, and adult mice) and demonstrate that Arpc4-deficiency does not interfere with the establishment of LC networks until postnatal day 28. However, LCs in ear and tail skin are substantially reduced in Arpc4-deficient mice at 8-12 weeks of age. In parallel to their in vivo model, the authors analyze cultures of bone marrow-derived dendritic cells (BMDCs) from control and CD11c-Cre Arpc4-flox mice. Arpc4-deficiency in BMDCs, which develop over 8-10 days in culture, results in nuclear shape and lamina abnormalities, as well as signs of increased DNA damage. Aspects of this phenotype are also detected in Langerhans cells in epidermal preparations. Transcriptomic analysis of BMDCs highlights a gene signature of increased expression of the interferon response pathway and alterations in cell cycle regulation. Arpc4-deficient BMDCs show increased expression of DNA damage markers and reduced expression of certain DNA repair factors. Based on these correlative findings from the BMDC model, the authors conclude that the decline in LC numbers might develop from the accumulation of DNA damage over time, which the authors phrease "pre-mature aging of Langerhans cells". Lastly, the authors show a heterogenous picture how Arp2/3 depletion affects distinct DC populations in CD11c-Cre Arpc4-flox mice. While some tissue-resident DC subsets appear normal in numbers, others are declined in numbers in the tissue. This may be related to their proliferation potential in tissues.

      Major comments:

      • Are the claims and the conclusions supported by the data or do they require additional experiments or analyses to support them?

      1) The authors claim that Arpc4 deficiency selectively compromises myeloid cell populations that rely on proliferation for tissue colonization (Figure 7). The presented data might give hints for such a general hypothesis, but solid experimental proof to prove this is lacking. When comparing myeloid cell subsets from foru different irgans, the authors refer to published data that some dendritic cell subsets are more proliferative in tissues than others and that CD11cCre Arpc4-flox mice appear to have reduced cell numbers in these populations. However, the presented data are purely correlative and no functional connection to cell proliferation has been made to the phenotypes. While some dendritic cell subsets (Langerhans cells, alveolar DCs) show reduced cell numbers in CD11cCre Arpc4-flox mice, other myeloid cell cells subsets are unaffected (e.g. dermal cDC1 and 2, colon macrophages).There could be plenty of other reasons that might underly the observed discrepancies between these cell subsets, e.g. Arp2/3 knockout efficiency and myeloid cell turnover in the tissue are just two examples, which have not been taken into consideration. Direct measurement of cell proliferation, e.g. BrdU labeling, and the observed phenotype would be missing to make such claims. The data could either be removed. Experimentally addressing these points could take 3-6 months.

      2) The authors claim that DC subsets (e.g. dermal cDCs), which develop from pre-DCs, are not affected by Arp2/3 depletion (Figure 7, although the FACS plot in Fig. 1D would suggest a different picture for cDC1). This is surprising in light of the data with bone marrow-derived DCs (BMDCs), the major in vitro model of this study, which develop from CDPs that again develop from pre-DCs. BMDCs did show aberrant nuclei and signs of DNA damage. How would the authors then explain the discrepancies of the BMDC model with DC subsets, where the authors feel that the pre-DC origin explains the phenotypic difference? This is a general concern of the data interpretation and conclusions.

      3) In line with point 2, the authors never show that BMDCs show reduced proliferation, reduced cell numbers or increased cell death in Arpc4-deficient cell cultures, as a consequence of the detected DNA damage and impaired DNA repair. In fact, Figure 5C even shows that cell growth rates between control and KO are equal. This is a major mismatch in the current study. Since the authors use the BMDC model to explain the declining cell numbers in Langerhans cells (which derive from fetal liver cells), this phenotype is not mirrored by the BMDC culture and it remains open whether the observed changes in nuclear DNA damage and repair are indeed directly linked to the observed phenotype of declining cell numbers in the tissue. These aspects require argumentation why cell growth is unchanged in KO cells. Additional experiments addressing these points with sufficient biological replicates (cultures from different mice) could take 2-3 months, including preparation time.

      4) The authors refer to a "pre-mature aging" phenotype of Arpc4-deficient BMDCs and LCs, based on reductions in Lamin B, Lamin A and increases in gH2AX and 53BP1. I find this term and overstatement of the current data and suggest that other markers for cell senescence, such as p53, Rb, p21 and b-Galactosidase are then also used to make such strong claim on "aging" and cell senescence. Experimentally addressing this point with sufficient biological replicates could take 2-3 months, including preparation time.

      5) The study does not provide a mechanism how the Arp2/3 complex would mediate the observed effects on DNA damage and repairs has not been addressed in the cell model, and only potential scenarios from other non-myeloid cell lines are discussed. It remains unclear whether the observed phenotypes in Arpc4-depleted myleoid cells relate to the direct nuclear function of Arp2/3 or the cytosolic function of Arp2/3, including its roles in cytoskeletal regulation that may have secondary effects on the nuclear alterations. This is a general concern of the presented data, data on mechanism might require more than 6 months.

      6) OPTIONAL: The authors make a strong case arguing that the increased interferon expression signature (based on the transcriptomics data) reflects the nuclear ruptures in Arpc4-deficient cells and adds to the observed phenotype. If this is so, what happens then in STING knockout cells in the presence of CK666 inhibitor?

      • Are the data and the methods presented in such a way that they can be reproduced?

      1) The analyses include quite a number of intensity calculations of immunofluorescence signals (Fig. 3D, E; Fig. 4E, Fig. 5B and 6B)? The background stainings are often variable or very high. In some cases it is even unclear whether stainings are really detecting protein and go beyond background staining (Fig. 6A, Fig. 5F). How were immunofluorescence data acquired and dealt with different background staining intensities?

      2) It remained unclear to me on which basis the nuclear deformations in Fig. 3G, H were calculated?

      3) The detailed phenotype of control mice is a bit unclear. It appears as if these were Cre-negative animals. Did the authors have some proof-of-principle experiments showing that CD11cCre Arpc4 +/+ animals have comparable phenotypes to Cre-negative animals?

      • Are the experiments adequately replicated and statistical analysis adequate?

      For most experiments, the number of biological replicates (mice, or BMDC cultures from different mice) and individual values (n, cells) are indicated. Statistical analysis appears adequate.

      Minor comments:

      • Prior published studies on Arp2/3 function in immune cells are referenced accordingly. A number of additional pre-print manuscripts on this topic have not been cited and could be considered referencing.

      • The text is very clearly and very well written. Figures are clear and accurate for most cases. There are some open questions:

      1) Fig. 1B: The number of dots betwenn graph and legend do not match. The dots are not n=12 for both genotypes. Additionally: What do the symbols in the circles in the graph stand for? This is also in another later figure unclear.

      2) Fig. 2C: The current IF presentation (overlay MHCII with Ki67) is not very helpful. An additional image that shows only the Ki67 signal in the MHCII mask would be very helpful.

      3) Fig. 4B: BMDCs of which culture day were used for these experiments?

      4) Fig. 4A and D shows the same representative cells for two biological messages, which is only moderately convincing regarding a "general" phenotype.

      5) Fig. 5, B: Scale bars are missing.

      Significance

      Strengths and Advance:

      The study provides strong data and a very detailed analysis of how the Arp2/3 complex regulates stages of Langerhans cell development and homeostasis. The role of the Arp2/3 complex as regulator of actin branching, which is involved in many cellular functions, has previously not been reported for this cell type. Previous research in immune cells have already studied the Arp2/3 complex, but studies were focussed on its role in migration and the majority of published phenotypes related to cell migration. While there are already a number of in vitro studies showing that the Arp2/3 complex can regulate aspects of cell cycle control or cell death in non-immune cells, most of these studies were performed with immortalized, non-immune cell lines, which can be more easily manipulated to dissect mechanistic aspects of the cellular phenotype, but are limited in their physiological interpretation. Hence, it is a major strength of this study to investigate the effects of Arp2/3 in a primary immune cell type, directly in the native and physiological environment. This is important because in vitro data from other cell types cannot be easily extrapolated to any other cell type and it is critical for our understanding to collect physiological data from tissues, where the biology really happens. The finding that the Arp2/3 complex regulates the tissue-residency of Langerhans cell through processes that are unrelated to migration are partially unexpected, shifting the view of this protein complex's physiological role to other cell biological processes, e.g. regulation of cell proliferation.

      Limitations:

      The limitations of the study are detailed in the five major points listed above. The study accumulates many experiments that characterize the phenotype of Arpc4-depleted cells, showing signs of DNA damage in Langerhans cells and cultures of BMDCs. How the Arp2/3 complex would mechanistically mediate the observed effects on DNA damage and repairs have not been addressed. It also remains open whether this is due to the effects of the Arp2/3 complex in the nucleus or the cytosol, which would be biologically extremely important to understand. Above that, there are some discrepancies regarding the phenotype of the BMDC model, which does neither entirely match the Langerhans cell phenotype in the tissue (reduced proliferation, LC derive from different progenitors), nor other endogenous DC populations, which should also derive from similar progenitors.

      Audience and reviewer background:

      In its current form, the manuscript will already be of interest for several research fields: Langerhans cell and dendritic cell homeostasis, immune cell trafficking, actin and cytoskeleton regulation in immune cells, physiological role of actin-regulating proteins. My own field of expertise is immune cell trafficking in mouse models, leukocyte migration and cytoskeletal regulation. I cannot judge the analysis and clustering of the bulk RNA sequencing data.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      • This is a study in experimental mice employing both in vitro and, importantly, in vivo approaches. EPIDERMAL LANGERHANS CELLS serve as a paradigm for the maintenance of homeostasis of myeloid cells in a tissue, epidermis in this case. In addition to well known functions of the ACTIN NETWORK in cell migration, chemotaxis, cell adherence and phagocytosis the authors reveal a critical function of actin networks in the survival of cells in their home tissue.

      • Actin-related proteins (Arp), specifically here the Arp2/3 complex, are necessary to form the filamentous actin networks. The authors use conditional knock-out mice where Arpc4 (an essential component of the Arp2/3 complex) is deleted under the control of CD11c, the most prominent dendritic cell marker which is also expressed on Langerhans cells. In normal mice, epidermal Langerhans cells reside in the epidermis virtually life-long. They initially settle the epidermis around and few days after birth an establish a dense network by a burst of proliferation and then they "linger on" by low level maintenance proliferation. In the epidermis of Arpc4 knock-out mice Langerhans cells also start off with this proliferative burst but, strikingly, they do not stay but are massively reduced by the age of 8-12 weeks.

      • The analyses of this decline revealed that

      a) the shape (number of nuclear lobes) and integrity of cell nuclei was compromised; they were fragile and ruptured to some degree when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      b) DNA damage, as detected by staining for gamma-H2Ax or 53BP1 accumulated when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      c) recruitment of DNA repair molecules was inhibited when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      d) gene signatures of interferon signaling and response were increased when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      e) in vivo migration of dendritic cells and Langerhans cells from the skin to the draining lymph nodes in an inflammatory setting (FITC painting of the skin) was impaired when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing;

      f) the persistence of the typical dense network of Langerhans cells in the epidermis, created by proliferation shortly after birth, is abrogated when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing. Importantly, this was not the case for myeloid cell populations that settle a tissue without needing that initial burst of proliferation. For instance, numbers of colonic macrophages were not affected when Arpc4 was knocked out, i.e., the Arp2/3 complex was missing.

      • Thus, the authors conclude that the Arp2/3 complex is essential by its formation of actin networks to maintain the integrity of nuclei and ensure DNA repair thereby ascertaining the maintenance proliferation of Langerhans cells and, as the consequence, the persistence of the dense epidermal netowrk of Langerhans cells.

      • Up-to-date methodology from the fields of cell biology and cellular immunology (cell isolation from tissues, immunofluorescence, multiparameter flow cytometry, FISH, "good old" - but important - transmission electronmicroscopy, etc.) was used at high quality (e.g., immunofluorescence pictures!). Quantitative and qualitative analytical methods were timely and appropriate (e.g., Voronoi diagrams, cell shape profiling tools, Cre-lox gene-deletion technology, etc.). Importantly, the authors used a clever method, that they had developed several years ago, namely the analysis of dendritic cell migration in microchannels of defined widths. Molecular biology methods such as RNAseq were also employed and analysed by appropriate bioinformatic tools.

      Major comments:

      • ARE THE KEY CONCLUSIONS CONVINCING? Yes, they are.

      • SHOULD THE AUTHORS QUALIFY SOME OF THEIR CLAIMS AS PRELIMINARY OR SPECULATIVE, OR REMOVE THEM ALTOGETHER? No, I think it is ok as it stands. The authors are wording their claims and conclusions not apodictically but cautiously, as it should be. They point out explicitely which lines of investigations they did not follow up here.

      • WOULD ADDITIONAL EXPERIMENTS BE ESSENTIAL TO SUPPORT THE CLAIMS OF THE PAPER? REQUEST ADDITIONAL EXPERIMENTS ONLY WHERE NECESSARY FOR THE PAPER AS IT IS, AND DO NOT ASK AUTHORS TO OPEN NEW LINES OF EXPERIMENTATION. I think that the here presented experimental evidence suffices to support the conclusions drawn. No additional experiments are necessary.

      • ARE THE SUGGESTED EXPERIMENTS REALISTIC IN TERMS OF TIME AND RESOURCES? IT WOULD HELP IF YOU COULD ADD AN ESTIMATED COST AND TIME INVESTMENT FOR SUBSTANTIAL EXPERIMENTS. Not applicable.

      • ARE THE DATA AND THE METHODS PRESENTED IN SUCH A WAY THAT THEY CAN BE REPRODUCED? Yes, they are.

      • ARE THE EXPERIMENTS ADEQUATELY REPLICATED AND STATISTICAL ANALYSIS ADEQUATE? Yes.

      Minor comments:

      • SPECIFIC EXPERIMENTAL ISSUES THAT ARE EASILY ADDRESSABLE. None

      • ARE PRIOR STUDIES REFERENCED APPROPRIATELY? Essentially yes. Regarding the reduction / loss of the adult epidermal Langerhans cell network, it may be of some interest to also refer to / discuss to another one of the few examples of this phenomenon. There, the initial burst of proliferation is followed by reduced proliferation and increased apoptosis when a critical member of the mTOR signaling cascade is conditionally knocked out (Blood 123:217, 2014).

      • ARE THE TEXT AND FIGURES CLEAR AND ACCURATE? Yes they are. Figures are well arranged for easy comprehension.

      • DO YOU HAVE SUGGESTIONS THAT WOULD HELP THE AUTHORS IMPROVE THE PRESENTATION OF THEIR DATA AND CONCLUSIONS?

      • Materials & Methods. The authors write, regarding flow cytometry of epidermal cells: "Briefly, 1cm2 of back skin from 8-14 weeks old female wild-type and knockout littermates was dissociated in 0.25 mg/mL Liberase (Sigma, cat. #5401020001) and 0.5 mg/mL DNase (Sigma, cat.#10104159001) in 1 mL of RPMI (Sigma) and mechanically disaggregated in Eppendorf tubes, FOLLOWED BY INCUBATED for 2 h at 37 {degree sign}C." Followed by what?

      • Materials & Methods. BMDC electronmicroscopy. What is "IF". Please specify.

      • RESULTS in gene expression analyses. The authors observe some increase in apoptosis (as detected by cleaved-Caspase-3 staining). Is this observation in immunofluorescence also evident in the RNAseq data (where the IFN changes were seen), i.e., in Figure 5.

      • Figure 7 F and G. Perhaps the authors may want to swap upper and lower panels in F or G, so that macrophage FACS plots and bar graphs are in the same row - ob, obiously, DC plots and bars likewise.

      • Figure 7H. "Gating strategy in ArpC4WT Lung (previously gated in Live CD45+ cells)" - The lower row is knock-out, not WT. This is indicated correctly in the legand, but in the figure both rows are labeled as WT.

      • The reference by Park et al. 2021 is missing in the list.

      • Figure 1D. Sure, the bar graphs are meant to say "CD11c"? The FACS plots show "CD11b".

      • As to cDC1. In Figure 1D the FACS plot shows an absence of CD103+ cDC1 cells. In contrast, In Figure 7A-left side panel, there is not difference in cDC1 cells between WT and KO mice. Is therefore the flow cytometry plot in Figure 1D not representative regarding cDC1 cells? Correct?

      Significance

      • DESCRIBE THE NATURE AND SIGNIFICANCE OF THE ADVANCE (E.G. CONCEPTUAL, TECHNICAL, CLINICAL) FOR THE FIELD. This is a conceptual advance. It adds a big step to our understanding of how immune cells in tissues (which all come from the bone marrow or are seeded before birth from embryonal hematopoietic organs such as yolk sac and fetal liver) can remain resident in these tissues. For cell types such as Langerhans cells, which establish their final population density within their tissues of residence, the presented finding convincingly buttress the role of proliferation and thereby the role for the actin-related protein complex 2/3 (Arp2/3).

      • PLACE THE WORK IN THE CONTEXT OF THE EXISTING LITERATURE (PROVIDE REFERENCES, WHERE APPROPRIATE). While we know much about actin-related proteins (Arp), as correctly cited by the authors, this knowledge is derived mostly from in vitro studies. The submitted study translates the findings to an in vivo setting for the first time.

      • STATE WHAT AUDIENCE MIGHT BE INTERESTED IN AND INFLUENCED BY THE REPORTED FINDINGS. Skin immunologists foremost, but these findings are of interest to the entire community of immunologists, but also cell biologists.

      • DEFINE YOUR FIELD OF EXPERTISE. My expertise is in skin immunology, in particular skin dendritic cells including Langerhans cells.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      • The manuscript by Delgado et al. reports the role of the actin remodeling Arp2/3 complex in the biology of Langerhans cells, which are specialized innate immune cells of the epidermis. The study is based on a conditional KO mouse model (CD11cCre;Arpc4fl/fl), in which the deletion of the Arp2/3 subunit ArpC4 is under the control of the myeloid cell specific CD11c promoter.

      • In this model, the assembly of LC networks in the epidermis of ear and tail skin is preserved when examining animals immediately after birth (up to 1 week). Subsequently however LCs from ArpC4-deleted mice start displaying morphological aberrations (reduced elongation and number of branches at 4 weeks of age). Additionally, a profound decline in LC numbers is reported in the skin of both the ear and tail of young adult mice (8-10 weeks).

      • To explore the cause of such decline, the authors then opt for the complementary in vitro study of bone-marrow derived DCs, given the lack of a model to study LCs in vitro. They report that ArpC4 deletion is associated with aberrantly shaped nuclei, decreased expression of the nucleoskeleton proteins Lamin A/C and B1, nuclear envelop ruptures and increased DNA damage as shown by γH2Ax staining. Importantly, they provide evidence that the defects evoked by ArpC4 deletion also occur in the LCs in situ (immunofluorescence of the skin in 4-week old mice).

      • Increased DNA damage is further documented by staining differentiating DCs from ArpC4-deleted mice with the 53BP1 marker. In parallel, nuclear levels of DNA repair kinase ATR and recruitment of RPA70 (which recruits ATR to replicative forks) are reduced in the ArpC4-deleted condition. In vitro treatment of DCs with the topoisomerase II inhibitor etoposide and the Arp2/3 inhibitor CK666 induce comparable DNA damage, as well as multilobulated nuclei and DNA bridges. The authors conclude that the ArpC4-KO phenotype might stem, at least in part, from a defective ability to repair DNA damages occurring during cell division.

      • The study in enriched by an RNA-seq analysis that points to an increased expression of genes linked to IFN signaling, which the authors hypothetically relate to overt activation of innate nucleic acid sensing pathways.

      • The study ends by an examination of myeloid cell populations in ArpC4-KO mice beyond LCs. Skin cDC2 and cDC2 subsets display skin emigration defects (like LCs), but not numerical defects in the skin (unlike LCs). Myeloid cell subsets of the colon are also present in normal numbers. In the lungs, interstitial and alveolar macrophages are reduced, but not lung DC subsets. Collectively, these observations suggest that ArpC4 is essential for the maintenance of myeloid cell subsets that rely on cell division to colonize or to self-maintain within their tissue of residency (including LCs).

      Major comments:

      1. ArpC4 and Arp2/3 expression

      The authors argue that LCs from Arpc4KO mice should delete the Arpc4 gene in precursors that colonize the skin around birth. It would be important to show it to rule out the possibility that the lack of phenotype (initial seeding, initial proliferative burst) in young animals (first week) could be related to an incomplete deletion of ArpC4 expression. Also important would be to show what is happening to the Arp2/3 complex in LCs from Arpc4KO mice. In the in vitro studies with DCs, the level of ArpC4 and Arp2/3 deletion at the protein level is also not documented. The authors explain that surface expression of the CD11c marker, which drives Arpc4 deletion, gradually increased during differentiation of DCs: from 50% to 90% of the cells. Does that mean that loss of ArpC4 expression is only effective in a fraction of the cells examined before day 10 of differentiation (e.g. in the RNA-seq analysis)?

      1. Intra-nuclear versus extra-nuclear activities of Arp2/3

      The authors favor a model whereby intra-nuclear ArpC4 helps maintaining nuclear integrity during proliferation of DCs (and possibly LCs). However, multiple pools of Arp2/3 have been described and accordingly, multiple mechanisms may account for the observed phenotype: i) cytoplasmic pool to drive the protrusions sustaining the assembly of the LC network and its connectivity with keratinocytes ; ii) peri-nuclear pool to protect the nucleus ; iii) Intra-nuclear pool to facilite DNA repair mechanisms e.g. by stabilizing replicative forks (the scenario favored by the authors).

      It is recommended that the authors try to gather more supportive data to sustain the intra-nuclear role. Documenting ArpC4 presence in the nucleus would help support the claim. It could be combined with treatments aiming at blocking proliferation in order to reinforce the possibility that a main function of ArpC4 is to protect proliferating cells by favoring DNA repair inside the nucleus.

      1. Nuclear envelop ruptures

      The nuclear envelop ruptures are not sufficiently documented (how many cells were imaged? quantification?). The authors employ STED microscopy to examine Lamin B1 distribution. The image shown in Figure 4A does not really highlight the nuclear envelop, but rather the entire content. Whether it is representative is questionable. We would expect Lamin B1 staining intensity to be drastically reduced given the quantification shown in Figure 3D. In addition, although the authors have stressed in the previous figure that Arpc4-KO is associated with nucleus shape aberrations, the example shown in Figure 4A is that of a nucleus with a normal ovoid shape.

      It is recommended to quantify the ruptures with Lap2b antibodies (or another staining that would better delineate the envelop) in order to avoid the possible bias due to the reduced staining intensity of Lamin B1.

      A missing analysis is that of nuclear envelop ruptures as a function of nucleus deformations.

      Fig 4B-C: same frequency of Arpc4-KO and WT cells displaying nuclear envelop ruptures in the 4-µm channels; however image show a rupture for the Arpc4-KO and no rupture for the WT cells (this is somehow misleading). Are ruptures similar in Arpc4-KO and WT cells in this condition?

      Fig 4D-E: is their a direct link between nuclear envelop ruptures and ƴH2A.X?

      Interesting (but optional) would be to understand what is happening to DNA, histones? Is their evidence for leakage in the cytoplasm?

      1. RNA seq analysis

      The RNA-seq analysis suffers from a lack of direct connection with the rest of the study. The extracted molecular information is not validated nor further explored. It remains very descriptive. The PCA analysis suggests a « more pronounced transcriptomic heterogeneity in differentiating Arpc4KO DCs ». However it seems difficult to make such a claim from the comparison of 3 mice per group. In addition, such heterogeneity is not seen in the more detailed analysis (Fig 5F). The authors claim that « day 10 control and Arpc4KO DCs showed no to very little differences in gene expression, in contrast to cells at days 7-9 of differentiation ». This is not obvious from the data displayed in the corresponding figure. In addition, it is not expected that cells that may take a divergent differentiation path at days 7-9 may would return to a similar transcriptional activity at day 10. A point that is not discussed is that before day 10 of DC differentiation, Arpc4 KO is expected to only occur in about 50% of the cell population. This is expected to impact the RNA-seq analysis. Not all clusters have been exploited (e.g. cluster 3 elevated, cluster 6 partly reduced). I suggest the authors reconsider their analysis and analysis of the RNA-seq analysis (or eventually invest in complementary analysis).

      1. What causes the profound numerical drop of LC in the epidermis?

      A major open question is what causes the massive drop of LCs. Although differentiating Arpc4KO DCs start accumulating DNA damage upon proliferation, they succeed in progressing through the cell cycle. There is even a slightly elevated expression of cell cycle genes at day 7 of differentiation in the DC model. Only a trend for increased apoptosis is observed in ear and tail skin. It would be important to provide complementary data documenting increased death (or aberrant emigration?) of LCs in the 4-8 week time window.

      1. Functional consequences

      Although the study reports novel aspects of LC biology, the consequence of ArpC4 deletion for skin barrier function and immunosurveillance are not investigated. It would seem very relevant to test how this model copes with radiation, chemical and/or microorganism challenges.

      Minor comments:

      1. Figure 1D

      Gating strategy: twice the same empty plots. The content seems to be missing... Does this need to be shown in the main figure?

      1. Figure 2

      Best would be to keep same scale to compare P1 and P7 (tail skin, figure 2A)

      Overlay of Ki67 and MHC-II does not allow to easily visualize the double-positive cells (Fig 2C)

      Quality of Ki67 staining different for Arpc4-KO (less intense, less focused to the nuclei): a technical issue or could that reflect something?

      Fig 2C: Panels mounted differently for ear and tail skin (different order to present the individual stainings, Dapi for tail skin only).

      1. LC branch analysis (Fig 1 and 2)

      While Fig 1 indicates that ear skin LCs form in average twice as few branches as tail skin LCs (3-4 versus 8-9 branches per cell), Fig 2 shows the opposite (10-12 versus 6-7 branches per cell). Is this due to a very distinct pattern between the 2 considered ages (4 weeks versus 8-10 weeks)? Could the author double-check that there is no methodological bias in their analysis?

      1. Fig 3 E-G

      How many animals were examined (n=5)? Reproducible accros animals? Why was it done with 4-week animals (phenotype not complete? Event occurring before loss in numbers...)

      Staining Lamin A/C globally more intense in the Arpc4-KO epidermis (also seems to apply to the masks corresponding to the LCs). Surprising to see that the quantification indicates a major drop of Lamin A/C intensity in the LCs.

      1. Legend Fig 4D replace confocal microscopy by STED microscopy

      2. Figure 4F

      Intensity/background of γH2Ax staining very distinct between the 2 micrographs shown for WT and Arpc4-KO epidermis.

      1. Figure 7C, F, H

      Gating strategies: would be better to harmonize the style of the plots (dot plots and 2 types of contour plots have been used...)

      1. Figure 7H

      Legend of lower gating strategy seems to be wrong (KO and not WT).

      Significance

      Strengths: the general quality of the manuscript is high. It is very clearly written and it contains a very detailed method section that would allow reproducing the reported experiments. This work entails a clear novelty in that it represents the first investigation of the role of ArpC4 in LCs. It opens an interesting perspective about specific mechanisms sustaining the maintenance of myeloid cell subsets in peripheral tissues. This work is therefore expected to be of interest for a large audience of cellular immunologists and beyond. Challenging skin function with an external trigger would lift the relevance for a even wider audience (see main point 6).

      Limitations: in its current version the manuscript suffers from a lack of solidity around a few analysis (see main points on ArpC4 and Arp2/3 protein expression, nuclear envelop rupture analysis,...). It also tends to formulate a narrative centered on the ArpC4 intra-nuclear function that is not definitely proven.

      The field of expertise of this reviewer is: cellular immunology and actin remodeling.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Revision Plan

      June 28, 2025

      Manuscript number: RC-2025-02982

      Corresponding author(s): Babita Madan, Nathan Harmston, David Virshup

      General Statements In Wnt signaling, the relative contributions of ‘canonical (β-catenin dependent) and non- canonical (β-catenin independent) signaling remains unclear. Here, we exploited a unique and highly robust in vivo system to study this. Our study is therefore the first comprehensive analysis of the β-catenin independent arm of the Wnt signaling pathway in a cancer model and illustrates how a combination of cis-regulatory elements can determine Wnt-dependent gene regulation.

      We are very pleased with the reviews; it appears we communicated our goal and our findings clearly, and in general the reviewers felt the study provided important information, was well planned and the results were “crystal clear”.

      While more experiments could strengthen and extend the results, we feel our results are already very robust due to the use of multiple replicates in the in vivo system.

      The Virshup lab in Singapore closed July 1, 2025 and so additional wet lab studies are not feasible.

      1. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      Below we address the points raised by the reviewers:

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The article has the merit of addressing a yet-unsolved question in the field (if beta-catenin can also repress genes) that only a limited number of studies has tried to tackle, and provides useful datasets for the community. The system employed is elegant, and the PORCN-inhibition bypassed by a ____constitutively active beta-catenin is clean and ingenious. The manuscript is clearly written.

      We thank the reviewers for their kind comments on the importance of the data. Our orthotopic model provides the opportunity to exploit robust Wnt regulated gene expression in a more responsive microenvironment than can be achieved in cell culture and simple flank xenograft models.

      Here we propose a series of thoughts and comments that, if addressed, would in our opinion improve the study and its description.

      1) We wonder why a xenograft model is necessary to induce a robust WNT response in these cells.

      The authors describe this set-up as a strength, as it is supposed to provide physiological relevance, yet it is not clear to us why this is the case.

      We welcome the opportunity to expand on our choice of an orthotopic xenograft model. It has been long established that cancer cells behave differently in different in vivo locations (Killion et al., 1998). Building on this, we confirmed this in our system that identical pancreatic cancer cells treated with the same PORCN inhibitor had very different responses in vitro, in the flank and in their orthotopic environment (Madan et al., 2018). To quote from our prior paper, “Looking only at genes decreasing more than 1.5-fold at 56 hours, we would have missed 817/1867 (44%) genes using a subcutaneous or 939/1867 (50%) using an in vitro model. Thus, the overall response to Wnt inhibition was reduced in the subcutaneous model and further blunted in vitro. An orthotopic model more accurately represents real biology.

      The reason for this is presumably the very different orthotopic microenvironment, including tissue appropriate stroma-tumor, vascular-tumor, lymphatic-tumor, and humoral interactions.

      Moreover, as the authors homogenize the tumour to perform bulk RNA-seq, we wonder whether they are not only sequencing mRNA from the cancer cells but also from infiltrating immune cells and/or from the surrounding connective tissue.

      In experiments generating RNA-seq data from xenograft models, the resulting sequences can originate from either human (graft) or mouse (host). In order to account for this, following standard practice, we filtered reads prior to alignment using Xenome (Conway et al., 2012). We have added additional text to the methods to highlight this step in our pipeline.

      2) If, as the established view implies, Wnt/beta-catenin only leads to gene activation, pathway

      inhibition would free up the transcriptional machinery - there is evidence that some of its constituents are rate-limiting. The free machinery could now activate some other genes: the net effect observed would be their increased transcription upon Wnt inhibition, irrespective of beta-catenin's presence. Could this be considered as an alternative explanation for the genes that go up in both control and bcat4A lines upon ETC-159 administration? This, we think, is in part corroborated by the absence of enrichment of biological pathways in this group of genes. The genes that are beta-catenin-dependent and downregulated (D&R) are obviously not affected by this alternative explanation.

      This is an interesting suggestion, and we will incorporate this thought into our discussion of potential mechanisms.

      3) The authors mention that HPAF-II are Wnt addicted. Do they die upon ETC-159 administration, and is this effect rescued by exogenous WNT addition?

      We and several others have previously reported that Wnt-addicted cells differentiate and/or senesce upon Wnt withdrawal in vivo but not in vitro. This is related to the broader changes in gene expression in the orthotopic tumors. The effect of PORCN inhibition has been demonstrated by us and others and is rescued by Wnt addition, downstream activation of Wnt signaling by e.g. APC mutation, and, as we show here, stabilized β-catenin.

      4) Line 120: the authors write about Figure 1C: "This demonstrates that the growth of β-cat4A cells in vitro largely requires Wnts to activate β-catenin signaling." The opposite is true: control cells require WNT and form less colony with ETC159, while β-cat4A are independent from Wnt secretion.

      We appreciate the reviewer pointing out our mis-statement. This error has now been corrected in the revised manuscript.

      5) Lines 226-229: "The β-catenin independent repressed genes were notably enriched for motifs bound by homeobox factors including GSC2, POU6F2, and MSGN1. This finding aligns with the known role of non-canonical Wnt signaling in embryonic development" This statement assumes that target genes, or at least the beta-catenin independent ones, are conserved across tissues, including developing organs. This contrasts with the view that target genes in addition to the usual suspects (e.g., AXIN2, SP5 etc.) are modulated tissue-specifically - a view that the authors (and in fact, these reviewers) appear to support in their introduction.

      We agree with the reviewer that a majority of Wnt-regulated genes are tissue specific. Indeed, the β-catenin independent Wnt-repressed genes may also be tissue specific. In other tissues, we speculate that other β-catenin independent Wnt-repressed genes may also have homeobox factor binding sites as well and so the general concept remains valid. We do not have sufficient data in other tissues to resolve this issue.

      7) The luciferase and mutagenesis work presented in Figure 5 are crystal-clear. One important aspect that remains to be clarified is whether beta-catenin and/or TCF7L2 directly bind to the NRE sites. Or do the authors hypothesize that another factor binds here? We suggest the authors to show TCF7L2 binding tracks at the NRE/WRE motifs in the main figures.

      A major question of the reviewers was, can we provide additional evidence that the NRE is bound by LEF/TCF family members. Our initial analysis of more datasets indicates TCF7L2 peaks are enriched on NREs in Wnt-β-catenin responsive cell lines like HCT116 and PANC1. These analyses appear to further support the model that the NRE binds TCF7L2, but we fully agree these analyses can neither prove nor disprove the model.

      In our revision, we will analyze additional cut and run datasets as suggested and look at the HEPG2 datasets suggested by reviewer 1. We are concerned about tissue specificity as some of the genes are not expressed in e.g. HEPG2 or HEK293 cells where datasets are available. However, our data continues to support a functional role for the NRE in the modulation of β-catenin regulated genes. The best analysis would be more ChIP-Seq or Cut and Run assays on tissues, not cells, but these studies are beyond what we can do.

      What about other TCF/LEFs and beta-catenin? Are there relevant datasets that could be explored to test whether all these bind here during Wnt activation?

      As above, We will analyze additional ChIP and Cut & Run datasets to address this question looking at β-catenin and other LEF/TCF family members. We also reflect on the fact that ChIP-Seq does not necessarily imply that the targeted factor (e.g.,TCF7L2) is bound in the target site in all the cells.

      The repression might be mediated by beta-catenin partnering with other factors that bind the NRE even by competing with TCF7L2.

      We appreciate the insightful comments and now incorporate this into our discussion.

      8) In general, while we greatly appreciate the github page to replicate the analysis, we feel that the methods' description is lacking, both concerning analytical details (e.g., the cutoff used for MACS2 peak calling) or basic experimental planning (e.g, how the luciferase assays were performed).

      We thank reviewers for the suggestions and will add further details regarding the analysis

      and experimental planning in the method sections.

      9) The paper might benefit from the addition of quality metrics on the RNA-seq. Interesting for example would be to see a PCA analysis - as a more unbiased approach - rather than the kmeans clustering.

      We have this data and will add it to the revised manuscript.

      10) It seems that in Figure 3A the clusters are mislabelled as compared to Figure 3B and Figure 1. Here the repressor clusters are labelled DR5, DR6 and DN7 whereas in the rest of the paper they are labelled DR1, DR2 and DN1.

      Thank you for pointing out this issue. This has now been corrected in Figure 3.

      11) The siCTNNB1 in Figure 5E is described to be a significant effect in the text whereas in Figure 5E this has a p value of 0.075.

      Thank you for pointing out the p value did not cross the 0.05 threshold. We have modified the text to remove the word ‘significant’.

      12) Line 396: 'Here we confirm and extend the identification of a TCF-dependent negative regulatory element (NRE), where beta-catenin interacts with TCF to repress gene expression'. We suggest caution in stating that beta-catenin and TCF directly repress gene expression by binding to NRE. In the current state the authors do not show that TCF & beta-catenin bind to these elements. See our previous point 7.

      We appreciate the suggestion of the reviewers. We will be more cautious in our interpretation.

      Further suggestions - or food for thoughts:

      13) A frequently asked question in the field concerns the off-target effects of CHIR treatment as opposed to exposure to WNT ligands. CHIR treatment - in parallel to bcat4A overexpression - would allow the authors to delineate WNT independent effects of CHIR treatment and settle this debate.

      We thank the reviewers for suggesting this interesting experiment to sort out the non- Wnt effects of GSK3 inhibition. Such a study would require a new set of animal experiments and a different analysis; we think this is beyond the scope of this manuscript.

      14) We think that Figure 4C could be strengthened by adding more public TCF-related datasets (e.g., from ENCODE) to confirm the observation across datasets from different laboratories. In particular, the HEPG2 could possibly be improved as there is an excellent TCF7L2 dataset available by ENCODE.

      Many more datasets are easily searchable through: https://www.factorbook.org/.

      As above, we will analyze the HEPG2 dataset. We plan on updating Fig 4 with data from analysis from different datasets such as (Blauwkamp et al., 2008; Zambanini et al., 2022).

      15) The authors show that there is no specific spacing between NREs and WREs. This implies that it is not likely that TCF7L2 recognizes both at the same time through the C-clamp. Do the authors think that there might be a pattern discernible when comparing the location of WRE and NRE in relation to the TCF7L2 ChIP-seq peak summit? This would allow inferring whether TCF7L2 more likely directly binds the WRE (presumably) and if the NRE is bound by a cofactor.

      This is an interesting suggestion and we will conduct this analysis as suggested on available datasets (as the result may be different in different tissue types with varying degrees of Wnt/β-catenin signaling).

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Overall, the study provides a solid framework for understanding noncanonical transcriptional ____outputs of Wnt signaling in a cancer context. The majority of the conclusions are well supported by the data. However, there are a few substantive points that require clarification before the manuscript is ready for publication.

      Major Comments

      The authors' central claim-that their findings represent a comprehensive analysis of the β-catenin- independent arm of Wnt signaling and uncover a "cis-regulatory grammar" governing Wnt-dependent gene activation versus repression-is overstated based on the presented data.

      We appreciate the reviewers concern and will temper our language.

      Specifically:

      • Figure 3B identifies TF-binding motifs enriched among different Wnt-responsive gene clusters, but the authors only functionally investigate the role of NRE in β-catenin-dependent repression, particularly in the context of TCF motif interaction.

      • To support a broader claim regarding cis-regulatory grammar, additional analyses are required:

      o What is the distribution of NREs across all clusters? Are they exclusive to β-catenin-dependent repressed clusters, or more broadly present?

      The distribution of the NREs is a statistically significant enrichment; they are observed in the repressed clusters more frequently than expected by chance alone, but they are present elsewhere as well. We have tempered our language around the cis-regulatory grammar.

      o Do NREs interact with other enriched motifs beyond TCF? Is this interaction specific to repression or also involved in activation?

      This is an interesting question beyond the scope of this analysis. Our dataset uses multiple interventions; The NREs may interact with other motifs but we would need more transcriptional analysis data with biological intervention to assess this.

      o A more comprehensive analysis of cis-element combinations is needed to draw conclusions about their collective influence on gene regulation across clusters.

      We agree; This would be a great question if we had TCF binding data in our orthotopic xenograft model. It’s a dataset we do not have, nor do we have the resources to pursue this.

      Other important clarifications:

      • The use of the term "wild-type" to describe HPAF-II cells is potentially misleading. These cells are not genetically wild-type and harbor multiple oncogenic alterations.

      Thank you for pointing this out. We will use the word “parental” in the text

      • The manuscript does not clearly present the kinetics of Wnt target downregulation upon ETC-159 treatment of HPAF-II cells. Understanding whether repression mirrors activation dynamics (e.g., delay or persistence of Wnt effects) is essential to interpreting the system's temporal behavior.

      We previously addressed the temporal dynamics of activation and repression in our more comprehensive time course papers (Harmston et al., 2020; Madan et al., 2018); there are differences in the dynamics that are difficult to tease out in this new dataset as the density of time points is less. Having said that, we will compare the time course and annotate the sets of genes identified in this current study with the data from our original study to provide more information on the temporal dynamics of this system.

      Minor Comment

      • The statement in Figure 1C (lines 119-120) that "growth of β-cat4A cells in vitro largely requires Wnts to activate β-catenin signaling" is inconsistent with the data. As the β-cat4A allele encodes a constitutively active form of β-catenin, Wnts should not be required. Please revise this conclusion for clarity.

      We thank the reviewers for pointing out this mis-statement. We have corrected this.

      Reviewer #2 (Significance (Required)):

      This study offers a systematic classification of Wnt-responsive gene expression dynamics, differentiating between β-catenin-dependent and -independent mechanisms. The insights into temporal expression patterns and the potential role of the NRE element in transcriptional repression add depth to our understanding of Wnt signaling. These findings have relevance for developmental biology, stem cell biology, and cancer research-particularly in understanding how Wnt-mediated repression may influence tumor progression and therapeutic response.

      Nice review; thank you.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      … The work advances understanding of Wnt mediated repression via cis regulatory grammar.

      Major Concerns

      1) Statistical thresholds and clustering - The criteria for classifying β catenin-dependent versus - independent genes rely on FDR cutoffs above or below 0.1. If the more stringent cutoff of 0.05 was used, how many genes would still be considered Wnt regulated?

      We can readily address this in a revised manuscript.

      2) Validation of selected β catenin-dependent and -independent Wnt target genes - While the authors identify β catenin-dependent and -independent Wnt target genes (4 selected genes from different clusters in Fig.2), RT-qPCR based validation of Axin2 has been performed in Fig. S3. Authors should also validate other 3 genes as well.

      We had considered performing qPCR to re-validate some of our gene-expression changes but qPCR analyses is intrinsically more error prone than RNAseq, and we believe the literature shows that qPCR from the same samples will not add any extra utility. Previous studies that have examined this question have reported excellent correlation between the RNAseq and pPCR (Asmann et al., 2009; Griffith et al., 2010; Wu et al., 2014).

      3) NRE mechanistic insight - The most important contribution of this manuscript is the extension of the importance of the NRE motif in Wnt regulated enhancers. But the mutagenesis data provided is insufficient to conclusively nail down that the NREs are responsible for the repression. The effects in the synthetic reporters in Fig. 4D are small - it's not clear that there is much activity in the MimRep to be repressed by the NREs. The data in Fig. 5 is a better context to test the importance of the NREs, but the authors use deletion analysis which is too imprecise and settle for single nucleotide mutants in individual NREs in the ABHD11-AS1 reporter. In the Axin2 report, they mutate sequences outside of the NRE. It's too inconsistent. They should mutate 3 or 4 positions within the NRE in BOTH motifs in the context of the ABHD11-AS1 reporter. Same for the Axin2 reporter.

      We feel our analysis, coupled with the Kim paper (Kim et al., 2017), support the role of the NRE. We agree that more data is always desirable, but in our current circumstances are we cannot add additional wetlab experiments.

      Regarding Figure 4D, this is a synthetic system lacking the endogenous elements in the promoter. We agree with the reviewer that the effect is small but we would also like to point out that adding the well-established 2WRE in front of the MinRep increased the transcription activity to 1.5 fold, which is of similar magnitude change of the 2NRE deceasing the transcriptional activity 1/1.5 = 0.6.

      In Kim et al, it was shown that mutating the 11st nucleotide of the NRE motif showed the strongest effect, so we followed their lead in only mutated the 11st nucleotide in ABHD11- AS1 NRE.

      As for the putative NRE sequence present in AXIN2 promoter, its downstream sequence is polyT (__GTGTTTTTTTT__TTTTTTTTTT), if we only mutate 11st nucleotide to G/C, we could create similar sequence to NRE, so we mutated sequences outside of the NRE to fully disrupt it.

      4) Even if the mutagenesis is done more completely, the results simply replicate that of the Goentoro group. In Kim et al 2017, they provide suggestive (not convincing) evidence that TCFs directly bind to the NRE. The authors of this manuscript should explore that in more detail, e.g., can purified TCF bind to the NRE sequence? Can the authors design experiments to directly test whether beta-catenin is acting through the NRE - their data currently only demonstrates that the NRE provide a negative input to the reporters - that's an important mechanistic difference.

      We point out that our minimal reporter studies with the NRE showed a repressive effect in HCT116 (colorectal cancer cells with stabilized β-catenin) but not HT1080 (sarcoma cells with low Wnt) supporting the importance of β-catenin acting through the NRE (Figs. 4D, 4E).

      We fully agree with the reviewers that additional study of TCF interaction with the NRE would be of value. While EMSA and culture-based ChIP assays would be of some value, the best study should be done in vivo where the system is most robust. We are not in a position to do these studies, but we will add in a discussion of this as a limitation of the current study.

      5) In vertebrates, some TCFs are more repressive than others and TLEs have been implicated in repressive. Exploring these factors in the context of the NRE would increase the value of this story.

      This is an interesting idea but beyond the scope of the current manuscript. It is likely this would be dependent on tissue specific expression, local expression levels, and local binding of co-factors. As we look at other TCF members in other datasets we may be able to address this. Further wetlab experiments are beyond the scope of this work.

      **Referees cross-commenting**

      I respectfully disagree that the luciferase assays are sufficient. Using deletion analysis to understand the function of specific binding sites is insufficient and the more specific mutations of NREs are incomplete. Regarding this paper extending our knowledge of direct transcriptional repression by Wnt/bcat signaling, I don't agree that it adds much - there are numerous datasets where Wnt signaling activates and represses genes - the trick is determining whether any of the repressed genes are the result and direct regulation by TCF/bcat. They don't explore that. The main finding is an extension of the work by Lea Goentoro on the importance of the NRE motif, but they don't address whether TCF directly associates with this sequence. Goentoro argued in the 2017 paper that it does, but that data is unconvincing to me. Can purified TCF bind the NRE? Without that information (done carefully) this manuscript is very limited.

      We respectfully disagree with the reviewer regarding the contribution of this manuscript. There are certainly many datasets looking at Wnt-regulated genes in tissue culture, but these cell-based studies are underpowered to really understand Wnt biology. There are only two papers, ours and Cantú’s, that address Wnt repressed genes in any depth. No prior papers have differentiated β-catenin dependent from β-catenin independent genes before, and certainly not in an orthotopic animal model.

      A major impact of our study is the finding that only 10% of Wnt regulated genes are independent of β-catenin, at least in pancreatic cancer. We feel this is a major contribution. We further add to this analysis by re-enforcing/extend the prior evidence on the NRE in humans (and correct the motif sequence!) for Wnt-repressed genes. Our data supports the fine-tuning of the Wnt/β-catenin regulated genes by a cis-regulatory grammar.

      Reviewer #3 (Significance (Required)):

      Overall, this study advances our understanding of the dual roles of Wnt signaling in gene activation and repression, highlighting the role of the NRE motif. But this is an extension of the original NRE paper (Kim et al 2017) with no mechanistic advance beyond that original work. The transcriptomics in the first part of the manuscript have some value, but similar data sets already exist.

      We respectfully but strongly disagree with the reviewer. First, our work examines the NRE in a large-scale in vivo transcriptome dataset, significantly extending the candidate gene approach of Kim et al. Secondly, we disagree with the comment that “similar data sets already exist.” Indeed, reviewer 1 (C. Cantú) specifically pointed out we had addressed an “yet-unsolved question in the field” on whether and how β-catenin repressed genes.

      __3. __Description of the revisions that have already been incorporated in the transferred manuscript

      To date we have only corrected several typographical errors.

      1. Description of analyses that authors prefer not to carry out

      We fully agree with the reviewers that additional study of TCF interaction with the NRE would be of value. While EMSA and cell culture-based ChIP assays would be of some modest value, they have already been done in vitro by Kim et al. (Kim et al., 2017) and the best next study should be done in vivo in Wnt-responsive cancers or tissues where the biology is most robust (Madan et al., 2018) . We are not in a position to do these studies, but we will add this into the discussion as a limitation of the current study. We also acknowledge that the NRE may interact with other currently unidentified factors.

      Reviewer 1 asked about considering experiments to determine non-Wnt effects of GSK3 inhibitors like CHIR. Such a study, while interesting, would require a new set of animal experiments and a different analysis; we think this is beyond the scope of this manuscript.

      Finally, we note that the Virshup lab at Duke-NUS Medical School in Singapore, where these in vivo studies were performed, has closed as of July 1, 2025 and the various lab members have moved on to new adventures. Because of this, we are unable to undertake new wet-lab studies.

      Thank you for your consideration,

      For the authors,

      David Virshup

      References:

      Asmann YW, Klee EW, Thompson EA, Perez EA, Middha S, Oberg AL, Therneau TM, Smith DI,

      Poland GA, Wieben ED, Kocher J-PA. 2009. 3’ tag digital gene expression profiling of human

      brain and universal reference RNA using Illumina Genome Analyzer. BMC Genom 10:531–

      1. doi:10.1186/1471-2164-10-531

      Blauwkamp TA, Chang MV, Cadigan KM. 2008. Novel TCF-binding sites specify transcriptional

      repression by Wnt signalling. The EMBO Journal 27:1436–1446. doi:10.1038/emboj.2008.80

      Conway T, Wazny J, Bromage A, Tymms M, Sooraj D, Williams ED, Beresford-Smith B. 2012.

      Xenome—a tool for classifying reads from xenograft samples. Bioinformatics 28:i172–i178.

      doi:10.1093/bioinformatics/bts236

      Griffith M, Griffith OL, Mwenifumbo J, Goya R, Morrissy AS, Morin RD, Corbett R, Tang MJ, Hou

      Y-C, Pugh TJ, Robertson G, Chittaranjan S, Ally A, Asano JK, Chan SY, Li HI, McDonald H,

      Teague K, Zhao Y, Zeng T, Delaney A, Hirst M, Morin GB, Jones SJM, Tai IT, Marra MA.

      1. Alternative expression analysis by RNA sequencing. Nat Methods 7:843–847.

      doi:10.1038/nmeth.1503

      Harmston N, Lim JYS, Arqués O, Palmer HG, Petretto E, Virshup DM, Madan B. 2020.

      Widespread Repression of Gene Expression in Cancer by a Wnt/β-Catenin/MAPK Pathway.

      Cancer Res 81:464–475. doi:10.1158/0008-5472.can-20-2129

      Killion JJ, Radinsky R, Fidler IJ. 1998. Orthotopic models are necessary to predict therapy of

      transplantable tumors in mice. Cancer metastasis reviews 17:279–284.

      Kim K, Cho J, Hilzinger TS, Nunns H, Liu A, Ryba BE, Goentoro L. 2017. Two-Element

      Transcriptional Regulation in the Canonical Wnt Pathway. Current Biology 27:2357-2364.e5.

      doi:10.1016/j.cub.2017.06.037

      Madan B, Harmston N, Nallan G, Montoya A, Faull P, Petretto E, Virshup DM. 2018. Temporal

      dynamics of Wnt-dependent transcriptome reveals an oncogenic Wnt/MYC/ribosome axis. J

      Clin Invest 128:5620–5633. doi:10.1172/jci122383

      Wu AR, Neff NF, Kalisky T, Dalerba P, Treutlein B, Rothenberg ME, Mburu FM, Mantalas GL,

      Sim S, Clarke MF, Quake SR. 2014. Quantitative assessment of single-cell RNA-sequencing

      methods. Nat Methods 11:41–46. doi:10.1038/nmeth.2694

      Zambanini G, Nordin A, Jonasson M, Pagella P, Cantù C. 2022. A new cut&run low volume-

      urea (LoV-U) protocol optimized for transcriptional co-factors uncovers Wnt/b-catenin tissue-

      specific genomic targets. Development 149. doi:10.1242/dev.201124

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors use a PORCN inhibitor (ETC 159) in an orthotopic RNF43 mutant pancreatic cancer model to distinguish β catenin-dependent from -independent Wnt target genes. They find that ~90% of Wnt regulated genes in this system are β catenin dependent. Approximately half of these genes are activated by Wnt signaling, half repressed. Clustering and functional enrichment link dependent versus independent targets to distinct pathways. They observe enrichment of sequence motifs similar to the 11 bp Negative Regulatory Element (NRE) previously identified by Lea Goentoro's group in the region around the TSS of β catenin-repressed genes. Using reporter constructs, both synthetic and regulatory DNA from Wnt targets (e.g., ABHD11 AS1, AXIN2), they provide evidence that the NREs are a negative input on expression. The work advances understanding of Wnt mediated repression via cis regulatory grammar.

      Major Concerns

      1. Statistical thresholds and clustering - The criteria for classifying β catenin-dependent versus -independent genes rely on FDR cutoffs above or below 0.1. If the more stringent cutoff of 0.05 was used, how many genes would still be considered Wnt regulated?
      2. Validation of selected β catenin-dependent and -independent Wnt target genes - While the authors identify β catenin-dependent and -independent Wnt target genes (4 selected genes from different clusters in Fig.2), RT-qPCR based validation of Axin2 has been performed in Fig. S3. Authors should also validate other 3 genes as well.
      3. NRE mechanistic insight - The most important contribution of this manuscript is the extension of the importance of the NRE motif in Wnt regulated enhancers. But the mutagenesis data provided is insufficient to conclusively nail down that the NREs are responsible for the repression. The effects in the synthetic reporters in Fig. 4D are small - it's not clear that there is much activity in the MimRep to be repressed by the NREs. The data in Fig. 5 is a better context to test the importance of the NREs, but the authors use deletion analysis which is too imprecise and settle for single nucleotide mutants in individual NREs in the ABHD11-AS1 reporter. In the Axin2 report, they mutate sequences outside of the NRE. It's too inconsistent. They should mutate 3 or 4 positions within the NRE in BOTH motifs in the context of the ABHD11-AS1 reporter. Same for the Axin2 reporter.
      4. Even if the mutagenesis is done more completely, the results simply replicate that of the Goentoro group. In Kim et al 2017, they provide suggestive (not convincing) evidence that TCFs directly bind to the NRE. The authors of this manuscript should explore that in more detail, e.g., can purified TCF bind to the NRE sequence? Can the authors design experiments to directly test whether beta-catenin is acting through the NRE - their data currently only demonstrates that the NRE provide a negative input to the reporters - that's an important mechanistic difference.
      5. In vertebrates, some TCFs are more repressive than others and TLEs have been implicated in repressive. Exploring these factors in the context of the NRE would increase the value of this story.

      Referees cross-commenting

      I respectfully disagree that the luciferase assays are sufficient. Using deletion analysis to understand the function of specific binding sites is insufficient and the more specfic mutations of NREs are incomplete. Regarding this paper extending our knowledge of direct transcriptional repression by Wnt/bcat signaling, I don't agree that it adds much - there are numerous datasets where Wnt signaling activates and represses genes - the trick is determining whether any of the repressed genes are the result and direct regulation by TCF/bcat. They don't explore that. The main finding is an extension of the work by Lea Goentoro on the importance of the NRE motif, but they don't address whether TCF directly associates with this sequence. Goentoro argued in the 2017 paper that it does, but that data is unconvincing to me. Can purified TCF bind the NRE? Without that information (done carefully) this manuscript is very limited.

      Significance

      Overall, this study advances our understanding of the dual roles of Wnt signaling in gene activation and repression, highlighting the role of the NRE motif. But this is an extension of the original NRE paper (Kim et al 2017) with no mechanistic advance beyond that original work. The transcriptomics in the first part of the manuscript have some value, but similar data sets already exist.

      Addressing the major points-especially gaining a deeper mechanistic insight into NRE function-would elevate the manuscript's impact. Major revisions are recommended.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript by Liu et al. explores lesser-known aspects of Wnt signaling, particularly focusing on genes that are repressed by β-catenin and those regulated independently of β-catenin. While canonical Wnt signaling is well characterized through the stabilization and nuclear translocation of β-catenin to activate TCF/LEF target genes, the mechanisms underlying gene repression and β-catenin-independent regulation remain relatively underexplored.

      The authors leverage the Wnt-addicted HPAF-II cancer cell line, combining PORCN inhibitor (ETC-159) treatment with ectopic expression of a stabilized β-catenin mutant (β-cat4A) in orthotopic xenograft models. Through RNA-sequencing analysis, they systematically identify Wnt-responsive gene clusters that are either dependent or independent of β-catenin stabilization. They further demonstrate that a specific cis-regulatory element, termed the Negative Regulatory Element (NRE), contributes to β-catenin-mediated transcriptional repression.

      Overall, the study provides a solid framework for understanding noncanonical transcriptional outputs of Wnt signaling in a cancer context. The majority of the conclusions are well supported by the data. However, there are a few substantive points that require clarification before the manuscript is ready for publication.

      Major Comments

      The authors' central claim-that their findings represent a comprehensive analysis of the β-catenin-independent arm of Wnt signaling and uncover a "cis-regulatory grammar" governing Wnt-dependent gene activation versus repression-is overstated based on the presented data.

      Specifically:

      • Figure 3B identifies TF-binding motifs enriched among different Wnt-responsive gene clusters, but the authors only functionally investigate the role of NRE in β-catenin-dependent repression, particularly in the context of TCF motif interaction.
      • To support a broader claim regarding cis-regulatory grammar, additional analyses are required:
        • What is the distribution of NREs across all clusters? Are they exclusive to β-catenin-dependent repressed clusters, or more broadly present?
        • Do NREs interact with other enriched motifs beyond TCF? Is this interaction specific to repression or also involved in activation?
        • A more comprehensive analysis of cis-element combinations is needed to draw conclusions about their collective influence on gene regulation across clusters.

      Other important clarifications:

      • The use of the term "wild-type" to describe HPAF-II cells is potentially misleading. These cells are not genetically wild-type and harbor multiple oncogenic alterations.
      • The manuscript does not clearly present the kinetics of Wnt target downregulation upon ETC-159 treatment of HPAF-II cells. Understanding whether repression mirrors activation dynamics (e.g., delay or persistence of Wnt effects) is essential to interpreting the system's temporal behavior.

      Minor Comment

      • The statement in Figure 1C (lines 119-120) that "growth of β-cat4A cells in vitro largely requires Wnts to activate β-catenin signaling" is inconsistent with the data. As the β-cat4A allele encodes a constitutively active form of β-catenin, Wnts should not be required. Please revise this conclusion for clarity.

      Significance

      This study offers a systematic classification of Wnt-responsive gene expression dynamics, differentiating between β-catenin-dependent and -independent mechanisms. The insights into temporal expression patterns and the potential role of the NRE element in transcriptional repression add depth to our understanding of Wnt signaling. These findings have relevance for developmental biology, stem cell biology, and cancer research-particularly in understanding how Wnt-mediated repression may influence tumor progression and therapeutic response.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Shiyang Liu and colleagues investigate the transcription induced by Wnt/beta-catenin by employing PORCN inhibition (ECT-159, blocking the secretion of WNTs) in the Wnt-addicted HPAF-II cell line. Classical targets, such as AXIN2, are downregulated by PORCN inhibition (as expected), while many other genes are upregulated, suggesting that Wnt/beta-catenin represses them. Overexpression of a GSK3/CK1-insensitive beta-catenin variant leads to the re-established upregulation of AXIN2 and the concomitant repression of the other group of repressed genes, demonstrating that the repression is mediated by beta-catenin. Other genes are repressed (activated by ECT-159) irrespective of the presence of activated beta-catenin, and the authors conclude that they are beta-catenin-independent Wnt-repressed genes. The authors observe that beta-catenin-dependent repressed genes present enrichment, in their promoters, of the Negative Regulatory Element (NRE) previously identified by the Goentoro lab. In elegant Luciferase assays, the authors now confirm that individual NRE elements are causally involved in target gene repression by -catenin. The article has the merit of addressing a yet-unsolved question in the field (if beta-catenin can also repress genes) that only a limited number of studies has tried to tackle, and provides useful datasets for the community. The system employed is elegant, and the PORCN-inhibition bypassed by a constitutively active beta-catenin is clean and ingenious. The manuscript is clearly written.

      Here we propose a series of thoughts and comments that, if addressed, would in our opinion improve the study and its description.

      1. We wonder why a xenograft model is necessary to induce a robust WNT response in these cells. The authors describe this set-up as a strength, as it is supposed to provide physiological relevance, yet it is not clear to us why this is the case. Moreover, as the authors homogenize the tumour to perform bulk RNA-seq, we wonder whether they are not only sequencing mRNA from the cancer cells but also from infiltrating immune cells and/or from the surrounding connective tissue.
      2. If, as the established view implies, Wnt/beta-catenin only leads to gene activation, pathway inhibition would free up the transcriptional machinery - there is evidence that some of its constituents are rate-limiting. The free machinery could now activate some other genes: the net effect observed would be their increased transcription upon Wnt inhibition, irrespective of beta-catenin's presence. Could this be considered as an alternative explanation for the genes that go up in both control and cat4A lines upon ETC-159 administration? This, we think, is in part corroborated by the absence of enrichment of biological pathways in this group of genes. The genes that are beta-catenin-dependent and downregulated (D&R) are obviously not affected by this alternative explanation.
      3. The authors mention that HPAF-II are Wnt addicted. Do they die upon ETC-159 administration, and is this effect rescued by exogenous WNT addition?
      4. Line 120: the authors write about Figure 1C: "This demonstrates that the growth of β-cat4A cells in vitro largely requires Wnts to activate β-catenin signaling." The opposite is true: control cells require WNT and form less colony with ETC159, while β-cat4A are independent from Wnt secretion.
      5. Lines 226-229: "The β-catenin independent repressed genes were notably enriched for motifs bound by homeobox factors including GSC2, POU6F2, and MSGN1. This finding aligns with the known role of non-canonical Wnt signaling in embryonic development." This statement assumes that target genes, or at least the beta-catenin independent ones, are conserved across tissues, including developing organs. This contrasts with the view that target genes in addition to the usual suspects (e.g., AXIN2, SP5 etc.) are modulated tissue-specifically - a view that the authors (and in fact, these reviewers) appear to support in their introduction.
      6. The luciferase and mutagenesis work presented in Figure 5 are crystal-clear. One important aspect that remains to be clarified is whether beta-catenin and/or TCF7L2 directly bind to the NRE sites. Or do the authors hypothesize that another factor binds here? We suggest the authors to show TCF7L2 binding tracks at the NRE/WRE motifs in the main figures. What about other TCF/LEFs and beta-catenin? Are there relevant datasets that could be explored to test whether all these bind here during Wnt activation? We also reflect on the fact that ChIP-Seq does not necessarily imply that the targeted factor (e.g., TCF7L2) is bound in the target site in all the cells. The repression might be mediated by beta-catenin partnering with other factors that bind the NRE even by competing with TCF7L2.
      7. In general, while we greatly appreciate the github page to replicate the analysis, we feel that the methods' description is lacking, both concerning analytical details (e.g., the cutoff used for MACS2 peak calling) or basic experimental planning (e.g, how the luciferase assays were performed).
      8. The paper might benefit from the addition of quality metrics on the RNA-seq. Interesting for example would be to see a PCA analysis - as a more unbiased approach - rather than the kmeans clustering.
      9. It seems that in Figure 3A the clusters are mislabelled as compared to Figure 3B and Figure 1. Here the repressor clusters are labelled DR5, DR6 and DN7 whereas in the rest of the paper they are labelled DR1, DR2 and DN1.
      10. The siCTNNB1 in Figure 5E is described to be a significant effect in the text whereas in Figure 5E this has a p value of 0.075.
      11. Line 396: 'Here we confirm and extend the identification of a TCF-dependent negative regulatory element (NRE), where beta-catenin interacts with TCF to repress gene expression'. We suggest caution in stating that beta-catenin and TCF directly repress gene expression by binding to NRE. In the current state the authors do not show that TCF & beta-catenin bind to these elements. See our previous point 7.

      Further suggestions - or food for thoughts: 13. A frequently asked question in the field concerns the off-target effects of CHIR treatment as opposed to exposure to WNT ligands. CHIR treatment - in parallel to bcat4A overexpression - would allow the authors to delineate WNT independent effects of CHIR treatment and settle this debate. 14. We think that Figure 4C could be strengthened by adding more public TCF-related datasets (e.g., from ENCODE) to confirm the observation across datasets from different laboratories. In particular, the HEPG2 could possibly be improved as there is an excellent TCF7L2 dataset available by ENCODE. Many more datasets are easily searchable through: https://www.factorbook.org/. 15. The authors show that there is no specific spacing between NREs and WREs. This implies that it is not likely that TCF7L2 recognizes both at the same time through the C-clamp. Do the authors think that there might be a pattern discernible when comparing the location of WRE and NRE in relation to the TCF7L2 ChIP-seq peak summit? This would allow inferring whether TCF7L2 more likely directly binds the WRE (presumably) and if the NRE is bound by a cofactor.

      Review by Claudio Cantù and Yorick van de Grift

      Why we sign: we believe that peer review should be a transparent dialogue. We strive to be critical but honest and professional, and care that our opinions and criticisms are formulated as if we were meeting the authors in person.

      Our expertise lies in the genomics impact of Wnt/beta-catenin activation, and in the search of mechanisms that drive the tissue-specific functions of this pathway across developmental and disease contexts.

      Significance

      Shiyang Liu and colleagues investigate the transcription induced by Wnt/beta-catenin by employing PORCN inhibition (ECT-159, blocking the secretion of WNTs) in the Wnt-addicted HPAF-II cell line. Classical targets, such as AXIN2, are downregulated by PORCN inhibition (as expected), while many other genes are upregulated, suggesting that Wnt/beta-catenin represses them. Overexpression of a GSK3/CK1-insensitive beta-catenin variant leads to the re-established upregulation of AXIN2 and the concomitant repression of the other group of repressed genes, demonstrating that the repression is mediated by beta-catenin. Other genes are repressed (activated by ECT-159) irrespective of the presence of activated beta-catenin, and the authors conclude that they are beta-catenin-independent Wnt-repressed genes. The authors observe that beta-catenin-dependent repressed genes present enrichment, in their promoters, of the Negative Regulatory Element (NRE) previously identified by the Goentoro lab. In elegant Luciferase assays, the authors now confirm that individual NRE elements are causally involved in target gene repression by -catenin. The article has the merit of addressing a yet-unsolved question in the field (if beta-catenin can also repress genes) that only a limited number of studies has tried to tackle, and provides useful datasets for the community. The system employed is elegant, and the PORCN-inhibition bypassed by a constitutively active beta-catenin is clean and ingenious. The manuscript is clearly written.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      PAPS is required for all sulfotransferase reactions in which a sulfate group is covalently attached to amino acid residues of proteins or to side chains of proteoglycans. This sulfation is crucial for properly organizing the apical extracellular matrix (aECM) and expanding the lumen in the Drosophila salivary gland. Loss of Papss potentially leads to decreased sulfation, disorganizing the aECM, and defects in lumen formation. In addition, Papss loss destabilizes the Golgi structures.

      In Papss mutants, several changes occur in the salivary gland lumen of Drosophila. The tube lumen is very thin and shows irregular apical protrusions. There is a disorganization of the apical membrane and a compaction of the apical extracellular matrix (aECM). The Golgi structures and intracellular transport are disturbed. In addition, the ZP domain proteins Piopio (Pio) and Dumpy (Dpy) lose their normal distribution in the lumen, which leads to condensation and dissociation of the Dpy-positive aECM structure from the apical membrane. This results in a thin and irregularly dilated lumen.

      1. The authors describe various changes in the lumen in mutants, from thin lumen to irregular expansion. I would like to know the correct lumen diameter, and length, besides the total area, by which one can recognize thin and irregular.

      We have included quantification of the length and diameter of the salivary gland lumen in the stage 16 salivary glands of control, Papss mutant, and salivary gland-specific rescue embryos (Figure 1J, K). As described, Papss mutant embryos have two distinct phenotypes, one group with a thin lumen along the entire lumen and the other group with irregular lumen shapes. Therefore, we separated the two groups for quantification of lumen diameter. Additionally, we have analyzed the degree of variability for the lumen diameter to better capture the range of phenotypes observed (Figure 1K'). These quantifications enable a more precise assessment of lumen morphology, allowing readers to distinguish between thin and irregular lumen phenotypes.

      The rescue is about 30%, which is not as good as expected. Maybe the wrong isoform was taken. Is it possible to find out which isoform is expressed in the salivary glands, e.g., by RNA in situ Hyb? This could then be used to analyze a more focused rescue beyond the paper.

      Thank you for this point, but we do not agree that the rescue is about 30%. In Papss mutants, about 50% of the embryos show the thin lumen phenotype whereas the other 50% show irregular lumen shapes. In the rescue embryos with a WT Papss, few embryos showed thin lumen phenotypes. About 40% of the rescue embryos showed "normal, fully expanded" lumen shapes, and the remaining 60% showed either irregular (thin+expanded) or slightly overexpanded lumen. It is not uncommon that rescue with the Gal4/UAS system results in a partial rescue because it is often not easy to achieve the balance of the proper amount of the protein with the overexpression system.

      To address the possibility that the wrong isoform was used, we performed in situ hybridization to examine the expression of different Papss spice forms in the salivary gland. We used probes that detect subsets of splice forms: A/B/C/F/G, D/H, and E/F/H, and found that all probes showed expression in the salivary gland, with varying intensities. The original probe, which detects all splice forms, showed the strongest signals in the salivary gland compared to the new probes which detect only a subset. However, the difference in the signal intensity may be due to the longer length of the original probe (>800 bp) compared to other probes that were made with much smaller regions (~200 bp). Digoxigenin in the DIG labeling kit for mRNA detection labels the uridine nucleotide in the transcript, and the probes with weaker signals contain fewer uridines (all: 147; ABCFG, 29; D, 36; EFH, 66). We also used the Papss-PD isoform, for a salivary gland-specific rescue experiment and obtained similar results to those with Papss-PE (Figure 1I-L, Figure 4D and E).

      Furthermore, we performed additional experiments to validate our findings. We performed a rescue experiment with a mutant form of Papss that has mutations in the critical rescues of the catalytic domains of the enzyme, which failed to rescue any phenotypes, including the thin lumen phenotype (Figure 1H, J-L), the number and intensity of WGA puncta (Figure 3I, I'), and cell death (Figure 4D, E). These results provide strong evidence that the defects observed in Papss mutants are due to the lack of sulfation.

      Crb is a transmembrane protein on the apicolateral side of the membrane. Accordingly, the apicolateral distribution can be seen in the control and the mutant. I believe there are no apparent differences here, not even in the amount of expression. However, the view of the cells (frame) shows possible differences. To be sure, a more in-depth analysis of the images is required. Confocal Z-stack images, with 3D visualization and orthogonal projections to analyze the membranes showing Crb staining together with a suitable membrane marker (e.g. SAS or Uif). This is the only way to show whether Crb is incorrectly distributed. Statistics of several papas mutants would also be desirable and not just a single representative image. When do the observed changes in Crb distribution occur in the development of the tubes, only during stage 16? Is papss only involved in the maintenance of the apical membrane? This is particularly important when considering the SJ and AJ, because the latter show no change in the mutants.

      We appreciate your suggestion to more thoroughly analyze Crb distribution. We adapted a method from a previous study (Olivares-Castiñeira and Llimargas, 2017) to quantify Crb signals in the subapical region and apical free region of salivary gland cells. Using E-Cad signals as a reference, we marked the apical cell boundaries of individual cells and calculated the intensity of Crb signals in the subapical region (along the cell membrane) and in the apical free region. We focused on the expanded region of the SG lumen in Papss mutants for quantification, as the thin lumen region was challenging to analyze. This quantification is included in Figure 2D. Statistical analysis shows that Crb signals were more dispersed in SG cells in Papss mutants compared to WT.

      A change in the ECM is only inferred based on the WGA localization. This is too few to make a clear statement. WGA is only an indirect marker of the cell surface and glycosylated proteins, but it does not indicate whether the ECM is altered in its composition and expression. Other important factors are missing here. In addition, only a single observation is shown, and statistics are missing.

      We understand your concern that WGA localization alone may not be sufficient to conclude changes in the ECM. However, we observed that luminal WGA signals colocalize with Dpy-YFP in the WT SG (Figure 5-figure supplement 2C), suggesting that WGA detects the aECM structure containing Dpy. The similar behavior of WGA and Dpy-YFP signals in multiple genotypes further supports this idea. In Papss mutants with a thin lumen phenotype, both WGA and Dpy-YFP signals are condensed (Figure 5E-H), and in pio mutants, both are absent from the lumen (Figure 6B, D). We analyzed WGA signals in over 25 samples of WT and Papss mutants, observing consistent phenotypes. We have included the number of samples in the text. While we acknowledge that WGA is an indirect marker, our data suggest that it is a reliable indicator of the aECM structure containing Dpy.

      Reduced WGA staining is seen in papss mutants, but this could be due to other circumstances. To be sure, a statistic with the number of dots must be shown, as well as an intensity blot on several independent samples. The images are from single confocal sections. It could be that the dots appear in a different Z-plane. Therefore, a 3D visualization of the voxels must be shown to identify and, at best, quantify the dots in the organ.

      We have quantified cytoplasmic punctate WGA signals. Using spinning disk microscopy with super-resolution technology (Olympus SpinSR10 Sora), we obtained high-resolution images of cytoplasmic punctate signals of WGA in WT, Papss mutant, and rescue SGs with the WT and mutant forms of Papss-PD. We then generated 3D reconstructed images of these signals using Imaris software (Figure 3E-H) and quantified the number and intensity of puncta. Statistical analysis of these data confirms the reduction of the number and intensity of WGA puncta in Papss mutants (Figure 3I, I'). The number of WGA puncta was restored by expressing WT Papss but not the mutant form. By using 3D visualization and quantification, we have ensured that our results are not limited to a single confocal section and account for potential variations in Z-plane localization of the dots.

      A colocalization analysis (statistics) should be shown for the overlap of WGA with ManII-GFP.

      Since WGA labels multiple structures, including the nuclear envelope and ECM structures, we focused on assessing the colocalization of the cytoplasmic WGA punctate signals and ManII-GFP signals. Standard colocalization analysis methods, such as Pearson's correlation coefficient or Mander's overlap coefficient, would be confounded by WGA signals in other tissues. Therefore, we used a fluorescent intensity line profile to examine the spatial relationship between WGA and ManII-GFP signals in WT and Papss mutants (Figure 3L, L').

      I do not understand how the authors describe "statistics of secretory vesicles" as an axis in Figure 3p. The TEM images do not show labeled secretory vesicles but empty structures that could be vesicles.

      Previous studies have analyzed "filled" electron-dense secretory vesicles in TEM images of SG cells (Myat and Andrew, 2002, Cell; Fox et al., 2010, J Cell Biol; Chung and Andrew, 2014, Development). Consistent with these studies, our WT TEM images show these vesicles. In contrast, Papss mutants show a mix of filled and empty structures. For quantification, we specifically counted the filled electron-dense vesicles (now Figure 3W). A clear description of our analysis is provided in the figure legend.

      1. The quality of the presented TEM images is too low to judge any difference between control and mutants. Therefore, the supplement must present them in better detail (higher pixel number?).

      We disagree that the quality of the presented TEM images is too low. Our TEM images have sufficient resolution to reveal details of many subcellular structures, such as mitochondrial cisternae. The pdf file of the original submission may not have been high resolution. To address this concern, we have provided several original high-quality TEM images of both WT and Papss mutants at various magnifications in Figure 2-figure supplement 2. Additionally, we have included low-magnification TEM images of WT and Papss mutants in Figure 2H and I to provide a clearer view of the overall SG lumen morphology.

      Line 266: the conclusion that apical trafficking is "significantly impaired" does not hold. This implies that Papss is essential for apical trafficking, but the analyzed ECM proteins (Pio, Dumpy) are found apically enriched in the mutants, and Dumpy is even secreted. Moreover, they analyze only one marker, Sec15, and don't provide data about the quantification of the secretion of proteins.

      We agree and have revised our statement to "defective sulfation affects Golgi structures and multiple routes of intracellular trafficking".

      DCP-1 was used to detect apoptosis in the glands to analyze acellular regions. However, the authors compare ST16 control with ST15 mutant salivary glands, which is problematic. Further, it is not commented on how many embryos were analyzed and how often they detect the dying cells in control and mutant embryos. This part must be improved.

      Thank you for the comment. We agree and have included quantification. We used stage 16 samples from WT and Papss mutants to quantify acellular regions. Since DCP-1 signals are only present at a specific stage of apoptosis, some acellular regions do not show DCP-1 signals. Therefore, we counted acellular regions regardless of DCP-1 signals. We also quantified this in rescue embryos with WT and mutant forms of Papss, which show complete rescue with WT and no rescue with the mutant form, respectively. The graph with a statistical analysis is included (Figure 4D, E).

      WGA and Dumpy show similar condensed patterns within the tube lumen. The authors show that dumpy is enriched from stage 14 onwards. How is it with WGA? Does it show the same pattern from stage 14 to 16? Papss mutants can suffer from a developmental delay in organizing the ECM or lack of internalization of luminal proteins during/after tube expansion, which is the case in the trachea.

      Dpy-YFP and WGA show overlapping signals in the SG lumen throughout morphogenesis. Dpy-YFP is SG enriched in the lumen from stage 11, not stage 14 (Figure 5-figure supplement 2). WGA is also detected in the lumen throughout SG morphogenesis, similar to Dpy. In the original supplemental figure, only a stage 16 SG image was shown for co-localization of Dpy-YFP and WGA signals in the SG lumen. We have now included images from stage 14 and 15 in Figure 5-figure supplement 2C.

      Given that luminal Pio signals are lost at stage 16 only and that Dpy signals appear as condensed structures in the lumen of Papss mutants, it suggests that the internalization of luminal proteins is not impaired in Papss mutants. Rather, these proteins are secreted but fail to organize properly.

      Line 366. Luminal morphology is characterized by bulging and constrictions. In the trachea, bulges indicate the deformation of the apical membrane and the detachment from the aECM. I can see constrictions and the collapsed tube lumen in Fig. 6C, but I don't find the bulges of the apical membrane in pio and Np mutants. Maybe showing it more clearly and with better quality will be helpful.

      Since the bulging phenotype appears to vary from sample to sample, we have revised the description of the phenotype to "constrictions" to more accurately reflect the consistent observations. We quantified the number of constrictions along the entire lumen in pio and Np mutants and included the graph in Figure 6F.

      The authors state that Papss controls luminal secretion of Pio and Dumpy, as they observe reduced luminal staining of both in papss mutants. However, the mCh-Pio and Dumpy-YFP are secreted towards the lumen. Does papss overexpression change Pio and Dumpy secretion towards the lumen, and could this be another explanation for the multiple phenotypes?

      Thank you for the comment. To clarify, we did not observe reduced luminal staining of Pio and Dpy in Papss mutants, nor did we state that Papss controls luminal secretion of Pio and Dpy. In Papss mutants, Pio luminal signals are absent specifically at stage 16 (Figure 5H), whereas strong luminal Pio signals are present until stage 15 (Figure 5G). For Dpy-YFP, the signals are not reduced but condensed in Papss mutants from stages 14-16 (Figure 5D, H).

      It remains unclear whether the apparent loss of Pio signals is due to a loss of Pio protein in the lumen or due to epitope masking resulting from protein aggregation or condensation. As noted in our response to Comment 11 internalization of luminal proteins seems unaffected in Papss mutants; proteins like Pio and Dpy are secreted into the lumen but fail to properly organize. Therefore, we have not tested whether Papss overexpression alters the secretion of Pio or Dpy.

      In our original submission, we incorrectly stated that uniform luminal mCh-Pio signals were unchanged in Papss mutants. Upon closer examination, we found these signals are absent in the expanded luminal region in stage 16 SG (where Dpy-YFP is also absent), and weak mCh-Pio signals colocalize with the condensed Dpy-YFP signals (Figure 5C, D). We have revised the text accordingly.

      Regulation of luminal ZP protein level is essential to modulate the tube expansion; therefore, Np releases Pio and Dumpy in a controlled manner during st15/16. Thus, the analysis of Pio and Dumpy in NP overexpression embryos will be critical to this manuscript to understand more about the control of luminal ZP matrix proteins.

      Thanks for the insightful suggestion. We overexpressed both the WT and mutant form of Np using UAS-Np.WT and UAS-Np.S990A lines (Drees et al., 2019) and analyzed mCh-Pio, Pio antibody, and Dpy-YFP signals. It is important to note that these overexpression experiments were done in the presence of the endogenous WT Np.

      Overexpression of Np.WT led to increased levels of mCh-Pio, Pio, and Dpy-YFP signals in the lumen and at the apical membrane. In contrast, overexpression of Np.S990A resulted in a near complete loss of luminal mCh-Pio signals. Pio antibody signals remained strong at the apical membrane but was weaker in the luminal filamentous structures compared to WT.

      Due to the GFP tag present in the UAS-Np.S990A line, we could not reliably analyze Dpy-YFP signals because of overlapping fluorescent signals in the same channel. However, the filamentous Pio signals in the lumen co-localized with GFP signals, suggesting that these structures might also include Dpy-YFP, although this cannot be confirmed definitively.

      These results suggest that overexpressed Np.S990A may act in a dominant-negative manner, competing with endogenous Np and impairing proper cleavage of Pio (and mCh-Pio). Nevertheless, some level of cleavage by endogenous Np still appears to occur, as indicated by the residual luminal filamentous Pio signals. These new findings have been incorporated into the revised manuscript and are shown in Figure 6H and 6I.

      Minor: Fig. 5 C': mChe-Pio and Dumpy-YFP are mixed up at the top of the images.

      Thanks for catching this error. It has been corrected.

      Sup. Fig7. A shows Pio in purple but B in green. Please indicate it correctly.

      It has been corrected.

      Reviewer #1 (Significance (Required)):

      In 2023, the functions of Pio, Dumpy, and Np in the tracheal tubes of Drosophila were published. The study here shows similar results, with the difference that the salivary glands do not possess chitin, but the two ZP proteins Pio and Dumpy take over its function. It is, therefore, a significant and exciting extension of the known function of the three proteins to another tube system. In addition, the authors identify papss as a new protein and show its essential function in forming the luminal matrix in the salivary glands. Considering the high degree of conservation of these proteins in other species, the results presented are crucial for future analyses and will have further implications for tubular development, including humans.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: There is growing appreciation for the important of luminal (apical) ECM in tube development, but such matrices are much less well understood than basal ECMs. Here the authors provide insights into the aECM that shapes the Drosophila salivary gland (SG) tube and the importance of PAPSS-dependent sulfation in its organization and function.

      The first part of the paper focuses on careful phenotypic characterization of papss mutants, using multiple markers and TEM. This revealed reduced markers of sulfation (Alcian Blue staining) and defects in both apical and basal ECM organization, Golgi (but not ER) morphology, number and localization of other endosomal compartments, plus increased cell death. The authors focus on the fact that papss mutants have an irregular SG lumen diameter, with both narrowed regions and bulged regions. They address the pleiotropy, showing that preventing the cell death and resultant gaps in the tube did not rescue the SG luminal shape defects and discussing similarities and differences between the papss mutant phenotype and those caused by more general trafficking defects. The analysis uses a papss nonsense mutant from an EMS screen - I appreciate the rigorous approach the authors took to analyze transheterozygotes (as well as homozygotes) plus rescued animals in order to rule out effects of linked mutations.

      The 2nd part of the paper focuses on the SG aECM, showing that Dpy and Pio ZP protein fusions localize abnormally in papss mutants and that these ZP mutants (and Np protease mutants) have similar SG lumen shaping defects to the papss mutants. A key conclusion is that SG lumen defects correlate with loss of a Pio+Dpy-dependent filamentous structure in the lumen. These data suggest that ZP protein misregulation could explain this part of the papss phenotype.

      Overall, the text is very well written and clear. Figures are clearly labeled. The methods involve rigorous genetic approaches, microscopy, and quantifications/statistics and are documented appropriately. The findings are convincing, with just a few things about the fusions needing clarification.

      minor comments 1. Although the Dpy and Qsm fusions are published reagents, it would still be helpful to mention whether the tags are C-terminal as suggested by the nomenclature, and whether Westerns have been performed, since (as discussed for Pio) cleavage could also affect the appearance of these fusions.

      Thanks for the comment. Dpy-YFP is a knock-in line in which YFP is inserted into the middle of the dpy locus (Lye et al., 2014; the insertion site is available on Flybase). mCh-Qsm is also a knock-in line, with mCh inserted near the N-terminus of the qsm gene using phi-mediated recombination using the qsmMI07716 line (Chu and Hayashi, 2021; insertion site available on Flybase). Based on this, we have updated the nomenclature from Qsm-mCh to mCh-Qsm throughout the manuscript to accurately reflect the tag position. To our knowledge, no western blot has been performed on Dpy-YFP or mCh-Qsm lines. We have mentioned this explicitly in the Discussion.

      The Dpy-YFP reagent is a non-functional fusion and therefore may not be a wholly reliable reporter of Dpy localization. There is no antibody confirmation. As other reagents are not available to my knowledge, this issue can be addressed with text acknowledgement of possible caveats.

      Thanks for raising this important point. We have added a caveat in the Discussion noting this limitation and the need for additional tools, such as an antibody or a functional fusion protein, to confirm the localization of Dpy.

      TEM was done by standard chemical fixation, which is fine for viewing intracellular organelles, but high pressure freezing probably would do a better job of preserving aECM structure, which looks fairly bad in Fig. 2G WT, without evidence of the filamentous structures seen by light microscopy. Nevertheless, the images are sufficient for showing the extreme disorganization of aECM in papss mutants.

      We agree that HPF is a better method and intent to use the HPF system in future studies. We acknowledge that chemical fixation contributes to the appearance of a gap between the apical membrane and the aECM, which we did not observe in the HPF/FS method (Chung and Andrew, 2014). Despite this, the TEM images still clearly reveal that Papss mutants show a much thinner and more electron-dense aECM compared to WT (Figure 2H, I), consistent to the condensed WGA, Dpy, and Pio signals in our confocal analyses. As the reviewer mentioned, we believe that the current TEM data are sufficient to support the conclusion of severe aECM disorganization and Golgi defects in Papss mutants.

      The authors may consider citing some of the work that has been done on sulfation in nematodes, e.g. as reviewed here: https://pubmed.ncbi.nlm.nih.gov/35223994/ Sulfation has been tied to multiple aspects of nematode aECM organization, though not specifically to ZP proteins.

      Thank you for the suggestion. Pioneering studies in C. elegans have highlighted the key role of sulfation in diverse developmental processes, including neuronal organization, reproductive tissue development, and phenotypic plasticity. We have now cited several works.

      Reviewer #2 (Significance (Required)):

      This study will be of interest to researchers studying developmental morphogenesis in general and specifically tube biology or the aECM. It should be particularly of interest to those studying sulfation or ZP proteins (which are broadly present in aECMs across organisms, including humans).

      This study adds to the literature demonstrating the importance of luminal matrix in shaping tubular organs and greatly advances understanding of the luminal matrix in the Drosophila salivary gland, an important model of tubular organ development and one that has key matrix differences (such as no chitin) compared to other highly studied Drosophila tubes like the trachea.

      The detailed description of the defects resulting from papss loss suggests that there are multiple different sulfated targets, with a subset specifically relevant to aECM biology. A limitation is that specific sulfated substrates are not identified here (e.g. are these the ZP proteins themselves or other matrix glycoproteins or lipids?); therefore it's not clear how direct or indirect the effects of papss are on ZP proteins. However, this is clearly a direction for future work and does not detract from the excellent beginning made here.

      My expertise: I am a developmental geneticist with interests in apical ECM

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this work Woodward et al focus on the apical extracellular matrix (aECM) in the tubular salivary gland (SG) of Drosophila. They provide new insights into the composition of this aECM, formed by ZP proteins, in particular Pio and Dumpy. They also describe the functional requirements of PAPSS, a critical enzyme involved in sulfation, in regulating the expansion of the lumen of the SG. A detailed cellular analysis of Papss mutants indicate defects in the apical membrane, the aECM and in Golgi organization. They also find that Papss control the proper organization of the Pio-Dpy matrix in the lumen. The work is well presented and the results are consistent.

      Main comments

      • This work provides a detailed description of the defects produced by the absence of Papss. In addition, it provides many interesting observations at the cellular and tissular level. However, this work lacks a clear connection between these observations and the role of sulfation. Thus, the mechanisms underlying the phenotypes observed are elusive. Efforts directed to strengthen this connection (ideally experimentally) would greatly increase the interest and relevance of this work.

      Thank you for this thoughtful comment. To directly test whether the phenotypes observed in Papss mutants are due to the loss of sulfation activity, we generated transgenic lines expressing catalytically inactive forms of Papss, UAS-PapssK193A, F593P, in which key residues in the APS kinase and ATP sulfurylase domains are mutated. Unlike WT UAS-Papss (both the Papss-PD or Papss-PE isoforms), the catalytically inactive UAS-Papssmut failed to rescue any of the phenotypes, including the thin lumen phenotype (Figure 1I-L), altered WGA signals (Figure I, I') and the cell death phenotype (Figure 4D, E). These findings strongly support the conclusion that the enzymatic sulfation activity of Papss is essential for the developmental processes described in this study.

      • A main issue that arises from this work is the role of Papss at the cellular level. The results presented convincingly indicate defects in Golgi organization in Papss mutants. Therefore, the defects observed could stem from general defects in the secretion pathway rather than from specific defects on sulfation. This could even underly general/catastrophic cellular defects and lead to cell death (as observed). This observation has different implications. Is this effect observed in SGs also observed in other cells in the embryo? If Papss has a general role in Golgi organization this would be expected, as Papss encodes the only PAPs synthatase in Drosophila. Can the authors test any other mutant that specifically affect Golgi organization and investigate whether this produces a similar phenotype to that of Papss?

      Thank you for the comment. To address whether the defects observed in Papss mutants stem from general disruption of the secretory pathway due to Golgi disorganization, we examined mutants of two key Golgi components: Grasp65 and GM130.

      In Grasp65 mutants, we observed significant defects in SG lumen morpholgy, including highly irregular SG lumen shape and multiple constrictions (100%; n=10/10). However, the lumen was not uniformly thin as in Papss mutants. In contrast, GM130 mutants-although this line was very sick and difficult to grow-showed relatively normal salivary glands morphology in the few embryos that survived to stage 16 (n=5/5). It is possible that only embryos with mild phenotypes progressed to this stages, limiting interpretation. These data have now been included in Figure 3-figure supplement 2. Overall, while Golgi disruption can affect SG morphology, the specific phenotypes seen in Papss mutants are not fully recapitulated by Grasp65 or GM130 loss.

      • A model that conveys the different observations and that proposes a function for Papss in sulfation and Golgi organization (independent or interdependent?) would help to better present the proposed conclusions. In particular, the paper would be more informative if it proposed a mechanism or hypothesis of how sulfation affects SG lumen expansion. Is sulfation regulating a factor that in turn regulates Pio-Dpy matrix? Is it regulating Pio-Dpy directly? Is it regulating a product recognized by WGA? For instance, investigating Alcian blue or sulfotyrosine staining in pio, dpy mutants could help to understand whether Pio, Dpy are targets of sulfation.

      Thank you for the comment. We're also very interested in learning whether the regulation of the Pio-Dpy matrix is a direct or indirect consequence of the loss of sulfation on these proteins. One possible scenario is that sulfation directly regulates the Pio-Dpy matrix by regulating protein stability through the formation of disulfide bonds between the conserved Cys residues responsible for ZP module polymerization. Additionally, the Dpy protein contains hundreds of EGF modules that are highly susceptible to O-glycosylation. Sulfation of the glycan groups attached to Dpy may be critical for its ability to form a filamentous structure. Without sulfation, the glycan groups on Dpy may not interact properly with the surrounding materials in the lumen, resulting in an aggregated and condensed structure. These possibilities are discussed in the Discussion.

      We have not analyzed sulfation levels in pio or dpy mutants because sulfation levels in mutants of single ZP domain proteins may not provide much information. A substantial number of proteoglycans, glycoproteins, and proteins (with up to 1% of all tyrosine residues in an organism's proteins estimated to be sulfated) are modified by sulfation, so changes in sulfation levels in a single mutant may be subtle. Especially, the existing dpy mutant line is an insertion mutant of a transposable element; therefore, the sulfation sites would still remain in this mutant.

      • Interpretation of Papss effects on Pio and Dpy would be desired. The results presented indicate loss of Pio antibody staining but normal presence of cherry-Pio. This is difficult to interpret. How are these results of Pio antibody and cherry-Pio correlating with the results in the trachea described recently (Drees et al. 2023)?

      In our original submission, we stated that the uniform luminal mCh-Pio signals were not changed in Papss mutants, but after re-analysis, we found that these signals were actually absent from the expanded luminal region in stage 16 SG (where Dpy-YFP is also absent), and weak mCh-Pio signals colocalize with the condensed Dpy-YFP signals (Figure 5C, D). We have revised the text accordingly.

      After cleavages by Np and furin, the Pio protein should have three fragments. The N-terminal region contains the N-terminal half of the ZP domain, and mCh-Pio signals show this fragment. The very C-terminal region should localize to the membrane as it contains the transmembrane domain. We think the middle piece, the C-terminal ZP domain, is recognized by the Pio antibody. The mCh-Pio and Pio antibody signals in the WT trachea (Drees et al., 2023) are similar to those in the SG. mCh-Pio signals are detected in the tracheal lumen as uniform signals, at the apical membrane, and in cytoplasmic puncta. Pio antibody signals are exclusively in the tracheal lumen and show more heterogenous filamentous signals.

      In Papss mutants, the middle fragment (the C-terminal ZP domain) seems to be most affected because the Pio antibody signals are absent from the lumen. The loss of Pio antibody signals could be due to protein degradation or epitope masking caused by aECM condensation and protein misfolding. This fragment seems to be key for interacting with Dpy, since Pio antibody signals always colocalize with Dpy-YFP. The N-terminal mCh-Pio fragment does not appear to play a significant role in forming a complex with Dpy in WT (but still aggregated together in Papss mutants), and this can be tested in future studies.

      In response to Reviewer 1's comment, we performed an additional experiment to test the role of Np in cleaving Pio to help organize the SG aECM. In this experiment, we overexpressed the WT and mutant form of Np using UAS-Np.WT and UAS-Np.S990A lines (Drees et al., 2019) and analyzed mCh-Pio, Pio antibody, and Dpy-YFP signals. Np.WT overexpression resulted in increased levels of mCh-Pio, Pio, and Dpy-YFP signals in the lumen and at the apical membrane. However, overexpression of Np.S990A resulted in the absence of luminal mCh-Pio signals. Pio antibody signals were strong at the apical membrane but rather weak in the luminal filamentous structures. Since the UAS-Np.S990A line has the GFP tag, we could not reliably analyze Dpy-YFP signals due to overlapping Np.S990A.GFP signals in the same channel. However, the luminal filamentous Pio signals co-localized with GFP signals, and we assume that these overlapping signals could be Dpy-YFP signals.

      These results suggest that overexpressed Np.S990A may act in a dominant-negative manner, competing with endogenous Np and impairing proper cleavage of Pio (and mCh-Pio). Nevertheless, some level of cleavage by endogenous Np still appears to occur, as indicated by the residual luminal filamentous Pio signals. These new findings have been incorporated into the revised manuscript and are shown in Figure 6H and 6I.

      A proposed model of the Pio-Dpy aECM in WT, Papss, pio, and Np mutants has now been included in Figure 7.

      • What does the WGA staining in the lumen reveal? This staining seems to be affected differently in pio and dpy mutants: in pio mutants it disappears from the lumen (as dpy-YFP does), but in dpy mutants it seems to be maintained. How do the authors interpret these findings? How does the WGA matrix relate to sulfated products (using Alcian blue or sulfotyrosine)?

      WGA binds to sialic acid and N-acetylglucosamine (GlcNAc) residues on glycoproteins and glycolipids. GlcNAc is a key component of the glycosaminoglycan (GAG) chains that are covalently attached to the core protein of a proteoglycan, which is abundant in the ECM. We think WGA detects GlcNAc residues in the components of the aECM, including Dpy as a core component, based on the following data. 1) WGA and Dpy colocalize in the lumen, both in WT (as thin filamentous structures) and Papss mutant background (as condensed rod-like structures), and 2) are absent in pio mutants. WGA signals are still present in a highly condensed form in dpy mutants. That's probably because the dpy mutant allele (dpyov1) has an insertion of a transposable element (blood element) into intron 11 and this insertion may have caused the Dpy protein to misfold and condense. We added the information about the dpy allele to the Results section and discussed it in the Discussion.

      Minor points:

      • The morphological phenotypic analysis of Papss mutants (homozygous and transheterozygous) is a bit confusing. The general defects are higher in Papss homozygous than in transheterozygotes over a deficiency. Maybe quantifying the defects in the heterozygote embryos in the Papss mutant collection could help to figure out whether these defects relate to Papss mutation.

      We analyzed the morphology of heterozygous Papss mutant embryos. They were all normal. The data and quantifications have now been added to Figure 1-figure supplement 3.

      • The conclusion that the apical membrane is affected in Papss mutants is not strongly supported by the results presented with the pattern of Crb (Fig 2). Further evidences should be provided. Maybe the TEM analysis could help to support this conclusion

      We quantified Crb levels in the sub-apical and medial regions of the cell and included this new quantification in Figure 2D. TEM images showed variation in the irregularity of the apical membrane, even in WT, and we could not draw a solid conclusion from these images.

      • It is difficult to understand why in Papss mutants the levels of WGA increase. Can the authors elaborate on this?

      We think that when Dpy (and many other aECM components) are condensed and aggregated into the thin, rod-like structure in Papss mutants, the sugar residues attached to them must also be concentrated and shown as increased WGA signals.

      • The explanation about why Pio antibody and mcherry-Pio show different patterns is not clear. If the antibody recognizes the C-t region, shouldn't it be clearly found at the membrane rather than the lumen?

      The Pio protein is also cleaved by furin protease (Figure 5B). We think the Pio fragment recognized by the antibody should be a "C-terminal ZP domain", which is a middle piece after furin + Np cleavages.

      • The qsm information does not seem to provide any relevant information to the aECM, or sulfation.

      Since Qsm has been shown to bind to Dpy and remodel Dpy filaments in the muscle tendon (Chu and Hayashi, 2021), we believe that the different behavior of Qsm in the SG is still informative. As mentioned briefly in the Discussion, the cleaved Qsm fragment may localize differently, like Pio, and future work will need to test this. We have shortened the description of the Qsm localization in the manuscript and moved the details to the figure legend of Figure 5-figure supplement 3.

      Reviewer #3 (Significance (Required)):

      Previous reports already indicated a role for Papss in sulfation in SG (Zhu et al 2005). Now this work provides a more detailed description of the defects produced by the absence of Papss. In addition, it provides relevant data related to the nature and requirements of the aECM in the SG. Understanding the composition and requirements of aECM during organ formation is an important question. Therefore, this work may be relevant in the fields of cell biology and morphogenesis.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this work Woodward et al focus on the apical extracellular matrix (aECM) in the tubular salivary gland (SG) of Drosophila. They provide new insights into the composition of this aECM, formed by ZP proteins, in particular Pio and Dumpy. They also describe the functional requirements of PAPSS, a critical enzyme involved in sulfation, in regulating the expansion of the lumen of the SG. A detailed cellular analysis of Papss mutants indicate defects in the apical membrane, the aECM and in Golgi organization. They also find that Papss control the proper organization of the Pio-Dpy matrix in the lumen. The work is well presented and the results are consistent.

      Main comments:

      • This work provides a detailed description of the defects produced by the absence of Papss. In addition, it provides many interesting observations at the cellular and tissular level. However, this work lacks a clear connection between these observations and the role of sulfation. Thus, the mechanisms underlying the phenotypes observed are elusive. Efforts directed to strengthen this connection (ideally experimentally) would greatly increase the interest and relevance of this work.

      • A main issue that arises from this work is the role of Papss at the cellular level. The results presented convincingly indicate defects in Golgi organization in Papss mutants. Therefore, the defects observed could stem from general defects in the secretion pathway rather than from specific defects on sulfation. This could even underly general/catastrophic cellular defects and lead to cell death (as observed). This observation has different implications. Is this effect observed in SGs also observed in other cells in the embryo? If Papss has a general role in Golgi organization this would be expected, as Papss encodes the only PAPs synthatase in Drosophila. Can the authors test any other mutant that specifically affect Golgi organization and investigate whether this produces a similar phenotype to that of Papss?

      • A model that conveys the different observations and that proposes a function for Papss in sulfation and Golgi organization (independent or interdependent?) would help to better present the proposed conclusions. In particular, the paper would be more informative if it proposed a mechanism or hypothesis of how sulfation affects SG lumen expansion. Is sulfation regulating a factor that in turn regulates Pio-Dpy matrix? Is it regulating Pio-Dpy directly? Is it regulating a product recognized by WGA?<br /> For instance, investigating Alcian blue or sulfotyrosine staining in pio, dpy mutants could help to understand whether Pio, Dpy are targets of sulfation.

      • Interpretation of Papss effects on Pio and Dpy would be desired. The results presented indicate loss of Pio antibody staining but normal presence of cherry-Pio. This is difficult to interpret. How are these results of Pio antibody and cherry-Pio correlating with the results in the trachea described recently (Drees et al. 2023)?

      • What does the WGA staining in the lumen reveal? This staining seems to be affected differently in pio and dpy mutants: in pio mutants it disappears from the lumen (as dpy-YFP does), but in dpy mutants it seems to be maintained. How do the authors interpret these findings? How does the WGA matrix relate to sulfated products (using Alcian blue or sulfotyrosine)?.

      Minor points:

      • The morphological phenotypic analysis of Papss mutants (homozygous and transheterozygous) is a bit confusing. The general defects are higher in Papss homozygous than in transheterozygotes over a deficiency. Maybe quantifying the defects in the heterozygote embryos in the Papss mutant collection could help to figure out whether these defects relate to Papss mutation.

      • The conclusion that the apical membrane is affected in Papss mutants is not strongly supported by the results presented with the pattern of Crb (Fig 2). Further evidences should be provided. Maybe the TEM analysis could help to support this conclusion

      • It is difficult to understand why in Papss mutants the levels of WGA increase. Can the authors elaborate on this?

      • The explanation about why Pio antibody and mcherry-Pio show different patterns is not clear. If the antibody recognizes the C-t region, shouldn't it be clearly found at the membrane rather than the lumen?

      • The qsm information does not seem to provide any relevant information to the aECM, or sulfation.

      Significance

      Previous reports already indicated a role for Papss in sulfation in SG (Zhu et al 2005). Now this work provides a more detailed description of the defects produced by the absence of Papss. In addition, it provides relevant data related to the nature and requirements of the aECM in the SG. Understanding the composition and requirements of aECM during organ formation is an important question. Therefore, this work may be relevant in the fields of cell biology and morphogenesis.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      There is growing appreciation for the important of luminal (apical) ECM in tube development, but such matrices are much less well understood than basal ECMs. Here the authors provide insights into the aECM that shapes the Drosophila salivary gland (SG) tube and the importance of PAPSS-dependent sulfation in its organization and function.

      The first part of the paper focuses on careful phenotypic characterization of papss mutants, using multiple markers and TEM. This revealed reduced markers of sulfation (Alcian Blue staining) and defects in both apical and basal ECM organization, Golgi (but not ER) morphology, number and localization of other endosomal compartments, plus increased cell death. The authors focus on the fact that papss mutants have an irregular SG lumen diameter, with both narrowed regions and bulged regions. They address the pleiotropy, showing that preventing the cell death and resultant gaps in the tube did not rescue the SG luminal shape defects and discussing similarities and differences between the papss mutant phenotype and those caused by more general trafficking defects. The analysis uses a papss nonsense mutant from an EMS screen - I appreciate the rigorous approach the authors took to analyze transheterozygotes (as well as homozygotes) plus rescued animals in order to rule out effects of linked mutations.

      The 2nd part of the paper focuses on the SG aECM, showing that Dpy and Pio ZP protein fusions localize abnormally in papss mutants and that these ZP mutants (and Np protease mutants) have similar SG lumen shaping defects to the papss mutants. A key conclusion is that SG lumen defects correlate with loss of a Pio+Dpy-dependent filamentous structure in the lumen. These data suggest that ZP protein misregulation could explain this part of the papss phenotype.

      Overall, the text is very well written and clear. Figures are clearly labeled. The methods involve rigorous genetic approaches, microscopy, and quantifications/statistics and are documented appropriately. The findings are convincing, with just a few things about the fusions needing clarification.

      Minor comments:

      1. Although the Dpy and Qsm fusions are published reagents, it would still be helpful to mention whether the tags are C-terminal as suggested by the nomenclature, and whether Westerns have been performed, since (as discussed for Pio) cleavage could also affect the appearance of these fusions.

      2. The Dpy-YFP reagent is a non-functional fusion and therefore may not be a wholly reliable reporter of Dpy localization. There is no antibody confirmation. As other reagents are not available to my knowledge, this issue can be addressed with text acknowledgement of possible caveats.

      3. TEM was done by standard chemical fixation, which is fine for viewing intracellular organelles, but high pressure freezing probably would do a better job of preserving aECM structure, which looks fairly bad in Fig. 2G WT, without evidence of the filamentous structures seen by light microscopy. Nevertheless, the images are sufficient for showing the extreme disorganization of aECM in papss mutants.

      4. The authors may consider citing some of the work that has been done on sulfation in nematodes, e.g. as reviewed here: https://pubmed.ncbi.nlm.nih.gov/35223994/ Sulfation has been tied to multiple aspects of nematode aECM organization, though not specifically to ZP proteins.

      Significance

      This study will be of interest to researchers studying developmental morphogenesis in general and specifically tube biology or the aECM. It should be particularly of interest to those studying sulfation or ZP proteins (which are broadly present in aECMs across organisms, including humans).

      This study adds to the literature demonstrating the importance of luminal matrix in shaping tubular organs and greatly advances understanding of the luminal matrix in the Drosophila salivary gland, an important model of tubular organ development and one that has key matrix differences (such as no chitin) compared to other highly studied Drosophila tubes like the trachea.

      The detailed description of the defects resulting from papss loss suggests that there are multiple different sulfated targets, with a subset specifically relevant to aECM biology. A limitation is that specific sulfated substrates are not identified here (e.g. are these the ZP proteins themselves or other matrix glycoproteins or lipids?); therefore it's not clear how direct or indirect the effects of papss are on ZP proteins. However, this is clearly a direction for future work and does not detract from the excellent beginning made here.

      My expertise: I am a developmental geneticist with interests in apical ECM

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      PAPS is required for all sulfotransferase reactions in which a sulfate group is covalently attached to amino acid residues of proteins or to side chains of proteoglycans. This sulfation is crucial for properly organizing the apical extracellular matrix (aECM) and expanding the lumen in the Drosophila salivary gland. Loss of Papss potentially leads to decreased sulfation, disorganizing the aECM, and defects in lumen formation. In addition, Papss loss destabilizes the Golgi structures.

      In Papss mutants, several changes occur in the salivary gland lumen of Drosophila. The tube lumen is very thin and shows irregular apical protrusions. There is a disorganization of the apical membrane and a compaction of the apical extracellular matrix (aECM). The Golgi structures and intracellular transport are disturbed. In addition, the ZP domain proteins Piopio (Pio) and Dumpy (Dpy) lose their normal distribution in the lumen, which leads to condensation and dissociation of the Dpy-positive aECM structure from the apical membrane. This results in a thin and irregularly dilated lumen.

      1. The authors describe various changes in the lumen in mutants, from thin lumen to irregular expansion. I would like to know the correct lumen diameter, and length, besides the total area, by which one can recognize thin and irregular.

      2. The rescue is about 30%, which is not as good as expected. Maybe the wrong isoform was taken. Is it possible to find out which isoform is expressed in the salivary glands, e.g., by RNA in situ Hyb? This could then be used to analyze a more focused rescue beyond the paper.

      3. Crb is a transmembrane protein on the apicolateral side of the membrane. Accordingly, the apicolateral distribution can be seen in the control and the mutant. I believe there are no apparent differences here, not even in the amount of expression. However, the view of the cells (frame) shows possible differences. To be sure, a more in-depth analysis of the images is required. Confocal Z-stack images, with 3D visualization and orthogonal projections to analyze the membranes showing Crb staining together with a suitable membrane marker (e.g. SAS or Uif). This is the only way to show whether Crb is incorrectly distributed. Statistics of several papas mutants would also be desirable and not just a single representative image. When do the observed changes in Crb distribution occur in the development of the tubes, only during stage 16? Is papass only involved in the maintenance of the apical membrane? This is particularly important when considering the SJ and AJ, because the latter show no change in the mutants.

      4. A change in the ECM is only inferred based on the WGA localization. This is too few to make a clear statement. WGA is only an indirect marker of the cell surface and glycosylated proteins, but it does not indicate whether the ECM is altered in its composition and expression. Other important factors are missing here. In addition, only a single observation is shown, and statistics are missing.

      5. Reduced WGA staining is seen in papas mutants, but this could be due to other circumstances. To be sure, a statistic with the number of dots must be shown, as well as an intensity blot on several independent samples. The images are from single confocal sections. It could be that the dots appear in a different Z-plane. Therefore, a 3D visualization of the voxels must be shown to identify and, at best, quantify the dots in the organ.

      6. A colocalization analysis (statistics) should be shown for the overlap of WGA with ManII-GFP.

      7. I do not understand how the authors describe "statistics of secretory vesicles" as an axis in Figure 3p. The TEM images do not show labeled secretory vesicles but empty structures that could be vesicles.

      8. The quality of the presented TEM images is too low to judge any difference between control and mutants. Therefore, the supplement must present them in better detail (higher pixel number?).

      9. Line 266: the conclusion that apical trafficking is "significantly impaired" does not hold. This implies that Papass is essential for apical trafficking, but the analyzed ECM proteins (Pio, Dumpy) are found apically enriched in the mutants, and Dumpy is even secreted. Moreover, they analyze only one marker, Sec15, and don't provide data about the quantification of the secretion of proteins.

      10. DCP-1 was used to detect apoptosis in the glands to analyze acellular regions. However, the authors compare ST16 control with ST15 mutant salivary glands, which is problematic. Further, it is not commented on how many embryos were analyzed and how often they detect the dying cells in control and mutant embryos. This part must be improved.

      11. WGA and Dumpy show similar condensed patterns within the tube lumen. The authors show that dumpy is enriched from stage 14 onwards. How is it with WGA? Does it show the same pattern from stage 14 to 16? Papass mutants can suffer from a developmental delay in organizing the ECM or lack of internalization of luminal proteins during/after tube expansion, which is the case in the trachea.

      12. Line 366. Luminal morphology is characterized by bulging and constrictions. In the trachea, bulges indicate the deformation of the apical membrane and the detachment from the aECM. I can see constrictions and the collapsed tube lumen in Fig. 6C, but I don't find the bulges of the apical membrane in pio and Np mutants. Maybe showing it more clearly and with better quality will be helpful.

      13. The authors state that Papass controls luminal secretion of Pio and Dumpy, as they observe reduced luminal staining of both in papass mutants. However, the mCh-Pio and Dumpy-YFP are secreted towards the lumen. Does papass overexpression change Pio and Dumpy secretion towards the lumen, and could this be another explanation for the multiple phenotypes? Regulation of luminal ZP protein level is essential to modulate the tube expansion; therefore, Np releases Pio and Dumpy in a controlled manner during st15/16. Thus, the analysis of Pio and Dumpy in NP overexpression embryos will be critical to this manuscript to understand more about the control of luminal ZP matrix proteins.

      14. Minor: Fig. 5 C': mChe-Pio and Dumpy-YFP are mixed up at the top of the images. Sup. Fig7. A shows Pio in purple but B in green. Please indicate it correctly.

      Significance

      In 2023, the functions of Pio, Dumpy, and Np in the tracheal tubes of Drosophila were published. The study here shows similar results, with the difference that the salivary glands do not possess chitin, but the two ZP proteins Pio and Dumpy take over its function. It is, therefore, a significant and exciting extension of the known function of the three proteins to another tube system. In addition, the authors identify papass as a new protein and show its essential function in forming the luminal matrix in the salivary glands. Considering the high degree of conservation of these proteins in other species, the results presented are crucial for future analyses and will have further implications for tubular development, including humans.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study characterizes autoimmunity in mutant lines of Arabidopsis that are lacking components of the m6A methyltransferase complex (MTC). The molecular results and bacterial pathogen resistance of the lines at low temps as compared to high temps support this hypothesis. However, the phenotypic analysis or new complete lack there of (Figure 6), makes the hypothesis and overall story much less convincing. I give some comments for improving the figures below.

      Figure 6 showing the phenotypes in its current set up is very uninformative. More informative pictures and quantitative analyses of specific developmental phenotypes should be added to show the differences between the phenotypes of the mutant and wild-type plants at the two different temperatures. As of now the reader gets a sense of nothing from the figure. Without this Figure demonstrating a major rescue of phenotype at the 27C temperature the reader is not convinced that the autoimmunity is the major cause of the phenotype

      Supplemental Figure 1 is missing from the review file.

      Significance

      It is notable that a couple of recent studies have already shown the increased resistance of MTC component mutants to pathogens (Prall et al. 2024 and Chen et al. 2024), which weakens the impact of the overall findings. In my honest assessment, this study would be well positioned for publication in a mid-tier plant specific journal (e.g. Plant Physiology) based on the currently included results.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript by Metheringham et al. reports on interesting new characterizations of phenotypes caused by genetic inactivation of subunits of the methyl transferase complex responsible for N6-adenosine methylation in (pre)-mRNA ("the m6A writer") in the plant Arabidopsis thaliana. The main claim of the paper is that mutants in these subunits exhibit autoimmunity, a claim that is supported by the following lines of evidence:

      • Transcriptome profiling by mRNA-seq shows a gene expression profile with differential expression of many stress- and defense-related genes.
      • The immunity-like gene expression profile is observed under growth at 17{degree sign}C but not at 27{degree sign}C, consistent with the well-known temperature-sensitivity of some (but not all) innate immunity signaling systems in plants.
      • m6A writer mutants show increased resistance to infection by the virulent Pseudomonas syringae DC3000 strain.
      • The primary biochemical defect in m6A writing is not temperature sensitive, excluding the trivial possibility that the mutant alleles chosen for study are simply ts.

      The observations are important and the manuscript is very well written, a pleasure to read: the problem is clearly presented, the experimental results are presented in a clear, logical succession, and the discussion treats important points.

      The study is valuable pending some manuscript revision on the autoimmunity interpretation of the results obtained, and a few suggested edits that can be included if the authors agree that they would improve the paper.

      The finding that an autoimmune-like state is activated in m6A writer mutants is significant because it provides a warning flag on how such mutants should be used for studying the role of m6A in stress response signaling, including reassessment of previously published work. Whether the stress state really is autoimmunity is subject to some debate, particularly because no genetic evidence to support it has been obtained. The results are nonetheless interesting and constitute an important contribution to the community, even if they remain descriptive and with nearly no insight into molecular mechanisms. My suggestions for improvement are summarized below.

      1. Although the authors do a lot to support the claim that autoimmunity is an element of m6A writer mutant phenotypes, the study does not include genetic evidence to support this claim. This is important, because if the stress/defense gene activation causes some of the morphological phenotypes of m6A writer mutants, one should be able to suppress such defects by mutation of know immune signaling components such as the appropriate nucleotide-binding leucine-rich repeat proteins, or more generic signaling components such as EDS1, PAD4 and SAG1, common to a subset of such intracellular immune receptors. Resistance to pathogens can be observed in mutants with constitutive stress response signaling, and defense-like gene expression can be induced as a secondary of other primary defects, for instance DNA damage. Similarly, while it is true that some types of immune activation are temperature sensitive, others are not 1, and clearly, elevated temperature changes so much of the physiology of the plant that sensitivity to elevated temperature cannot be used as proof of immune activation. Thus, each of the lines of evidence presented is suggestive, not conclusive. Together, they constitute a good argument, but still not a completely satisfactory proof of the main claim. I do not think that this concern means that a lot of genetic work must be undertaken to make this paper publishable, but I think that the authors should be even more careful about how they interpret their observations. I understand that they favor more or less direct activation of autoimmunity, although even if that were true, it would be unclear what the biochemical triggers of such autoimmunity would be (unmethylated RNA, absence or writer components, excess of free m6A-binding proteins etc). However, given the concerns above, I think the authors should dedicate a small paragraph in the discussion to the possibility that the primary cause of stress/defense-gene expression is unclear and may not result from innate immune surveillance of unmethylated mRNA or components of the m6A pathway as favoured by the authors.
      2. It may be of relevance to search promoters of differentially expressed genes for enrichment of cis-elements. This simple approach identified the W-box in the first papers using transcriptome profiling to characterize the immune state in Arabidopsis 2,3, and could perhaps reveal whether a WRKY-driven transcriptional program drives differential expression or whether several other transcription factor classes may also contribute substantially, as may be expected if a more complex stress-related transcriptional program is activated. I do not think that this is a deal breaker, but some additional useful information from the existing data might be gathered in this way.
      3. Stress response activation has also been clearly described in ect2 ect3 ect4 mutants4 and even if the authors find no evidence for PR1 expression in this mutant, it is still of relevance to include a mention of this result in the discussion, together with the discussion of stress response activation seen in writer mutants in earlier reports 5,6. I would not mind the authors being a bit more explicit about what their results mean for studies that try to conclude on the biological relevance of m6A in different types of stress signaling, using phenotypes writer mutants as their primary line of evidence. But this is of course up to the authors to decide on that.
      4. In the introduction on preferred m6A sequence contexts, please clarify that m6A in plants occurs both DRACH in (G)GAU contexts 7,8.
      5. When mentioning convergence on shared signaling components from immune receptors, please include a tiny bit more information for the reader. For instance, EDS1 is mentioned, but this protein is only required for signaling from (some) TIR-NBS-LRRs, not the class of CC-NBS-LRRs. Indeed, signaling by this latter class may not converge on just one to a few components, as their multimerization appears to form the ion channels required for signaling-inducing ion currents.
      6. Please clarify in the introduction and in later parts that only some forms of autoimmunity can be suppressed by elevated temperature. Sentences like "A hallmark of Arabidopsis autoimmunity is temperature sensitivity..." are a bit misleading. Temperature sensitivity has clearly been used to study some forms of EDS1-dependent immunity, to great effect in the TMV-N interaction for instance, but it is not accurate to call temperature sensitivity a "hallmark of autoimmunity".
      7. In the discussion of possible biochemical triggers of autoimmunity in m6A mutants, please consider the following:

      (A) Mention the possibility that the primary trigger may not be immune receptor-surveillance of some defect induced by lack of m6A in mRNA (as discussed above).

      (B) In connection with the consideration that lack of m6A writer components, not m6A in mRNA, may be a signal, you could include the observation from yeast that Ime4 knockouts have a much stronger phenotype than Ime4 catalytically dead mutants or knockouts of the sole yeast YTH-domain Pho92 9. Indeed, it is a bit of an embarrassment to the plant m6A community that we have not yet examined phenotypes of MTA and MTB catalytically dead mutants, and the present report should further urge the community to finally do this important experiment. 8. Just a tiny typo on page 15, Pst DC3000, not Pst D3000 (of no relevance to the overall assessment, just a help to eliminate annoying errors before final submission).

      REFERENCES

      1. Demont, H. et al. Downstream signaling induced by several plant Toll/interleukin-1 receptor-containing immune proteins is stable at elevated temperature. Cell Reports 44(2025).
      2. Petersen, M. et al. Arabidopsis MAP kinase 4 negatively regulates systemic acquired resistance. Cell 103, 1111-1120 (2000).
      3. Maleck, K. et al. The transcriptome of Arabidopsis thaliana during systemic acquired resistance. Nature Genetics 26, 403-410 (2000).
      4. Arribas-Hernández, L. et al. The YTHDF proteins ECT2 and ECT3 bind largely overlapping target sets and influence target mRNA abundance, not alternative polyadenylation. eLife 10, e72377 (2021).
      5. Bodi, Z. et al. Adenosine Methylation in Arabidopsis mRNA is Associated with the 3' End and Reduced Levels Cause Developmental Defects. Front Plant Sci 3, 48 (2012).
      6. Prall, W. et al. Pathogen-induced m6A dynamics affect plant immunity. The Plant Cell 35, 4155-4172 (2023).
      7. Arribas-Hernández, L. et al. Principles of mRNA targeting via the Arabidopsis m6A-binding protein ECT2. eLife 10, e72375 (2021).
      8. Wang, G. et al. Quantitative profiling of m6A at single base resolution across the life cycle of rice and Arabidopsis. Nature Communications 15, 4881 (2024).
      9. Ensinck, I. et al. The yeast RNA methylation complex consists of conserved yet reconfigured components with m6A-dependent and independent roles. eLife 12, RP87860 (2023).

      Significance

      The finding that an autoimmune-like state is activated in m6A writer mutants is significant because it provides a warning flag on how such mutants should be used for studying the role of m6A in stress response signaling, including reassessment of previously published work. Whether the stress state really is autoimmunity is subject to some debate, particularly because no genetic evidence to support it has been obtained. The results are nonetheless interesting and constitute an important contribution to the community, even if they remain descriptive and with nearly no insight into molecular mechanisms. I wish to congratulate the authors on another valuable contribution to the plant m6A field.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The authors aim to understand the consequences of disrupting N6 methyladenosine, an abundant mRNA modification in plants and other organisms, in Arabidopsis. Genetic ablation of the N6 methyladenosine transferase complex is embryonic lethal in Arabidopsis. Therefore, the authors utilize a hypomorphic allele of VIRILIZER, a component of the complex, to examine gene expression changes and other phenotypes. The authors demonstrate that immune response pathway genes are misregulated in the vir mutant. This transcriptional phenotype is suppressed at higher temperatures, although developmental phenotypes are not. The manuscript provides strong evidence that reduced function of the m6A methyltransferase complex leads to upregulation of immune response genes, although a mechanistic connection between the immune response and m6A in mRNA is not discerned.

      Major comments

      The major claims of the manuscript are that disrupting the m6A writer complex triggers an autoimmune response that is present at 17C and suppressed at 27C (in line with known aspects of Arabidopsis immunity). Consistent with this, they also show that at 17C the vir-1 mutant has more cell death and is more resistant to infection by Pseudomonas syringae. All of these claims are well supported by the data. The authors also claim that polyA tail lengths are different between the two temperatures. They further speculate that mRNAs that lack m6A trigger immune signaling, but this is not directly tested in the study.

      The conclusions about transcriptional activation of the immune response at lower temperatures are sufficiently supported by two types of mRNA sequencing data (direct RNA sequencing and short-read sequences) and appropriate biological replication. The initial profiling was at 22 C, later profiling was at 17C and 27 C. How similar/overlapping were the vir-1 misregulated genes at 17C and 22C? Is the immune response transcriptional signature stronger at 17C than at 22C? The authors sought to determine whether the vir-1 response at 17C was due to pathogen infection of those plants. They used their Illumina RNA-seq data to try and identify pathogen RNAs. They report that there was no significant enrichment of plant pathogen sequences (supplemental table 7). Significant compared to what? Supplemental Table 7 does not indicate that the WT data was assessed and there's no information on significance of enrichment (or nothing obvious, based on column titles). Did the Illumina library prep preparation rely on polyA tails? If so, this is not a sensitive assay to detect bacterial transcripts.

      I found the last section on altered poly(A) tail length and site usage somewhat difficult to follow and the analysis rather cursory. The authors find no difference in polyA site usage in vir-1 at 17C or 27C (although both are different than WT). For Figure 7A, in addition to the histogram of poly A site shift, I would like to see a plot (heatmap?) that compares poly A sites shift for individual mRNAs across samples, instead of only aggregated data. Are there individual mRNAs that differ between 17 and 27C in vir-1?

      A similar comment applies to the data in 7E. Please also compare individual mRNA polyA tail length across samples. What is the significance of the change in polyA tail length? The tails are shorter in vir-1 than Col at 27 C. But vir-1 has a very similar phenotype to WT at 27 C. At 17 C, vir-1 tails are longer than WT. Together, do these two results imply that polyA tail length is unlikely to be related to the observed phenotypes? In other words, if longer tails have no effect, do shorter tails? Is there any relationship between RNAs with altered polyA site usage or tail length and those mRNAs that are misexpressed in the mutant? Are immunity gene mRNAs more likely to be m6A modified than other mRNAs?

      Minor comments

      At times it felt like the authors were stretching to fill seven figure with data. For example, in Figure 1, it was not necessary to show the data on increased PR1 expression in 6 different sub panels (B-F) to convince the reader that PR1 expression was increased. A similar comment applies to Figure 3A-D. In Figure 3 please write the common gene names above the plots.

      In Supplemental Table 3 the Enriched GO Terms tab is blank. Supplementary File 1 appeared to be missing from the submission, so I could not evaluate the sequencing statistics (# of reads per sample, mapping %, etc). Many of the Supplemental Tables would benefit from a readme that describes what analysis was performed and what the different columns mean.

      Significance

      The manuscript provides additional insight on the functional consequences of disrupting adenosine methylation in RNA, identifying features of an autoimmune response. Given the ubiquity of m6A in RNA across eukaryotes, this is a result that will be of interest to basic researchers in the plant RNA modification community and likely those working in other eukaryotes. However, the study is not able to connect the inappropriate expression of immune response genes back to the function m6A in RNA, and the effects might be indirect. Although there is speculation that RNA that lacks m6A might trigger autoimmunity, the presented experiments do not directly test that hypothesis (nor do the authors claim to).

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Reply to the Reviewers

      I thank the Referees for their...

      Referee #1

      1. The authors should provide more information when...

      Responses + The typical domed appearance of a hydrocephalus-harboring skull is apparent as early as P4, as shown in a new side-by-side comparison of pups at that age (Fig. 1A). + Though this is not stated in the MS 2. Figure 6: Why has only...

      Response: We expanded the comparison

      Minor comments:

      1. The text contains several...

      Response: We added...

      Referee #2

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Enterovirus genomes contain an AUG triplet at the 3'-border of the IRES, often far upstream of the initiation codon for the principal ORF that encodes the viral polyprotein. Prior in vitro and in vivo studies have shown that this upstream AUG triplet (uAUG) initiates translation of a short polypeptide ("UP") encoded by an upstream ORF (uORF) that promotes viral infection in gut epithelial cells (Refs. 5, 6). In the present thorough and rigorously controlled study, O'Connor et al. extend these observations, thereby providing further insights into the regulatory and coding potential of translation of in alternate reading frames in viral mRNAs.

      They first undertook detailed analyses of almost 10000 enterovirus genomic sequences and determined that one third contained additional AUG triplets in the vicinity of the uAUG, collectively designated upstream uAUGs (uuAUGs), that could potentially initiate translation of uuORFs that are mostly very short but that in a few instances encode UP-related polypeptides.

      Systematic studies involving (a) ribosomal profiling and (b) the use of a dual luciferase reporter system showed that uuAUG triplets are recognized by ribosomes in infected cells and are functional albeit inefficient initiation codons. The uuAUG triplet in the enterovirus CVA-13 (Flores strain) initiates translation of an 8aa-long non-UP-like peptide, and the functional importance of this uuAUG was assayed by substituting it by a GUG triplet to downregulate uuORF translation. This mutation had no effect of infection in HeLa cells, but the uuAUG-containing (wt) virus had a competitive advantage over the mutant in mixed mutant/wt infections in terminally differentiated neuronal cells and in differentiated human intestinal organoids. This differential effect was similar to the previously reported competitive advantage conferred by UP expression during enterovirus infection in differentiated cells (Ref. 5). The function of non-UP-like proteins initiating at uuAUG codons remains unknown. However, elimination of stop codons that modify their length of uORFs modulated upstream ORF expression, although the mechanism responsible for this effect remains unknown. These results suggest that the interplay between initiation, termination and recycling steps on the 5'UTR of enteroviruses has the potential to affect viral pathogenicity.

      The data in the manuscript are strong, well controlled and validated. Elements of the manuscript could be presented more clearly.

      Minor comments

      1. Line 56. Domain 1 is a cloverleaf i.e. not just a stemloop.
      2. Fig. 4A, 5B, 8C. It would be informative to add an additional 5'-terminal nucleotide to the structure of the SL-VI region to show the Kozak context of the uuAUG codon.
      3. Figs. 8C, 8E, 8F. It might be more reader-friendly to replace structural models of sections of enterovirus 5'UTRs by a schematic representations to show uuORFs, uORFs, ppORFs etc and how altered stop codons affect their overlap. The corresponding section of the manuscript could also be presented in a more straightforward manner.
      4. Lines 473-4. This statement is incorrect, because eIF4G is required for IRES-dependent initiation. 2A-mediated cleavage of eIF4G does not abrogate IRES function because it splits off the non-essential N-terminal (eIF4E-binding) region from the critical C-terminal region that binds directly to enterovirus IRESs and recruits eIF4A (Ref. 7; PMID: 19470487).
      5. Ref. 27 is annotated incorrectly

      Significance

      The study reinforces and extends the authors' previous conclusions (Ref. 5, 13, 28, 31) that the genomes of positive-sense RNA viruses can and do have coding properties that are more complex than simply encoding a single open reading frame. Careful examination of a large panel of enterovirus genomes revealed a great diversity in coding potential, and the authors are right to suggest that further correlation of coding potential (particularly alternate ORFs/alternate reading frames) with pathogenic phenotypes is merited, particularly for variants of a single virus.

      This study also provides insights into the influence of alternative upstream open reading frames on viral fitness using strong experimental models (viral infection of differentiated cells and organoids in addition to HeLa cells), and appropriate methods (e.g. an innovative competition assay to compare the competitive advantage of co-infecting variants of a virus, sophisticated reporter assays). Although the mechanistic basis for the influence of uuORFs on enterovirus infection of cells remains to be fully elucidated, these studies indicate that the topic strongly merits further study. In consequence, this report will be of interest both to molecular virologists and to scientists with an interest in gene expression mechanisms.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study provides a detailed analysis of upstream and "upstream upstream" open reading frames in enteroviruses from the Cosackiepol and Alphacoxsackie species. The work includes a comprehensive bioinformatic analysis of uORF and uuORF diversity and conservation across the EVs, along with characterization of these ORFs by Riboseq and reporter assays. The authors also include a characterization of the uORFs through the mutation of these ORFs in both cell lines and iPSC derived cells.

      The manuscript is detailed, the experiments rigorous, and their description clear. The work provides a number of orthogonal experiments to support the claims of the study.

      Significance

      General assessment: This work is of high quality, and an important addition to the literature on uORFs, though it doesn't provide much mechanistic insight into the function of these ORFs. It falls short in pushing our understanding of uORF function forward.

      • I wonder if the authors can expand their studies to address the potential mechanisms by which these ORFs function, whether through their translation or translation products.
      • Have the authors considered exploring the possible functions similar to cellular transcripts, e.g. what they reference in the discussion regarding ATF4, by modulating stress responses and assessing expression of uORF and ppORF? These studies would greatly enhance the additional insights the manuscript provides.
      • A more comprehensive accounting of the ORF diversity across the EV's would be a valuable analysis. Are there ORFs in the negative strand (as have been characterized in influenza), or elsewhere in the positive strand, that may have functions?

      In Fig. 7A: "iPCS" -> "iPSC"

      Advance: This work builds on the authors' previous characterization of the uORFs of related EV's. It provides further support consistent with their previous findings that these uORFs are of importance of these regions in the replication of the virus, especially in differentiated target tissues, suggesting they contribute to the pathogenesis of the virus as part of the known important role the IRES regions are known to play. How the translation, or the products of translation, function to confer this phenotype remains elusive.

      Audience: This will be of interest to virologists working on cryptic translational elements in viruses, which are found in many viruses, and sure to be discovered in more as we begin to appreciate their important role, however the findings are not likely to be especially relevant to very broad audience.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We appreciated the constructive suggestions from the reviewers, and the explanation of the contribution of the manuscript. We have revised the manuscript in accordance with their suggestions, as discussed below.

      __Reviewer #1 (Evidence, reproducibility and clarity (Required)): __ This manuscript presents a computational analysis of PRC1, a passive microtubule crosslinker important for cell division, with a focus on its role in resisting force generation within antiparallel bundles, whose sliding is promoted by active kinesin motors. Using a previously developed simulator and several assumptions, the authors successfully recapitulated the two modes of PRC1 sliding resistance - coasting and braking - that were previously observed in in vitro reconstruction assays. The simulation also reproduces the redistribution of PRC1 within the overlap region as microtubules transition into the braking mode, a phenomenon also observed experimentally. An interesting outcome of the simulation is the change in spacing between the microtubules: The distance narrows as the sliding polymers switch from the coasting to the breaking mode, associated with an increased tilt of PRC1.

      Major comments: I find that this manuscript makes a valuable contribution to the cytoskeletal community, as the role of interfilament spacing in polymer assembly has been relatively underexplored, except for more classic studies such as those on muscle contraction and flagellar beating. What I had difficulty fully visualizing the model was the behavior of PRC1 during the coasting and braking modes. In my understanding, if individual heads of PRC1 bind and unbind to and from microtubules while microtubules that they crosslink slide apart, PRC1 should experience greater stretching and thus tilt more at higher sliding speeds. When the sliding slows down, the relative polymer position changes less within a given time, and PRC1 unbinding and re-binding would more easily reset their tilt to an equilibrium angle. However, the authors' simulation shows the opposite: PRC1 exhibits a greater tilt during the braking mode. This seems counterintuitive and a more detailed description and interpretation would worth. I suggest that the authors include a schematic illustrating the configuration of individual PRC1 molecules (e.g., angle and stretch) within the ensemble, particularly during their transition phase. This would greatly help readers grasp how this important protein ensemble switches its mechanical mode depending on polymer sliding and geometry.

      We thank the reviewer for the comments on the contribution of the results of the manuscript. Braking typically initiates at higher sliding speeds, when PRC1 do experience greater stretching and tilt more as the reviewer writes. As sliding slows down, the ability of PRC1 to unbind, re-bind, and rest their tilt to the equilibrium angle is restricted by the small distance between the microtubules: PRC1 binding will tend to occur tilted in the direction of sliding, and molecules tilted in this direction promote close separations, keeping overlaps braking. To clarify why braking overlaps are stable we added text and figure 4H. Steric interactions within the clusters at the overlap edges also restrict rebinding. To illustrate the behavior of PRC1 molecules during the transition from coasting to braking , we have added in figure 4A a schematic derived from simulation data of the microtubule and PRC1 positions, separations, and tilts during the transition from coasting to braking.

      Minor comments: 1. How was the bimodal velocity distribution (Fig. 1D) obtained experimentally? Were the individual data averaged over time from the start to the end of individual sliding events? If so, does mode switching within a pair lead to under/over-estimate of the coasting and braking speeds?

      These data are reproduced from Alfieri et al. Current Biology 2021. In that paper, we acquired this data by observing the sliding separation of PRC1-crosslinked microtubule pairs and recorded two distinct velocities for each pair: the “bundled” velocity when overlap>0 and PRC1 was engaged in crosslinking and then the “escaped” velocity once the two microtubules had separated. In the vast majority of cases (>90%) each of these velocities was well measured by fitting a slope to the kymograph, as there were only very minor deviations from a linear position-versus-time relationship (e.g. we rarely saw acceleration or deceleration within an individual pair). In the rare (

      Line 158 includes typo.

      We thank the reviewer for pointing out this typo, which has been corrected.

      The fixed-separation simulation in Fig. 3D is important for demonstrating the causality. How was the average speed (V_avg) calculated in this case? Specifically, do microtubule pairs that slide at coasting mode maintain a high speed over the entire sliding event when the inter-filament spacing is fixed at a large distance?

      We thank the reviewer for raising this point, which was not clear in the original manuscript. In the fixed-separation simulation of Fig 3D the average speed is calculated for the whole simulation. We have clarified this in the figure 3 caption. We have also added a supplementary figure showing the velocity distribution. The coasting pairs do maintain high speed over the event.

      In my understanding, the attractive and repulsive lateral forces exerted by PRC1 with positive and negative tilts arise because PRC1 has a natural tilt relative to the perpendicular. Is this correct? It would be helpful to illustrate this assumption in a figure to clarify the molecular behavior being modelled.

      The reviewer raises an important point that we have clarified in the revised manuscript. The linear (spring stretch/compression) force is the primary contributor to the attractive lateral force in both braking and coasting states. The torsional force that arises from the natural tilt of PRC1 does contribute significantly to repulsion between microtubules in the coasting state. We have clarified this in the text and added a supplementary figure showing the energy and forces from PRC1 molecules as a function of angle.

      In the paragraph starting from line 258, the authors discuss Ase1 and the yeast spindles. What is the relevance to PRC1 particularly in considering that Ase1 exerts an entropic force within the confined microtubule bundles to resist sliding (e.g., Lasky et al., 2015)?

      We thank the reviewer for raising this important point. It is true that Ase1 has been shown to generate entropic forces that work to push against microtubule sliding, while this specific behavior has not been observed for PRC1. We believe that such forces are likely to arise when Ase1 is in a coasting-like mode and the individual crosslinkers are free to diffuse within the confines of the overlap, which is the mechanism Lansky et al. propose. In this paragraph of the discussion, we are highlighting the experimental observation that microtubule-microtubule spacing significantly reduces as a yeast cell proceeds from metaphase to anaphase, with late anaphase MT separations measured to be ~15nm, similar to what we predict for microtubule pairs that have engaged in a braking mode. We therefore speculate that a coasting-to-braking transition may be more generally applicable across different spindle types, at least when involving MAP65 family members such as Ase1 and PRC1. In the yeast spindle, then, we speculate that when microtubule separation is larger, Ase1 would be arranged in a coasting-like mode of binding, capable of generating entropic forces. Later, it is possible the molecules switch to a more braking-like mode, where MT-MT spacing reduces significantly as shown in EM data from yeast spindles. It will be useful in the future to acquire similar data from mammalian spindles to determine if late anaphase midzone separation also compacts when PRC1 is present, which would further validate our predictions. We have clarified the discussion of this point in the revision.

      Fig. 1B, C would benefit from additional labels, as the colors in the images do not match those in the accompanying cartoon.

      We thank the reviewer for the suggestion, and have added additional labels.

      Reviewer #1 (Significance (Required)):

      As in my major comments above. My expertise is experimental biophysics on microtubules and motors.

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __The paper presents simulations of sliding antiparallel microtubules linked by PRC1 crosslinking proteins. It aims to reproduce and explain experimental observations by Alfieri et al. that suggested that PRC1 could adopt two distinct modes of resistive force production against kinesin-driving sliding forces.

      The model which the authors propose is that antiparallel sliding leads to the accumulation of PRC1 at the edges, which results in higher tilt angles of PRC1 molecules and consequently smaller microtubule separation. In the higher tilt regime PRC1 can exert more braking forces since, its angle with the Microtubules is smaller. To my understanding the key parameters for this model to work it the spontaneous tilt angle, and torsional spring that PRC-1's structure encodes. The authors demonstrates that for reasonable values very good agreement with experimental observations can be reached. The simulations are done in the CYlAks framework, which the Betterton group developed and validated in earlier work. The discussion is clear and readable

      Major Comment: While the paper goes at great length to successfully reproduce experiments, it is not discussed how sensitive the model is to changes in parameters. In particular it remains unclear how sensitive the model is to changes in the torsional spring that is being used to model PRC-1. Given that this is key to the findings presented here, I would have hoped for a more extensive discussion of the relevant physics. In particular It should be discussed how non-linearities and asymmetries in the torsional spring would affect the phenomenon identified here.

      We appreciate the reviewer’s suggestion to examine sensitivity to variation in model parameters. We note that we do present in Figure 2 a smaller exploration of parameter space; when key values are modulated by an order of magnitude, we find differences in the simulated outputs (e.g. enhanced or reduced tip clustering in response to changes in MAP diffusion or end binding). We also note the supplementary information includes the effect of varying parameters including the strength and asymmetry of the torsional spring, which addresses the specific concern noted. Given the length of the current manuscript, we propose to delay a more extensive study of parameter sensitivity to future work.

      (Very) Minor remark: the orientation of PRC-1 molecules is inconsistent between figures.

      We thank the reviewer for pointing this out. We have edited the figures to make the orientation consistent.

      __Reviewer #2 (Significance (Required)): __ PRC-1 is an important cross-linking protein in cell division, and its mechanics is at the center of much current research interest. As such this paper is timely. The key physics that is interesting here is the link between geometry, PRC1-arrangements and geometry of the MT network. The authors reproduce successfully the experimental observations, with reasonable parameters. But a parameter study that exposes the physics at play, and would help the reader generalize the concepts at play is missing.

      In its current state the paper will be of interest to experimentalists and theoreticians working on cytoskeletal filament networks. But it could be even more so, if the authors sought to generalize beyond the experiment at hand.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Steckhahn and colleagues is a computational study of the mechanics of microtubule interactions with an essential mitotic crosslinker, PRC1. PRC1 is known to act as a molecular clutch, resisting the sliding of antiparallel microtubules in order to maintain mitotic spindle integrity. The present study aims to explain the recently discovered two modes of action of this clutch: a weakly resistant 'coasting' mode and a highly resistant 'braking' mode. The authors employ their previously developed Cytoskeleton Lattice-based Kinetic Simulator (CyLaKS) model to carry out Monte Carlo/Langevin dynamics simulations of microtubule sliding, driven by a mitotic kinesin and resisted by an ensemble of PRC1 crosslinkers, with explicit account of their diffusion, binding-unbinding kinetics, stretching-compression, and volume-exclusive interactions. Their reasonable model successfully reproduces the bimodal distribution of microtubule sliding rates, and offers a simple explanation of the two modes of action of the crosslinkers. According to the authors' conclusion, in the coasting mode PRC1 molecules are almost perpendicular to the microtubules, while the microtubules are separated by about 30 nm (close to the rest length of PRC1). When the overlap between the sliding microtubules shrinks, the PRC1 molecules cluster, which facilitates their tilting. This has two effects: a projection of force bringing microtubules closer together appears, and a projection of resistive force along the microtubule axis becomes substantial, enabling more efficient 'braking'.

      The key conclusions are convincing, clearly stated, and supported by data. The simulation techniques are justified and well described. I have no concerns about the technical side of this study.

      We thank the reviewer for their clear summary of the results of the paper.

      Reviewer #3 (Significance (Required))

      I believe this is a useful piece of work, which clarifies some important aspects of the PRC1 mechanism of action by showing that a simple but rigorous mechanical consideration is sufficient to explain the observed bimodal behavior of the mitotic crosslinkers. The findings could be interesting to biophysicists and cells biologists, interested in cytoskeleton and cell division.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Steckhahn and colleagues is a computational study of the mechanics of microtubule interactions with an essential mitotic crosslinker, PRC1. PRC1 is known to act as a molecular clutch, resisting the sliding of antiparallel microtubules in order to maintain mitotic spindle integrity. The present study aims to explain the recently discovered two modes of action of this clutch: a weakly resistant 'coasting' mode and a highly resistant 'braking' mode. The authors employ their previously developed Cytoskeleton Lattice-based Kinetic Simulator (CyLaKS) model to carry out Monte Carlo/Langevin dynamics simulations of microtubule sliding, driven by a mitotic kinesin and resisted by an ensemble of PRC1 crosslinkers, with explicit account of their diffusion, binding-unbinding kinetics, stretching-compression, and volume-exclusive interactions. Their reasonable model successfully reproduces the bimodal distribution of microtubule sliding rates, and offers a simple explanation of the two modes of action of the crosslinkers. According to the authors' conclusion, in the coasting mode PRC1 molecules are almost perpendicular to the microtubules, while the microtubules are separated by about 30 nm (close to the rest length of PRC1). When the overlap between the sliding microtubules shrinks, the PRC1 molecules cluster, which facilitates their tilting. This has two effects: a projection of force bringing microtubules closer together appears, and a projection of resistive force along the microtubule axis becomes substantial, enabling more efficient 'braking'. The key conclusions are convincing, clearly stated, and supported by data. The simulation techniques are justified and well described. I have no concerns about the technical side of this study.

      Significance

      I believe this is a useful piece of work, which clarifies some important aspects of the PRC1 mechanism of action by showing that a simple but rigorous mechanical consideration is sufficient to explain the observed bimodal behavior of the mitotic crosslinkers. The findings could be interesting to biophysicists and cells biologists, interested in cytoskeleton and cell division.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The paper presents simulations of sliding antiparallel microtubules linked by PRC1 crosslinking proteins. It aims to reproduce and explain experimental observations by Alfieri et al. that suggested that PRC1 could adopt two distinct modes of resistive force production against kinesin-driving sliding forces.

      The model which the authors propose is that antiparallel sliding leads to the accumulation of PRC1 at the edges, which results in higher tilt angles of PRC1 molecules and consequently smaller microtubule separation. In the higher tilt regime PRC1 can exert more braking forces since, its angle with the Microtubules is smaller. To my understanding the key parameters for this model to work it the spontaneous tilt angle, and torsional spring that PRC-1's structure encodes. The authors demonstrates that for reasonable values very good agreement with experimental observations can be reached. The simulations are done in the CYlAks framework, which the Betterton group developed and validated in earlier work. The discussion is clear and readable

      Major Comment:

      While the paper goes at great length to successfully reproduce experiments, it is not discussed how sensitive the model is to changes in parameters. In particular it remains unclear how sensitive the model is to changes in the torsional spring that is being used to model PRC-1. Given that this is key to the findings presented here, I would have hoped for a more extensive discussion of the relevant physics. In particular It should be discussed how non-linearities and assymetries in the torsional spring would affect the phenomenon identified here.

      (Very) Minor remark: the orientation of PRC-1 molecules is inconsistent between figures.

      Significance

      PRC-1 is an important cross-linking protein in cell division, and its mechanics is at the center of much current research interest. As such this paper is timely. The key physics that is interesting here is the link between geometry, PRC1-arrangements and geometry of the MT network. The authors reproduce successfully the experimental observations, with reasonable parameters. But a parameter study that exposes the physics at play, and would help the reader generalize the concepts at play is missing.

      In its current state the paper will be of interest to experimentalists and theoreticians working on cytoskeletal filament networks. But it could be even more so, if the authors sought to generalize beyond the experiment at hand.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript presents a computational analysis of PRC1, a passive microtubule crosslinker important for cell division, with a focus on its role in resisting force generation within antiparallel bundles, whose sliding is promoted by active kinesin motors. Using a previously developed simulator and several assumptions, the authors successfully recapitulated the two modes of PRC1 sliding resistance - coasting and braking - that were previously observed in in vitro reconstruction assays. The simulation also reproduces the redistribution of PRC1 within the overlap region as microtubules transition into the braking mode, a phenomenon also observed experimentally. An interesting outcome of the simulation is the change in spacing between the microtubules: The distance narrows as the sliding polymers switch from the coasting to the breaking mode, associated with an increased tilt of PRC1.

      Major comments:

      I find that this manuscript makes a valuable contribution to the cytoskeletal community, as the role of interfilament spacing in polymer assembly has been relatively underexplored, except for more classic studies such as those on muscle contraction and flagellar beating. What I had difficulty fully visualizing the model was the behavior of PRC1 during the coasting and braking modes. In my understanding, if individual heads of PRC1 bind and unbind to and from microtubules while microtubules that they crosslink slide apart, PRC1 should experience greater stretching and thus tilt more at higher sliding speeds. When the sliding slows down, the relative polymer position changes less within a given time, and PRC1 unbinding and re-binding would more easily reset their tilt to an equilibrium angle. However, the authors' simulation shows the opposite: PRC1 exhibits a greater tilt during the braking mode. This seems counterintuitive and a more detailed description and interpretation would worth. I suggest that the authors include a schematic illustrating the configuration of individual PRC1 molecules (e.g., angle and stretch) within the ensemble, particularly during their transition phase. This would greatly help readers grasp how this important protein ensemble switches its mechanical mode depending on polymer sliding and geometry.

      Minor comments:

      1. How was the bimodal velocity distribution (Fig. 1D) obtained experimentally? Were the individual data averaged over time from the start to the end of individual sliding events? If so, does mode switching within a pair lead to under/over-estimate of the coasting and braking speeds?
      2. Line 158 includes typo.
      3. The fixed-separation simulation in Fig. 3D is important for demonstrating the causality. How was the average speed (V_avg) calculated in this case? Specifically, do microtubule pairs that slide at coasting mode maintain a high speed over the entire sliding event when the inter-filament spacing is fixed at a large distance?
      4. In my understanding, the attractive and repulsive lateral forces exerted by PRC1 with positive and negative tilts arise because PRC1 has a natural tilt relative to the perpendicular. Is this correct? It would be helpful to illustrate this assumption in a figure to clarify the molecular behavior being modelled.
      5. In the paragraph starting from line 258, the authors discuss Ase1 and the yeast spindles. What is the relevance to PRC1 particularly in considering that Ase1 exerts an entropic force within the confined microtubule bundles to resist sliding (e.g., Lasky et al., 2015)?
      6. Fig. 1B, C would benefit from additional labels, as the colors in the images do not match those in the accompanying cartoon.

      Significance

      As in my major comments above. My expertise is experimental biophysics on microtubules and motors.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Firstly, we would like to thank the reviewers for their time and efforts in critiquing this paper. The reviewers addressed our study to be significant, but also presented great suggestions to improve our manuscript, mainly the comparison of mRNA and eRNA for predicting subtype specificity and prognosis, the integration with independent validation datasets, etc. Our preliminary analyses showed that our classified mRNAs can predict subtypes better which is not surprising, as these subtypes were initially discovered using mRNA differences. Hence, we employed a novel approach of associating these classified mRNA and eRNA with distance and identified 71% classified eRNAs are associated with classified mRNAs. We also propose to integrate the datasets with PEGS (Briggs et al 2021) to achieve better mRNA-eRNA association and Perturb-seq validated regions to achieve functional validation of the eRNA loci. We believe that our potential improved integrative analyses will improve the novelty and power of our findings, as this is an unique approach which is employed in patient samples-based high resolution eRNA atlas for the first time. We have addressed most of the other major and minor comments of the reviewers and have provided the preliminary revised manuscript.

      Reviewer #1

      Evidence, reproducibility and clarity

      Summary<br /> This study assesses eRNA activity as a classifier of different subtypes of breast cancer and as a prognosis tool. The authors take advantage of previously published RNA-seq data from human breast cancer samples and assess it more deeply, considering the cancer subtype of the patient. They then apply two machine learning approaches to find which eRNAs can classify the different breast cancer subtypes. While they do not find any eRNA that helps distinguish ductal vs. lobular breast cancers, their approach helps identify eRNAs that distinguish luminal A, B, basal and Her2+ cancers. They also use motif enrichment analysis and ChIP-seq datasets to characterize the eRNA regions further. Through this analysis, they observe that those eRNAs where ER binds strongest are associated with a poor patient prognosis.

      Major comments:

      Part of the rationale for this study is the previous observation that eRNAs are less associated with the prognosis of breast cancer patients in comparison to mRNAs and they claim that the high heterogeneity between breast cancer subtypes would mask the importance of eRNAs. In this study, the authors solely focus on eRNAs as a classification of breast cancer subtypes and prognostic tool and do not answer whether eRNAs or mRNAs are a better predictor of cancer subtypes and of prognosis. Since the answer and the tools are already in their hands, it would be important to also see a comparative analysis where they assess which of the two (mRNAs or eRNAs) is a better predictor.

      Response: We appreciate the reviewer for this valid point about comparing the prognostic eRNAs vs mRNAs. Our study doesn’t imply that eRNA markers are better than mRNAs in predicting subtype specificity and/or prognosis, but our motivation for working with eRNAs is that they can be used to define relevant transcriptional regulators and prognosis generally if they are subtyped. As the molecular subtypes in breast cancers were established using gene expression datasets, mRNAs would perform better as predictors of subtypes and or prognosis. However, identifying regulatory networks with emphasis on transcription factor binding motif analyses is not achievable using mRNA datasets. Analysing the active enhancer regions with eRNA transcription will provide high resolution landscape of TF and epigenetic networks. These sorts of analyses usually require ATAC-seq or H3K27ac datasets, but these assays need fresh frozen tissue material and laborious experimental designs compared to RNA-seq datasets. Furthermore, eRNA-transcribing enhancers represent highly active enhancers, while ATAC and H3K27ac datasets can identify all enhancers, which can be inactive or poised, but captured due to the dynamic nature of enhancers. We demonstrate that traditional RNA-seq datasets mapped on active enhancer regions showing eRNA transcription would be sufficient to identify the highly active TF network and gene-enhancer regulatory frameworks in a subtype-specific manner, hence emphasising the potential of eRNA studies.

      Hence, the scope of our study is not to establish which RNA can predict subtype and survival, but to demonstrate the potential of studying eRNAs in patient samples using traditional RNA-seq assays. This study would be beneficial for epigenetics biologists of how enhancer transcription can be associated with gene regulation through deregulated transcription factor networks in patients. The above section had been included in the discussion in the revised manuscript.

      As the comparative analyses suggested by the reviewer will substantiate the potential of eRNAs being studied as cancer prognostic markers, we performed identical methodologies with our machine learning approaches on the published TCGA mRNA-seq datasets, identify the subtype-specific mRNAs as well as prognostic mRNAs and perform the comparative analyses of eRNAs and mRNAs. As we expected, mRNAs indeed perform better in associating with subtype specificity than eRNAs as we could identify more subtype-specific mRNAs with better statistics metrics. The results exhibit great separation across subtypes (Basal, Her2, LumA/B) as well as Ductal vs Lobular.

      We believe that eRNA and mRNA are complementary but not comparative to predict subtype-specific survival. To address this in the revised manuscript, we performed an initial selection of the eRNAs associated with their corresponding subtype-specific mRNAs within 50 kb distance which can be integrated with the above analyses, based on the suggestion from reviewer 3. In our preliminary analysis, around 71% of eRNAs are associated with the subtype-specific mRNAs and we also observed an observable separation of ductal and lobular subtypes using this method.

      Furthermore, we integrated our enhancer RNAs with the key enhancer regions which show significant impact on gene transcription, as shown in single cell CRISPRi screens (Perturb-seq) datasets derived from ATAC-matched H3K27ac datasets verified on one ER+ and one ER- breast cancer cell lines (Wang et al., Genome Biology 2025, https://genomebiology.biomedcentral.com/articles/10.1186/s13059-025-03474-0) . Our initial analyses identified at least 29 regions from the Perturb-seq datasets overlapping with 72 and 5 eRNAs of subtype classification and Her2 survival respectively.

      For the revised manuscript, we will perform the mRNA-eRNA association in a detailed manner and include the data. We will also employ our well-established tool for associating mRNAs and noncoding elements, Peak set Enrichment in Gene Sets (PEGS, Briggs et al., F1000 research, 2021 https://f1000research.com/articles/10-570/v2 ). We hypothesise that this will improve the power of the classification models used in the study and will also provide gene-enhancer RNA interaction landscape in patient samples for the first time. Furthermore, we will integrate the activity of these eRNA-mRNA pairs with chromatin accessibility and enhancer activity using ATAC-seq and H3K27ac ChIP-seq datasets to establish more robust active regulatory networks in patient samples. We will also perform motif analyses on the published ATAC-seq peaks (performed on TCGA-BRCA patient samples, Corces et al., 2018) close to the eRNA loci to identify the TF networks with better precision, hopefully unravelling novel and relevant subtype-specific TFs in an efficient manner, better than our original work. Furthermore, as an experimental functional validation of our classified eRNAs, we will investigate the regulatory effect of 29 Perturb-seq overlapped regions. Hence, our revised manuscript will potentially provide a comprehensive validated list of enhancer RNA regions which are highly active, actively transcribing, subtype and survival specific regulatory networks in breast cancer patients for the first time.

      The authors run the umaps of Fig. 1C only taking the predictor eRNAs. It is then somewhat expected to observe a separation. Coming from a single-cell omics field, what I would suggest is to take the eRNA loci and compute a umap with the highly variable regions, perform clustering on it and assess how the cancer subtypes are structured within the data. This would give a first overview of how much segregation and structure one can have with this data. Having a first step of data exploration would also strengthen the paper. If the authors have tried it, could the authors comment on it?

      Response: We appreciate the reviewer for sharing their experience from single cell omics analysis. In our case, following the scRNA like pipeline is not appropriate, given the focus of our study on identifying markers on the already annotated subtypes. Basically, we aim to assess the quality of the identified markers (the quality is quantified by the statistics provided for random forest classification), and we see that the data is well-separated in PCA using only PC1 and PC2. We showed the umap (using PC1 and PC2) for better visualization in the original manuscript and we included the PCA plots in the revised manuscript.

      'neither measures could classify any distinct eRNAs for invasive ductal vs lobular cancer samples' S1B. Just by eye, I can see a potential enrichment of ductal on the left and on the right while lobular stays in the center. This suggests to me that, while perhaps each eRNA alone does not have the power to classify the lobular vs ductal subtype, perhaps there is a difference - which could result from a cooperative model of eRNA influence - that would need further exploration. Would a PCA also show enrichments of ductal vs. lobular in specific parts of the plot? It may be worth exploring the PC loadings to see which eRNAs could play an influence. In this regard, a more unbiased visual examination, as suggested in my previous point, could help clarify whether there could be an association of certain eRNAs that cannot be captured by ML.

      Response: The subtypes of cancer patients (Basal, Her2, LumA/B) possess clear differences in mRNA expression in breast cancer studies. Given the fixed annotations of the subtypes in the patient datasets, we applied our methodologies on mRNA datasets, and the results exhibited great separation across subtypes (Basal, Her2, LumA/B) as well as Ductal vs Lobular. In addition, 70% of subtype-specific eRNAs are located next to mRNA. This ensures that we detected proper eRNA markers. Furthermore, Random Forest is the standard and powerful non-linear classifier for these types of classifying questions. Therefore, we hypothesized that the data which can distinguish Ductal vs Lobular does not exist in the used eRNA dataset. We only detected 38 subtype-specific mRNAs using information gain with standard cutoff 0.05 which they have classifying power across ductal-lobular. With this standard cutoff only one eRNA-associated gene was detected. To explore more, we used low cutoff for information gain (0.01) and then took only the eRNAs which are located near classified mRNAs (up to 50KB). In this way, we detected 96 eRNA candidates linked to 8 classified mRNAs. These 96 eRNAs could, to some extent, classify ductal vs lobular (PCA plots attached above). This observation can further verify that if a more comprehensive eRNA dataset exists, we could detect better eRNA markers and cover more (probably all) mRNA markers. Hence, cooperative model of eRNA as suggested by the reviewer can't be achieved and random forest is one of the efficient tools to decipher the cooperation if it exists. Besides, as we demonstrated in this paper that eRNA is a complementary dataset to mRNA which can assist in the identification of regulatory networks. For the revision, we will provide more detailed eRNA-mRNA associations using integration with PEGS and Perturb-seq validated regions, in both subtype classification and survival and will motivate the potential similar studies for ductal vs lobular in the discussion.

      "we employed machine learning approaches on 302,951 eRNA loci identified from RNA-seq datasets from 1,095 breast cancer patient samples from previous studies" - the previous studies from which the authors take the data [11,12] highlight the presence of ~60K enhancers in the human genome and they use less than that in their analysis. Could the authors please clarify the differences in numbers with previous studies and give a reasoning?

      Response: ~300K enhancers are derived from ENCODE H3K27ac datasets which represents all active enhancer regions marked by H3K27ac (Hnisz et al., 2013). This is a high-resolution map of eRNA loci ever presented. In Chen et al 2020, 1,531 superenhancers representing 30K eRNA loci was utilised for exploratory analysis, and the findings were generalised back to the 300K set. 65K enhancer loci covers tissue-specific enhancers initially identified by FANTOM CAGE datasets and this subset provide limited regions of eRNA expression. Hence, our analyses on ~300K eRNA loci provide unbiased information on subtype specificity and gene-TF regulatory networks. The differences had been highlighted in the methods and results in the revised manuscript.

      Also, from the methods section, they discard many patient samples due to low QC, so, from what I understand, the number of samples analyzed in the end is 975 and not 1,095.

      Response: We thank the reviewer for pointing this out and we have updated the numbers in the revised manuscript.

      Minor comments:

      Can the authors please state the parameters of the umap in methods? Although it could be intrinsic to the dataset, data points are grouped in a way that makes me think that the granularity is too forced. Could the authors please show how the umap would behave with more lenient parameters? Or even with PCA?

      Response: We used ‘umap’ function from umap package (with default parameters) in R using only PC1 and PC2, hence the granularity is not forced. As suggested by the reviewer, we have now added PCA plots in the main figures (Fig. 1E) and moved all the umap plots to the Supplementary figures (Fig.S1B) in the revised manuscript.

      'Majority of the basal' -> The majority of the basal.

      Response: We thank the reviewers for noticing the typo and we corrected this in the revised manuscript.

      Significance

      This is a paper relevant in the cancer field, particularly for breast cancer research. The significance of the paper lies in digging into the breast cancer samples, taking the different existing subtypes into account to assess the contribution of eRNAs as a classifier and as a prognostic tool. The data is already available but it has not been studied to this degree of detail. It highlights the importance of characterizing cancer samples in more depth, considering its intrinsic heterogeneity, as averaging across different subtypes would mask biology. My expertise lies in gene regulation and single-cell omics. My contribution will therefore be more focused on the analysis and extraction of biological information. The extent of its specific relevance in cancer research falls beyond my expertise.

      Response: We appreciate the reviewer for understanding our efforts to bring out the importance of subtyping and to explore the association of eRNA in breast cancer transcriptional gene regulatory networks.

      Reviewer #2

      Evidence, reproducibility and clarity

      Summary<br /> Enhancer RNAs (eRNAs) are early indicators of transcription factor (TF) activity and can identify distinct molecular subtypes and pathological outcomes in breast cancer. In this study, Patel et al. analysed 302,951 polyadenylated eRNA loci from 1,095 breast cancer patients using RNA-seq data, applying machine learning (ML) to classify eRNAs associated with specific molecular subtypes and survival. They discovered subtype-specific eRNAs that implicate both established and novel regulatory pathways and TFs, as well as prognostic eRNAs -specifically, LumA and HER2-survival- that distinguish favorable from poor survival outcomes. Overall, this ML-based approach illustrates how eRNAs reveal the molecular grammar and pathological implications underlying breast cancer heterogeneity.

      Major comments

      1. The authors define 302,951 eRNA loci based on RNA-seq data, yet it is widely known that many enhancers reside in proximity to promoters or within intronic regions (examples presented in Fig. 3B and S3). Consequently, it seems likely that reads mapped to these regions might not truly represent eRNA signals but include mRNA contamination. Could the authors clarify how they ensured that the identified eRNAs were not confounded by mRNA reads? What fraction of these enhancer loci is promoter proximal or intronic? How does H3K4me3, a well-established and standardized active promoter histone mark, behave on these loci? The reviewer considers it important to confirm that the identified eRNAs are indeed of enhancer origin rather than promoter transcripts.

      Response: For this study, we utilised pan cancer atlas-based published work (Chen et al 2018 and 2020) where the abundant RNA signals on intronic and intergenic regions are included, and promoter-based signals are excluded. These studies utilise the advantage of identifying eRNAs on large sample size and the possibility of mRNA being on introns in 1000s of patient samples is very low. A clarification of this concern had been discussed in the Introduction of these studies as follows: “because eRNA reads associated with real enhancer activity recurrently accumulate, whereas background transcription noise tends to occur stochastically. The large number of RNA-seq reads obtained would compensate for the statistical power compromised by the low eRNA expression level typically observed in a single sample.” We included clarification of this concern in the discussion. Furthermore, as per the reviewer’s suggestion, we examined the distribution of the eRNA loci across the genome and found that majority of eRNA regions are located on introns and intergenic regions. This figure had been included in the Supplementary Fig. S6A.

      2. In Fig. 1B, the F measure (0.540) of the Basal subtype using the Logmc method contradicts its extremely high precision (1.000) and sensitivity (0.890). The authors need to clarify the exact formula or method used to compute F1 and the discrepancy in the reported metrics for this subtype and perhaps other subtypes as well.

      Response: We apologise for the mistake in this section and thank the reviewer for pointing this out. We included the formulas for each statistical metric in the method section of the manuscript. The F-measure was mentioned wrong which led to the confusion here. The figure had been corrected with the F-measure of 0.94 in the revised manuscript.

      3. As shown in Fig. 4C, S4B, and most, if not all, tracks of Fig. S3, ER binding regions are not annotated as eRNA loci. It seems, in this reviewer's opinion, very unlikely that this is because they generally lack eRNA expression, but rather they do not express polyadenylated eRNA (typically 1D eRNA), which is captured in this dataset. The reviewer posits that these enhancers produce more transient, non-polyadenylated 2D eRNA. It has been widely documented in prior studies that ER-bound enhancers exhibit bimodal eRNA expression patterns [e.g., Li, W. et al. Functional roles of enhancer RNAs for oestrogen-dependent transcriptional activation. Nature 498, 516-520 (2013)]. Could the authors address this opinion and elaborate on how the restriction to polyadenylated transcripts might underrepresent enhancers regulated by ER and other TFs and whether this bias impacts the overall findings?

      Response: The authors appreciate the reviewer’s suggestion to address the caveats of using polyadenylated eRNAs to identify the ER binding patterns. TCGA eRNA atlas with polyadenylated eRNAs indeed possesses this disadvantage of using polyadenylated eRNAs for this study, however currently there are no data available with bidirectional transcripts in any breast cancer patient samples. The tools to profile these RNAs are not robust enough to be performed on frozen cancer tissue samples which are extremely limited in their size and availability. By utilising the polyadenylated eRNA-seq datasets, we might not only lose the accuracy of ER binding patterns, but also for other transcription factors which activate/associate with bimodal expression around enhancers. However, our integrative analysis on stable polyadenylated eRNA loci can still identify the most-relevant TF networks of each subtype.

      Furthermore, we validated this finding by analysing our own datasets of KAS-seq which represents any active transcribing bidirectional enhancers from MCF7 cell line. Independently, we also incorporated ATAC-seq, H3K27ac ChIP-seq, CAGE and GRO-seq data on the gene profiles in Fig. S3 to associate the eRNA regions identified in polyadenylated RNA datasets with ER binding sites in patients and published bidirectional transcripts in the preliminarily revised manuscript. We observed that all the ER binding sites are accompanied by open and active enhancer marks with bidirectional transcription (either GRO- or CAGE positive) but they are not on the exact location of eRNA regions. Subtype-specific eRNA regions close to genes like MLPH and XBP1 possess both active bidirectional transcribing ER bound sites far away (around 1.5 kb) from subtype-specific eRNA loci and bidirectional transcribing ER unbound sites. However, these distal ER binding sites are close to the regions from the list of 300K eRNA loci and they were simply not identified as subtype-specific regions. Hence, it can be true that the occupancy of ER might not be present on all subtype-specific eRNA loci, but our subtype-specific eRNA sites are representative of bidirectional transcription.

      Upon the suggestion from the reviewer, we discussed the potential of identifying TF networks by analysing the 1D eRNAs, in the revised manuscript.

      4. Despite the unsatisfied performance of the ML approach on classifying Her2 subtypes, the hierarchical clustering performed in Fig. 2A and S2A appears to show a reasonable separation of Her2 subtypes, showing as a clustered green band. Could the authors quantitatively assess how effective this clustering results and compare that to the ML outcome? (OPTIONAL)

      Response: The authors acknowledge this interpretation from the reviewers. Using both the measures, our ML platform can identify markers for Her2 subtype but some of the statistical metrics are poor. As the heatmaps were performed based on these identified Her2 markers, a separate analysis on this cluster would not be much informative. The poor metrics for Her2 classification was already justified, partly due to the low number of Her2+ patients in the cohort.

      5. In Fig. 4 and S4, the authors reported to have enriched binding or motif of TFs, e.g., FOXA1, AP-2, and E2A, specifically at enhancer loci with low eRNA level, which conflicts with their established roles as transcriptional activators. The reviewer asks for an address as to why these factors would be associated with basal low-eRNA regions and whether any additional data might clarify their functional role in these contexts.

      Response: The authors appreciate the reviewer’s concern, but we would like to clarify that eRNAs which are less expressed in basal subtype are classified as basal low. These regions show high expression in luminal patients. Hence, there is a strong overlap of basal low and luminal high regions. FOXA1 and AP2 factors are strongly established coactivators in luminal ER+ transcriptional signaling, hence they are associated with basal low eRNA regions. We clarified this in the discussion and provided more literature evidence in the revised manuscript to demonstrate the strong role of FOXA1 and AP2 factors in ER+ luminal breast cancer transcriptional response.

      6. Regarding Fig. 4B, the authors state that "ER binding occupies only the strongest ssDNA and GRO-seq-positive sites". Firstly, the GRO-seq data quality is poor with indiscernible peaks. This may be insufficient for a qualified representation of nascent eRNA expression. More importantly, it appears each heatmap is ranked independently, so top loci for ssDNA are not necessarily top loci for GRO-seq, ER, Pol-II, or H3K27ac. The reviewer requests clarification on how the authors plot these heatmaps and questions whether the statement is supported by the analysis as presented.

      Response: We acknowledge the reviewer’s concern and based on their suggestion, we utilised another set of GRO-seq datasets which is more deeply sequenced and published by the same lab. The average plot from these new datasets showed better profile. We also apologize for not providing enough details of how we generated the heatmaps in Fig. 4B. The heatmaps were made separately for each profile to auto scale with their own intensity levels but the order of the regions is based on KAS-seq intensity. The order of these regions was kept the same between each profile. Hence, top loci of ssDNA are not exact top loci of GRO, ER, H3K27ac and Polymerase but top loci of ssDNA also show similar high intensity in GRO, ER, H3K27ac and Polymerase, hence correlated. We also removed regions which belong to blacklisted regions of hg38 and the regions which were over-sequenced due to amplifications and showed weird signals. We provided the new heatmaps and profile plots in the revised manuscript with different clusters of KAS-seq intensity. We also updated the methods section to clarify how these heatmaps were made.

      7. In Fig. S4B and the third plot of 4C, the averaged histogram of ER binding appears in multiple sharp peaks with drastic asymmetric positioning around the enhancer centre, which is highly atypical of most published ER ChIP-seq profiles. Could the authors discuss possible "spatial syntax" or directional patterns of ER binding in relation to eRNA loci and cite any literature showing a similar pattern? Further evidence is required to substantiate these observations, as they are remarkably unique.

      Response: The authors agree with the reviewer’s point about asymmetric peaks of ER on the luminal specific eRNA regions. Due to the nature of the average profile plots and the number of regions explored here are so low, the profiles look asymmetrical and different than the published literature. Heatmaps lose their resolution when made on a very low number of regions. The focus of this analysis is to highlight that the ER is not binding to the centre of eRNA loci which is contradictory to the published findings from in vitro studies, but further away on these subtype-specific regions. We don’t have any solid evidence to demonstrate the directional patterns of ER binding related to this data. To avoid any confusion, we removed these average plots but focused on the already existing single gene profiles in Fig. S3 and discussed our interpretations in detail.

      Minor comments<br /> 1. When introducing eRNAs, the reviewer recommends mentioning that 1) eRNA levels correlate with enhancer activity and 2) eRNA expression precedes target gene transcription, thus reflecting upstream regulatory events. Relevant references include: Arner, E. et al. Transcribed enhancers lead waves of coordinated transcription in transitioning mammalian cells. Science 347, 1010-1014 (2015); Carullo, N. V. N. et al. Enhancer RNAs predict enhancer-gene regulatory links and are critical for enhancer function in neuronal systems. Nucleic Acids Res. 48, 9550-9570 (2020); Kaikkonen, Minna U. et al. Remodeling of the Enhancer Landscape during Macrophage Activation Is Coupled to Enhancer Transcription. Mol. Cell 51, 310-325 (2013).

      Response: These are great recommendations from the reviewer, and we included the suggested publications in the Introduction section of the revised manuscript.

      2. H3K27ac is used initially to define these regulatory loci, and like eRNAs, H3K27ac also varies among patients. Which H3K27ac dataset(s) were used initially, and could this approach potentially overlook patient-specific enhancers? (OPTIONAL)

      Response: This is a totally valid point from the reviewer. The idea of this project is to define common subtype-specific enhancers which can be regulatory and prognostic, hence can be developed further as biomarkers providing benefit for more patients in the future. Hence, investigating the common enhancers which are activated in multiple normal and cancer cell lines defined by ENCODE is more valid than patient-specific enhancers whose activity might be influenced by specific genetic alterations. There is very limited availability of H3K27ac ChIP-seq datasets from cancer patients to explore the patient-specific enhancers, and our analyses were totally based on the published work, hence not possible to fully address this concern. The source of the H3K27ac ENCODE datasets (from 86 human cell lines and tissue samples) is clarified in the revised manuscript.

      3. In addition to the overall metrics displayed in Fig. 2B, could the authors provide precision and sensitivity values for LumA and LumB separately under the Logmc method, given the observation in Fig. 2E that LumA and LumB are not well separated in the UMAP projection?

      Response: The authors appreciate the suggestion from the reviewer. We have included the metrics separately for LumA and LumB in the revised manuscript in Fig. S1D.

      4. Could the author elaborate, in the discussion section, on why there is a substantial difference in ML performance depending on whether InfoGain or Logmc is used?

      Response: We have included the following text in the discussion to explain the differences between these two measures.

      “InfoGain measure work with the approach of binarization with k-means (k=2). It has the potential to capture both strongly expressed eRNAs which are differential between subtypes as well as low expressed sparser on and off eRNAs. In the first case, although eRNA is highly expressed in all patients, the higher expression mode becomes 1 and the lower expressed mode become 0. However, in case of low expression, more on and off expression, recentered logmc would not generate a striking high value. Furthermore, binarization is also a strong process to perform better clustering and classification, as distinguishing between data points gets better and clearer. “

      5. How does the expression pattern of Basal high, Basal low, Her2, and Lum eRNA clusters behave differentially in Basal, Her2, and LumA/B subtypes? Are Basal high eRNAs downregulated in Her2 or Lum subtypes, and vice versa? Since many downstream analyses rely on these eRNA clusters, it is suggested to include a heatmap and/or boxplot that displays how each eRNA category is expressed in each subtype to confirm that these definitions are consistent.

      Response: We thank the reviewers for this suggestion and apologise for not providing enough clarification on the expression of eRNAs in other subtypes. Indeed, Basal high expressed eRNA are expressed low in LumA and LumB and Basal low expressed eRNAs are expressed higher in lumA and lumB. Her2 subtype-specific eRNAs has a trend of expression between Basal and Lum, as it can be seen in the umap and PCA. Basically, the Basal high expressed eRNAs are Lum lower expressed eRNAs, and the Basal low expressed markers are Lum higher expressed markers. As per the suggestion from the reviewer, we provided heatmaps on eRNA expression of each subtype-specific with regulation in other subtype patients in figure S2F-K.

      Referee cross-commenting

      I share Reviewer #1's opinion that the manuscript should assess whether mRNA or eRNA is the stronger predictor of breast cancer subtypes and clinical outcomes. It will greatly improve the novelty if eRNA is shown to be a better indicator for cancer characterization.

      Also, I strongly concur with Reviewer #3 that the current informatics approach is superficial and that several conclusions are contentious. The authors need to resolve the inconsistencies in their ML statistics and the potentially misleading interpretations of the ChIPseq and motif enrichment results.

      It is further recommended that, building Reviewer #3's comment, the study integrate eRNA signatures with their proximal genes to address 1) whether genes located near these enhancers are differentially expressed-and correlated with enhancer activity-across cancer subtypes, and 2) whether it provides insights into understanding the enhancer-gene regulatory architecture in a subtype-specific context.

      Response: We thank reviewer 2 for cross-commenting on reviewer 1 and 3’s suggestions. Indeed, these are interesting points to cover and will increase the novelty of the study. Based upon these suggestions and discussed earlier for reviewer 1’s comments, we will explore the comparison of mRNAs vs eRNAs as predictor of cancer subtypes and prognosis and the association of genes-eRNAs in cis as discussed in other reviewer’s comments. Our preliminary analyses show a strong association of eRNA and mRNA specific to subtypes and an observable separation on subtypes which were harder to classify markers using eRNAs alone. Hence, we will improve these analyses, and the manuscript further as discussed above in the final revision.

      Significance

      General Assessment

      This study provides insights into the potential use of eRNA to classify breast cancer subtypes and refine prognostic markers. A strength is the integration of large-scale RNA-seq data with machine learning to identify eRNA signatures in biologically-meaningful patient samples, revealing both established and novel TF networks. The study also discovered eRNA clusters that correlate with the survival of patients, thus providing strong clinical implications. However, the ML approach yields several inconsistencies-for instance, unsatisfactory classification results for the Her2 subtype as well as the confused statistical metrics in the results. Furthermore, the ML model struggles to differentiate more nuanced molecular classes (e.g., LumA vs. LumB) and higher-level histological subtypes (e.g., lobular vs. ductal), thus limiting its power to dissect more delicate pathological and molecular mechanisms. Another limitation worth noting of this ML approach is the exclusive use of only polyadenylated eRNAs via RNA-seq, which excludes perhaps the more prominent 2D eRNA expressed in regulatory enhancers. Moreover, certain datasets appear to be of suboptimal quality, leading to assertions that would benefit from additional supporting evidence. Altogether, while the study offers a promising angle on eRNA-based tumor stratification, more robust experimental validations are needed to resolve inconsistencies and clarify the mechanistic underpinnings.

      Advance<br /> Conceptually, the study highlights the potential for eRNA-based signatures to capture regulatory variation beyond classical markers. However, the utility of these signatures is constrained by the focus on polyadenylated transcripts alone, likely underrepresenting key enhancer regions, and certain evidence presented in this study is not substantial enough to support some statements. While the work adds an important dimension to the understanding of enhancer biology in breast cancer, the resulting insights are partly hampered by limitations in data coverage and quality.

      Audience<br /> The primary audience includes cancer epigenetics, functional genomics, and bioinformatics researchers who are interested in leveraging eRNAs as biomarkers and dissecting complex regulatory networks in breast cancer. Clinically oriented scientists focusing on molecular diagnostics may also find relevance in the authors' approach to stratify subtypes and outcomes. The research is most relevant to a specialized audience within basic and translational cancer genomics, as well as computational biology groups interested in eRNA analysis.

      Field of Expertise

      I evaluate this manuscript as a researcher specializing in cancer epigenetics, functional genomics, and NGS-based data analysis. Parts of the manuscript touching on clinical outcome measures may require additional review from practicing oncologists.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This study aims to classify prognostic and subtype-specific eRNAs in breast cancer, highlighting their potential as biomarkers.<br /> Data was analysed using existing machine learning algorithms,<br /> Data analysis is superficial and it is hard to understand the key significant findings.

      This is an important topic and a highly relevant approach to identifying RNA-based biomarkers.<br /> They analyse published RNAseq datasets by focusing on molecular subtype-specific eRNAs, enhancing clinical relevance and thereby addressing the heterogeneity of the cancer type (strength of the study).

      Weaknesses include: Most of the findings are purely correlation-based and also based on a reanalysis of published datasets; it would benefit from experimental validation to support their findings. Differential expression analysis of large datasets likely yields some differences in the transcriptome. How significant are these changes?<br /> Does the expression of eRNAs affect the expression of genes in cis? Although this analysis would provide some associated gene expression differences, it can also provide some insights into subtype-specific differences in gene expression programs.<br /> If the authors find experimental validations are not feasible, I recommend validating the eRNA signature in an independent dataset.

      Response: We acknowledge the weaknesses noticed by the reviewer from this study about the correlation-based analyses of published datasets. While the TCGA eRNA atlas datasets are reanalysed, these are the high-resolution maps ever published on eRNA expression on cancer patient samples, and our study is the first to establish the subtype specific classification of eRNAs. We believe that the eRNAs are biologically relevant, as they are strongly associated with the subtype-specific pathways and epigenetic regulators. Upon suggestion from the reviewers, we will explore the association of mRNAs and eRNAs in cis to establish further significance and relevance of the eRNAs we identified (discussed earlier in reviewer 1 comments).

      We would like to focus on studying the functional relevance of eRNAs as a separate project. In vitro studies to establish the knockdown of eRNAs are not straightforward due to the toxicity and non-specific targeting of the locked nucleic acids approach or Cas13-based RNA targeting. siRNA-based approaches don't target the nuclear eRNAs effectively, even though they were widely used by other labs to target eRNAs. Hence, a lot of effort on optimisations are needed to establish functional validation of our eRNAs, hence not under the scope and time frame of this study/revision. To provide validation and significance using independent datasets, we will explore the association of these factors with the expression of subtype-specific eRNAs further in our final revised manuscript using the tools explained above for reviewer 1 (PEGS and Perturb-seq integration). Integration of our classified eRNAs with the published Perturb-seq validated regions from ER+ and ER- breast cancer cell lines will provide the functional validation of patient-associated classified enhancer/eRNAs. Hence, our study would be the first to demonstrate the validated gene-enhancer regulatory networks from breast cancer patient datasets.

      Furthermore, we included the single gene visualisation profiles of independent datasets of ER ChIP-seq from different patients (Ross-Innes et al., 2012), ATAC-seq from TCGA patients (Corces et al., 2018), H3K27ac ChIP-seq datasets from cell lines (Theodorou et al., 2013 and Hickey et al., 2021) and GRO-seq and CAGE data published in MCF7 cells close to the eRNA regions and discussed their overlap with the eRNA regions in the revised manuscript. In the final revision, we will perform further detailed integration of all these profiles. Overall, our study will provide the integratory analysis of various independent epigenetic and functional profiles to validate our classified subtype and survival-specific eRNA regions.

      Here are major points; addressing these points in the revised version is important.

      From Figure 1B, what eRNAs were identified for LumB using log2MC?

      Response: The authors acknowledge the lack of analyses on LumB eRNAs in the original version of the manuscript. In the final revised manuscript after associating with mRNAs, we will provide the heatmaps, pathway analyses and other functional annotations for LumB specific eRNAs.

      Page 8 However, sensitivity and F-measure .... It would help to include the metrics for the number of patients in each subtype. The ratio of eRNAs/number of cases in each subtype would inform if the number of eRNAs is an outcome of no. of cases or subgroup-specific.

      Response: This is a great suggestion from the reviewer, and we included the number of patients for each subtype in the table in Fig. 1D. We observed that the basal patients are low in number, but we identified more basal eRNAs. Hence, the number of eRNAs identified in subtype-specific manner is not correlated to the number of patients in the cohort.

      Page 9 "Altogether, both measurements classify eRNAs efficiently based on subtypes, InfoGain allowed us to distinguish further samples based on high and low expression of eRNAs for basal subtype and performed better in statistical metrics" Based on statistical metrics, both models seem to be performing similarly except for Her2.

      Response: We apologise for this wrong interpretation. We corrected this in the revised manuscript at page 9.

      In Fig. 1B, the F-measure metrics are wrong for basal LogMC, as it is 0.94 rather than 0.54, which could lead to a misinterpretation of the model.

      Response: We apologise for the mistake in this figure, and we included the corrected heatmap in the revised manuscript.

      Many genome browser figures, including Figure S3. TFBS is not at the same site as eRNAs detected. Is there CAGE data to show that binding these TFs at these sites leads to the expression of eRNAs? That will give direct evidence that the eRNAs are transcribed due to these TFs

      Response: This is a great suggestion from the reviewer. We incorporated ATAC-seq, H3K27ac ChIP-seq, CAGE and GRO-seq data on the gene profiles in Fig. S3 to validate the activity of these ER binding sites in the preliminarily revised manuscript. We observed that all the ER binding sites are accompanied by open and active enhancer marks with bidirectional transcription (either GRO- or CAGE positive) but they are not on the exact location of eRNA regions (250-1000 bps away from the centre of ER binding site). Subtype-specific eRNA regions close to genes like MLPH and XBP1 possess active bidirectional transcribing ER binding sites far away from subtype-specific eRNA loci and also ER unbound sites. However, these distal ER binding sites are close to the regions from the list of 300K eRNA loci and they were simply not identified as subtype-specific regions.

      Page 10, There were 30 Her2-specific eRNA regions.... Do the same enhancers also regulate these genes as those from which eRNAs are transcribed? Is it cis-effect, or could these affect the trans-regulating of other genes?

      Response: We acknowledge the concern from the reviewer, however this is hard to be validated, as functional experiments to explore the 3D interactions of enhancers and gene promoters are not robust enough to be performed in patient samples and can't be performed within the revision time frame. In the final revised manuscript, we will explore the association of enhancers and promoters of ERBB2 with PEGS association as discussed above and with available HiC datasets in Her2+ cell lines (HCC1954, GSE167150, Kim et al., 2022 https://pubmed.ncbi.nlm.nih.gov/35513575/ )

      Minor comments:

      Page 8 "InfoGain meausure..." Fig. S2A also shows high and low expressed eRNAs for the basal group

      Response: We apologise for the lack of clarity here. InfoGain measure identifies both high and low expressed eRNAs in all patients showing similar pattern of regulation among patients. However, logmc derived eRNAs are highly expressed in most patients. Low expressed eRNAs could not be identified in logmc measure as strong as InfoGain regions. The text in the results had been edited in the revised manuscript to reflect better clarity on this point.

      Page 11, Our analyses also identified the role of another..... The statement is misleading as it is the enrichment of these TFs with the eRNAs<br /> Response: We included the word “enrichment” to clarify this statement.

      Page 13, "Around 90% of eRNAs are bidirectional and non-polyadenylated [53]. TCGA expression datasets are based on RNA-seq assays, which capture only non-polyadenylated RNAs. Thus, analysing the expression of eRNAs on mRNA-seq datasets might not be adequate". It is very confusing, please check<br /> Response: We apologise for the mistake, and this has been corrected in the revised manuscript.

      Reviewer #3 (Significance (Required)):

      This is an important topic and a highly relevant approach to identifying RNA-based biomarkers.<br /> They analyse published RNAseq datasets by focusing on molecular subtype-specific eRNAs, enhancing clinical relevance and thereby addressing the heterogeneity of the cancer type (strength of the study).

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-03004

      Corresponding author(s): Kentaro Furukawa and Tomotake Kanki

      1. General Statements [optional]

      We would like to thank the reviewers for their constructive and positive feedback. We are encouraged that all three reviewers consider the identification of Mfi2 as an outer mitochondrial membrane fission factor required for mitophagy to be a significant and important contribution to the research field. We acknowledge the concerns raised and propose the following plan to address them through additional experiments and clarifications. We believe that these revisions will further strengthen the manuscript and enhance its impact.

      2. Description of the planned revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Furukawa and colleagues identified Mfi2 as novel factor that promotes fragmentation and removal of damaged mitochondria by mitophagy. They report that parallel loss of Dnm1 and Mfi2 blocks mitophagy. Mfi2 acts on the outer membrane, while the previous found Atg44 functions in the intermembrane space. How the proteins cooperate remains unknown. This is an elegant study with high-quality data. The findings are interesting for a broad readership. There are some issues as outline below that should be solved.

      Response:

      We would like to thank Reviewer #1 for their thoughtful evaluation of our manuscript and for recognizing the interest and quality of the study.

      1. It remains unclear how Mfi2 is anchored into the outer mitochondrial membrane. Does it contain a transmembrane domain? The carbonate resistance indicates the presence of such transmembrane domain. However, the presented structures lack such membrane-spanning segment. This point should be clarified.

      Response:

      We performed an in silico topology prediction of Atg44 and Mfi2 using TMHMM. This tool identified a weakly hydrophobic region of Mfi2 near the N-terminus but did not predict a definitive transmembrane domain (see new Fig. EV1E) (Page 6, lines 8-9). This result implies that Mfi2 interacts with the outer membrane in a monotopic or peripheral manner, rather than as a classical transmembrane protein. Such proteins may remain in the membrane pellet after carbonate treatment due to their strong hydrophobic insertion into the lipid bilayer (e.g., yeast tafazzin/Taz1; Brandner et al., Mol. Biol. Cell, 2005; DOI: 10.1091/mbc.E05-03-0256). We will incorporate this interpretation in the revised manuscript.

      How does Mfi2 cooperate with Dnm1? Is there any interaction between these proteins? Some further information could provide mechanistic insights into the function of Mfi2.

      Response:

      While our study does not explicitly suggest that Mfi2 cooperates with Dnm1, we plan to investigate whether these proteins physically associate. We will perform co-immunoprecipitation experiments under growing and mitophagy-inducing conditions to examine potential interactions between Mfi2 and Dnm1. Further insights into their interaction could help clarify the mechanistic role of Mfi2 in mitochondrial fission and mitophagy.

      The authors report a CL-dependent binding of Mfi2 to liposomes. Is the recruitment of Mfi2 to mitochondria impaired when CL-synthesis is blocked, e.g. in crd1delta mitochondria?

      Response:

      To assess the role of cardiolipin in Mfi2 localization, we will compare the efficiency of mitochondrial targeting of endogenous Mfi2 in WT and crd1Δ cells. Additionally, as mentioned in Reviewer #3's comment, we plan to perform coarse-grained molecular dynamics simulations to further investigate the interaction between Mfi2 and cardiolipin. The results of these simulations will be incorporated into the discussion to provide deeper mechanistic insights.

      Figure 4B: a wild-type control should be added.

      Response:

      We appreciate Reviewer #1’s suggestion to include a WT control in Figure 4B. However, given the focus of this figure on the rescue of mitophagy defects in the mfi2Δ dnm1Δ strain, we believe that adding a WT control is not essential for the analysis. The key comparison here is between the mfi2Δ dnm1Δ strain and the rescue conditions, and statistical analysis was performed to support the conclusions. We hope this clarifies our approach, but we will make adjustments if necessary.

      Reviewer #1 (Significance (Required)):

      The reported findings are interesting for a broad readership.

      Response:

      We appreciate Reviewer #1’s recognition of the relevance of our findings to a broad readership.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this study, the authors discover a mitochondrial fission factor, termed Mfi2, that promotes mitophagy efficiency and that functions in a partially redundant way with Dnm1 for the fission of mitochondrial outer membranes during mitophagy. The discovery helps to clarify why Dnm1 does not appear to be essential for fission mediated mitophagy by Dnm1. Mfi2 is structurally similar to the inner membrane fission factor Atg44 which is consistent with Mfi2's fission activity. The authors show that Mfi2 has membrane fission activity towards nanotubes in vitro, and that membrane binding is dependent of high levels of cardiolipin, a mitochondrially enriched lipid. In summary, the authors show that Mfi2 mediates mitochondrial outer membrane fission together with Drp1, whereas Atg44 mediates inner membrane fission, that together are necessary for mitophagy.

      Response:

      We thank Reviewer #2 for the positive assessment and for clearly summarizing the main contributions of our work.

      Major: 1. Figure 2: How do the expression levels of the Mfi2 constructs compare to the endogenous levels of the protein? This will help to gauge to what degree Mfi2 N66 overexpression is needed to achieve mitochondrial fragmentation in Atg44 delta cells and also the low level of mitophagy rescue that was observed.

      Response:

      We used the TDH3 promoter for the expression of Mfi2 in Figures 2D and 2E. Unfortunately, our Mfi2 antibody only detects full-length Mfi2, as it recognizes a C-terminal region of the protein. This means we cannot directly compare the expression levels of Mfi2(N66) to those of endogenous full-length Mfi2. To clarify the expression levels, we will provide the following data:

      (1) Mfi2 antibody: Endogenous Mfi2(Full) and overexpressed Mfi2(Full)

      (2) FLAG antibody: Overexpressed Mfi2(Full)-FLAG and overexpressed Mfi2(N66)-FLAG

      Figure 3A-B: The cardiolipin binding results in vitro are interesting but the concentration of cardiolipin is much lower on the outer membrane versus the inner membrane. Can the authors comment on whether the cardiolipin levels used on the nanotubes are relevant to that of the mitochondrial outer membrane where Mfi2 is located? Can the authors provide quantitative data for these experiments to help strengthen their conclusions?

      Can the authors also use purified MBP alone or a form of Mfi2 that cannot bind to membrane e.g. Mfi2-C33) as a control?

      Response:

      We thank the reviewer for raising this important point regarding our cardiolipin-dependent in vitro data. In our experiments, we used 20 mol% cardiolipin (CL), a concentration higher than the typical levels in the mitochondrial outer membrane, which contains less than 5% CL. However, it is known that CL translocates to the outer membrane under mitophagy-inducing conditions (e.g., Chu et al., Nat Cell Biol, 2013; Kagan et al., Cell Death Differ, 2016). Our use of elevated CL levels aligns with standard practices in in vitro reconstitution assays to ensure adequate membrane curvature and charge density, which are necessary for robust and reproducible protein-membrane interaction assessments.

      To strengthen our conclusions, we will provide a quantitative analysis of the nanotube fission experiments. This will include the percentage of severed tubes under each condition, the total number of tubes analyzed (n), and the relationship between tube diameter and fission efficiency. These additional data will allow for a more thorough evaluation of the membrane fission activity of Mfi2.

      Furthermore, we will include control experiments using purified MBP alone and a membrane-binding-deficient mutant of Mfi2 (C33), as suggested by the reviewer.

      Figure 4D: The protrusions are very difficult to visualize. Can the authors also provide zoomed in regions. Is the data representative from 3 or more independent experiments? Can the authors provide a graph of the quantitation to aid readers with analysis of the data?

      Response:

      We thank the reviewer for this helpful suggestion. In the revised manuscript, we will provide higher magnification images to improve the visibility of mitochondrial protrusions. We confirm that the presented images are representative of results obtained from three independent experiments. Additionally, as requested, we will include a graph quantifying the frequency and morphology of protrusions to facilitate data interpretation.

      Figure 4D: It is fascinating to see the mitochondrial protrusion formation being dependent on autophagy factors but not mitochondrial fission factors. To help visualize this, can the authors image one of either Atg1, Atg8 to address whether phagophores are forming on the protrusions and if so where they are positionally located on the protrusion in control and/or mfi2,dnm1,atg44 triple mutant cells?

      Response:

      We thank the reviewer for this insightful comment. In our previous study (Fukuda et al., Mol Cell, 2023), we demonstrated that Atg proteins, such as Atg8, accumulate at mitochondrial protrusions formed in atg44Δ cells, suggesting that these structures can serve as sites for phagophore assembly. However, as in our previous microscopy analysis, the resolution limitations of our imaging system make it difficult to precisely determine the exact location of phagophores on the protrusions.

      Whether similar recruitment occurs in the absence of both Mfi2 and Dnm1 remains untested. To address this, we will perform fluorescence imaging of fluorescent protein tagged Atg proteins, such as GFP-Atg8, in mfi2Δ dnm1Δ atg44Δ triple mutant cells to examine whether phagophores form on the mitochondrial protrusions under these conditions. This will help us determine whether phagophore formation requires mitochondrial fission or occurs independently of it.

      Minor: 1. Is it possible to target Atg44 to the mitochondrial outer membrane, either by attaching an OM anchor or using part of the N-terminus of Mfi2? This will help elucidate how Mfi2 reaches the outer membrane and whether Atg44 can be just as active on the outer membrane as long as it can access it.

      Response:

      We thank the reviewer for this suggestion. We will construct chimeric proteins between Atg44 and Mfi2 and examine where such proteins are localized. Additionally, we will assess whether these chimeric proteins have the functional activity of Mfi2, as this will help determine if Atg44 can be active on the mitochondrial outer membrane when properly targeted.

      Are microtubules or actin required for the protrusions to form? Using the triple mutant cells that have a high proportion of protrusions, it could be tried to add cytoskeletal depolymerizing drugs such as nocodazole for microtubules or Latrunculin A or Latrunculin B for actin.

      Response:

      We thank the reviewer for this suggestion. We will test the effect of cytoskeletal depolymerizing drugs on protrusion formation in the mfi2Δ dnm1Δ atg44Δ triple mutant cells.

      Reviewer #2 (Significance (Required)):

      Significance: The discovery of Mfi2 as an outer membrane mitophagy fission factor is an exciting, and very important and significant contribution to the field. The data are in this study are clear and the conclusions are generally well supported by the experiments. This study appears to be suitable as a report style manuscript given that there is limited mechanistic analysis of Mfi2 activity. This does not affect the importance of the work, it just means that it is suited as a report of a significant discovery. Overall, this fills an important knowledge gap in solving the mystery behind which factors are involved in mitochondrial outer membrane fission during mitophagy, and provides a clarification why Dnm1 loss alone minimally affects mitophagy. This work will appeal to researchers interested in mitochondrial biology, the autophagy field, and cell biologists interested in organelle membrane dynamics, and is also broadly important and interesting to all cell biologists.

      Reviewer expertise: mitophagy mechanisms, cell biology of mitophagy, autophagy and autophagosome formation, mitochondrial biology including OXPHOS and mitochondrial dynamics

      Response:

      We appreciate Reviewer #2’s comments on the importance and potential impact of our discovery for the mitophagy and cell biology fields.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Furukawa et al. presents a well-structured and thorough study identifying Mfi2 as a novel mitochondrial outer membrane-resident fission factor required for mitophagy in Saccharomyces cerevisiae. The authors demonstrate that Mfi2, together with the inner membrane mitofissin Atg44 and the dynamin-related GTPase Dnm1, contributes to mitochondrial fragmentation during mitophagy. Importantly, they show that while Dnm1 is dispensable on its own, Mfi2 and Dnm1 act redundantly from the outer membrane to support Atg44-mediated fission. The data are robust, the figures are clear, and the mechanistic insight into how mitophagy-specific fission is achieved is of high relevance to the field of mitochondrial quality control.

      Overall, this is a logically constructed and convincing study with important implications for understanding compartment-specific mechanisms of mitochondrial fission during selective autophagy. The conclusions are largely well supported by the data. However, a few issues and points of clarification should be addressed before publication.

      Response:

      We thank Reviewer #3 for the careful and constructive review and for acknowledging the logical structure and robustness of our data.

      Major Comments

      1. The observation that both Mfi2 and Atg44 require high cardiolipin (CL) content for membrane binding and fission in vitro is intriguing, especially given that CL is enriched in the inner membrane. The authors mention CL externalisation during mitophagy, but this connection could be made more explicit earlier in the manuscript. Furthermore, since the molecular mechanism of membrane interaction remains unresolved, I would strongly encourage the authors to undertake coarse-grained molecular dynamics simulations to explore how Mfi2 might interact with lipid bilayers of differing composition. This could clarify the role of CL and the potential structural contribution of the disordered C-terminal region. Response:

      We thank the reviewer for highlighting the need to clarify the connection between CL externalization and the observed CL-dependent membrane binding and fission activity of Mfi2 and Atg44. While we briefly mentioned CL externalization during mitophagy in the Discussion, we agree that this connection should be made more explicit earlier in the manuscript. In the revised version, we will incorporate a brief rationale in the Results section to clarify that CL translocates to the mitochondrial outer membrane under mitophagy-inducing conditions (e.g., Chu et al., Nat Cell Biol 2013). This will provide a physiological basis for our in vitro reconstitution assays using CL-containing liposomes.

      We also appreciate the reviewer’s suggestion to explore the molecular basis of Mfi2-lipid interaction through coarse-grained molecular dynamics (CGMD) simulations. In collaboration with Dr. Yuji Sakai, we will perform coarse-grained molecular dynamics (CGMD) simulations to investigate how Mfi2 interacts with lipid bilayers of varying compositions, focusing particularly on the role of cardiolipin and the structural contribution of the disordered C-terminal region. If successful, we will include the results in the revised manuscript.

      While the genetic and phenotypic data indicate that Mfi2 and Dnm1 act independently to support mitochondrial fission, the spatial and temporal organisation of their activity during mitophagy remains unclear. Do Mfi2 and Dnm1 colocalise at fission sites, or do they act at separate subdomains of the outer membrane? Live-cell imaging with fluorescently tagged Mfi2 and Dnm1, particularly during mitophagy induction, could help clarify whether these factors act in concert or at distinct locations and time points. This would also help determine whether their apparent redundancy reflects parallel mechanisms or functional compensation at shared sites. It would also be interesting to combine this with Atg44.

      Response:

      We thank the reviewer for this insightful comment. We plan to perform co-localization analysis of Mfi2 and Dnm1 during mitophagy induction to clarify whether these proteins colocalize at fission sites or act at separate subdomains of the outer membrane. Additionally, we will conduct co-immunoprecipitation experiments of Mfi2 and Dnm1 (see also Response to Reviewer #1’s major comment 2) to further investigate their potential interaction. It is challenging to analyze Mfi2, Dnm1, Atg44, and mitochondrial fission sites simultaneously, as fluorescence-tagged Atg44 has been shown to lose its function (Fukuda et al., Mol Cell, 2023).

      Minor Comments

      1. The sodium carbonate extraction and proteinase K assays (Figure 1E-F) are standard but may not be familiar to all readers. A brief explanatory sentence clarifying what these methods reveal about membrane topology would improve accessibility. Response:

      We thank the reviewer for this helpful comment. We have added a brief explanatory sentence in the revised manuscript to clarify the principles and interpretation of the sodium carbonate extraction and proteinase K assays (Page 5, lines 23-25; Page 6, lines 1-3).

      While immunoblot quantifications are shown throughout, it would be helpful to include statistical analysis where appropriate, especially in cases where differences between genotypes or constructs are modest.

      Response:

      Statistical analyses have been added for immunoblot quantifications where appropriate, particularly in cases where differences between genotypes or constructs are modest.

      The naming of Mfi2 as a mitofissin is consistent with previous terminology introduced for Atg44, but the term remains relatively new. A brief clarification distinguishing "mitofissin" from the better-known "mitofusin" family in mammals would help avoid confusion for readers less familiar with yeast-specific nomenclature.

      Response:

      We have added a brief explanation of the term "mitofissin" to distinguish it from the mammalian "mitofusin" family in Introduction (Page 3, line 26-Page 4 line 1).

      Reviewer #3 (Significance (Required)):

      This is a strong and well-executed study that provides mechanistic insight into how mitochondrial fission is coordinated during mitophagy in yeast. A major strength is the identification and characterisation of Mfi2 as a previously unrecognised outer membrane fission factor acting in parallel with Dnm1 and in coordination with the intermembrane space protein Atg44. The genetic, imaging, and in vitro biochemical data are carefully integrated, and the authors are transparent about limitations, including open questions around the C-terminal domain of Mfi2, CL dependence, and the evolutionary conservation of mitofissins.

      The work makes a conceptual advance by showing that mitophagy-specific mitochondrial fission requires the cooperation of spatially separated factors acting from both the inside and outside of mitochondria, a mechanism that had not been fully appreciated. This study helps resolve previous contradictions regarding the dispensability of Dnm1 in mitophagy, thereby filling a gap in our understanding of organelle-specific fission. While the findings are focused on yeast, they raise broader questions about whether similar principles apply to higher eukaryotes (historically yeast research was always at the forefront of autophagy field).

      The study will be of interest to specialists in autophagy, mitochondrial dynamics, and yeast cell biology, as well as researchers working on membrane remodelling and organelle quality control. While the audience is primarily specialised, the conceptual insights will resonate more broadly in the cell biology community.

      I am an expert in mitophagy mechanisms in mammalian cells, and while not a specialist in yeast models, I found the study logical, rigorous, and of clear relevance to the broader autophagy field.

      Response:

      We are grateful for Reviewer #3’s recognition of the conceptual advance provided by our study and its relevance beyond yeast biology.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Responses to Reviewer #1:

      ・We performed in silico topology prediction of Atg44 and Mfi2 using TMHMM. This tool identified a weakly hydrophobic region of Mfi2 near the N-terminus but did not predict a definitive transmembrane domain (new Fig. EV1E) (Page 6, lines 8-9).

      Responses to Reviewer #3:

      ・We have added a brief explanatory sentence in the revised manuscript to clarify the methods and interpretation of the sodium carbonate extraction and proteinase K assays (Page 5, lines 23-25; Page 6, lines 1-3).

      ・Statistical analyses have been added for immunoblot quantifications where appropriate, particularly in cases where differences between genotypes or constructs are modest.

      ・We have added a brief explanation of the term "mitofissin" to distinguish it from the mammalian "mitofusin" family in Introduction (Page 3, line 26-Page 4, line 1).

      4. Description of analyses that authors prefer not to carry out

      Response to Reviewer #1 (Major 4):

      We will not include the WT strain as a control. See our response.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Furukawa et al. presents a well-structured and thorough study identifying Mfi2 as a novel mitochondrial outer membrane-resident fission factor required for mitophagy in Saccharomyces cerevisiae. The authors demonstrate that Mfi2, together with the inner membrane mitofissin Atg44 and the dynamin-related GTPase Dnm1, contributes to mitochondrial fragmentation during mitophagy. Importantly, they show that while Dnm1 is dispensable on its own, Mfi2 and Dnm1 act redundantly from the outer membrane to support Atg44-mediated fission. The data are robust, the figures are clear, and the mechanistic insight into how mitophagy-specific fission is achieved is of high relevance to the field of mitochondrial quality control. Overall, this is a logically constructed and convincing study with important implications for understanding compartment-specific mechanisms of mitochondrial fission during selective autophagy. The conclusions are largely well supported by the data. However, a few issues and points of clarification should be addressed before publication.

      Major Comments

      1. The observation that both Mfi2 and Atg44 require high cardiolipin (CL) content for membrane binding and fission in vitro is intriguing, especially given that CL is enriched in the inner membrane. The authors mention CL externalisation during mitophagy, but this connection could be made more explicit earlier in the manuscript. Furthermore, since the molecular mechanism of membrane interaction remains unresolved, I would strongly encourage the authors to undertake coarse-grained molecular dynamics simulations to explore how Mfi2 might interact with lipid bilayers of differing composition. This could clarify the role of CL and the potential structural contribution of the disordered C-terminal region.
      2. While the genetic and phenotypic data indicate that Mfi2 and Dnm1 act independently to support mitochondrial fission, the spatial and temporal organisation of their activity during mitophagy remains unclear. Do Mfi2 and Dnm1 colocalise at fission sites, or do they act at separate subdomains of the outer membrane? Live-cell imaging with fluorescently tagged Mfi2 and Dnm1, particularly during mitophagy induction, could help clarify whether these factors act in concert or at distinct locations and time points. This would also help determine whether their apparent redundancy reflects parallel mechanisms or functional compensation at shared sites. It would also be interesting to combine this with Atg44.

      Minor Comments

      1. The sodium carbonate extraction and proteinase K assays (Figure 1E-F) are standard but may not be familiar to all readers. A brief explanatory sentence clarifying what these methods reveal about membrane topology would improve accessibility.
      2. While immunoblot quantifications are shown throughout, it would be helpful to include statistical analysis where appropriate, especially in cases where differences between genotypes or constructs are modest.
      3. The naming of Mfi2 as a mitofissin is consistent with previous terminology introduced for Atg44, but the term remains relatively new. A brief clarification distinguishing "mitofissin" from the better-known "mitofusin" family in mammals would help avoid confusion for readers less familiar with yeast-specific nomenclature.

      Significance

      This is a strong and well-executed study that provides mechanistic insight into how mitochondrial fission is coordinated during mitophagy in yeast. A major strength is the identification and characterisation of Mfi2 as a previously unrecognised outer membrane fission factor acting in parallel with Dnm1 and in coordination with the intermembrane space protein Atg44. The genetic, imaging, and in vitro biochemical data are carefully integrated, and the authors are transparent about limitations, including open questions around the C-terminal domain of Mfi2, CL dependence, and the evolutionary conservation of mitofissins.

      The work makes a conceptual advance by showing that mitophagy-specific mitochondrial fission requires the cooperation of spatially separated factors acting from both the inside and outside of mitochondria, a mechanism that had not been fully appreciated. This study helps resolve previous contradictions regarding the dispensability of Dnm1 in mitophagy, thereby filling a gap in our understanding of organelle-specific fission. While the findings are focused on yeast, they raise broader questions about whether similar principles apply to higher eukaryotes (historically yeast research was always at the forefront of autophagy field).

      The study will be of interest to specialists in autophagy, mitochondrial dynamics, and yeast cell biology, as well as researchers working on membrane remodelling and organelle quality control. While the audience is primarily specialised, the conceptual insights will resonate more broadly in the cell biology community.

      I am an expert in mitophagy mechanisms in mammalian cells, and while not a specialist in yeast models, I found the study logical, rigorous, and of clear relevance to the broader autophagy field.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, the authors discover a mitochondrial fission factor, termed Mfi2, that promotes mitophagy efficiency and that functions in a partially redundant way with Dnm1 for the fission of mitochondrial outer membranes during mitophagy. The discovery helps to clarify why Dnm1 does not appear to be essential for fission mediated mitophagy by Dnm1. Mfi2 is structurally similar to the inner membrane fission factor Atg44 which is consistent with Mfi2's fission activity. The authors show that Mfi2 has membrane fission activity towards nanotubes in vitro, and that membrane binding is dependent of high levels of cardiolipin, a mitochondrially enriched lipid. In summary, the authors show that Mfi2 mediates mitochondrial outer membrane fission together with Drp1, whereas Atg44 mediates inner membrane fission, that together are necessary for mitophagy.

      Major:

      1. Figure 2: How do the expression levels of the Mfi2 constructs compare to the endogenous levels of the protein? This will help to gauge to what degree Mfi2 N66 overexpression is needed to achieve mitochondrial fragmentation in Atg44 delta cells and also the low level of mitophagy rescue that was observed.
      2. Figure 3A-B: The cardiolipin binding results in vitro are interesting but the concentration of cardiolipin is much lower on the outer membrane versus the inner membrane. Can the authors comment on whether the cardiolipin levels used on the nanotubes are relevant to that of the mitochondrial outer membrane where Mfi2 is located? Can the authors provide quantitative data for these experiments to help strengthen their conclusions? Can the authors also use purified MBP alone or a form of Mfi2 that cannot bind to membrane e.g. Mfi2-C33) as a control?
      3. Figure 4D: The protrusions are very difficult to visualize. Can the authors also provide zoomed in regions. Is the data representative from 3 or more independent experiments? Can the authors provide a graph of the quantitation to aid readers with analysis of the data?
      4. Figure 4D: It is fascinating to see the mitochondrial protrusion formation being dependent on autophagy factors but not mitochondrial fission factors. To help visualize this, can the authors image one of either Atg1, Atg8 to address whether phagophores are forming on the protrusions and if so where they are positionally located on the protrusion in control and/or mfi2,dnm1,atg44 triple mutant cells?

      Minor:

      1. Is it possible to target Atg44 to the mitochondrial outer membrane, either by attaching an OM anchor or using part of the N-terminus of Mfi2? This will help elucidate how Mfi2 reaches the outer membrane and whether Atg44 can be just as active on the outer membrane as long as it can access it.
      2. Are microtubules or actin required for the protrusions to form? Using the triple mutant cells that have a high proportion of protrusions, it could be tried to add cytoskeletal depolymerizing drugs such as nocodazole for microtubules or Latrunculin A or Latrunculin B for actin.

      Significance

      The discovery of Mfi2 as an outer membrane mitophagy fission factor is an exciting, and very important and significant contribution to the field. The data are in this study are clear and the conclusions are generally well supported by the experiments. This study appears to be suitable as a report style manuscript given that there is limited mechanistic analysis of Mfi2 activity. This does not affect the importance of the work, it just means that it is suited as a report of a significant discovery. Overall, this fills an important knowledge gap in solving the mystery behind which factors are involved in mitochondrial outer membrane fission during mitophagy, and provides a clarification why Dnm1 loss alone minimally affects mitophagy. This work will appeal to researchers interested in mitochondrial biology, the autophagy field, and cell biologists interested in organelle membrane dynamics, and is also broadly important and interesting to all cell biologists.

      Reviewer expertise: mitophagy mechanisms, cell biology of mitophagy, autophagy and autophagosome formation, mitochondrial biology including OXPHOS and mitochondrial dynamics

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Furukawa and colleagues identified Mfi2 as novel factor that promotes fragmentation and removal of damaged mitochondria by mitophagy. They report that parallel loss of Dnm1 and Mfi2 blocks mitophagy. Mfi2 acts on the outer membrane, while the previous found Atg44 functions in the intermembrane space. How the proteins cooperate remains unknown. This is an elegant study with high-quality data. The findings are interesting for a broad readership. There are some issues as outline below that should be solved.

      1. It remains unclear how Mfi2 is anchored into the outer mitochondrial membrane. Does it contain a transmembrane domain? The carbonate resistance indicates the presence of such transmembrane domain. However, the presented structures lack such membrane-spanning segment. This point should be clarified.
      2. How does Mfi2 cooperate with Dnm1? Is there any interaction between these proteins? Some further information could provide mechanistic insights into the function of Mfi2.
      3. The authors report a CL-dependent binding of Mfi2 to liposomes. Is the recruitment of Mfi2 to mitochondria impaired when CL-synthesis is blocked, e.g. in crd1delta mitochondria?
      4. Figure 4B: a wild-type control should be added.

      Significance

      The reported findings are interesting for a broad readership.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to Reviewers and Revision Plan

      We thank all three reviewers for their thoughtful and constructive comments. We are pleased that the reviewers found our work to be "very interesting," "well written," with "high quality" data that is "convincing" and will be "of broad interest for the community of axon guidance, circuit formation and brain development." We particularly appreciate the recognition that our study provides "novel functions for Cas family genes in forebrain axon organization" and uses "state-of-the art mouse genetics" with "quantitative and statistical rigor." Below are our detailed responses to each reviewer's comments, including extensive additional experiments and analyses that we will perform to significantly strengthen the manuscript.

      Reviewer #1

      We thank this reviewer for recognizing that our experiments are "carefully done and quantified" with "clear and striking" phenotypes that "support most of the conclusions in the manuscript." We appreciate their acknowledgment that this work will be "of interest to developmental neurobiologists and the axon guidance and adhesion fields."

      Major Comments:

      __ Authors clearly show that misplaced TCA axons are coordinate with cortical layer defects, with misplaced tbr1+ neurons, in EMX-Cre cas and integrin knockouts, suggesting these axons are following misplaced cells. These results are described as 100% coordinate, but since there is no figure of quantification, authors need to clarify how many embryos were examined for each genotype, as this was not described in results or legends.__ We apologize for this oversight and will provide detailed quantification of this important finding. We examined a total of 11 Emx1Cre;TcKO embryos with 13 controls, and 14 Emx1Cre;Itgb1 embryos with 13 littermate controls at two developmental stages (E16.5 and P0) to quantify the coordination between misplaced Tbr1+ neurons and cortical bundle formation. This quantification will be presented in the main text and figure legend.

      Here's a more detailed breakdown of those numbers: For Emx1Cre;TcKO knockouts, we examined 7 controls and 5 mutants at P0, and 6 controls and 6 mutant embryos at E16.5. For the Emx1Cre;Itgb1 knockouts, we examined 5 controls and 6 mutant neonates at P0, and 8 controls and 8 mutant embryos at E16.5.

      __ Are the neurons not misplaced in Nex cre cas or integrin knockouts? One would think presumably not, but then what are the tbr1+ cell migration defect caused by? I struggle with the semantics of non-neuronal autonomous role of cas in cortex, since tbr1+ neurons are misplaced, and this is what the axons are mistargeting too. So yes, potentially cas or b1 is not needed in those neurons, but those misplaced neurons are presumably driving the phenotype.__

      We agree that this important point requires better explanation. You are absolutely correct that Tbr1+ neurons are not misplaced in NexCre;TcKO mutants (Wong et al., 2023), which is precisely why these animals do not exhibit cortical bundle formation. In addition to our previously published data showing normal location of Tbr1+ neurons in those mutants, we can also provide similar analysis at E16.5 and P0 as a supplemental figure. The model we propose is that Cas genes are required in radial glial cells for proper positioning of deep layer cortical neurons. These correctly positioned neurons, in turn, provide appropriate guidance cues for TCA projections. Hence, our model is that while the role of Cas genes is non-neuronal-autonomous (acting in radial glia rather than in the neurons themselves), the mispositioned Tbr1+ neurons in Emx1Cre;TcKO mutants drive the TCA misprojection phenotype. We will clarify this mechanism in the discussion and provide a new graphical model as a supplemental figure to facilitate conceptualization of our conclusions.

      __ Authors need to clarify in the discussion that they can't rule out the cas not also needed in tca neurons, Since neither emx or nex cre would hit those cells.__

      We will add the following clarification to the discussion: The analysis of cortical bundle formation in Emx1Cre;TcKOrevealed a comparable phenotype to that observed in NestinCre;TcKO, strongly suggesting a cortical-autonomous role for Cas genes in CB formation. "However, we cannot formally exclude a thalamus-autonomous role for Itgb1 or Cas genes in TCA pathfinding, as we did not ablate these genes exclusively in the thalamus. Future studies using thalamus-specific Cre drivers would be needed to definitively address this question."

      __ Could authors add boxes in zoomed out brain images to denote zoom regions. And potentially a schematic demonstrating placement of DiI for lipophilic tracing experiments.__

      We will add boxes to denote zoom regions where possible throughout the manuscript. For some high magnification panels, we selected the best representative images, which don't necessarily correspond to specific regions of the lower magnification panels, but we will note this in the figure legends. We will also add a schematic diagram to a supplemental figure illustrating DiI placement for all lipophilic tracing experiments.

      Reviewer #2

      We thank this reviewer for describing our study as "very interesting," "well written," with data that are "of high quality" and findings that are "convincing." We appreciate their recognition that we used "state-of-the art mouse genetics" and that our work will be "of broad interest for the community of axon guidance, circuit formation and brain development."

      Major Comments:

      __ Immunofluorescence labeling for other β-integrin family members to examine expression in AC axons may provide insights into why β1-integrin deficiency does not replicate the Cas TcKO phenotype.__ This is an excellent suggestion that we will address experimentally. We will perform RNAscope analysis for integrin β5, β6, and β8 in developing piriform and S1 cortex at E14.5, E16.5, and E18.5, as these are the only other β-integrins expressed during cortical development. We anticipate that this analysis may reveal expression of alternative β-integrins in the neurons that extend axons along the developing anterior commissure, which would provide a potential explanation for why β1-integrin deficiency does not replicate the AC phenotype observed in Cas TcKO animals. These new data will be presented as part of a new figure.

      __ Is there any evidence that β1-integrin in developing cortical axons is colocalized with Cas proteins (in vivo or in vitro)?__

      We have tested multiple antibodies for p130Cas and CasL without success in cortical tissue. However, we will test two new integrin β1 antibodies and a new p130Cas antibody. While direct colocalization may be challenging due to species restrictions and tissue-specific antibody performance, we will attempt to show regional co-expression in consecutive sections. If the integrin antibodies work, we will present data as a supplemental figure demonstrating that p130Cas (using our BAC-EGFP reporter) and β1-integrin show overlapping expression patterns in developing cortical white matter tracts and neurons, supporting their potential functional interaction. In the end, while we will try to address this critique, we will be limited by the reagents that are available to us.

      Minor Comments:

      __ How long do the Cas TcKO with the various cre driver survive?__

      We have not systematically quantified survival beyond 6 months, but surprisingly, survival up to 6 months of age appears normal for all genotypes examined. This information will be included in the Methods section.

      Reviewer #3

      We thank this reviewer for acknowledging that our "main claims and conclusions are solidly supported by the data" with "good overall data quality" and "high quantitative and statistical rigor." We appreciate their recognition that we "uncover novel functions for Cas family genes in forebrain axon organization" and that our "overall reporting and discussion of findings is data-driven and refrains from excessive speculation."

      Addressing Concerns About Novelty and Impact:

      We respectfully disagree with the characterization of our findings as "somewhat incremental." While we acknowledge that similar axonal defects have been described in other lamination mutants, our study makes several novel and significant contributions:

      First demonstration of Cas family requirement in forebrain axon tract development: This is the first study to establish roles for Cas proteins in axon guidance, representing a completely new function for these well-studied signaling molecules. Novel β1-integrin-independent role for Cas proteins: Our finding that AC defects occur in Cas mutants but not β1-integrin mutants reveals a previously unknown signaling pathway and challenges the assumption that Cas proteins always function downstream of β1-integrin. Mechanistic insights into cortical-TCA interactions: While the general principle that cortical lamination affects TCA projections has been established, our work provides the first demonstration of how specific adhesion signaling molecules (Cas proteins) control this process through radial glial function. Cell-type specific requirements: Our systematic analysis using multiple Cre drivers provides unprecedented detail about where and when Cas proteins function during brain development, revealing both neuronal-autonomous (AC) and non-neuronal autonomous (TCA) roles.

      As Reviewer #2 noted, "The main advancement is a more nuanced understanding of where and when these molecules function during brain development and insights into the origin of the defects observed." This represents significant mechanistic progress in understanding forebrain circuit assembly.

      Specific Comments:

      Suggestion about cell autonomy testing: We appreciate the optional suggestion to test strict cell autonomy using sparse deletion approaches. While this would indeed be interesting, it would represent a substantial undertaking beyond the scope of the current study. However, we believe our current data using NexCre (which hits early postmitotic neurons) versus NestinCre (CNS-wide deletion) and Emx1Cre (cortical progenitors) provides supportive evidence for neuronal autonomy of the AC phenotype, as mentioned by the reviewer.

      In vitro axon guidance assays: This is an excellent suggestion for future molecular studies. In the discussion we identify specific candidate guidance molecules (e.g. Ephrins) that would be prime targets for such experiments.

      Cross-Reviewer Comments:

      We appreciate Reviewer #3's agreement with the other reviewers' suggestions and will address the quantification of neuronal mispositioning/axon bundle correlation as requested by Reviewer #1.

      Additional Improvements:

      Beyond addressing the specific reviewer comments, we will make several additional improvements to strengthen the manuscript:

      Enhanced statistical analysis: All quantifications will include appropriate statistical tests with clearly stated n values and multiple litters represented. Expanded discussion: We will better contextualize our findings within the broader axon guidance literature and discuss future directions (e.g. TCAs). New data: Additional controls, expression analysis, and quantifications will strengthen our conclusions.

      We believe these revisions, particularly the new experimental data addressing integrin family expression and the detailed quantification of phenotype coordination, will significantly strengthen our conclusions and demonstrate the novelty and impact of our findings. We hope the reviewers will find these improvements satisfactory and agree that our work makes important contributions to understanding axon guidance mechanisms in the developing forebrain.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript by Estep et al., the authors use conditional in vivo mouse genetics to study roles for Cas family intracellular adaptor proteins in forebrain axon tract development. They report two phenotypes after simultaneous nervous system-wide deletion of three Cas family genes - (1) defasciculation and misprojection of anterior commissure axons and (2) ectopic formation of thalamocortical axon bundles that penetrate the cortex. Further investigation using specific Cre lines and other conditional knockout alleles demonstrates that the anterior commissure defect results from a requirement for Cas genes in cortical projection neurons, whereas thalamocortical axons are misguided due to Cas functional requirements in cortical lamination, as ectopic axon bundles are confined to sites of disrupted cortical layer formation. Overall, this study uncovers novel functions for Cas family genes in forebrain axon organization, one of which likely reflects a direct role in axon guidance and/or fasciculation, while another one is indirect and based in the previously established role of Integrin-Cas signaling in radial glia organization and cortical neuron migration.

      The main claims and conclusions of the paper are solidly supported by the data. The study is fairly descriptive in nature, being limited to in vivo analyses of Cas expression patterns and characterization of the knockout phenotypes, and does not uncover novel molecular mechanisms for axon guidance, but it also does not attempt to make any claims to that effect. The overall reporting and discussion of findings is data-driven and refrains from excessive speculation, which is commendable. The overall data quality is good, and data organization and presentation are clear. Quantitative and statistical rigor are high.

      The characterization of Itgb1 knockout animals and various conditional Cas knockouts provides strong evidence that the thalamocortical axon phenotypes are simply a secondary consequence of cortical disorganization, as they strictly segregate with defects in cortical lamination.

      The requirement for Cas genes in anterior commissure axon organization is accurately reported as "neuronal-autonomous", but not as cell-autonomous. It would be interesting, yet not essential (i.e. this suggestion is optional), to test for strict cell autonomy by sparsely deleting Cas family genes in a subset of the neurons that project axons through the anterior commissure and analyzing the projection patterns of Cas mutant and control neurons in such a genetic mosaic side by side.

      In the discussion, the authors highlight a few of the axon guidance signaling pathways that would be strong candidates for requiring Cas in the context of the anterior commissure. If the authors wanted to develop this idea further, they should consider using in vitro axon guidance assays to study the requirement for Cas function in the axonal response to these candidate guidance molecules.

      Referee Cross-commenting

      I generally agree to the comments by reviewers 2 and 3. I especially like reviewer 1's suggestion to provide quantitative support for the correlation between sites of neuronal mispositioning and sites of ectopic axon bundle emergence in the cortex. I also agree with that reviewer's idea to box regions in micrographs that are shown in high-magnification panels.

      Significance

      The strengths of the study lie in its simplicity and limited scope, yet so do its weaknesses. The authors uncover requirements for Cas genes in axon tract organization, but mechanistic insights are extremely limited. On the plus side, the authors refrain from excessive speculation and stay very close to the data in the interpretation and discussion of their findings.

      The reported findings are novel, at least to some extent. The same group had previously established the Itgb1-Cas signaling axis as an important regulator of cortical architecture, and results presented here document a thalamocortical axon guidance phenotype that results from defective cortical lamination. Similar axonal defects have been described in other mouse models with lamination phenotypes, and these studies are cited in the manuscript at hand. So while the study is not first to show this interplay between cortical neuronal positioning and thalamocortical axon organization, it does add to the growing body of evidence for this phenomenon. As for the anterior commissure defect, the study is first to establish a role for Cas family genes in development of this axon tract, but beyond evidence that this might be a neuronal-autonomous requirement (but see earlier comment), it does not provide any mechanistic insights into this Cas function. Had the authors identified an actual signaling pathway for axon guidance or bundling that is mediated by Cas proteins and explains their requirement for anterior commissure formation, this study would be a lot more impactful. In its current form, however, the limited genetic and functional insights from this manuscript will largely be of interest to a specialized audience. The overall advance provided by this work is somewhat incremental.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: In this very interesting study, Estep and colleagues investigate the role of Cas family members (p130Cas, CasL/Nedd9 and Sin/Efs) by generating triple conditional mutant (Cas TcKO) mice to investigate to role in the developing brain (E14.5 - P0), focusing on thalamocortical axons (TCA) and the anterior commissure (AC). For visualization of p130Cas expressing neurons, the p130Cas-EGFP-BAC allele was used. This revealed EGFP (p130Cas) expression in all major cortical tracts and overlap with L1 distribution. Conditional ablation using Nestin-cre (Nes-cre;TcKO) revealed defects in the AC and the external capsule (EC). In addition, these mice show pathfinding defects resulting cortical bundles (CBs) in white matter within the cortical plate. Evidence is provided that these CBs originate from TCA afferents. To assess the cell autonomy of these phenotypes, Emx1-cre;TcKO and Nex-Cre;TcKO mice were generated. Analysis of these mice revealed that Cas genes function in cortical neurons is required for proper TCA development. Nex-cre;TcKO mice only replicated the AC phenotypes observed in Nestin-cre;TcKO mice. Moreover, evidence is provided that proper development of TCA afferents requires non-neuronal functions of Cas genes. Because Cas proteins function downstream of integrins, including beta1-integrin, Itgb1 cKO mice (using Emx-cre or Nex-cre) to examine similarities to Cas TcKO mice. Indeed, Emx-cre;Itgb1 cKO mice phenocopy CB defects observed in the Emx-cre; CasTcKO, while Nex-cre;Itgb1 mutants do not, and neither of the Itgb1 mutants phenocopied the Cas TcKO defects in the AC. Correlative evidence is provided that CBs observed in Cas TcKO mutants originate from disorganization of the subplate.

      Overall, this manuscript is well written, and most of the data presented are of high quality. It is also clear that a great deal of effort was put into the experiments, however some issues were identified, and the authors should address them to further clarify and strengthen the work.

      Major comments:

      1. Immunofluorescence labeling for other b-integrin family members to examine expression in AC axons may provide insights into why b1-integrin deficiency does not replicate the Cas TcKO phenotype.
      2. Is there any evidence that b1-integrin in developing cortical axons is colocalized with Cas proteins (in vivo or in vitro)

      Minor comments:

      1. How long do the Cas TcKO with the various cre driver survive?

      Significance

      Elucidation of molecular mechanisms of axon pathfinding and brain wiring in vivo. Using state-of-the art mouse genetics; this includes genetic labeling of specific axon tracts, generation of compound mutants in a cell type specific manner and gene products that are thought to function in the same pathway. This was confirmed for some fiber systems, but not for others. The findings presented are convincing and the manuscript is well written. The guidance molecules investigated are not novel and have been analyzed previously, however not with the same rigor or the use of compound mutants. The main advancement is a more nuanced understanding of where and when these molecules function during brain development and insights into the origin of the defects observed. Of broad interest for the community of axon guidance, circuit formation and brain development. I have been studying molecules that regulate axon guidance, growth and regeneration for the past 20+ years.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In the manuscript by Estep et al., the authors studied Cas proteins expressed during brain development, particularly during the formation of the anterior commissure (AC) and thalamocortical (TCA) projection, using conditional alleles in mice, immunohistochemistry, and a combination of lipophilic axon tracers or genetically encoded fluorophores that mark cells that have expressed Cre. They found that Cas proteins were required for proper guidance of TCA projections and fasciculation of posterior AC axons using broad deletion of Cas gene function by crossing Cas TcKO animals with Nestin-Cre mice- CNS-wide deletions in both neuronal and glial populations, results in axon misprojection and aberrant cortical bundles, of both AC and TCA. With a time course, they find these axon misguidance phenotypes appear at different developmental time points, with AC defasciulation not apparent until e18.5, whereas aberrent TCA Cortical bundles were detected already at e14.5, and increasing over developmental time.

      They then go on to use more specific Cre drivers, EMX cre (RGCs, excitatory neurons, mqcroglia in cortex), vs Nex Cre early postmitotic neurons in cortex. EMX cre, still shows TCA defects, even though cas not knocked out of tca axons, suggesting cortical autonomous expression of cas, affects these axons. Because Nex Cre mice didn't show this phenotype, this suggested that the TCA phenotype was cortical-autonomous but not neuronal-autonomous, with mis-projecting TCA processes (cortical bundles) closely associating with misplaced subplate and deep layer neurons. cortical- and neuronal autonomous role for Cas genes during AC fasciculation showed that defasciculating AC axons originated from the dorsolateral cortex. Defects in AC fasciculation were dissimilar to β1-integrin mutants, suggesting that Cas proteins can act independently of β1-integrin during AC formation.

      Overall, these data indicate a requirement for Cas family genes during cortical white matter tract formation. The experiments are carefully done and quantified, and the phenotypes are clear and striking, and support most of the conclusions in the manuscript. I only suggest a few points for clarification.

      Authors clearly show that misplaced TCA axons are coordinate with cortical layer defects, with misplaced tbr1 + neurons, in EMX-Cre cas and integrin knockouts, suggesting these axons are following misplaced cells. These results are described as 100% coordinate, but since there is no figure of quantification, authors need to clarify how many embryos were examined for each genotype, as this was not described in results or legends.

      Are the neurons not misplaced in Nex cre cas or integrin knockouts? One would think presumably not, but then what are the tbr1+ cell migration defect caused by? I struggle a with the semantics of non-neuronal autonomous role of cas in cortex, since tbr1+ neurons are misplaced, and this is what the axons are mistargeting too. So yes, potentially cas or b1 is not needed in those neurons, but those misplaced neurons are presumably driving the phenotype.

      Authors need to clarify in the discussion that they can't rule out the cas not also needed in tca neurons, Since neither emx or nex cre would hit those cells.

      Could authors add boxes in zoomed out brain images to to denote zoom regions. And potentially a schematic demonstrating placement of DiI for lipophilic tracing experiments.

      Significance

      The study demonstrates the different requirement for Cas proteins and b1 integrin in different cell populations for appropriate white matter tract formation, and further supports the that cortical layering helps direct TCA projections. It also provides evidence for a b1 integrin independent role for cas proteins. The authors nicely discuss this with several alternative upstream receptors that may be involved detailed, that set the stage for future work, but this would be quite a large endeavor. I would add that it is unclear if cas proteins are needed in the TCA neurons, as they did not use a cre driver that would clarify this. The final limitation I will mention is that EM would likely be required to demonstrate a role for fasiculation, but this also seems beyond this original manuscript. This study will be of interest to developmental neurobiologists and the axon guidance and adhesion fields.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Evidence, reproducibility and clarity

      In their manuscript de las Mercedes Carro et al investigated the role of Ago proteins during spermatogenesis by producing a triple knockout of Ago 1, 3 and 4. They first describe the pattern of expression of each protein and of Ago2 during the differentiation of male germ cells, then they describe the spermatogenesis phenotype of triple knockout males, study gene deregulation by scRNA seq and identify novel interacting proteins by co-IP mass spectrometry, in particular BRG1/SMARCA4, a chromatin remodeling factor and ATF2 a transcription factor. The main message is that Ago3 and 4 are involved in the regulation of XY gene silencing during meiosis, and also in the control of autosomal gene expression during meiosis. Overall the manuscript is well written, the topic, very interesting and the experiments, well-executed. However, there are some parts of the methodology and data interpretation that are unclear (see below).

      Major comments

      1= Please clarify how the triple KO was obtained, and if it is constitutive or specific to the male germline. In the result section a Cre (which cre?) is mentioned but it is not mentioned in the M&M. On Figure S1, a MICER VECTOR is shown instead of a deletion, but nothing is explained in the text nor legend. Could the authors provide more details in the results section as well as in the M&M ? This is essential to fully interpret the results obtained for this KO line, and to compare its phenotype to other lines (such as lines 184-9 Comparison of triple KO phenotype with that of Ago4 KO). Also, if it is a constitutive KO, the authors should mention if they observed other phenotypes in triple KO mice since AGO proteins are not only expressed in the male germline.

      Response: We apologize for omitting this vital information. We have now incorporated a more detailed description of how the Ago413 mutant was created in the results and M&M sections (line 120 and 686 respectively).

      As mentioned in the manuscript, Ago4, Ago1 and Ago3 are widely expressed in mammalian somatic tissues. Mutations or deletions of these genes does not disrupt development; however, there is limited research on the impact of these mutations in mammalian models in vivo. In humans, mutations in Ago1 and Ago3 genes are associated with neurological disorders, autism and intellectual disability (Tokita, M.J.,et al. 2015- doi: 10.1038/ejhg.2014.202., Sakaguchi et al. 2019- doi: 10.1016/j.ejmg.2018.09.004, Schalk et al 2021- doi: 10.1136/jmedgenet-2021-107751). In mouse, global deletion of Ago1 and Ago3 simultaneously was shown to increase mice susceptibility to influenza virus through impaired inflammation responses (Van Stry et al 2012- doi.org/10.1128/jvi.05303-11). Studies performed in female Ago413 mutants (the same mutant line used herein) have shown that knockout mice present postnatal growth retardation with elevated circulating leukocytes (Guidi et al 2023- doi: 10.1016/j.celrep.2023.113515). Other studies of double conditional knockout of Ago1 and Ago3 in the skin associated the loss of these Argonautes with decreased weight of the offspring and severe skin morphogenesis defects (Wang et al 2012- doi: 10.1101/gad.182758.111). In our study, we did not observe major somatic or overt behavioral phenotypes, and we did not observe statistical differences in body weights of null males compared to WT as shown in figure below.

      2= The paragraph corresponding to G2/M analysis is unclear to me. Why was this analysis performed? What does the heatmap show in Figure S4? What is G2/M score? (Fig 2D). Lines 219-220, do the authors mean that Pachytene cells are in a cell phase equivalent to G2/M? All this paragraph and associated figures require more explanation to clarify the method and interpretation.

      __Response: __We have modified the methods to include more information about how the cell cycle scoring used in Figures 2D and S4 were calculated and will add more information regarding the interpretation of these figures.

      3= I have concerns regarding Fig2G: to be convincing the analysis needs to be performed on several replicates, and, it is essential to compare tubules of the same stage - which does not seem to be the case. This does not appear to be the case. Besides, co (immunofluorescent) staining with markers of different cell types should be shown to demonstrate the earlier expression of some markers and their colocalization with markers of the earlier stages.

      __Response: __We agree with the Reviewer. New images with staged tubules will be added to the analysis of Figure 2G.

      4= one important question that I think the authors should discuss regarding their scRNAseq: clusters are defined using well characterized markers. But Ago triple KO appears to alter the timing of expression of genes... could this deregulation affects the interperetation of scRNAseq clusters and results?

      __Response: __We thank the reviewer for this suggestion and agree that including this information is important. We expect that, at most, this dysregulation impacts the edges of these clusters slightly. Given that marker genes that have been used to define cell types in these data are consistently expressed between the knockout and wildtype mice (see Figure S4A), we do not think that the cells in these clusters have different identities, just dysregulated expression programs. We have added the relevant sentence to the discussion, and will include additional supplemental figure panels to document this point more comprehensively.

      5= XY gene deregulation is mentioned throughout the result section but only X chromosome genes seem to have been investigated.... Even the gene content of the Y is highly repetitive, it would be very interesting to show the level of expression of Y single copy and Y multicopy genes in a figure 3 panel.

      __Response: __We agree with the reviewer that including analysis of Y-linked genes is important. We will add a supplemental figure which includes the Y:Autosome ratio and differential expression analysis.

      6= Can the authors elaborate on the observation that X gene upregulation is visible in the KO before MSCI; that is in lept/zygotene clusters (and in spermatogonia, if the difference visible in 3A is significant?)

      Response: We do see that X gene expression is upregulated before pachynema. Previous scRNA-seq studies that have looked at MCSI have seen that silencing of genes on the X and Y chromosomes starts before the cell clusters that are defined as pachynema, though silencing is not fully completed until pachynema. We have clarified this point in the manuscript.

      7 = miRNA analysis: could the authors indicate if X encoded miRNA were identified and found deregulated? Because Ago4 has been shown to lead to a downregulation of miRNA, among which many X encoded. It is therefore puzzling to see that the triple KO does not recapitulate this observation. Were the analyses performed differently in the present study and in Ago4 KO study?

      __Response: __The analysis identifying downregulation of miRNA in the original Ago4 mutant analysis was conducted relative to total small RNA expression. Amongst those altered miRNA families in the Ago4 mutants, we demonstrated both upregulation and downregulation of miRNA. We agree that confirming a similar global downregulation of miRNA counts compared to other small RNAs is important. Therefore, in a revised manuscript, we will add this information to the miRNA analysis section, especially highlighting the X chromosome-associated miRNAs, as well as whether the ratios between other small RNA classes change.

      8 = The last results paragraph would also benefit from some additional information. It is not clear why the authors focused on enhancers and did not investigate promoters (or maybe they were but it's unclear). Which regions (size and location from TSS) were investigated for motif enrichment analyses? To what correspond the "transcriptional regulatory regions previously identified using dREG" mentioned in the M&M? I understand it's based on a previous article, but more info in the present manuscript would be useful.

      Response: We thank the reviewer for this suggestion. The regions that were used for motif enrichment will be included as a supplementary information in the fully revised manuscript. We have also clarified in the methods that these transcriptional regulatory regions were downloaded from GEO and obtained from previous ChRO-seq data (from GEO) analysis. These data are run through the dREG pipeline that identifies regions predicted to contain transcription start sites, which include promoters and enhancers.

      Minor comments

      1) In the introduction: The sentence "Ago1 is not expressed in the germline from the spermatogonia stage onwards allowing us to use this model to study the roles of Ago4 and Ago3 in spermatogenesis." is misleading because Ago1 is expressed at least in spermatogonia; It would be more precise to write "after spermatogonia stage" and rephrase the sentence. Otherwise it is surprising to see AGO1 protein in testis lysate and it is not in line with the scRNA seq shown in figure 2.

      __Response: __We agree with the Reviewers suggestion and have edited the sentence on line 100. This sentence now reads "Ago1 is not expressed in the germline after the spermatogonia stage allowing us to use this model to study the roles of Ago4 and Ago3 in spermatogenesis".

      2) Could the authors precise if AGO proteins are expressed in other tissues? In somatic testicular cells?

      __Response: __Expression patterns of mammalian AGOs have been described in somatic and testicular tissues for the mouse by Gonzales-Gonzales et al (2008) by qPCR. They found that Ago2 is expressed in all the somatic tissues analyzed (brain, spleen, heart, muscle and lung) as well as the testis, with the highest expression in brain and lowest in heart. Ago1 is highly expressed in spleen compared to all the tissues analyzed, while Ago3 and Ago4 showed highest expression in testis and brain. Within somatic tissues of the testis, the four argonautes are expressed in Sertoli cells, however, Ago1,3 and 4 expression is very low compared to Ago2, with the latter showing a 10-fold higher transcript level. We have included a sentence with this information in the introduction in line 89.

      3) Pattern of expression: How do the authors explain that AGO3 disappears at the diplotene stage and reappears in spermatids?

      __Response: __ Single cell RNAseq data in the germline shows reduced transcript for Ago3 from the Pachytene stage onwards, suggesting minimal if any new transcription in round spermatids. We hypothesize that the AGO3 protein present in the round spermatid stage is cytoplasmic, presumably coming from the pool of AGO3 in the chromatoid body, a cytoplasmic structure with functional association with the nucleus in round spermatids (Kotaja et al, 2003 doi: 10.1073/pnas.05093331).

      4) It would be useful to show the timing of expression of AGO 1 to 4 throughout spermatogenesis in the first paragraph of the article. Maybe the authors could present data from fig2B earlier?

      Response: We understand the Reviewers concern, however, given that Ago expression throughout spermatogenesis was obtained from scRNA seq, we consider that this data should be presented after introducing the Ago413 knockout and the scRNA seq experiment. As Ago1-4 expression was also described in an earlier manuscript by Gonzales-Gonzales et al in the mouse male germline, and our data aligns with this report, we included a sentence about these previous findings in the earlier results section.

      5) Line 190: please modify the sentence "reveal no differences in cellular architecture of the seminiferous tubules when compared to wild-type males" to " reveal no gross differences..." since even without quantification of the different cell types it is visible that KO seminiferous tubules are different from WT tubules.

      __Response: __We agree with the reviewer, and we modified line 190 (now 173) as suggested. Grossly, seminiferous tubules from Ago413 null males contain the same cell types as in wild type tubules, including spermatozoa. However, our studies show that the number and quality of germ cells is compromised in knockouts, as shown by sperm counts and TUNEL staining.

      6) TUNEL analysis: please stage the tubules to determine the stage(s) at which apoptosis is the most predominant.

      __Response: __We have complied with the reviewer suggestion. Figure 1G now shows staged seminiferous tubules, and we have replaced the wild type image for one where the staged tubules match the knockout image.

      7) Figure S4B does not show an increase of cells at Pachytene stage but at Lepto/zygotene stage (as well as an increase of spermatogonia). Please comment this discrepancy with results shown in Fig2.

      __Response: __Figures 2 and S4 show distribution of cells in different substages of spermatogenesis and prophase I measured with very different methods: a cytological approach using chromosome spreads cells vs a transcriptomic approach that involves clustering of cells. We attribute the differences in cell type distribution to differences in the sensitivity of the methods to identify each cell type and therefore identify differences between the number of cells for each group. Moreover, our scRNA-seq data groups the leptotene and zygotene stages together, while the cytological approach allows for separation of these two sub-stages. Importantly, both results show that Ago413 spermatocytes are progressing slower from pachynema into diplonema and/or are dying after pachynema, as stated in line 194 in our manuscript.

      8) Fig5H and 5I are not mentioned in the result section. Also, it would be useful to label them with "all chromosomes" and "XY" to differentiate them easily

      __Response: __We apologize for the omission and have now cited Figures 5H and 5I in the manuscript (line 453). We have added the suggested labels.

      9) Line 530 "data provide further evidence for a functional association between AGO-dependent small RNAs and heterochromatin formation, maintenance and/or silencing." Please rephrase, the present article does not really show that AGO nuclear role depends on small RNAs.

      __Response____: __We agree with the reviewer that these data do not directly show a dependence on small RNAs. As our identified localization of AGO proteins to the pericentric heterochromatin coincides with localization of DICER shown previously by Yadav and collaborators (2020, doi: 10.1093/nar/gkaa460), we do believe that our data further implicates small RNAs in the silencing of heterochromatin. Yadav et al shows that DICER localizes to pericentromeric heterochromatin and processes major satellite transcripts into small RNAs in mouse spermatocytes, and cKO germ cells have reduced localization of SUV39H2 and H3K9me3 to the pericentromeric heterochromatin. Given the colocalization of both small RNA producing machinery and AGOs at pericentromeric heterochromatin, the AGOs may bind these small RNAs, and the statement in line 530 refers to how our results provide evidence for the involvement of other RNAi machinery in the silencing of pericentromeric heterochromatin investigated by Yadav et al which likely includes small RNAs.

      To clarify this point, we have modified the text accordingly.

      10) Line 1256: replace "cite here " by appropriate reference

      __Response: __The reference was added to line 1256.

      11) Please use SMARCA4 instead of BRG1 name as it is its official name.

      __Response: __We have replaced BRG1 with SMARCA4 in the text and figures.

      Figures:

      Figure 1: Are the pictures shown for Ago3-tagged and floxed from the same stages ? The leptotene stage in 1A looks like a zygotene, while some pachytene/diplotene stage pictures do not look alike.

      __Response: __New representative images have been added to figure 1 to match the same substages across the figure.

      Figure 1D, please label the Y scale properly (testis weight related to body weight)

      __Response: __We have fixed this.

      FigS1: Please comment the presence of non-specific bands in the figure legend

      __Response: __We have added a sentence in Figure S1 Legend.

      Fig 2E and F, please indicate on the figure (in addition to its legend), what are the X and Y axes respectively to facilitate its reading.

      __Response: __X and Y axes are now labelled in Figure 2E and F.

      2F: please use an easier abbreviation for Spermatocyte than Sp (which could spermatogonia, sperm etc..) such as Scyte I ? (same comment for Fig 3C)

      Response: The abbreviation for spermatocyte was changed from Sp to Scyte I in Figures 2 and 3.

      Overall, for all figures showing GSEA analyses, could the authors explain what a High positive NES and a High negative NES mean in the results section?

      Response: Thank you for this suggestion. We have added this information where the GSEA score of the cell markers is initially introduced.

      Significance

      Ago proteins are known for their roles in post transcriptional gene regulation via small RNA mediated cleavage of mRNA, which takes places in the cytoplasm. Some Ago proteins have been shown to be also located in the nucleus suggesting other non-canonical roles. It is the case of Ago4 which has been shown to localize to the transcriptionally silenced sex chromosomes (called sex body) of the spermatocyte nucleus, where it contributes to regulate their silencing (Modzelewski et al 2012). Interestingly, Ago4 knockout leads to Ago3 upregulation, including on the sex body indicating that Ago3 and Ago4 are involved in the same nuclear process. In their manuscript, de las Mercedes Carro et al., investigate the consequences of loss of both Ago3 and Ago4 in the male germline by the production of a triple knockout of Ago1, 3 and 4 in the mouse. With this model, the authors describe the role of Ago3 and Ago4 during spermatogenesis and show that they are involved in sex chromosome gene repression in spermatocytes and in round spermatids, as well as in the control of autosomal meiotic gene expression. Triple KO males have impaired meiosis and spermiogenesis, with fewer and abnormal spermatozoa resulting in reduced fertility. Since Ago1 male germline expression is restricted to pre-meiotic germ cells, it is not expected to contribute to the meiotic and postmeiotic phenotypes observed in the triple KO. The strengths of the study are i) the thorough analyses of mRNA expression at the single cell level, and in purified spermatocytes and spermatids (bulk RNAseq), ii) the identification of novel nuclear partners of AGO3/4 relevant for their described nuclear role: ATF2, which they show to also co-localize with the sex body, and BRG1/SMARCA4, a SWI/SNF chromatin remodeler. The main limitation of the study is the lack of information in the method regarding the production of the triple KO, as well as some aspects of the transcriptome and motif analyses. It is also surprising to see that the triple KO does not recapitulate the miRNA deregulation observed in Ago4 KO. The characterization of a non-canonical role of AGO3/4 in male germ cells will certainly influence researchers of the field, and also interest a broader audience studying Argonaute proteins and gene regulation at transcriptional and posttranscriptional levels.

      Reviewer #2

      Evidence, reproducibility and clarity

      In the manuscript titled "Argonaute proteins regulate the timing of the spermatogenic transcriptional program" by Carro et al., the authors present their findings on how Argonaute proteins regulate spermatogenic development. They utilize a mouse model featuring a deletion of the gene cluster on chromosome 4 that contains Ago1, Ago3, and Ago4 to investigate the cumulative roles of AGO3 and AGO4 in spermatogenic cells. The authors characterize the distribution of AGO proteins and their effects on key meiotic milestones such as synapsis, recombination, meiotic transcriptional regulation, and meiotic sex chromosome inactivation (MSCI). They analyze stage-specific transcriptomes in spermatogenic cells using single-cell and bulk RNA sequencing and determine the interactome of AGO3 and AGO4 through mass spectrometry to examine how AGO proteins may regulate gene expression in these cells during meiotic and post-meiotic development. The authors conclude that both AGO3 and AGO4 are essential for regulating the overall gene expression program in spermatogenic cells and specifically modulate MSCI to repress sex-linked genes in pachytene spermatocytes, which may be partially mediated by the proper distribution of DNA damage repair factors. Additionally, AGO3 is suggested to interact with the chromatin remodeler SWI/SNF factor BRG1, facilitating its removal from the sex-chromatin to enable the repression of sex-linked genes during MSCI.

      Major Comments: 1. The study utilized a triple knockout mouse model to determine the effect of AGO3 on spermatogenesis, following up on their previous report about the role of AGO4 in spermatogenesis, which resulted from an upregulation of AGO3 in Ago4-/- spermatocytes. However, the results are more difficult to interpret and ascertain the role of AGO3 in these cells, given the absence of any observable phenotype from Ago3 interruption. AGO4 regulates sex body formation, meiotic sex chromosome inactivation (MSCI), and miRNA production in spermatocytes, all of which were noted in the absence of both AGO3 and AGO4, with only an increased incidence of cells containing abnormal RNAPII at the sex chromosomes. It will be necessary to characterize how AGO3 regulates spermatogenic development, including meiotic progression and the regulation of the meiotic transcriptome, and compare these findings with the current observations to determine if the proposed mechanism involving AGO3, BRG1, and possibly AP2 is relevant in this context.

      __Response: __While we agree with Reviewer that a single Ago3 knockout will help understand distinct roles of AGO3 and AGO4 in spermatogenesis, the time and resources required to generate a new mouse model are substantial. The analysis included in this current manuscript has already taken over seven years, and with the lengthy production of a new single mutant mouse, validation of the new mouse, and then final analysis, we would be looking at another 3-5 years of analysis. In the current funding climate, and with strong concerns over ensuring reduction in utilization of laboratory mice, we consider this request to be far in excess of what is required to move this important story forward.

      The Ago413-/- mouse model has allowed us to associate a nuclear role of Argonaute proteins with a strong reproductive phenotype in the mouse germline. Given the redundancy between Ago3 and Ago4, it is likely that a single Ago3 knockout would have a mild phenotype just like the Ago4 KO. All this said, we agree with the reviewer that analysis of an Ago3 knockout mouse is a valuable next step, just not within this chapter of the story.

      1. Does Ago413-/- mice recapitulate the early meiotic entry phenotype observed in Ago4-/- mice? If not, could it be possible that AGO3 promotes meiotic entry, given its strong mRNA expression in spermatogonia according to the scRNAseq data (Fig. 2B)

      Response: Our scRNA-seq data shows strong expression of Ago3 in spermatogonia, as mentioned by the Reviewer. Analysis of cell cycle marker expression also shows that the transcriptomic profile of spermatogonia is altered, with higher levels of transcripts corresponding to the later G2/M stages (Figure 2D). Moreover, Ago413 knockouts present an increase in the number of spermatogonial stem cells (Supplementary Figure S4B). However, this cluster represents a pool of quiescent and mitotically active cells entering meiosis, therefore interpretation of these data might be challenging. While specific experiments could be conducted to answer this question, this is outside of the scope of our manuscript. The manuscript as it stands is already rather large, and a full analysis of meiotic entry dynamics would dilute the core message relating to chromatin regulation in the sex body.

      1. The authors suggested that the removal of BRG1 by AGO3 is necessary during sex body formation and the eventual establishment of MSCI. However, the BAF complex subunit ARID1A has been shown to facilitate MSCI by regulating promoter accessibility. It will be interesting to determine how BRG1 distribution changes across the genome in the absence of AGO proteins and how that correlates with alterations in sex-linked gene expression.

      __Response: __We agree that changes in BRG1 distribution across the genome would be very interesting to identify. However, in this work we show that BRG1/SMARCA4 protein changes its localization in the sex body very rapidly between early to late pachynema. These two substages are only discernable by immunofluorescence using synaptonemal complex markers, as there are currently no available techniques to enrich for these subfractions. Therefore, study of genome occupancy of BRG1 in these specific substages by techniques such as CUT&Tag are not currently possible. However, we are currently working on new methods to distinguish these cell populations and hope eventually to use these purification strategies to perform the studies suggested by this reviewer. Alternatively, the hope is that single cell CUT&Tag methods will become more reliable, and will enable us to address these questions. Both of these options are not currently available to us. The studies by Menon et al (2024-doi:10.7554/eLife.88024.5) provide strong evidence to support that ARID1A is needed to reduce promoter accessibility of XY silenced genes in prophase I through modulation of H3.3 distribution. However, this mechanism and our identification of the removal of BRG1 between early and late pachytema are not inconsistent with one another, as either SMARCA4 or SMARCA2 can associate with ARID1A as part of the cBAF complex, and ARID1A is also not in all forms of the BAF complex which BRG1 are in. The difference between our results and those seen in Menon et al likely indicate that there are multiple forms of the BAF complex which are differentially regulated during MSCI and play different roles in silencing transcription. Further studies of specific BAF subunits are needed to elucidate how different flavors of the BAF complex act at specific genomic locations and meiotic time points.

      1. The observations presented in this manuscript (Fig. 1D, 2C, 3D, and 4) suggest a haploinsufficiency of the deleted locus in spermatogenic development. How does this compare with the ablation of either Ago3 or Ago4? Please explain.

      Response: Our previous studies in single Ago4 knockouts did not present a heterozygous phenotype (Modzelewski et al 2012, doi: 10.1016/j.devcel.2012.07.003, data not shown). Triple Ago413 knockouts show a much stronger fertility phenotype than single Ago4 knockout. Testis weight of Ago413 homozygous null present a 30% reduction while heterozygous mice show a 15% reduction (Figure 1D), comparable to the 13% reduction previously observed in Ago4-/- males. Sperm counts of Ago413 null and heterozygous males are reduced by 60% and 39% compared to wild type (Figure 1E), respectively, whereas Ago4 null mice have a milder phenotype, with only a 22% reduction in sperm counts. At the MSCI level, both homozygous and heterozygous Ago413 mutant spermatocytes show a similar increase in pachytene spermatocytes with increased RNA pol II ingression into the sex body with respect to wild-type of 35% and 30%, respectively. Ago4 single knockouts show an almost 18% increase in Pol II ingression when compared to wild type. These comparisons are now included in our manuscript in lines 170, 172 and 288. A milder phenotype of the Ago4 knockout and haploinsufficiency in triple Ago413 knockouts but not in Ago4 single knockouts is likely a consequence of the overlapping functions of Ago3 and Ago4 in mammals (and/or overexpression of Ago3 in Ago4 knockouts). In the context of their role in RISC, Wang et al (doi: 10.1101/gad.182758.111) studied the effects of single and double conditional knockouts for Ago1 and Ago2 in miRNA-mediated silencing. They discovered that the interaction between miRNAs and AGOs is highly correlated with the abundance of each AGO protein, and only double knockouts presented an observable phenotype.

      Minor Comments: Based on the interactome analysis, it was argued that AGO3 and AGO4 may function separately. Please discuss how AGO3 might compensate for AGO4 (Line 109).

      Response: We hypothesize that the combined function of AGO3 and AGO4 is needed for proper sex chromosome inactivation during meiosis. We base this hypothesis on the facts that (i) both proteins localize to the sex body in pachytene spermatocytes, (ii) loss of Ago4 leads to upregulation of Ago3, and (iii) the MSCI phenotype of Ago413 knockout mice is much stronger than the single Ago4 knockout (see above). However, AGO3 and AGO4 might not induce silencing through the same mechanism or pathway. In this work, we observed that their temporal expression in prophase I is different; while AGO3 protein seems to disappear by the diplotene stage, AGO4 is present in the sex body of these cells. Moreover, the proteomic analysis revealed a very low number of common interactors, an observation which could support the idea of AGO3 and AGO4 acting by different (albeit perhaps related) mechanisms to achieve MSCI. It is also possible that common interactors were not identified in our proteomic analysis due to the low abundance of AGO3 and AGO4 in the germ cells, limiting the resolution of the proteomics analysis (note that in order to visualize AGO proteins in WB experiments, at least 60 μg of enriched germ cell lysate must be loaded per lane). Moreover, given the difficulty in obtaining enough isolated pachytene and diplotene spermatocytes to perform immunoprecipitation experiments, we performed IP experiments in whole germ cell lysates, which limits the interpretation of our analysis. If AGO3 and AGO4 protein interactors overlap, then AGO3 would directly substitute for AGO4 leading to silencing in single Ago4 knockouts. However, if AGO3 and AGO4 work together through different, complementary mechanisms, then Ago4 mutant mice likely compensates loss of Ago4 by upregulation of Ago3along with specific interactors of the given pathway. We have added a sentence addressing this matter in line 411 of the results section and lines 506 and 513 of the discussion in the revised manuscript.

      In Line 221, it is unclear what is meant by 'cell cycle transcripts'. Does this refer to meiotic transcripts? It is also important to discuss the relevance of the G2/M cell cycle marker genes at later stages of meiotic prophase.

      Response: Thank you for this suggestion. We have changed the relevant text to remove redundancies and include more information. We agree that considering the importance of these genes across meiotic prophase is needed, as cells which are in the dividing stage will already have produced the proteins necessary for division. These cells likely correspond to the diplotene/M cluster cells that have a lower G2/M score, potentially causing the bimodal distribution seen in Figure 2D. We have added a sentence addressing this to the manuscript.

      While identified as a common interactor of both AGO3 and AGO4 in lines 440-445, HNRNPD is not listed among AGO4 interactors in Table S6. Please correct or explain this discrepancy.

      Response: HNRPD was originally identified as an AGO4 interactor using a less strict criteria than the one used in our manuscript: we required consistent enrichment in at least two rounds of IP MS experiments. This reference to HNRNPD was a mistake, given that HNRPD was only enriched in one of our three replicates. Thus, we apologize and have removed the sentence in lines 440-445.

      It is unclear whether wild-type cell lysate or lysate containing FLAG-tagged AGO3 was used for BRG1 immunoprecipitation, and which antibody was used to detect AGO3 in the BRG1 IP sample. A co-IP experiment demonstrating interaction between BRG1 and wild-type AGO3 would be ideal in this context. Furthermore, co-localization by IF would be beneficial to determine the subcellular localization and the cell stages the interaction may be occurring. Additionally, co-IP and Western blot methodologies should be included in the methods section.

      __Response: __MYC-FLAG tagged AGO3 protein lysates were used for BRG1 Co-Immunoprecipitation, along with an anti MYC antibody to detect AGO3. This is now detailed in the Methods section of our revised manuscript (line 1133).

      Regarding BRG1 and AGO3 colocalization by IF, we can confidently show that both AGO3 and BRG1 localize to the sex chromosomes in early pachynema by comparing BRG1/SYCP3 and FLAG-AGO3/SYCP3 stained spreads. We were not able to show colocalization simultaneously on the same cells, given the lack of appropriate antibodies. Our anti FLAG antibody is raised in mouse, while anti BRG1 is raised in rabbit, therefore a non-rabbit, non-mouse anti SYCP3 would be needed to identify prophase I substages, and our lab does not possess such a validated antibody. However, we now have access to a multiplexing kit that allows to use same-species antibodies for immunofluorescence and we can perform these experiments for a revised manuscript.

      __Response: __The methods section now includes description of co-IP methodologies (line 1132). Western Blot methodologies are explained in lane 718, under the "Immunoblotting" title.

      In line 599, it is unclear what is meant by 'persistence of sex chromosome de-repression'. Please correct or clarify this.

      Response: This sentence has been changed and reads: "The persistence of sex chromosome gene expression".

      If possible, please add an illustration to summarize the findings together.

      Response: We thank the reviewer for this suggestion, and have now added this in Figure 6

      Significance

      Overall, this study enhances the understanding of gene expression regulation by AGO proteins during spermatogenesis. Several approaches, including functional, histological, and molecular characterization of the triple knockout phenotype, were instrumental in elucidating the role of AGO proteins in MSCI and meiotic as well as postmeiotic gene regulation. The main limitation of the study is that it is challenging to appreciate the role of AGO3 in addition to the previously published role of AGO4 without the inclusion of necessary control groups. Furthermore, the mechanism of action for AGO proteins in meiotic gene regulation was left relatively unexplored. This study presents new findings that will be significant for the research community interested in gene regulation, chromatin biology, and reproductive biology with the above suggestions considered.

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)): __

      The authors characterize a CRISPR-Cas9 mouse mutant that targets 3 genes that encode AGO family proteins, 2 of which are expressed during spermatogenesis (AGO3 and AGO4) and one that is said is not expressed, AGO1. This mouse mutant showed that AGO3 and AGO4 both contribute to spermatogenesis success as the "Ago413" mutation gave rise to an additive reduction in testis weight, due to spermatocyte apoptosis, and reduction in sperm count. Furthermore, they use insertion mouse mutants for Ago3 and Ago2 that express tagged versions of their corresponding proteins, which they use in combination with pan-AGO antibodies and Ago mutants to show differential expression and localization properties of AGO2, AGO3, and AGO4 (and the absence of AGO1) during spermatogenesis with a particular focus on meiotic prophase. They perform single-cell RNAseq and intricate analyses to demonstrate a change in distribution of meiotic stages in Ago413 mutants, and the overall cell cycle in spermatogonia and spermatocytes is altered. This analysis shows that the mutation leads to an inability to downregulate prior spermatogonia/spermatocyte stage transcripts in a timely manner. On the other hand, later-stage spermatocytes are abnormally expressing spermiogenesis genes. Similar to the Ago4 mutant previously characterized MSCI is disrupted. The authors also show that AGO3 has different interaction partners compared to AGO4 and focus their final assessment on a novel interaction partner of AGO3, BRG1. They show that this factor, which is involved in chromatin remodeling, is aberrantly localized to the sex body during meiotic prophase and diplonema. As BRG1 is involved in open chromatin, it is proposed that AGO3 restricts BRG1 (and related proteins) from the XY chromosome to ensure MSCI. Overall, this paper is very well constructed with mechanistic insights that make this a very impactful contribution to the research community. Major Comments:

      1. The abstract contains "Ago413-/- mouse" without any explanation of what that is. The abstract needs to be a stand-alone document that does not require any referencing for context.

      Response: We have included a sentence describing Ago413 in line 27

      Figure 2C. - The significance bars are confusing as they appear to overlap strangely.

      Response: We have modified this figure and now present the significance bars are on top of the data points.

      On line 235, the authors state that "we first identified the top non-overlapping upregulated genes for Ago413+/+ germ cells in each cluster. Why did the authors not also select down-regulated genes in each cluster to perform a similar analysis?

      __Response: __Thank you for this question. As our goal was to identify genes that are markers of the transcriptional program in each cell type, we used only uniquely upregulated genes for each cluster. Genes that are downregulated for a cluster may be indicative of the transcription in several other cell types, which is not easily interpretable. For a revised manuscript, we will perform this analysis to determine if there is any specific alterations in these downregulated genes.

      Their Ago413 mutant characterization does a good job of assessing meiotic prophase and spermatozoa. However, their assessment of the stages in between these is lacking (meiotic divisions and spermiogenesis).

      Response: We understand the reviewer's concern, however, it is not usual to study stages between the first meiotic division and spermiogenesis because meiosis II is so rapid and thus we lack tools to dissect it. In general, any defect that impacts meiosis I (and particularly prophase I) leads to cell death during prophase I or at metaphase I due to strictly adhered checkpoints that eradicate defective cells. Thus, the increased TUNEL staining in prophase I indicates to us that defective cells are cleared before exit from meiosis I, and those cells progressing to the spermatid stage are "normal" for meiosis II progression. For these cells that did complete meiosis I and progressed normally through meiosis II, we analyzed their spermiogenic outcome extensively (see section entitled "Post-meiotic spermatids from Ago413-/- males exhibit defective spermiogenesis and poor spermatozoa function"). This section included extensive sperm morphology, sperm motility and sperm fertility through in vitro fertilization assays. That said, we have added a sentence on line 268 to explain the transit through meiosis II.

      The discovery of the interaction between BRG1 and AGO3 is exciting. They should assess BRG1 localization in later sub-stages, including late diplonema and diakinesis.

      __Response: __BRG1(SMARCA4) was analyzed throughout prophase I, as shown in image 5G, including quantification of fluorescence intensity included the analysis of diplonema (5H-I). However, diakinesis was not included here since there was no observable signal of BRG1 in these cells. We have explained this in lines 459.

      ATF2 should have been assessed in more detail, as was done for BRG1 in Figure 5.

      __Response: __We agree with the Reviewer, however, staining of chromosome spreads with the anti ATF2 antibody was not possible in our hands after several attempts and changes in staining conditions. However, as staining of sections was successful, we showed localization of ATF2 on spermatocytes by co staining sections with SYCP3 and ATF2.

      Reviewer #3 (Significance (Required)): Overall, this paper is very well constructed with mechanistic insights, as described in my reviewer comments, that make this a very impactful contribution to the research community.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors characterize a CRISPR-Cas9 mouse mutant that targets 3 genes that encode AGO family proteins, 2 of which are expressed during spermatogenesis (AGO3 and AGO4) and one that is said is not expressed, AGO1. This mouse mutant showed that AGO3 and AGO4 both contribute to spermatogenesis success as the "Ago413" mutation gave rise to an additive reduction in testis weight, due to spermatocyte apoptosis, and reduction in sperm count. Furthermore, they use insertion mouse mutants for Ago3 and Ago2 that express tagged versions of their corresponding proteins, which they use in combination with pan-AGO antibodies and Ago mutants to show differential expression and localization properties of AGO2, AGO3, and AGO4 (and the absence of AGO1) during spermatogenesis with a particular focus on meiotic prophase. They perform single-cell RNAseq and intricate analyses to demonstrate a change in distribution of meiotic stages in Ago413 mutants, and the overall cell cycle in spermatogonia and spermatocytes is altered. This analysis shows that the mutation leads to an inability to downregulate prior spermatogonia/spermatocyte stage transcripts in a timely manner. On the other hand, later-stage spermatocytes are abnormally expressing spermiogenesis genes. Similar to the Ago4 mutant previously characterized MSCI is disrupted. The authors also show that AGO3 has different interaction partners compared to AGO4 and focus their final assessment on a novel interaction partner of AGO3, BRG1. They show that this factor, which is involved in chromatin remodeling, is aberrantly localized to the sex body during meiotic prophase and diplonema. As BRG1 is involved in open chromatin, it is proposed that AGO3 restricts BRG1 (and related proteins) from the XY chromosome to ensure MSCI. Overall, this paper is very well constructed with mechanistic insights that make this a very impactful contribution to the research community.

      Major Comments:

      1. The abstract contains "Ago413-/- mouse" without any explanation of what that is. The abstract needs to be a stand-alone document that does not require any referencing for context.
      2. Figure 2C. - The significance bars are confusing as they appear to overlap strangely.
      3. On line 235, the authors state that "we first identified the top non-overlapping upregulated genes for Ago413+/+ germ cells in each cluster. Why did the authors not also select down-regulated genes in each cluster to perform a similar analysis?
      4. Their Ago413 mutant characterization does a good job of assessing meiotic prophase and spermatozoa. However, their assessment of the stages in between these is lacking (meiotic divisions and spermiogenesis).
      5. The discovery of the interaction between BRG1 and AGO3 is exciting. They should assess BRG1 localization in later sub-stages, including late diplonema and diakinesis.
      6. ATF2 should have been assessed in more detail, as was done for BRG1 in Figure 5.

      Significance

      Overall, this paper is very well constructed with mechanistic insights, as described in my reviewer comments, that make this a very impactful contribution to the research community.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In the manuscript titled "Argonaute proteins regulate the timing of the spermatogenic transcriptional program" by Carro et al., the authors present their findings on how Argonaute proteins regulate spermatogenic development. They utilize a mouse model featuring a deletion of the gene cluster on chromosome 4 that contains Ago1, Ago3, and Ago4 to investigate the cumulative roles of AGO3 and AGO4 in spermatogenic cells. The authors characterize the distribution of AGO proteins and their effects on key meiotic milestones such as synapsis, recombination, meiotic transcriptional regulation, and meiotic sex chromosome inactivation (MSCI). They analyze stage-specific transcriptomes in spermatogenic cells using single-cell and bulk RNA sequencing and determine the interactome of AGO3 and AGO4 through mass spectrometry to examine how AGO proteins may regulate gene expression in these cells during meiotic and post-meiotic development. The authors conclude that both AGO3 and AGO4 are essential for regulating the overall gene expression program in spermatogenic cells and specifically modulate MSCI to repress sex-linked genes in pachytene spermatocytes, which may be partially mediated by the proper distribution of DNA damage repair factors. Additionally, AGO3 is suggested to interact with the chromatin remodeler SWI/SNF factor BRG1, facilitating its removal from the sex-chromatin to enable the repression of sex-linked genes during MSCI.

      Major Comments:

      The study utilized a triple knockout mouse model to determine the effect of AGO3 on spermatogenesis, following up on their previous report about the role of AGO4 in spermatogenesis, which resulted from an upregulation of AGO3 in Ago4-/- spermatocytes. However, the results are more difficult to interpret and ascertain the role of AGO3 in these cells, given the absence of any observable phenotype from Ago3 interruption. AGO4 regulates sex body formation, meiotic sex chromosome inactivation (MSCI), and miRNA production in spermatocytes, all of which were noted in the absence of both AGO3 and AGO4, with only an increased incidence of cells containing abnormal RNAPII at the sex chromosomes. It will be necessary to characterize how AGO3 regulates spermatogenic development, including meiotic progression and the regulation of the meiotic transcriptome, and compare these findings with the current observations to determine if the proposed mechanism involving AGO3, BRG1, and possibly AP2 is relevant in this context.

      Does Ago413-/- mice recapitulate the early meiotic entry phenotype observed in Ago4-/- mice? If not, could it be possible that AGO3 promotes meiotic entry, given its strong mRNA expression in spermatogonia according to the scRNAseq data (Fig. 2B) The authors suggested that the removal of BRG1 by AGO3 is necessary during sex body formation and the eventual establishment of MSCI. However, the BAF complex subunit ARID1A has been shown to facilitate MSCI by regulating promoter accessibility. It will be interesting to determine how BRG1 distribution changes across the genome in the absence of AGO proteins and how that correlates with alterations in sex-linked gene expression. The observations presented in this manuscript (Fig. 1D, 2C, 3D, and 4) suggest a haploinsufficiency of the deleted locus in spermatogenic development. How does this compare with the ablation of either Ago3 or Ago4? Please explain.

      Minor Comments:

      Based on the interactome analysis, it was argued that AGO3 and AGO4 may function separately. Please discuss how AGO3 might compensate for AGO4 (Line 109).<br /> In Line 221, it is unclear what is meant by 'cell cycle transcripts'. Does this refer to meiotic transcripts? It is also important to discuss the relevance of the G2/M cell cycle marker genes at later stages of meiotic prophase.<br /> While identified as a common interactor of both AGO3 and AGO4 in lines 440-445, HNRNPD is not listed among AGO4 interactors in Table S6. Please correct or explain this discrepancy. It is unclear whether wild-type cell lysate or lysate containing FLAG-tagged AGO3 was used for BRG1 immunoprecipitation, and which antibody was used to detect AGO3 in the BRG1 IP sample. A co-IP experiment demonstrating interaction between BRG1 and wild-type AGO3 would be ideal in this context. Furthermore, co-localization by IF would be beneficial to determine the subcellular localization and the cell stages the interaction may be occurring. Additionally, co-IP and Western blot methodologies should be included in the methods section. In line 599, it is unclear what is meant by 'persistence of sex chromosome de-repression'. Please correct or clarify this. If possible, please add an illustration to summarize the findings together.

      Significance

      Overall, this study enhances the understanding of gene expression regulation by AGO proteins during spermatogenesis. Several approaches, including functional, histological, and molecular characterization of the triple knockout phenotype, were instrumental in elucidating the role of AGO proteins in MSCI and meiotic as well as postmeiotic gene regulation. The main limitation of the study is that it is challenging to appreciate the role of AGO3 in addition to the previously published role of AGO4 without the inclusion of necessary control groups. Furthermore, the mechanism of action for AGO proteins in meiotic gene regulation was left relatively unexplored. This study presents new findings that will be significant for the research community interested in gene regulation, chromatin biology, and reproductive biology with the above suggestions considered.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In their manuscript de las Mercedes Carro et al investigated the role of Ago proteins during spermatogenesis by producing a triple knockout of Ago 1, 3 and 4. They first describe the pattern of expression of each protein and of Ago2 during the differentiation of male germ cells, then they describe the spermatogenesis phenotype of triple knockout males, study gene deregulation by scRNA seq and identify novel interacting proteins by co-IP mass spectrometry, in particular BRG1/SMARCA4, a chromatin remodeling factor and ATF2 a transcription factor. The main message is that Ago3 and 4 are involved in the regulation of XY gene silencing during meiosis, and also in the control of autosomal gene expression during meiosis. Overall the manuscript is well written, the topic, very interesting and the experiments, well-executed. However, there are some parts of the methodology and data interpretation that are unclear (see below).

      Major comments

      1. Please clarify how the triple KO was obtained, and if it is constitutive or specific to the male germline. In the result section a Cre (which cre?) is mentioned but it is not mentioned in the M&M. On Figure S1, a MICER VECTOR is shown instead of a deletion, but nothing is explained in the text nor legend. Could the authors provide more details in the results section as well as in the M&M ? This is essential to fully interpret the results obtained for this KO line, and to compare its phenotype to other lines (such as lines 184-9 Comparison of triple KO phenotype with that of Ago4 KO). Also, if it is a constitutive KO, the authors should mention if they observed other phenotypes in triple KO mice since AGO proteins are not only expressed in the male germline.
      2. The paragraph corresponding to G2/M analysis is unclear to me. Why was this analysis performed? What does the heatmap show in Figure S4? What is G2/M score? (Fig 2D). Lines 219-220, do the authors mean that Pachytene cells are in a cell phase equivalent to G2/M? All this paragraph and associated figures require more explanation to clarify the method and interpretation.
      3. I have concerns regarding Fig2G: to be convincing the analysis needs to be performed on several replicates, and, it is essential to compare tubules of the same stage - which does not seem to be the case. This does not appear to be the case. Besides, co (immunofluorescent) staining with markers of different cell types should be shown to demonstrate the earlier expression of some markers and their colocalization with markers of the earlier stages.
      4. one important question that I think the authors should discuss regarding their scRNAseq: clusters are defined using well characterized markers. But Ago triple KO appears to alter the timing of expression of genes... could this deregulation affects the interperetation of scRNAseq clusters and results?
      5. XY gene deregulation is mentioned throughout the result section but only X chromosome genes seem to have been investigated.... Even the gene content of the Y is highly repetitive, it would be very interesting to show the level of expression of Y single copy and Y multicopy genes in a figure 3 panel.
      6. Can the authors elaborate on the observation that X gene upregulation is visible in the KO before MSCI; that is in lept/zygotene clusters (and in spermatogonia, if the difference visible in 3A is significant?)
      7. miRNA analysis: could the authors indicate if X encoded miRNA were identified and found deregulated? Because Ago4 has been shown to lead to a downregulation of miRNA, among which many X encoded. It is therefore puzzling to see that the triple KO does not recapitulate this observation. Were the analyses performed differently in the present study and in Ago4 KO study?
      8. The last results paragraph would also benefit from some additional information. It is not clear why the authors focused on enhancers and did not investigate promoters (or maybe they were but it's unclear). Which regions (size and location from TSS) were investigated for motif enrichment analyses? To what correspond the "transcriptional regulatory regions previously identified using dREG" mentioned in the M&M? I understand it's based on a previous article, but more info in the present manuscript would be useful.

      Minor comments

      1. In the introduction: The sentence "Ago1 is not expressed in the germline from the spermatogonia stage onwards allowing us to use this model to study the roles of Ago4 and Ago3 in spermatogenesis." is misleading because Ago1 is expressed at least in spermatogonia; It would be more precise to write "after spermatogonia stage" and rephrase the sentence. Otherwise it is surprising to see AGO1 protein in testis lysate and it is not in line with the scRNA seq shown in figure 2.
      2. Could the authors precise if AGO proteins are expressed in other tissues? In somatic testicular cells?
      3. Pattern of expression: How do the authors explain that AGO3 disappears at the diplotene stage and reappears in spermatids?
      4. It would be useful to show the timing of expression of AGO 1 to 4 throughout spermatogenesis in the first paragraph of the article. Maybe the authors could present data from fig2B earlier?
      5. Line 190: please modify the sentence "reveal no differences in cellular architecture of the seminiferous tubules when compared to wild-type males" to " reveal no gross differences..." since even without quantification of the different cell types it is visible that KO seminiferous tubules are different from WT tubules.
      6. TUNEL analysis: please stage the tubules to determine the stage(s) at which apoptosis is the most predominant.
      7. Figure S4B does not show an increase of cells at Pachytene stage but at Lepto/zygotene stage (as well as an increase of spermatogonia). Please comment this discrepancy with results shown in Fig2.
      8. Fig5H and 5I are not mentioned in the result section. Also, it would be useful to label them with "all chromosomes" and "XY" to differentiate them easily
      9. Line 530 "data provide further evidence for a functional association between AGO-dependent small RNAs and heterochromatin formation, maintenance and/or silencing." Please rephrase, the present article does not really show that AGO nuclear role depends on small RNAs.
      10. Line 1256: replace "cite here " by appropriate reference
      11. Please use SMARCA4 instead of BRG1 name as it is its official name.

      Figures:

      Figure 1: Are the pictures shown for Ago3-tagged and floxed from the same stages ? The leptotene stage in 1A looks like a zygotene, while some pachytene/diplotene stage pictures do not look alike.

      Figure 1D, please label the Y scale properly (testis weight related to body weight)

      FigS1: Please comment the presence of non-specific bands in the figure legend

      Fig 2E and F, please indicate on the figure (in addition to its legend), what are the X and Y axes respectively to facilitate its reading.

      2F: please use an easier abbreviation for Spermatocyte than Sp (which could spermatogonia, sperm etc..) such as Scyte I ? (same comment for Fig 3C)

      Overall, for all figures showing GSEA analyses, could the authors explain what a High positive NES and a High negative NES mean in the results section?

      Significance

      Ago proteins are known for their roles in post transcriptional gene regulation via small RNA mediated cleavage of mRNA, which takes places in the cytoplasm. Some Ago proteins have been shown to be also located in the nucleus suggesting other non-canonical roles. It is the case of Ago4 which has been shown to localize to the transcriptionally silenced sex chromosomes (called sex body) of the spermatocyte nucleus, where it contributes to regulate their silencing (Modzelewski et al 2012). Interestingly, Ago4 knockout leads to Ago3 upregulation, including on the sex body indicating that Ago3 and Ago4 are involved in the same nuclear process. In their manuscript, de las Mercedes Carro et al., investigate the consequences of loss of both Ago3 and Ago4 in the male germline by the production of a triple knockout of Ago1, 3 and 4 in the mouse. With this model, the authors describe the role of Ago3 and Ago4 during spermatogenesis and show that they are involved in sex chromosome gene repression in spermatocytes and in round spermatids, as well as in the control of autosomal meiotic gene expression. Triple KO males have impaired meiosis and spermiogenesis, with fewer and abnormal spermatozoa resulting in reduced fertility. Since Ago1 male germline expression is restricted to pre-meiotic germ cells, it is not expected to contribute to the meiotic and postmeiotic phenotypes observed in the triple KO. The strengths of the study are i) the thorough analyses of mRNA expression at the single cell level, and in purified spermatocytes and spermatids (bulk RNAseq), ii) the identification of novel nuclear partners of AGO3/4 relevant for their described nuclear role: ATF2, which they show to also co-localize with the sex body, and BRG1/SMARCA4, a SWI/SNF chromatin remodeler. The main limitation of the study is the lack of information in the method regarding the production of the triple KO, as well as some aspects of the transcriptome and motif analyses. It is also surprising to see that the triple KO does not recapitulate the miRNA deregulation observed in Ago4 KO. The characterization of a non-canonical role of AGO3/4 in male germ cells will certainly influence researchers of the field, and also interest a broader audience studying Argonaute proteins and gene regulation at transcriptional and posttranscriptional levels.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the reviewers

      We would like to thank the reviewers for their comments, we see great value in the suggestions they made to strengthen our work. We are glad to see that they are in general positive about the manuscript. In the following, we include a point-by-point response to their comments, which are in general consistent with each other.


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Sanchez-Cisneros and colleagues, examine how tracheal cell adhesion to the ECM underneath the epidermis helps shape the tracheal system. They show that if cell-ECM adhesion is perturbed the development of the tracheal system and the epidermis is disrupted. They also detect protrusions extending from the dorsal trunk cells towards the ECM. The work is novel, the figures are clear, and the questions are well addressed. However, I find that some of the claims are not completely supported by the data presented. I have some suggestions that will, I believe, clarify certain points.

      Major comments

      At the beginning of the results section as in the introduction the authors claim that "It is generally assumed that trunk displacement occurs due to tip cells pulling on the trunks so that they follow their path dorsally." This sentence is not referenced, and I do not know where it has been shown or proposed to be like this. In addition, the comparison with the ventral branches is also not referenced and the movie does not really show this. Forces generated by tracheal branch migration have been shown to drive intercalation (Caussinus E, Colombelli J, Affolter M. Tip-cell migration controls stalk-cell intercalation during Drosophila tracheal tube elongation. Curr Biol. 2008;18(22):1727-1734. doi:10.1016/j.cub.2008.10.062), but not dorsal trunk (DT) displacement.

      • *

      We agree that dorsal trunk displacement has not been discussed in previous works, just the fact that tip-cell migration influences stalk cell intercalation. We will rephrase this sentence, stating that dorsal trunk displacement has not been studied.

      However, to rule out the possibility that DT displacement and the phenotype observed in XXX is due to dorsal branch pulling forces, the authors should analyze what happens in the absence of dorsal branches (in condition of Dpp signalling inhibition as in punt mutants or Dad overexpression conditions).

      This is a great idea, and we thank the reviewer for suggesting this. We tried to achieve a similar goal by expressing a Dominant Negative FGFR (Breathless-DN) in the tracheal system, since its expression under btl-gal4 affects tip cell migration. However, the phenotype arises too late to have an effect in dorsal branch migration during the stages we were interested in analyzing. The alternative proposed by the reviewer should be more efficient, as blocking Dpp signalling prevents the formation of dorsal branches completely. We have just received flies carrying the UAS-Dad construct. We will express Dad under btl-gal4 and see how this affects dorsal trunk displacement.

      I am concerned about the TEM observations. The authors claim they can identify tracheal cells by their lumen (Fig. 2 C'). However, at stage 15, the tracheal lumen should be clearly identifiable, and the interluminal DT space should be wider relative to the size of the cells. In this case, there is nothing telling us that we are not looking at a dorsal branch or lateral trunk cell. Furthermore, at embryonic stage 15, the tracheal lumen is filled with a chitin filament, which is not visible in these micrographs. Also, there is quite a lot of tissue detachment and empty spaces between cells, which might be a sign of problems in sample fixing. Better images and more accurate identification of dorsal trunk cells is necessary to support the claim that "These experiments revealed a novel anatomical contact between the epidermis and tracheal trunks".

      The protocol that we use for TEM involves performing 1-μm sections that allow us to stage embryos and to identify the anatomical regions using light microscopy and then switch to ultra-thin sections for electron microscopy once we have found the right position within the sample. This approach also allows us to determine the integrity of the sample. We attach here a micrograph of the last section we analyzed before we decided to do the EM analysis. The asterisk (*) points to a region where the multicellular lumen of the trunk is visible. Due to its proximity to the posterior spiracles, we are confident this is the dorsal trunk and not the lateral trunk. We realize now, after comparing this image with an atlas of development (Campos-Ortega and Hartenstein, 2013), that the stage we chose to illustrate the interaction is a stage 14 embryo instead of the stage 15 we indicated in the manuscript. We will change the stage but given that dorsal closure has already started by stage 14, this does not affect our analysis. Still, we apologize for the mis-staging of the embryo.

      In the light-microscopy image, we have overlaid the EM section to the corresponding region of interest. We agree that the lumen should be thicker compared to the length of the cells, if the section would be cutting the trunk through its largest diameter. However, the protrusions we see do not emerge from the middle part of the trunk where the lumen is found but are seen towards the dorsal side of the trunk, where the lumen will no longer be visible in a longitudinal section as the ones we present. In the embryo shown in Figure 2A-C, our interpretation is that the section was done through a very shallow section of the lumen (represented below). We interpret this from the fact that we see abundant electron-dense areas which we think are adherens junctions from multiple cells. These junctions are visible in Figure 2C but are currently not labelled. We will add arrows to increase their visibility.

      Given that protruding cells lie at the base of dorsal branches, it would be expected that in some sections we would find the protrusions close to the dorsal branches. This is in fact what we show in the micrograph shown in Figure 2D, with a lower magnification overview image shown in Figure S2D. In this case, we see a cell in close proximity to the tendon cells on one side (Figure 2D), which is connected to a dorsal branch on the opposite side (shown in Figure S2D). This dorsal branch is clearly autocellular and chitin deposition is visible as expected for the developmental stage. Again, in Figure S2E we see an electron-dense patch near the lumen that corresponds to the adherens junctions that seal the lumen. We see that all this needs to be better explained in the manuscript, so we will elaborate on the descriptions, and incorporate the light microscopy micrograph to the supplemental figures. This should also aid with the anatomical descriptions requested by Reviewer #3. Nevertheless, we think these observations confirm that what we are describing are the contact points between the dorsal trunk and tendon cells.

      Timelapse imaging of the protrusions in DT cells is done with frames every 4 minutes (Video S3). This is not enough to properly show cellular protrusions and the images do not really show interaction with the epidermis. Video S4 has a better time resolution but it is very short and only shows the cut moment. Video S4, shows the cut, but the reported (and quantified recoil) is not clear. Nevertheless, the results are noteworthy and should be further analysed.

      We will acquire high temporal resolution time-lapse images using E-Cadherin::GFP and btl-gal4, UAS-PH::mCherry to show the behaviour of the protrusions on a short time scale.

      • *

      Provided these embryos survive, would it be possible to check if embryos after laser cutting will develop wavy DTs?

      We think it would be interesting to carry out this experiment, but the laser cut experiments were done under a collaborative visit and we would not be able to repeat it in a short-term period.

      What happens to the larvae under the genetic conditions presented in Fig.S3? Do they reach pupal stages? Do these animals reach adult stages?

      We have seen escapers out of these crosses, but we have not quantified the lethality of the experiment. We will analyse this and include it in the manuscript.

      The kayak phenotypes are very interesting and perhaps the authors could explore them more. As in inhibition of adhesion to the ECM, kay mutants display wavy dorsal trunks. Do they have defective adhesion? Fos being a transcription factor, this is a possibility. The authors should at least discuss the kay phenotypes more extensively and present a suitable hypothesis for the phenotype.

      We agree that the kayak experiments might bring more consequences than just preventing dorsal closure. We will complement this approach by blocking dorsal closure by other independent means. We will use pannier-gal4 (a lateral epidermis driver), engrailed-gal4 (a driver for epidermal posterior compartment), and 332-gal4 (an amnioserosa driver) to express dominant-negative Moesin. In our experience, this also delays dorsal closure and it should result in a similar tracheal phenotype as the one we see in kayak embryos.

      Minor comments

      Page 2 Line 9/10 The sentence "tracheal tubes branch and migrate over neighbouring tissues of different biochemical and mechanical properties to ventilate them." should be rewritten. Tracheal cells do not migrate over other tissues to ventilate them.

      We meant to say that tracheal cells migrate over other tissues at the same time as they branch and interconnect to allow gas exchange in their surroundings after tracheal morphogenesis is completed. Ventilation is used here as a synonym for gas exchange or breathing. We will rephrase this if the reviewer considers it confusing.

      Page 2 Line 24/25 The sentence "It has been generally assumed that trunks reach the dorsal side of the embryo because of the pulling forces of dorsal branch migration." needs to be backed up by a reference.

      As explained above, we will rephrase this sentence.

      Page 7 Line 32/23 In this sentence, the references are not related to dorsal closure "Similarly, the signals that regulate epidermal dorsal closure do not participate in tracheal development, or vice versa (Letizia et al., 2023; Reichman-Fried et al., 1994)."

      Our goal in this sentence was to explain that while JNK is required for proper epidermal dorsal closure, loss of JNK signaling in the trachea does not affect tracheal development (as shown by Letizia et al., 2023). At the same time, Reichman-Fried et al., 1994 described the phenotypes of loss of breathless (btl). We will remove this last reference as the work does not study the epidermis. We will rephrase the sentence as: “Similarly, the signals that regulate epidermal dorsal closure do not participate in tracheal development; namely, JNK signaling (Letizia et al., 2023).”

      Page 12 Line 1 "Muscles attach to epidermal tendon cells through a dense meshwork of ECM" this sentence must be referenced.

      We will add the corresponding references for this statement: (Fogerty et al., 1994; Prokop et al., 1998; Urbano et al., 2009). We will change “dense” for “specialized”.

      Fig. S1- Single channel images (A'-C' and A'-C') should be presented in grayscale.

      Fig. S4- Single channel images (A'-D' and A'-D') should be presented in grayscale.

      We will add the grayscale, single-channel images for these figures.

      Reviewer #1 (Significance (Required)):

      The findings shown in this manuscript shed light on the interactions and cooperation between two organs, the tracheal system and the epidermis. These interactions are mediated by cell-ECM contacts which are important for the correct morphogenesis of both systems. The strengths of the work lie on its novelty and live analysis of these interactions. However, its weaknesses are related to some claims not completely backed by the data, some technical issues regarding imaging and some over-interpreted conclusions.

      This basic research work will be of interest to a broad cell and developmental biology community as they provide a functional advance on the importance of cell-ECM interactions for the morphogenesis of a tubular organ. It is of specific interest to the specialized field of tubulogenesis and tracheal morphogenesis.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: In this paper, the authors explore the relationships between two Drosophila tissues - the epidermis and tracheal dorsal trunk (DT) - that get dorsally displaced during mid-late embryogenesis. The show a nice temporal correlation between the movements of the epithelia during dorsal closure and DT displacement. They also show a correlation between the movement of an endogenously tagged version of collagen and the DT, suggesting that the ECM may contribute to this coordinated movement. Through high magnification TEM, they show that tracheal cells make direct contact with the subset of epithelial cells, known as tendon cells, that also serve as muscle attachment sites. In between these contact sites, tracheae are separated from the epithelia by the muscles. Furthermore, the TEMs and confocal imaging of tracheal cells expressing a membrane marker at these contact sites show that the tracheal cells are extending filopodia toward the tendon cells. The authors then explore how a variety of perturbations to the ECM produced by the tendon and DT cells affect DT and epithelial movement. They find that expressing membrane-associated matrix metalloproteases (MMP1 or MMP2) in tendon cells as well as perturbations in integrin or integrin signaling components leads to delays in dorsal displacement as well as defective lengthening of the tracheal DT tubes. They find that defects in the association between the tracheal and epidermal ECM attachments affect dorsal displacement of the epidermis, disrupting dorsal closure.

      Major comments: I like the goals of this paper testing the idea that the ECM plays important roles in the coordination of tissue placement, and I think they have good evidence of that from this study. However, I disagree with the conclusions of the authors that disrupting contact between DT and the tendon cells has no effect on DT dorsal displacement. DT tracheal positioning is clearly delayed; the fact that it takes a lot longer indicates that the ECM does affect the process. It's just that there are likely backup systems in place - clearly not as good since the tracheal tubes end up being the wrong length.

      We agree with this view; in our deGradFP experiments we see a delayed DT displacement. We focused our analyses on the coordination with epidermal remodelling, which remained unaltered, but we in fact see a delayed progression in dorsal displacement of both tissues (Figure 5I-J). We will emphasize this in the corresponding section of the Results.

      It also seems important that the parts of the DT where the dorsal branches (DB) emanate are moving dorsally ahead of the intervening portions of the trachea. This suggests to me that the DB normally does contribute to DT dorsal displacement and that this activity may be what helps the DT eventually get into its final position. The authors should test whether the portions of the DT that contact the DB are under tension. If the DB migration is providing some dorsal pulling force on the DT, this may also contribute to the observed increases in DT length observed with the perturbations of the ECM between the tendon cells and the trachea - if tube lengthening is a consequence of the pulling forces that would be created by parts of the trachea moving dorsally ahead of the other parts. Here again, it would be good to test if the DT itself is under additional tension when the ECM is disrupted.

      • *

      We thank the reviewer for the suggested experiments. We agree with the fact that the dorsal branches should pull on the dorsal trunk and that this interaction should generate tension. Unfortunately, we are unable to test this with the experiments proposed by the reviewer, but we propose an alternative strategy to overcome this. We understand that the reviewer suggests we do laser cut experiments in dorsal branches to see if there is a recoil in the opposite direction of dorsal branch migration. We carried out our laser cut experiments using a 2-photon laser through a visit to the EMBL imaging facility, using funds from a collaborative grant. Funding a second visit would require us to apply for extra funding, which would delay the preparation of the experiments. We are aware of UV-laser setups within our university, however, UV-laser cuts would also affect the epidermis above the dorsal branches, which we think might contribute to recoil we would expect to see.

      Instead of doing laser cuts, we have designed an experiment based on the suggestion of reviewer #1 of blocking Dpp signaling (with UAS-Dad), which would prevent the formation of dorsal branches. We expect that in this experimental setup, the trunk will bend ventrally in response to thepulling forces of the ventral branches. We will also co-express UAS-Dad (to prevent dorsal branch formation) and UAS-Mmp2 (to ‘detach’ the dorsal trunk from the epidermis), and we would expect to at least partially rescue the wavy trunk phenotype.

      Minor comments: The authors need to do a much better job in the intro and in the discussion of citing the work of the people who made many of the original findings that are relevant to this study. Many citations are missing (especially in the introduction) or the authors cite their own review (which most people will not have read) for almost everything (especially in the discussion). This fails to give credit to decades of work by many other groups and makes it necessary for someone who would want to see the original work to first consult the review before they can find the appropriate reference. I know it saves space (and effort) but I think citing the original work is important.

      • *

      The reviewer is right; we apologize for falling into this practice. We will reference the original works wherever it is needed.

      Figure 7 is not a model. It is a cartoon depicting what they see with confocal and TEM images.

      We will change the figure; we will include our interpretations of the phenotypes we observed under different experimental manipulations.

      Reviewer #2 (Significance (Required)):

      Overall, this study is one of the first to focus on how the ECM affects coordination of tissue placement. The coordination of tracheal movement with that of the epidermis is very nicely documented here and the observation that the trachea make direct contact with the tendon cells/muscle attachment sites is quite convincing. It is less clear from the data how exactly the cells of the trachea and the ECM are affected by the different perturbations of the ECM. It seems like this could be better done with immunostaining of ECM proteins (collagen-GFP?), cell type markers, and super resolution confocal imaging with combinations of these markers. What happens right at the contact site between the tendon cell and the trachea with the perturbation? I think that at the level of analysis presented here, this study would be most appropriate for a specialized audience working in the ECM or fly embryo development field.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary The manuscript by Sanchez-Cisneros et al provides a detailed description of the cellular interactions between cells of the Drosophila embryonic trachea and nearby tendon and epidermal cells. The researchers use a combination of genetic experiments, light sheet style live imaging and transmission electron microscopy. The live imaging is particularly clear and detailed, and reveals protruding cells. The results overall suggest that interactions mediated through the ECM contribute to development of trachea and dorsal closure of epidermis. One new aspect is the existence of dorsal trunk filipodia that are under tension and may impact tracheal morphogenesis through required integrin/ECM interactions.

      Major comments: - Are the key conclusions convincing? Generally, the key conclusions are well supported by the data, and the movies are very impressive. Interactions between the cell types are clearly shown, as is the correlations in their development. However, some of the images are challenging to decipher for a non-expert in Drosophila trachea, especially the EM images, and some of the data is indirect or a bit weak.

      We thank the reviewer for their observations. As mentioned above in response to Reviewer #1, we will add an overview image of the embryo we processed for TEM that is presented in Figure 2.

      The data related to failure of dorsal closure affecting trachea relies on one homozygous allele of one gene (kayak), and so this is somewhat weak evidence. Even though kay is not detected in trachea, there could be secondary effects of the mutation or another lesion on the mutant chromosome. The segments look a bit uneven in the mutant examples.

      • *

      The reviewer is right; as we proposed before, we will complement the kayak experiments with independent approaches that will delay dorsal closure.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? Some of the experiments have low n values, especially in imaging experiments, so these may be more preliminary, but they are in concordance with other data.

      The problem we face in our live-imaging experiments is related to the probability of finding the experimental embryos. In most of our experiments we combine double-tissue labelling plus the expression of genetic tools. This generally corresponds to a very small proportion of the progeny. We will aim to have at least 4 embryos per condition.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. Higher n-values would substantiate the claims. To strengthen the argument that dorsal closure affects trachea morphogenesis mechanically, the authors might consider using of a combination of kay mutant alleles or other mutant genes in this pathway to provide stronger evidence. Or they could try a rescue experiment in epidermis and trachea separately for the kay mutants.

      We think our experiments delaying dorsal closure using the Gal4/UAS system and a variety of drivers should address the point of the possible indirect effects of kay in tracheal development.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. Imaging data can take awhile to obtain, but the genetic experiments could be done in a couple of months, and the authors should be able to obtain any needed lines within a few weeks.

      The reviewer is correct, we will be able to plan our crosses for the proposed experiments within a couple of months.

      • Are the data and the methods presented in such a way that they can be reproduced? Generally, yes. For the deGrad experiments, it is not clear how the fluorescent intensity was normalized - was this against a reference marker?

      Briefly, we used signals from within the embryo as internal controls. In the case of en-gal4, we normalized the signal to the sections of the embryo where en is not expressed and therefore, beta-integrin levels should not be affected. In the case of btl-gal4, we normalized against the signal surrounding the trunks which should also not be affected by the deGradFP system. We will elaborate on these analyses in the methods section.

      Are the experiments adequately replicated and statistical analysis adequate? There are several experiments with low n values, so this could fall below statistical significance. For example, data shown in Fig 1G: n=3; Fig 4D n=4, n=3; Fig 6J n=4

      As mentioned above, we will increase our sample sizes.

      Minor comments: - Specific experimental issues that are easily addressable. To make the TEM images more easily interpreted, it would be helpful to provide a fluorescent image of all the relevant cell types (especially trachea, epidermis, muscle, and tendon cells, plus segmental boundaries) labelled accordingly, so that reader can correlate them more easily with the TEM images. They might also include a schematic of an embryo to show where the TEM field of view is.

      We believe this should be addressed by adding the light microscopy section of the embryo with the TEM image overlaid as illustrated above.

      It is hard to be confident that the EM images reflect the cells they claim and that the filopodia are in fact that, at least for people not used to looking at these types of images.

      As we explained in the response to Reviewer #1, we will elaborate on the descriptions of our TEM data. We think that adding the reference micrograph will aid with the interpretations of the TEM images.

      • Are prior studies referenced appropriately? yes
      • Are the text and figures clear and accurate? yes

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions? The writing could be revised to be a bit clearer. Since the results of the experiments do not support the initial hypothesis, I found it a bit confusing as I read along. It may help to introduce an alterative hypothesis earlier to make the paper more logical and easy to follow. To be more specific, On page 3, the authors say they "show that dorsal trunk displacement is mechanically coupled to the remodelling of the epidermis" and also in the results comment that "With two opposing forces pulling the trunks other factors likely participate in their dorsal displacement, but so far these have remained unstudied." But that doesn't end up being what they find. The results from figure 5 and related interpretation on page 17 says "cell-ECM interactions are important for proper trunk morphology, but not for its displacement." So this was confusing to read and I would encourage the authors to frame the issues a bit differently in terms of tube morphogenesis.

      We see how this might be confusing. We will rewrite the introduction so that the work is easier to follow. To achieve this, we will state from the beginning the mechanisms we anticipate that regulate trunk displacement: 1) adhesion to the epidermis, 2) pulling forces from the dorsal branches and 3) a combination of both.

      Some minor presentation issues: What orientation is the cross-sectional view in figure 1C and movie 1?

      We will add a dotted box that indicates the region that we turned 90° to show the cross-section.

      On page 12, the authors say the "Electron micrographs also suggested high filopodial activity" but activity suggests dynamics that are not clear from EM. This could be re-phrased.

      As the reviewer indicates, we cannot conclude dynamics from a static image. We will replace “suggested high filopodial activity” with “revealed filopodial abundance”.

      Reviewer #3 (Significance (Required)):

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. The results of the paper are significant in that they characterize a mechanical interaction between two tissue types in development, which are linked by the extracellular matrix that sits between them. It is not clear to me that this describes a "novel mechanism for tissue coordination" as stated in the abstract, but it does characterize this type of interaction in a detailed cellular way.

      • Place the work in the context of the existing literature (provide references, where appropriate). For specialists, the work identifies a novel protruding cell type in the fly embryonic trachea, and provides beautiful and detailed imaging data on tracheal development. The "wavy" trachea phenotype is also uncommon and very interesting, so this result could be linked to the few papers that also describe this phenotype and be built up.

      • State what audience might be interested in and influenced by the reported findings. As it stands, this is most interesting for a specialized audience because it requires some understanding of the development of this system in particular. As it characterizes this to a new level of detail, it could be influential to those in the field. Some addition clarification of the results and re-framing could make the manuscript more clear and interesting for non-specialists.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. I work with Drosophila and have studied embryonic and adult cell types, although not trachea specifically. I am familiar with all the genetic techniques and imaging techniques used here.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      The manuscript by Sanchez-Cisneros et al provides a detailed description of the cellular interactions between cells of the Drosophila embryonic trachea and nearby tendon and epidermal cells. The researchers use a combination of genetic experiments, light sheet style live imaging and transmission electron microscopy. The live imaging is particularly clear and detailed, and reveals protruding cells. The results overall suggest that interactions mediated through the ECM contribute to development of trachea and dorsal closure of epidermis. One new aspect is the existence of dorsal trunk filipodia that are under tension and may impact tracheal morphogenesis through required integrin/ECM interactions.

      Major comments:

      • Are the key conclusions convincing?

      Generally, the key conclusions are well supported by the data, and the movies are very impressive. Interactions between the cell types are clearly shown, as is the correlations in their development. However, some of the images are challenging to decipher for a non-expert in Drosophila trachea, especially the EM images, and some of the data is indirect or a bit weak.

      The data related to failure of dorsal closure affecting trachea relies on one homozygous allele of one gene (kayak), and so this is somewhat weak evidence. Even though kay is not detected in trachea, there could be secondary effects of the mutation or another lesion on the mutant chromosome. The segments look a bit uneven in the mutant examples. - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      Some of the experiments have low n values, especially in imaging experiments, so these may be more preliminary, but they are in concordance with other data. - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      Higher n-values would substantiate the claims. To strengthen the argument that dorsal closure affects trachea morphogenesis mechanically, the authors might consider using of a combination of kay mutant alleles or other mutant genes in this pathway to provide stronger evidence. Or they could try a rescue experiment in epidermis and trachea separately for the kay mutants. - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Imaging data can take awhile to obtain, but the genetic experiments could be done in a couple of months, and the authors should be able to obtain any needed lines within a few weeks. - Are the data and the methods presented in such a way that they can be reproduced?

      Generally, yes. For the deGrad experiments, it is not clear how the fluorescent intensity was normalized - was this against a reference marker? - Are the experiments adequately replicated and statistical analysis adequate?

      There are several experiments with low n values, so this could fall below statistical significance. For example, data shown in Fig 1G: n=3; Fig 4D n=4, n=3; Fig 6J n=4

      Minor comments:

      • Specific experimental issues that are easily addressable.

      To make the TEM images more easily interpreted, it would be helpful to provide a fluorescent image of all the relevant cell types (especially trachea, epidermis, muscle, and tendon cells, plus segmental boundaries) labelled accordingly, so that reader can correlate them more easily with the TEM images. They might also include a schematic of an embryo to show where the TEM field of view is.

      It is hard to be confident that the EM images reflect the cells they claim and that the filopodia are in fact that, at least for people not used to looking at these types of images. - Are prior studies referenced appropriately?

      yes - Are the text and figures clear and accurate?

      yes - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      The writing could be revised to be a bit clearer. Since the results of the experiments do not support the initial hypothesis, I found it a bit confusing as I read along. It may help to introduce an alterative hypothesis earlier to make the paper more logical and easy to follow. To be more specific, On page 3, the authors say they "show that dorsal trunk displacement is mechanically coupled to the remodelling of the epidermis" and also in the results comment that "With two opposing forces pulling the trunks other factors likely participate in their dorsal displacement, but so far these have remained unstudied." But that doesn't end up being what they find. The results from figure 5 and related interpretation on page 17 says "cell-ECM interactions are important for proper trunk morphology, but not for its displacement." So this was confusing to read and I would encourage the authors to frame the issues a bit differently in terms of tube morphogenesis.

      Some minor presentation issues:

      What orientation is the cross-sectional view in figure 1C and movie 1? On page 12, the authors say the "Electron micrographs also suggested high filopodial activity" but activity suggests dynamics that are not clear from EM. This could be re-phrased.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      The results of the paper are significant in that they characterize a mechanical interaction between two tissue types in development, which are linked by the extracellular matrix that sits between them. It is not clear to me that this describes a "novel mechanism for tissue coordination" as stated in the abstract, but it does characterize this type of interaction in a detailed cellular way. - Place the work in the context of the existing literature (provide references, where appropriate).

      For specialists, the work identifies a novel protruding cell type in the fly embryonic trachea, and provides beautiful and detailed imaging data on tracheal development. The "wavy" trachea phenotype is also uncommon and very interesting, so this result could be linked to the few papers that also describe this phenotype and be built up. - State what audience might be interested in and influenced by the reported findings.

      As it stands, this is most interesting for a specialized audience because it requires some understanding of the development of this system in particular. As it characterizes this to a new level of detail, it could be influential to those in the field. Some addition clarification of the results and re-framing could make the manuscript more clear and interesting for non-specialists. - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      I work with Drosophila and have studied embryonic and adult cell types, although not trachea specifically. I am familiar with all the genetic techniques and imaging techniques used here.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: In this paper, the authors explore the relationships between two Drosophila tissues - the epidermis and tracheal dorsal trunk (DT) - that get dorsally displaced during mid-late embryogenesis. The show a nice temporal correlation between the movements of the epithelia during dorsal closure and DT displacement. They also show a correlation between the movement of an endogenously tagged version of collagen and the DT, suggesting that the ECM may contribute to this coordinated movement. Through high magnification TEM, they show that tracheal cells make direct contact with the subset of epithelial cells, known as tendon cells, that also serve as muscle attachment sites. In between these contact sites, tracheae are separated from the epithelia by the muscles. Furthermore, the TEMs and confocal imaging of tracheal cells expressing a membrane marker at these contact sites show that the tracheal cells are extending filopodia toward the tendon cells. The authors then explore how a variety of perturbations to the ECM produced by the tendon and DT cells affect DT and epithelial movement. They find that expressing membrane-associated matrix metalloproteases (MMP1 or MMP2) in tendon cells as well as perturbations in integrin or integrin signaling components leads to delays in dorsal displacement as well as defective lengthening of the tracheal DT tubes. They find that defects in the association between the tracheal and epidermal ECM attachments affect dorsal displacement of the epidermis, disrupting dorsal closure.

      Major comments: I like the goals of this paper testing the idea that the ECM plays important roles in the coordination of tissue placement, and I think they have good evidence of that from this study. However, I disagree with the conclusions of the authors that disrupting contact between DT and the tendon cells has no effect on DT dorsal displacement. DT tracheal positioning is clearly delayed; the fact that it takes a lot longer indicates that the ECM does affect the process. It's just that there are likely backup systems in place - clearly not as good since the tracheal tubes end up being the wrong length. It also seems important that the parts of the DT where the dorsal branches (DB) emanate are moving dorsally ahead of the intervening portions of the trachea. This suggests to me that the DB normally does contribute to DT dorsal displacement and that this activity may be what helps the DT eventually get into its final position. The authors should test whether the portions of the DT that contact the DB are under tension. If the DB migration is providing some dorsal pulling force on the DT, this may also contribute to the observed increases in DT length observed with the perturbations of the ECM between the tendon cells and the trachea - if tube lengthening is a consequence of the pulling forces that would be created by parts of the trachea moving dorsally ahead of the other parts. Here again, it would be good to test if the DT itself is under additional tension when the ECM is disrupted.

      Minor comments: The authors need to do a much better job in the intro and in the discussion of citing the work of the people who made many of the original findings that are relevant to this study. Many citations are missing (especially in the introduction) or the authors cite their own review (which most people will not have read) for almost everything (especially in the discussion). This fails to give credit to decades of work by many other groups and makes it necessary for someone who would want to see the original work to first consult the review before they can find the appropriate reference. I know it saves space (and effort) but I think citing the original work is important.

      Figure 7 is not a model. It is a cartoon depicting what they see with confocal and TEM images.

      Significance

      Overall, this study is one of the first to focus on how the ECM affects coordination of tissue placement. The coordination of tracheal movement with that of the epidermis is very nicely documented here and the observation that the trachea make direct contact with the tendon cells/muscle attachment sites is quite convincing. It is less clear from the data how exactly the cells of the trachea and the ECM are affected by the different perturbations of the ECM. It seems like this could be better done with immunostaining of ECM proteins (collagen-GFP?), cell type markers, and super resolution confocal imaging with combinations of these markers. What happens right at the contact site between the tendon cell and the trachea with the perturbation? I think that at the level of analysis presented here, this study would be most appropriate for a specialized audience working in the ECM or fly embryo development field.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Sanchez-Cisneros and colleagues, examine how tracheal cell adhesion to the ECM underneath the epidermis helps shape the tracheal system. They show that if cell-ECM adhesion is perturbed the development of the tracheal system and the epidermis is disrupted. They also detect protrusions extending from the dorsal trunk cells towards the ECM.

      The work is novel, the figures are clear, and the questions are well addressed. However, I find that some of the claims are not completely supported by the data presented. I have some suggestions that will, I believe, clarify certain points.

      Major comments

      At the beginning of the results section as in the introduction the authors claim that "It is generally assumed that trunk displacement occurs due to tip cells pulling on the trunks so that they follow their path dorsally." This sentence is not referenced, and I do not know where it has been shown or proposed to be like this. In addition, the comparison with the ventral branches is also not referenced and the movie does not really show this. Forces generated by tracheal branch migration have been shown to drive intercalation (Caussinus E, Colombelli J, Affolter M. Tip-cell migration controls stalk-cell intercalation during Drosophila tracheal tube elongation. Curr Biol. 2008;18(22):1727-1734. doi:10.1016/j.cub.2008.10.062), but not dorsal trunk (DT) displacement. However, to rule out the possibility that DT displacement and the phenotype observed in XXX is due to dorsal branch pulling forces, the authors should analyze what happens in the absence of dorsal branches (in condition of Dpp signalling inhibition as in punt mutants or Dad overexpression conditions).

      I am concerned about the TEM observations. The authors claim they can identify tracheal cells by their lumen (Fig. 2 C'). However, at stage 15, the tracheal lumen should be clearly identifiable, and the interluminal DT space should be wider relative to the size of the cells. In this case, there is nothing telling us that we are not looking at a dorsal branch or lateral trunk cell. Furthermore, at embryonic stage 15, the tracheal lumen is filled with a chitin filament, which is not visible in these micrographs. Also, there is quite a lot of tissue detachment and empty spaces between cells, which might be a sign of problems in sample fixing. Better images and more accurate identification of dorsal trunk cells is necessary to support the claim that "These experiments revealed a novel anatomical contact between the epidermis and tracheal trunks".

      Timelapse imaging of the protrusions in DT cells is done with frames every 4 minutes (Video S3). This is not enough to properly show cellular protrusions and the images do not really show interaction with the epidermis. Video S4 has a better time resolution but it is very short and only shows the cut moment. Video S4, shows the cut, but the reported (and quantified recoil) is not clear. Nevertheless, the results are noteworthy and should be further analysed. Provided these embryos survive, would it be possible to check if embryos after laser cutting will develop wavy DTs?

      What happens to the larvae under the genetic conditions presented in Fig.S3? Do they reach pupal stages? Do these animals reach adult stages?

      The kayak phenotypes are very interesting and perhaps the authors could explore them more. As in inhibition of adhesion to the ECM, kay mutants display wavy dorsal trunks. Do they have defective adhesion? Fos being a transcription factor, this is a possibility. The authors should at least discuss the kay phenotypes more extensively and present a suitable hypothesis for the phenotype.

      Minor comments

      Page 2 Line 9/10 The sentence "tracheal tubes branch and migrate over neighbouring tissues of different biochemical and mechanical properties to ventilate them." should be rewritten. Tracheal cells do not migrate over other tissues to ventilate them.

      Page 2 Line 24/25 The sentence "It has been generally assumed that trunks reach the dorsal side of the embryo because of the pulling forces of dorsal branch migration." needs to be backed up by a reference.

      Page 7 Line 32/23 In this sentence, the references are not related to dorsal closure "Similarly, the signals that regulate epidermal dorsal closure do not participate in tracheal development, or vice versa (Letizia et al., 2023; Reichman-Fried et al., 1994)."

      Page 12 Line 1 "Muscles attach to epidermal tendon cells through a dense meshwork of ECM" this sentence must be referenced.

      Fig. S1- Single channel images (A'-C' and A'-C') should be presented in grayscale.

      Fig. S4- Single channel images (A'-D' and A'-D') should be presented in grayscale.

      Significance

      The findings shown in this manuscript shed light on the interactions and cooperation between two organs, the tracheal system and the epidermis. These interactions are mediated by cell-ECM contacts which are important for the correct morphogenesis of both systems. The strengths of the work lie on its novelty and live analysis of these interactions. However, its weaknesses are related to some claims not completely backed by the data, some technical issues regarding imaging and some over-interpreted conclusions.

      This basic research work will be of interest to a broad cell and developmental biology community as they provide a functional advance on the importance of cell-ECM interactions for the morphogenesis of a tubular organ. It is of specific interest to the specialized field of tubulogenesis and tracheal morphogenesis.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their careful and thoughtful read of our work.

      Reviewer 1 helpfully suggested that the audience might not know what fuzzy matching is. The manuscript contains the following explanation of fuzzy matching:

      "That subjects had almost no exact matches to SARS-CoV-2-specific IGH sequences did not exclude the possibility that they have sequences that are functionally similar to these reference sequences. The same possibility exists for TRBs. A standard method for finding similar sequences is using the Levenshtein (edit) distance. Sequences with a distance of less than or equal to a tolerance t are considered similar (for example, sequences that differ by no more than t=1 amino acid). This is known as “fuzzy matching” with tolerance t. (Note that exact matches are just fuzzy matches with tolerance 0.)"

      We now also add the word "approximate" in conjunction with earlier uses of the word "fuzzy."

      Reviewer 2 asked whether we "focused on potential contributions [to CDR3 length variations] based on germline gene usage, rather than directly observed contributions from the V and J segments within the CDR3 regions;" the answer is, the latter. Reviewer 2 also pointed out that it would be valuable to have HLA typing for a more comprehensive analysis. We wholeheartedly agree and have added a sentence to this effect in the discussion.

      Reviewer 3 had several specific comments. The first was regarding the overall implication of the study. There are several:

      • binding capacity is as predictive as, and more robust than, prior approaches. As we write: "We found that repertoires’ binding capacity to known SARS-CoV-2-specific CD4+ TRBs performs as well as the best hand-tuned approximate or “fuzzy” matching at predicting a protective level of NAbs, while also being more robust to repertoire sample size and not requiring hand-tuning."

      • the importance of looking for unexpected patterns, for example in non-productive joins as was done here, and for global small-scale perturbations that together result in unavoidable signals. As we write, "B- and T-cell adaptive responses to SARS-CoV-2 infection and vaccination are surprising, subtle, and diffuse," and "One open question is to what extent infection affects antibody and TCR repertoires as a whole vs. enriching specific clones within it. One can refer to these ends of the continuum of possible effects as “diffuse” vs. “precise.”"

      • caution against over-interpreting correlations with specific gene segments and. As we write: "With these caveats in mind, to our knowledge previous studies have identified 20 IGH V genes to be enriched in sequences produced during various immune responses to SARS-CoV-2.8–16 Given that human genomes encode 54 IGH V genes,17 collectively these studies implicate 37% of V genes in the response to this single viral exposure, indicating that the SARS-CoV-2 response is either quite broad within individuals, quite heterogeneous among individuals, or both."

      Each of these challenges prevailing approaches, understanding, and conclusions about patterns and signatures in repertoire sequence. We should hope this would be of some benefit.

      Reviewer 3 also asked what type of vaccines the participants received. We have now clarified that they all received an mRNA vaccine: 80% receiving Pfizer Comirnaty and the rest receiving Moderna Spikevax.

      We looked at anti-spike neutralizing antibodies because this is where the evidence for neutralization is strongest. It would have been great to have diagnostics for every protein as well as Fc function, but these were not available and therefore not possible to study.

      Reviewer 3 noted that we mention that it is impossible to know a priori what study size would be adequate to identify public sequences comprehensively in COVID-19 and asks if 251 individuals are enough. Assuming this is in reference to the size of our study, we would like to point out that this study does not claim to identify public sequences comprehensively. The rationale is more is better. The statistics tell the reader the extent to which to reject the null hypotheses put forth.

      Regarding comorbidities: we probably could perform an analysis on their impact in a future study. We thank the reviewer for this idea.

      Regarding timing: samples were collected from the vaccinee cohort 4 to 84 days (mean = 44.3 days, standard deviation = 15.3 days) after administration of the initial vaccine dose. Supplementary Figure S13 shows sampling times vs. NAb titers.

      We feel the length of the introduction is required to contextualize the implications and benefits of this study.

      We were unable to find the typos referred to but did run the manuscript through spelling and grammar checks again. We thank the Reviewer for the thoughtful attentiveness.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study by Braun et al. looked at B and T cell receptor repertoires in SARS-CoV-2 infected and vaccinated individuals in comparison to healthy controls to evaluate the impact on neutralizing antibody titers. The results are clearly presented. As expected, vaccination and infection have differential effects on the repertoires. The major finding is that vaccinated and infected individuals with more SARS-CoV-2-specific TRBs have higher neutralizing antibody titers.

      Major comments:

      • It is not clear what the overall implication of this study is? What are the benefits?

      • The authors did not specify what type of vaccines the participants received. If different vaccines were used, a comparison is necessary.

      • The authors only looked at anti-spike neutralizing antibodies while other SARS-CoV-2 proteins could have a different impact. It is also known that Fc-functions participate to protection and should be studied.

      • Line 55 the authors mentioned it's difficult to know what study size is enough to represent the population. What is the rationale for including 251 individuals? Is this enough to represent the population?

      • With so many comorbidities in the different cohorts, could the authors perform an analysis of the impact of such comorbidities?

      • The timing of the SARS-CoV-2 infection or vaccination of the individuals is missing. Were all samples collected at the same time? As we know neutralizing antibodies are waning over time, it is important to include this data.

      Minor comments:

      • Introduction is quite long and needs to be summarized better.

      • Include a table with all the abbreviations

      • Missing data in the individuals demographics table

      • Typo line 65

      • Typo in the methods section

      Significance

      This study highlights the effects of SARS-CoV-2 vaccination versus infection on antibody and T cell repertoires. Increasing evidence shows the differential effects of vaccination compared to infection. This substantial study provides new data to the field. However, for a virus for which many vaccines and treatments have already been developed and proven effective, the implications and benefits to the field should be more clearly explained.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      SUMMARY

      The paper focuses on identifying signatures in specific antibody and T-cell receptor repertoires related to SARS-CoV-2 infection and vaccination within a cohort of 250 patients. This study addresses the limitations of prior research, which often relied on small sample sizes, possibly leading to an underestimation of the variability in adaptive immune responses to SARS-CoV-2 infection and vaccination. Researchers analyzed functional features by sequencing IGH, TRD, and TRB, aiming to identify signatures that correlate antibody and T-cell responses with SARS-CoV-2 exposure.

      MAJOR COMMENTS

      • The study's major limitations, as acknowledged by the authors, primarily relate to the timing of sample collections during the pandemic and current methodological constraints, such as the challenge of reliably predicting receptor-antigen binding using a unified approach. Despite these challenges, the methodologies and statistical approaches employed were carefully designed to minimize potential biases. The tools and findings from this study could prove valuable for future research, particularly in this rapidly evolving field.

      • One intriguing finding was the observed pattern of IGH CDR3 lengths among vaccinated individuals. When investigating the contributions of germline genes to CDR3 regions as a potential explanation for length variations, did I understand correctly that authors focused on potential contributions based on germline gene usage, rather than directly observed contributions from the V and J segments within the CDR3 regions? This discovery highlights a potential impact of vaccination on V-D-J recombination machinery.

      • OPTIONAL For future studies, it may be valuable to have HLA typing for a more comprehensive analysis.

      Significance

      Overall, this manuscript uncovers previously undescribed patterns of immune responses to SARS-CoV-2 infection and vaccination. It is supported by a statistically robust methodological approach to effectively interpret the complex features resulting from exposure to a specific immunogen.

      The manuscript could be of broad interest for immunologists, clinicians and bioinformaticians.

      My area of expertise lies in the molecular biology of T cells, with a focus on applying multi-omics approaches (e.g., transcriptomics, epigenomics) to elucidate the molecular mechanisms governing T cell function and their role in the immune responses.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this study, the authors sequenced B- and T-cell receptor repertoires (recombined immunoglobulin heavy-chain, TCRβ, and TCRδ genes) from the blood of infected, vaccinated, and control subjects (tested for negative). They focused on their hypervariable CDR3 regions and correlated this AIRRseq data with demographics and clinical findings from subject data. They investigated whether features of these repertoires could predict subjects' SARS-CoV-2 neutralizing antibody titer. They discovered that age affected NAb levels in vaccinated subjects but not infectees. Furthermore, they found that vaccination, but not infection, substantially affects non-productively recombined IGHs, and that repertoires' binding capacity to known SARS-CoV-2-specific CD4+ TCRβ performs as well as the best matching at predicting a protective level of NAbs. The overall conclusion from this dataset is that B- and T-cell adaptive responses to SARS-CoV-2 infection and vaccination are subtle and diffuse.

      The data support the claims and the conclusions and do not require additional analyses.

      The study is robust and large, with over 250 subjects, and involved sequencing IGH and TCRdelta as well as TCRbeta, to a depth of over 100000 cells/subject.

      Significance

      The study is very specific and sectorial, and I do not think it is easily accessible to a broad audience of immunologists; despite this, however, the authors have managed to explain quite understandably the results achieved, the challenges faced, and the conclusions obtained. If the aim is to inform the scientific community that deals with immunology, I suggest not assuming that the audience knows what fuzzy is. So, I would recommend explaining the statistical tools used in a few words.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1.1. It would be helpful if the authors could discuss whether there is any correlation between cryptic sites and the extent of experimental validation in the Phosphosite database (e.g. those that were only identified in one or a few MS experiments). It is difficult to determine stoichiometry of phosphorylation experimentally, but can any inference be made on the extent of phosphorylation of cryptic sites vs. more conventional sites located in IDRs or on the surface of globular domains?

      We thank the reviewer for this valuable suggestion. To investigate the extent of the experimental validation of phosphosites, we examined the number of supporting studies for each site reported in the PhosphoSitePlus database. Specifically, we summed the values of the LT_LIT (literature-based experiments), MS_LIT (mass spectrometry literature), and MS_CST (Cell Signaling Technology mass spectrometry) fields to count the number of independent studies supporting each phosphorylation site, either cryptic or non-cryptic. To visualize the results, we plotted the number of supporting references vs the relative solvent accessibility (RSA) distribution of phosphosites (Figure R1). The analysis revealed a direct correlation between the RSA of phosphosites and the number of studies supporting their phosphorylation. This observation may arise from an intrinsic difficulty in studying cryptic phosphosites due to their destabilizing effects on native proteins. Notably, no differences were observed in the number of supporting studies within cryptic phosphosites (Figure R1B). We have not mentioned these analyses in the new version of the manuscript. However, we would gladly add it if the editor or the reviewer advises accordingly.

      1.2. The authors note that a larger percentage of tyrosine phosphorylation sites are cryptic compared with serine/threonine sites. I assume that tyrosine itself is more highly enriched in the hydrophobic cores of proteins relative to serine or threonine, due to its bulky hydrophobic side chain. Is the increased proportion of cryptic tyrosine phosphorylation sites more, less, or the same as the proportion of tyrosine in hydrophobic cores relative to serine and threonine?

      We thank the reviewer for this insightful comment. As correctly noted, tyrosine residues tend to be enriched in the hydrophobic cores of proteins, as reflected by their generally lower relative solvent accessibility (RSA) values, regardless of phosphorylation state. This enrichment is likely due to the tyrosine side chain's bulky and partially hydrophobic nature. To address the reviewer's question, we compared the RSA distributions of phosphorylated tyrosine, serine, and threonine residues with that of the same residues non-phosphorylated in the human proteome (Figure R2). In order to statistically compare the two distributions, we employed the Mann-Whitney test. The large sample size inevitably yields very low p-values, even when the distributions differ mildly (pThr, pSer vs non-p Thr, Ser, p 1.3. Fig. 5D and E: I had some trouble interpreting these figures. Indicating where the native state is in the plots would be helpful (stated in text as lower right, but a rectangle on the plot would make this more obvious). The text discusses three metastable intermediates, but what is the fourth one shown on the figures (well A, close to the native state)? This could be more explicitly explained.

      We added the missing rectangles into the original Fig. 5D and E (see below Figure R3 and R4). The three metastable intermediates discussed in the original text reflect protein conformers in which the cryptic site is exposed to the solvent. Conversely, the fourth state, and the final native state, are conformations in which the site is already partially or fully cryptic. The observation that the masking of cryptic sites coincides with the latest folding steps allows us to hypothesize a mechanism by which cryptic phosphorylation may regulate protein folding. Following the reviewer's suggestion, we now specify more explicitly each conformation in the new version of the legends of the relative figures (text file with track changes, lines 950 and 1017).

      1.4. The fact that phosphomimetic mutations of cyptic sites in SMAD2 and CHK1 lead to lower expression levels and shorter half-lives is not surprising, given the expected disruption of the hydrophobic core by introduction of a charged residue. The results certainly show that if phosphorylated, these sites would decrease expression and half-life. With respect to half-life, however, if the authors are correct and cryptic sites are predominately phosphorylated co-translationally, one would expect that the half-life curves for the wt protein would not be a simple exponential, but would instead reflect two distinct populations: those that are phosphorylated during translation, and are almost immediately degraded, and those that escape phosphorylation and have the same half-life as the non-phosphorylatable mutant. Are the actual experimental results consistent with this two-population model? If not, this would be evidence that some of these cryptic sites can be exposed post-translation, either by thermal fluctuation or biological interactions.

      We thank the reviewer for this insightful point. The readout employed in our study (i.e., western blotting) measures the aggregate signal from the total protein population in the cell culture. It thus reflects average protein levels rather than the dynamics of individual molecules. As such, it is not well-suited to resolving coexisting populations with distinct half-lives. We agree that if phosphorylation of cryptic sites occurs strictly co-translationally, one might expect a biphasic decay curve. However, due to methodological constraints, our assay provides only a single exponential fit to the global turnover kinetics. While we cannot entirely exclude the possibility that cryptic sites may become exposed post-translationally (e.g., due to thermal fluctuations or interactions), our molecular dynamics simulations did not reveal such exposure events within the simulated timescales. Therefore, while the two-population model remains plausible in principle, our results are consistent with a co-translational phosphorylation and degradation model. Forthcoming experiments aimed at characterizing the phosphorylation of ribosome-associated nascent chains in the human proteome may further validate this conclusion.

      1.5. The authors make a point that cryptic phosphosites are more highly conserved than non-cryptic phosphosites, but it is not clear to me whether it is the side chain itself or its ability to be phosphorylated that is conserved. Supplemental Fig. 9, if I am interpreting it correctly, would suggest it is the residue itself and not its phosphorylation that is conserved. If so, wouldn't this suggest that phosphorylation of these cryptic sites is just an inevitable consequence of the conservation of serine, threonine, and tyrosine residues in hydrophobic core regions? If the authors have evidence that argues against this simple hypothesis, they should discuss it (e.g., cryptic phosphosites are more highly conserved in some cases than non-phosphorylated tyrosine, serine, and threonine residues that are not solvent accessible).

      We agree with the reviewer's interpretation. The higher conservation of cryptic phosphosites likely reflects the evolutionary constraint on hydrophobic core residues, which tend to be more conserved due to their role in structural stability. This conservation does not imply phosphorylation at those sites is functionally selected across species. Instead, when such residues are phosphorylated, as we observe in the human proteome, the effect is often destabilizing and associated with protein degradation. Our analysis does not establish that the phosphorylation of cryptic residues is conserved across species, only that the residues themselves are. We appreciate the reviewer's suggestion and now explicitly discuss this point in the revised manuscript to clarify the distinction between residue conservation and phosphorylation conservation (text file with track changes, line 618)

      1.6. Regarding the evolutionary conservation of cryptic sites, have the authors taken into consideration that tyrosine-specific kinases, phosphatases, and reader domains first appeared in the first metazoans, and are for the most part not seen in non-metazoan eukaryotes? I notice some of the proteomes used for the conservation analysis include plants and yeast, which lack most tyrosine phosphorylation.

      We thank the reviewer for this insightful comment. In response to the suggestion, we have recalculated the entropic conservation score by restricting the analysis to metazoan species. This analysis ensures that the evolutionary context more accurately reflects the presence and functional relevance of tyrosine-specific kinases, phosphatases, and reader domains. The comparison between the entropic score distribution calculated by including or not non-metazoan orthologues show statistically significant differences for both serine and threonine, and tyrosine. However, the large sample sizes translate inevitably into statistically significant p-values, even when the differences in mean are minimal and the standard deviations relatively small. To better assess the practical relevance of these differences, we calculated Cohen's d as a measure of effect size (Table R1). The coefficient helps assess the size and biological significance of a difference (>0.2 = small effect; >0.5 = medium effect; >0.8 = large effect). The analysis indicates a very modest deviation in entropic scores by including or not non-metazoan orthologues.

      1.7. I find the argument that phosphorylation of exposed core residues is part of normal protein quality control/proteostasis to be convincing. Can the authors provide any experimental evidence to support this model (for example, greater phosphorylation of cryptic sites under stress conditions)? I don't think these experiments are necessary, but would seem to be a logical next step and could be done quite easily through collaboration.

      We appreciate the reviewer's suggestion and fully agree that showing more significant phosphorylation of cryptic sites under stress conditions could represent an exciting future direction. We are conducting experiments on individual tumor suppressors such as p53 and PTEN, which harbor cryptic phosphosites, to test whether cellular stress conditions enhance phosphorylation at these positions. These studies assess whether such modifications contribute to altered protein stability or function in stress or disease contexts, particularly cancer. We plan to communicate these results in forthcoming publications and are currently open to collaborations to broaden this line of investigation.

      1.8. The authors note at the end of the discussion that targeting cryptic phosphosites might be a strategy to selectively degrade some proteins in cancer. Practically, how would this work? I can't think of how, but perhaps the authors can provide more specific suggestions.

      We thank the reviewer for raising this important point. One promising approach to therapeutically exploit cryptic phosphosites builds on the PPI-FIT principles (Pharmacological Protein Inactivation by Folding Intermediate Targeting). This strategy targets transient structural pockets appearing only in folding intermediates (Spagnolli et al., Comm Biology 2021). In this context, kinases that phosphorylate cryptic sites could be modulated, either inhibited or redirected, so that misfolded or oncogenic proteins are selectively marked for degradation. For example, selectively enhancing the phosphorylation of a cryptic site on an oncogenic protein could destabilize it and promote its degradation via the proteasome. Conversely, preventing phosphorylation at a cryptic site on a tumor suppressor (e.g., by inhibiting the specific kinase) could enhance protein stability and restore function. While this concept is still emerging, it offers an exciting therapeutic avenue that complements our findings. We added a paragraph addressing this point in the discussion section of the new version of the manuscript (text file with track changes, line 716).

      1.9. Introduction: "It involves the addition of a phosphate to an hydroxyl group found in the side chain of specific amino acids, typically serine, threonine or tyrosine residues." Of course serine, threonine, and tyrosine are the only standard amino acids with a simple hydroxyl group, so "typically" is not needed here.

      We have removed the word "typically" to reflect the accurate chemical specificity of phosphorylation events (text file with track changes, line 82).

      1.10. In my view this is an important study, bringing rigor and a broad proteomic perspective to a phenomenon that (to my knowledge) had not been carefully examined previously. In terms of the big picture, I am of two minds. On the one hand, showing that phosphorylation of hydrophobic core residues exposed during translation or the early stages of folding can regulate steady state levels of some proteins provides an intriguing new mechanism to control the complement of proteins in the cell, and is potentially an area of regulation in normal physiology or in disease. On the other hand, if this is just part of the normal proteostatic mechanisms (hydrophobic core residues exposed for too long consign the protein to degradation, before it can lead to aggregation and other problems), that is a little less interesting to me. I think future work to tease out whether this mechanism is actually regulated and used by the cell to transmit information will be key. But the first step is showing that the phenomenon is real and widespread, and in my view this preprint accomplishes that goal very well.

      We appreciate the reviewer's thoughtful summary and agree that distinguishing between passive proteostatic clearance and active regulatory function is essential. Toward this goal, we plan to carry out a phosphoproteomic analysis of ribosome-associated nascent chains. By mapping phosphorylation events during translation, we aim to validate our cryptic phosphosite dataset in a co-translational context and potentially identify novel regulatory modifications. This approach will also help us assess whether phosphorylation at cryptic sites is modulated context-dependently, thereby supporting a role in regulated protein expression rather than solely quality control.

      2.1. Evolutionary comparison whether cryptic and non-cryptic sites are differently conserved. Two distinct distributions for cryptic and non-cryptic phospho-sites are observed and Figure 6 shows two entropy distributions of cryptic v non-cryptic. Here it is unclear whether this is significant given the different distributions of the two types when non modified.

      We thank the reviewer for raising this critical point. Due to the large sample sizes in our analysis, statistical tests inevitably yield very low p-values, even when differences in mean are minimal and the standard deviations relatively small. To better assess the practical relevance of these differences, we calculated Cohen's d as a measure of effect size (Table R2). The comparison between cryptic and non-cryptic phosphosites yielded an effect size (Cohen's d = 0.4028) slightly lower than the one obtained for residues lying within protein cores or exposed on protein surfaces (Cohen's d = 0.5126), both indicating a modest but meaningful shift in entropic scores. In contrast, the comparisons between cryptic phosphosites and all core residues, as well as non-cryptic phosphosites and all surface residues, showed negligible effect sizes (Cohen's d = 0.0245 and 0.1326, respectively). These findings suggest that while statistical significance is achieved in all cases, only the difference between cryptic and non-cryptic phosphosites, or core and surface residues, reflects a meaningful biological signal. We have now included these data in the new version of the manuscript (text file with track changes, line 544).

      2.2. The identification of buried modification sites and what the biological meaning / implications are is a very interesting topic. However PTM distribution on proteins is very skewed (many papers have identified ____clusters, hot spots, structural dependencies etc...) and therefore comparing modified sites on different residues and in different protein regions and with non-modified residues has to be very stringently controlled.

      We fully agree with the reviewer that PTM distribution is non-random and influenced by structural and functional constraints, making comparative analyses challenging. To ensure rigor, we implemented a robust computational pipeline. Unlike other PTMs found almost exclusively on solvent-exposed residues, phosphorylation uniquely showed a distinct subset of sites with extremely low solvent accessibility. This pattern held even after applying stringent structural and dynamical filters. Specifically, we excluded low-confidence residues, small or unstructured domains, and sites that become exposed due to thermal fluctuations, using the SPECTRUS-based dynamic analysis. While we cannot entirely rule out context-specific exposure in fully folded proteins (e.g., during protein-protein interactions), we validated selected cryptic sites experimentally, and our findings were consistent with the computational predictions. We believe this multilayered approach strengthens the reliability of our classification and distinguishes cryptic phosphosites from the broader PTM landscape.

      2.3. Very basic question: How do you assessed the RSA value of the residues from the alphafold structure. If it is sequence based, then it is unclear what the alpha fold structure actually contributes in this step? Although I assume it is structure based, it is not well described, only a reference.

      We calculated the RSA values using the Shrake-Rupley algorithm implemented in the MDTraj Python library. This is a structure-based metric: for each PTM-carrying residue, we evaluated the absolute SASA from the 3D AlphaFold structure and normalized it against the theoretical maximum exposure for that residue in a Gly-X-Gly tripeptide, as defined in Tien et al. (2013). Thus, AlphaFold structures directly provide the atomic coordinates necessary for solvent accessibility estimation. We have now revised the Methods section to describe this process more explicitly (text file with track changes, lines 110 and 113).

      2.4. Given that the different residues S,T,Y but also K for glycosylations etc. have a very different baseline RSA distribution, the distributions of modified residues as such are not so informative. Are the distributions of residues with the alpha fold LOD 0.65 different between modified and non-modified?

      2.5. Same point: it is very clear that "tyrosine presenting a larger proportion of cryptic phosphor-sites", as they mainly are within folded domains to begin with. The pattern of phosphorylation and clustering is very different between the modified amino acid residue T,S,Y and needs consideration, given the large number of PTMs, a simple distribution is not sufficient to argue.

      As already discussed in point 1.2 above, and correctly noted also by this reviewer, tyrosine residues are generally enriched in the hydrophobic cores of proteins, which is reflected by their typically low RSA, regardless of phosphorylation status. This tendency likely arises from the bulky and partially hydrophobic nature of the tyrosine side chain. To address the reviewer's question, we compared the RSA distributions of phosphorylated tyrosine, serine, and threonine residues with those of all these amino acids in the human proteome. We found that phosphorylated residues consistently exhibit higher RSA values than the overall averages for their respective amino acids. This is expected, as phosphorylation within protein cores would likely be destabilizing. Indeed, the existence of low-RSA phosphorylated residues, represents a significant deviation from the intrinsic tendency of tyrosine, serine, and threonine residues and suggests that cryptic sites may become accessible only transiently along protein folding pathways.

      2.6. Figure 3E (proteins need names in the figure ): the cryptic site T222 (Chk1) is not in the quasi ridged domain, it is in a light color region. What is actually the SPECTRUS cutoff? The Pidc is only one sentence in the main text? It says fewer than 80% intradomain contacts in rigid domains i.e. >0.8, right, but is the domain rigid?

      We have revised the original figure in the new version of the manuscript to include protein names, and clarified the domain assignments. The cryptic phosphosite T222 in Chk1 lies within a quasi-rigid domain, as identified by SPECTRUS. The color of the image does not reflect any structural property but instead it is used to distinguish different quasi-rigid domains. In particular, black regions identify unstructured domains, whereas shadows from dark grey to white identify quasi rigid domains. We apologize for the lack of clarity. We have corrected the figure legend accordingly (text file with track changes, line 912).

      There is no cutoff in SPECTRUS' identification of quasi-rigid domain. Non quasi-rigid domains are simply regions of the protein that SPECTRUS cannot process properly. Meaning regions that, due to the large degree of intrinsic fluctuations, cannot be modelled as quasi-rigid.

      We also expanded the description of Pidc in the main text to clarify that it quantifies the proportion of intra-domain contacts made by the phosphosite's side chain, and that a cutoff of {greater than or equal to}0.8 was used to retain only residues well-integrated within rigid domains (text file with track changes, line 243).

      We hope these updates will resolve the ambiguities noted and more clearly define the criteria used in our filtering pipeline.

      2.7. The evolutionary comparison (which is not my core expertise), seems again like comparing different things. Why not comparing cryptic and non-cryptic sites in the same protein regions? Also p-Y are, evolutionarily speaking, very different to p-S and p-T. How is this possibly considered in one distribution. p-Y analysis needs to be separated from the p-T and p-S analyses here.

      We want to clarify that our evolutionary analyses compare residues at the aligned positions in orthologous proteins across multiple species. This approach ensures that each cryptic or non-cryptic phosphosites is assessed in its native structural and sequence context. Therefore, the comparison is not between different regions but evaluates the evolutionary conservation of specific sites across species, allowing for a direct and meaningful comparison of cryptic and non-cryptic phosphosites. In order to address the second point, we report below the entropic score distributions for serine/threonine and tyrosine, separately (Figure R5).

      2.8. Have the authors thought of randomization of their data to see whether the distributions are significant?

      We are unsure we fully understand what the referee means by randomizing the data in this case.

      However, according to the mathematical definition of entropic score, the limit case in which, within each orthogroup, the phosphorylated amino acid is replaced by a completely random residue yields an entropic score of 1. The opposite limit, in which all members of the orthogroups have the same amino acid in the position of the phosphorylated amino acid, yields an ES of 0. We have added a paragraph in the methods to stress this point (text file with track changes, line 354).

      2.9. Labeling in Suppl Figures is insufficient. E.g. In S6 what are the various WT, A and D numbering, are this independent stable transfections/clones? Figure S7 what is R? Thank you for pointing this out. We have now corrected the missing information in the revised version of the manuscript (text file with track changes, from line 992 to 1008)

      2.10. Whether or not findings are "impressive" should be up to the reader, please remove these attributes in the text.

      We agree with the reviewer's suggestion. We have removed subjective language such as "impressive" from the revised manuscript to ensure an objective and neutral tone, allowing readers to independently evaluate the significance of our findings (text file with track changes, line 454).

      3.1. Residues with pLDDT scores below 65 were excluded from the analysis. The high-confidence measure applies to individual residues, regardless of whether the domains they belong to are also predicted with high confidence. Identifying the number of domains containing PTMs with overall high-confidence predictions could provide better insights into the orientation of modified residues within domain structures. To assess the relationship between residue-specific confidence and domain stability, we can analyze the correlation between high-confidence modified residues and the overall prediction accuracy of their domains. This could be quantified using the average error scores of domain residues. Additionally, using the average pLDDT score would indicate how many individual residues were predicted with high local structural confidence. In contrast, the average PAE (Predicted Aligned Error) score would provide insights into how well each residue's position is predicted relative to others within the domain, reflecting overall domain structural confidence.

      Our analysis excluded residues with pLDDT scores below 65 to ensure high local confidence. While pLDDT provides residue-level structural confidence, assessing domain-wide prediction quality offers additional insights into modified residues' spatial organization and exposure. However, a domain-level interpretation is currently limited by the format of AlphaFold structural predictions. Specifically, AlphaFold does not provide Predicted Aligned Error (PAE) matrices for sequences split into overlapping fragments, a method used for proteins longer than 2,700 amino acids. These fragment predictions are only available in the downloadable AlphaFold proteome archives, not through the web interface, and lack the global alignment metrics (such as PAE) necessary for analyzing domain stability or inter-residue confidence within the domain context.

      3.2. "Approximately 65% of proteins with cryptic phosphosites contained only one or two such residues, while less than 10% had five or more sites (Supp. Figure 3)." To better interpret this trend, it would be useful to analyze the total number of cryptic PTMs on proteins part of this study, including all modification types-not just phosphorylation. This would help determine whether the observed pattern is specific to phosphorylation or if it extends to other post-translational modifications as well.

      To compare the occurrence of different cryptic PTMs, we extended our analysis to include all cryptic post-translational modifications annotated in PhosphoSitePlus, including phosphorylation, glycosylation, methylation, sumoylation, and ubiquitination. The approach allowed us to assess whether the observed distribution of cryptic phosphosites is unique or represents a more general feature of all cryptic PTMs. We observed extensive variation among the different PTMs in the proportion of proteins carrying 1, 2, or more of the same cryptic PTM (see Table R3). However, it must be noted that the relatively low number of cryptic PTMs, excluding phosphorylation, could make it difficult to determine whether these patterns reflect actual biological trends or are simply influenced by the sample size. We have not included these data in the new version of the manuscript, but we would be willing to add them if the editor or the reviewer advises us accordingly.

      3.3. For the validation of cryptic sites, selecting domains under 200 amino acids was mentioned. However, was there also a minimum length threshold applied, similar to the filtering criteria used for false positives (less than 40 ignored)?

      The 40-residue threshold was applied because protein domains that are too small cannot be reliably subdivided into quasi-rigid domains. Trying to run SPECTRUS on structures with fewer than 40 residues inevitably returns a warning, reflecting the intrinsic cooperative nature of quasi-rigid domains. In fact, entities composed of too few amino acids cannot properly arrange themselves into 3D structures and tend to be disordered. The same reasoning was applied when choosing the proteins to simulate. In particular, for the refolding simulations, we selected protein domains possessing the following properties:

      1. Shorter than 200 amino acids to limit the computational demands.
      2. Long enough to fold into an ordered 3-dimensional conformation reliably.
      3. Have an experimentally determined NMR or X-ray crystal structure 3.4. To test their hypothesis that phosphorylation affects protein expression, they selected candidates for serine and threonine but excluded tyrosine. What were the reasons for not including tyrosine-related PTMs in their analysis?

      Our experimental assays relied on phosphomimetic substitutions to mimic the effect of phosphorylation. While serine/threonine phosphorylation can be reasonably mimicked by E or D substitutions, there is no reliable single-residue mimic for phosphotyrosine. Indeed, E or D substitutions do not recapitulate the structural or electronic features of pTyr. Given these limitations, we excluded tyrosine phosphosites from experimental validation to avoid generating inconclusive or misleading data.

      3.5. Do we know that the regulatory role of S300 on PYST1 is associated with the dual specificity of the phosphatase, and is this why it was selected as a negative regulator? While the regulatory roles of the other analyzed phosphosites on SMAD and CHK1 are discussed, there is limited mention of the specific role of S300 on PYST1 within the scope of the study.

      S300 of PYST1 was selected not due to known regulatory relevance, but for technical convenience. PYST1 is a relatively small protein, facilitating computational simulations. We also had suitable reagents for detection (i.e., expression vector), and importantly, S300 was identified as a false-positive cryptic phosphosite removed by our dynamic filtering. It was a practical and structurally matched negative control for validating our computational pipeline.

      3.6. When comparing the entropic scores between cryptic and non-cryptic residues, the medians are 0.43 and 0.52, respectively. Although this difference is not very high, they do observe that cryptic residues have lower scores than non-cryptic ones. The distributions also show greater overlap (Figure 6). I'm wondering if any statistical testing would help assess how distinct these two groups really are.

      We thank the reviewer for the comment raised by reviewer #2, for which we provide an answer above. Briefly, given our large sample sizes, statistical tests often yield very low p-values even for minor differences. To assess the biological significance, we calculated Cohen's d (Table R2 above). The effect size between cryptic and non-cryptic phosphosites (d = 0.4028) was modest but meaningful, and slightly lower than between core and surface residues (d = 0.5126).

      3.7. Why did the authors choose to rely on AlphaFold data instead of examining PDB structures? I didn't see any explanation or rationale provided for preferring AlphaFold predictions over experimentally determined structures from the PDB.

      We appreciate the value of this comment. We focused on AlphaFold to maximize proteome-wide coverage. Indeed, although PDB structures offer experimentally validated conformations, their sparse and uneven proteome coverage (particularly for membrane proteins, low-abundance factors, and intrinsically disordered regions) precludes a truly global analysis. AlphaFold2 models, by contrast, deliver accurate, full-length structures for nearly the entire human proteome, enabling unbiased, large-scale mapping of cryptic phosphosites. Nonetheless, we performed the same analysis using high-resolution structures from the Protein Data Bank (PDB). The results were fully consistent with those based on AlphaFold predictions, indicating that our findings are consistent across the two databases (see Figure R6 below).

      3.8. Novelty - The concept that cryptic site modifications can dysregulate signaling in cancer and other diseases is known, but systematically categorizing PTM sites into cryptic and non-cryptic to generate hypotheses for a wide range of identified PTMs remains an underdeveloped approach. This study establishes a framework for classifying PTMs based on their structural accessibility, integrating AlphaFold predictions, molecular dynamics simulations, solvent accessibility analysis, and phylogenetic conservation metrics. This approach not only enhances our understanding of PTM-mediated regulatory mechanisms but also provides a foundation for exploring how cryptic modifications contribute to protein function, stability, and disease progression.

      We appreciate the reviewer's comment. To our knowledge, this is the first study to introduce and define "cryptic phosphosites" as a structurally distinct and functionally relevant subset of phosphorylation sites. While some individual cases of buried amino acids influencing cancer-related proteins have been reported, no previous study has systematically mapped, filtered, and analyzed these sites across the human proteome using integrated structural, dynamical, evolutionary, and experimental criteria.

      3.9. The study relies primarily on predicted protein structures (e.g., AlphaFold), without exploring experimentally derived structures, which could provide more accurate and physiologically relevant insights.

      We have addressed this point above (see reply to #3.7).

      3.10. While the research demonstrates the impact of cryptic PTMs on protein function, it would be valuable to also investigate non-cryptic sites from their annotated data. By examining the effects of modifications on these non-cryptic sites, the study could further validate the importance of the cryptic versus non-cryptic classifications and help clarify the functional relevance of both types of sites.

      We thank the referee for this thoughtful suggestion. We compared the proportion of cryptic or non-cryptic phosphosites associated with cancer- and disease-related mutations in each group from the COSMIC and PTMVar datasets. The percentage of phosphosites associated with the two repositories is essentially the same for cryptic and non-cryptic sites. This observation suggests that, despite their different structural and regulatory features, both site types occur similarly in disease contexts (see Table R4). We have included these data in the new version of the manuscript (text file with track changes, line 1067; and new Supp. Table 3).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      The methods applied in this study were thoughtfully designed. The study's goals and the experiments performed to test several of their hypotheses were meticulously planned, ensuring that the research approach was robust and aligned with the objectives. The experimental design effectively addressed the key questions and provided reliable insights into the role of cryptic PTMs in protein function and disease mechanisms.

      This study investigates cryptic post-translational modification (PTM) sites in the human proteome and their role in protein folding and expression, with significant implications for disease mechanisms. This work seeks to bridge the gap between the abundance of identified PTM sites and their regulatory roles in signaling pathways. A key focus of the study is on intermediate protein conformations-states that exist between fully folded and unfolded structures to determine whether these transient states contribute to disease by affecting protein synthesis, activity, stability, and degradation. To classify PTM sites as cryptic or non-cryptic, the authors used AlphaFold-predicted structures and relative solvent accessibility (RSA) scores, excluding those within quasi-rigid domain interfaces. This enabled them to create a database of mapped PTM sites, distinguishing based on their cryptic nature. Their analysis revealed that most PTMs occur at solvent-exposed residues, but unexpectedly, one-third of tyrosine phosphosites were cryptic. To assess the impact of cryptic phosphorylation on protein expression, they performed molecular dynamics (MD) simulations on SMAD and CHK1 phosphsites, showing that cryptic sites can become transiently exposed during protein folding. Their computational simulations further supported the finding that this exposure enhancing the chances of being modified and ultimately a potential mechanism for destabilization of its structure (due to that modification) to trigger degradation in physiological conditions. Experimentally, western blotting and protein half-life measurements confirmed that phosphomimetic substitutions affected protein expression, supporting their hypothesis that cryptic phosphorylation can influence protein stability and function. From an evolutionary and functional perspective, their phylogenetic analysis using entropy scores indicates that cryptic sites are more conserved. They also show that the cryptic PTM sites identified in this study were found to be substituted by phosphomimetic mutations in tumor-suppressor proteins, leading to dysregulation of their function and suppression of downstream signaling essential for tumor cell death. This study provides a framework for mapping cryptic PTM sites and understanding their role within intermediate protein folding states. By linking cryptic PTMs to their effects on protein stability, signaling pathways, and disease progression, the findings highlight a potential regulatory mechanism through which cryptic modifications contribute to cancer and other diseases.

      Minor revisions

      1. Result 1 - Residues with pLDDT scores below 65 were excluded from the analysis. The high-confidence measure applies to individual residues, regardless of whether the domains they belong to are also predicted with high confidence. Identifying the number of domains containing PTMs with overall high-confidence predictions could provide better insights into the orientation of modified residues within domain structures. To assess the relationship between residue-specific confidence and domain stability, we can analyze the correlation between high-confidence modified residues and the overall prediction accuracy of their domains. This could be quantified using the average error scores of domain residues. Additionally, using the average pLDDT score would indicate how many individual residues were predicted with high local structural confidence. In contrast, the average PAE (Predicted Aligned Error) score would provide insights into how well each residue's position is predicted relative to others within the domain, reflecting overall domain structural confidence.
      2. "Approximately 65% of proteins with cryptic phosphosites contained only one or two such residues, while less than 10% had five or more sites (Supp. Figure 3)." To better interpret this trend, it would be useful to analyze the total number of cryptic PTMs on proteins part of this study, including all modification types-not just phosphorylation. This would help determine whether the observed pattern is specific to phosphorylation or if it extends to other post-translational modifications as well.
      3. For the validation of cryptic sites, selecting domains under 200 amino acids was mentioned. However, was there also a minimum length threshold applied, similar to the filtering criteria used for false positives (less than 40 ignored)?
      4. To test their hypothesis that phosphorylation affects protein expression, they selected candidates for serine and threonine but excluded tyrosine. What were the reasons for not including tyrosine-related PTMs in their analysis?
      5. Do we know that the regulatory role of S300 on PYST1 is associated with the dual specificity of the phosphatase, and is this why it was selected as a negative regulator? While the regulatory roles of the other analyzed phosphosites on SMAD and CHK1 are discussed, there is limited mention of the specific role of S300 on PYST1 within the scope of the study.
      6. When comparing the entropic scores between cryptic and non-cryptic residues, the medians are 0.43 and 0.52, respectively. Although this difference is not very high, they do observe that cryptic residues have lower scores than non-cryptic ones. The distributions also show greater overlap (Figure 6). I'm wondering if any statistical testing would help assess how distinct these two groups really are.
      7. Why did the authors choose to rely on AlphaFold data instead of examining PDB structures? I didn't see any explanation or rationale provided for preferring AlphaFold predictions over experimentally determined structures from the PDB.

      Significance

      Novelty - The concept that cryptic site modifications can dysregulate signaling in cancer and other diseases is known, but systematically categorizing PTM sites into cryptic and non-cryptic to generate hypotheses for a wide range of identified PTMs remains an underdeveloped approach. This study establishes a framework for classifying PTMs based on their structural accessibility, integrating AlphaFold predictions, molecular dynamics simulations, solvent accessibility analysis, and phylogenetic conservation metrics. This approach not only enhances our understanding of PTM-mediated regulatory mechanisms but also provides a foundation for exploring how cryptic modifications contribute to protein function, stability, and disease progression.

      Strengths - This study benefits from its use of multiple validation methods and false-positive filtering, resulting in a high-confidence dataset of annotated PTM sites. The combination of computational predictions and experimental analyses strengthens the validity of their findings. This integrative approach enhances the reliability of the data and provides a comprehensive understanding of cryptic versus non-cryptic PTMs in protein regulation.

      Limitations

      1. The study relies primarily on predicted protein structures (e.g., AlphaFold), without exploring experimentally derived structures, which could provide more accurate and physiologically relevant insights.
      2. While the research demonstrates the impact of cryptic PTMs on protein function, it would be valuable to also investigate non-cryptic sites from their annotated data. By examining the effects of modifications on these non-cryptic sites, the study could further validate the importance of the cryptic versus non-cryptic classifications and help clarify the functional relevance of both types of sites.

      Audience - The broader implications of this work extend to biomedical research, drug discovery, and therapeutic development. Researchers in cell signaling and systems biology who aim to understand which modification sites are crucial for evaluating the outcomes of signaling pathways can benefit from the insights generated by this study. It provides a pathway for identifying novel drug targets and enhances our understanding of disease mechanisms, particularly in cancer and other diseases. Additionally, this work encourages and motivates computational biologists to develop more efficient methods for capturing protein folding dynamics, enabling more accurate hypotheses regarding the effects of specific PTM sites and how they influence protein function and disease progression.

      My expertise lies primarily in structural biology, with a strong background in developing and utilizing bioinformatics and computational tools. While I currently have less hands-on experience with experimental techniques, my comprehensive understanding of experimental methodologies, combined with an awareness of the expected outcomes, has enabled me to effectively evaluate and interpret experimental results.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Review on Gasparotto et al "Mapping Cryptic Phosphorylation Sites in the Human Proteome"

      Gasparotte et al assess the solvent accessibility of 87,138 post-translationally modified amino acids in the human proteome (from phosphosite plus). There initial observation is that a large fraction of modified sites are buried, a finding that is pronounced for phosphorylation but not other modifications. Their approach is using alpha fold 3D structures (0.65 cut off) and RSA prediction to get a set of buried sites. Further refinement includes the removing of low-confidence segments (such as loops, linkers, or short disordered regions) and to use SPECTRUS to identified quasi-rigid domains. The idea is that quasi rigid domains may not breathe and thus will be modified during the synthesis or folding.

      They generated a final dataset of 10,606 cryptic T, S and Y phosphor-sites in 5,496 proteins and state that: "These data indicate that ~5% of all known phospho-sites are cryptic. Impressively, the number translates to ~33% of phosphorylated proteins in the human proteome presenting at least one cryptic phospho-site." They focus on S417 of the SMAD2, T382 of Chk1, known to be associated with loss of function effects or proteasomal degradation and S300 of PYST1 negative control. They stably express these proteins as phospho-mimicry or alanine substitution in HEK293. Expression levels were reduced in the phosphor-D- mutant versions and upon cycloheximide treatment a reduction of the turnover time for the phospho-D CHK1 was observed. I think we are looking a large clonal difference in the supplemental figures.

      The examples are supported by MD simulations that suggest that cryptic phospho-sites can occur during the folding process and affect protein homeostasis by drastically increasing degradation rate and leading to rapid turnover; Essentially the phospho-versions show a solvent exposure. Evolutionary comparison whether cryptic and non-cryptic sites are differently conserved. Two distinct distributions for cryptic and non-cryptic phospho-sites are observed and Figure 6 shows two entropy distributions of cryptic v non-cryptic. Here it is unclear whether this is significant given the different distributions of the two types when non modified. Finally, overlay of the sites with cancer mutations lists 221 mutations in COSMIC associated with cryptic phosphosites that have been annotated as cancer-related and 138 mutations in PTMVar linked to cancer and other human pathologies. The identification of buried modification sites and what the biological meaning / implications are is a very interesting topic. However PTM distribution on proteins is very skewed (many papers have identified cluster, hot spots, structural dependencies etc...) and therefore comparing modified sites on different residues and in different protein regions and with non-modified residues has to be very stringently controlled.

      Points for consideration

      • Very basic question: How do you assessed the RSA value of the residues from the alphafold structure. If it is sequence based, then it is unclear what the alpha fold structure actually contributes in this step? Although I assume it is structure based, it is not well described, only a reference.
      • Given that the different residues S,T,Y but also K for glycosylations etc. have a very different baseline RSA distribution, the distributions of modified residues as such are not so informative. Are the distributions of residues with the alpha fold LOD 0.65 different between modified and non-modified?
      • Same point: it is very clear that "tyrosine presenting a larger proportion of cryptic phosphor-sites", as they mainly are within folded domains to begin with. The pattern of phosphorylation and clustering is very different between the modified amino acid residue T,S,Y and needs consideration, given the large number of PTMs, a simple distribution is not sufficient to argue.
      • Figure 3 E (proteins need names in the figure ): the cryptic site T222 (Chk1) is not in the quasi ridged domain, it is in a light color region. What is actually the SPECTRUS cutoff? The Pidc is only one sentence in the main text? It says fewer than 80% intradomain contacts in rigid domains i.e. >0.8, right, but is the domain rigid?
      • The evolutionary comparison (which is not my core expertise), seems again like comparing different things. Why not comparing cryptic and non-cryptic sites in the same protein regions? Also p-Y are, evolutionarily speaking, very different to p-S and p-T. How is this possibly considered in one distribution. p-Y analysis needs to be separated from the p-T and p-S analyses here.
      • Have the authors thought of randomization of their data to see whether the distributions are significant?
      • Labeling in Suppl Figures is insufficient. E.g. In S6 what are the various WT, A and D numbering, are this independent stable transfections/clones? Figure S7 what is R?
      • Whether or not findings are "impressive" should be up to the reader, please remove these attributes in the text.

      Significance

      The identification of buried modification sites and what the biological meaning / implications are is a very interesting topic. However PTM distribution on proteins is very skewed (many papers have identified cluster, hot spots, structural dependencies etc...) and therefore comparing modified sites on different residues and in different protein regions and with non-modified residues has to be very stringently controlled.

      main conclusion: 5% of all known phospho-sites are cryptic, at least one in 1/3 of structured protein regions.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This preprint uses bioinformatic and experimental approaches to explore the prevalence and consequences of the phosphorylation of residues normally buried in the hydrophobic core of proteins. By cross-referencing validated human phosphosites (PhosphositePlus) with the predicted 3D structures of the human proteome (from the AlphaFold predicted protein structure database), they identified potential "cryptic" phosphosites not expected to be solvent-accessible. They further refined the list using a variety of tools and conclude that a significant percentage (roughly 25%) of known phosphosites in folded domains are cryptic. They go on to experimentally test the consequences of mutating several of these sites in known proteins either to non-phosphorylateable or phospho-mimetic residues, and found that the phosphomimetic mutants had lower half-lives and average expression levels than either the wt or non-phosphorylatable versions. Finally, they show that putative cryptic phosphorylation sites are more highly conserved that those that are surface-accessible, and that some of these cryptic sites are found in tumor suppressor genes and that phosphomimetic mutations at these sites can be found in tumor mutation databases.

      Major comments:

      Overall the experimental approach is relatively straightforward, and in general the authors' interpretation of the results seems reasonable. There were several areas where I believe additional analysis or discussion might clarify the interpretation, however.

      1. It would be helpful if the authors could discuss whether there is any correlation between cryptic sites and the extent of experimental validation in the Phosphosite database (e.g. those that were only identified in one or a few MS experiments). It is difficult to determine stoichiometry of phosphorylation experimentally, but can any inference be made on the extent of phosphorylation of cryptic sites vs. more conventional sites located in IDRs or on the surface of globular domains?
      2. The authors note that a larger percentage of tyrosine phosphorylation sites are cryptic compared with serine/threonine sites. I assume that tyrosine itself is more highly enriched in the hydrophobic cores of proteins relative to serine or threonine, due to its bulky hydrophobic side chain. Is the increased proportion of cryptic tyrosine phosphorylation sites more, less, or the same as the proportion of tyrosine in hydrophobic cores relative to serine and threonine?
      3. Fig. 5D and E: I had some trouble interpreting these figures. Indicating where the native state is in the plots would be helpful (stated in text as lower right, but a rectangle on the plot would make this more obvious). The text discusses three metastable intermediates, but what is the fourth one shown on the figures (well A, close to the native state)? This could be more explicitly explained.
      4. The fact that phosphomimetic mutations of cyptic sites in SMAD2 and CHK1 lead to lower expression levels and shorter half-lives is not surprising, given the expected disruption of the hydrophobic core by introduction of a charged residue. The results certainly show that if phosphorylated, these sites would decrease expression and half-life. With respect to half-life, however, if the authors are correct and cryptic sites are predominately phosphorylated co-translationally, one would expect that the half-life curves for the wt protein would not be a simple exponential, but would instead reflect two distinct populations: those that are phosphorylated during translation, and are almost immediately degraded, and those that escape phosphorylation and have the same half-life as the non-phosphorylatable mutant. Are the actual experimental results consistent with this two-population model? If not, this would be evidence that some of these cryptic sites can be exposed post-translation, either by thermal fluctuation or biological interactions.
      5. The authors make a point that cryptic phosphosites are more highly conserved than non-cryptic phosphosites, but it is not clear to me whether it is the side chain itself or its ability to be phosphorylated that is conserved. Supplemental Fig. 9, if I am interpreting it correctly, would suggest it is the residue itself and not its phosphorylation that is conserved. If so, wouldn't this suggest that phosphorylation of these cryptic sites is just an inevitable consequence of the conservation of serine, threonine, and tyrosine residues in hydrophobic core regions? If the authors have evidence that argues against this simple hypothesis, they should discuss it (e.g., cryptic phosphosites are more highly conserved in some cases than non-phosphorylated tyrosine, serine, and threonine residues that are not solvent accessible).
      6. Regarding the evolutionary conservation of cryptic sites, have the authors taken into consideration that tyrosine-specific kinases, phosphatases, and reader domains first appeared in the first metazoans, and are for the most part not seen in non-metazoan eukaryotes? I notice some of the proteomes used for the conservation analysis include plants and yeast, which lack most tyrosine phosphorylation.
      7. I find the argument that phosphorylation of exposed core residues is part of normal protein quality control/proteostasis to be convincing. Can the authors provide any experimental evidence to support this model (for example, greater phosphorylation of cryptic sites under stress conditions)? I don't think these experiments are necessary, but would seem to be a logical next step and could be done quite easily through collaboration.
      8. The authors note at the end of the discussion that targeting cryptic phosphosites might be a strategy to selectively degrade some proteins in cancer. Practically, how would this work? I can't think of how, but perhaps the authors can provide more specific suggestions.

      Minor comment:

      1. Introduction: "It involves the addition of a phosphate to an hydroxyl group found in the side chain of specific amino acids, typically serine, threonine or tyrosine residues." Of course serine, threonine, and tyrosine are the only standard amino acids with a simple hydroxyl group, so "typically" is not needed here.

      Significance

      In my view this is an important study, bringing rigor and a broad proteomic perspective to a phenomenon that (to my knowledge) had not been carefully examined previously. In terms of the big picture, I am of two minds. On the one hand, showing that phosphorylation of hydrophobic core residues exposed during translation or the early stages of folding can regulate steady state levels of some proteins provides an intriguing new mechanism to control the complement of proteins in the cell, and is potentially an area of regulation in normal physiology or in disease. On the other hand, if this is just part of the normal proteostatic mechanisms (hydrophobic core residues exposed for too long consign the protein to degradation, before it can lead to aggregation and other problems), that is a little less interesting to me. I think future work to tease out whether this mechanism is actually regulated and used by the cell to transmit information will be key. But the first step is showing that the phenomenon is real and widespread, and in my view this preprint accomplishes that goal very well.

      I come from a background of studying post-translational modifications in signaling, hence my hope that a regulatory role can be found. But even if cryptic phosphorylation turns out to be unregulated, the work provides important new insight into normal proteostasis, and therefore is a valuable contribution. I should note that I don't have extensive expertise in bioinformatic methods or the computational tools to study protein dynamics, but I assume other reviewers will critically evaluate these methods.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      General Statements

      We sincerely thank all three reviewers for their thoughtful and constructive feedback. Your comments were invaluable in improving the clarity and quality of our work.

      In this study, we revisit a previously overlooked lipophilic dye, demonstrating its utility for live-cell imaging that transport in a non-vesicular pathway and label autophagy related structures. Against the backdrop of increasing attention to membrane contact sites (MCSs), bridge-like lipid transfer proteins (BLTPs), and organelle biogenesis, we aim to propose the possibility of a reversible one-way phospholipid transfer activity that really takes place in living cells.

      As Reviewer #1 noted, recent cryo-EM studies (e.g., Oikawa et al.) have highlighted the importance of lipids in autophagosome formation. And there are some existed in vitro studies. However, we believe that we have to think about the consistence of simplified in vitro reconstitution and the complex real cellular environment. In addition, to our knowledge, no studies have directly tracked lipid flow dynamics over time in living cells. We believe our work contributes to this gap by combining three interesting technical approaches: (a) R18 as a lipid-tracing dye, (b) FRAP analysis on the isolation membrane, and (c) the use of Ape1 overexpression to stall autophagosome closure, enabling us to visualize reversible lipid flow in vivo. While these techniques may not appear "fancy," we hope they offer new insights that can inspire further exploration in lipid dynamics story in a real cellular environment.

      We appreciate Reviewer #2's comments on our high imaging quality and Reviewer #3's recognition of our approach as an elegant way to study lipid transfer. We have revised the manuscript accordingly and included additional explanations, figure clarifications, and planned experiments to address remaining concerns.

      As two key concerns were raised repeatedly by all reviewers, we would like to address them here:

      1. Regarding the concern that the evidence for reversible lipid transfer from the IM to the ER is not sufficiently strong:

      We are deeply grateful to Reviewer #2 for the insightful suggestion to compare the fluorescence recovery of the adjacent bleached ER to that of the ER-IM MCS, to exclude the possibility that recovery at the ER-IM MCS originates from nearby ER rather than from the IM. Following this suggestion, we performed a quantitative analysis using unbleached ER as a background. Interestingly, in every sample, the adjacent bleached ER consistently showed a significantly lower fluorescence recovery than the ER-IM MCS. We also used the IM as a background for normalization, the difference became even more pronounced, further supporting the idea that the adjacent ER could not be the source of the recovery signal at the ER-IM MCS. These findings strengthen our conclusion that phospholipid recovery at the MCS could be derived from the IM. The updated analysis and corresponding figure panels (Figure 5K, 5L, and 5M), along with the relevant text (lines 384-396), have been revised accordingly.

      Regarding the concern that the evidence for R18 transfer via Atg2 as a bridge-like lipid transfer protein is not sufficiently direct:

      In addition to the evidence presented in this manuscript, we have now cited our parallel study currently under revision (Sakai et al., bioRxiv 2025.05.24.655882v1), where we provide direct evidence that Atg2 indeed functions as a bridge-like lipid transfer protein, rather than a shuttle. Importantly, we also show in that study that R18 transfer requires the bridge-like structure of Atg2. This new reference has been cited in the revised manuscript, and relevant textual explanations have been added to provide further support.

      We hope that the revisions and our revision plan can address the reviewers key concerns. Please find our detailed point-by-point responses below.

      Response to the Reviewer ____#____1

      In their study, Hao and colleagues exploited the fluorescent fatty acid R18 to follow phospholipid (PL) transfer in vivo from the endoplasmic reticulum to the IM during autophagosome formation. Although the results are interesting, especially the retrograde transport of PLs, based on the provided data, additional control experiments are needed to firmly support the conclusions.

      We sincerely thank the reviewer for the positive assessment and agree that additional controls are necessary to support our conclusion. Detailed responses and corresponding revisions are provided below.

      An additional point is that the authors also study the internalization of R18 into cells and found a role of lipid flippases and oxysterol binding proteins. While this information could be useful for researchers using this dye, these analyses/findings have no specific connection with the topic of the manuscript, i.e. the PL transfer during autophagosome formation. Therefore, they must be removed.

      We thank the reviewer for the thoughtful comment. We understand the concern that the R18 internalization analysis may appear peripheral to the manuscript's main focus on phospholipid transfer during autophagosome formation. However, we respectfully believe that this section is critical for establishing the mechanistic basis as this study represents the first detailed in vivo application of R18 for tracing lipid dynamics. We believe it is interesting that R18 entry is not due to chemically passive diffusion or non-specific adsorption, but occurs through a biologically regulated, non-vesicular lipid transport pathway. This mechanistic context underpins the reliability of using R18 to monitor ER-to-IM lipid transport in the autophagy pathway.

      To improve clarity and coherence, we have added explanatory text in the Introduction and at the start of the Results section to explicitly link the internalization assay to the subsequent autophagy-related experiments (line 94-98, 185-187). We hope this helps guide the reader through the rationale and relevance of this part of the study.

      Major points:

      1) In general, the quality of the microscopy images are quite poor and this make it difficult to assert some of the authors' conclusions.

      We thank the reviewer for the feedback. To better address this concern, we would appreciate clarification regarding which specific images or figure panels were found to be of low quality. Overall, we believe the microscopy data presented are of sufficient resolution and clarity to support our main conclusions, as also noted by Reviewer #2 ("the high-quality images and FRAP experiments").

      We acknowledge that certain phenomena-such as occasional R18 labeling of the vacuole-were not clearly explained in the original manuscript. We have now included additional clarification in the results section and mentioned this limitation in the discussion (lines 170-171, 436-438), along with a note on ongoing experiments to further investigate this point.

      2) It would be important to perform some lipidomics analysis to determine in which PLs and other lipids or lipid intermediates R18 is incorporated. First, it will be important to know which the major PL species are are labelled under the conditions of the experiments done in this study. Second, the authors assume that all the R18 is exclusively incorporated into PLs and this is what they follow in their in vivo experiments. What about acyl-CoA, which has been shown to be a key player in the IM elongation (Graef lab, Cell)?

      We thank the reviewer for raising this point. However, we believe this is based on a misunderstanding of the chemical nature of R18. R18 is not a free fatty acid analog and cannot be incorporated into phospholipids or acyl-CoA via metabolic pathways. Due to its chemical structure-a bulky rhodamine headgroup attached to a long alkyl chain-it cannot undergo enzymatic conjugation or incorporation into membrane lipids. This is why we did not pursue lipidomics analysis. Instead, we focused on characterizing the biological behavior of R18 through a range of live-cell assays, including temperature and ATP dependency, involvement of flippases, OSBP proteins, and Atg2, all of which support a regulated, non-vesicular lipid transport pathway. Additionally, the AF3 structural model presented in this study is consistent with this interpretation, showing no evidence of R18 forming chemical bonds with phospholipids.

      3) Figure 1A and 1B. The authors conclude that Atg2 is involved in the lipid transfer since R18 does not localize to the PAS/ARS in the atg2KO cells. However, another possible explanation is that in those cells the IM is not formed and does not expand, and con sequetly R18 is present in low amounts not detectable by fluorescence microscopy. To support their conclusion, the authors must assess PAS-labelling with R18 in cells lacking another ATG gene in which Atg2 is still recruited to the PAS.

      We thank the reviewer for this important suggestion. As noted, the absence of R18 at the PAS in atg2Δ cells may reflect a lack of membrane formation rather than impaired lipid transfer. However, in support of our interpretation, our previous work (Hirata E, Ohya Y, Suzuki K, 2017) has shown that R18 accumulates at PAS-like structures in delipidation mutants, where the IM fails to expand but Atg2 is still recruited (please refer to the attached revision plan for further details). This suggests that the presence of Atg2, rather than the mere existence of a mature IM, contributes to R18 localization.

      To address this, we revised our statement to the more cautious: "R18 was undetectable at the PAS in atg2Δ cells," to avoid overinterpretation (lines 119-120). 4)

      4) Figure 2. As written, the paragraph this figure seems to indicate that flippases are directly involved in the translocation of R18 from the PM to the ER. As correctly indicated by the authors, flippases flip PLs, not fatty acids. Moreover, there are no PL synthesizing at the PM and thus probably R18 is not flipped upon incorporation into PL. As a result, the relevance of flippase in R18 internalization is probably indirect. This must be explained clearly to avoid confusion/misunderstandings.

      We thank the reviewer for this important clarification. We fully agree that flippases act on phospholipids, not fatty acids, and that R18 is not metabolically incorporated into phospholipids at the plasma membrane. However, our ongoing work (Rev. Figure 1) shows that R18 preferential labeling affinity for PS and PE in vivo (yeast phospholipid synthesis mutants), consistent with its flippase-dependent localization. Flippases are known to specifically flip PS and PE. While R18 itself is not enzymatically modified or incorporated into phospholipids, its membrane distribution may thus depend on the lipid environment and the activity of lipid-translocating proteins.

      Preliminary data supporting this observation are included in the "Supplementary Figures for reviewer reference only" and are not part of the public submission.

      5) A couple of manuscript has shown a (partial) role of Drs2 in autophagy. The authors must explain the discrepancy between their own results and what published, especially because they use the GFP-Atg8 processing assay, which is less sensitive than the Pho8delta60 used in the other studies.

      We thank the reviewer for raising this important point. We are aware of prior reports implicating Drs2 in autophagy and in fact discussed this work directly with the authors during the course of our experiments, who kindly provided helpful suggestions. While our GFP-Atg8 processing assay did not show significant defects upon Drs2 deletion, strain background differences may explain this discrepancy. We also appreciate the suggestion to use the Pho8Δ60 assay and plan to include it in future experiments.

      Additionally, authors should check whether the Atg2 and Atg18 proteins are present at the IM-ER membrane contact sites in the same rates after nutrient replenished than when cells are nitrogen-starved, since this complex would determine the lipid transfer dynamics at this membrane contact site.

      We thank the reviewer for the helpful suggestion. We plan to perform additional experiments to monitor Atg18 localization during the nutrient replenishment assay.

      6) Authors used a predicted Atg2 lipid-transfer mutant (Srinivasan et al, J Cel Biol, 2024), but not direct prove that this mutant is defective for this activity. As previously done for other Atg2/ATG2-related manuscripts (Osawa et al, Nat Struct Mol Biol, 2019; Valverde et al, J Cel Biol, 2019), this must be measure in vitro. Moreover, they do not show whether other known functions of Atg2 are unaffected when expressing this Atg2 mutant, e.g. formation of the IM-ER MCSs, Atg2 interaction with Atg9 and localization at the extremity of the IM...

      We thank the reviewer for this concern. The lipid-transfer-deficient Atg2 mutant used here is based on the same structural rationale as in our recent parallel study (Sakai et al., bioRxiv 2025; https://www.biorxiv.org/content/10.1101/2025.05.24.655882v1, currently under revision). In that study, we addressed whether Atg2 indeed functions as a bridge-like lipid transfer protein, and also used R18 to directly demonstrate the lipid transfer defect of this Atg2 mutant in vivo.

      We therefore believe that referencing this study provides mechanistic support for the use of this Atg2 mutant in the current manuscript. A citation and brief explanation have now been added to the revised text (line 315-316, 439-441). We also plan to perform the lipid transfer assay in vitro.

      7) The mNG-Atg8 signal is not recovered in the fluorescent recovery assays. Based on the observation that R18 signal comes back after photobleaching, authors suggest that the supply of Atg8 is not required for IM expansion. This idea is opposite to data where the levels of Atg8 and deconjugation of lipidated Atg8 determines the size of the forming autophagosomes (e.g., Xie et al, Mol Biol Cell, 2008; Nair et al, Autophagy, 2012). Similar results have also been obtained in mammalian cells (Lazarou and Mizushima results in cell lacking components of the two ubiquitin-like conjugation systems). This discrepancy requires an explanation.

      We thank the reviewer for pointing out this imprecise interpretation, and we sincerely apologize for the confusion it may have caused. We fully agree that Atg8 is essential for the expansion of the isolation membrane (IM), as supported by previous studies. In our FRAP data, mNG-Atg8 showed gradual recovery at the later timepoints, indicating that Atg8 can be replenished over time. The reason why R18 recovery appears much more rapid is likely due to the inherently fast lipid transfer activity of Atg2, the bridge-like lipid transport protein. In contrast, Atg8 signal recovery may have been delayed for two reasons: (1) slower recruitment kinetics to the IM, and (2) partial depletion of the available mNG-Atg8 protein pool due to photobleaching during the experiment.

      We have revised the relevant paragraph in the manuscript (line 326-330) to clarify these points and avoid potential misinterpretation.

      8) Although authors claim that there is a retrograde lipid transfer from the IM to the ER, based on the data, it quite difficult to extract these conclusions as they show a decrease in the lipid flow dynamics rather to an inversion of the lipid flow per se. Can the authors exclude that ER microdomains are formed at the ERES in contact with the IM, and consequently what they measure is a slow diffusion of R18-labeled lipid from other part of the ER to these ERES?

      We appreciate the reviewer's insightful comment. Indeed, we are also considering the possibility that lipid-enriched microdomains may form in the ER and contribute to complex lipid dynamics at contact sites. However, direct visualization of such domains in cells remains technically challenging, this remains one of the important directions we aim to pursue in future studies. While our current data do not allow us to definitively state that all recovered lipids originate from the IM, our FRAP experiments provide indirect yet strong support for the possibility that at least a substantial portion of the recovered lipid signal in the ER derives from the IM. Moreover, following Reviewer 2's major point No.4, we performed a direct comparison of R18 fluorescence recovery between the photobleached ER-IM MCS region and the adjacent bleachedER region (Figure 5K and 5M). Interestingly, each sample consistently showed lower fluorescence recovery in the adjacent bleached ER near the ER-IM MCS (mean = 0.20), compared to the ER-IM MCS region (mean = 0.28). To further validate this observation, we also used the IM as a background reference for normalization. This analysis revealed a more significant difference, with the adjacent bleached ER near the ER-IM MCS showing a lower recovery (mean = 0.47) than the ER-IM MCS (mean = 0.80).

      As the Reviewer2 pointed out, these results support our reversible lipid transfer model by demonstrating that fluorescence recovery at the ER-IM MCS is due to the signal coming from the IM, rather than from the adjacent bleached ER, which recovers more slowly and less efficiently. We have incorporated this new analysis into Figure 5, and accordingly revised the figure legend and main text (lines 384-396).

      9) The retrograde PL transfer is studied in cells overexpressing Ape1, in which IM elongation is stalled. This is a non-physiological experimental setup and consequently it is unclear whether what observed applies to normal IM/autophagosomes. This event should be shown to occur in WT cells as well.

      We thank the reviewer for this point. Indeed, it remains technically difficult to visualize lipid flow during normal IM expansion in vivo, as this process is rapid and transient. And to date, there are no reports directly addressing lipid flow in this process.

      But the Ape1 overexpression system provides a strategic advantage by temporally extending the IM elongation phase and spatially enlarging the IM, thus offering a unique opportunity to capture membrane behavior that would otherwise be transient and difficult to resolve. Importantly, this system arrests autophagosome closure, which we leveraged to investigate the potential reversibility of phospholipid transfer in a controlled and prolonged context. Without this system, it would be exceedingly difficult for reaserchers to examine the lipid flow directionality in living cells.

      Furthermore, the use of Ape1 overexpression has been widely employed in previous high-impact autophagy studies. We emphasize that our aim is to understand Atg2-mediated lipid transfer, and in this context, the Ape1 system provides a valuable and informative tool without compromising the validity of our conclusions.

      10) From the images provided, it appears that R18 also labels the vacuole. The vacuole form MCSs with the IM. Can the author exclude a passage of R18 from the vacuole to the IM?

      We thank the reviewer for the insightful comment. Our data suggest that R18 traffics from the plasma membrane to the ER, then to autophagy-related structures. Actually, following that, as we kown, autophagosomes will eventually reaches and fused with the vacuole. This explains the occasional weak R18 signals at the vacuole membrane, particularly in late-stage cells. We have revised the figure and clarified this point in the text to avoid oversimplification of R18 localization (lines 169-171, 426-428)

      Here we also added the results of our onging work (in preparation). R18 tends to accumulate in a dot-like compartment after prolonged rapamycin treatment and incubation (Rev. Figure 2). And the vacuolar labeling of R18 correlates with the degradation status of autophagosomes, rather than reverse lipid transport from the vacuole to the IM (Rev. Figure 2). Taken together, we believe that R18 transport from the vacuole back to the IM is unlikely.

      Preliminary data supporting this response are included in the "Supplementary Figures for reviewer reference only" and are not part of the public submission.

      Minor points:

      1) L66. One report has indicated that Vps13 may also play a role in the transfer of lipids from the ER to the IM (Graef lab, J. Cell Biol).

      Thank you for pointing this out. Their excellent work also suggested that the inherent lipid transfer activity of Atg2 is required for IM expansion. We have revised the sentence (lines 67-68, 312-314) and included the appropriate citation at these two places.

      2) L70. It must be indicated that IM is also called phagophore.

      We have revised the sentence (line 70-71). Thank you for pointing this out.

      3) L74. It is mentioned "Additionally, a hydrophobic cavity in the N-terminal region of Atg2 directly tethers Atg2 to the ER, particularly the ER exit site (ERES), which is considered a key hub for autophagosome biogenesis", but there is no experimental evidence supporting that Atg2 is involved in the tethering with the ERES.

      Thank you for pointing this out. We have removed the N-terminal region part and revised the sentence accordingly (line 79-81) to avoid overstatement.

      4) L90. PAS must be listed between the ARS.

      We have revised the sentence (line 97-98). Thank you for pointing this out.

      5) Upon deletion of ATG39 and ATG40, there is a pronounced reduction of mNG-Atg8 labelled with R18. This would suggest that these two ER-phagy receptors are required for the PL transfer from the ER to the IM, which is not the case as autophagy is mildly affected by the absence of them (e.g., Zhang et al, Autophagy, 2020).

      We thank the reviewer for the important comment and agree that Atg39 and Atg40 are not required for phospholipid transfer from the ER to the IM. We have revised the text (lines 155-157). We appreciate if the reviewer could provide the DOI or PubMed ID for this paper.

      6) Authors referred that "no direct evidence has been found to confirm lipid transfer at the ER-IM MCS in living cells" (lines 282-283). However, a recent paper has shown that de novo-synthesized phosphatidylcholine is incorporated from the ER to the autophagosomes and autophagic bodies (Orii et al, J Cel Biol, 2021). This reference should be mentioned in the manuscript.

      Thank you for your insightful reminder. This paper beautifully demonstrated the importance of de novo-synthesized phosphatidylcholine in autophagy using electron microscopy. We have now included its citation and brief discussion in the revised manuscript (lines 74-76, 297-298). However, we respectfully note that direct observation of lipid transfer at the ER-IM MCS in living cells still remains unproven.

      7) In lines 252-253, the sentence "R18 transport from the PM to the ER was partially impaired in osh1Δ osh2Δ, osh6Δ osh7Δ, and oshΔ osh4-1 cells (Figure S3). These results suggest that Osh proteins participate in transferring R18 from the PM to the ER" does not recapitulate what is observed in Fig. S3. Moreover, the Emr lab has generate a tertadeletion mutant in which the PM-ER MCSs are abolished. The authors could examine this mutant.

      We thank the reviewer for this helpful comment and sincerely apologize for the lack of clarity in our original description. Our conclusion was primarily based on the partial PM accumulation of R18 observed in some osh mutant strains shown in Figure S3, which motivated us to further investigate this pathway using the OSW-1 inhibitor. We have revised the corresponding text to improve the logic and clarity of this section.

      We appreciate the recommendation of the tether∆ mutant. Our preliminary tests indicate that R18 still properly labels the ER in tether∆ cells, suggesting that its localization is not due to passive diffusion at membrane contact sites, but rather involves specific transport mechanisms. As this is an initial observation, we plan to confirm the result and include it in a future revision.

      Reviewer #1 (Significance (Required)):

      General assistent: Strength: potential new system to monitor lipid flow Limitations: Indirect evidences and in the case of the retrograde transport of phospholipids, it could be an artefact of the employed experimental approach. Advance: Little advances because something in part already shown in vitro. No new mechanisms uncovered. Audience: Autophagy and membrane contact site fields.

      We sincerely thank the reviewer for the overall evaluation. We agree that our current system offers indirect but promising evidence for lipid transfer events at ER-IM contact sites in vivo. While Atg2-mediated lipid transport has been proposed in vitro, our study adds value by (1) establishing a live-cell imaging way to monitor lipid flow in a non-vesicular transport pathway, (2) proposing a model of reversible one-way lipid transfer activity, and (3) addressing whether findings from simplified in vitro reconstitution accurately reflect the dynamics in the more complex real cellular environment.

      We recognize the limitations of our current approach and plan to include additional analyses to more cautiously interpret the observed retrograde movement. Although we do not claim to identify a new mechanism, we believe our work provides an interesting framework to inspire future efforts aimed at directly probing lipid flow at membrane contact sites in vivo.

      We also sincerely appreciate the reviewer's recognition of the potential value of this system for the autophagy and membrane contact site communities.

      Response to the Reviewer ____#2

      Non-vesicular lipid transfer plays an essential role in organelle biogenesis. Compared to vesicular lipid transfer, it is faster and more efficient to maintain proper lipid levels in organelles. In this study, Hao et al. introduced a high lipophilic dye octadecyl rhodamine B (R18), which specifically labels the ER structures and autophagy-related structures in yeast and mammalian cells. They characterised its distinct lipid entry into yeast cells via lipid flippase Neo1 and Drs2 on the plasma membrane, rather than through the endocytic pathway. They then demonstrated that R18 intracellular trafficking through plasma membrane to ER depends on "box-like" lipid transfer Osh proteins. They further looked into the "bridge-like" lipid transfer protein Atg2, using R18 as a lipid probe to track lipid transfer from ER to the isolation membrane (IM) during membrane expansion and reversible lipid transfer through IM to the ER-IM membrane contact sites (MCS) when autophagy is terminated by nutrient replenishment. The authors provide an interesting model of reversible directionality of Atg2 lipid transfer during autophagy induction and termination.

      We sincerely thank the reviewer for the thoughtful and constructive summary of our work. We are grateful for the recognition of the novelty of using R18 to visualize non-vesicular lipid transfer in vivo and for highlighting the conceptual contribution of our proposed model of reversible Atg2-mediated transport during autophagy.

      In response to the reviewer's valuable suggestions, we have revised key parts of the manuscript and prepared a detailed revision plan to address the specific concerns. We truly appreciate the reviewer's insights, which have been instrumental in improving the clarity of our study.

      Major points:

      1. Line 299-309: The FRAP assays were interesting and well performed. The authors photobleached R18 and Atg8 signal, and found R18 fluorescence recovery but not Atg8, which suggests lipid transfer occurs between ER and the IM and faster than Atg8 lipidation process during IM expansion. These results gave clear evidence that R18 can be transferred during IM expansion. The supply of Atg8 may not be not able to track within this time frame or the recovered amount of Atg8 may not be able to visualized due to the threshold limitation with confocal microcopy. This does not imply the supply of Atg8 to the IM is not required during IM expansion. This should be clarified.

      We thank the reviewer for this valuable comment and fully agree that Atg8 is essential for IM expansion. We apologize for any ambiguity that may have suggested otherwise.

      As pointed out, the lack of mNG-Atg8 recovery in our FRAP assay likely reflects the slower turnover of lipidated Atg8, limited observation time, and photobleaching of the existing protein pool. Notably, we observed a weak but gradual signal recovery at later time points, supporting this view. We have revised the relevant paragraph in the manuscript (line 326-330) to clarify these points and avoid potential misinterpretation.

      Please clarify how the length of the IM is measured and determined in Figure 4H and Figure 5D.

      We thank the reviewer for the vaulable comment. We have now clarified the method for quantifying IM length in the revised manuscript. Specifically, we modified the Statistical Analysis section of the Methods (line 642-643).

      Line 336-342: The description of the results should be clarified. Based on Figure 5H, the authors observed a significant decrease in the mNG-Atg8 signal during photobleaching of the R18 signal.

      We thank the reviewer for pointing out the ambiguity. We have now clarified the description in the revised manuscript. The sentence has been modified (line 360-362) as follows: "To determine whether nutrient replenishment terminates autophagy, we selectively photobleached the R18 signal and monitored the R18 (photobleached) and mNG-Atg8 (without photobleaching) signal following nutrient replenishment."

      The authors photobleached ER-IM MCS and the ER region (boxed region in Figure 5J) and quantified fluorescence recovery, normalized to the IM region and an ER control. The ER control was taken from the other cell. It would be helpful to compare and analyse the fluorescence recovery of R18 in the bleached ER region near the ER-IM MCS to that in the ER-IM MCS. This would help to confirm the ER-IM MCS fluorescence recovery is due to signal coming from the IM.

      We sincerely thank the reviewer for this insightful suggestion. We have now performed the suggested comparison. Interestingly, each sample consistently showed lower fluorescence recovery in the adjacent bleached ER near the ER-IM MCS (mean = 0.20), compared to the ER-IM MCS region (mean = 0.28). To further validate this observation, we also used the IM as a background reference for normalization. This analysis revealed a more significant difference, with the adjacent bleached ER near the ER-IM MCS showing a lower recovery (mean = 0.47) than the ER-IM MCS (mean = 0.80).

      As the reviewer pointed out, these results support our reversible lipid transfer model by demonstrating that fluorescence recovery at the ER-IM MCS is due to the signal coming from the IM, rather than from the adjacent bleached ER, which recovers more slowly and less efficiently. We have incorporated this new analysis into Figure 5, and accordingly revised the figure legend and main text (lines 384-396). Again, we appreciate this constructive and helpful suggestion.

      In figure 5K, the autophagic structure or IM labelled by R18 seems to be maintained when the mNG-Atg8 signal decreases or dissociates from the IM. Could the authors comment on that how they interpret the termination of the prolonged IM structure and IM shrinkage?

      We thank the reviewer for this insightful observation. Based on our live-cell imaging, we speculate that following the initial dissociation of Atg8, the IM membrane undergoes a relatively slow disassembly process, potentially retracting toward the ER-IM MCS, which often localizes near ER exit sites (ERES). This suggests that IM shrinkage may proceed via Atg8-independent mechanisms. Although the precise pathway remains unclear, we occasionally observed vesiculation events during this phase, supporting the idea that membrane remodeling continues even in the absence of Atg8. In response to this comment, we have revised our manuscript to reflect these interpretations (line 494-496).

      The author has shown that Atg2Δ and Atg2LT lipid transfer mutant impair R18 labelling of autophagic structures in Figure 4C. However, the evidence supporting that R18 fluorescence recovery at ER-IM MCS is mediated by reversible Atg2 lipid transfer is not direct. It would be helpful to clarify whether Atg2 stays on the enlarged autophagic membranes when the membrane has reached to its maximum length and no longer grows.

      We thank the reviewer for this important suggestion. As noted in our response to Reviewer 1 (Major Point 8-2), clarifying whether Atg2/Atg18 remains at the ER-IM contact sites after IM expansion is indeed important for supporting the reversible lipid transfer model. We plan to monitor the localization of Atg18 during the nutrient replenishment assay.

      Minor points:

      1. Figure 2A "Dpm-GFP" is missing. The experiment replicates in Figure 2M should be indicated.

      We thank the reviewer for pointing out these issues. The label for "Dpm-GFP" has been added in Figure 2A, and the number of experimental replicates for Figure 2M is now indicated in the figure legend.

      Figure S2, the magenta panel should be "R18".

      We thank the reviewer for catching this labeling error. We have corrected the magenta panel label in Figure S2 to "R18" in the revised version of the figure.

      Line 341-342: "Figure 5H and 5J" should be "Figure 5H and 5I"

      We thank the reviewer for pointing out this error. The citation has been corrected from "Figure 5H and 5J" to "Figure 5H and 5I" in the revised manuscript.

      Please describe how the lipid docking model of Atg2 is generated.

      We thank the reviewer for this question. We have added a description of the modeling approach in the Methods section of the revised manuscript (lines 640-646). We also added the configuration files of AlphaFold3 to the supplementary information.

      Reviewer #2 (Significance (Required)):

      Currently, lipid probes are emerging as powerful tools to understand membrane dynamics, integrity, and the lipid-mediated cellular functions. In this manuscript, the authors performed a detailed characterisation of octadecyl rhodamine B (R18) as a potential lipid probe, which specifically labels ER and autophagic membranes. They present high quality imaging data and performed FRAP experiments to monitor the membrane dynamics and investigate the lipid transfer directionality between the ER and autophagic structure. However, the evidence of Atg2-mediated reversible lipid transfer may not be direct and sufficient. The proposed reversible lipid transfer model is interesting and provides an explanation of lipid level regulation during autophagosome formation.

      We sincerely thank the reviewer for the positive assessment of our work and for acknowledging the potential of R18 as a lipid probe, as well as the quality of our imaging and FRAP experiments. We are particularly grateful that the reviewer found the proposed model of reversible lipid transfer both interesting and relevant to the broader question of lipid regulation during autophagosome formation.

      Regarding the reviewer's concern that the evidence for Atg2-mediated reversible lipid transfer may not be sufficiently direct, we agree this is a critical point. While technical limitations currently prevent direct visualization of lipid flow reversal at single-molecule resolution in vivo, we hope our revision plan strengthen the proposed model and better convey its biological relevance, while also acknowledging the current limitations and the need for further mechanistic work.

      Response to the ____Reviewer #3

      The authors address the question of how autophagic membrane seeds expand into autophagosomes. After nucleation, IMs expand in dependence of the bridge-like lipid transfer protein Atg2, which has been shown to tether the IM to the ER. Several studies have shown in vitro evidence for direct lipid transfer by Atg2 between tethered membranes, and previous evidence has shown that the hydrophobic groove of Atg2 implicated in lipid transfer is required for autophagosome biogenesis in vivo in yeast and mammalian cells.

      In this manuscript, the authors take advantage of the dye R18, which they show accumulates mainly in the ER after a few minutes. They show specifically that the import of R18 into cells and transfer to the ER depends on the activity of flippases in the plasma membrane and OSPB-related lipid transporter. Using different sets of FRAT experiments, the authors track the fluorescence recovery of R18 in the IM, the IM-ER membrane contact site and the neighboring ER. From these experiments the authors conclude that (a) R18 is transferred to IM from the ER when IMs expand and (b) can be transferred from IMs back to the ER when autophagy is deactivated.

      The use of a lipophilic dye to monitor lipid dynamics during IM expansion or dissolution is an elegant way to probe the mechanisms of lipid transfer across ER-IM contact sites. Quantitative in vivo data is critically needed to address this fundamental question in autophagy and contact site biology. However, the study remains limited in providing direct evidence that it is indeed the lipid transfer activity of Atg2, which underlies the R18 dynamics in IMs in vivo.

      We sincerely thank the reviewer for this thoughtful and encouraging summary. We appreciate the recognition of our approach using R18 to visualize lipid dynamics at ER-IM contact sites, and agree that in vivo quantitative data are critically needed to advance our understanding of autophagic membrane expansion.

      We also fully agree with the reviewer that our current study provides indirect-but conceptually informative-support for Atg2-mediated reversible one way lipid transfer. While prior in vitro studies have demonstrated the lipid transfer capability of Atg2, our goal here was to develop a live-cell system that allows the dynamic tracking of lipid flow in vivo, and to explore the possibility of reversible transport during autophagy termination. We hope our story will offer unique insights for future studies aiming to directly probe lipid transfer mechanisms in live cells.

      Regarding the reviewer's concern about the lack of direct evidence that Atg2's lipid transfer activity underlies the observed R18 dynamics, we fully acknowledge this limitation. To address this point, we would like to cite our parallel study currently under revision (Sakai et al., bioRxiv 2025.05.24.655882v1), which provides additional mechanistic evidence linking R18 dynamics to the lipid transfer function of Atg2. Further details and planned revisions are described in the responses below.

      Major points:

      (1) The authors use R18in FRAP experiments to follow its transfer from the ER into IMs. However, whether this transfer is mediated by Atg2 via its inherent lipid transfer activity remains indirect. The only evidence that implicates Atg2 directly is the observation that a lipid transfer deficient Atg2 variant fails to support IM expansion and autophagosome biogenesis. A similar full-length Atg2 mutant has previously been shown to block autophagosome formation in Dabrowski et al. 2023 in yeast, which the authors do not cite or discuss, suggesting the inherent lipid transfer activity of Atg2 is required for IM expansion. However, aside from this experiment, the mechanisms underlying R18 transfer remain unclear and, while they likely depend on or are at least partially mediated by Atg2, they may involve alternative mechanisms including vesicle transport or continuous membrane contacts. Moreover, for the assays with stalled or dissolving IM, it is essential for the authors to test whether Atg2 is still associated with these IMs. It is quite possible that Atg2 dissociates from maximally expanded or dissolving IMs, which would make their interpretation of the data very unlikely. Thus, it will be critical to provide consistent evidence that lipid transfer from the IM to the ER is mediated by Atg2. Ideally, the authors would label IM with BFP-Atg8, R18, and Atg2-GFP and perform their in vivo analysis.

      We sincerely thank the reviewer for the critical comments and valuable suggestions. To further support the link between R18 transfer and Atg2, we would like to highlight two complementary findings. As noted in our response to Reviewer 1 (Major Point 3), R18 can still label the PAS even when Atg2 is recruited but IM expansion is impaired, suggesting that R18 trafficking occurs in an Atg2-dependent manner. In addition, in our parallel study (bioRxiv, 2025.05.24.655882v1), we demonstrated that Atg2 acts as a bridge-like lipid transfer protein. Notably, when we mutated the bridge-forming region of Atg2, R18 transport to the IM was also disrupted.

      We greatly appreciate the reviewer's reminder regarding the study by Dabrowski et al., 2023, which we have now cited and discussed in the revised manuscript (lines 66-68, 312-314). Their findings that the inherent lipid transfer activity of Atg2 is required for autophagosome formation in vivo strongly reinforce our model.

      Regarding the possibility of vesicle transport, we consider this contribution minimal based on R18's preferential labeling of continuous membranes and its divergence from FM4-64 staining. As for the role of continuous membrane contacts, as also mentioned in our response to Reviewer 1, our preliminary tests indicate that R18 still properly labels the ER in tether∆ cells, suggesting that its localization is not due to passive diffusion at membrane contact sites, but rather involves specific transport mechanisms. As this is an initial observation, we plan to confirm the result and include it in a future revision.

      We also thank the reviewer for the suggestion to monitor Atg2 localization at the dissolving IM. As similarly pointed out by two other reviewers, we plan to track Atg18 during the nutrient replenishment assay.

      Finally, we appreciate the idea of triple-labeling with BFP-Atg8, R18, and Atg2-GFP. While our preliminary attempts encountered technical difficulties such as abnormal BFP-Atg8 localization and severe bleaching during long-term imaging in yeast, we plan to optimize this approach in future experiments.

      (2) Given the ER forms contact sites with many organelles using bridge-like lipid transfer proteins, how do the authors explain the preferential accumulation of R18 in ARS and not in for example PM (Fmp27), mitochondria, endosomes or vacuole (Vps13)? Why should R18 specifically transferred by Atg2 and not or to a much lower rate by Fmp27 or Vps13?

      We sincerely thank the reviewer for raising this insightful question. Indeed, we have carefully considered this point. Our data indicate that R18 labeling of autophagy-related structures (ARS) depends on Atg2, as demonstrated in the present manuscript and supported by our parallel study currently under revision (bioRxiv, 2025.05.24.655882v1).

      We speculate that the preferential accumulation of R18 in ARS may arise from structural and contextual differences among bridge-like LTPs, such as Atg2, Vps13, and Fmp27. Although all are capable of mediating lipid transfer, these proteins differ in their membrane tethering modes, cargo specificity, and spatial regulation. For example, Atg2 localizes specifically to ER-IM contact sites during autophagosome formation, where membrane expansion requires rapid lipid supply. In contrast, Vps13 and Fmp27 may function at more stable or less dynamic contacts, where lipid turnover or probe accessibility is more limited. We have added a brief discussion of this point in the revised manuscript to reflect this important consideration (lines 439-444).

      (3) Does R18 label autophagic bodies after they are formed. Could the authors add R18 after autophagic bodies have formed in atg15 or pep4 cells?

      We thank the reviewer for this excellent suggestion. To address whether R18 can label autophagic bodies post-formation, we plan to perform additional experiments by adding R18 after autophagic bodies have accumulated in atg15Δ or pep4Δ cells. This will help clarify whether R18 incorporates into pre-formed autophagic bodies or requires earlier membrane dynamics for its labeling.

      (4) Since Neo1- or OSBP-defective cells do not transfer R18 from the PM to the ER or other membranes, the authors should include these strains as controls for ER-dependent R18 transfer to ARSs.

      We thank the reviewer for this insightful suggestion. To further validate the ER-dependency of R18 transfer to autophagy-related structures, we plan to include Neo1- and OSBP-deficient strains as additional controls.

      Comments:

      The authors neglect to mention or discuss important recent literature directly related to their study:

      Schutter et al., Cell (2020); Orii et al., JCB (2021); Polyansky et al., EMBOJ (2022); Dabrowski et al., JCB (2023); Shatz et al., Dev Cell (2024)

      We sincerely thank the reviewer for pointing out these important and highly relevant studies. We apologize for our oversight in not citing them earlier. Each of these works has provided valuable insights that are directly related to and have greatly informed our current study. We have now cited and discussed these references in appropriate sections of the revised manuscript.

      Figure 1A and B: The authors need to describe how these cells were stained with R18 in the figure legend or text to help the reader to understand how these experiments were performed. Figure legends need to indicate at which time point after rapamycin treatment cells were analyzed.

      Thank you for the helpful suggestion. We have now added the corresponding information to the figure legends to clarify the staining procedure and time points.

      The authors need to clarify whether mNG-Atg8 colocalization with R18 was included for dot- and ring-like structures for WT cells as shown separately in 1A but not in 1B.

      Thank you for the comment. The quantification in Figure 1B includes both dot- and ring-like structures of mNG-Atg8 colocalized with R18 in WT cells, as shown in Figure 1A. We have now clarified this point in the revised figure legend.

      Figure 1C: The figure legend needs to describe the conditions cells were treated with and when cells were analyzed after rapamycin treatment (presumably).

      Thank you for the helpful suggestion. We have now added the corresponding information to the figure legends.

      Figure 1C: The authors should combine atg15 and pep4 deletions with atg2 or atg7 as controls in which autophagic bodies are not formed.

      Thank you for the valuable suggestion. We plan to perform these experiments that combine atg15 and pep4 deletions with atg2 or atg7 as controls.

      Figure 1E and F: R18 stains more than just the ER in the cells shown. In addition to atg39 and atg40, authors should include atg11 to inhibit all forms of selective autophagy.

      Thank you very much for the insightful comment. We agree and plan to include the atg11Δ mutant to inhibit all forms of selective autophagy.

      Figure S2A and B: The figures are mislabeled. Instead of FM4-64 it should say R18. In addition to the ER, in several images it is obvious to see R18 staining the vacuole membrane (for example Figure 2A 30 degrees) and others. Thus, the strong thresholding in S2 may give the reader an oversimplified view on R18 localization. This needs to be corrected.

      Thank you very much for pointing this out. We have corrected the labeling error in Figure S2A and B. Regarding the observation that R18 occasionally labels the vacuole membrane, we agree with the reviewer's comment. Based on our data, we believe that this signal likely reflects autophagosomes that have reached and fused with the vacuole, as expected in the later stages of autophagy. We have clarified this point in the text to avoid oversimplification of R18 localization (lines 169-171, 426-428).

      Figure 1G and H: In 1G, there are number of R18-stained patches not co-labeled by GFP-ER. What are these patches and which organelles to they represent? In 1H, given the tight association of the ER (omegasome) with forming IMs, it is difficult to discern whether R18 labels surrounding ER membrane or the IM itself. This needs to be more closely analyzed. The authors need to quantify these data similar to the yeast data.

      Thank you for the suggestion. We plan to perform additional quantification and colocalization analysis to clarify the identity of R18-positive signals in 1G and 1H.

      Figure 4A-C: A full-length PLT-deficient variant of Atg2 has been analyzed by Dabrowski et al, JCB 2023 in vivo. This work needs to be cited and discussed. The analysis needs to include punctate Atg8 structures for WT cells to exclude effects due to expansion defects.

      Thank you for the suggestion. We have now cited and discussed the work by Dabrowski et al., JCB 2023 in the revised manuscript (lines 67-68, 312-314). In addition, we have included an analysis of punctate Atg8 structures in WT cells to address the concern regarding potential expansion defects.

      Figure 4F-H: To measure the size changes in IMs, the authors would need to perform these experiments without bleaching the mNG-Atg8 signals.

      We apologize for the lack of clarity. The method for measuring IM size has now been added to the revised manuscript. In Figure 4, we note that mNG-Atg8 fluorescence actually shows a slow recovery over time. This limited recovery likely reflects both the slower turnover of Atg8 and the fact that the pre-existing Atg8 pool at the IM was partially photobleached. We have now revised the main text to clarify this point and included additional explanation (line 326-330).

      Figure 5C: The authors need to indicate the bleached areas in the mNG-Atg8 image for easier orientation. It looks to me that the area that the authors mark as IM-ER MCS is really the IM in proximity to the ER. Thus, if lipid transfer to the IM has ceased, I would not expect recovery here. If the IM-ER MCS area includes IM and the ER to similar extent, I would expect exactly what the authors show: IM does not recover while ER quickly recovers. On average, we would observe reduced recovery as shown in 5D.

      Thank you for the helpful suggestion, and we apologize for the oversight during figure preparation. We have now clearly indicated the bleached areas in the merged image in Figure 5C for better orientation. Additionally, we have carefully re-examined the defined ER-IM MCS region and confirm that the quantified area indeed corresponds to the contact site between the ER and the IM. And double checked the measurements shown in the figure remain correct.

      Figure 5L: Since mNG-Atg8 signal homogenously disappears from the IM, it is meaningless to measure size. How do the authors measure the size of something they cannot detect?

      Thank you for pointing this out. We agree with the reviewer's comment and have removed the panel from the revised version accordingly.

      Figure 5K: The authors need to show the whole bleached area overtime for the reader to be able to see where the recovered R18 signal might be coming from. Currently, it is impossible to discern whether the signal comes from the IM or from slow recovery from neighboring ER.

      We appreciate this insightful comment. To address the concern and following the suggestion from Reviewer 2 (Major Point No.4), we have now revised the figure to include an additional measurement of fluorescence recovery in the adjacent bleached ER (Figure 5K and 5M) (lines 384-396). These results further support our reversible lipid transfer model by demonstrating that fluorescence recovery at the ER-IM MCS originates from the IM, rather than from the adjacent bleached ER, which shows slower and less efficient recovery.

      We have also added time-lapse videos to the supplementary information due to space limitations in the main figure.

      Reviewer #3 (Significance (Required)):

      The use of a lipophilic dye to monitor lipid dynamics during IM expansion or dissolution is an elegant way to probe the mechanisms of lipid transfer across ER-IM contact sites. Quantitative in vivo data is critically needed to address this fundamental question in autophagy and contact site biology. However, the study remains limited in providing direct evidence that it is indeed the lipid transfer activity of Atg2, which underlies the R18 dynamics in IMs in vivo.

      We sincerely thank the reviewer for this encouraging and thoughtful comment. We appreciate the recognition that our live-cell approach using a lipophilic dye provides a valuable framework to visualize lipid dynamics during autophagosome biogenesis. As the reviewer pointed out, quantitative in vivo evidence is critically needed in this field, and we hope our study contributes meaningfully toward that goal.

      We also fully acknowledge the limitation. While our current data offer indirect evidence for Atg2-mediated lipid transfer, we would like to support this by our revision plan and also our parallel study (bioRxiv, 2025.05.24.655882v1) that shows Atg2 is indeed a bridge-like LTP and R18 transfer is lost in the bridge-structure defective strain. Together, we hope these can suggest that the lipid transfer activity of Atg2 underlies the observed R18 dynamics in vivo.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors address the question of how autophagic membrane seeds expand into autophagosomes. After nucleation, IMs expand in dependence of the bridge-like lipid transfer protein Atg2, which has been shown to tether the IM to the ER. Several studies have shown in vitro evidence for direct lipid transfer by Atg2 between tethered membranes, and previous evidence has shown that the hydrophobic groove of Atg2 implicated in lipid transfer is required for autophagosome biogenesis in vivo in yeast and mammalian cells.

      In this manuscript, the authors take advantage of the dye R18, which they show accumulates mainly in the ER after a few minutes. They show specifically that the import of R18 into cells and transfer to the ER depends on the activity of flippases in the plasma membrane and OSPB-related lipid transporter. Using different sets of FRAT experiments, the authors track the fluorescence recovery of R18 in the IM, the IM-ER membrane contact site and the neighboring ER. From these experiments the authors conclude that (a) R18 is transferred to IM from the ER when IMs expand and (b) can be transferred from IMs back to the ER when autophagy is deactivated.

      The use of a lipophilic dye to monitor lipid dynamics during IM expansion or dissolution is an elegant way to probe the mechanisms of lipid transfer across ER-IM contact sites. Quantitative in vivo data is critically needed to address this fundamental question in autophagy and contact site biology. However, the study remains limited in providing direct evidence that it is indeed the lipid transfer activity of Atg2, which underlies the R18 dynamics in IMs in vivo.

      Major points:

      1. The authors use R18in FRAP experiments to follow its transfer from the ER into IMs. However, whether this transfer is mediated by Atg2 via its inherent lipid transfer activity remains indirect. The only evidence that implicates Atg2 directly is the observation that a lipid transfer deficient Atg2 variant fails to support IM expansion and autophagosome biogenesis. A similar full-length Atg2 mutant has previously been shown to block autophagosome formation in Dabrowski et al. 2023 in yeast, which the authors do not cite or discuss, suggesting the inherent lipid transfer activity of Atg2 is required for IM expansion. However, aside from this experiment, the mechanisms underlying R18 transfer remain unclear and, while they likely depend on or are at least partially mediated by Atg2, they may involve alternative mechanisms including vesicle transport or continuous membrane contacts. Moreover, for the assays with stalled or dissolving IM, it is essential for the authors to test whether Atg2 is still associated with these IMs. It is quite possible that Atg2 dissociates from maximally expanded or dissolving IMs, which would make their interpretation of the data very unlikely. Thus, it will be critical to provide consistent evidence that lipid transfer from the IM to the ER is mediated by Atg2. Ideally, the authors would label IM with BFP-Atg8, R18, and Atg2-GFP and perform their in vivo analysis.
      2. Given the ER forms contact sites with many organelles using bridge-like lipid transfer proteins, how do the authors explain the preferential accumulation of R18 in ARS and not in for example PM (Fmp27), mitochondria, endosomes or vacuole (Vps13)? Why should R18 specifically transferred by Atg2 and not or to a much lower rate by Fmp27 or Vps13?
      3. Does R18 label autophagic bodies after they are formed. Could the authors add R18 after autophagic bodies have formed in atg15 or pep4 cells?
      4. Since Neo1- or OSBP-defective cells do not transfer R18 from the PM to the ER or other membranes, the authors should include these strains as controls for ER-dependent R18 transfer to ARSs.

      Comments:

      The authors neglect to mention or discuss important recent literature directly related to their study:

      Schutter et al., Cell (2020); Orii et al., JCB (2021); Polyansky et al., EMBOJ (2022); Dabrowski et al., JCB (2023); Shatz et al., Dev Cell (2024)

      Figure 1A and B: The authors need to describe how these cells were stained with R18 in the figure legend or text to help the reader to understand how these experiments were performed. Figure legends need to indicate at which time point after rapamycin treatment cells were analyzed.

      The authors need to clarify whether mNG-Atg8 colocalization with R18 was included for dot- and ring-like structures for WT cells as shown separately in 1A but not in 1B.

      Figure 1C: The figure legend needs to describe the conditions cells were treated with and when cells were analyzed after rapamycin treatment (presumably).

      The authors should combine atg15 and pep4 deletions with atg2 or atg7 as controls in which autophagic bodies are not formed.

      Figure 1E and F: R18 stains more than just the ER in the cells shown. In addition to atg39 and atg40, authors should include atg11 to inhibit all forms of selective autophagy.

      Figure S2A and B: The figures are mislabeled. Instead of FM4-64 it should say R18. In addition to the ER, in several images it is obvious to see R18 staining the vacuole membrane (for example Figure 2A 30 degrees) and others. Thus, the strong thresholding in S2 may give the reader an oversimplified view on R18 localization. This needs to be corrected.

      Figure 1G and H: In 1G, there are number of R18-stained patches not co-labeled by GFP-ER. What are these patches and which organelles to they represent? In 1H, given the tight association of the ER (omegasome) with forming IMs, it is difficult to discern whether R18 labels surrounding ER membrane or the IM itself. This needs to be more closely analyzed. The authors need to quantify these data similar to the yeast data.

      Figure 4A-C: A full-length PLT-deficient variant of Atg2 has been analyzed by Dabrowski et al, JCB 2023 in vivo. This work needs to be cited and discussed. The analysis needs to include punctate Atg8 structures for WT cells to exclude effects due to expansion defects.

      Figure 4F-H: To measure the size changes in IMs, the authors would need to perform these experiments without bleaching the mNG-Atg8 signals.

      Figure 5C: The authors need to indicate the bleached areas in the mNG-Atg8 image for easier orientation. It looks to me that the area that the authors mark as IM-ER MCS is really the IM in proximity to the ER. Thus, if lipid transfer to the IM has ceased, I would not expect recovery here. If the IM-ER MCS area includes IM and the ER to similar extent, I would expect exactly what the authors show: IM does not recover while ER quickly recovers. On average, we would observe reduced recovery as shown in 5D.

      Figure 5L: Since mNG-Atg8 signal homogenously disappears from the IM, it is meaningless to measure size. How do the authors measure the size of something they cannot detect?

      Figure 5K: The authors need to show the whole bleached area overtime for the reader to be able to see where the recovered R18 signal might be coming from. Currently, it is impossible to discern whether the signal comes from the IM or from slow recovery from neighboring ER.

      Significance

      The use of a lipophilic dye to monitor lipid dynamics during IM expansion or dissolution is an elegant way to probe the mechanisms of lipid transfer across ER-IM contact sites. Quantitative in vivo data is critically needed to address this fundamental question in autophagy and contact site biology. However, the study remains limited in providing direct evidence that it is indeed the lipid transfer activity of Atg2, which underlies the R18 dynamics in IMs in vivo.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Non-vesicular lipid transfer plays an essential role in organelle biogenesis. Compared to vesicular lipid transfer, it is faster and more efficient to maintain proper lipid levels in organelles. In this study, Hao et al. introduced a high lipophilic dye octadecyl rhodamine B (R18), which specifically labels the ER structures and autophagy-related structures in yeast and mammalian cells. They characterised its distinct lipid entry into yeast cells via lipid flippase Neo1 and Drs2 on the plasma membrane, rather than through the endocytic pathway. They then demonstrated that R18 intracellular trafficking through plasma membrane to ER depends on "box-like" lipid transfer Osh proteins. They further looked into the "bridge-like" lipid transfer protein Atg2, using R18 as a lipid probe to track lipid transfer from ER to the isolation membrane (IM) during membrane expansion and reversible lipid transfer through IM to the ER-IM membrane contact sites (MCS) when autophagy is terminated by nutrient replenishment. The authors provide an interesting model of reversible directionality of Atg2 lipid transfer during autophagy induction and termination.

      Major points:

      1. Line 299-309: The FRAP assays were interesting and well performed. The authors photobleached R18 and Atg8 signal, and found R18 fluorescence recovery but not Atg8, which suggests lipid transfer occurs between ER and the IM and faster than Atg8 lipidation process during IM expansion. These results gave clear evidence that R18 can be transferred during IM expansion. The supply of Atg8 may not be not able to track within this time frame or the recovered amount of Atg8 may not be able to visualized due to the threshold limitation with confocal microcopy. This does not imply the supply of Atg8 to the IM is not required during IM expansion. This should be clarified.
      2. Please clarify how the length of the IM is measured and determined in Figure 4H and Figure 5D.
      3. Line 336-342: The description of the results should be clarified. Based on Figure 5H, the authors observed a significant decrease in the mNG-Atg8 signal during photobleaching of the R18 signal.
      4. The authors photobleached ER-IM MCS and the ER region (boxed region in Figure 5J) and quantified fluorescence recovery, normalized to the IM region and an ER control. The ER control was taken from the other cell. It would be helpful to compare and analyse the fluorescence recovery of R18 in the bleached ER region near the ER-IM MCS to that in the ER-IM MCS. This would help to confirm the ER-IM MCS fluorescence recovery is due to signal coming from the IM.
      5. In figure 5K, the autophagic structure or IM labelled by R18 seems to be maintained when the mNG-Atg8 signal decreases or dissociates from the IM. Could the authors comment on that how they interpret the termination of the prolonged IM structure and IM shrinkage?
      6. The author has shown that Atg2Δ and Atg2LT lipid transfer mutant impair R18 labelling of autophagic structures in Figure 4C. However, the evidence supporting that R18 fluorescence recovery at ER-IM MCS is mediated by reversible Atg2 lipid transfer is not direct. It would be helpful to clarify whether Atg2 stays on the enlarged autophagic membranes when the membrane has reached to its maximum length and no longer grows.

      Minor points:

      1. Figure 2A "Dpm-GFP" is missing. The experiment replicates in Figure 2M should be indicated.
      2. Figure S2, the magenta panel should be "R18".
      3. Line 341-342: "Figure 5H and 5J" should be "Figure 5H and 5I"
      4. Please describe how the lipid docking model of Atg2 is generated.

      Significance

      Currently, lipid probes are emerging as powerful tools to understand membrane dynamics, integrity, and the lipid-mediated cellular functions. In this manuscript, the authors performed a detailed characterisation of octadecyl rhodamine B (R18) as a potential lipid probe, which specifically labels ER and autophagic membranes. They present high quality imaging data and performed FRAP experiments to monitor the membrane dynamics and investigate the lipid transfer directionality between the ER and autophagic structure. However, the evidence of Atg2-mediated reversible lipid transfer may not be direct and sufficient. The proposed reversible lipid transfer model is interesting and provides an explanation of lipid level regulation during autophagosome formation.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In their study, Hao and colleagues exploited the fluorescent fatty acid R18 to follow phospholipid (PL) transfer in vivo from the endoplasmic reticulum to the IM during autophagosome formation. Although the results are interesting, especially the retrograde transport of PLs, based on the provided data, additional control experiments are needed to firmly support the conclusions. An additional point is that the authors also study the internalization of R18 into cells and found a role of lipid flippases and oxysterol binding proteins. While this information could be useful for researchers using this dye, these analyses/findings have no specific connection with the topic of the manuscript, i.e. the PL transfer during autophagosome formation. Therefore, they must be removed.

      Major points:

      1. In general, the quality of the microscopy images are quite poor and this make it difficult to assert some of the authors' conclusions.
      2. It would be important to perform some lipidomics analysis to determine in which PLs and other lipids or lipid intermediates R18 is incorporated. First, it will be important to know which the major PL species are are labelled under the conditions of the experiments done in this study. Second, the authors assume that all the R18 is exclusively incorporated into PLs and this is what they follow in their in vivo experiments. What about acyl-CoA, which has been shown to be a key player in the IM elongation (Graef lab, Cell)?
      3. Figure 1A and 1B. The authors conclude that Atg2 is involved in the lipid transfer since R18 does not localize to the PAS/ARS in the atg2KO cells. However, another possible explanation is that in those cells the IM is not formed and does not expand, and con sequetly R18 is present in low amounts not detectable by fluorescence microscopy. To support their conclusion, the authors must assess PAS-labelling with R18 in cells lacking another ATG gene in which Atg2 is still recruited to the PAS.
      4. Figure 2. As written, the paragraph this figure seems to indicate that flippases are directly involved in the translocation of R18 from the PM to the ER. As correctly indicated by the authors, flippases flip PLs, not fatty acids. Moreover, there are no PL synthesizing at the PM and thus probably R18 is not flipped upon incorporation into PL. As a result, the relevance of flippase in R18 internalization is probably indirect. This must be explained clearly to avoid confusion/misunderstandings.
      5. A couple of manuscript has shown a (partial) role of Drs2 in autophagy. The authors must explain the discrepancy between their own results and what published, especially because they use the GFP-Atg8 processing assay, which is less sensitive than the Pho8delta60 used in the other studies.
      6. Authors used a predicted Atg2 lipid-transfer mutant (Srinivasan et al, J Cel Biol, 2024), but not direct prove that this mutant is defective for this activity. As previously done for other Atg2/ATG2-related manuscripts (Osawa et al, Nat Struct Mol Biol, 2019; Valverde et al, J Cel Biol, 2019), this must be measure in vitro. Moreover, they do not show whether other known functions of Atg2 are unaffected when expressing this Atg2 mutant, e.g. formation of the IM-ER MCSs, Atg2 interaction with Atg9 and localization at the extremity of the IM...
      7. The mNG-Atg8 signal is not recovered in the fluorescent recovery assays. Based on the observation that R18 signal comes back after photobleaching, authors suggest that the supply of Atg8 is not required for IM expansion. This idea is opposite to data where the levels of Atg8 and deconjugation of lipidated Atg8 determines the size of the forming autophagosomes (e.g., Xie et al, Mol Biol Cell, 2008; Nair et al, Autophagy, 2012). Similar results have also been obtained in mammalian cells (Lazarou and Mizushima results in cell lacking components of the two ubiquitin-like conjugation systems). This discrepancy requires an explanation.
      8. Although authors claim that there is a retrograde lipid transfer from the IM to the ER, based on the data, it quite difficult to extract these conclusions as they show a decrease in the lipid flow dynamics rather to an inversion of the lipid flow per se. Can the authors exclude that ER microdomains are formed at the ERES in contact with the IM, and consequently what they measure is a slow diffusion of R18-labeled lipid from other part of the ER to these ERES? Additionally, authors should check whether the Atg2 and Atg18 proteins are present at the IM-ER membrane contact sites in the same rates after nutrient replenished than when cells are nitrogen-starved, since this complex would determine the lipid transfer dynamics at this membrane contact site.
      9. The retrograde PL transfer is studied in cells overexpressing Ape1, in which IM elongation is stalled. This is a non-physiological experimental setup and consequently it is unclear whether what observed applies to normal IM/autophagosomes. This event should be shown to occur in WT cells as well.
      10. From the images provided, it appears that R18 also labels the vacuole. The vacuole form MCSs with the IM. Can the author exclude a passage of R18 from the vacuole to the IM?

      Minor points:

      1. L66. One report has indicated that Vps13 may also play a role in the transfer of lipids from the ER to the IM (Graef lab, J. Cell Biol).
      2. L70. It must be indicated that IM is also called phagophore.
      3. L74. It is mentioned "Additionally, a hydrophobic cavity in the N-terminal region of Atg2 directly tethers Atg2 to the ER, particularly the ER exit site (ERES), which is considered a key hub for autophagosome biogenesis", but there is no experimental evidence supporting that Atg2 is involved in the tethering with the ERES.
      4. L90. PAS must be listed between the ARS.
      5. Upon deletion of ATG39 and ATG40, there is a pronounced reduction of mNG-Atg8 labelled with R18. This would suggest that these two ER-phagy receptors are required for the PL transfer from the ER to the IM, which is not the case as autophagy is mildly affected by the absence of them (e.g., Zhang et al, Autophagy, 2020).
      6. Authors referred that "no direct evidence has been found to confirm lipid transfer at the ER-IM MCS in living cells" (lines 282-283). However, a recent paper has shown that de novo-synthesized phosphatidylcholine is incorporated from the ER to the autophagosomes and autophagic bodies (Orii et al, J Cel Biol, 2021). This reference should be mentioned in the manuscript.
      7. In lines 252-253, the sentence "R18 transport from the PM to the ER was partially impaired in osh1Δ osh2Δ, osh6Δ osh7Δ, and oshΔ osh4-1 cells (Figure S3). These results suggest that Osh proteins participate in transferring R18 from the PM to the ER" does not recapitulate what is observed in Fig. S3. Moreover, the Emr lab has generate a tertadeletion mutant in which the PM-ER MCSs are abolished. The authors could examine this mutant.

      Significance

      General assessment:

      Strength: potential new system to monitor lipid flow Limitations: Indirect evidences and in the case of the retrograde transport of phospholipids, it could be an artefact of the employed experimental approach.

      Advance: Little advances because something in part already shown in vitro. No ne mechanisms uncovered.

      Audience: Autophagy and membrane contact site fields.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      *We would like to thank all the reviewers for their positive comments and valuable feedback. In addition, we would like to address reviewer 1 query on novelty, which was not questioned by the other 2 reviewers. Our study uncovered two main aspects of hypoxia biology: first we addressed the role of NF-kappaB contribution towards the transcriptome changes in hypoxia, and second, this revealed a previously unknown aspect, that NF-kappaB is required for gene repression in hypoxia. While we know a lot about gene induction in hypoxia, much less is known about repression of genes. In times of energy preservation, gene repression is as important as gene induction. *

      .

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      • *

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The work from Shakir et al uses different cell line models to investigate the role of NF-kB in the transcriptional adaptation of cells to hypoxia, which is relevant. In addition, the manuscript contains a large amount of data that could be of interest and even useful for researchers in the field of hypoxia and NF-kB. However, in my opinion, there are several concerns that should be revised and additional experiments that could be included to strengthen the relevance of the work.

      We thank this reviewer for their positive comments.

      Specific issues: In Figure 1A, the authors examine which of the genes induced by hypoxia require NF-kB by RNA sequencing analysis of cells knocked down for specific NF-kB subunits and exposed to hypoxia for 24 hours. The knockdown is about 40-60% at the RNA level, but it would be helpful to show the effect of knockdown at the protein level.

      We agree with this and have added Western blot data (Sup. Figure S1F), which shows the effects of the siRNA are much more pronounced at the protein level.

      All the data regarding genes induced by hypoxia in control or NF-kB siRNA-treated cells are somewhat confusing. If I understand correctly, when the data from the three different siRNAs are crossed, only 1070 genes are upregulated and 295 are downregulated in an NF-kB-independent manner. If this is the case, I think it would be easier to use this information in Figure 2 to define the hypoxia-induced genes that are NF-kB-dependent by simply considering those induced in the control that are not in the NF-kB-independent subset (rather than repeating the integration of the data without additional explanation). If the authors do this simple analysis, are the resulting genes the same or similar? In any case, the way these numbers are obtained should be shown more clearly (i.e., a new Venn diagram showing genes up- or down-regulated in the siRNA control that are not up- or down-regulated in any of the siRNA-NF-kB treatments).

      Figure 1 shows the effects on gene expression of hypoxia in control and NF-____k____B ____subunit____-depleted cells compared to normoxia control cells. Figures 1F/1G compares genes up/downregulated in hypoxia when RelA, RelB, and cRel are depleted, compared to normoxia control. Figure 1 does not display N____F-____k____B____-dependent/independent hypoxia-responsive genes____, but rather the overall effect of siRNA control and siNF-____k____B treatments in hypoxia, compared to siRNA control in normoxia. Figure 2 then defines NF-____k____B-dependent ____and independent hypoxia-responsive genes. We actually define these exactly as the reviewer suggested and agree that we should show the way these numbers are obtained more clearly. We have added the suggested Venn diagrams (Sup. Figure S2) and added extra information to the methods section (page 5 of revised manuscript). We felt it was important to show all the data upfront in Figure 1 and then integrate and focus on NF-____k____B-dependent ____hypoxia-induced genes in Figure 2.

      Figure 2H shows that approximately 80% of the NF-kB-dependent genes up- or down-regulated in hypoxia were identified as RelA targets, which is statistically significant compared to RelB or cRel targets. However, what is the proportion of genes identified as RelA targets in the subset of NF-kB-independent hypoxia-induced genes? And in a randomly selected set of 500-600 genes? In my opinion, this statistical analysis should be included to demonstrate a relationship between NF-kB recruitment and hypoxia-induced upregulation (expected) and downregulation (unexpected). In this context, it is surprising that HIF consensus sites are preferentially detected in the genes that are supposed to be NF-kB dependent instead of RelA consensus.

      We thank the reviewer for this question, which is really helpful. The way we have displayed the stars on the graph for Figure 2H was slightly misleading we realize now. As such, we have amended the graph. RelA, RelB, and cRel bound genes (from the ChIP atlas) are all significantly enriched within our N____F-____k____B-dependent hypoxia-responsive genes, there is no statistical testing between RelA bound vs RelB bound or cRel bound. We have also performed this analysis on the NF-____k____B____-independent hypoxia-responsive genes ____and see the same trend (Sup. Figure S5B). This indicates that the enrichment of Rel binding sites from the ChIP atlas is not specific to NF-____k____B____-dependent hypoxia-responsive genes____. We have moved Figure 2H to (Sup. Figure S5A) and amended our description of the finding. This showcases how DNA binding does not necessarily mean functionality. We have amended our description of this result and limitation of the study.

      Figure 3 is just a confirmation by qPCR of the data obtained in the RNA-seq analysis, which in my opinion should be included as supplementary information. Moreover, both the effects of hypoxia and reversion by RelB siRNA are modest in several of the genes tested. The same is true for Figures 4 and 5 with very modest and variable results across cell types and genes.

      We appreciate this comment; we would like to keep this as a main figure for full transparency and show validation of our RNA-sequencing results.

      Figure 6 shows the effect of NF-kB knockdown on the induction of ROS in response to hypoxia. In the images provided, the effect of hypoxia is minimal in control cells, with the only clear differences shown in RelA-depleted cells.

      The quantification of the IF data (Figure 6B) shows ROS induction in hypoxia which is reduced in Rel-depleted cells, with RelA depletion having the strongest effect. ROS generation in hypoxia, although counterintuitive, is well documented and used for important signalling events. We believe our data supports the previously reported levels of ROS induction (reviewed in {Alva, 2024}) in hypoxia and importantly, that NF-____k____B depletion can at least partially____ reverse this.

      In 6B it is not clear what the three asterisks in the normoxia control represent (compared to the hypoxia siRNA control?). This should be clarified in the figure legend or text.

      We apologize for the lack of clarity we have now added this information to the figure legend.

      In the Western blot of 6C, there are no differences in the levels of SOD1 after RelA depletion. Again, there is no reason not to include the NF-kB subunits in the Western blot analysis.

      We have added the Western blot analysis to this figure. We were trying to simplify it. Although depletion of RelA does not rescue the hypoxia-induced repression of SOD1, depletion of RelB does. Furthermore, cRel although not statistically significant, has a trend for the rescue of this effect, see Figure 6C-D.

      Finally, regarding Figure 7, the authors mention that "we confirmed that hypoxia led to a reduction in several proteins represented in this panel (of proteins involved in oxidative phosphorylation), such as UQCRC2 and IDH1 (Figure 7A-B)". The authors cannot say this because it is not seen in the Western blot in 7A or in the quantification shown in 7B. In my personal opinion, stating something that is not even suggested in the experiments is very negative for the credibility of the whole message.

      We really do not agree with this comment. We do see reductions in the levels of the proteins we mentioned. We have made the figure less complex given that some proteins are very abundant while others are not. We hope the changes are now clear and apparent. We have changed the quantification normalisation to reflect this as well and modified our description of the results, see Figure 7 and Sup. Figure S18.

      In conclusion, this paper contains a large amount of relevant information, but i) non-essential data should be moved to Supplementary, ii) protein levels of relevant players need to be shown in addition to RNA, iii) minimal or undetectable differences need to be considered as no-differences, and iv) a model showing what is the interpretation of the data provided is needed to better understand the message of the paper. I mean, is it p65 or RelB binding to some of these genes leading to their activation or repression, or is it RelA or RelB inducing HIF1beta leading to NF-kB-dependent gene activation by hypoxia? If this were the case, experimental evidence that NF-kB regulates a subset of hypoxia genes through HIF1beta would make the story more understandable.

      We apologise but we do not know why the reviewer mentions HIF1beta. For gene induction, there is cooperation with the HIF system in some genes but not all. The most interesting and unexpected finding is that NF-kappaB is required for gene repression in hypoxia. We have added a new figure, investigating how HDAC inhibition could reverse the repression. A mechanism known to be employed by NF-kappaB when repressing genes. We have added all the blots for NF-kB, clarified the quantification and included other approaches including a CRISPR KO cell lines for both IKKs. We hope this is now clear.

      Reviewer #1 (Significance (Required)):

      The work presented here is interesting but does not provide a major advance over previous publications, the main message being that a subset of hypoxia-regulated genes are NF-kB dependent. However, there is no mechanistic explanation of how this regulation is achieved and several data that are not clearly connected. A more comprehensive analysis of the data and additional experimental validation would greatly enhance the significance of the work.

      We politely disagree with the reviewer. Our main finding is that NF-____k____B____ does play an important role in gene regulation in hypoxia but unexpectedly, this occurs mostly via gene repression. While there is vast knowledge on gene induction in hypoxia, gene repression, which typically does not occur directly via HIF, is virtually unknown. A previous study had identified Rest as a transcriptional repressor {PMID: 27531581} but this could only account for 20% of gene repression. Our findings reveal up to 60% of genes repressed in hypoxia require NF-____k____B____, hence this is a significant finding and a major advance over previous knowledge. Furthermore, we feel this paper is an excellent data resource for the field, as it is, to our knowledge, the first study characterising the extent to which NF-____k____B is required for hypoxia-induced gene changes, on a transcriptome-wide scale. Furthermore, we have validated this across multiple cell types and also used different approaches to investigate the role of NF-kB in the hypoxia transcriptional response. We are happy that the other reviewers agree with our novel findings.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this study, the authors have interrogated the role of NF-kappaB in the cellular transcriptional response to hypoxia. While HIF is considered the master regulator of the cellular response to hypoxia, it has long been known that mutliple transcription factors also play a role both independently of HIF and through the regulation of HIF-1alpha levels. Chief amongst these is NF-kappaB, a regulator of cell death and inflammation amongst other things. While NF-kappB has been known to be activated in hypoxia through altered PhD activity, the impact of this on global gene expression has remained unclear and this study addresses this important question. Of particular interest, genes downregulated in hypoxia appear to be repressed in a NF-kappaB-dependent manner. Overall, this nice study reveals an important role for NF-kappaB in the control of the global cellular transcriptional response to hypoxia.

      We thank this reviewer for their positive comments.

      Reviewer #2 (Significance (Required)):

      Some questions for the authors to consider with experiments or discussion: -One caveat of the current study which should be discussed is that while interesting and extensive, the analysis is restricted to cancer cell lines which have dysfunctional gene expression systems which may differ from "normal" cells. This should be discussed.

      We thank the reviewer for these comments. This is indeed an important aspect, which we now expand on in the discussion section. We also took advantage of RNA-seq datasets for HUVECs (a non-transformed cell lines) in response to hypoxia (Sup. Figure S15), TNF-alpha with and without RelA depletion (Sup. Figure S16). These data support our findings that in hypoxia NF-kB is important for transcriptional repression, with some contributions to gene induction, even in a non-transformed cell system.

      In the publicly available data sets analyzed, were the same hypoxic conditions used as in this study. This information should be included.

      We apologize if this was not clear, the hypoxia RNA-seq studies are the same oxygen level and time (1%, 24 hours), this is in the legend of Figure 4A and Sup. Figure S9 and in Sup. Table S2. We have added this information to the main text also.

      • What is known about NF-kappaB as a transcriptional repressor in other systems such as the control of cytokine or infection driven inflammation? This is briefly discussed but should be expanded. This is important as a key question in the study of hypoxia is what regulates gene repression.

      We have included this in the discussion and also analysed available data in HUVECs in response to cytokine stimulation with and without RelA depletion (Sup. Figure S16). This analysis revealed equal importance of RelA for activation and repression of genes upon TNF-alpha stimulation. Around 40% of genes require RelA for their induction or repression in response to TNF-a. In the discussion we have also included other references where NF-kappaB has been found to repress genes.

      NF-kappaB has previously been shown to regulate HIF-1alpha transcription. What are the effects of NF-kappaB subunit siRNAs on basal HIF-1alpha transcription? In figure 7, it appears that NF-kappaB subunit siRNA is without effect on hypoxia-induced HIF protein expression. Could this account for some of the effects of NF-kappaB depletion on the hypoxic gene signature? This point needs to be clarified in light of the data presented.

      We have included data for HIF-1α RNA levels in HeLa cells with/without NF-____k____B____ depletion followed by 24 hours of hypoxia (Sup. Figure S20) and we see a small reduction (~10-20%). The reviewer is correct, there was not much effect of NF-____k____B____ depletion on HIF-1α protein levels following 24 hours hypoxia in HeLa cells. Effects of NF-kappaB depletion can be found usually with lower times of hypoxia exposure or when more than one subunit is depleted at the same time. We have added this as a discussion point in the revised manuscript.

      NRF-2 is a key cellular sensor of oxidative stress in a similar way to HIF being a hypoxia sensor. The authors demonstrate using a dye that ROS are paradoxically increased in hypoxia (a more controversial finding than the authors present). It would be of interest to know if NFR-2 is induced in hypoxia as a marker of cellular oxidative stress. Similarly, it would be interesting to determine by metabolic analysis whether oxidative phosphorylation (O2 consumption) is decreased as the transcriptional signature would suggest (although the difficulty of performing metabolic analysis in hypoxia is acknowledged).

      To investigate if NRF2 is induced, we performed a western blot at 0, 1, and 24 hours 1% oxygen, but didn’t see any induction of NRF2 protein levels (____Sup. Figure S17A). We also overlapped our hypoxia upregulated genes with NRF2 target genes from {PMID:24647116 and PMID: 38643749} (Sup. Figure S17B) and found limited evidence of NRF2 target genes being induced. Based on these findings, it seems that NRF2 is not being induced in hypoxia, at least not at the hypoxia level/time point we have analysed. We also agree it would be ideal to measure oxygen consumption in hypoxia, but unfortunately, we do not have the technical ability to do this at present.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Strengths This manuscript attempts to integrate multiple strands of data to determine the role of NFkB in hypoxia -induced gene expression. This analysis looks at multiple NFkB subunits in multiple cell lines to convincingly demonstrate that NFkB does indeed play a central role in the regulation of hypoxia-induced gene expression. This broad approach integrates new experimental data with findings from the published literature.

      A significant amount of work has been performed both experimentally and bioinformatically to test experimental hypotheses.

      We thank this reviewer for their positive comments.

      Limitations

      The main analysis in the paper involves comparing the impact of knocking down different NFkB family members in hypoxia and comparing transcriptional responses. I am surprised that the authors did not include the impact of knockdown of the NFkB family members in normoxia too. The absence of these control experiments allows us to understand the role of NFkB in hypoxia, but does not give us information as to how many of those impacts are specific/ induced in hypoxic conditions. i.e. many of the observed effects of NFkB knockdown could be due to basal suppression of NFkB target genes that happen to be hypoxia sensitive. This finding is obviously important, but it would be nice to know how many of those genes are only / preferentially regulated by NFkB in hypoxia. This would give a much deeper insight into the role of NFkB in hypoxia induced gene expression.

      We agree this would have been ideal. For financial reasons we limited our analysis to hypoxia samples. We have performed qPCR analysis depleting RelA, RelB and cRel under normal oxygen conditions in HeLa (Sup. Figure S8). We find that the majority of the validated genes in HeLa cells which require____ NF-____k____B for gene changes in hypoxia, are not regulated by N____F-____k____B under normal oxygen conditions____. We have also added this limitation into our discussion section.

      The broad experimental approach while a strength of the paper in many ways also has its limitations e.g. Motif analysis revealing e.g. HIF-1a binding site enrichment in RelA and RelB-dependent DEGs is correlative observation and does not prove HIF involvement in NFkB-dependent hypoxia induced gene activation. Comparing responses with responses seen in one cell type with responses that have been described in a database comprised of many studies in a variety of different cells also has some limitations. These points can be described more fully in the discussion

      We agree these are mere correlations and hence a limitation and we have not formerly tested the involvement of HIF. We have included this in the discussion as suggested. For HIF binding site correlation, we do also compare to HIF ChIP-seq in HeLa cells exposed to 1% oxygen, albeit at 8 hours and not 24 hours (Sup. Figure S4).

      For siRNA transfections, single oligonucleotide sequences were used for RelA, RelB and cRel. This increases the potential likelihood of 'off targets' compared to pooled oligos delivered at lower concentrations. This limitation should at least be mentioned.

      We agree and have now included this as a limitation in the discussion section. We have now also included analysis using wild type and 2 different IKK____________ double KO CRISPR cell lines generated in the following publication {PMID: 35029639}. Out of the 9 genes we identified as NF-____k____B-dependent hypoxia upregulated genes from HeLa cell RNA-seq and validated by qPCR, which are also hypoxia-responsive in HCT116 cells (Sup. Figure S11D), 6 displayed ____NF-____k____B dependence in HCT116 cells (Sup. Figure S14). We also provide new protein data in this cell system for oxidative phosphorylation markers, which show as with the siRNA depletion, rescue of repression of these proteins when NF-____k____B is inactivated.

      RNA-seq experiments are performed on n=2 data which means relatively low statistical power. How has the statistical analysis been performed on normalised counts (corresponding to 2 n- numbers) to yield statistical significance? I am not familiar with hypergeometric tests - please justify their use here.

      __*We use DESeq2 for differential expression analysis and filter for effect size (> -/+ 0.58 log2 fold change) and statistical significance (FDR I am not familiar with hypergeometric tests - please justify their use here.

      The hypergeometric test (equivalent to a one-sided Fisher's exact test) is routinely used to determine whether the observed overlap between two gene lists is statistically significant compared to what would be expected by chance. It is also the statistical test of choice for popular bioinformatics tools which perform over representation analysis (ORA) to see which gene sets/groups/pathways/ontologies are over-represented in a gene list, examples include Metascape, clusterProfiler, WebGestalt (used in this study), and gProfiler.

      P14 RelB is described as having the most widespread impact of hypoxia dependent gene changes across all cell systems tested. Could this be due to a more potent silencing of RelB and / or due to particularly high/ low expression of RelB in these cells in general?

      This is an excellent point, at the RNA level the RelB depletion is slightly more efficient (Sup. Figure S1), at the protein level, silencing is highly potent with all 3 siRNAs (Sup. Figure S1). We looked at the RNA levels of RelA, RelB and cRel in HeLa cells at basal conditions, and RelA shows the highest abundance compared to RelB and cRel, while RelB and cRel have similar expression levels (see below). However, RelB is very dynamic in response to hypoxia, something we have observed but have not published yet.

      P18 For western blot analysis best practise is to have 2 MW markers per blot presented

      We have and have added the second MW markers suggested.

      For quantification, I suggest avoiding performing statistical analysis on semi-quantitative data unless a dynamic range of detection (with standards) has been fully established.

      We agree this has many limitations, we will keep the quantification but moved into supplementary information.

      P19 There is clearly an effect of reciprocal silencing with the NFkB knockdown experiments ie. siRelA affects RelB levels in hypoxia and vice versa. The implications of this for data interpretation should be discussed.

      Indeed, it is well known that RelB and cRel are RelA targets. Less is known about RelA as it is not a known NF-____k____B____ target. We have added a discussion in the revised manuscript.

      P20 The literature can be better cited in relation RelB and hypoxia A brief search reveals a few papers that should be mentioned/ discussed. Oliver et al. 2009 Patel et al. 2017 Riedl et al. 2021

      We have looked into these suggestions. Oliver et al, refer to hypercapnia, not hypoxia and the other two only briefly mentioned RelB with no effects toward the goals of their studies. We have tried to incorporate what is currently known as much as possible.

      I suggest leaving out mention of IkBa sumoylation and supplementary figure 10. I'm not sure the data in the paper as a whole merits focus on this very specific point.

      We thank the reviewer for this suggestion and we have removed this aspect from the manuscript.

      There is a very strong reliance on mRNA and TPM data. Some additional protein data in support of key findings will enhance

      We have added additional protein level analysis where we could obtain antibodies, see Figures 6, 7 and Sup. Figures S17, S18, and S19 for our protein level analysis.

      A graphical abstract summarising key findings with exemplar genes highlighted will enhance.

      We have added a model to summarise our findings as suggested.

      Both HIF and NFKB are ancient evolutionarily conserved pathways. Can lessons be learned from evolutionary biology as to how NFkB regulation of hypoxia induced genes occured. Does the HIF pathway pre-date the NFkB pathway or vice versa. This approach could be valuable in supporting the findings from this study.

      We have investigated this. Unfortunately, there are very little available data on hypoxia gene expression in lower organisms. However, we have added a few sentences on the evolution of NF-____k____B____ and HIF.

      Minor comments P2 please briefly explain how 5 genes give rise to 7 proteins

      We have added this to the introduction as requested.

      P2 there seems to be some recency bias in the studies cited as being associated with NFkB activation in response to hypoxia. Mention of Koong et al (1994) and Taylor et al (1999) and other early papers in the field will enhance

      We have added these as suggested.

      P3 The role of PHD enzymes in the regulation of NFkB in hypoxia can be introduced and / or discussed

      We have added a reference to this aspect as suggested.

      P8 I suggest use of proportional Venn diagrams to demonstrate the patterns more clearly

      We have added these as suggested.

      P11 To what extent might NFkB and Rest co-operate/ co-regulate gene repression in hypoxia?

      This is a good question. We have overlapped our datasets with Rest-dependent hypoxia-regulated genes identified by Cavadas et al., (Figure below), and find that these appear to act independently of each other for the most part, with very few genes co-regulated by both.

      Reviewer #3 (Significance (Required)):

      Shakir et al. present a manuscript titled 'NFkB is a central regulator of hypoxia-induced gene expression'.

      The research group are experts in both NFkB and hypoxia signaling and are the ideal group to perform these studies.

      Hypoxia and inflammation are co-incident in many physiological and pathophysiological conditions, where the microenvironment affects disease severity and patient outcome. The cross talk between inflammatory and hypoxia signaling pathways is not fully described. Thus, this manuscript takes a novel approach to an established question and concludes clearly that NFkB is a central regulator of hypoxia-induced gene expression.

      We thank the reviewer for these positive comments.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Strengths

      This manuscript attempts to integrate multiple strands of data to determine the role of NFkB in hypoxia -induced gene expression. This analysis looks at multiple NFkB subunits in multiple cell lines to convincingly demonstrate that NFkB does indeed play a central role in the regulation of hypoxia-induced gene expression. This broad approach integrates new experimental data with findings from the published literature.

      A significant amount of work has been performed both experimentally and bioinformatically to test experimental hypotheses.

      Limitations

      The main analysis in the paper involves comparing the impact of knocking down different NFkB family members in hypoxia and comparing transcriptional responses. I am surprised that the authors did not include the impact of knockdown of the NFkB family members in normoxia too. The absence of these control experiments allows us to understand the role of NFkB in hypoxia, but does not give us information as to how many of those impacts are specific/ induced in hypoxic conditions. i.e. many of the observed effects of NFkB knockdown could be due to basal suppression of NFkB target genes that happen to be hypoxia sensitive. This finding is obviously important, but it would be nice to know how many of those genes are only / preferentially regulated by NFkB in hypoxia. This would give a much deeper insight into the role of NFkB in hypoxia induced gene expression.

      The broad experimental approach while a strength of the paper in many ways also has its limitations e.g. Motif analysis revealing e.g. HIF-1a binding site enrichment in RelA and RelB-dependent DEGs is correlative observation and does not prove HIF involvement in NFkB-dependent hypoxia induced gene activation. Comparing responses with responses seen in one cell type with responses that have been described in a database comprised of many studies in a variety of different cells also has some limitations. These points can be described more fully in the discussion

      For siRNA transfections, single oligonucleotide sequences were used for RelA, RelB and cRel. This increases the potential likelihood of 'off targets' compared to pooled oligos delivered at lower concentrations. This limitation should at least be mentioned.

      RNA-seq experiments are performed on n=2 data which means relatively low statistical power. How has the statistical analysis been performed on normalised counts (corresponding to 2 n- numbers) to yield statistical significance? I am not familiar with hypergeometric tests - please justify their use here.

      P14 RelB is described as having the most widespread impact of hypoxia dependent gene changes across all cell systems tested. Could this be due to a more potent silencing of RelB and / or due to particularly high/ low expression of RelB in these cells in general?

      P18 For western blot analysis best practise is to have 2 MW markers per blot presented

      For quantification, I suggest avoiding performing statistical analysis on semi-quantitative data unless a dynamic range of detection (with standards) has been fully established.

      P19 There is clearly an effect of reciprocal silencing with the NFkB knockdown experiments ie. siRelA affects RelB levels in hypoxia and vice versa. The implications of this for data interpretation should be discussed.

      P20 The literature can be better cited in relation RelB and hypoxia A brief search reveals a few papers that should be mentioned/ discussed. Oliver et al. 2009 Patel et al. 2017 <br /> Riedl et al. 2021

      I suggest leaving out mention of IkBa sumoylation and supplementary figure 10. I'm not sure the data in the paper as a whole merits focus on this very specific point.

      There is a very strong reliance on mRNA and TPM data. Some additional protein data in support of key findings will enhance

      A graphical abstract summarising key findings with exemplar genes highlighted will enhance.

      Both HIF and NFKB are ancient evolutionarily conserved pathways. Can lessons be learned from evolutionary biology as to how NFkB regulation of hypoxia induced genes occured. Does the HIF pathway pre-date the NFkB pathway or vice versa. This approach could be valuable in supporting the findings from this study.

      Minor comments

      P2 please briefly explain how 5 genes give rise to 7 proteins

      P2 there seems to be some recency bias in the studies cited as being associated with NFkB activation in response to hypoxia. Mention of Koong et al (1994) and Taylor et al (1999) and other early papers in the field will enhance

      P3 The role of PHD enzymes in the regulation of NFkB in hypoxia can be introduced and / or discussed

      P8 I suggest use of proportional Venn diagrams to demonstrate the patterns more clearly

      P11 To what extent might NFkB and Rest co-operate/ co-regulate gene repression in hypoxia?

      Significance

      Shakir et al. present a manuscript titled 'NFkB is a central regulator of hypoxia-induced gene expression'.

      The research group are experts in both NFkB and hypoxia signaling and are the ideal group to perform these studies.

      Hypoxia and inflammation are co-incident in many physiological and pathophysiological conditions, where the microenvironment affects disease severity and patient outcome. The cross talk between inflammatory and hypoxia signaling pathways is not fully described. Thus, this manuscript takes a novel approach to an established question and concludes clearly that NFkB is a central regulator of hypoxia-induced gene expression.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, the authors have interrogated the role of NF-kappaB in the cellular transcriptional response to hypoxia. While HIF is considered the master regulator of the cellular response to hypoxia, it has long been known that mutliple transcription factors also play a role both independently of HIF and through the regulation of HIF-1alpha levels. Chief amongst these is NF-kappaB, a regulator of cell death and inflammation amongst other things. While NF-kappB has been known to be activated in hypoxia through altered PhD activity, the impact of this on global gene expression has remained unclear and this study addresses this important question. Of particular interest, genes downregulated in hypoxia appear to be repressed in a NF-kappaB-dependent manner. Overall, this nice study is reveals an important role for NF-kappaB in the control of the global cellular transcriptional response to hypoxia.

      Significance

      Some questions for the authors to consider with experiments or discussion:

      • One caveat of the current study which should be discussed is that while interesting and extensive, the analysis is restricted to cancer cell lines which have dysfunctional gene expression systems which may differ from "normal" cells. This should be discussed.
      • In the publicly available data sets analysed, were the same hypoxic conditions used as in this study. This information should be included.
      • What is known about NF-kappaB as a transcriptional repressor in other systems such as the control of cytokine or infection driven inflammation? This is briefly discussed but should be expanded. This is important as a key question in the study of hypoxia is what regulates gene repression.
      • NF-kappaB has previously been shown to regulate HIF-1alpha transcription. What are the effects of NF-kappaB subunit siRNAs on basal HIF-1alpha transcription? In figure 7, it appears that NF-kappaB subunit siRNA is without effect on hypoxia-induced HIF protein expression. Could this account for some of the effects of NF-kappaB depletion on the hypoxic gene signature? This point needs to be clarified in light of the data presented.
      • NRF-2 is a key cellular sensor of oxidative stress in a similar way to HIF being a hypoxia sensor. The authors demonstrate using a dye that ROS are paradoxically increased in hypoxia (a more controversial finding than the authors present). It would be of interest to know if NFR-2 is induced in hypoxia as a marker of cellular oxidative stress. Similarly it would be interesting to determine by metabolic analysis whether oxidative phosphorylation (O2 consumption) is decreased as the transcriptional signature would suggest (although the difficulty of performing metabolic analysis in hypoxia is acknowleged).
    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The work from Shakir et al uses different cell line models to investigate the role of NF-kB in the transcriptional adaptation of cells to hypoxia, which is relevant. In addition, the manuscript contains a large amount of data that could be of interest and even useful for researchers in the field of hypoxia and NF-kB. However, in my opinion, there are several concerns that should be revised and additional experiments that could be included to strengthen the relevance of the work.

      Specific issues:

      In Figure 1A, the authors examine which of the genes induced by hypoxia require NF-kB by RNA sequencing analysis of cells knocked down for specific NF-kB subunits and exposed to hypoxia for 24 hours. The knockdown is about 40-60% at the RNA level, but it would be helpful to show the effect of knockdown at the protein level. All the data regarding genes induced by hypoxia in control or NF-kB siRNA-treated cells are somewhat confusing. If I understand correctly, when the data from the three different siRNAs are crossed, only 1070 genes are upregulated and 295 are downregulated in an NF-kB-independent manner. If this is the case, I think it would be easier to use this information in Figure 2 to define the hypoxia-induced genes that are NF-kB-dependent by simply considering those induced in the control that are not in the NF-kB-independent subset (rather than repeating the integration of the data without additional explanation). If the authors do this simple analysis, are the resulting genes the same or similar? In any case, the way these numbers are obtained should be shown more clearly (i.e., a new Venn diagram showing genes up- or down-regulated in the siRNA control that are not up- or down-regulated in any of the siRNA-NF-kB treatments). Figure 2H shows that approximately 80% of the NF-kB-dependent genes up- or down-regulated in hypoxia were identified as RelA targets, which is statistically significant compared to RelB or cRel targets. However, what is the proportion of genes identified as RelA targets in the subset of NF-kB-independent hypoxia-induced genes? And in a randomly selected set of 500-600 genes? In my opinion, this statistical analysis should be included to demonstrate a relationship between NF-kB recruitment and hypoxia-induced upregulation (expected) and downregulation (unexpected). In this context, it is surprising that HIF consensus sites are preferentially detected in the genes that are supposed to be NF-kB dependent instead of RelA consensus. Figure 3 is just a confirmation by qPCR of the data obtained in the RNA-seq analysis, which in my opinion should be included as supplementary information. Moreover, both the effects of hypoxia and reversion by RelB siRNA are modest in several of the genes tested. The same is true for Figures 4 and 5 with very modest and variable results across cell types and genes. Figure 6 shows the effect of NF-kB knockdown on the induction of ROS in response to hypoxia. In the images provided, the effect of hypoxia is minimal in control cells, with the only clear differences shown in RelA-depleted cells. In 6B it is not clear what the three asterisks in the normoxia control represent (compared to the hypoxia siRNA control?). This should be clarified in the figure legend or text. In the Western blot of 6C, there are no differences in the levels of SOD1 after RelA depletion. Again, there is no reason not to include the NF-kB subunits in the Western blot analysis. Finally, regarding Figure 7, the authors mention that "we confirmed that hypoxia led to a reduction in several proteins represented in this panel (of proteins involved in oxidative phosphorylation), such as UQCRC2 and IDH1 (Figure 7A-B)". The authors cannot say this because it is not seen in the Western blot in 7A or in the quantification shown in 7B. In my personal opinion, stating something that is not even suggested in the experiments is very negative for the credibility of the whole message. In conclusion, this paper contains a large amount of relevant information, but i) non-essential data should be moved to Supplementary, ii) protein levels of relevant players need to be shown in addition to RNA, iii) minimal or undetectable differences need to be considered as no-differences, and iv) a model showing what is the interpretation of the data provided is needed to better understand the message of the paper. I mean, is it p65 or RelB binding to some of these genes leading to their activation or repression, or is it RelA or RelB inducing HIF1beta leading to NF-kB-dependent gene activation by hypoxia? If this were the case, experimental evidence that NF-kB regulates a subset of hypoxia genes through HIF1beta would make the story more understandable.

      Significance

      The work presented here is interesting but does not provide a major advance over previous publications, the main message being that a subset of hypoxia-regulated genes are NF-kB dependent. However, there is no mechanistic explanation of how this regulation is achieved and several data that are not clearly connected. A more comprehensive analysis of the data and additional experimental validation would greatly enhance the significance of the work.

  4. May 2025
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to Review

      Manuscript number: RC-2024-02391

      Corresponding author(s): John Varga

      Dibyendu Bhattacharyya

      [Please use this template only if the submitted manuscript should be considered by the affiliate journal as a full revision in response to the points raised by the reviewers.

      If you wish to submit a preliminary revision with a revision plan, please use our "Revision Plan" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      Dear editor,

      We are pleased to submit a full revised version of the manuscript that addresses all the points raised by the reviewers. We have included new experiments and modified the text and figures based on the reviewers’ suggestions. We thank all the reviewers for their insightful feedback, which has significantly enhanced the quality of the manuscript. We are confident and optimistic that our improved manuscript will be accepted by the journal of our choice.

      This document is supposed to contain a few images, which were somehow missing after the processing through the manuscript submission path. For convenience we also included a PDF version of the response to reviewers.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      Reviewer #1

      • To reliably quantify the ciliary length in different cell types, and in independent ciliary marker needs to be included for comparison and the ciliary base needs to be labeled (e.g., g-TUBULIN). This needs to combined with a non-biased, high-throughput analysis, e.g., CiliaQ, Response: As suggested, we compared primary cilia length measurements using antibodies against Arl13b and γ-tubulin. The comparison between healthy controls (HC) and systemic sclerosis (SSc) is presented in Supplementary Figure S1. No significant differences in primary cilia length were observed compared to our previous measurements. Cilia length was quantified using ImageJ version 1.48v (http://imagej.nih.gov/ij) with the maximum intensity projection (MIP) method and visualized through 3D reconstruction using the ImageJ 3D Viewer.

      • As mentioned in the study, TGFbhas been implicated to drive myofibroblast transition. Thus TGFb stimulate ciliary signaling in the presented primary cells? The authors should provide a read-out for TGFb signaling in the cilium (ICC for protein phosphorylation etc.). Furthermore, canonical ciliary signaling pathways have been suggested to act as fibrotic drivers, such as Hedgehog and Wnt signaling - does stimulation of these pathways evoke a similar effect? Response: Yes, TGF-β1 stimulates ciliary signaling in growth-arrested foreskin fibroblasts. Clement et al. (2013) showed that TGF-β1 induces p-SMAD2/3 at the ciliary base, followed by the nuclear translocation of p-SMAD2/3 after 90 minutes. To assess whether canonical ciliary signaling pathways influence primary cilia length, we treated foreskin fibroblasts with Wnt (#908-SH, R&D) and a Shh agonist (#5036-WN, R&D) at 100 ng/mL each for 24 hours. We did not observe any changes in primary cilia length under either condition. These data are shown here for reference but are not included in the manuscript.

      Clement, Christian Alexandro, et al. "TGF-β signaling is associated with endocytosis at the pocket region of the primary cilium." Cell reports 3.6 (2013): 1806-1814.

      • Does TGFbinduce cell proliferation? If yes, this would force cilium disassembly and, thereby, reduce ciliary length, which is independent of a "shortening" mechanism proposed by the authors. Response: Yes, TGF-β induces cell proliferation in fibroblasts (Lee et al., 2013; Liu et al., 2016). However, we did serum starvation to stop proliferation. In our study, we observed a few percentage of Ki67-positive cells under TGF-β treatment at 24 hours (Supplementary Figure S2C). However, cell proliferation mainly stopped after 48 hours. Typically, proliferating cells rarely display any PC or show very small puncta. In our case, we observe a significantly elongated PC structure (although shorter than that of untreated cells) under TGF-beta-treated conditions. Our results display that a majority of cells are not proliferating but still display PC shortening under TGF-β treatment, suggesting that PC shortening is not due to cell division-induced PC disassembly. TGF beta-induced PC shortening is also reported in another fibroblast type previously (Kawasaki et al., 2024).

      Kawasaki, Makiri, et al. "Primary cilia suppress the fibrotic activity of atrial fibroblasts from patients with atrial fibrillation in vitro." Scientific Reports 14.1 (2024): 12470.

      Lee, J., Choi, JH. & Joo, CK. TGF-β1 regulates cell fate during epithelial–mesenchymal transition by upregulating survivin. Cell Death Dis 4, e714 (2013). https://doi.org/10.1038/cddis.2013.244.

      Liu, Y. et al. TGF-β1 promotes scar fibroblasts proliferation and transdifferentiation via up-regulating MicroRNA-21. Sci. Rep. 6, 32231; doi: 10.1038/srep32231 (2016).

      • As PGE2 has been shown to signal through EP4 receptors in the cilium, is the restoration of primary cilia length due to ciliary signaling? Response: As per your suggestion, we measured cilia length in the presence and absence of the EP4 receptor antagonist (#EP4 Receptor Antagonist 1; #32722; Cayman Chemicals; 500 nM) with PGE2. Interestingly, we did not observe a change in cilia length between the PGE2 and TGFβ (with EP4 receptor antagonist) treatment groups, as shown in supplementary figure S3. We believe that PGE2 works with the EP2 receptor under our experimental conditions. Kolodsick et al., 2003, also observed that PGE2 inhibits myofibroblast differentiation via activation of EP2 receptors and elevations in cAMP levels in healthy lung fibroblasts.

      Kolodsick, Jill E., et al. "Prostaglandin E2 inhibits fibroblast to myofibroblast transition via E. prostanoid receptor 2 signaling and cyclic adenosine monophosphate elevation." American journal of respiratory cell and molecular biology 29.5 (2003): 537-544.

      • Primary cilia length is regulated by cAMP signaling in the cilium vs. cytoplasm - does cAMP signaling play a role in this context? PGE2 is potent stimulator of cAMP synthesis - does this underlie the rescue of primary cilia length? Response: Yes, cAMP levels are important for both myofibroblast dedifferentiation and cilia length elongation. Kolodsick et al., 2003 observed that PGE2 inhibits myofibroblast differentiation via activation of EP2 receptors and elevations in cAMP levels in healthy lung fibroblasts. In a parallel set of experiments, treatment with forskolin (a cAMP activator) also reduced α-SMA protein levels by 40%. Forskolin is also known to increase PC length.

      Kolodsick, Jill E., et al. "Prostaglandin E2 inhibits fibroblast to myofibroblast transition via E. prostanoid receptor 2 signaling and cyclic adenosine monophosphate elevation." American journal of respiratory cell and molecular biology 29.5 (2003): 537-544.

      • The authors describe that they wanted to investigate how aSMA impacted primary cilia length. They only provide a knock-down experiment and measured ciliary length, but the mechanistic insight is missing. How does loss of aSMA expression control ciliary length? Response: We measured acetylated α-tubulin levels in ACTA2 siRNA-treated cells compared to control-treated cells. Acetylated α-tubulin levels increased under ACTA2 siRNA-treated conditions, as shown in Figure 4D, and TPPP3 levels were also elevated (Figure S8A). Interestingly, TPPP3 levels negatively correlated with disease severity in SSc fibroblasts (r = -0.2701, p = 0.0183), and TPPP3 expression significantly reduced in SSc skin biopsies, as shown in Figures 6C and 6D. These results strengthen our hypothesis that microtubule polymerization and actin polymerization, while they counterbalance each other, also contrarily affect PC length. We agree that a much more detailed study is needed to extensively delineate the intricate homeostasis of the actin network and microtubule network in conjunction with fibrosis and primary cilia length. We have mentioned this in the discussion.

      • The authors used LiCl in their experiments, which supposedly control Hh signaling. Coming back to my second questions, is this Hh-dependent? And what is the common denominator with respect to TGFbsignaling? And how is this mechanistically connected to actin and microtubule polymerization? Response: We used Shh inhibitor (Cyclopamine hydrate #C4116 Sigma-Aldrich) in both SSc and foreskin fibroblasts (with and without TGFβ). We found that PC length is significantly increased and αSMA intensity is reduced in the Shh inhibitor treated group (data not included in the Manuscript)

      • How was the aSMA Mean intensity determined? Response: We quantified aSMA mean intensity using ImageJ, and the procedure has been added to the respective figure legend and materials and methods section under ‘Quantification of immunofluorescence’ (each point represents mean intensity from three randomly selected hpf/slide was performed using ImageJ).

      • Fig: 1D: Statistical test is missing in Figure Legend and presentation of the p-values for the left graph is confusing Response: We added statistical test information in Figure Legend.

      • Some graphs are presented {plus minus} SD and some {plus minus} SEM, but this is not correctly stated in the Material & Methods Part __Response: __We added information to the figure legend as well as in the Material & Methods section.

        • 4D&E: Statistical test is missing in Figure Legend* Response: We added it now.
      • In general, text should be checked again for spelling mistakes and sentences may be re-written to promote readability. In particular, this applies to the discussion. __Response: __We checked and corrected.

      • Figure Legends are not written consistently, information is missing (e.g., statistical tests, see above). __Response: __We carefully checked and added information accordingly.

      • Figures should be checked again, and all text should be the same size and alignment of images should be improved. __Response: __We checked and corrected.

      Significance

      The authors present a novel connection between the regulation of primary cilia length and fibrogenesis. However, the study generally lacks mechanistic insight, in particular on how TGFb signaling, aSMA expression, and ciliary length control are connected. The spatial organization of the proposed signaling components is also not clear - is this a ciliary signaling pathway? If so, how does it interact with cytoplasmic signaling and vice versa?

      Response: Thank you for your thoughtful and constructive feedback. We appreciate your recognition of the novelty of our study linking primary cilia length regulation to fibrogenesis. In our revised manuscript, we did provide a mechanistic insight, though. Our results suggest that during the fibrotic response, higher-order actin polymerization, along with microtubule destabilization resulting from tubulin deacetylation, drives the shortening of PC length. In contrast, PC length elongation via stabilization of microtubule polymerization mitigates the fibrotic phenotype in fibrotic fibroblasts. We agree that a deeper mechanistic understanding particularly regarding how TGFβ signaling, αSMA expression, and ciliary length control intersect is essential for fully elucidating the pathway. We also acknowledge the importance of clarifying the spatial organization of the signaling components and plan to incorporate such analyses in future studies.

      Reviewer #2

      *I found the paper to be rather muddled and its presentation made if somewhat difficult to follow. For example, the Figures are disorganised (Fig 1 is a great example of this) and there was reference to Sup data that appeared out of order (eg Sup Fig 2 appeared before Sup Fig 1 in the text). *

      Response: We carefully revised the manuscript and arranged the figures.

      *Images in a single figure should be the same size. Currently they are almost random and us different magnifications. Overall, the paper needs to be better organized. *

      Response: We carefully revised the manuscript and figures provided with same magnification.

      *I have some significant concerns about how the PC length data was generated. To my mind the length may be hard to determine from the type of images shown in the paper (which may represent the best images?). Some of the images presented appear to show shorter, fatter PCs in the cells from fibrosis cases. Is this real or is it some kind of artefact? Would a shorter, fatter PCs have a similar or larger surface area? What would be the consequence of this? *

      Response: Primary cilia length was measured with ImageJ1.48v (using maximum intensity projection (MIP) method and visualized by 3D reconstruction with the ImageJ 3D viewer. Each small dot represents the PC length from an individual cell, and each large dot represents the average of the small dots for one cell line.

      *I am confused as to exactly what is meant by matched healthy controls. Age, sex and ethnicity, where stated seem to be very variable? What are CCL210 fibroblasts? *

      Response: We appreciate this comment. This is correct. The age, sex, and ethnicity are not matched for the available healthy controls. We have corrected that in the text. CCL210 is a commercially available fibroblast cell line that was isolated from the lung of a normal White, 20-year-old, female patient.

      *What does a change in PC length signify? DO shot PC foe a cellular transition or are they a consequence of it? What would happen is you targeted PCs with a drug and that influenced the length on all cell types? Is the effect on PC fibroblast specific? *

      __Response: __Significance and regulation of PC length are greatly debated and investigated still. It appears that PC length signify different features in different cell types. Although these are very interesting questions but such experiments are beyond the scope of our present work.

      Minor concerns

      *Page 4 second paragraph. I think it should be clarified that it is this group who have suggested a link between PCs and myofibroblast transition? *

      __Response: __We agree with the reviewer and clarified it.

      *Page 4 second paragraph. The use of the word "remarkably' is a bit subjective. *

      __Response: __We agree with the reviewer and have removed it.

      *Reference 27 is a paper on multiciliogenesis rather than primary ciliogenesis. *

      __Response: __We agree with the reviewer and have removed it.

      Figure 1 panel D. Make the image with the same sized vertical scale

      __Response: __We have replaced it with a new Figure 1.

      Significance

      Reviewer #2 (Significance (Required)):

      To my mind this is a novel paper and the data presented in it may be of interest to the cilia community as well as to the fibrosis field. This could be considered to be a significant advance and I am unaware that other groups are actively working in this area.

      Presentation of the data in the current form does not instil confidence in the work.

      Response: ____Thank you for recognizing the novelty and potential significance of our work. We appreciate your comments and fully acknowledge the concern regarding the presentation of the data. We have carefully revised the manuscript and reorganized the figures to improve clarity and overall presentation.

      Reviewer #3

      Major comments:

      • Need to demonstrate if the fibrotic phenotypes seen are produced through a ciliary-dependent mechanism. For example, to see if LiCl effects on Cgn1 are through ciliary expression or by other mechanisms. To achieve that objective, The authors should repeat the experiments in cells with a knockdown or knockout of ciliary proteins such as IFT20, IFT88, etc. The same approach should be applied to the tubacin experiments. Response: We silenced foreskin fibroblasts with IFT88/IFT20, both in the presence and absence of TGF-β1, followed by treatment with LiCl and Tubacin. Both LiCl and Tubacin can rescue cilia length and mitigate the myofibroblast phenotype in the presence of silenced IFT88/IFT20 gene, as shown in supplementary figure S9. Our result suggests that LiCl and Tubacin functions are both independent of the IFT-mediated ciliary mechanism. Regulation of PC length is still an enigma and highly debated. Moreover, PC length can be affected in multiple ways and is not solely dependent on IFTs (Avasthi and Marshall, 2012). One such method is the direct modification of the axoneme by altering microtubule stability through the acetylation state (Avasthi and Marshall, 2012), a pathway most likely the case for Tubacin. Another mode of PC length regulation is through a change in Actin polymerization. The remodeling of actin between contractile stress fibers and a cortical network alters conditions that are hospitable to basal body docking and maintenance at the cell surface (Avasthi and Marshall, 2012), causing PC length variation. Our results suggest that PC length functions as a sensor of the status of the fibrotic condition, as evidenced by the aSMA levels of the cells.

      Avasthi, P., and W.F. Marshall. 2012. Stages of ciliogenesis and regulation of ciliary length. Differentiation. 83:S30-42.

      • The use of LiCl to increase ciliary length is complicated. What are the molecular mechanisms underlying this effect? It is known that it may be affecting GSK-3b, which can have other ciliary-independent effects. Therefore, using ciliary KO/KD cells (IFT88 or IFT20) as controls may help assess the specificity of the proposed treatments. Response: As explained in the previous paragraph, PC length regulations are dependent on multiple factors and many of them are not IFT dependent. One such method is directly modifying the axoneme by altering microtubule stability/polymerization through the acetylation state(Avasthi and Marshall, 2012), a pathway most likely the case for Tubacin. Another mode of PC length regulation is through a change in Actin polymerization. The remodeling of actin between contractile stress fibers and a cortical network alters conditions that are hospitable to basal body docking and maintenance at the cell surface (Avasthi and Marshall, 2012), causing PC length variation. Higher order microtubule polymerization inhibit actin polymerization. By interrogating RNA-seq data we determined that several PC-disassembly related genes (KIF4A, KIF26A, KIF26B, KIF18A), as well as microtubule polymerization protein genes (TPPP, TPPP3, TUBB, TUBB2A etc), were differentially expressed in LiCl-treated SSc fibroblasts (Suppl. Fig. S6D). Altogether, these findings suggest that microtubule polymerization/depolymerization mechanisms may regulate PC elongation and attenuation of fibrotic responses after either LiCl or Tubacin treatment.

      • Also, assessing the frequency of ciliary-expressing cells is important. That may give another variable important to predict fibrotic phenotypes. Or do 100% of the cultured cells express cilia in those conditions? Response: We carefully checked and observed almost 95% cells express cilia in cultured conditions.

      • Have the authors evaluated if TGF-b1 treatments induce cell cycle re-entry and proliferation in these experimental conditions? This is important to exclude ciliary resorption due to cell cycle re-entry instead of the myofibroblast activation process. __Response:__Yes, TGF-β induces cell proliferation in fibroblasts (Lee et al., 2013; Liu et al., 2016). However, we did serum starvation to stop proliferation. In our study, we observed a few percentage of Ki67-positive cells under TGF-β treatment at 24 hours (Supplementary Figure S2C). However, cell proliferation mainly stopped after 48 hours. Typically, proliferating cells rarely display any PC or show very small puncta. In our case, we observe a significantly elongated PC structure (although shorter than that of untreated cells) under TGF-beta-treated conditions. Our results display that a majority of cells are not proliferating but still display PC shortening under TGF-β treatment, suggesting that PC shortening is not due to cell division-induced PC disassembly. TGF beta-induced PC shortening is also reported in another fibroblast type previously (Kawasaki et al., 2024).

      Kawasaki, Makiri, et al. "Primary cilia suppress the fibrotic activity of atrial fibroblasts from patients with atrial fibrillation in vitro." Scientific Reports 14.1 (2024): 12470.

      Lee, J., Choi, JH. & Joo, CK. TGF-β1 regulates cell fate during epithelial–mesenchymal transition by upregulating survivin. Cell Death Dis 4, e714 (2013). https://doi.org/10.1038/cddis.2013.244.

      Liu, Y. et al. TGF-β1 promotes scar fibroblasts proliferation and transdifferentiation via up-regulating MicroRNA-21. Sci. Rep. 6, 32231; doi: 10.1038/srep32231 (2016).

      • The authors described that they focused on the genes that are affected in opposite ways (supp table 4), but TEAD2, MICALL1, and HDAC6 are not listed in that table. Response: The list in Supplementary Table S3 includes common genes defined as differentially expressed based on a fold change >1 or Minor comments:

      • Figure 1A,B,C should also show lower magnification images where several cells/field are visualized. Response: We have replaced it with a new Figure 1.

      • The number of patients analyzed is not clear. For example, M&M describes 5 healthy and 8 SSc, but only 3 and 4 are shown in the figure. Furthermore, for orbital fibrosis, 2 healthy vs. 2 TAO are mentioned in the figure legend, but only one of each showed. Finally, the healthy control for lung fibroblast seems to be 3 independent experiments of the CCL210 cell line; please show the three independent controls and clarify on the X-axis and in the figure legend that these are CCL210 cells. Response: A total of 5 healthy and 8 SSc skin explanted fibroblast cell lines were used, as described in the Materials and Methods. Since these are patient-derived skin fibroblasts, maintaining equal numbers in each experiment is challenging. Revised graphs for orbital fibroblasts and CCL210 have been added in the new Figures 1B and 1C.

      • For the same set of experiments, please clarify and consistently describe the conditions that promote PC: 12hs serum starvation as described in M&M? Or 24hs as described in the text? Or 16 as described in figure legend 1? Or 24hs as described in supp figure 2? Response: We serum-starved the cells overnight, and this is also mentioned in the manuscript.

      • Please confirm in figure legends and M&M that 100 cells per group were counted. Response: We measured only 100 cells per cell line in Supplementary Figure S1B. To eliminate any confusion, we have now created a superplot for cilia analysis. Each small dot represents the PC length from an individual cell, and each large dot represents the average of the small dots for one cell line. An unpaired two-tailed t-test was performed on the small dots (mean ± SD).

      • Figure 2 should also provide lower magnification to show several cells per field. Response: Foreskin fibroblasts treated with TGF-β1 are added in S2A.

      • How do you explain that the increase in length of primary cilia after siACTA2 doesn't change COL1A1? Wouldn't it be a good approach to also check by Western Blot? Response: We believe that depletion of aSMA was sufficient to reduce the PC length for the reason described earlier (Avasthi and Marshall, 2012), but was not sufficient enough to change COL1A1 level. We added the western blot in Supplementary Figure S8B.

      • Once more, figure 5 will benefit from low mag images. How consistent is the effect of LiCl in the cultured cells? What is the percentage of rescued cells? Response: LiCl treatment was consistent for almost all the cells (~95%) as shown below and added in S4A.

      • Figure 5, panels F and G need better explanation in the results text as well as in the figure legend. Response: We added now.

      • 9) Some figures/supp figures are wrongly referenced in the text. *

      __ Response:__ We carefully revised the manuscript and corrected the references.

      10) Figure 6, panel A is confusing. Is it a comparison between SSC skin fibroblasts and foreskin fibroblasts? Maybe show labels on the panel.

      __ Response:__ We updated the figure legend for Panel A in Figure 6.

      11) Where is Figure 8 mentioned in the text?

      __ Response:__ In the discussion section.

      12) The work will benefit from an initial paragraph in the discussion enumerating the findings and a summary of the conclusion at the end.

      Response: We agree and modified the discussion accordingly.

      13) The nintedanib experiments are not described in the results section at all.

      Response: All nintedanib experiments are now included in Figure S5C-F and are described in the Results section.

      Significance

      Reviewer #3 (Significance (Required)): Beyond the lack of in situ ciliary expression assessment, the work is exciting, and the potential implications of treating/preventing fibrosis with small molecules to modulate ciliary length could be transformative in the field. Furthermore, there are a few HDAC6 inhibitors already in clinical trials for different tumors, which increases the significance of the work.

      Response: Thank you for your encouraging comments regarding the potential impact of our findings. We agree that the therapeutic implications of modulating ciliary length, particularly using small molecules such as HDAC6 inhibitors already in clinical trials, could be transformative in the context of fibrosis. We also acknowledge the importance of in situ assessment of ciliary expression and plan to incorporate such analyses in future studies to further strengthen our findings.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The author's main research topic in this work is the relationship between ciliary length and the level of fibrosis. Fibrotic samples show shorter primary cilia, profibrotic treatment with TGFB decreases the ciliary length, and posterior dedifferentiation of fibroblasts shows longer cilia. Cells with a decrease of αSMA by using siACTA2 siRNA, also show increased ciliary length. Most importantly, inducing the increase of ciliary length with LiCl or Tubacin has an inverse association with fibrosis phenotypes. The modulation of primary cilia length may represent a potential therapeutic strategy for fibrosis-associated diseases.

      The premise is relevant and exciting, and the methods are appropriate. The experiments partially sustain the conclusion. The results open a new potential area for studying fibrosis. The tables and figures aid in understanding the paper. The paper is clear and easy to read for a basic research specialized audience.

      Major comments:

      1. Need to demonstrate if the fibrotic phenotypes seen are produced through a ciliary-dependent mechanism. For example, to see if LiCl effects on Cgn1 are through ciliary expression or by other mechanisms. To achieve that objective, The authors should repeat the experiments in cells with a knockdown or knockout of ciliary proteins such as IFT20, IFT88, etc. The same approach should be applied to the tubacin experiments.
      2. The use of LiCl to increase ciliary length is complicated. What are the molecular mechanisms underlying this effect? It is known that it may be affecting GSK-3b, which can have other ciliary-independent effects. Therefore, using ciliary KO/KD cells (IFT88 or IFT20) as controls may help assess the specificity of the proposed treatments.
      3. Also, assessing the frequency of ciliary-expressing cells is important. That may give another variable important to predict fibrotic phenotypes. Or do 100% of the cultured cells express cilia in those conditions?
      4. Have the authors evaluated if TGF-b1 treatments induce cell cycle re-entry and proliferation in these experimental conditions? This is important to exclude ciliary resorption due to cell cycle re-entry instead of the myofibroblast activation process.
      5. The authors described that they focused on the genes that are affected in opposite ways (supp table 4), but TEAD2, MICALL1, and HDAC6 are not listed in that table.

      Minor comments:

      1. Figure 1A,B,C should also show lower magnification images where several cells/field are visualized.
      2. The number of patients analyzed is not clear. For example, M&M describes 5 healthy and 8 SSc, but only 3 and 4 are shown in the figure. Furthermore, for orbital fibrosis, 2 healthy vs. 2 TAO are mentioned in the figure legend, but only one of each showed. Finally, the healthy control for lung fibroblast seems to be 3 independent experiments of the CCL210 cell line; please show the three independent controls and clarify on the X-axis and in the figure legend that these are CCL210 cells.
      3. For the same set of experiments, please clarify and consistently describe the conditions that promote PC: 12hs serum starvation as described in M&M? Or 24hs as described in the text? Or 16 as described in figure legend 1? Or 24hs as described in supp figure 2?
      4. Please confirm in figure legends and M&M that 100 cells per group were counted.
      5. Figure 2 should also provide lower magnification to show several cells per field.
      6. How do you explain that the increase in length of primary cilia after siACTA2 doesn't change COL1A1? Wouldn't it be a good approach to also check by Western Blot?
      7. Once more, figure 5 will benefit from low mag images. How consistent is the effect of LiCl in the cultured cells? What is the percentage of rescued cells?
      8. Figure 5, panels F and G need better explanation in the results text as well as in the figure legend.
      9. Some figures/supp figures are wrongly referenced in the text.
      10. Figure 6, panel A is confusing. Is it a comparison between SSC skin fibroblasts and foreskin fibroblasts? Maybe show labels on the panel.
      11. Where is Figure 8 mentioned in the text?
      12. The work will benefit from an initial paragraph in the discussion enumerating the findings and a summary of the conclusion at the end.
      13. The nintedanib experiments are not described in the results section at all.

      Significance

      Beyond the lack of in situ ciliary expression assessment, the work is exciting, and the potential implications of treating/preventing fibrosis with small molecules to modulate ciliary length could be transformative in the field. Furthermore, there are a few HDAC6 inhibitors already in clinical trials for different tumors, which increases the significance of the work.

      Expertise: primary cilium functions, cell biology, cancer biology

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This is an interesting paper that appears to show that explanted fibroblasts from a range of fibrotic conditions exhibit a reduction in the length of their primary cilia (PC). The paper employs a number of different experimental approaches that appear to show that the modulation of fibroblast/myofibroblast differentiation is associated with alterations in PC length. The rational for the study is that actin polymerization has previously been associated with PC length. The authors suggest that modulation of PC dynamics may represent a potential theraputic strategy for fibrotic disease. To me that seems like a big jump.

      Major concerns.

      I found the paper to be rather muddled and its presentation made if somewhat difficult to follow. For example, the Figures are disorganised (Fig 1 is a great example of this) and there was reference to Sup data that appeared out of order (eg Sup Fig 2 appeared before Sup Fig 1 in the text). Images in a single figure should be the same size. currently they are almost random and us different magnifications. Overall, the paper needs to be better organised.

      I have some significant concerns about how the PC length data was generated. To my mind the length may be hard to determine from the type of images shown in the paper (which may represent the best images?). Some of the images presented appear to show shorter, fatter PCs in the cells from fibrosis cases. Is this real or is it some kind of artefact? Would a shorter, fatter PCs have a similar or larger surface area? What would be the consequence of this?

      I am confused as to exactly what is meant by matched healthy controls. Age, sex and ethnicity, where stated seem to be very variable? What are CCL210 fibroblasts?

      What does a change in PC length signify? DO shot PC foe a cellular transition or are they a consequence of it? What would happen is you targeted PCs with a drug and that influenced the length on all cell types? Is the effec on PC fibroblast specific?

      Minor concerns

      Page 4 second paragraph. I think it should be clarified that it is this group who have suggested a link between PCs and myofibroblast transition?

      Page 4 second paragraph. The use of the word "remarkably' is a bit subjective.

      Reference 27 is a paper on multiciliogenesis rather than primary ciliogenesis.

      Figure 1 panel D. Make the image with the same sized vertical scale

      Significance

      To my mind this is a novel paper and the date presented in it may be of interest to the cilia community as well as to the fibrosis field. This could be considered to be a significant advance and I am unaware that other groups are actively working in this area.

      Presentation of the data in the current form does not instil confidence in the work.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The manuscript by Verma et al. describes the involvement of primary cilia length control in driving pro-fibrotic progression of fibroblasts in fibrotic diseases. This is shown in primary cells from several organs from patients, suffering from different fibrotic diseases. They demonstrate that primary cilia are shorter in fibroblasts from different fibrotic conditions and that pro-fibrotic signaling, as exemplified by TGFb stimulation, causes shortening of the cilium. Vice versa, elongation of the cilium via different pharmacological substances reverses the pro-fibrotic phenotype.

      Major comments

      1. To reliably quantify the ciliary length in different cell types, and in independent ciliary marker needs to be included for comparison and the ciliary base needs to be labeled (e.g., -TUBULIN). This needs to combined with a non-biased, high-throughput analysis, e.g., CiliaQ,
      2. As mentioned in the study, TGF has been implicated to drive myofibroblast transition. Thus TGF stimulate ciliary signaling in the presented primary cells? The authors should provide a read-out for TGF signaling in the cilium (ICC fro protein phosphorylation etc.). Furthermore, canonical ciliary signaling pathways have been suggested to act as fibrotic drivers, such as Hedgehog and Wnt signaling - does stimulation of these pathways evoke a similar effect?
      3. Does TGF induce cell proliferation? If yes, this would force cilium disassembly and, thereby, reduce ciliary length, which is independent of a "shortening" mechanism proposed by the authors.
      4. As PGE2 has been shown to signal through EP4 receptors in the cilium, is the restoration of primary cilia length due to ciliary signaling?
      5. Primary cilia length is regulated by cAMP signaling in the cilium vs. cytoplasm - does cAMP signaling play a role in this context? PGE2 is potent stimulator of cAMP synthesis - does this underlie the rescue of primary cilia length?
      6. The authors describe that they wanted to investigate how aSMA impacted primary cilia length. They only provide a knock-down experiment and measured ciliary length, but the mechanistic insight is missing. How does loss of aSMA expression control ciliary length?
      7. The authors used LiCl in their experiments, which supposedly control Hh signaling. Coming back to my second questions, is this Hh-dependent? And what is the common denominator with respect to TGF signaling? And how is this mechanistically connected to actin and microtubule polymerization?
      8. How was the SMA Mean intensity determined?
      9. Fig: 1D: Statistical test is missing in Figure Legend and presentation of the p-values for the left graph is confusing.
      10. Some graphs are presented {plus minus} SD and some {plus minus} SEM, but this is not correctly stated in the Material & Methods Part.
      11. Fig. 4D&E: Statistical test is missing in Figure Legend.

      Minor comments

      • In general, text should be checked again for spelling mistakes and sentences may be re-written to promote readability. In particular, this applies to the discussion.
      • Figure Legends are not written consistently, information is missing (e.g., statistical tests, see above).
      • Figures should be checked again and all text should be the same size and alignment of images should be improved.

      Significance

      The authors present a novel connection between the regulation of primary cilia length and fibrogenesis. However, the study generally lacks mechanistic insight, in particular on how TGF signaling, SMA expression, and ciliary length control are connected. The spatial organization of the proposed signaling components is also not clear - is this a ciliary signaling pathway? If so, how does it interact with cytoplasmic signaling and vice versa?

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors describe a novel pattern of ncRNA processing by Pac1. Pac1 is a RNase III family member in S. pombe that has previously been shown to process pre-snoRNAs. Other RNase III family members, such as Rnt1 in S. cerevisiae and Dosha in human, have similar roles in cleaving precursors to ncRNAs (including miRNA, snRNA, snoRNA, rRNA). All RNAse III family members share that they recognize and cleave dsRNA regions, but differ in their exact sequence and structure requirement. snoRNAs can be processed from their own precursor, a polycistronic pre-cursor, or the intron of a snoRNA host gene. After the intron is spliced out, the snoRNA host gene can either encode an protein or be a non-functional by product.

      In the current manuscript the authors show that in S. pombe snoRNA snR107 and U14 are processed from a common precursor in a way that has not previously been described. snR107 is encoded within an intron and processed from the spliced out intron, similar to a typical intron-encoded snoRNA. What is different is that upon splicing, the host gene can adopt a new secondary structure that requires base-pairing between exon 1 and exon2, generating a Pac1 recognition site. This site is recognized, resulting in cleaving of the RNA and further processing of the 3' cleavage product into U14 snoRNA. In addition, the 5' cleavage product is processed into a ncRNA named mamRNA. The experiments describing this processing are thorough and convincing, and include RNAseq, degradome sequencing, northern blotting, qRT-PCR and the analysis of mutations that disrupt various secondary structures in figures 1, 2, and 3. The authors thereby describe a previously unknown gene design where both the exon and the intron are processed into a snoRNA. They conclude that making the formation of the Pac1 binding site dependent on previous splicing ensures that both snoRNAs are produced in the correct order and amount. Some of the authors findings are further confirmed by a different pre-print (reference 19), but the other preprint did not reveal the involvement of Pac1.

      While the analysis on the mamRNA/snR107/U14 precursor is convincing, as a single example the impact of these findings is uncertain. In Figure 4 and supplemental table 1, the authors use bioinformatic searches and identify other candidate loci in plans and animals that may be processed similarly. Each of these loci encode a putative precursor that results in one snoRNA processed from an intron, a different snoRNA processed from an exon, and a double stranded structure that can only form after splicing. While is potentially interesting, it is also the least developed and could be discussed and developed further as detailed below.

      Major comments:

      1. The proposal that plant and animal pre-snoRNA clusters are processed similarly is speculative. the authors provide no evidence that these precursors are processed by an RNase III enzyme cutting at the proposed splicing-dependent structure. This should not be expected for publication, but would greatly increase the interest.

      All three reviewers expressed a similar concern, and we now provide additional evidence supporting the conservation of the proposed mechanism. Specifically, we focused on the SNHG25 gene in H. sapiens, which hosts two snoRNAs—one intronic, as previously shown in Figure 4B, and one non-intronic. We substantiated our predictions through the re-analysis of multiple sequencing datasets in human cell lines, as outlined below:

      I. Analysis of CAGE-seq and nano-COP datasets indicates a single major transcription initiation site at the SNHG25 locus. Both the intronic and non-intronic snoRNAs are present within the same nascent precursor transcripts (Supplementary Figure 4D).

      II. Degradome-seq experiments in human cell lines reveal that the predicted splicing-dependent stem-loop structure within the SNHG25 gene is subject to endonucleolytic cleavage (Supplementary Figure 4D). The cleavage sites are located at the apical loop and flanking the stem, displaying a staggered symmetry characteristic of RNase III activity (Figure 4C). Importantly, the nucleotide sequence surrounding the 3' cleavage site and the 3' splice-site are conserved in other vertebrates (Supplementary Figure 4.D).

      III. fCLIP experiments demonstrate that DROSHA associates with the spliced SNHG25 transcript (Supplementary Figure 4D).

      Together, these analyses support the generalizability of our model beyond fission yeast. They confirm the structure of the SNHG25 gene as a single non-coding RNA precursor hosting two snoRNAs, one of which is intronic. Importantly, these findings show that the predicted stem-loop structure contains conserved elements and is subject to endonucleolytic cleavage. Human DROSHA, an RNase III enzyme, could be responsible for this processing step.

      The authors provide examples of similarly organized snoRNA clusters from human, mouse and rat, but the examples are not homologous to each other. Does this mean these snoRNA clusters are not conserved, even between mammals? Are the examples identified in Arabidopsis conserved in other plants? If there is no conservation, wouldn't that indicate that this snoRNA cluster organization offers no benefit?

      We noticed during this revision that the human SNHG25 locus is actually very well conserved in mice at the GM36220 locus, where both snoRNAs (SNORD104 and SNORA50C/GM221711) are similarly arranged. Although the murine host gene, GM36220, also contains an intron in the UCSC annotation, it is intronless in the Ensembl annotation we used to screen for mixed snoRNA clusters, which explains why it was not part of our initial list of candidates (Supplementary Table 1). Importantly, sequence elements in SNHG25, close to the splice sites and cleavage sites in exon 2, are also well conserved in mice and other vertebrates (Supplementary Figure 4D). Therefore, it is reasonable to think that the mechanism described for SNHG25 in humans may also apply in mice and other vertebrates.

      That being said, snoRNAs are highly mobile genetic elements. For example, it is well established that even between relatively closely related species (e.g., mouse and human), the positions of intronic snoRNAs within their host genes are not strictly conserved, even when both the snoRNAs and their host genes are. In the constrained drift model of snoRNA evolution (Hoeppner et al., BMC Evolutionary Biology, 2012; doi: 10.1186/1471-2148-12-183), it is proposed that snoRNAs are mobile and “may occupy any genomic location from which expression satisfies phenotype.”

      Therefore, a low level of conservation in mixed snoRNA clusters is generally expected and does not necessarily imply that is offers no benefit. Despite the limited conservation of snoRNA identity across species, mixed snoRNA clusters consistently display two recurring features: (1) non-intronic snoRNAs often follow intronic snoRNAs, and (2) the predicted secondary structure tends to span the last exon–exon junction. These enriched features support the idea that enforcing sequential processing of mixed snoRNA clusters may confer a selective advantage. We now explicitly discuss these points in the revised manuscript.

      Supplemental Figure 4 shows some evidence that the S. pombe gene organization is conserved within the Schizosaccharomyces genus, but could be enhanced further by showing what sequences/features are conserved. Presumably the U14 sequence is conserved, but snR107 is not indicated. Is it not conserved? Is the stem-loop more conserved than neighboring sequences? Are there any compensatory mutations that change the sequence but maintain the structure? Is there evidence for conservation outside the Schizosaccharomyces genus?

      We thank the reviewer for these excellent suggestions, which helped us significantly improve Supplementary Figure 4. In the revised version, we now include an additional species—S. japonicus, which is more evolutionarily distant—and show that the intronic snR107 is conserved across the Schizosaccharomyces genus (Supplementary Figure 4A). The distance between conserved elements (splice sites, snoRNAs, and RNA structures) varies, indicating that surrounding sequences are less conserved compared to these functionally constrained features

      We also performed a detailed alignment of the sequences corresponding to the predicted RNA secondary structures. This revealed that the apical regions are less conserved than the base, particularly near the splice and cleavage sites. In these regions, we observe compensatory or base-pair-neutral mutations (e.g., U-to-C or C-to-U, which both pair with G), suggesting structural conservation through evolutionary constraint (Supplementary Figures 4B–C). These observations are now described in greater detail in the revised manuscript, along with a discussion of the specific features likely to be under selective pressure at this locus.

      Conservation outside the Schizosaccharomyces genus is less clear. As already noted in the manuscript, the S. cerevisiae locus retains synteny between snR107 and snoU14, but the polycistronic precursor encompassing both is intronless and processed by RNase III (Rnt1) between the cistrons. Similarly, in Ashbya gossypii and a few other fungal species, synteny is preserved, but no intron appears to be present in the presumed common precursor. Notably, secondary structure predictions for the A. gossypii locus (not shown) suggest the formation of a stable stem-loop encompassing the first snoRNA in a large apical loop. This could reflect a distinct mode of snoRNA maturation, possibly analogous to pri-miRNA processing, where cleavage by an RNase III enzyme contributes to both 5′ and 3′ end formation. In Candida albicans, snoU14 is annotated within an intron of a host gene, but no homolog of snR107 is annotated. Other cases either resemble one of the above scenarios or are inconclusive due to the lack of a clearly conserved snoRNA (or possibly due to incomplete annotation). Although these examples are potentially interesting, we have chosen not to elaborate on them in the manuscript in order to maintain focus and avoid speculative interpretation in the absence of stronger evidence.

      The authors suggest that snoRNAs can be processed from the exons of protein coding genes, but snoRNA processing would destroy the mRNA. Thus snoRNAs processing and mRNA function seem to be alternative outcomes that are mutually exclusive. Can the authors comment?

      In theory, we agree with reviewer on the mutually exclusive nature of mRNA and snoRNA expression for putative snoRNA hosted in the exon of protein coding genes. However, we want to clarify that the specific examples of snoRNA precursor (or host) developed in the manuscript (mamRNA-snoU14 in S.pombe and, in this resubmission, SNHG25 in H. sapiens) are non-coding. So although we do not exclude that our model of sequential processing through splicing and endonucleolytic cleavage could apply to coding snoRNA precursors, it is not something we want to insist on, especially given the lack of experimental evidence for these cases.

      It is possible that the use of the term "exonic snoRNA" in the first version of the manuscript lead to the reviewer's impression that we explicitly meant that snoRNA processing can be processed from the exon of protein coding genes, which was not what we meant (although we do not exclude it). If that was the case, we apologize for the confusion. We have now clarified the issue (see next point).

      Minor comments:

      The term "exonic snoRNA" is confusing. Isn't any snoRNA by definition an exon?

      We agree that this term can be confusing, a sentiment that was also shared by reviewer 3. We replaced the problematic term by either "non-intronic snoRNA", "snoRNA" or "snoRNA gene located in exon" depending on the context, which are more unambiguous in conveying our intended meaning.

      The methods section does not include how similar snoRNA clusters were identified in other species

      We have now corrected this omission in the method section ('Identification of mixed snoRNA clusters' subsection): "To identify mixed snoRNA clusters, we downloaded the latest genome annotation from Ensembl and selected snoRNAs co-hosted within the same precursor, with at least one being intronic and at least one being non-intronic. We filtered out ambiguous cases where snoRNAs overlapped exons defined as 'retained introns', reasoning that in these cases the snoRNA is more likely to be intronic than not."

      In the discussion the authors argue that a previously published observation that S. pombe U14 does not complement a S. cerevisiae mutation can be explained because "was promoter elements... were simply not included in the transgene sequence". However, even if promoter elements were included, the dsRNA structure of S. pombe would not be cleaved by the S. cerevisiae RNase III. I doubt that missing promoter elements are the full explanation, and the authors provide insufficient data to support this conclusion.

      We agree with the reviewer that, given the substantial divergence in substrate specificity between Pac1 and Rnt1, it is unlikely that S. pombe snoU14 would be efficiently processed from its precursor in S. cerevisiae. We did not intend to suggest otherwise, and we have now removed this part of the discussion. As the experiment reported by Samarsky et al. did not detect expression of the S. pombe snoU14 precursor (even its unprocessed form), it remains inconclusive with respect to the conservation (or lack thereof) of snoU14 processing mechanisms.

      For the record, we had originally included this discussion to point out that the lack of cryptic promoter activity (or at least none that S. cerevisiae can use) within the S. pombe snoU14 precursor supports the idea that transcription initiates solely upstream of the mamRNA precursor. However, we recognize that this argument is speculative and potentially confusing. We have therefore removed it from the revised manuscript to maintain clarity and focus.

      **Referees cross-commenting**

      I agree with the other 2 reviewers but think the thiouracil pulse labeling reviewer 2 suggests would take considerable work and if snoRNA processing is very fast might not be as conclusive as the reviewer suggests.

      We are grateful to the reviewer for this comment, which helped us perform this reviewing in a timely manner.

      Reviewer #1 (Significance (Required)):

      In the current manuscript the authors show that in S. pombe snoRNA snR107 and U14 are processed from a common precursor in a way that has not previously been described. The experiments describing this processing are thorough and convincing, and include RNAseq, degradome sequencing, northern blotting, qRT-PCR and the analysis of mutations that disrupt various secondary structures in figures 1, 2, and 3. The authors thereby describe a previously unknown gene design where both the exon and the intron are processed into a snoRNA.

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __

      __ __The manuscript presents a novel mode of processing for polycistronic snoRNAs in the yeast Saccharomyces pombe. The authors demonstrate that the processing sequence of a transcription unit containing U14, intronic snR107, and an overlapping non-coding mamRNA is determined by secondary structures recognized by RNase III (Pac1). Specifically, the formation of a stem structure over the mamRNA exon-exon junction facilitates the processing of terminal exonic-encoded U14. Consequently, U14 maturation occurs only after the mamRNA intron (containing snR107) is spliced out. This mechanism prevents the accumulation of unspliced, truncated mamRNA.

      1.The first section describing the processing steps is challenging to follow due to the unusual organization of the locus and maturation pathway. If the manuscript is intended for a broad audience, I recommend simplifying this section and presenting it in a more accessible manner. A larger diagram illustrating the transcription unit and processing intermediates would be beneficial. Additionally, introducing snR107 earlier in the text would improve clarity.

      We thank the reviewer for these excellent suggestions. In the previous version of the manuscript, we were cautious in how we introduced the locus, as snR107 and the associated intron had not yet been published. This is no longer the case, as the locus is now described in Leroy et al. (2025). Accordingly, we now introduce the complete locus at the beginning of the manuscript and have improved the corresponding diagram (new Figure 1A). We believe these changes enhance clarity and make the section more accessible to a broader audience.

      2.Evaluation of some results is difficult due to the overexposure of Northern blot signals in Figures 1 and 2. The unspliced and spliced precursors appear as a single band, making it hard to distinguish processing intermediates. Would the authors consider presenting these results similarly to Figure 3, where bands are more clearly resolved? Or presenting both overexposed and underexposed blots?

      For all blots (probes A, B, and C), we selected an exposure level that allows detection of precursor forms under wild-type (WT) conditions. This necessarily results in some overexposure of the accumulating precursors in mutant conditions, due to their broad dynamic range of accumulation. To address this, we now provide an additional supplementary "source data" file containing all uncropped blots with both low and high exposures.

      For example, a lower exposure version of the blot in new Figure 1.B (included in the source data file) confirms the consistent accumulation of the spliced precursor when Pac1 activity is compromised. The unspliced precursor also shows slight accumulation in the Pac1-ts mutant, although to a much lesser extent than the spliced precursor. This observation is consistent with our qPCR results (new Figure 1.C).

      Importantly, because this effect is not observed in neither the Pac1-AA or the steam-dead (SD) mutants, we interpret it as an indirect effect—possibly reflecting a mild growth defect in the Pac1-ts strain, even under growth-permissive conditions. We now explicitly address this point in the revised manuscript.

      3.Additionally, I noticed a discrepancy in U14 detection: Probe B gives a strong signal for U14 in Figure 3B, whereas in Figures 1 and 2, U14 appears as faint bands. Could the authors clarify this inconsistency?

      We thank the reviewer for pointing out this discrepancy. The variation in U14 signal intensity is most likely due to technical differences in UV crosslinking efficiency during the Northern blot procedure. This step can differentially affect the membrane retention of RNA species depending on their length, as previously reported (PMID: 17405769). Because U14 is a relatively abundant snoRNA, the fainter signal observed in Figure 1 (relative to the accumulating precursor) likely reflects suboptimal crosslinking of shorter RNAs in that particular blot.

      Importantly, this technical variability does not impact the conclusions of our study, as we do not compare RNA species of different lengths directly. To increase transparency, we now provide a supplementary "source data" file that includes all uncropped blots from our Northern blot experiments. These include examples—such as the uncropped blot for Figure 1B—where U14 retention is more consistent.

      4.Furthermore, ethidium bromide (EtBr) staining of rRNA is used as a loading control, but overexposed signals from the gel may not accurately reflect RNA amounts on the membrane. This could affect the interpretation of mature RNA species' relative abundance.

      We thank the reviewer for pointing this out and have now measured rRNAs loading on the same northern blot membrane from probes complementary to mature rRNA. We updated new Figures 1B, 2B, 3B, S1B, and S3A accordingly.

      5.To further support the sequential processing model, the authors could use pulse-labeling thiouracil to test the accumulation of newly transcribed RNAs and accumulation of individual species. Additionally, it could help determine whether U14 can be processed through alternative, less efficient pathways. Would the authors consider incorporating this approach?

      We thank the reviewer for this pertinent suggestion. We actually plan to investigate the putative alternative U14 maturation pathway in future work, and the suggested approach will definitely be instrumental for that. However, to keep the present manuscript focused, and also to keep the review timely (successful pulse-chase experiments are likely to take time to optimize – as also suggested by the other reviewers in their cross-commenting section), we prefer not to perform this experiment for this reviewing.

      7.In the final section, the authors propose that this processing mechanism is conserved across species, identifying 12 similar genetic loci in different organisms. This is very interesting finding. In my opinion, providing any experimental evidence would greatly strengthen this claim and the manuscript's significance. Even preliminary validation would add substantial value!

      We thank the reviewer for his/her enthusiasm and are glad to provide some preliminary validation to the final section of our manuscript. Specifically, we focused on the SNHG25 gene in H. sapiens, which hosts two snoRNAs—one intronic, as previously shown in Figure 4B, and one non-intronic. We substantiated our predictions through the re-analysis of multiple sequencing datasets in human cell lines, as outlined below:

      I.Analysis of CAGE-seq and nano-COP datasets indicates a single major transcription initiation site at the SNHG25 locus. Both the intronic and non-intronic snoRNAs are present within the same nascent precursor transcripts (Supplementary Figure 4D).

      II.Degradome-seq experiments in human cell lines reveal that the predicted splicing-dependent stem-loop structure within the SNHG25 gene is subject to endonucleolytic cleavage (Supplementary Figure 4D). The cleavage sites are located at the apical loop and flanking the stem, displaying a staggered symmetry characteristic of RNase III activity (Figure 4C). Importantly, the nucleotide sequence surrounding the 3' cleavage site and the 3' splice-site are conserved in other vertebrates (Supplementary Figure 4.D).

      III. fCLIP experiments demonstrate that DROSHA associates with the spliced SNHG25 transcript (Supplementary Figure 4D).

      Together, these analyses support the generalizability of our model beyond fission yeast. They confirm the structure of the SNHG25 gene as a single non-coding RNA precursor hosting two snoRNAs, one of which is intronic. Importantly, these findings unambiguously show that the predicted stem-loop structure is subject to endonucleolytic cleavage, and they are consistent with DROSHA, an RNase III enzyme, being responsible for this processing step.

      **Referees cross-commenting**

      The other two reviewers' comments are justified.

      Reviewer #2 (Significance (Required)):

      The authors describe an interesting novel mode of snoRNA procseeimg form the host transcript. The results appear sound and intriguing, especially if the proposed mechanism can be confirmed across different organisms. Including such validation would significantly enhance the impact and make this work of broad audience interest.

      My expertise: transcription, non-coding RNAs

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)): __

      The manuscript by Migeot et al., focuses on a new Pac1-mediated snoRNA processing pathway for intron-encoded snoRNA pairs in yeast Schizosaccharomyces pombe. The novelty of the findings described in MS is the report of an unusual and relatively rare genomic organization and sequential processing of a few snoRNA genes in S. pombe and other eukaryotic organisms. It appears that in the case of snoRNA pairs, hosted in pre-mRNA in the intron and exon, respectively, the release of separate pre-snoRNAs from the host gene relies first on splicing to free the intron-encoded snoRNA, followed by endonucleolytic cleavage by RNase III (Pac1 in S. pombe) to produce snoRNA present in the mRNA exon. The sequential processing pathway, ensuring proper maturation of two snoRNAs, was demonstrated and argued in an elegant and clear way. The main message of the MS is straightforward, most experiments are properly conducted and specific conclusions based on the data are justified and valid. The text is clearly written and well-presentded.

      But there are some shortcomings.

      1.First of all, the title of the MS and general conclusions regarding the Pac1-mediated sequential release of snoRNA pairs hosted within the intron are definitely an overstatement. Especially the title suggests that this genomic organization and unusual processing mode of these snoRNAs is widespread. Later in the discussion the authors themselves admit that such mixed exonic-intronic snoRNAs are rare, although their presence may be underestimated due to annotation problems. It is likely that such snoRNA arrangement and processing is conserved, but the evidence is missing and only unique cases were identified based on bioinformatics mining and their processing has not been assayed. This makes the generalization impossible based on a single documented mamRNA/snoU14 example, no matter how carefully examined.

      We thank the reviewer for clearly articulating this concern. In response, we now provide additional evidence supporting conservation of the proposed mechanism in other species:

      • Conservation within the Schizosaccharomyces genus (Figures S4A–C) has been further analyzed, as suggested by Reviewer 1. This expanded analysis highlights conserved features—such as splice sites and cleavage sites within the predicted stem-loop structure—indicating that these elements are under selective constraint.

      • Conservation in mammals is now supported by experimental data, as detailed in our responses to point #7 of Reviewer 2 and major comment #1 of Reviewer 1. Specifically, we show that for the SNHG25 gene in H. sapiens (Figure S4D):

      (1) nascent transcription give rise to a single non-coding RNA precursor that hosts two snoRNAs, one of which is intronic;

      (2) the predicted stem-loop structure contains conserved elements and is subject to endonucleolytic cleavage;

      (3) the RNase III enzyme DROSHA associates with the spliced SNHG25 precursor.

      Together, these analyses strengthen the evidence for the evolutionary conservation of the mechanism and support the general conclusions and title of the manuscript.

      Another interesting observation is that, similarly to other intron-encoded snoRNA in other species, there is a redundant pathway to produce mature U14 in addition to Pac1-mediated cleavage. In the case of intronic snoRNAs in S. cerevisiae, their release could be performed either by splicing/debranching or Rnt1 cleavage, but there is also a third alternative option, that is processing following transcription termination downstream of the snoRNA gene, which at the same time interferes with the expression of the host gene. Is such a scenario possible as an alternative pathway for U14? Are there any putative, or even cryptic, terminators downstream of the U14 gene? The authors did not consider or attempt to inspect this possibility.

      We thank the reviewer for this interesting and thoughtful comment. First, we would like to clarify that snoU14 is not intron-encoded; rather, it is located on the exon downstream of the intron-encoded snR107.

      Regarding the possibility of transcription termination-based processing: downstream of snoU14, we identified a non-consensus polyadenylation signal (AUUAAA) preceded by a U-rich tract, followed by three consensus polyadenylation signals (AAUAAA) within a 500-nt window. These elements likely contribute to robust and redundant transcription termination at this highly expressed locus. However, since all these sites are located downstream of snoU14, they do not provide an alternative 5′-end processing mechanism for this snoRNA –they reflect normal termination.

      If we correctly understood the reviewer’s suggestion (apologies if not), they may have been referring to the possibility of a cryptic or alternative polyadenylation site between snR107 and snoU14 instead. If cleavage were to occur in this inter-snoRNA region while transcription continued past snoU14, it could, in principle, allow for alternative processing of snoU14. We have indeed considered this scenario. However, we currently do not find strong support for it: there are no identifiable polyadenylation signals motifs between the two snoRNAs, aside from a weakly conserved and questionable AAUAAU hexamer that does not appear to be used as polyA site at least in WT conditions (DOI: 10.4161/rna.25758). Given the lack of evidence, we chose not to explore this hypothesis further in the present manuscript, though it remains an interesting possibility for future investigation.

      I also have some concerns or comments related to the presented research, which are no major, but are mainly related to data quatification, but have to be addressed.

      • In Pac1-ts and Pac1-AA strains the level of mature U14 seems upregulated compared to respective WT (Figure 1A). At the same time mature 25S and 18S rRNAs are less abundant. But there is no quantification and it is not mentioned in the text. What could be the reason for these effects?

      We thank the reviewer for this observation. As reviewer 2 also noted, ethidium bromide staining of mature rRNAs is not a reliable quantitative loading control. In response to this concern, we have now reprobed all northern blots with radiolabeled rRNA probes. These provide a more accurate and consistent loading for our blots (new Figures 1B, 2B, 3B, S1B, S3A).

      Using these improved loading controls, it is evident that snoU14, snR107, and the unspliced precursor are all slightly upregulated in the Pac1-ts strain, although to a much lesser extent than the spliced precursor, which accumulates dramatically. We do not observe this effect in either the Pac1-AA or stem-dead (SD) mutants. We therefore interpret the modest upregulation as an indirect effect, possibly linked to the physiological state of the Pac1-ts mutant, which exhibits slower growth even at growth-permissive temperatures. We now explicitly discuss this in the revised manuscript.

      Regarding the suggestion to include quantification of the northern blot signal: we opted not to include this in the figures for the following reasons. First, the accumulation of the spliced precursor—the central focus of our analysis—is large and highly reproducible across all replicates and conditions. Second, northern blot quantification by pixel intensity remains semi-quantitative, particularly for comparisons across RNAs of highly different abundance. Finally, we support our conclusions with additional quantitative data from RT-qPCR and RNA-seq, which provide more robust measures of RNA accumulation.

      • Processing of the other snoRNA from the mamRNA/snoU14 precursor is largely overlooked in the MS. It is commented on only in the context of mutants expressing constitutive mamRNA-CS constructs (Figure 3B). Its level was checked in Pac1-ts and Pac1-AA (Supplementary Figure 1), but the authors conclude that "its expression remained largely unaffected by Pac1 inactivation", which is clearly not true. Similarly to U14, also snR170 is increased in Pac1-ts and Pac1-AA strains, at least judged "by eye" because the loading control or quantification is not provided. This matter should be clarified.

      We thank the reviewer for pointing this out. We have now included appropriate loading controls for Supplementary Figure 1 to clarify the interpretation. As discussed in our response to the previous comment, we observe a general upregulation of the mamRNA locus in the Pac1-ts strain, which likely contributes to the increased levels of both snR107 and snoU14. However, because this upregulation is not observed in the Pac1-AA or stem-dead (SD) mutants, we interpret it as an indirect effect, possibly related to the altered physiological state of the Pac1-ts strain (e.g., slightly reduced growth rate even at the permissive temperature). This interpretation has now been clearly explained in the revised manuscript.

      We also identified and corrected a labeling error in the previous version of Supplementary Figure 1, where the Pac1-ts and Pac1-AA strains were inadvertently swapped. We sincerely apologize for the confusion this may have caused and have now ensured that all figure panels are correctly labeled and consistent with the text.

      Other minor comments:

      Minor points:

      1. Page 1, Abstract. The sentence "The hairpin recruits the RNase III Pac1 that cleaves and destabilizes the precursor transcript while participating in the maturation of the downstream exonic snoRNA, but only after splicing and release of the intronic snoRNA" is not entirely clear and should be simplified, maybe split into two sentences. This message is clear after reading the MS and learning the data, but not in the abstract.

      We thank the reviewer for pointing this out and have now clarified the abstract following the suggestion to split and simplify the problematic sentence : "... the sequence surrounding an exon-exon junction within their precursor transcript folds into a hairpin after splicing of the intron. This hairpin recruits the RNase III ortholog Pac1, which participates in the maturation of the downstream snoRNA by cleaving the precursor."

      Page 1, Introduction. I am not convinced by the need to use the term "exonic snoRNA" for all snoRNA that are not intronic, which is misleading, and is rather associated per se with snoRNA encoded in the mRNA exon. It has been used before in the review about snoRNAs by Michelle Scott published in RNA Biol (2024), but it does not justify its common use.

      We thank the reviewer for raising this important point. We agree that the term “exonic snoRNA” can be misleading, as it was previously used to specifically refer to snoRNAs embedded within exons of mRNA transcripts—an rare and potentially artifactual scenario, as very cautiously discussed by Michelle Scott and colleagues in their review published in RNA Biol (2024).

      In the previous version of our manuscript, we actually used “exonic snoRNA” in a broader sense to denote any snoRNA not encoded within an intron, primarily for convenience in contrasting the processing of intronic snR107 with that of non-intronic/exonic snoU14. However, we recognize that this usage is non-standard and risks confusion due to the ambiguity surrounding the term’s definition in the literature.

      In light of this, and in agreement with reviewer 1 who raised a similar concern, we have revised the manuscript to remove the term “exonic snoRNA” entirely. Depending on the context, we now refer more precisely to “non-intronic snoRNA,” “snoRNA gene located in exon,” or simply “snoRNA.”

      Supplementary Figure 3. It is difficult to assess whether the level of mature rRNAs is unchanged in the mutants based on EtBr staining and without calculations. Northern blotting should be performed and the levels properly calculated.

      As suggested, we performed northern blotting on mature 18S and 25S, quantified the signal and observed no significant differences (new Supplementary Figure 3).

      **Referees cross-commenting**

      I also agree that 4sU labeling may require too much work with a questionable result.

      We are grateful to the reviewer for this comment, which helped us perform this reviewing in a timely manner.

      Reviewer #3 (Significance (Required)):

      Strengths: 1. Novelty of the described genomic arrangement of snoRNA/ncRNA genes and their processing in a sequential and regulated manner.

      Potential conservation of this pathways across eukaryotic organisms. Well designed and performed experiments followed by proper conclusions.

      Limitations: 1. Insufficient evidence to support generalization of the study results.

      Moderate overall impact of the study

      Advance: This research can be placed within publications describing specific processing pathways for various non-coding RNAs, including for example unusual chimeric species such as sno-lncRNAs. In this context, the presented results do advance the knowledge in the field by providing mechanistic evidence for a tightly controlled and coordinated maturation of selected ncRNAs.

      Audience: Basic research and specialized. The interest in this research will rather be limited to a specific field.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Migeot et al., focuses on a new Pac1-mediated snoRNA processing pathway for intron-encoded snoRNA pairs in yeast Schizosaccharomyces pombe.

      The novelty of the findings described in MS is the report of an unusual and relatively rare genomic organization and sequential processing of a few snoRNA genes in S. pombe and other eukaryotic organisms. It appears that in the case of snoRNA pairs, hosted in pre-mRNA in the intron and exon, respectively, the release of separate pre-snoRNAs from the host gene relies first on splicing to free the intron-encoded snoRNA, followed by endonucleolytic cleavage by RNase III (Pac1 in S. pombe) to produce snoRNA present in the mRNA exon. The sequential processing pathway, ensuring proper maturation of two snoRNAs, was demonstrated and argued in an elegant and clear way. The main message of the MS is straightforward, most experiments are properly conducted and specific conclusions based on the data are justified and valid. The text is clearly written and well-presentded.

      But there are some shortcomings. First of all, the title of the MS and general conclusions regarding the Pac1-mediated sequential release of snoRNA pairs hosted within the intron are definitely an overstatement. Especially the title suggests that this genomic organization and unusual processing mode of these snoRNAs is widespread. Later in the discussion the authors themselves admit that such mixed exonic-intronic snoRNAs are rare, although their presence may be underestimated due to annotation problems. It is likely that such snoRNA arrangement and processing is conserved, but the evidence is missing and only unique cases were identified based on bioinformatics mining and their processing has not been assayed. This makes the generalization impossible based on a single documented mamRNA/snoU14 example, no matter how carefully examined. Another interesting observation is that, similarly to other intron-encoded snoRNA in other species, there is a redundant pathway to produce mature U14 in addition to Pac1-mediated cleavage. In the case of intronic snoRNAs in S. cerevisiae, their release could be performed either by splicing/debranching or Rnt1 cleavage, but there is also a third alternative option, that is processing following transcription termination downstream of the snoRNA gene, which at the same time interferes with the expression of the host gene. Is such a scenario possible as an alternative pathway for U14? Are there any putative, or even cryptic, terminators downstream of the U14 gene? The authors did not consider or attempt to inspect this possibility.

      I also have some concerns or comments related to the presented research, which are no major, but are mainly related to data quatification, but have to be addressed. In Pac1-ts and Pac1-AA strains the level of mature U14 seems upregulated compared to respective WT (Figure 1A). At the same time mature 25S and 18S rRNAs are less abundant. But there is no quantification and it is not mentioned in the text. What could be the reason for these effects? Processing of the other snoRNA from the mamRNA/snoU14 precursor is largely overlooked in the MS. It is commented on only in the context of mutants expressing constitutive mamRNA-CS constructs (Figure 3B). Its level was checked in Pac1-ts and Pac1-AA (Supplementary Figure 1), but the authors conclude that "its expression remained largely unaffected by Pac1 inactivation", which is clearly not true. Similarly to U14, also snR170 is increased in Pac1-ts and Pac1-AA strains, at least judged "by eye" because the loading control or quantification is not provided. This matter should be clarified.

      Other minor comments:

      Minor points:

      1. Page 1, Abstract. The sentence "The hairpin recruits the RNase III Pac1 that cleaves and destabilizes the precursor transcript while participating in the maturation of the downstream exonic snoRNA, but only after splicing and release of the intronic snoRNA" is not entirely clear and should be simplified, maybe split into two sentences. This message is clear after reading the MS and learning the data, but not in the abstract.
      2. Page 1, Introduction. I am not convinced by the need to use the term "exonic snoRNA" for all snoRNA that are not intronic, which is misleading, and is rather associated per se with snoRNA encoded in the mRNA exon. It has been used before in the review about snoRNAs by Michelle Scott published in RNA Biol (2024), but it does not justify its common use.
      3. Supplementary Figure 3. It is difficult to assess whether the level of mature rRNAs is unchanged in the mutants based on EtBr staining and without calculations. Northern blotting should be performed and the levels properly calculated.

      Referees cross-commenting

      I also agree that 4sU labeling may require too much work with a questionable result.

      Significance

      Strengths:

      1. Novelty of the described genomic arrangement of snoRNA/ncRNA genes and their processing in a sequential and regulated manner.
      2. Potential conservation of this pathways across eukaryotic organisms.
      3. Well designed and performed experiments followed by proper conclusions.

      Limitations:

      1. Insufficient evidence to support generalization of the study results.
      2. Moderate overall impact of the study

      Advance:

      This research can be placed within publications describing specific processing pathways for various non-coding RNAs, including for example unusual chimeric species such as sno-lncRNAs. In this context, the presented results do advance the knowledge in the field by providing mechanistic evidence for a tightly controlled and coordinated maturation of selected ncRNAs.

      Audience:

      Basic research and specialized. The interest in this research will rather be limited to a specific field.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript presents a novel mode of processing for polycistronic snoRNAs in the yeast Saccharomyces pombe. The authors demonstrate that the processing sequence of a transcription unit containing U14, intronic snR107, and an overlapping non-coding mamRNA is determined by secondary structures recognized by RNase III (Pac1). Specifically, the formation of a stem structure over the mamRNA exon-exon junction facilitates the processing of terminal exonic-encoded U14. Consequently, U14 maturation occurs only after the mamRNA intron (containing snR107) is spliced out. This mechanism prevents the accumulation of unspliced, truncated mamRNA.

      The first section describing the processing steps is challenging to follow due to the unusual organization of the locus and maturation pathway. If the manuscript is intended for a broad audience, I recommend simplifying this section and presenting it in a more accessible manner. A larger diagram illustrating the transcription unit and processing intermediates would be beneficial. Additionally, introducing snR107 earlier in the text would improve clarity.

      Evaluation of some results is difficult due to the overexposure of Northern blot signals in Figures 1 and 2. The unspliced and spliced precursors appear as a single band, making it hard to distinguish processing intermediates. Would the authors consider presenting these results similarly to Figure 3, where bands are more clearly resolved? Or presenting both overexposed and underexposed blots?

      Additionally, I noticed a discrepancy in U14 detection: Probe B gives a strong signal for U14 in Figure 3B, whereas in Figures 1 and 2, U14 appears as faint bands. Could the authors clarify this inconsistency? Furthermore, ethidium bromide (EtBr) staining of rRNA is used as a loading control, but overexposed signals from the gel may not accurately reflect RNA amounts on the membrane. This could affect the interpretation of mature RNA species' relative abundance.

      To further support the sequential processing model, the authors could use pulse-labeling thiouracil to test the accumulation of newly transcribed RNAs and accumulation of individual sopecies. Additionally, it could help determine whether U14 can be processed through alternative, less efficient pathways. Would the authors consider incorporating this approach?

      In the final section, the authors propose that this processing mechanism is conserved across species, identifying 12 similar genetic loci in different organisms. This is very interesting finding. In my opinion, providing any experimental evidence would greatly strengthen this claim and the manuscript's significance. Even preliminary validation would add substantial value!

      Referees cross-commenting

      The other two reviewers' comments are justified.

      Significance

      The authors describe an interesting novel mode of snoRNA procseeimg form the host transcript. The results appear sound and intriguing, especially if the proposed mechanism can be confirmed across different organisms. Including such validation would significantly enhance the impact and make this work of broad audience interest.

      My expertise: transcription, non-coding RNAs

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors describe a novel pattern of ncRNA processing by Pac1. Pac1 is a RNase III family member in S. pombe that has previously been shown to process pre-snoRNAs. Other RNase III family members, such as Rnt1 in S. cerevisiae and Dosha in human, have similar roles in cleaving precursors to ncRNAs (including miRNA, snRNA, snoRNA, rRNA). All RNAse III family members share that they recognize and cleave dsRNA regions, but differ in their exact sequence and structure requirement. snoRNAs can be processed from their own precursor, a polycistronic pre-cursor, or the intron of a snoRNA host gene. After the intron is spliced out, the snoRNA host gene can either encode an protein or be a non-functional by product.

      In the current manuscript the authors show that in S. pombe snoRNA snR107 and U14 are processed from a common precursor in a way that has not previously been described. snR107 is encoded within an intron and processed from the spliced out intron, similar to a typical intron-encoded snoRNA. What is different is that upon splicing, the host gene can adopt a new secondary structure that requires base-pairing between exon 1 and exon2, generating a Pac1 recognition site. This site is recognized, resulting in cleaving of the RNA and further processing of the 3' cleavage product into U14 snoRNA. In addition, the 5' cleavage product is processed into a ncRNA named mamRNA. The experiments describing this processing are thorough and convincing, and include RNAseq, degradome sequencing, northern blotting, qRT-PCR and the analysis of mutations that disrupt various secondary structures in figures 1, 2, and 3. The authors thereby describe a previously unknown gene design where both the exon and the intron are processed into a snoRNA. They conclude that making the formation of the Pac1 binding site dependent on previous splicing ensures that both snoRNAs are produced in the correct order and amount. Some of the authors findings are further confirmed by a different pre-print (reference 19), but the other reprint did not reveal the involvement of Pac1.

      While the analysis on the mamRNA/snR107/U14 precursor is convincing, as a single example the impact of these findings is uncertain. In Figure 4 and supplemental table 1, the authors use bioinformatic searches and identify other candidate loci in plans and animals that may be processed similarly. Each of these loci encode a putative precursor that results in one snoRNA processed from an intron, a different snoRNA processed from an exon, and a double stranded structure that can only form after splicing. While is potentially interesting, it is also the least developed and could be discussed and developed further as detailed below.

      Major comments:

      1. The proposal that plant and animal pre-snoRNA clusters are processed similarly is speculative. the authors provide no evidence that these precursors are processed by an RNase III enzyme cutting at the proposed splicing-dependent structure. This should not be expected for publication, but would greatly increase the interest.
      2. The authors provide examples of similarly organized snoRNA clusters from human, mouse and rat, but the examples are not homologous to each other. Does this mean these snoRNA clusters are not conserved, even between mammals? Are the examples identified in Arabidopsis conserved in other plants? If there is no conservation, wouldn't that indicate that this snoRNA cluster organization offers no benefit?
      3. Supplemental Figure 4 shows some evidence that the S. pombe gene organization is conserved within the Schizosaccharomyces genus, but could be enhanced further by showing what sequences/features are conserved. Presumably the U14 sequence is conserved, but snR107 is not indicated. Is it not conserved? Is the stem-loop more conserved than neighboring sequences? Are there any compensatory mutations that change the sequence but maintain the structure? Is there evidence for conservation outside the Schizosaccharomyces genus?
      4. The authors suggest that snoRNAs can be processed from the exons of protein coding genes, but snoRNA processing would destroy the mRNA. Thus snoRNAs processing and mRNA function seem to be alternative outcomes that are mutually exclusive. Can the authors comment?

      Minor comments:

      1. The term "exonic snoRNA" is confusing. Isn't any snoRNA by definition an exon?
      2. The methods section does not include how similar snoRNA clusters were identified in other species
      3. In the discussion the authors argue that a previously published observation that S. pombe U14 does not complement a S. cerevisiae mutation can be explained because "was promoter elements... were simply not included in the transgene sequence". However, even if promoter elements were included, the dsRNA structure of S. pombe would not be cleaved by the S. cerevisiae RNase III. I doubt that missing promoter elements are the full explanation, and the authors provide insufficient data to support this conclusion.

      Referees cross-commenting

      I agree with the other 2 reviewers but think the thiouracil pulse labeling reviewer 2 suggests would take considerable work and if snoRNA processing is very fast might not be as conclusive as the reviewer suggests.

      Significance

      In the current manuscript the authors show that in S. pombe snoRNA snR107 and U14 are processed from a common precursor in a way that has not previously been described. The experiments describing this processing are thorough and convincing, and include RNAseq, degradome sequencing, northern blotting, qRT-PCR and the analysis of mutations that disrupt various secondary structures in figures 1, 2, and 3. The authors thereby describe a previously unknown gene design where both the exon and the intron are processed into a snoRNA.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. Response to reviewers

      We would like to thank the reviewers for carefully reading our manuscript and for their valuable comments in support for the publication of our investigation of rapid promoter evolution of accessory gland genes between Drosophila species and hybrids. We are glad to read that the reviewers find our work interesting and that it provides valuable insights into the regulation and divergence of genes through their promoters. We are encouraged by their acknowledgement of the overall quality of the work and the importance of our analyses in advancing the understanding of cis-regulatory changes in species divergence.

      2. Point-by-point description of the revisions

      Reviewer #

      Reviewer Comment

      Author Response/Revision

      Reviewer 1

      The authors test the hypothesis that promoters of genes involved in insect accessory glands evolved more rapidly than other genes in the genome. They test this using a number of computational and experimental approaches, looking at different species within the Drosophila melanogaster complex. The authors find an increased amount of sequence divergence in promoters of accessory gland proteins. They show that the expression levels of these proteins are more variable among species than randomly selected proteins. Finally, they show that within interspecific hybrids, each copy of the gene maintains its species-specific expression level.

      We thank Reviewer 1 for their detailed review and positive feedback on our manuscript, and for their helpful suggestions. We have now fully addressed the points raised by Reviewer 1 and have provided the suggested clarifications and revisions to improve the flow, readability, and presentation of the data, which we believe have improved the manuscript significantly.

      The work is done with expected standards of controls and analyses. The claims are supported by the analysis. My main criticism of the manuscript has to do not with the experiments or conclusion themselves but with the presentation. The manuscript is just not very well written, and following the logic of the arguments and results is challenging.

      The problem begins with the Abstract, which is representative of the general problems with the manuscript. The Abstract begins with general statements about the evolution of seminal fluid proteins, but then jumps to accessory glands and hybrids, without clarifying what taxon is being studied, and what hybrids they are talking about. Then, the acronym Acp is introduced without explanation. The last two sentences of the Abstract are very cumbersome and one has to reread them to understand how they link to the beginning of the Abstract.

      More generally, if this reviewer is to be seen as an "average reader" of the paper, I really struggled through reading it, and did not understand many of the arguments or rationale until the second read-through, after I had already read the bottom line. The paragraph spanning lines 71-83 is another case in point. It is composed of a series of very strongly worded sentences, almost all starting with a modifier (unexpectedly, interestingly, moreover), and supported by citations, but the logical flow doesn't work. Again, reading the paragraph after I knew where the paper was going was clearer, but on a first read, it was just a list of disjointed statements.

      Since most of the citations are from the authors' own work, I suspect they are assuming too much prior understanding on the part of the reader. I am sure that if the authors read through the manuscript again, trying to look through the eyes of an external reader, they will easily be able to improve the flow and readability of the text.

      We thank the reviewer for their detailed feedback and are glad that they acknowledge our work fully supports the claims of our manuscript. We also appreciate their helpful suggestions for improving the readability of the manuscript and have done our best to re-write the abstract and main text where indicated. In particular, the paragraph between lines 71-83 have been rewritten and we have taken care to write to non-expert readers.

      1) In the analysis of expression level differences, it is not clear what specific stage / tissue the levels taken from the literature refer to. Could it be that the source of the data is from a stage or tissue where seminar fluid proteins will be expressed with higher variability in general (not just inter-specifically) and this could be skewing the results? Please add more information on the original source of the data and provide support for their validity for this type of comparison.

      These were taken from publicly available adult male Drosophila datasets, listed in the data availability statement and throughout the manuscript. We have provided more detail on the tissue used for analysis of Acp gene expression levels.

      2) The sentence spanning lines 155-157 needs more context.

      We have added more context to lines 155-157.

      3) Line 203-204: What are multi-choice enhancers?

      We replaced the sentence with "... such as rapidly evolving enhancers or nested epistasis enhancer networks"

      4) Figure 1: The terminology the authors use, comparing the gene of interest to "Genome" is very confusing. They are not comparing to the entire genome but to all genes in the genome, which is not the same.

      We have changed the word "genome" to "all genes in the genome" on the reviewer's suggestion.

      5) Figure 2: Changes between X vs. Y is redundant (either changes between X and Y or changes in X vs. Y).

      We assume that the reviewer is referring to Fig. 2B, which does not measure changes between X and Y, but changes in distribution between Acps and the control group. We have explained this in the figure legend.

      The manuscript addresses a general question in evolutionary biology - do control regions diverge more quickly protein coding regions. The answer is that yes, they do, but this is actually not very surprising. The work is probably thus of more interest to people interested in the copulatory proteins or in the evolution of mating systems, than to people interested in broader evolutionary questions.

      We appreciate this reviewer's recognition of the significance of our work and would like to point out that there are very few studies looking at promoter evolution as detailed in the introduction. Of particular relevance, our study using Acp genes allows us to directly test the impact of promoter mutations on the expression by comparing two alleles in male accessory glands of Drosophila hybrids. Male accessory glands consist of only two secretory cell types allowing us to study evolution of gene expression in a single cell type (Acps are either expressed in main cells or secondary cells). Amid this unique experimental set up we can conclude that promoter mutations can act dominant, in contrast to mutations in protein coding regions, which are generally recessive. Thus, our study is unique in pointing out a largely overseen aspect of gene evolution.

      Reviewer 2

      This manuscript explores promoter evolution of genes encoding seminal fluid proteins expressed in the male accessory gland of Drosophila and finds cis-regulatory changes underlie expression differences between species. Although these genes evolve rapidly it appears that the coding regions rarely show signs of positive selection inferring that changes in their expression and hence promoter sequences can underlie the evolution of their roles within and among species.

      We thank Reviewer 2 for their thorough review, positive feedback on the importance of our work, and suggestions for improving the manuscript. We have addressed all points raised by the reviewer, including analysis of Acp coding region evolution, additional analyses of hybrid expression data, and improved the clarity of the text.

      Figure 1 illustrates evidence that the promoter regions of these gene have accumulated more changes than other sampled genes from the Drosophila genome. While this convinces that the region upstream of the transcription start site has diverged considerably in sequence (grey line compared to black line), Figure 1A also suggests the "Genespan" region which includes the 5'UTR but presumably also part of the coding region is also highly diverged. It would be useful to see how the pattern extends into the coding region further to compare further to the promoter region (although Fig 1H does illustrate this more convincingly).

      The reviewer raises an interesting point, and certainly all parts of genes evolve. Fig. 1A shows the evolutionary rates of Acps compared to the genome average from phyloP27way scores calculated from 27 insect species. Since these species are quite distant it is unsurprising that they show divergence in coding regions as well as promoter regions. In fact, we addressed whether promoter regions evolve fast in closely related Drosophila species in Fig. 1H compared to coding regions. We have included an additional analysis of coding region evolution in Figure 1B.

      Figure 2 presents evidence for significant changes in (presumably levels of) expression of male accessory gland protein (AcP) genes and ribosomal proteins genes between pairs of species, which is reflected in the skew of expression compared to randomly selected genes.

      Correct, we have rephrased the statement for clarity.

      Figure 3 shows detailed analysis for 3 selected AcP genes with significantly diverged expression. The authors claim this shows 'substitution' hotspots in the promoter regions of all 3 genes but this could be better illustrated by extending the plots in B-D further upstream and downstream to compare to these regions.

      We picked the 300-nucleotide promoter region for this analysis as it accumulated significant changes as shown in Fig. 1E-H, and extending the G plots (Fig. 3B-D) to regions with lower numbers of sequence changes would not substantially change the conclusion. Specifically, this analysis identifies sequence change hotspots within fast-evolving promoter regions, rather than comparing promoter regions to other genomic regions, as we previously addressed. The plot is based on a cumulative distribution function and the significant positive slope in the upstream region where promoters are located identifies a hotspot for accumulation of substitutions. There could be other hotspots, but the point being made is that significant hotspots consistently appear in the promoter region of these three genes.

      Figure 4 shows the results of expression analysis in parental lines of each pair of species and F1 hybrids. However the results are very difficult to follow in the figure and in the relevant text. While the schemes in A, C. E and G are helpful, the gel images are not the best quality and interpretations confusing. An additional scheme is needed to illustrate hypothetical outcomes of trans change, cis change and transvection to help interpret the gels. On line 169 (presumably referring to panels D and F although C and D are cited on the next line) the authors claim that Obp56f and CG11598 'were more expressed in D. melanogaster compared to D. simulans' but in the gel image the D. sim band is stronger for both genes (like D. sechellia) compared to the D. mel band. The authors also claim that the patterns of expression seen in the F1s are dominant for one allele and that this must be because of transvection. I agree this experiment is evidence for cis-regulatory change. However the interpretation that it is caused by transvection needs more explanation/justification and how do the authors rule out that it is not a cis X trans interaction between the species promoter differences and differences in the transcription factors of each species in the F1? Also my understanding is that transvection is relatively rare and yet the authors claim this is the explanation for 2/4 genes tested.

      We appreciate the reviewer's comments on Figure 4 and the opportunity to improve its clarity. To address these concerns, we have carefully checked the figure citations and corrected any inconsistencies.

      The reviewer raises an important point about our interpretation of transvection. We have expanded our discussion of this result to consider why transvection is a plausible explanation for the observed dominance patterns and also consider cis x trans interactions between species-specific promoters and transcription factor binding. While rare, transvection likely has more relevance in hybrid regulatory contexts involving homologous chromosome pairing which we discuss this in the revised text.

      Line 112 states that the melanogaster subgroup contains 5 species - this is incorrect - while this study looked at 5 species there are more species in this subgroup such as mauritiana and santomea.

      We have corrected the statement about the number of species in the melanogaster subgroup.

      Lines 131-134 could explain better what the conservation scores and their groupings mean and the rationale for this approach.

      We have clarified what the conservation scores and their groupings mean and the rationale for this approach.

      Line 162 - the meaning of the sentence starting on this line is unclear - it sounds very circular.

      We have rephrased the statement for more clarity.

      Line 168 should cite Fig 4 H instead of F.

      We have amended citation of Fig 4F to H.

      Reviewer 3

      In this study, McQuarrie et al. investigate the evolution of promoters of genes encoding accessory gland proteins (Acps) in species within the D. melanogaster subgroup. Using computational analyses and available genomic and transcriptomic datasets, they demonstrate that promoter regions of Acp genes are highly diverse compared to the promoters of other genes in the genome. They further show that this diversification correlates with changes in gene expression levels between closely related species. Complementing these computational analyses, the authors conduct experiments to test whether differences in expression levels of four Acp genes with highly diverged promoter regions are maintained in hybrids of closely related species. They find that while two Acp genes maintain their expression level differences in hybrids, the other two exhibit dominance of one allele. The authors attribute these findings to transvection. Based on their data, they conclude that rapid evolution of Acp gene promoters, rather than changes in trans, drives changes in Acp gene expression that contribute to speciation.

      We thank Reviewer 3 for their thorough review and suggestions. We further thank the reviewer for acknowledging the importance of our findings and for pointing out that it contributes to our understanding of speciation. We have thoroughly addressed all comments from the reviewer and significantly revised the manuscript. We believe that this has greatly improved the manuscript.

      Unfortunately, the presented data are not sufficient to fully support the conclusions. While many of the concerns can be addressed by revising the text to moderate the claims and acknowledge the methodological limitations, some key experiments require repetition with more controls, biological replicates, and statistical analyses to validate the findings.

      Specifically, some of the main conclusions heavily rely on the RT-PCR experiments presented in Figure 4, which analyze the expression of four Acp genes in hybrid flies. The authors use PCR and RFLP to distinguish species-specific alleles but draw quantitative conclusions from what is essentially a qualitative experiment. There are several issues with this approach. First, the experiment includes only two biological replicates per sample, which is inadequate for robust statistical analysis. Second, the authors did not measure the intensity of the gel fragments, making it impossible to quantify allele-specific expression accurately. Third, no control genes were used as standards to ensure the comparability of samples.

      The gold standard for quantifying allele-specific expression is using real-time PCR methods such as TaqMan assays, which allow precise SNP genotyping. To address this major limitation, the authors should ideally repeat the experiments using allele-specific real-time PCR assays. This would provide a reliable and quantitative measurement of allele-specific expression.

      If the authors cannot implement real-time PCR, an alternative (though less rigorous) approach would be to continue using their current method with the following adjustments:

      • Include a housekeeping gene in the analysis as an internal control (this would require identifying a region distinguishable by RFLP in the control).

      • Quantify the intensity of the PCR products on the gel relative to the internal standard, ensuring proper normalization.

      • Increase the sample size to allow for robust statistical analysis.

      These experiments could be conducted relatively quickly and would significantly enhance the validity of the study's conclusions.

      We thank the reviewer for their detailed suggestions for improving the conclusions in Fig. 4. Indeed, incorporating a housekeeping gene as a control supports our results for qualitative analysis of gene expression in hybrids assessing each allele individually (Fig 4), and improves interpretation for non-experts. We have also included an additional analysis in the new Fig. 5 which analyses RNA-seq expression changes in D. melanogaster x D. simulans hybrid male accessory glands. We believe these additions have significantly improved the manuscript and its conclusions.

      While the following comments are not necessarily minor, they can be addressed through revisions to the text without requiring additional experimental work. Some comments are more conceptual in nature, while others concern the interpretation and presentation of the experimental results. They are provided in no particular order.

      1. A key limitation of this study is the use of RNA-seq datasets from whole adult flies for interspecies gene expression comparisons. Whole-body RNA-seq inherently averages gene expression across all tissues, potentially masking tissue-specific expression differences. While Acp genes are likely restricted to accessory glands, the non-Acp genes and the random gene sets used in the analysis may have broader expression profiles. As a result, their expression might be conserved in certain tissues while diverging in others- an aspect that whole-body RNA-seq cannot capture. The authors should acknowledge that tissue-specific RNA-seq analyses could provide a more precise understanding of expression divergence and potentially reveal reduced conservation when considering specific tissues independently.

      We have added a section discussing the limitations in gene expression analysis in the discussion. In addition, we have included an additional Figure analysing gene expression in hybrid male accessory glands (Fig. 5).

      1. The statement in line 128, "Consistent with this model," does not accurately reflect the findings presented in Figures 2A and B. Specifically, the data in Figure 2A show that Acp gene expression divergence is significantly different from the divergence of non-Acp genes or a random sample only in the comparison between D. melanogaster and D. simulans. However, when these species are compared to D. yakuba, Acp gene expression divergence aligns with the divergence patterns of non-Acp genes or random samples. In contrast, Figure 2B shows that the distribution of expression changes is skewed for Acp genes compared to random control samples when D. melanogaster or D. simulans are compared to D. yakuba. However, this skew is absent when the two D. melanogaster and D. simulans are compared. Therefore, the statement in line 128 should be revised to accurately reflect these nuanced results and the trends shown in Figure 2A and B.

      We have updated the statement for clarity. Here, the percentage of Acps showing significant gene expression changes is greater between more closely related species, but the distribution of expression changes increases between more distantly related species.

      1. The statement in lines 136-138, "Acps were enriched for significant expression changes in the faster evolving group across all species," while accurate, overlooks a key observation. This trend was also observed in other groups, including those with slower evolving promoters, in some of the species' comparisons. Therefore, the enrichment is not unique to Acps with rapidly evolving promoters, and this should be explicitly acknowledged in the text.

      This is a valid point, and we have updated this statement as suggested.

      1. It would be helpful for the authors to explain the meaning of the d score at the beginning of the paragraph starting in line 131, to ensure clarity for readers unfamiliar with this metric.

      This scoring method is described in the methods sections, and we have now included reference to thorough explanation of how d was calculated at the indicated section.

      1. In Figure 2C-E - the title of the Y-axis does not match the text. If it represents the percentage of genes with significant expression changes, as in Figure 2A, the discrepancies between the percentages in this figure and those in Figure 2A need to be addressed.

      We have updated the method used to categorise significant changes in gene expression in the text and the figure legend for clarity.

      1. The experiment in Figure 3 needs a better explanation in the text. What is the analysis presented in Figure 3B-D. How many species were compared?

      We have added additional details in the results section and an explanation of how sequence change hotspots were calculated in the results section is available.

      1. The concept of transvection should be omitted from this manuscript. First, the definition provided by the authors is inaccurate. Second, even if additional experiments were to convincingly show that one allele in hybrid animals is dominant over the other, there are alternative explanations for this phenomenon that do not involve transvection. The authors may propose transvection as a potential model in the discussion, but they should do so cautiously and explicitly acknowledge the possibility of other mechanisms.

      We have updated the text to more conservatively discuss transvection, moving this to the discussion section with additional possibilities discussed.

      1. The statement at the end of the introduction is overly strong and would benefit from more cautious phrasing. For instance, it could be reworded as: "These findings suggest that promoter changes, rather than genomic background, play a significant role in driving expression changes, indicating that promoter evolution may contribute to the rise of new species."

      We have reworded this line following the reviewer's suggestion.

      1. Line 32 of the abstract: The term "Acp" is introduced without explaining what it stands for. Please define it as "Accessory gland proteins (Acp)" when it first appears.

      We have updated the manuscript to define Acp where it is first mentioned.

      1. Line 61: The phrase "...through relaxed,..." is unclear. Specify what is relaxed (e.g., "relaxed selective pressures").

      We have included description of relaxed selective pressures.

      1. The sentence in lines 74-76, starting in "Interestingly,...." Needs revision for clarity.

      We have removed the word interestingly.

      1. Line 112: Revise "we focused on the melanogaster subgroup which is made up of five species" to: "we focused on the melanogaster subgroup, which includes five species."

      We have made this change in the text.

      1. In line 144 use the phrase "promoter conservation" instead of "promoter evolution"

      We have updated the phrasing.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this study, McQuarrie et al. investigate the evolution of promoters of genes encoding accessory gland proteins (Acps) in species within the D. melanogaster subgroup. Using computational analyses and available genomic and transcriptomic datasets, they demonstrate that promoter regions of Acp genes are highly diverse compared to the promoters of other genes in the genome. They further show that this diversification correlates with changes in gene expression levels between closely related species. Complementing these computational analyses, the authors conduct experiments to test whether differences in expression levels of four Acp genes with highly diverged promoter regions are maintained in hybrids of closely related species. They find that while two Acp genes maintain their expression level differences in hybrids, the other two exhibit dominance of one allele. The authors attribute these findings to transvection. Based on their data, they conclude that rapid evolution of Acp gene promoters, rather than changes in trans, drives changes in Acp gene expression that contribute to speciation.

      Major comments:

      Unfortunately, the presented data are not sufficient to fully support the conclusions. While many of the concerns can be addressed by revising the text to moderate the claims and acknowledge the methodological limitations, some key experiments require repetition with more controls, biological replicates, and statistical analyses to validate the findings.

      Specifically, some of the main conclusions heavily rely on the RT-PCR experiments presented in Figure 4, which analyze the expression of four Acp genes in hybrid flies. The authors use PCR and RFLP to distinguish species-specific alleles but draw quantitative conclusions from what is essentially a qualitative experiment. There are several issues with this approach. First, the experiment includes only two biological replicates per sample, which is inadequate for robust statistical analysis. Second, the authors did not measure the intensity of the gel fragments, making it impossible to quantify allele-specific expression accurately. Third, no control genes were used as standards to ensure the comparability of samples.

      The gold standard for quantifying allele-specific expression is using real-time PCR methods such as TaqMan assays, which allow precise SNP genotyping. To address this major limitation, the authors should ideally repeat the experiments using allele-specific real-time PCR assays. This would provide a reliable and quantitative measurement of allele-specific expression.

      If the authors cannot implement real-time PCR, an alternative (though less rigorous) approach would be to continue using their current method with the following adjustments:

      • Include a housekeeping gene in the analysis as an internal control (this would require identifying a region distinguishable by RFLP in the control).
      • Quantify the intensity of the PCR products on the gel relative to the internal standard, ensuring proper normalization.
      • Increase the sample size to allow for robust statistical analysis. These experiments could be conducted relatively quickly and would significantly enhance the validity of the study's conclusions.

      Minor comments

      While the following comments are not necessarily minor, they can be addressed through revisions to the text without requiring additional experimental work. Some comments are more conceptual in nature, while others concern the interpretation and presentation of the experimental results. They are provided in no particular order. 1. A key limitation of this study is the use of RNA-seq datasets from whole adult flies for interspecies gene expression comparisons. Whole-body RNA-seq inherently averages gene expression across all tissues, potentially masking tissue-specific expression differences. While Acp genes are likely restricted to accessory glands, the non-Acp genes and the random gene sets used in the analysis may have broader expression profiles. As a result, their expression might be conserved in certain tissues while diverging in others- an aspect that whole-body RNA-seq cannot capture. The authors should acknowledge that tissue-specific RNA-seq analyses could provide a more precise understanding of expression divergence and potentially reveal reduced conservation when considering specific tissues independently. 2. The statement in line 128, "Consistent with this model," does not accurately reflect the findings presented in Figures 2A and B. Specifically, the data in Figure 2A show that Acp gene expression divergence is significantly different from the divergence of non-Acp genes or a random sample only in the comparison between D. melanogaster and D. simulans. However, when these species are compared to D. yakuba, Acp gene expression divergence aligns with the divergence patterns of non-Acp genes or random samples. In contrast, Figure 2B shows that the distribution of expression changes is skewed for Acp genes compared to random control samples when D. melanogaster or D. simulans are compared to D. yakuba. However, this skew is absent when the two D. melanogaster and D. simulans are compared. Therefore, the statement in line 128 should be revised to accurately reflect these nuanced results and the trends shown in Figure 2A and B. 3. The statement in lines 136-138, "Acps were enriched for significant expression changes in the faster evolving group across all species," while accurate, overlooks a key observation. This trend was also observed in other groups, including those with slower evolving promoters, in some of the species' comparisons. Therefore, the enrichment is not unique to Acps with rapidly evolving promoters, and this should be explicitly acknowledged in the text. 4. It would be helpful for the authors to explain the meaning of the d score at the beginning of the paragraph starting in line 131, to ensure clarity for readers unfamiliar with this metric. 5. In Figure 2C-E - the title of the Y-axis does not match the text. If it represents the percentage of genes with significant expression changes, as in Figure 2A, the discrepancies between the percentages in this figure and those in Figure 2A need to be addressed. 6. The experiment in Figure 3 needs a better explanation in the text. What is the analysis presented in Figure 3B-D. How many species were compared? 7. The concept of transvection should be omitted from this manuscript. First, the definition provided by the authors is inaccurate. Second, even if additional experiments were to convincingly show that one allele in hybrid animals is dominant over the other, there are alternative explanations for this phenomenon that do not involve transvection. The authors may propose transvection as a potential model in the discussion, but they should do so cautiously and explicitly acknowledge the possibility of other mechanisms. 8. The statement at the end of the introduction is overly strong and would benefit from more cautious phrasing. For instance, it could be reworded as: "These findings suggest that promoter changes, rather than genomic background, play a significant role in driving expression changes, indicating that promoter evolution may contribute to the rise of new species."

      Text edits:

      Throughout the manuscripts there are incomplete sentences and sentences that are not clear. Below is a list of corrections:

      1. Line 32 of the abstract: The term "Acp" is introduced without explaining what it stands for. Please define it as "Accessory gland proteins (Acp)" when it first appears.
      2. Line 61: The phrase "...through relaxed,..." is unclear. Specify what is relaxed (e.g., "relaxed selective pressures").
      3. The sentence in lines 74-76, starting in "Interestingly,...." Needs revision for clarity.
      4. Line 112: Revise "we focused on the melanogaster subgroup which is made up of five species" to: "we focused on the melanogaster subgroup, which includes five species."
      5. In line 144 use the phrase "promoter conservation" instead of "promoter evolution"

      Significance

      This study addresses an important question in evolutionary biology: how seminal fluid proteins achieve rapid evolution despite showing limited adaptive changes in their coding regions. By focusing on accessory gland proteins (Acps) and examining their promoter regions, the authors suggest promoter-driven evolution as a potential mechanism for rapid seminal fluid protein diversification. While this hypothesis is intriguing and can contribute to our understanding of speciation, more rigorous analysis and experimental validation would be needed to support the conclusions. The revised manuscript can be of interest to fly geneticists and to scientists in the fields of gene regulation and evolution.

      Keywords for my expertise: Enhancers, transcriptional regulation, development, evolution, Drosophila.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      This manuscript explores promoter evolution of genes encoding seminal fluid proteins expressed in the male accessory gland of Drosophila and finds cis-regulatory changes underlie expression differences between species. Although these genes evolve rapidly it appears that the coding regions rarely show signs of positive selection inferring that changes in their expression and hence promoter sequences can underlie the evolution of their roles within and among species.

      Major comments

      Figure 1 illustrates evidence that the promoter regions of these gene have accumulated more changes than other sampled genes from the Drosophila genome. While this convinces that the region upstream of the transcription start site has diverged considerably in sequence (grey line compared to black line), Figure 1A also suggests the "Genespan" region which includes the 5'UTR but presumably also part of the coding region is also highly diverged. It would be useful to see how the pattern extends into the coding region further to compare further to the promoter region (although Fig 1H does illustrate this more convincingly).

      Figure 2 presents evidence for significant changes in (presumably levels of) expression of male accessory gland protein (AcP) genes and ribosomal proteins genes between pairs of species, which is reflected in the skew of expression compared to randomly selected genes.

      Figure 3 shows detailed analysis for 3 selected AcP genes with significantly diverged expression. The authors claim this shows 'substitution' hotspots in the promoter regions of all 3 genes but this could be better illustrated by extending the plots in B-D further upstream and downstream to compare to these regions.

      Figure 4 shows the results of expression analysis in parental lines of each pair of species and F1 hybrids. However the results are very difficult to follow in the figure and in the relevant text. While the schemes in A, C. E and G are helpful, the gel images are not the best quality and interpretations confusing. An additional scheme is needed to illustrate hypothetical outcomes of trans change, cis change and transvection to help interpret the gels. On line 169 (presumably referring to panels D and F although C and D are cited on the next line) the authors claim that Obp56f and CG11598 'were more expressed in D. melanogaster compared to D. simulans' but in the gel image the D. sim band is stronger for both genes (like D. sechellia) compared to the D. mel band. The authors also claim that the patterns of expression seen in the F1s are dominant for one allele and that this must be because of transvection. I agree this experiment is evidence for cis-regulatory change. However the interpretation that it is caused by transvection needs more explanation/justification and how do the authors rule out that it is not a cis X trans interaction between the species promoter differences and differences in the transcription factors of each species in the F1? Also my understanding is that transvection is relatively rare and yet the authors claim this is the explanation for 2/4 genes tested.

      Minor comments

      Line 112 states that the melanogaster subgroup contains 5 species - this is incorrect - while this study looked at 5 species there are more species in this subgroup such as mauritiana and santomea.

      Lines 131-134 could explain better what the conservation scores and their groupings mean and the rationale for this approach.

      Line 162 - the meaning of the sentence starting on this line is unclear - it sounds very circular.

      Line 168 should cite Fig 4 H instead of F.

      Significance

      This paper is generally well written although some sections would benefit from more explanation. The paper demonstrates cis-regulatory changes between the promoters of orthologs of male accessory gland genes underlie expression differences but that the species differences are not always reflected in hybrids, which the authors interpret as being caused by transvection although there could be other explanations. Overall this provides new insights into the regulation and divergence of these interesting genes. The paper does not explore the consequences of these changes in gene expression although this is discussed to some extent in the Discussion section.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors test the hypothesis that promoters of genes involved in insect accessory glands evolved more rapidly than other genes in the genome. They test this using a number of computational and experimental approaches, looking at different species within the Drosophila melanogaster complex. The authors find an increased amount of sequence divergence in promoters of accessory gland proteins. They show that the expression levels of these proteins are more variable among species than randomly selected proteins. Finally, they show that within interspecific hybrids, each copy of the gene maintains its species-specific expression level.

      The work is done with expected standards of controls and analyses. The claims are supported by the analysis. My main criticism of the manuscript has to do not with the experiments or conclusion themselves but with the presentation. The manuscript is just not very well written, and following the logic of the arguments and results is challenging. The problem begins with the Abstract, which is representative of the general problems with the manuscript. The Abstract begins with general statements about the evolution of seminal fluid proteins, but then jumps to accessory glands and hybrids, without clarifying what taxon is being studied, and what hybrids they are talking about. Then, the acronym Acp is introduced without explanation. The last two sentences of the Abstract are very cumbersome and one has to reread them to understand how they link to the beginning of the Abstract.

      More generally, if this reviewer is to be seen as an "average reader" of the paper, I really struggled through reading it, and did not understand many of the arguments or rationale until the second read-through, after I had already read the bottom line. The paragraph spanning lines 71-83 is another case in point. It is composed of a series of very strongly worded sentences, almost all starting with a modifier (unexpectedly, interestingly, moreover), and supported by citations, but the logical flow doesn't work. Again, reading the paragraph after I knew where the paper was going was clearer, but on a first read, it was just a list of disjointed statements.

      Since most of the citations are from the authors' own work, I suspect they are assuming too much prior understanding on the part of the reader. I am sure that if the authors read through the manuscript again, trying to look through the eyes of an external reader, they will easily be able to improve the flow and readability of the text.

      More specific comments:

      1. In the analysis of expression level differences, it is not clear what specific stage / tissue the levels taken from the literature refer to. Could it be that the source of the data is from a stage or tissue where seminar fluid proteins will be expressed with higher variability in general (not just inter-specifically) and this could be skewing the results? Please add more information on the original source of the data and provide support for their validity for this type of comparison.
      2. The sentence spanning lines 155-157 needs more context.
      3. Line 203-204: What are multi-choice enhancers?
      4. Figure 1: The terminology the authors use, comparing the gene of interest to "Genome" is very confusing. They are not comparing to the entire genome but to all genes in the genome, which is not the same.
      5. Figure 2: Changes between X vs. Y is redundant (either changes between X and Y or changes in X vs. Y).

      Significance

      The manuscript addresses a general question in evolutionary biology - do control regions diverge more quickly protein coding regions. The answer is that yes, they do, but this is actually not very surprising. The work is probably thus of more interest to people interested in the copulatory proteins or in the evolution of mating systems, than to people interested in broader evolutionary questions.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. Reviewer #1 Evidence, reproducibility and clarity:

        Summary:

      In this manuscript, authors demonstrated the role of ECM-dependent MEK/ERK/MITF signaling pathway that influences the plasticity of MCs (melanocytes) through their interactions with the environment. The findings emphasize the essential role of the extracellular matrix (ECM) in controlling MC function and differentiation, highlighting a critical need for further research to understand the complex interactions between mechanical factors and ECM components in the cellular microenvironment. Overall, the manuscript is concise, written well and shed light on a complex relationship between ECM protein types and substrate stiffness that affects MC mechanosensation. However, understanding detailed molecular mechanisms involved, especially the roles of MITF and other key regulators, is crucial for comprehending MC function and related pathologies. Authors need to clarify some minor queries to be considered for publication.

      We thank this reviewer for the time and caution taken to assess our work. To provide a better understanding of the molecular mechanisms involved in MITF modulation and MC function in response to ECM proteins, we substantially revised the manuscript and now included e.g. bulk RNA sequencing, pharmacological inhibition of FAK and ERK (in addition to MEK inhibition), and MITF depletion.

      Major comments to the Authors:

        • Authors have chosen ERK signaling pathways to test and draw their conclusion based on existing knowledge in the field, as several studies previously reported the role of ECM to modulate the ERK signaling pathway but it would be interesting to test other signaling pathways unbiasedly; e.g. ECM can also regulate Wnt signaling (PMID: 29454361) and connection of MITF and its target gene TYR expression is also regulated by Wnt in context of melanocyte. (PMID: 29454361, PMID: 34878101, PMID: 38020918).*

      The new transcriptome analysis (line 258 ff., revised fig. 5, new fig. 6, new suppl. fig. S5) indeed showed that some components of the Wnt signaling pathway are differentially expressed in response to ECM proteins (new fig. 6B). In comparison, however, the expression of genes involved in MAPK/ERK signaling was more prominently affected by the specific ECM types (new fig. 6C, D), congruent with the biochemical results we presented in the original manuscript. We therefore focused our mechanistic analyses on this pathway, and we consolidated our initial findings with additional pharmacological inhibition experiments. Specifically, like MEK inhibition, ERK inhibition (new fig. 6J-L) increased both MITF nuclear localization and melanin production in MCs exposed to FN, reinforcing the relevance of this pathway in control of MC functions in the model used.

      We agree that an additional contribution of Wnt signaling to ECM-dependent regulation of MC phenotypes is possible, including Mitf and Tyr expression. Next to the new Wnt-related transcriptome data (line 323 ff., new fig. 6B), we therefore now included a short discussion on that aspect (line 478 ff.). However, we feel that a comprehensive comparison of the individual contributions of Wnt vs. ERK signaling is beyond the scope of the current manuscript.

      • Discussion line 340-344. Please provide the data as it is directly connected to the study, and it would be crucial to interpret data better. As FAK is upregulated and FAK inhibitor did not reduce pERK, is there any possibility that other kinases might involve. Please discuss. Again, authors should check Wnt activation as FAK can activate Wnt signaling in response to matrix stiffness as well. (PMID 29454361).*

      We agree with the reviewer that the FAK data required further investigation. In the revised version, we re-examined the potential role of FAK as an upstream regulator of ERK activation using the FAK inhibitor Ifebemtinib, rather than Defactinib as used in our original experiments. Our previous conclusion-that ERK activation was independent of FAK-was likely influenced by limitations associated with Defactinib, which did not properly reduce p-FAK levels despite lowering focal adhesion numbers, accompanied with an increase of ERK phosphorylation alongside a decrease of nuclear MITF levels. In contrast, Ifebemtinib treatment led to a more effective inhibition of FAK, as evidenced by a marked reduction in both p-FAK levels and focal adhesion number (new suppl. fig. 6B,C). Importantly, this was accompanied by a significant decrease in p-ERK levels (new fig. 6M,N), suggesting that FAK contributes to ERK activation in response to ECM molecules in our model. Furthermore, FAK inhibition similar to MEK and ERK inhibition, led to increased melanin production in MCs cultured on FN (new fig. 6O). These new data are now included in the revised version of the manuscript (line 360 ff., new fig. 6M-O, new suppl. fig. 6).

      Nonetheless, this does not exclude the possibility that additional kinases and pathways, including Wnt signaling, may also be involved. We acknowledge this possibility in the revised discussion (lines 478-488).

      • Rationale for selecting MITF for the study is very weak. Please justify in the discussion why authors have chosen to study MITF/ERK axis with a more logistic approach.*

      We have focused central aspects of our analyses on MITF because it is a central regulator of MC differentiation, pigmentation, and survival, and its activity has previously been reported to be modulated by ERK. Considering the observed changes in pigmentation, proliferation, and gene expression in response to distinct ECM molecules, we hypothesized that MITF acts as a key integrator of these ECM-dependent signals. Our data indeed support this rationale: we detected ECM-type-dependent MITF levels and localization, and manipulating the ERK pathway altered MITF activity and associated functional outputs. Moreover, siRNA-mediated downregulation of MITF in MCs cultured on COL I led to a marked reduction in melanin content (revised fig. 4D). Together, these data emphasize that the ERK/MITF axis serves as a pathway that responds to extracellular cues and links these to MC behavior. For clarity, we have included an additional explanation on our rationale in the revised manuscript (lines 146-152).

      • It is suggested to check for the changes in the transcriptomic profile of melanocytes upon culturing on different matrix to get a more comprehensive view associated with the molecular mechanisms involved.*

      We fully agree with the reviewer on the importance of assessing the ECM-dependent transcriptomes of MCs. Therefore, we have now performed RNA sequencing to compare the transcriptomic profiles of MCs cultured on COL IV-, COL I- and FN-coated stiff substrates (line 258 ff. and revised fig. 5, new fig. 6, new suppl. fig. S5). This analysis provided a broader view of the molecular responses of MCs to ECM molecules and complemented our previous molecular and phenotypes analyses. The obtained transcriptomes confirmed significant modulation of genes associated with MC differentiation and pigmentation, as well as genes involved in signaling pathways such as MAPK/ERK and Wnt (see also answers to points 1-3). These findings help contextualize the ECM-dependent phenotypic changes and strengthen the mechanistic insights presented in the study.

      • Please provide the protein expression of genes involved in cell cycle progression and/or apoptosis to support the data in Fig. 3D-E.*

      To support the observations presented in original fig. 3, we employed immunostaining to assess the protein expression of Ki67, which is both a well-established marker and a protein involved in cell cycle progression (PMID: 28630280). In revised figure 3E, a significant reduction in the proportion of Ki67-positive cells on FN compared to COL I was observed, reinforcing our initial findings derived from BrdU incorporation assays and direct microscopic monitoring of cell division (revised fig. 3D,F).

      In addition, global gene expression analysis revealed differentially expressed genes related to cell cycle regulation and apoptosis (revised fig. 5C,D), in line with the reduced proliferation observed. Notably, FN also triggered the differential expression of genes associated with cellular senescence (revised fig. 5E). Together, these data suggest that adhesion to FN induces a transcriptional and phenotypic shift in MCs toward a less-proliferative state that is associated with differential cell cycle modulation and signs of senescence.

      Minor comment to the Authors:

        • Discussion line 358-359, using term synergy is an overstatement as the collective data do not support the claim. Very little role of matrix stiffness is demonstrated by experimental data.*

      We thank the reviewer for this comment and agree that the term "synergy" may overstate the conclusions drawn from the current dataset. We have therefore removed this term from the revised version of the manuscript to more accurately reflect the data.

      • Method section, BrdU assay and BrdU assay-cell proliferation can be combined in method section.*

      We have combined the descriptions of the BrdU assay and BrdU-based cell proliferation assay into a single, unified section in the Methods.

      • What trigger melanocytes to respond to different microenvironment. Please discuss.*

      To address this question, we have added the following paragraph to the Discussion (lines 377-380): "Our study identifies ECM components as critical environmental triggers that instruct MC behavior. Through dynamic interactions with the ECM, MCs engage adhesion-dependent signaling pathways, such as FAK activation, enabling them to decode contextual ECM inputs and adapt their phenotype accordingly."

      • Fig 3C and 5D Tyr mRNA expression is tested. Authors should also test for the protein expression in the similar set of studies.*

      We thank the reviewer for this suggestion and agree that assessing TYR protein expression would be valuable. However, we have encountered difficulties with the currently available antibodies and detection methods, which in our hands appeared unreliable for consistently detecting endogenous TYR protein levels in MCs under these conditions. For this reason, we relied on Tyr mRNA expression as a robust and reproducible readout and complemented this with functional assays such as melanin content measurement as a read-out that indirectly reflects TYR enzymatic activity. Of note, our transcriptomic analysis also revealed Tyr and other melanogenesis genes as differentially expressed genes when comparing MCs grown on COLI vs FN (revised fig. 5A,B).

      • Line 217-218, Authors claim stiffness mediated increase of MITF nuclear localization in Col I, however Fig. 4A-B does not represent that claim. Please justify.*

      Fig. 4A shows representative images of MCs cultured on stiff substrates coated with different ECM types, while the original figure 4B included the comparison across substrate stiffnesses for each ECM condition. We have now generated additional datasets to assess global MITF levels as well as nuclear localization across stiffness conditions in the presence of the different ECM types, demonstrating that nuclear MITF is significantly higher in cells cultured on stiff vs. soft or intermediate stiffness (revised fig. 4B,C). Of note, we do not detect a significant difference between soft and intermediate substrate stiffness, which could hint to a threshold of MITF dynamics in stiffness sensitivity. We have updated the figure legend and corresponding text to ensure the data presentation accurately supports our conclusions.

      Significance:

      Overall, the study is well-planned, the experiments are well-designed and executed with appropriate use of statistical analysis. However, a more in-depth analysis of the molecular mechanisms is necessary to clarify how the extracellular matrix (ECM) regulates ERK or MITF nuclear translocation.

      We agree and feel that the additional data in the revised manuscript that explored transcriptional changes and the FAK/MEK/ERK/MITF axis in response to ECM proteins provide improved insights into ECM-mediated regulation of ERK and MC pigmentation.

      This study enhances our existing knowledge by linking the well-established role of the extracellular matrix (ECM) in regulating ERK signaling to ERK's involvement in controlling MITF, a key regulator of melanocyte differentiation. It further establishes the ECM's role in controlling melanocyte function and differentiation.

      This study will interest readers working in the field of the tumor microenvironment, as it explores the role of the extracellular matrix and its complexity and stiffness in disease progression, not only in melanoma but also in other types of cancer.

      1. Reviewer #2 Evidence, reproducibility and clarity:

      Summary:

      In their manuscript, Luthold et al describe the behaviour of immortalized mouse melanocytes cultured on various extracellular matrix (ECM) proteins and substrates of different stiffness. They found that fibronectin, collagen IV and collagen I have different effects on melanocyte morphology, migration, and proliferation. They further link these differential effects to MITF localization and MEK/ERK signalling. This work shows that fibronectin supports melanocyte migration, which was associated with a dendritic morphology and correlated with increased MEK/ERK signalling and decreased MITF nuclear localization. In contrast, collagen I promoted melanocyte proliferation with low MEK/ERK signalling, enhanced MITF nuclear localization and high melanin production.

      While this study is well designed and the data adequately presented and interpreted, the impact of its conclusions is limited by the incomplete mechanistic characterization of the observed phenotypes and by the lack of parallels with physiological conditions. To strengthen their manuscript, the authors should consider the following comments:

      We also wish to thank this reviewer for the efforts made to assess our work and help us improve the study. We substantially revised the manuscript and now included e.g. bulk RNA sequencing and various loss-of-function approaches to better delineate the signaling pathways involved in ECM-dependent control of MCs.

      Major comments to the Authors:

        • Characterization of observed phenotypes:*
      • *The link between matrix-sensing and intracellular signalling is missing. Which types of integrins are expressed by iMCs? *

      This is indeed an interesting point. Our RNA sequencing analysis indicates that MCs express integrins known to mediate adhesion to COL I and FN, including Itga2, Itga3, Itga5, Itgav, Itgb1, and Itgb3 (revised fig. 5K). Importantly, the expression of these integrins remains relatively consistent across ECM conditions (COL I, COL IV, and FN), suggesting that the phenotypic differences observed may not be directly explained by variations in integrin expression.

      • Are any of these integrins required for the observed phenotypes?

      To assess a functional involvement, we conducted a pilot experiment blocking β1-integrin in MCs seeded on COL I and observed a marked reduction in MC adhesion (see associated graph 1, provided to this reviewer). However, the compromised cell spreading and resulting widespread detachment introduced confounding effects, making it difficult to interpret downstream events such as MITF nuclear localization. Since such readouts can be indirectly influenced by the overall adhesion state and associated signaling pathways such as FAK, we chose not to pursue further mechanistic analysis using this approach. Targeted strategies (e.g., inducible knockdown, acute protein degradation) will be needed in the future to dissect the precise role of individual integrins in mediating ECM-specific signaling responses in MCs.

      Graph 1: Effect of β1-integrin blocking on MC adhesion. iMCs were detached using PBS-EDTA (10 min, 37 {degree sign}C) and incubated for 15 min on ice with either 10 μg/mL β1-integrin-blocking antibody (CD29, clone TS2/16; Invitrogen, #AB_1210468) or 10 μg/mL IgG isotype control. Cells (5,000 per well) were then seeded on COL I-coated substrates. After 1 h, non-adherent cells were gently washed off with PBS, and adherent cells were fixed with 4% PFA. Cell adhesion was quantified by counting the number of attached cells per µm² under a microscope.

      • The phenotypic changes described here are interesting but only partially analysed. Transcriptomic studies would yield a more complete view of cell state transitions (optional). At a minimum, could the authors detect any changes in cadherin expression, or in other genes classically involved in phenotype switching, such as twist1, snail or zeb1?

      We thank the reviewer for this important suggestion, which helped to improve this manuscript. We have now performed bulk RNA sequencing to analyze global gene expression changes in MCs cultured on different ECM substrates (revised fig. 5, new suppl. fig. 5). Among these, we explored gene expression programs associated with MC plasticity and differentiation (revised fig. 5F-H): MCs cultured on FN exhibited reduced expression of melanocytic differentiation markers and upregulation of genes linked to plasticity, dedifferentiation, and neural crest-like features, suggesting a shift toward a less differentiated state, reflecting aspects of a phenotypic switch.

      Nonetheless, as part of this analysis (but not included in the manuscript), we found that Zeb2, Snai2, and Zeb1 were expressed at similar levels across ECM conditions. Similarly, among the cadherins, Cdh1 and Cdh2 were not differentially expressed, albeit the overall low expression of Cdh1 showed a trend towards a reduction on COL I. Finally, Snai1, Twist1, and Twist2 were detected at very low levels and not significantly regulated as well. These data suggest that, at the chosen experimental conditions, while a clear adaptive phenotypic cell plasticity is observed, classical EMT-like programs are not prominently activated. However, we cannot exclude the possibility that longer culture durations or additional cues could induce such transitions.

      • Lines 235-236, the authors write that ECM proteins regulate melanocyte behaviour "likely through modulation of MITF localization and activity". Could the authors support the role of MITF experimentally? Genetic experiments using different MITF mutants could address this question.

      To experimentally support the role of MITF, we now performed melanin assays following siRNA-mediated knockdown of MITF in MCs grown on COL I or FN. On COL I-coated substrates, MITF depletion led to a marked reduction in melanin content, supporting the conclusion that ECM-dependent regulation of pigmentation in our culture model involves MITF activity. These findings are now included in the revised manuscript (lines 244-245, revised fig. 4D, new fig. S4B).

      • *Additionally, how does MEK/ERK signalling control MITF activity in these melanocytes? The trametinib experiment should be consolidated with other inhibitors (including ERK inhibitors) and/or genetic manipulation. *

      To address this comment, we complemented our former Trametinib experiments with ERK inhibition using Ravoxertinib (new fig. 6J-L). ERK inhibition led to increased nuclear localization of MITF and elevated melanin production, supporting the involvement of MEK/ERK in restraining MITF activity in MCs in response to ECM molecules. These new data are now included in the revised manuscript (line 354 ff. and new fig. 6J-L).

      • Did the authors also measure the effect of trametinib on cell proliferation in Figure 5?

      Overall, compared to the observed pronounced phenotypes like ECM-dependent cell morphology, melanin production and others, the differences in cell proliferation of MCs grown at different ECM conditions were statistically significant but not very large. We therefore refrained from additionally assessing the effect of trametinib on the observed ECM-dependent MC behaviour. Given the well-established role of ERK signaling in promoting cell proliferation, we indeed expect that MEK inhibition can reduce MC proliferation in our system, though it remains open whether there is an ECM-specific aspect to this.

      • Parallels with physiological conditions:*
      • *Most experiments shown were performed with immortalized melanocytes even though authors mention the use of primary cells (pMCs, line 148). Were similar results obtained in primary melanocytes? Do human melanocytes in culture behave similarly? *

      While we have not assessed human MCs, original fig. S2 (__revised fig. S3) __provides data using primary murine MCs (freshly isolated from newborn mice), confirming a similar behavior of primary cells compared to immortalized MCs in terms of cell area, p-FAK levels, number of FAs, melanin production, and MITF nuclear localization.

      • Are some of these observations also true in vivo, for example in mouse skin (optional)?

      The current manuscript focuses on the behavior of MCs in culture, as it was important to use a reductionist model system that can uncouple the effect of distinct ECM types as well as substrate stiffness. However, as a perspective and beyond the scope of this manuscript, we indeed plan to translate our in vitro findings to mouse skin, taking different biophysical and biochemical cues into account. Data from the present in vitro study provides valuable insights into which parameters and which anatomical areas to study in vivo.

      • How do the authors reconciliate their findings that collagen IV induced melanocyte migration and decreased proliferation and melanin production with the fact that melanocytes in human skin are generally in contact with the collagen IV-rich basement membrane?

      We indeed regarded the use of collagen IV (COL IV) as a physiological reference condition, and considered MC migration, proliferation, and melanin production on COL as baseline levels. Relative to COL IV, COL I reduced migration and increased melanin production, while FN led to increased migration, and a decrease of proliferation and melanin production. This suggests that ECM composition can selectively modulate distinct aspects of MC behavior compared to attachment to COL IV. The intermediate state observed on COL IV would be in line with a model in which this abundant basement membrane molecule enables MCs to maintain high flexibility in their phenotype, e.g. to further increase melanin production upon external stimuli other than ECM (UV, inflammation etc.). The perhaps unexpected, opposing response of MCs to FN and COL I, respectively, opens the possibility that under specific (patho)physiological conditions, the then abundant ECM can direct MC behaviour. Both plasma- and cellular-derived FN is deposited upon skin injury and instructs various cell types to promote skin repair. Taking our observations in vitro into account, it is tempting to speculate that this FN-enriched tissue enables MCs to quickly migrate into wound sites to re-establish protection to UV. Conversely, increased COL I levels-as observed in fibrotic conditions such as scleroderma-might favor a more differentiated, pigment-producing phenotype. Interestingly, cases of localized hyperpigmentation have been reported in scleroderma patients, possibly reflecting such matrix-driven MC reprogramming. Though requiring further investigation, these observations open new avenues to explore how dynamic changes in ECM composition contribute to MC behavior in tissue homeostasis and repair.

      We now extended our original discussion to better emphasize the physiological relevance of our findings (lines 383-391) and hypothesize how ECM remodeling may contribute to the dynamic regulation of MC plasticity-not only during tissue homeostasis, but also in response to injury and in fibrotic conditions such as scleroderma (lines 393-406).

      Minor comments to the Authors:

      The evidence that FAK is not responsible for MEK/ERK activation could be presented in the main text rather than in the discussion.

      We thank the reviewer for highlighting this important point. Our initial conclusion-that ERK activation was independent of FAK-likely stemmed from limitations of the previously used FAK inhibitor (Defactinib). In those earlier experiments, while FAK inhibition reduced focal adhesion numbers, p-FAK levels were not properly decreased, and paradoxically, ERK phosphorylation increased alongside decreased nuclear MITF levels. Based on this initial discrepancy and because of this reviewer's comment, we performed additional experiments using another selective FAK inhibitor, Ifebemtinib, which achieved an effective reduction in both p-FAK levels and focal adhesion number (new suppl. fig. S6B, C). In the revised version, we present new experiments using Ifebemtinib, demonstrating that FAK inhibition in fact does reduce p-ERK levels (new fig. 6M-N), thus supporting the notion that FAK contributes to ECM-dependent ERK pathway activation in our model. These findings are now shown in the results section (lines 357-364).

      Significance:

      General assessment: This study establishes the cellular impact of different types of extracellular matrix proteins and stiffness conditions relevant to skin biology on the behaviour of untransformed mouse melanocytes. In particular, it shows opposite effects of fibronectin and collagen I on cell proliferation and migration, which could prove relevant to certain skin conditions in human. However, the scope of these results is limited by the incomplete mechanistic characterization of the observed phenotypes and by the lack of parallels with physiological conditions.

      Advance: The systematic comparison of different microenvironmental conditions on normal melanocyte behaviour is novel and opens perspectives to understand the role of melanocytes in some human skin pathologies.

      Audience: The comparison of different environmental conditions on melanocyte behaviour is of interest to the melanocyte biology community and could have implications for basic and clinical understanding of some skin diseases.

      My expertise is in melanoma biology, including the impact of the microenvironment on tumour cell behaviours.

      1. Reviewer #3 Evidence, reproducibility and clarity:

      In this manuscript Luthold et al. describe how extracelluar matrix proteins and mechanosensation affect melanocyte differentiation. In particular, they show that ECM proteins and surface stiffness lead to effects on the MEK/ERK pathway, thus affecting the MITF transcription factor. The manuscript is interesting, well written and the data presented in a clear and easy-to-follow manner. The data are nicely quantitated and largely convincing.

        • However, the discussion of the nuclear location of MITF (Figure 4A) is not convincing. The images presented show that upon exposure to ColI, there is a lot of MITF in the nucleus, a lot less so upon ColIV and none upon FN exposure. However, we only see a snapshot of the cells and thus we do not know if we are witnessing effects on MITF protein synthesis, degradation or nuclear localization (the least likely scenario since M-MITF, the isoform present in melanocytes is predominantly nuclear anyway). Was there a cytoplasmic signal detected? Upon FN treatment, there is no MITF protein visible in the cells. Does this mean that the protein is not made, that it is degraded or present at such low levels that the antibody does not detect it? The claim of the authors that this affects nuclear localization of MITF needs more corroboration. *

      We thank the reviewer for raising this important point regarding the interpretation of MITF localization. We agree that the data as represented in the original figure 4 cannot distinguish whether changes reflect differences in MITF expression, stability, or subcellular distribution.

      To better address this, we now included a quantitative analysis of both nuclear and cytoplasmic MITF signals (revised fig. 4B). These data show that MITF is detectable in both compartments at all conditions tested. While total MITF levels were not reduced on FN, nuclear MITF was markedly decreased and cytoplasmic MITF was even increased compared to COL I. This indicates that the reduced nuclear signal on FN compared to COL I is not due to an overall loss of MITF protein but rather reflects a shift in its subcellular distribution. These findings support the idea that ECM composition influences MITF localization, consistent with functional changes in its activity and with the observed phenotypic changes.

      • Also, the authors need to show immunocytochemical images for the effects on MITF nuclear localization for the images presented in Figure 5C. *

      As requested, we now provide representative micrographs illustrating the effects on MITF nuclear localization corresponding to the conditions shown in Fig. 5C. These images have been included in the revised version of the manuscript (new fig. 6G), further supporting the quantitative data presented.

      • It seems that the authors quantitated immune-reactivity for both MITF and YAP. What was the control and how was the data normalized? *

      MITF and YAP immunodetection were performed in separate experiments and were not analyzed in the same cells. For both stainings, secondary antibody controls were included (secondary antibody alone without primary antibody), which showed no detectable signal. For MITF and YAP quantifications (revised fig. 4B,F), nuclear (for both) and cytoplasmic (for MITF) intensity values were normalized within each independent experiment by dividing each individual measurement by the mean nuclear intensity across all conditions. This approach allowed us to deal with total signal variability between experiments while preserving relative differences between ECM conditions. For the percentage of nuclear MITF no normalization was applied. We have added this description to the revised methods section.

      • Similarly, the blots and data shown in Figure 5 are not consistent with the text as described in the results section. The differences observed are minor and the only set that is likely to be significant is the FN-set; the differences between soft, intermediate and stiff of the FN-set do not look significantly different. The description of this in the results section should be toned down accordingly.*

      To strengthen the conclusions drawn from the original Fig. 5 (now fig. 6), we performed additional immunoblot experiments to increase the number of replicates. These extended results now show a statistically significant increase in pERK levels in MCs cultured on FN compared to COL I. However, consistent with the reviewer's observation, no significant differences were detected across the stiffness conditions within FN. We have revised the Results section accordingly to tone down the interpretation and to better reflect the revised data (revised fig. 6E, lines 339-355).

      Significance:

      Upon improvement, this paper will provide an early characterization of the effects of the ECM on melanocyte differentiation. If the link to MITF holds, this will be the first time that mechanosensation has been shown to mediate effects on this transcription factor.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript Luthold et al. describe how extracelluar matrix proteins and mechanosensation affect melanocyte differentiation. In particular, they show that ECM proteins and surface stiffness lead to effects on the MEK/ERK pathway, thus affecting the MITF transcription factor. The manuscript is interesting, well written and the data presented in a clear and easy-to-follow manner. The data are nicely quantitated and largely convincing. However, the discussion of the nuclear location of MITF (Figure 4A) is not convincing. The images presented show that upon exposure to ColI, there is a lot of MITF in the nucleus, a lot less so upon ColIV and none upon FN exposure. However, we only see a snapshot of the cells and thus we do not know if we are witnessing effects on MITF protein synthesis, degradation or nuclear localization (the least likely scenario since M-MITF, the isoform present in melanocytes is predominantly nuclear anyway). Was there a cytoplasmic signal detected? Upon FN treatment, there is no MITF protein visible in the cells. Does this mean that the protein is not made, that it is degraded or present at such low levels that the antibody does not detect it? The claim of the authors that this affects nuclear localization of MITF needs more corroboration. Also, the authors need to show immunocytochemical images for the effects on MITF nuclear localization for the images presented in Figure 5C. It seems that the authors quantitated immune-reactivity for both MITF and YAP. What was the control and how was the data normalized? Similarly, the blots and data shown in Figure 5 are not consistent with the text as described in the results section. The differences observed are minor and the only set that is likely to be significant is the FN-set; the differences between soft, intermediate and stiff of the FN-set do not look significantly different. The description of this in the results section should be toned down accordingly.

      Significance

      Upon improvement, this paper will provide an early characterization of the effects of the ECM on melanocyte differentiation. If the link to MITF holds, this will be the first time that mechanosensation has been shown to mediate effects on this transcription factor.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      In their manuscript, Luthold et al describe the behaviour of immortalized mouse melanocytes cultured on various extracellular matrix (ECM) proteins and substrates of different stiffness. They found that fibronectin, collagen IV and collagen I have different effects on melanocyte morphology, migration, and proliferation. They further link these differential effects to MITF localization and MEK/ERK signalling. This work shows that fibronectin supports melanocyte migration, which was associated with a dendritic morphology and correlated with increased MEK/ERK signalling and decreased MITF nuclear localization. In contrast, collagen I promoted melanocyte proliferation with low MEK/ERK signalling, enhanced MITF nuclear localization and high melanin production.

      While this study is well designed and the data adequately presented and interpreted, the impact of its conclusions is limited by the incomplete mechanistic characterization of the observed phenotypes and by the lack of parallels with physiological conditions. To strengthen their manuscript, the authors should consider the following comments:

      Major comments

      1. Characterization of observed phenotypes: The link between matrix-sensing and intracellular signalling is missing. Which types of integrins are expressed by iMCs? Are any of these integrins required for the observed phenotypes? The phenotypic changes described here are interesting but only partially analysed. Transcriptomic studies would yield a more complete view of cell state transitions (optional). At a minimum, could the authors detect any changes in cadherin expression, or in other genes classically involved in phenotype switching, such as twist1, snail or zeb1? Lines 235-236, the authors write that ECM proteins regulate melanocyte behaviour "likely through modulation of MITF localization and activity". Could the authors support the role of MITF experimentally? Genetic experiments using different MITF mutants could address this question. Additionally, how does MEK/ERK signalling control MITF activity in these melanocytes? The trametinib experiment should be consolidated with other inhibitors (including ERK inhibitors) and/or genetic manipulation. Did the authors also measure the effect of trametinib on cell proliferation in Figure 5?
      2. Parallels with physiological conditions: Most experiments shown were performed with immortalized melanocytes even though authors mention the use of primary cells (pMCs, line 148). Were similar results obtained in primary melanocytes? Do human melanocytes in culture behave similarly? Are some of these observations also true in vivo, for example in mouse skin (optional)? How do the authors reconciliate their findings that collagen IV induced melanocyte migration and decreased proliferation and melanin production with the fact that melanocytes in human skin are generally in contact with the collagen IV-rich basement membrane?

      Minor comment

      The evidence that FAK is not responsible for MEK/ERK activation could be presented in the main text rather than in the discussion.

      Significance

      General assessment: This study establishes the cellular impact of different types of extracellular matrix proteins and stiffness conditions relevant to skin biology on the behaviour of untransformed mouse melanocytes. In particular, it shows opposite effects of fibronectin and collagen I on cell proliferation and migration, which could prove relevant to certain skin conditions in human. However, the scope of these results is limited by the incomplete mechanistic characterization of the observed phenotypes and by the lack of parallels with physiological conditions.

      Advance: The systematic comparison of different microenvironmental conditions on normal melanocyte behaviour is novel and opens perspectives to understand the role of melanocytes in some human skin pathologies.

      Audience: The comparison of different environmental conditions on melanocyte behaviour is of interest to the melanocyte biology community and could have implications for basic and clinical understanding of some skin diseases.

      My expertise is in melanoma biology, including the impact of the microenvironment on tumour cell behaviours.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, authors demonstrated the role of ECM-dependent MEK/ERK/MITF signaling pathway that influences the plasticity of MCs (melanocytes) through their interactions with the environment. The findings emphasize the essential role of the extracellular matrix (ECM) in controlling MC function and differentiation, highlighting a critical need for further research to understand the complex interactions between mechanical factors and ECM components in the cellular microenvironment. Overall, the manuscript is concise, written well and shed light on a complex relationship between ECM protein types and substrate stiffness that affects MC mechanosensation. However, understanding detailed molecular mechanisms involved, especially the roles of MITF and other key regulators, is crucial for comprehending MC function and related pathologies. Authors needs to clarify some minor queries to be considered for publication.

      Major comment:

      1. Authors have chosen ERK signaling pathways to test and draw their conclusion based on existing knowledge in the field, as several studies previously reported the role of ECM to modulate the ERK signaling pathway but it would be interesting to test other signaling pathways unbiasedly; e.g. ECM can also regulate Wnt signaling (PMID: 29454361) and connection of MITF and its target gene TYR expression is also regulated by Wnt in context of melanocyte. (PMID: 29454361, PMID: 34878101, PMID: 38020918).
      2. Discussion line 340-344. Please provide the data as it is directly connected to the study, and it would be crucial to interpret data better. As FAK is upregulated and FAK inhibitor did not reduce pERK, is there any possibility that other kinases might involve. Please discuss. Again, authors should check Wnt activation as FAK can activate Wnt signaling in response to matrix stiffness as well. (PMID 29454361).
      3. Rationale for selecting MITF for the study is very weak. Please justify in the discussion why authors have chosen to study MITF/ERK axis with a more logistic approach.
      4. It is suggested to check for the changes in the transcriptomic profile of melanocytes upon culturing on different matrix to get a more comprehensive view associated with the molecular mechanisms involved.
      5. Please provide the protein expression of genes involved in cell cycle progression and/or apoptosis to support the data in Fig. 3D-E.

      Minor comment:

      1. Discussion line 358-359, using term synergy is an overstatement as the collective data do not support the claim. Very little role of matrix stiffness is demonstrated by experimental data.
      2. Method section, BrdU assay and BrdU assay-cell proliferation can be combined in method section.
      3. What trigger melanocytes to respond to different microenvironment. Please discuss.
      4. Fig 3C and 5D Tyr mRNA expression is tested. Authors should also test for the protein expression in the similar set of studies.
      5. Line 217-218, Authors claim stiffness mediated increase of MITF nuclear localization in Col I, however Fig. 4A-B does not represent that claim. Please justify.

      Significance

      Overall, the study is well-planned, the experiments are well-designed and executed with appropriate use of statistical analysis. However, a more in-depth analysis of the molecular mechanisms is necessary to clarify how the extracellular matrix (ECM) regulates ERK or MITF nuclear translocation.

      This study enhances our existing knowledge by linking the well-established role of the extracellular matrix (ECM) in regulating ERK signaling to ERK's involvement in controlling MITF, a key regulator of melanocyte differentiation. It further establishes the ECM's role in controlling melanocyte function and differentiation.

      This study will interest readers working in the field of the tumor microenvironment, as it explores the role of the extracellular matrix and its complexity and stiffness in disease progression, not only in melanoma but also in other types of cancer.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study aims to classify prognostic and subtype-specific eRNAs in breast cancer, highlighting their potential as biomarkers. Data was analysed using existing machine learning algorithms, Data analysis is superficial and it is hard to understand the key significant findings

      This is an important topic and a highly relevant approach to identifying RNA-based biomarkers. They analyse published RNAseq datasets by focusing on molecular subtype-specific eRNAs, enhancing clinical relevance and thereby addressing the heterogeneity of the cancer type (strength of the study).

      Weaknesses include: Most of the findings are purely correlation-based and also based on a reanalysis of published datasets; it would benefit from experimental validation to support their findings. Differential expression analysis of large datasets likely yields some differences in the transcriptome. How significant are these changes? Does the expression of eRNAs affect the expression of genes in cis? Although this analysis would provide some associated gene expression differences, it can also provide some insights into subtype-specific differences in gene expression programs. If the authors find experimental validations are not feasible, I recommend validating the eRNA signature in an independent dataset.

      Here are major points; addressing these points in the revised version is important.

      From Figure 1B, what eRNAs were identified for LumB using log2MC?

      Page 8 However, sensitivity and F-measure .... It would help to include the metrics for the number of patients in each subtype. The ratio of eRNAs/number of cases in each subtype would inform if the number of eRNAs is an outcome of no. of cases or subgroup-specific.

      Page 9 "Altogether, both measurements classify eRNAs efficiently based on subtypes, InfoGain allowed us to distinguish further samples based on high and low expression of eRNAs for basal subtype and performed better in statistical metrics" Based on statistical metrics, both models seem to be performing similarly except for Her2. In Fig. 1B, the F-measure metrics are wrong for basal LogMC, as it is 0.94 rather than 0.54, which could lead to a misinterpretation of the model.

      Many genome browser figures, including Figure S3. TFBS is not at the same site as eRNAs detected. Is there CAGE data to show that binding these TFs at these sites leads to the expression of eRNAs? That will give direct evidence that the eRNAs are transcribed due to these TFs

      Page 10, There were 30 Her2-specific eRNA regions.... Do the same enhancers also regulate these genes as those from which eRNAs are transcribed? Is it cis-effect, or could these affect the trans-regulating of other genes?

      Minor comments:

      Page 8 "InfoGain meausure..." Fig. S2A also shows high and low expressed eRNAs for the basal group

      Page 11, Our analyses also identified the role of another..... The statement is misleading as it is the enrichment of these TFs with the eRNAs

      Page 13, "Around 90% of eRNAs are bidirectional and non-polyadenylated [53]. TCGA expression datasets are based on RNA-seq assays, which capture only non-polyadenylated RNAs. Thus, analysing the expression of eRNAs on mRNA-seq datasets might not be adequate". It is very confusing, please check

      Significance

      This is an important topic and a highly relevant approach to identifying RNA-based biomarkers. They analyse published RNAseq datasets by focusing on molecular subtype-specific eRNAs, enhancing clinical relevance and thereby addressing the heterogeneity of the cancer type (strength of the study).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Enhancer RNAs (eRNAs) are early indicators of transcription factor (TF) activity and can identify distinct molecular subtypes and pathological outcomes in breast cancer. In this study, Patel et al. analysed 302,951 polyadenylated eRNA loci from 1,095 breast cancer patients using RNA-seq data, applying machine learning (ML) to classify eRNAs associated with specific molecular subtypes and survival. They discovered subtype-specific eRNAs that implicate both established and novel regulatory pathways and TFs, as well as prognostic eRNAs -specifically, LumA and HER2-survival- that distinguish favorable from poor survival outcomes. Overall, this ML-based approach illustrates how eRNAs reveal the molecular grammar and pathological implications underlying breast cancer heterogeneity.

      Major comments

      1. The authors define 302,951 eRNA loci based on RNA-seq data, yet it is widely known that many enhancers reside in proximity to promoters or within intronic regions (examples presented in Fig. 3B and S3). Consequently, it seems likely that reads mapped to these regions might not truly represent eRNA signals but include mRNA contamination. Could the authors clarify how they ensured that the identified eRNAs were not confounded by mRNA reads? What fraction of these enhancer loci is promoter proximal or intronic? How does H3K4me3, a well-established and standardized active promoter histone mark, behave on these loci? The reviewer considers it important to confirm that the identified eRNAs are indeed of enhancer origin rather than promoter transcripts.
      2. In Fig. 1B, the F measure (0.540) of the Basal subtype using the Logmc method contradicts its extremely high precision (1.000) and sensitivity (0.890). The authors need to clarify the exact formula or method used to compute F1 and the discrepancy in the reported metrics for this subtype and perhaps other subtypes as well.
      3. As shown in Fig. 4C, S4B, and most, if not all, tracks of Fig. S3, ER binding regions are not annotated as eRNA loci. It seems, in this reviewer's opinion, very unlikely that this is because they generally lack eRNA expression, but rather they do not express polyadenylated eRNA (typically 1D eRNA), which is captured in this dataset. The reviewer posits that these enhancers produce more transient, non-polyadenylated 2D eRNA. It has been widely documented in prior studies that ER-bound enhancers exhibit bimodal eRNA expression patterns [e.g., Li, W. et al. Functional roles of enhancer RNAs for oestrogen-dependent transcriptional activation. Nature 498, 516-520 (2013)]. Could the authors address this opinion and elaborate on how the restriction to polyadenylated transcripts might underrepresent enhancers regulated by ER and other TFs and whether this bias impacts the overall findings?
      4. Despite the unsatisfied performance of the ML approach on classifying Her2 subtypes, the hierarchical clustering performed in Fig. 2A and S2A appears to show a reasonable separation of Her2 subtypes, showing as a clustered green band. Could the authors quantitatively assess how effective this clustering results and compare that to the ML outcome? (OPTIONAL)
      5. In Fig. 4 and S4, the authors reported to have enriched binding or motif of TFs, e.g., FOXA1, AP-2, and E2A, specifically at enhancer loci with low eRNA level, which conflicts with their established roles as transcriptional activators. The reviewer asks for an address as to why these factors would be associated with basal low-eRNA regions and whether any additional data might clarify their functional role in these contexts.
      6. Regarding Fig. 4B, the authors state that "ER binding occupies only the strongest ssDNA and GRO-seq-positive sites". Firstly, the GRO-seq data quality is poor with indiscernible peaks. This may be insufficient for a qualified representation of nascent eRNA expression. More importantly, it appears each heatmap is ranked independently, so top loci for ssDNA are not necessarily top loci for GRO-seq, ER, Pol-II, or H3K27ac. The reviewer requests clarification on how the authors plot these heatmaps and questions whether the statement is supported by the analysis as presented.
      7. In Fig. S4B and the third plot of 4C, the averaged histogram of ER binding appears in multiple sharp peaks with drastic asymmetric positioning around the enhancer centre, which is highly atypical of most published ER ChIP-seq profiles. Could the authors discuss possible "spatial syntax" or directional patterns of ER binding in relation to eRNA loci and cite any literature showing a similar pattern? Further evidence is required to substantiate these observations, as they are remarkably unique.

      Minor comments

      1. When introducing eRNAs, the reviewer recommends mentioning that 1) eRNA levels correlate with enhancer activity and 2) eRNA expression precedes target gene transcription, thus reflecting upstream regulatory events. Relevant references include: Arner, E. et al. Transcribed enhancers lead waves of coordinated transcription in transitioning mammalian cells. Science 347, 1010-1014 (2015); Carullo, N. V. N. et al. Enhancer RNAs predict enhancer-gene regulatory links and are critical for enhancer function in neuronal systems. Nucleic Acids Res. 48, 9550-9570 (2020); Kaikkonen, Minna U. et al. Remodeling of the Enhancer Landscape during Macrophage Activation Is Coupled to Enhancer Transcription. Mol. Cell 51, 310-325 (2013).
      2. H3K27ac is used initially to define these regulatory loci, and like eRNAs, H3K27ac also varies among patients. Which H3K27ac dataset(s) were used initially, and could this approach potentially overlook patient-specific enhancers? (OPTIONAL)
      3. In addition to the overall metrics displayed in Fig. 2B, could the authors provide precision and sensitivity values for LumA and LumB separately under the Logmc method, given the observation in Fig. 2E that LumA and LumB are not well separated in the UMAP projection?
      4. Could the author elaborate, in the discussion session, on why there is a substantial difference in ML performance depending on whether InfoGain or Logmc is used?
      5. How does the expression pattern of Basal high, Basal low, Her2, and Lum eRNA clusters behave differentially in Basal, Her2, and LumA/B subtypes? Are Basal high eRNAs downregulated in Her2 or Lum subtypes, and vice versa? Since many downstream analyses rely on these eRNA clusters, it is suggested to include a heatmap and/or boxplot that displays how each eRNA category is expressed in each subtype to confirm that these definitions are consistent.

      Referee cross-commenting

      I share Reviewer #1's opinion that the manuscript should assess whether mRNA or eRNA is the stronger predictor of breast cancer subtypes and clinical outcomes. It will greatly improve the novelty if eRNA is shown to be a better indicator for cancer characterization.

      Also, I strongly concur with Reviewer #3 that the current informatics approach is superficial and that several conclusions are contentious. The authors need to resolve the inconsistencies in their ML statistics and the potentially misleading interpretations of the ChIP‑seq and motif‑enrichment results.

      It is further recommended that, building Reviewer #3's comment, the study integrate eRNA signatures with their proximal genes to address 1) whether genes located near these enhancers are differentially expressed-and correlated with enhancer activity-across cancer subtypes, and 2) whether it provides insights into understanding the enhancer-gene regulatory architecture in a subtype-specific context.

      Significance

      General Assessment

      This study provides insights into the potential use of eRNA to classify breast cancer subtypes and refine prognostic markers. A strength is the integration of large-scale RNA-seq data with machine learning to identify eRNA signatures in biologically-meaningful patient samples, revealing both established and novel TF networks. The study also discovered eRNA clusters that correlate with the survival of patients, thus providing strong clinical implications. However, the ML approach yields several inconsistencies-for instance, unsatisfactory classification results for the Her2 subtype as well as the confused statistical metrics in the results. Furthermore, the ML model struggles to differentiate more nuanced molecular classes (e.g., LumA vs. LumB) and higher-level histological subtypes (e.g., lobular vs. ductal), thus limiting its power to dissect more delicate pathological and molecular mechanisms. Another limitation worth noting of this ML approach is the exclusive use of only polyadenylated eRNAs via RNA-seq, which excludes perhaps the more prominent 2D eRNA expressed in regulatory enhancers. Moreover, certain datasets appear to be of suboptimal quality, leading to assertions that would benefit from additional supporting evidence. Altogether, while the study offers a promising angle on eRNA-based tumor stratification, more robust experimental validations are needed to resolve inconsistencies and clarify the mechanistic underpinnings.

      Advance

      Conceptually, the study highlights the potential for eRNA-based signatures to capture regulatory variation beyond classical markers. However, the utility of these signatures is constrained by the focus on polyadenylated transcripts alone, likely underrepresenting key enhancer regions, and certain evidence presented in this study is not substantial enough to support some statements. While the work adds an important dimension to the understanding of enhancer biology in breast cancer, the resulting insights are partly hampered by limitations in data coverage and quality.

      Audience

      The primary audience includes cancer epigenetics, functional genomics, and bioinformatics researchers who are interested in leveraging eRNAs as biomarkers and dissecting complex regulatory networks in breast cancer. Clinically oriented scientists focusing on molecular diagnostics may also find relevance in the authors' approach to stratify subtypes and outcomes. The research is most relevant to a specialized audience within basic and translational cancer genomics, as well as computational biology groups interested in eRNA analysis.

      Field of Expertise

      I evaluate this manuscript as a researcher specializing in cancer epigenetics, functional genomics, and NGS-based data analysis. Parts of the manuscript touching on clinical outcome measures may require additional review from practicing oncologists.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      This study assesses eRNA activity as a classifier of different subtypes of breast cancer and as a prognosis tool. The authors take advantage of previously published RNA-seq data from human breast cancer samples and assess it more deeply, considering the cancer subtype of the patient. They then apply two machine learning approaches to find which eRNAs can classify the different breast cancer subtypes. While they do not find any eRNA that helps distinguish ductal vs. lobular breast cancers, their approach helps identify eRNAs that distinguish luminal A, B, basal and Her2+ cancers. They also use motif enrichment analysis and ChIP-seq datasets to characterize the eRNA regions further. Through this analysis, they observe that those eRNAs where ER binds strongest are associated with a poor patient prognosis.

      Major comments

      • Part of the rationale for this study is the previous observation that eRNAs are less associated with the prognosis of breast cancer patients in comparison to mRNAs and they claim that the high heterogeneity between breast cancer subtypes would mask the importance of eRNAs. In this study, the authors solely focus on eRNAs as a classification of breast cancer subtypes and prognostic tool and do not answer whether eRNAs or mRNAs are a better predictor of cancer subtypes and of prognosis. Since the answer and the tools are already in their hands, it would be important to also see a comparative analysis where they assess which of the two (mRNAs or eRNAs) is a better predictor.
      • The authors run the umaps of Fig. 1C only taking the predictor eRNAs. It is then somewhat expected to observe a separation. Coming from a single-cell omics field, what I would suggest is to take the eRNA loci and compute a umap with the highly variable regions, perform clustering on it and assess how the cancer subtypes are structured within the data. This would give a first overview of how much segregation and structure one can have with this data. Having a first step of data exploration would also strengthen the paper. If the authors have tried it, could the authors comment on it?
      • 'neither measures could classify any distinct eRNAs for invasive ductal vs lobular cancer samples' S1B. Just by eye, I can see a potential enrichment of ductal on the left and on the right while lobular stays in the center. This suggests to me that, while perhaps each eRNA alone does not have the power to classify the lobular vs ductal subtype, perhaps there is a difference - which could result from a cooperative model of eRNA influence - that would need further exploration. Would a PCA also show enrichments of ductal vs. lobular in specific parts of the plot? It may be worth exploring the PC loadings to see which eRNAs could play an influence. In this regard, a more unbiased visual examination, as suggested in my previous point, could help clarify whether there could be an association of certain eRNAs that cannot be captured by ML.
      • "we employed machine learning approaches on 302,951 eRNA loci identified from RNA-seq datasets from 1,095 breast cancer patient samples from previous studies" - the previous studies from which the authors take the data [11,12] highlight the presence of ~60K enhancers in the human genome and they use less than that in their analysis. Could the authors please clarify the differences in numbers with previous studies and give a reasoning? Also, from the methods section, they discard many patient samples due to low QC, so, from what I understand, the number of samples analyzed in the end is 975 and not 1,095.

      Minor comments

      • Can the authors please state the parameters of the umap in methods? Although it could be intrinsic to the dataset, data points are grouped in a way that makes me think that the granularity is too forced. Could the authors please show how the umap would behave with more lenient parameters? Or even with PCA?
      • 'Majority of the basal' -> The majority of the basal.

      Significance

      This is a paper relevant in the cancer field, particularly for breast cancer research. The significance of the paper lies in digging into the breast cancer samples, taking the different existing subtypes into account to assess the contribution of eRNAs as a classifier and as a prognostic tool. The data is already available but it has not been studied to this degree of detail. It highlights the importance of characterizing cancer samples in more depth, considering its intrinsic heterogeneity, as averaging across different subtypes would mask biology. My expertise lies in gene regulation and single-cell omics. My contribution will therefore be more focused on the analysis and extraction of biological information. The extent of its specific relevance in cancer research falls beyond my expertise.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this manuscript the authors have done cryo-electron tomography of the manchette, a microtubule-based structure important for proper sperm head formation during spermatogenesis. They also did mass-spectrometry of the isolated structures. Vesicles, actin and their linkers to microtubules within the structure are shown.

      __We thank the reviewer for the critical reading of our manuscript; we have implemented the suggestions as detailed below, which we believe indeed improved the manuscript. __

      Major:

      The data the conclusions are based on seem very limited and sometimes overinterpreted. For example, only one connection between actin and microtubules was observed, and this is thought to be MACF1 simply based on its presence in the MS.

      __We regret giving the impression that the data is limited. We in fact collected >100 tilt series from 3 biological replicas for the isolated manchette. __

      __In the revised version, we added data from in-situ studies showing vesicles interacting with the manchette (as requested below, new Fig. 1). __

      Specifically, for the interaction of actin with microtubule we added more examples (Revised Fig. 6) and we toned down the discussion related to the relevance of this interaction (lines 193-194, 253-255). MACF1 is mentioned only as a possible candidate in the discussion (line 254).

      Another, and larger concern, is that the authors do a structural study on something that has been purified out of the cell, a process which is extremely disruptive. Vesicles, actin and other cellular components could easily be trapped in this cytoskeletal sieve during the purification process and as such, not be bona fide manchette components. This could create both misleading proteomics and imaging. Therefore, an approach not requiring extraction such as high-pressure freezing, sectioning and room-temperature electron tomography and/or immunoEM on sections to set aside this concern is strongly recommended. As an additional bonus, it would show if the vesicles containing ATP synthase are deformed mitochondria.

      __We recognise the concern raised by the reviewer. __

      __To alleviate this concern, we added imaging data of manchettes in-situ that show vesicles, mitochondria and filaments interacting with the manchette (new Fig. 1), essentially confirming the observations that were made on the isolated manchette. __

      __The benefits of imaging the isolated manchette were better throughput (being able to collect more data) and reaching higher resolution allowing to resolve unequivocally the dynein/dynactin and actin filaments. __

      Minor: Line 99: "to study IMT with cryo-ET, manchettes were isolated ...(insert from which organism)..."

      __Added in line 102 in the revised version. __

      Line 102 "...demonstrating that they can be used to study IMT".. can the authors please clarify?

      This paragraph was revised (lines 131-137), we hope it is now more clear.

      Line 111 "densities face towards the MT plus-end" How can a density "face" anywhere? For this, it needs to have a defined front and back.

      Microtubule motor proteins (kinesin and dynein) are often attached to the microtubules with an angle and dynactin and cargo on one side (plus end). We rephrased this part and removed the word “face” in the revised version to make it more clear (lines 161-162).

      Line 137: is the "perinuclear ring" the same as the manchette?

      The perinuclear ring is the apical part of the manchette that connects it to the nucleus. We added to the revised version imaging of the perinuclear ring with observations on how it changes when the manchette elongates (new Fig. 2).

      Figure 2B: How did the authors decide not to model the electron density found between the vesicle and the MT at 3 O'clock? Is there no other proteins with a similar lollipop structure as ATP synthase, so that this can be said to be this protein with such certainty?

      __The densities connecting the vesicles to the microtubules shown in (now) Fig. 4D are not consistent enough to be averaged. __

      __The densities resembling ATP synthase are inside the vesicles. Nevertheless, we have decided to remove the averaging of the ATP synthases from the revised manuscipt as they are not of great importance for this manuscript. Instead, the new in-situ data clearly show mitochondria (with their characteristic double membrane and cristae) interacting with manchette microtubule (new Fig 1C). __

      Line 189: "F-actin formed organized bundles running parallel to mMTs" - this observation needs confirming in a less disrupted sample.

      __Phalloidin (actin marker) was shown before to stain the manchette (PMID: 36734600). As actin filaments are very thin (7 nm) they are very hard to observe in plastic embedded EM. __

      In the in-situ data we added to the revised manuscipt (new Fig 1D), we observe filaments with a diameter corresponding to actin. In addition, we added more examples of microtubules interacting with actin in isolated manchette (new Fig. 6 E-K).

      Line 242 remove first comma sign.

      Removed.

      Line 363 "a total of 2 datasets" - is this manuscript based on only two tilt-series? Or two datasets from each of the 4 grids? In any case, this is very limited data.

      We apologise for not clearly providing the information about the data size in the original manuscipt. The data is based on three biological replicas (3 animals). We collected more than 100 tomograms of different regions of the manchettes. As such, we would argue that the data is not limited per se.

      Reviewer #1 (Significance (Required)):

      The article is very interesting, and if presented together with the suggested controls, would be informative to both microtubule/motorprotein researchers as well as those trying studying spermatogenesis.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manchette appears as a shield-like structure surrounding the flagellar basal body upon spermiogenesis. It consists of a number of microtubules like a comb, but actin (Mochida et al. 1998 Dev. Biol. 200, 46) and myosin (Hayasaka et al. 2008 Asian J. Androl. 10, 561) were found, suggesting transportation inside the manchette. Detailed structural information and functional insight into the manchette was still awaited. There is a hypothesis called IMT (intra-machette transport) based on the fact that machette and IFT (intraflagellar transport) share common components (or homologues) and on their transition along the stages of spermiogenesis. While IMT is considered as a potential hypothesis to explain delivery of centrosomal and flagellar components, no one has witnessed IMT at the same level as IFT. IMT has never been purified, visualized in motion or at high resolution. This study for the first time visualized manchette using high-end cryo-electron tomography of isolated manchettes, addressing structural characterization of IMT. The authors successfully microtubular bundles, vesicles located between microtubules and a linker-like structure connecting the vesicle and the microtubule. On multilamellar membranes in the vesicles they found particles and assigned them to ATPase complexes, based on intermediate (~60A) resolution structure. They further identified interesting structures, such as (1) particles on microtubules, which resemble dynein and (2) filaments which shows symmetry of F-actin. All the molecular assignments are consistent with their proteomics of manchettes.

      __We thank the reviewer for highlighting the novelty of our study.____ __

      Their assignment of ATPase will be strengthened by MS data, if it proves absence of other possible proteins forming such a membrane protein complex.

      All the ATPase components were indeed found in our proteomics data. Nevertheless, we have decided to remove the averaging of the ATPase as it does not directly relate to IMT, the focus of this manuscript.

      They discussed possible role of various motor proteins based on their abundance (Line 134-151, Line 200). This makes sense only with a control. Absolute abundance of proteins would not necessarily present their local importance or roles. This reviewer would suggest quantitative proteomics of other organelles, or whole cells, or other fractions obtained during manchette isolation, to demonstrate unique abundance of KIF27 and other proteins of their interest.

      We agree with the reviewer that absolute abundance does not necessarily indicate importance or a role. As such, we removed this part of the discussion from the revised manuscript.

      A single image from a tomogram, Fig.6B, is not enough to prove actin-MT interaction. A gallery and a number (how many such junctions were found from how many MTs) will be necessary.

      We agree that one example is not enough. In the new Fig. 6E-K, we provide a gallery of more examples. We have revised the text to reflect the point that these observations are still rare and more data will be needed to quantify this interaction (Lines 253-254).

      Minor points: Their manchette purification is based on Mochida et al., which showed (their Fig.2) similarity to the in vivo structure (for example, Fig.1 of Kierszenbaum 2001 Mol. Reproduc. Dev. 59, 347). Nevertheless, since this is not a very common prep, it is helpful to show the isolated manchette’s wide view (low mag cryo-EM or ET) to prove its intactness.

      We thank the reviewer for this suggestion, in the revised version, new Fig. 2 provides a cryo-EM overview of purified manchette from different developmental stages.

      Line 81: Myosin -> myosin (to be consistent with other protein names)

      Corrected.

      This work is a significant step toward the understanding of manchettes. While the molecular assignment of dynein and ATPase is not fully decisive, due to limitation of resolution (this reviewer thinks the assignment of actin filament is convincing, based on its helical symmetry), their speculative model still deserves publication.

      Reviewer #2 (Significance (Required)):

      This work is a significant step toward the understanding of manchettes. While the molecular assignment of dynein and ATPase is not fully decisive, due to limitation of resolution (this reviewer thinks the assignment of actin filament is convincing, based on its helical symmetry), their speculative model still deserves publication.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      ->Summary:

      The manchette is a temporary microtubule (MT)-based structure essential for the development of the highly polarised sperm cell. In this study, the authors employed cryo-electron tomography (cryo-ET) and proteomics to investigate the intra-manchette transport system. Cryo-EM analysis of purified rat manchette revealed a high density of MTs interspersed with actin filaments, which appeared either bundled or as single filaments. Vesicles were observed among the MTs, connected by stick-like densities that, based on their orientation relative to MT polarity, were inferred to be kinesins. Subtomogram averaging (STA) confirmed the presence of dynein motor proteins. Proteomic analysis further validated the presence of dynein and kinesins and showed the presence of actin crosslinkers that could bundle actin filaments. Proteomics data also indicated the involvement of actin-based transport mediated by myosin. Importantly, the data indicated that the intraflagellar transport (IFT) system is not part of the intra-manchette transport mechanism. The visualisation of motor proteins directly from a biological sample represents a notable technical advancement, providing new insights into the organisation of the intra-manchette transport system in developing sperm.

      We thank the reviewer for summarising the novelty of our observations.

      -> Are the key conclusions convincing? Below we comment on three main conclusions. MT and F-actin bundles are both constituents of the manchette While the data convincingly shows that MT and F-actin are part of the manchette, one cannot conclude from it that F-actin is an integral part of the manchette. The authors would need to rephrase so that it is clear that they are speculating.

      We have rephrased our statements and replaced “integral” with ‘actin filaments are associated’. Of note previous studies suggested actin are part of the manchette including staining with phalloidin (PMID: 36734600, PMID: 9698455, PMID: 18478159) and we here visualised the actin in high resolution.

      The transport system employs different transport machinery on these MTs Proteomics data indicates the presence of multiple motor proteins in the manchette, while cryo-EM data corroborates this by revealing morphologically distinct densities associated with the MTs. However, the nature of only one of these MT-associated densities has been confirmed-specifically, dynein, as identified through STA. The presence of kinesin or myosin in the EM data remains unconfirmed based on just the cryo-ET density, and therefore it is unclear whether these proteins are actively involved in cargo transport, as this cannot be supported by just the proteomics data. In summary, we recommend that the authors rephrase this conclusion and avoid using the term "employ".

      We agree that our cryo-ET only confirmed the motor protein dynein. As such, we removed the term employ and rephrased our claims regarding the active transport and accordingly changed the title.

      Dynein mediated transport (Line 225-227) The data shows that dynein is present in the manchette; however, whether it plays and active role in transport cannot be determined from the cryo-ET data provided in the manuscript, as it does not clearly display a dynein-dynactin complex attached to cargo. The attachment to cargo is also not revealed via proteomics as no adaptor proteins that link dynein-dynactin to its cargo have been shown.

      A list of cargo adaptor proteins were found in our proteomics data but we agree that cryo-ET and proteomics alone cannot prove active transport. As such we toned down the discussion about active transport (lines 212-220).

      -> Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      F-actin • In the abstract, the authors state that F-actin provides tracks for transport as well as having structural and mechanical roles. However, the manuscript does not include experiments demonstrating a mechanical role. The authors appear to base this statement on literature where actin bundles have been shown to play a mechanical role in other model systems. We suggest they clarify that the mechanical role the authors suggest is speculative and add references if appropriate.

      __ ____We removed the claim about the mechanical role of the actin from the abstract and rephrased this in the discussion to suggest this role for the F-actin (lines 242-243).__

      • Lines 15,92, 180 and 255: The statement "Filamentous actin is an integral part of the manchette" is misleading. While the authors show that F-actin is present in their purified manchette structures, whether it is integral has not been tested. Authors should rephrase the sentence.

      We removed the word integral.

      • To support the claim that F-actin plays a role in transport within the manchette, the authors present only one instance where an unidentified density is attached to an actin filament. This is insufficient evidence to claim that it is myosin actively transporting cargo. Although the proteomics data show the presence of myosin, we suggest the authors exercise more caution with this claim.

      We agree that our data do not demonstrate active transport as such we removed that claim. We mention the possibility of cargo transport in the discussion (lines 250-255).

      • The authors mention the presence of F-actin bundles but do not show direct crosslinking between the F-actin filaments. They could in principle just be closely packed F-actin filaments that are not necessarily linked, so the term "bundle" should be used more cautiously.

      We do not assume that a bundle means that the F-actin filaments are crosslinked. A bundle simply indicates the presence of multiple F-actin filaments together. We rephrased it to call them actin clusters.

      Observations of dynein • Relating to Figure 2B: From the provided image it is not clear whether the density corresponds to a dynein complex, as it does not exhibit the characteristic morphological features of dynein or dynactin molecules.

      We indeed do not claim that the densities in this figure are dynein or dynactin. __We revised this paragraph and hope that it is now more clear (lines 135-137). __

      • Lines 171-172 and Figure 4: It is well established that dynein is a dimer and should always possess two motor domains. The authors have incorrectly assumed they observed single motor heads, except possibly in Figure 4A (marked by an arrow). In all other instances, the dynein complexes show two motor domains in proximity, but these have not been segmented accurately. Furthermore, the "cargos" shown in grey are more likely to represent dynein tails or the dynactin molecule, based on comparisons with in vitro structures of these complexes (see references 1-3).

      We thank the reviewer for this correction. We improved the annotations in the figure and revised the text to clarify that we identified dimers of dynein motor heads (lines 140-144). We further added a projection of a dynein dynactin complex to compare to the observation on the manchette (new Fig. 5E). We further changed claims on the presence of protein cargo to the presence of dynein/dynactin that allows cargo tethering based on the presence of cargo adaptors in the proteomics data.

      • Lines 21, 173, and 233 mention cargos, but as noted above, it seems to be parts of the dynein complex the authors are referring to.

      This was corrected as mentioned above.

      • Panel 4B appears to show a dynein-dynactin complex, but whether there is a cargo is unclear and if there is it should be labelled accordingly. To assessment of whether there is any cargo bound to the dynein-dynactin complex a larger crop of the panel would be helpful In summary, we recommend that the authors revisit their segmentations in Figures 2B and 4, revise their text based on these observations, and perform quantification of the data (as suggested in the next section).

      We thank the reviewers for sharing their expertise on dynein-dynactin complexes. We have revised the text as detailed above and excluded the assignment of any cargo, as we cannot (even from larger panels) see a clear association of cargo. We have made clear that we only refer to dynein dynactin with the capability of linking cargo based on the presence of proteomics data. We have removed claims on active transport with dynein.

      Dynein versus kinesin-based transport The calculation presented in lines 147-151 does not account for the fact that both the dynein-dynactin complex and kinesin proteins require cargo adaptors to transport cargo. Additionally, the authors overlook the possibility that multiple motors could be attached to a single cargo. If the authors did not observe this, they should explicitly mention it to support their argument. In short, the calculations are based on an incorrect premise, rendering the comparison inaccurate. Unless the authors have identified any dynein-dynactin or kinesin cargo adaptors in their proteomics data which could be used for such a comparison, we believe the authors lack sufficient data to accurately estimate the "active transport ratio" between dynein and kinesin.

      Even though we detect cargo adaptors in our proteomics, we agree that calculating relative transport based only on the proteomics can be inaccurate as such we removed absolute quantification and comparison between dynein and kinesin-based IMT.

      • Would additional experiments be essential to support the claims of the paper?

      F-actin distance and length distribution • To support the claim that F-actin is bundled (line 189), could the authors provide the distance between each F-actin filament and its neighbours? Additionally, could they compare the average distance to the length of actin crosslinkers found in their proteomics data, or compare it to the distances between crosslinked F-actin observed in other research studies?

      We measured distances between the actin filaments and added a plot to new Fig 6.

      • While showing that F-actin is important for the manchette would require cellular experiments, authors could provide quantification of how frequently these actin structures are observed in comparison to MTs to support their claims that these actin filaments could be important for the manchette structure.

      We agree that claims on the role and function of actin in the manchette require cellular experiments that are beyond the scope of this study. Absolute quantification of the ratio between MTs and actin from cryoET is very hard and will be inaccurate as the manchette cannot be imaged as a whole due to its size and thickness. The ratio we have is based on the relative abundance provided by the proteomics (Fig. 5F).

      • In line 193, the authors claim that the F-actin in bundles appears too short for transport. Could they provide length distributions for these filaments? This might provide further support to their claim that individual F-actin filaments can serve as transport tracks (line 266).

      __In addition to the limitation mentioned in the previous point, quantification of length from high magnification imaging will likely be inaccurate as the length of the actin in most cases is bigger than the field of view that is captured. Nevertheless, we removed the claim about the actin being too short for transport. __

      • Could the authors also quantify the abundance of individual F-actin filaments observed, compared to MTs and F-actin bundles, to support the idea that they could play a role in transport?

      As explained for the above points absolute quantification of the ratio between MTs and actin is not feasible from cryoET data that cannot capture all of the manchette in high enough resolution to resolve the actin.

      • In the discussion, the authors mention "interactions between F-actin singlets and mMTs" (line 269), yet they report observing only one instance of this interaction (lines 210 and 211). Given the limited data, they should refer to this as a single interaction in the discussion. The scarcity of data raises questions about how representative this event truly is.

      We agree that one example is not enough. In the new Fig. 6E-K, we provide a gallery of more examples as also requested by reviewers 1 and 2. We have also revised the text to reflect the point that these observations are still rare (Lines 190-194).

      Quantifications for judgement of representativity The authors should quantify how often they observed vesicles with a stick-like connection to MTs (lines 106-107); this would strengthen the interpretation of the density, as currently only one example is shown in the manuscript (Figure 4A). If possible, they could show how many of them are facing towards the MT plus end.

      __As mentioned in the text (lines 135-137), the linkers connecting vesicles to MTs were irregular and so we could not interpret them further this is in contrast to dynein that were easily recognisable but were not associated with vesicles. __

      Dynein quantifications • The authors are recommended to quantify how many dynein molecules per micron of MT they observe and how often they are angled with their MT binding domain towards the minus-end.

      As the manchette is large and highly dense any quantification will likely be biased towards parts of the manchette that are easier to image, for example the periphery. As such we do not think quantifying the dynein density will yield meaningful insight.

      • Could the authors quantify how many dynein densities they found to be attached to a (vesicle) cargo, if any (line 175)? They could show these observations in a supplementary figure.

      We did not observe any case of a connection between a vesicle and dynein motors, we edited this sentence to be more clear on that.

      • For densities that match the size and location of dynein but lack clear dynein morphology (as seen in Figure 2B), could the authors quantify how many are oriented towards the MT minus end?

      We had many cases where the connection did not have a clear dynein morphology, and as the morphology is not clear, it is impossible to make a claim about whether they are oriented towards the minus end.

      Artefacts due to purification: Authors should discuss if the purification could have effects on visualizing components of the manchette. For example, if it has effect on the MTs and actin structure or the abundance/structure of the motor protein complexes (bound to cargo or isolated).

      We have followed a protocol that was published before and showed the overall integrity of the manchette. Nevertheless, losing connections between manchette and other cellular organelles are expected. To address this point, we added in-situ data (new Fig 1) showing manchette in intact spermatids interacting with vesicles and mitochondria, as well as overviews of manchettes (new Fig 2), the text was revised accordingly.

      • Are the experiments adequately replicated and statistical analysis adequate? The cryo-ET data presented in the manuscript is collected using two separate sample preparations. Along with the quantifications of the different observations suggested above which will help the reader assess how abundant and representative these observations are, the authors could further strengthen their claims by acquiring data from a third sample preparation and then analysing how consistent their observations are between different purifications. This however could be time consuming so it is not a major requirement but recommended if possible within a short time frame.

      We regret not explicitly mentioning our data set size, it was added now to the revised version. In essence, the data is based on three biological replicas (3 animals). We collected more than 100 tomograms of different regions of the manchettes. We provided in the revised version more observations (new Fig 1, 2, 4B-C and 6E-K).

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Most of the comments deal with either modifying the text or analysing the data already presented, so the revision could be done with 1-3 months.


      Minor comments: - Specific experimental issues that are easily addressable. 1) Could the authors state how many tilt series were collected for each dataset/independent sample preparation? We recommend that they upload their raw data or tomograms to EMPAIR.

      We added this information in the material and methods.

      2) It is not clear to me if the same sample was used for cryo-ET and proteomics. Could the authors clarify how comparable the sample preparation for the cryo-ET and proteomics data is or if the same sample was used for both. If there is a discrepancy between these preparations, they would need to discuss how this can affect comparing observations from cryo-ET and mass spectrometry. Ideally both samples should be the same.

      After sample preparation the manchettes were directly frozen on grids. The rest of the samples was used for proteomics. Consequently, EM and MS data were acquired on the same samples. We clarified this in the text (lines 327-328).

      • Are prior studies referenced appropriately? We recommend including additional references to support the claim that F-actin has a mechanical role (line 242). Could the authors compare their proteomics data to other mass spectrometry studies conducted on the Manchette (for example, see reference 4)?

      We added the comparison but it is important to point out that in reference 4 the manchettes were isolated from mice testes.

      • Are the text and figures clear and accurate? Text: We do not see the necessity of specifying the microtubules (MTs) in the data as "manchette MTs" or "mMTs" rather than simply "MTs". However, we recommend that the authors use either "MT" or "mMT" consistently throughout the manuscript.

      We changed to only MTs.

      The authors appear to refer to both dynein-1 (cytoplasmic dynein) and dynein-2 (axonemal dynein or IFT dynein). To avoid confusion, it is important that the authors clearly specify which dynein they are referring to throughout the text. This is particularly relevant as the study aims to demonstrate that IFT is not part of the manchette transport system.

      • Introduction: In the third paragraph (lines 59-75), the authors should specify that they are referring to dynein-2, which is distinct from cytoplasmic dynein discussed in the previous paragraph (lines 44-58).

      We specify the respective dyneins in the text (line 66,140-141,145).

      • Figure 4D: The authors could fit a dynein-1 motor domain instead of a dynein-2 into the density to stay consistent with the fact that the density belongs to cytoplasmic dynein-1.

      __We changed the figure and fitted a cytosolic dynein-1 structure (5nvu) instead. __

      Figures: • Figure 2B: The legend mentions a large linker complex; however, this may correspond to two or three separate densities.

      We have addressed this and changed the wording.

      • Figure 4: please revisit the segmentation of this whole figure based on previous comments.

      __We revised as suggested. __

      • Figures 1, 2, 4, 5, and 6: It would be helpful to state in the legends that the tomograms are denoised. There are stripe-like densities visible in the images (e.g., in the vesicle in Figure 2B). Do these artefacts also appear in the raw data?

      As stated in the Methods section, tomograms were generally denoised with CryoCare for visualisation purposes. The “stripe-like densities” are artefacts of the gold fiducials used for tomogram alignment and appear in the raw data (before denoising).

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions? We suggest revising the paragraph title "Dynein-mediated cargo along the manchette" (line 165) to "Dynein-mediated cargo transport along the manchette".

      __We have changed this in the revised version. __

      We recommend that the authors provide additional evidence to support the interpretation that the observed EM densities correspond to motor proteins. Specifically: • Include scale bars or reference lines indicating the known dimensions of motor proteins, based on previous data, to demonstrate that the observed densities match the expected size.

      The dynein structure is provided for reference. We also added the cytosolic dynein–dynactin as a reference (Fig 5E).

      • Make direct comparisons to existing EM data and highlight morphological similarities.

      We have added a comparison to existing data (Fig 5E).

      In the discussion (lines 249-254), the authors could speculate on alternative roles for the IFT components in the manchette, particularly if they are not part of the IFT trains. We also suggest rephrasing the claim in line 266 to make it more speculative in tone.

      __We have addressed this in the revised version (lines 221-230). __

      Finally, a schematic overview of the manchette ultrastructure in a spermatid would greatly aid the reader in understanding the material presented.

      We now include a graphical abstract and overviews of isolated manchettes on cryo-EM grids.

      References: 1. Chowdhury, S., Ketcham, S., Schroer, T. et al. Structural organization of the dynein-dynactin complex bound to microtubules. Nat Struct Mol Biol 22, 345-347 (2015). https://doi.org/10.1038/nsmb.2996

      1. Grotjahn, D.A., Chowdhury, S., Xu, Y. et al. Cryo-electron tomography reveals that dynactin recruits a team of dyneins for processive motility. Nat Struct Mol Biol 25, 203-207 (2018). https://doi.org/10.1038/s41594-018-0027-7

      2. Chaaban, S., Carter, A.P. Structure of dynein-dynactin on microtubules shows tandem adaptor binding. Nature 610, 212-216 (2022).https://doi.org/10.1038/s41586-022-05186-y

      3. W. Hu, R. Zhang, H. Xu, Y. Li, X. Yang, Z. Zhou, X. Huang, Y. Wang, W. Ji, F. Gao, W. Meng, CAMSAP1 role in orchestrating structure and dynamics of manchette microtubule minus-ends impacts male fertility during spermiogenesis, Proc. Natl. Acad. Sci. U.S.A. 120 (45) e2313787120, https://doi.org/10.1073/pnas.2313787120 (2023).

      Reviewer #3 (Significance (Required)):

      This study employs cryo-electron tomography (cryo-ET) and proteomics to elucidate the architecture of the manchette. It advances our understanding of the components involved in intracellular transport within the manchette and introduces the following technical and conceptual innovations:

      a) Technical Advances: The authors have visualized the manchette at high resolution using cryo-ET. They optimized a purification pipeline capable of retaining, at least partially, the transport machinery of the manchette. Notably, they observed dynein and putative kinesin motors attached to microtubules-a significant achievement that, to our knowledge, has not been reported previously.

      b) Conceptual Advances: This study provides novel insights into spermatogenesis. The findings suggest that intraflagellar transport (IFT) is unlikely to play a role at this stage of sperm development while shedding light on alternative transport systems. Importantly, the authors demonstrate that actin filaments organize in two distinct ways: clustering parallel to microtubules or forming single filaments.

      This work is likely to be of considerable interest to researchers in sperm development and structural biology. Additionally, it may appeal to scientists studying motor proteins and the cytoskeleton.

      We thank the reviewers for appreciating the significance and novelty of our study.

      The reviewers possess extensive expertise in in situ cryo-electron tomography and single-particle microscopy, including work on dynein-based complexes. Collectively, they have significant experience in the field of cytoskeleton-based transport.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The manchette is a temporary microtubule (MT)-based structure essential for the development of the highly polarised sperm cell. In this study, the authors employed cryo-electron tomography (cryo-ET) and proteomics to investigate the intra-manchette transport system. Cryo-EM analysis of purified rat manchette revealed a high density of MTs interspersed with actin filaments, which appeared either bundled or as single filaments. Vesicles were observed among the MTs, connected by stick-like densities that, based on their orientation relative to MT polarity, were inferred to be kinesins. Subtomogram averaging (STA) confirmed the presence of dynein motor proteins. Proteomic analysis further validated the presence of dynein and kinesins and showed the presence of actin crosslinkers that could bundle actin filaments. Proteomics data also indicated the involvement of actin-based transport mediated by myosin. Importantly, the data indicated that the intraflagellar transport (IFT) system is not part of the intra-manchette transport mechanism. The visualisation of motor proteins directly from a biological sample represents a notable technical advancement, providing new insights into the organisation of the intra-manchette transport system in developing sperm.

      Are the key conclusions convincing?

      Below we comment on three main conclusions.

      MT and F-actin bundles are both constituents of the manchette While the data convincingly shows that MT and F-actin are part of the manchette, one cannot conclude from it that F-actin is an integral part of the manchette. The authors would need to rephrase so that it is clear that they are speculating.

      The transport system employs different transport machinery on these MTs Proteomics data indicates the presence of multiple motor proteins in the manchette, while cryo-EM data corroborates this by revealing morphologically distinct densities associated with the MTs. However, the nature of only one of these MT-associated densities has been confirmed-specifically, dynein, as identified through STA. The presence of kinesin or myosin in the EM data remains unconfirmed based on just the cryo-ET density, and therefore it is unclear whether these proteins are actively involved in cargo transport, as this cannot be supported by just the proteomics data. In summary, we recommend that the authors rephrase this conclusion and avoid using the term "employ".

      Dynein mediated transport (Line 225-227) The data shows that dynein is present in the manchette; however, whether it plays and active role in transport cannot be determined from the cryo-ET data provided in the manuscript, as it does not clearly display a dynein-dynactin complex attached to cargo. The attachment to cargo is also not revealed via proteomics as no adaptor proteins that link dynein-dynactin to its cargo have been shown.

      Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      F-actin

      • In the abstract, the authors state that F-actin provides tracks for transport as well as having structural and mechanical roles. However, the manuscript does not include experiments demonstrating a mechanical role. The authors appear to base this statement on literature where actin bundles have been shown to play a mechanical role in other model systems. We suggest they clarify that the mechanical role the authors suggest is speculative and add references if appropriate.
      • Lines 15,92, 180 and 255: The statement "Filamentous actin is an integral part of the manchette" is misleading. While the authors show that F-actin is present in their purified manchette structures, whether it is integral has not been tested. Authors should rephrase the sentence.
      • To support the claim that F-actin plays a role in transport within the manchette, the authors present only one instance where an unidentified density is attached to an actin filament. This is insufficient evidence to claim that it is myosin actively transporting cargo. Although the proteomics data show the presence of myosin, we suggest the authors exercise more caution with this claim.
      • The authors mention the presence of F-actin bundles but do not show direct crosslinking between the F-actin filaments. They could in principle just be closely packed F-actin filaments that are not necessarily linked, so the term "bundle" should be used more cautiously.

      Observations of dynein

      • Relating to Figure 2B: From the provided image it is not clear whether the density corresponds to a dynein complex, as it does not exhibit the characteristic morphological features of dynein or dynactin molecules.
      • Lines 171-172 and Figure 4: It is well established that dynein is a dimer and should always possess two motor domains. The authors have incorrectly assumed they observed single motor heads, except possibly in Figure 4A (marked by an arrow). In all other instances, the dynein complexes show two motor domains in proximity, but these have not been segmented accurately. Furthermore, the "cargos" shown in grey are more likely to represent dynein tails or the dynactin molecule, based on comparisons with in vitro structures of these complexes (see references 1-3).
      • Lines 21, 173, and 233 mention cargos, but as noted above, it seems to be parts of the dynein complex the authors are referring to.
      • Panel 4B appears to show a dynein-dynactin complex, but whether there is a cargo is unclear and if there is it should be labelled accordingly. To assessment of whether there is any cargo bound to the dynein-dynactin complex a larger crop of the panel would be helpful In summary, we recommend that the authors revisit their segmentations in Figures 2B and 4, revise their text based on these observations, and perform quantification of the data (as suggested in the next section).

      Dynein versus kinesin-based transport

      The calculation presented in lines 147-151 does not account for the fact that both the dynein-dynactin complex and kinesin proteins require cargo adaptors to transport cargo. Additionally, the authors overlook the possibility that multiple motors could be attached to a single cargo. If the authors did not observe this, they should explicitly mention it to support their argument. In short, the calculations are based on an incorrect premise, rendering the comparison inaccurate. Unless the authors have identified any dynein-dynactin or kinesin cargo adaptors in their proteomics data which could be used for such a comparison, we believe the authors lack sufficient data to accurately estimate the "active transport ratio" between dynein and kinesin.

      Would additional experiments be essential to support the claims of the paper?

      F-actin distance and length distribution

      • To support the claim that F-actin is bundled (line 189), could the authors provide the distance between each F-actin filament and its neighbours? Additionally, could they compare the average distance to the length of actin crosslinkers found in their proteomics data, or compare it to the distances between crosslinked F-actin observed in other research studies?
      • While showing that F-actin is important for the manchette would require cellular experiments, authors could provide quantification of how frequently these actin structures are observed in comparison to MTs to support their claims that these actin filaments could be important for the manchette structure.
      • In line 193, the authors claim that the F-actin in bundles appears too short for transport. Could they provide length distributions for these filaments? This might provide further support to their claim that individual F-actin filaments can serve as transport tracks (line 266).
      • Could the authors also quantify the abundance of individual F-actin filaments observed, compared to MTs and F-actin bundles, to support the idea that they could play a role in transport?
      • In the discussion, the authors mention "interactions between F-actin singlets and mMTs" (line 269), yet they report observing only one instance of this interaction (lines 210 and 211). Given the limited data, they should refer to this as a single interaction in the discussion. The scarcity of data raises questions about how representative this event truly is.

      Quantifications for judgement of representativity

      The authors should quantify how often they observed vesicles with a stick-like connection to MTs (lines 106-107); this would strengthen the interpretation of the density, as currently only one example is shown in the manuscript (Figure 4A). If possible, they could show how many of them are facing towards the MT plus end.

      Dynein quantifications

      • The authors are recommended to quantify how many dynein molecules per micron of MT they observe and how often they are angled with their MT binding domain towards the minus-end.
      • Could the authors quantify how many dynein densities they found to be attached to a (vesicle) cargo, if any (line 175)? They could show these observations in a supplementary figure.
      • For densities that match the size and location of dynein but lack clear dynein morphology (as seen in Figure 2B), could the authors quantify how many are oriented towards the MT minus end?

      Artefacts due to purification: Authors should discuss if the purification could have effects on visualizing components of the manchette. For example, if it has effect on the MTs and actin structure or the abundance/structure of the motor protein complexes (bound to cargo or isolated).

      Are the experiments adequately replicated and statistical analysis adequate?

      The cryo-ET data presented in the manuscript is collected using two separate sample preparations. Along with the quantifications of the different observations suggested above which will help the reader assess how abundant and representative these observations are, the authors could further strengthen their claims by acquiring data from a third sample preparation and then analysing how consistent their observations are between different purifications. This however could be time consuming so it is not a major requirement but recommended if possible within a short time frame.

      Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Most of the comments deal with either modifying the text or analysing the data already presented, so the revision could be done with 1-3 months.

      Minor comments:

      Specific experimental issues that are easily addressable.

      1. Could the authors state how many tilt series were collected for each dataset/independent sample preparation? We recommend that they upload their raw data or tomograms to EMPAIR.
      2. It is not clear to me if the same sample was used for cryo-ET and proteomics. Could the authors clarify how comparable the sample preparation for the cryo-ET and proteomics data is or if the same sample was used for both. If there is a discrepancy between these preparations, they would need to discuss how this can affect comparing observations from cryo-ET and mass spectrometry. Ideally both samples should be the same.

      Are prior studies referenced appropriately?

      We recommend including additional references to support the claim that F-actin has a mechanical role (line 242). Could the authors compare their proteomics data to other mass spectrometry studies conducted on the Manchette (for example see reference 4)?

      Are the text and figures clear and accurate?

      Text: We do not see the necessity of specifying the microtubules (MTs) in the data as "manchette MTs" or "mMTs" rather than simply "MTs". However, we recommend that the authors use either "MT" or "mMT" consistently throughout the manuscript.

      The authors appear to refer to both dynein-1 (cytoplasmic dynein) and dynein-2 (axonemal dynein or IFT dynein). To avoid confusion, it is important that the authors clearly specify which dynein they are referring to throughout the text. This is particularly relevant as the study aims to demonstrate that IFT is not part of the manchette transport system.

      • Introduction: In the third paragraph (lines 59-75), the authors should specify that they are referring to dynein-2, which is distinct from cytoplasmic dynein discussed in the previous paragraph (lines 44-58).
      • Figure 4D: The authors could fit a dynein-1 motor domain instead of a dynein-2 into the density to stay consistent with the fact that the density belongs to cytoplasmic dynein-1. Figures:
      • Figure 2B: The legend mentions a large linker complex; however, this may correspond to two or three separate densities.
      • Figure 4: please revisit the segmentation of this whole figure based on previous comments.
      • Figures 1, 2, 4, 5, and 6: It would be helpful to state in the legends that the tomograms are denoised. There are stripe-like densities visible in the images (e.g., in the vesicle in Figure 2B). Do these artefacts also appear in the raw data?

      Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      We suggest revising the paragraph title "Dynein-mediated cargo along the manchette" (line 165) to "Dynein-mediated cargo transport along the manchette".

      We recommend that the authors provide additional evidence to support the interpretation that the observed EM densities correspond to motor proteins. Specifically:

      • Include scale bars or reference lines indicating the known dimensions of motor proteins, based on previous data, to demonstrate that the observed densities match the expected size.
      • Make direct comparisons to existing EM data and highlight morphological similarities. In the discussion (lines 249-254), the authors could speculate on alternative roles for the IFT components in the manchette, particularly if they are not part of the IFT trains. We also suggest rephrasing the claim in line 266 to make it more speculative in tone. Finally, a schematic overview of the manchette ultrastructure in a spermatid would greatly aid the reader in understanding the material presented.

      References:

      1. Chowdhury, S., Ketcham, S., Schroer, T. et al. Structural organization of the dynein-dynactin complex bound to microtubules. Nat Struct Mol Biol 22, 345-347 (2015). https://doi.org/10.1038/nsmb.2996
      2. Grotjahn, D.A., Chowdhury, S., Xu, Y. et al. Cryo-electron tomography reveals that dynactin recruits a team of dyneins for processive motility. Nat Struct Mol Biol 25, 203-207 (2018). https://doi.org/10.1038/s41594-018-0027-7
      3. Chaaban, S., Carter, A.P. Structure of dynein-dynactin on microtubules shows tandem adaptor binding. Nature 610, 212-216 (2022). https://doi.org/10.1038/s41586-022-05186-y
      4. W. Hu, R. Zhang, H. Xu, Y. Li, X. Yang, Z. Zhou, X. Huang, Y. Wang, W. Ji, F. Gao, W. Meng, CAMSAP1 role in orchestrating structure and dynamics of manchette microtubule minus-ends impacts male fertility during spermiogenesis, Proc. Natl. Acad. Sci. U.S.A. 120 (45) e2313787120, https://doi.org/10.1073/pnas.2313787120 (2023).

      Significance

      This study employs cryo-electron tomography (cryo-ET) and proteomics to elucidate the architecture of the manchette. It advances our understanding of the components involved in intracellular transport within the manchette and introduces the following technical and conceptual innovations:

      a) Technical Advances:

      The authors have visualized the manchette at high resolution using cryo-ET. They optimized a purification pipeline capable of retaining, at least partially, the transport machinery of the manchette. Notably, they observed dynein and putative kinesin motors attached to microtubules-a significant achievement that, to our knowledge, has not been reported previously.

      b) Conceptual Advances:

      This study provides novel insights into spermatogenesis. The findings suggest that intraflagellar transport (IFT) is unlikely to play a role at this stage of sperm development while shedding light on alternative transport systems. Importantly, the authors demonstrate that actin filaments organize in two distinct ways: clustering parallel to microtubules or forming single filaments.

      This work is likely to be of considerable interest to researchers in sperm development and structural biology. Additionally, it may appeal to scientists studying motor proteins and the cytoskeleton.

      The reviewers possess extensive expertise in in situ cryo-electron tomography and single-particle microscopy, including work on dynein-based complexes. Collectively, they have significant experience in the field of cytoskeleton-based transport.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manchette appears as a shield-like structure surrounding the flagellar basal body upon spermiogenesis. It consists of a number of microtubules like a comb, but actin (Mochida et al. 1998 Dev. Biol. 200, 46) and myosin (Hayasaka et al. 2008 Asian J. Androl. 10, 561) were found, suggesting transportation inside the manchette. Detailed structural information and functional insight into the manchette was still awaited. There is a hypothesis called IMT (intra machette transport) based on the fact that machette and IFT (intraflagellar transport) share common components (or homologues) and on their transition along the stages of spermiogenesis. While IMT is considered as a potential hypothesis to explain delivery of centrosomal and flagellar components, no one has witnessed IMT at the same level as IFT. IMT has never been purified, visualized in motion or at high resolution.

      This study for the first time visualized manchette using high-end cryo-electron tomography of isolated manchettes, addressing structural characterization of IMT. The authors successfully microtubular bundles, vesicles located between microtubules and a linker-like structure connecting the vesicle and the microtubule. On multilamellar membranes in the vesicles they found particles and assigned them to ATPase complexes, based on intermediate (~60A) resolution structure. They further identified interesting structures, such as (1) particles on microtubules, which resemble dynein and (2) filaments which shows symmetry of F-actin. All the molecular assignments are consistent with their proteomics of manchettes.

      Their assignment of ATPase will be strengthened by MS data, if it proves absence of other possible proteins forming such a membrane protein complex.

      They discussed possible role of various motor proteins based on their abundance (Line 134-151, Line 200). This makes sense only with a control. Absolute abundance of proteins would not necessarily present their local importance or roles. This reviewer would suggest quantitative proteomics of other organelles, or whole cells, or other fractions obtained during manchette isolation, to demonstrate unique abundance of KIF27 and other proteins of their interest.<br /> A single image from a tomogram, Fig.6B, is not enough to prove actin-MT interaction. A gallery and a number (how many such junctions were found from how many MTs) will be necessary.

      Minor points:

      Their manchette purification is based on Mochida et al., which showed (their Fig.2) similarity to the in vivo structure (for example, Fig.1 of Kierszenbaum 2001 Mol. Reproduc. Dev. 59, 347). Nevertheless, since this is not a very common prep, it is helpful to show the wide view (low mag cryo-EM or ET) of the isolated manchette to prove its intactness. Line 81: Myosin -> myosin (to be consistent with other protein names)

      This work is a significant step toward the understanding of manchettes. While the molecular assignment of dynein and ATPase is not fully decisive, due to limitation of resolution (this reviewer thinks the assignment of actin filament is convincing, based on its helical symmetry), their speculative model still deserves publication.

      Significance

      This work is a significant step toward the understanding of manchettes. While the molecular assignment of dynein and ATPase is not fully decisive, due to limitation of resolution (this reviewer thinks the assignment of actin filament is convincing, based on its helical symmetry), their speculative model still deserves publication.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript the authors have done cryo-electron tomography of the manchette, a microtubule-based structure important for proper sperm head formation during spermatogenesis. They also did mass-spectrometry of the isolated structures. Vesicles, actin and their linkers to microtubules within the structure are shown.

      Major:

      The data the conclusions are based on seem very limited and sometimes overinterpreted. For example, only one connection between actin and microtubules was observed, and this is thought to be MACF1 simply based on its presence in the MS.

      Another, and larger concern, is that the authors do a structural study on something that has been purified out of the cell, a process which is extremely disruptive. Vesicles, actin and other cellular components could easily be trapped in this cytoskeletal sieve during the purification process and as such, not be bona fide manchette components. This could create both misleading proteomics and imaging. Therefore, an approach not requiring extraction such as high-pressure freezing, sectioning and room-temperature electron tomography and/or immunoEM on sections to set aside this concern is strongly recommended. As an additional bonus, it would show if the vesicles containing ATP synthase are deformed mitochondria.

      Minor:

      Line 99: "to study IMT with cryo-ET, manchettes were isolated ...(insert from which organism)..."

      Line 102 "...demonstrating that they can be used to study IMT".. can the authors please clarify?

      Line 111 "densities face towards the MT plus-end" How can a density "face" anywhere? For this, it needs to have a defined front and back.

      Line 137: is the "perinuclear ring" the same as the manchette?

      Figure 2B: How did the authors decide to not model the electron density found between the vesicle and the MT at 3 O'clock? Is there no other proteins with a similar lollipop structure as ATP synthase, so that this can be said to be this protein with such certainty?

      Line 189: "F-actin formed organized bundles running parallel to mMTs" - this observation needs confirming in a less disrupted sample.

      Line 242 remove first comma sign

      Line 363 "a total of 2 datasets" - is this manuscript based on only two tilt-series? Or two datasets from each of the 4 grids? In any case, this is very limited data.

      Significance

      The article is very interesting, and if presented together with the suggested controls, would be informative to both microtubule/motorprotein researchers as well as those trying studying spermatogenesis.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Evidence, reproducibility, and clarity

      The work by Pinon et al describes the generation of a microvascular model to study Neisseria meningitidis interactions with blood vessels. The model uses a novel and relatively high throughput fabrication method that allows full control over the geometry of the vessels. The model is well characterized. The authors then study different aspects of Neisseria-endothelial interactions and benchmark the bacterial infection model against the best disease model available, a human skin xenograft mouse model, which is one of the great strengths of the paper. The authors show that Neisseria binds to the 3D model in a similar geometry that in the animal xenograft model, induces an increase in permeability short after bacterial perfusion, and induces endothelial cytoskeleton rearrangements. Finally, the authors show neutrophil recruitment to bacterial microcolonies and phagocytosis of Neisseria. The article is overall well written, and it is a great advancement in the bioengineering and sepsis infection field, and I only have a few major comments and some minor.

      Major comments:

      Infection-on-chip. I would recommend the authors to change the terminology of "infection on chip" to better reflect their work. The term is vague and it decreases novelty, as there are multiple infection on chips models that recapitulate other infections (recently reviewed in https://doi.org/10.1038/s41564-024-01645-6) including Ebola, SARS-CoV-2, Plasmodium and Candida. Maybe the term "sepsis on chip" would be more specific and exemplify better the work and novelty. Also, I would suggest that the authors carefully take a look at the text and consider when they use VoC or to current term IoC, as of now sometimes they are used interchangeably, with VoC being used occasionally in bacteria perfused experiments.

      We thank Reviewer #1 for this suggestion. Indeed, we have chosen to replace the term "Infection-on-Chip" by "infected Vessel-on-chip" to avoid any confusion in the title and the text. Also, we have removed all the terms "IoC" which referred to "Infection-on-Chip" and replaced with "VoC" for "Vessel-on-Chip". We think these terms will improve the clarity of the main text.

      Fig 3 and Suppmentary 3: Permeability. The authors suggest that early 3h infection with Neisseria do not show increase in vascular permeability in the animal model, contrary to their findings in the 3D in vitro model. However, they show a non-significant increase in permeability of 70 KDa Dextran in the animal xenograft early infection. This seems to point that if the experiment would have been done with a lower molecular weight tracer, significant increases in permeability could have been detected. I would suggest to do this experiment that could capture early events in vascular disruption.

      Comparing permeability under healthy and infected conditions using Dextran smaller than 70 kDa is challenging. Previous research [1] has shown that molecules below 70 kDa already diffuse freely in healthy tissue. Given this high baseline diffusion, we believe that no significant difference would be observed before and after N. meningitidis infection and these experiments were not carried out. As discussed in the manuscript, bacteria induced permeability in mouse occurs at later time points, 16h post infection as shown previoulsy [2]. As discussed in the manuscript, this difference between the xenograft model and the chip likely reflect the absence in the chip of various cell types present in the tissue parenchyma.

      The authors show the formation of actin of a honeycomb structure beneath the bacterial microcolonies. This only occurred in 65\% of the microcolonies. Is this result similar to in vitro 2D endothelial cultures in static and under flow? Also, the group has shown in the past positive staining of other cytoskeletal proteins, such as ezrin in the ERM complex. Does this also occur in the 3D system?

      We thank the Reviewer #1 for this suggestion. - According to this recommendation, we imaged monolayers of endothelial cells in the flat regions of the chip (the two lateral channels) using the same microscopy conditions (i.e., Obj. 40X N.A. 1.05) that have been used to detect honeycomb structures in the 3D vessels in vitro. We showed that more than 56% of infected cells present these honeycomb structures in 2D, which is 13% less than in 3D, and is not significant due to the distributions of both populations. Thus, we conclude that under both in vitro conditions, 2D and 3D, the amount of infected cells exhibiting cortical plaques is similar. We have added the graph and the confocal images in Figure S4B and lines 418-419 of the revised manuscript. - We recently performed staining of ezrin in the chip and imaged both the 3D and 2D regions. Although ezrin staining was visible in 3D (Fig. 1 of this response), it was not as obvious as other markers under these infected conditions and we did not include it in the main text. Interpretation of this result is not straight forward as for instance the substrate of the cells is different and it would require further studies on the behaviour of ERM proteins in these different contexts.

      One of the most novel things of the manuscript is the use of a relatively quick photoablation system. I would suggest that the authors add a more extensive description of the protocol in methods. Could this technique be applied in other laboratories? If this is a major limitation, it should be listed in the discussion.

      Following the Reviewer's comment, we introduced more detailed explanations regarding the photoablation: - L157-163 (Results): "Briefly, the chosen design is digitalized into a list of positions to ablate. A pulsed UV-LASER beam is injected into the microscope and shaped to cover the back aperture of the objective. The laser is then focused on each position that needs ablation. After introducing endothelial cells (HUVEC) in the carved regions,.." - L512-516 (Discussion): "The speed capabilities drastically improve with the pulsing repetition rate. Given that our laser source emits pulses at 10kHz, as compared to other photoablation lasers with repetitions around 100 Hz, our solution could potentially gain a factor of 100. Also,..." - L1082-1087 (Materials and Methods): "…, and imported in a python code. The control of the various elements is embedded and checked for this specific set of hardware. The code is available upon request."

      Adding these three paragraphs gives more details on how photoablation works thus improving the manuscript.

      Minor comments:

      Supplementary Fig 2. The reference to subpanels H and I is swapped.

      The references to subpanels H and I have been correctly swapped back in the reviewed version.

      Line 203: I would suggest to delete this sentence. Although a strength of the submitted paper is the direct comparison of the VoC model with the animal model to better replicate Neisseria infection, a direct comparison with animal permeability is not needed in all vascular engineering papers, as vascular permeability measurements in animals have been well established in the past.

      The sentence "While previously developed VoC platforms aimed at replicating physiological permeability properties, they often lack direct comparisons with in vivo values." has been removed from the revised text.

      Fig 3: Bacteria binding experiments. I would suggest the addition of more methodological information in the main results text to guarantee a good interpretation of the experiment. First, it would be better that wall shear stress rather than flow rate is described in the main text, as flow rate is dependent on the geometry of the vessel being used. Second, how long was the perfusion of Neisseria in the binding experiment performed to quantify colony doubling or elongation? As per figure 1C, I would guess than 100 min, but it would be better if this information is directly given to the readers.

      We thank Reviewer #1 for these two suggestions that will improve the text clarity (e.g., L316). (i) Indeed, we have changed the flow rate in terms of shear stress. (ii) Also, we have normalized the quantification of the colony doubling time according to the first time-point where a single bacteria is attached to the vessel wall. Thus, early adhesion bacteria will be defined by a longer curve while late adhesion bacteria by a shorter curve. In total, the experiment lasted for 3 hours (modifications appear in L318 and L321-326).}

      Fig 4: The honeycomb structure is not visible in the 3D rendering of panel D. I would recommend to show the actin staining in the absence of Neisseria staining as well.

      According to this suggestion, a zoom of the 3D rendering of the cortical plaque without colony had been added to the figure 4 of the revised manuscript.

      Line 421: E-selectin is referred as CD62E in this sentence. I would suggest to use the same terminology everywhere.

      We have replaced the "CD62E" term with "E-selectin" to improve clarity.}

      Line 508: "This difference is most likely associated with the presence of other cell types in the in vivo tissues and the onset of intravascular coagulation". Do the authors refer to the presence of perivascular cells, pericytes or fibroblasts? If so, it could be good to mention them, as well as those future iterations of the model could include the presence of these cell types.

      By "other cell types", we refer to pericytes [3], fibroblasts [4], and perivascular macrophages [5], which surround endothelial cells and contribute to vessel stability. The main text was modified to include this information (Lines 548 and 555-570) and their potential roles during infection disussed.

      Discussion: The discussion covers very well the advantages of the model over in vitro 2D endothelial models and the animal xenograft but fails to include limitations. This would include the choice of HUVEC cells, an umbilical vein cell line to study microcirculation, the lack of perivascular cells or limitations on the fabrication technique regarding application in other labs (if any).

      We thank Reviewer #1 for this suggestion. Indeed, our manuscript may lack explaining limitations, and adding them to the text will help improve it: - The perspectives of our model include introducing perivascular cells surrounding the vessel and fibroblasts into the collagen gel as discussed previously and added in the discussion part (L555-570). - Our choice for HUVEC cells focused on recapitulating the characteristics of venules that respect key features such as the overexpression of CD62E and adhesion of neutrophils during inflammation. Using microvascular endothelial cells originating from different tissues would be very interesting. This possibility is now mentioned in the discussion lines 567-568. - Photoablation is a homemade fabrication technique that can be implemented in any lab harboring an epifluorescence microscope. This method has been more detailed in the revised manuscript (L1085-1087).

      Line 576: The authors state that the model could be applied to other systemic infections but failed to mention that some infections have already been modelled in 3D bioengineered vascular models (examples found in https://doi.org/10.1038/s41564-024-01645-6). This includes a capillary photoablated vascular model to study malaria (DOI: 10.1126/sciadv.aay724).

      Thes two important references have been introduced in the main text (L84, 647, 648).}

      Line 1213: Are the 6M neutrophil solution in 10ul under flow. Also, I would suggest to rewrite this sentence in the following line "After, the flow has been then added to the system at 0.7-1 μl/min."

      We now specified that neutrophils are circulated in the chip under flow conditions, lines 1321-1322.

      Significance

      The manuscript is comprehensive, complete and represents the first bioengineered model of sepsis. One of the major strengths is the carful characterization and benchmarking against the animal xenograft model. Its main limitations is the brief description of the photoablation methodology and more clarity is needed in the description of bacteria perfusion experiments, given their complexity. The manuscript will be of interest for the general infection community and to the tissue engineering community if more details on fabrication methods are included. My expertise is on infection bioengineered models.

      Reviewer #2

      Evidence, reproducibility, and clarity

      Summary The authors develop a Vessel-on-Chip model, which has geometrical and physical properties similar to the murine vessels used in the study of systemic infections. The vessel was created via highly controllable laser photoablation in a collagen matrix, subsequent seeding of human endothelial cells and flow perfusion to induce mechanical cues. This vessel could be infected with Neisseria meningitidis, as a model of systemic infection. In this model, microcolony formation and dynamics, and effects on the host were very similar to those described for the human skin xenograft mouse, which is the current gold standard for these studies, and were consistent with observations made in patients. The model could also recapitulate the neutrophil response upon N. meningitidis systemic infection.

      Major comments:

      I have no major comments. The claims and the conclusions are supported by the data, the methods are properly presented and the data is analyzed adequately. Furthermore, I would like to propose an optional experiment could improve the manuscript. In the discussion it is stated that the vascular geometry might contribute to bacterial colonization in areas of lower velocity. It would be interesting to recapitulate this experimentally. It is of course optional but it would be of great interest, since this is something that can only be proven in the organ-on-chip (where flow speed can be tuned) and not as much in animal models. Besides, it would increase impact, demonstrating the superiority of the chip in this area rather than proving to be equal to current models.

      We have conducted additional experiments on infection in different vascular geometries now added these results figure 3/S3 and lines 288-305. We compared sheared stress levels as determined by Comsol simulation and experimentally determined bacterial adhesion sites. In the conditions used, the range of shear generated by the tested geometries do not appear to change the efficiency of bacterial adhesion. These results are consistent with a previous study from our group which show that in this range of shear stresses the effect on adhesion is limited [6] . Furthermore, qualitative observations in the animal model indicate that bacteria do not have an obvious preference in terms of binding site.

      Minor comments:

      I have a series of suggestions which, in my opinion, would improve the discussion. They are further elaborated in the following section, in the context of the limitations.

      • How to recapitulate the vessels in the context of a specific organ or tissue? If the pathogen is often found in the luminal space of other organs after disseminating from the blood, how can this process be recapitulated with this mode, if at all?

      • For reasons that are not fully understood, postmortem histological studies reveal bacteria only inside blood vessels but rarely if ever in the organ parenchyma. The presence of intravascular bacteria could nevertheless alter cells in the tissue parenchyma. The notable exception is the brain where bacteria exit the bacterial lumen to access the cerebrospinal fluid. The chip we describe is fully adapted to develop a blood brain barrier model and more specific organ environments. This implies the addition of more cell types in the hydrogel. A paragraph on this topic has been added (Lines 548 and 552-570).

      • Similarly, could other immune responses related to systemic infection be recapitulated? The authors could discuss the potential of including other immune cells that might be found in the interstitial space, for example.

      • This important discussion point has been added to the manuscript (L623-636). As suggested by Reviewer #2, other immune cells respond to N. meningitis and can be explored using our model. For instance, macrophages and dendritic cells are activated upon N. meningitis infection, eliminate the bacteria through phagocytosis, produce pro-inflammatory cytokines and chemokines potentially activating lymphocytes [7]. Such an immune response, yet complex, would be interesting to study in our model as skin-xenograft mice are deprived of B and T lymphocytes to ensure acceptance of human skin grafts.

      • A minor correction: in line 467 it should probably be "aspects" instead of "aspect", and the authors could consider rephrasing that sentence slightly for increased clarity.

      • We have corrected the sentence with "we demonstrated that our VoC strongly replicates key aspects of the in vivo human skin xenograft mouse model, the gold standard for studying meningococcal disease under physiological conditions." in lines 499-503.

        Strengths and limitations

      The most important strength of this manuscript is the technology they developed to build this model, which is impressive and very innovative. The Vessel-on-Chip can be tuned to acquire complex shapes and, according to the authors, the process has been optimized to produce models very quickly. This is a great advancement compared with the technologies used to produce other equivalent models. This model proves to be equivalent to the most advanced model used to date, but allows to perform microscopy with higher resolution and ease, which can in turn allow more complex and precise image-based analysis. However, the authors do not seem to present any new mechanistic insights obtained using this model. All the findings obtained in the infection-on-chip demonstrate that the model is equivalent to the human skin xenograft mouse model, and can offer superior resolution for microscopy. However, the advantages of the model do not seem to be exploited to obtain more insights on the pathogenicity mechanisms of N. meningitidis, host-pathogen interactions or potential applications in the discovery of potential treatments. For example, experiments to elucidate the role of certain N. meningiditis genes on infection could enrich the manuscript and prove the superiority of the model. However, I understand these experiments are time-consuming and out of the scope of the current manuscript. In addition, the model lacks the multicellularity that characterizes other similar models. The authors mention that the pathogen can be found in the luminal space of several organs, however, this luminal space has not been recapitulated in the model. Even though this would be a new project, it would be interesting that the authors hypothesize about the possibilities of combining this model with other organ models. The inclusion of circulating neutrophils is a great asset; however it would also be interesting to hypothesize about how to recapitulate other immune responses related to systemic infection.

      We thank Reviewer #2 for his/her comment on the strengths and limitations of our work. The difficulty is that our study opens many futur research directions and applications and we hope that the work serves as the basis for many future studies but one can only address a limited set of experiments in a single manuscript. - Experiments investigating the role of N. meningitidis genes require significant optimization of the system. Multiplexing is a potential avenue for future development, which would allow the testing of many mutants. The fast photoablation approach is particularly amenable to such adaptation. - Cells and bacteria inside the chambers could be isolated and analyzed at the transcriptomic level or by flow cytometry. This would imply optimizing a protocol for collecting cells from the device via collagenase digestion, for instance. This type of approach would also benefit from multiplexing to enhance the number of cells. - As mentioned above, the revised manuscript discusses the multicellular capabilities of our model, including the integration of additional immune cells and potential connections to other organ systems. We believe that these approaches are feasible and valuable for studying various aspects of N. meningitidis infection.

      Advance

      The most important advance of this manuscript is technical: the development of a model that proves to be equivalent to the most complex model used to date to study meningococcal systemic infections. The human skin xenograft mouse model requires complex surgical techniques and has the practical and ethical limitations associated with the use of animals. However, the Infection-on-chip model is completely in vitro, can be produced quickly, and allows to precisely tune the vessel's geometry and to perform higher resolution microscopy. Both models were comparable in terms of the hallmarks defining the disease, suggesting that the presented model can be an effective replacement of the animal use in this area.

      Other vessel-on-chip models can recapitulate an endothelial barrier in a tube-like morphology, but do not recapitulate other complex geometries, that are more physiologically relevant and could impact infection (in addition to other non-infectious diseases). However, in the manuscript it is not clear whether the different morphologies are necessary to study or recapitulate N. meningitidis infection, or if the tubular morphologies achieved in other similar models would suffice.

      We thank Reviewer #2 for his/her comment, also raised by reviewer 1. To answer this question, we have now infected vessel-on-chips of different geometries, to dissect the impact of flow distribution in N. meningitidis infection (Figures 3 and S3, explained in lines 288-307). In this range of shear stress, we show that bacterial infection is not strongly affected by geometry-induced shear stress variation. These observations are constistent with observations in flow chambers and qualitative observations of human cases and in the xenograft model [6].

      Audience

      This manuscript might be of interest for a specialized audience focusing on the development of microphysiological models. The technology presented here can be of great interest to researchers whose main area of interest is the endothelium and the blood vessels, for example, researchers on the study of systemic infections, atherosclerosis, angiogenesis, etc. Thus, the tool presented (vessel-on-chip) can have great applications for a broad audience. However, even when the method might be faster and easier to use than other equivalent methods, it could still be difficult to implement in another laboratory, especially if it lacks expertise in bioengineering. Therefore, the method could be more of interest for laboratories with expertise in bioengineering looking to expand or optimize their toolbox. Alternatively, this paper present itself as an opportunity to begin collaborations, since the model could be used to test other pathogen or conditions.

      Field of expertise: Infection biology, organ-on-chip, fungal pathogens.

      I lack the expertise to evaluate the image-based analysis.

      References:

      1. Gyohei Egawa, Satoshi Nakamizo, Yohei Natsuaki, Hiromi Doi, Yoshiki Miyachi, and Kenji Kabashima. Intravital analysis of vascular permeability in mice using two-photon microscopy. Scientific Reports, 3(1):1932, Jun 2013. ISSN 2045-2322. doi: 10.1038/srep01932.

      2. Valeria Manriquez, Pierre Nivoit, Tomas Urbina, Hebert Echenique-Rivera, Keira Melican, Marie-Paule Fernandez-Gerlinger, Patricia Flamant, Taliah Schmitt, Patrick Bruneval, Dorian Obino, and Guillaume Duménil. Colonization of dermal arterioles by neisseria meningitidis provides a safe haven from neutrophils. Nature Communications, 12(1):4547, Jul 2021. ISSN 2041-1723. doi:10.1038/s41467-021-24797-z.

      3. Mats Hellström, Holger Gerhardt, Mattias Kalén, Xuri Li, Ulf Eriksson, Hartwig Wolburg, and Christer Betsholtz. Lack of pericytes leads to endothelial hyperplasia and abnormal vascular morphogenesis. Journal of Cell Biology, 153(3):543–554, Apr 2001. ISSN 0021-9525. doi: 10.1083/jcb.153.3.543.

      4. Arsheen M. Rajan, Roger C. Ma, Katrinka M. Kocha, Dan J. Zhang, and Peng Huang. Dual function of perivascular fibroblasts in vascular stabilization in zebrafish. PLOS Genetics, 16(10):1–31, 10 2020. doi: 10.1371/journal.pgen.1008800.

      5. Huanhuan He, Julia J. Mack, Esra Güç, Carmen M. Warren, Mario Leonardo Squadrito, Witold W. Kilarski, Caroline Baer, Ryan D. Freshman, Austin I. McDonald, Safiyyah Ziyad, Melody A. Swartz, Michele De Palma, and M. Luisa Iruela-Arispe. Perivascular macrophages limit permeability. Arteriosclerosis, Thrombosis, and Vascular Biology, 36(11):2203–2212, 2016. doi: 10.1161/ATVBAHA. 116.307592.

      6. Emilie Mairey, Auguste Genovesio, Emmanuel Donnadieu, Christine Bernard, Francis Jaubert, Elisabeth Pinard, Jacques Seylaz, Jean-Christophe Olivo-Marin, Xavier Nassif, and Guillaume Dumenil. Cerebral microcirculation shear stress levels determine Neisseria meningitidis attachment sites along the blood–brain barrier . Journal of Experimental Medicine, 203(8):1939–1950, 07 2006. ISSN 0022-1007. doi: 10.1084/jem.20060482.

      7. Riya Joshi and Sunil D. Saroj. Survival and evasion of neisseria meningitidis from macrophages. Medicine in Microecology, 17:100087, 2023. ISSN 2590-0978. doi: https://doi.org/10.1016/j.medmic.2023.100087.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      The authors develop a Vessel-on-Chip model, which has geometrical and physical properties similar to the murine vessels used in the study of systemic infections. The vessel was created via highly controllable laser photoablation in a collagen matrix, subsequent seeding of human endothelial cells and flow perfusion to induce mechanical cues. This vessel could be infected with Neisseria meningitidis, as a model of systemic infection. In this model, microcolony formation and dynamics, and effects on the host were very similar to those described for the human skin xenograft mouse, which is the current gold standard for these studies, and were consistent with observations made in patients. The model could also recapitulate the neutrophil response upon N. meningitidis systemic infection.

      Major comments

      I have no major comments. The claims and the conclusions are supported by the data, the methods are properly presented and the data is analyzed adequately. Furthermore, I would like to propose an optional experiment could improve the manuscript. In the discussion it is stated that the vascular geometry might contribute to bacterial colonization in areas of lower velocity. It would be interesting to recapitulate this experimentally. It is of course optional but it would be of great interest, since this is something that can only be proven in the organ-on-chip (where flow speed can be tuned) and not as much in animal models. Besides, it would increase impact, demonstrating the superiority of the chip in this area rather than proving to be equal to current models.

      Minor comments

      I have a series of suggestions which, in my opinion, would improve the discussion. They are further elaborated in the following section, in the context of the limitations. - How to recapitulate the vessels in the context of a specific organ or tissue? If the pathogen is often found in the luminal space of other organs after disseminating from the blood, how can this process be recapitulated with this mode, if at all? - Similarly, could other immune responses related to systemic infection be recapitulated? The authors could discuss the potential of including other immune cells that might be found in the interstitial space, for example. A minor correction: in line 467 it should probably be "aspects" instead of "aspect", and the authors could consider rephrasing that sentence slightly for increased clarity.

      Referee cross-commenting

      I agree with the rest of the comments, and also agree that the manuscript is already complete and could be published as it is.

      Significance

      Strengths and limitations

      The most important strength of this manuscript is the technology they developed to build this model, which is impressive and very innovative. The Vessel-on-Chip can be tuned to acquire complex shapes and, according to the authors, the process has been optimized to produce models very quickly. This is a great advancement compared with the technologies used to produce other equivalent models. This model proves to be equivalent to the most advanced model used to date, but allows to perform microscopy with higher resolution and ease, which can in turn allow more complex and precise image-based analysis. However, the authors do not seem to present any new mechanistic insights obtained using this model. All the findings obtained in the infection-on-chip demonstrate that the model is equivalent to the human skin xenograft mouse model, and can offer superior resolution for microscopy. However, the advantages of the model do not seem to be exploited to obtain more insights on the pathogenicity mechanisms of N. meningitidis, host-pathogen interactions or potential applications in the discovery of potential treatments. For example, experiments to elucidate the role of certain N. meningiditis genes on infection could enrich the manuscript and prove the superiority of the model. However, I understand these experiments are time consuming and out of the scope of the current manuscript. In addition, the model lacks the multicellularity that characterizes other similar models. The authors mention that the pathogen can be found in the luminal space of several organs, however, this luminal space has not been recapitulated in the model. Even though this would be a new project, it would be interesting that the authors hypothesize about the possibilities of combining this model with other organ models. The inclusion of circulating neutrophils is a great asset; however it would also be interesting to hypothesize about how to recapitulate other immune responses related to systemic infection.

      Advance

      The most important advance of this manuscript is technical: the development of a model that proves to be equivalent to the most complex model used to date to study meningococcal systemic infections. The human skin xenograft mouse model requires complex surgical techniques and has the practical and ethical limitations associated with the use of animals. However, the Infection-on-chip model is completely in vitro, can be produced quickly, and allows to precisely tune the vessel's geometry and to perform higher resolution microscopy. Both models were comparable in terms of the hallmarks defining the disease, suggesting that the presented model can be an effective replacement of the animal use in this area. Other vessel-on-chip models can recapitulate an endothelial barrier in a tube-like morphology, but do not recapitulate other complex geometries, that are more physiologically relevant and could impact infection (in addition to other non-infectious diseases). However, in the manuscript it is not clear whether the different morphologies are necessary to study or recapitulate N. meningitidis infection, or if the tubular morphologies achieved in other similar models would suffice.

      Audience

      This manuscript might be of interest for a specialized audience focusing on the development of microphysiological models. The technology presented here can be of great interest to researchers whose main area of interest is the endothelium and the blood vessels, for example, researchers on the study of systemic infections, atherosclerosis, angiogenesis, etc. Thus, the tool presented (vessel-on-chip) can have great applications for a broad audience. However, even when the method might be faster and easier to use than other equivalent methods, it could still be difficult to implement in another laboratory, especially if it lacks expertise in bioengineering. Therefore, the method could be more of interest for laboratories with expertise in bioengineering looking to expand or optimize their toolbox. Alternatively, this paper present itself as an opportunity to begin collaborations, since the model could be used to test other pathogen or conditions.

      Field of expertise: infection biology, organ-on-chip, fungal pathogens

      I lack the expertise to evaluate the image-based analysis.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The work by Pinon et al describes the generation of a microvascular model to study Neisseria meningitidis interactions with blood vessels. The model uses a novel and relatively high throughput fabrication method that allows full control over the geometry of the vessels. The model is well characterized. The authors then study different aspects of Neisseria-endothelial interactions and benchmark the bacterial infection model against the best disease model available, a human skin xenograft mouse model, which is one of the great strengths of the paper. The authors show that Neisseria binds to the 3D model in a similar geometry that in the animal xenograft model, induces an increase in permeability short after bacterial perfusion, and induces endothelial cytoskeleton rearrangements. Finally, the authors show neutrophil recruitment to bacterial microcolonies and phagocytosis of Neisseria. The article is overall well written, and it is a great advancement in the bioengineering and sepsis infection field, and I only have a few major comments and some minor.

      Major comments:

      Infection-on-chip. I would recommend the authors to change the terminology of "infection on chip" to better reflect their work. The term is vague and it decreases novelty, as there are multiple infection on chips models that recapitulate other infections (recently reviewed in https://doi.org/10.1038/s41564-024-01645-6) including Ebola, SARS-CoV-2, Plasmodium and Candida. Maybe the term "sepsis on chip" would be more specific and exemplify better the work and novelty. Also, I would suggest that the authors carefully take a look at the text and consider when they use VoC or to current term IoC, as of now sometimes they are used interchangeably, with VoC being used occasionally in bacteria perfused experiments.

      Fig 3 and Suppmentary 3: Permeability. The authors suggest that early 3h infection with Neisseria do not show increase in vascular permeability in the animal model, contrary to their findings in the 3D in vitro model. However, they show a non-significant increase in permeability of 70 KDa Dextran in the animal xenograft early infection. This seems to point that if the experiment would have been done with a lower molecular weight tracer, significant increases in permeability could have been detected. I would suggest to do this experiment that could capture early events in vascular disruption.

      The authors show the formation of actin of a honeycomb structure beneath the bacterial microcolonies. This only occurred in 65% of the microcolonies. Is this result similar to in vitro 2D endothelial cultures in static and under flow? Also, the group has shown in the past positive staining of other cytoskeletal proteins, such as ezrin in the ERM complex. Does this also occur in the 3D system?

      One of the most novel things of the manuscript is the use of a relatively quick photoablation system. I would suggest that the authors add a more extensive description of the protocol in methods. Could this technique be applied in other laboratories? If this is a major limitation, it should be listed in the discussion.

      Minor comments:

      Supplementary Fig 2. The reference to subpanels H and I is swapped.

      Line 203: I would suggest to delete this sentence. Although a strength of the submitted paper is the direct comparison of the VoC model with the animal model to better replicate Neisseria infection, a direct comparison with animal permeability is not needed in all vascular engineering papers, as vascular permeability measurements in animals have been well established in the past.

      Fig 3: Bacteria binding experiments. I would suggest the addition of more methodological information in the main results text to guarantee a good interpretation of the experiment. First, it would be better that wall shear stress rather than flow rate is described in the main text, as flow rate is dependent on the geometry of the vessel being used. Second, how long was the perfusion of Neisseria in the binding experiment performed to quantify colony doubling or elongation? As per figure 1C, I would guess than 100 min, but it would be better if this information is directly given to the readers.

      Fig 4: The honeycomb structure is not visible in the 3D rendering of panel D. I would recommend to show the actin staining in the absence of Neisseria staining as well.

      Line 421: E-selectin is referred as CD62E in this sentence. I would suggest to use the same terminology everywhere.

      Line 508: "This difference is most likely associated with the presence of other cell types in the in vivo tissues and the onset of intravascular coagulation". Do the authors refer to the presence of perivascular cells, pericytes or fibroblasts? If so, it could be good to mention them, as well as those future iterations of the model could include the presence of these cell types.

      Discussion: The discussion covers very well the advantages of the model over in vitro 2D endothelial models and the animal xenograft but fails to include limitations. This would include the choice of HUVEC cells, an umbilical vein cell line to study microcirculation, the lack of perivascular cells or limitations on the fabrication technique regarding application in other labs (if any).

      Line 576: The authors state that the model could be applied to other systemic infections but failed to mention that some infections have already been modelled in 3D bioengineered vascular models (examples found in https://doi.org/10.1038/s41564-024-01645-6). This includes a capillary photoablated vascular model to study malaria ( DOI: 10.1126/sciadv.aay724).

      Line 1213: Are the 6M neutrophil solution in 10ul under flow. Also, I would suggest to rewrite this sentence in the following line "After, the flow has been then added to the system at 0.7-1 μl/min."

      Referee cross-commenting

      I agree with the other reviewer's comments. The manuscript is already very complete could be published without the addition of other experiments, but the ones I proposed could validate even more the in vitro model. For example the permeability with lower molecular weight tracers, could show that the changes in vessel permeability might already exist at early timepoints in the xenograft model, similarly than in the in vitro model.

      Significance

      The manuscript is comprehensive, complete and represents the first bioengineered model of sepsis. One of the major strengths is the carful characterization and benchmarking against the animal xenograft model. Its main limitations is the brief description of the photoablation methodology and more clarity is needed in the description of bacteria perfusion experiments, given their complexity. The manuscript will be of interest for the general infection community and to the tissue engineering community if more details on fabrication methods are included.

      My expertise is on infection bioengineered models.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the Reviewers

      We thank the reviewers for their very thoughtful and insightful reviews. We have performed several new experiments and addressed their points in a revised manuscript, which has significantly improved the manuscript. Our detailed responses follow.


      Responses to Reviewer 1

      Major comments

      1. __ In the supplementary figures when looking at the KC vs KPC mice and the trichrome staining (S2) or looking at the muc5a and mucin levels in figure 2, the KPC mice appear to have a larger amount of PANIN formation than the KC mice which is usually indicative of further tumour progression that can occur in p53 null tumours. Due to the further progression of the tumour in the KPC mice, drugs such as nutlin-3a may have inhibitory effect on PANIN formation and nutlin results might therefore not be indicative of p53 dependence. This should at least be mentioned and discussed.__

      A: We appreciate the reviewer’s insightful comment. Indeed, in the KPC model, where PDAC progression is accelerated, Nutlin-3a may exhibit p53-independent effects. We now address this consideration in the revised manuscript on page 6, when describing our PanIN and Alcian blue results in KPC mice.

      __ When looking at how p53 controls the expression of acinar cell identity genes the authors look at MEFs when performing their GSEA (Figure 4a, b). The MEFs are used as a model for neoplastic cells but it would also be beneficial to test this in pancreatic cell lines. When looking at Bhlha15 expression (figure 4c) there is a decrease observed in the p53 containing vs knockout mice, this is from a data set GSE94566, it would be beneficial to test this in the KC and KPC mice the authors generated to see if the results can be validated. Results in 4d re mist expression should also be evaluated in KPC mice to prove p53 dependence. Finally, murine and human fibroblasts are treated with doxorubicin (figure 4e-f; figure s4b) to show Bhlha15 is upregulated, it would also be useful to show this either in pancreatic cell lines with/without p53 or from the murine tissue of the KC and KPC mice.__

      A: We appreciate the reviewer’s insightful comment. We recognize that the way we originally described the GSEA analysis may have inadvertently suggested that it was performed in RNA-seq from MEFs. To clarify, the GSEA analysis in Figure 4a is derived from RNA-seq of sorted precursor lesions from p53-proficient (KC) and p53-deficient (KPC) mice, not MEFs. We have revised the Results section that describes Figure 4 __to more clearly reflect this distinction. Additionally, we acknowledge the importance of confirming p53’s regulatory role over Bhlha15 expression in the pancreas. To support the findings from the GSE94566 dataset, which was generated using sorted precursor lesions from KC and KPC mice (Mello et al., 2017), we present a boxplot of Bhlha15 expression (__Figure 4c).In response to the reviewer’s suggestion, we incorporated Mist1 immunohistochemistry and quantification in KPC mice treated with Nutlin-3a or vehicle control (Figure 4b) to further validate the p53-dependent regulation of Mist1 expression. To strengthen the conclusions from Figures 4c–d, we also conducted complementary experiments in mouse-derived pancreatic cancer cell lines either proficient (KIC1 and KIC2, derived from Kras+/G12D; Pdx1-Cre; Cdkn2afl/fl mice) or deficient (KPC, derived from Kras+/G12D; Pdx1-Cre; Trp53fl/fl mice) for p53. These experiments aim to further substantiate the regulatory role of p53 in controlling Mist1 expression.

      __ It is unclear why varying numbers of mice have been used. For the majority of experiments, the authors use n=6 mock treated mice and n=3 or 4 nutlin-3a treated mice. For KPC mice n=4 and n=4 was used. N=3 for mock and nutlin-3a treated mice were used. Did some mice die unexpectedly during the experiment? It would be good to report this. Also, the smaller amount of animal models used even though it was n=3/the disparity between the control and nutlin treated may raise some question. Usually, 6 vs 3. Possibly testing this with some lesson common Kras mutants and increasing the time in which nutlin-3a is studied in the pancreatic tumours, can it constantly prevent tumour formation.__

      A: We appreciate the reviewer’s concern regarding the variation in cohort sizes across experiments. Several factors contributed to these differences. In some cases, mice were excluded due to health issues such as malocclusion and poor general condition. Additionally, a subset of animals was misgenotyped and later confirmed to lack the KrasG12D allele, necessitating their exclusion from the study. The KPC model in particular is challenging to breed due to the requirement for four specific alleles and its rapid progression toward pancreatic cancer, which can limit survival and experimental flexibility. Despite these limitations, our key experimental groups, such as those evaluating Amylase rescue upon Nutlin-3a treatment, Mist1 induction in ADM, and lineage tracing studies, maintained a statistical power of at least 80% based on our cohort sizes. We have now clarified these details in the Supplemental Materials and Methods to ensure transparency regarding animal exclusions and sample size variability. We also agree with the reviewer that assessing Nutlin-3a at later stages of tumorigenesis would strengthen our findings. To this end, we treated aging KC mice (6 months old), which accumulate ADM and PanINs due to chronic Kras activation (rather than pancreatitis), with Nutlin-3a for a week and analyzed them at 8 months. Treated mice showed increased normal acinar tissue and reduced high-grade PanINs. This new data, presented in Figure 5, highlights the sustained tumor-suppressive effect of p53 activation and suggests that it could delay or prevent PDAC onset.

      Minor comments:

      1. __ The text is mostly clear, apart from the results section around figure 4, where it is not always clear which material has been used for analysis when referring to a previous paper. The quantification graphs should be wider as they seem squished sometimes. Changing the colours to be darker would make these more easily identifiable as the pale blue/red are sometimes difficult to see.__

      A: We appreciate the reviewer’s feedback and agree with the reviewer that the results section for Figure 4 leaves some margin for misinterpretation. To address this, we have revised the text to improve clarity and ensure that the source of each dataset, condition, and cell line is explicitly stated. Additionally, we have added a Supplementary Materials and Methods section that provides detailed information about the datasets and experimental conditions used in Figure 4. We have also adjusted the quantification graphs to be wider, preventing them from appearing compressed, and modified the color scheme, using gray and white tones to improve visibility and contrast, making the data easier to interpret.

      __ a) Figure 4a and 4b can be moved to the supplementary figures. b). For the figure legend of S1 on the separate file for the supplemental figures there is no S1e mentioned but it is in the paper. c) Figure S1a needs a scale bar.__

      A: We appreciate the reviewer’s suggestions and have implemented all the requested changes:

      1. a) Figures 4a and 4b have been moved to the Supplementary Figures section.
      2. b) The Figure S1 legend in the supplemental file has been updated to include S1e, ensuring consistency with the main text.
      3. c) A scale bar has been added to Figure S1a for clarity.

      Responses to Reviewer 2

      Major comments

      1. __ The study implies that pharmacologically engaging wild-type p53 (for example, through Nutlin-3a) may serve as a strategy to prevent or significantly delay the onset of PDAC by preserving the normal acinar cell phenotype and blocking early metaplastic changes. Doing a search in Pubmed search, no such findings has been previously published. It is a very important findings as it paves the way to clinical trial. The data are of excellent quality and would support the conclusions but the experiments need additional control experiments to strengthen the conclusions.__

      A: We thank the reviewer for the positive assessment of our work.

      __ For all immunohistochemistry quantification: The authors should explain better how the scoring was performed. - the authors should present a range of positive staining (negative, Weak, medium, high). The authors should state the number of sections analysed and how many cells or nuclei in total were counted per section or ROI to define the percentage of positive cells/nuclei.__

      A: We appreciate the reviewer’s suggestion and have addressed this point by adding a detailed description of our immunohistochemistry quantification methodology in the Supplementary Materials and Methods. This includes a clear explanation of how positive staining was defined. Specifically, we did not use a categorical intensity scoring system (e.g., weak, medium, strong); instead, positive staining was determined based on signal levels clearly distinguishable from background noise, enabling reliable automated detection by the analysis software that we employed. Regarding sample size and quantification scope, we analyzed one representative section per individual in the cohort. For each section, either the entire tissue or specific ROIs, such as ADM or PanIN lesions, were annotated and quantified. The number of nuclei or cells evaluated per section varied depending on tissue size and ROI, and this is now described in the Supplementary Materials and Methods.

      __ In material methods: the antibodies concentration must be indicated in ug/ml.__

      A: We appreciate the reviewer’s suggestion and have updated the Materials and Methods section to include antibody concentrations in µg/ml.

      __ a) Figure1b, 1c must present the following control staining in addition to presented data: i) staining of non-treated pancreas (as negative control); ii) staining of pancreas treated with Nutlin only (not-treated with cerulein) to assess the effect of Nutlin in absence of Cerulein. b) Figure 4d: the authors should repeat the experiment in p53fl/fl mice to assess nutlin off-target effect. c) Figure S1 e) there is no legend for it. d) Figure S4: which p53 exon has been deleted by CRISPr. The sequences of the sgRNA are not indicated.__

      A: We appreciate the reviewer’s suggestions and have addressed all the requested changes. For Figures 1b and 1c, we have added the necessary control stainings, including (a) staining of non-treated pancreas as a negative control and (b) staining of pancreas treated with Nutlin-3a only (without cerulein) to assess the effect of Nutlin-3a in the absence of Cerulein (Figure S1c). For Figure 4d (now Figure 4b), we have included sections from p53-deficient (KPC) mice stained for Mist1 to evaluate potential off-target effects of Nutlin-3a. Our results show no Mist1 expression in the absence of p53, suggesting that Nutlin-3a-mediated upregulation of Mist1 in ADM is p53-dependent. Additionally, we have added a legend for Figure S1e. For Figure S4, we clarified that CRISPR interference (CRISPRi) was used in this experiment rather than gene deletion. As such, the sgRNA is not designed against a specific exon, but instead targets the promoter region of the TP53 gene to suppress its transcription. We have now included the sgRNA sequence used in Figure S4d for clarity.

      Responses to Reviewer 3


      1. __ The authors suggested, based on their data, that Mist1 may be transactivated by p53 "presumably directly, across distinct cell types and in different contexts, such as oncogenic stress and DNA damage." This statement is too speculative and that is noteworthy because the experiments to get at those potential functional details (including, e.g., gene interference, biochemical assays) are not particularly difficult and would significantly improve the manuscript.__

      A: We appreciate the reviewer’s feedback. Our original statement aimed to accurately reflect our findings without overinterpretation, as we identified a conserved p53 binding site in the Bhlha15 locus, observed p53 occupancy in published ChIP-seq datasets, and demonstrated p53-dependent expression of Mist1 at both RNA and protein levels. To further support this relationship, we expanded our analysis to include p53-proficient and p53-deficient mouse PDAC cell lines, confirming the dependency of Mist1 expression on p53 (Figure 4e). Additionally, we now show that Mist1 protein was detected in lesions of Nutlin-3a–treated KC mice, but not in KPC mice, further indicating that Mist1 induction is p53-dependent in vivo (Figure 4b). While we acknowledge that direct functional testing of the p53 binding site would further strengthen the mechanistic insight, the Bhlha15 locus contains multiple p53 ChIP-seq peaks, making it difficult to isolate the contribution of individual sites. For this reason, we believe that dissecting the precise binding events underlying p53-mediated regulation of Bhlha15 goes beyond the scope of the current study, but we agree it is a valuable direction for future work.

      __ The study did not explore a novel concept beside showing that ADM can be reversed by inhibiting p53, which though may sound novel is intuitive (the focus on Mist1 alone appear narrow too).__

      A: We respectfully disagree with the reviewer’s assessment. The prevailing view in the literature is that p53 suppresses pancreatic cancer primarily by preventing the progression from PanINs to PDAC, largely through the induction of senescence in precursor lesions (Caldwell et al. Oncogene 2012; Morton et al. PNAS 2010). However, whether p53 also plays a tumor-suppressive role at earlier stages, particularly in ADM, remains unclear. Our study provides evidence that p53 regulates ADM plasticity and acinar cell identity, expanding its known functions beyond senescence induction. Additionally, the role of p53 in maintaining tissue homeostasis through the regulation of differentiation programs is an emerging and underexplored concept. Our focus on Mist1 is well justified, as we observed a significant overlap between gene expression changes in p53-proficient (KC) and p53-deficient (KPC) precursor lesions and known Mist1-regulated genes (Fig. 4a), highlighting its potential as a key mediator of p53-dependent acinar cell identity maintenance. While Mist1 is a focal point of our study, the broader implication is that p53 plays an active role in controlling acinar cell fate, which challenges the conventional view of its function solely in later-stage tumor suppression.

      __ The RNA-seq and ChIP data may provide several opportunities to get at how p53 mediates the proposed effect on ADM and it would be worthwhile to leverage those data.__

      A: We appreciate the reviewer’s suggestion. Like the reviewer, we recognize that p53 has a vast downstream network, and while additional pathways may contribute to p53-mediated cell differentiation, we believe that investigations of other mechanisms involved in this process extends beyond the scope of this manuscript and would dilute the central message rather than strengthen it.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study, Twardowski et al. showed that the treatment of pancreatic cancer mice models with Nutlin-3a, a drug that allows p53 stabilization, limits tumor initiation by reversing acinar-to-ductal metaplasia (ADM), which is an early event in pancreatic cancer. This study is well done, and the figures are quite convincing. It also uses a combination of state-of-the-art mouse models, most notably the inducible model that allowed cell lineage tracing. However, it is generally descriptive and provides no mechanistic detail beside suggesting that p53 potentially drives the upregulation of the transcription factor Mist1 (Bhlha15) as part of the ADM to acinar differentiation process. The authors also suggested, based on their data, that Mist1 may be transactivated by p53 "presumably directly, across distinct cell types and in different contexts, such as oncogenic stress and DNA damage." This statement is too speculative and that is noteworthy because the experiments to get at those potential functional details (including, e.g., gene interference, biochemical assays) are not particularly difficult and would significantly improve the manuscript. Besides the above comments, the study did not explore a novel concept beside showing that ADM can be reversed by inhibiting p53, which though may sound novel is intuitive (the focus on Mist1 alone appear narrow too). The RNA-seq and ChIP data may provide several opportunities to get at how p53 mediates the proposed effect on ADM and it would be worthwhile to leverage those data. The study in its current state is descriptive and appears too preliminary.

      Significance

      Besides the above comments, the study did not explore a novel concept beside showing that ADM can be reversed by inhibiting p53, which though may sound novel is intuitive (the focus on Mist1 alone appear narrow too). The RNA-seq and ChIP data may provide several opportunities to get at how p53 mediates the proposed effect on ADM and it would be worthwhile to leverage those data. The study in its current state is descriptive and appears too preliminary.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript entitled:" Drug-Induced p53 Activation Limits Pancreatic Cancer Initiation. " Twardowski et al., investigate using mouse animal models the impact of pharmacological stabilization of the wild-type p53 protein on the formation of acinar-to-ductal metaplasia (ADM) in a KrasG12D-driven mouse model of Pancreatic ductal adenocarcinoma (PDAC). The authors mostly performed immunohistochemistry to assess the differentiation status in response to treatment. The authors claims that they demonstrate that p53 stabilisation via Nutlin-3a treatment, an inhibitor of its ubiquitin ligase MDM2, significantly reduces both ADM and the formation of precursor lesions, such as pancreatic intraepithelial neoplasia (PanIN) by promoting the differentiation of ADM into acinar cells. The authors claim that the differentiation is concomitant with p53-dependent induction of the transcription factor Mist1 (also named Bhlha15), a critical inducer of acinar cell formation. The authors conclude that their data reveal a role for p53 in promoting the re-differentiation of ductal metaplasia in healthy acinar cells, preventing ductal-metaplasia to progress to Pancreatic ductal adenocarcinoma.

      Significance

      The study implies that pharmacologically engaging wild-type p53 (for example, through Nutlin-3a) may serve as a strategy to prevent or significantly delay the onset of PDAC by preserving the normal acinar cell phenotype and blocking early metaplastic changes. Doing a search in Pubmed search, no such findings has been previously published. It is a very important findings as it paves the way to clinical trial.

      The data are of excellent quality and would support the conclusions but the experiments need additional control experiments to strengthen the conclusions.

      Here are the major points that preclude publication as it is

      • For all immunohistochemistry quantification: The authors should explain better how the scoring was performed. - the authors should present a range of positive staining (negative, Weak, medium, high). The authors should state the number of sections analysed and how many cells or nuclei in total were counted per section or ROI to define the percentage of positive cells/nuclei.
      • In material methods: the antibodies concentration must be indicated in ug/ml.
      • Figure1b, 1c must present the following control staining in addition to presented data

      a. staining of non-treated pancreas (as negative control)

      b. staining of pancreas treated with Nutlin only (not-treated with cerulein) to assess the effect of Nutlin in absence of Cerulein - Figure 4d: the authors should repeat the experiment in p53fl/fl mice to assess nutlin off-target effect - Figure S1 e) there is no legend for it - Figure S4: which p53 exon has been deleted by CRISPr. The sequences of the sgRNA are not indicated.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The key findings in this paper are that using nutlin-3a to stabilise p53 a reduction of the formation of ADM and PanIN in the KrasG12D driven mouse model of PDAC are observed. They show that as p53 is stabilised by nutlin-3a ADM cells are differentiated into acinar cells, corresponding with a p53 dependent upregulation of Mist1. To show these results the authors utilised multiple mouse models and induced pancreatic damage/oncogenic stress via the injection of cerulein. Histological sections of the pancreas in the various mouse models were stained and quantified to allow the authors to come to their conclusions. The methods are presented sufficiently. The statistical analysis was adequate for this work. References are appropriate.

      Major comments:

      1. In the supplementary figures when looking at the KC vs KPC mice and the trichrome staining (S2) or looking at the muc5a and mucin levels in figure 2, the KPC mice appear to have a larger amount of PANIN formation than the KC mice which is usually indicative of further tumour progression that can occur in p53 null tumours. Due to the further progression of the tumour in the KPC mice, drugs such as nutlin-3a may have inhibitory effect on PANIN formation and nutlin results might therefore not be indicative of p53 dependence. This should at least be mentioned and discussed.
      2. When looking at how p53 controls the expression of acinar cell identity genes the authors look at MEFs when performing their GSEA (Figure 4a, b). The MEFs are used as a model for neoplastic cells but it would also be beneficial to test this in pancreatic cell lines. When looking at Bhlha15 expression (figure 4c) there is a decrease observed in the p53 containing vs knockout mice, this is from a data set GSE94566, it would be beneficial to test this in the KC and KPC mice the authors generated to see if the results can be validated. Results in 4d re mist expression should also be evaluated in KPC mice to prove p53 dependence. Finally, murine and human fibroblasts are treated with doxorubicin (figure 4e-f; figure s4b) to show Bhlha15 is upregulated, it would also be useful to show this either in pancreatic cell lines with/without p53 or from the murine tissue of the KC and KPC mice.
      3. It is unclear why varying numbers of mice have been used. For the majority of experiments, the authors use n=6 mock treated mice and n=3 or 4 nutlin-3a treated mice. For KPC mice n=4 and n=4 was used. N=3 for mock and nutlin-3a treated mice were used. Did some mice die unexpectedly during the experiment? It would be good to report this.

      Minor comments:

      1. The text is mostly clear, apart from the results section around figure 4, where it is not always clear which material has been used for analysis when referring to a previous paper. The quantification graphs should be wider as they seem squished sometimes. Changing the colours to be darker would make these more easily identifiable as the pale blue/red are sometimes difficult to see.
      2. Figure 4a and 4b can be moved to the supplementary figures.
      3. For the figure legend of S1 on the separate file for the supplemental figures there is no S1e mentioned but it is in the paper.
      4. Figure S1a needs a scale bar.

      Significance

      The study is a conscience investigation into how p53 is involved in pancreatic cancer initiation and how this can be reduced by over activation of p53. A strong point of the study is the generation of the genetic lineage mouse model. This allowed the authors to persistently label ADM cells and trace their progeny. This experiment provided strong evidence that nutlin-3a treatment can indeed reverse acini to ADM formation and prevent PanIN formation. Some of the limitations of the study involve relying on mainly immunohistochemistry to show changes in protein level in mouse tissue, western blotting could be used in conjunction with this to further validate the claims put forward in the paper. Also, the smaller amount of animal models used even though it was n=3/the disparity between the control and nutlin treated may raise some question. Usually, 6 vs 3. Possibly testing this with some lesson common Kras mutants and increasing the time in which nutlin-3a is studied in the pancreatic tumours, can it constantly prevent tumour formation.

      Other work in this area has looked at nutlin-3a and its effect on NSCLC with a Kras mutant (https://pubmed.ncbi.nlm.nih.gov/38093368/) and has shown that nutlin-3a is able to induce cell death in Kras mutant NSCLC cells. This paper also builds on work by (https://pmc.ncbi.nlm.nih.gov/articles/PMC5730340/#S9) who looks at NRF2-mediated induction of MDM2 and accumulation of p62 leading to PDAC and how inhibition of MDM2 by nutlin-3a may reduce this progression and this is shown in the present paper. The study for review advances using MDM2 inhibitors such at nutlin-3a in a clinical manner by looking at how it affects the progression of PDAC in mice and starts to elucidate the interactions which cause this to happen.

      The research present is specialised research that will hopefully be able to be translated to the clinic, if the use of nutlin-3a is able to prevent progression to PDAC in a mouse it would be useful to see if this also possible in patient derived primary cell lines to further elucidate the mechanism of this work.

      My expertise: P53, mouse work, lung cancer.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewer for their constructive comments and the fair and interesting discussion between reviewers.

      __Reviewer #1 __

      We are delighted to read that the reviewer finds the manuscript “very clear and of immediate impact […] and ready for publication” regarding this aspect. We have toned down the conclusion, proposing rather than concluding that “the incapacitation of Cmg2[KO] intestinal stem cells to function properly […] is due to their inability to transduce Wnt signals”.

      We have addressed the 3 points that were raised as well as the minor comments.

      Point #1

      The mouse mutant is just described as 'KO', referring to the previous work by the authors. The cited work simply states that this is a zygotic deletion of exon 3, which somehow leads to a decrease in protein abundance that is almost total in the lung but not so clear in the uterus. Exon 3 happens to be 72 bp long [https://www.ncbi.nlm.nih.gov/nuccore/NM_133738], so its deletion (assuming there are no cryptic splicing sites used) leads to an internal in-frame deletion of 24 amino acids. So, at best, this 'KO' is not a null, but a hypomorphic allele of context-dependent strength.

      Unfortunately, neither the previous work nor this paper (unless I have missed it!) contains information provided about the expression levels of Cmg2 in the intestine of KO mice - nor which cell types usually express it (see below). I think that using anti Cmg2 in WB and immunohistofluorescence of with ISC markers with intestine homogenate/sections of wild-type and mutant mice would be necessary to set the stage for the rest of the work.

      We now provide and explanation and characterization the Cmg2KO mice. Exon 3 indeed only encodes a short 24 amino acid sequence. This exon however encodes a ß-strand that is central to the vWA domain of CMG2, and therefore critical for the folding of this domain. As now shown in Fig. S1c, CMG2Dexon3 is produced in cells but cleared by the ER associated degradation pathway, therefore it is only detectable in cells treated with the proteasome inhibitor MG132, at a slightly lower molecular weight than the full-length protein. This is consistent, and was inspired by the fact that multiple Hyaline Fibromatosis missense mutations that map to the vWA domain lead to defective folding of CMG2, further illustrating that this domain is very vulnerable to modifications. In Fig. S1c, we moreover now show immunoprecipitation of Cmg2 from colonic tissue of wild-type (WT) and knockout (KO) mice, which confirm the absence of Cmg2 protein in Cmg2KO samples.

      Point #2

      Connected to the previous point, the expression pattern of Cmg2 in the intestine is not described. Maybe this is already established in the literature, but the authors do not refer to the data. This is important when considering that the previous work of the authors suggests that Cmg2 might contribute to Wnt signalling transduction through physical, cis interactions with the Wnt co-receptor LRP6. Therefore, one would expect that Cmg2 would be cell-autonomously required in the intestinal stem cells.

      The expression pattern of Cmg2 in the gut has not been characterized and is indeed essential to understanding its function. To address this gap, we now added a figure (Fig. 1) providing data from publicly available RNA-seq datasets and from our RNAscope experiments on Cmg2WT mice. Of note, we unfortunately have never managed to detect Cmg2 protein expression by immunohistochemistry of mouse tissue with any of the antibodies available, commercial or generated in the lab.

      In the RESULTS section we now mention:

      To investigate Cmg2 expression in the gut, we first analyzed publicly available spatial and scRNA-seq datasets to identify which cell types express Cmg2 across different gut regions. Spatial transcriptomic data from the mouse small intestine and colon revealed that Cmg2 is broadly expressed throughout the gut, including in the muscular, crypt, and epithelial layers (Fig. 1A–C). To validate these findings, we performed RNAscope in situ hybridization targeting Cmg2 in the duodenum and colon of wild-type mice. The expression pattern observed was consistent with the spatial transcriptomics data (Fig. 1D–E). We then analyzed scRNA-seq data from the same dataset to assess cell-type-specific expression in the mouse colon. Cmg2 was detected at varying levels across multiple cell types, including enterocytes and intestinal stem cells, as well as mesenchymal cells, notably fibroblasts.

      Of note for the reviewer, not mentioned in the manuscript, this wide-spread distribution of Cmg2 across the different cell types is not true for all organs. We have recently investigated the expression of Cmg2 in muscle and found that it is almost exclusively expressed in fibroblasts (so-called fibro-adipocyte progenitors) and very little in any other muscle cells, in particular fibers.

      Interestingly also, as now mentioned in the manuscript and shown in Fig. S1,the ANTXR1 protein, which is highly homologous to Cmg2 at the protein level and share its function of anthrax toxin receptor, displayed a much more restricted expression pattern, being confined primarily to fibroblasts and mural cells, and notably absent from epithelial cells. This differential expression highlights a potentially unique and epithelial-specific role for Cmg2 in maintaining intestinal homeostasis.

      Point #3

      The authors establish that the regenerating crypts of Cmg2[KO] mice are unable to transduce Wnt signalling, but it is not clear whether this situation is provoked by the DSS-induce injury or existed all along. Can Cmg2[KO] intestinal stem cells transduce Wnt signalling before the DSS challenge? If they were, it might suggest that the 'context-dependence' of the Cmg2 role in Wnt signalling is contextual not only because of the tissue, but because of the history of the tissue or its present structure. It would also suggest that Cmg2 mutant mice, unless reared in a germ-free facility for life, would eventually lose intestinal homeostasis, and maybe suggest the level of intervention/monitoring that HFS patients would require. It might also provide an explanation in case Cmg2 was not expressed in ISCs - if the state of the tissue was as important as the presence of the protein, then the effect on Wnt transduction could be indirect and therefore it might not be required cell-autonomously.

      We agree that understanding whether Cmg2KO intestinal stem cells are intrinsically unable to transduce Wnt signals, or whether this defect is contextually induced following injury (such as DSS treatment), is a critical point.

      As a first line of evidence, we show than under homeostatic condition, Wnt signaling appears largely intact in Cmg2KO crypts, with comparable levels of ß-catenin and expression levels of canonical Wnt target genes (e.g., Axin2, Lgr5) to those observed in WT animals (Figs. S1j-l and S3d-e). This indicates that Cmg2 is not essential for basal Wnt signaling under steady-state conditions.

      These findings thus support the idea that the requirement for Cmg2 in Wnt signal transduction is context-dependent—not only at the tissue level but also temporally, being specifically required during regenerative processes or in altered microenvironments such as during inflammation or epithelial damage. This context-dependence may reflect changes in the composition or accessibility of Wnt ligands, receptors, or matrix components during repair, where Cmg2 could play a scaffolding or stabilizing role.

      These aspects are now discussed in the text.

      I think points 1 and 2 are absolutely fundamental in a reverse genetics investigation. Point 3 would be nice to know but the outcome would not change the tenet of the paper. I believe that the work needed to deal these points can be performed on archival material. I do not think the mechanism proposed can be taken from 'plausible' to 'proven' without proposing substantial additional investigation, so I will not suggest any of it, as it could well be another paper.

      We have addressed points 1 and 2, and provided evidence and discussion for Point 3.

      __Minor points __

      1- Figure 1 legend says "In (c), results are mean {plus minus} SEM" - this seems applicable to (d) as (c) does not show error whiskers.

      We thank the reviewer for picking up this error. We modified : “In (c), results are median” and “In (d, f and g) Results are mean ± SEM.”

      2- Figure 1 legend says "(d) Body weight loss, (f) the aspect of the feces and presence of occult blood were monitored and used for the (e) DAI. Results are mean {plus minus} SEM. Each dot represents the mean of n = 12 mice per genotype". This part looks like has suffered some rearrangement of words. The first instance of (f) should be (e), I guess, and I am not sure what "(e) DAI" means. And for (e), "mean {plus minus} SEM" does not seem applicable. This needs some light revision.

      The legend was clarified as followed : “(d) __Body weight loss, and (e) aspect of the feces and presence of occult blood were monitored and used to evaluate Disease activity index in (f).__

      3 - Figure 1H legend does not say which statistical test was made in the survival experiment in (h) - presumably log-rank? A further comment on the survival statistics: euthanised animals should not be counted towards true mortality when that is what is recorded as an 'event'. They should be right-censored. However, in this case, reaching the euthanasia criterion is just as good an indicator of health as mortality itself. So, simply by changing the Y axis from 'survival' to 'event-free survival' (or something to that effect), where 'events' are either death or reaching the euthanasia criterion, leaves the analysis as it is, and authors do not need to clarify that figure 1H shows "apparent mortality", as it is straightforward "complication-free survival" (just not entirely orthogonal to weight loss).

      The Y axis was changed from 'survival' to “percentage of mice not reaching the euthanasia criterion”.

      4 - Some density measurements are made unnecessarily on arbitrary units (per field of view) - this should be simple to report in absolute measures (i.e. area of tissue screened or, better still, length of epithelium screened).

      Because the aera of tissue can vary significantly between damages, regenerating and undamaged tissue, we reported the length of epithelium screened as suggested : “per 800um tissue screened” in Fig S1c and Fig 2b.

      5 - Figure 2E should read "percent involvement"

      This has been corrected.

      6 - Figure 2J should read "lipocalin..."

      This has been corrected.

      7 - In section "CMG2 Is Dispensable for YAP/TAZ-Mediated Reprogramming to Fetal-Like Stem Cells", the authors write ""We measured the mRNA levels of two additional YAP target genes, Cyr61 and CTGF...". I presume the "additional" is because Ly6a is also a target of YAP/TAZ, but if the reader does not know, it is puzzling. I would suggest to make this link explicit.

      We added : “In addition to the fetal-like stem cell marker Ly6a, which is a YAP/TAZ target gene, we measured the mRNA levels of two others YAP target genes, Cyr61 and CTGF”

      8 - In Figures S2, 3 and S3, I think that the measures expressed as "% of homeostatic X in WT" really mean "% of average homeostatic X in WT". This should be made clear somewhere.

      We added: “Dotted line represents the average homeostatic levels of Cmg2 WT” in figure legends

      9 - In panel C, the nature of the data is not entirely clear. First, the corresponding part of the legend says "Representative images of n=4 mice per genotype" which I presume should refer to panel B. Then, the graph plots 4 data points, which suggests that they correspond to 4 mice - but how many fields of view? Also, the violin plot outline is not described - I presume it captures all the data points from the coarse-grained pixel analysis, but it should be clarified.

      It was modified as suggested : “(c) Results are presented as violin plot of the Ly6a mean intensity of all data points from the coarse-grain analysis. Each symbol represents the mean per mice of n=4 mice per condition. Results are mean ± SEM. Dotted line represents the average homeostatic levels of Cmg2WT. P values obtained by two-tailed unpaired t test.”

      10 - In Figure 3H and 3I, I would suggest to add the 7+3 timepoint where the data come from.

      We unfortunately do not understand the suggestion of the reviewer, given that these panels show the 7+3 time point.

      11 - In section "CMG2 Is Critical for Restoring the Lgr5+ Intestinal Stem Cell Pool", the authors say "...The mRNA levels of ... LRP6, β-catenin (Fig. S3a-b), and Wnt ligands (Wnt5a, 5b, and 2b) were comparable between the colons of Cmg2WT and Cmg2KO mice (Fig. S3c)..." without clarifying in which context - one needs to read the figure legend to realise this is "timepoint 7+3". I suggest to add "in the recovery phase" or "in regenerating colons" or something shorter, just to guide the reader.

      We added : “Initially, we quantified the expression of key molecular components involved in Wnt signaling in mice colon 3 days after DSS withdrawal using qPCR.”

      12 - Like with the previous point, it is not clear when the immunohistofluorescence of B-catenin is made - not even in the legend, as far as I could see. The only hint is that authors say "the nuclei of cells in the atrophic crypts of Cmg2KO..." with 'atrophic' probably indicating again the 7+3 timepoint.

      We have changed the text and now mention “Next, we analyzed β-catenin activation in the colon of Cmg2WT and Cmg2KO mice during the recovery phase.”

      13 - A typo in the discussion: tunning for tuning.

      This has been corrected.

      14 - In the discussion, the authors talk about the 'CMG2' protein (all caps - formatting convention for human proteins) but before they were referring to 'Cmg2' (formatting convention for mouse proteins). That is fine but some of the statements where "CMG2" is used clearly refer to observations made in the mouse.

      We have now used Cmg2, whenever referring to the mouse protein.

      15 - Typos in methods: "antigen retrieval by treating [with] Proteinase K"; "Image acquisition and analyze [analysis]"; "All details regarding code used for immunofluorescence analysis”.

      This has been corrected.

      __Reviewer #2 __

      We are very pleased to read that the reviewer found the study “overall well designed, meticulously carried out, and with clear and convincing results that are most reasonably and thoughtfully interpreted”.

      For this reader, one additional thought comes to mind. If I understand the field correctly it would be informative to know with greater confidence where - in what cell type, epithelial or mesenchymal - the CMG2-LRP6-WNT interaction occurs.

      This point was also raised by Reviewer I, and we have now added a new Figure 1, that describes Cmg2 expression in the gut, based both on from publicly available RNA-seq datasets and our RNAscope experiments on Cmg2WT mice. Of note, we unfortunately have never managed to detect Cmg2 protein expression by immunohistochemistry of mouse tissue with any of the antibodies available, commercial or generated in the lab.

      After injury the CMG2-KO mouse epithelium exhibits defective WNT signal transduction - as evidenced by failure of b-catenin to translocate into the nucleus. At first glance, this result is a disconnect with the paper by van Rijin that claims the defect in Hyaline Fibromatosis Syndrome cannot be due to loss of CMG2 expression/function in the barrier epithelial cell - a claim based on the mostly normal phenotypes of human CMG2 KO duodenal organoids. But the human organoids studied in the van Rijin paper, like all others, are established and cultured in very high WNT conditions, perhaps obscuring the lack of the CMG2-LRP6-WNT interaction. And in fact, the phenotypes of these human CMG2-KO duodenoids were not entirely normal - the CMG2-KO stem-like organoids (even when cultured in high WNT/R-spondin conditions) developed abnormal intercellular blisters consistent with a defect in epithelial structure/function - of unknown cause and not investigated.

      We thank the reviewer for raising this point and we fully agree. We now specify in the text that the human CMG2-KO duodenoids showed blisters, indeed consistent with a defect in epithelial structure/function, and that they were grown on high Wnt media which likely obscure the CMG2 requirement.

      I think it would be informative to prepare colon organoids (and duodenoids) from WT and CMG2-KO mice to quantify their WNT dependency during establishment and maintenance of the stem-like (and WNT-dependent) state. If CMG2 acts within the epithelial cell to affect WNT signaling (regardless of WNT source), organoids prepared from colons of CMG2-KO mice would require more WNT in culture media to establish and maintain the stem cell proliferative state - when compared to organoids prepared from WT mice. This can be quantified (and confirmed molecularly by transgene expression if successful). Enhanced dependency of high concentrations of exogenous WT would be evidence for a primary defect in WNT-(LRP2)-CMG2 signal transduction localized to the epithelial barrier cell - thus addressing the apparent discrepancy with the van Rijin paper - and for my part, advancing the field. And the discovery of a defect in the epithelium itself for WNT signal transduction would implicate a biologically most plausible mechanism for development of protein losing enteropathy.

      By no means do I consider these experiments to be required for publication (especially if considered to be incremental or already defined - WNT-CMG2 is not my field of research). This study already makes a meaningful contribution to the field as I state above. But in the absence of new experimentation, the issue should probably be discussed in greater depth.

      We are working out conditions to grow colon organoids that from WT and Cmg2 KO mice, indeed playing around with the concentrations of Wnt in the various media to identify those that would best mimic the regeneration conditions. This is indeed a study in itself. We have however included a discussion on this point in the manuscript as suggested.

      __Reviewer #3: __

      We thank the reviewer for her/his insightful comments.

      The premise is that the causative germline mutated gene, CMG2/ANTRX2, may have a functional role in colonic epithelium in addition to controlling the ECM composition. There is little background information but one study has shown no primary defect in epithelial organoids grown from patients with the syndrome. This leads the authors to wonder if non-homeostatic, conditions might reveal a function role for the gene in regeneration.

      Reviewer 2 commented on the fact that “human organoids studied in the van Rijin paper, like all others, are established and cultured in very high WNT conditions, perhaps obscuring the lack of the CMG2-LRP6-WNT interaction. And in fact, the phenotypes of these human CMG2-KO duodenoids were not entirely normal - the CMG2-KO stem-like organoids (even when cultured in high WNT/R-spondin conditions) developed abnormal intercellular blisters consistent with a defect in epithelial structure/function - of unknown cause and not investigated”.

      We have now added a discussion on this point in the manuscript.

      The authors' approach to test the hypothesis is to use a mouse germline knockout model and to induce colitis and regeneration by the established protocol of introducing dextran sodium sulfate (DSS) into the drinking water for five days. In brief there is no phenotype apparent in the untreated knockout (KO) but these animals show a more severe response to DSS that requires them to be killed by 10 days after the start of treatment. This effect following phenotypic characterisation of the colonic epithelium is interpreted as showing the CMG2 is a Wnt modifier required for the restoration of the intestinal stem cell population in the final stages of repair.

      The experiment and analysis seem reasonably well executed - although a few specific comments follow below. The narrative is simple and easy to understand. However, there are significant caveats that cast doubts on the interpretation made that loss of CMG2 impairs the transition of colonic epithelial cells from a fetal like state to adult ISCs.

      First there is only a single approach and single type of experiment performed. There is a lack of independent validation of the phenotype and how it is mediated.

      We do not fully understand what type of independent validation of the phenotype the reviewer would have liked to see. Is it the induction of intestinal damage using a stress other than DSS?

      The DSS dose in this kind of experiment is often determined empirically in individual units. Here the 3% used is within published range but at upper end. The control animals show a typical response with symptoms of colitis worsening for 2-3 days after the removal of DSS and then recovery commonly over another 5-7 days. Here the CMG2 KO mice fail to recover and are killed by 9 or 10 days. The authors attempt to exploit the time course by identifying normal initial (7days) and defective late (10days) repair phases in KO animals when compared to controls. It is from this comparison that conclusions are drawn. However, the alternative interpretation might be that the epithelium of KO animals is so badly damaged, and indeed non-existent (from viewing Fig2a), that it is incapable of mounting any other response other than death and that the profiling shown is of an epithelium in extremis. The repair capability and dynamics of the KO would have been better tested under more moderate DSS challenge, if this experiment had been regarded as a pilot rather than as definitive.

      The choice of 3% DSS was in fact based on a pilot experiment. As now shown in Fig. S4, we tested different concentrations and found that 3% DSS was the lowest concentration that reliably induced the full spectrum of colitis-associated symptoms, including significant body weight loss, diarrhea, rectal bleeding (summarized in the Disease Activity Index), as well as macroscopic signs such as colon shortening and spleen enlargement. Based on these criteria, we selected 3% DSS for the study described in the manuscript.

      In this model, WT mice showed a typical progression: body weight stabilized rapidly after DSS withdrawal, with resolution of diarrhea and rectal bleeding. Histological analysis at day 9 revealed signs of epithelial regeneration, including hypertrophic crypts and increased epithelial proliferation.

      In contrast, Cmg2KO mice failed to initiate this recovery phase. Clinical signs such as weight loss, diarrhea, and bleeding persisted after DSS withdrawal, ultimately necessitating euthanasia at day 9–10 due to humane endpoint criteria. Unfortunately, this prevented us from exploring later timepoints to determine whether regeneration was delayed or completely abrogated in the absence of Cmg2.

      Regarding the severity of epithelial damage, as raised by Reviewer 1, we now provide detailed histological scoring in the supplementary data. This analysis shows that the severity of inflammation and crypt damage was similar between WT and KO animals, as were inflammatory markers such as Lipocalin-2. The key difference lies in the extent of tissue involvement. While the lesions in WT mice were more localized, Cmg2KO mice displayed widespread and diffuse damage with no sign of regeneration as shown by the absence of hypertrophic crypts and a marked reduction in both epithelial coverage and proliferative cells. Importantly, at day 7, the percentage of epithelial and proliferating cells was comparable between genotypes, further supporting the idea that Cmg2KO mice failed to initiate this recovery phase and present a defective repair response.

      The animals used were young (8 weeks) and lacked any obvious defect in collagen deposition. Does this change with treatment? Even if not, is it possible that there is a defect in peristalsis or transit time of gut contents, resulting in longer dwell times and higher effective dose of DSS to the KO epithelium?

      Collagen deposition, particularly of collagen VI, is known to increase in response to intestinal injury and plays a critical role in promoting tissue repair following DSS-induced damage (Molon et al., PMID: 37272555). As suggested, we investigated whether Cmg2KO mice exhibit abnormal collagen VI accumulation following DSS treatment.

      Our results show that, consistent with published data, WT mice exhibit a marked increase in collagen VI expression during the acute phase of colitis, with levels returning toward baseline following DSS withdrawal. A similar expression pattern was observed in Cmg2KO mice, with no significant differences in Col6a1 mRNA levels between WT and KO animals throughout the entire time course of the experiment. This observation was further confirmed at the protein level by western blot and immunohistochemistry analyses, suggesting that the impaired regenerative capacity observed in Cmg2KO mice is independent of Collagen VI.

      Regarding the possibility of altered peristalsis or intestinal transit time contributing to increased DSS exposure in KO mice, this is indeed a possibility. Although we did not directly measure gut motility in this study, we did not observe any signs of intestinal obstruction or fecal retention in Cmg2KO mice. Indeed, during the experiment, animals were single caged for 30min in order to collect feces and no difference in the amount of feces collected was observed between WT and KO mice, arguing against a substantial difference in transit time (see figure below). The possible altered peristalsis and these observations are now mentioned in the discussion.

      Is CMG2 RNA and protein expressed in the colonic epithelium? It is not indicated or tested in the submitted manuscript. This reviewer struggled to find evidence, notably it did not seem to be referenced in the organoid paper they reference in introduction (ref 13).

      This very valid point was also raised by Reviewers 1 and 2. The expression pattern of Cmg2 in the gut has indeed not been characterized and is essential to understanding its function. To address this gap, we added a figure (Fig. 1) providing data from publicly available RNA-seq datasets and from our RNAscope experiments on Cmg2WT mice. Of note, we unfortunately have never managed to detect Cmg2 protein expression by immunohistochemistry of mouse tissue with any of the antibodies available, commercial or generated in the lab.

      __Specific comments: __

      Figure 3 c-e and associated text are confusing. In c the Y scale seems inappropriate to show percentages up to 15,000%.

      In this graph values are normalized to homeostatic level of WT mice which represent 100%

      In d and e the use of percentages may by correct. However, it is claimed in text that Cty61 and CTFG are upregulated in the KO. That is not what the plots appear to show as the compare to WT untreated cells, in which case the KO have not downregulated these genes in the way the controls have.

      As clarified in the text, under regenerative conditions, a transient activation of YAP signaling is crucial to induce a fetal-like reversion of intestinal stem cells. However, in a subsequent phase, the downregulation of YAP and the reactivation of Wnt signaling are necessary to complete intestinal regeneration. Several studies have highlighted a strong interplay between the Wnt and YAP pathways, suggesting that their coordinated regulation is essential for effective gut repair. Nevertheless, the precise mechanisms governing this interaction remain incompletely understood.

      In our model, this critical transition—YAP downregulation and Wnt reactivation—appears to be impaired. CMG2 may either hinder Wnt reactivation directly, or lead to sustained YAP signaling, which in turn suppresses activation of the Wnt pathway. Further studies, using in-vivo model and organoid models, will be necessary to understand the mechanistic role of Cmg2 in this regulatory process.

      A precision of the figure has been updated as followed: both of which were significantly upregulated in the injured colons of Cmg2KO mice compared to DSS-injured Cmg2WT mice

      __**Referees cross-commenting** __

      Rev2 Points 1 and 2 made by Referee 1 (and point 4 of Referee 3) appear most reasonable, and if not already done should be.

      We have indeed addressed these 2 points.

      I also noted the more severe morphology of DSS damaged epithelium shown in Fig 2a noted by Referee 3 - and this I agree is a confounding factor. […] For my part, the concern is understandable but likely not operating in a confounding way. And the evidence for the reprogramming of the damaged epithelium into "fetal-like stem cells" (the 1st step in restitution of lost stem cells) occurs in both WT and KO mice - and these data are strong. For this reader, the block convincingly shows up for KO mouse at the WNT dependent step

      The representative image has been updated, and a transverse section has been added to better illustrate that, although both epithelium and crypt structures can be present, the epithelial morphology differs significantly. Indeed, the regenerating epithelium of Cmg2WT mice displays a thick epithelial layer with well-polarized epithelial cells, whereas in cmg2KO mice, the epithelium appears atrophic, characterized by a thinner epithelial layer and elongated epithelial cells.

      __Rev 3 __

      This reviewer remains sceptical. I agree the authors performed the experiment well to confirm that DSS dosing was as equivalent as possible across the study. But DSS acts to induce colitis because it is concentrated in the colonic lumen as water is absorbed. Also ECM responses and remodelling are a central part of colitis models. And my concern is that the actual exposure in the KO group is influenced by transit of faeces/DSS is secondary to the known action of CMG2 on collagen deposition. The consequence of this being a protracted damage phase in which a restoration of adult stem cells would not be expected and leading to epithelial failure.

      However, we differ. I might propose that the authors are asked to investigate and confirm expression of CMG2 in the epithelium and to repeat the analysis of collagen levels they performed on untreated CMG2 KO mice on colons from CMG2 KO mice having received DSS to see if these differ from controls.

      This has now been done.

      __Rev 1 __

      Both reviewer #2 and reviewer #3 make relevant points, from the point of view of extracting as much biological knowledge as we can from the observations reported in the manuscript.

      Reviewer #2 suggestion to use Cmg2[KO] organoids to investigate the dependence of Wnt transduction on Cmg2 is the type of experiments I refrained to propose. However, I think the "skeleton" of the mechanism is there and is reasonably solid. Fleshing it out may well be another paper.

      I agree with Reviewer #3 objections to the timing and severity of the DSS damage. However, I am not sure how much they invalidate the main tenet of the paper:

      • DSS may affect Cmg2[KO] more severely, but the overall disease score is comparable during the DSS treatment. If this severity was enough to be the main driver of the phenotype, it should have left a mark in the Histological and Disease activity scores. In this regard, I think it would be helpful if the authors provided an expanded version of Figure 2A with examples of the different levels of "Crypt damage" scored, and the proportions for each. This could be in the supplementary material and would balance the impressions induced by a single image.

      As suggested, we included a detail of histological score including the crypt damage score in Supplementary Fig 3i showing no significant differences in crypt damage between Cmg2WT and Cmg2KO mice.

      • If DSS affected the recovery, this would also be compatible with having a more severe histological phenotype (which is not shown overall, just in Fig 2A) because one would also expect the tissue to attempt regeneration during the 7 days of DSS treatment.

      This is an interesting point, and we now allude to this aspect in the manuscript.

      • The only objection that I find difficult to argue is the effective duration of the treatment. If indeed peristalsis is affected, it may be that during the 'recovery' phase there is still DSS in the intestine. This could be perhaps verified using a DS detection assay (e.g. https://arxiv.org/pdf/1703.08663) on the intestinal contents or the faeces of the mice during the 3-day recovery period.

      We have attempted to obtain and purchase Heparin Red to perform this assay. Unfortunately, we have not obtained the reagent, which has never been delivered. We now also mention the following in the Discussion:

      One could envision that Cmg2KO mice have a defect in peristalsis resulting in longer dwell times and possibly higher effective dose of DSS to the KO epithelium. We however did not observe any signs of intestinal obstruction or fecal retention in Cmg2KO mice. Animals were single-caged for 30 min to collect feces. We did not observe any difference in amounts collected from WT and KO mice, arguing against a substantial difference in transit time of gut contents. Moreover, if DSS affected the recovery, one would have expected a more severe histological phenotype in the colon of Cmg2KO since the tissue likely already attempts regeneration during the 7 days of DSS treatment. But this was not the case. Therefore, while we cannot formally rule out the presence of residual DSS in Cmg2KO mice during the DSS withdrawal phase, there is currently no indication that this was the case.

      I think of what the aim of scholarly publication is, with this paper, and I find myself going back to a statement of the authors' discussion - that this work suggests that infants risking death may be offered (compassionate, I guess) IBD treatment. What does this hinge upon? I think, on the basic observation that diarrhoea (in the mouse model) is not intrinsic but caused by an inflammation-promoting insult. Is this substantiated? I think it is. Could we learn more biology from this disease model, about Wnt and about how ECM affects tissue regeneration? Certainly. Can this learning wait? I believe it can.

      We thank the reviewer for this statement.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript has a good rationale in trying to understand why infants with an inherited condition, Hyaline Fibromatosis Syndrome, that is primarily associated with turnover and deposition of extracellular collagen also develop severe diarrhoea that can contribute to their premature death. The premise is that the causative germline mutated gene, CMG2/ANTRX2, may have a functional role in colonic epithelium in addition to controlling the ECM composition. There is little background information but one study has shown no primary defect in epithelial organoids grown from patients with the syndrome. This leads the authors to wonder if non-homeostatic, conditions might reveal a function role for the gene in regeneration.

      The authors' approach to test the hypothesis is to use a mouse germline knockout model and to induce colitis and regeneration by the established protocol of introducing dextran sodium sulfate (DSS) into the drinking water for five days. In brief there is no phenotype apparent in the untreated knockout (KO) but these animals show a more severe response to DSS that requires them to be killed by 10 days after the start of treatment. This effect following phenotypic characterisation of the colonic epithelium is interpreted as showing the CMG2 is a Wnt modifier required for the restoration of the intestinal stem cell population in the final stages of repair.

      The experiment and analysis seem reasonably well executed - although a few specific comments follow below. The narrative is simple and easy to understand. However, there are significant caveats that cast doubts on the interpretation made that loss of CMG2 impairs the transition of colonic epithelial cells from a fetal like state to adult ISCs.

      Significance

      1. First there is only a single approach and single type of experiment performed. There is a lack of independent validation of the phenotype and how it is mediated.
      2. The DSS dose in this kind of experiment is often determined empirically in individual units. Here the 3% used is within published range but at upper end. The control animals show a typical response with symptoms of colitis worsening for 2-3 days after the removal of DSS and then recovery commonly over another 5-7 days.

      Here the CMG2 KO mice fail to recover and are killed by 9 or 10 days. The authors attempt to exploit the time course by identifying normal initial (7days) and defective late (10days) repair phases in KO animals when compared to controls. It is from this comparison that conclusions are drawn.

      However, the alternative interpretation might be that the epithelium of KO animals is so badly damaged, and indeed non-existent (from viewing Fig2a), that it is incapable of mounting any other response other than death and that the profiling shown is of an epithelium in extremis. The repair capability and dynamics of the KO would have been better tested under more moderate DSS challenge, if this experiment had been regarded as a pilot rather than as definitive. 3. The animals used were young (8 weeks) and lacked any obvious defect in collagen deposition. Does this change with treatment? Even if not, is it possible that there is a defect in peristalsis or transit time of gut contents, resulting in longer dwell times and higher effective dose of DSS to the KO epithelium? 4. Is CMG2 RNA and protein expressed in the colonic epithelium? It is not indicated or tested in the submitted manuscript. This reviewer struggled to find evidence, notably it did not seem to be referenced in the organoid paper they reference in introduction (ref 13).

      Specific comments:

      Figure 3 c-e and associated text are confusing. In c the Y scale seems inappropriate to show percentages up to 15,000%. In d and e the use of percentages may by correct. However, it is claimed in text that Cty61 and CTFG are upregulated in the KO. That is not what the plots appear to show as the compare to WT untreated cells, in which case the KO have not downregulated these genes in the way the controls have.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The paper uses mice lacking Capillary Morphogenesis Gene 2 (CMG2- KO) mice to investigate the pathogenic mechanism underlying the protein losing enteropathy seen in children with severe Hyaline Fibromatosis Syndrome. Significance of the work is further enhanced as the intestinal phenotype induced by CMG2-KO provided a model system (with robust validated tools) for testing newly emerging (and paradigm shifting) ideas in mechanisms of tissue regeneration after injury - generalizable to tissue restitution beyond the intestine.

      The study shows that in the mouse colon CMG2 plays a critical role in recovery from mucosal/epithelial damage chemically induced by dextran-sulfate-sodium (DSS). Mice lacking CMG2 failed to recover from DSS colitis with no evidence for restitution of the DSS-damaged epithelium. WT mice recovered after DSS removal.

      The first step in restitution of epithelial damage in the intestine, when the epithelial stem-cell populations are depleted as in this model of DSS colitis, occurs by the transformation of surviving differentiating/differentiated epithelial cells back into a stem-cell-like (fetal-cell-like) state. This step in the process was found to occur normally in the CMG2 KO mouse. The block in restitution was located to the step where de-differentiated (fetal-cell-like) colonocytes are induced back into their WNT-dependent proliferative state - thus replenishing the normally proliferating stem (LGR5+) cells of the colonic crypt. The reason for this failure is explained by a defect in WNT signaling in the injured colons of CMG2 KO mice, as assessed by failure of -catenin translocation into the nucleus of barrier epithelial cells - a down-stream effect of WNT signaling and consistent with the dependence on CMG2 for WNT signaling in other experimental systems.

      The study is overall well designed, meticulously carried out, and with clear and convincing results that are most reasonably and thoughtfully interpreted. The paper makes a meaningful contribution to the field. It models an experiment of nature to test, delineate, and verify disease pathogenesis and a newly revised mechanism for mucosal tissue repair.

      For this reader, one additional thought comes to mind. If I understand the field correctly it would be informative to know with greater confidence where - in what cell type, epithelial or mesenchymal - the CMG2-LRP6-WNT interaction occurs.

      After injury the CMG2-KO mouse epithelium exhibits defective WNT signal transduction - as evidenced by failure of -catenin to translocate into the nucleus. At first glance, this result is a disconnect with the paper by van Rijin that claims the defect in Hyaline Fibromatosis Syndrome cannot be due to loss of CMG2 expression/function in the barrier epithelial cell - a claim based on the mostly normal phenotypes of human CMG2 KO duodenal organoids. But the human organoids studied in the van Rijin paper, like all others, are established and cultured in very high WNT conditions, perhaps obscuring the lack of the CMG2-LRP6-WNT interaction. And in fact, the phenotypes of these human CMG2-KO duodenoids were not entirely normal - the CMG2-KO stem-like organoids (even when cultured in high WNT/R-spondin conditions) developed abnormal intercellular blisters consistent with a defect in epithelial structure/function - of unknown cause and not investigated.

      I think it would be informative to prepare colon organoids (and duodenoids) from WT and CMG2-KO mice to quantify their WNT dependency during establishment and maintenance of the stem-like (and WNT-dependent) state. If CMG2 acts within the epithelial cell to affect WNT signaling (regardless of WNT source), organoids prepared from colons of CMG2-KO mice would require more WNT in culture media to establish and maintain the stem cell proliferative state - when compared to organoids prepared from WT mice. This can be quantified (and confirmed molecularly by transgene expression if successful). Enhanced dependency of high concentrations of exogenous WT would be evidence for a primary defect in WNT-(LRP2)-CMG2 signal transduction localized to the epithelial barrier cell - thus addressing the apparent discrepancy with the van Rijin paper - and for my part, advancing the field. And the discovery of a defect in the epithelium itself for WNT signal transduction would implicate a biologically most plausible mechanism for development of protein losing enteropathy.

      By no means do I consider these experiments to be required for publication (especially if considered to be incremental or already defined - WNT-CMG2 is not my field of research). This study already makes a meaningful contribution to the field as I state above.

      But in the absence of new experimentation, the issue should probably be discussed in greater depth.

      Significance

      The study is overall well designed, meticulously carried out, and with clear and convincing results that are most reasonably and thoughtfully interpreted. The paper makes a meaningful contribution to the field. It models an experiment of nature to test, delineate, and verify disease pathogenesis and a newly revised mechanism for mucosal tissue repair.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this work, Bracq and colleagues provide clear evidence that the persistent diarrhoea seen in a mouse model of Hyaline Fibromatosis Syndrome is related to the inability of their intestinal epithelium to properly regenerate. This is very clear and of immediate impact. This aspect of the paper, I think, is ready for publication, and would merit immediate dissemination on its own. It is great that the manuscript is in bioRxiv already.

      I am not so thoroughly convinced about the mechanism that the author propose to explain the incapacitation of Cmg2[KO] intestinal stem cells to function properly. The authors propose that it is due to their inability to transduce Wnt signals, and while this is plausible, I think there are few things that the paper should contain before this can be proposed firmly:

      Point #1

      The mouse mutant is just described as 'KO', referring to the previous work by the authors. The cited work simply states that this is a zygotic deletion of exon 3, which somehow leads to a decrease in protein abundance that is almost total in the lung but not so clear in the uterus. Exon 3 happens to be 72 bp long [https://www.ncbi.nlm.nih.gov/nuccore/NM_133738], so its deletion (assuming there are no cryptic splicing sites used) leads to an internal in-frame deletion of 24 amino acids. So, at best, this 'KO' is not a null, but a hypomorphic allele of context-dependent strength. Unfortunately, neither the previous work nor this paper (unless I have missed it!) contains information provided about the expression levels of Cmg2 in the intestine of KO mice - nor which cell types usually express it (see below). I think that using anti Cmg2 in WB and immunohistofluorescence of with ISC markers with intestine homogenate/sections of wild-type and mutant mice would be necessary to set the stage for the rest of the work.

      Point #2

      Connected to the previous point, the expression pattern of Cmg2 in the intestine is not described. Maybe this is already established in the literature, but the authors do not refer to the data. This is important when considering that the previous work of the authors suggests that Cmg2 might contribute to Wnt signalling transduction through physical, cis interactions with the Wnt co-receptor LRP6. Therefore, one would expect that Cmg2 would be cell-autonomously required in the intestinal stem cells.

      Point #3

      The authors establish that the regenerating crypts of Cmg2[KO] mice are unable to transduce Wnt signalling, but it is not clear whether this situation is provoked by the DSS-induce injury or existed all along. Can Cmg2[KO] intestinal stem cells transduce Wnt signalling before the DSS challenge? If they were, it might suggest that the 'context-dependence' of the Cmg2 role in Wnt signalling is contextual not only because of the tissue, but because of the history of the tissue or its present structure. It would also suggest that Cmg2 mutant mice, unless reared in a germ-free facility for life, would eventually lose intestinal homeostasis, and maybe suggest the level of intervention/monitoring that HFS patients would require. It might also provide an explanation in case Cmg2 was not expressed in ISCs - if the state of the tissue was as important as the presence of the protein, then the effect on Wnt transduction could be indirect and therefore it might not be required cell-autonomously.

      I think points 1 and 2 are absolutely fundamental in a reverse genetics investigation. Point 3 would be nice to know but the outcome would not change the tenet of the paper. I believe that the work needed to deal these points can be performed on archival material. I do not think the mechanism proposed can be taken from 'plausible' to 'proven' without proposing substantial additional investigation, so I will not suggest any of it, as it could well be another paper.

      A few minor points picked along the way:

      1. Figure 1 legend says "In (c), results are mean {plus minus} SEM" - this seems applicable to (d) as (c) does not show error whiskers.
      2. Figure 1 legend says "(d) Body weight loss, (f) the aspect of the feces and presence of occult blood were monitored and used for the (e) DAI. Results are mean {plus minus} SEM. Each dot represents the mean of n = 12 mice per genotype". This part looks like has suffered some rearrangement of words. The first instance of (f) should be (e), I guess, and I am not sure what "(e) DAI" means. And for (e), "mean {plus minus} SEM" does not seem applicable. This needs some light revision.
      3. Figure 1H legend does not say which statistical test was made in the survival experiment in (h) - presumably log-rank? A further comment on the survival statistics: euthanised animals should not be counted towards true mortality when that is what is recorded as an 'event'. They should be right-censored. However, in this case, reaching the euthanasia criterion is just as good an indicator of health as mortality itself. So, simply by changing the Y axis from 'survival' to 'event-free survival' (or something to that effect), where 'events' are either death or reaching the euthanasia criterion, leaves the analysis as it is, and authors do not need to clarify that figure 1H shows "apparent mortality", as it is straightforward "complication-free survival" (just not entirely orthogonal to weight loss).
      4. Some density measurements are made unnecessarily on arbitrary units (per field of view) - this should be simple to report in absolute measures (i.e. area of tissue screened or, better still, length of epithelium screened).
      5. Figure 2E should read "percent involvement"
      6. Figure 2J should read "lipocalin..."
      7. In section "CMG2 Is Dispensable for YAP/TAZ-Mediated Reprogramming to Fetal-Like Stem Cells", the authors write ""We measured the mRNA levels of two additional YAP target genes, Cyr61 and CTGF...". I presume the "additional" is because Ly6a is also a target of YAP/TAZ, but if the reader does not know, it is puzzling. I would suggest to make this link explicit.
      8. In Figures S2, 3 and S3, I think that the measures expressed as "% of homeostatic X in WT" really mean "% of average homeostatic X in WT". This should be made clear somewhere.
      9. In panel C, the nature of the data is not entirely clear. First, the corresponding part of the legend says "Representative images of n=4 mice per genotype" which I presume should refer to panel B. Then, the graph plots 4 data points, which suggests that they correspond to 4 mice - but how many fields of view? Also, the violin plot outline is not described - I presume it captures all the data points from the coarse-grained pixel analysis, but it should be clarified.
      10. In Figure 3H and 3I, I would suggest to add the 7+3 timepoint where the data come from.
      11. In section "CMG2 Is Critical for Restoring the Lgr5+ Intestinal Stem Cell Pool", the authors say "...The mRNA levels of ... LRP6, β-catenin (Fig. S3a-b), and Wnt ligands (Wnt5a, 5b, and 2b) were comparable between the colons of Cmg2WT and Cmg2KO mice (Fig. S3c)..." without clarifying in which context - one needs to read the figure legend to realise this is "timepoint 7+3". I suggest to add "in the recovery phase" or "in regenerating colons" or something shorter, just to guide the reader.
      12. Like with the previous point, it is not clear when the immunohistofluorescence of B-catenin is made - not even in the legend, as far as I could see. The only hint is that authors say "the nuclei of cells in the atrophic crypts of Cmg2KO..." with 'atrophic' probably indicating again the 7+3 timepoint.
      13. A typo in the discussion: tunning for tuning.
      14. In the discussion, the authors talk about the 'CMG2' protein (all caps - formatting convention for human proteins) but before they were referring to 'Cmg2' (formatting convention for mouse proteins). That is fine but some of the statements where "CMG2" is used clearly refer to observations made in the mouse.
      15. Typos in methods: "antigen retrieval by treating [with] Proteinase K"; "Image acquisition and analyze [analysis]"; "All details regarding code[s] used for immunofluorescence analysis"

      Referees cross-commenting

      *this session contains comments from ALL the reviewers"

      Rev2

      Points 1 and 2 made by Referee 1 (and point 4 of Referee 3) appear most reasonable, and if not already done should be.

      I also noted the more severe morphology of DSS damaged epithelium shown in Fig 2a noted by Referee 3 - and this I agree is a confounding factor. But overall, multiple lines of evidence were assembled to show that the KO mice and WT mice suffered DSS-induced colitis with equal severity - and with closely equal severity of damage to the intestinal epithelium (though the image in Fig 2a is disturbing). For my part, the concern is understandable but likely not operating in a confounding way. And the evidence for the reprogramming of the damaged epithelium into "fetal-like stem cells" (the 1st step in restitution of lost stem cells) occurs in both WT and KO mice - and these data are strong. For this reader, the block convincingly shows up for KO mouse at the WNT dependent step

      Rev 3 This reviewer remains sceptical. I agree the authors performed the experiment well to confirm that DSS dosing was as equivalent as possible across the study. But DSS acts to induce colitis because it is concentrated in the colonic lumen as water is absorbed. Also ECM responses and remodelling are a central part of colitis models. And my concern is that the actual exposure in the KO group is influenced by transit of faeces/DSS is secondary to the known action of CMG2 on collagen deposition. The consequence of this being a protracted damage phase in which a restoration of adult stem cells would not be expected and leading to epithelial failure.

      However, we differ. I might propose that the authors are asked to investigate and confirm expression of CMG2 in the epithelium and to repeat the analysis of collagen levels they performed on untreated CMG2 KO mice on colons from CMG2 KO mice having received DSS to see if these differ from controls.

      Rev 1 Both reviewer #2 and reviewer #3 make relevant points, from the point of view of extracting as much biological knowledge as we can from the observations reported in the manuscript.

      Reviewer #2 suggestion to use Cmg2[KO] organoids to investigate the dependence of Wnt transduction on Cmg2 is the type of experiments I refrained to propose. However, I think the "skeleton" of the mechanism is there and is reasonably solid. Fleshing it out may well be another paper.

      I agree with Reviewer #3 objections to the timing and severity of the DSS damage. However, I am not sure how much they invalidate the main tenet of the paper:

      • DSS may affect Cmg2[KO] more severely, but the overall disease score is comparable during the DSS treatment. If this severity was enough to be the main driver of the phenotype, it should have left a mark in the Histological and Disease activity scores. In this regard, I think it would be helpful if the authors provided an expanded version of Figure 2A with examples of the different levels of "Crypt damage" scored, and the proportions for each. This could be in the supplementary material and would balance the impressions induced by a single image.

      • If DSS affected the recovery, this would also be compatible with having a more severe histological phenotype (which is not shown overall, just in Fig 2A) because one would also expect the tissue to attempt regeneration during the 7 days of DSS treatment.

      • The only objection that I find difficult to argue is the effective duration of the treatment. If indeed peristalsis is affected, it may be that during the 'recovery' phase there is still DSS in the intestine. This could be perhaps verified using a DS detection assay (e.g. https://arxiv.org/pdf/1703.08663) on the intestinal contents or the faeces of the mice during the 3-day recovery period.

      I think of what the aim of scholarly publication is, with this paper, and I find myself going back to a statement of the authors' discussion - that this work suggests that infants risking death may be offered (compassionate, I guess) IBD treatment. What does this hinge upon? I think, on the basic observation that diarrhoea (in the mouse model) is not intrinsic but caused by an inflammation-promoting insult. Is this substantiated? I think it is. Could we learn more biology from this disease model, about Wnt and about how ECM affects tissue regeneration? Certainly. Can this learning wait? I believe it can.

      Significance

      In this work, Bracq and colleagues provide clear evidence that the persistent diarrhoea seen in a mouse model of Hyaline Fibromatosis Syndrome is related to the inability of their intestinal epithelium to properly regenerate. This is very clear and of immediate impact. For instance, the authors themselves point at the possibility of applying treatments for Inflammatory Bowel Disease to HFS patients. While what happens in a mouse model is not necessarily the same as in human patients, the fact that persistent diarrhoea is a life-threatening symptom in HFS make this proposal, at least in compassionate use of the therapies and until its efficacy is disproven, very plausible. This is a clear gap of knowledge that addresses an unmet medical need.

      I find that the work shows clearly that HFS mouse model subjects have normal intestinal function until challenged with a standard chemically-induced colitis. Then, the histological and health deterioration of the HFS mouse model is clear in comparison with normal mice, which can regenerate appropriately. This is shown with a multiplicity of orthogonal techniques spanning molecular, histological and organismal, which are standard and very well reported in the paper.

      The authors propose a specific cellular and molecular mechanism to explain the incapacity of the intestinal epithelium in the mouse model of HFS to regenerate. According to this mechanism, the protein Cmg2, whose mutation causes HFS in humans, would be necessary for intestinal stem cells to transduce the signal of Wnt ligands and therefore support their behaviour as regenerative cells. This mechanism is plausible, but more basic and advanced work would be needed to take it as proven.

      This work would be of interest to both the clinical, biomedical, and basic research communities interested in rare diseases, the gastrointestinal system, collagen and extracellular matrix, and Wnt signalling.

      My general expertise is in developmental and stem cell biology using reverse genetics, transgenesis and immunohistological and molecular methods of data production, and lineage tracing, digital imaging and bioinformatic analytical methods; I work with Drosophila melanogaster and its adult gastrointestinal system.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In the manuscript entitled " The hepatitis E virus capsid protein ORF2 counteracts cell-intrinsic antiviral responses to enable persistence in hepatocytes ", Ann-Kathrin Mehnert interrogated that HEV pORF2 can inhibit host antiviral response. They found interaction of HEV ORF2 and TBK1. The finding is interesting and echoed with some previous studies that ORF2 can inhibit innate immunity.

      The study could benefit from the consideration of some major and specific points, as indicated below:

      Major issues:

      1. The researchers used p6, a cell-adapted clone, which was isolated form a chronic HEV patient. As previous studies suggested, p6 may behave differently than wild-type strains. Did the authors tried other HEV strains, as they used ips-induced model that was reported supportive to wild-type HEV?
      2. Figure 1F, ORF2 can interact with TBK1 as showed. But the prediction from Alphafold is weak. Also, could the author more evidence than the co-IP?
      3. Figure 2C and 2D, at 5 dpi, one can observed a stronger antiviral response, but at 7 dpi, no obvious difference was observed. Could the authors comment on this? 4.Figure 2H and 2I, detailed description of how the authors measured the positive cells should be provided. Did the authors selected whole plate of cells for counting? As showed in Figure 2H, the signals of IF were stronger at 5 and 7 dpi when compared at 3 dpi, but why the proportion of positive cells was reduced in Figure 2I?
      4. The study emphasized the function of ORF2 on HEV "persistence". However, this cannot be fully supported by cell models. In future, study on chronic HEV infection animal models may be conducted.
      5. The authors study ORF2 in whole. It will be of benefit to the readers that the authors could specified the function of secreted ORF2 and ORF2 capsid in the current study.

      Minor issues:

      1. Figure 3A, this is an elegant design. More data may provide for the validation of the formation of the virions.
      2. Figure 1, data should be provided for the successful expression of HEV-1 or HEV-3 ORF2, and ORF3.
      3. line 219, the current evidence that supported this statement is weak, especially for ORF2.
      4. Suppl Figure 3F-3H, statistical analysis is needed
      5. Suppl Figure 3F-3H, it seems that when no treatment was admistrated, the level of ISG15 in ΔORF2 group was higher than those of the WT and ΔORF3 group. Could the authors comment?
      6. Figure 3D and 3E, the starting time of the detection is not aligned.
      7. Figure 3F, scale bar is missing.
      8. In M&M, statistical method should be provided with more details and cover all the experiments used.

      Significance

      In the manuscript entitled " The hepatitis E virus capsid protein ORF2 counteracts cell-intrinsic antiviral responses to enable persistence in hepatocytes ", Ann-Kathrin Mehnert interrogated that HEV pORF2 can inhibit host antiviral response. They found interaction of HEV ORF2 and TBK1. The finding is interesting and echoed with some previous studies that ORF2 can inhibit innate immunity.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: Authors described the protective mechanism mediated by ORF2 that protects viral replication from the antiviral responses. They have utilized the advanced single-cell RNA sequencing to decipher the dampened antiviral responses in the presence of ORF2 HEV. I believe the study is important for the HEV literature and believe that the manuscript can be considered for publication after authors (1) rewrites the results and discussions separately until the journal wants it to be together. (2) answer the below questions.

      Minor comments:

      Line 69, 71 - I have never seen in any paper including reference in this way!

      Line 72 and 73 - missing reference!

      Line 92, 93 - missing reference!

      Line 95 to 99 - missing references!

      Major comments:

      I would like the authors to answer few questions: 1. Did the authors study only the P6 HEV genome? Have they done anything comparative with the other strain to understand if the proposed mechanism is not the strain specific? 2. Can the authors explain why we do not see any band in the Fig. 1F B-actin?

      Significance

      The paper uses advanced technique as single cell RNA seq to understand the mechanism of ORF2 assisting in the HEV replication.

      The study is well designed.

      This study will add up to understand some of the persistence infection seen in solid organ transplant patients. This study gives a mechanistic overview of HEV avoidance of antiviral response.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this study, the authors investigated how HEV ORF2 interferes with host antiviral responses to sustain viral infection. They employed several models, including pattern recognition receptors (PRRs) KO cell lines, immunodeficient cells and stem cell-derived models, to prove that: 1) ORF2 is essential for viral replication and 2) ORF2 dampens the interferon and inflammatory signaling pathways. They confirmed the interaction between ORF2 and TBK1, a central mediator of innate immune responses and identified residues in ORF2 that affect its interaction with TBK1. Finally, through single cell RNAseq, they demonstrated that ORF2 is a viral antagonist that inhibits host ISG expression in both infected and bystander cells. Interestingly, the sets of genes that are upregulated in WT vs ORF2-deficient virus infected cells are not entirely identical, suggesting that ORF2 may also modulate host gene expression in addition to suppressing the immune response. This research confers new immune antagonism mechanisms mediated by HEV capsid for sustainable HEV replication in host cells and provides potential therapeutic targets for HEV treatments.

      Major comments:

      1. The authors conclude from Figure 1 that the HEV ORF2 protein antagonizes both antiviral and inflammatory signaling pathways. The authors comprehensively investigated PRRs-mediated activation of type I interferon by viruses or poly(I:C) through overexpression of MDA5, RIG-I and TLR3. However, they only investigated the impact of ORF2 on host inflammatory response through evaluating the levels of TNFAIP3 RNA in the presence of MDA5 overexpression. It would be informative if the authors also check for NFkB activation/phosphorylation and expression of classical pro-inflammatory cytokines such as IL-1b and IL6. Interestingly, changes in IFNB secretion after ORF2 overexpression appear more dramatic compared to changes in IFNB1 RNA levels (compare Figure 1A-C with Supplementary Figure 1A and C). Are the IFN-beta protein expression changes statistically significant in Supplementary Figure 1?
      2. Changes in the IFN response do not always translate into changes in the viral RNA levels. In Figure 2B-D, the authors attributed the higher induction of IFNL1 and ISG15 on day 5 to the absence of ORF2 inhibition. However, the expression of these two genes drops to the same levels as the ones in WT viral RNA-electroporated cells on day 7, which is strange as ORF2-deficient viral RNA levels continue to be inhibited on day 7. This is different from the stem cell derived hepatocytes infected with the trans-complementation viruses in Figure 3G-H where there are significant differences in ISG15 levels between WT and ORF2-deficient virus infected cells on both days 5 and 7. To support their hypothesis, the authors need to further confirm the sudden upregulated antiviral activity on day 5 in electroporated HepG2/C3A cells by testing JAK/STAT phosphorylation and type I interferon secretion.
      3. The authors used different hepatocyte systems coupled with viral RNA electroporation or trans-complementation virus infection to investigate ORF2-mediated interference of the IFN pathway, which is highly complementary. However, while the electroporation of viral RNA into HepG2/C3A (Figure 2B-D) and infection of stem cell-derived hepatocytes with trans-complementation viruses (Figure 3F-H) result in similar upregulation of ISG expression on day 5, that wasn't observed in HepG2/C3A cells infected with trans-complementation viruses (Figure 3C-E) on day 5. The authors need to discuss the discrepancy among these different systems. Since the ORF2-deficient trans-complementation virus still brings in ORF2 proteins from the producer cells but cannot generate new ORF2 proteins, do ORF2 proteins from these two different sources have different functions in different hepatocyte systems? In addition, other than the data points that are shown to be not significantly different in Figure 3D-E, are any of the other data points significantly different?
      4. The single cell RNAseq data are very informative and revealed two interesting groups of genes. First, the ISGs that are further induced in the cells infected with ORF2-deficient HEV compared to cells infected with WT HEV (Figure 4N) are likely suppressed by ORF2. Second, the ISGs that are uniquely induced in the absence of ORF2 are different from the genes that are uniquely induced by WT HEV (Supplementary Table 2), suggesting that ORF2 may also modulate host gene expression. The authors can further characterize these two groups of ISGs by performing gene knockdown or knockout and investigating whether ORF2 directly interacts with these ISG products to determine the functional consequences of their upregulation. Related to that, are there other gene expression changes beyond ISG signatures which would suggest that ORF2 can regulate host gene expression? Figure 4A-C only shows comparisons for WT or ORF2-deficient vs. uninfected cells. The authors can perform GO and KEGG analyses to see if certain biological processes/pathways are enriched among the WT vs ORF2-deficient HEV induced genes. Further characterization of these genes (ISGs or not) would shed light on the novel roles of ORF2 in both immune antagonism and gene regulation and greatly increase the significance of the study.
      5. In Supplementary Figure 3F-H, the authors used BX795 to inhibit TBK1 (a target of ORF2) and found decreases in IFNL1 and ISG15 expression whether cells are electroporated with WT, ORF2-deficient, or ORF3-deficient viral RNA. However, this does not correlate with the data in Figure 2E-G where TBK1 inhibition results in significant differences in viral RNA levels only in the absence of ORF2 or ORF3. These results would suggest that the effects of TBK1 inhibition on viral RNA levels is independent of changes in the IFN/ISG expression levels.

      Significance

      The study addresses a long-standing question in the field about the immune antagonism activities of HEV ORF2 and ORF3 which previous studies have conflicting results on. The strength of this study is the use of complementary approaches such as ORF2 trans complementation system and single cell sequencing, and more relevant models such as stem cell derived hepatocytes to rigorously dissect the role of newly synthesized ORF2 protein in immunocompetent cell context. The manuscript is well written and would appeal to researchers in the HEV and innate immunity fields. However, the significance of the study is dampened by changes in the IFN response not always correlate with the inconsistency of ORF2-mediated inhibitory effects in different models and the still poorly defined mechanism of ORF2 suppression of the IFN pathway. The study would make conceptual advance if the authors can address the discrepancies in their findings and perform additional characterization to determine the functional consequences of ORF2-mediated immune suppression and gene regulation.

      My expertise is in innate immunity and host-virus interactions.