10,000 Matching Annotations

May 2025
www.biorxiv.org www.biorxiv.org

Rapid and Inducible Mislocalization of Endogenous TDP43 in a Novel Human Model of Amyotrophic Lateral Sclerosis

3
1. Public_Reviews 12 May 2025
 
 in eLife
 
 eLife Assessment
 
 TDP-43 mislocalization is a key feature of some neurodegenerative diseases, but cellular models are lacking. The authors endogenously-tagged TDP-43 with a C-terminal GFP tag in human iPSCs, followed by expression of an intrabody-NES that targeted GFP to the cytosol. They convincingly report physical mislocalization and functional depletion of TDP-43, as measured by microscopy and RNAseq. This method will be valuable to investigators studying the biological consequences of TDP-43 mislocalization and the methodology is in line with the current state-of-the-art.
 
 Summary
2. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 TDP-43 mislocalization occurs in nearly all of ALS, roughly half of FTD, and as a co-pathology in roughly half of AD cases. Both gain of function and loss of function mechanisms associated with this mislocalization likely contribute to disease pathogeneisis.
 
 Here, the authors describe a new method to induce TDP-43 mislocalization in cellular models. They endogenously-tagged TDP-43 with a C-terminal GFP tag in human iPSCs. They then expressed an intrabody - fused with a nuclear export signal (NES) - that targeted GFP to the cytosol. Expression of this intrabody-NES in human iPSC derived neurons induced nuclear depletion of homozygous TDP-43-GFP, caused its mislocalization to the cytosol, and at least in some cells appeared to cause cytosolic aggregates. This mislocalization was accompanied by induction of cryptic exons in well characterized transcripts known to be regulated by TDP-43, a hallmark of functional TDP-43 loss and consistent with pathological nuclear TDP-43 depletion. Interestingly, in heterozygous TDP-43-GFP neurons, expression of intrabody-NES appeared to also induce the mislocalization of untagged TDP-43 in roughly half of the neurons, suggesting that this system can also be used to study effects on untagged endogenous TDP-43 as well as TDP-43-GFP fusion protein.
 
 Strengths:
 
 A clearer understanding of how TDP-43 mislocalization alters cellular function, as well as pathways that mitigate clearance of TDP-43 aggregates, is critical. But modeling TDP-43 mislocalization in disease-relevant cellular systems has proven to be challenging. High levels of overexpression of TDP-43 lacking an NES can drive endogenous TDP-43 mislocalization, but such overexpression has direct and artificial consequences on certain cellular features (e.g. altered exon skipping) not seen in diseased patients. Toxic small molecules such as MG132 and arsenite can induce TDP-43 mislocalization, but co-induce myriad additional cellular dysfunctions unrelated to TDP-43 or ALS. TDP-43 binding oligonucleotides can cause cytosolic mislocalization as well. Each system has pros and cons, and additional ways to induce TDP-43 mislocalization would be useful for the field. The method described in this manuscript could provide researchers with a powerful way to study the combined biology of cytosolic TDP-43 mislocalization and nuclear TDP-43 depletion, with additional temporal control that is lacking in current method. Indeed, the author see some evidence of differences in RNA splicing caused by pure TDP-43 depletion versus their induced mislocalization model. Finally, their method may be especially useful in determining how TDP-43 aggregates are cleared by cells, potentially revealing new biological pathways that could be therapeutically targeted.
 
 Weaknesses:
 
 The method and supporting data have some limitations.
 
 • Tagging of TDP-43 with a bulky GFP tag may alter its normal physiological functions, for example, phase separation properties and functions within complex ribonucleoprotein complexes. The authors show that normal splicing function of GFP-TDP-43 is maintained, suggesting that physiology is largely preserved, but other functions and properties of TDP-43 that were not directly tested could be altered.
 
 • Potential differences in splicing and micro RNAs between TDP-43 knockdown and TDP-43 mislocalization are potentially interesting. However, different patterns of dysregulated RNA splicing can occur at different levels of TDP-knockdown and can differ in different batches of experiments, thus it is difficult to asses whether the changes observed in this paper are due to mislocalization per se, or rather just reflect differences in nuclear TDP-43 abundance or batch effects.
 
 Review 1
3. Public_Reviews 12 May 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the previous reviews
 
 Public Reviews:
 
 Reviewer #1 (Public Review):
 
 Summary:
 
 Nuclear depletion and cytoplasmic mislocalization/aggregation of the DNA and RNA binding protein TDP-43 are pathological hallmarks of multiple neurodegenerative diseases. Prior work has demonstrated that depletion of TDP-43 from the nucleus leads to alterations in transcription and splicing. Conversely, cytoplasmic mislocalization/aggregation can contribute to toxicity by impairing mRNA transport and translation as well as miRNA dysregulation. However, to date, models of TDP-43 proteinopathy rely on artificial knockdown- or overexpression-based systems to evaluate either nuclear loss or cytoplasmic gain of function events independently. Few model systems authentically reproduce both nuclear depletion and cytoplasmic miscloalization/aggregation events. In this manuscript, the authors generate novel iPSC-based reagents to manipulate the localization of endogenous TDP-43. This is a valuable resource for the field to study pathological consequences of TDP-43 proteinopathy in a more endogenous and authentic setting. However, in the current manuscript, there are a number of weaknesses that should be addressed to further validate the ability of this model to replicate human disease pathology and demonstrate utility for future studies.
 
 Strengths:
 
 The primary strength of this paper is the development of a novel in vitro tool.
 
 Weaknesses:
 
 There are a number of weaknesses detailed below that should be addressed to thoroughly validate these new reagents as more authentic models of TDP-43 proteinopathy and demonstrate their utility for future investigations.
 
 (1) The authors should include images of their engineered TDP-43-GFP iPSC line to demonstrate TDP-43 localization without the addition of any nanobodies (perhaps immediately prior to addition of nanobodies). Additionally, it is unclear whether simply adding a GFP tag to endogenous TDP-43 impact its normal function (nuclear-cytoplasmic shuttling, regulation of transcription and splicing, mRNA transport etc).
 
 We have included images of the untransduced day 20 MNs derived from the engineered TDP43-GFP iPSC lines and the unedited line (Supplementary Fig. 1B).
 
 We acknowledge the reviewer’s concern about the potential impact of the GFP tag on TDP43's normal function. To address this, we have validated the functionality of TDP43 by assessing the inclusion of cryptic exons in highly sensitive targets such as UNC13A and STMN2, both of which are known to be directly regulated by TDP43.
 
 We compared MNs derived from the unedited parent line with the TDP43-GFP MNs prior to nanobody addition. As measured by qPCR, cryptic exon inclusion in UNC13A and STMN2 was not observed in the unedited or edited TDP43-GFP MNs (Supplementary Fig.1C), confirming that the tagging does not induce splicing defects by itself. The cryptic exon inclusion in UNC13A and STMN2 were only observed in TDP43-GFP MNs expressing the NES nanobody (Supplementary Fig. 2D). These findings were further supported by our next-generation sequencing data, which also showed that cryptic exon inclusion was specific to the TDP43 mislocalization condition (Supplementary Fig.3 and 4).
 
 Thus, we have strong evidence that the GFP-tagged TDP43 behaves similarly to the wild-type protein and does not interfere with its function in our model.
 
 (2) Can the authors explain why there is a significant discrepancy in time points selected for nanobody transduction and immunostaining or cell lysis throughout Figure 1 and 2? This makes interpretation and overall assessment of the model challenging.
 
 For the phenotypic data shown in Fig.1, we added the AAVs at day 18 or 20 and analyzed the cells at day 40. For the phosphorylated TDP43 western blot (revised Fig. 3D), cells were treated with doxycycline at day 20 to induce nanobody expression and samples were harvested at day 40. Thus, cells were harvested between days 20 or 22 after adding the nanobodies. The onset of transgene expression when using AAVs in neurons typically display slow kinetics. We observed TDP43 mislocalization in less than 50% of the neurons after 7 days post-transduction that peaked at 10-12 days after addition of the nanobodies, when more than 80% of the cells displayed TDP43 mislocalization. Hence, we do not believe that a two-day difference significantly alters the interpretation of the data.
 
 The decision to harvest neurons at day 30 for the qPCR data was taken to investigate whether the splicing changes seen at day 40 from the transcriptomics analysis can be detected well before the phenotypes observed at day 40.
 
 (3) The authors should further characterize their TDP-43 puncta. TDP-43 immunostaining is typically punctate so it is unclear if the puncta observed are physiologic or pathologic based on the analyses carried out in the current version of this manuscript. Additionally, do these puncta co-localize with stress granule markers or RNA transport granule markers? Are these puncta phosphorylated (which may be more reminiscent of end-stage pathologic observations in humans)?
 
 We have tried immunostaining neurons for phosphorylated TDP43. However, our immunostaining attempts were unsuccessful. Depending on the antibody, we either saw no signal (antibody from Cosmo Bio, TIP-PTD-M01A) or even the control neurons displayed detectable phosphorylation within the nucleus (antibody from Proteintech 22309-1-AP). Consequently, we performed western blot analysis using an antibody from Cosmo Bio, (TIP-PTD-M01A) that clearly shows hyperphosphorylation of TDP43 in whole cell lysates (Fig. 3D, E). Hence, we have referred to these structures as puncta and not aggregates (Page 4).
 
 To assess co-localization of the puncta with stress granules, we immunostained for the stress granule marker G3BP1. This was done in MNs that were treated with sodium arsenite (SA) or PBS as a control. In the PBS treated control MN cultures, TDP43 mislocalization alone did not induce stress granule formation. G3BP1+ stress granules were only observed following SA stress (0.5 mM, 60 minutes). Further, only a subset of TDP43 puncta overlapped with these stress granules (Supplementary Fig. 7) (Page 6).
 
 (4) The authors should include multiple time points in their evaluation of TDP-43 loss of function events and aggregation. Does loss of function get worse over time? Is there a time course by which RNA misprocessing events emerge or does everything happen all at once? Does aggregation get worse over time? Do these neurons die at any point as a result of TDP-43 proteinopathy?
 
 We agree that a time course to analyze TDP43 mislocalization and its consequences would be ideal. However, the mislocalization of TDP43 across neurons is not a coordinated process. At each given time instance, neurons display varying levels of TDP43 mislocalization. Answering the questions raised by the reviewer would require tracking individual neurons in real time in a controlled environment over weeks. Unfortunately, we currently do not have the hardware to run these experiments. However, we do observe increased levels of cleaved caspase 3 in MNs expressing the NES nanobody, indicating that these neurons indeed undergo apoptosis by day 40 (Fig.1).
 
 We have, however, analyzed changes in splicing using qPCR for 12 genes over a time course starting as early as 4 hours after inducing mislocalization. We detect time-dependent cryptic splicing events in all genes as early as 8 hours after doxycycline addition, coinciding with the appearance TDP43 mislocalization (Fig. 4A, B).
 
 (5) Can the authors please comment on whether or not their model is "tunable"? In real human disease, not every neuron displays complete nuclear depletion of TDP-43. Instead there is often a gradient of neurons with differing magnitudes of nuclear TDP-43 loss. Additionally, very few neurons (5-10%) harbor cytoplasmic TDP-43 aggregates at end-stage disease. These are all important considerations when developing a novel authentic and endogenous model of TDP-43 proteinopathy which the current manuscript fails to address.
 
 As shown in Fig .1, the neurons expressing the NES-nanobody display a wide range of mislocalization as assessed by the % of nuclear TDP43 present. By titrating the amount of AAVs added to the culture, the model can be tuned to achieve a wide gradient of TDP43 mislocalization.
 
 We calculated the size and percentage of neurons displaying TDP43 puncta. The size and the number of aggregates varies across the neurons that display TDP43 mislocalization. Around 50% of the neurons displayed small (1 um2) puncta while large puncta (> 5 um2) were observed in <10% of the cells, similar to observations in patient tissue (Fig. 1F).
 
 Reviewer #2 (Public Review):
 
 Summary:
 
 TDP-43 mislocalization occurs in nearly all of ALS, roughly half of FTD, and as a co-pathology in roughly half of AD cases. Both gain-of-function and loss-of-function mechanisms associated with this mislocalization likely contribute to disease pathogeneisis.
 
 Here, the authors describe a new method to induce TDP-43 mislocalization in cellular models. They endogenously tagged TDP-43 with a C-terminal GFP tag in human iPSCs. They then expressed an intrabody - fused with a nuclear export signal (NES) - that targeted GFP to the cytosol. Expression of this intrabody-NES in human iPSC-derived neurons induced nuclear depletion of homozygous TDP-43-GFP, caused its mislocalization to the cytosol, and at least in some cells appeared to cause cytosolic aggregates. This mislocalization was accompanied by induction of cryptic exons in well characterized transcripts known to be regulated by TDP-43, a hallmark of functional TDP-43 loss and consistent with pathological nuclear TDP-43 depletion. Interestingly, in heterozygous TDP-43-GFP neurons, expression of intrabody-NES appeared to also induce the mislocalization of untagged TDP-43 in roughly half of the neurons, suggesting that this system can also be used to study effects on untagged endogenous TDP-43 as well as TDP-43-GFP fusion protein.
 
 Strengths:
 
 A clearer understanding of how TDP-43 mislocalization alters cellular function, as well as pathways that mitigate clearance of TDP-43 aggregates, is critical. But modeling TDP-43 mislocalization in disease-relevant cellular systems has proven to be challenging. High levels of overexpression of TDP-43 lacking an NES can drive endogenous TDP-43 mislocalization, but such overexpression has direct and artificial consequences on certain cellular features (e.g. altered exon skipping) not seen in diseased patients. Toxic small molecules such as MG132 and arsenite can induce TDP-43 mislocalization, but co-induce myriad additional cellular dysfunctions unrelated to TDP-43 or ALS. TDP-43 binding oligonucleotides can cause cytosolic mislocalization as well. Each system has pros and cons, and additional ways to induce TDP-43 mislocalization would be useful for the field. The method described in this manuscript could provide researchers with a powerful way to study the combined biology of cytosolic TDP-43 mislocalization and nuclear TDP-43 depletion, with additional temporal control that is lacking in current method. Indeed, the authors see some evidence of differences in RNA splicing caused by pure TDP-43 depletion versus their induced mislocalization model. Finally, their method may be especially useful in determining how TDP-43 aggregates are cleared by cells, potentially revealing new biological pathways that could be therapeutically targeted.
 
 Weaknesses:
 
 The method and supporting data have limitations in its current form, outlined below, and in its current form the findings are rather preliminary.
 
 (1) Tagging of TDP-43 with a bulky GFP tag may alter its normal physiological functions, for example phase separation properties and functions within complex ribonucleoprotein complexes. In addition, alternative isoforms of TDP-43 (e.g. "short" TDP-43, would not be GFP tagged and therefore these species would not be directly manipulatable or visualizable with the tools currently employed in the manuscript.
 
 With reference to our answer above, we have confirmed using qPCR and RNA-seq analysis that adding a GFP tag to the C-terminus of TDP43 does not result in an appreciable loss of functionality. We do not observe any cryptic exon inclusion in STMN2 and UNC13A. Cryptic exon inclusion in these genes, especially STMN2, has been recognized as a very sensitive indicator of TDP43 loss of function (Supplementary Fig 1C, Supplementary 2D, Fig. 3, Fig.4)
 
 We acknowledge that truncated alternatively spliced versions of TDP43 will lose the GFP-tag and cannot be manipulated with our system. Since our GFP tag is positioned on the C-terminus, our system cannot manipulate these truncated fragments as the tag is lost in these isoforms. But these isoforms, if present, should be detectable using the Proteintech antibody against total TDP43, which recognizes N-terminal TDP43 epitopes. However, western blot analysis, even 20 days after inducing TDP43 mislocalization, showed no truncated fragments. This suggests that TDP43 mislocalization alone is insufficient to generate significant levels of truncated isoforms. We have added this section to the Limitations paragraph (page 9).
 
 (2) The data regarding potential mislocalization of endogenous TDP-43 in the heterozygous TDP-43-GFP lines is especially intriguing and important, yet very little characterization was done. Does untagged TDP-43 co-aggregate with the tagged TDP-43? Is localization of TDP-43 immunostaining the same as the GFP signal in these cells?
 
 The purpose of the heterozygous experiments was to see whether mislocalized TDP43 could potentially trap the untagged TDP43. If this was not the case, we would have seen a maximum of 50% of the TDP43 signal mislocalized to the cytoplasm. The fact that a sizeable proportion of cells had significantly higher levels of TDP43 loss from the nucleus, indicates that mislocalized TDP43 can indeed trap the untagged protein fraction. We used GFP immunostaining to identify the tagged TDP43 while an antibody against the endogenous TDP43 protein was used to detect total TDP43 levels. In the cells that show near complete loss of nuclear TDP43, the total TDP43 signal coincides with the GFP (tagged TDP43) signal. We are unable to distinguish the untagged fraction selectively as we do not have an antibody that can detect this directly.
 
 But we agree with the reviewer that these observations need further detailed follow-up that we are unable to provide currently. Hence, we have removed this figure from the manuscript.
 
 (3) The experiments in which dox was used to induce the nanobody-NES, then dox withdrawn to study potential longer-lasting or self-perpetuating inductions of aggregation is potentially interesting. However, the nanobody was only measured at the RNA level. We know that protein half lives can be very long in neurons, and therefore residual nanobody could be present at these delayed time points. The key measurement to make would be at the protein level of the nanobody if any conclusions are be made from this experiment.
 
 The reviewer has highlighted an important point. To address this issue, we tagged the nanobodies with a V5 tag that allowed us to directly measure nanobody levels within cells. After Dox withdrawal, we indeed observed significant expression of the nanobody within cells even after two weeks of Dox withdrawal. Extending the time point to three weeks allowed complete loss of the nanobody in most neurons. However, in contrast to our observations at two weeks, this was accompanied by a reversal of TDP43 mislocalization in these neurons at three weeks (Fig. 5).
 
 Surprisingly, in less than 10% of the neurons, we observed >80% of the total TDP43 still mislocalized to the cytoplasm, despite nearly undetectable levels of the nanobody. Super-resolution microscopy further revealed persistent cytoplasmic TDP43 in these neurons that did not overlap with residual nanobody signal. This suggests that in these neurons, the nanobody was no longer required to maintain TDP43 mislocalization (Fig. 5, page 7)
 
 (4) Potential differences in splicing and microRNAs between TDP-43 knockdown and TDP-43 mislocalization are potentially interesting. However, different patterns of dysregulated RNA splicing can occur at different levels of TDP-knockdown, thus it is difficult to assess whether the changes observed in this paper are due to mislocalization per se, or rather just reflect differences in nuclear TDP-43 abundance.
 
 This a fair point. It is possible that microRNA dysregulation might require a greater loss of nuclear TDP43 and maybe more resilient to TDP43 loss as compared to splicing. We have acknowledged this in the discussion section (page 9).
 
 Recommendations for the authors:
 
 Reviewer #1 (Recommendations For The Authors):
 
 (1) It would be helpful to include nuclear vs cytoplasmic ratios of TDP-43 instead of simply "% nuclear TDP-43"
 
 We have used % nuclear TDP43 as these values have biologically meaningful upper and lower bounds, which makes it easier to compare across experiments. We found that using a ratio of nuclear vs cytoplasmic TDP43 intensities displayed higher variability and a wider range.
 
 We have re-labelled the y-axis as “% Nuclear TD43 / soma TDP43” to make our quantification clearer. The conversion from % nuclear TDP43 to N/C is straightforward. If the % nuclear TDP43 is X, then the N/C ratio can be calculated as X / (100-X). For example, a % nuclear TDP43 of 80% would amount to an N/C ratio of 80/20 = 4.
 
 (2) The axis descriptions in Figure 1D are very unclear. While this is described better in the figure legend, it would be beneficial to have a more descriptive y-axis title in the figure (which may mean increasing the number of graphs).
 
 Axis descriptions and figures changed as recommended.
 
 (3) In Figure 1, the time points at which iPSNs were transduced with nanobody and/or fixed for immunostaining is somewhat inconsistent across all panels. This hinders interpretation of the figure as a whole. The authors should use same transduction and immunostaining time points for consistency or demonstrate that the same phenotype is observed regardless of transduction and immunostaining day as long as the time in between (time of nano body expression) is consistent. Subsequently, in Figure 2, a different set of time points is used.
 
 Please see our response in the public comments above
 
 (4) In Figure 1, please show individual data points for each independent differentiation to demonstrate the level of reproducibility from batch to batch.
 
 Data points have been shown per replicate (Supplementary Fig. 2)
 
 We have refined our approach for phenotypic analysis to improve consistency across different clones. Previously, we set thresholds on % nuclear TDP43 to distinguish MNs with nuclear versus mislocalized TDP43. This was done by ranking all cells based on % nuclear TDP43 and applying quantile-based thresholds—designating the top 25% as control and the bottom 25% as mislocalized, ensuring equal number of cells per category. However, we observed significant variability in thresholds across clones. For instance, the E8 clone had thresholds of 96% and 29%, while the E5 clone had 93% and 40%.
 
 To address this, we reanalysed the data using a standardized three-bin approach:
 
 (1) Control: MNs expressing the control nanobody.
 
 (2) Low-Moderate Mislocalization: MNs expressing the NES nanobody with > 40% nuclear TDP43.
 
 (3) Severe Mislocalization: MNs expressing the NES nanobody with < 40% nuclear TDP43.
 
 This approach ensured a more reliable comparison of TDP43 mislocalization effects across experiments. The conclusions remain the same.
 
 (5) In Figure 2, please show individual data points.
 
 Data points for all the qPCR analyses in the paper have been included as a supplementary text file.
 
 (6) In Figure 3, please show individual data points.
 
 Data points for the western blot data have been included as a supplementary data file.
 
 All other comments are within the public review.
 
 Reviewer #2 (Recommendations For The Authors):
 
 (1) In general more robust quantification of many of the described phenotypes are necessary. In particular, no apparent quantification of cytosolic mislocalization was performed in Figure 1, or quantification of mislocalization of Figure 3F. It is unclear in the western blot in Fig 1G if TDP-43 signal were normalized to total protein, and of note it seems that expression of the intrabody-NES reduced total proteins in the western blots that were shown. No quantification or measurement of the insoluble material was done or shown.
 
 We have quantified cytosolic mislocalization of TDP43 (Fig. 1C). The y-axis indicates the total TDP43 signal observed in the nucleus as a percentage of the total signal observed in the soma (including the nucleus). This value has the advantage of ranging between 100% (perfectly nuclear) to 0% (complete nuclear loss). The boxplots indicate that expression of the NES-nanobody results in a range of cytosolic mislocalization with a median value around 40% of the TDP43 remaining in the nucleus.
 
 Western blot data in previous Fig. 1G was normalized to alpha-tubulin. We were unable to get a good signal for the insoluble fraction. From the alpha-tubulin alone, it cannot be concluded that NES-nanobody results in a decrease in total protein levels. In the revised western blot for phosphorylated TDP43 (Fig. 3D, E), we have quantified total and phosphorylated TDP43. Here, we observe a six-fold increase in the levels of phosphorylated TDP43 without a significant change in total TDP43 protein levels.
 
 To avoid potential mis-interpretation of our results, we have now removed the previous Fig. 1G.
 
 (2) Additional images of nearly all microscopy data at higher magnifications would be required to better evaluate TDP-43 localization. Ideally including images for each channel in addition to merged images, and especially for key figures such as Figure 1B, 3B, 3F.
 
 Better images have been provided.
 
 (3) No control images were shown for Figure 1F and 3F. It is unclear what the bright punctate spots of cytoplasmic TDP-43 GFP signal represent. Are these true aggregates? If so, additional characterization would be required before such conclusions can be made, beyond the relatively superficial western blot analysis that was done in Figure 1.
 
 Control images have now been provided (Figure 1E). As we mentioned above, immunostaining analysis to characterize whether the aggregates are phosphorylated failed to provide a clear signal. However, we have now confirmed that the mislocalized TDP43 is indeed hyper-phosphorylated (Figure 3D, E). We have acknowledged this in the main text, and have referred to these as puncta reminiscent of aggregates (Page 4, Page 6).
 
 AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.10.24.563760v3
www.biorxiv.org www.biorxiv.org

Bridging verbal coordination and neural dynamics

4
1. Public_Reviews 12 May 2025
 
 in eLife
 
 eLife Assessment
 
 This paper reports on an important study that aims to move beyond current experimental approaches in speech production by (1) investigating speech in the context of a fully interactive task and (2) employing advanced methodology to record intracranial brain activity. Together these allow for examination of the unfolding temporal dynamics of brain-behaviour relationships during interactive speech. This approach and the analyses presented in support of the authors' claims pose convincing evidence.
 
 Summary
2. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Summary:
 
 This paper reports an intracranial SEEG study of speech coordination, where participants synchronize their speech output with a virtual partner that is designed to vary its synchronization behavior. This allows the authors to identify electrodes throughout the left hemisphere of the brain that have activity (both power and phase) that correlates with the degree of synchronization behavior. They find that high-frequency activity in secondary auditory cortex (superior temporal gyrus) is correlated to synchronization, in contrast to primary auditory regions. Furthermore, activity in inferior frontal gyrus shows a significant phase-amplitude coupling relationship that is interpreted as compensation for deviation from synchronized behavior with the virtual partner.
 
 Strengths: (1) The development of a virtual partner model trained for each individual participant, which can dynamically vary its synchronization to the participant's behavior in real time, is novel and exciting. (2) Understanding real-time temporal coordination for behaviors like speech is a critical and understudied area. (3) The use of SEEG provides the spatial and temporal resolution necessary to address the complex dynamics associated with the behavior. (4) The paper provides some results that suggest a role for regions like IFG and STG in the dynamic temporal coordination of behavior both within an individual speaker and across speakers performing a coordination task.
 
 Weaknesses:
 
 (1) The main weakness of the paper is that the results are presented in a largely descriptive and vague manner. For instance, while the interpretation about predictive coding and error correction is interesting, it is not clear how the experimental design or analyses specifically support such a model, or how they differentiate that model from the alternatives. It's possible that some greater specificity could be achieved by a more detailed examination of this rich dataset, for example by characterizing the specific phase relationships (e.g., positive vs negative lags) in areas that show correlations with synchronization behavior. However, as written, it is difficult to understand what these results tell us about how coordination behavior arises. (2) In the results section, there's a general lack of quantification. While some of the statistics reported in the figures are helpful, there are also claims that are stated without any statistical test. For example, in the paragraph starting on line 342, it is claimed that there is an inverse relationship between rho-value and frequency band, "possibly due to the reversed desynchronization/synchronization process in low and high frequency bands". Based on Figure 3, the first part of this statement appears to be true qualitatively, but is not quantified, and is therefore impossible to assess in relation to the second part of the claim. Similarly, the next paragraph on line 348 describes optimal clustering, but statistics of the clustering algorithm and silhouette metric are not provided. More importantly, it's not entirely clear what is being clustered - is the point to identify activity patterns that are similar within/across brain regions? Or to interpret the meaning of the specific patterns? If the latter, this is not explained or explored in the paper. (3) Given the design of the stimuli, it would be useful to know more about how coordination relates to specific speech units. The authors focus on the syllabic level, which is understandable. But as far as the results relate to speech planning (an explicit point in the paper), the claims could be strengthened by determining whether the coordination signal (whether error correction or otherwise) is specifically timed to e.g., the consonant vs the vowel. If the mechanism is a phase reset, does it tend to occur on one part of the syllable? (4) In the discussion the results are related to a previously described speech-induced suppression effect. However, it's not clear what the current results have to do with SIS, since the speaker's own voice is present and predictable from the forward model on every trial. Statements such as "Moreover, when the two speech signals come close enough in time, the patient possibly perceives them as its own voice" are highly speculative and apparently not supported by the data. (5) There are some seemingly arbitrary decisions made in the design and analysis that, while likely justified, need to be explained. For example, how were the cutoffs for moderate coupling vs phase-shifted coupling (k ~0.09) determined? This is noted as "rather weak" (line 212), but it's not clear where this comes from. Similarly, the ROI-based analyses are only done on regions "recorded in at least 7 patients" - how was this number chosen? How many electrodes total does this correspond to? Is there heterogeneity within each ROI?
 
 Comments on revisions:
 
 The authors have generally responded to the critiques from the first round of review, and have provided additional details that help readers to understand what was done.
 
 In my opinion, the paper still suffers from a lack of clarity about the interpretation, which is partly due to the fact that the results themselves are not straightforward. For example, the heterogeneity across individual electrodes that is obvious from Fig 3 makes it hard to justify the ROI-based approach. And even the electrode clustering, while more data-driven, does not substantially help the fact that the effects appear to be less spatially-organized than the authors may want to claim.
 
 I recognize the value of introducing this new mutual adaptation paradigm, which is the main strength of the paper. However, the conclusions that can be drawn from the data presented here seem incomplete at best.
 
 Review 1
3. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 This paper investigates the neural underpinnings of an interactive speech task requiring verbal coordination with another speaker. To achieve this, the authors recorded intracranial brain activity from the left (and to a lesser extent, the right) hemisphere in a group of drug-resistant epilepsy patients while they synchronised their speech with a 'virtual partner'. Crucially, the authors were able to manipulate the degree of success of this synchronisation by programming the virtual partner to either actively synchronise or desynchronise their speech with the participant, or else to not vary its speech in response to the participant (making the synchronisation task purely one-way). Using such a paradigm, the authors identified different brain regions that were either more sensitive to the speech of the virtual partner (primary auditory cortex), or more sensitive to the degree of verbal coordination (i.e. synchronisation success) with the virtual partner (left secondary auditory cortex and bilateral IFG). Such sensitivity was measured by (1) calculating the correlation between the index of verbal coordination and mean power within a range of frequency bands across trials, and (2) calculating the phase-amplitude coupling between the behavioural and brain signals within single trials (using the power of high-frequency neural activity only). Overall, the findings help to elucidate some of the brain areas involved in interactive speaking behaviours, particularly highlighting high-frequency activity of the bilateral IFG as a potential candidate supporting verbal coordination.
 
 Strengths:
 
 This study provides the field with a convincing demonstration of how to investigate speaking behaviours in more complex situations that share many features with real-world speaking contexts e.g. simultaneous engagement of speech perception and production processes, the presence of an interlocutor and the need for inter-speaker coordination. The findings thus go beyond previous work that has typically studied solo speech production in isolation, and represent a significant advance in our understanding of speech as a social and communicative behaviour. It is further an impressive feat to develop a paradigm in which the degree of cooperativity of the synchronisation partner can be so tightly controlled; in this way, this study combines the benefits of using pre-recorded stimuli (namely, the high degree of experimental control) with the benefits of using a live synchronisation partner (allowing the task to be truly two-way interactive, an important criticism of other work using pre-recorded stimuli). A further key strength of the study lies in its employment of stereotactic EEG to measure brain responses with both high temporal and spatial resolution, an ideal method for studying the unfolding relationship between neural processing and this dynamic coordination behaviour.
 
 Weaknesses:
 
 One limitation of the current study is the relatively sparse coverage of the right hemisphere by the implanted electrodes (91 electrodes in the right compared to 145 in the left). Of course, electrode location is solely clinically motivated, and so the authors did not have control over this. In a previous version of this article, the authors therefore chose not to include data from the right hemisphere in reported analyses. However, after highlighting previous literature suggesting that the right hemisphere likely has high relevance to verbal coordination behaviours such as those under investigation here, the authors have now added analyses of the right hemisphere data to the results. These confirm an involvement of the right hemisphere in this task, largely replicating left hemisphere results. Some hemispheric differences were found in responses within the STG; however, interpretation should be tempered by an awareness of the relatively sparse coverage of the right hemisphere meaning that some regions have very few electrodes, resulting in reduced statistical power.
 
 Review 2
4. Public_Reviews 12 May 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews
 
 Public Reviews:
 
 Reviewer #1 (Public Review):
 
 Summary:
 
 This paper reports an intracranial SEEG study of speech coordination, where participants synchronize their speech output with a virtual partner that is designed to vary its synchronization behavior. This allows the authors to identify electrodes throughout the left hemisphere of the brain that have activity (both power and phase) that correlates with the degree of synchronization behavior. They find that high-frequency activity in the secondary auditory cortex (superior temporal gyrus) is correlated to synchronization, in contrast to primary auditory regions. Furthermore, activity in the inferior frontal gyrus shows a significant phase-amplitude coupling relationship that is interpreted as compensation for deviation from synchronized behavior with the virtual partner.
 
 Strengths:
 
 (1) The development of a virtual partner model trained for each individual participant, which can dynamically vary its synchronization to the participant's behavior in real-time, is novel and exciting.
 
 (2) Understanding real-time temporal coordination for behaviors like speech is a critical and understudied area.
 
 (3) The use of SEEG provides the spatial and temporal resolution necessary to address the complex dynamics associated with the behavior.
 
 (4) The paper provides some results that suggest a role for regions like IFG and STG in the dynamic temporal coordination of behavior both within an individual speaker and across speakers performing a coordination task.
 
 We thank the Reviewer for their positive comments on our manuscript.
 
 Weaknesses:
 
 (1) The main weakness of the paper is that the results are presented in a largely descriptive and vague manner. For instance, while the interpretation of predictive coding and error correction is interesting, it is not clear how the experimental design or analyses specifically support such a model, or how they differentiate that model from the alternatives. It's possible that some greater specificity could be achieved by a more detailed examination of this rich dataset, for example by characterizing the specific phase relationships (e.g., positive vs negative lags) in areas that show correlations with synchronization behavior. However, as written, it is difficult to understand what these results tell us about how coordination behavior arises.
 
 We understand the reviewer’s comment. It is true that this work, being the first in the field using real-time adapting synchronous speech and intracerebral neural data, is a descriptive work, that hopefully will pave the way for further studies. We have now added more statistical analyses (see point 2) to go beyond a descriptive approach and we have also rewritten the discussion to clarify how this work can possibly contribute to disentangle different models of language interaction. Most importantly we have also run new analyses taking into account the specific phase relationship, as suggested.
 
 We already had an analysis using instantaneous phase difference in the phase-amplitude coupling approach, that bridges phase of behaviour to neural responses (amplitude in the high-frequency range). However, this analysis, as the reviewer noted, does not distinguish between positive and negative lags, but rather uses the continuous fluctuations of coordinative behaviour. Following the reviewer’s suggestion, we have now run a new analysis estimating the average delay (between virtual partner speech and patient speech) in each trial, using a cross-correlation approach. This gives a distribution of delays across trials that can then be “binned” as positive or negative. We have thus rerun the phase-amplitude coupling analyses on positive and negative trials separately, to assess whether the phase amplitude relationship depends upon the anticipatory (negative lags) or compensatory (positive lags) behaviour. Our new analysis (now in the supplementary, see figure below) does not reveal significant differences between positive and negative lags. This lack of difference, although not easy to interpret, is nonetheless interesting because it seems to show that the IFG does not have a stronger coupling for anticipatory trials. Rather the IFG seems to be strongly involved in adjusting behaviour, minimizing the error, independently of whether this is early or late.
 
 We have updated the “Coupling behavioural and neurophysiological data” section in Materials and methods as follows:
 
 “In the third approach, we assessed whether the phase-amplitude relationship (or coupling) depends upon the anticipatory (negative delays) or compensatory (positive delays) behaviour between the VO and the patients’ speech. We computed the average delay in each trial using a cross-correlation approach on speech signals (between patient and VP) with the MATLAB function xcorr. A median split (patient-specific ; average median split = 0ms, average sd = 24ms) was applied to conserve a sufficient amount of data, classifying trials below the median as “anticipatory behaviour” and trials above the median as “compensatory behaviour”. Then we conducted the phase-amplitude coupling analyses on positive and negative trials separately.”
 
 We also added a paragraph on this finding in the Discussion:
 
 “Our results highlight the involvement of the inferior frontal gyrus (IFG) bilaterally, in particular the BA44 region, in speech coordination. First, trials with a weak verbal coordination (VCI) are accompanied by more prominent high frequency activity (HFa, Fig.4; Fig.S4). Second, when considering the within-trial time-resolved dynamics, the phase-amplitude coupling (PAC) reveals a tight relation between the low frequency behavioural dynamics (phase) and the modulation of high-frequency neural activity (amplitude, Fig.5B ; Fig.S5). This relation is strongest when considering the phase adjustments rather than the phase of speech of the VP per se : larger deviations in verbal coordination are accompanied by increase in HFa. Additionally, we also tested for potential effects of different asynchronies (i.e., temporal delay) between the participant's speech and that of the virtual partner but found no significant differences (Fig.S6). While lack of delay-effect does not permit to conclude about the sensitivity of BA44 to absolute timing of the partner’s speech, its neural dynamics are linked to the ongoing process of resolving phase deviations and maintaining synchrony.”
 
 (2) In the results section, there's a general lack of quantification. While some of the statistics reported in the figures are helpful, there are also claims that are stated without any statistical test. For example, in the paragraph starting on line 342, it is claimed that there is an inverse relationship between rho-value and frequency band, "possibly due to the reversed desynchronization/synchronization process in low and high frequency bands". Based on Figure 3, the first part of this statement appears to be true qualitatively, but is not quantified, and is therefore impossible to assess in relation to the second part of the claim. Similarly, the next paragraph on line 348 describes optimal clustering, but statistics of the clustering algorithm and silhouette metric are not provided. More importantly, it's not entirely clear what is being clustered - is the point to identify activity patterns that are similar within/across brain regions? Or to interpret the meaning of the specific patterns? If the latter, this is not explained or explored in the paper.
 
 The reviewer is right. We have now added statistical analyses showing that:
 
 (1) the ratio between synchronization and desynchronization evolves across frequencies (as often reported in the literature).
 
 (2) the sign of rho values also evolves across frequencies.
 
 (3) the clustering does indeed differ when taking into account behaviour. We have also clarified the use of clustering and the reasoning behind it.
 
 We have updated the Materials and methods section as follows:
 
 “The statistical difference between spatial clustering in global effect and brain-behaviour correlation was estimated with linear model using the R function lm (stat package), post-hoc comparisons were corrected for multiple comparisons using the Tukey test (lsmeans R package ; Lenth, 2016). The statistical difference between clustering in global effect and behaviour correlation across the number of clusters was estimated using permutation tests (N=1000) by computing the silhouette score difference between the two conditions.” We have updated the Results section as follows:
 
 (1) “This modulation between synchronization and desynchronization across frequencies was significant (F(5) = 6.42, p < .001 ; estimated with linear model using the R function lm).”
 
 (2) “The first observation is a gradual transition in the direction of correlations as we move up frequency bands, from positive correlations at low frequencies to negative ones at high frequencies (F(5) = 2.68, p = .02). This effect, present in both hemispheres, mimics the reversed desynchronization/synchronization process in low and high frequency bands reported above.”
 
 (3) “Importantly, compared to the global activity (task vs rest, Fig 3A), the neural spatial profile of the behaviour-related activity (Fig 3B) is more clustered, in the left hemisphere. Indeed, silhouette scores are systematically higher for behaviour-related activity compared to global activity, indicating greater clustering consistency across frequency bands (t(106) = 7.79, p < .001, see Figure S3). Moreover, silhouette scores are maximal, in particular for HFa, for five clusters (p < .001), located in the IFG BA44, the IPL BA 40 and the STG BA 41/42 and BA22 (see Figure S3).”
 
 (3) Given the design of the stimuli, it would be useful to know more about how coordination relates to specific speech units. The authors focus on the syllabic level, which is understandable. But as far as the results relate to speech planning (an explicit point in the paper), the claims could be strengthened by determining whether the coordination signal (whether error correction or otherwise) is specifically timed to e.g., the consonant vs the vowel. If the mechanism is a phase reset, does it tend to occur on one part of the syllable?
 
 Thank you for this thoughtful feedback. We agree that the relationship between speech coordination and specific speech units, such as consonants versus vowels, is an intriguing question. However, in our study, both interlocutors (the participant and the virtual partner) are adapting their speech production in real-time. This interactive coordination makes it difficult to isolate neural signatures corresponding to precise segments like consonants or vowels, as the adjustments occur in a continuous and dynamic context.
 
 The VP's ability to adapt depends on its sensitivity to spectral cues, such as the transition from one phonetic element to another. This is likely influenced by the type of articulation, with certain transitions being more salient (e.g., between a stop consonant like "p" and a vowel like "a") and others being less distinct (e.g., between nasal consonants like "m" and a vowel). Thus, the VP’s spectral adaptation tends to occur at these transitions, which are more prominent in some cases than in others.
 
 For the participants, previous studies have shown a greater sensitivity during the production of stressed vowels (Oschkinat & Hoole, 2022; Li & Lancia, 2024), which may reflect a heightened attentional or motor adjustment to stressed syllables.
 
 Here, we did not specifically address the question of coordination at the level of individual linguistic units. Moreover, even if we attempted to focus on this level, it would be challenging to relate neural dynamics directly to specific speech segments. The question of how synchronization at the level of individual linguistic units might relate to neural data is complex. The lack of clear, unit-specific predictions makes it difficult to parse out distinct neural signatures tied to individual segments, particularly when both interlocutors are continuously adjusting their speech in relation to one another.
 
 Therefore, while we recognize the potential importance of examining synchronization at the level of individual phonetic elements, the design of our task and the nature of the coordination in this interactive context (realtime bidirection adaptation) led us to focus more broadly on the overall dynamics of speech synchronization at the syllabic level, rather than on specific linguistic units.
 
 We now state at the end of the Discussion section:
 
 “It is worth noting that the influence of specific speech units, such as consonants versus vowels, on speech coordination remains to be explored. In non-interactive contexts, participants show greater sensitivity during the production of stressed vowels, possibly reflecting heightened attentional or motor adjustments (Oschkinat & Hoole, 2022; Li & Lancia, 2024). In this study, the VP’s adaptation relies on sensitivity to spectral cues, particularly phonetic transitions, with some (e.g., formant transitions) being more salient than others. However, how these effects manifest in an interactive setting remains an open question, as both interlocutors continuously adjust their speech in real time. Future studies could investigate whether coordination signals, such as phase resets, preferentially align with specific parts of the syllable.” References cited:
 
 – Oschkinat, M., & Hoole, P. (2022). Reactive feedback control and adaptation to perturbed speech timing in stressed and unstressed syllables. Journal of Phonetics, 91, 101133.
 
 – Li, J., & Lancia, L. (2024). A multimodal approach to study the nature of coordinative patterns underlying speech rhythm. In Proc. Interspeech, 397-401.
 
 (4) In the discussion the results are related to a previously-described speech-induced suppression effect. However, it's not clear what the current results have to do with SIS, since the speaker's own voice is present and predictable from the forward model on every trial. Statements such as "Moreover, when the two speech signals come close enough in time, the patient possibly perceives them as its own voice" are highly speculative and apparently not supported by the data.
 
 We thank the reviewer for raising thoughtful concerns about our interpretation of the observed neural suppression as related to speaker-induced suppression (SIS). We agree that our study lacks a passive listening condition, which limits direct comparisons to the original SIS effect, traditionally defined as the suppression of neural responses to self-produced speech compared to externally-generated speech (Meekings & Scott, 2021).
 
 In response, we have reconsidered our terminology and interpretation. In the revised Discussion section, we refer to our findings as a "SIS-related phenomenon specific to the synchronous speech context". Unlike classic SIS paradigms, our interactive task involves simultaneous monitoring of self- and externally-generated speech, introducing additional attentional and coordinative demands.
 
 The revised Discussion also incorporates findings by Ozker et al. (2022, 2024), which link SIS and speech monitoring, suggesting that suppressing responses to self-generated speech facilitates error detection. We propose that the decrease in high-frequency activity (HFa) as verbal coordination increases reflects reduced error signals due to closer alignment between perceived and produced speech. Conversely, HFa increases with reduced coordination may signify greater prediction error.
 
 Additionally, we relate our findings to the "rubber voice" effect (Zheng et al., 2011; Lind et al., 2014; Franken et al., 2021), where temporally and phonetically congruent external speech can be perceived as self-generated. We speculate that this may occur in synchronous speech tasks when the participant's and VP's speech signals closely align. However, this interpretation remains speculative, as no subjective reports were collected to confirm this perception. Future studies could include participant questionnaires to validate this effect and relate subjective experience to neural measures of synchronization.
 
 Overall, our findings extend the study of SIS to dynamic, interactive contexts and contribute to understanding internal forward models of speech production in more naturalistic scenarios.
 
 We have now added these points to the discussion as follows:
 
 “The observed negative correlation between verbal coordination and high-frequency activity (HFa) in STG BA22 suggests a suppression of neural responses as the degree of behavioural synchrony increases. This result is reminiscent of findings on speaker-induced suppression (SIS), where neural activity in auditory cortex decreases during self-generated speech compared to externally-generated speech (Meekings & Scott, 2021; Niziolek et al., 2013). However, our paradigm differs from traditional SIS studies in two critical ways: (1) the speaker's own voice is always present and predictable from the forward model, and (2) no passive listening condition was included. Therefore, our findings cannot be directly equated with the original SIS effect.
 
 Instead, we propose that the suppression observed here reflects a SIS-related phenomenon specific to the synchronous speech context. Synchronous speech requires simultaneous monitoring of self- and externallygenerated speech, a task that is both attentionally demanding and coordinative. This aligns with evidence from Ozker et al. (2024, 2022), showing that the same neural populations in STG exhibit SIS and heightened responses to feedback perturbations. These findings suggest that SIS and speech monitoring are related processes, where suppressing responses to self-generated speech facilitates error detection. In our study, suppression of HFa as coordination increases may reflect reduced prediction errors due to closer alignment between perceived and produced speech signals. Conversely, increased HFa during poor coordination may signify greater mismatch, consistent with prediction error theories (Houde & Nagarajan, 2011; Friston et al., 2020). Furthermore, when self- and externally-generated speech signals are temporally and phonetically congruent, participants may perceive external speech as their own. This echoes the "rubber voice" effect, where external speech resembling self-produced feedback is perceived as self-generated (Zheng et al., 2011; Lind et al., 2014; Franken et al., 2021). While this interpretation remains speculative, future studies could incorporate subjective reports to investigate this phenomenon in more detail.” References cited:
 
 – Franken, M. K., Hartsuiker, R. J., Johansson, P., Hall, L., & Lind, A. (2021). Speaking With an Alien Voice: Flexible Sense of Agency During Vocal Production. Journal of Experimental Psychology-Human perception and performance, 47(4), 479-494. https://doi.org/10.1037/xhp0000799
 
 – Houde, J. F., & Nagarajan, S. S. (2011). Speech production as state feedback control. Frontiers in human neuroscience, 5, 82.
 
 – Lind, A., Hall, L., Breidegard, B., Balkenius, C., & Johansson, P. (2014). Speakers' acceptance of real-time speech exchange indicates that we use auditory feedback to specify the meaning of what we say. Psychological Science, 25(6), 1198-1205. https://doi.org/10.1177/0956797614529797
 
 – Meekings, S., & Scott, S. K. (2021). Error in the Superior Temporal Gyrus? A Systematic Review and Activation Likelihood Estimation Meta-Analysis of Speech Production Studies. Journal of Cognitive Neuroscience, 33(3), 422-444. https://doi.org/10.1162/jocn_a_01661
 
 – Niziolek C. A., Nagarajan S. S., Houde J. F (2013) What does motor efference copy represent? Evidence from speech production Journal of Neuroscience 33:16110–16116Ozker M., Doyle W., Devinsky O., Flinker A (2022) A cortical network processes auditory error signals during human speech production to maintain fluency PLoS Biology 20.
 
 – Ozker, M., Yu, L., Dugan, P., Doyle, W., Friedman, D., Devinsky, O., & Flinker, A. (2024). Speech-induced suppression and vocal feedback sensitivity in human cortex. eLife, 13, RP94198. https://doi.org/10.7554/eLife.94198
 
 – Zheng, Z. Z., MacDonald, E. N., Munhall, K. G., & Johnsrude, I. S. (2011). Perceiving a Stranger's Voice as Being One's Own: A 'Rubber Voice' Illusion? PLOS ONE, 6(4), e18655.
 
 (5) There are some seemingly arbitrary decisions made in the design and analysis that, while likely justified, need to be explained. For example, how were the cutoffs for moderate coupling vs phase-shifted coupling (k ~0.09) determined? This is noted as "rather weak" (line 212), but it's not clear where this comes from. Similarly, the ROI-based analyses are only done on regions "recorded in at least 7 patients" - how was this number chosen? How many electrodes total does this correspond to? Is there heterogeneity within each ROI?
 
 The reviewer is correct, we apologize for this missing information. We now specify that the coupling values were empirically determined on the basis of a pilot experiment in order to induce more or less synchronization, but keeping the phase-shifted coupling at a rather implicit level.
 
 Concerning the definition of coupling as weak, one should consider that, in the Kuramoto model, the strength of coupling (k) is relative to the spread of the natural frequencies (Δω) in the system. In our study, the natural frequencies of syllables range approximately from 2 Hz to 10Hz, resulting in a frequency spread of Δω = 8 Hz. For coupling to strongly synchronize oscillators across such a wide range, k must be comparable to or exceed Δω. Thus, since k = 0.1 is far much smaller than Δω, it is therefore classified as weak coupling.
 
 We have now modified the Materials and methods section as follows:
 
 “More precisely, for a third of the trials the VP had a neutral behaviour (close to zero coupling: k = +/- 0.01). For a third it had a moderate coupling, meaning that the VP synchronised more to the participant speech (k = -0.09). And for the last third of the trials the VP had a moderate coupling but with a phase shift of pi/2, meaning that it moderately aimed to speak in between the participant syllables (k = + 0.09). The coupling values were empirically determined on the basis of a pilot experiment in order to induce more or less synchronization but keeping the phase-shifted coupling at a rather implicit level. In other terms, while participants knew that the VP would adapt, they did not necessarily know in which direction the coupling went.”
 
 Regarding the criterion of including regions recorded in at least 7 patients, our goal was to balance data completeness with statistical power. Given our total sample of 16 patients, this threshold ensures that each included region is represented in at least ~44% of the cohort, reducing the likelihood of spurious findings due to extremely small sample sizes. This choice also aligns with common neurophysiological analysis practices, where a minimum number of subjects (at least 2 in extreme cases) is required to achieve meaningful interindividual comparisons while avoiding excessive data exclusion. Additionally, this threshold maintains a reasonable tradeoff between maximizing patient inclusion and ensuring that statistical tests remain robust.
 
 We have now added more information in the Results section “Spectral profiles in the language network are nuanced by behaviour” on this point as follows:
 
 “To balance data completeness and statistical power, we included only brain regions recorded in at least 7 patients (~44% of the cohort) for the left hemisphere and at least 5 patients for the right hemisphere (~31% of the cohort), ensuring sufficient representation while minimizing biases due to sparse data.”
 
 Reviewer #2 (Public Review):
 
 Summary:
 
 This paper investigates the neural underpinnings of an interactive speech task requiring verbal coordination with another speaker. To achieve this, the authors recorded intracranial brain activity from the left hemisphere in a group of drug-resistant epilepsy patients while they synchronised their speech with a 'virtual partner'. Crucially, the authors were able to manipulate the degree of success of this synchronisation by programming the virtual partner to either actively synchronise or desynchronise their speech with the participant, or else to not vary its speech in response to the participant (making the synchronisation task purely one-way). Using such a paradigm, the authors identified different brain regions that were either more sensitive to the speech of the virtual partner (primary auditory cortex), or more sensitive to the degree of verbal coordination (i.e. synchronisation success) with the virtual partner (secondary auditory cortex and IFG). Such sensitivity was measured by (1) calculating the correlation between the index of verbal coordination and mean power within a range of frequency bands across trials, and (2) calculating the phase-amplitude coupling between the behavioural and brain signals within single trials (using the power of high-frequency neural activity only). Overall, the findings help to elucidate some of the left hemisphere brain areas involved in interactive speaking behaviours, particularly highlighting the highfrequency activity of the IFG as a potential candidate supporting verbal coordination.
 
 Strengths:
 
 This study provides the field with a convincing demonstration of how to investigate speaking behaviours in more complex situations that share many features with real-world speaking contexts e.g. simultaneous engagement of speech perception and production processes, the presence of an interlocutor, and the need for inter-speaker coordination. The findings thus go beyond previous work that has typically studied solo speech production in isolation, and represent a significant advance in our understanding of speech as a social and communicative behaviour. It is further an impressive feat to develop a paradigm in which the degree of cooperativity of the synchronisation partner can be so tightly controlled; in this way, this study combines the benefits of using prerecorded stimuli (namely, the high degree of experimental control) with the benefits of using a live synchronisation partner (allowing the task to be truly two-way interactive, an important criticism of other work using pre-recorded stimuli). A further key strength of the study lies in its employment of stereotactic EEG to measure brain responses with both high temporal and spatial resolution, an ideal method for studying the unfolding relationship between neural processing and this dynamic coordination behaviour.
 
 We sincerely appreciate the Reviewer's thoughtful and positive feedback on our manuscript.
 
 Weaknesses:
 
 One major limitation of the current study is the lack of coverage of the right hemisphere by the implanted electrodes. Of course, electrode location is solely clinically motivated, and so the authors did not have control over this. However, this means that the current study neglects the potentially important role of the right hemisphere in this task. The right hemisphere has previously been proposed to support feedback control for speech (likely a core process engaged by synchronous speech), as opposed to the left hemisphere which has been argued to underlie feedforward control (Tourville & Guenther, 2011). Indeed, a previous fMRI study of synchronous speech reported the engagement of a network of right hemisphere regions, including STG, IPL, IFG, and the temporal pole (Jasmin et al., 2016). Further, the release from speech-induced suppression during a synchronous speech reported by Jasmin et al. was found in the right temporal pole, which may explain the discrepancy with the current finding of reduced leftward high-frequency activity with increasing verbal coordination (suggesting instead increased speech-induced suppression for successful synchronisation). The findings should therefore be interpreted with the caveat that they are limited to the left hemisphere, and are thus likely missing an important aspect of the neural processing underpinning verbal coordination behaviour.
 
 We have now included, in the supplementary materials, data from the right hemisphere, although the coverage is a bit sparse (Figures S2, S4, S5, see our responses in the ‘Recommendation for the authors’ section, below). We have also revised the Discussion section to add the putative role of right temporal regions (see below as well).
 
 A further limitation of this study is that its findings are purely correlational in nature; that is, the results tell us how neural activity correlates with behaviour, but not whether it is instrumental in that behaviour. Elucidating the latter would require some form of intervention such as electrode stimulation, to disrupt activity in a brain area and measure the resulting effect on behaviour. Any claims therefore as to the specific role of brain areas in verbal coordination (e.g. the role of the IFG in supporting online coordinative adjustments to achieve synchronisation) are therefore speculative.
 
 We appreciate the reviewer’s observation regarding the correlational nature of our findings and agree that this is a common limitation of neuroimaging studies. While elucidating causal relationships would indeed require intervention techniques such as electrical stimulation, our study leverages the unique advantages of intracerebral recordings, offering the best available spatial and temporal resolution alongside a high signal-tonoise ratio. These attributes ensure that our data accurately reflect neural activity and its temporal dynamics, providing a robust foundation for understanding the relationship between neural processes and behaviour. Therefore, while causal claims are beyond the scope of this study, the precision of our methodology allows us to make well-supported observations about the neural correlates of synchronous speech tasks.
 
 Recommendations for the authors:
 
 Reviewing Editor Comment:
 
 After joint consultation, we are seeing the potential for the report to be strengthened and the evidence here to be deemed ultimately at least 'solid': to us (editors and reviewers) it seems that this would require both (1) clarifying/acknowledging the limitations of not having right hemisphere data, and (2) running some of the additional analyses the reviewers suggest, which should allow for richer examination of the data e.g. phase relationships in areas that correlate with synchronisation.
 
 We have now added data on the right hemisphere (RH) that we did not previously report due to a rather sparse sampling of the RH. These results are now reported in the Results section as well as in the Supplementary section, where we put all right hemisphere figures for all analyses (Figure S2, S4, S5). We have also run additional analyses digging into the phase relationship in areas that correlate with synchronisation (Figure S6). These additional analyses allowed us to improve the Discussion section as well.
 
 Reviewer #1 (Recommendations For The Authors):
 
 In some sections, the writing is a bit unclear, with both typos and vague statements that could be fixed with careful proofreading.
 
 We thank the reviewer for pointing out areas where the writing could be improved. We carefully proofread the manuscript to address typos and clarify any vague statements. Specific sections identified as unclear have been rephrased for better precision and readability.
 
 In Figure 1, the colors repeat, making it impossible to tell patients apart.
 
 We have now updated Figure 1 colormap to avoid redundancy and added the right hemisphere.
 
 Line 132: "16 unilateral implantations (9 left, 7 bilateral implantations)". Should this say 7 right hemisphere? If so, the following sentence stating that there was "insufficient cover [sic] of the right hemisphere" is unclear, since the number of patients between LH and RH is similar.
 
 The confusion was due to the fact that the lateralization refers to the presence/absence of electrodes in the Heschl’s gyrus (left : H’ ; right : H) exclusively.
 
 We have thus changed this section as follows:
 
 “16 patients (7 women, mean age 29.8 y, range 17 - 50 y) with pharmacoresistant epilepsy took part in the study. They were included if their implantation map covered at least partially the Heschl's gyrus and had sufficiently intact diction to support relatively sustained language production.” The relevant part (previously line 132) now states:
 
 “Sixteen patients with a total of 236 electrodes (145 in the left hemisphere) and 2395 contacts (1459 in the left hemisphere, see Figure 1). While this gives a rather sparse coverage of the right hemisphere, we decided, due to the rarity of this type of data, to report results for both hemispheres, with figures for the left hemisphere in the main text and figures for the right hemisphere in the supplementary section.”
 
 Reviewer #2 (Recommendations For The Authors):
 
 (1) To address the concern regarding the absence of data from the right hemisphere, I would advise the authors to directly acknowledge this limitation in their Discussion section, citing relevant work suggesting that the right hemisphere has an important role to play in this task (e.g. Jasmin et al., 2016). You should also make this clear in your abstract e.g. you could rewrite the sentence in line 40 to be: "Then, we recorded the intracranial brain activity of the left hemisphere in 16 patients with drug-resistant epilepsy...".
 
 We are grateful to the reviewer for this comment that incited us to look into the right hemisphere data. We have now included results in the right hemisphere, although the coverage is a bit sparse. We have also revised the Discussion section to add the putative role of right temporal regions. Interestingly, our results show, as suggested by the reviewer, a clear involvement of the RH in this task.
 
 First, the full brain analyses show a very similar implication of the RH as compared to the LH (see Figure below). We have now added in the Results section:
 
 “As expected, the whole language network is strongly involved, including both dorsal and ventral pathways (Fig 3A). More precisely, in the left temporal lobe the superior, middle and inferior temporal gyri, in the left parietal lobe the inferior parietal lobule (IPL) and in the left frontal lobe the inferior frontal gyrus (IFG) and the middle frontal gyrus (MFG). Similar results are observed in the right hemisphere, neural responses being present across all six frequency bands with medium to large modulation in activity compared to baseline (Figure S2A) in the same regions. Desynchronizations are present in the theta, alpha and beta bands while the low gamma and HFa bands show power increases.”
 
 As to compared to the left hemisphere, assessing brain-behaviour correlations in the right hemisphere does not provide the same statistical power, because some anatomical regions have very few electrodes. Nonetheless, we observe a strong correlation in the right IFG, similar to the one we previously reported in the left hemisphere, and we now report in the Results section:
 
 “The decrease in HFa along the dorsal pathway is replicated in the right hemisphere (Figure S4). However, while both the right STG BA41/42 and STG BA22 present a power increase (compared to baseline) — with a stronger increase for the STG BA41/42 — neither shows a significant correlation with verbal coordination (t(45)=-1.65, p=.1 ; t(8)=-0.67, p=.5 ; Student’s T test, FDR correction). By contrast, results in the right IFG BA44 are similar to the one observed in the left hemisphere with a significant power increase associated with a negative brainbehaviour correlation (t(17) = -3.11, p = .01 ; Student’s T test, FDR correction).”
 
 Interestingly, the phase-amplitude coupling analysis yields very similar results in both hemispheres (exception made for BA22). We have thus updated the Results section as follows:
 
 “Notably, when comparing – within the regions of interest previously described – the PAC with the virtual partner speech and the PAC with the phase difference, the coupling relationship changes when moving along the dorsal pathway: a stronger coupling in the auditory regions with the speech input, no difference between speech and coordination dynamics in the IPL and a stronger coupling for the coordinative dynamics compared to speech signal in the IFG (Figure 5B ). When looking at the right hemisphere, we observe the same changes in the coupling relationship when moving along the dorsal pathway, except that no difference between speech and coordination dynamics is present in the right secondary auditory regions (STG BA22; Figure S5).”
 
 We also included in the Discussion section the right hemisphere results also mentioning previous work of Guenther and the one of Jasmin. On the section “Left secondary auditory regions are more sensitive to coordinative behaviour” one can read:
 
 “Furthermore, the absence of correlation in the right STG BA22 (Figure S4) seems in first stance to challenge influential speech production models (e.g. Guenther & Hickok, 2016) that propose that the right hemisphere is involved in feedback control. However, one needs to consider the the task at stake heavily relied upon temporal mismatches and adjustments. In this context, the left-lateralized sensitivity to verbal coordination reminds of the works of Floegel and colleagues (2020, 2023) suggesting that both hemispheres are involved depending on the type of error: the right auditory association cortex monitoring preferentially spectral speech features and the left auditory association cortex monitoring preferentially temporal speech features. Nonetheless, the right temporal pole seems to be sensitive to speech coordinative behaviour, confirming previous findings using fMRI (Jasmin et al., 2016) and thus showing that the right hemisphere has an important role to play in this type of tasks (e.g. Jasmin et al., 2016).”
 
 References cited:
 
 – Floegel, M., Fuchs, S., & Kell, C. A. (2020). Differential contributions of the two cerebral hemispheres to temporal and spectral speech feedback control. Nature Communications, 11(1), 2839.
 
 – Floegel, M., Kasper, J., Perrier, P., & Kell, C. A. (2023). How the conception of control influences our understanding of actions. Nature Reviews Neuroscience, 24(5), 313-329.
 
 – Guenther, F. H., & Hickok, G. (2016). Neural models of motor speech control. In Neurobiology of language (pp. 725-740). Academic Press.
 
 (2) When discussing previous work on alignment during synchronous speech, you may wish to include a recently published paper by Bradshaw et al (2024); this manipulated the acoustics of the accompanist's voice during a synchronous speech task to show interactions between speech motor adaptation and phonetic convergence/alignment.
 
 We thank the reviewer for pointing to this recent and interesting paper. We added the article as reference as follows
 
 “Furthermore, synchronous speech favors the emergence of alignment phenomena, for instance of the fundamental frequency or the syllable onset (Assaneo et al., 2019 ; Bradshaw & McGettigan, 2021 ; Bradshaw et al., 2023; Bradshaw et al., 2024).”
 
 (3) Line 80: "Synchronous speech resembles to a certain extent to delayed auditory feedback tasks"- I think you mean "altered auditory feedback tasks" here.
 
 In the case of synchronous speech it is more about timing than altered speech signals, that is why the comparison is done with delayed and not altered auditory feedback. Nonetheless, we understand the Reviewer’s point and we have now changed the sentence as follows:
 
 “Synchronous speech resembles to a certain extent to delayed/altered auditory feedback tasks”
 
 (4) When discussing superior temporal responses during such altered feedback tasks, you may also want to cite a review paper by Meekings and Scott (2021).
 
 We thank the reviewer for this suggestion, indeed this was a big oversight!
 
 The paper is now quoted in the introduction as follows:
 
 “Previous studies have revealed increased responses in the superior temporal regions compared to normal feedback conditions (Hirano et al., 1997 ; Hashimoto & Sakai, 2003 ; Takaso et al., 2010 ; Ozerk et al., 2022 ; Floegel et al., 2020 ; see Meekings & Scott, 2021 for a review of error-monitoring and feedback control in the STG during speech production).”
 
 Furthermore, we updated the discussion part concerning the speaker-induced suppression phenomenon (see below our response to the point 10).
 
 (5) Line 125: "The parameters and sound adjustment were set using an external low-latency sound card (RME Babyface Pro Fs)". Can you please report the total feedback loop latency in your set-up? Or at the least cite the following paper which reports low latencies with this audio device.
 
 Kim, K. S., Wang, H., & Max, L. (2020). It's About Time: Minimizing Hardware and Software Latencies in Speech Research With Real-Time Auditory Feedback. Journal of Speech, Language, and Hearing Research, 63(8), 25222534. https://doi.org/10.1044/2020_JSLHR-19-00419
 
 We now report the total feedback loop latency (~5ms) and also cite the relevant paper (Kim et al., 2020).
 
 (6) Line 127 "A calibration was made to find a comfortable volume and an optimal balance for both the sound of the participant's own voice, which was fed back through the headphones, and the sound of the stimuli." What do you mean here by an 'optimal balance'? Was the participant's own voice always louder than the VP stimuli? Can you report roughly what you consider to be a comfortable volume in dB?
 
 This point was indeed unlcear. We have now changed as follows:
 
 “A calibration was made to find a comfortable volume and an optimal balance for both the sound of the participant's own voice, which was fed back through the headphones, and the sound of the stimuli. The aim of this procedure was that the patient would subjectively perceive their voice and the VP-voice in equal measure. VP voice was delivered at approximately 70dB.”
 
 (7) Relatedly, did you use any noise masking to mask the air-conducted feedback from their own voice (which would have been slightly out of phase with the feedback through the headphones, depending on your latency)?
 
 Considering the low-latency condition allowed with the sound card (RME Babyface Pro Fs), we did not use noise masking to mask the air-conducted feedback from the self-voice of the patients.
 
 (8) Line 141: "four short sentences were pre-recorded by a woman and a man." Did all participants synchronise with both the man and woman or was the VP gender matched to that of the participant/patient?
 
 We thank the reviewer for this important missing detail. We know changed the text as follows:
 
 “Four stimuli corresponding to four short sentences were pre-recorded by both a female and a male speaker. This allowed to adapt to the natural gender differences in fundamental frequency (i.e. so that the VP gender matched that of the patients). All stimuli were normalised in amplitude.”
 
 (9) Can you clarify what instructions participants were given regarding the VP? That is, were they told that this was a recording or a real live speaker? Were they naïve to the manipulation of the VP's coupling to the participant?
 
 We have now added this information to the task description as follows:
 
 “Participants, comfortably seated in a medical chair, were instructed that they would perform a real-time interactive synchronous speech task with an artificial agent (Virtual Partner, henceforth VP, see next section) that can modulate and adapt to the participant’s speech in real time.”
 
 “The third step was the actual experiment. This was identical to the training but consisted of 24 trials (14s long, speech rate ~3Hz, yielding ~1000 syllables). Importantly, the VP varied its coupling behaviour to the participant. More precisely, for a third of the sequences the VP had a neutral behaviour (close to zero coupling : k = +/- 0.01). For a third it had a moderate coupling, meaning that the VP synchronised more to the participant speech (k = - 0.09). And for the last third of the sequences the VP had a moderate coupling but with a phase shift of pi/2, meaning that it moderately aimed to speak in between the participant syllables (k = + 0.09). The coupling values were empirically determined on the basis of a pilot experiment in order to induce more or less synchronization, but keeping the phase-shifted coupling at a rather implicit level. In other terms, while participants knew that the VP would adapt, they did not necessarily know in which direction the coupling went.”
 
 (10) The paragraph from line 438 entitled "Secondary auditory regions are more sensitive to coordinative behaviour" includes an interesting discussion of the relation of the current findings to the phenomenon of speech-induced suppression (SIS). However, the authors appear to equate the observed decrease in highfrequency activity as speech coordination increases with the phenomenon of SIS (in lines 456-457), which is quite a speculative leap. I would encourage the authors to temper this discussion by referring to SIS as a potentially related phenomenon, with a need for more experimental work to determine if this is indeed the same phenomenon as the decreases in high-frequency power observed here. I believe that the authors are arguing here for an interpretation of SIS as reflecting internal modelling of sensory input regardless of whether this is self-generated or other-generated; if this is indeed the case, I would ask the authors to be more explicit here that these ideas are not a standard part of the traditional account of SIS, which only includes internal modelling of self-produced sensory feedback.
 
 As stated in the public review, we thank both reviewers for raising thoughtful concerns about our interpretation of the observed neural suppression as related to speaker-induced suppression (SIS). We agree that our study lacks a passive listening condition, which limits direct comparisons to the original SIS effect, traditionally defined as the suppression of neural responses to self-produced speech compared to externally-generated speech (Meekings & Scott, 2021).
 
 In response, we have reconsidered our terminology and interpretation. In the revised discussion, we refer to our findings as a "SIS-related phenomenon specific to the synchronous speech context." Unlike classic SIS paradigms, our interactive task involves simultaneous monitoring of self- and externally-generated speech, introducing additional attentional and coordinative demands.
 
 The revised discussion also incorporates findings by Ozker et al. (2024, 2022), which link SIS and speech monitoring, suggesting that suppressing responses to self-generated speech facilitates error detection. We propose that the decrease in high-frequency activity (HFa) as verbal coordination increases reflects reduced error signals due to closer alignment between perceived and produced speech. Conversely, HFa increases with reduced coordination may signify greater prediction error.
 
 Additionally, we relate our findings to the "rubber voice" effect (Zheng et al., 2011; Lind et al., 2014; Franken et al., 2021), where temporally and phonetically congruent external speech can be perceived as self-generated. We speculate that this may occur in synchronous speech tasks when the participant's and VP's speech signals closely align. However, this interpretation remains speculative, as no subjective reports were collected to confirm this perception. Future studies could include participant questionnaires to validate this effect and relate subjective experience to neural measures of synchronization.
 
 Overall, our findings extend the study of SIS to dynamic, interactive contexts and contribute to understanding internal forward models of speech production in more naturalistic scenarios.
 
 We have now added these points to the discussion as follows:
 
 “The observed negative correlation between verbal coordination and high-frequency activity (HFa) in STG BA22 suggests a suppression of neural responses as the degree of synchrony increases. This result aligns with findings on speaker-induced suppression (SIS), where neural activity in auditory cortex decreases during self-generated speech compared to externally-generated speech (Meekings & Scott, 2021; Niziolek et al., 2013). However, our paradigm differs from traditional SIS studies in two critical ways: (1) the speaker's own voice is always present and predictable from the forward model, and (2) no passive listening condition was included. Therefore, our findings cannot be directly equated with the original SIS effect.
 
 Instead, we propose that the suppression observed here reflects a SIS-related phenomenon specific to the synchronous speech context. Synchronous speech requires simultaneous monitoring of self- and externally generated speech, a task that is both attentionally demanding and coordinative. This aligns with evidence from Ozker et al. (2024, 2022), showing that the same neural populations in STG exhibit SIS and heightened responses to feedback perturbations. These findings suggest that SIS and speech monitoring are related processes, where suppressing responses to self-generated speech facilitates error detection.
 
 In our study, suppression of HFa as coordination increases may reflect reduced prediction errors due to closer alignment between perceived and produced speech signals. Conversely, increased HFa during poor coordination may signify greater mismatch, consistent with prediction error theories (Houde & Nagarajan, 2011; Friston et al., 2020).”
 
 (11) Within this section, you also speculate in line 460 that "Moreover, when the two speech signals come close enough in time, the patient possibly perceives them as its own voice." I would recommend citing studies on the 'rubber voice' effect to back up this claim (e.g. Franken et al., 2021; Lind et al., 2014; Zheng et al., 2011).
 
 We are grateful to the Reviewer for this interesting suggestion. Directly following the previous comment, the section now states:
 
 “Furthermore, when self- and externally-generated speech signals are temporally and phonetically congruent, participants may perceive external speech as their own. This echoes the "rubber voice" effect, where external speech resembling self-produced feedback is perceived as self-generated (Zheng et al., 2011; Lind et al., 2014; Franken et al., 2021). While this interpretation remains speculative, future studies could incorporate subjective reports to investigate this phenomenon in more detail.”
 
 (12) As noted in my public review, since your methods are correlational, you need to be careful about inferring the causal role of any brain areas in supporting a specific aspect of functioning e.g. line 501-504: "By contrast, in the inferior frontal gyrus, the coupling in the high-frequency activity is strongest with the input-output phase difference (input of the VP - output of the speaker), a metric that reflects the amount of error in the internal computation to reach optimal coordination, which indicates that this region optimises the predictive and coordinative behaviour required by the task." I would argue that the latter part of this sentence is a conclusion that, although consistent with, goes beyond the current data in this study, and thus needs tempering.
 
 We agree with the Reviewer and changed the sentence as follows:
 
 “By contrast, in the inferior frontal gyrus, the coupling in the high-frequency activity is strongest with the inputoutput phase difference (input of the VP - output of the speaker), a metric that could possibly reflect the amount of error in the internal computation to reach optimal coordination. This indicates that this region could have an implication in the optimisation of the predictive and coordinative behaviour required by the task.”
 
 AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 2

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.04.23.590817v2
www.biorxiv.org www.biorxiv.org

Upstream open reading frames buffer translational variability during Drosophila evolution and development

3
1. Public_Reviews 12 May 2025
 
 in eLife
 
 eLife Assessment
 
 This study reveals the important role of upstream open reading frames (uORFs) in limiting the translational variability of downstream coding sequences. Through a combination of computational simulations, comparative analyses of translation efficiency across different developmental stages in two closely related Drosophila species, and manipulative, experimental validation of translation buffering by an uORF for a gene, the authors provide convincing evidence supporting their conclusions. This work will be of broad interest to molecular biologists and geneticists.
 
 Summary
2. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Summary:
 
 The authors set out to explore the role of upstream open reading frames (uORFs) in stabilizing protein levels during Drosophila development and evolution. By utilizing a modified ICIER model for ribosome translation simulations and conducting experimental validations in Drosophila species, the study investigates how uORFs buffer translational variability of downstream coding sequences. The findings reveal that uORFs significantly reduce translational variability, which contributes to gene expression stability across different biological contexts and evolutionary timeframes.
 
 Strengths:
 
 (1) The study introduces a sophisticated adaptation of the ICIER model, enabling detailed simulation of ribosomal traffic and its implications for translation efficiency. (2) The integration of computational predictions with empirical data through knockout experiments and translatome analysis in Drosophila provides a compelling validation of the model's predictions. (3) By demonstrating the evolutionary conservation of uORFs' buffering effects, the study provides insights that are likely applicable to a wide range of eukaryotes.
 
 Weaknesses:
 
 (1) Although the study is technically sound, it does not clearly articulate the mechanisms through which uORFs buffer translational variability. A clearer hypothesis detailing the potential molecular interactions or regulatory pathways by which uORFs influence translational stability would enhance the comprehension and impact of the findings. (2) The study could be further improved by a discussion regarding the evolutionary selection of uORFs. Specifically, it would be beneficial to explore whether uORFs are favored evolutionarily primarily for their role in reducing translation efficiency or for their capability to stabilize translation variability. Such a discussion would provide deeper insights into the evolutionary dynamics and functional significance of uORFs in genetic regulation.
 
 Comments on revisions:
 
 The authors have adequately addressed my previous concerns.
 
 Review 1
3. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 uORFs, short open reading frames located in the 5' UTR, are pervasive in genomes. However, their roles in maintaining protein abundance are not clear. In this study, the authors propose that uORFs act as "molecular dam", limiting the fluctuation of the translation of downstream coding sequences. First, they performed in silico simulations using an improved ICIER model, and demonstrated that uORF translation reduces CDS translational variability, with buffering capacity increasing in proportion to uORF efficiency, length, and number. Next, they analysed the translatome between two related Drosophila species, revealing that genes with uORFs exhibit smaller fluctuations in translation between the two species and across different developmental stages within the same species. Moreover, they identified that bicoid, a critical gene for Drosophila development, contains a uORF with substantial changes in translation efficiency. Deleting this uORF in Drosophila melanogaster significantly affected its gene expression, hatching rates, and survival under stress conditions. Lastly, by leveraging public Ribo-seq data, the authors showed that the buffering effect of uORFs is also evident between primates and within human populations. Collectively, the study significantly advances our understanding of how uORFs regulate the translation of downstream coding sequences at the genome-wide scale, as well as during development and evolution. It would be particularly interesting to explore whether similar buffering functions are conserved in other organisms, and whether their regulatory effects could be harnessed for practical applications, such as improving crop traits or benefiting human health.
 
 Comments on revisions:
 
 The authors have fully addressed all of my concerns, and the revisions have substantially improved the manuscript. I have no further comments.
 
 Review 2
Visit annotations in context

Tags

Summary

Review 1

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.11.13.623404v3
www.biorxiv.org www.biorxiv.org

Prediction tendency, eye movements, and attention in a unified framework of neural speech tracking

4
1. Public_Reviews 12 May 2025
 
 in eLife
 
 eLife Assessment
 
 These are valuable findings for those interested in how neural signals reflect auditory speech streams, and in understanding the roles of prediction, attention, and eye movements in this tracking. However, the evidence as it stands is incomplete. Further analyses are needed to clarify how the observed results relate to the relevant theoretical claims.
 
 Summary
2. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Summary:
 
 This study aimed at replicating two previous findings that showed (1) a link between prediction tendencies and neural speech tracking, and (2) that eye movements track speech. The main findings were replicated which supports the robustness of these results. The authors also investigated interactions between prediction tendencies and ocular speech tracking, but the data did not reveal clear relationships. The authors propose a framework that integrates the findings of the study and proposes how eye movements and prediction tendencies shape perception.
 
 Strengths:
 
 This is a well-written paper that addresses interesting research questions, bringing together two subfields that are usually studied in separation: auditory speech and eye movements. The authors aimed at replicating findings from two of their previous studies, which was overall successful and speaks for the robustness of the findings. The overall approach is convincing, methods and analyses appear to be thorough, and results are compelling.
 
 Weaknesses:
 
 Eye movement behavior could have presented in more detail and the authors could have attempted to understand whether there is a particular component in eye movement behavior (e.g., blinks, microsaccades) that drives the observed effects.
 
 Review 1
3. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary
 
 Schubert et al. recorded MEG and eye tracking activity while participants were listening to stories in single-speaker or multi-speaker speech. In a separate task, MEG was recorded while the same participants were listening to four types of pure tones in either structured (75% predictable) or random (25%) sequences. The MEG data from this task was used to quantify individual 'prediction tendency': the amount by which the neural signal is modulated by whether or not a repeated tone was (un)predictable, given the context. In a replication of earlier work, this prediction tendency was found to correlate with 'neural speech tracking' during the main task. Neural speech tracking is quantified as the multivariate relationship between MEG activity and speech amplitude envelope. Prediction tendency did not correlate with 'ocular speech tracking' during the main task. Neural speech tracking was further modulated by local semantic violations in the speech material and by whether or not a distracting speaker was present. The authors suggest that part of the neural speech tracking is mediated by ocular speech tracking. Story comprehension was negatively related with ocular speech tracking.
 
 Strengths
 
 This is an ambitious study, and the authors' attempt to integrate the many reported findings related to prediction and attention in one framework is laudable. The data acquisition and analyses appear to be done with great attention to methodological detail. Furthermore, the experimental paradigm used is more naturalistic than was previously done in similar setups (i.e.: stories instead of sentences).
 
 Weaknesses
 
 While the analysis pipeline is outlined in much detail, some analysis choices appear ad-hoc and could have been more uniform and/or better motivated (other than: this is what was done before).
 
 Review 2
4. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 I thank the authors for their extensive revision of this paper, and I found some elements greatly improved. In particular, the authors do embrace a somewhat more speculative tone in the current version, which I think is fitting for this work, as the data seem (to me) to be not fully conclusive. The data set collected here is clearly valuable and unique (and I would encourage the authors to make it publicly available!), however, my overall impression is that the specific analyses reported here might not fully
 
 Despite the revised description of methods, results and figures, I still have trouble understanding many of the results and the authors conclusive interpretation of them. These are my main reservations:
 
 (1) Regarding "individual prediction tendency" - thank you for adding clarifying methodological details and showing the data in a new Figure (#2). Honestly, however, I still can't say that I fully understand the result. For example, why is there also a significant response in the random condition as well? And how do you interpret the interesting time-course (with a peak ~200ms prior to the stimulus, and a reduction overtime from there? Also (I may have missed this, but..) what neural data was used to train the classifier and derive the "prediction tendency" index? Was it just the broadband neural response? Is there a way to know which sensors contributed to this metric (e.g., are they predominantly auditory? Frontal?)? And is there a way to establish the statistical significance of this metric (e.g., how good the decoder actually was in predicting behavioral sensitivity?). I don't see any statistics in the results section describing the individual prediction tendency.
 
 (2) Regarding the TRF analysis - Thanks for clarifying the approach used to obtain 2-second long "segments" of speech tracking. This is an interesting approach, however I think quite new(?) , and for me it raises a whole new set of questions, as well as additional controls and data that I would have liked to see, to be convinced that results are significant. I will elaborate:
 
 - Do I understand correctly that you segment the real and predicted neural response into 2-second long segments and then calculate the Pearsons' correlation between them to assess the goodness of the model? This is very unclear, since in the methods section you state only that "the same" analysis was performed as for the full data - but what exactly? Clearly, values will be very different when using such short segments. I feel that additional details are still required (and perhaps data shown) to fully understand the "semantic violation" analysis of TRFs.
 
 - I would like to reiterate my previous comment regarding the use of permutation tests to verify the validity of TRF-based measures derived. This would be especially important when using new approaches (such as the segmentation used here). The authors argue that this is not needed since this was not done in their previously published study. However, this sounds a bit like "two wrongs make a right" argument... why not just do it, and let us know that this 2-second segmentation approach allows estimating reliable speech tracking?
 
 - Following up on my previous comment that defining "clusters" as at least two neighboring channels (Figure 3) - the fact that this is a default in Fieldtrip is by no means sufficient justification!. This seems quite liberal to me, especially given the many comparisons performed. Here too, permutations can help to determine the necessary data-driven threshold for corrections. This is of course critical for interpreting the result shown in Figures 3E&G that are critical "take home messages" of the paper - i.e., that the prediction-index from the first part of the experiment is related to speech tracking in the second part of the experiment. To my eyes, this does not look extremely convincing, but perhaps the authors can show more conclusive data to support this (e.g., scatter plots of the betas across participant?). - A similar point can be made for the effect of semantic violations (though here the scalp-level result is somewhat more clustered). The authors point out that the semantic effect is a "replication" of their result reported in Schubert et al. 2023, but if I am not mistaken the results there were somewhat different (as was the manipulation). It would be nice to explicitly discuss the similarity/difference between these effects.
 
 (3) Regarding the ocular-TRFs -
 
 - Maybe this is just me, but I believe that effects that are robust should be clearly visible in the data, without the need for fancy "black-box" statistical models. In the case of the ocular TRFs, it is hard for me to see how these time-courses are not just noise (and, again, a permutation test would have helped to convince me..). The inconsistent results for horizontal and vertical eye-movements vis a vis the experimental conditions (single vs. multi-speaker conditions) don't help either, despite the authors argument that these are "independent" - but why should this be the case, especially if there is nothing really to look at in this task? - I remain with this scepticism for the mediation-portion of the analysis as well... But perhaps replications from other groups or making the data public will help shed further light on this in the future.
 
 Minor - Thanks for adding information about the creation of semantic-violation stimuli. Since the violations and lexical-controls were taken from different audio recordings, it would have been nice to verify that differences between neural responses cannot be attributed to differences in articulations (e.g., by comparing their spectro-temporal properties).
 
 Review 3
Visit annotations in context

Tags

Summary

Review 1

Review 2

Review 3

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.27.546746v4
www.biorxiv.org www.biorxiv.org

Post-fertilization transcription initiation in an ancestral LTR retrotransposon drives lineage-specific genomic imprinting of ZDBF2

5
1. Public_Reviews 12 May 2025
 
 in eLife
 
 eLife Assessment
 
 The authors analyses describe a novel mechanism by which a retrotransposon-derived LTR may be involved in genomic imprinting and demonstrate imprinting of the ZDBF2 locus in rabbits and Rhesus macaques using allele-specific expression analysis. This imprinting of the ZDBF2 locus correlates with transcription of GPR1-AS orthologs. The accompanying genomic analysis is very well executed allowing for the conclusions reached in the manuscript. The revisions made at the request of the reviewers in this important manuscript strengthen the evidence from the genomic analyses, and as a result, the evidence is now convincing and will be informative to the genomics and developmental biology communities.
 
 Summary
2. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Summary:
 
 The study tests the conservation of imprinting of the ZBDF2 locus across mammals. ZDBF2 is known to be imprinted in mouse, human and rat. The locus has a unique mechanism of imprinting: although imprinting is conferred by a germline DMR methylated in oocytes, the DMR is upstream to ZDBF2 (at GPR1) and monoallelic methylation of the gDMR does not persist beyond early developmental stages. Instead, a lncRNA (GPR1-AS, also known as Liz in mouse) initiating at the gDMR is expressed transiently in embryos and sets up a secondary DMR (by mechanisms not fully elucidated) that then confers monoallelic expression of ZDBF2 in somatic tissues.
 
 In this study, the authors first interrogate existing placental RNA-seq datasets from multiple mammalian species, and detect GPR1-AS1 candidate transcripts in human, baboon, macaque and mouse, but not in about a dozen other mammals. Because of the varying depth, quality and nature of these RNA-seq libraries, the ability to definitely detect the GPR1-AS1 lncRNA is not guaranteed; therefore, they generate their own deep, directional RNA-seq data from tissues/embryos from five species, finding evidence of GPR1-AS in rabbit, chimpanzee, but not bovine, pig or opossum. From these surveys, the authors conclude that the lncRNA is present only in Euarchontoglires mammals. To test the association between GPR1-AS and ZDBF2 imprinting, they perform RT-PCR and sequencing in tissue from wallabies and cattle, finding biallelic expression of ZDBF2 in these species that also lack a detected GPR1-AS transcript. From inspection of the genomic location of the GPR1-AS first exon, the authors identify an overlap with a solo LTR of the MER21C retrotransposon family in those species in which the lncRNA is observed, except for some rodents, including mouse. However, they do detect a degree of homology (46%) to the MER21C consensus at the first exon on Liz in mouse. Finally, the authors explore public RNA-seq datasets to show that GPR1-AS is expression transiently during human preimplantation development, an expression dynamic that would be consistent with the induction of monoallelic methylation of a somatic DMR at ZDBF2 and consequent monoallelic expression.
 
 Strengths:
 
 The analysis uncovers a novel mechanism by which a retrotransposon-derived LTR may be involved in genomic imprinting. The genomic analysis is very well executed. New directional and deeply-sequenced RNA-seq datasets from placenta or trophectoderm of five mammalian species and marsupial embryos, which will be of value to the community.
 
 Weaknesses:
 
 Although the genomic analysis is very strong, the study remains entirely correlative. All of the data are descriptive, and much of the analysis is performed on RNA-seq and other datasets from the public domain; a small amount of primary data is generated by the authors. Evidence that the residual LTR in mouse is functionally relevant for Liz lncRNA expression is lacking.
 
 Comments on revision:
 
 The authors have responded very constructively to all points raised by me and the other reviewers. For example, the authors have gone to further, extensive efforts in seeking to identify an LTR at the mouse Liz locus - which is not found - but additional multiple genome alignments provide evidence for sequence conservation consistent with retention of a functional relic of the MER21C in rodent genomes. Moreover, they demonstrate the promoter activity of this mouse sequence region in transfections. They have also demonstrated imprinted expression of ZDBF2 in two additional species - rabbit and rhesus macaque - consistent with their model.
 
 Review 1
3. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 This work concerns the evolution of ZDBF2 imprinting in mammalian species via initiation of GPR1 antisense (AS) transcription from a lineage-specific long-terminal repeat (LTR) retrotransposon. It extends previous work describing the mechanism of ZDBF2 imprinting in mice and humans by demonstrating conservation of GPR1-AS transcripts in rabbits and non-human primates. By identifying the origin of GPR1-AS transcription as the LTR MER21C, the authors claim to account for how imprinting evolved in these species but not in those lacking the MER21C insertion. This illustrates the principle of LTR co-option as a means of evolving new gene regulatory mechanisms, specifically to achieve parent-of-origin allele specific expression (imprinting). Examples of this phenomenon have been described previously, but usually involve initiation of transcription during gametogenesis rather than post-fertilization, as in this work. The findings of this paper are therefore relevant to biologists studying imprinted genes or interested more generally in the evolution of gene regulatory mechanisms.
 
 Strengths:
 
 (1) The authors convincingly demonstrate the existence of GPR1-AS orthologs in specific mammalian lineages using high quality RNA-seq libraries collected from diverse mammalian species.
 
 (2) The authors demonstrate imprinting of the ZDBF2 locus in rabbits and Rhesus macaques using allele-specific expression analysis. The transcription of GPR1-AS orthologs therefore correlates with imprinting of the ZDBF2 locus.
 
 Weaknesses:
 
 (1) Experimental evidence directly linking GPR1-AS transcription to ZDBF2 imprinting in rabbits and non-human primates is lacking. Consideration should be given to the challenges associated with studying non-model species and manipulating repeat sequences. Further, this mechanism is established in humans and mice, so the authors' model is arguably sufficiently supported merely by the existence of GPR1-AS orthologs in other mammalian lineages.
 
 Review 2
4. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 Kobayashi et al identify MER21C as a common promoter of GPR1-AS/Liz in Euarchontoglires, which establishes a somatic DMR that controls ZFDB2 imprinting. In mice, MER21C appears to have diverged significantly from its primate counterparts and is no longer annotated as such.
 
 The authors used high-quality cross-species RNA-seq data to characterise GPR1-AS-like transcripts, which included generating new data in five different species. The association between MER21C/B elements and the promoter of GPR1-AS in most species is clear and convincing. The expression pattern of MER21C/B elements overall further supports their role in enabling correct temporal expression of GPR1-AS during embryonic development.
 
 In the revised version of the manuscript the authors provided additional support for the common evolutionary origin of the GPR1-AS/Liz promoter between primates and rodents. They also showed a more extensive concordance between the presence of GPR1-AS-like transcripts and ZDBF2 imprinting.
 
 Altogether, these findings robustly support the conclusions of the paper, shedding light into the events underlying the evolution of imprinting at the ZDBF2 locus.
 
 Review 3
5. Public_Reviews 12 May 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews
 
 Recommendations For The Authors:
 
 Reviewer #1 (Recommendations For The Authors):
 
 Recommendations Analysis:
 
 (1) Given that a MER21B/C LTR was not immediately identified at the start site of the Liz lncRNA in the mouse, and its match is only 46%, this raises the question of whether an analogous LTR would be identified at the homologous location in other species on deeper analysis. The authors need to argue that what has been conserved in the LTR alone in mouse is the essential element conferring the ability to initiate transcription of Liz. A transient reporter assay might be sufficient to do this.
 
 We believe that the 46% identity between the first exon of mouse Liz and the consensus sequence of MER21C is so weak that its traces as MER21C are too attenuated to be detected by standard in silico analyses, such as homology searches. For instance, when pairwise alignments are performed between the first exon of mouse Liz and the consensus sequences of solo-LTRs other than MER21C, MER21C does not emerge as the most similar sequence (Figure 5 – figure supplement 1). This is in stark contrast to similar analyses involving the first exon of human and rabbit GPR1AS (which overlaps with MER21C), where MER21C is identified as the most similar sequence. [pages: 26, 31-32]
 
 The positions of these LTRs were initially annotated using RepeatMasker. To ensure robust analysis, we performed additional searches with RepeatMasker under more sensitive conditions, adjusting search engines (e.g., RMblast to HMMER or Cross-match) and sensitivity settings. Nevertheless, MER21C or closely related LTRs were still undetectable in mouse, rat, and hamster (Figure 4 – figure supplement 1). However, a multiple genome alignment generated by Cactus/UCSC revealed a syntenic region corresponding to the first exon of human GPR1-AS, overlapping with LTR21C, in the genomes of mice, as well as rats and hamsters (Figure 4 – figure supplement 2). Although RepeatMasker did not annotate MER21C at the GPR1 locus in these species, homologous regions were observed across all selected Euarchontoglires. Due to the limitations of the Cactus alignment track in delineating precise homologous boundaries across species, extracting sequences for evolutionary tree construction was not feasible. Nevertheless, these findings support the hypothesis that the first exon of GPR1-AS (Liz in mice) originated from a MER21C insertion in the common ancestor of Euarchontoglires. [pages: 21, 24-25]
 
 A combination of traditional annotation of repetitive elements using RepeatMasker and the reconstruction of ancestral genomes through multiple genome alignment can reveal highly degenerated LTR relics. This approach is likely to point to significant future directions for research. This point is further elaborated in the discussion section. [page 42]
 
 Furthermore, in response to the reviewer's suggestion, we investigated the promoter activity of the GPR1-AS and Liz first exons, which are hypothesized to have originated from the same MER21C insertion. Using a dual reporter assay, we demonstrated that the first exon of mouse Liz exhibits promoter activity in a human cell line comparable to that of the human GPR1-AS promoter. Thus, despite the relatively low sequence similarity between the Liz first exon and the MER21C consensus sequence (46% as determined by pairwise alignment, Figure 5 – figure supplement 2), the promoter activity remains functionally conserved. We further discuss the potential functional motifs within the putative MER21C LTR-derived sequences in Figure 4B-D. Taken together, these findings suggest that despite a high level of degeneracy of the promoter region in rodents, including mice, the most parsimonious explanation for the origin of this regulatory element in rodents is the presence of the same LTR relic detectable in humans/primates, which is essential for robust transcription initiation of Liz and GPR1-AS, respectively. [pages: 27, 32]
 
 (2) Imprinting will depend on an initiating mechanism in the germline, in addition to events in the embryo that induce the secondary DMR at ZDBF2. The authors should therefore examine as far as possible the presence of a gDMR in the species with/without GPR1-AS1 and ZDBF2 imprinting. Whole-genome bisulphite sequencing data from oocytes and sperm should exist for some of the relevant species (e.g., pig, cow: Ivanova et al. 2020 PMID: 32393379; Lu et al. 2012 PMID: 34818044).
 
 As the reviewer noted, the presence of a gDMR is essential for the establishment of imprinting. Following another reviewer's suggestion, we have now demonstrated that the ZDBF2 gene in rhesus monkeys is also subject to imprinting (see Figure 3C-D). We also acquired whole genome bisulfite sequencing data for rhesus monkey sperm and oocytes, identified DMRs between them, and discovered an oocyte-specifically methylated gDMR in the first exon of GPR1-AS (which overlaps with MER21C)(Figure 3 – figure supplement 1A). This finding is consistent with observations in humans and mice. Conversely, we obtained similar sequencing data for porcine and bovine sperm and oocytes and conducted the same analysis (Figure 3 – figure supplement 1A,B). However, we did not detect any oocyte-specific methylated gDMRs in the GPR1 intragenic region (where GPR1AS is transcribed from an intron of GPR1) in these species of the Laurasiatheria superorder. These results support the hypothesis that ZDBF2 is not imprinted in lineages outside the Euarchontoglires, the superorder which includes both rodents and primates. We have included these important DMR results as a supplement to Figure 3. [pages 16-21]
 
 Presentation:
 
 (1) The first section of the Introduction would benefit from the inclusion of some additional general references on genomic imprinting.
 
 We have added two review articles, Tucci et al. (2019) and Kobayashi (2021), as references in the first section of the Introduction. [page 5]
 
 (2) Introduction statement: "....nearly 200 imprinted genes have been identified in mice and humans. However, less than half of these genes overlapped in both species." This was the conclusion of one study (Tucci et al. 2016), so it would be better to provide a caveat to the statement "However, one comparative analysis suggested that fewer than half of these genes overlapped in both species".
 
 The point being that the actual number of imprinted genes is still a matter of debate (see Edwards et al. 2023 PMID: 36916665), and the extent of overlap will depend on the strength of the evidence for each gene in the human and mouse imprinted gene lists. So, it is very difficult to put an accurate figure on the extent of overlap - but the authors' point is valid that there are species- or lineage-specific imprinted genes.
 
 We have revised this sentence following reviewer #1's suggestion. [page 5]
 
 (3) Introduction statement: "The establishment of species-specific imprinting.....can be driven by various evolutionary events, including.....differences in the function of DNA methyltransferases". I am not aware that this has been described as an evolutionary event causing species-specific imprinting - without supporting evidence, I recommend to remove this suggestion.
 
 We thank the reviewer for this comment and realize that we should have been more explicit here. We were referring to DNMT3C, a rodent-specific member of the DNMT3 family, which is responsible for the paternal methylation imprinting of Rasgrf1 in mice (Barau et al., Science, 2016), in association with the piRNA pathway and targeting of a specific retrotransposon within the DMR (Watanabe et al. Science, 2011). The Rasgrf1 gene is imprinted in mice but not considered imprinted in humans (though some conflicting data exist). While it is likely that the emergence of DNMT3C was a pre-requisite to the establishment of Rasgrf1 imprinting in evolutionary terms, clear evidence is lacking. Following the reviewer’s suggestion, we have removed the phrase "differences in the function of DNA methyltransferases" from the text. However, we have reintroduced this point in the Introduction section as a potential mechanism that may contribute to the establishment of species-specific imprinted genes, alongside the roles of ZNF445 and ZFP57, which regulate the maintenance of imprinting with partially divided roles between humans and mice. [page 6]
 
 (4) It would be very useful for readers to have a schema of the Gpr1/Zdbf2 locus that indicates the locations of the germline and somatic DMRs and their relationship to the Liz transcript.
 
 (5) There is a summary figure amongst the Supplementary Figures (Suppl. Fig. 7) - it would be beneficial to readers to have this summary figure in the main text rather than the supplement.
 
 Following reviewer #1’s suggestion, we have moved the regulatory system schema at the Gpr1/Zdbf2 locus, originally shown in Supplementary Figure 7, to the main text as Figure 7. In addition, in response to comment 4, we have revised the figure to explicitly depict the relationship between the Liz transcript and the establishment of the somatic DMR (sDMR), enhancing the clarity of the regulatory interactions at this locus. [page 38]
 
 (6) With a focus of the study on LTRs as cis-regulatory elements having been co-opted in genomic imprinting mechanisms - whether in the female germline (as in Bogutz et al. 2019) or in the current study as an activating element post-fertilisation - it is a real omission that the authors do not to refer to the role of tissue-specific LTRs as the candidate regulatory elements in non-canonical imprinting (see Hanna et al. 2019 PMID: 31665063). Please include in Introduction and/or Discussion.
 
 We added a sentence explaining canonical and non-canonical imprinting and the cases where LTRs act as regulatory elements in non-canonical imprinting, with reference to the study of Hanna et al., as suggested. [page 6]
 
 (7) Discussion statement: "Two paternally expressed imprinted genes, PEG10/SIRH1 and PEG11/RTL1/SIRH2 have been identified in mammals. They encode GAG-POL proteins of sushi-ichi LTR retrotransposons and are essential for mammalian placenta formation and maintenance."
 
 These sentences should be combined: "Two paternally expressed imprinted genes, PEG10/SIRH1, and PEG11/RTL1/SIRH2, that encode GAG-POL proteins of sushi-ichi LTR retrotransposons have been identified in mammals and are essential for mammalian placenta formation and maintenance."
 
 We have revised this sentence according to reviewer #1's suggestion. [page 41]
 
 Reviewer #2 (Recommendations For The Authors):
 
 When showing assembled GPR1-AS transcripts via genome browser tracks, it would be valuable to add normalized counts of reads mapping to each strand, in order to more convincingly demonstrate the existence of these transcripts. I ask for this because in my experience Stringtie will assemble transcripts that are only marginally supported by reads.
 
 In response to Reviewer #2's suggestion, FPKM and TPM values for all StringTiepredicted GPR1-AS-like transcripts are now included in Figure 6. Each of these transcripts has a TPM value greater than 1, supporting their validity. [pages: 35]
 
 Reviewer #3 (Recommendations For The Authors):
 
 (1) The tree in Figure 5A is one of the main arguments supporting the divergence of the mouse Liz promoter from a common MER21C element, but this contains only a handful of species, making it difficult to appreciate the full extent of its evolution. Presumably its faster mutation rate in mouse would also be supported by other closely related rodents, which would solidify the conclusion that the Liz promoter is derived from an ancient MER21C insertion. So my suggestion is to expand this tree substantially to other species, comparing sequences syntenic to the GPR1-AS/Liz promoter.
 
 (2) It may also be worth trying different TE/LTR annotation tools and/or running Repeatmasker with different parameters, to see if an MER21C element is detected in mouse using a more sensitive approach.
 
 In response to this suggestion, we performed computational analyses with RepeatMasker under various settings (e.g., switching search engines from RMblast to HMMER or Crossmatch, adjusting speed/sensitivity settings from default to slow). Despite these modifications, a MER21C element was not detected near the mouse Liz promoter. However, a multiple genome alignment track generated by Cactus/UCSC revealed a syntenic region, corresponding to the first exon of human GPR1-AS, which overlaps with LTR21C, also present in the genomes of mouse, rat, and hamster (Figure 4 – figure supplement 1). While RepeatMasker did not identify MER21C at the GPR1 locus in these species, homologous regions were observed across all selected Euarchontoglires. Although the Cactus alignment track does not delineate the exact boundaries of homologous regions across species (relative to humans) and thus precludes extracting each homologous sequence to construct an evolutionary tree, these findings support the hypothesis that the first exon of GPR1-AS (referred to as Liz in mice) originated from an ancient MER21C insertion in the common ancestor of Euarchontoglires. [pages: 21, 24-25]
 
 (3) According to Dfam, MER21C is not common to all eutherians, but specific to Boroeutheria, whilst MER21B is presumably specific to Euarchontoglires. To clarify MER21C/B evolution, it would be useful to show the number of elements present in select species from each group (including an outgroup).
 
 (7) In Figure 4 it is hard to distinguish between red and purple.
 
 Initially, we referenced Repbase (e.g., MER21C: Origin/Eutheria), but, as Reviewer #3 noted, Dfam should be the primary reference. We have now included the copy numbers of MER21C and MER21B for each genome in Figure 4, providing a clearer understanding of their evolutionary appearance (MER21C appears specific to Boroeutheria, while MER21B is specific to Euarchontoglires). Additionally, we adjusted the MER21B position color from purple to dark purple to improve visibility. Furthermore, we have also underlined the copy number of MER21C or MER21B located within the GPR1 region in each species. For example, in the Treeshrew genome, the LTR overlapping with GPR1-AS is annotated as MER21B, so we underlined the copy number of MER21B (2,305). These changes now clearly indicate whether homologous sequences to the first exon of GPR1-AS are annotated as MER21C or MER21B in each genome. [page 22]
 
 (4) Could the imprinting status of ZDBF2 not be determined in chimpanzees and rabbits? Or is it already known? Either way, a clarification would be useful to further support the concordance between GPR1-AS-like transcripts and ZDBF2 imprinting.
 
 The imprinting status of ZDBF2 had not previously been reported in chimpanzees, rhesus macaques, or rabbits, where GPR1-AS-like transcripts were identified. Therefore, we conducted allele-specific expression analysis of ZDBF2 using blood samples from rhesus macaques and rabbits. As expected, paternal-allele-specific expression of ZDBF2 was observed in both species, consistent with findings in humans and mice. These results have been added to Figure 3. Although we did not analyze the imprinting status in chimpanzees, we believe the existing data sufficiently support our hypothesis. [pages: 16, 19-20]
 
 (5) The authors briefly discuss the role of KRAB-ZFPs in controlling TE expression. An interesting addition would be to analyse the expression of the main KRAB-ZFP that binds to MER21C (ZFP789, according to data from PMID 28273063). This could be linked to the temporal control of MER21C expression.
 
 In response to Reviewer #3's suggestion, we focused on the expression pattern of ZNF789 (noted by the reviewer as ZFP789), the primary KRAB-ZFP known to bind MER21C, as identified by Didier Trono’s group (PMID 28273063). Strikingly, our analysis reveals that ZNF789 is specifically downregulated at the 4-cell stage, which aligns with the timing of MER21C reactivation. While it remains to be determined whether this downregulation directly influences MER21C reactivation or the initiation of GPR1-AS expression, this finding is significant and consistent with our model. We have incorporated this information in Figure 5 – figure supplement 3. [pages: 33]
 
 (6) The sentence "Liz directs DNA methylation at the somatic DMR, which competes with ZDBF2 to repress the paternal allele" (introduction) was confusing to me.
 
 This sentence has been revised to be more accurate as follows; Liz transcription counteracts the H3K27me3-mediated repression of Zdbf2 by promoting the deposition of antagonistic DNA methylation at the secondary DMR. [page 7]
 
 (8) In Figure 5 I take it that 'consensus motif' refers to ELF1/2? Maybe change the legend.
 
 To clarify potential confusion around the term 'consensus motif,' which may have been mistaken for 'consensus MER21C' (the consensus sequence of MER21C-LTR from the Dfam database), we have revised the figure legend. We now refer to the motif as the "common motif," indicating the sequence common to all MER21C-derived sequences and overlapping with the first exon of GPR1-AS. [page 29]
 
 AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 3

AuthorResponse

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.10.30.564869v3
www.biorxiv.org www.biorxiv.org

Expansion-assisted selective plane illumination microscopy for nanoscale imaging of centimeter-scale tissues

4
1. Public_Reviews 12 May 2025
 
 in eLife
 
 eLife Assessment
 
 The ExA-SPIM methodology developed here and characterized and supported by convincing evidence is an important development for the field of light sheet microscopy as the new technology provides an impressive field of view making it possible to image the entire expanded mouse brain at cellular and subcellular resolution.
 
 Summary
2. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #1 (Public Review):
 
 Summary:
 
 Glaser et al present ExA-SPIM, a light-sheet microscope platform with large volumetric coverage (Field of view 85mm^2, working distance 35mm ), designed to image expanded mouse brains in their entirety. The authors also present an expansion method optimized for whole mouse brains, and an acquisition software suite. The microscope is employed in imaging an expanded mouse brain, the macaque motor cortex and human brain slices of white matter.
 
 This is impressive work, and represents a leap over existing light-sheet microscopes. As an example, it offers a ~ fivefold higher resolution than mesoSPIM (https://mesospim.org/), a popular platform for imaging large cleared samples. Thus while this work is rooted in optical engineering, it manifests a huge step forward and has the potential to become an important tool in the neurosciences.
 
 Strengths:
 
 -ExA-SPIM features an exceptional combination of field of view, working distance, resolution and throughput.
 
 -An expanded mouse brain can be acquired with only 15 tiles, lowering the burden on computational stitching. That the brain does not need to be mechanically sectioned is also seen as an important capability.
 
 -The image data is compelling, and tracing of neurons has been performed. This demonstrates the potential of the microscope platform.
 
 Review of the revised manuscript:
 
 The authors have carefully addressed my previous concerns and suggestions.
 
 Review 1
3. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #2 (Public Review):
 
 In this manuscript, Glaser et al. describe a new selective plane illumination microscope designed to image a large field of view that is optimized for expanded and cleared tissue samples. For the most part, the microscope design follows a standard formula that is common among many systems (e.g. Keller PJ et al Science 2008, Pitrone PG et al. Nature Methods 2013, Dean KM et al. Biophys J 2015, and Voigt FF et al. Nature Methods 2019). The primary conceptual and technical novelty is to use a detection objective from the metrology industry that has a large field of view and a large area camera. The authors characterize the system resolution, field curvature, and chromatic focal shift by measuring fluorescent beads in a hydrogel and then show example images of expanded samples from mouse, macaque, and human brain tissue.
 
 Glaser et al. have responded to the reviewer comments by removing some of the overstated claims from the prior manuscript and editing portions of the manuscript text to enhance the clarity. Although the manuscript would be stronger if the authors had been able to provide data that justified the original high-impact claims from the initial publication (e.g. that the images could be used for robust and automated neuronal tracing across large volumes), the amended manuscript text now more closely matches the supporting data. As with the initial submission, I believe that the microscope design and characterization is a useful contribution to the field and the data are quite stunning.
 
 Review 2
4. Public_Reviews 12 May 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the previous reviews
 
 Public Reviews:
 
 Reviewer #1 (Public Review):
 
 Summary:
 
 Glaser et al present ExA-SPIM, a light-sheet microscope platform with large volumetric coverage (Field of view 85mm^2, working distance 35mm), designed to image expanded mouse brains in their entirety. The authors also present an expansion method optimized for whole mouse brains and an acquisition software suite. The microscope is employed in imaging an expanded mouse brain, the macaque motor cortex, and human brain slices of white matter.
 
 This is impressive work and represents a leap over existing light-sheet microscopes. As an example, it offers a fivefold higher resolution than mesoSPIM (https://mesospim.org/), a popular platform for imaging large cleared samples. Thus while this work is rooted in optical engineering, it manifests a huge step forward and has the potential to become an important tool in the neurosciences.
 
 Strengths:
 
 - ExA-SPIM features an exceptional combination of field of view, working distance, resolution, and throughput.
 
 - An expanded mouse brain can be acquired with only 15 tiles, lowering the burden on computational stitching. That the brain does not need to be mechanically sectioned is also seen as an important capability.
 
 - The image data is compelling, and tracing of neurons has been performed. This demonstrates the potential of the microscope platform.
 
 Weaknesses:
 
 - There is a general question about the scaling laws of lenses, and expansion microscopy, which in my opinion remained unanswered: In the context of whole brain imaging, a larger expansion factor requires a microscope system with larger volumetric coverage, which in turn will have lower resolution (Figure 1B). So what is optimal? Could one alternatively image a cleared (non-expanded) brain with a high-resolution ASLM system (Chakraborty, Tonmoy, Nature Methods 2019, potentially upgraded with custom objectives) and get a similar effective resolution as the authors get with expansion? This is not meant to diminish the achievement, but it was unclear if the gains in resolution from the expansion factor are traded off by the scaling laws of current optical systems.
 
 Paraphrasing the reviewer: Expanding the tissue requires imaging larger volumes and allows lower optical resolution. What has been gained?
 
 The answer to the reviewer’s question is nuanced and contains four parts.
 
 First, optical engineering requirements are more forgiving for lenses with lower resolution. Lower resolution lenses can have much larger fields of view (in real terms: the number of resolvable elements, proportional to ‘etendue’) and much longer working distances. In other words, it is currently more feasible to engineer lower resolution lenses with larger volumetric coverage, even when accounting for the expansion factor.
 
 Second, these lenses are also much better corrected compared to higher resolution (NA) lenses. They have a flat field of view, negligible pincushion distortions, and constant resolution across the field of view. We are not aware of comparable performance for high NA objectives, even when correcting for expansion.
 
 Third, although clearing and expansion render tissues ‘transparent’, there still exist refractive index inhomogeneities which deteriorate image quality, especially at larger imaging depths. These effects are more severe for higher optical resolutions (NA), because the rays entering the objective at higher angles have longer paths in the tissue and will see more aberrations. For lower NA systems, such as ExaSPIM, the differences in paths between the extreme and axial rays are relatively small and image formation is less sensitive to aberrations.
 
 Fourth, aberrations are proportional to the index of refraction inhomogeneities (dn/dx). Since the index of refraction is roughly proportional to density, scattering and aberration of light decreases as M^3, where M is the expansion factor. In contrast, the imaging path length through the tissue only increases as M. This produces a huge win for imaging larger samples with lower resolutions.
 
 To our knowledge there are no convincing demonstrations in the literature of diffraction-limited ASLM imaging at a depth of 1 cm in cleared mouse brain tissue, which would be equivalent to the ExA-SPIM imaging results presented in this manuscript.
 
 In the discussion of the revised manuscript we discuss these factors in more depth.
 
 - It was unclear if 300 nm lateral and 800 nm axial resolution is enough for many questions in neuroscience. Segmenting spines, distinguishing pre- and postsynaptic densities, or tracing densely labeled neurons might be challenging. A discussion about the necessary resolution levels in neuroscience would be appreciated.
 
 We have previously shown good results in tracing the thinnest (100 nm thick) axons over cm scales with 1.5 um axial resolution. It is the contrast (SNR) that matters, and the ExaSPIM contrast exceeds the block-face 2-photon contrast, not to mention imaging speed (> 10x).
 
 Indeed, for some questions, like distinguishing fluorescence in pre- and postsynaptic structures, higher resolutions will be required (0.2 um isotropic; Rah et al Frontiers Neurosci, 2013). This could be achieved with higher expansion factors.
 
 This is not within the intended scope of the current manuscript. As mentioned in the discussion section, we are working towards ExA-SPIM-based concepts to achieve better resolution through the design and fabrication of a customized imaging lens that maintains a high volumetric coverage with increased numerical aperture.
 
 - Would it be possible to characterize the aberrations that might be still present after whole brain expansion? One approach could be to image small fluorescent nanospheres behind the expanded brain and recover the pupil function via phase retrieval. But even full width half maximum (FWHM) measurements of the nanospheres' images would give some idea of the magnitude of the aberrations.
 
 We now included a supplementary figure highlighting images of small axon segments within distal regions of the brain.
 
 Reviewer #2 (Public Review):
 
 Summary:
 
 In this manuscript, Glaser et al. describe a new selective plane illumination microscope designed to image a large field of view that is optimized for expanded and cleared tissue samples. For the most part, the microscope design follows a standard formula that is common among many systems (e.g. Keller PJ et al Science 2008, Pitrone PG et al. Nature Methods 2013, Dean KM et al. Biophys J 2015, and Voigt FF et al. Nature Methods 2019). The primary conceptual and technical novelty is to use a detection objective from the metrology industry that has a large field of view and a large area camera. The authors characterize the system resolution, field curvature, and chromatic focal shift by measuring fluorescent beads in a hydrogel and then show example images of expanded samples from mouse, macaque, and human brain tissue.
 
 Strengths:
 
 I commend the authors for making all of the documentation, models, and acquisition software openly accessible and believe that this will help assist others who would like to replicate the instrument. I anticipate that the protocols for imaging large expanded tissues (such as an entire mouse brain) will also be useful to the community.
 
 Weaknesses:
 
 The characterization of the instrument needs to be improved to validate the claims. If the manuscript claims that the instrument allows for robust automated neuronal tracing, then this should be included in the data.
 
 The reviewer raises a valid concern. Our assertion that the resolution and contrast is sufficient for robust automated neuronal tracing is overstated based on the data in the paper. We are hard at work on automated tracing of datasets from the ExA-SPIM microscope. We have demonstrated full reconstruction of axonal arbors encompassing >20 cm of axonal length. But including these methods and results is out of the scope of the current manuscript.
 
 The claims of robust automated neuronal tracing have been appropriately modified.
 
 Recommendations for the authors:
 
 Reviewer #1 (Recommendations For The Authors):
 
 Smaller questions to the authors:
 
 - Would a multi-directional illumination and detection architecture help? Was there a particular reason the authors did not go that route?
 
 Despite the clarity of the expanded tissue, and the lower numerical aperture of the ExA-SPIM microscope, image quality still degrades slightly towards the distal regions of the brain relative to both the excitation and detection objective. Therefore, multi-directional illumination and detection would be advantageous. Since the initial submission of the manuscript, we have undertaken re-designing the optics and mechanics of the system. This includes provisions for multi-directional illumination and detection. However, this new design is beyond the scope of this manuscript. We now mention this in L254-255 of the Discussion section.
 
 - Why did the authors not use the same objective for illumination and detection, which would allow isotropic resolution in ASLM?
 
 The current implementation of ASLM requires an infinity corrected objective (i.e. conjugating the axial sweeping mechanism to the back focal plane). This is not possible due to the finite conjugate design of the ExA-SPIM detection lens.
 
 More fundamentally, pushing the excitation NA higher would result in a shorter light sheet Rayleigh length, which would require a smaller detection slit (shorter exposure time, lower signal to noise ratio). For our purposes an excitation NA of 0.1 is an excellent compromise between axial resolution, signal to noise ratio, and imaging speed.
 
 For other potentially brighter biological structures, it may be possible to design a custom infinity corrected objective that enables ASLM with NA > 0.1.
 
 - Have the authors made any attempt to characterize distortions of the brain tissue that can occur due to expansion?
 
 We have not systematically characterized the distortions of the brain tissue pre and post expansion. Imaged mouse brain volumes are registered to the Allen CCF regardless of whether or not the tissue was expanded. It is beyond the scope of this manuscript to include these results and processing methods, but we have confirmed that the ExA-SPIM mouse brain volumes contain only modest deformation that is easily accounted for during registration to the Allen CCF.
 
 - The authors state that a custom lens with NA 0.5-0.6 lens can be designed, featuring similar specifications. Is there a practical design? Wouldn't such a lens be more prone to Field curvature?
 
 This custom lens has already been designed and is currently being fabricated. The lens maintains a similar space bandwidth product as the current lens (increased numerical aperture but over a proportionally smaller field of view). Over the designed field of view, field curvature is <1 µm. However, including additional discussion or results of this customized lens is beyond the scope of this manuscript.
 
 Reviewer #2 (Recommendations For The Authors):
 
 System characterization:
 
 - Please state what wavelength was used for the resolution measurements in Figure 2.
 
 An excitation wavelength of 561 nm was used. This has been added to the manuscript text.
 
 - The manuscript highlights that a key advance for the microscope is the ability to image over a very large 13 mm diameter field of view. Can the authors clarify why they chose to characterize resolution over an 8diameter mm field rather than the full area?
 
 The 13 mm diameter field of view refers to the diagonal of the 10.6 x 8.0 mm field of view. The results presented in Figure 1c are with respect to the horizontal x direction and vertical y direction. A note indicating that the 13 mm is with respect to the diagonal of the rectangular imaging field has been added to the manuscript text. The results were presented in this way to present the axial and lateral resolution as a function of y (the axial sweeping direction).
 
 - The resolution estimates seem lower than I would expect for a 0.30 NA lens (which should be closer to ~850 nm for 515 nm emission). Could the authors clarify the discrepancy? Is this predicted by the Zemax model and due to using the lens in immersion media, related to sampling size on the camera, or something else? It would be helpful if the authors could overlay the expected diffraction-limited performance together with the plots in Figure 2C.
 
 As mentioned previously, the resolution measurements were performed with 561 nm excitation and an emission bandpass of ~573 – 616 nm (595 nm average). Based on this we would expect the full width half maximum resolution to be ~975 nm. The resolution is in fact limited by sampling on the camera. The 3.76 µm pixel size, combined with the 5.0X magnification results in a sampling of 752 nm. Based on the Nyquist the resolution is limited to ~1.5 µm. We have added clarifying statements to the text.
 
 - I'm confused about the characterization of light sheet thickness and how it relates to the measured detection field curvature. The authors state that they "deliver a light sheet with NA = 0.10 which has a width of 12.5 mm (FWHM)." If we estimate that light fills the 0.10 NA, it should have a beam waist (2wo) of ~3 microns (assuming Gaussian beam approximations). Although field curvature is described as "minimal" in the text, it is still ~10-15 microns at the edge of the field for the emission bands for GFP and RFP proteins. Given that this is 5X larger than the light sheet thickness, how do the authors deal with this?
 
 The generated light sheet is flat, with a thickness of ~ 3 µm. This flat light sheet will be captured in focus over the depth of focus of the detection objective. The stated field curvature is within 2.5X the depth of focus of the detection lens, which is equivalent to the “Plan” specification of standard microscope objectives.
 
 - In Figure 2E, it would be helpful if the authors could list the exposure times as well as the total voxels/second for the two-camera comparison. It's also worth noting that the Sony chip used in the VP151MX camera was released last year whereas the Orca Flash V3 chosen for comparison is over a decade old now. I'm confused as to why the authors chose this camera for comparison when they appear to have a more recent Orca BT-Fusion that they show in a picture in the supplement (indicated as Figure S2 in the text, but I believe this is a typo and should be Figure S3).
 
 This is a useful addition, and we have added exposure times to the plot. We have also added a note that the Orca Flash V3 is an older generation sCMOS camera and that newer variants exist. Including the Orca BT-Fusion. The BT-Fusion has a read noise of 1.0 e- rms versus 1.6 e- rms, and a peak quantum efficiency of ~95% vs. 85%. Based on the discussion in Supplementary Note S1, we do not expect that these differences in specifications would dramatically change the data presented in the plot. In addition, the typo in Figure S2 has been corrected to Figure S3.
 
 - In Table S1, the authors note that they only compare their work to prior modalities that are capable of providing <= 1 micron resolution. I'm a bit confused by this choice given that Figure 2 seems to show the resolution of ExA-SPIM as ~1.5 microns at 4 mm off center (1/2 their stated radial field of view). It also excludes a comparison with the mesoSPIM project which at least to me seems to be the most relevant prior to this manuscript. This system is designed for imaging large cleared tissues like the ones shown here. While the original publication in 2019 had a substantially lower lateral resolution, a newer variant, Nikita et al bioRxiv (which is cited in general terms in this manuscript, but not explicitly discussed) also provides 1.5-micron lateral resolution over a comparable field of view.
 
 We have updated the table to include the benchtop mesoSPIM from Nikita et al., Nature Communications, 2024. Based on this published version of the manuscript, the lateral resolution is 1.5 µm and axial resolution is 3.3 µm. Assuming the Iris 15 camera sensor, with the stated 2.5 fps, the volumetric rate (megavoxels/sec) is 37.41.
 
 - The authors state that, "We systematically evaluated dehydration agents, including methanol, ethanol, and tetrahydrofuran (THF), followed by delipidation with commonly used protocols on 1 mm thick brain slices. Slices were expanded and examined for clarity under a macroscope." It would be useful to include some data from this evaluation in the manuscript to make it clear how the authors arrived at their final protocol.
 
 Additional details on the expansion protocol may be included in another manuscript.
 
 General comments:
 
 There is a tendency in the manuscript to use negative qualitative terms when describing prior work and positive qualitative terms when describing the work here. Examples include:
 
 - "Throughput is limited in part by cumbersome and error-prone microscopy methods". While I agree that performing single neuron reconstructions at a large scale is a difficult challenge, the terms cumbersome and error-prone are qualitative and lacking objective metrics.
 
 We have revised this statement to be more precise, stating that throughput is limited in part by the speed and image quality of existing microscopy methods.
 
 - The resolution of the system is described in several places as "near-isotropic" whereas prior methods were described as "highly anisotropic". I agree that the ~1:3 lateral to axial ratio here is more isotropic than the 1:6 ratio of the other cited publications. However, I'm not sure I'd consider 3-fold worse axial resolution than lateral to be considered "near" isotropic.
 
 We agree that the term near-isotropic is ambiguous. We have modified the text accordingly, removing the term near-isotropic and where appropriate stating that the resolution is more isotropic than that of other cited publications.
 
 - In the manuscript, the authors describe the photobleaching in their imaging conditions as "negligible". Figure S5 seems to show a loss of 60% fluorescence after 2000 exposures (which in the caption is described as "modest"). I'd suggest removing these qualitative terms and just stating the values.
 
 We agree and have changed the text accordingly.
 
 - The results section for Figure 5 is titled "Tracing axons in human neocortex and white matter". Although this section states "larger axons (>1 um) are well separated... allowing for robust automated and manual tracing" there is no data for any tracing in the manuscript. Although I agree that the images are visually impressive, I'm not sure that this claim is backed by data.
 
 We have now removed the text in this section referring to automated and manual tracing.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 2

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.08.544277v5
www.biorxiv.org www.biorxiv.org

Four Individually Identified Paired Dopamine Neurons Signal Taste Punishment in Larval Drosophila

5
1. Public_Reviews 12 May 2025
  
  in eLife
  
  eLife Assessment
  
  This comprehensive study presents important findings that delineate how specific dopaminergic neurons (DANs) instruct aversive learning in Drosophila larvae exposed to high salt through an integration of behavioral experiments, imaging, and connectomic analysis. The work reveals how a numerically minimal circuit achieves remarkable functional complexity, with redundancies and synergies within the DL1 cluster that challenge our understanding of how few neurons generate learning behaviors. By establishing a framework for sensory-driven learning pathways, the study makes a compelling and substantial contribution to understanding associative conditioning while demonstrating conservation of learning mechanisms across Drosophila developmental stages.
  
  Summary
2. Public_Reviews 12 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  In this paper Weber et al. investigate the role of 4 dopaminergic neurons of the Drosophila larva in mediating the association between an aversive high-salt stimulus and a neutral odor. The 4 DANs belong to the DL1 cluster and innervate non-overlapping compartments of the mushroom body, distinct from those involved in appetitive associative learning. Using specific driver lines for individual neurons, the authors show that activation of the DAN-g1 is sufficient to mimic an aversive memory and it is also necessary to form a high-salt memory of full strength, although optogenetic silencing of this neuron has only a partial phenotype. The authors use calcium imaging to show that the DAN-g1 is not the only DAN responding to salt. DAN-c1 and d1 also respond to salt, but they seem to play no role for the associative memory. DAN-f1, which does not respond to salt, is able to lead to the formation of a memory (if optogenetically activated), but it is not necessary for the salt-odor memory formation in normal conditions. However, when silenced together with DAN-g1, it enhances the memory deficit of DAN-g1. Overall, this work brings evidence of a complex interaction between DL1 DANs in both the encoding of salt signals and their teaching role in associative learning, with none of them being individually necessary and sufficient for both functions.
  
  Overall, the manuscript contributes interesting results that are useful to understand the organization and function of the dopaminergic system. The behavioral role of the specific DANs is accessed using specific driver lines which allow to test their function individually and in pairs. Moreover, the authors perform calcium imaging to test whether DANs are activated by salt, a prerequisite for inducing a negative association to it. Proper genetic controls are carried across the manuscript.
  
  Review 1
3. Public_Reviews 12 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  In this work the authors show that dopaminergic neurons (DANs) from the DL1 cluster in Drosophila larvae are required for the formation of aversive memories. DL1 DANs complement pPAM cluster neurons which are required for the formation of attractive memories. This shows the compartmentalized network organization of how an insect learning center (the mushroom body) encodes memory by integrating olfactory stimuli with aversive or attractive teaching signals. Interestingly, the authors found that the 4 main dopaminergic DL1 neurons act partially redundant, and that single cell ablation did not result in aversive memory defects. However, ablation or silencing of a specific DL1 subset (DAN-f1,g1) resulted in reduced salt aversion learning, which was specific to salt but no other aversive teaching stimuli tested. Importantly, activation of these DANs using an optogenetic approach was also sufficient to induce aversive learning in the presence of high salt. Together with the functional imaging of salt and fructose responses of the individual DANs and the implemented connectome analysis of sensory (and other) inputs to DL1/pPAM DANs this represents a very comprehensive study linking the structural, functional and behavioral role of DL1 DANs. This provides fundamental insight into the function of a simple yet efficiently organized learning center which displays highly conserved features of integrating teaching signals with other sensory cues via dopaminergic signaling.
  
  Strengths:
  
  This is a very careful, precise and meticulous study identifying the main larval DANs involved in aversive learning using high salt as a teaching signal. This is highly interesting because it allows to define the cellular substrates and pathways of aversive learning down to the single cell level in a system without much redundancy. It therefore sets the basis to conduct even more sophisticated experiments and together with the neat connectome analysis opens the possibility to unravel different sensory processing pathways within the DL1 cluster and integration with the higher order circuit elements (Kenyon cells and MBONs). The authors' claims are well substantiated by the data and balanced, putting their data in the appropriate context. The authors also implemented neat pathway analyses using the larval connectome data to its full advantage, thus providing network pathways that contribute towards explaining the obtained results.
  
  Weaknesses:
  
  Previous comments were fully addressed by the authors.
  
  Review 2
4. Public_Reviews 12 May 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  The study of Weber et al. provides a thorough investigation of the roles of four individual dopamine neurons for aversive associative learning in the Drosophila larva. They focus on the neurons of the DL-1 cluster which already have been shown to signal aversive teaching signals. But the authors go beyond the previous publications and test whether each of these dopamine neurons responds to salt or sugar, is necessary for learning about salt, bitter, or sugar, and is sufficient to induce a memory when optogenetically activated. In addition, previously published connectomic data is used to analyze the synaptic input to each of these dopamine neurons. The authors conclude that the aversive teaching signal induced by salt is distributed across the four DL-1 dopamine neurons, with two of them, DAN-f1 and DAN-g1, being particularly important. Overall, the experiments are well designed and performed, support the authors' conclusions, and deepen our understanding of the dopaminergic punishment system.
  
  Strengths:
  
  (1) This study provides, at least to my knowledge, the first in vivo imaging of larval dopamine neurons in response to tastants. Although the selection of tastants is limited, the results close an important gap in our understanding of the function of these neurons.
  
  (2) The authors performed a large number of experiments to probe for the necessity of each individual dopamine neuron, as well as combinations of neurons, for associative learning. This includes two different training regimen (1 or 3 trials), three different tastants (salt, quinine and fructose) and two different effectors, one ablating the neuron, the other one acutely silencing it. This thorough work is highly commendable, and the results prove that it was worth it. The authors find that only one neuron, DAN-g1, is partially necessary for salt learning when acutely silenced, whereas a combination of two neurons, DAN-f1 and DAN-g1, are necessary for salt learning when either being ablated or silenced.
  
  (3) In addition, the authors probe whether any of the DL-1 neurons is sufficient for inducing an aversive memory. They found this to be the case for two of the neurons, largely confirming previous results obtained by a different learning paradigm, parameters and effector.
  
  (4) This study also takes into account connectomic data to analyze the sensory input that each of the dopamine neurons receives. This analysis provides a welcome addition to previous studies and helps to gain a more complete understanding. The authors find large differences in inputs that each neuron receives, and little overlap in input that the dopamine neurons of the "aversive" DL-1 cluster and the "appetitive" pPAM cluster seem to receive.
  
  (5) Finally, the authors try to link all the gathered information in order to describe an updated working model of how aversive teaching signals are carried by dopamine neurons to the larva's memory center. This includes important comparisons both between two different aversive stimuli (salt and nociception) and between the larval and adult stages.
  
  Review 3
5. Public_Reviews 12 May 2025
  
  in eLife
  
  Author response:
  
  The following is the authors’ response to the previous reviews
  
  Public Reviews:
  
  Reviewer #1 (Public review):
  
  Summary:
  
  In this paper Weber et al. investigate the role of 4 dopaminergic neurons of the Drosophila larva in mediating the association between an aversive high-salt stimulus and a neutral odor. The 4 DANs belong to the DL1 cluster and innervate non-overlapping compartments of the mushroom body, distinct from those involved in appetitive associative learning. Using specific driver lines for individual neurons, the authors show that activation of the DAN-g1 is sufficient to mimic an aversive memory and it is also necessary to form a high-salt memory of full strength, although optogenetic silencing of this neuron has only a partial phenotype. The authors use calcium imaging to show that the DAN-g1 is not the only DAN responding to salt. DAN-c1 and d1 also respond to salt, but they seem to play no role for the associative memory. DAN-f1, which does not respond to salt, is able to lead to the formation of a memory (if optogenetically activated), but it is not necessary for the salt-odor memory formation in normal conditions. However, when silenced together with DAN-g1, it enhances the memory deficit of DAN-g1. Overall, this work brings evidence of a complex interaction between DL1 DANs in both the encoding of salt signals and their teaching role in associative learning, with none of them being individually necessary and sufficient for both functions.
  
  Strengths:
  
  Overall, the manuscript contributes interesting results that are useful to understand the organization and function of the dopaminergic system. The behavioral role of the specific DANs is accessed using specific driver lines which allow to test their function individually and in pairs. Moreover, the authors perform calcium imaging to test whether DANs are activated by salt, a prerequisite for inducing a negative association to it. Proper genetic controls are carried across the manuscript.
  
  Weaknesses:
  
  The authors use two different approaches to silence dopaminergic neurons: optogenetics and induction of apoptosis. The results are not always consistent, but the authors discuss these differences appropriately. In general, the optogenetic approach is more appropriate as developmental compensations are not of major interest for the question investigated.
  
  The physiological data would suggest the role of a certain subset of DANs in salt-odor association, but a different partially overlapping set is necessary in behavioral assays (with a partial phenotype). No manipulation completely abolishes the salt-odor association, leaving important open questions on the identity of the neural circuits involved in this behavior.
  
  The EM data analysis reveals a non-trivial organization of sensory inputs into DANs, but it is difficult to extrapolate a link to the functional data presented in the paper.
  
  We would like to once again thank Reviewer 1 for the positive assessment of our work and for the valuable suggestions provided on the first revision of the manuscript. In this second revision, we have addressed the linguistic issues and most of the minor comments as recommended. We now hope that the current version of our manuscript meets the reviewer’s expectations both in terms of language and content.
  
  Reviewer #2 (Public review):
  
  Summary:
  
  In this work the authors show that dopaminergic neurons (DANs) from the DL1 cluster in Drosophila larvae are required for the formation of aversive memories. DL1 DANs complement pPAM cluster neurons which are required for the formation of attractive memories. This shows the compartmentalized network organization of how an insect learning center (the mushroom body) encodes memory by integrating olfactory stimuli with aversive or attractive teaching signals. Interestingly, the authors found that the 4 main dopaminergic DL1 neurons act partially redundant, and that single cell ablation did not result in aversive memory defects. However, ablation or silencing of a specific DL1 subset (DAN-f1,g1) resulted in reduced salt aversion learning, which was specific to salt but no other aversive teaching stimuli tested. Importantly, activation of these DANs using an optogenetic approach was also sufficient to induce aversive learning in the presence of high salt. Together with the functional imaging of salt and fructose responses of the individual DANs and the implemented connectome analysis of sensory (and other) inputs to DL1/pPAM DANs this represents a very comprehensive study linking the structural, functional and behavioral role of DL1 DANs. This provides fundamental insight into the function of a simple yet efficiently organized learning center which displays highly conserved features of integrating teaching signals with other sensory cues via dopaminergic signaling.
  
  Strengths:
  
  This is a very careful, precise and meticulous study identifying the main larval DANs involved in aversive learning using high salt as a teaching signal. This is highly interesting because it allows to define the cellular substrates and pathways of aversive learning down to the single cell level in a system without much redundancy. It therefore sets the basis to conduct even more sophisticated experiments and together with the neat connectome analysis opens the possibility to unravel different sensory processing pathways within the DL1 cluster and integration with the higher order circuit elements (Kenyon cells and MBONs). The authors' claims are well substantiated by the data and balanced, putting their data in the appropriate context. The authors also implemented neat pathway analyses using the larval connectome data to its full advantage, thus providing network pathways that contribute towards explaining the obtained results.
  
  Weaknesses:
  
  Previous comments were fully addressed by the authors.
  
  We sincerely thank Reviewer 2 for the positive evaluation of our work. We are glad that our responses in the first revision addressed the previous concerns and appreciate the reviewer’s constructive feedback once again.
  
  Reviewer #3 (Public review):
  
  Summary:
  
  The study of Weber et al. provides a thorough investigation of the roles of four individual dopamine neurons for aversive associative learning in the Drosophila larva. They focus on the neurons of the DL-1 cluster which already have been shown to signal aversive teaching signals. But the authors go beyond the previous publications and test whether each of these dopamine neurons responds to salt or sugar, is necessary for learning about salt, bitter, or sugar, and is sufficient to induce a memory when optogenetically activated. In addition, previously published connectomic data is used to analyze the synaptic input to each of these dopamine neurons. The authors conclude that the aversive teaching signal induced by salt is distributed across the four DL-1 dopamine neurons, with two of them, DAN-f1 and DAN-g1, being particularly important. Overall, the experiments are well designed and performed, support the authors' conclusions, and deepen our understanding of the dopaminergic punishment system.
  
  Strengths:
  
  (1) This study provides, at least to my knowledge, the first in vivo imaging of larval dopamine neurons in response to tastants. Although the selection of tastants is limited, the results close an important gap in our understanding of the function of these neurons.
  
  (2) The authors performed a large number of experiments to probe for the necessity of each individual dopamine neuron, as well as combinations of neurons, for associative learning. This includes two different training regimen (1 or 3 trials), three different tastants (salt, quinine and fructose) and two different effectors, one ablating the neuron, the other one acutely silencing it. This thorough work is highly commendable, and the results prove that it was worth it. The authors find that only one neuron, DAN-g1, is partially necessary for salt learning when acutely silenced, whereas a combination of two neurons, DAN-f1 and DAN-g1, are necessary for salt learning when either being ablated or silenced.
  
  (3) In addition, the authors probe whether any of the DL-1 neurons is sufficient for inducing an aversive memory. They found this to be the case for two of the neurons, largely confirming previous results obtained by a different learning paradigm, parameters and effector.
  
  (4) This study also takes into account connectomic data to analyze the sensory input that each of the dopamine neurons receives. This analysis provides a welcome addition to previous studies and helps to gain a more complete understanding. The authors find large differences in inputs that each neuron receives, and little overlap in input that the dopamine neurons of the "aversive" DL-1 cluster and the "appetitive" pPAM cluster seem to receive.
  
  (5) Finally, the authors try to link all the gathered information in order to describe an updated working model of how aversive teaching signals are carried by dopamine neurons to the larva's memory center. This includes important comparisons both between two different aversive stimuli (salt and nociception) and between the larval and adult stages.
  
  We would also like to thank Reviewer 3 for the positive assessment of our work. Many of the constructive comments provided were incorporated into the first revision, contributing significantly to the improved clarity and overall quality of the manuscript.
  
  Recommendations for the authors:
  
  Reviewer #1 (Recommendations for the authors):
  
  Here are some minor comments (and some semantics that could be addressed to improve the manuscript)
  
  Title: is the title correct given that c1 and d1 do not really signal punishment?
  
  We think the title is correct and would like to keep it as it is.
  
  L72 striatum misspelled
  
  We have corrected the error.
  
  L74 constitute instead of provide?
  
  We made the suggested modification in the text.
  
  L129: "But can these four individual DANs also process other sensory modalities?" other then what? What was used before?
  
  We have made the required change, which now allows us to contrast somatosensory and chemosensory information.
  
  L172: (Please refer to the discussion regarding the partial reduction of the memory); would be more natural to explain shortly here, or add a sentence before this parenthesis that point to the effect
  
  We made the requested change in the manuscript and added a short sentence before the parenthesis.
  
  L182: "DL1 neurons convey a dopaminergic aversive teaching signal" you cannot make this statement from just TH-GAL4!
  
  We agree - that's why we have completely revised the sentence and now further restricted it and also refer to further larval and adult published data
  
  L264: "possible redundancy among" I don't think you are testing a redundancy here, it is more likely a developmental compensation.
  
  We made the requested change in the sentence and added a potential developmental compensation as an interpretation of our results.
  
  L296: "to determine if the activation of individual DL1 DANs signals aspects of the natural high salt punishment," - how can the optogenetic activation tell something about aspects of the natural salt punishment? I understand the fact that salt is present, but still I find it inaccurate
  
  Our approach is based on the framework established by Bertram Gerber and colleagues over the past two decades in larval Drosophila research. According to this logic, memory recall is dependent on the specific properties of the test context, particularly the type and concentration of the stimulus presented on the test plate. Aversive memory retrieval occurs only when the test conditions closely match those of the training stimulus. Consequently, the larva's behavior on the test plate serves as an indicator of the memory content being recalled. We therefore adhere to this established methodology (Gerber & Hendel, 2006; Schleyer et al., 2011; Schleyer et al., 2015).
  
  L307 "DAN-f1 and DAN-g1 encode aspects of the natural aversive high salt teaching" you cannot conclude that given that f1 does not even respond to salt. I understand the logic of the salt during test, but I think it is still a stretched interpretation
  
  We agree and thus have deleted the sentence.
  
  L310 "Individual DL1 DANs are acutely necessary" this is too general, it seems that only one is
  
  We have changed the title and now clearly state that this is only one DAN of the DL1 cluster.
  
  Reviewer #2 (Recommendations for the authors):
  
  In Fig.6 the text flow could be optimized as the authors first mention Fig. 6E,F before they follow up with Fig. 6A-D.
  
  Thanks for bringing this up – we changed it in the revised version of the manuscript. Now 6A-D is mentioned first.
  
  In Fig.6 the finding that optogenetic inactivation but not ablation of DAN-g1 slightly but significantly reduces aversive salt learning suggests that there is an individual contribution of this DAN in this paradigm. The authors emphasize redundancy of DL1 DANs although the effect size seems comparable between DAN-g1 and DAN-f1,g1 silencing.
  
  In response to this concern and the one of reviewer 2, we have revised the section title and removed the final sentence of the section before to avoid placing emphasis on the potential redundancy of DL1 DANs within this results section.
  
  Reviewer #3 (Recommendations for the authors):
  
  The authors replied to each issue I raised, and revised their manuscript accordingly. In particular, regarding my major concern (the sufficiency of the neurons for salt-"specific" memories), I think the authors found a good solution.
  
  I have no further comments.
  
  We sincerely thank the reviewer for the positive feedback on our revision. We are pleased that the revised manuscript meets the expectations and appreciate the time and effort invested in the review process.
  
  AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 3

AuthorResponse

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.07.26.550661v3
www.biorxiv.org www.biorxiv.org

Acquisition of auditory discrimination mediated by different processes through two distinct circuits linked to the lateral striatum

3
1. Public_Reviews 12 May 2025
  
  in eLife
  
  eLife Assessment
  
  This study provides an important understanding of the contribution of different striatal subregions, the anterior Dorsal Lateral Striatum (aDLS) and the posterior Ventrolateral Striatum (pVLS), to auditory discrimination learning. The authors have included robust behavior combined with multiple observational and perturbation techniques. The data provided are convincing of the relevance of task-related activity in these two subregions during learning.
  
  Summary
2. Public_Reviews 12 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  In this study, Setogawa et al. employ an auditory discrimination task in freely moving rats, coupled with small animal imaging, electrophysiological recordings, and pharmacological inhibition/lesioning experiments to better understand the role of two striatal subregions: the anterior Dorsal Lateral Striatum (aDLS) and the posterior Ventrolateral Striatum (pVLS), during auditory discrimination learning. Attempting to better understand the contribution of different striatal subregions to sensory discrimination learning strikes me as a highly relevant and timely question, and the data presented in this study are certainly of major interest to the field. The authors have set up a robust behavioral task, systematically tackled the question about a striatal role in learning with multiple observational and manipulative techniques. Additionally, the structured approach the authors take by using neuroimaging to inform their pharmacological manipulation experiments and electrophysiological recordings is a strength.
  
  Comments on revisions:
  
  The authors have addressed some concerns raised in the initial review but some remain. In particular it is still unclear what conclusions can be drawn about task-related activity from scans that are performed 30 minutes after the behavioral task. I continue to think that a reorganization/analysis data according to event type would be useful and easier to interpret across the two brain areas, but the authors did not choose to do this. Finally, switching the cue-response association, I am convinced, would help to strengthen this study.
  
  Review 1
3. Public_Reviews 12 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  The study by Setogawa et al. aims to understand the role that different striatal subregions belonging to parallel brain circuits have in associative learning and discrimination learning (S-O-R and S-R tasks). Strengths of the study are the use of multiple methodologies to measure and manipulate brain activity in rats, from microPET imaging to excitotoxic lesions and multielectrode recordings across anterior dorsolateral (aDLS), posterior ventral lateral (pVLS)and dorsomedial (DMS) striatum.
  
  The main conclusions are that the aDLS promotes stimulus-response association and suppresses response-outcome associations. The pVLS is engaged in the formation and maintenance of the stimulus-response association. There is a lot of work done and some interesting findings however, the manuscript can be improved by clarifying the presentation and reasoning. The inclusion of important controls will enhance the rigor of the data interpretation and conclusions.
  
  Comments on revisions:
  
  The authors have made important revisions to the manuscript and it has improved in clarity. They also added several figures in the rebuttal letter to answer questions by the reviewers. I would ask that these figures are also made public as part of the authors' response or if not, included in the manuscript.
  
  Review 2
Visit annotations in context

Tags

Summary

Review 1

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.10.19.563198v4
www.medrxiv.org www.medrxiv.org

Causal associations between plasma proteins and prostate cancer: a Proteome-Wide Mendelian Randomization

4
1. Public_Reviews 12 May 2025
 
 in eLife
 
 eLife Assessment
 
 This study presents a valuable meta-analysis of two independent genome-wide association studies (GWASs) elucidating the role of plasma proteins as biomarkers for improving early detection of prostate cancer (PCa). The evidence supporting novel protein biomarkers of PCa risk is solid, although exploration of how these markers may also be shared with other prostate diseases would have strengthened the study. The work will be of interest to the field for elucidating novel variants of prostate cancer risk.
 
 Summary
2. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Summary:
 
 In Causal associations between plasma proteins and prostate cancer: a Proteome-Wide Mendelian Randomization, the authors present a manuscript which seeks to identify novel markers for prostate cancer through analysis of large biobank-based datasets and to extend this analysis to potential therapeutic targets for drugs. This is an area that is already extensively researched, but remains important, due to the high burden and mortality of prostate cancer globally.
 
 Strengths:
 
 The main strengths of the manuscript are the identification and use of large biobank data assets, which provide large numbers of cases and controls, essential for achieving statistical power. The databases used (deCODE, FinnGen, and the UK Biobank) allow for robust numbers of cases and controls. The analytical method chosen, Mendelian Randomization, is appropriate to the problem. Another strength is the integration of multi-omic datasets, here using protein data as well as GWAS sources to integrate genomic and proteomic data.
 
 Weaknesses:
 
 The main weaknesses of the manuscript relate to the following areas:
 
 (1) The failure of the study to analyse the data in the context of other closely related conditions such as benign prostatic hyperplasia (BPH) or lower urinary tract symptoms (LUTS), which have some pathways and biomarkers in common, such as inflammatory pathways (including complement) and specific markers such as KLK3. As a consequence, it is not possible for readers to know whether the findings are specific to prostate cancer or whether they are generic to prostate dysfunction. Given the prevalence of prostate dysfunction (half of men reaching their sixth decade), the potential for false positives and overtreatment from non-specific biomarkers is a major problem, resulting in the evidence presented in this manuscript being weak. Other researchers have addressed this issue using the same data sources as presented here, for example, in this paper, looking at BPH in the UK Biobank population. https://www.nature.com/articles/s41467-018-06920-9
 
 (2) There is no discussion of Gleason scores with regard to either biomarkers or therapies, and a general lack of discussion around indolent disease as compared with more aggressive variants. These are crucial issues with regard to the triage and identification of genomically aggressive localized prostate cancers. See, for example, the work set out in: https://doi.org/10.1038/nature20788 .
 
 (3) An additional issue is that the field of PCa research is fast-moving. The manuscript cites ~80 references, but too few of these are from recent studies, and many important and relevant papers are not included. The manuscript would be much stronger if it compared and contrasted its findings with more recent studies of PCa biomarkers and targets, especially those concerned with multi-omics and those including BPH.
 
 (4) The Methods section provides no information on how the Controls were selected. There is no Table providing cohort data to allow the reader to know whether there were differences in age, BMI, ethnic grouping, social status or deprivation, or smoking status, between the Cases and Controls. These types of data are generally recorded in Biobank data, so this sort of analysis should be possible, or if not, the authors' inability to construct an appropriately matched set of Controls should be discussed as a Limitation.
 
 Assessing impact:
 
 Because of the weaknesses of the approach identified above, without further additions to the manuscript, the likely impact of the work on the field is minimal. There is no significant utility of the methods and data to the community, because the data are pre-existing and are not newly introduced to the community in this work, and Mendelian randomization is a well-described approach in common use, and therefore, the assets and methods described in the manuscript are not novel. With regard to the authors achieving their aims, without assessing specificity and without setting their findings in the context of the latest literature, the authors (and readers) cannot know or assess whether the biomarkers identified or the druggable targets will be useful in the clinic.
 
 In conclusion, adding additional context and analysis to the manuscript would both help readers interpret and understand the work and would also greatly enhance its significance. For example, the UK Biobank includes data on men with BPH / LUTS, as analysed in this paper, for example, https://doi.org/10.1038/s41467-018-06920-9. By extending this analysis to identify which biomarkers and druggable targets are specific to PCa, and which are generic to prostate dysfunction, the authors would substantially reduce the risks of diagnostic false positives. This would help to manage the risks of inappropriate treatment or overtreatment.
 
 Review 1
3. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 This is potentially interesting work, but the analyses are attempted in a rather scattergun way, with little evident critical thought. The structure of the work (Results before Methods) can work in some manuscripts, but it is not ideal here. The authors discuss results before we know anything about the underlying data that the results come from. It gives the impression that the authors regard data as a resource to be exploited, without really caring where the data comes from. The methods can provide meaningful insights if correctly used, but while I don't have reasons to doubt that the analyses were conducted correctly, findings are presented with little discussion or interpretation. No follow-up analyses are performed.
 
 In summary, there are likely some gems here, but the whole manuscript is essentially the output from an analytic pipeline.
 
 Taking the researchers aims in turn:
 
 (1) Meta-GWAS - while combining two datasets together can provide additional insights, the contribution of this analysis above existing GWAS is not clear. The PRACTICAL consortium has already reported the GWAS of 70% of these data. What additional value does this analysis provide? (Likely some, but it's not clear from the text.) Also, the presentation of results is unclear - authors state that only 5 gene regions contained variants at p<5x10-8, but Figure 1 shows dozens of hits above 5x10-8. Also, the red line in Figure 1 (supposedly at 5x10-8) is misplaced.
 
 (2) Cross-phenotype analysis. It is not really clear what this analysis is, or why it is done. What is the iCPAGdb? A database? A statistical method? Why would we want to know cross-phenotype associations? What even are these? It seems that the authors have taken data from an online resource and have written a paragraph based on this existing data with little added value.
 
 (3) PW-MR. I can see the value of this work, but many details are unclear. Was this a two-sample MR using PRACTICAL + FinnGen data for the outcome? How many variants were used in key analyses? Again, the description of results is sparse and gives little added value.
 
 (4) Colocalization - seems clear to me.
 
 (5) Additional post-GWAS analyses (pathway + druggability) - again, the analyses seem to be performed appropriately, although little additional insight other than the reporting of output from the methods.
 
 Minor points:
 
 (6) The stated motivation for this work is "early detection". But causality isn't necessary for early detection. If the authors are interested in early detection, other analysis approaches are more appropriate.
 
 (7) The authors state "193 proteins were associated with PCa risk", but they are looking at MR results - these analyses test for disease associations of genetically-predicted levels of proteins, not proteins themselves.
 
 Strengths:
 
 The data and methods used are state-of-the-art.
 
 Weaknesses:
 
 The reader will have to provide their own translational insight.
 
 Review 2
4. Public_Reviews 12 May 2025
 
 in eLife
 
 Author response:
 
 Public Reviews:
 
 Reviewer #1 (Public review):
 
 Summary:
 
 In Causal associations between plasma proteins and prostate cancer: a Proteome-Wide Mendelian Randomization, the authors present a manuscript which seeks to identify novel markers for prostate cancer through analysis of large biobank-based datasets and to extend this analysis to potential therapeutic targets for drugs. This is an area that is already extensively researched, but remains important, due to the high burden and mortality of prostate cancer globally.
 
 Strengths:
 
 The main strengths of the manuscript are the identification and use of large biobank data assets, which provide large numbers of cases and controls, essential for achieving statistical power. The databases used (deCODE, FinnGen, and the UK Biobank) allow for robust numbers of cases and controls. The analytical method chosen, Mendelian Randomization, is appropriate to the problem. Another strength is the integration of multi-omic datasets, here using protein data as well as GWAS sources to integrate genomic and proteomic data.
 
 Thank you for your positive feedback regarding the overall quality of our work and we greatly appreciate you taking time and making effort in reviewing our manuscript.
 
 Weaknesses:
 
 The main weaknesses of the manuscript relate to the following areas:
 
 (1) The failure of the study to analyse the data in the context of other closely related conditions such as benign prostatic hyperplasia (BPH) or lower urinary tract symptoms (LUTS), which have some pathways and biomarkers in common, such as inflammatory pathways (including complement) and specific markers such as KLK3. As a consequence, it is not possible for readers to know whether the findings are specific to prostate cancer or whether they are generic to prostate dysfunction. Given the prevalence of prostate dysfunction (half of men reaching their sixth decade), the potential for false positives and overtreatment from non-specific biomarkers is a major problem, resulting in the evidence presented in this manuscript being weak. Other researchers have addressed this issue using the same data sources as presented here, for example, in this paper, looking at BPH in the UK Biobank population. https://www.nature.com/articles/s41467-018-06920-9
 
 Thank you for your valuable comment. We fully agree that biomarker development must prioritize specificity to avoid overtreatment. While our study is a foundational step toward identifying potential therapeutic targets or complementary biomarkers for prostate cancer (PCa)—not as a direct endorsement of these proteins for standalone clinical diagnosis. Mendelian randomization (MR) analysis strengthens causal inference by design, and we further ensured robustness through sensitivity analyses (e.g. MR-Egger regression for pleiotropy, Bonferroni correction for multiple testing). These methods distinguish true causal effects from nonspecific associations. Importantly, while PSA’s lack of specificity is widely recognized, its role in reducing PCa mortality underscores the value of biomarker-driven screening. Our findings align with the need to integrate multiple markers (e.g. combining a novel protein with PSA) to improve diagnostic precision. Translating these causal insights into clinical tools remains challenging but represents a necessary next step, and we emphasize that this work provides a rigorous starting point for future validation studies.
 
 (2) There is no discussion of Gleason scores with regard to either biomarkers or therapies, and a general lack of discussion around indolent disease as compared with more aggressive variants. These are crucial issues with regard to the triage and identification of genomically aggressive localized prostate cancers. See, for example, the work set out in: https://doi.org/10.1038/nature20788
 
 Thank you for pointing this out. We acknowledge that our original analysis did not directly address this critical issue due to a key data limitation: the publicly available GWAS summary statistics for PCa (from openGWAS and FinnGen) do not provide genetic associations stratified by phenotypic severity or molecular subtypes. This limitation precluded MR analysis of proteins specifically linked to aggressive disease. To partially bridge this gap, we integrate evidence from recent studies in the revised Discussion section to explore the relevance of potential biomarkers to aggressive PCa.
 
 (3) An additional issue is that the field of PCa research is fast-moving. The manuscript cites ~80 references, but too few of these are from recent studies, and many important and relevant papers are not included. The manuscript would be much stronger if it compared and contrasted its findings with more recent studies of PCa biomarkers and targets, especially those concerned with multi-omics and those including BPH.
 
 Thank you for your professional comments. We have rigorously updated the manuscript to include more recent publications and we systematically compare and contrast our findings with these recent studies in the revised Discussion section.
 
 (4) The Methods section provides no information on how the Controls were selected. There is no Table providing cohort data to allow the reader to know whether there were differences in age, BMI, ethnic grouping, social status or deprivation, or smoking status, between the Cases and Controls. These types of data are generally recorded in Biobank data, so this sort of analysis should be possible, or if not, the authors' inability to construct an appropriately matched set of Controls should be discussed as a Limitation.
 
 We thank the reviewer for raising this important methodological concern. We have expanded the Limitations section to state it.
 
 Reviewer #2 (Public review):
 
 This is potentially interesting work, but the analyses are attempted in a rather scattergun way, with little evident critical thought. The structure of the work (Results before Methods) can work in some manuscripts, but it is not ideal here. The authors discuss results before we know anything about the underlying data that the results come from. It gives the impression that the authors regard data as a resource to be exploited, without really caring where the data comes from. The methods can provide meaningful insights if correctly used, but while I don't have reasons to doubt that the analyses were conducted correctly, findings are presented with little discussion or interpretation. No follow-up analyses are performed.
 
 In summary, there are likely some gems here, but the whole manuscript is essentially the output from an analytic pipeline.
 
 We thank the reviewer for the thoughtful evaluation of our work.
 
 Taking the researchers aims in turn:
 
 (1) Meta-GWAS - while combining two datasets together can provide additional insights, the contribution of this analysis above existing GWAS is not clear. The PRACTICAL consortium has already reported the GWAS of 70% of these data. What additional value does this analysis provide? (Likely some, but it's not clear from the text.) Also, the presentation of results is unclear - authors state that only 5 gene regions contained variants at p<5x10-8, but Figure 1 shows dozens of hits above 5x10-8. Also, the red line in Figure 1 (supposedly at 5x10-8) is misplaced.
 
 Thank you very much for your feedback. Although the PRACTICAL consortium constituted the majority of PCa GWAS data, our meta-analysis integrating FinnGen data enhanced statistical power enabling robust detection of low-frequency variants with minor allele frequencies. Moreover, FinnGen's Finnish ancestry (genetic isolate) helps distinguish population-specific effects. The presentation of results showed the top 5 gene regions contained variants at p < 5×10-8. We apologize for not noticing that the red line was not displayed correctly in the original figures included in the manuscript. We have updated it in the revised manuscript.
 
 (2) Cross-phenotype analysis. It is not really clear what this analysis is, or why it is done. What is the iCPAGdb? A database? A statistical method? Why would we want to know cross-phenotype associations? What even are these? It seems that the authors have taken data from an online resource and have written a paragraph based on this existing data with little added value.
 
 We thank you for raising this issue. The iCPAGdb (interactive Cross-Phenotype Analysis of GWAS database) is an integrative platform that systematically identifies cross-phenotype associations and evaluates genetic pleiotropy by leveraging LD-proxy associations from the NHGRI-EBI GWAS Catalog. The pathogenesis and progression of prostate cancer constitute a complex pathophysiological continuum characterized by dynamic multisystem interactions, extending beyond singular molecular pathway dysregulation to encompass coordinated disruptions across endocrine regulation, immune microenvironment remodeling, and metabolic reprogramming. Therefore, it is indispensable for discriminating primary pathogenic drivers from secondary compensatory responses, ultimately informing the development of precision therapeutic strategies.
 
 (3) PW-MR. I can see the value of this work, but many details are unclear. Was this a two-sample MR using PRACTICAL + FinnGen data for the outcome? How many variants were used in key analyses? Again, the description of results is sparse and gives little added value.
 
 We thank you for raising this issue. Two-sample MR refers to an analytical design where genetic instruments for the exposure (plasma proteins) and genetic associations with the outcome (PCa) are derived from non-overlapping populations. This ensures complete sample independence between exposure and outcome datasets to avoid confounding biases, regardless of whether the outcome data originate from single or multiple cohorts. The meta-analysis of PRACTICAL and FinnGen GWAS generates 27,210 quality-controlled variants (p < 5×10-8, MAF ≥ 1%, LD-clumped r2 < 0.1) used in key analyses.
 
 (4) Colocalization - seems clear to me.
 
 (5) Additional post-GWAS analyses (pathway + druggability) - again, the analyses seem to be performed appropriately, although little additional insight other than the reporting of output from the methods.
 
 The post-MR druggability and pathway analyses serve two primary scientific purposes: (1) therapeutic prioritization - systematically evaluating which MR-identified proteins represent tractable drug targets (either through existing FDA-approved agents or compounds in clinical development) with direct relevance to cancer or PCa management, and (2) mechanistic hypothesis generation - mapping these candidate proteins to coherent biological pathways to guide future functional validation studies investigating their causal roles in prostate carcinogenesis.
 
 Minor points:
 
 (6) The stated motivation for this work is "early detection". But causality isn't necessary for early detection. If the authors are interested in early detection, other analysis approaches are more appropriate.
 
 We appreciate your insightful feedback. While early detection is one motivation for this work, our primary goal extends to identifying causally implicated proteins that may serve as intervention targets for PCa prevention or therapy. Establishing causality is critical for distinguishing biomarkers that drive disease pathogenesis from those that are secondary to disease progression, as the former holds greater specificity for early detection and prioritization of therapeutic targets. While we acknowledge that validation for early detection may require additional methodologies, MR analysis provides a foundational step by prioritizing candidate proteins with causal links to disease. This approach ensures that downstream efforts focus on biomarkers and targets with the greatest potential to alter disease trajectories, rather than merely correlative markers.
 
 (7) The authors state "193 proteins were associated with PCa risk", but they are looking at MR results - these analyses test for disease associations of genetically-predicted levels of proteins, not proteins themselves.
 
 In MR, the exposure of interest is the lifelong effect of genetically predicted protein levels. This approach is designed to infer causality while avoiding confounding and reverse causation, as genetic variants are fixed at conception and unaffected by disease processes. When we state “193 proteins were associated with PCa risk,” we specifically refer to proteins whose genetically predicted levels (based on instrument SNPs from protein QTLs) show causal links to PCa. Importantly, MR does not measure the direct association between observed protein concentrations and disease. Instead, it estimates the lifelong causal effect of protein levels predicted by genetics. This distinction is critical for disentangling cause from consequence. For example, a protein elevated due to tumor progression would not be identified as causal in MR if its genetic predictors are unrelated to PCa risk.
 
 We acknowledge that clinical translation requires further validation of these proteins in observational studies measuring actual protein levels. However, MR provides a robust first step by prioritizing candidates with causal roles, thereby reducing the risk of investing in biomarkers confounded by disease processes.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 2

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2024.09.17.24312688v1
www.medrxiv.org www.medrxiv.org

Five-Year Survival Outcomes for Breast Cancer Patients Across Continental Africa: A Contemporary Review of Literature with Meta Analysis

4
1. Public_Reviews 12 May 2025
 
 in eLife
 
 eLife Assessment
 
 This study presents a valuable meta-analysis that highlights low and highly variable breast cancer survival rates across Africa, emphasizing the pressing need for public health in Africa. The evidence supporting the claims of the authors is solid, although a clarification of the crude 5-year survival rates would have strengthened the study. The work will be of interest to scientists working in the field of public health and breast cancer.
 
 Summary
2. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Summary:
 
 This meta-analysis synthesized data from 79 studies across 22 African countries, encompassing over 27,000 breast cancer patients, to estimate 5-year survival rates. The pooled survival rate was 48%, with substantial regional variation, ranging from 64% in Northern Africa to 32% in Western Africa. Survival outcomes were associated with socioeconomic indicators such as education level, Human Development Index (HDI), and Socio-demographic Index (SDI). Although no significant differences in survival were observed between sexes, non-Black Africans had better outcomes. Despite global advances in cancer care, breast cancer survival in Africa has largely stagnated since the early 2010s, underscoring the need for improved healthcare infrastructure, early detection, and equitable access to treatment.
 
 Strengths:
 
 The study has several strengths. It features a comprehensive literature search, adherence to the PRISMA reporting guideline, and prospective registration on PROSPERO, including documentation of protocol deviations. The authors employed rigorous meta-analytic techniques, including subgroup analyses and meta-regression, allowing for a nuanced investigation of potential effect modifiers.
 
 Weaknesses:
 
 Analyses of crude 5-year survival rates are inherently difficult to interpret, particularly in the absence of key clinical variables such as stage at diagnosis or whether cancers were detected through screening programs. This omission raises concerns about lead time bias, where earlier diagnosis (e.g., via screening) may falsely appear to improve survival without affecting actual mortality. The higher survival seen in North Africa, for example, may reflect earlier diagnosis rather than improved prognosis or care quality. In this context, the age of the study population is another important aspect.
 
 Relatedly, the representativeness of the included study populations is unclear. The data sources for individual studies - whether from national cancer registries or single tertiary hospitals -are not systematically reported. This distinction is crucial, as survival outcomes differ significantly between population-based and hospital-based cohorts. Without this contextual information, the generalizability of the findings is limited.
 
 The meta-regression analyses further raise concerns. The authors use study-level covariates (e.g., national HDI, average years of schooling) to explain variation in survival, yet they do not acknowledge the risk of ecological bias. Inferring individual-level effects from aggregated data is methodologically flawed, and the authors' causal interpretation of these associations is inappropriate, especially given the potential for confounding by unmeasured variables at both the individual and study levels.
 
 The assessment of publication bias is similarly problematic. While funnel plot asymmetry and a significant Egger's test are interpreted as evidence of bias, such methods are unreliable in meta-analyses of observational studies. Smaller studies may differ meaningfully from larger ones, not due to selective reporting, but because they may recruit patients from specialized tertiary centers where outcomes are poorer. The observed relationship between study size and survival may therefore reflect true differences in patient populations, not publication bias.
 
 Despite claiming to search for gray literature via Google Scholar, no such studies appear in the PRISMA flowchart. This is a missed opportunity. Gray literature - especially reports from cancer registries - could have enhanced the quality and completeness of the data. While cancer registration systems are not available in all African countries, several do exist, and the authors should have made greater efforts to incorporate routine surveillance data where available. Mortality data from vital statistics systems, available in some countries, could also have provided useful context or validation.
 
 The study's approach to quality assessment is limited. The scoring tool, adapted from Ssentongo et al., conflates completeness of reporting with risk of bias and fails to address key domains such as study population representativeness, selection bias, and lead time bias. Rather than calculating an overall quality score, the authors should have used a structured tool that evaluates risk of bias across defined domains-such as ROBINS-I, ROBINS-E, or tools developed for prevalence studies (e.g., Tonia et al., BMJ Mental Health, 2023). Cochrane guidance and the textbook by Egger, Higgins, and Davey Smith (DOI:10.1002/9781119099369) provide valuable resources for this purpose.
 
 The cumulative meta-analysis is not particularly informative, considering the massive heterogeneity in survival rates. It would be more meaningful to stratify the analysis by calendar period. In general, with such important heterogeneity, the calculation of an overall estimate does not add much.
 
 The authors spend quite some time discussing differences in survival between men and women and between the current and the 2018 estimates, despite the fact that the survival rates are similar, with widely overlapping confidence intervals. The use of a Z-test in this context is inappropriate as it ignores the heterogeneity between studies.
 
 Minor point:
 
 The terms retrospective and prospective are not particularly helpful - every longitudinal study of survival is retrospective. What the authors mean is whether or not the data were collected within a study designed to address this question, or whether existing data were used that were collected for another purpose. See also DOI: 10.1136/bmj.302.6771.249.
 
 Review 1
3. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 The study provides an updated literature review and meta-analysis for the 5-year survival estimates in breast cancer patients across continental Africa. The findings reveal substantial disparities between regions and other factors, highlighting the disadvantaged areas in Africa and the urgent need to address these inequities across the continent.
 
 Strengths:
 
 The main strengths of this study include: (1) the thorough literature search with an increasing number of included studies that enhances result reliability; (2) standard and appropriate statistical methods with clear reporting; (3) a comprehensive discussion.
 
 Overall, the paper is well-structured, clearly presented, and provides useful insights.
 
 Weaknesses:
 
 However, I have a few concerns that I would like the authors to address.
 
 (1) The conclusion "A country-wise comparison with 2018 estimates suggests a declining survival tendency, with WHO AFRO countries reporting the poorest estimates among other WHO regions." appears to have been drawn from the comparisons across both different regions and different time periods, which is incorrect! As shown in Figure 8, survival in Africa has increased from below 30% (WHO AFRO 2017) to around 50% (AFRICA 2024, presumably the current study). Section 3.5 is confusing and headed in the wrong direction. The key message in Figure 8 is that the current survival estimate in Africa is still lower than that of other WHO regions from a few years ago, highlighting the urgent need to improve survival in Africa.
 
 (2) The previous review by Ssentongo et al. classified countries into North Africa and sub-Saharan Africa (SSA), regions divided by the Sahara Desert. This classification is not only geographical-based, but also accounts for the significant differences in ethnicity, health system, and socioeconomic factors. North Africa (especially Egypt, Tunisia, Morocco) has better cancer registries, earlier detection, more treatment access, and therefore better survival outcomes (as shown in Figure 2). SSA tends to have worse outcomes, due to later-stage diagnosis, limited pathology, and access barriers. Given that the survival in women with breast cancer is among the lowest for several SSA countries, the study would benefit from an additional comparison between pooled estimates of North African and SSA, and comparisons with previous pooled estimates.
 
 (3) The authors classified studies under the female group. Females constituted at least 80% of the sample population, and subgroup analysis revealed only a marginal discrepancy in survival rates between the two sexes. However, most of the breast cancer patients and related studies consist predominantly of females. Given the non-negligible differences in various aspects between females and males, sensitivity analyses restricted to studies among females (as in Figure 2-3) would be informative for future research in breast cancer patients.
 
 (4) Stage at diagnosis and treatment are the strongest prognostic factors for breast cancer survival. Though data regarding these variables are not available for all studies, and it's complicated to compare or pool the results from different studies (as mentioned in the limitation), could the authors perform the regression analyses regarding early vs. late stages, and the percentage of treatment received? These two factors are too significant to overlook in studies on breast cancer survival.
 
 (5) The authors reported that studies published before 2019 had a higher survival than those conducted from 2019 onwards, which could be misleading and requires further explanation. As the authors noted ─"the year of publication may not be a reliable measure of the effect in question"─ a better approach would be to use the year of inclusion, i.e., the year the studies were conducted.
 
 (6) Northern and Western Africa both have the highest incidence of breast cancer in Africa, yet their 5-year survival estimates differ substantially. However, the authors have discussed the survival disparities without considering their similarly higher incidence rates. Could this disparity reflect different contributing factors, with the higher incidence rate in Northern Africa resulting from better screening programs (leading to more detections), while that in Western Africa stems from other epidemiological factors despite lower screening participation? Though the incidence rate is not the primary focus of this study, briefly exploring this dichotomy would enhance the discussion and provide valuable insights for readers.
 
 Review 2
4. Public_Reviews 12 May 2025
 
 in eLife
 
 Author response:
 
 We thank the reviewing editors, senior editors, and reviewers for their time, efforts, and constructive feedback. We believe the points raised are addressable and we would like to proceed with a revised submission for further review. Specifically, we plan the following revisions:
 
 Editor’s Comments
 
 We will clarify study definitions to ensure the meaning of "5-year crude overall survival time" is explicit for readers.
 
 Reviewer 1 Comments
 
 - Clarify and supplement the work with detailed sources of study origin (cancer registries or single-center cohorts).
 
 - Conduct a multi-level hierarchical meta-analysis to address concerns of ecological fallacy in interpreting results.
 
 - Perform an ecological sensitivity analysis and clarify findings regarding small study effects.
 
 - Expand the search base significantly by including African local databases; preliminary searches have identified over 50 potentially eligible doctoral theses, dissertations, local journal articles, and gray literature, potentially adding data from five or more additional countries.
 
 Reviewer 2 Comments
 
 - Conduct subgroup analyses by sex and assess the influence of the percentage of males in mixed cohorts.
 
 - Enhance the limited meta-analysis and provide supplementary full forest plots for all analyses.
 
 - Clarify phrasing in sections identified by the reviewer.
 
 Additional Planned Clarifications and Analyses
 
 - Elucidate the role of cumulative meta-analysis in mitigating lead-time bias.
 
 - Include supplementary cumulative meta-analysis based on the year of investigation (instead of publication year).
 
 - Perform subgroup analyses by clinical staging, TNM grading, and treatment modalities where data from ≥10 studies is available.
 
 - Expand discussion on the merits of quality assessment versus risk of bias evaluation in large scale epidemiological and observational studies, in line with other studies of this scale.
 
 - Condense the comparison with 2018 estimates, as per reviewer suggestions.
 
 Clarification Regarding SSA vs. AU Classification
 
 We do not intend to compare survival between "Sub-Saharan Africa" (SSA) and North Africa, as this binary classification is historically rooted and does not reflect current African Union (AU) administrative or policy groupings. Our regional analyses will adhere to the AU’s contemporary regional framework to better reflect political, cultural, and healthcare system realities.
 
 On Registry Data
 
 We will clarify that we will not extract raw registry data, as such data is typically unprocessed and does not provide 5-year overall survival metrics. As such extracting raw, individual-level data from registries or vital statistics systems falls outside the methodological scope of a meta-analysis. Meta-analyses are designed to synthesize published survival estimates or those available from reports where survival analyses have already been conducted. Utilizing raw surveillance data would require primary data processing and survival analysis — effectively creating new data, not synthesizing existing results. This would represent a distinct study design, such as a pooled analysis or original cohort study, rather than a meta-analysis. Where registry reports present summary survival estimates (e.g., 5-year overall survival) in a format compatible with meta-analysis, we will certainly include them.
 
 All planned additional analyses will depend on data quality, consistency, and feasibility for pooling using state-of-the-art statistical techniques. Where pooling is not possible, we will transparently report limitations.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 2

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2025.01.03.25319952v1
www.biorxiv.org www.biorxiv.org

Multiphase separation in postsynaptic density regulated by membrane geometry via interaction valency and volume

5
1. Public_Reviews 12 May 2025
  
  in eLife
  
  eLife Assessment
  
  This valuable study provides a conceptual advance in our understanding of how membrane geometry modulates the balance between specific and non-specific molecular interactions, reversing multiphase morphologies in postsynaptic protein assemblies. Using a mesoscale simulation framework grounded in experimental binding affinities, the authors successfully recapitulate key experimental observations in both solution and membrane-associated systems, providing novel mechanistic insight into how spatial constraints regulate postsynaptic condensate organization. While the evidence supporting the conclusions is largely solid, a few aspects of the analysis and model proposed remain incomplete.
  
  Summary
2. Public_Reviews 12 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  This study uses mesoscale simulations to investigate how membrane geometry regulates the multiphase organization of postsynaptic condensates. It reveals that dimensionality shifts the balance between specific and non-specific interactions, thereby reversing domain morphology observed in vitro versus in vivo.
  
  Strengths:
  
  The model is grounded in experimental binding affinities, reproduces key experimental observations in 3D and 2D contexts, and offers mechanistic insight into how geometry and molecular features drive phase behavior.
  
  Weaknesses:
  
  The model omits other synaptic components that may influence domain organization and does not extensively explore parameter sensitivity or broader physiological variability.
  
  Review 1
3. Public_Reviews 12 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  This is a timely and insightful study aiming to explore the general physical principles for the sub-compartmentalization--or lack thereof--in the phase separation processes underlying the assembly of postsynaptic densities (PSDs), especially the markedly different organizations in three-dimensional (3D) droplets on one hand and the two-dimensional (2D) condensates associated with a cellular membrane on the other. Simulation of a highly simplified model (one bead per protein domain) is carefully executed. Based on a thorough consideration of various control cases, the main conclusion regarding the trade-off between repulsive excluded volume interactions and attractive interactions among protein domains in determining the structures of 3D vs 2D model PSD condensates is quite convincing. The results in this manuscript are novel; however, as it stands, there is substantial room for improvement in the presentation of the background and the findings of this work. In particular, (i) conceptual connections with prior works should be better discussed, (ii) essential details of the model should be clarified, and (iii) the generality and limitations of the authors' approach should be better delineated. Specifically, the following items should be addressed (with the additional references mentioned below cited and discussed):
  
  (1) Excluded volume effects are referred to throughout the text by various terms and descriptions such as "repulsive force according to the volume" (e.g., in the Introduction), "nonspecific volume interaction", and "volume effects" in this manuscript. This is somewhat curious and not conducive to clarity, because these terms have alternate or connotations of alternate meanings (e.g., in biomolecular modeling, repulsive interactions usually refer to those with longer spatial ranges, such as that between like charges). It will be much clearer if the authors simply refer to excluded volume interactions as excluded volume interactions (or effects).
  
  (2) Inasmuch as the impact of excluded volume effects on subcompartmentalization of condensates ("multiple phases" in the authors' terminology), it has been demonstrated by both coarse-grained molecular dynamics and field-theoretic simulations that excluded volume is conducive to demixing of molecular species in condensates [Pal et al., Phys Rev E 103:042406 (2021); see especially Figures 4-5 of this reference]. This prior work bears directly on the authors' observation. Its relationship with the present work should be discussed.
  
  (3) In the present model setup, activation of the CaMKII kinase affects only its binding to GluN2Bc. This approach is reasonable and leads to model predictions that are essentially consistent with the experiment. More broadly, however, do the authors expect activation of the CaMKII kinase to lead to phosphorylation of some of the molecular species involved with PSDs? This may be of interest since biomolecular condensates are known to be modulated by phosphorylation [Kim et al., Science 365:825-829 (2019); Lin et al, eLife 13:RP100284 (2025)].
  
  (4) The forcefield for confinement of AMPAR/TARP and NMDAR/GluN2Bc to 2D should be specified in the main text. Have the authors explored the sensitivity of their 2D findings on the strength of this confinement?
  
  (5) Some of the labels in Figure 1 are confusing. In Figure 1A, the structure labeled as AMPAR has the same shape as the structure labeled as TARP in Figure 1B, but TARP is labeled as one of the smaller structures (like small legs) in the lower part of AMPAR in Figure 1A. Does the TARP in Figure 1B correspond to the small structures in the lower part of AMPAR? If so, this should be specified (and better indicated graphically), and in that case, it would be better not to use the same structural drawing for the overall structure and a substructure. The same issue is seen for NMDAR in Figure 1A and GluN2Bc in Figure 1B.
  
  (6) In addition to clarifying Figure 1, the authors should clarify the usage of AMPAR vs TARP and NMDAR vs GluN2Bc in other parts of the text as well.
  
  (7) The physics of the authors' model will be much clearer if they provide an easily accessible graphical description of the relative interaction strengths between different domain-representing spheres (beads) in their model. For this purpose, a representation similar to that given by Feric et al., Cell 165:1686-1697 (2016) (especially Figure 6B in this reference) of the pairwise interactions among the beads in the authors' model should be provided as an additional main-text figure. Different interaction schemes corresponding to inactive and activated CAMKII should be given. In this way, the general principles (beyond the PSD system) governing 3D vs 2D multiple-component condensate organization can be made much more apparent.
  
  (8) Can the authors' rationalization of the observed difference between 3D and 2D model PSD condensates be captured by an intuitive appreciation of the restriction on favorable interactions by steric hindrance and the reduction in interaction cooperativity in 2D vs 3D?
  
  (9) In the authors' model, the propensity to form 2D condensates is quite weak. Is this prediction consistent with the experiment? Real PSDs do form 2D condensates around synapses.
  
  (10) More theoretical context should be provided in the Introduction and/or Discussion by drawing connections to pertinent prior works on physical determinants of co-mixing and de-mixing in multiple-component condensates (e.g., amino acid sequence), such as Lin et al., New J Phys 19:115003 (2017) and Lin et al., Biochemistry 57:2499-2508 (2018).
  
  (11) In the discussion of the physiological/neurological significance of PSD in the Introduction and/or Discussion, for general interest it is useful to point to a recently studied possible connection between the hydrostatic pressure-induced dissolution of model PSD and high-pressure neurological syndrome [Lin et al., Chem Eur J 26:11024-11031 (2020)].
  
  (12) It is more accurate to use "perpendicular to the membrane" rather than "vertical" in the caption for Figure 3E and other such descriptions of the orientation of the CaMKII hexagonal plane in the text.
  
  Review 2
4. Public_Reviews 12 May 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Summary:
  
  In this work, Yamada, Brandani, and Takada have developed a mesoscopic model of the interacting proteins in the postsynaptic density. They have performed simulations, based on this model and using the software ReaDDy, to study the phase separation in this system in 2D (on the membrane) and 3D (in the bulk). They have carefully investigated the reasons behind different morphologies observed in each case, and have looked at differences in valency, specific/non-specific interactions, and interfacial tension.
  
  Strengths:
  
  The simulation model is developed very carefully, with strong reliance on binding valency and geometry, experimentally measured affinities, and physical considerations like the hydrodynamic radii. The presented analyses are also thorough, and great effort has been put into investigating different scenarios that might explain the observed effects.
  
  Weaknesses:
  
  The biggest weakness of the study, in my opinion, has to do with a lack of more in-depth physical insight about phase separation. For example, the authors express surprise about similar interactions between components resulting in different phase separation in 2D and 3D. This is not surprising at all, as in 3D, higher coordination numbers and more available volume translate to lower free energy, which easily explains phase separation. The role of entropy is also significantly missing from the analyses. When interaction strengths are small, entropic effects play major roles.
  
  In the introduction, the authors present an oversimplified view of associative and segregative phase transitions based on the attractive and repulsive interactions, and I'm afraid that this view, in which all the observed morphologies should have clear pairwise enthalpic explanations, diffuses throughout the analysis. Meanwhile, I believe the authors correctly identify some relevant effects, where they consider specific/non-specific interactions, or when they investigate the reduced valency of CaMKII in the 2D system.
  
  Also, I sense some haste in comparing the findings with experimental observations. For example, the authors mention that "For the current four component PSD system, the product of concentrations of each molecule in the dilute phase is in good agreement with that of the experimental concentrations (Table S2)." But the data used here is the dilute phase, which is the remnant of a system prepared at very high concentrations and allowed to phase separate. The errors reported in Table S2 already cast doubt on this comparison. Or while the 2D system is prepared via confining the particles to the vicinity of the membrane, the different diffusive behavior in the membrane, in contrast to the bulk (i.e., the Saffman-Delbrück model), is not considered. This would thus make it difficult to interpret the results of a coupled 2D/3D system and compare them to the actual system.
  
  Review 3
5. Public_Reviews 12 May 2025
  
  in eLife
  
  Author response:
  
  We thank all the reviewers for their thoughtful comments on our submitted manuscript.
  
  The main points made by all three reviewers were: to discuss the components of the omitted synapses and explore parameter sensitivity and broader physiological variability; to provide deeper physical insights into phase separation; to clarify terminology and provide better presentation and context in relation to previous studies.
  
  We fully agree with the first point, suggesting that parameter sensitivity and broader physiological variability should be explored. Our model omits scaffold proteins such as GKAP, Shank and Homer, which are present at the bottom of the PSD hierarchy. In addition, there are many other interactions in PSDs whose affinity is altered by phosphorylation, and the phase separation state of the condensate is likely to be affected by ionic concentration and other environmental factors. We will include a more detailed discussion of these environmental factors and a limitation of our study in the Discussion section. Furthermore, regarding to the sensitivity of the parameters, the reviewer's point that the membrane potential parameter is an important value is right since it directly regulates the difference between 3D and 2D systems. We plan to verify this by changing the strength of the membrane potential, and by running simulations again to see how much it affects the morphology of condensates.
  
  The second point is that we should provide deeper physical insight into phase separation in different dimensions. It would not be straightforward to directly estimate the entropy of the system due to the nature of the model. However, as pointed out, the difference of phase behavior can be elucidated through various simplified theories such as the lattice model. In this context, the reduced coordination number in 2D systems compared to 3D systems, and the decreased pseudo-attractive force due to the depletion effect, can offer rationalizations. We would like to add some theoretical discussion of these aspects with equations.
  
  Third, we will clarify terminology and provide better explanation in relation to previous studies. In some parts in manuscripts, such as complexes containing receptors, there were some disunity in terminology and lack of annotations in figures. We will improve the wording and visualization in the text for further clarity and add relevant references, as suggested by the reviewers.
  
  Also, as additionally suggested, scripts for the simulation and analysis together with the initial structure obtained will be deposited to Zenodo or GitHub.
  
  AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 3

AuthorResponse

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.02.19.638939v1
www.biorxiv.org www.biorxiv.org

MorphoNet 2.0: An innovative approach for qualitative assessment and segmentation curation of large-scale 3D time-lapse imaging datasets

5
1. Public_Reviews 12 May 2025
 
 in eLife
 
 eLife Assessment
 
 This work presents an important technical advancement with the release of MorphoNet 2.0, a user-friendly, standalone platform for 3D+T segmentation and analysis in biological imaging. The authors provide convincing evidence of the tool's capabilities through illustrative use cases, though broader validation against current state-of-the-art tools would strengthen its position. The software's accessibility and versatility make it a resource that will be of value for the bioimaging community, particularly in specialized subfields.
 
 Summary
2. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 The authors present a substantial improvement to their existing tool, MorphoNet, intended to facilitate assessment of 3D+t cell segmentation and tracking results, and curation of high-quality analysis for scientific discovery and data sharing. These tools are provided through a user-friendly GUI, making them accessible to biologists who are not experienced coders. Further, the authors have re-developed this tool to be a locally installed piece of software instead of a web interface, making the analysis and rendering of large 3D+t datasets more computationally efficient. The authors evidence the value of this tool with a series of use cases, in which they apply different features of the software to existing datasets and show the improvement to the segmentation and tracking achieved.
 
 While the computational tools packaged in this software are familiar to readers (e.g., cellpose), the novel contribution of this work is the focus on error correction. The MorphoNet 2.0 software helps users identify where their candidate segmentation and/or tracking may be incorrect. The authors then provide existing tools in a single user-friendly package, lowering the threshold of skill required for users to get maximal value from these existing tools. To help users apply these tools effectively, the authors introduce a number of unsupervised quality metrics that can be applied to a segmentation candidate to identify masks and regions where the segmentation results are noticeably different from the majority of the image.
 
 This work is valuable to researchers who are working with cell microscopy data that requires high-quality segmentation and tracking, particularly if their data are 3D time-lapse and thus challenging to segment and assess. The MorphoNet 2.0 tool that the authors present is intended to make the iterative process of segmentation, quality assessment, and re-processing easier and more streamlined, combining commonly used tools into a single user interface.
 
 One of the key contributions of the work is the unsupervised metrics that MorphoNet 2.0 offers for segmentation quality assessment. These metrics are used in the use cases to identify low-quality instances of segmentation in the provided datasets, so that they can be improved with plugins directly in MorphoNet 2.0. However, not enough consideration is given to demonstrating that optimizing these metrics leads to an improvement in segmentation quality. For example, in Use Case 1, the authors report their metrics of interest (Intensity offset, Intensity border variation, and Nuclei volume) for the uncurated silver truth, the partially curated and fully curated datasets, but this does not evidence an improvement in the results. Additional plotting of the distribution of these metrics on the Gold Truth data could help confirm that the distribution of these metrics now better matches the expected distribution.
 
 Similarly, in Use Case 2, visual inspection leads us to believe that the segmentation generated by the Cellpose + Deli pipeline (shown in Figure 4d) is an improvement, but a direct comparison of agreement between segmented masks and masks in the published data (where the segmentations overlap) would further evidence this.
 
 We would appreciate the authors addressing the risk of decreasing the quality of the segmentations by applying circular logic with their tool; MorphoNet 2.0 uses unsupervised metrics to identify masks that do not fit the typical distribution. A model such as StarDist can be trained on the "good" masks to generate more masks that match the most common type. This leads to a more homogeneous segmentation quality, without consideration for whether these metrics actually optimize the segmentation
 
 In Use case 5, the authors include details that the errors were corrected by "264 MorphoNet plugin actions ... in 8 hours actions [sic]". The work would benefit from explaining whether this is 8 hours of human work, trying plugins and iteratively improving, or 8 hours of compute time to apply the selected plugins.
 
 Review 1
3. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 This article presents Morphonet 2.0, a software designed to visualise and curate segmentations of 3D and 3D+t data. The authors demonstrate their capabilities on five published datasets, showcasing how even small segmentation errors can be automatically detected, easily assessed, and corrected by the user. This allows for more reliable ground truths, which will in turn be very much valuable for analysis and training deep learning models. Morphonet 2.0 offers intuitive 3D inspection and functionalities accessible to a non-coding audience, thereby broadening its impact.
 
 Strengths:
 
 The work proposed in this article is expected to be of great interest to the community by enabling easy visualisation and correction of complex 3D(+t) datasets. Moreover, the article is clear and well written, making MorphoNet more likely to be used. The goals are clearly defined, addressing an undeniable need in the bioimage analysis community. The authors use a diverse range of datasets, successfully demonstrating the versatility of the software.
 
 We would also like to highlight the great effort that was made to clearly explain which type of computer configurations are necessary to run the different datasets and how to find the appropriate documentation according to your needs. The authors clearly carefully thought about these two important problems and came up with very satisfactory solutions.
 
 Weaknesses:
 
 There is still one concern: the quantification of the improvement of the segmentations in the use cases and, therefore, the quantification of the potential impact of the software. While it appears hard to quantify the quality of the correction, the proposed work would be significantly improved if such metrics could be provided.
 
 The authors show some distributions of metrics before and after segmentations to highlight the changes. This is a great start, but there seem to be two shortcomings: first, the comparison and interpretation of the different distributions does not appear to be trivial. It is therefore difficult to judge the quality of the improvement from these. Maybe an explanation in the text of how to interpret the differences between the distributions could help. A second shortcoming is that the before/after metrics displayed are the metrics used to guide the correction, so, by design, the scores will improve, but does that accurately represent the improvement of the segmentation? It seems to be the case, but it would be nice to maybe have a better assessment of the improvement of the quality.
 
 Review 2
4. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 Summary:
 
 A very thorough technical report of a new standalone, open-source software for microscopy image processing and analysis (MorphoNet 2.0), with a particular emphasis on automated segmentation and its curation to obtain accurate results even with very complex 3D stacks, including timelapse experiments.
 
 Strengths:
 
 The authors did a good job of explaining the advantages of MorphoNet 2.0, as compared to its previous web-based version and to other software with similar capabilities. What I particularly found more useful to actually envisage these claimed advantages is the five examples used to illustrate the power of the software (based on a combination of Python scripting and the 3D game engine Unity). These examples, from published research, are very varied in both types of information and image quality, and all have their complexities, making them inherently difficult to segment. I strongly recommend the readers to carefully watch the accompanying videos, which show (although not thoroughly) how the software is actually used in these examples.
 
 Weaknesses:
 
 Being a technical article, the only possible comments are on how methods are presented, which is generally adequate, as mentioned above. In this regard, and in spite of the presented examples (chosen by the authors, who clearly gave them a deep thought before showing them), the only way in which the presented software will prove valuable is through its use by as many researchers as possible. This is not a weakness per se, of course, but just what is usual in this sort of report. Hence, I encourage readers to download the software and give it time to test it on their own data (which I will also do myself).
 
 In conclusion, I believe that this report is fundamental because it will be the major way of initially promoting the use of MorphoNet 2.0 by the objective public. The software itself holds the promise of being very impactful for the microscopists' community.
 
 Review 3
5. Public_Reviews 12 May 2025
 
 in eLife
 
 Author response:
 
 eLife Assessment
 
 This work presents an important technical advancement with the release of MorphoNet 2.0, a user-friendly, standalone platform for 3D+T segmentation and analysis in biological imaging. The authors provide convincing evidence of the tool's capabilities through illustrative use cases, though broader validation against current state-of-the-art tools would strengthen its position. The software's accessibility and versatility make it a resource that will be of value for the bioimaging community, particularly in specialized subfields.
 
 We would like to thank the editors and reviewers for their careful and constructive evaluation of our manuscript “MorphoNet 2.0: An innovative approach for qualitative assessment and segmentation curation of large-scale 3D time-lapse imaging datasets”. We are grateful for the positive assessment of MorphoNet 2.0 as a valuable and accessible tool for the bioimaging community, and for the recognition of its technical advancements, particularly in the context of complex 3D+t segmentation tasks.
 
 The reviewers have highlighted several important points that we will address in the revised manuscript. These include:
 
 - The need for a clearer demonstration that improvements in unsupervised quality metrics correspond to actual improvements in segmentation quality. In response, we will provide comparisons with gold standard annotations where available and clarify how to interpret metric distributions. - The potential risk of circular logic when using unsupervised metrics to guide model training. We now explicitly discuss this limitation and emphasize the importance of external validation and expert input. - The value of comparing MorphoNet 2.0 to other tools such as FIJI and napari. We will include a comparative table to help readers understand MorphoNet’s positioning and complementarity. - The importance of clearer documentation and terminology. We will overhaul the help pages, standardize plugin naming, and add a glossary-style table to the manuscript. - Suggestions for future developments, such as mesh export and interoperability with napari, which we will explore for the revision.
 
 We appreciate the detailed feedback on both scientific and editorial aspects, including corrections to figures and text, and we will integrate all suggested revisions to improve the manuscript’s clarity and impact. We are confident that these changes will strengthen the manuscript and enhance the utility of MorphoNet 2.0 for the community.
 
 Public Reviews:
 
 Reviewer #1 (Public review):
 
 The authors present a substantial improvement to their existing tool, MorphoNet, intended to facilitate assessment of 3D+t cell segmentation and tracking results, and curation of high-quality analysis for scientific discovery and data sharing. These tools are provided through a user-friendly GUI, making them accessible to biologists who are not experienced coders. Further, the authors have re-developed this tool to be a locally installed piece of software instead of a web interface, making the analysis and rendering of large 3D+t datasets more computationally efficient. The authors evidence the value of this tool with a series of use cases, in which they apply different features of the software to existing datasets and show the improvement to the segmentation and tracking achieved.
 
 While the computational tools packaged in this software are familiar to readers (e.g., cellpose), the novel contribution of this work is the focus on error correction. The MorphoNet 2.0 software helps users identify where their candidate segmentation and/or tracking may be incorrect. The authors then provide existing tools in a single user-friendly package, lowering the threshold of skill required for users to get maximal value from these existing tools. To help users apply these tools effectively, the authors introduce a number of unsupervised quality metrics that can be applied to a segmentation candidate to identify masks and regions where the segmentation results are noticeably different from the majority of the image.
 
 This work is valuable to researchers who are working with cell microscopy data that requires high-quality segmentation and tracking, particularly if their data are 3D time-lapse and thus challenging to segment and assess. The MorphoNet 2.0 tool that the authors present is intended to make the iterative process of segmentation, quality assessment, and re-processing easier and more streamlined, combining commonly used tools into a single user interface.
 
 We sincerely thank the reviewer for their thorough and encouraging evaluation of our work. We are grateful that they highlighted both the technical improvements of MorphoNet 2.0 and its potential impact for the broader community working with complex 3D+t microscopy datasets. We particularly appreciate the recognition of our efforts to make advanced segmentation and tracking tools accessible to non-expert users through a user-friendly and locally installable interface, and for pointing out the importance of error detection and correction in the iterative analysis workflow. The reviewer’s appreciation of the value of integrating unsupervised quality metrics to support this process is especially meaningful to us, as this was a central motivation behind the development of MorphoNet 2.0. We hope the tool will indeed facilitate more rigorous and reproducible analyses, and we are encouraged by the reviewer’s positive assessment of its utility for the community.
 
 One of the key contributions of the work is the unsupervised metrics that MorphoNet 2.0 offers for segmentation quality assessment. These metrics are used in the use cases to identify low-quality instances of segmentation in the provided datasets, so that they can be improved with plugins directly in MorphoNet 2.0. However, not enough consideration is given to demonstrating that optimizing these metrics leads to an improvement in segmentation quality. For example, in Use Case 1, the authors report their metrics of interest (Intensity offset, Intensity border variation, and Nuclei volume) for the uncurated silver truth, the partially curated and fully curated datasets, but this does not evidence an improvement in the results. Additional plotting of the distribution of these metrics on the Gold Truth data could help confirm that the distribution of these metrics now better matches the expected distribution.
 
 Similarly, in Use Case 2, visual inspection leads us to believe that the segmentation generated by the Cellpose + Deli pipeline (shown in Figure 4d) is an improvement, but a direct comparison of agreement between segmented masks and masks in the published data (where the segmentations overlap) would further evidence this.
 
 We agree that demonstrating the correlation between metric optimization and real segmentation improvement is essential. We will add new analysis comparing the distributions of the unsupervised metrics with the gold truth data before and after curation. Additionally, we will provide overlap scores where ground truth annotations are available, confirming the improvement. We will also explicitly discuss the limitation of relying solely on unsupervised metrics without complementary validation.
 
 We would appreciate the authors addressing the risk of decreasing the quality of the segmentations by applying circular logic with their tool; MorphoNet 2.0 uses unsupervised metrics to identify masks that do not fit the typical distribution. A model such as StarDist can be trained on the "good" masks to generate more masks that match the most common type. This leads to a more homogeneous segmentation quality, without consideration for whether these metrics actually optimize the segmentation
 
 We thank the reviewer for this important and insightful comment. It raises a crucial point regarding the risk of circular logic in our segmentation pipeline. Indeed, relying on unsupervised metrics to select “good” masks and using them to train a model like StarDist could lead to reinforcing a particular distribution of shapes or sizes, potentially filtering out biologically relevant variability. This homogenization may improve consistency with the chosen metrics, but not necessarily with the true underlying structures.
 
 We fully agree that this is a key limitation to be aware of. We will revise the manuscript to explicitly discuss this risk, emphasizing that while our approach may help improve segmentation quality according to specific criteria, it should be complemented with biological validation and, when possible, expert input to ensure that important but rare phenotypes are not excluded.
 
 In Use case 5, the authors include details that the errors were corrected by "264 MorphoNet plugin actions ... in 8 hours actions [sic]". The work would benefit from explaining whether this is 8 hours of human work, trying plugins and iteratively improving, or 8 hours of compute time to apply the selected plugins.
 
 We will clarify that the “8 hours” refer to human interaction time, including exploration, testing, and iterative correction using plugins.
 
 Reviewer #2 (Public review):
 
 Summary:
 
 This article presents Morphonet 2.0, a software designed to visualise and curate segmentations of 3D and 3D+t data. The authors demonstrate their capabilities on five published datasets, showcasing how even small segmentation errors can be automatically detected, easily assessed, and corrected by the user. This allows for more reliable ground truths, which will in turn be very much valuable for analysis and training deep learning models. Morphonet 2.0 offers intuitive 3D inspection and functionalities accessible to a non-coding audience, thereby broadening its impact.
 
 Strengths:
 
 The work proposed in this article is expected to be of great interest to the community by enabling easy visualisation and correction of complex 3D(+t) datasets. Moreover, the article is clear and well written, making MorphoNet more likely to be used. The goals are clearly defined, addressing an undeniable need in the bioimage analysis community. The authors use a diverse range of datasets, successfully demonstrating the versatility of the software.
 
 We would also like to highlight the great effort that was made to clearly explain which type of computer configurations are necessary to run the different datasets and how to find the appropriate documentation according to your needs. The authors clearly carefully thought about these two important problems and came up with very satisfactory solutions.
 
 We would like to sincerely thank the reviewer for their positive and thoughtful feedback. We are especially grateful that they acknowledged the clarity of the manuscript and the potential value of MorphoNet 2.0 for the community, particularly in facilitating the visualization and correction of complex 3D(+t) datasets. We also appreciate the reviewer’s recognition of our efforts to provide detailed guidance on hardware requirements and access to documentation—two aspects we consider crucial to ensuring the tool is both usable and widely adopted. Their comments are very encouraging and reinforce our commitment to making MorphoNet 2.0 as accessible and practical as possible for a broad range of users in the bioimage analysis community.
 
 Weaknesses:
 
 There is still one concern: the quantification of the improvement of the segmentations in the use cases and, therefore, the quantification of the potential impact of the software. While it appears hard to quantify the quality of the correction, the proposed work would be significantly improved if such metrics could be provided.
 
 The authors show some distributions of metrics before and after segmentations to highlight the changes. This is a great start, but there seem to be two shortcomings: first, the comparison and interpretation of the different distributions does not appear to be trivial. It is therefore difficult to judge the quality of the improvement from these. Maybe an explanation in the text of how to interpret the differences between the distributions could help. A second shortcoming is that the before/after metrics displayed are the metrics used to guide the correction, so, by design, the scores will improve, but does that accurately represent the improvement of the segmentation? It seems to be the case, but it would be nice to maybe have a better assessment of the improvement of the quality.
 
 We thank the reviewer for this constructive and important comment. We fully agree that assessing the true quality improvement of segmentation after correction is a central and challenging issue. While we initially focused on changes in the unsupervised quality metrics to illustrate the effect of the correction, we acknowledge that interpreting these distributions may not be straightforward, and that relying solely on the metrics used to guide the correction introduces an inherent bias in the evaluation.
 
 To address the first point, we will revise the manuscript to provide clearer guidance on how to interpret the changes in metric distributions before and after correction, with additional examples to make this interpretation more intuitive.
 
 Regarding the second point, we agree that using independent, external validation is necessary to confirm that the segmentation has genuinely improved. To this end, we will include additional assessments using complementary evaluation strategies on selected datasets where ground truth is accessible, to compare pre- and post-correction segmentations with an independent reference. These results reinforce the idea that the corrections guided by unsupervised metrics generally lead to more accurate segmentations, but we also emphasize their limitations and the need for biological validation in real-world cases.
 
 Reviewer #3 (Public review):
 
 Summary:
 
 A very thorough technical report of a new standalone, open-source software for microscopy image processing and analysis (MorphoNet 2.0), with a particular emphasis on automated segmentation and its curation to obtain accurate results even with very complex 3D stacks, including timelapse experiments.
 
 Strengths:
 
 The authors did a good job of explaining the advantages of MorphoNet 2.0, as compared to its previous web-based version and to other software with similar capabilities. What I particularly found more useful to actually envisage these claimed advantages is the five examples used to illustrate the power of the software (based on a combination of Python scripting and the 3D game engine Unity). These examples, from published research, are very varied in both types of information and image quality, and all have their complexities, making them inherently difficult to segment. I strongly recommend the readers to carefully watch the accompanying videos, which show (although not thoroughly) how the software is actually used in these examples.
 
 We sincerely thank the reviewer for their thoughtful and encouraging feedback. We are particularly pleased that the reviewer appreciated the comparative analysis of MorphoNet 2.0 with both its earlier version and existing tools, as well as the relevance of the five diverse and complex use cases we selected. Demonstrating the software’s versatility and robustness across a variety of challenging datasets was a key goal of this work, and we are glad that this aspect came through clearly. We also appreciate the reviewer’s recommendation to watch the accompanying videos, which we designed to provide a practical sense of how the tool is used in real-world scenarios. Their positive assessment is highly motivating and reinforces the value of combining scripting flexibility with an interactive 3D interface.
 
 Weaknesses:
 
 Being a technical article, the only possible comments are on how methods are presented, which is generally adequate, as mentioned above. In this regard, and in spite of the presented examples (chosen by the authors, who clearly gave them a deep thought before showing them), the only way in which the presented software will prove valuable is through its use by as many researchers as possible. This is not a weakness per se, of course, but just what is usual in this sort of report. Hence, I encourage readers to download the software and give it time to test it on their own data (which I will also do myself).
 
 We fully agree that the true value of MorphoNet 2.0 will be demonstrated through its practical use by a wide range of researchers working with complex 3D and 3D+t datasets. In this regard, we will improve the user documentation and provide a set of example datasets to help new users quickly familiarize themselves with the platform. We are also committed to maintaining and updating MorphoNet 2.0 based on user feedback to further support its usability and impact.
 
 In conclusion, I believe that this report is fundamental because it will be the major way of initially promoting the use of MorphoNet 2.0 by the objective public. The software itself holds the promise of being very impactful for the microscopists' community.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 3

AuthorResponse

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.02.21.639560v1
www.biorxiv.org www.biorxiv.org

Making plant tissue accessible for cryo-electron tomography

4
1. Public_Reviews 12 May 2025
  
  in eLife
  
  eLife Assessment
  
  Thick multicellular plant samples provide unique challenges when it comes to cryo-preservation, which has resulted in limited successful examples for structural studies using in situ cryo-electron tomography. To address this deficiency, this important study describes procedures for high-pressure-freezing, focused ion-beam milling, and cryo-electron tomography imaging of certain plant types. The results described in the paper provide solid evidence for the usefulness of the methods described, although some reservations remain about the applicability of the methods to a wider range of plant cell types.
  
  Summary
2. Public_Reviews 12 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  This in situ cryo-ET workflow of selected plant structures provides several detailed strategies using plunge-freezing and the HPF waffle method and lift-out for notoriously difficult samples (compared to cell culture, yeast, and algae, which are far more prevalent in the literature).
  
  Strengths:
  
  A very difficult challenge whereby the authors demonstrate successful vitrification of selected plants/structures using waffle and lift-out approaches for cryoET. Because there are relatively few examples of multi-cellular plant cryo-ET in the literature, it is important for the scientific community to be motivated and have demonstrated strategies that it is achievable. This manuscript has a number of very helpful graphics and videos to help guide researchers who would be interested in undertaking that would help shorten the learning curve of admittedly tedious and complex workflows. This is a slow and tedious process, but you have to start somewhere, and I applaud the authors for sharing their experiences with others, and I expect will help other early adopters to come up to speed sooner.
  
  Weaknesses:
  
  While important, the specific specimen and cell-types selected that were successful (perhaps other plant specimen and tissues tried were unsuccessful and thus not reported) in this approach did not demonstrate success to broadly applicable to other much more prevalent and interesting and intensive areas plant biology and plant structures (some mentioned in more detail below).
  
  This manuscript is essentially a protocol paper and in its paragraph form, and even with great graphics, will definitely be difficult to follow and reproduce for a non-expert. Also considering the use of 3 different FIB-SEM platforms and 2 different cryo-FLM platforms, I wonder if a master graphic of the full workflow(s) could be prepared as a supplementary document that walks through the major steps and points to the individual figures at the critical steps to make it more accessible to the broader readership.
  
  Multiple times in the manuscript, important workflow details seemed to point to and be dependent on two "unpublished" manuscripts:
  
  (1) Line 583, 755, 790, 847-848, (Poge et al., will soon be published as a protocol).
  
  (2) Lines 140, 695, 716 (Capitanio et al., will soon be described in a manuscript).
  
  It is not clear if/when these would be publicly available. It may be important to wait until these papers can be included in published form.
  
  Review 1
3. Public_Reviews 12 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  Poge et al. present a workflow for studying plant tissue by combining high-pressure freezing, cryo-fluorescence microscopy, FIB milling, and cryo-electron tomography (cryo-ET). They tested various plant tissues, including Physcomitrium patens, Arabidopsis thaliana, and Limonium bicolor. The authors successfully produce thin lamellae suitable for cryo-ET studies. Using sub-tomogram averaging, they determined the Rubisco structure at subnanometer resolution, demonstrating the potential of this workflow for plant tissue studies.
  
  Strengths:
  
  This manuscript is likely the first to systematically apply FIB milling and cryo-ET to plant tissue samples. It provides a detailed methodological description, which is not only valuable for plant tissue studies but also adaptable to a broader range of biological tissue samples. The study compares the plunge freezing method with a high-pressure freezing method, demonstrating that high-pressure freezing can vitrify thick tissues while preserving their native state. Additionally, the authors explore two methods for plant tissue sample preparation, the "waffle" method and in-carrier high-pressure freezing combined with the "lift-out" approach. The "waffle" method is suitable for samples less than 25um, while the in-carrier high-pressure freezing method can process samples up to 100um.
  
  Weaknesses:
  
  The described workflow is very complicated and requires special expertise. The success rate of this workflow is not very high, particularly for high-pressure freezing and life-out technology. Further improvements are needed for automation and increasing throughput.
  
  Review 2
4. Public_Reviews 12 May 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Summary:
  
  The authors aimed to improve cryo-TEM workflows for plant cells. The authors present details on high-pressure-freezing protocols to vitrify, ion-mill, and image certain plant cell types.
  
  Strengths:
  
  Clear step-by-step outline on how to preserve and image cryo samples derived from plants.
  
  Weaknesses:
  
  A general current weakness of cryo-TEM is the problem of vitrifying cells that are embedded in tissues. The vast majority of cells in the plant body are currently not accessible to this technology. This is not a weakness of this specific manuscript but a general problem.
  
  The manuscript is well organized and well written, and the discussion covers practically all questions I had while reading the results section. I only have a few comments, all of which I consider minor.
  
  Review 3
Visit annotations in context

Tags

Summary

Review 1

Review 2

Review 3

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.02.14.638237v1
osf.io osf.io

Disinformation elicits learning biases

4
1. Public_Reviews 12 May 2025
 
 in eLife
 
 eLife Assessment
 
 This study provides a valuable extension of credibility-based learning research by showing how feedback reliability can distort reward-learning biases in a disinformation-like bandit task. Although the paradigm is well controlled and the computational modelling rigorous, the evidential support is incomplete: key claims about learning from 50 %-credible feedback and heightened positivity bias at low credibility hinge on a single dataset, specific parameter definitions, and modelling assumptions not fully validated across studies. Clearer reporting of the discovery-study null result, behavioural tests of positivity bias, and standard information-criterion model comparisons are needed to solidify the conclusions and enhance generalizability.
 
 Summary
2. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 This is a well-designed and very interesting study examining the impact of imprecise feedback on outcomes in decision-making. I think this is an important addition to the literature, and the results here, which provide a computational account of several decision-making biases, are insightful and interesting.
 
 I do not believe I have substantive concerns related to the actual results presented; my concerns are more related to the framing of some of the work. My main concern is regarding the assertion that the results prove that non-normative and non-Bayesian learning is taking place. I agree with the authors that their results demonstrate that people will make decisions in ways that demonstrate deviations from what would be optimal for maximizing reward in their task under a strict application of Bayes' rule. I also agree that they have built reinforcement learning models that do a good job of accounting for the observed behavior. However, the Bayesian models included are rather simple, per the author's descriptions, applications of Bayes' rule with either fixed or learned credibility for the feedback agents. In contrast, several versions of the RL models are used, each modified to account for different possible biases. However, more complex Bayes-based models exist, notably active inference, but even the hierarchical Gaussian filter. These formalisms are able to accommodate more complex behavior, such as affect and habits, which might make them more competitive with RL models. I think it is entirely fair to say that these results demonstrate deviations from an idealized and strict Bayesian context; however, the equivalence here of Bayesian and normative is, I think, misleading or at least requires better justification/explanation. This is because a great deal of work has been done to show that Bayes optimal models can generate behavior or other outcomes that are clearly not optimal to an observer within a given context (consider hallucinations for example) but which make sense in the context of how the model is constructed as well as the priors and desired states the model is given.
 
 As such, I would recommend that the language be adjusted to carefully define what is meant by normative and Bayesian and to recognize that work that is clearly Bayesian could potentially still be competitive with RL models if implemented to model this task. An even better approach would be to directly use one of these more complex modelling approaches, such as active inference, as the comparator to the RL models, though I would understand if the authors would want this to be a subject for future work.
 
 Abstract:
 
 The abstract is lacking in some detail about the experiments done, but this may be a limitation of the required word count. If word count is not an issue, I would recommend adding details of the experiments done and the results. One comment is that there is an appeal to normative learning patterns, but this suggests that learning patterns have a fixed optimal nature, which may not be true in cases where the purpose of the learning (e.g. to confirm the feeling of safety of being in an in-group) may not be about learning accurately to maximize reward. This can be accommodated in a Bayesian framework by modelling priors and desired outcomes. As such, the central premise that biased learning is inherently non-normative or non-Bayesian, I think, would require more justification. This is true in the introduction as well.
 
 Introduction:
 
 As noted above, the conceptualization of Bayesian learning being equivalent to normative learning, I think requires further justification. Bayesian belief updating can be biased and non-optimal from an observer perspective, while being optimal within the agent doing the updating if the priors/desired outcomes are set up to advantage these "non-optimal" modes of decision making.
 
 Results:
 
 I wonder why the agent was presented before the choice, since the agent is only relevant to the feedback after the choice is made. I wonder if that might have induced any false association between the agent identity and the choice itself. This is by no means a critical point, but it would be interesting to get the authors' thoughts.
 
 The finding that positive feedback increases learning is one that has been shown before and depends on valence, as the authors note. They expanded their reinforcement learning model to include valence, but they did not modify the Bayesian model in a similar manner. This lack of a valence or recency effect might also explain the failure of the Bayesian models in the preceding section, where the contrast effect is discussed. It is not unreasonable to imagine that if humans do employ Bayesian reasoning that this reasoning system has had parameters tuned based on the real world, where recency of information does matter; affect has also been shown to be incorporable into Bayesian information processing (see the work by Hesp on affective charge and the large body of work by Ryan Smith). It may be that the Bayesian models chosen here require further complexity to capture the situation, just like some of the biases required updates to the RL models. This complexity, rather than being arbitrary, may be well justified by decision-making in the real world.
 
 The methods mention several symptom scales- it would be interesting to have the results of these and any interesting correlations noted. It is possible that some of the individual variability here could be related to these symptoms, which could introduce precision parameter changes in a Bayesian context and things like reward sensitivity changes in an RL context.
 
 Discussion:
 
 (For discussion, not a specific comment on this paper): One wonders also about participants' beliefs about the experiment or the intent of the experimenters. I have often had participants tell me they were trying to "figure out" a task or find patterns even when this was not part of the experiment. This is not specific to this paper, but it may be relevant in the future to try and model participant beliefs about the experiment especially in the context of disinformation, when they might be primed to try and "figure things out".
 
 As a general comment, in the active inference literature, there has been discussion of state-dependent actions, or "habits", which are learned in order to help agents more rapidly make decisions, based on previous learning. It is also possible that what is being observed is that these habits are at play, and that they represent the cognitive biases. This is likely especially true given, as the authors note, the high cognitive load of the task. It is true that this would mean that full-force Bayesian inference is not being used in each trial, or in each experience an agent might have in the world, but this is likely adaptive on the longer timescale of things, considering resource requirements. I think in this case you could argue that we have a departure from "normative" learning, but that is not necessarily a departure from any possible Bayesian framework, since these biases could potentially be modified by the agent or eschewed in favor of more expensive full-on Bayesian learning when warranted.
 
 Indeed, in their discussion on the strategy of amplifying credible news sources to drown out low-credibility sources, the authors hint at the possibility of longer-term strategies that may produce optimal outcomes in some contexts, but which were not necessarily appropriate to this task. As such, the performance on this task- and the consideration of true departure from Bayesian processing- should be considered in this wider context.
 
 Another thing to consider is that Bayesian inference is occurring, but that priors present going in produce the biases, or these biases arise from another source, for example, factoring in epistemic value over rewards when the actual reward is not large. This again would be covered under an active inference approach, depending on how the priors are tuned. Indeed, given the benefit of social cohesion in an evolutionary perspective, some of these "biases" may be the result of adaptation. For example, it might be better to amplify people's good qualities and minimize their bad qualities in order to make it easier to interact with them; this entails a cost (in this case, not adequately learning from feedback and potentially losing out sometimes), but may fulfill a greater imperative (improved cooperation on things that matter). Given the right priors/desired states, this could still be a Bayes-optimal inference at a social level and, as such, may be ingrained as a habit that requires effort to break at the individual level during a task such as this.
 
 The authors note that this task does not relate to "emotional engagement" or "deep, identity-related issues". While I agree that this is likely mostly true, it is also possible that just being told one is being lied to might elicit an emotional response that could bias responses, even if this is a weak response.
 
 Review 1
3. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 This valuable paper studies the problem of learning from feedback given by sources of varying credibility. The solid combination of experiment and computational modeling helps to pin down properties of learning, although some ambiguity remains in the interpretation of results.
 
 Summary:
 
 This paper studies the problem of learning from feedback given by sources of varying credibility. Two bandit-style experiments are conducted in which feedback is provided with uncertainty, but from known sources. Bayesian benchmarks are provided to assess normative facets of learning, and alternative credit assignment models are fit for comparison. Some aspects of normativity appear, in addition to deviations such as asymmetric updating from positive and negative outcomes.
 
 Strengths:
 
 The paper tackles an important topic, with a relatively clean cognitive perspective. The construction of the experiment enables the use of computational modeling. This helps to pinpoint quantitatively the properties of learning and formally evaluate their impact and importance. The analyses are generally sensible, and parameter recovery analyses help to provide some confidence in the model estimation and comparison.
 
 Weaknesses:
 
 (1) The approach in the paper overlaps somewhat with various papers, such as Diaconescu et al. (2014) and Schulz et al. (forthcoming), which also consider the Bayesian problem of learning and applying source credibility, in terms of theory and experiment. The authors should discuss how these papers are complementary, to better provide an integrative picture for readers.
 
 Diaconescu, A. O., Mathys, C., Weber, L. A., Daunizeau, J., Kasper, L., Lomakina, E. I., ... & Stephan, K. E. (2014). Inferring the intentions of others by hierarchical Bayesian learning. PLoS computational biology, 10(9), e1003810. Schulz, L., Schulz, E., Bhui, R., & Dayan, P. Mechanisms of Mistrust: A Bayesian Account of Misinformation Learning. https://doi.org/10.31234/osf.io/8egxh
 
 (2) It isn't completely clear what the "cross-fitting" procedure accomplishes. Can this be discussed further?
 
 (3) The Credibility-CA model seems to fit the same as the free-credibility Bayesian model in the first experiment and barely better in the second experiment. Why not use a more standard model comparison metric like the Bayesian Information Criterion (BIC)? Even if there are advantages to the bootstrap method (which should be described if so), the BIC would help for comparability between papers.
 
 (4) As suggested in the discussion, the updating based on random feedback could be due to the interleaving of trials. If one is used to learning from the source on most trials, the occasional random trial may be hard to resist updating from. The exact interleaving structure should also be clarified (I assume different sources were shown for each bandit pair). This would also relate to work on RL and working memory: Collins, A. G., & Frank, M. J. (2012). How much of reinforcement learning is working memory, not reinforcement learning? A behavioral, computational, and neurogenetic analysis. European Journal of Neuroscience, 35(7), 1024-1035.
 
 (5) Why does the choice-repetition regression include "only trials for which the last same-pair trial featured the 3-star agent and in which the context trial featured a different bandit pair"? This could be stated more plainly.
 
 (6) Why apply the "Truth-CA" model and not the Bayesian variant that it was motivated by?
 
 (7) "Overall, the results from this study support the exact same conclusions (See SI section 1.2) but with one difference. In the discovery study, we found no evidence for learning based on 50%-credibility feedback when examining either the feedback effect on choice repetition or CA in the credibility-CA model (SI 1.2.3)" - this seems like a very salient difference, when the paper reports the feedback effect as a primary finding of interest, though I understand there remains a valence-based difference.
 
 (8) "Participants were instructed that this feedback would be "a lie 50% of the time but were not explicitly told that this meant it was random and should therefore be disregarded." - I agree that this is a possible explanation for updating from the random source. It is a meaningful caveat.
 
 (9) "Future studies should investigate conditions that enhance an ability to discard disinformation, such as providing explicit instructions to ignore misleading feedback, manipulations that increase the time available for evaluating information, or interventions that strengthen source memory." - there is work on some of this in the misinformation literature that should be cited, such as the "continued influence effect". For example: Johnson, H. M., & Seifert, C. M. (1994). Sources of the continued influence effect: When misinformation in memory affects later inferences. Journal of experimental psychology: Learning, memory, and cognition, 20(6), 1420.
 
 (10) Are the authors arguing that choice-confirmation bias may be at play? Work on choice-confirmation bias generally includes counterfactual feedback, which is not present here.
 
 Review 2
4. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 Summary
 
 This paper investigates how disinformation affects reward learning processes in the context of a two-armed bandit task, where feedback is provided by agents with varying reliability (with lying probability explicitly instructed). They find that people learn more from credible sources, but also deviate systematically from optimal Bayesian learning: They learned from uninformative random feedback, learned more from positive feedback, and updated too quickly from fully credible feedback (especially following low-credibility feedback). Overall, this study highlights how misinformation could distort basic reward learning processes, without appeal to higher-order social constructs like identity.
 
 Strengths
 
 (1) The experimental design is simple and well-controlled; in particular, it isolates basic learning processes by abstracting away from social context.
 
 (2) Modeling and statistics meet or exceed the standards of rigor.
 
 (3) Limitations are acknowledged where appropriate, especially those regarding external validity.
 
 (4) The comparison model, Bayes with biased credibility estimates, is strong; deviations are much more compelling than e.g., a purely optimal model.
 
 (5) The conclusions are interesting, in particular the finding that positivity bias is stronger when learning from less reliable feedback (although I am somewhat uncertain about the validity of this conclusion)
 
 Weaknesses
 
 (1) Absolute or relative positivity bias?
 
 In my view, the biggest weakness in the paper is that the conclusion of greater positivity bias for lower credible feedback (Figure 5) hinges on the specific way in which positivity bias is defined. Specifically, we only see the effect when normalizing the difference in sensitivity to positive vs. negative feedback by the sum. I appreciate that the authors present both and add the caveat whenever they mention the conclusion (with the crucial exception of the abstract). However, what we really need here is an argument that the relative definition is the *right* way to define asymmetry....
 
 Unfortunately, my intuition is that the absolute difference is a better measure. I understand that the relative version is common in the RL literature; however previous studies have used standard TD models, whereas the current model updates based on the raw reward. The role of the CA parameter is thus importantly different from a traditional learning rate - in particular, it's more like a logistic regression coefficient (as described below) because it scales the feedback but *not* the decay. Under this interpretation, a difference in positivity bias across credibility conditions corresponds to a three-way interaction between the exponentially weighted sum of previous feedback of a given type (e.g., positive from the 75% credible agent), feedback positivity, and condition (dummy coded). This interaction corresponds to the non-normalized, absolute difference.
 
 Importantly, I'm not terribly confident in this argument, but it does suggest that we need a compelling argument for the relative definition.
 
 (2) Positivity bias or perseveration?
 
 A key challenge in interpreting many of the results is dissociating perseveration from other learning biases. In particular, a positivity bias (Figure 5) and perseveration will both predict a stronger correlation between positive feedback and future choice. Crucially, the authors do include a perseveration term, so one would hope that perseveration effects have been controlled for and that the CA parameters reflect true positivity biases. However, with finite data, we cannot be sure that the variance will be correctly allocated to each parameter (c.f. collinearity in regressions). The fact that CA- is fit to be negative for many participants (a pattern shown more strongly in the discovery study) is suggestive that this might be happening. A priori, the idea that you would ever increase your value estimate after negative feedback is highly implausible, which suggests that the parameter might be capturing variance besides that it is intended to capture.
 
 The best way to resolve this uncertainty would involve running a new study in which feedback was sometimes provided in the absence of a choice - this would isolate positivity bias. Short of that, perhaps one could fit a version of the Bayesian model that also includes perseveration. If the authors can show that this model cannot capture the pattern in Figure 5, that would be fairly convincing.
 
 (3) Veracity detection or positivity bias?
 
 The "True feedback elicits greater learning" effect (Figure 6) may be simply a re-description of the positivity bias shown in Figure 5. This figure shows that people have higher CA for trials where the feedback was in fact accurate. But, assuming that people tend to choose more rewarding options, true-feedback cases will tend to also be positive-feedback cases. Accordingly, a positivity bias would yield this effect, even if people are not at all sensitive to trial-level feedback veracity. Of course, the reverse logic also applies, such that the "positivity bias" could actually reflect discounting of feedback that is less likely to be true. This idea has been proposed before as an explanation for confirmation bias (see Pilgrim et al, 2024 https://doi.org/10.1016/j.cognition.2023.105693 and much previous work cited therein). The authors should discuss the ambiguity between the "positivity bias" and "true feedback" effects within the context of this literature....
 
 The authors get close to this in the discussion, but they characterize their results as differing from the predictions of rational models, the opposite of my intuition. They write:
 
 Alternative "informational" (motivation-independent) accounts of positivity and confirmation bias predict a contrasting trend (i.e., reduced bias in low- and medium credibility conditions) because in these contexts it is more ambiguous whether feedback confirms one's choice or outcome expectations, as compared to a full-credibility condition.
 
 I don't follow the reasoning here at all. It seems to me that the possibility for bias will increase with ambiguity (or perhaps will be maximal at intermediate levels). In the extreme case, when feedback is fully reliable, it is impossible to rationally discount it (illustrated in Figure 6A). The authors should clarify their argument or revise their conclusion here.
 
 (4) Disinformation or less information?
 
 Zooming out, from a computational/functional perspective, the reliability of feedback is very similar to reward stochasticity (the difference is that reward stochasticity decreases the importance/value of learning in addition to its difficulty). I imagine that many of the effects reported here would be reproduced in that setting. To my surprise, I couldn't quickly find a study asking that precise question, but if the authors know of such work, it would be very useful to draw comparisons. To put a finer point on it, this study does not isolate which (if any) of these effects are specific to *disinformation*, rather than simply _less information._ I don't think the authors need to rigorously address this in the current study, but it would be a helpful discussion point.
 
 (5) Over-reliance on analyzing model parameters
 
 Most of the results rely on interpreting model parameters, specifically, the "credit assignment" (CA) parameter. Exacerbating this, many key conclusions rest on a comparison of the CA parameters fit to human data vs. those fit to simulations from a Bayesian model. I've never seen anything like this, and the authors don't justify or even motivate this analysis choice. As a general rule, analyses of model parameters are less convincing than behavioral results because they inevitably depend on arbitrary modeling assumptions that cannot be fully supported. I imagine that most or even all of the results presented here would have behavioral analogues. The paper would benefit greatly from the inclusion of such results. It would also be helpful to provide a description of the model in the main text that makes it very clear what exactly the CA parameter is capturing (see next point).
 
 (6) RL or regression?
 
 I was initially very confused by the "RL" model because it doesn't update based on the TD error. Consequently, the "Q values" can go beyond the range of possible reward (SI Figure 5). These values are therefore *not* Q values, which are defined as expectations of future reward ("action values"). Instead, they reflect choice propensities, which are sometimes notated $h$ in the RL literature. This misuse of notation is unfortunately quite common in psychology, so I won't ask the authors to change the variable. However, they should clarify when introducing the model that the Q values are not action values in the technical sense. If there is precedent for this update rule, it should be cited.
 
 Although the change is subtle, it suggests a very different interpretation of the model.
 
 Specifically, I think the "RL model" is better understood as a sophisticated logistic regression, rather than a model of value learning. Ignoring the decay term, the CA term is simply the change in log odds of repeating the just-taken action in future trials (the change is negated for negative feedback). The PERS term is the same, but ignoring feedback. The decay captures that the effect of each trial on future choices diminishes with time. Importantly, however, we can re-parameterize the model such that the choice at each trial is a logistic regression where the independent variables are an exponentially decaying sum of feedback of each type (e.g., positive-cred50, positive-cred75, ... negative-cred100). The CA parameters are simply coefficients in this logistic regression.
 
 Critically, this is not meant to "deflate" the model. Instead, it clarifies that the CA parameter is actually not such an assumption-laden model estimate. It is really quite similar to a regression coefficient, something that is usually considered "model agnostic". It also recasts the non-standard "cross-fitting" approach as a very standard comparison of regression coefficients for model simulations vs. human data. Finally, using different CA parameters for true vs false feedback is no longer a strange and implausible model assumption; it's just another (perfectly valid) regression. This may be a personal thing, but after adopting this view, I found all the results much easier to understand.
 
 Review 3
Visit annotations in context

Tags

Summary

Review 1

Review 2

Review 3

Annotators

Public_Reviews

URL

osf.io/preprints/osf/st4kg_v1
www.biorxiv.org www.biorxiv.org

Cellular and synaptic organization of the Octopus vertical lobe

5
1. Public_Reviews 12 May 2025
 
 in eLife
 
 eLife Assessment
 
 This important study of the inhibitory complex amacrines (CAM) in the vertical lobe of Octopus vulgaris delivers a solid standard for the structural characterization of an anatomical region likely to be key for memory processing in this unconventional but complex organism, as well as a helpful classification of CAM subtypes. This work will be of broad relevance to the fields of memory and evolutionary neuroscience.
 
 Summary
2. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 The authors identified five complex amacrine cell (CAM) subtypes based on their morphology and synaptic connectivity. It's suggested that the differences in structure may be directly correlated with different functional roles. The authors also describe synaptic compartmentalization in the SFL tract relating to three types of CAM input regions, again implying a specialized role for these cells. The authors also identified neural progenitor cells, which suggests that the octopus's vertical lobe can undergo neurogenesis throughout its life.
 
 The work presented here is valuable and convincing. Below are some suggestions the authors may wish to incorporate:
 
 a) Quantitative measurements to define the CAM subtypes I think the categorization of the CAMs into five subtypes is convincing, however, I wonder how easily these categories could be identified by other researchers. Would it be possible for the authors to include additional quantitative measurements of these cell types to make their categorization less qualitative and more quantitative? For example, density, volume, and orientation of their dendritic fields?
 
 b) The definition of the neuritic backbone is included in the methods, but I found the term confusing when I first encountered it in the results, so I would suggest adding the definition to the results too.
 
 c) The authors wrote, 'Note that given the pronounced difference in diameters between the neuritic backbones (208.27 +/-87.95 nm) and axons (121.55 +/- 21.28 nm)'. What figure is this in?
 
 d) I am slightly confused about how the authors decided on the specific cubes to reflect the different synaptic compartments in the SFL tract. Is this organisation arranged/repeated vertically or horizontally throughout the SFL tract? The location of the cubes looks to me to be chosen at random, so more information here would be helpful.
 
 e) In Figure 2, could the authors plot the number of synapses per cube to make the result clearer, so that cube 1 has the lowest synaptic density and cube 2 has the highest?
 
 f) SAMs are ACh and excitatory The authors refer to SAMs as excitatory cholinergic. They should provide more detailed explanations/citations to back up this claim. Could SAMs be synthesizing any other neurotransmitters? Could there be a subpopulation of inhibitory SAMs?
 
 g) CAMs are GABA and inhibitory
 
 The 5 subtypes of CAMs described here have never been directly confirmed to be GABAergic. Could CAMs be synthesizing any other neurotransmitters? Could a subpopulation of CAMs be excitatory? I believe the authors should make this clearer to readers when referring to CAMs, perhaps by saying, 'hypothesized to be inhibitory neurons', or 'putative inhibitory neurons'.
 
 h) Fast neurotransmitters and neuromodulators The authors refer to neuromodulatory connections in their summary in Figure 4, however, cephalopod receptors have yet to be extensively functionally characterized, therefore, the role different molecules play as neurotransmitters or neuromodulators is not yet known. For example, many invertebrates are known to have functional diversity in their receptors: C. elegans has both excitatory and inhibitory receptors for a range of neurotransmitters, anionic ACh- and glutamate-gated channels, and cationic peptide-gated channels have also been identified in some molluscs. So, probably the authors should be cautious in speculating about how a particular transmitter/modulator acts in the octopus brain.
 
 i) In the methods, the authors refer to "an adult Octopus", what age and size was it? I also know this is Octopus vulgaris, but it would be good to specify it here.
 
 j) A general comment about all figures. All panels should have a letter associated with them to make it easier to refer to them in the text. For example, in Figure 4, please also add letters to the main schematic, the CAM subtypes, and the VL wiring diagram. In addition, D and E are missing boxes on the main schematic. It's also not immediately obvious that A-E are zooms of the larger schematic; perhaps this could be made clearer with colours or arrows. Please also add names to the CAM subtypes.
 
 a) Typo: 'Additionally, the unique characteristics of LTP in the octopus VL, such as its reliance on a NO-dependent mechanism, independent of de novo protein synthesis, persistent activation of (Turchetti-Maia et al., 2018).'
 
 Review 1
3. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 The paper examines the diversity of complex amacrine neurons in the ventral lobe of the adult octopus brain, a structure involved in learning and memory. The work builds on a recent paper by the authors that described the connectivity of the much larger population of simple amacrine (SAM) interneurons from the same pioneering EM volume.
 
 Strengths:
 
 While the EM volume only provides a snapshot of a tiny fraction of an adult octopus' brain, the authors can make specific conclusions and formulate precise hypotheses about neuron function, synaptic pathways, and developmental trajectories. One example is the reconstruction of a putative maturation sequence for the SAM neuronal lineage, based on the correlation of soma position and the number of synapses, uncovering a plausible developmental sequence of cell morphologies, with interesting parallels to vertebrate neurogenesis.
 
 Weaknesses:
 
 The weakness of the study is that it is examining a relatively small volume (260 × 390 × 27 µm), and several neurons are only incompletely reconstructed. It also remains unclear approximately how many neurons remain to be reconstructed from this volume.
 
 To improve the presentation, the authors should consider showing videos with the volumetric reconstructions of the different types with their partners/synapses and their relation to the SFL track and SAMs. Such videos would help the reader to appreciate the morphological differences between the cell types. The authors could also consider carrying out further morphological analyses to strengthen their cell-type classification, including Sholl value, radial density of input and output synapses, the number of branch nodes, and similar measures.
 
 Review 2
4. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 (1) The authors described "the excitatory glutamatergic SFL axons and cholinergic SAM inputs". However, the evidence of their transmitter specificity has not been provided. Compelling evidence was neither provided nor discussed in the context of the study.
 
 (2) Specific interference for inhibitory or excitatory synapses based on EM or other studies must be detailed and elaborated
 
 (3) Different local microcircuits (submodules) referred to in the text should be better described and more specifically defined.
 
 (4) I would recommend incorporating a more detailed description of synapses and, especially, synaptic vesicles, clarifying their diversity and similarity across cell subtypes. Are there any differences between cholinergic and glutamatergic synaptic vesicles, postsynaptic densities, or other features...? It would be good, if possible, to explicitly clarify: how many vesicles per different types of synapses? How many synapses per neuron of different types? How many inputs and outputs per a given neuron?
 
 (5) Authors discuss retrograde messengers like NO? Is there any identifiable morphological type of neuron(s) or synapses that might be nitrergic?
 
 (6) It would be good to provide separate illustrations showing the detailed organization of any glial cell or different types of glial cells they identified in this study. Authors mainly discuss glial processes but refer to "recognized glial types, such as radial glia and astrocyte-like glia" without specific illustrations, which can be deciphered from their EM data. What are vesicular organizations within different types of glial cells?
 
 (7) The authors also discuss "supervising inputs of inhibitory (pain) and neuromodulatory (supervising) signals", without any details. It would be important to provide these details in the discussion. Specifically, I suggest incorporating comments about differences/similarities of transmitters and morphology between pain and modulatory pathways/signaling/circuits.
 
 Review 3
5. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #4 (Public review):
 
 Summary:
 
 The authors present a follow-up to their initial publication of a volume EM reconstruction of a part of the Octopus vulgaris vertical lobe (VL) (Bidel, Meirovitch et al., eLife 2023). In their previous study, they presented a swath of novel observations pertaining to the neuron types making up the VL and their synaptic connectivity. Here, the authors present an extension of those findings in which they (1) demonstrate that the Complex Amacrine cells (CAMs), which they identified previously, can be grouped into at least 5 distinct subclasses; (2) show that there appears to be distinct compartments in the SFL tract that contain specific synapse types; and (3) present morphological evidence that there may be a neurogenic niche in the VL. The findings are intriguing, advance our understanding of memory circuitry in octopus and across the phylogenetic tree, and open new avenues for deeper investigation.
 
 Strengths:
 
 A deeper dissection of the morphologies of CAMs and their distinct complements of synapse types is valuable. The identification of multiple categories of CAMs makes it clearer how the very simple SFL-to-SAM connectivity is likely enriched by a population of diverse interneurons.
 
 The observation that synapse types may be compartmentalized in the superior frontal lobe tract is an intriguing one, and invites more extensive segmentation and future anatomical studies to further characterize the precise architecture of these compartments.
 
 Finally, the evidence of the possibility of a neurogenic niche in the VL is exciting as it suggests that ongoing neurogenesis may be a common feature of memory circuitry, perhaps contributing to keeping the representation space of the circuit flexible and adequately sparse.
 
 Weaknesses:
 
 A key weakness is the reconstruction and grouping of the CAMs:
 
 (1) CAMs are relatively few in number compared to SAMs, and as such, only 53 are reconstructed in this study. Of those 53 cells, 18 were not classified into one of the 5 categories the authors designate, begging the question of how robust those categories are.
 
 (2) Related to (1), in Figure 1B, the proportions given in the bar graph are given cumulatively across the entire population of each category. The proportions should be presented as means within each category to adequately capture the variability of the small sample sizes.
 
 (3) While the xy dimensions of the serial section EM volume are adequate to capture relatively whole cells and neuronal arbors, the volume is only 27µm thick. Thus, many neurite branches are likely truncated in the z-dimension. This may have contributed to ~1/3 of CAMs eluding categorization. However, it is hard to estimate the effect this may have had without knowing the extent of the truncation. It may be worth the authors' time to count the proportion of CAM neurites that are cut off at the edges of the volume.
 
 (4) The authors state that CAMs appear to have axons and dendrites based on neurite widths. This is an interesting finding, given that amacrine cells are generally thought to possess only one type of neurite, which both send and receive synaptic potentials, and therefore deserves more attention. Is the distribution of neurite widths indeed bimodally distributed? Can the axons and dendrites be differentiated by examining the presence and absence of synaptic vesicle pools, respectively?
 
 In Figure 2, the compartmentalization of synapse types is intriguing; however, due to the 3D nature of the data, it is difficult to appreciate clearly from the panels presented. This is particularly true for the suggestion that glia may be forming a barrier around these compartments. This could be rectified by providing Neuroglancer links for these specific reconstructions (neurites, synapses, and glia).
 
 Lastly, although the identification of a putative neurogenic niche is tantalizing, morphological data alone is only an initial hint. Although the chances are slim, it would be more convincing if the authors could identify any actively dividing cells in the proposed niche. More likely, further work, for instance, immunofluorescence, which the lab has previously shown to be viable in octopus, will be needed to add weight to the claim.
 
 Review 4
Visit annotations in context

Tags

Summary

Review 1

Review 3

Review 4

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.01.29.635406v1
www.biorxiv.org www.biorxiv.org

A Stimulus-Computable Model for Audiovisual Perception and Spatial Orienting in Mammals

3
1. Public_Reviews 12 May 2025
 
 in eLife
 
 eLife Assessment
 
 This is an important study introducing a stimulus-computable model of multisensory perception that extends an existing framework to accept raw, stimulus-level inputs (i.e., image- and soundscape-computable). The author demonstrates how low-level correlation detection can drive both illusions and cue integration, and the model bridges diverse stimuli, behaviors, and species. The model and evidence provided are deemed generally convincing and of broad applicability, potentially impacting areas across neuroscience, psychology, and computational cognitive science. There are, however, certain aspects of the work considered incomplete, particularly as they relate to explaining details pertinent to model fitting.
 
 Summary
2. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Summary:
 
 Parise presents another instantiation of the Multisensory Correlation Detector model that can now accept stimulus-level inputs. This is a valuable development as it removes researcher involvement in the characterization/labeling of features and allows analysis of complex stimuli with a high degree of nuance that was previously unconsidered (i.e., spatial/spectral distributions across time). The author demonstrates the power of the model by fitting data from dozens of previous experiments, including multiple species, tasks, behavioral modalities, and pharmacological interventions.
 
 Strengths:
 
 One of the model's biggest strengths, in my opinion, is its ability to extract complex spatiotemporal co-relationships from multisensory stimuli. These relationships have typically been manually computed or assigned based on stimulus condition and often distilled to a single dimension or even a single number (e.g., "-50 ms asynchrony"). Thus, many models of multisensory integration depend heavily on human preprocessing of stimuli, and these models miss out on complex dynamics of stimuli; the lead modality distribution apparent in Figures 3b and c is provocative. I can imagine the model revealing interesting characteristics of the facial distribution of correlation during continuous audiovisual speech that have up to this point been largely described as "present" and almost solely focused on the lip area.
 
 Another aspect that makes the MCD stand out among other models is the biological inspiration and generalizability across domains. The model was developed to describe a separate process - motion perception - and in a much simpler organism - Drosophila. It could then describe a very basic neural computation that has been conserved across phylogeny (which is further demonstrated in the ability to predict rat, primate, and human data) and brain area. This aspect makes the model likely able to account for much more than what has already been demonstrated with only a few tweaks akin to the modifications described in this and previous articles from Parise.
 
 What allows this potential is that, as Parise and colleagues have demonstrated in those papers since our (re)introduction of the model in 2016, the MCD model is modular - both in its ability to interface with different inputs/outputs and its ability to chain MCD units in a way that can analyze spatial, spectral, or any other arbitrary dimension of a stimulus. This fact leaves wide open the possibilities for types of data, stimuli, and tasks a simplistic, neutrally inspired model can account for.
 
 And so it's unsurprising (but impressive!) that Parise has demonstrated the model's ability here to account for such a wide range of empirical data from numerous tasks (synchrony/temporal order judgement, localization, detection, etc.) and behavior types (manual/saccade responses, gaze, etc.) using only the stimulus and a few free parameters. This ability is another of the model's main strengths that I think deserves some emphasis: it represents a kind of validation of those experiments, especially in the context of cross-experiment predictions (but see some criticism of that below).
 
 Finally, what is perhaps most impressive to me is that the MCD (and the accompanying decision model) does all this with very few (sometimes zero) free parameters. This highlights the utility of the model and the plausibility of its underlying architecture, but also helps to prevent extreme overfitting if fit correctly (but see a related concern below).
 
 Weaknesses:
 
 There is an insufficient level of detail in the methods about model fitting. As a result, it's unclear what data the models were fitted and validated on. Were models fit individually or on average group data? Each condition separately? Is the model predictive of unseen data? Was the model cross-validated? Relatedly, the manuscript mentions a randomization test, but the shuffled data produces model responses that are still highly correlated to behavior despite shuffling. Could it be that any stimulus that varies in AV onset asynchrony can produce a psychometric curve that matches any other task with asynchrony judgements baked into the task? Does this mean all SJ or TOJ tasks produce correlated psychometric curves? Or more generally, is Pearson's correlation insensitive to subtle changes here, considering psychometric curves are typically sigmoidal? Curves can be non-overlapping and still highly correlated if one is, for example, scaled differently. Would an error term such as mean-squared or root mean-squared error be more sensitive to subtle changes in psychometric curves? Alternatively, perhaps if the models aren't cross-validated, the high correlation values are due to overfitting?
 
 While the model boasts incredible versatility across tasks and stimulus configurations, fitting behavioral data well doesn't mean we've captured the underlying neural processes, and thus, we need to be careful when interpreting results. For example, the model produces temporal parameters fitting rat behavior that are 4x faster than when fitting human data. This difference in slope and a difference at the tails were interpreted as differences in perceptual sensitivity related to general processing speeds of the rat, presumably related to brain/body size differences. While rats no doubt have these differences in neural processing speed/integration windows, it seems reasonable that a lot of the differences in human and rat psychometric functions could be explained by the (over)training and motivation of rats to perform on every trial for a reward - increasing attention/sensitivity (slope) - and a tendency to make mistakes (compression evident at the tails). Was there an attempt to fit these data with a lapse parameter built into the decisional model as was done in Equation 21? Likewise, the fitted parameters for the pharmacological manipulations during the SJ task indicated differences in the decisional (but not the perceptual) process and the article makes the claim that "all pharmacologically-induced changes in audiovisual time perception" can be attributed to decisional processes "with no need to postulate changes in low-level temporal processing." However, those papers discuss actual sensory effects of pharmacological manipulation, with one specifically reporting changes to response timing. Moreover, and again contrary to the conclusions drawn from model fits to those data, both papers also report a change in psychometric slope/JND in the TOJ task after pharmacological manipulation, which would presumably be reflected in changes to the perceptual (but not the decisional) parameters.
 
 The case for the utility of a stimulus-computable model is convincing (as I mentioned above), but its framing as mission-critical for understanding multisensory perception is overstated, I think. The line for what is "stimulus computable" is arbitrary and doesn't seem to be followed in the paper. A strict definition might realistically require inputs to be, e.g., the patterns of light and sound waves available to our eyes and ears, while an even more strict definition might (unrealistically) require those stimuli to be physically present and transduced by the model. A reasonable looser definition might allow an "abstract and low-dimensional representation of the stimulus, such as the stimulus envelope (which was used in the paper), to be an input. Ultimately, some preprocessing of a stimulus does not necessarily confound interpretations about (multi)sensory perception. And on the flip side, the stimulus-computable aspect doesn't necessarily give the model supreme insight into perception. For example, the MCD model was "confused" by the stimuli used in our 2018 paper (Nidiffer et al., 2018; Parise & Ernst, 2025). In each of our stimuli (including catch trials), the onset and offset drove strong AV temporal correlations across all stimulus conditions (including catch trials), but were irrelevant to participants performing an amplitude modulation detection task. The to-be-detected amplitude modulations, set at individual thresholds, were not a salient aspect of the physical stimulus, and thus only marginally affected stimulus correlations. The model was of course, able to fit our data by "ignoring" the on/offsets (i.e., requiring human intervention), again highlighting that the model is tapping into a very basic and ubiquitous computational principle of (multi)sensory perception. But it does reveal a limitation of such a stimulus-computable model: that it is (so far) strictly bottom-up.
 
 The manuscript rightly chooses to focus a lot of the work on speech, fitting the MCD model to predict behavioral responses to speech. The range of findings from AV speech experiments that the MCD can account for is very convincing. Given the provided context that speech is "often claimed to be processed via dedicated mechanisms in the brain," a statement claiming a "first end-to-end account of multisensory perception," and findings that the MCD model can account for speech behaviors, it seems the reader is meant to infer that energetic correlation detection is a complete account of speech perception. I think this conclusion misses some facets of AV speech perception, such as integration of higher-order, non-redundant/correlated speech features (Campbell, 2008) and also the existence of top-down and predictive processing that aren't (yet!) explained by MCD. For example, one important benefit of AV speech is interactions on linguistic processes - how complementary sensitivity to articulatory features in the auditory and visual systems (Summerfield, 1987) allow constraint of linguistic processes (Peelle & Sommers, 2015; Tye-Murray et al., 2007).
 
 References
 
 Campbell, R. (2008). The processing of audio-visual speech: empirical and neural bases. Philosophical Transactions of the Royal Society B: Biological Sciences, 363(1493), 1001-1010. https://doi.org/10.1098/rstb.2007.2155 Nidiffer, A. R., Diederich, A., Ramachandran, R., & Wallace, M. T. (2018). Multisensory perception reflects individual differences in processing temporal correlations. Scientific Reports 2018 8:1, 8(1), 1-15. https://doi.org/10.1038/s41598-018-32673-y Parise, C. V, & Ernst, M. O. (2025). Multisensory integration operates on correlated input from unimodal transient channels. ELife, 12. https://doi.org/10.7554/ELIFE.90841 Peelle, J. E., & Sommers, M. S. (2015). Prediction and constraint in audiovisual speech perception. Cortex, 68, 169-181. https://doi.org/10.1016/j.cortex.2015.03.006 Summerfield, Q. (1987). Some preliminaries to a comprehensive account of audio-visual speech perception. In B. Dodd & R. Campbell (Eds.), Hearing by Eye: The Psychology of Lip-Reading (pp. 3-51). Lawrence Erlbaum Associates. Tye-Murray, N., Sommers, M., & Spehar, B. (2007). Auditory and Visual Lexical Neighborhoods in Audiovisual Speech Perception: Trends in Amplification, 11(4), 233-241. https://doi.org/10.1177/1084713807307409
 
 Review 1
3. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 Building on previous models of multisensory integration (including their earlier correlation-detection framework used for non-spatial signals), the author introduces a population-level Multisensory Correlation Detector (MCD) that processes raw auditory and visual data. Crucially, it does not rely on abstracted parameters, as is common in normative Bayesian models," but rather works directly on the stimulus itself (i.e., individual pixels and audio samples). By systematically testing the model against a range of experiments spanning human, monkey, and rat data, the authors show that their MCD population approach robustly predicts perception and behavior across species with a relatively small (0-4) number of free parameters.
 
 Strengths:
 
 (1) Unlike prior Bayesian models that used simplified or parameterized inputs, the model here is explicitly computable from full natural stimuli. This resolves a key gap in understanding how the brain might extract "time offsets" or "disparities" from continuously changing audio-visual streams.
 
 (2) The same population MCD architecture captures a remarkable range of multisensory phenomena, from classical illusions (McGurk, ventriloquism) and synchrony judgments, to attentional/gaze behavior driven by audio-visual salience. This generality strongly supports the idea that a single low-level computation (correlation detection) can underlie many distinct multisensory effects.
 
 (3) By tuning model parameters to different temporal rhythms (e.g., faster in rodents, slower in humans), the MCD explains cross-species perceptual data without reconfiguring the underlying architecture.
 
 Weaknesses:
 
 (1) The authors show how a correlation-based model can account for the various multisensory integration effects observed in previous studies. However, a comparison of how the two accounts differ would shed light on the correlation model being an implementation of the Bayesian computations (different levels in Marr's hierarchy) or making testable predictions that can distinguish between the two frameworks. For example, how uncertainty in the cue combined estimate is also the harmonic mean of the unimodal uncertainties is a prediction from the Bayesian model. So, how the MCD framework predicts this reduced uncertainty could be one potential difference (or similarity) to the Bayesian model.
 
 2) The authors show a good match for cue combination involving 2 cues. While Bayesian accounts provide a direction extension to more cues (also seen empirically, for eg, in Hecht et al. 2008), discussion on how the MCD model extends to more cues would benefit the readers.
 
 Likely Impact and Usefulness:
 
 The work offers a compelling unification of multiple multisensory tasks- temporal order judgments, illusions, Bayesian causal inference, and overt visual attention - under a single, fully stimulus-driven framework. Its success with natural stimuli should interest computational neuroscientists, systems neuroscientists, and machine learning scientists. This paper thus makes an important contribution to the field by moving beyond minimalistic lab stimuli, illustrating how raw audio and video can be integrated using elementary correlation analyses.
 
 Review 2
Visit annotations in context

Tags

Summary

Review 1

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.12.29.573621v2
www.biorxiv.org www.biorxiv.org

Drug combination prediction for cancer treatment using disease-specific drug response profiles and single-cell transcriptional signature

3
1. Public_Reviews 12 May 2025
 
 in eLife
 
 eLife Assessment
 
 The study conducted by Hurtado et al. offers important insights and solid evidence regarding the prediction of drug combinations for cancer treatment. By leveraging disease-specific drug response profiles and single-cell transcriptional signatures, this research not only demonstrates a novel and effective approach to identifying potential drug synergies but it also enhances our understanding of the underlying mechanisms of drug response prediction.
 
 Summary
2. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Summary:
 
 Identifying drugs that target specific disease phenotypes remains a persistent challenge. Many current methods are only applicable to well-characterized small molecules, such as those with known structures. In contrast, methods based on transcriptional responses offer broader applicability because they do not require prior information about small molecules. Additionally, they can be rapidly applied to new small molecules. One of the most promising strategies involves the use of "drug response signatures"-specific sets of genes whose differential expression can serve as markers for the response to a small molecule. By comparing drug response signatures with expression profiles characteristic of a disease, it is possible to identify drugs that modulate the disease profile, indicating a potential therapeutic connection.
 
 This study aims to prioritize potential drug candidates and to forecast novel drug combinations that may be effective in treating triple-negative breast cancer (TNBC). Large consortia, such as the LINCS-L1000 project, offer transcriptional signatures across various time points after exposing numerous cell lines to hundreds of compounds at different concentrations. While this data is highly valuable, its direct applicability to pathophysiological contexts is constrained by the challenges in extracting consistent drug response profiles from these extensive datasets. The authors use their method to create drug response profiles for three different TNBC cell lines from LINCS. To create a more precise, cancer-specific disease profile, the authors highlight the use of single-cell RNA sequencing (scRNA-seq) data. They focus on TNBC epithelial cells collected from 26 diseased individuals compared to epithelial cells collected from 10 healthy volunteers. The authors are further leveraging drug response data to develop inhibitor combinations.
 
 Strengths:
 
 The authors of this study contribute to an ongoing effort to develop automated, robust approaches that leverage gene expression similarities across various cell lines and different treatment regimen, aiming to predict drug response signatures more accurately. There remains a gap in computational methods for inferring drug responses at the cell subpopulation level, which the authors are trying to address.
 
 Weaknesses:
 
 The major deficiencies in this revised manuscript are a lack of benchmarking against established methods, clarification of method limitations, and experimental validation.
 
 (1) The manuscript still lacks a direct comparison between the retriever tool and well-established methods. How does it perform compared to metaLINCS? Evaluating its performance relative to existing approaches is essential to demonstrate its added value and robustness. (2) The study remains limited by the absence of experimental validation. Are there supporting data from biological models or clinical trials? Figure 5F is important as this is the validation of the identified compounds in three cell lines. In the previous review, it was noted that the identified drugs had only a modest effect on cell viability. Furthermore, the efficacy of QL-XII-47 and GSK-690693 was found to be cell-line specific-showing activity against BT20 (the cell line used for LINCS transcriptional signature generation) but not against CAL120 and DU4475, which were not included in the signature derivation process. This raises concerns about the tool's ability to predict effective drugs. Additionally, the combination may have an effect because the drugs were tested at high concentrations. How does this effect compare in non-TNBC or normal immortalized breast cell lines? Finally, the DU4475 data were not reproducible, and the experiment must be repeated to ensure reliable comparisons. (3) A previous review requested a discussion on the limitations of the retriever tool, but the authors instead focused on the well-documented constraints of the LINCS dataset. Clearly defining limitations of the retriever will be critical for evaluating its potential applications and reliability. (4) Description of the database that the authors used should be corrected. Two examples are below: "The LINCS-L1000 project published transcriptional profiles of several cell lines." Exploring LINCS metadata will help to introduce the reader to this impressive catalog. "The portal then returns a ranked list of compounds that are likely to have an inverse effect on disease-associated gene expression levels". When selecting small molecules for use in LINCS-L1000 platform, no link was established between the compounds and disease-associated gene expression levels. (5) Fig. 3 presents data on differentially expressed genes. However, without indicating whether these genes are up- or downregulated, it is difficult to assess their relevance to TNBC phenotypes and cancer burden. Additionally, presenting the new Biological Process Gene Ontology analysis in a format similar to Fig. 3C would be beneficial. The statement that these processes are closely related to cancer deregulation is somewhat vague. Instead, the findings may be discussed in relation to each enriched pathway, specifically in the context of TNBC biology and available treatments.
 
 Review 1
3. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 In their study, Osorio and colleagues present 'retriever,' an innovative computational tool designed to extract disease-specific transcriptional drug response profiles from the LINCS-L1000 project. This tool has been effectively applied to TNBC, leveraging single-cell RNA sequencing data to predict drug combinations that may effectively target the disease. The public review highlights the significant integration of extensive pharmacological data with high-resolution transcriptomic information, which enhances the potential for personalized therapeutic applications.
 
 Strengths:
 
 A key finding of the study is the prediction and validation of the drug combination QL-XII-47 and GSK-690693 for the treatment of TNBC. The methodology employed is robust, with a clear pathway from data analysis to experimental confirmation.
 
 Comments on revisions:
 
 I commend the authors for their thorough and thoughtful revisions, which have significantly strengthened the manuscript. The expanded discussion on the limitations of the LINCS-L1000 dataset and the inherent challenges of imputation techniques provides critical context for interpreting the tool's predictive accuracy. The addition of clinical implications, including strategies for integrating retriever into clinical trial design and its broader applicability to other diseases, enhances the translational relevance of the work. Addressing drug resistance mechanisms in the context of combination therapy further underscores the biological rationale for the approach.
 
 The transparency regarding computational requirements and ethical considerations-particularly data privacy, bias mitigation, and model validation-demonstrates a responsible and forward-thinking approach to computational biology. These additions not only improve the manuscript's rigor but also set a precedent for ethical practices in personalized medicine research.
 
 With these revisions, the authors have effectively addressed prior concerns and elevated the impact of their work. The manuscript now presents a compelling case for the retriever as a valuable tool in precision oncology.
 
 Review 2
Visit annotations in context

Tags

Summary

Review 1

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.03.31.486602v4
arxiv.org arxiv.org

Optimal information gain at the onset of habituation to repeated stimuli

4
1. Public_Reviews 12 May 2025
 
 in eLife
 
 eLife Assessment
 
 This manuscript presents a valuable minimal model of habituation which is quantified by information theoretic measures. The results here could be of use in interpreting habituation behavior in a range of biological systems. The evidence presented is solid, and uses simulations of the minimal model to recapitulate several hallmarks of habituation from a simple model.
 
 Summary
2. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 In this study, the authors aim to investigate habituation, the phenomenon of increasing reduction in activity following repeated stimuli, in the context of its information theoretic advantage. To this end, they consider a highly simplified three-species reaction network where habituation is encoded by a slow memory variable that suppresses the receptor and therefore the readout activity. Using analytical and numerical methods, they show that in their model the information gain, the difference between the mutual information between the signal and readout after and before habituation, is maximal for intermediate habituation strength. Furthermore, they demonstrate that the Pareto front corresponding to an optimization strategy that maximizes the mutual information between signal and readout in the steady-state and minimizes dissipation in the system also exhibits similar intermediate habituation strength. Finally, they briefly compare predictions of their model to whole-brain recordings of zebrafish larvae under visual stimulation.
 
 The author's simplified model serves as a good starting point for understanding habituation in different biological contexts as the model is simple enough to allow for some analytic understanding but at the same time exhibits most basic properties of habituation in sensory systems. Furthermore, the author's finding of maximal information gain for intermediate habituation strength via an optimization principle is, in general, interesting. However, the following points remain unclear:
 
 (1) How general is their finding that the optimal Pareto front coincides with the region of maximal information gain? For instance, what happens if the signal H_st (H_max) isn't very strong? Does it matter that in this case, H_st only has a minor influence on delta Q_R? In the binary switching case, what happens if H_max is rather different from H_st (and not just 20% off)? Or in a case where the adapted value corresponds to the average of H_max and H_min?
 
 (2) The comparison to experimental data isn't very convincing. For instance, is PCA performed simultaneously on both the experimental data set and on the model or separately? What are the units of the PCs in Fig. 6(b,c)? Given that the model parameters are chosen so that the activity decrease in the model is similar to the one in the data (i.e., that they show similar habituation in terms of the readout), isn't it expected that the dynamics in the PC1/2 space look very similar?
 
 Review 1
3. Public_Reviews 12 May 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 The authors use a generic model framework to study the emergence of habituation and its functional role from information-theoretic and energetic perspectives. Their model features a receptor, readout molecules, and a storage unit, and as such, can be applied to a wide range of biological systems. Through theoretical studies, the authors find that habituation (reduction in average activity) upon exposure to repeated stimuli should occur at intermediate degrees to achieve maximal information gain. Parameter regimes that enable these properties also result in low dissipation, suggesting that intermediate habituation is advantageous both energetically and for the purpose of retaining information about the environment.
 
 A major strength of the work is the generality of the studied model. The presence of three units (receptor, readout, storage) operating at different time scales and executing negative feedback can be found in many domains of biology, with representative examples well discussed by the authors (e.g. Figure 1b). A key takeaway demonstrated by the authors that has wide relevance is that large information gain and large habituation cannot be attained simultaneously. When energetic considerations are accounted for, large information gain and intermediate habituation appear to be the favorable combination.
 
 Comments on the revision:
 
 The authors have adequately addressed the points I raised during the initial review. The text has been clarified at multiple instances, and the treatment of energy expenditure is now more rigorous. The manuscript is much improved both in terms of readability and scientific content.
 
 Review 2
4. Public_Reviews 12 May 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews
 
 Reviewer #1 (Public Review):
 
 Summary:
 
 The manuscript by Nicoletti et al. presents a minimal model of habituation, a basic form of non-associative learning, addressing both from dynamical and information theory aspects of how habituation can be realized. The authors identify that negative feedback provided with a slow storage mechanism is sufficient to explain habituation.
 
 Strengths:
 
 The authors combine the identification of the dynamical mechanism with information-theoretic measures to determine the onset of habituation and provide a description of how the system can gain maximum information about the environment.
 
 We thank the reviewer for highlighting the strength of our work and for their comments, which we believe have been instrumental in significantly improving our work and its scope. Below, we address all their concerns.
 
 Weaknesses:
 
 I have several main concerns/questions about the proposed model for habituation and its plausibility. In general, habituation does not only refer to a decrease in the responsiveness upon repeated stimulation but as Thompson and Spencer discussed in Psych. Rev. 73, 16-43 (1966), there are 10 main characteristics of habituation, including (i) spontaneous recovery when the stimulus is withheld after response decrement; dependence on the frequency of stimulation such that (ii) more frequent stimulation results in more rapid and/or more pronounced response decrement and more rapid spontaneous recovery; (iii) within a stimulus modality, the less intense the stimulus, the more rapid and/or more pronounced the behavioral response decrement; (iv) the effects of repeated stimulation may continue to accumulate even after the response has reached an asymptotic level (which may or may not be zero, or no response). This effect of stimulation beyond asymptotic levels can alter subsequent behavior, for example, by delaying the onset of spontaneous recovery.
 
 These are only a subset of the conditions that have been experimentally observed and therefore a mechanistic model of habituation, in my understanding, should capture the majority of these features and/or discuss the absence of such features from the proposed model.
 
 We are really grateful to the reviewer for pointing out these aspects of habituation that we overlooked in the previous version of our manuscript. Indeed, our model is able to capture most of these 10 observed behaviors, specifically: 1) habituation; 2) spontaneous recovery; 3) potentiation of habituation; 4) frequency sensitivity; 5) intensity sensitivity; 6) subliminal accumulation. Here, we are following the same terminology employed in Eckert et al., Current Biology 34, 5646–5658 (2024), the paper highlighted by the reviewer. We have dedicated a section of the revised version of the manuscript to these hallmarks, substantiating the validity of our framework as a minimal model to have habituation. We remark that these are the sole hallmarks that can be discussed by considering one single external stimulus and that can be identified without ambiguity in a biochemical context. This observation is again in line with Eckert et al., Current Biology 34, 5646–5658 (2024).
 
 In the revised version, we employ the same strategy of the aforementioned work to determine when the system can be considered “habituated”. Indeed, we introduce a response threshold that is now discussed in the manuscript. We also included a note in the discussions stating that, since any biochemical model will eventually reach a steady state, subliminal accumulation, for example, can only be seen with the use of a threshold. The introduction of different storage mechanisms, ideally more detailed at a molecular level, can shed light on this conceptual gap. This is an interesting direction of research.
 
 Furthermore, the habituated response in steady-state is approximately 20% less than the initial response, which seems to be achieved already after 3-4 pulses, the subsequent change in response amplitude seems to be negligible, although the authors however state "after a large number of inputs, the system reaches a time-periodic steady-state". How do the authors justify these minimal decreases in the response amplitude? Does this come from the model parametrization and is there a parameter range where more pronounced habituation responses can be observed?
 
 The reviewer is correct, but this is solely a consequence of the specific set of parameters we selected. We made this choice solely for visualization purposes in the previous version. In the revised version, in the section discussing the hallmarks of habituation, we also show other parameter choices when the response decrement is more pronounced. Moreover, we remark that the contour plot of \Delta⟨U> clearly shows that the decrement can largely exceed the 20% threshold presented in the previous version.
 
 In the revised version, also in light of the works highlighted by the reviewer, we decided to move the focus of the manuscript to the information-theoretic advantage of habituation. As such, we modified several parts of the main text. Also, in the region of optimal information gain, habituation is at an intermediate level. For this reason, we decided to keep the same parameter choice as the previous version in Figure 2.
 
 We stated that the time-periodic steady-state is reached “after a large number of stimuli” from a mathematical perspective. However, by using a habituation threshold, as done in Eckert et al., Current Biology 34, 5646–5658 (2024), we can state that the system is habituated after a few stimuli for each set of parameters. This aspect is highlighted in the revised version of the manuscript (see also the point above).
 
 The same is true for the information content (Figure 2f) - already at the first pulse, IU, H ~ 0.7 and only negligibly increases afterwards. In my understanding, during learning, the mutual information between the input and the internal state increases over time and the system extracts from these predictions about its responses. In the model presented by the authors, it seems the system already carries information about the environment which hardly changes with repeated stimulus presentation. The complexity of the signal is also limited, and it is very hard to clarify from the presented results, whether the proposed model can actually explain basic features of habituation, as mentioned above.
 
 As for the response decrement of the readout, we can certainly choose a set of parameters for which the information gain is higher. In the revised version, we also report the information at the first stimulation and when the system is habituated to give a better idea of the range of these quantities. At any rate, as the referee correctly points out, it is difficult to give an intuitive interpretation of the information in our minimal model.
 
 It is also important to remark that, since the readout population and the receptor both undergo fast dynamics (with appropriate timescales as discussed in the text), we are not observing the transient gain of information associated with the first stimulus. As such, the mutual information presents a discontinuous behavior that resembles the dynamics of the readout, thereby starting at a non-zero value already at the first stimulus.
 
 Additionally, there have been two recent models on habituation and I strongly suggest that the authors discuss their work in relation to recent works (bioRxiv 2024.08.04.606534; arXiv:2407.18204).
 
 We thank the reviewer for pointing out these relevant references. In the revised version, we highlighted that we discuss the information-theoretic aspects of habituation, while the aforementioned references focus on the dynamics of this phenomenon.
 
 Reviewer #1 (Recommendations for the authors):
 
 I would also like to note here the simplification of the proposed biological model - in particular, that the receptor can be in an active/passive state, as well as proposing the Nf-kB signaling module as a possible molecular realization. Generally, a large number of cell surface receptors including RTKs of GPCRs have much more complex dynamics including autocatalytic activation that generally leads to bistability, and the Nf-kB has been demonstrated to have oscillatory even chaotic dynamics (works of Savas Tsay, Mogens Jensen and others). Considering this, the authors should at least discuss under which conditions these TNF-Alpha signaling could potentially serve as a molecular realisation for habituation.
 
 We thank the reviewer for bringing this to our attention. In the previous version, we reported the TNF signaling network only to show a similar coarse-grained modular structure. However, following a suggestion of reviewer #2, we decided to change Figure 1 to include a simplified molecular scheme of chemotaxis rather than TNF signaling, to avoid any source of confusion about this issue.
 
 Also, a minor point: Figures 2d-e are cited before 2a-c.
 
 We apologize for the oversight. The structure of the Figures and their order is now significantly different, and they are now cited in the correct order.
 
 Reviewer #2 (Public review):
 
 In this study, the authors aim to investigate habituation, the phenomenon of increasing reduction in activity following repeated stimuli, in the context of its information-theoretic advantage. To this end, they consider a highly simplified three-species reaction network where habituation is encoded by a slow memory variable that suppresses the receptor and therefore the readout activity. Using analytical and numerical methods, they show that in their model the information gain, the difference between the mutual information between the signal and readout after and before habituation, is maximal for intermediate habituation strength. Furthermore, they demonstrate that the Pareto front corresponds to an optimization strategy that maximizes the mutual information between signal and readout in the steady state, minimizes some form of dissipation, and also exhibits similar intermediate habituation strength. Finally, they briefly compare predictions of their model to whole-brain recordings of zebrafish larvae under visual stimulation.
 
 The author's simplified model might serve as a solid starting point for understanding habituation in different biological contexts as the model is simple enough to allow for some analytic understanding but at the same time exhibits all basic properties of habituation in sensory systems. Furthermore, the author's finding of maximal information gain for intermediate habituation strength via an optimization principle is, in general, interesting. However, the following points remain unclear or are weakly explained:
 
 We thank the reviewer for deeming our work interesting and for considering it a solid starting point for understanding habituation in biological systems.
 
 (1) Is it unclear what the meaning of the finding of maximal information gain for intermediate habituation strength is for biological systems? Why is information gain as defined in the paper a relevant quantity for an organism/cell? For instance, why is a system with low mutual information after the first stimulus and intermediate mutual information after habituation better than one with consistently intermediate mutual information? Or, in other words, couldn't the system try to maximize the mutual information acquired over the whole time series, e.g., the time series mutual information between the stimulus and readout?
 
 This is a delicate aspect to discuss and we thank the referee for the comment. In the revised version, we report information gain, initial and final information, highlighting that both gain and final information are higher in regions where habituation is present. They have qualitatively similar behavior and highlight a clear information-theoretic advantage of this dynamical phenomenon. An important point is that, to determine the optimal Pareto front, we consider a prolonged stimulus and its associated steady-state information. Therefore, from the optimization point of view, there is no notion of “information gain” or “final information”, which are intrinsically dynamical quantities. As a result, the fact that optimal curve lies in the region of optimal information gain is a-priori not expected and hints at the potential crucial role of this feature. In the revised version, we elucidate this aspect with several additional analyses.
 
 We would like to add that, from a naive perspective, while the first stimulation will necessarily trigger a certain (non-zero) mutual information, multiple observations of the same stimulus have to reflect into accumulated information that consequently drives the onset of observed dynamical behaviors, such as habituation.
 
 (2) The model is very similar to (or a simplification of previous models) for adaptation in living systems, e.g., for adaptation in chemotaxis via activity-dependent methylation and demethylation. This should be made clearer.
 
 We apologize for having missed this point. Our choice has been motivated by the fact that we wanted to avoid confusion between the usual definition of (perfect) adaptation and habituation. However, we now believe that this is not the case for the revised manuscript, and we now include chemotaxis as an example in Figure 1.
 
 (3) It remains unclear why this optimization principle is the most relevant one. While it makes sense to maximize the mutual information between stimulus and readout, there are various choices for what kind of dissipation is minimized. Why was \delta Q_R chosen and not, for instance, \dot{\Sigma}_int or the sum of both? How would the results change in that case? And how different are the results if the mutual information is not calculated for the strong stimulation input statistics but for the background one?
 
 We thank the reviewer for the suggestion. We agree that a priori, there is no reason to choose \delta Q_R or a function of the internal energy flux J_int (that, in the revised version, we are using in place of \dot\Sigma_int following the suggestion of reviewer #3). The rationale was to minimize \delta Q_R since this dissipation is unavoidable and stems from the presence of the storage inhibiting the receptor through the internal pathway. Indeed, considering the existence of two different pathways implementing sensing and feedback, the presence of any input will result in a dissipation produced by the receptor. This energy consumption is reflected in \delta Q_R.
 
 In the revised version, we now include in the optimization principle two energy contributions (see Eq. (14) of the revised manuscript): \delta Q_R and E_int, which is the energy consumption associated with the driven storage production per unit energy. All Figures have been updated accordingly. The results remain similar, as \delta Q_R still represents the main contribution, especially at high \beta.
 
 Furthermore, in the revised version, we include examples of the Pareto optimization for different values of input strength. As detailed both in the main text and the Supplementary Information, changing the value of ⟨H⟩ moves the Pareto frontier in the (\beta, \sigma) space, since the signal needs to be strong enough for the system to distinguish it from the intrinsic thermal noise (controlled by beta). We also show that if the system is able to tune the inhibition strength \kappa, the Pareto frontiers at different ⟨H⟩ collapse into a single curve. This shows that, although the values of, e.g., the mutual information, depend on ⟨H⟩, the qualitative behavior of the system in this regime is effectively independent of it. We also added more details about this in the Supplementary Information.
 
 (4) The comparison to the experimental data is not too strong of an argument in favor of the model. Is the agreement between the model and the experimental data surprising? What other behavior in the PCA space could one have expected in the data? Shouldn't the 1st PC mostly reflect the "features", by construction, and other variability should be due to progressively reduced activity levels?
 
 The agreement between data and model is not surprising - we agree on this - since the data exhibit habituation. However, we believe that the fact that our minimal model is able to capture the features of a complex neural system just by looking at the PCs, without any explicit biological details, is non-trivial. We also stress that the 1st PC only reflects the feature that captures most of the variance of the data and, as such, it is difficult to have a-priori expectations on what it should represent. In the case of the data generated from the model, most of the variance of the activity comes from the switching signal, and similar considerations can be made for the looming stimulations in the data. We updated the manuscript to clarify this point.
 
 Reviewer #2 (Recommendations for the authors):
 
 (1) The abstract makes it sound like a new finding is that habituation is due to a slow, negative feedback mechanism. But, as mentioned in the introduction, this is a well-known fact.
 
 We agree with the reviewer. We have revised the abstract.
 
 (2) Figure 2c Why does the range of Delta Delta I_f include negative values if the corresponding region is shaded (right-tilted stripes)?
 
 The negative values in the range are those attained in the shaded region with right-tilted stripes. We decided to include them in the colorbar for clarity, since Delta Delta I_f is also plotted in the region where it attains negative values.
 
 (3) What does the Pareto front look like if the optimization is done for input statistics given by ⟨H⟩_min?
 
 In the revised version, we include examples of the Pareto optimization for different values of input strength. As detailed both in the main text and the Supplementary Information, changing the value of ⟨H⟩ moves the Pareto frontier in the (\beta, \sigma) space, since the strength of the signal is crucial for the system to discriminate input and thermal noise (see also the answers above).
 
 In particular, in Figure 4 we explicitly compare the results of the Pareto optimization (which is done with a static input of a given statistics) with the dynamics of the model for different values of ⟨H⟩ in two scenarios, i.e., adaptive and non-adaptive inhibition strength (see answers above for details).
 
 We also remark that ⟨H⟩_min represents the background signal that the system is not trying to capture, which is why we never used it for optimization.
 
 (4) From the main text, it is rather difficult to understand how the comparison to the experimental data was performed. How was the PCA done exactly? What are the "features" of the evoked neural response?
 
 The PCA on data is performed starting from the single-neuron calcium dynamics. To perform a far comparison, we reconstruct a similar but extremely simplified dynamics using our model as explained in Methods to perform the PCA on analogous simulated data. We added a comment on this in the revised version. While these components capture most of the variance in the data, their specific interpretation is usually out of reach and we believe that it lies beyond the scope of this theoretical work. We also remark that the model does not contain all these biological details - a strong aspect in our opinion - and, as such, it cannot capture specific biological features.
 
 Reviewer #3 (Public review):
 
 The authors use a generic model framework to study the emergence of habituation and its functional role from information-theoretic and energetic perspectives. Their model features a receptor, readout molecules, and a storage unit, and as such, can be applied to a wide range of biological systems. Through theoretical studies, the authors find that habituation (reduction in average activity) upon exposure to repeated stimuli should occur at intermediate degrees to achieve maximal information gain. Parameter regimes that enable these properties also result in low dissipation, suggesting that intermediate habituation is advantageous both energetically and for the purpose of retaining information about the environment.
 
 A major strength of the work is the generality of the studied model. The presence of three units (receptor, readout, storage) operating at different time scales and executing negative feedback can be found in many domains of biology, with representative examples well discussed by the authors (e.g. Figure 1b). A key takeaway demonstrated by the authors that has wide relevance is that large information gain and large habituation cannot be attained simultaneously. When energetic considerations are accounted for, large information gain and intermediate habituation appear to be a favorable combination.
 
 We thank the reviewer for this positive assessment of our work and its generality.
 
 While the generic approach of coarse-graining most biological detail is appealing and the results are of broad relevance, some aspects of the conducted studies, the problem setup, and the writing lack clarity and should be addressed:
 
 (1) The abstract can be further sharpened. Specifically, the "functional role" mentioned at the end can be made more explicit, as it was done in the second-to-last paragraph of the Introduction section ("its functional advantages in terms of information gain and energy dissipation"). In addition, the abstract mentions the testing against experimental measurements of neural responses but does not specify the main takeaways. I suggest the authors briefly describe the main conclusions of their experimental study in the abstract.
 
 We thank the reviewer for raising this point. In the revised version, we have changed the abstract to reflect the reviewer’s points and the new structure and results of the manuscript.
 
 (2) Several clarifications are needed on the treatment of energy dissipation.
 
 - When substituting the rates in Eq. (1) into the definition of δQ_R above Eq. (10), "σ" does not appear on the right-hand side. Does this mean that one of the rates in the lower pathway must include σ in its definition? Please clarify.
 
 We apologize to the reviewer for this typo. Indeed, \sigma sets the energy scale of feedback and, as such, it appears in the energetic driving given by the feedback on the receptor, i.e., in Eq. (1) together with \kappa. This typo has been corrected in the revised manuscript, and all subsequent equations are consistent.
 
 - I understand that the production of storage molecules has an associated cost σ and hence contributes to dissipation. The dependence of receptor dissipation on ⟨H⟩, however, is not fully clear. If the environment were static and the memory block was absent, the term with ⟨H⟩ would still contribute to dissipation. What would be the nature of this dissipation?
 
 In the spirit of building a paradigmatic minimal model with a thermodynamic meaning, we considered H to act as an external thermodynamic driving. Since this driving acts on a different pathway with respect to the one affected by the storage, the receptor is driven out of equilibrium by its presence.
 
 By eliminating the memory block, we would also be necessarily eliminating the presence of the pathway associated with the storage effect (“internal pathway” in the manuscript), since its presence is solely due to the existence of a storage population. Therefore, in this case, the receptor would be a 2-state, 1-pathway system and, as such, it would always satisfy an effective detailed balance. As a consequence, the definition of \delta Q_R reported in the manuscript would not hold anymore and the receptor would not exhibit any dissipation. Thus, in a static environment and without a memory block, no receptor dissipation would be present. We would also like to stress that our choice to model two different pathways has been motivated by the observation that the negative feedback acts along a different pathway in several biochemical and biological examples. We made some changes to the model description in the revised version and we hope that this aspect has been clarified.
 
 - Similarly, in Eq. (9) the authors use the ratio of the rates Γ_{s → s+1} and Γ_{s+1 → s} in their expression for internal dissipation. The first-rate corresponds to the synthesis reaction of memory molecules, while the second corresponds to a degradation reaction. Since the second reaction is not the microscopic reverse of the first, what would be the physical interpretation of the log of their ratio? Since the authors already use σ as the energy cost per storage unit, why not use σ times the rate of producing S as a metric for the dissipation rate?
 
 We agree with the referee that the reverse reaction we considered is not the microscopic reverse of the storage production. In the case of a fast readout population, we employed a coarse-grained view to compute this entropy production. To be more precise, we gladly welcomed the referee’s suggestion in the revised version and modified the manuscript accordingly. As suggested, we now employ the energy flux associated with the storage production to estimate the internal dissipation (see new Fig. 3).
 
 In the revised version, we also use this quantity in the optimization procedure in combination with \deltaQ_R (see new Fig. 4) to have a complete characterization of the system’s energy consumption. The conclusions are qualitatively identical to before, but we believe that now they are more solid from a theoretical perspective. For this important advance in the robustness and quality of our work, we are profoundly grateful to the referee.
 
 (3) Impact of the pre-stimulus state. The plots in Figure 2 suggest that the environment was static before the application of repeated stimuli. Can the authors comment on the impact of the pre-stimulus state on the degree of habituation and its optimality properties? Specifically, would the conclusions stay the same if the prior environment had stochastic but aperiodic dynamics?
 
 The initial stimulus is indeed stochastic with an average constant in time and mimics the background (small) signal. We apply the (strong) stimulation when the system already reached a stationary state with respect to the background. As it can be appreciated in Fig. 2 of the revised version, the model response depends on the pre-stimulus level, since it sets the storage concentration before the stimulation arrives and, as such, the subsequent habituation dynamics. This dependence is important from a dynamical perspective. The information-theoretic picture has been developed, as said above, by letting the system relax before the first stimulus. This eliminates this arbitrary dependence and provides a clearer idea of the functional advantages of habituation. Moreover, the optimization procedure is performed in a completely different setting, with no pre-stimulus at all, since we only have one prolonged stimulation. We hope that the revised version is clearer on all these points.
 
 (4) Clarification about the memory requirement for habituation. Figure 4 and the associated section argue for the essential role that the storage mechanism plays in habituation. Indeed, Figure 4a shows that the degree of habituation decreases with decreasing memory. The graph also shows that in the limit of vanishingly small Δ⟨S⟩, the system can still exhibit a finite degree of habituation. Can the authors explain this limiting behavior; specifically, why does habituation not vanish in the limit Δ⟨S⟩ -> 0?
 
 We apologize for the lack of clarity and we thank the reviewer for spotting this issue. In Figure 4 (now Figure 5 in the revised manuscript) Δ⟨S⟩ is not exactly zero, but equal to 0.15% at the final point. It appeared as 0% in the plot due to an unwanted rounding in the plotting function that we missed. This has been fixed in the revised version, thank you.
 
 Reviewer #3 (Recommendations for the authors):
 
 (1) Page 2 | "Figure 1b-e" should be "Figure 1b-d" since there is no panel (e) in Figure 1.
 
 (2) Figure 1a | In the top schematic, the symbol "k" is used, while in the rest of the text, the proportionality constant is denoted by κ.
 
 We thank the reviewer for pointing this out. Figure 1 has been revised and the panels are now consistent. The proportionality constant (the inhibition strength) has also been fixed.
 
 (3) Figure 1a | I find the upper part of the schematic for Storage hard to perceive. I understand the lower part stands for the degradation reaction for storage molecules. The upper part stands for the synthesis reaction catalyzed by the readout population. I think the bolded upper arrow would explain it sufficiently well; the left/right arrows, together with the crossed green circle make that part of the figure confusing. Consider simplifying.
 
 We decided to remove the left/right arrows, as suggested by the reviewer, as we agree that they were unnecessarily complicating the schematic. We hope that the revised version will be easier to understand.
 
 (4)Page 3 | It would be helpful to tell what the temporal statistics of the input signal $p_H(h,t)$ is, i.e. <h(t) h(t')>. Looking at the example trajectory in Figure 1a, consecutive signal values do not seem correlated.
 
 We agree with the reviewer that this is an important detail and worth mentioning. We now explicitly state that consecutive values are not correlated, for simplicity.
 
 (5)Figure 2 | I believe the label "EXTERNAL INPUT" refers to the *average* external input, not one specific realization (similar to panels (d) and (e) that report on average metrics). I suggest you indicate this in the label, or, what may be even better, add one particular realization of the stochastic input to the same graph.
 
 We thank the reviewer for spotting this. We now write that what we show is the average external signal. We prefer this solution rather than showing a realization of the stochastic input, since it is more consistent with the rest of the plots, where we always show average quantities. We also note that Figure 2 is now Figure 3 in the revised manuscript.
 
 (6)Figure 2d | The expression of Δ⟨U⟩ is the negative of the definition in Eq. (5). It should be corrected.
 
 In the revised version, both the definitions in Figure 2 (now Figure 3) and in the text (now Eq. (11)) are consistent.
 
 (7) Figure 3(d-e) caption | "where ⟨U⟩ starts to be significantly smaller than zero." There, it should be Δ⟨U⟩ instead of ⟨U⟩.
 
 Thanks again, we corrected this typo.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 2

AuthorResponse

Annotators

Public_Reviews

URL

arxiv.org/abs/2301.12812v6
www.biorxiv.org www.biorxiv.org

The robust, high-throughput, and temporally regulated roxCre and loxCre reporting systems for genetic modifications in vivo

4
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  This study presents an important set of new tools to facilitate Cre or Dre-mediated recombination in mice. The characterization of these new tools was done using solid and validated methodology. The work convincingly demonstrates the efficient gene knockout capability of these models and will progress the field.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  This is a simple and potentially valuable approach to reduce Cre leak in amplified systems designed to improve CreER use across alleles. The revised work is improved with a direct comparison to the Benedito iSure-Cre line, providing some practical guidance for investigators. The authors do not address the issue of Cre toxicity or mosaic efficiency with low Tamoxifen use.
  
  The major improvement in my mind is the inclusion of Supp Fig 7 where the authors compare their loxCre to iSureCre. The discussion is somewhat improved, but still fails to discuss significant issues such as Cre toxicity in detail. As noted by most reviewers, without a biological question, the paper is entirely a technical description of a couple of new tools. Whether and to what extent journals such as eLife should publish every new technical innovation without rigorous functional comparison to prior tools is an important question raised by this study. There is already a plethora of available techniques, most of which look better on paper than they function in mice.
  
  However, I do feel that these tools will be of potential use to the field.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  This work presents new genetic tools for enhanced Cre-mediated gene deletion and genetic lineage tracing. The authors optimise and generate mouse models that convert temporally controlled CreER or DreER activity to constitutive Cre expression, coupled with the expression of tdT reporter for the visualizing and tracing of gene-deleted cells. This was achieved by inserting a stop cassette into the coding region of Cre, splitting it into N- and C-terminal segments. Removal of the stop cassette by Cre-lox or Dre-rox recombination results in the generation of modified Cre that is shown to exhibit similar activity to native Cre. The authors further demonstrate efficient gene knockout in cells marked by the reporter using these tools, including intersectional genetic targeting of pericentral hepatocytes.
  
  The new models offer several important advantages. They enable tightly controlled and highly effective genetic deletion of even alleles that are difficult to recombine. By coupling Cre expression to reporter expression, these models reliably report Cre-expressing i.e. gene-targeted cells and circumvent false positives that can complicate analyses in genetic mutants relying on separate reporter alleles. Moreover, the combinatorial use of Dre/Cre permits intersectional genetic targeting, allowing for more precise fate mapping.
  
  The study and the new models have also limitations. The demonstration of efficient deletion of multiple floxed alleles in a mosaic fashion, a scenario where the lines would demonstrate their full potential compared to already existing models, has not been tested in the current study. Mosaic genetics is increasingly recognized as a key methodology for assessing cell-autonomous gene functions. The challenge lies in performing such experiments, as low doses of tamoxifen needed for inducing mosaic gene deletion may not be sufficient to efficiently recombine multiple alleles in individual cells while at the same time accurately reporting gene deletion. In addition, as discussed by the authors, a limitation of this line is the constitutive expression of Cre, which is associated with toxicity in some cases.
  
  Comments on revisions: I have no further comments.
  
  Review 2
4. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Shi et al describe a new set of tools to facilitate Cre or Dre-recombinase-mediated recombination in mice. The strategies are not completely novel but have been pursued previously by the lab, which is world-leading in this field, and by others. The authors report a new version of the iSuRe-Cre approach, which was originally developed by Rui Benedito's group in Spain. Shi et al describe that their approach shows reduced leakiness compared to the iSuRe-Cre line. Furthermore, a new R26-roxCre-tdT mouse line was established after extensive testing, which enables efficient expression of the Cre recombinase after activation of the Dre recombinase. The authors carefully evaluated efficiency and leakiness of the new line and demonstrated the applicability by marking peri-central hepatocytes in an intersectional genetics approach. The paper represents the result of enormous, carefully executed efforts. Although I would have preferred to see a study which uses the wonderful new tools to address a major biological question, carefully conducted technical studies have an enormous value for the scientific community, clearly justifying publication.
  
  The new mouse lines generated in this study will enhance the precision of genetic manipulation in distinct cell types and greatly facilitate future work in numerous laboratories. The authors expertly eradicated weaknesses from initial submissions. Remaining open questions regarding potential toxicity of expressing multiple recombinases and fluorescence reports were convincingly answered.
  
  Review 3
Visit annotations in context

Tags

Summary

Review 1

Review 2

Review 3

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.04.23.590680v3
www.biorxiv.org www.biorxiv.org

Planar cell polarity coordination in a cnidarian embryo provides clues to animal body axis evolution

4
1. Public_Reviews 09 May 2025
 
 in eLife
 
 eLife Assessment
 
 This analysis of the formation of the oral-aboral body axis in cnidarians, the sister group of bilaterians, is a significant and fundamental contribution to the field of Wnt signalling and planar cell polarity, particularly in or understanding in gradient formation, non-canonical Wnt signalling and Wnt-Frizzled interactions in cnidarians. The evidence supporting the conclusions is compelling and has the potential to contribute to a deeper understanding of the origin and evolution of Wnt signalling in cnidarians and metazoans in general. These findings, which are presented in a thoughtful and scholarly manner, will be of broad interest to developmental and evolutionary biologists.
 
 Summary
2. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Summary:
 
 This noteworthy paper examines the role of planar cell polarity and Wnt signalling in body axis formation of the hydrozoan Clytia. In contrast to the freshwater polyp Hydra or the sea anemone Nematostella, Clytia represents a cnidarian model system with a complete life cycle (planula larva-polyp-medusa). In this species, classical experiments have demonstrated that a global polarity is established from the oral end of the embryos (Freeman, 1981). Prior research has demonstrated that Wnt3 plays a role in the formation of the oral organiser in Clytia and other cnidarians, acting in an autocatalytic feedback-loop with β-catenin. However, the question of whether and to what extent an oral-aboral gradient of Wnt activity is established remained unanswered. This gradient is thought to control both tissue differentiation and tissue polarity. The planar cell polarity (PCP) pathway has been linked to this polarity, although it is generally considered to be β-catenin independent.
 
 Comments on major strengths and weaknesses:
 
 Beautiful and solid experiments to clarify the role of canonical Wnt signalling and PCP core factors in coordinating planar cell polarity of Clytia. The authors have conducted a series of sophisticated experiments utilising morpholinos, mRNA microinjections and immunofluorescent visualisation of PCP. The objective of these experiments was to address the function of Wnt3, β-catenin and PCP core proteins in the coordination of the global polarity of Clytia embryos. The authors conclude that PCP plays a role in regulating polarity along the oral-aboral axis of embryos and larvae. This offers a conceivable explanation for how polarity information is established and distributed globally during Clytia embryogenesis, with implications for our understanding of axis formation in cnidarians and the evolution of Wnt signalling in general. - While the experiments are well-designed and executed, there are some criticisms, questions or suggestions that should be addressed.
 
 (i) Wnt3 cue and global PCP. PCP has been described in detail in a previous paper on Clytia (Momose et al, 2012): its orientation along the oral-aboral body axis (ciliary basal body positioning studies), and its function in directional polarity during gastrulation (Stbm-, Fz1-, and Dsh-MO experiments). I wonder if this part could be shortened. What is new, however, are the knockdown and Wnt3-mRNA rescue experiments, which provide a deeper insight into the link between Wnt3 function in the blastopore organiser as a source or cue for axis formation. These experiments demonstrate that the Wnt3 knockdown induces defects equivalent to PCP factor knockdown, but can be rescued by Wnt3-mRNA injection, even at a distance of 200 µm away from the Wnt-positive area. The experimental set-up of these new molecular experiments follows in important aspects those of Freeman's experiments of 1981 (who in turn was motivated to re-examine Teissier's work of 1931/1933 ...). Freeman did not use the term "global polarity" but the concept of an axis-inducing source and a long-range tissue polarity can be traced back to both researchers.
 
 (ii) PCP propagation and β-catenin. The central but unanswered question in this study focuses on the interaction between Wnt3 and PCP and the propagation of PCP. Wnt3 has been described in cnidarians but also in vertebrates and insects as a canonical Wnt interacting with β-catenin in an autocatalytic loop. The surprising result of this study is that the action of Wnt3 on PCP orientation is not inhibited in the presence of a dominant-negative form of CheTCF (dnTCF) ruling out a potential function of β-catenin in PCP. This was supported by studies with constitutively active β-catenin (CA-β-cat) mRNA which was unable to restore PCP coordination nor elongation of Wnt3-depleted embryos but did restore β-catenin-dependent gastrulation. Based on these data, the authors conclude that Wnt3 has two independent roles: Wnt/β-catenin activation and initial PCP orientation (two step model for PCP formation). However, the molecular basis for the interaction of Wnt3 with the PCP machinery and how the specificity of Wnt3 for both pathways is regulated at the level of Wnt-receiving cells (Fz-Dsh) remains unresolved. - Also, with respect to PCP propagation, there is no answer with respect to the underlying mechanisms. The authors found that PCP components are expressed in the mid-blastula stage, but without any further indication of how the signal might be propagated, e.g., by a wavefront of local cell alignment. Here, it is necessary to address the underlying possible cellular interactions more explicitly.
 
 (iii) The proposed two step model for PCP formation has important evolutionary implications in that it excludes the current alternate model according to which a long-range Wnt3-gradient orients PCP ("Wnt/β-catenin-first"). Nevertheless, the initial PCP orientation by Wnt3 - as proposed in the two-step-model - is not explained at all on the molecular level. Another possible, but less well discussed and studied option for linking Wnt3 with PCP action could be a role of other Wnt pathways. The authors present compelling evidence that Wnt3 is the most highly expressed Wnt in Clytia at all stages of development. The authors convincingly show that Wnt3 is the most highly expressed Wnt in Clytia at all stages of development (Fig. S1). However, Wnt7 is also more highly expressed, which makes it a candidate for signal transduction from canonical Wnts to PCP Wnts. An involvement of Wnt7 in PCP regulation has been described in vertebrates (http://dx.doi.org/10.1016/j.celrep.2013.12.026). This would challenge the entire discussion and speculation on the evolutionary implications according to which PCP Wnt signaling comes first (PCP-first scenario") and canonical Wnt signaling later in metazoan evolution.
 
 (iv) The discussion, including Figure 6, is strongly biased towards the traditional evolutionary scenario postulating a choanzoan-sponge ancestry of metazoans. Chromosome-linkage data of pre-metazoans and metazoans (Schulz et al., 2023; https://doi.org/10 (1038/s41586-023-05936-6) now indicate a radically different scenario according to which ctenophores represent the ancestral form and are sister to sponges, cnidarians and bilaterians (the Ctenophora-sister hypothesis). This also has implications for the evolution of Wnt signalling, as discussed in the recent Nature Genetics Review by Holzem et al. (2024) (https://doi.org/10.1038/s41576-024-00699-w). Furthermore, it calls into question the hypothesis of a filter-feeding multicellular gastrula-like ancestor as proposed by Haeckel (Maegele et al., 2023). These papers have not yet been referenced, but they would provide a more robust discussion.
 
 General appraisal:
 
 The authors have carefully addressed all important points raised in this review. Aims and results support their conclusions.
 
 Impact of the work, utility of methods and data:
 
 As stated above, there will be a major impact on our understanding of the role of Wnt signaling in gradient formation and particularly the role of non canonical wnt signaling. As mentioned above, this will have a major impact on our understanding of the role of Wnt signalling in gradient formation, particularly the role of non-canonical Wnt signalling. - It will also be important to better understand the role of Wnt-Frizzled interactions in these basal organisms, as cnidarians have a smaller repertoire of Frizzled receptors compared to the relatively complete repertoire of Wnt subfamilies. This may imply that Wnt 3 is active in both canonical and PCP.
 
 Additional context:
 
 With regard to the question of the evolution of the body plan and Wnt signalling, it would be helpful and important for readers unfamiliar with cnidarians to know that the Hydrozoa/Medusozoa, to which Clytia belongs, are an "evolutionary derived group" within the Cnidaria, as opposed to the Anthozoa (e.g. sea anemone Nematostella). Hydrozoans possess planula larvae that are devoid of a mouth and any form of feeding mechanism, relying instead on the yolk of a fertilised egg for sustenance. The substantial divergence between the Anthozoa and Medusozoa was accompanied by significant gene reductions within the Medusozoa, which likely exerts an influence on the evolution of Wnt signalling in this group as well. This should not detract from the value of the work, but may help to put it in perspective.
 
 Review 1
3. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 Canonical Wnt signaling has previously been shown to be responsible for correct patterning of the oral-aboral axis as well as germ layer formation in several cnidarians. The post-gastrula stage, the planula larvae is not only elongated, it has a specific swimming direction due to the decentralized cellular positioning and slanted anchoring of the cilia. This, in turn, is in most other animals the result of a Wnt-Planar-cell polarity pathway. This paper by Uveira et al investigates the role of Wnt3 signaling in serving as a local cue for the PCP pathway which then is responsible for the orientation of the cilia and elongation of the planula larva of the hydrozoan Clytia hemisphaerica. Wnt3 was shown before to activate the canonical pathway via ß-catenin and to act as an axial organizer. The authors provide compelling evidence for this somewhat unusual direct link between the pathways through the same signaling molecule, Wnt3. In conclusion, they propose a two-step model: 1) local orientation by Wnt3 secretion 2) global propagation by the PCP pathway over the whole embryo.
 
 Strengths:
 
 In a series of elegant and also seemingly sophisticated experiments, they show that Wnt3 activates the PCP pathway directly, as it happens in the absence of canonical Wnt signaling (e.g. through co-expression of dnTCF). Conversely, constitutive active ß-catenin was not able to rescue PCP coordination upon Wnt3 depletion, yet restored gastrulation. This uncouples the effect of Wnt3 on axis specification and morphogenetic movements from the elongation via PCP. Through transplantation of single blastomeres providing a local source of Wnt3, they also demonstrate the reorganization of cellular polarity immediately adjacent to the Wnt3 expressing cell patch. These transplantation experiments also uncover that mechanical cues can also trigger the polarization, suggesting a mechanotransduction or direct influence on subcellular structures, e.g. actin fiber orientation.
 
 This is a beautiful and elegant study addressing an important question. The results have significant implications also for our understanding of the evolutionary origin of axis formation and the link of these two ancient pathways, which in most animals are controlled by distinct Wnt ligands and Frizzled receptors. The quality of the data is stunning and the paper is written in a clear and succinct manner. This paper has the potential to become a widely cited milestone paper.
 
 Weaknesses:
 
 I can not detect any major weaknesses. The work only raises a few more follow-up questions, which the authors are invited to comment on.
 
 I acknowledge the revisions made by the authors. Some open questions remain that need to be addressed in future work, and I accept the limitations of this study, as argued by the authors. Besides the elegant and high-quality experiments, I also appreciate the thoughtful and inspiring discussion.
 
 Review 2
4. Public_Reviews 09 May 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews
 
 Reviewer #1 (Public review):
 
 (1) Wnt3 cue and global PCP. PCP has been described in detail in a previous paper on Clytia (Momose et al, 2012): its orientation along the oral-aboral body axis (ciliary basal body positioning studies), and its function in directional polarity during gastrulation (Stbm-, Fz1-, and Dsh-MO experiments). I wonder if this part could be shortened. What is new, however, are the knockdown and Wnt3-mRNA rescue experiments, which provide a deeper insight into the link between Wnt3 function in the blastopore organiser as a source or cue for axis formation. These experiments demonstrate that the Wnt3 knockdown induces defects equivalent to PCP factor knockdown, but can be rescued by Wnt3-mRNA injection, even at a distance of 200 µm away from the Wnt-positive area. The experimental set-up of these new molecular experiments follows in important aspects those of Freeman's experiments of 1981 (who in turn was motivated to re-examine Teissier's work of 1931/1933 ...). Freeman did not use the term "global polarity" but the concept of an axis-inducing source and a long-range tissue polarity can be traced back to both researchers.
 
 We appreciate the reviewer’s insightful comments for evolutionary biology and cnidarian developmental biology.
 
 Concerning the presentation of the basic PCP structure of Clytia embryo epidermal cells, we prefer to retain this section unless there is a strict limit on manuscript length. These experiments provide background information necessary to establish the biological system for the readers. The structures of cells, notably cell adhesion, cilia, and the cytoskeleton, are essential components of this system.
 
 We have restored sentences concerning the historical contributions of Freeman and Teissier from a previous version of the manuscript.
 
 Freeman’s work offered two key insights. The first is the concept that cell polarity spreads and self-organizes over the distances revealed by the tissue orientation of aggregate embryonic cells (Freeman, 1981 https://doi.org/10.1007/BF00867804), which was termed “global polarity” in a review by Primus and Freeman (2004 https://doi.org/10.1002/bies.20031). This concept closely resembles the modern understanding of PCP coordination mechanisms mediated by core PCP interactions. Remarkably, Freeman proposed this idea in the early 1980s, at the same time of the first characterization of PCP mutants in Drosophila (Gubb and Garcia-Bellido 1982). The second is the role of egg polarity in defining the axis. Freeman demonstrated that the position of the first cleavage furrow predicts the oral-aboral axis by a series of sophisticated experiments. This was a milestone for the studies of cnidarian body axis development.
 
 However, some of Freeman’s interpretations were misleading. In the 1981 paper, he stated:
 
 "Polarity
 
 Other work that I have done has established that the anterior-posterior axis of the planula is set up at the time of the first cleavage; the site where cleavage is initiated specifies the posterior pole of this axis (Freeman 1980). The experiment reported here in which embryos were cut into halves and each half regulated to form a normal planula with the same polarity properties as the embryo it is from provides evidence that these polarity properties are remarkably stable at all developmental stages tested ranging from 4 cell to postgastrula embryos. "
 
 Freeman hypothesised that cell polarity at the 2- or 4-cell stage, referred to as the “polarity of first cell cleavage,” is directly inherited as the global polarity observed in later developmental stages.
 
 In the review by Primus and Freeman (2004), two hypotheses were introduced: (1) maternally localised factors, such as mRNA, determine the axis, and (2) cell polarity of cleavage furrow formation, is inherited to later stages and determines the axis. Freeman described these two hypotheses as mutually exclusive. However, we now know that cell polarity at early cleavage stages does not directly contribute to global polarity/PCP. Instead, Wnt/β-catenin signaling is regionally activated by maternally localised mRNAs distributed along the egg polarity (Momose, 2007; Momose, 2008), which maintain Wnt3 localisation and direct morphological axis patterning. Our study shown in this article unified these hypotheses.
 
 On the second point, as the reviewer noted, Freeman indeed revisited the work of Georges Teissier (Teissier, 1931), who conducted similar experiments on Amphisbetia embryos. It was Teissier who first described how the egg polarity is preserved in later stages and defines the axis. Teissier, however, carefully avoided asserting continuity between egg and blastula polarities, allowing for the possibility of “rétablissement” (re-establishment). As Teissier stated:
 
 "…On constate, en second lieu, que la polarité de l’œuf se conserve dans chacun de se fragment et que le maintien ou le rétablissement de cette polarité sont indispensables à un développement normal. Un fragment d’œuf ou de morula n’a aucune partie ni aucun blastomère qui soit rigoureusement déterminé comme endoderme, mais possède, par contre, un pôle antérieur et un pôle postérieur bien définis.…
 
 Mais cette proposition, qui ne semble pourtant guère dépasser la simple constatation des faits, soulève de grave difficulté. Elle donne en effet à la polarité, propriété encore bien mystérieuse, un rôle morphogénétique de premier ordre et implique des conséquences trop importantes pour qu’on puisse l’accepter sans un très sérieux examen.
 
 Comme je ne pense pas que les questions relatives à la nature des localisation germinales, à l’existence et au fonctionnement des organisateurs de l’œuf des Cœlentérés, puissant, dans l’état actuel de nos connaissances, être discutées utilement, je ne veux voir dans la proposition précédente qu’une façons commode et tout provisoire de systématiser les faits."
 
 English translation:
 
 “We note also that the polarity of the egg is preserved in each fragment and that the maintenance or re-establishment of this polarity is essential for normal development. A fragment of egg or morula has no part or blastomere that is rigorously determined as endoderm, but has, on the other hand, a well-defined anterior and posterior pole....
 
 But this proposition, which hardly seems to go beyond the simple observation of facts, raises serious difficulties. It gives polarity, still a mysterious property, a morphogenetic role of the first order, and implies consequences too important to be accepted without very serious examination.
 
 As I do not believe that questions concerning the nature of germinal localisation, or the existence and functioning of the egg organisers in Coelenterates, can, in the present state of our knowledge, be usefully discussed, I prefer only to see in the foregoing proposition a convenient and very provisional way of systematising the facts.”
 
 Teissier, G. (1931). Étude Expérimentale du Développement de Quelques Hydraires. Ann. Sc. Nat. Zool 14, 5–59.
 
 Teissier's interpretation and caution were reasonable.
 
 Our work connects recent molecular research on axis specification mechanisms in cnidarians with the classic experimental studies of Freeman and Teissier. We believe it is essential to present and acknowledge their conceptual contributions. We have updated the Discussion to include these points.
 
 (2) PCP propagation and β-catenin. The central but unanswered question in this study focuses on the interaction between Wnt3 and PCP and the propagation of PCP. Wnt3 has been described in cnidarians but also in vertebrates and insects as a canonical Wnt interacting with β-catenin in an autocatalytic loop. The surprising result of this study is that the action of Wnt3 on PCP orientation is not inhibited in the presence of a dominant-negative form of CheTCF (dnTCF) ruling out a potential function of β-catenin in PCP. This was supported by studies with constitutively active β-catenin (CA-β-cat) mRNA which was unable to restore PCP coordination nor elongation of Wnt3-depleted embryos but did restore β-catenin-dependent gastrulation. Based on these data, the authors conclude that Wnt3 has two independent roles: Wnt/β-catenin activation and initial PCP orientation (two-step model for PCP formation). However, the molecular basis for the interaction of Wnt3 with the PCP machinery and how the specificity of Wnt3 for both pathways is regulated at the level of Wnt-receiving cells (Fz-Dsh) remain unresolved. Also, with respect to PCP propagation, there is no answer with respect to the underlying mechanisms. The authors found that PCP components are expressed in the mid-blastula stage, but without any further indication of how the signal might be propagated, e.g., by a wavefront of local cell alignment. Here, it is necessary to address the underlying possible cellular interactions more explicitly.
 
 The question of how Wnt3 interacts with the core PCP complex remains open for future investigation. An obvious hypothesis is that one of the Frizzled receptors binds Wnt3 ligands. For additional details, please refer to the response to Reviewer 2’s comment. Regarding other non-classic Wnt receptors, studies in the developing mouse limb have demonstrated that a Wnt5a gradient controls PCP polarisation via ROR receptors and graded Strabismus phosphorylation (Gao et al., 2011, https://doi.org/10.1016/j.devcel.2011.01.001). However, in this context, the Wnt5a gradient influences the frequency of polarised cells rather than PCP orientation. In Clytia, we performed gene knockdown experiments targeting ROR and RYK receptors using Morpholinos but did not observe any effect on axial patterning, suggesting that these receptors are unlikely to be involved in Wnt3 interaction.
 
 Concerning PCP propagation mechanisms, these are well-characterized in both Drosophila and vertebrates and conserved across taxa. The localised Fz-Fmi complex at the apical cortex of a cell interacts with the oppositely localised Stbm-Fmi complex in neighbouring cells, enabling coordination of PCP between directly adjacent cells. This interaction provides a comprehensive explanation for PCP propagation mechanisms. In Drosophila, the “domineering non-autonomy” effect is a well-documented phenomenon where PCP orientation autonomously propagates from core PCP mutant mosaic patches. Overall, PCP propagation is a conserved and robust mechanism across metazoans.
 
 (3) The proposed two-step model for PCP formation has important evolutionary implications in that it excludes the current alternate model according to which a long-range Wnt3-gradient orients PCP ("Wnt/β-catenin-first"). Nevertheless, the initial PCP orientation by Wnt3 - as proposed in the two-step model - is not explained at all on the molecular level. Another possible, but less well-discussed and studied option for linking Wnt3 with PCP action could be the role of other Wnt pathways. The authors present compelling evidence that Wnt3 is the most highly expressed Wnt in Clytia at all stages of development. The authors convincingly show that Wnt3 is the most highly expressed Wnt in Clytia at all stages of development (Figure S1). However, Wnt7 is also more highly expressed, which makes it a candidate for signal transduction from canonical Wnts to PCP Wnts. An involvement of Wnt7 in PCP regulation has been described in vertebrates (http://dx.doi.org/10.1016/j.celrep.2013.12.026). This would challenge the entire discussion and speculation on the evolutionary implications according to which PCP Wnt signaling comes first (PCP-first scenario") and canonical Wnt signaling later in metazoan evolution.
 
 First of all, we apologise that the expression profile of Wnt7originally provided in Figure S1 was incorrect; Wnt7 is not expressed in the embryonic stage. The error came from the accession number XLOC_034538 assigned to two transcripts, Wnt7 and Ataxin10, in the published genome assembly. Once the expression profile is revised in this light, the data are consistent with the in situ hybridisation data published in Momose et al. (2012, https://doi.org/10.1242/dev.084251). Wnt3 is the only Wnt ligand detectable between egg and gastrula stages. We appreciate the reviewer highlighting this issue and have corrected Figure S1
 
 If we understand correctly, the reviewer raises the possibility that Wnt3's downstream canonical Wnt/β-catenin pathway activates the expression of other Wnt genes, which in turn orient the PCP. Indeed, we showed that the expression of Wnt1 (previously called WntX2), Wnt2 (WntX1A), Wnt5 and Wnt6 (Wnt9) all becomes undetectable at the planula stage following Wnt3-MO injection (Momose et al., 2012). So, it is a reasonable concern.
 
 This possibility can be excluded because the canonical pathway activation by CA-β-cat does not restore PCP in Wnt3-MO-injected embryos and Wnt3 can orient PCP without Wnt/β-catenin pathway activity in the presence of dominant negative TCF (dnTCF). Concerning Wnt1b and Wnt11b, these transcripts are maternally stored and even more abundant than Wnt3. However, we can conclude that these do not have any role in axis patterning based on the complete axis loss in Wnt3-MO morphants.
 
 Lastly, it should of course be remembered that the chronological order of characters appearing in a developmental process does not necessarily reflect their appearance in evolution from ancestral to modern.
 
 (4) The discussion, including Figure 6, is strongly biased towards the traditional evolutionary scenario postulating a choanzoan-sponge ancestry of metazoans. Chromosome-linkage data of pre-metazoans and metazoans (Schulz et al., 2023; https://doi.org/10(1038/s41586-023-05936-6) now indicate a radically different scenario according to which ctenophores represent the ancestral form and are sister to sponges, cnidarians and bilaterians (the Ctenophora-sister hypothesis). This has also implications for the evolution of Wnt signalling, as discussed in the recent Nature Genetics Review by Holzem et al. (2024) (https://doi.org/10.1038/s41576-024-00699-w). Furthermore, it calls into question the hypothesis of a filter-feeding multicellular gastrula-like ancestor as proposed by Haeckel (Maegele et al., 2023). These papers have not yet been referenced, but they would provide a more robust discussion.
 
 I overlooked the excellent work of Holzem and colleagues. I appreciate this suggestion. The work, unfortunately, focusses mainly on the Wnt/β-catenin pathway. The PCP pathway consists of not only core PCP (Fmi Stbm, Pk, Dgo, Fz and Dsh) but many other components, such as Rho GTPase, which are all dealt with as "PCP” in this review. While the full set of core PCP is present only in the phylum Cnidaria and bilaterians, Pk and Dgo are present in choanoflagellate and Rho GTPase or ROCK are present even in Fungi (Lapébie et al, 2011 DOI 10.1002/bies.201100023). Holzem et al., described PCP as absent in ctenophores, likely based on the lack of Fmi/Stbm, while claiming its presence in fungi based on Rho GTPase and ROCK. This led to their argument that the Wnt/β-catenin pathway is more ancestral, supported by the absence of PCP components in ctenophores alongside the ctenophore-sister hypothesis.
 
 This likely reflects the limited attention given to PCP in the metazoan evolutionary biology community. Our work sheds light on the importance of PCP regulation in metazoan evolution. In the revised Discussion, we emphasise this point together with the importance of cell biology studies in basal metazoans and compare them based on functional studies.
 
 The observation of Aiptasia’s predatory “gastrula-like” larvae is indeed fascinating. Understanding how early metazoan ancestors obtained nutrients is a key to uncovering the origins of metazoans. However, the relevance of this work to metazoan evolution remains unclear. Predatory nutrient uptake is common among cnidarians, and the findings of Maegele et al. could suggest that the predatory gastrula-like state is ancestral, with the symbiotic state being derived, within Cnidaria, but does not notably support it in metazoa. Also, it has to be clarified how predation is defined. Fundamentally, there is little distinction between filter-feeding and predatory feeding regarding heterotrophy; both feeding types require digestive machinery. If active feeding behaviour is the essence of predation, this would be better addressed as an evolutionary neurobiology or neuroscience question. Another mystery is what the metazoan ancestors took as food if they were predatory; there has to be a non-predatorial metazoan, as a food, already present before them. Overall, Maegele’s work seems premature to be incorporated into the metazoan evo-devo discussion. In either case, the standard approach would involve comparative studies across taxa. It will be interesting to see follow-up works on comparative and functional genomics of predatory/digestive machinery within phylum cnidaria and across metazoan, including sponge and ctenophores.
 
 Reviewer #2 (Recommendations for the authors):
 
 We appreciate the reviewer’s expertise and recommendations regarding Wnt and PCP signalling. It would be our great pleasure if our work is seen and referenced by the cell biology community using model animals.
 
 (1) According to the 2-step model, one would expect that there is a temporal gradient in the spreading of the PCP from oral to aboral. Is there any indication for this?
 
 The best indication of a spatial and temporal gradient of PCP establishment observed so far is at the blastula stage (Fig.2B). PCP gradually becomes coordinated starting at 9 hpf, when PCP is slightly better organised close to the Wnt3-positive area (oral) compared to distal (aboral) areas. We did live imaging with tagged Poc1 to track the positions of centrioles in each cell (Fig. 2E), but this did not provide any further information about the spreading of the PCP. We hypothesise that there is a delay between PCP polarisation—established through the subcellular localisation of core PCP components—and its structural manifestation as ciliary positioning and orientation. This delay likely varies between cells, preventing the formation of a precise spatial PCP wave. We hope in the future to address this temporal aspect by live-imaging of core PCP proteins labelled with fluorescent proteins.
 
 (2) PCP is likely to be an all-or-nothing effect, while axial patterning is dose-dependent. is there a critical dose of Wnt3 level required to kick off the PCP pathway?
 
 We agree that the PCP phenotype is all-or-nothing. Although we did not perform a quantitative test, we have not seen any intermediate phenotypes in Wnt3-rescue experiments. In our experimental condition (100 ng/µl mRNA), the Wnt3 mRNA injection into a blastomere consistently restores the body axis (via PCP) of Wnt3-MO injected embryos. No axis restoration was observed at 1 ng/µl. At 10 ng/µl, some embryos showed a restored elongated axis, while others showed no axis. The volume of injection is not precisely controllable and can easily vary two-fold, so we assume the limit is somewhere around 10 ng/µl. This contrasts with endoderm rescue via Wnt/β-catenin activation by GSK-β-inhibitors (alsterpaullone) or the constitutively active β-catenin (CA-β-cat), which occurs in a dose-dependent manner (ex. Supplementary Figure S2).
 
 (3) The key question left unaddressed is whether Wnt3 signals through one or two different Frizzled receptors? Which Frizzled receptors are candidates for this? Could they be knocked down to see which pathway (or both) is affected?
 
 How Wnt3 orientates the PCP system is an extremely interesting question that needs to be answered, and we plan to address this in the future. In Clytia, four Frizzled genes have been identified in the genome: CheFz1 (vertebrate counterpart of Fz1, 2, 3, 6 and 7), CheFz2 (Fz5 and 8), CheFz3 (Fz9/10) and CheFz4 (Fz4). Knockdown of CheFz1, hereby called Fz1, by Morpholino showed a PCP phenotype (Momose 2012, supplementary data). For a long time, we have suspected that the most likely candidate for PCP mediation is CheFz1. The Wnt3-rescue experiment in CheFz1-blocked background (similar experiment to Figure 3E, F) could potentially have answered this question. No PCP orientation would be expected even near the Wnt3-mRNA injected area if CheFz1 was the Wnt3 receptor for PCP orientation. Unfortunately, no reliable PCP phenotype was observed in this experiment, so this experiment was not included in the manuscript. We initially thought this was due to incomplete suppression of CheFz1 mRNA translation by the Morpholino when used at sub-toxic doses. But we now favour the alternative explanation that Fz1 does not mediate the Wnt3 signal responsible for initiating PCP orientation. We have previously shown that Fz1 is required for the Wnt/ β-catenin pathway (indicated by nuclear β-catenin localisation Momose 2007), which is then required to maintain Wnt3 expression. We cannot rule out that the PCP phenotype obtained previously following Fz1 knockdown (supplementary data in Momose 2012) is an indirect effect of Wnt3 downregulation.
 
 In future work, we plan to test the PCP involvement of the other Clytia Frizzleds, notably CheFz2 and CheFz4, which are not present as maternal mRNAs but are zygotically expressed in the early gastrula stage. CheFz3 is unlikely to be a candidate because it is aborally localised and acts as a negative receptor for the Wnt/β-catenin pathway (Momose 2007). Lastly, in unpublished experiments, no axial phenotype was obtained with ROR and RYK knockdown by Morpholino (T. Momose unpublished).
 
 Based on these considerations, our current working hypothesis is that Wnt3 somehow stabilises or activates one of the Frizzled receptors acting as a core PCP protein in a polarised manner, likely at the oral side of each cell (Stbm is localised at the aboral side), which breaks the PCP symmetry and is propagated across the body axis.
 
 A few lines have been added to the discussion regarding this point.
 
 (4) Is there also PCP within the Wnt3 expressing domain? In other words, (and linked to question 2), does PCP require a certain concentration of Wnt3 or a gradient of Wnt3 in order to provide an orientation?
 
 In the context of a simple Wn3-MO rescue experiment, PCP is coordinated within the Wnt3-positive area. But this could be because PCP can propagate in both orientations, so it does not answer the question. In the Wnt3-rescue experiments in Fmi-MO and Stbm-MO embryos, PCP seemed better oriented close to the boundary between Wnt3-positive and -negative areas, in particular outside the Wnt3-positive area and rather uncoordinated deep in the middle of Wnt3-RNA positive area.
 
 If Wnt3 expression is uniform across an embryo, as achieved by Wnt3-mRNA injection into the egg, the axis will be lost entirely (Momose 2008). We interpret these observations as indicating that Wnt3 expression "contrasts" (or steep gradients) act as the PCP orientation cue rather than a permissive manner.
 
 In normal development, mRNA expression detected by in situ hybridisation has a slight gradient, but we do not have any information about the endogenous protein distribution.
 
 We greatly appreciate the reviewer’s insightful comments. A few sentences addressing points (2) and (4) have been added. The graphical models in Figures 4 and 6A have been updated. While these are relatively minor changes to the manuscript, they significantly impact future perspectives.
 
 Minor comments:
 
 (1) Labeling in some of the figures is too small and not legible, e.g. Figures 4E-H. Please check and improve.
 
 Agreed. Some labelling was way too small (2.5 points). This has been corrected. The minimum font size is now 6-point for most labelling in the revised Figures.
 
 (2) Page 13: ...and allow us to novel scenarios for PCP-driven axis symmetry breaking... seems to lack the verb "propose"
 
 Corrected.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 2

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.09.05.609312v3
www.biorxiv.org www.biorxiv.org

Tissue resident memory CD4+ T cells are sustained by site-specific levels of self-renewal and continuous replacement

4
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  This paper provides a compelling and rigorous quantitative analysis of the turnover and maintenance of CD4+ tissue-resident memory T cell clones, in the skin and the lamina propria. It provides a fundamental advance in our understanding of CD4 T cell regulation. Interestingly, in both tissues, maintenance involves an influx from progenitors on the time scale of months. The evidence that is based on fate mapping and mathematical inference is strong, although open questions on the interpretation of the Ki67-based fate mapping remain.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  Compelling and clearly described work that combines two elegant cell fate reporter strains with mathematical modelling to describe the kinetics of CD4+ TRM in mice. The aim is to investigate the cell dynamics underlying maintenance of CD4+TRM.
  
  The main conclusions are that 1) CD4+ TRM are not intrinsically long-lived 2) even clonal half lives are short: 1 month for TRM in skin, even shorter (12 days) for TRM in lamina propria 3) TRM are maintained by self-renewal and circulating precursors.
  
  Strengths:
  
  (1) Very clearly and succinctly written. Though in some places too succinctly! See suggestions below for areas I think could benefit from more detail.
  
  (2) Powerful combination of mouse strains and modelling to address questions that are hard to answer with other approaches.
  
  (3) The modelling of different modes of recruitment (quiescent, neutral, division linked) is extremely interesting and often neglected (for simpler neutral recruitment).
  
  Comments on revised version: This reviewer is satisfied with the author responses and the changes made in the manuscript.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  This manuscript addresses a fundamental problem of immunology - the persistence mechanisms of tissue-resident memory T cells (TRMs). It introduces a novel quantitative methodology, combining the in vivo tracing of T cell cohorts with rigorous mathematical modeling and inference. Interestingly, the authors show that immigration plays a key role for maintaining CD4+ TRM populations in both skin and lamina propria (LP), with LP TRMs being more dependent on immigration than skin TRMs. This is an original and potentially impactful manuscript.
  
  Comments on revised version: This reviewer is satisfied with the author responses and the changes made in the manuscript.
  
  Review 2
4. Public_Reviews 09 May 2025
  
  in eLife
  
  Author response:
  
  The following is the authors’ response to the original reviews
  
  Reviewer #1 (Public review):
  
  Summary:
  
  Compelling and clearly described work that combines two elegant cell fate reporter strains with mathematical modelling to describe the kinetics of CD4+ TRM in mice. The aim is to investigate the cell dynamics underlying the maintenance of CD4+TRM.
  
  The main conclusions are that:
  
  (1) CD4+ TRM are not intrinsically long-lived.
  
  (2) Even clonal half-lives are short: 1 month for TRM in skin, and even shorter (12 days) for TRM in lamina propria.
  
  (3) TRM are maintained by self-renewal and circulating precursors.
  
  Strengths:
  
  (1) Very clearly and succinctly written. Though in some places too succinctly! See suggestions below for areas I think could benefit from more detail.
  
  (2) Powerful combination of mouse strains and modelling to address questions that are hard to answer with other approaches.
  
  (3) The modelling of different modes of recruitment (quiescent, neutral, division linked) is extremely interesting and often neglected (for simpler neutral recruitment).
  
  Weaknesses/scope for improvement:
  
  (1) The authors use the same data set that they later fit for generating their priors. This double use of the same dataset always makes me a bit squeamish as I worry it could lead to an underestimate of errors on the parameters. Could the authors show plots of their priors and posteriors to check that the priors are not overly-influential? Also, how do differences in priors ultimately influence the degree of support a model gets (if at all)? Could differences in priors lead to one model gaining more support than another?
  
  We now show the priors and posteriors overlaid in Figure S2. The posteriors lie well within the priors, giving us confidence that the priors are not overly influential.
  
  (2) The authors state (line 81) that cells were "identified as tissue-localised by virtue of their protection from short-term in vivo labelling (Methods; Fig. S1B)". I would like to see more information on this. How short is short term? How long after labelling do cells need to remain unlabelled in order to be designated tissue-localised (presumably label will get to tissue pretty quickly -within hours?). Can the authors provide citations to defend the assumption that all label-negative cells are tissue-localised (no false negatives)?
  
  And conversely that no label-positive cells can be found in the tissue (no false positives)? I couldn't actually find the relevant section in the methods and Figure S1B didn't contain this information.
  
  We did describe the in vivo labeling in the first section of Methods (it was for 3 mins before sacrifice). The two aims of Fig S1B were to show the gating strategy (label-positive and negatives from tissue samples were clearly separated) and to address the false-positive issue. Less than 3% of cells in our tissue samples were positive; therefore, at most 3% of truly tissue-resident cells acquired the i.v. label, and likely less. Excluding those (as we did) therefore makes little difference to our analyses in terms of cell numbers. False negative rates are expected to be extremely low; labeling within circulating cells is typically >99% (see refs in Methods).
  
  (3) Are the target and precursor populations from the same mice? If so is there any way to reflect the between-individual variation in the precursor population (not captured by the simple empirical fit)? I am thinking particularly of the skin and LP CD4+CD69- populations where the fraction of cells that are mTOM+ (and to a lesser extent YFP+) spans virtually the whole range. Would it be nice to capture this information in downstream predictions if possible?
  
  This is a great point. We do indeed isolate all populations from each mouse. We are very aware of the advantages of using this grouping of information to reduce within-mouse uncertainty – we employ this as often as we can. The issue here was that the label content within the tissue (target) at any time depends on the entire trajectory of the label frequency in the precursor, in that mouse, up to that point. We can’t identify this curve for each animal individually – so we are obliged to use a population average.
  
  To mitigate this lack of pairing we do take a very conservative approach and fit this empirical function describing the trajectories of YFP and mTom in precursors at the same time as the label kinetics in the target; that is, we account for uncertainty in label influx in our fits and parameter estimates.
  
  Another issue is that to be sure that we are performing model selection appropriately, we only use the distribution of the likelihood on the target observations when comparing support for different precursors with LOO-IC. If we had been able to pair the precursor and target data in some way, the two would then be entangled and model comparison across precursors would not be possible.
  
  We’ve added some of this to the discussion.
  
  (4) In Figure 3, estimates of kinetics for cells in LP appear to be more dependent on the input model (quiescent/neutral/division-linked) than the same parameters in the skin. Can the authors explain intuitively why this is the case?
  
  This is a nice observation and it has a fairly straightforward explanation. As we pointed out in the paper, estimated rates of self renewal become more sensitive to the mode of recruitment the greater the rate of influx. If immigrants are quiescent, all Ki67 in the tissue has to be explained by self renewal. If all new immigrants are Ki67 high, the estimate of the rate of self renewal within the tissue will be lower. Across the board, the estimated rates of influx into gut were consistently higher than those in skin, and so the sensitivity of parameters to the mode of recruitment was much more obvious at that site.
  
  The importance of this trade-off for the division linked model can also be seen when you look at the neutral and quiescent models; they give similar parameter estimates because the Ki67 levels within all precursor populations were all less than 25% and so those two modes of recruitment are difficult to distinguish.
  
  (5) Can the authors include plots of the model fits to data associated with the different strengths of support shown in Figure 4? That is, I would like to know what a difference in the strength of say 0.43 compared with 0.3 looks like in "real terms". I feel strongly that this is important. Are all the fits fantastic, and some marginally better than others? Are they all dreadful and some are just less dreadful? Or are there meaningful differences?
  
  This is another good point (and from the author recommendations list, is your most important concern).
  
  We find that a fairly common issue is that models that are clearly distinguished by information criteria or LRTs can often give visually quite similar fits. Our experience is that this is partly due to the fact that models are usually fit on transformed scales (e.g. log for cell counts, logit for fractions) to normalise residuals, and this uncertainty is compressed when one looks at fits on the observed scale (e.g. linear). Another issue in our case is that for each model (precursor, target, and mode of recruitment) we fit 6 time courses simultaneously. Visual comparisons of fits of different models can then be a little difficult or misleading; apparently small differences in each fitted timecourse can add up to quite significant changes in the combined likelihood. We added this to the Discussion.
  
  The number of models is combinatorial (Fig. 4) so showing them all seems a bit cumbersome. But now in the supporting information (Fig. S3), for each target we show the best, second best, and the worst model fits overlaid, to give a sense of the dynamic range of the models we considered. As you will now see, visual differences among the most strongly supported models were not huge (but refer to our point just above). Measures of out-of-sample prediction error (LOO-IC) discriminated between these models reasonably well, though (weights shown in Fig. 4).
  
  It’s also worth mentioning here that we have substantially greater confidence in the identity of the precursors than in the precise modes of recruitment - you can see this clearly in the groupings of weights in Figure 4A. We did comment on this in the text but now emphasise it more.
  
  (6) Figure 4 left me unclear about exactly which combinations of precursors and targets were considered. Figure 3 implies there are 5 precursors but in Figure 4A at most 4 are considered. Also, Figure 4B suggests skin CD69- were considered a target. This doesn't seem to be specified anywhere.
  
  Thanks for pointing this out. When we were considering CD4+ EM in bulk as target, this population includes CD69- cells; in those fits, therefore, we couldn't use CD69- as a precursor. We now clarify this in the caption. Thanks also for the observation about Figure 4B; we didn’t consider CD69- cells as a target, so we’ve also made that clearer.
  
  Reviewer #2 (Public review):
  
  This manuscript addresses a fundamental problem of immunology - the persistence mechanisms of tissue-resident memory T cells (TRMs). It introduces a novel quantitative methodology, combining the in vivo tracing of T-cell cohorts with rigorous mathematical modeling and inference. Interestingly, the authors show that immigration plays a key role in maintaining CD4+ TRM populations in both skin and lamina propria (LP), with LP TRMs being more dependent on immigration than skin TRMs. This is an original and potentially impactful manuscript. However, several aspects were not clear and would benefit from being explained better or worked out in more detail.
  
  (1) The key observations are as follows:
  
  a) When heritably labeling cells due to CD4 expression, CD4+ TRM labeling frequency declines with time. This implies that CD4+ TRMs are ultimately replenished from a source not labeled, hence not expressing CD4. Most likely, this would be DN thymocytes.
  
  That’s correct.
  
  b) After labeling by Ki67 expression, labeled CD4+ TRMs also decline - This is what Figure 1B suggests. Hence they would be replaced by a source that was not in the cell cycle at the time of labeling. However, is this really borne out by the experimental data (Figure 2C, middle row)? Please clarify.
  
  (2) For potential source populations (Figure 2D): Please discuss these data critically. For example, CD4+ CD69- cells in skin and LP start with a much lower initial labeling frequency than the respective TRM populations. Could the former then be precursors of the latter?
  
  A similar question applies to LN YFP+ cells. Moreover, is the increase in YFP labeling in naïve T cells a result of their production from proliferative thymocytes? How well does the quantitative interpretation of YFP labeling kinetics in a target population work when populations upstream show opposite trends (e.g., naïve T cells increasing in YFP+ frequency but memory cells in effect decreasing, as, at the time of labeling, non-activated = non-proliferative T cells (and hence YFP-) might later become activated and contribute to memory)?
  
  These are good (and related) points. We've added some text to the discussion, paragraphs 2 and 3; we reproduce it here, slightly expanded.
  
  Fig 1B was a schematic but did faithfully reflect the impact of any waning of YFP in precursor on its kinetic in the targets. However, in our experiments, as you noted, the kinetics of YFP in most of the precursor populations were quite flat. This was due in part to memory subsets being sustained by the increasing levels of YFP within naïve cells from the cohort of thymocytes labeled during treatment. There is also likely some residual permanent labeling of lymphocyte progenitor populations. We discussed this in Lukas Front Imm 2023. (The latter is not a problem; all that matters for our analysis is that we generate a reasonable empirical description of the label kinetics in naive cells, however it arises). YFP is therefore not cleanly washed out in the periphery; and so for models with circulating memory as the tissue precursor, the flatness of their YFP curves leads to rather flat curves in the tissues.
  
  The mTom labelling was more informative as it was clearly diluted out of all peripheral populations by mTom-negative descendants of thymically-derived cells, as you point out in (a).
  
  Regarding (2), re: interpreting the initial levels of labels in precursors and targets. The important point here is that YFP and mTom were induced quickly in all populations we studied; therefore our inferences regarding precursors and targets aren’t informed by the initial levels of levels in each. (Imagine a slow precursor feeding a rapidly dividing target; YFP levels in the former would start lower than those in the latter). The causal issue that we think you’re referring to would matter if one expects the targets to begin with no label at all; for instance, in our busulfan chimeric mouse model (e.g. Hogan PNAS 2015) new, thymically derived ‘labelled’ (donor) cells progressively infiltrate replete ‘unlabelled’ (host) populations. In that case, one can immediately reject certain differentiation pathways by looking the sequence of accrual of donor cells in different subsets.
  
  The trends in YFP and mTom frequencies after treatment do matter for pathway inference, though, because precursor kinetics must leave an imprint on the target. For the case you mentioned, with opposite trends in label kinetics, such models would unlikely to be supported strongly; indeed, we never saw strong support for naïve cells (strongly increasing YFP) as a direct precursor of TRM (fairly flat).
  
  We’ve added a condensed version of this to the Discussion.
  
  (3) Please add a measure of variation (e.g., suitable credible intervals) to the "best fits" (solid lines in Figure 2).
  
  Added.
  
  (4) Could the authors better explain the motivation for basing their model comparisons on the Leave-OneOut (LOO) cross-validation method? Why not use Bayesian evidence instead?
  
  Bayes factors are very sensitive to priors and are either computationally unstable if calculated with importance sampling methods, or very expensive to calculate, if ones uses the more stable bridge sampling method. (We also note that fitting just a single model here takes a substantial amount of time). Further, using BF can be unreliable unless one of the models is close to the 'true' data generating model; though they seem to work well, we can be sure that none of our models are! For us, a more tractable and real-world selection criterion is based on the usefulness of a model, for which predictive performance is a reasonable proxy. In this case the mean out-of-sample prediction error (which LOO-IC reflects) is a wellestablished and valid means of ascribing support to different models.
  
  AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 2

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.09.26.615039v3
www.biorxiv.org www.biorxiv.org

Drosophila Hamlet mediates epithelial tissue assembly of the reproductive system

4
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  This important study addresses an essential morphogenetic process-epithelial fusion-by identifying the transcription factor Hamlet as a potential master regulator. Using a combination of genetic, cell biological, and omics approaches, including a comprehensive RNAi screen and high-quality imaging, the authors provide compelling evidence for Hamlet's role in coordinating cell fate and differentiation. The findings are robust and of broad interest to developmental biologists and geneticists.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  Wang et al. identify Hamlet, a PR-containing transcription factor, as a master regulator of reproductive development in Drosophila. Specifically, the fusion between the gonad and genital disc that is necessary for development of a continuous testes and seminal vesicle tissue essential for fertility. To do so, the authors generate novel Hamlet null mutants by CRISPR/Cas9 gene editing and characterize the morphological, physiological, and gene expression changes of the mutants using immunofluorescence, RNA-seq, cut-tag, and in-situ analysis. Thus, Hamlet is discovered to regulate a unique expression program, which includes Wnt2 and Tl, that is necessary for testis development and fertility.
  
  Strengths:
  
  This is a rigorous and comprehensive study that identifies the Hamlet dependent gene expression program mediating reproductive development in Drosophila. The Hamlet transcription targets are further characterized by Gal4/UAS-RNAi confirming their role in reproductive development. Finally, the study points to a role for Wnt2 and Tl as well as other Hamlet transcriptionally regulated genes in epithelial tissue fusion.
  
  Weaknesses:
  
  None noted.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Strengths:
  
  Wang and colleagues successfully uncovered an important function of the Drosophila PRDM16/PRDM3 homolog Hamlet (Ham) - a PR domain containing transcription factor with known roles in the nervous system in Drosophila. To do so, they generated and analyzed new mutants lacking the PR domain, and also employed diverse preexisting tools. In doing so, they made a fascinating discovery: They found that PR-domain containing isoforms of ham are crucial in the intriguing development of the fly genital tract. Wang and colleagues found three distinct roles of Ham: (1) Specifying the position of the testis terminal epithelium within the testis, (2) allowing normal shaping and growth of the anlagen of the seminal vesicles and paragonia and (3) enabling the crucial epithelial fusion between the seminal vesicle and the testis terminal epithelium. The mutant blocks fusion even if the parts are positioned correctly. The last finding is especially important, as there are few models allowing one to dissect the molecular underpinnings of heterotypic epithelial fusion in development. Their data suggest that they found a master regulator of this collective cell behavior. Further, they identified some of the cell biological players downstream of Ham, like for example E-Cadherin and Crumbs. In a holistic approach, they performed RNAseq and intersected them with the CUT&TAG-method, to find a comprehensive list of downstream factors directly regulated by Ham. Their function in the fusion process was validated by a tissue-specific RNAi screen. Meticulously, Wang and colleagues performed multiplexed in situ hybridization and analyzed different mutants, to gain a first understanding of the most important downstream-pathways they characterized - which are Wnt2 and Toll.
  
  This study pioneers a completely new system. It is a model for exploring a process crucial in morphogenesis across animal species, yet not well-understood. Wang and colleagues not only identified a crucial regulator of heterotypic epithelial fusion but took on the considerable effort of meticulously pinning down functionally important downstream effectors by using many state-of-the-art methods. This is especially impressive, as dissection of pupal genital discs before epithelial fusion is a time-consuming and difficult task. This promising work will be the foundation future studies build on, to further elucidate how this epithelial fusion works, for example on a cell biological and biomechanical level.
  
  Weaknesses:
  
  The developing testis-genital disc system has many moving parts. Myotube migration was previously shown to be crucial for testis shape. This means, that there is the potential of non-tissue autonomous defects upon knockdown of genes in the genital disc or the terminal epithelium, affecting myotube behavior which in turn affects epithelial fusion, as myotubes might create the first "bridge" bringing the two epithelia together. Nevertheless, this is outside the scope of this work and could be addressed in the future.
  
  Review 2
4. Public_Reviews 09 May 2025
  
  in eLife
  
  Author response:
  
  The following is the authors’ response to the original reviews
  
  Reviewer #1 (Public review):
  
  Summary:
  
  Wang et al. identify Hamlet, a PR-containing transcription factor, as a master regulator of reproductive development in Drosophila. Specifically, the fusion between the gonad and genital disc is necessary for the development of continuous testes and seminal vesicle tissue essential for fertility. To do this, the authors generate novel Hamlet null mutants by CRISPR/Cas9 gene editing and characterize the morphological, physiological, and gene expression changes of the mutants using immunofluorescence, RNA-seq, cut-tag, and in-situ analysis. Thus, Hamlet is discovered to regulate a unique expression program, which includes Wnt2 and Tl, that is necessary for testis development and fertility.
  
  Strengths:
  
  This is a rigorous and comprehensive study that identifies the Hamlet-dependent gene expression program mediating reproductive development in Drosophila. The Hamlet transcription targets are further characterized by Gal4/UAS-RNAi confirming their role in reproductive development. Finally, the study points to a role for Wnt2 and Tl as well as other Hamlet transcriptionally regulated genes in epithelial tissue fusion.
  
  We appreciate that the reviewer thinks our study is rigorous.
  
  Weaknesses:
  
  The image resolution and presentation of figures is a major issue in this study. As a nonexpert, it is nearly impossible to see the morphological changes as described in the results. Quantification of all cell biological phenotypes is also lacking therefore reducing the impact of this study to those familiar with tissue fusion events in Drosophila development.
  
  In the revised version, we have improved the image presentation and resolution. For all the images with more than two channels, we included single-channel images, changed the green color to lime and the red to magenta, highlighted the testis (TE) and seminal vescicles to make morphological changes more visible.
  
  We had quantification for marker gene expression in the original version, and now also included quantification for cell biological phenotypes which are generally with 100% penetrance.
  
  Reviewer #2 (Public review):
  
  Strengths:
  
  Wang and colleagues successfully uncovered an important function of the Drosophila PRDM16/PRDM3 homolog Hamlet (Ham) - a PR domain-containing transcription factor with known roles in the nervous system in Drosophila. To do so, they generated and analyzed new mutants lacking the PR domain, and also employed diverse preexisting tools. In doing so, they made a fascinating discovery: They found that PR-domain containing isoforms of ham are crucial in the intriguing development of the fly genital tract. Wang and colleagues found three distinct roles of Ham: (1) specifying the position of the testis terminal epithelium within the testis, (2) allowing normal shaping and growth of the anlagen of the seminal vesicles and paragonia and (3) enabling the crucial epithelial fusion between the seminal vesicle and the testis terminal epithelium. The mutant blocks fusion even if the parts are positioned correctly. The last finding is especially important, as there are few models allowing one to dissect the molecular underpinnings of heterotypic epithelial fusion in development. Their data suggest that they found a master regulator of this collective cell behavior. Further, they identified some of the cell biological players downstream of Ham, like for example E-Cadherin and Crumbs. In a holistic approach, they performed RNAseq and intersected them with the CUT&TAG-method, to find a comprehensive list of downstream factors directly regulated by Ham. Their function in the fusion process was validated by a tissue-specific RNAi screen. Meticulously, Wang and colleagues performed multiplexed in situ hybridization and analyzed different mutants, to gain a first understanding of the most important downstream pathways they characterized, which are Wnt2 and Toll.
  
  This study pioneers a completely new system. It is a model for exploring a process crucial in morphogenesis across animal species, yet not well understood. Wang and colleagues not only identified a crucial regulator of heterotypic epithelial fusion but took on the considerable effort of meticulously pinning down functionally important downstream effectors by using many state-of-the-art methods. This is especially impressive, as the dissection of pupal genital discs before epithelial fusion is a time-consuming and difficult task. This promising work will be the foundation future studies build on, to further elucidate how this epithelial fusion works, for example on a cell biological and biomechanical level.
  
  We appreciate that the reviewer thinks our study is orginal and important.
  
  Weaknesses:
  
  The developing testis-genital disc system has many moving parts. Myotube migration was previously shown to be crucial for testis shape. This means, that there is the potential of non-tissue autonomous defects upon knockdown of genes in the genital disc or the terminal epithelium, affecting myotube behavior which in turn affects fusion, as myotubes might create the first "bridge" bringing the epithelia together. The authors clearly showed that their driver tools do not cause expression in myoblasts/myotubes, but this does not exclude non-tissue autonomous defects in their RNAi screen. Nevertheless, this is outside the scope of this work.
  
  We thank the reviewer’s consideration of non-tissue autonomous defects upon gene knockdown. The driver, hamRSGal4, drives reporter gene expression mainly in the RS epithelia, but we did observe weak expression of the reporter in the myoblasts before they differentiate into myotubes. Thus, we could not rule out a non-tissue autonomou effect in the RNAi screen. So we now included a statement in the result, “Given that the hamRSGal4 driver is highly expressed in the TE and SV epithelia, we expect highly effective knockdown occurs only in these epithelial cells. However, hamRSGal4 also drives weak expression in the myoblasts before they differentiated into myotubes (Supplementary Fig. 5B), which may result in a non-tissue autonomous effect when knocking down the candidate genes expressed in myoblasts.”
  
  However, one point that could be addressed in this study: the RNAseq and CUT&TAG experiments would profit from adding principal component analyses, elucidating similarities and differences of the diverse biological and technical replicates.
  
  Thanks for the suggestion. We now have included the PCA analyses in supplementary figure 6A-B and the corresponding description in the text. The PCA graphs validated the consistency between biological replicates of the RNA-seq samples. The Cut&Tag graphs confirm the consistency between the two biological replicates from the GFP samples, but show a higher variability between the w1118 replicates. Importantly, we only considered the overlapped peaks pulled by the GFP antibody from the ham_GFP genotype and the Ham antibody from the wildtype (w1118) sample as true Ham binding sites.
  
  Recommendations for the authors:
  
  Reviewer #1 (Recommendations for the authors):
  
  Major Concern:
  
  (1) The image resolution and presentation of figures (Figures 2, 5, 6, and 7) is a major issue in this study. As a non-expert, it is nearly impossible to see the morphological changes as described in the results. Images need to be captured at higher resolution and zoomed in with arrows denoting changes as described. Individual channels, particularly for intensity measurement need to be shown in black and white in addition to merged images. Images also need pseudo-colored for color-blind individuals (i.e. no red-green staining).
  
  The images were captured at a high resolution, but somehow the resolution was drammaticlly reduced in the BioRxiv PDF. We try to overcome this by directly submitting the PDF in the Elife submission system. In the revised version, we have included single-channel images, changed the green and red colors to lime and magenta for color blindness. We also highlighted the testis (TE) and seminal vescicle structures in the images to make morphological changes more visible.
  
  (2) The penetrance of morphological changes observed in RT development is also unclear and needs to be rigorously quantified for data in Figures 2, 5, and 7.
  
  We now included quantification for cell biological phenotypes which are generally with 100% penetrance. The percentage of the penetrance and the number of animals used are indicated in each corresponding image.
  
  Reviewer #2 (Recommendations for the authors):
  
  Major Points
  
  (1) Lines 193- 220 I would strongly suggest pointing out the obvious shape defects of the testes visible in Figure 2A ("Spheres" instead of "Spirals"). These are probably a direct consequence of a lack in the epithelial connection that myotubes require to migrate onto the testis (in a normal way) as depicted in the cartoons, allowing the testis to adopt a spiral shape through myotube-sculpting (Bischoff et al., 2021), further confirming the authors' findings!
  
  Good point. In the revised text, we have added more description of the testis shape defects and pointed out a potential contribution from compromised myotube migration.
  
  (2) Line 216: "Often separated from each other". Here it would be important to mention how often. If the authors cannot quantify that from existing data, I suggest carrying it out in adult/pharate adult genital tracts (if there is no strong survivor bias due to the lethality of stronger affected animals), as this is much easier than timing prepupae. This should be a quick and easy experiment.
  
  Because it is hard to tell whether the separation of the SV and TE was caused by developmental defects or sometimes could be due to technical issues (bad dissection), we now change the description to, “control animals always showed connected TE and SV, whereas ham mutant TE and SV tissues were either separated from each other, or appeared contacted but with the epithelial tubes being discontinuous (Fig. 2B).” Additionally, we quantified the disconnection phenotype, which is 100% penetrance in 18 mutant animals. This quantification is now included in the figure.
  
  (3) Lines 289-305, Figure 3. I could only find how many replicates were analyzed in the RNAseq/CUT&Tag experiments in the Material & Methods section. I would add that at least in the figure legends, and perhaps even in the main text. Most importantly, I would add a Principal Component Analysis (one for RNAseq and one for the CUT&TAG experiment), to demonstrate the similarity of biological replicates (3x RNaseq, 4x Cut&Tag) but also of the technical replicates (RNAseq: wt & wt/dg, ham/ham & ham/df, GD & TE; CUT&TAG: Antibody & GFP-Antibody, TG&TE...). This should be very easy with the existing data, and clearly demonstrate similarities & differences in the different types of replicates and conditions.
  
  Principle component analysis and its description are now added to Supplementary Fig 6 and the main text respectively.
  
  (4) Line 321; Supplementary Table 1: In the table, I cannot find which genes are down- or upregulated - something that I think is very important. I would add that, and remove the "color" column, which does not add any useful information.
  
  In Supplementary table 1, the first sheet includes upregulated genes while the second sheet includes downregulated genes. We removed the column “color” as suggested.
  
  (5) Line 409: SCRINSHOT was carried out with candidate genes from the screen. One gene I could not find in that list was the potential microtubule-actin crosslinker shot. If shot knockdown caused a phenotype, then I would clearly mention and show it. If not, I would mention why a shot is important, nonetheless.
  
  shot is one of the candidate target genes selected from our RNA-seq and Cut&Tag data. However, in the RNAi screen, knocking down shot with the available RNAi lines did not cause any obvious phenotype. These could be due to inefficient RNAi knockdown or redundancy with other factors. We anyway wanted to examine shot expression pattern in the developing RS, give the important role of shot in epithelial fusion (Lee S., 2002). Using SCRINSHOT, we could detect epithelial-specific expression of shot, implying its potential function in this context. We now revised the text to clarify this point.
  
  Minor points
  
  (1) Cartoons in Figure 1: The cartoons look like they were inspired by the cartoon from Kozopas et al., 1998 Fig. 10 or Rothenbusch-Fender et al., 2016 Fig 1. I think the manuscript would greatly profit from better cartoons, that are closer to what the tissue really looks like (see Figure 1H, 2G), to allow people to understand the somewhat complicated architecture. The anlagen of the seminal vesicles/paragonia looks like a butterfly with a high columnar epithelium with a visible separation between paragonia/seminal vesicles (upper/lower "wing" of the "butterfly"). Descriptions like "unseparated" paragonia/seminal vesicle anlagen, would be much easier to understand if the cartoons would for example reflect this separation. It would even be better to add cartoons of the phenotypic classes too, and to put them right next to the micrographs. (Another nitpick with the cartoons: pigment cells are drastically larger and fewer in number (See: Bischoff et al., 2021 Figure 1E & MovieM1).)
  
  Thanks for the suggestion. We have updated Figure 1 by adding additional illustrations showing the accessory gland and seminal vesicle structures in the pupal stage and changing the size of pigment cells.
  
  (2) Line 95-121 I would also briefly introduce PR domains, here.
  
  We have added a brief descripition of the PR domains.
  
  (3) Line 152, 158, 160, 162. When first reading it, I was a bit confused by the usage of the word sensory organ. I would at least introduce that bristles are also known as external mechanosensory organs.
  
  We have now revised the description to “mechano-sensory organ”.
  
  eg. Line 184, 194, and many more. Most times, the authors call testis muscle precursors "myoblasts". This is correct sometimes, but only when referring to the stage before myoblast-fusion, which takes place directly before epithelial fusion (28 h APF). Postmyoblast-fusion (eg. during migration onto the testis), these cells should be called myotubes or nascent myotubes, as the fly muscle community defined the term myoblast as the singlenuclei precursors to myotubes.
  
  We have now revised the description accordingly.
  
  (4) Line 217/Figure 2B. It looks like there is a myotube bridge between the testis and the genital disc. I would point that out if it's true. If the authors have a larger z-stack of this connection, I suggest creating an MIP, and checking if there are little clusters of two/three/four nuclei packed together. This would clearly show that the cells in between are indeed myotubes (granted that loss of ham does not introduce myoblast-fusion-defects).
  
  We do not have a Z-stack of this connection, and thus can not confirm whether the cells in this image are myotubes. However, we found that mytubes can migrate onto the testis and form the muscular sheet in the ham mutant despite reduced myotube density. At the junction there are myotubes, suggesting that loss of ham does not introduce myoblast-fusion defects. These results are now included in the revised manuscript, supplementary Fig. 5 C-D.
  
  (5) Line 231/Supplementary Fig. 3C-G: I would add to the cartoons, where the different markers are expressed.
  
  We have added marker gene expression in the cartoons.
  
  (6) Line 239. I don't see what Figure 1A/1H refers to, here. I would perhaps just remove it.
  
  Yes, we have removed it.
  
  (7) Line 232. I would rephrase the beginning of the sentence to: Our data suggest Ham to be...
  
  Yes, we have revised it.
  
  (8) Line 248-250/Figure 2F. Clonal analyses are great, but I think single channels should be shown in black and white. Also, a version without the white dashed line should be shown, to clearly see the differences between wt and ham-mutant cells.
  
  Now single channel images from the green and red images are presented in Supplementary Figures. This particular one is in Supplementary Figure 3B.
  
  (9) Line 490. The Toll-9 phenotype was identified on the sterility effect/lack-of-spermphenotype alone, and it was deduced, that this suggests connection defects. By showing the right focus plane in Fig S8B (lower right), it should be easy to directly show whether there is a connection defect or not. Also, one would expect clearer testis-shaping defects, like in ham-mutants, as a loss of connection should also affect myotube migration to shape the testis. This is just a minor point, as it only affects supplementary data with no larger impact on the overall findings, even if Toll-9 is shown not to have a defect, after all.
  
  We find that scoring defects at the junction site at the adult stage is difficult and may not be always accurate. Instead, we score the presence of sperms in the SV, which indirectly but firmly suggests successful connection between the TE and SV. We have now included a quantification graph, showing the penetrance of the phentoype in the new Supplementary Fig.14C. There were indeed morphological defects of TE in Toll-9 RNAi animals. We now included the image and quantification in the new Supplementary Fig.14B.
  
  AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 2

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.04.05.588322v3
www.biorxiv.org www.biorxiv.org

PDZ-directed substrate recruitment is the primary determinant of specific 4EBP1dephosphorylation by PP1-Neurabin

5
1. Public_Reviews 09 May 2025
 
 in eLife
 
 eLife Assessment
 
 This important study reports on a basis for neurabin-mediated specification of substrate choice by protein phosphatase-1. The data from the comprehensive approach using structural, biochemical, and computational methods are compelling. This paper is broadly relevant to those investigating various cellular signaling cascades that entail phosphorylation as the main mechanism.
 
 Summary
2. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Summary:
 
 In this manuscript the Treisman and colleagues address the question of how protein phosphatase 1 (PP1) regulatory subunits (or PP1-interacting protein (PIPs)) confer specificity on the PP1 catalytic subunit which by itself possesses little substrate specificity. In prior work the authors showed that the PIP Phactrs confers specificity by remodelling a hydrophobic groove immediately adjacent to the PP1 catalytic site through residues within the RVxF- ø ø -R-W string of Phactrs. Specifically, the residues proximal and including the 'W' of the RVxF- ø ø -R-W string remodel the hydrophobic groove. Other residues the of the RVxF- ø ø -R-W string (i.e. the RVxF- ø ø -R) are not involved in this remodelling.
 
 The authors suggest that the RVxF- ø ø -R-W string is a conserved feature of many PIPs including PNUTS, Neurabin/spinophilin and R15A. However from a sequence and structural perspective only the RVxF- ø ø -R- is conserved. The W is not conserved in most and in the R15A structure (PDB:7NZM) the Trp side chain points away from the hydrophobic channel - this could be a questionable interpretation due to model building into the low resolution cryo-EM map (4 A).
 
 In this paper the authors convincingly show that Neurabin confers substrate specificity through interactions of its PDZ domain with the PDZ domain-binding motif (PBM) of 4E-BP. They show the PBM motif is required for Neurabin to increase PP1 activity towards 4E-BP and a synthetic peptide modelled on 4E-BP and also a synthetic peptide based on IRSp53 with a PBM added. The PBM of 4E-BP1 confers high affinity binding to the Neurabin PDZ domain. A crystal structure of a PP1-4E-BP1 fusion with Neurabin shows that the PBM of 4E-BP interacts with the PDZ domain of Neurabin. No interactions of 4E-BP and the catalytic site of PP1 are observed. Cell biology work showed that Neurabin-PP1 regulates the TOR signalling pathway by dephosphorylating 4E-BPs.
 
 Strengths:
 
 This work demonstrates convincingly using a variety of cell biology, proteomics, biophysics and structural biology that the PP1 interacting protein Neurabin confers specificity on PP1 through an interaction of its PDZ domain with a PDZ-binding motif of 4E-BP1 proteins. Remodelling of the hydrophobic groove of the PP1 catalytic subunit is not involved in Neurabin-dependent substrate specificity, in contrast to how Phactrs confers specificity on PP1. The active site of the Neurabin/PP1 complex does not recognise residues in the vicinity of the phospho-residue, thus allowing for multiple phospho-sites on 4E-BP to be dephosphorylated by Neurabin/PP1. This contrasts with substrate specificity conferred by the Phactrs PIP that confers specificity of Phactrs/PP1 towards its substrates in a sequence-specific context by remodelling the hydrophobic groove immediately adjacent to the catalytic. The structural and biochemical insights are used to explore the role of Neurabin/PP1 in dephosphorylation 4E-BPs in vivo, showing that Neurabin/PP1 regulates the TOR signalling pathway, specifically mTORC1-dependent translational control.
 
 Weaknesses:
 
 The only weakness is the suggestion that a conserved RVxF- ø ø -R-W string exists in PIPs. The 'W' is not conserved in sequence and 3-dimensions in most of the PIPs discussed in this manuscript. The lack of conservation of the W would be consistent with the finding based on multiple PP1-PIP structures that apart from Phactrs, no other PIP appears to remodel the PP1 hydrophobic channel.
 
 Comments on revisions:
 
 The authors have addressed my comments.
 
 One aspect of the manuscript and response to reviewers is misleading regarding the statement: 'Like many PIPs, they interact with PP1 using the previously defined "RVxF", "ΦΦ", and "R" motifs (Choy et al, 2014).' This statement, and similar in the authors' response, implies that Choy et al discovered the "RVxF" and "ΦΦ" motifs. The Choy et al, 2014 paper reports the discovery of the "R" motif. The "RVxF" and "ΦΦ" motifs were discovered and reported in earlier papers not cited in the authors' manuscript. Perhaps the authors can correct this.
 
 Review 1
3. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 This manuscript explores the molecular mechanisms that are involved in substrate recognition by the PP1 phosphatase. The authors previously showed that the PP1 interacting protein (PPI), PhactrI, conferred substrate specificity by remodelling the PP1 hydrophobic substrate groove. In this work, the authors aimed to understand the key determinant of how other PIPs, Neurabin and Spinophilin, mediate substrate recognition.
 
 The authors generated a few PP1-PIP fusion constructs, undertook TMT phosphoproteomics and validated their method using PP1-Phactr1/2/3/4 fusion constructs. Using this method, the authors identified phsophorylation sites controlled by PP1-Neurabin and focussed their work on 4E-BP1, thereby linking PP1-Neurabin to mTORC1 signalling. Upon validating that PP1-Neurabin dephosphorylates 4E-BP1, they determined that 4E-BP1 PBM binds to the PDZ domain of Neurabin with an affinity that was greater than 30 fold as compared to other substrates. PP1-Neurabin dephosphorylated 4E-BP1WT and IRSp53WT with a catalytic efficiency much greater than PP1 alone. However, PP1-Neurabin bound to 4E-BP1 and IRSp53 mutants lacking the Neurabin PDZ domain with a catalytic efficiency lesser than that observed with 4E-BP1WT. These results indicate the involvement of the PDZ domain in facilitating substrate recruitment by PP1-Neurabin. Interestingly, PP1-Phactr1 dephosphorylation of 4E-BP1 phenocopies PP1 alone, while PP1-Phactr1 dephosphorylates IRSp53 to a much higher extent than PP1 alone. These results highlights the importance of the PDZ domain and also shed light on how different PP1-PIP holoenzymes mediate substrate recognition using distinct mechanisms. The authors also show that the remodelling of the hydrophobic PP1 substrate groove which is essential for substrate recognition by PP1-Phactr1, was not required by PP1-Neurabin. Additionally, the authors also resolved the structure of a PP1-4E-BP1 fusion with the PDZ-containing C-terminal of Neurabin and observed that the Neurabin/PP1-4E-BP1 complex structure was oriented at 21{degree sign} to that in the unliganded Spinophilin/PP1 complex (resolved by Ragusa et al., 2010) owing to a slight bend in the C-terminal section that connects it to the RVxF-ΦΦ-R-W string. Since, no interaction was observed with the remodelled PP1-Neurabin hydrophobic groove, the authors utilised AlphaFold3 to further answer this. They observed a high confidence of interaction between the groove and phosphorylated substrate and a low confidence of interaction between the groove and unphosphorylated substrate, thereby suggesting that the hydrophobic groove remodelling is not involved in PP1-Neurabin recognition and dephosphorylation of 4E-BP1.
 
 In this work, the authors provide novel insights into how Neurabin depends on the interaction between its PDZ domain and PBM domains of potential substrates to mediate its recruitment by PP1. Additionally, they uncover a novel PP1-Neurabin substrate, 4E-BP1. They systematically employ phosphoproteomics, biochemical and structural methods to investigate substrate specifity in a robust fashion. Furthermore, the authors also compares the interactions between PP1-Neurabin to 4E-BP1 and IRSp53 (PP1-Phactr1 substrate) with PP1-Phactr1, to showcase the specificity of the mode of action employed by these complexes in mediating substrate specificity. The authors do employ an innovative PP1-PIP fusion strategy previously explored by Oberoi et al., 2016 and the authors themselves in Fedoryshchak et al., 2020. This method, allows for a more controlled investigation of the interactions between PP1-PIPs and its substrates. Furthermore, the authors have substantially characterised the importance of the PDZ domain using their fusion constructs, however, I believe that a further exploration into either structural or AlphaFold3 modelling of PBM domain substrate mutants, or a Neurabin PDZ-domain mutant might further strengthen this claim. Overall, the paper makes a substantial contribution to understanding substrate recognition and specificity in PP1-PIP complexes. The study's innovative methods, biological relevance, and mechanistic insights are strengths, but whether this mechanism occurs in a physiological context is unclear.
 
 Review 2
4. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 Protein Phosphatase 1 (PP1), a vital member of the PPP superfamily, drives most cellular serine/threonine dephosphorylation. Despite PP1's low intrinsic sequence preference, its substrate specificity is finely tuned by over 200 PP1-interacting proteins (PIPs), which employ short linear motifs (SLIMs) to bind specific PP1 surface regions. By targeting PP1 to cellular sites, modifying substrate grooves, or altering surface electrostatics, PIPs influence substrate specificity. Although many PIP-PP1-substrate interactions remain uncharacterized, the Phactr family of PIPs uniquely imposes sequence specificity at dephosphorylation sites through a conserved "RVxF-ΦΦ-R-W" motif. In Phactr1-PP1, this motif forms a hydrophobic pocket that favors substrates with hydrophobic residues at +4/+5 in acidic contexts (the "LLD motif"), a specificity that endures even in PP1-Phactr1 fusions. Neurabin/Spinophilin remodel PP1's hydrophobic groove in distinct ways, creating unique holoenzyme surfaces, though the impact on substrate specificity remains underexplored. This study investigates Neurabin/Spinophilin specificity via PDZ domain-driven interactions, showing that Neurabin/PP1 specificity is governed more by PDZ domain interactions than by substrate sequence, unlike Phactr1/PP1.
 
 A significant strength of this work is the use of PP1-PIP fusion proteins to effectively model intact PP1•PIP holoenzymes by replicating the interactions that remodel the PP1 interface and confer site-specific substrate specificity. When combined with proteomic analyses to assess phospho-site depletion in mammalian cells, these fusions offer critical insights into holoenzyme specificity, revealing new candidate substrates for Neurabin and Spinophilin. The studies present compelling evidence that the PDZ domain of PP1-Neurabin directs its specificity, with the remodeled PP1 hydrophobic groove interactions having minimal impact. This mechanism is supported by structural analysis of the PP1-4E-BP1 substrate fusion bound to a Neurabin construct, highlighting the 4E-BP1/PDZ interaction. This work delivers crucial insights into PP1-PIP holoenzyme function, combining biochemical, proteomic, and structural approaches. It validates the PP1-PIP fusion protein model as a powerful tool, suggesting it may extend to studying additional holoenzymes. While an extremely useful model, it must be considered unlikely the PP1-PIP fusions fully recapitulate the specificity and regulation of the holoenzyme.
 
 Review 3
5. Public_Reviews 09 May 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews
 
 Response to the public reviews:
 
 We are very pleased to see these positive reviews of our preprint.
 
 Reviewers 1 and 3 raise issues around PIP-PP1 interactions.
 
 (1) Role of the “RVxF-ΦΦ-R-W string”
 
 Most PIPs interact with the globular PP1 catalytic core through short linear interaction motifs (SLiMs) and Choy et al (PNAS 2014) previously showed that many PIPs interact with PP1 through conserved trio of SLiMs, RVxF-ΦΦ-R, which is also present in the Phactrs.
 
 Previous structural analysis showed the trajectory of the PPP1R15A/B, Neurabin/Spinphilin (PPP1R9A/B), and PNUTS (PPP1R10) PIPs across the PP1 surface encompasses not only the RVxF-ΦΦ-R trio, but also additional sequences C-terminal to it (Chen et al, eLife, 2015). This extended trajectory is maintained in the Phactr1-PP1 complex (Fedoryshchak et al, eLife (2020). Based on structural alignment we proposed the existence of an additional hydrophobic “W” SLiM that interacts with the PP1 residues I133 and Y134.
 
 The extended “RVxF-ΦΦ-R-W” interaction brings sequences C-terminal to the “W” SLiM into the vicinity of the hydrophobic groove that adjoins the PP1 catalytic centre. In the Phactr1/PP1 complex, these sequences remodel the groove, generating a novel pocket that facilitates sequence-specific substrate recognition.
 
 This raises the possibility that sequences C-terminal to the extended “RVxF-ΦΦ-R-W string” in the other complexes also confer sequence-specific substrate recognition, and our study aims to test this hypothesis. Indeed, the hydrophobic groove structures of the Neurabin/Spinophilin/PP1 and Phactr1/PP1 complexes differ significantly (Ragusa et al, 2010; see Fedoryshchak et al 2020, Fig2 FigSupp1).
 
 (2) Orientation of the W side chain
 
 Reviewer 1 points out that in the substrate-bound PP1/PPP1R15A/Actin/eIF2 pre-dephosphorylation complex the W sidechain is inverted with respect to its orientation in PP1-PPP1R15B complex (Yan et al, NSMB 2021). The authors proposed that this may reflect the role of actin in assembly of the quaternary complex. This does not necessarily invalidate the notion that sequences C-terminal to the “W” motif might play a role in actin-independent substrate recognition, and we therefore consider our inclusion of the R15A/B fusions in our analysis to be reasonable.
 
 (3) Conservation of W
 
 The motif ‘W’ does not mandate tryptophan - Phactrs and PPP1R15A/B indeed have W at this position but Neurabin/spinophilin contain VDP, which makes similar interactions. Similarly the “RVxF” motifs in Phactr1, Neurabin/Spinophilin, PPP1R15A/B and PNUTS are LIRF, KIKF, KV(R/T)F and TVTW respectively.
 
 In our revision, we will present comparisons of the differentially remodelled/modified PP1 hydrophobic groove in the various complexes, discuss the different orientations of the tryptophan in the previously published PPP1R15A/PP1 and PPP1R15B/PP1 structures. We will also address the other issues raised by the referees.
 
 Recommendations for the authors:
 
 Reviewer #1 (Recommendations for the authors):
 
 Comments and suggestions for revisions
 
 (1) The authors do not provide strong evidence that the interactions of the 'W' of the RVxF- øø -R-W string with the hydrophobic groove of PP1 is conserved in PIPs. Whereas the RVxF motif is well conserved and validated since its discovery in 1997, as are the øø - (an extension of the RVxF motif), and the 'R', the conservation of the Trp residue in the RVxF-øø-R-W string is not conserved.
 
 We did not mean to imply that the W motif is conserved amongst all PIPs.
 
 Most PIPs interact with the globular PP1 catalytic core through short linear interaction motifs (SLiMs). Choy et al (PNAS 2014) previously showed that many PIPs interact with PP1 through a conserved trio of SLiMs, RVxF-ΦΦ-R, which is also present in the Phactrs.
 
 Previous structural analysis showed that the PPP1R15A/B, Neurabin/Spinophilin (PPP1R9A/B), and PNUTS (PPP1R10) PIPs share a trajectory across the PP1 surface that encompasses not only the RVxF-ΦΦ-R SLIMs, but also additional sequences C-terminal to the R SLIM (Chen et al, eLife, 2015). This trajectory is also shared by the Phactr1-PP1 complex (Fedoryshchak et al, eLife, 2020). Based on this structural alignment we proposed the existence of an additional hydrophobic “W” SLiM that interacts with the PP1 residues I133 and Y134 (See Fedoryshchak et al, 2020, Figure 1 figure supplement 2).
 
 Introduction, paragraph 2 is rewritten to make this clearer.
 
 The sequence and positions of W differ in amino acid type and position relative to the RVxF-øø-R string.
 
 The motif ‘W’ does not mandate tryptophan, it is our name for a common structurally aligned motif: although the Phactrs and PPP1R15A/B indeed have W at this position, Neurabin and spinophilin contain VDP, which nevertheless makes similar interactions. Similarly the _“_RVxF” motifs in Phactr1, Neurabin/Spinophilin, PPP1R15A/B and PNUTS are LIRF, KIKF, KV(R/T)F and TVTW respectively.
 
 In the Discussion the authors state that the hydrophobic groove of PP1 is remodelled by Neurabin. However, details of this are not described or shown in the manuscript.
 
 The shared trajectory determined by the RVxF-øø-R-W string brings the sequences C-terminal to the W SLIM into the vicinity of the PP1 hydrophobic groove. In the Phactr1/PP1 holoenzyme this generates a novel pocket required for substrate recognition (Fedoryshchak et al, 2020). These observations raised the possibility that sequences C-terminal to the “W” motif in the other RVxF-øø-R-W PIPs also play a role in substrate recognition.
 
 Introduction paragraph 3 now cites a new Figure 1-S2, which shows how the hydrophobic groove is remodelled in the various different PIP/PP1 complexes. A revised Figure 1A now indicates the hydrophobic residues defining the hydrophobic groove by grey shading.
 
 (2) To add to the confidence of the structure, the authors should include a 2Fo-Fc simulated annealing omit map, perhaps showing the R and W interactions of the RVxF-øø-R-W string.
 
 This is now included as new Figure 6 Figure supplement 1. Note that in Neurabin, the W motif is VDP, where the valine and proline sidechains interact similarly to the tryptophan (see also new Figure 1-S2G,H).
 
 We also add a new supplementary Figure 6-S1 comparing our PBM-liganded Neurabin PDZ domain with the previously published unliganded structure (Ragusa et al 2010).
 
 (3) Page 16. The authors state that spinophilin remodels the PP1 hydrophobic groove differently from Phactrs. Arguably spinophilin does not remodel the PP1 hydrophobic groove at all. There are no contacts between spinophilin and the PP1 hydrophobic groove in the spinophilin-PP1 structure, correlating with the absence of 'W" in the RVxF-øø-R-W string in spinophilin.
 
 The VDP sequence corresponding to the W motif in spinophilin and neurabin makes analogous contacts to those made by the W in Phactr1 (see Fedoryshchak et al 2020).
 
 Remodelling is meant in the sense of altering the structure of the major groove by bringing new sequences into its vicinity rather than necessarily directly interacting with it. The spinophilin/PP1 and Phactr/PP1 hydrophobic grooves are compared in new Figure 1-S2 (see also Fedoryshchak et al 2020, Figure 2 figure supplement 1)
 
 (4) Page 8. For the cell-based/proteomics-dephosphorylation assay in Figure 2, it isn't clear why there were no dephosphorylation sites detected for the PPP1R15A/B-PP1 fusion (except PPP6R1 S531 for PPP1R15B). One might have expected a correlation with PP1 alone. Does this imply that PPP1R15A/B are inhibiting PP1 catalytic activity? Was the activity tested in vitro?
 
 The R15A/B data are compared to average abundance of all the phosphosites in the dataset, including those of PP1.
 
 We have not tested for a general inhibitory effect of R15A/B on PP1 activity. Many PIPs including R15A/B do occlude one or more of the PP1 substrate groove and therefore generally act as inhibitors of PP1 activity against some potential substrates, while enhancing activities against others.
 
 Other points
 
 (4) Figure S1: Colour sequence similarities/identities.
 
 Done
 
 (6) Figures: Structure figures lacked labels:
 
 Figure 1A, label PP1, Phactrs etc.
 
 Done
 
 Figure 6, label PP1, Neurabin, previous Neurabin structure (Fig. 6C), hydrophobic groove, PDZ domain, etc.
 
 Done
 
 (7) Statistical analysis. p values should be shown for data in:
 
 Figure 5.
 
 To avoid cluttering the Figure, a new sheet, “statistical significance” has been added to Supplementary Table 3, summarizing the analysis.
 
 Figure 1.
 
 Figure amended (now figure 1-S1).
 
 (8) Some inconsistency with labels, eg '34-WT' used in Fig. 5C, whereas '34A-WT' (better) in Methods.
 
 Now changed to 34A etc where used.
 
 (9) Page 6. PPP1R9A/B is not shown in Figure 1A and Figure S1A.
 
 PPP1R9A/B are Neurabin and spinophilin - now clarified in Introduction paragraph 2, Results paragraph 1, Discussion paragraph 1.
 
 (10) Page 7: lines 4, 'site' not 'side'.
 
 Done
 
 (11) Page 9: DTL and CAMSAP3 were found to be dephosphorylated in the PP1-Neurabin/spinophilin screen. Are these PDZ-binding proteins?
 
 Neither DTL nor CAMSAP3 contain C-terminal hydrophobic residues characteristic of classical PBMs. Sentence added in Discussion, paragraph 5
 
 (12) Page 12 and Figure 5 and S5: The synthetic p4E-BP1 and IRSp53WT peptides with PBM should be given more specific names to indicate the presence of the PBM.
 
 We have renamed 4E-BP1WT and IRSp53WT to 4E-BP1PBM and IRSp53PBM respectively, emphasising the inclusion of the wildtype or mutated PBM from 4E-BP1 on these peptides.
 
 Text, Figure 5, and Figure S5 all revised accordingly.
 
 (13) Give PDB code for spinophilin-PP1 complex coordinates shown in Figure 6C.
 
 PDB codes for the various PIP/PP1 complexes now given in new Figure 1-S2 and revised Figure 6C.
 
 Reviewer #2 (Recommendations for the authors):
 
 The work undertaken by the authors is extensive and robust, however, I believe that some improvement in the writing and some detailed explanation of certain results sections would help with the presentation of the work and clarity for the readers.
 
 (1) The introduction should contain more information about the interaction between PP1 and Neurabin, given that this is the focus of the paper. This would give the reader the necessary background required to follow the paper.
 
 Introduction paragraph 2 revised to describe the different SLIMs in more detail. New Figure 1-S2 shows detail of the different remodelled hydrophobic grooves in the various PIP/PP1 complexes.
 
 (2) More information on PP1-IRSp53L460A has to be added before discussing results in S1B.
 
 Sentence explaining that IRSp53 L460 docks with the remodelled PP1 hydrophobic groove in the Phactr1/PP1 holoenzyme added in Results paragraph 2.
 
 (3) Page 6: "as expected, the +5 residue L460A mutation, which impairs dephosphorylation by the intact Phactr1/PP1 holoenzyme, impaired sensitivity to all the fusions, indicating that they recognise phosphorylated IRSp53 in a similar way (Figure S1B)". Statistics between IRSp53 and IRSp53L460A across PP1-PIPs need to be conducted before concluding the above. From the graph and the images, the impairment to dephosphorylation is not convincing.
 
 For each of the four PP1-Phactr fusions, the IRSp53 L460A peptide shows significantly less reactivity than the IRSp53WT peptide (p<0.05 for each fusion).
 
 Since the proteomics studes in Figure 2 show that the substrate specificity of the four PP1-Phactr1 fusions is virtually identical, we combined the data for the four different fusions. The IRSp53 L460A peptide shows significantly less reactivity than the IRSp53WT peptide in this analysis (p< 0.0001). This result shown in revised Figure S1B and legend.
 
 (4) mCherry-4E-BP1(118+A), in which an additional C-terminal alanine should still allow TOSmediated phosphorylation, but prevent PDZ interaction. Does 4EBP1 (118+A) actually prevent interaction between PP1-Neurabin? This interaction needs to be validated, especially since spinophilin was shown to bind to multiple regions of PP1.
 
 It is not clear what the referee is asking for here. The biochemical analysis in Figure 4C shows that the C-terminus of 4E-BP1 constitutes a classical PBM. The X-ray crystallography in Figure 6 confirms this, demonstrating H-bond interactions between the 4E-BP1 C-terminal carboxylate and main chain amides of L514, G515 and I516.
 
 We consider the possibility that the 4E-BP1(118+A) mutant inhibits the activity of PP1-neurabin via a mechanism other than direct blocking 4E-BP1 / PDZ interaction to be unlikely for the following reasons:
 
 (1) Addition of a C-terminal alanine will disrupt the PBM interaction because the extra residue sterically blocks access to the PBM-binding groove. This is the most parsimonious explanation, and is based on our solid structural and biochemical evidence that the 4E-BP1 C-terminus is a classical PBM.
 
 (2) Alphafold3 modelling predicts Neurabin PDZ / 4E-BP1 PBM interaction with high confidence (shown in Figure 6-S2E), but it does not predict any PDZ interaction with 4E-BP1(118+A). Note added in Figure 6-S2 legend.
 
 (3) Recognition of the 4E-BP1(118+A) mutation without loss of binding affinity would require that the mutant becapable of binding formally equivalent to recognition of an “internal” PDZ-binding peptide. Recognition of such “internal peptides” is dependent on their adopting a specifically constrained conformation, which typically requires reorganisation of the PDZ carboxylate-binding GLGF loop. Such “internal site” recognition typically involves more than one residue C-terminal to the conventional PDZ “0” position (see Penkert et al NSMB 2004, doi:10.1038/nsmb839; Gee et al JBC 1998, DOI: 10.1074/jbc.273.34.21980; Hillier et al 1999, Science PMID: 10221915).
 
 (5) It is nice to see that the various PP1-Phactr fusions have around 60% substrate overlap between them. Would it be possible to compare these results with previously published mass spec data of Phactr1XXX from the group? There is mention of some substrates being picked up, but a comparison much like in Figure 2E would be more informative about the extent to which the described method captures relevant information.
 
 This is difficult to do directly as the PP1-Phactr fusion data are from human cells while that in Fedoryshchak et al 2020 is from mouse.
 
 However, manual curation shows that of the 28 top hits seen in our previous analysis of Phactr1XXX in NIH3T3 cells, 18 were also detectable in the HEK293 system; of these, 13 were also detected as as PP1-Phactr fusion hits. Data summarised in new Figure 2-S1C. Text amended in Results, “Proteomic analysis...”, paragraph 2.
 
 (6) Figure 3D Why are the levels of pT70, pT37/46 and total protein in vector controls much lower as compared to 0nM Tet in PP1-Neurabin conditions? It is also weird that given total protein is so low, why are the pS65/101 levels high compared to the rest?
 
 We think it likely these phenomena reflect a low level expression of PP1-Neurabin expression in uninduced cells. Now noted in Figure 3D legend, basal PP1-Neurabin expression shown in new Figure 3-S1C. This alters the relative levels of the different species detected by the total 4E-BP1 antibody in favour of the faster migrating forms, which are less phosphorylated than the slower ones, and the total amount increases about 2-fold (Figure 3D, compare 0nM Tet lanes).
 
 The altered p65/101-pT70 ratio is also likely to reflect the leaky PP1-Neurabin expression, since the relative intensities of the various phosphorylated species are dependent on both the relative rates of phosphorylation and dephosphorylation. Expression of a phosphatase would therefore be expected to differentially affect the phosphorlyation levels of different sites according to their reactivity.
 
 (7) Figure 3E: Does inhibiting mTORC further reduce translation when PP1-Neurabin is expressed? If this is the case, this might suggest that they might not necessarily be mTORC inhibitors?
 
 We have not done this experiment. Since Rapamycin cannot be guaranteed to completely block 4E-BP1 phosphorylation, and PP1-Neurabin cannot be guaranteed to completely dephosphorylate 4E-BP1, any further reduction upon their combination would be hard to interpret.
 
 (8) Substrate interactions with the remodelled PP1 hydrophobic groove do not affect PP1-Neurabin specificity. Is there evidence that PP1-Neurabin remodels the hydrophobic groove? Is it not possible that Neurabin does not remodel the PP1 groove to begin with and hence there is no effect observed with the various mutants? If this is not the case, it should be explained in a bit more detail.
 
 Comparison of the Neurabin/PP1 and Phactr1/PP1 structures shows that the hydrophobic groove is remodelled differently in the two complexes. Now shown in new Figure 1-S2B,C,G.
 
 (9) Figure 5B has a lot of interesting information, which I believe has not been discussed at all in the results section.
 
 To help interpretation of the enzymology in Figure 5 we have renamed 4E-BP1WT and IRSp53WT to 4E-BP1PBM and IRSp53PBM respectively, emphasising the inclusion of the wildtype or mutated PBM from 4E-BP1 on these peptides. Text in Results, “PDZ domain interaction…”, paragraph 1, and Figures 5 and S5 revised accordingly.
 
 Why does the 4E-BP1Mut affect catalytic efficiency of PP1 alone when compared with WT, while no difference is observed with IRSp53WT and mutant?
 
 We do not understand the basis for the differential reactivity of 4E-BP1PBM and 4E-BP1MUT with PP1 alone; we suspect that it reflects the hydrophobicity change resulting from the MDI -> SGS substitution. However this is unlikely to be biologically significant as PP1 is sequestered in PIP-PP1 complexes.
 
 Importantly, the two PP1 fusion proteins behave consistently in this assay – the presence of the intact PBM increases reactivity with PP1-Neurabin, but has no effect on dephosphorylation by PP1-Phactr1.
 
 Why does PP1 alone not have a difference between IRSp53WT and mutant, while PP1-Neurabin does have a difference?
 
 This is due to the presence of the PBM in IRSp53WT (now renamed IRSp53PBM), which affects increases affinity for PP1 Neurabin, but not PP1 alone. Likewise, PP1-Phactr1, which does not possess a PDZ domain, is also unaffected by the integrity of the PBM.
 
 (7) “Strikingly, alanine substitutions at +1 and +2 in 4E-BP1WT increased catalytic efficiency by both fusions, perhaps reflecting changes at the catalytic site itself (Figure 5E, Figure S5E)”. This could be expanded upon, because this suggests a mechanism that makes the substrate refractory to PDZ/hydrophobic groove remodelling?
 
 We favour the idea that this reflects a requirement to balance dephosphorylation rates between the multiple 4E-BP1 phosphorylation sites, especially if multiple rounds of dephosphorylation occur for each PBM—PDZ interaction. Additional sentences added in Discussion paragraph 7.
 
 (8) Typographical errors and minor comments:
 
 a) PIPs can target PP1 to specific subcellular locations, and control substrate specificity through autonomous substrate-binding domains, occupation or extension of the substrate grooves, or modification of PP1 surface electrostatics.
 
 b) Phosphophorylation side site abundances within triplicate samples from the same cell line were comparable between replicates (Figure 2B).
 
 c) While the alanine substitutions had little effect, conversion of +4 to +6 to the IRSp534E-BP1 sequence LLD increased catalytic efficiency some 20-fold (Figure 5C, Figure S5C).
 
 d) Figure 3E labels are not clear. The graph can be widened to make the labels of the conditions clearer.
 
 All corrected
 
 Reviewer #3 (Recommendations for the authors):
 
 This was a very well-written manuscript.
 
 However, I was looking for a summary mechanistic figure or cartoon to help me navigate the results.
 
 I noted a few typos in the text.
 
 New summary Figure 5-S2 added, cited in results, and discussed in Discussion paragraph 6,7.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 3

AuthorResponse

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.09.23.614477v2
ecoevorxiv.org ecoevorxiv.org

An illusion of a macroecological law, abundance-occupancy relationships

3
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife assessment
  
  This study offers a useful discussion of the well-accepted abundance-occupancy relationship in macroecology. While using the ebird large dataset to revisit the theme is interesting, multiple unresolved confounding factors exist, leaving the results inadequate to overturn the repeatedly confirmed abundancy-occupancy relationship.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  This article presents an analysis that challenges established abundance-occupancy relationships (AORs) by utilizing the largest known bird observation database. The analysis yields contentious outcomes, raising the question of whether these findings could potentially refute AORs.
  
  Strengths:
  
  The study employed an extensive aggregation of datasets to date to scrutinize the abundance-occupancy relationships (AORs).
  
  Weaknesses:
  
  The authors should thoroughly address the correlation between checklist data and global range data, ensuring that the foundational assumptions and potential confounding factors are explicitly examined and articulated within the study's context.
  
  In the revision, the authors have refined their findings to birds and provided additional clarifications and discussion. However, the primary concerns raised by reviewers remain inadequately addressed. My main concern continues to be whether testing AOR at a global scale is meaningful given the numerous confounding factors involved. With the current data and analytical approach, these confounders appear inseparable. The study would be significantly strengthened if the authors identified specific conditions under which AORs are valid.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Author response:
  
  The following is the authors’ response to the original reviews
  
  Public Reviews:
  
  Reviewer #1 (Public Review):
  
  Summary:
  
  This article presents a meta-analysis that challenges established abundance-occupancy relationships (AORs) by utilizing the largest known bird observation database. The analysis yields contentious outcomes, raising the question of whether these findings could potentially refute AORs.
  
  We thank the Reviewer for their positive comments.
  
  Strengths:
  
  The study employed an extensive aggregation of datasets to date to scrutinize the abundance-occupancy relationships (AORs).
  
  We thank the Reviewer for their positive comments.
  
  Weaknesses:
  
  While the dataset employed in this research holds promise, a rigorous justification of the core assumptions underpinning the analytical framework is inadequate. The authors should thoroughly address the correlation between checklist data and global range data, ensuring that the foundational assumptions and potential confounding factors are explicitly examined and articulated within the study's context.
  
  We thank the Reviewer for these comments. We agree that more justification and transparency is needed of the core assumptions that form the foundation of our methods. In our revised version, we have taken the following steps to achieve this:
  
  - Altered the title to be more explicit about the core assumptions, which now reads: “Local-scale relative abundance is decoupled from global range size”
  
  - We have added more details on why and how we treat global range size as a measure of ‘occupancy.’
  
  - We have added a section that discusses the limitations of using eBird relative abundance
  
  Reviewer #2 (Public Review):
  
  Summary:
  
  The goal is to ask if common species when studied across their range tend to have larger ranges in total. To do this the authors examined a very large citizen science database which gives estimates of numbers, and correlated that with the total range size, available from Birdlife. The average correlation is positive but close to zero, and the distribution around zero is also narrow, leading to the conclusion that, even if applicable in some cases, there is no evidence for consistent trends in one or other direction.
  
  We thank the Reviewer for these comments.
  
  Strengths:
  
  The study raises a dormant question, with a large dataset.
  
  We thank the Reviewer for these comments. We intended to take a longstanding question and attempt to apply novel datasets that were not available mere decades ago. While we do not imply that we have ‘solved’ the question, we hope this work highlights the potential for further interrogation using these large datasets.
  
  Weaknesses:
  
  This study combines information from across the whole world, with many different habitats, taxa, and observations, which surely leads to a quite heterogeneous collection.
  
  We agree that there is a heterogeneous collection of data across many habitats, taxa, and observations. However, rather than as a weakness, we see this as a significant strength. Our work assumes we are averaging over this variability to assess for a large-scale pattern in the relationship - something that was potentially a limitation of previous work, as these large datasets were often focused on particular contexts (e.g., much work focused solely on the UK), which we believe could limit some of the generalizability of the previous work. However, the reviewer makes a fair point in regard to the heterogeneity of data collection. We have now added some text in the discussion which is explicit about this - see the new section named “Potential limitations of current work and future work –-although our findings challenge some long-held assumptions about the consistency of the abundance-occupancy relationship, our work only deals with interspecific AORs among birds, synthesizing observations of potentially heterogeneous locations, context and quality”.
  
  First, scale. Many of the earlier analyses were within smaller areas, and for example, ranges are not obviously bounded by a physical barrier. I assume this study is only looking at breeding ranges; that should be stated, as 40% of all bird species migrate, and winter limitation of populations is important. Also are abundances only breeding abundances or are they measured through the year? Are alien distributions removed?
  
  Second, consider various reasons why abundance and range size may be correlated (sometimes positively and sometimes negatively) at large scales. Combining studies across such a large diversity of ecological situations seems to create many possibilities to miss interesting patterns. For example:
  
  (1) Islands are small and often show density release.
  
  See comment below.
  
  (2) North temperate regions have large ranges (Rapoport's rule) and higher population sizes than the tropics.
  
  See comment below.
  
  (3) Body size correlates with global range size (I am unsure if this has recently been tested but is present in older papers) and with density. For example, cosmopolitan species (barn owl, osprey, peregrine) are relatively large and relatively rare.
  
  See comment below.
  
  (4) In the consideration of alien species, it certainly looks to me as if the law is followed, with pigeon, starling, and sparrow both common and widely distributed. I guess one needs to make some sort of statement about anthropogenic influences, given the dramatic changes in both populations and environments over the past 50 years.
  
  See comment below. We also added a sentence in the methods that highlighted we did not remove alien ranges and provided reasons why. Still, we do acknowledge the dramatic changes in populations and environments over the past 50 years (see the new section “Potential limitations of current work and futur work”)
  
  (5) Wing shape correlates with ecological niche and range size (e.g. White, American Naturalist). Aerial foraging species with pointed wings are likely to be easily detected, and several have large ranges reflecting dispersal (e.g. barn swallow).
  
  We agree that all of the points above are interesting data explorations. As said above, our main purpose was to highlight the potential for further interrogation using these large datasets. However, we have added some additional text in the discussion that explicitly mentions/encourages these additional data explorations. We hope people will pick up on the potential for these data and explore them further.
  
  Third, biases. I am not conversant with ebird methodology, but the number appearing on checklists seems a very poor estimate of local abundance. As noted in the paper, common species may be underestimated in their abundance. Flocking species must generate large numbers, skulking species few. The survey is often likely to be in areas favorable to some species and not others. The alternative approach in the paper comes from an earlier study, based on ebird but then creating densities within grids and surely comes with similar issues.
  
  We agree that if we were interested in the absolute abundance of a given species, the local number on an eBird checklist would be a poor representation. However, our study aims not to estimate absolute abundance but to examine relative abundance among species on each checklist. By focusing on relative abundance, we leverage eBird data's strengths in detecting the presence and frequency of species across diverse locations and times, thereby capturing community composition trends that can provide meaningful insights despite individual checklist biases. This approach allows us to assess the comparative prominence of species in the community as reported by the observer, providing a consistent metric of relative abundance. Despite detectability biases, the structure of eBird checklists reflects the observer’s encounter rates with each species under similar conditions, offering a valuable snapshot of relative species composition across sites and times. The key to our assumption is that these biases discussed are not directional and, therefore, random throughout the sampling process, which would translate to no ‘real’ bias in our effect size of interest.
  
  Range biases are also present. Notably, tropical mountain-occupying species have range sizes overestimated because holes in the range are not generally accounted for (Ocampo-Peñuela et al., Nature Communications). These species are often quite rare, too.
  
  We thanks the reviewer for pointing to this issue and reference. We included a discussion on these biases in our limitations section and reference Ocampo-Peñuela et al. to emphasize the need for improved spatial resolution in range data for more accurate AOR assessments.”More precise range-size estimates would also improve the accuracy of AOR assessments, since species range data are often overestimated due to the failure to capture gaps in actual distributions ”
  
  Fourth, random error. Random error in ebird assessments is likely to be large, with differences among observers, seasons, days, and weather (e.g. Callaghan et al. 2021, PNAS). Range sizes also come with many errors, which is why occupancy is usually seen as the more appropriate measure.
  
  If we consider both range and abundance measurements to be subject to random error in any one species list, then the removal of all these errors will surely increase the correlation for that list (the covariance shouldn't change but the variances will decrease). I think (but am not sure) that this will affect the mean correlation because more of the positive correlations appear 'real' given the overall mean is positive. It will definitely affect the variance of the correlations; the low variance is one of the main points in the paper. A high variance would point to the operation of multiple mechanisms, some perhaps producing negative correlations (Blackburn et al. 2006).
  
  We agree random errors can affect estimates, but as we wrote above, random errors, regardless of magnitudes, would not bias estimates. After accounting for sampling error (a part of random errors), little variance is left to be explained as we have shown in the MS. This suggests that many of the random errors were part of the sampling errors. And this is where meta-analysis really shines.
  
  On P.80 it is stated: "Specifically, we can quantify how AOR will change in relation to increases in species richness and sampling duration, both of which are predicted to reduce the magnitude of AORs" I haven't checked the references that make this statement, but intuitively the opposite is expected? More species and longer durations should both increase the accuracy of the estimate, so removing them introduces more error? Perhaps dividing by an uncertain estimate introduces more error anyway. At any rate, the authors should explain the quoted statement in this paper.
  
  It would be of considerable interest to look at the extreme negative and extreme positive correlations: do they make any biological sense?
  
  Extremely high correlations would not make any biological sense if these observations were based on large sample sizes. However, as shown in Figure 2, all extreme correlations come from small sample sizes (i.e., low precision), as sampling theory expects (actually our Fig 2 a text-book example of the funnel shape). Therefore, we do not need to invoke any biological explanations here.
  
  Discussion:
  
  I can see how publication bias can affect meta-analyses (addressed in the Gaston et al. 2006 paper) but less easily see how confirmation bias can. It seems to me that some of the points made above must explain the difference between this study and Blackburn et al. 2006's strong result.
  
  We agree. Now, we extended an explanation of why confirmation bias could result in positive AOR. Yet, we point out confirmation bias is a very common phenomena which we cite relevant citations in the original MS. The only way to avoid confirmation bias is to conduct a study blind but this is not often possible in ecological work.
  
  “Meta-research on behavioural ecology identified 79 studies on nestmate recognition, 23 of which were conducted blind. Non-blind studies confirmed a hypothesis of no aggression towards nestmates nearly three times more often. It is possible that confirmation bias was at play in earlier AOR studies.”
  
  Certainly, AOR really does seem to be present in at least some cases (e.g. British breeding birds) and a discussion of individual cases would be valuable. Previous studies have also noted that there are at least some negative and some non-significant associations, and understanding the underlying causes is of great interest (e.g. Kotiaho et al. Biology Letters).
  
  We agree. And yes, we pointed out these in our introduction.
  
  Reviewer #3 (Public Review):
  
  Summary:
  
  This paper claims to overturn the longstanding abundance occupancy relationship.
  
  Strengths:
  
  (1) The above would be important if true.
  
  (2) The dataset is large.
  
  We have clarified this point by changing the title to emphasize that we do not suggest overturning AORs entirely but instead provide a refined view of the relationship at a global scale. Our results suggest a weaker and more context-dependent AOR than previously documented. We hope our revised title and additional clarifications in the text convey our intent to contribute to a more nuanced understanding rather than a whole overturning of the AOR framework.
  
  Weaknesses:
  
  (1) The authors are not really measuring the abundance-occupancy relationship (AOR). They are measuring abundance-range size. The AOR typically measures patches in a metapopulation, i.e. at a local scale. Range size is not an interchangeable notion with local occupancy.
  
  We have refined this in our revision to be more explicitly focused on global range size. However, we note that the classic paper by Bock and Richlefs (1983, Am Nat) also refers to global (species entire) range size in the context of the AOR. Importantly, Bock and Richlefs pointed out the importance of using species’ entire ranges; without such uses, there will be sampling artifacts creating positive AORs when using arbitrary geographical ranges, which were used in some studies of AORs. So we highlight that our work is well in line with the previous work, allowing us to question the longstanding macroecological work. One of the issues of AOR has been how to define occupancy and global range size, which provides a relatively ambiguous measure, which is why we used this measure.
  
  (2) Ebird is a poor dataset for this. The sampling unit is non-standard. So abundance can at best be estimated by controlling for sampling effort. Comparisons across space are also likely to be highly heterogenous. They also threw out checklists in which abundances were too high to be estimated (reported as "X"). As evidence of the biases in using eBird for this pattern, the North American Breeding Bird Survey, a very similar taxonomic and geographic scope but with a consistent sampling protocol across space does show clear support for the AOR.
  
  Yes, we agree the sampling unit is non-standard. However, this is a significant strength in that it samples across much heterogeneity (as discussed in response to Reviewer 2, above). We were interested in relative abundance and not direct absolute abundance per se, which is accurate, especially since we did control for sampling effort.
  
  We appreciate the reviewer’s attention to our data selection criteria. We excluded checklists containing ‘X’ entries to minimize biases in our abundance estimates. The 'X' notation is often used for the most common species, reflecting the observer's identification of presence without specifying a count. This approach was chosen to avoid disproportionately inflating presence data for these abundant species, which could distort the relative abundance calculations in our analysis. By excluding such checklists, we aimed to retain consistency and ensure that local abundance estimates were representative across all species on each checklist. We have revised our manuscript to clarify this methodological choice and hope this explanation addresses the reviewer’s concern. We modified our text in the methods to make the entries ‘X’ clearer (see the Method section).
  
  (3) In general, I wonder if a pattern demonstrated in thousands of data sets can be overturned by findings in one data set. It may be a big dataset but any biases in the dataset are repeated across all of those observations.
  
  Overturning a major conclusion requires careful work. This paper did not rise to this level.
  
  We appreciate the reviewer’s caution regarding broad conclusions based on a single dataset, even one as large as eBird. Our intention was not to definitively overturn the abundance-occupancy relationship (AOR) but to re-evaluate it with the most extensive and globally representative dataset currently available. We recognise that potential biases in citizen science data, such as observer variation, may influence our findings, and we have taken steps to address these in our methodology and limitations sections. We see this work as a contribution to an ongoing discourse, suggesting that AOR may be less universally consistent than previously believed, mainly when tested with large-scale citizen science data. We hope this study will encourage additional research that tests AORs using other expansive datasets and approaches, further refining our understanding of this classic macroecological relationship. However, we have left our broad message about instigating credible revolution and also re-examining ecological laws.
  
  Recommendations for the authors:
  
  Reviewer #1 (Recommendations For The Authors):
  
  (1) The investigation focuses solely on interspecific relationships among birds; thus, the extrapolation of these conclusions to broader ecological contexts requires further validation.
  
  We have now added this point to our new section: “Although our findings challenge some long-held assumptions about the consistency of the abundance-occupancy relationship, our work only deals with interspecific AORs among birds, so we hope this work serves as a foundation for further investigations that utilize such comprehensive datasets.”
  
  (2) The rationale for combining data from eBird - a platform predominantly representing individual observations from urban North America - with the more globally comprehensive BirdLife International database needs to be substantiated. The potential underrepresentation of global abundance in the eBird checklist data could introduce a sampling bias, undermining the foundational premises of AORs.
  
  We agree with the limitation of ebird sampling coverage, but it should not bias our results. In statistical definitions, bias is directional, and if not directional, it will become statistical noise, making it difficult to detect the signal. In fact, our meta-analyses adjust what statisticians call sampling bias and it is the strength of meta-analysis.
  
  (3) In the full mixed-effect model, checklist duration and sampling variance (inversely proportional to sample size N) are treated as fixed effects. However, these variables are likely to be negatively correlated, which could introduce multicollinearity, inflating standard errors and diminishing the statistical significance of other factors, such as the intercept. This calls into question the interpretation of insignificance in the results.
  
  Multicollinearity is an issue with sample sizes. For example, with small datasets, correlations of 0.5 could be an issue, and such an issue would usually show up as a large SE. We do not have such an issue with ~ 17 million data points. Please refer to this paper.
  
  Freckleton, Robert P. "Dealing with collinearity in behavioural and ecological data: model averaging and the problems of measurement error." Behavioral Ecology and Sociobiology 65 (2011): 91-101.
  
  (4) The observed low heterogeneity may stem from discrepancies in sampling for abundance versus occupancy, compounded by uncertainties in reporting behavior.
  
  If we assume everybody underreports common species or overreports rare species, this could happen. However, such an assumption is unlikely. If some people report accurately (but not others), we should see high heterogeneity, which we do not observe). We have touched upon this point in our original MS.
  
  (5) The contribution and implementation of phylogenetic comparative analysis remain ambiguous and were not sufficiently clarified within the study.
  
  We need to add more explanation for the global abundance analysis
  
  “To statistically test whether there was an effect of abundance and occupancy at the macro-scale, we used phylogenetic comparative analysis. This analysis also addresses the issue of positive interspecific AORs potentially arising from not accounting for phylogenetic relatedness among species examined ”
  
  (6) The use of large N checklists could skew the perceived rarity or commonality of species, potentially diminishing the positive correlation observed in AORs. A consistent observer effect could lead to a near-zero effect with high precision.
  
  Regardless of the number of N species in checklists (seen in Fig 2), correlations are distributed around zero. This means there is nothing special about large N checklists.
  
  (7) The study should acknowledge and discuss any discrepancies or deviations from previous literature or expected outcomes.
  
  We felt we had already done this as we discussed the previous meta-analysis and what we expected from this meta-analysis. Nevertheless, we have added some relevant sentences in the new version of MS.
  
  In addition to these major points, there are several minor concerns:
  
  (1) Figure 2B lacks discussion, and the metric for the number of observations is not clarified. Furthermore, the labeling of the y-axis appears to be incorrect.
  
  Thank you very much for pointing out this shortcoming. Now, the y-axis label has been fixed and we mention 2B in the main text.
  
  (2) The study should provide a clear, mathematical expression of the multilevel random effect models for greater transparency.
  
  Many thanks for this point, and now we have added relevant mathematical expressions in Table S6.
  
  (3) On Line 260, the term "number of species" should be refined to "number of species in a checklist," ideally represented by a formula for precision.
  
  This ambiguity has been mended as suggested.
  
  Please provide the data and R code linked to the outputs.
  
  The referee must have missed the link (https://github.com/itchyshin/AORs) in our original MS. In addition to our GitHub repository link, we now have added a link to our Zenodo repository (https://doi.org/10.5281/zenodo.14019900).
  
  Reviewer #3 (Recommendations For The Authors):
  
  The authors cite Rabinowitz's 7 forms of rarity paper as a suggestion that previous findings also break the AOR. In fact empirical studies of the 7 forms of rarity typically find that all three forms of rareness vs commonness are heavily correlated (e.g. Yu & Dobson 2000).
  
  We thank the reviewer for drawing attention to Yu & Dobson (2000) and similar studies that find positive correlations among the axes of rarity. Ref 3 is correct in that Rabinowitz’s (1981) framework does not require that local abundance and geographic range size be uncorrelated for every species; instead, it highlights conceptual scenarios where a species may be common locally yet have a restricted distribution (or vice versa).
  
  Empirical analyses such as Yu & Dobson (2000) show that, on average, these axes can be correlated, which may align with conventional AOR findings in some taxonomic groups. However, Rabinowitz’s key insight was that exceptions do occur, so these exceptions demonstrate that strong positive AORs may not be universally applicable. Our results do not claim that Rabinowitz’s framework “breaks” the AOR outright; instead, we use it to underscore that local abundance can, in principle, be “decoupled” from global occupancy. Whether the correlation found by Yu & Dobson (2000) implies a positive AOR, requires a detailed simulation study, which is an interesting avenue for future research.
  
  Thus, citing Rabinowitz serves to highlight the potential heterogeneity and complexity of abundance–occupancy relationships rather than to refute every positive correlation reported in the literature. Our findings suggest that when examined at large spatiotemporal scales (with unbiased sampling), the overall AOR signal may be less robust than traditionally believed. This is consistent with Rabinowitz’s view that local abundance and global range can vary along independent axes. Now we added
  
  “Although studies using her framework found positive correlations between species range and local abundance.”
  
  AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

AuthorResponse

Annotators

Public_Reviews

URL

ecoevorxiv.org/repository/view/6190/
arxiv.org arxiv.org

Collective epithelial migration mediated by the unbinding of hexatic defects

4
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  This important theoretical study shows that active hexatic topological defects in epithelia play a crucial role in enabling collective cell flows. While the use of coarse-grained hydrodynamic models to describe cell-scale behavior has limitations, the study provides solid evidence supporting its claims. These findings will interest both biophysicists studying collective cell behaviors and biologists investigating epithelial flows during development.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  This paper investigates the physical mechanisms underlying cell intercalation, which then enables collective cell flows in confluent epithelia. The authors show that T1 transitions (the topological transitions responsible for cell intercalation) correspond to the unbinding of groups of hexatic topological defects. Defect unbinding, and hence cell intercalation and collective cell flows, are possible when active stresses in the tissue are extensile. This result helps to rationalize the observation that many epithelial cell layers have been found to exhibit extensile active nematic behavior.
  
  Strengths:
  
  The authors obtain their results based on a combination of active hexanematic hydrodynamics and a multiphase field (MPF) model for epithelial layers, whose connection is a strength of the paper. With the hydrodynamic approach, the authors find the active flow fields produced around hexatic topological defects, which can drive defect unbinding. Using the MPF simulations, the authors show that T1 transitions tend to localize close to hexatic topological defects.
  
  Weaknesses:
  
  Citations are sometimes not comprehensive. Cases of contractile behavior found in collective cell flows, which would seemingly contradict some of the authors' conclusions, are not discussed.
  
  I encourage the authors to address the comments and questions below.
  
  (1) In Equation 1, what do the authors mean by the cluster's size \ell? How is this quantity defined? The calculations in the Methods suggest that \ell indicates the distance between the p-atic defects and the center of the T1 cell cluster, but this is not clearly defined.
  
  (2) The multiphase field model was developed and reviewed already, before the Loewe et al. 2020 paper that the authors cite. Earlier papers include Camley et al. PNAS 2014, Palmieri et al. Sci. Rep. 2015, Mueller et al. PRL 2019, and Peyret et al. Biophys. J. 2019, as reviewed in Alert and Trepat. Annu. Rev. Condens. Matter Phys. 2020.
  
  (3) At what time lag is the mean-squared displacement in Figure 3f calculated? How does the choice of a lag time affect these data and the resulting conclusions?
  
  (4) The authors argue that their results provide an explanation for the extensile behavior of cell layers. However, there are also examples of contractile behavior, such as in Duclos et al., Nat. Phys., 2017 and in Pérez-González et al., Nat. Phys., 2019. In both cases, collective cell flows were observed, which in principle require cell intercalations. How would these observations be rationalized with the theory proposed in this paper? Can these experiments and the theory be reconciled?
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  This paper studies the role of hexatic defects in the collective migration of epithelia. The authors emphasize that epithelial migration is driven by cell intercalation events and not just isolated T1 events, and analyze this through the lens of hexatic topological defects. Finally, the authors study the effect of active and passive forces on the dynamics of hexatic defects using analytical results, and numerical results in both continuum and phase-field models.
  
  The results are very interesting and highlight new ways of studying epithelial cell migration through the analysis of the binding and unbinding of hexatic defects.
  
  Strengths:
  
  (1) The authors convincingly argue that intercalation events are responsible for collective cell migration, and that these events are accompanied by the formation and unbinding of hexatic topological defects.
  
  (2) The authors clearly explain the dynamics of hexatic defects during T1 transitions, and demonstrate the importance of active and passive forces during cell migration.
  
  (3) The paper thoroughly studies the T1 transition through the viewpoint of hexatic defects. A continuum model approach to study T1 transitions in cell layers is novel and can lead to valuable new insights.
  
  Weaknesses:
  
  (1) The authors could expand on the dynamics of existing hexatic defects during epithelial cell migration, in addition to how they are created during T1 transitions.
  
  (2) The different terms in the MPF model used to study cell layer dynamics are not fully justified. In particular, it is not clear why the model includes self-propulsion and rotational diffusion in addition to nematic and hexatic stresses, and how these quantities are related to each other.
  
  (3) The authors could provide some physical intuition on what an active extensile or contractile term in the hexatic order parameter means, and how this is related to extensility and contractility in active nematics and/or for cell layers.
  
  Review 2
4. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Summary:
  
  In this manuscript, the authors discuss epithelial tissue fluidity from a theoretical perspective. They focus on the description of topological transitions whereby cells change neighbors (T1 transitions). They explain how such transitions can be described by following the fate of hexatic defects. They first focus on a single T1 transition and the surrounding cells using a hydrodynamic model of active hexatics. They show that successful T1 intercalations, which promote tissue fluidity, require a sufficiently large extensile hexatic activity in the neighborhood of the cells attempting a T1 transition. If such activity is contractile or not sufficiently extensile, the T1 is reversed, hexatic defects annihilate, and the epithelial network configuration is unchanged. They then describe a large epithelium, using a phase field model to describe cells. They show a correlation between T1 events and hexatic defects unbinding, and identify two populations of T1 cells: one performing T1 cycles (failed T1), and not contributing to tissue migration, and one performing T1 intercalation (successful T1) and leading to the collective cell migration.
  
  Strengths:
  
  The manuscript is scientifically sound, and the variety of numerical and analytical tools they use is impressive. The approach and results are very interesting and highlight the relevance of hexatic order parameters and their defects in describing tissue dynamics.
  
  Weaknesses:
  
  (1) Goal and message of the paper.
  
  a) In my opinion, the article is mainly theoretical and should be presented as such. For instance, their conclusions and the consequences of their analysis in terms of biology are not extremely convincing, although they would be sufficient for a theory paper oriented to physicists or biophysicists. The choice of journal and potential readership should be considered, and I am wondering whether the paper structure should be re-organized, in order to have side-by-side the methods and the results, for instance (see also below).
  
  b) Currently, the two main results sections are somewhat disconnected, because they use different numerical models, and because the second section only marginally uses the results from the first section to identify/distinguish T1 (see also below).
  
  (2) Quite surprisingly, the authors use a cell-based model to describe the macroscopic tissue-scale behavior, and a hydrodynamic model to describe the cell-based events. In particular, their hydrodynamic description (the active hexatic model) is supposed to be a coarse-grained description, valid to capture the mesoscopic physics, and yet, they use it to describe cell-scale events (T1 transitions). For instance, what is the meaning of the velocity field they are discussing in Figure 2? This makes me question the validity of the results of their first part.
  
  (3) The quality of the numerical results presented in the second part (phase field model) could be improved.
  
  a) In terms of analysis of the defects. It seems that they have all the tools to compare their cell-resolved simulations and their predictions about how a T1 event translates into defects unbinding. However, their analysis in Figure 3e is relatively minimal: it shows a correlation between T1 cells and defects. But it says nothing about the structure and evolution of the defects, which, according to their first section, should be quite precise. I believe it should be possible to identify and quantify more precisely the unbinding or annihilation of the defects and hence to characterize more precisely the T1 events.
  
  b) In terms of clarity of the presentation. For instance, in Figure 3f, they plot the mean-square displacement as a function of a defect density. I thought that MSD was a time-dependent quantity: they must therefore consider MSD at a given time, or averaged over time (in that case, what they are showing is rather an effective diffusivity). They should, in any case, be explicit about what their definition of this quantity is.
  
  c) In terms of statistics. For instance, Figure 3g is used to study the role of rotational diffusion on the average time between T1s. The error bars in this figure are huge and make their claims hardly supported. It is, for instance, hard to believe that the dynamics of T1 cycles are unaffected by D_r. In the limit where D_r vanishes, for instance, there should be no T1 and the period of a T1 cycle should diverge, which is not observed. Their claim of a "monotonic decay" of the average time between intercalations is also not fully supported given their statistics.
  
  Review 3
Visit annotations in context

Tags

Summary

Review 1

Review 2

Review 3

Annotators

Public_Reviews

URL

arxiv.org/abs/2307.12956
osf.io osf.io

Effects of experiencing the COVID-19 pandemic on optimistically biased belief updating

4
1. Public_Reviews 09 May 2025
 
 in eLife
 
 eLife Assessment
 
 This important study addresses the question of how large-scale events such as the COVID-19 pandemic can change people's beliefs and their updates. Using a well-validated task, the authors find that belief updating becomes less optimistically biased during COVID-19 compared to before it. In this revision, due to the addition of more model-based analyses and power calculations, they have generated convincing evidence for their primary claim that the pandemic significantly impacted people's belief updating away from optimistic belief updating. As with many manipulations outside the experimenters' control, it remains unclear which psychological factor impacted by the pandemic drives the group differences, and sample sizes are, by necessity, on the smaller side as data cannot readily be acquired. However, the authors are commended for doing power analyses, showing their sensitivity, and recognizing the limitations of their study.
 
 Summary
2. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 This manuscript uses a well-validated behavioural estimation task to investigate the degree to which optimistic belief updating was attenuated during the 2020 global pandemic. Online participants estimated how likely different negative life events were to happen to them in the future and were given statistics about these events. Belief updating (measured as the degree to which estimations changed after viewing the statistics) was less optimistically biased during the pandemic (compared to outside of it). This resulted from reduced updating from "good news" (better than expected information). Computational models were used to try to unpack how statistics were integrated and used to revise beliefs. Two families of models were compared - an RL set of models where "estimation errors" (analogous to prediction errors in classic RL models) predict belief change and a Bayesian set of models where an implied likelihood ratio was calculated (derived from participants estimations of their own risk and estimation of the base rate risk) and used to predict belief change. The authors found evidence that the former set of models accounted for updating better outside of the pandemic, but the latter accounted for updating during the pandemic. In addition, the RL model provides evidence that learning was asymmetrically positively biased outside of the pandemic but symmetric during it (as a result of reduced learning rates from good news estimation errors).
 
 Strengths
 
 Understanding whether biases in learning are fixed modes of information processing or flexible and adapt in response to environmental shocks (like a global pandemic or economic recession) is an important area of research relevant to a wide range of fields, including cognitive psychology, behavioural economics, and computational psychiatry. The study uses a well-validated task, and the authors conduct a power analysis to show that the sample sizes are appropriate. Furthermore, the authors test that their results hold in both a between-group analysis (the focus of the main paper) and a within-group analysis (mainly in the supplemental).
 
 The finding that optimistic biases are reduced in response to acute stress, perceived threat, and depression has been shown before using this task both in the lab (social stress manipulation), in the real world (firefighters on duty), and clinical groups (patients with depression). However, the work does extend these findings here in important ways:
 
 (1) Examining the effect of a new real-world adverse event (the pandemic). (2) The reduction in optimistic updating here arises due to reduced updating from positive information (previously, in the case of environmental threat, this reduction mainly arose from increased sensitivity to negative information). (3) Leveraging new RL-inspired computational approaches, demonstrating that the bias - and its attenuation - can be captured using trial-by-trial computational modelling with separate learning rates for positive and negative estimation errors.
 
 The authors now take great care to caveat that the findings cannot directly attribute the observed lack of optimistically biased belief updating during lockdown to psychological causes such as heightened anxiety and stress.
 
 The authors have added model recovery results. Whilst there are some cases within a family (RL or Bayesian) of models where they can be confused (e.g., Bayesian model 10-the winning model during the pandemic-sometimes gets confused with Bayesian model 9), there is no confusion between families of models (RL models don't get confused with Bayesian models and vice versa), which is reassuring.
 
 Weaknesses
 
 The authors now conduct model recovery (SI Figure 5) and show how the behaviour of the two best-fitting models (Rational Bayesian model and optimistically biased RL-like model) approximates the actual data observed by showing them alongside each other (Figure 1b). It seems from Figure 1b that the 2 models predict similar behaviour for bad news but diverge for good news, with the optimistically biased RL-like model predicting greater updates than the rational Bayesian model. However, it is difficult to tell from the figure (partly because of the y-axis scale) how much of a divergence this is and how distinctive a pattern relative to the other models. I think the interpretation could be improved further by a clearer sense of the behavioural signatures of each model, enabling them to be reliably teased apart from one another in the model recovery.
 
 Review 1
3. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 The authors investigated how experiencing the COVID-19 pandemic affected optimism bias in updating beliefs about the future. They ran a between-subjects design testing participants on cognitive tasks before, during and after the lift of the sanitary state of emergency during the pandemic. The authors show that optimism bias varied depending on the context in which it was tested. Namely, it disappeared during COVID-19 and it re-emerged at the time of lift of sanitary emergency measures. Via advanced computational modelling they are able to thoroughly characterise the nature of such alterations, pinpointing specific mechanisms underlying the lack of optimistic bias during the pandemic.
 
 Strengths pertain to the comprehensive assessment of the results via computational modelling, and from a theoretical point of view, the notion that environmental factors can affect cognition. Power analysis was conducted to ensure that the study was powered to observe the effect of interest despite the relatively small sample size.
 
 As the authors also noted, a major impediment to the interpreting the findings pertains to the lack of additional measures. While information on, for example, risk perception or need for social interaction were collected from participants during the pandemic, the fact that these could not be included in the analysis hindered the interpretation of findings. While the interpretation of the findings remains challenging, this work offers an example of the influence of real-life conditions on the belief-updating process.
 
 Review 2
4. Public_Reviews 09 May 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews
 
 Reviewer #1:
 
 Summary:
 
 This manuscript uses a well-validated behavioral estimation task to investigate how optimistic belief updating was attenuated during the 2020 global pandemic. Online participants recruited during and outside of the pandemic estimated how likely different negative life events were to happen to them in the future and were given statistics about these events happening. Belief updating (measured as the degree to which estimations changed after viewing the statistics) was less optimistically biased during the pandemic (compared to outside of it). This resulted from reduced updating from "good news" (better than expected information). Computational models were used to try to unpack how statistics were integrated and used to revise beliefs. Two families of models were compared - an RL set of models where "estimation errors" (analogous to prediction errors in classic RL models) predict belief change and a Bayesian set of models where an implied likelihood ratio was calculated (derived from participants estimations of their own risk and estimation of the base rate risk) and used to predict belief change. The authors found evidence that the former set of models accounted for updating better outside of the pandemic, but the latter accounted for updating during the pandemic. In addition, the RL model provides evidence that learning was asymmetrically positively biased outside of the pandemic but symmetric during it (as a result of reduced learning rates from good news estimation errors).
 
 Strengths:
 
 Understanding whether biases in learning are fixed modes of information processing or flexible and adapt in response to environmental shocks (like a global pandemic or economic recession) is an important area of research relevant to a wide range of fields, including cognitive psychology, behavioral economics, and computational psychiatry. The study uses a well-validated task, and the authors conduct a power analysis to show that the sample sizes are appropriate. Furthermore, the authors test that their results hold in both a between-group analysis (the focus of the main paper) and a within-group analysis (mainly in the supplemental).
 
 The finding that optimistic biases are reduced in response to acute stress, perceived threat, and depression has been shown before using this task both in the lab (social stress manipulation), in the real world (firefighters on duty), and clinical groups (patients with depression). However, the work does extend these findings here in important ways:
 
 (1) Examining the effect of a new real-world adverse event (the pandemic). (2) The reduction in optimistic updating here arises due to reduced updating from positive information (previously, in the case of environmental threat, this reduction mainly arose from increased sensitivity to negative information). (3) Leveraging new RL-inspired computational approaches, demonstrating that the bias - and its attenuation - can be captured using trial-by-trial computational modeling with separate learning rates for positive and negative estimation errors.
 
 Weaknesses:
 
 Some interpretation and analysis (the computational modeling in particular) could be improved.
 
 On the interpretation side, while the pandemic was an adverse experience and stressful for many people (including myself), the absence of any measures of stress/threat levels limits the conclusions one can draw. Past work that has used this task to examine belief updating in response to adverse environmental events took physiological (e.g., SCR, cortisol) and/or self-report (questionnaires) measures of mood. In SI Table 1, the authors possibly had some questionnaire measures along these lines, but this might be for the participants tested during the pandemic.
 
 Thank you for this review.
 
 We agree that the lack of physiological and self-report measures of stress, threat, and perceived uncertainty limits the interpretation of findings regarding potential psychological factors. Some self-reported anxiety and perceived risk measures experienced during the lockdowns were collected in a subset of participants (n=40, counting n=21 tested before and during the 1st strict lockdown, and n=19 tested solely during the 1st lockdown). These reports were given retrospectively at the time of release of the 1st lockdown in summer 2020 when the pandemic was still unfolding (SI Table 1).
 
 Exploratory correlations revealed some noteworthy trends. We found that participants who reported to have perceived a bigger risk of death due to contagion were also those who were less optimistically biased when updating their beliefs about adverse future life risks during the first strict COVID-19-related lockdown (Pearson’s r = -0.36, p = 0.02).
 
 Moreover, parameter estimates from the computational models of belief updating showed associations with specific survey responses: The rational Bayesian model’s scaling parameter correlated positively with adherence to distancing measures (r = 0.41, p = 0.01) and negatively with the need for social contact (r = -0.37, p = 0.02). This result indicated that participants who were updating their beliefs faster were more likely to follow preventive guidelines and felt less social craving. Meanwhile, the asymmetry parameter correlated negatively with mask wearing (r = -0.41, p = 0.01), positively with physical contact with close others (r = 0.32, p = 0.04) and satisfaction with social interactions (r = 0.33, p = 0.04). This suggests that participants who displayed some asymmetry in belief updating during the COVID-19 pandemic were less likely to comply with mask-wearing rules and more likely to engage in social interactions.
 
 However, these results did not survive correction for multiple comparisons and the sample size for correlational analyses is in the lower range. The subjective measures of anxiety and fear of contagion did not significantly correlate to the updating bias, or any other variable measured by the belief updating task (e.g. estimation error, updating magnitude).
 
 We now further discuss on page 12 the limitation, which reads:
 
 “We did not collect physiological measures of stress or information about the COVID-19 infection status of participants, which precludes a direct exploration of the immediate effects of experiencing the infection on belief-updating behavior and the potential interaction with anxiety and stress levels. Although subjective ratings of the perceived risk of death from COVID-19 correlated negatively to the beliefs updating bias measured during the pandemic, this result was obtained retrospectively in a subset of participants (SI section 4). We thus cannot directly attribute the observed lack of optimistically biased belief updating during the lockdown to psychological causes such as heightened anxiety and stress. This limitation is noteworthy, as the impact of experiencing the pandemic on belief updating about the future could differ between those who directly experienced infection and those who remained uninfected. It is also important to acknowledge that our study was timely and geographically limited to the context of the COVID-19 outbreak in France. Cultural variations and differences in governmental responses to contain the spread of SARS-CoV-2 may have impacted the optimism biases in belief updating differently.”
 
 On the analysis side, it was unclear what the motivation was for the different sets of models tested. Both families of models test asymmetric vs symmetric learning (which is the main question here) and have similar parameters (scaling and asymmetry parameters) to quantify these different aspects of the learning process. Conceptually, the different behavioral patterns one could expect from the two families of models needed to be clarified.
 
 Thank you for raising this point. We agree that a clearer conceptual distinction between the two model families can help strengthen the interpretation of our findings. We have added the following considerations to the introduction on pages 2–3, which now reads:
 
 “The underlying mechanism of optimistically biased belief updating involves an asymmetry in learning from positive and negative belief-disconfirming information[2,3,4], which can unfold in two ways following Reinforcement learning (RL) or Bayes rule[5].
 
 Conceptually, Reinforcement learning (RL) and Bayesian models of belief updating are complementary but make different assumptions about the hidden process humans may use to adjust their beliefs when faced with information that contradicts them. The RL models assume belief updating is proportional to the estimation error. The key idea of the estimation error expresses the difference between how much someone believes they will experience a future life event and the actual prevalence of the event in the general population. This difference can be positive or negative. A scaling and an asymmetry parameter quantify the propensity to consider the estimation error magnitude and its valence, respectively. These two free parameters form the learning rate, which indicates how fast and biased participants update their beliefs.
 
 In contrast, Bayesian models assume that following Bayes’ rule the posterior, updated belief is a new hypothesis, formed by pondering prior knowledge with new evidence. The prior knowledge consists in information about the prevalence of life events in the general population. The new evidence comprises various alternative hypotheses. It examines how likely a specific event is to occur or not occur for oneself, compared to the likelihood that it will happen or not happen to others. This probabilistic adjustment of beliefs about future life events can be considered as an approximation of a participant’s confidence in the future. The two free parameters of the Bayesian belief updating model scale how much the initial belief deviates from the updated, posterior belief (i.e., scaling parameter) and the propensity to consider the valence of this deviance (i.e., asymmetry parameter).
 
 Although RL-like and Bayesian updating models make different assumptions about the updating strategy, they are complementary and powerful formalizations of human reasoning. Both models provide insight into hidden, latent variables of the updating process. Most notably, the learning rate and its components, the scaling and asymmetry parameters, which can vary between individuals and contexts and, through this variance, offer possible explanations for the idiosyncrasy in belief-updating behavior and its cognitive biases. “
 
 Do the "winning" models produce the main behavioral patterns in Figure 1, and are they in some way uniquely able to do so, for instance? How would updating look different for an optimistic RL learner versus an optimistic Bayesian RL learner?
 
 We now show that the winning models can reproduce the main behavioral patterns (revised Figure 1b).
 
 Moreover, we plotted estimated and observed average belief updating for each participant (n=123) using the overall best-fitting asymmetrical RL-like updating model shown in SI Figure 6.
 
 Would the asymmetry parameter in the former be correlated with the asymmetry parameter in the latter? Moreover, crucially, would one be able to reliably distinguish the models from one another under the model estimation and selection criteria that the authors have used here (presenting robust model recovery could help to show this)?
 
 The asymmetry parameter estimated with the optimistically biased RL- and Bayesian models did correlate (r = 0.735; p < 0.001).
 
 However, we argue that while the observed updating behavior and estimated free parameters are similar for RL-like and Bayesian learners, the underlying assumed cognitive processes differed and are identifiable. To test this assumption, we have added a model recovery analysis now reported in the supplement section 2c and main manuscript’s methods section pages 24–25.
 
 As shown in SI Figure 5 confusion matrix, there is evidence for strong recovery of nearly all models, and importantly for the two winning models: the optimistically biased RL-like model and the rational Bayesian model of belief updating. This analysis thus rules out that the two model families were confused and mitigate concerns about the validity of the model selection.
 
 Note, one exception was observed. The RL-like and Bayesian updating models that assumed no scaling and asymmetry were best recovered by their respective models that estimated the asymmetry parameter. Many factors could explain this. For example, it could be that the models, which assumed asymmetry, but no scaling, may have captured some bias in updating due to noise generated by the zero parameter models.
 
 A justification is also needed to focus on the "RL-like updating model with an asymmetry and scaling learning rate component" in Figure 3. As I understand it, this model fits best outside of the pandemic, but another model - the Rational Bayesian Model - does worse (and does the best during the pandemic). What model best combines the groups (outside and inside the pandemic)?
 
 We thank the reviewer for highlighting the need to justify our focus on the biased RL-like updating model in Figure 3. The model chosen for parameter comparison was selected based on a model comparison procedure conducted across all 12 models, including data from all participants (both those tested outside and during the pandemic, n=123). This model comparison revealed that Model 1 — the RL model with both asymmetry and scaling learning rate parameters estimated — provided the best fit across the entire dataset (Ef = 0.40, pxp = 0.99). As such, we focused on this model for parameter comparisons in Figure 3 to ensure consistency with the model comparison results and to interpret the parameters in the context of the overall best-fitting model. We added this information on top of the model parameter comparison results on page 8. Moreover, SI Figure 6 in the supplements shows how this model reproduces the observed belief updating in each of the 123 participants.
 
 Why do the authors use absolute belief updating (|UPD|) in the first linear mixed effects model (equation iv)? Since an update is calculated differently depending on whether information calls for an update in an upward or downward direction, I do not understand the need to do this (and it means that updates that go in the wrong direction - away from the information - are counted as positive)
 
 Thank you for driving our attention to this point. The ‘absolute belief updating’ note was incorrect, and we apologize for the confusion. To be precise, we did not use absolute updating values in our analyses. Belief updating was assumed on each trial to go either toward the base rate (e.g., Update = E2 – E1) for negative estimation errors or away from it for positive estimation errors (e.g., Update = E1 – E2). Updates that went in the wrong direction, further away from the base rate, were thus counted and included in the analysis with their negative sign. We have corrected this important point in equation iv of the methods section on page 19.
 
 Figure 4: The task schema does not show a confidence rating for base rates.
 
 Thank you for catching this. We have now added the confidence ratings for base rates to the task in Figure 4b in the revised version of the manuscript. We have furthermore corrected a typo in Figure 4a: The sample size for the group 3 tested in Mai 2021 now indicates 31.
 
 The authors report that base rates are uniformly distributed - this is quite different to other instances of the task where base rates are normally distributed (ideally around the midpoint of the scale). Why this deviation in the design?
 
 We used life events and base rates like those used in past studies of belief updating (Garrett and Sharot 2017, Sharot et al. 2011, Garrett et al. 2017, Korn et al. 2017), which were normal to uniformly distributed (W = 0.952, p = 0.088, Shapiro-Wilk test). The base rates ranged between 10% and 70%, with a mean of 40%. Participants rated their estimates between 3% and 77%, which ensured that for most likely (base rate = 70%) and most unlikely events (base rate = 10%) there was the same space (7%) to update beliefs toward the base rates. Moreover, all statistical models included the absolute estimation errors as a control for variance potentially explained by different estimation error magnitude[42,43]. We added this extra base rate information to the methods section’s task description on page 16.
 
 The task is comprised of only negative life events, which arguably this hinders the generalizability of the results. The authors could mention this as a limitation (there has been a significant quantity of debate about this point in relation to this task: see the work from Ulrike Hahn's lab).
 
 We have added a paragraph to the discussion page 13 to provide a rationale for using only adverse events. This paragraph now reads:
 
 “In this study we tested how actual adverse experiences affect the updating of negative future outlooks in healthy participants and in analogy to studies conducted in depressed patients[19,20,24] following the cognitive model of depression[37]. One open question is whether findings were specific to the adverse event framing[38,39,40]. We argue that under normal, non-adverse contexts belief updating should also be optimistically biased for positive life events, as shown by previous research[41,42]. However, how context such as experiencing a challenging or favorable situation influence the updating of beliefs about positive and negative outlooks remains an open question.”
 
 It would be useful to show the parameter recovery for all parameters (not just the learning rates) and the correlation between parameters (both in simulations and in the fitted parameters).
 
 We apologize for being unclear on this part. The models included two free parameters that were the components of the learning rates: The scaling and the asymmetry parameter. We now have added parameter recovery analyses for the scaling and asymmetry components of the learning rates for (1) the Bayesian model of belief updating during the pandemic, and (2) the RL-like model of belief updating outside the pandemic to the supplement (SI section 2b, SI Figure 4).
 
 Reviewer #2:
 
 The authors investigated how experiencing the COVID-19 pandemic affected optimism bias in updating beliefs about the future. They ran a between-subjects design testing for participants on cognitive tasks before, during, and after lifting the sanitary state of emergence during the pandemic. The authors show that optimism bias varied depending on the context in which it was tested. Namely, it disappeared during COVID-19 and re-emerged at the time of lift of sanitary emergency measures. Through advanced computational modeling, they are able to thoroughly characterize the nature of such alternations, pinpointing specific mechanisms underlying the lack of optimistic bias during the pandemic.
 
 Strengths pertain to the comprehensive assessment of the results via computational modeling and from a theoretical point of view to the notion that environmental factors can affect cognition. However, the relatively small sample size for each group is a limitation.
 
 Thank you for this review.
 
 We acknowledge that sample sizes in each group are lower, especially when breaking down the participant sample into four sub-samples tested in the different contexts. To mitigate concerns we checked the power of the observed context by valence interaction on belief updating. To this aim we simulated new belief updates using the parameters from the best fitting optimistic RL-like model of observed belief updating outside the pandemic, and the rational Bayesian model of observed belief updating during the pandemic. At each iteration we performed a linear mixed effects model analysis of the simulated belief updates[44] analogous to equation iv in the main text. The frequency across 1000 iterations with which the LMEs detected a significant interaction of valence by context on simulated belief updating was 75 %. This frequency indicates the power of the valence by context interaction on observed belief updating. In other words, false negatives were 25% likely, which meant type II errors of failing to reject the null hypothesis when the effect was there. We have added these extra analyses to the main manuscript’s results section page 4 and method’s section page 20.
 
 A major impediment interpreting of the findings is the need for additional measures. While the information on for example, risk perception or the need for social interaction was collected from participants during the pandemic, the fact that these could not be included in the analysis hinders the interpretation of findings, which is now generally based on data collected during the pandemic, for example, reporting increased stress. While authors suggest an interpretation in terms of uncertainty of real-life conditions it is currently difficult to know if that factor drove the effect. Many concurrent elements might have accounted for the findings. This limits understanding of the underlying mechanisms related to changes in optimism bias.
 
 We agree with the reviewer on the limitation arising from the lack of physiological and self-report measures of stress, threat, and perceived uncertainty. To address this point and a similar point raised by reviewer 1 we have added a section to the supplement (SI section 4) that now reports explorative correlations between questionnaire responses of subjective perceptions of risk and anxiety, behavior (e.g. mask wearing, social distancing) and belief updating measured during the 1st strict lockdown.
 
 We now also further discuss this limitation on page 12 of the main text’s discussion.
 
 I recommend that the authors spend more time on explaining the belief-updating task in the presentation of the experiment.
 
 Thank you for this advice. We now provide a clearer and more detailed description of the belief-updating task in the main manuscript’s methods section and have updated Figure 4b to display the confidence rating event in the task schema.
 
 The task description now reads:
 
 “As illustrated in Figure 4b, each of the 40 trials began with presenting an adverse life event. Participants estimated their own risk and the risk of someone else their age and gender. Then the base rate of the event occurring in the general population was displayed on the computer screen. Participants rated their confidence in the accuracy of the presented base rate. Finally, they re-estimated their risk for experiencing the event now informed by the base rate.”
 
 The experimental task seems to include a self-other dimension, which is completely disregarded in the analysis. It would be interesting to explore whether the effect of diluted optimism bias during the pandemic is specific to information about self vs. Other.
 
 We appreciate the reviewer's observation regarding the self-versus-other dimension in the belief updating task design. As now shown in SI Figure 2 the participants indeed displayed an optimism bias: They estimated that adverse events are more likely to happen to others than to themselves (ß = 3.02, SE = 0.86, t (232) = 3.53, p = 5.09e-04, 95% CI [1.33 – 4.71]; SI Figure 2; SI Table 18). This effect was observed overall participants. The pandemic context had no significant effect (ß = -1.91, SE = 3.00, t (232) = -0.64, p = 0.52, 95% CI [-7.82 – 4.00]; SI Table 18). Moreover, following previous studies of optimistically biased belief updating we tested the effect of estimation errors (EE) calculated on the difference between the estimate for someone else (eBR) and the base rate (BR), following: EE = eBR – BR[4,5,25,26]. When categorizing trials as good news or bad news based on this alternative EE calculation the context-by-EE valence interaction remained significant (SI Table 6).
 
 We conclude from these additional analyses that experiencing the pandemic specifically influenced belief updating but did not affect optimism biases in initial beliefs about the future.
 
 Please provide an English translation of the instructions for the task.
 
 We now provide an English translation of the task instructions in the Supplement section 5.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 2

AuthorResponse

Annotators

Public_Reviews

URL

osf.io/preprints/psyarxiv/6mauz_v2
www.researchsquare.com www.researchsquare.com

The role of GABA in semantic memory and its neuroplasticity

3
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  Jung et al. present valuable work on the relationship between gamma-aminobutyric acid (GABA) levels within the anterior temporal lobes (ATL) to semantic memory while accounting for inter-individual differences. They provide solid evidence suggesting that inhibitory continuous theta burst transcranial magnetic stimulation (cTBS TMS) increased GABA concentration and decreased the blood-oxygen dependent signal (BOLD) during a semantic task. The results will be of interest to researchers studying the neurobiology of semantic cognition.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  This study presents valuable findings on the GABA and BOLD changes induced by continuous theta burst stimulation (cTBS) and on the relationships between ATL GABA level and performance in a semantic task. However, I'm afraid that the current results are incomplete to support some primary claims of the paper, for example, the purported inverted-U-shaped relationship between GABA levels in the ATL and semantic task performance. The influence of practice effects also complicates the interpretation of the results. Additional concerns include potential double dipping in the analysis depicted in Figure 3A and the use of inconsistent behavioral measures (IE and accuracy) across various analyses.
  
  The authors have made two beneficial revisions in this round: (1) acknowledging the insufficient data points supporting the inverted U-shaped curve; (2) attempting to control for practice effects. However, I believe unresolved issues remain:
  
  (1) The authors have not addressed my specific concern about Figure 4D - the analysis attempts to fit an inverted U-shaped curve to the data without distinguishing between data points influenced by practice effects and those unaffected, rendering its reliability questionable.
  
  (2) The authors appear to have misunderstood my question regarding Figure 3A. This issue is unrelated to practice effects. My point was that even if we randomly generated pre- and post-test data points and grouped/analyzed them according to the authors' methodology, we would still likely reproduce the pattern in Figure 3A due to the double dipping problem. Thus, this statistical analysis and its conclusions currently lack methodological validity.
  
  (3) Regarding the inconsistency in behavioral measures, the authors' explanation fails to remove my concerns. If the authors argue that accuracy is the most appropriate behavioral dependent variable for this study, why did they employ inverse efficiency in some of their analyses? My understanding is that a study should either consistently use the single most suitable measure or report multiple measures while providing adequate discussion of inconsistent results.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  As a result of a number of rounds of reviews and consultations between reviewers, Jung et al. present important work on the relationship between gamma-aminobutyric acid (GABA) levels within the anterior temporal lobes (ATL) to semantic memory while accounting for inter-individual differences. They provide solid evidence suggesting that inhibitory continuous theta burst transcranial magnetic stimulation (cTBS TMS) increased GABA concentration and decreased the blood-oxygen dependent signal (BOLD) during a semantic task.
  
  The authors fully addressed my comments from the first and second rounds of reviews, and I do not have additional concerns. I have, however, scaled down my short assessment, given the concerns of reviewers 1 and 2.
  
  Review 2
Visit annotations in context

Tags

Summary

Review 1

Review 2

Annotators

Public_Reviews

URL

researchsquare.com/article/rs-3026480/v3
www.biorxiv.org www.biorxiv.org

Neocortical Layer-5 tLTD Relies on Non-Ionotropic Presynaptic NMDA Receptor Signaling

4
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife assessment
  
  Using an elegant and thorough experimental design, Thomazeau et al show that, in the developing mouse visual cortex, presynaptic NMDA receptors at layer 5 neocortical synapses mediate spike-timing dependent LTD via JNK2, non-ionotropic signaling. These fundamental findings shed light on how NMDA receptors can tune synaptic function without acting as coincidence detectors. The experiments are supported by compelling evidence, gathered through mouse transgenics and quadruple patch clamp recordings from cortical slices.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  The results offer compelling evidence that L5-L5 tLTD depends on presynaptic NMDARs, a concept that has previously been somewhat controversial.
  
  It documents the novel finding that presynaptic NMDARs facilitate tLTD through their metabotropic signaling mechanism.
  
  Strengths:
  
  The experimental design is clever and clean.
  
  The approach of comparing the results in cell pairs where NMDA is deleted either presynaptically or postsynaptically is technically insightful and yields decisive data.
  
  The MK801 experiments are also compelling.
  
  Weaknesses:
  
  No major weaknesses were noted by this reviewer.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  The study characterized the dependence of spike-timing-dependent long-term depression (tLTD) on presynaptic NMDA receptors and the intracellular cascade after NMDAR activation possibly involved in the observed decrease in glutamate probability release at L5-L5 synapses of the visual cortex in mouse brain slices.
  
  Strengths:
  
  The genetic and electrophysiological experiments are thorough. The experiments are well-reported and mainly support the conclusions. This study confirms and extends current knowledge by elucidating additional plasticity mechanisms at cortical synapses, complementing existing literature.
  
  Weaknesses:
  
  While one of the main conclusions (preNMDARs mediating presynaptic LTD) is resolved in a very convincing genetic approach, the second main conclusion of the manuscript (non-ionotropic preNMDARs) relies on the use of a high concentration of extracellular blockers (MK801, 2 mM; 7-clorokinurenic acid: 100 microM), but no controls for the specific actions of these compounds are shown. In addition, no direct testing for ions passing through preNMDAR has been performed.
  
  It is not known if the results can be extrapolated to adult brain as the data were obtained from 11-18 days-old mice slices, a period during which synapses are still maturing and the cortex is highly plastic.
  
  Review 2
4. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Summary:
  
  In this manuscript, "Neocortical Layer-5 tLTD Relies on Non-Ionotropic Presynaptic NMDA Receptor Signaling", Thomazeau et al. seek to determine the role of presynaptic NMDA receptors and the mechanism by which they mediate expression of frequency-independent timing-dependent long-term depression (tLTD) between layer-5 (L5) pyramidal cells (PCs) in the developing mouse visual cortex. By utilizing sophisticated methods, including sparse Cre-dependent deletion of GluN1 subunit via neonatal iCre-encoding viral injection, in vitro quadruple patch clamp recordings, and pharmacological interventions, the authors elegantly show that L5 PC->PC tLTD is (1) dependent on presynaptic NMDA receptors, (2) mediated by non-ionotropic NMDA receptor signaling, and (3) is reliant on JNK2/Syntaxin-1a (STX1a) interaction (but not RIM1αβ) in the presynaptic neuron. The study elegantly and pointedly addresses a long-standing conundrum regarding the lack of frequency dependence of tLTD.
  
  Strengths:
  
  The authors did a commendable job presenting a very polished piece of work with high-quality data that this Reviewer feels enthusiastic about. The manuscript has several notable strengths. Firstly, the methodological approach used in the study is highly sophisticated and technically challenging and successfully produced high-quality data that were easily accessible to a broader audience. Secondly, the pharmacological interventions used in the study targeted specific players and their mechanistic roles, unveiling the mechanism in question step-by-step. Lastly, the manuscript is written in a well-organized manner that is easy to follow. Overall, the study provides a series of compelling evidence that leads to a clear illustration of mechanistic understanding.
  
  I have a couple of small items below, which the authors can address in a minor revision if they so wish.
  
  Minor comments:
  
  (1) For the broad readership, a brief description of JNK2-mediated signaling cascade underlying tLTD, including its intersection with CB1 receptor signaling may be desired.
  
  (2) The authors used juvenile mice, P11 to P18 of age. It is a typical age range used for plasticity experiments, but it is also true that this age range spans before and after eye-opening in mice (~P13) and is a few days before the onset of the classical critical period for ocular dominance plasticity in the visual cortex. Given the mechanistic novelty reported in the study, can authors comment on whether this signaling pathway may be age-dependent?
  
  Review 3
Visit annotations in context

Tags

Summary

Review 1

Review 2

Review 3

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.02.07.637179v1
www.biorxiv.org www.biorxiv.org

P66Shc Mediates SUMO2-induced Endothelial Dysfunction

5
1. Public_Reviews 09 May 2025
 
 in eLife
 
 eLife Assessment
 
 This study offers valuable insights into the role of post-translational modifiers, specifically SUMO2ylation at K81 in p66Shc, and its impact on endothelial function through reactive oxygen species. A series of compelling experiments demonstrated that lysine 81 of p66Shc is the site of SUMO2 conjugation, which is crucial for mitochondrial localization and essential for S36 phosphorylation, leading to specific pathological effects. The combination of cell overexpression and animal studies provides solid data supporting this mechanistic link.
 
 Summary
2. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Summary:
 
 The authors describe a role of sumoylation at K81 in p66Shc which affects endothelial dysfunction. This explores a new mechanism for understanding the role of PTMs in cellular processes.
 
 Strengths:
 
 The experiments are well planned and the results are well represented. Vascular tonality experiments were carried out nicely, given the amount of time and effort one needs to put in to get clean results from these experiments.
 
 Weaknesses:
 
 (1) The production of ROS has been measured in a very superficial way. The term "ROS" confers a plethora of chemical species which exerts different physiological effects on different cells and situations. Mitochondria through one of the source , but not the only source of ROS production. Only measuring ROS with mitosox do not reflect the cellular condition of ROS in a specific condition. I would suggest authors consider doing IF of oxidative stress specific markers , carbonyl group and also, maybe, Amplex red for determining average oxidative stress and ros production in the cells. (2) 8-OHG signal seems very confusing in Figure 7E. 8-ohg is supposed to be mainly in the nucleus and to some extent in mitochondria. The signal is very diffused in the images. I would suggest a higher magnification and better resolution images for 8-ohg. Also, the VWF signal is pretty weak whereas it should be strong given the staining is in aorta. Authors should redo the experiments. (3) PCA analysis is quite not clear. Why is there a convergence among the plots? Authors should explain. Also, I would suggest that the authors do the analysis done in Figure 8B again with R based packages. IPA, though being user-friendly, mostly does not yield meaningful results and the statistics carried out is not accurate. Authors should redo the analysis in R or Python whichever is suitable for them. (4) The MS analysis part seems pretty vague in methods. Please rewrite.
 
 Review 1
3. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 The article builds on the earlier work that both p66Shc and SUMOylation are essential nitric oxide (NO) based development of endothelial vasculature (PMID: 10580504; 28760777 and 35187108). The current manuscript brings forward a finding of how SUMO2ylation of p66Shc mediated ROS production which is essential for endothelial cells. They further identify that lysine 81 of p66Shc is the residue which is conjugated to SUMO2 and is crucial for mitochondrial localization. They further show that K81 SUMO2ylation is essential for S36 phosphorylation.
 
 Strengths:
 
 Convincingly shows that p66Shc is SUMO2ylated on lysine 81 in cells and also shows that the phosphorylation (serine 36) reduces upon loss of this critical SUMOylation site.
 
 Weaknesses:
 
 All the experiments performed here are in overexpression background therefore, it would be crucial to show that p66Shc is SUMO2ylated at physiological levels.
 
 Review 2
4. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 Summary:
 
 The authors set out to determine how SUMO2 impairs endothelial function through direct modification of the protein p66Shc. p66Shc is known to promote reactive oxygen species production, and here the authors demonstrate that SUMO2 modifies p66Shc at lysine-81, resulting in increased phosphorylation, mitochondrial translocation. These are prosed to mediate the detrimental effects of SUMO2 in a mouse model of hyperlipidemia.
 
 Strengths:
 
 A major strength of this work is the multi-pronged approach combining biochemical assays, proteomic analyses, and a genetically modified mouse model expressing a SUMOylation resistant mutant of p66Shc. These experiments comprehensively illustrate that lysine-81 SUMOylation of p66Shc is necessary for the observed endothelial dysfunction in hyperlipidemic conditions.
 
 Weaknesses:
 
 One notable weakness is that the link between the observed cellular changes and the ultimate in vivo phenotype remains only partially explored. While the authors successfully show that p66ShcK81R knockin mice are protected from endothelial dysfunction in a hyperlipidemic context, additional experiments characterizing the broader tissue-specific roles, or examining further endothelial assays in vivo, would strengthen the mechanistic conclusions. It would also be beneficial to see more direct evaluations of p66Shc subcellular localization in the protective knockin mice to complement the proteomic findings.
 
 Despite these gaps, the data broadly support the authors' main conclusions. The authors lay out a plausible mechanistic pathway for how hyperlipidemia and increased global SUMOylation can converge on the oxidative stress pathway to provoke vascular dysfunction.
 
 The likely impact of this work on the field is noteworthy. Beyond clarifying how a single post-translational modification event can influence the pathophysiology of endothelial cells, the study provides a model for investigating broader roles of SUMO2 in other cardiovascular conditions and highlights the importance of identifying additional SUMOylation sites and their downstream impact.
 
 In conclusion, by demonstrating the direct SUMOylation of p66Shc at lysine-81 and linking that modification to endothelial dysfunction in a hyperlipidemic mouse model, this paper offers valuable insights into how broadly acting post-translational modifiers can evoke specific pathological effects.
 
 Review 3
5. Public_Reviews 09 May 2025
 
 in eLife
 
 Author response:
 
 Public Reviews:
 
 Reviewer #1 (Public review):
 
 (1) The production of ROS has been measured in a very superficial way.
 
 The term "ROS" confers a plethora of chemical species which exerts different physiological effects on different cells and situations.
 
 Mitochondria through one of the source, but not the only source of ROS production. Only measuring ROS with mitosox do not reflect the cellular condition of ROS in a specific condition. I would suggest authors consider doing IF of oxidative stress specific markers , carbonyl group and also, maybe, Amplex red for determining average oxidative stress and ros production in the cells.
 
 We agree with the reviewer that a detailed analysis of ROS production and its markers would strengthen the manuscript. Accordingly, we will perform the Amplex Red assay for Figure 1.
 
 (2) 8-OHG signal seems very confusing in Figure 7E. 8-ohg is supposed to be mainly in the nucleus and to some extent in mitochondria. The signal is very diffused in the images. I would suggest a higher magnification and better resolution images for 8-ohg. Also, the VWF signal is pretty weak whereas it should be strong given the staining is in aorta. Authors should redo the experiments.
 
 The reviewer’s comment is correct regarding the expected signal. We will repeat the assays. However, we would like to note that the flat morphology of the endothelial cell monolayer on the aortic surface may limit the visualization of subcellular signal differentiation when transversely sectioned.
 
 (3) PCA analysis is quite not clear. Why is there a convergence among the plots? Authors should explain. Also, I would suggest that the authors do the analysis done in Figure 8B again with R based packages. IPA, though being user-friendly, mostly does not yield meaningful results and the statistics carried out is not accurate. Authors should redo the analysis in R or Python whichever is suitable for them.
 
 Thank you for your valuable feedback. We acknowledge the concern regarding the PCA analysis and the convergence observed in the plots. In the revised manuscript, we will revise our interpretation to clarify this observation.
 
 Additionally, we appreciate your suggestion to use R-based packages for pathway analysis. We will make efforts to regenerate the analysis presented in Figure 8B using R to enhance the statistical robustness and reproducibility of our results.
 
 (4) The MS analysis part seems pretty vague in methods. Please rewrite.
 
 We will revise the methods section to improve the legibility.
 
 Reviewer #2 (Public review):
 
 All the experiments performed here are in overexpression background therefore, it would be crucial to show that p66Shc is SUMO2ylated at physiological levels.
 
 To address this concern, we will attempt to assess p66Shc-SUMO2 levels under physiological conditions. However, we would like to highlight a technical limitation: the currently available antibodies do not distinguish p66Shc from other isoforms, nor SUMO2 from SUMO3. Therefore, enriching for the endogenous p66Shc-SUMO2 adduct will require novel tools and techniques, which we are actively exploring.
 
 Reviewer #3 (Public review):
 
 One notable weakness is that the link between the observed cellular changes and the ultimate in vivo phenotype remains only partially explored. While the authors successfully show that p66ShcK81R knockin mice are protected from endothelial dysfunction in a hyperlipidemic context, additional experiments characterizing the broader tissue-specific roles, or examining further endothelial assays in vivo, would strengthen the mechanistic conclusions. It would also be beneficial to see more direct evaluations of p66Shc subcellular localization in the protective knockin mice to complement the proteomic findings.
 
 That is an excellent suggestion. We will determine the tissue specific distribution of endogenous p66ShcK81R.
 
 Despite these gaps, the data broadly support the authors' main conclusions. The authors lay out a plausible mechanistic pathway for how hyperlipidemia and increased global SUMOylation can converge on the oxidative stress pathway to provoke vascular dysfunction.
 
 The likely impact of this work on the field is noteworthy. Beyond clarifying how a single post-translational modification event can influence the pathophysiology of endothelial cells, the study provides a model for investigating broader roles of SUMO2 in other cardiovascular conditions and highlights the importance of identifying additional SUMOylation sites and their downstream impact.
 
 In conclusion, by demonstrating the direct SUMOylation of p66Shc at lysine-81 and linking that modification to endothelial dysfunction in a hyperlipidemic mouse model, this paper offers valuable insights into how broadly acting post-translational modifiers can evoke specific pathological effects.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 3

AuthorResponse

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.01.24.577109v3
www.biorxiv.org www.biorxiv.org

Image Correlation Spectroscopy is a Robust Tool to Quantify Cellular DNA Damage Response

5
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  This valuable paper shows image correlation spectroscopy (ICS) as a new tool to analyze the clustering of proteins involved in DNA damage response (DDR). The convincing evidence presented demonstrates that ICS is more sensitive than traditional foci counting. This new method provides an alternative tool to quantify immunostained foci for researchers in the fields of DDR and cell biology.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  This manuscript assesses the utility of spatial image correlation spectroscopy (ICS) for measuring physiological responses to DNA damage. ICS is a long-established (~1993) method, similar to fluorescence correlation spectroscopy, for deriving information about the fluorophore density that underlies the intensity distributions of images.
  
  The revisions to the current manuscript have improved the understanding of the strengths and limitations of the spatial ICS method. In particular, since the measurements are obtaining complementary information to traditional focus counting, one does not expect a simple linear relationship between the quantities obtained by ICS and by immunostaining. The explanations are satisfactory to me and, I expect, to the interested reader.
  
  Additionally, I am satisfied with the code availability now that it is placed on Github.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  This valuable study presents image correlation spectroscopy (ICS) an alternative method to foci counting as a quantitative measurement of recruitment of DNA damage response associated proteins to chromatin following exposure of cells to various genotoxic agents. The evidence presented to demonstrate that this method is more sensitive than traditional foci counting is convincing, although the two methods provide similar results for many of the comparisons. This work will be of interest to scientists using immunostaining to study DNA repair.
  
  Comments on revisions:
  
  The authors adequately addressed the comments raised and improved the manuscript. The authors accurately state that there is subjectivity in foci counting, e.g., different thresholds and/or algorithms produce different absolute counts. In addition, the conditions for pre-extraction also introduce variability, and any pre-extraction may inadvertently remove meaningful signal. Yet it is unclear whether these differences in absolute counts impact the conclusions that can be drawn from these experiments, which do not usually make a claim about the absolute number of foci, but rather a comparison between two different conditions with the same pre-extraction conditions and the same threshold/counting algorithm applied, with appropriate controls. Moreover, when the authors compared ICS to foci counting, the results were largely similar, although ICS was superior in a few instances. Overall, how transitioning from the widely-used foci counting method to ICS will offer a major advantage is unclear.
  
  Review 2
4. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Summary:
  
  This paper described a new tool called "Image Correlation Spectroscopy; ICS) to detect clustering fluorescence signals such as foci in the nucleus (or any other cellular structures). The authors compared ICS DA (degree of aggregation) data with Imaris Spots data (and ImageJ Find Maxima data) and found a comparable result between the two analyses and that the ICS sometimes produced a better quantification than the Imaris software. Moreover, the authors extended the application of ICS to detect cell-cycle stages by analyzing the DAPI image of cells. This is a useful tool without the subjective bias of researchers and provides novel quantitative values in cell biology.
  
  Strengths:
  
  The authors developed a new tool to detect and quantify the aggregates of immuno-fluorescent signals, which is a center of modern cell biology, such as the fields of DNA damage responses (DDR), including DNA repair. This new method could detect the "invisible" signal in cells without pre-extraction, which could prevent the effect of extracted materials on the pre-assembled ensembles, a target for the detection. This would be an alternative method for the quantification of fluorescent signals relative to conventional methods.
  
  Comments on revisions:
  
  The authors addressed previous comments properly.
  
  Review 3
5. Public_Reviews 09 May 2025
  
  in eLife
  
  Author response:
  
  The following is the authors’ response to the original reviews
  
  Public Reviews:
  
  Reviewer #1 (Public review):
  
  Summary:
  
  This manuscript assesses the utility of spatial image correlation spectroscopy (ICS) for measuring physiological responses to DNA damage. ICS is a long-established (~1993) method similar to fluorescence correlation spectroscopy, for deriving information about the fluorophore density that underlies the intensity distributions of images. The authors first provide a technical but fairly accessible background to the theory of ICS, then compare it with traditional spot-counting methods for its ability to analyze the characteristics of γH2AX staining. Based on the degree of aggregation (DA) value, the authors then survey other markers of DNA damage and uncover some novel findings, such as that RPA aggregation inversely tracks the sensitivity to PARP inhibitors of different cell lines.
  
  The need for a more objective and standardized tool for analyzing DNA damage has long been felt in the field and the authors argue convincingly for this. The data in the manuscript are in general well-supported and of high quality, and show promise of being a robust alternative to traditional focus counting. However, there are a number of areas where I would suggest further controls and explanations to strengthen the authors' case for the robustness of their ICS method.
  
  Strengths:
  
  The spatial ICS method the authors describe and demonstrate is easy to perform and applicable to a wide variety of images. The DDR was well-chosen as an arena to showcase its utility due to its well-characterized dose-responsiveness and known variability between cell types. Their method should be readily useable by any cell biologist wanting to assess the degree of aggregation of fluorescent tags of interest.
  
  Weaknesses:
  
  The spatial ICS method, though of longstanding history, is not as intuitive or well-known as spot-based quantitation. While the Theory section gives a standard mathematical introduction, it is not as accessible as it could be. Additionally, the values of TNoP and DA shown in the Results are not discussed sufficiently with regard to their physical and physiological interpretation.
  
  We agree that a major limitation in adaption of this approach is a deeper understanding of the theory and results. We have updated the theory section to include further discussion (Page 4 line 132)
  
  The correlation of TNoP with γH2AX foci is high (Figure 2) and suggestive that the ICS method is suitable for measuring the strength of the DDR. The authors correctly mention that the number of spots found using traditional means can vary based on the parameters used for spot detection. They contrast this with their ICS detection method; however, the actual robustness of spatial ICS is not given equal consideration.
  
  We found it difficult to give equal consideration of robustness to ICS. The major limitation of traditional approaches is proper selection of an intensity threshold that is necessary to define and separate foci from background intensity. However, ICS does not employ a threshold, therefore we could not test different thresholding applications in ICS as we did with traditional methods. In our view the absence of the need for a threshold is profoundly advantageous. The only inputs we employ in the ICS analysis are used to segment cell nuclei, yet these have no impact on the ICS calculation and are necessary for any analysis of the DDR.
  
  Reviewer #2 (Public review):
  
  Summary:
  
  Immunostaining of chromatin-associated proteins and visualization of these factors through fluorescence microscopy is a powerful technique to study molecular processes such as DNA damage and repair, their timing, and their genetic dependencies. Nonetheless, it is well-established that this methodology (sometimes called "foci-ology") is subject to biases introduced during sample preparation, immunostaining, foci visualization, and scoring. This manuscript addresses several of the shortcomings associated with immunostaining by using image correlation spectroscopy (ICS) to quantify the recruitment of several DNA damage response-associated proteins following various types of DNA damage.
  
  The study compares automated foci counting and fluorescence intensity to image correlation spectroscopy degree of aggregation study the recruitment of DNA repair proteins to chromatin following DNA damage. After validating image correlation spectroscopy as a reliable method to visualize the recruitment of γH2AX to chromatin following DNA damage in two separate cell lines, the study demonstrates that this new method can also be used to quantify RPA1 and Rad51 recruitment to chromatin following DNA damage. The study further shows that RPA1 signal as measured by this method correlates with cell sensitivity to Olaparib, a widely-used PARP inhibitor.
  
  Strengths:
  
  Multiple proof-of-concept experiments demonstrate that using image correlation spectroscopy degree of aggregation is typically more sensitive than foci counting or foci intensity as a measure of recruitment of a protein of interest to a site of DNA damage. The sensitivity of the SKOV3 and OVCA429 cell lines to MMS and the PARP inhibitors Olaparib and Veliparib as measured by cell viability in response to increasing amounts of each compound is a valuable correlate to the image correlation spectroscopy degree of aggregation measurements.
  
  Weaknesses:
  
  The subjectivity of foci counting has been well-recognized in the DNA repair field, and thus foci counts are usually interpreted relative to a set of technical and biological controls and across a meaningful time period. As such:
  
  (1) A more detailed description of the numerous prior studies examining the immunostaining of proteins such as γH2AX, RAD51, and RPA is needed to give context to the findings presented herein.
  
  We apologize for not providing enough detail. We have added further references and discussion. γH2AX foci counting, in particular, has been used in thousands of previous studies. (Pages 18 line 513 and 517)
  
  (2) The benefits of adopting image correlation spectroscopy should be discussed in comparison to other methods, such as super-resolution microscopy, which may also offer enhanced sensitivity over traditional microscopy.
  
  Thank you for raising this point. We have added this discussion (page 19 line 553). The limiting factor that ICS addresses is the partition coefficient of signal in a foci or cluster versus outside the cluster. Super-resolution will not necessarily improve this unless it is resolved down to single molecule counting. However, one would still need to evaluate how to define a cluster or foci in the background of non-cluster distribution.
  
  (3) Additional controls demonstrating the specificity of their antibodies to detection of the proteins of interest should be added, or the appropriate citations validating these antibodies included.
  
  We have added text stating that we only use validated antibodies (page 6 line 193). One thing to note is that we are measuring differences between treatment conditions, thus, if an antibody has non-specific labeling of proteins of cellular structures that do not change upon treatment, our approach would overcome this limitation.
  
  Reviewer #3 (Public review):
  
  Summary:
  
  This paper described a new tool called "Image Correlation Spectroscopy; ICS) to detect clustering fluorescence signals such as foci in the nucleus (or any other cellular structures). The authors compared ICS DA (degree of aggregation) data with Imaris Spots data (and ImageJ Find Maxima data) and found a comparable result between the two analyses and that the ICS sometimes produced a better quantification than the Imaris. Moreover, the authors extended the application of ICS to detect cell-cycle stages by analyzing the DAPI image of cells. This is a useful tool without the subjective bias of researchers and provides novel quantitative values in cell biology.
  
  Strengths:
  
  The authors developed a new tool to detect and quantify the aggregates of immunofluorescent signals, which is a center of modern cell biology, such as the fields of DNA damage responses (DDR), including DNA repair. This new method could detect the "invisible" signal in cells without pre-extraction, which could prevent the effect of extracted materials on the pre-assembled ensembles, a target for the detection. This would be an alternative method for the quantification of fluorescent signals relative to conventional methods.
  
  Recommendations for the authors:
  
  Reviewer #1 (Recommendations for the authors):
  
  Major comments:
  
  (1) The ICS theory section is essential and based on an excellent review from one of the authors. It would benefit greatly from a diagram showing where the quantities 𝒈(𝟎, 𝟎), 𝝎𝟎, and 𝒈inf come from in the 2D Gaussian fit, ideally for two cases where these quantities differ (i.e., how they correspond to different DA or TNoP values). In my opinion, this addition would greatly increase the manuscript's accessibility for DDR researchers. The citation of the review at the beginning would also be a plus.
  
  We have added the review citation at the front of the theory section (page 3 line 87).We have highlighted where g(0,0), the most critical measurement for determination of TNoP and DA, derives from in Figure 2D. However, it is difficult to describe all the curve fit parameters in an image as they have some interdependency on each other and thus labeling one in a single image would not independently capture how they might be observed in a different curve fit.
  
  (2) The TNoP measured in Figure 2 is a quantity about 2000-3000 times greater than the number of "traditionally detected" foci by both methods and the linear relations have very low Y intercepts. Can the authors comment explicitly on the physical interpretation of this number - are 2 to 3 thousand independent particles present within each "focus" detected by traditional means? If so, then what might one "particle" correspond to? (a single secondary antibody or fluorophore? a nucleosome?). In a similar vein, the X intercepts lie at around 25 foci, meaning that in images with fewer than that number of foci detected by ImageJ or Imaris, the ICS method should detect zero TNoP - is this in line with the authors' predictions? Is it possible that a first-order line fit is not the most appropriate relation between the two methods?
  
  We apologize for our brevity here. Since DA proved to be a more useful metric we did not spend much effort discussing TNoP. TNoP correlates to the number of clustered particles, or non-diffuse fluorophores. TNoP is the inverse of the number of individual particles per nucleus, but the value is not a direct measure of foci. If a sample had no clustering at all, the number of individual particles would be at a maximum and the TNoP would be at a minimum. However, as fluorophores cluster, the number of individual particles (i.e. non-clustered fluorophores) decreases, which increases the TNoP value. Therefore, TNoP has a correlation to the number of foci detected through traditional measurements, as we found here. Yet, TNoP is a relative measurement and cannot be compared across different conditions. Similar to foci counting, TNoP is unable to factor the size or intensity of each cluster, thus DA is a more appropriate quantification of the DNA damage response.
  
  The value of TNoP is dependent on the fitted point spread function and the area of the nucleus. The y=0 intercept of TNoP is defined by the optical setup and is not expected to necessarily go through x=0. Intriguingly, other groups have found that some foci identified through traditional measurements are actually clusters of multiple smaller foci, thus the concept of what a foci represents is difficult to interpret. Thus, here we aimed to show a general correlation of TNoP with foci count through traditional methods to reflect how ICS is similar to foci counting, then employed DA to overcome the limitations of defining a foci.
  
  We have tried to clarify this in the text (page 8, line 266)
  
  (3) Some suggestions to address the robustness of ICS:
  
  For a given sample (i.e. one segmented nucleus), the calculation of DA and TNoP should be similar between different images of that same nucleus taken at different times, similar to how the number of traditionally detected foci would be fairly invariant. In particular, it should be shown that these values are not just scaling with the higher normalized intensity seen in stronger DDR responses. In the same vein, the linear relationship between TNoP and "foci" should not change even if the confocal settings are slightly different (i.e., higher/lower illumination intensity) as long as the condition stipulated by the authors in the Discussion holds ("ICS can be implemented on any fluorescence image as long as the square relative fluorescence intensity fluctuations are detectable above noise fluctuations."). To show, as the title states, that spatial ICS is a robust tool, it would be desirable to demonstrate this with a series of images of the same cell at the same or varying excitation intensities.
  
  Thank you for your suggestions. Indeed, the calculation will be the same over sequential images of the same cell. Observations of dose dependent DA that does not correlate with intensity for RPA1 and RAD51 results (Fig. S5) directly demonstrates that DA does not just scale with intensity.
  
  We would not expect the TNoP to change with confocal setting, however we show in Figure 1 that the number of foci does indeed change with intensity settings as captured by thresholds. Therefore, any interpretation of TNoP vs. foci count would be very difficult to make at different microscope settings. To ensure we are fairly comparing ICS to existing analysis we keep the settings the same and measure changes between conditions.
  
  (4) More information is needed on how intensity normalization was performed. The Methods states "Measurements across experiments were normalized by the control in each dataset." The DMSO (0mM drug) plots all appear to have a mean of 1.0, so it appears the values for each set of control nuclei were divided by their own mean, and then the values for each set of experimental nuclei were divided by the mean value of all 3 controls as an aggregate; is this correct?
  
  We apologize for not being more clear. Thank you for raising this point. We normalized data to a control from each experimental group. Thus, in figures 3,4 and 5 data were collected over multiple experiments with one control per experiment and each treatment condition included in each experiment. Therefore, we normalized each result to the corresponding control from that imaging session. However, in Figure 8 we ran experiments at much higher throughput with multiple controls per experiment, thus the data were normalized to the overall average of the controls, which is why the control averages are not all at a value of 1. We have clarified this in the text. (Page 7 line 218).
  
  (5) Some more information about the ICS analysis should be given if the full code is not provided - in particular, how the nucleus mask was implemented on the "signal" channel (were the edges abruptly set to zero or was a window function introduced to avoid edge effects in the discrete FFT?
  
  Thank you for raising this point. We have added the code to GitHub - github.com/ dubachLab/ics. The signal region was established by simply applying the nuclear mask from the DAPI channel to the IF channel. Each region is padded with average intensity value at the edges for 2x the dimensions of the ROI to remove edge effects in the FFT.
  
  Minor comments:
  
  (1) Figure 3, 4, 5: I think it would aid figure readability if channels were labeled in the images themselves, not just in the legend.
  
  Thank you for the suggestion, we tried doing this and struggle to fit a label with the layout of the images. We were also concerned about interpretation of data in each column and the potential to assign data to each figure if they were so prominently labeled.
  
  (2) Supplemental Figures are mislabeled; the order given in the legends is S1, S2, S3, S2, S3. S4 is called out in the main text where it should be S5.
  
  Thank you for catching this error. We have made the necessary corrections. S4 contains data on cellular response to the drugs, while S5 contains intensity data in response to MMS.
  
  (3) It should be stated for each Figure what kind of microscopy was performed - I assume that it is confocal for everything except when widefield is explicitly stated, but for clarity please add this information.
  
  Indeed, this is correct, we have indicated which microscopy was used for each figure.
  
  (4) The MATLAB code and full (uncropped) Western blots should be provided as supplemental data if possible.
  
  We have included a GitHub link for the code and un-cropped western blots.
  
  (5) The p values from significance tests should indicate whether multiple comparisons correction was necessary (if suggested by Prism) and performed.
  
  Apologies for a lack of clarity but this was not necessary, significance was calculated vs. the next lower dose (e.g. 10 micromolar vs. 1 micromolar). We have clarified this in the methods (page 7 line 221).
  
  Reviewer #2 (Recommendations for the authors):
  
  Major points:
  
  In addition to the weaknesses noted above, to encourage widespread adoption of this method, the authors should make the tools that they used for their analysis publicly available. In a few instances (e.g., compare Figures 3J and 3L), other methods outperform DA. It would be meaningful to discuss when especially DA may be a better measure than others (such as intensity or number of foci).
  
  We have made code available on Github. We expect results, such as those in Figures 3J and 3L where intensity is significantly higher at the highest concentration but DA is not are reflective of the underlying biology and this may be interpreted differently under different experimental conditions. Imaris spots (Fig. 3K) also does not capture a significant increase at the highest dose of olaparib, suggesting that intensity may raise but it doesn’t not generate more foci. These results are likely highly dependent on the mechanism of olaparib at such a high concentration and the DDR response. We are hesitant to draw biological conclusions from these results and instead would like to highlight the capacity of ICS to evaluate the DDR, therefore we don’t want to make any broad comments about different applications.
  
  Minor points:
  
  (1) Pg. 12: "We used MMS to induce DNA damage in SKOV3 and OVCA429 cells. As expected, normalized intensity for RPA1 and RAD51 values (Figure S5) did not display a dose dependence on MMS concentration."
  
  Please provide a citation for the claim that RPA1 and RAD51 normalized intensities do not display a dose dependence on MMS concentration.
  
  These were data that we generated. We were not expecting an intensity change as that would presumably require increased protein generation in response to MMS, compared to gH2AX where the phospho-specific H2AX is generated in the DDR.
  
  (2) Pg. 12: "Similar to RPA1, RAD51 does not form distinguishable foci in the nuclei in cells without preextraction (Fig. 5)." Please provide a citation for this claim.
  
  We did not do pre-extraction and our results don’t produce changes in distinguishable foci. We provided citations discussing how, without pre extraction, foci formation for these proteins is not obvious (REF 38 and 39).
  
  (3) I noted that the authors cite one paper [38] apparently showing that RPA and Rad51 do not always form foci, however, this is in the C. elegans germline in response to micro irradiation, therefore I am not sure that it is applicable to human cells.
  
  We apologize for referencing a paper on C elegans. Most papers looking at RPA and RAD51 in the DDR use pre-extraction as it seems necessary to observe foci. Therefore, there are not as many papers, that we could find, that do not use pre-extraction. Reference 39 is in Hela cells.
  
  Reviewer #3 (Recommendations for the authors):
  
  Major points:
  
  (1) Page 8, the second paragraph: In the Result section, it is better to describe how the authors carried out immuno-staining (without pre-extract subtraction) and ICS briefly, although the method is described in detail in the Method section.
  
  Thank you for the suggestion, we have added this description (page 8, line 259)
  
  (2) In Figure 5K-P: The authors analyzed "invisible" RAD51 foci on the image (Fig. 5L, M, O, and P) without pre-extraction. As a control experiment, it is useful to check whether pre-extraction would provide "visible" RAD51 foci and to examine the similar MMS concentration dependency shown in Figure 5R (or 5T). This would strengthen the power of the ICS analysis.
  
  Thank you for the suggestion. In our hands, pre-extraction is extremely subjective. We have tried performing pre-extraction but find highly variable results depending on conditions. Therefore, we did not include any pre-extraction here. We expect that performing these experiments may or may not agree with results in Figure 5 largely because we are unable to achieve repeatable pre-extraction foci counting.
  
  (3) Figure 6D (and 6C) looks very interesting. It would be important to show the interpretation of this correlation shown in the graph. Although the authors argued that ICS analysis results shown in the graph could provide new insight into the DDR (page 14, last line 5), as shown in another part, it is important to carry out the same analysis by using Imaris Spots. Moreover, it is interesting to apply the analysis to RAD51 foci (shown in Figure 5), given that the PARPi effect is enhanced in the absence of RAD51mediated recombination.
  
  We completely agree that this analysis may generate interesting results to help interpret the DDR response to PARP inhibition. These experiments are part of an ongoing follow up study where we extend the use of ICS to other parts of the DDR and investigate protein clustering across several proteins with impact on PARPi response. Therefore, since the focus of this manuscript is introducing ICS as a tool to study the DDR, we believe that omitting those data here does not deter from the central points of the manuscript. We including results in Figure 6 because we wanted to show how ICS could impact DDR research. Furthermore, combined with our advances shown in Figures 7 and 8, we are currently working on adapting ICS to be high-throughput and much simpler than Imaris spots for handling large datasets needed to generate results like those in Figure 6.
  
  Minor points:
  
  (1) Figure 1I, blue arrows: These showed an area with a higher background. Because of a low magnification, it is very hard to see the difference from the other areas of the background. It is better to show a magnified image of the representative region with a higher background.
  
  We hope that readers can see the higher intensity in the diffuse area. We attempted to construct a zoomed in area, but that either blocked a significant portion of the nonzoomed image or added complexity to the figure. We have noted that images in Figure S1 are larger and more obviously capture an increase in background intensity.
  
  (2) Figure 2 legend, line 5, the same as "A)": This should be "B".
  
  Here, the number of independent particle clusters is intended to be the same as A, the difference is that the independent particles are clusters in C and individual fluorophores in A.
  
  (3) Page 9, the first paragraph, last line, foci formation, and foci composition: These should be "focus formation and focus composition".
  
  We have changed this.
  
  (4) Page 15, the first paragraph, line 5, palbociclib, camptothecin, or etoposide: please explain what kinds of the drugs are.
  
  We have added that these drugs cause cells to stall at different cell cycle stages. Explaining the drugs would take considerable room in the text.
  
  (5) Page 16, the first paragraph, line 1, bleomycin: Please explain what this drug is.
  
  Similar to above, we have stated that this drug causes DNA damage, going into detail would take several sentences.
  
  AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 3

AuthorResponse

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.08.05.606697v2
www.biorxiv.org www.biorxiv.org

Integrated Analysis of Single-Cell and Bulk RNA-Seq Data reveals that Ferroptosis-Related Genes Mediated the Tumor Microenvironment predicts Prognosis, and guides Drug Selection in Triple-Negative Breast Cancer

3
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  This study presents a useful finding for the ferroptosis-mediated tumor microenvironment (TME) in triple-negative breast cancer (TNBC) using public single-cell RNA sequencing (scRNA-seq) and bulk RNA sequencing data. The data were collected and analyzed using solid and validated methodology and can be used as a starting point for functional studies of TME in TNBC. The work will be of interest to medical biologists working in the field of TNBC.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  This study aims to explore the ferroptosis-related immune landscape of TNBC through the integration of single-cell and bulk RNA sequencing data, followed by the development of a risk prediction model for prognosis and drug response. The authors identified key subpopulations of immune cells within the TME, particularly focusing on T cells and macrophages. Using machine learning algorithms, the authors constructed a ferroptosis-related gene risk score that accurately predicts survival and the potential response to specific drugs in TNBC patients.
  
  Strengths:
  
  The study identifies distinct subpopulations of T cells and macrophages with differential expression of ferroptosis-related genes. The clustering of these subpopulations and their correlation with patient prognosis is highly insightful, especially the identification of the TREM2+ and FOLR2+ macrophage subtypes, which are linked to either favorable or poor prognoses. The risk model thus holds potential not only for prognosis but also for guiding treatment selection in personalized oncology.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Author response:
  
  The following is the authors’ response to the original reviews
  
  Public Reviews:
  
  Reviewer #1 (Public review):
  
  Summary:
  
  Triple-negative breast cancer (TNBC) accounts for approximately 15-20% of all breast cancers. Compared to other types of breast cancer, TNBC exhibits highly aggressive clinical characteristics, a greater likelihood of metastasis, poorer clinical outcomes, and lower survival rates. Immunotherapy is an important treatment option for TNBC, but there is significant heterogeneity in treatment response. Therefore, it is crucial to accurately identify immunosuppressive patients before treatment and actively seek more effective therapeutic approaches for TNBC patients.
  
  Strengths:
  
  In this work, the authors collected and integrated data from single cells and large volumes of RNA sequencing and RNA-SEQ to analyze the TME landscape mediated by genes associated with iron death. On this basis, the prediction model of prognosis and treatment response of 131 patients was constructed using a machine learning algorithm, which is beneficial to provide individualized and precise treatment guidance for breast cancer patients.
  
  Thank you for your appreciation of our work. We are encouraged by your positive feedback and will continue to explore new avenues in personalized medicine for breast cancer.
  
  Weaknesses:
  
  However, there are still some issues that need to be clarified:
  
  (1) The description of the research background is too brief and concise, and it is necessary to add some information about the limitations of existing methods and the differences and advantages of this study compared with other published relevant studies, so as to better highlight the necessity and research value of this study.
  
  Thank you for your suggestions. We have supplemented the research background and compared the differences between this study and other studies, further highlighting the research value of our study.
  
  (2) This study is a retrospective analysis of a public data set and lacks experimental validation and prospective experiments to support the results of bioinformatics analysis. This should be added to the acknowledgment of limitations in the study.
  
  Thank you for the constructive feedback. We also acknowledge that the lack of experimental evidence is one of the limitations of this study. Therefore, we plan to conduct in vivo and in vitro experiments in our future research to support the findings of our bioinformatics analysis, and have already supplemented the relevant content in the limitations of Discussion.
  
  Reviewer #2 (Public review):
  
  Summary:
  
  This study aims to explore the ferroptosis-related immune landscape of TNBC through the integration of single-cell and bulk RNA sequencing data, followed by the development of a risk prediction model for prognosis and drug response. The authors identified key subpopulations of immune cells within the TME, particularly focusing on T cells and macrophages. Using machine learning algorithms, the authors constructed a ferroptosis-related gene risk score that accurately predicts survival and the potential response to specific drugs in TNBC patients.
  
  Strengths:
  
  The study identifies distinct subpopulations of T cells and macrophages with differential expression of ferroptosis-related genes. The clustering of these subpopulations and their correlation with patient prognosis is highly insightful, especially the identification of the TREM2+ and FOLR2+ macrophage subtypes, which are linked to either favorable or poor prognoses. The risk model thus holds potential not only for prognosis but also for guiding treatment selection in personalized oncology.
  
  Thank you for your thorough review and insightful comments.
  
  Weaknesses:
  
  The study has a relatively small sample size, with only 9 samples analyzed by scRNA-seq. Given the typically high heterogeneity of the tumor microenvironment (TME) in cancer patients, this may affect the accuracy of the conclusions. The scRNA-seq analysis focuses on the expression of ferroptosis-related genes in various cells within the TME. In contrast, bulk RNA sequencing uses data from tumor samples, and the results between the two analyses are not consistent. The bulk RNA sequencing results may not accurately capture the changes happening in the microenvironment.
  
  Thank you for your constructive feedback. Although this study only included 9 samples, given the limited availability of scRNA-seq datasets for untreated TNBC in public databases, we chose to utilize a dataset that contains a relatively larger number of untreated TNBC samples. We are fully aware of the complexity and high heterogeneity of the TME. Despite the limited sample size, we first conducted rigorous quality control on the data and, based on this, preliminarily revealed the landscape of the TME mediated by ferroptosis-related genes. These findings provide a new perspective for understanding the biological mechanisms underlying the onset and progression of breast cancer. To enhance the reliability and generalizability of our research results, we plan to strive to expand the sample size in future work and consider integrating other omics technologies, such as proteomics and metabolomics, with scRNA-seq data for a more in-depth exploration of the complex interactions within the TME.
  
  We also agree with your viewpoint that scRNA-seq data reveals gene expression within individual cells, while bulk RNA-seq data reveals the average gene expression in tumor tissues, and there are differences in data acquisition and processing methods between the two. However, we believe that there are also some close connections between them in terms of gene expression levels. By comparing the expression specificity of marker genes for specific cell types in breast cancer tissues, we found that they are correlated with patient prognosis, and the results have been validated in both internal and external validation sets. Thank you once again for your valuable suggestions, which will play an important guiding role in our subsequent research.
  
  Reviewer #1 (Recommendations for the authors):
  
  (1) The breast cancer scRNA-seq dataset files of GSE176078 include 10 TNBC primary tumors (DOI:10.1016/j.compbiomed.2023.107066). However, in this study, only 9 cases were listed, please explain the reason for the data exclusion.
  
  Thank you for your questions. Although it was clearly stated in the original paper that "To elucidate the cellular architecture of breast cancers, we analyzed 26 primary pre-treatment tumors, including 11 ER+, 5 HER2+ and 10 TNBCs, by scRNA-Seq (Supplementary Table 1)," upon downloading and carefully examining the patient information in Supplementary Table 1, we only included 9 patients explicitly labeled as TNBC in our study (https://pmc.ncbi.nlm.nih.gov/articles/PMC9044823/#SD1).
  
  (2) The description of the technique in the methods section should be more detailed, such as parameter settings, quality control standards, etc.
  
  Thank you for your valuable suggestions. We have already supplemented the relevant content in the methods section.
  
  (3) Please check and correct formatting errors to improve readability, such as lines 176 and 177.
  
  We were really sorry for our careless mistakes. Thank you for your reminder. We have corrected the “Pseudotime analysis with scRNA-seq data helps to obtain an approximate landscape of gene expression dynamics” into “Pseudotime analysis of scRNA-seq snapshot data helps to provide an approximate landscape of gene expression dynamics”. And we have further checked and revised the formatting errors of the manuscript.
  
  Reviewer #2 (Recommendations for the authors):
  
  (1) In multiple sections of the paper, abbreviations are used without being defined when first mentioned.
  
  We were really sorry for our careless mistakes. Thank you for your reminder. We have already added definitions for the abbreviations in both the abstract and the main text.
  
  (2) The authors should analyze whether the transcription factors in Figure 2 are correlated with the expression of ferroptosis-related genes.
  
  Thank you for your valuable feedback. Some transcription factors in Figure 2 correlate with the expression of ferroptosis-related genes, which we have supplemented in the Discussion.
  
  (3) Figures 3d and 4e lack explanations for the axis values, and for Figure 4e, is the unit of the y-axis labeled "survival" in days?
  
  Thank you for your valuable feedback. We apologize for the lack of explanations for the axis values in Figures 3d and 4e and we have made revisions to both figures accordingly. We have noted that the unit "survival" on the y-axis of Figure 4e is in years, and we have already made the necessary supplement to clarify this. Thank you very much for your reminder.
  
  (4) The authors conducted their analysis using public databases but did not cite the original literature, nor did they discuss the similarities and differences between their findings and those in the original studies.
  
  Thank you for your valuable suggestions, and we deeply apologize for our carelessness. We have supplemented the original literature in the references and discussed the differences between this study and the original literature in the Discussion.
  
  (5) Some figures, particularly those involving heatmaps and t-SNE plots (e.g., Figures 1 and 3), present dense and complex data that may be challenging for readers to interpret. The heatmaps (Figure 1e-f and 3d) include many genes, but it is unclear how these genes were selected, and the scale of gene expression differences is difficult to interpret. Simplifying these figures by focusing on the most differentially expressed and clinically relevant genes (e.g., those with prognostic value) would improve readability.
  
  Thank you for your valuable suggestions. The t-SNE plots in Figures 1 and 3 primarily serve as a dimensionality reduction technique to visually present the clustering of multiple cells or samples based on gene expression, aiding readers in quickly identifying cell subpopulations. The heatmaps, on the other hand, are mainly used to showcase the differential expression of ferroptosis-related genes across different clinicopathological classifications and cell subpopulations, with varying shades of color helping readers quickly recognize gene expression differences among different cell subpopulations. The genes included in the heatmaps (Figures 1e-f and 3d) are sourced from the FerrDb website. We have uploaded the list of ferroptosis-related genes used in this study as Supplementary Table 1 and added the relevant steps in Method 2.3.
  
  (6) The study analyzes the expression of ferroptosis-related genes in different immune cells within the TME. The authors should discuss how these changes in gene expression may impact the function and behavior of immune cells.
  
  Thank you for your valuable feedback. We have supplemented the discussion with detailed effects of the main differential genes (FOLR2 and TREM2) on the tumor immune response.
  
  (7) The authors analyzed the expression of ferroptosis-related genes in immune cells using single-cell sequencing data. However, they subsequently applied the selected genes to perform a risk factor analysis in tumor cells. Is the expression and function of these genes the same in immune cells and tumor cells? This seems questionable.
  
  Thank you very much for your suggestion. We also believe that there may be differences in the expression and function of genes between immune cells and tumor cells. However, some genes may exhibit similarities in their expression and function in immune cells and tumor cells, especially within the tumor immune microenvironment, due to the complex and tight interactions between immune cells and tumor cells (as shown in Figures 1d and 2h), and their expression levels can be related to the onset, progression, and prognosis of tumors.
  
  (8) While the risk score model based on ferroptosis-related genes is promising, it lacks experimental validation, which weakens the strength of the conclusions. The authors should consider conducting in vitro or in vivo experiments. These functional studies would provide essential evidence to support the model's predictive capability.
  
  Thank you for the constructive feedback. We fully recognize the importance of conducting functional studies to substantiate the predictive capability of the model. Therefore, we plan to conduct in vitro and in vivo experiments in our future research to provide the necessary evidence and further validate the model's effectiveness.
  
  (9) The manuscript predicts sensitivity to 27 drugs based on the risk score, but it lacks mechanistic insight into why patients in the high-risk group might be more responsive to certain drugs. Including a more detailed discussion of the molecular mechanisms underlying this drug sensitivity, particularly linking ferroptosis-related genes to drug metabolism or efficacy, would provide a stronger rationale for the clinical application of these findings.
  
  Thank you very much for your valuable suggestions. In the discussion, we thoroughly analyzed the mechanism of action of the drugs (ABT-263 and erlotinib) with the greatest difference in sensitivity between high-risk and low-risk groups, as well as their correlation with ferroptosis.
  
  AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.07.04.602021v2
www.biorxiv.org www.biorxiv.org

Testosterone-Induced Metabolic Changes in Seminal Vesicle Epithelial Cells Alter Plasma Components to Enhance Sperm Motility

4
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  This important work shows the biological processes and detailed mechanisms involving testosterone's influence on seminal plasma metabolites in mice. Evidence supporting the up regulation of metabolic enzymes and the role of ACLY is solid, though the precise contributions of fatty acids to sperm motility requires further elucidation.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  In this revised report, Yamanaka and colleagues investigate a proposed mechanism by which testosterone modulates seminal plasma metabolites in mice. Based on limited evidence in previous versions of the report, the authors softened the claim that oleic acid derived from seminal vesicle epithelium strongly affects linear progressive motility in isolated cauda epididymal sperm in vitro. Though the report still contains somewhat ambiguous references to the strength of the relationship between fatty acids and sperm motility.
  
  Strengths:
  
  Often, reported epidydimal sperm from mice have lower percent progressive motility compared with sperm retrieved from the uterus or by comparison with human ejaculated sperm. The findings in this report may improve in vitro conditions to overcome this problem, as well as add important physiological context to the role of reproductive tract glandular secretions in modulating sperm behaviors. The strongest observations are related to the sensitivity of seminal vesicle epithelial cells to testosterone. The revisions include the addition of methodological detail, modified language to reflect the nuance of some of the measurements, as well as re-performed experiments with more appropriate control groups. The findings are likely to be of general interest to the field by providing context for follow-on studies regarding the relationship between fatty acid beta oxidation and sperm motility pattern.
  
  Weaknesses:
  
  The connection between media fatty acids and sperm motility pattern remains inconclusive.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Using a combination of in vivo studies with testosterone-inhibited and aged mice with lower testosterone levels as well as isolated mouse and human seminal vesicle epithelial cells the authors show that testosterone induces an increase in glucose uptake. They find that testosterone induces a difference in gene expression with a focus on metabolic enzymes. Specifically, they identify increased expression of enzymes regulating cholesterol and fatty acid synthesis, leading to increased production of 18:1 oleic acid. The revised version strengthens the role of ACLY as the main regulator of seminal vesicle epithelial cell metabolic programming. The authors propose that fatty acids are secreted by seminal vesicle epithelial cells and are taken up by sperm, positively affecting sperm function. A lipid mixture mimicking the lipids secreted by seminal vesicle epithelial cells, however, only has a small and mostly non-significant effect on sperm motility, suggesting the authors were not apply to pinpoint the seminal vesicle fluid component that positively affects sperm function.
  
  Review 2
4. Public_Reviews 09 May 2025
  
  in eLife
  
  Author response:
  
  The following is the authors’ response to the previous reviews
  
  Public Reviews:
  
  Reviewer #1 (Public review):
  
  Summary:
  
  In this revised report, Yamanaka and colleagues investigate a proposed mechanism by which testosterone modulates seminal plasma metabolites in mice. The authors identify oleic acid as a particularly important metabolite, derived from seminal vesicle epithelium, that stimulates linear progressive motility in isolated cauda epidydimal sperm in vitro. The authors provide additional experimental evidence of a testosterone dependent mechanism of oleic acid production by the seminal vesicle epithelium.
  
  Strengths:
  
  Often, reported epidydimal sperm from mice have lower percent progressive motility compared with sperm retrieved from the uterus or by comparison with human ejaculated sperm. The findings in this report may improve in vitro conditions to overcome this problem, as well as add important physiological context to the role of reproductive tract glandular secretions in modulating sperm behaviors. The strongest observations are related to the sensitivity of seminal vesicle epithelial cells to testosterone. The revisions include addition of methodological detail, modified language to reflect the nuance of some of the measurements, as well as re-performed experiments with more appropriate control groups. The findings are likely to be of general interest to the field by providing context for follow-on studies regarding the relationship between fatty acid beta oxidation and sperm motility pattern.
  
  Thank you for summarizing and your positive evaluation of our study.
  
  Weaknesses:
  
  Support for the proposed mechanism is stronger in this revised report than in the previous report, but there are many challenges in measuring sperm metabolism and its direct relationship with motility patterns. This study is no exception and largely relies on correlations between various experiments in lieu of direct testing. Additionally, the discussion is framed from a human pre-clinical perspective, and it should be noted that the reproductive physiology between mice and humans is very different.
  
  Thank you for pointing out the challenges in our paper. We appreciate your comment on the limited evidence supporting the direct relationship between sperm metabolism and motility patterns under current experimental conditions. Based on your and reviewer2’s suggestions, we have decided to remove the experiments and discussion on the “effects of OA on sperm metabolism, motility and fertility (Fig. 7, Supplemental Figure 5A and C-F.)” and the corresponding parts in the Discussion section from the paper. (See also Reviewer 2's main comment) These data mainly show correlations, and did not show direct evidence of causality. Instead, we added a new experiment to the manuscript, in which a lipid mixture that mimics the fatty acid profile secreted testosterone-dependently from seminal vesicle epithelial cells was added to the sperm culture medium (New Supplemental Figure 5, Lines 259-268). In this experiment, motility parameters were measured using CASA. This experiment evaluates the direct effects of lipid exposure on sperm motility. With these revisions, we are able to focus on the metabolic changes caused by testosterone in seminal vesicle epithelial cells, which are the central focus of our research. We have added a short statement agreeing the potential importance of OA and our intention to more rigorously investigate the role of OA in sperm function in subsequent studies (Lines 402-407).
  
  Furthermore, we have revised text, clearly state the limitations of the species difference and clarify that the translational aspects to humans are speculative (Lines 383-384, 395-397, 408-410).
  
  We appreciate your guidance. We believe that these changes will strengthen our research.
  
  Reviewer #2 (Public review):
  
  Using a combination of in vivo studies with testosterone-inhibited and aged mice with lower testosterone levels as well as isolated mouse and human seminal vesicle epithelial cells the authors show that testosterone induces an increase in glucose uptake. They find that testosterone induces a difference in gene expression with a focus on metabolic enzymes. Specifically, they identify increased expression of enzymes regulating cholesterol and fatty acid synthesis, leading to increased production of 18:1 oleic acid. The revised version strengthens the role of ACLY as the main regulator of seminal vesicle epithelial cell metabolic programming. 18:1 oleic acid is secreted by seminal vesicle epithelial cells and taken up by sperm, inducing an increase in mitochondrial respiration. The difference in sperm motility and in vivo fertilization in the presence of 18:1 oleic acid and the absence of testosterone, however, is small. Additional experiments should be included to further support that oleic acid positively affects sperm function.
  
  Thank you very much for carefully reading the manuscript and for your comments. We appreciate your understanding that the role of ACLY in metabolic programming of seminal vesicle epithelial cells has been strengthened in the revised version. On the other hand, we agree with your view that the increase in sperm motility and fertilization rate by oleic acid is minimal under the current experimental conditions. We agree that further evidence is needed to support our conclusion regarding the positive effects of oleic acid on sperm function. Based on your comments and our re-evaluation of the data, we have decided to remove the experiments and discussion on “OA and sperm motility” from the current paper (Fig. 7, Supplemental Figure 5A and C-F). In the revised paper, we have significantly toned down the claims on the previous role of oleic acid and instead focused on the metabolic regulatory mechanisms of seminal vesicle epithelial cells.
  
  We hope that these revisions address your concerns and improve the overall clarity of the manuscript.
  
  Recommendations for the authors:
  
  Note from the reviewing editor: The reviewers agree that the revised manuscript is significantly improved and view the work as important. Both reviewers agree that the evidence for testosterone effects on seminal vesicle epithelial cells to support fatty acid synthesis is strong and suggest that the authors tone down their conclusion of oleic acid effect on sperm motility as the effect is very small. With this minor changes, the evidence to support the conclusion of the study is viewed as solid.
  
  Thank you for recognizing the improvements that we have made to our manuscript and for appreciating the importance of our research. We also appreciate your assessment that the evidence for the effect of testosterone on seminal vesicle epithelial cells that support fatty acid synthesis is solid.
  
  On the other hand, we agree with the two reviewers that the effect of oleic acid on sperm motility is limited and that the relevant data do not measure a direct relationship. Therefore, we have decided to withdraw the data set on the effect of oleic acid on sperm (Fig. 7, Supplemental Figure 5A and C-F) and focus this paper on seminal vesicle epithelial cells (in response to reviewer 2's suggestion). Given that testosterone-induced lipid (Fatty acid) synthesis in seminal vesicle epithelial cells is a key aspect of our study, we have included additional experiments in the revised manuscript to show how lipids affect sperm (New Supplemental Figure5, Lines 259-263).
  
  With these revisions, the manuscript emphasizes the importance of testosterone-dependent fatty acid synthesis in seminal vesicle epithelial cells and the fact that this includes oleic acid. The title has also been partially revised in line with these revisions.
  
  Reviewer #1 (Recommendations for the authors):
  
  Minor Comments:
  
  (1) The authors indicate in the methods that extracellular flux analysis was normalized to cell count. However, the y-axis units in Figs 4, 8, 9 and SFig 9 are not normalized.
  
  (2) The OA label appears to be missing from Fig 7A. Additionally, the scale bar is offset in one of the images and the length of the scale bar does not appear to be mentioned in the figure legend.
  
  Thank you for raising these points. We have corrected.
  
  Fig. 7 has been withdrawn in response to Reviewer 2's suggestion.
  
  Reviewer #2 (Recommendations for the authors):
  
  With the experiments included in their revised version the authors strengthen their conclusions about testosterone-induced metabolic reprogramming in seminal vesicle cells resulting in reduced proliferation. The experiments surrounding ACLY are well-designed and give insights into the underlying molecular mechanisms. For other parts, the manuscript became less clear and it is often hard to follow the author's line of thoughts for their conclusions.
  
  Based on the experiments shown in the manuscript this reviewer is still not convinced that OA positively affects sperm function. The changes in linear motility are minor, blastocyst levels are lower and the authors do not show that OA alone positively affects cleavage rate during AI. Without additional experiments that show a stronger effect on sperm function, the authors should consider focusing the manuscript exclusively on seminal vesicle epithelial cells.
  
  Thank you for your constructive comments on our paper. We thank the reviewer for pointing out that the effect of oleic acid (OA) on sperm function is limited in our current experiments. As reviewer 1 also pointed out, we agree that further experiments and improved methodology are needed to reliably demonstrate the functional effects of OA on sperm. Because the strength of the data on the direct relationship between fatty acids in seminal fluid and improved sperm function is currently insufficient, we have removed the data set for oleic acid and sperm motility (Fig. 7, Supplemental Figure 5A and C-F) and focused on the “the mechanism of metabolic regulation of testosterone in seminal vesicle epithelial cells”. We have consistently narrowed the focus of the paper to the theme of “how testosterone changes energy metabolism in seminal vesicle epithelial cells”. In accordance with this change, the structure of the paper has also been partially revised (red text in the manuscript). With these revisions, the main point of the paper focuses on the mechanism by which testosterone regulates metabolic pathways in the seminal vesicle epithelial cells.
  
  For more detailed revisions, please see the responses to your comments below.
  
  (1) 45-55 still need major revision. It will not become clear to the reader what the authors mean by epididymal maturation. 'Ability to fertilize in in vitro?' Epididymal sperm are moving linearly in the absence of seminal vesicle fluid. Increased progressive motility, hyperactivation, and the ability to undergo the acrosome reaction are induced upon exposure to seminal vesicle fluid. The authors should introduce the concept of capacitation and that capacitation can be induced in vitro by exposure to bicarbonate and a cholesterol acceptor.
  
  Thank you for pointing out the ambiguity of epididymal maturation, the need to clarify the concept of capacitation, and the role of seminal plasma in this context. The revised text explains that epididymal maturation only gives sperm their potential ability to fertilize. It also explains that it is the subsequent capacitation process—inducible in vitro by incubation with bicarbonate and cholesterol acceptors—that gives full fertilization potential. On the other hands, we emphasize that in vivo, seminal plasma, which contains both capacitation-promoting and decapacitation factors, plays a key role in fine-tuning the timing of capacitation, ensuring that sperm acquire fertilization competence at the appropriate moment. We hope that these revisions clarify our intended meaning and strengthen the overall message of the paragraph. (lines 42-54)
  
  “Sperm that have completed spermatogenesis in the testis acquire their potential to fertilize while maturing in the epididymis (5–7). The physiological change of sperm during fertilization process are collectively referred to as “capacitation”. This change includes a large amplitude of flagella (called hyperactivation) and developing the capacity to undergo the acrosome reaction, and can be induced by culturing sperm collected from the epididymis in a medium containing bicarbonate and cholesterol acceptors (8, 9). However, once capacitation is complete, sperm cannot maintain that state for a long time. Therefore, even if epididymal sperm that have not been exposed to seminal plasma are artificially inseminated into the cervix or uterus, the fertilization rate remains low (10–12). That is because, in vivo, during ejaculation, exposure of epididymal sperm to seminal plasma masks the unintended capacitation as they pass through the female reproductive tract and ensures fertilization of sperm that reach the oviduct (13). In other words, seminal plasma plays an important role in fine-tuning the timing of sperm capacitation and in maintaining the sustained sperm motility needed to reach the oviduct.”
  
  (2) 81: Similar as in their rebuttal the authors should further elute on the connection between fructose, citrate, and testosterone. That still does not become clear. Based on the author's explanation in the rebuttal, why are citrate and fructose levels higher when the animals are castrated?
  
  We thank you for the opportunity to clarify our statement regarding the relationship between fructose, citrate, and testosterone. Our original explanation was intended to reflect the fact that testosterone from the testes has a stimulating effect on the accessory reproductive glands, and to report that the concentrations of fructose and citric acid were higher in the non-castrated (control) animals than in the castrated animals. In castrated animals, the absence of testosterone leads to decreased activity of these glands and, consequently, lower levels of these metabolites. To make this clear, we have revised the manuscript as follows. (lines 76-82)
  
  “Several specific factors produced by the male accessory glands that contribute to seminal plasma and impact male fertility have been elucidated. For example, surgical removal of seminal vesicles in male mice and rats was associated with infertility (17, 22, 23). The observations that fructose (24) and citric acid (25) concentrations in seminal plasma of control mice and rats are higher than in castrated animals suggest that the specific metabolism of the accessory glands might be affected by testosterone derived from the testes, which activate intracellular androgen receptors (AR; NR3C4) required for gene regulation of transcription.”
  
  (3) 111: This reviewer does not understand the author's obsession with reporting linear motility. Sperm are moving linearly when isolated from the epididymis. Again, increase of progressive motility is a well-defined hallmark of capacitation and primarily used in the field when discussing changes in sperm motility during capacitation. This reviewer is assuming that the changes in progressive vs linear motility in Fig. 7 are not significant because the data is more scattered. The % increase seems to be approximately the same. The same is true for Fig. 8. The increase in LIN is so small and not dose-dependent that this reviewer is not comfortable making that one of the main conclusions of the manuscript.
  
  Our claim is based on the observation that seminal vesicle secretions significantly improve the linear motility (VSL and LIN) of sperm even in an environment that does not contain capacitation-inducing factors such as BSA. We interpret this as a survival strategy for sperm to pass through the female reproductive tract efficiently. Therefore, we believe that this does not mean that the meaning of “progressive motility” in the context of conventional capacitation is the same as that of progressive motility observed in seminal plasma.
  
  However, the reviewer's point that the current data set does not sufficiently support what the minor increase in linear motility caused by oleic acid means is agreed with. Therefore, we have decided to withdraw the dataset on the effect of oleic acid on sperm motility (Fig. 7, Supplemental Figure 5A and C-F) and have revised the conclusion. (Lines 406-410)
  
  (4) 128: For the mitochondrial membrane potential measurements the authors should mention that they included antimycin as a control. The manuscript would benefit from including scatter plots with unloaded controls to support their gating strategy. In its current stage, the gating between low and high membrane potential seems arbitrary.
  
  Thank you for pointing this out. We have included an explanation of antimycin as a control in the main text (Lines 920-921). In addition, we have added some reference scatter plots and also added an explanation of the gating strategy between low and high membrane potentials (Supplemental Figure 1C and D, Lines 1101-1104). We hope this change will make the manuscript clearer.
  
  (5) 190: What do the authors mean by: 'However, there was no difference in the Oligomycin-sensitive ECAR, indicating that testosterone may increase glucose metabolism but does not enhance the expression of a group of enzymes involved in the glycolytic pathway.'
  
  Our original intention was to state that testosterone probably increases basal glycolytic flux via increased glucose uptake (as supported by the GLUT4 translocation data), but does not increase maximal glycolytic capacity, as indicated by the lack of difference in oligomycin-sensitive ECAR.
  
  However, as Reviewer 1 previously pointed out, we agree that the assay conditions themselves, such as the use of oligomycin to inhibit oxidative mitochondria, may create non-physiological conditions and not fully reflect the energy distribution in vivo. Under these conditions, there is a possibility that the flow of glycolysis will increase artificially as a compensatory reaction, and parameters such as “maximum glycolytic capacity” should have been interpreted with caution.
  
  Therefore, we have revised the manuscript to clarify that our data are a single-time point under defined experimental conditions and do not necessarily provide direct insight into changes in expression or activity of individual glycolytic enzymes.
  
  “These data indicate that testosterone enhances glucose utilization. This leads to the interpretation that testosterone increases the flow of glycolysis by increasing glucose uptake and alters metabolic flux distribution.” (Lines 186-188)
  
  (6) 205: Could the authors elaborate further on how they came to this conclusion: 'These results suggest that testosterone does not reduce transient enzyme activity in mitochondria but rather weakens the metabolic pathway of the mitochondrial TCA cycle and/or the electron transport chain due to the changes in gene expression patterns in seminal vesicle epithelial cells.' Based on their results at this point the authors have no insights about changes in enzyme activity or gene expression that might explain the phenotype.
  
  Our statement is based on the following observations. In testosterone-treated cells, the addition of glucose increased ECAR, suggesting an increase in glycolytic flux due to an increase in glucose uptake. On the other hand, mitochondrial respiratory parameters (basal respiration, oligomycin-sensitive respiration, FCCP-uncoupled respiration, and reserve respiratory capacity) were significantly decreased under testosterone treatment.
  
  From these results, it was speculated that testosterone promotes the redistribution of metabolic flux, directing it away from mitochondrial oxidative phosphorylation and towards the glycolytic pathway and, possibly, lipid synthesis. However, as the reviewers correctly point out, at this point, we have not directly measured changes in the activity or expression of individual enzymes in the TCA cycle or ETC. Therefore, in the next experiment, we extracted mRNA from the cells and performed gene expression analysis using real-time PCR. To make this clear, we have revised the manuscript as follows.
  
  “Overall, these data indicate that testosterone promotes the redistribution of metabolic flux. In other words, testosterone increased glycolysis in seminal vesicle epithelial cells while decreasing mitochondrial respiration. To determine whether these changes were accompanied by changes in gene expression of specific metabolic-related enzymes, we analyzed gene expression levels.” (Lines 201-205)
  
  (7) 219: Characterizing ACLY as an enzyme of the ETC is misleading. ACLY is a cytosolic enzyme that connects the TCA cycle with fatty acid synthesis.
  
  We would like to thank you for pointing out that the description of the function of ACLY could be misunderstood. We agree that characterizing ACLY as an enzyme of the ETC could be misleading. Therefore, we have revised the sentence to clearly indicate that ACLY is a cytosolic enzyme that links the TCA cycle with fatty acid synthesis. The revised text is as follows:
  
  "Interestingly, testosterone significantly increased the expression of Acly, which encodes a cytoplasmic enzyme that converts citrate transported from the TCA cycle into acetyl-CoA, a substrate that is essential for fatty acid synthesis." (lines216-218)
  
  (8) 228: Which results support that ETC proteins were upregulated by flutamide?
  
  We appreciate the reviewer for this point. In preliminary experiments, we analyzed ETC protein expression using real-time qPCR. Our data show that treatment with flutamide significantly upregulates the expression of genes involved in mitochondrial ETC, such as mtND6, while decreasing the expression of the lipogenic genes Acly and Acc. These additional data are now presented in Supplementary Figure S3B. (lines 223-226)
  
  (9) 245: Aren't the authors showing in Fig. 5 that glut4 expression is reduced in seminal vesicle epithelial cells upon testosterone treatment? How does that fit into the author's hypothesis?
  
  Thank you for pointing this out. We have already responded to a similar comment from Reviewer 3 in a previous revision. Please refer to our response to Reviewer 3 in a previous version.
  
  (10) 285: Based on the author's results OA increases the oocyte cleavage rate but then reduces the rate of blastocyst to cleaved oocyte. Doesn't that mean OA affects negatively early development?
  
  We thank the reviewer for the insightful comment. The one-hour pre-treatment is designed to reflect the transient exposure of sperm to the seminal plasma during ejaculation. In this context, it is unlikely that such a short exposure would impair the overall developmental potential of the embryo. However, although pre-conditioning with oleic acid does not ultimately affect the development of the offspring, it may lead to a decrease in the blastocyst rate at a certain point (approximately 96-120 hours after fertilization). We agree that additional research is needed to demonstrate this.
  
  Therefore, because the experiments related to the effects of oleic acid on sperm and fertilization are currently incomplete, we have decided to withdraw them for future research.
  
  (11) 305: What happens to pyruvate and lactate levels when ACLY expression is reduced?
  
  We appreciate the reviewer’s question regarding the fate of pyruvate and lactate when ACLY expression is reduced. In the absence of testosterone (Ctrl), the expression level of ACLY decreases. At this time, the concentration of pyruvate in the culture medium increased compared to that of testosterone (Testo; Fig. 4D,E). This is probably a reflection of the fact that when the expression of ACLY is suppressed, the rate at which the products of the glycolytic pathway are converted to the fat-producing pathway (i.e., the conversion of citrate to acetyl-CoA) decreases.
  
  On the other hand, lactate levels did not change significantly. This suggests that the flow of lactate production via lactate dehydrogenase is relatively constant, independent of metabolic reprogramming by ACLY.
  
  Therefore, our data suggest that a decrease in ACLY expression leads to a decrease in pyruvate demand, while lactate production is maintained. We interpret these findings as supporting the idea that ACLY is important for directing the carbon produced by the glycolytic pathway to lipid synthesis (by transporting citrate from the mitochondria).
  
  We hope that this explanation clarifies the interpretation of the data.
  
  Minor revision:
  
  189: ECAR: extracellular acidification rate. Please correct.
  
  We have corrected this. (Lines 184-185)
  
  199: Pyruvate is not synthesized, it is metabolized from PEP. Please correct.
  
  The following corrections have been made. “pyruvate is metabolized from phosphoenolpyruvic acid through glycolysis”. (Lines 194-195)
  
  In addition, minor revisions were made to improve the clarity of the overall text.
  
  AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 2

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.01.16.575926v6
www.biorxiv.org www.biorxiv.org

AACDB: Antigen-Antibody Complex Database — a Comprehensive Database Unlocking Insights into Interaction Interface

4
1. Public_Reviews 09 May 2025
 
 in eLife
 
 eLife Assessment
 
 This useful manuscript provides a newly curated database (termed AACDB) of antibody-antigens structural information, alongside annotations that are either taken and from the PDB, or added de-novo. Sequences, structures, and annotations can be easily downloaded from the AACDB website, speeding up the development of structure-based algorithms and analysis pipelines to characterize antibody-antigen interactions. The methodology presented for this data curation is solid. The curated dataset will be of broad interest and value to researchers interested in antibody-antigen interactions.
 
 Summary
2. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 This work introduces and describes a useful curation pipeline of antibody-antigen structures downloaded from the PDB database. The antibody-antigen structures are presented in a new database called AACDB - with associated website - alongside annotations that were either corrected from those present in the PDB database, or added de-novo with solid methodology. Sequences, structures and annotations can be very easily downloaded from the AACDB website, speeding up the development of structure-based algorithms and analysis pipelines to characterize antibody-antigen interactions. However, AACDB is missing some important annotations that I believe would greatly enhance its usefulness, such as binding affinity annotations.
 
 I think the potentially most significant contribution of this database is the manual data curation to fix errors present in the PDB entries, by cross-referencing with the literature. The authors also seem to describe, whenever possible, the procedures they took to correct the annotations.
 
 I have personally verified some of the examples presented by the authors, and found that SAbDab appears to fix the mistakes related to mis-identification of antibody chains, but not other annotations.
 
 "(1) the species of the antibody in 7WRL was incorrectly labeled as "SARS coronavirus B012" in both PDB and SabDab" → I have verified the mistake and fix, and that SAbDab does not fix is, just uses the pdb annotation. "(2) 1NSN, the resolution should be 2.9 , but it was incorrectly labeled as 2.8" → I have verified the mistake and fix, and that saabdab does not fix it, just uses the PDB annotation. "(3) mislabeling of antibody chains as other proteins (e.g. in 3KS0, the light chain of B2B4 antibody was misnamed as heme domain of flavocytochrome b2)" → SAbDab fixes this as well in this case. "(4) misidentification of heavy chains as light chains (e.g. both two chains of antibody were labeled as light chain in 5EBW)" → SAbDab fixes this as well in this case.
 
 I believe the splitting of the pdb files is a valuable contribution as it standardizes the distribution of antibody-antigen complexes. Indeed, there is great heterogeneity in how many copies of the same structure are present in the structure uploaded to the PDB, generating potential artifacts for machine learning applications to pick up on. That being said, I have two thoughts both for the authors and the broader community. First, in the case of multiple antibodies binding to different epitopes on the same antigen, one should not ignore the potentially stabilizing effect that the binding of one antibody has on the complex, thereby enabling the binding of the second antibody. In general, I urge the community to think about what is the most appropriate spatial context to consider when modeling the stability of interactions from crystal structure data. Second, and in a similar vein, some antigens occur naturally as homomultimers - e.g. influenza hemagglutinin is a homotrimer. Therefore, to analyze the stability of a full-antigen-antibody structure, I believe it would be necessary to consider the full homo-trimer, whereas in the current curation of AACDB with the proposed data splitting, only the monomers are present.
 
 I think the annotation of interface residues is a very useful addition to structural datasets.
 
 I am, however, not convinced of the utility of *change* in SASA as a useful metric for identifying interacting residues, beyond what is already identified via pairwise distances between the antibody and antigen residues. If we had access to the unbound conformation of most antibodies and antigens, then we could analyze the differences in structural conformations upon binding, which can be in part quantified by change in SASA. However, as only bound structures are usually available, one is usually force to approximate a protein's unbound structure by computationally removing its binding partner - as it seems to me the authors of this work are doing.
 
 Some obvious limitations of AACDB in its current form include:
 
 AACDB only contains entries with protein-based antigens of at most 50 amino-acids in length. This excludes non-protein-based antigens, such as carbohydrate- and nucleotide-based, as well as short peptide antigens. AACDB does not include annotations of binding affinity, which are present in SAbDab and have been proven useful both for characterizing drivers of antibody-antigen interactions (cite https://www.sciencedirect.com/science/article/pii/S0969212624004362?via%3Dihub) and for benchmarking antigen-specific antibody-design algorithms (cite https://www.biorxiv.org/content/10.1101/2023.12.10.570461v1))
 
 Review 1
3. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 Antibodies, thanks to their high binding affinity and specificity to cognate protein targets, are increasingly used as research and therapeutic tools. In this work, Zhou et al. have created, curated and made publicly available a new database of antibody-antigen complexes to support research in the field of antibody modelling, development and engineering.
 
 Strengths:
 
 The authors have performed a manual curation of antibody-antigen complexes from the Protein Data Bank, rectifying annotation errors; they have added two methods to estimate paratope-epitope interfaces; they have produced a web interface capable of effective visualisation and of summarising the key useful information in one page. The database is also cross-linked to other databases that contain information relevant to antibody developability and therapeutic applications.
 
 Weaknesses:
 
 The database does not import all the experimental information from PDB and contains only complexes with large protein targets.
 
 Comments on revisions: I thank the authors for having incorporated my feedback and I look forward to the next releases of this database.
 
 Review 2
4. Public_Reviews 09 May 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews
 
 Public reviews:
 
 Reviewer #1:
 
 (1) This manuscript introduces a useful curation pipeline of antibody-antigen structures downloaded from the PDB database. The antibody-antigen structures are presented in a new database called AACDB, alongside annotations that were either corrected from those present in the PDB database or added de-novo with a solid methodology. Sequences, structures, and annotations can be very easily downloaded from the AACDB website, speeding up the development of structure-based algorithms and analysis pipelines to characterize antibody-antigen interactions. However, AACDB is missing some key annotations that would greatly enhance its usefulness.
 
 Here are detailed comments regarding the three strengths above:
 
 I think potentially the most significant contribution of this database is the manual data curation to fix errors present in the PDB entries, by cross-referencing with the literature. However, as a reviewer, validating the extent and the impact of these corrections is hard, since the authors only provided a few anecdotal examples in their manuscript.
 
 I have personally verified some of the examples presented by the authors and found that SAbDab appears to fix the mistakes related to the misidentification of antibody chains, but not other annotations.
 
 (a) "the species of the antibody in 7WRL was incorrectly labeled as "SARS coronavirus B012" in both PDB and SabDab" → I have verified the mistake and fix, and that SAbDab does not fix is, just uses the pdb annotation.
 
 (b) "1NSN, the resolution should be 2.9 , but it was incorrectly labeled as 2.8" → I have verified the mistake and fix, and that sabdab does not fix it, just uses the PDB annotation.
 
 (c) "mislabeling of antibody chains as other proteins (e.g. in 3KS0, the light chain of B2B4 antibody was misnamed as heme domain of flavocytochrome b2)" → SAbDab fixes this as well in this case.
 
 (d) "misidentification of heavy chains as light chains (e.g. both two chains of antibody were labeled as light chain in 5EBW)" → SAbDab fixes this as well in this case.
 
 I personally believe the authors should make public the corrections made, and describe the procedures - if systematic - to identify and correct the mistakes. For example, what was the exact procedure (e.g. where were sequences found, how were the sequences aligned, etc.) to find mutations? Was the procedure run on every entry?
 
 We appreciate the reviewer’s valuable feedback. Our correction procedures combined manual curation with systematic sequence analysis. While most metadata discrepancies were resolved through cross-referencing original literature, we implemented a structured approach for identifying mutations in specific cases. For PDB entries labeled as variants (e.g., "Bevacizumab mutant" or "Ipilimumab variant Ipi.106") where the "Mutation(s)" field was annotated as "NO," we retrieved the canonical therapeutic antibody sequence from Thera-SAbDab, then performed pairwise sequence alignment against the PDB entry using BLAST program to identified mutated residues.
 
 This procedure was not applied to all entries, as mutations are context-dependent. Therapeutic antibodies have well-defined reference sequences, enabling systematic alignment. For antibodies lacking unambiguous wild-type references (e.g., research-grade or non-therapeutic antibodies), mutation annotations were directly inherited from the PDB or literature.
 
 All corrections have been publicly archived in AACDB. We have added a detailed discussion of this issue in the section “2.3 Metadata” of revised manuscript.
 
 (2) I believe the splitting of the pdb files is a valuable contribution as it standardizes the distribution of antibody-antigen complexes. Indeed, there is great heterogeneity in how many copies of the same structure are present in the structure uploaded to the PDB, generating potential artifacts for machine learning applications to pick up on. That being said, I have two thoughts both for the authors and the broader community. First, in the case of multiple antibodies binding to different epitopes on the same antigen, one should not ignore the potentially stabilizing effect that the binding of one antibody has on the complex, thereby enabling the binding of the second antibody. In general, I urge the community to think about what is the most appropriate spatial context to consider when modeling the stability of interactions from crystal structure data. Second, and in a similar vein, some antigens occur naturally as homomultimers - e.g. influenza hemagglutinin is a homotrimer. Therefore, to analyze the stability of a full-antigen-antibody structure, I believe it would be necessary to consider the full homo-trimer, whereas, in the current curation of AACDB with the proposed data splitting, only the monomers are present.
 
 We sincerely appreciate the reviewer’s insightful comments regarding the splitting of PDB files and we appreciate the opportunity to address the reviewer’s thoughtful concerns.
 
 Firstly, when two antibodies bind to distinct epitopes on the same antigen, we would like to clarify that this scenario can be divided into two cases based on the experimental context: Case1: When two antibodies bind to distinct epitopes on the same antigen, and their complexes are determined in separate structures. For example, SAR650984 (PDB: 4CMH) and daratumumab (PDB: 7DHA) target CD38 at non-overlapping epitopes. These two antibody-antigen complexes were determined independently, and their structures do not influence each other. Case 2 : When the crystal structure contains a ternary complex with two antibodies and an antigen, as in the example of 6OGE discussed in Section 2.2 of our manuscript. After reviewing the original literature, the experiment confirmed that the order of Fab binding does not affect the formation of the ternary complex, and the binding of one antibody does not enhance the binding of the other. This supports the rationale for splitting 6OGE into two separate structures. However, we acknowledge that not all ternary complexes in the PDB provide such detailed experimental descriptions in their original literature. We agree with the reviewer that in some cases, one antibody may stabilize the structure to facilitate the binding of a second antibody. For instance, in 3QUM, the 5D5A5 antibody stabilizes the structure, enabling the binding of the 5D3D11 antibody to human prostate-specific antigen. Such sandwich complexes are indeed valuable for identifying true epitopes and paratopes. Importantly, splitting the structure does not alter the interaction sites.
 
 Secondly, we fully agree with the reviewer that for antigens that naturally exist as homomultimers (e.g., influenza hemagglutinin as a homotrimer), the full multimeric structure should be considered when analyzing stability. In such cases, users can directly utilize the original PDB structures provided in their multimeric form. Our splitting approach is intended to provide an additional option for cases where monomeric analysis is sufficient or preferred, but it does not preclude the use of the original multimeric structures when necessary.
 
 (3) I think the manuscript is lacking in justification about the numbers used as cutoffs (1A^2 for change in SASA and 5A for maximum distance for contact) The authors just cite other papers applying these two types of cutoffs, but the underlying physico-chemical reasons are not explicit even in these papers. I think that, if the authors want AACDB to be used globally for benchmarks, they should provide direct sources of explanations of the cutoffs used, or provide multiple cutoffs. Indeed, different cutoffs are often used (e.g. ATOM3D uses 6A instead of 5A to determine contact between a protein and a small molecule https://datasets-benchmarks-proceedings.neurips.cc/paper/2021/hash/c45147dee729311ef5b5c3003946c48f-Abstract-round1.html). I think the authors should provide a figure with statistics pertaining to the interface atoms. I think showing any distribution differences between interface atoms determined according to either strategy (number of atoms, correlation between change in SASA and distance...) would be fundamental to understanding the two strategies. I think other statistics would constitute an enhancement as well (e.g. proportion of heavy vs. light chain residues).
 
 Some obvious limitations of AACDB in its current form include:
 
 AACDB only contains entries with protein-based antigens of at most 50 amino acids in length. This excludes non-protein-based antigens, such as carbohydrate- and nucleotide-based, as well as short peptide antigens.
 
 AACDB does not include annotations of binding affinity, which are present in SAbDab and have been proven useful both for characterizing drivers of antibody-antigen interactions (cite https://www.sciencedirect.com/science/article/pii/S0969212624004362?via%3Dihub) and for benchmarking antigen-specific antibody-design algorithms (cite https://www.biorxiv.org/content/10.1101/2023.12.10.570461v1)).
 
 We thank the reviewer for raising this critical point about the cutoff values used in AACDB. In the current study, the selection of the threshold value is very objective; the threshold chosen in the manuscript is summarized based on existing literature, and we have provided more literature support in the manuscript. The criteria for defining interacting amino acids in established tools, typically do not set the ΔSASA exceed 1 Å2 and the distance exceed 6 Å. While our manuscript emphasizes widely accepted thresholds for consistency with prior benchmarks, AACDB explicitly provides raw ΔSASA and distance values for all annotated residues. Users can dynamically filter the data from downloaded files by excluding entries exceeding their preferred thresholds (e.g., selecting 5Å instead of 6Å). This ensures adaptability to diverse research needs. In the revised version, we reset the distance threshold to 6 Å and calculated the interacting amino acids in order to give the user a wider range of choices. In the section “3.2 Database browse and search” of revised manuscript, we provide a description of the flexible choice of thresholds for practical use.
 
 Furthermore, distance and ΔSASA are two distinct metrics for evaluating interactions. Distance directly quantifies spatial proximity between atoms, reflecting physical contacts such as van der Waals interactions or hydrogen bonds, and is ideal for identifying direct spatial adjacency. ΔSASA, on the other hand, measures changes in solvent accessibility of residues during binding, capturing the contribution of buried surfaces to binding free energy. Even for residues not in direct contact, reduced SASA due to conformational changes may indicate indirect functional roles.
 
 As demonstrated through comparisons on the detailed information pages, the sets of interacting amino acids defined by these two methods differ by only a few residues, with no significant variation in their overall distributions. However, since interaction patterns vary significantly across different complexes, analyzing residue distributions across all structures using both criteria is not feasible.
 
 We thank the reviewer for highlighting these limitations. AACDB currently focuses on protein-based antigens ≤50 amino acids to prioritize structural consistency, which excludes non-protein antigens and shorter peptides. While affinity annotations are critical for benchmarking antibody design tools, these data were not integrated in this release due to insufficient data verification caused by internal team constraints. We acknowledge these gaps and plan to expand antigen diversity and incorporate affinity metrics in future updates.
 
 Reviewer #2:
 
 Summary:
 
 Antibodies, thanks to their high binding affinity and specificity to cognate protein targets, are increasingly used as research and therapeutic tools. In this work, Zhou et al. have created, curated, and made publicly available a new database of antibody-antigen complexes to support research in the field of antibody modelling, development, and engineering.
 
 Strengths:
 
 The authors have performed a manual curation of antibody-antigen complexes from the Protein Data Bank, rectifying annotation errors; they have added two methods to estimate paratope-epitope interfaces; they have produced a web interface that is capable of both effective visualisation and of summarising the key useful information in one page. The database is also cross-linked to other databases that contain information relevant to antibody developability and therapeutic applications.
 
 Weaknesses:
 
 The database does not import all the experimental information from PDB and contains only complexes with large protein targets.
 
 Thank you for the valuable feedback. As previously responded to Reviewer 1, due to limitations within our team, comprehensive data integration from PDB has not been achieved in the current version. We acknowledge the significance of expanding the database to encompass a broader range of experimental information and complexes with diverse target sizes. Regrettably, immediate updates to address these limitations are not feasible at this time. Nevertheless, we are committed to enhancing the database in upcoming upgrades to provide users with a more comprehensive and inclusive resource
 
 Recommendations for the authors:
 
 Reviewer #1:
 
 (1) Line 194: "produce" → "produced"
 
 We thank the reviewer for the feedback. We have checked the grammar and spelling carefully in the revised manuscript.
 
 (2) As mentioned in the public review, I think adding binding affinity annotations would greatly enhance the use cases for the database.
 
 We thank the reviewer for the suggestion. As the response in “Public review”. Due to team constraints, these data are not integrated into this release but are being collated. We recognize these gaps and plan to expand antigenic diversity and incorporate affinity metrics in future updates.
 
 (3) I think adding a visualization of interface atoms and contacts on an entry's webpage would be useful for someone exploring specific entries. It also would be useful if the authors provided a pymol command to select interface residues since that's a procedure any structural biologist is likely to do.
 
 We sincerely appreciate the reviewer’s constructive suggestions. In response to the request for enhanced visualization and accessibility of interface residue information, we have implemented the following improvements: (1) Web Interface Visualization. On the entry-specific webpage, we have added an interactive visualization window that highlights the antigen-antibody interaction interface using distinct colors. The interaction interface visualization has been incorporated into Figure 5 of the revised manuscript, with a detailed description. (2) PyMOL Command Accessibility. The “Help” page now provides step-by-step PyMOL commands to select and visualize interface residues.
 
 (4) I think the authors should provide headers to the files containing interface residues according to the change-in-SASA criterion, as they do for those computed according to contact. This would avoid unnecessary confusion - however slight - and make parsing easier. I was initially confused by the meaning of the last column, though after a minute I understood it to be the change in SASA.
 
 We thank the reviewer for providing such detailed feedback. We thank the reviewer for the comment and the suggestion. We have provided headers for the files of the interacting residues defined by ΔSASA.
 
 (5) Line 233: "AACDB's data processing pipeline supports mmCIF files" → The meaning and implications of this statement are not obvious to me, and are mentioned nowhere else in the paper. Do you mean that in AACDB there are structure entries that the RCSB PDB database only has in mmCIF file format, and not .pdb format? So, effectively, there are some entries in AACDB that are not in any other antibody-specific database?I checked and, as of Dec 3rd, 2024, there are 41 structures in AACDB that are NOT in SAbDab. Manually checking 5 of those 41 structures, none are mmCIF-only structures.
 
 We thank the reviewer for the valuable comment. Because of the size of the structures within certain entries, representing them in a single PDB format data file is not feasible due to the excessive number of atoms and polymer chains they contain. As a result, PDB stores these structures in “mmcif” format files. In AACDB, 47 entries, such as 7SOF, 7NKT, 7B27, and 6T9D, are only available in the “mmCIF” format from the PDB. The “.pdb” and “.cif” files contain atomic coordinates in distinct text formats, and the segmentation of these structure files is automatically conducted based on manually annotated antibody-antigen chains. To accommodate this, we have incorporated these considerations into our file processing pipeline, thereby enabling a fully automated file segmentation process. Additionally, we employed Naccess to calculate interatomic distances. However, since this software only accepts .pdb format files as input, we also converted all split .cif files into .pdb format within our fully automated pipeline. We apologize for the lack of clarity in the original manuscript and have included a more detailed explanation in the "2.2 PDB Splitting" section of the revised manuscript.
 
 Reviewer #2:
 
 (1) In SabDab and PDB, experimental binding affinities are also reported: could the authors comment on whether they also imported this information and double-checked it against the original paper? If it wasn't imported, that might discourage some users and should be considered as an extension for the future.
 
 We thank the reviewer for the comment and the suggestion. As the response in “Public review”. Due to current resource constraints, quantitative affinity data has not been incorporated into this release but is undergoing systematic curation. We explicitly recognize these limitations and propose a two-pronged strategy for future iterations: (1) broadening antigen diversity coverage through expanded structural sampling, and (2) integrating quantitative binding affinity measurements. In the Discussion section, we have included description outlining the planned enhancements.
 
 (2) Line 49-50: the references mentioned in connection to deep learning methods for antibody-antigen predictions seem a bit limited given the amount of articles in this field, with 3 of 4 references on one method only (SEPPA), could the authors expand this list to reflect a bit more the state of the art?
 
 We thank the reviewer for the suggestion. We agree that more relevant studies should be listed and therefore more references are provided in the revised manuscript.
 
 When mentioning the limitations of the existing databases, it feels a bit that the criticism is not fully justified. For instance:
 
 Line 52-53: could the authors elaborate on the reasons why such an identification is challenging? (Isn't it possible to make an efficient database-filtered search? Or rather, should one highlight that a more focussed resource is convenient and why?)
 
 Thank you for feedback. In this study, the keywords "antibody complex," "antigen complex," and "immunoglobulin complex," were employed during data collection. PDB returned over 30,000 results, of which only one-tenth met our criteria after rigorous filtering. This demonstrates that keyword searches, while useful, inherently limit result precision and introduce substantial redundancy, likely due to the PDB's search mechanism. That’s why we illustrated the significant challenges in identifying antibody-antigen complexes from general protein structures in the PDB.
 
 Line 55: reading the website http://www.abybank.org/abdb/, it would be fairer to say that the web interface lacks updates, as the database and the code have gone through some updates. Could the authors provide a concrete example of the reason why: 'The AbDb database currently lacks proper organization and management of this valuable data.'?
 
 We thank the reviewer for highlighting this issue. In our original manuscript, the statement that the AbDb database "lacks proper organization and management" was based on the absence of explicit statement regarding data updates on its official website at the time of submission, even though internal updates to its content may have occurred. We fully respect the long-standing contributions of AbDb to antibody structural research, and our comments were solely directed at the specific state of the database at that time. As the reviewer noted, following the release of our preprint, we have also taken note of AbDb's recent updates. To reflect the latest developments and avoid potential misinterpretation, we have revised the original statement in revised manuscript.
 
 Also 'this rapid updating process may inadvertently overlook a significant amount of information that requires thorough verification,': it's difficult for me to understand what this means in practice. Could the authors clarify if they simply mean that SabDab collects information from PDB and therefore tends to propagate annotation errors from there? If yes, I think it's enough to state it in these terms, and for sure I agree that the reason is that correcting these annotation errors requires a substantial amount of work.
 
 We thank the reviewer for providing such detailed feedback on the manuscript. We acknowledge that SabDab represents a highly valuable contribution to the field, and its rapid update mechanism has significantly advanced related research areas. However, as stated by the reviewer, we aim to clarify that SabDab primarily relies on automated metadata extraction from the PDB for annotation, and its rapid update process inherently inherits raw data from upstream sources. According to their paper, manual curation is only applied when the automated pipeline fails to resolve structural ambiguities. This workflow—dependent on PDB annotations with limited manual verification—may propagate errors provided by PDB. Examples include species misannotation and mutation status misinterpretation. We fully agree with the reviewer's observation that correcting errors in such cases necessitates labor-intensive manual curation, which is a core motivation for our study.
 
 Line 86: why 'Structures that consisted solely of one type of antibody were excluded'? Why exclude complexes with antigens shorter than 50 amino acids? These complexes are genuine antibody-antigen complexes.
 
 We thank the reviewer for the valuable question. The AACBD database is dedicated to curating structural data of antigen-antibody complexes. Structures featuring only a single antibody type are classified as free antibodies and systematically excluded from the database due to the absence of protein-bound partners. During data screening , we retained sequences shorter than 50 amino acids by categorizing them as peptides rather than eliminating them outright. The current release exclusively encompasses complexes with protein-based antigens. Meanwhile, complexes involving peptide, haptens, and nucleic acid antigens are undergoing systematic curation, with planned inclusion in future updates to broaden antigen category representation.
 
 Line 96 needs a capital letter at the beginning.
 
 Line 107: 'this would generate' → 'this generates' (given it is something that has been implemented, correct?).
 
 Line 124: missing an 'of'.
 
 Line 163: inspiring by -> inspired by.
 
 Thank you for feedback. All of the above grammatical or spelling errors have been revised in the manuscript.
 
 Line 109-111: apart from the example, it would be good to spell out the general rule applied to anti-idiotypic antibodies.
 
 We thank the reviewer for the valuable feedback. For anti-idiotypic antibodies complex. the partner antibody is treated as a dual-chain antigen, , necessitating individual evaluation of heavy chain and light chain interactions with the anti-idiotypic component. We have given a general rule for anti-idiotypic antibodies in section “2.2 PDB splitting” of revised manuscript.
 
 Line 155-159: could the authors provide references for the two choices (based on sasa and any-atom distance) that they adopted to define interacting residues?
 
 We thank the reviewer for the comment and the suggestion. As the same as the response to reviewer #1 in Public review. The interacting residues definition and the threshold chosen in the manuscript is summarized based on existing literature. We have added additional references for support in section “1.Introduction”. Our resource does not provide a fixed amino acid list. Instead, all interacting residues are explicitly documented alongside their corresponding ΔSASA (solvent-accessible surface area changes) and intermolecular distances, allowing researchers to flexibly select residue pairs based on customized thresholds from downloadable datasets. Furthermore, aligning with widely adopted criteria in current literature—where interactions are defined by ΔSASA >1 Å² and atomic distances <6 Å, we have recalibrated our analysis in the revised version. Specifically, we replaced the previous 5 Å distance threshold with a 6 Å cutoff to recalculate interacting residues.
 
 Line 176-178: could the authors re-phrase this sentence to clarify what they mean by 'change in the distribution'?
 
 We thank the reviewer for the suggestion. Our search was conducted with an end date of November 2023. However, Figure 3B includes an entry dated 2024. Upon reviewing this record, we identified that the discrepancy arises from the supersession of the 7SIX database entry (originally released in December 2022) by the 8TM1 version in January 2024. This version update explains the apparent chronological inconsistency. We regret any lack of clarity in our original description and have revised the corresponding section in the manuscript to explicitly clarify this change of database.
 
 Caption Figure 3: please spell out all the acronyms in the figure. Provide the date when the last search was performed (i.e., the date of the last update of these statistics).
 
 We thank the reviewer for the comment. We have systematically expanded all acronyms and included update dates for statistics in the legend of Figure 3. Corresponding changes have also been made to the statistical pages on the website.
 
 Finally, it would be advisable to do a general check on the use of the English language (e.g. I noted a few missing articles). In Figure 5 DrugBank contains typos.
 
 We sincerely appreciate the reviewer's meticulous attention to linguistic precision. We have corrected the typographical error in Figure 5 and conducted a comprehensive review of the entire manuscript to ensure accuracy and clarity.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 2

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.11.12.623267v2
www.biorxiv.org www.biorxiv.org

Mitochondrial ETF insufficiency drives neoplastic growth by selectively optimizing cancer bioenergetics

4
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  The authors present an important set of data implicating ETFDH as an epigenetically suppressed gene in cancer with tumor suppressive functions. The evidence is solid, with the authors demonstrating that ETFDH suppression results in accumulation of amino acids that impact metabolism via hyperactive mTORC1.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  In their manuscript, Papadopoli et al explore the role of ETFDH in transformation. They note that ETFDH protein levels are decreased in cancer, and that deletion of ETFDH in cancer cell lines results in increased tumorigenesis, elevated OXPHOS and glycolysis, and a reduction in lipid and amino acid oxidation. The authors attribute these effects to increased amino acid levels stimulating mTORC1 signaling and driving alterations in BCL6 and EIF4EBP1. They conclude that ETFDH1 is epigenetically silenced in a proportion of neoplasms, suggesting a tumor-suppressive function. Overall, the authors logically present clear data and perform appropriate experiments to support their hypotheses. I only have a few minor points related to the semantics of a few of the author's statements.
  
  Minor Points
  
  Authors state, "we identified ETF dehydrogenase (ETFDH) as one of the most dispensable metabolic genes in neoplasia." Surely there are thousands of genes that are dispensable for neoplasia. Perhaps the authors can revise this sentence and similar sentiments in the text.
  
  Authors state, " These findings show that ETFDH loss elevates glutamine utilization in the CAC to support mitochondrial metabolism." While elevated glutamine to CAC flux is consistent with the statement that increased glutamine, the authors have not measured the effect of restoring glutamine utilization to baseline on mitochondrial metabolism. Thus, the causality implied by the authors can only be inferred based on the data presented. Indeed, the increased glutamine consumption may be linked to the increase in ROS, as glutamate efflux via system xCT is a major determinant of glutamine catabolism in vitro.
  
  Authors state that the mechanism described is an example of "retrograde signaling". However, the mechanism seems to be related to a reduction in BCAA catabolism, suggesting that the observed effects may be a consequence of altered metabolic flux rather than a direct signaling pathway. The data presented do not delineate whether the observed effects stem from disrupted mitochondrial communication or from shifts in nutrient availability and metabolic regulation.
  
  The authors should discuss which amino acids that are ETFDH substrates might affect mTORC1 activity, or consider whether other ETFDH substrates might also affect mTORC1 in their discussion. Along these lines, the authors might consider discussing why amino acids that are not ETFDH substrates are increased upon ETFDH loss.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  The altered metabolism of tumors enables their growth and survival. Classically, tumor metabolism often involves increased activity of a given pathway in intermediary metabolism to provide energy or substrates needed for growth. Papadopoli et al. investigate the converse - the role of mitochondrial electron transfer flavoprotein dehydrogenase (ETFDH) in cancer metabolism and growth. The authors present compelling evidence that ETFDH insufficiency, which is detrimental in non-malignant tissues, paradoxically enhances bioenergetic capacity and accelerates neoplastic growth in cancer cells in spite of the decreased metabolic fuel flexibility that this affords tumor cells. This is achieved through the retrograde activation of the mTORC1/BCL-6/4E-BP1 axis, leading to metabolic and signaling reprogramming that favors tumor progression.
  
  Strengths:
  
  This review focuses primarily on the cancer metabolism aspects of the manuscript.
  
  The study provides robust evidence linking ETFDH insufficiency to enhanced cancer cell bioenergetics and tumor growth.
  
  The use of multiple cancer cell lines and in vivo models strengthens the generalizability of the findings.
  
  The mechanistic insights into the mTORC1/BCL-6/4E-BP1 axis and its role in metabolic reprogramming are of general interest within and outside the immediate field of tumor metabolism.
  
  Weaknesses:
  
  The ETFDH knockout experiments are well-controlled by the addback of sgRNA-resistant ETFDH, but do not determine if the catalytic activity of this enzyme is required for the phenotypes induced by ETFDH loss.
  
  Although this is not critical, it would be nice to see if the increased labeled aspartate pools result in higher nucleotide pools to support tumor growth.
  
  Conclusion:
  
  This manuscript provides significant insights into the role of ETFDH insufficiency in cancer metabolism and growth. The findings highlight the potential of targeting the mTORC1/BCL-6/4E-BP1 axis in ETFDH-deficient cancers. The compelling data support the conclusions presented in the manuscript, which will be valuable to the cancer metabolism community.
  
  Review 2
4. Public_Reviews 09 May 2025
  
  in eLife
  
  Author response:
  
  We are highly appreciative of your constructive criticism and that you found that our findings of interest and significance. Based on your helpful suggestions, we plan to revise the paper as following:
  
  (1) Although ETFDH is reduced, but not mutated across neoplasia, we appreciate your point pertinent to catalytically activity of ETFDH. To this end, in the revision we are planning to compare the effects of rescues using wild type ETFDH or one of the MADD-associated mutants with compromised catalytic activity.
  
  (2) We intend to measure steady-state nucleotide levels as a function of ETFDH status in the cell. If time and/or funding allow, we will also perform appropriate labelling experiments.
  
  (3) We will revise the text of the manuscript to address the minor points raised by the reviewers.
  
  Again, we would like to thank you for helpful comments, which we aim to address as outlined above and hopefully further improve our report.
  
  AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 2

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.10.25.620155v3
www.biorxiv.org www.biorxiv.org

Genetic parallels in biomineralization of the calcareous sponge Sycon ciliatum and stony corals

5
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  This important paper reports the discovery of calcarins, a protein family that seems to be involved in calcification in the calcareous sponge Sycon ciliatum, significantly enhancing our understanding of the molecular and cellular mechanisms underlying spicule formation in sponges and the evolution of carbonate biomineralization. The conclusions are supported by compelling evidence based on an integrated analysis that combines transcriptomics, genomics, proteomics, and precise in situ hybridization. These findings will be of broad interest to cell biologists, biochemists, and evolutionary biologists.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  To elucidate the mechanisms and evolution of animal biomineralization, Voigt et al. focused on the sponge phylum - the earliest branching extant metazoan lineages exhibiting biomineralized structures - with a particular emphasis on deciphering the molecular underpinnings of spicule formation. This study centered on calcareous sponges, specifically Sycon ciliatum, as characterized in previous work by Voigt et al. In S. ciliatum, two morphologically distinct spicule types are produced by a set of two different types of cells that secrete extracellular matrix proteins, onto which calcium carbonate is subsequently deposited. Comparative transcriptomic analysis between a region with active spicule formation and other body regions identified 829 candidate genes involved in this process. Among these, the authors focused on the calcarine gene family, which is analogous to the Galaxins, the matrix proteins known to participate in coral calcification. The authors performed three-dimensional structure prediction using AlphaFold, examined mRNA expression of Calcarin genes in spicule-forming cell types via in situ hybridization, conducted proteomic analysis of matrix proteins isolated from purified spicules, and carried out chromosome arrangement analysis of the Calcarin genes.
  
  Based on these analyses, it was revealed that the combination of Calcarin genes expressed during spicule formation differs between the founder cells-responsible for producing diactines and triactines-and the thickener cells that differentiate from them, underscoring the necessity for precise regulation of Calcarin gene expression in proper biomineralization. Furthermore, the observation that 4 Calcarin genes are arranged in tandem arrays on the chromosome suggests that two rounds of gene duplication followed by neofunctionalization have contributed to the intricate formation of S. ciliatum spicules. Additionally, similar subtle spatiotemporal expression patterns and tandem chromosomal arrangements of Galaxins during coral calcification indicate parallel evolution of biomineralization genes between S. ciliatum and aragonitic corals.
  
  Strengths:
  
  (1) An integrative research approach, encompassing transcriptomic, genomic, and proteomic analyses as well as detailed FISH.
  
  (2) High-quality FISH images of Calcarin genes, along with a concise summary clearly illustrating their expression patterns, is appreciated.
  
  (3) It was suggested that thickener cells originate from founder cells. To the best of my knowledge, this is the first study to demonstrate trans-differentiation of sponge cells based on the cell-type-specific gene expression, as determined by in situ hybridization.
  
  (4) The comparison between Calcarins of Calcite sponge and Galaxins of aragonitic corals from various perspective-including protein tertiary structure predictions, gene expression profiling during calcification, and chromosomal sequence analysis to reveal significant similarities between them.
  
  (5) The conclusions of this paper are generally well supported by the data; however, some FISH images require clearer indication or explanation.
  
  (6) Figure S2 (B, C, D): The fluorescent signals in these images are difficult to discern. If the authors choose to present signals at such low magnification, enhancing the fluorescence signals would improve clarity. Additionally, incorporating Figure S2A as an inset within Figure S2E may be sufficient to convey the necessary information about signal localization.
  
  (7) Figure S3A: The claim that Cal2-expressing spherical cells are closely associated with the choanoderm at the distal end of the radial tube is difficult to follow. Are these Cal2-expressing spherical cells interspersed among choanoderm cells, or are they positioned along the basal surface of the choanoderm? Clarifying their precise localization and indicating it in the image would strengthen the interpretation.
  
  (8) To further highlight the similarities between S.ciliatum and aragonitic corals in the molecular mechanisms of calcification, consider including a supplementary figure providing a concise depiction of the coral calcification process. This would offer valuable context for readers.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  This paper reports on the discovery of calcarins, a protein family that seems involved in calcification in the sponge Sycon ciliatum, based on specific expression in sclerocytes and detection by mass spectrometry within spicules. Two aspects stand out: (1) the unexpected similarity between Sycon calcarins and the galaxins of stony corals, which are also involved in mineralization, suggesting a surprising, parallel co-option of similar genes for mineralization in these two groups; (2) the impressively cell-type-specific expression of specific calcarins, many of which are restricted to either founder or thickener cells, and to either diactines, triactines, or tetractines. The finding that calcarins likely diversified at least partly by tandem duplications (giving rise to gene clusters) is a nice bonus.
  
  Strengths:
  
  I enjoyed the thoroughness of the paper, with multiple lines of evidence supporting the hypothesized role of calcarins: spatially and temporally resolved RNAseq, mass spectrometry, and whole-mount in situ hybridization using CISH and HCR-FISH (the images are really beautiful and very convincing). The structural predictions and the similarity to galaxins are very surprising and extremely interesting, as they suggest parallel evolution of biomineralization in sponges and cnidarians during the Cambrian explosion by co-option of the same "molecular bricks".
  
  Weaknesses:
  
  I did not detect any major weakness, beyond those inherent to working with sponges (lack of direct functional inhibition of these genes) or with fast-evolving gene families with complex evolutionary histories (lack of a phylogenetic tree that would clarify the history of galaxins/calcarins and related proteins).
  
  Review 2
4. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Summary:
  
  The study explores the extent to which the biomineralization process in the calcitic sponge Sycon ciliatum resembles aragonitic skeleton formation in stony corals. To investigate this, the authors performed transcriptomic, genomic, and proteomic analyses on S. ciliatum and examined the expression patterns of biomineralization-related genes using in situ hybridization. Among the 829 differentially expressed genes identified in sponge regions associated with spicule formation, the authors focused on calcarin genes, which encode matrix proteins analogous to coral galaxins. The expression patterns of calcarins were found to be diverse but specific to particular spicule types. Notably, these patterns resemble those of galaxins in stony corals. Moreover, the genomic organization of calcarine genes in S. ciliatum closely mirrors that of galaxin genes in corals, suggesting a case of parallel evolution in carbonate biomineralization between calcitic sponges and aragonitic corals.
  
  Strengths:
  
  The manuscript is well written, and the figures are of high quality. The study design and methodologies are clearly described and well-suited to addressing the central research question. Particularly noteworthy is the authors´ integration of various omics approaches with molecular and cell biology techniques. Their results support the intriguing conclusion that there is a case of parallel evolution in skeleton-building gene sets between calcitic sponges and aragonitic corals. The conclusions are well supported by the data and analyses presented.
  
  Weaknesses:
  
  The manuscript is strong, and I have not identified any significant weaknesses in its current form.
  
  Review 3
5. Public_Reviews 09 May 2025
  
  in eLife
  
  Author response:
  
  We sincerely thank all reviewers for their thoughtful, detailed, and supportive evaluations of our manuscript. We are very pleased that the reviewers appreciated the integrative approach of our study, the quality of the imaging and analyses, and the insights provided into the parallel evolution of biomineralization mechanisms in sponges and corals.
  
  We are carefully considering all the suggestions made, including those regarding the improvement of figure clarity and the clarification of certain image interpretations. These comments are extremely valuable, and we are preparing a detailed point-by-point reply to accompany our revised manuscript.
  
  It was also brought to our attention that the links to the Zenodo repository were incorrect. We apologize for this oversight and any inconvenience it may have caused and will updae the links in our revised manuscript. In the meantime, the correct Zenodo repositories can be accessed using the following links:
  
  https://zenodo.org/records/14755899
  
  https://zenodo.org/records/13847772
  
  We again thank the reviewers for their constructive feedback, which will help us to further strengthen the manuscript.
  
  AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 3

AuthorResponse

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.02.06.636789v2
www.biorxiv.org www.biorxiv.org

Krüppel Regulates Cell Cycle Exit and Limits Adult Neurogenesis of Mushroom Body Neural Progenitors in Drosophila

5
1. Public_Reviews 09 May 2025
 
 in eLife
 
 eLife Assessment
 
 This study provides important insights into the regulation of neuroblast lifespan and proliferation in the Drosophila mushroom body, identifying Krüppel (Kr) as a key transcription factor promoting timely termination of these neuroblasts by repressing Imp expression, and proposes an antagonistic role of Krüppel homolog 1 (Kr-h1), whose overexpression leads to prolonged mushroom body neuroblast proliferation and tumor-like expansion. The findings are impactful for researchers interested in temporal patterning and neural development, and the methods and data analysis are solid, however, the precise regulatory interactions between Kr and Kr-h1 and their modes of action remain incompletely tested. Further experiments would be required to fully elucidate the mechanistic interplay between the factors involved.
 
 Summary
2. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Summary:
 
 In this manuscript, the authors investigated factors required for neural progenitors to exit the cell cycle before the adult stage. They first show that Kr is turned on in pupal stage MBNBs, and depletion of Kr from pupal stage NBs leads to retention of MBNBs into the adult stage. Then they demonstrate that these retained NBs maintain the expression of Imp, and co-depletion of Imp abolishes the extended neurogenesis. Further, they show that co-depletion of kr-h1 significantly reduces the retained MBNBs caused by loss of kr, suggesting antagonistic genetic interactions between these two. In addition, they demonstrate that over-expressing Kr-h1 leads to the striking phenotype of tumor-like neuroblast overgrowth in adult brains.
 
 Strengths:
 
 (1) The authors leveraged well-controlled, powerful genetic tools (including temporal control of RNAi knockdown using the Gal80ts system), and provided strong evidence that Kr expression in pupal stage MBNBs is required to repress Imp and promote the end of neurogenesis. Similarly, the experimental result of co-depleting Kr-h1 and Kr, and the striking phenotype upon Kr-h1 mis-expression, support the antagonistic roles played by Kr-h1 and Kr in this process.
 
 (2) The sample sizes, quantification methods, and p-values are well documented for all experiments. In most parts, the data presented strongly support their conclusions.
 
 (3) Identification of two transcription factors with opposite roles in controlling cell cycle exit, and their possible interactions with the Imp/Syp axis, is highly significant for the study on how the proliferation of neural progenitors is regulated and limited before the adult stage.
 
 Weaknesses:
 
 (1) The nature of the KrIf-1 allele is not clear. It is mentioned that this allele leads to misexpression of Kr in various tissues. However, it is not clear if Kr is mis-expressed or lost in MBNBs in the KrIf-1 mutant. If Kr is mis-expressed in MBNBs in the KrIf-1 mutant, then it would be difficult to explain why both loss of Kr and mis-expression of Kr in MBNBs lead to the same NB retention phenotype. The authors should examine Kr expression in MBNBs in the KrIf-1 mutant.
 
 (2) Some parts of the regulations and interactions between Kr, Kr-h1, Imp, Syp, and E93 are not well-defined. For example, the data suggest that Kr is turned on in the pupal stage MBNBs, and is required to end neurogenesis through repressing Imp and Kr-h1. To further support this conclusion, the authors can examine if Kr-h1 expression is up-regulated in kr-RNAi. The authors suggested that Kr-h1 may act upstream or in parallel to Imp/Syp, but also suggested that Kr-h1 may repress E93. The expression of Imp, Syp, and E93 can be examined in brains with Kr-h1 mis-expression to determine where Kr-h1 acts. If Imp expression is elevated when Kr-h1 is mis-expressed, then Kr-h1 may act upstream of Imp. If Imp/Syp expression does not change, then Kr-h1 may act on the E93 level.
 
 Review 1
3. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 In this paper, the authors study the role of Kruppel in regulating the survival of mushroom body neuroblasts. They first confirm that adult wild-type brains have no proliferation and report that Kruppel mutants and Kruppel RNAi in neuroblasts show a few proliferative clones; they show that these proliferative clones are localized in the mushroom body. They then show that Kruppel is expressed mostly during pupal stages and acts by downregulating the expression of Imp, which has been shown to positively regulate neuroblast proliferation and survival. Expectedly, this also affects neuronal diversity in the mushroom body, which is enriched in gamma neurons that are born during the Imp-expression window. Finally, they show that Kr acts antagonistically to Kr-h1, which is expressed predominantly in larval stages.
 
 Strengths:
 
 The main strength of this paper is that it identified a novel regulator of Imp expression in the mushroom body neuroblasts. Imp is a conserved RNA-binding protein that has been shown to regulate neural stem cell proliferation and survival in different animals.
 
 Weaknesses:
 
 (1) The main weakness of the paper is that the authors want to test adult neurogenesis in a system where no adult neurogenesis exists. To achieve this, they force neuroblasts to survive in adulthood by altering the genetic program that prevents them from terminating their proliferation. If this was reminiscing about "adult neurogenesis", the authors should at least show how adult neurons incorporate into the mushroom body even if they are born much later. On the contrary, this more likely resembles a tumorigenic phenotype, when stem cells divide way past their appropriate timing.
 
 (2) Moreover, the figures are, in many cases, hard to understand, and the interpretation of the figures doesn't always match what one sees. The manuscript would benefit from better figures; for example, in Figure 2C, Miranda expression in insc>GFP in Kr-IF-1 is not visible.
 
 (3) The authors describe a targeted genetic screen, but they don't describe which genes were tested, how they were chosen, and why Kruppel was finally selected.
 
 (4) The authors argue that Kr does not behave as a typical tTF in MBNBs. However, they show no expression in the embryo, limited expression in the larva and early pupa, and a peak around P24-P48. This sounds like a temporally regulated expression of a transcription factor. Importantly, they mentioned that they tested their observations against different datasets (FlyAtlas2, modENCODE, and MBNB-lineage-specific RNA-seq data), but they don't provide the data.
 
 (5) Finally, the contribution of Kr to the neuronal composition of the mushroom body is expected (since Imp is known to regulate neuronal diversity in the MB), but the presentation in the paper is very incomplete.
 
 Unfortunately, based on the above, I am not convinced that the authors can use this framework to infer anything about adult neurogenesis. Therefore, the impact of this work is limited to the role of Kruppel in regulating Imp, which has already been shown to regulate the extent of neuroblast division, as well as the neuronal types that are born at different temporal windows.
 
 Review 2
4. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 Summary:
 
 Drosophila neuroblasts (NBs) serve as a well-established model for studying neural stem cell biology. The intrinsic genetic programs that control their mitotic potential throughout development have been described in remarkable detail, highlighting a series of sequentially expressed transcription factors and RNA-binding proteins that together constitute the temporal patterning system.
 
 However, the mechanisms that limit the number of NB divisions remain largely unknown in a specific subset of NBs known as mushroom body neuroblasts (MB NBs). Unlike other NBs, which terminate proliferation before or shortly after the onset of metamorphosis, MB NBs continue dividing until the end of metamorphosis, ceasing only just before adulthood. In this study, the authors identify the transcription factor Krüppel (Kr), a member of the conserved Krüppel-like family, as temporally regulated in MB NBs. They demonstrate that Kr knockdown during pupal stages maintains expression of the RNA-binding protein Imp and results in prolonged MB NB proliferation into adulthood. Their data suggest that Kr contributes to the timely silencing of Imp during metamorphosis. The authors further identify Kr-h1, a related transcription factor, as a potential antagonist. While Kr-h1 appears dispensable for the timely termination of MB NBs under normal conditions, its overexpression leads to their continued proliferation and tumor-like expansion in adults.
 
 This work provides the first evidence for a transcription factor-driven temporal regulation mechanism in MB NBs, offering new insight into the control of neural stem cell self-renewal. Given the evolutionary conservation of Krüppel-like factors, this study may have broader implications for the neural stem cell field.
 
 Strengths:
 
 (1) The study possibly identifies a new series of temporal transcription factors that are specific for mushroom body neuroblasts.
 
 (2) The mechanism could be conserved in vertebrates.
 
 Weaknesses:
 
 Some proposed regulatory interactions, particularly between Kr, Kr-h1, and other temporal factors like Imp, Chinmo, and E93, have not been thoroughly investigated, which weakens the support for the proposed model. Additional experimental validation is needed to confirm these relationships and strengthen the mechanistic framework.
 
 Review 3
5. Public_Reviews 09 May 2025
 
 in eLife
 
 Author response:
 
 We thank the editors and reviewers for their thoughtful and constructive evaluation of our manuscript, “Krüppel Regulates Cell Cycle Exit and Limits Adult Neurogenesis of Mushroom Body Neural Progenitors in Drosophila.” We are pleased that all reviewers recognised the novelty and significance of identifying Krüppel (Kr) as a key transcription factor promoting timely termination of mushroom body neuroblast (MBNB) proliferation, and the potential antagonistic function of Kr-h1.
 
 We appreciate the helpful suggestions aimed at improving the mechanistic clarity and presentation of our findings. Below, we outline how we plan to address the major points raised in the full revision.
 
 (1) Characterisation of the KrIf-1 allele and Kr expression
 
 We agree that clarifying the nature of the KrIf-1 allele is important. In response to this concern, we will examine Kr expression in KrIf-1 mutant larval, pupal, and adult brains using immunostaining and available reporter lines. These experiments will help determine whether the observed neuroblast retention phenotype correlates with altered Kr expression in MBNBs.
 
 (2) Regulatory relationships between Kr, Kr-h1, Imp, Syp, Chinmo, and E93
 
 We are currently performing additional experiments to clarify the interactions among these temporal factors. For instance, we are testing whether Kr-h1 overexpression alters the expression of Imp, Syp, and E93. We have obtained a published E93 antibody from Dr Chris Doe (Syed et al., 2017) and will include E93 expression analysis in our revised manuscript.
 
 While Chinmo is of interest, its expression is well established to be regulated downstream of Imp/Syp via mRNA stability (Liu et al., 2015; Ren et al., 2017). Given that we currently lack reliable tools to assess Chinmo levels, we will focus primarily on Imp, Syp, and E93 as readouts for Kr/Kr-h1 function. If we succeed in obtaining Chinmo antibodies or reporter lines in time, we will include corresponding data.
 
 (3) Expression of Kr-h1 in MBNBs
 
 We fully agree that direct evidence for Kr-h1 expression in MBNBs is important. To address this, we have obtained the Kr-h1::GFP BAC transgenic line (BDSC #96786) and are currently using it to assess Kr-h1 expression in MBNBs. We also tested an anti–Kr-h1 antibody previously reported by Kang et al. (2017), developed in the context of fat body studies, but it did not yield clear signals in larval MBNBs. However, previous work by Shi et al. (2007) clearly demonstrated Kr-h1 expression in the developing MB, including MBNBs, using a custom antibody developed by their lab. We also contacted the Lee lab to request this antibody, but unfortunately, it is no longer available. We will include the results obtained using the GFP BAC line in the revised manuscript and, if needed, pursue RNA in situ hybridisation to further validate Kr-h1 expression in MBNBs.
 
 (4) Temporal Kr knockdown and MARCM analysis
 
 We appreciate the suggestion to validate our RNAi-based temporal knockdown results using MARCM. We plan to perform MBNB-specific MARCM analysis following the strategy described by Rossi et al. (2020). However, this approach requires additional time due to the logistics of acquiring the necessary fly stocks, generating appropriate genetic combinations, and conducting clonal analyses. While we will make every effort to include these data, we note that RNAi-based knockdown offers the advantage of temporal reversibility and has been essential for assessing stage-specific requirements in our current study.
 
 (5) Details of the targeted genetic screen
 
 Kr was initially identified as part of a broader, ongoing effort to screen for candidate transcription factors and cell cycle regulators involved in neuroblast cell cycle exit and/or quiescence. As this screen is still preliminary and incomplete, we prefer not to include the full dataset at this stage. Instead, we will revise the manuscript to clarify that Kr was prioritised for further investigation based on the striking MBNB-specific phenotype observed upon RNAi-mediated knockdown and in the KrIf-1 mutant, rather than through a completed screening process.
 
 (6) Clarifying the model (Figure 6D) and interactions
 
 We will revise the proposed model to distinguish between experimentally supported interactions and speculative ones. As noted above, we will primarily focus on the Imp/Syp and E93 axis in relation to Kr and Kr-h1 activity. Chinmo will be omitted from the model unless further data become available to support its inclusion.
 
 (7) Clarifications on figures and data presentation
 
 We appreciate the feedback on figure clarity. We will revise figures such as 1B, 2C, and 3A to improve legibility and presentation. We will also correct typographical errors and figure references, and clarify the activity patterns of the GAL4 drivers. Specifically, while UASmCD8::GFP expression driven by OK107-GAL4 is markedly weaker in MBNBs than in their neuronal progeny (as seen, for example, in Figure S3C), the driver remains active and functionally relevant in MBNBs. We believe the weak expression in MBNBs likely explains the absence of a NB retention phenotype in OK107>KrIR adult brains (see main text, Lines 374–376). As suggested by the reviewer, we will clarify this point earlier in the manuscript and can include additional data showing OK107>GFP expression patterns in pupal MB lineages as supplementary material.
 
 (8) Analysis of public datasets
 
 We will include results from our analysis of publicly available datasets such as FlyAtlas2, modENCODE, and a time-course RNA-seq dataset specific to MBNBs (Liu et al., 2015). While the spatial resolution of FlyAtlas2 and modENCODE is limited, the MBNB dataset provides valuable temporal information up to 36 h after puparium formation (APF). From this dataset, we observe that Kr expression remains consistently low throughout development, with only a modest increase at 84 h ALH (mean TPM ~11) and 36 h APF (~7), suggesting it does not undergo strong transcriptional regulation in MBNBs. In contrast, Kr-h1 is highly expressed during early larval stages (24–84 h ALH; mean TPM ~55–60) and shows a marked suppression by 36 h APF (mean TPM ~2), consistent with its proposed role in promoting MBNB proliferation. Importantly, Eip93F (E93) exhibits a reciprocal pattern to Kr-h1—with minimal expression until 84 h ALH (mean TPM ~24), followed by a substantial induction at 36 h APF (mean TPM ~104), aligning with its known role in triggering neuroblast termination. These temporal expression dynamics support our model that Kr-h1 and E93 function in opposition during the transition from proliferative to terminating neuroblast states. We will summarise these findings in the revised manuscript, along with appropriate discussion of dataset limitations.
 
 We hope this provisional response conveys our strong commitment to thoroughly addressing the reviewers’ concerns and improving the manuscript. We are currently carrying out additional experiments and will submit a revised version with new data and enhanced clarity in due course.
 
 References:
 
 Kang et al., 2017. Sci Rep. 7(1):16369. doi: 10.1038/s41598-017-16638-1.
 
 Shi et al., 2007. Dev Neurobiol. 67(11):1614–1626. doi: 10.1002/dneu.20537.
 
 Rossi et al., 2020. eLife. 9:e58880. doi: 10.7554/eLife.58880.
 
 Liu et al., 2015. Science. 350(6258):317–320. doi: 10.1126/science.aad1886.
 
 Ren et al., 2017. Curr Biol. 27(9):1303–1313. doi: 10.1016/j.cub.2017.03.018. Syed et al., 2017. eLife. 6:e26287. doi: 10.7554/eLife.26287.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 3

AuthorResponse

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.03.24.645006v1
www.biorxiv.org www.biorxiv.org

The alternative oxidase reconfigures the larval mitochondrial electron transport system to accelerate growth and development in Drosophila melanogaster

4
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  The findings in this manuscript are important because they demonstrate the key role of metabolism in insect development. The data were collected and analyzed using solid and validated methodologies, but the evidence is incomplete, as the extent of the involvement of AOX activity in vivo and in physiological conditions is not addressed. This manuscript will be of interest for the fields of mitochondrial bioenergetics, metabolism and development.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  The manuscript by Garcia et al. describes how the expression of a respiratory chain alternative oxidase (AOX) from the tunicate Ciona intestinalis, capable of transferring electrons directly from reduced coenzyme Q (CoQ) to oxygen, is able to induce an increase in the mass of Drosophila melanogaster larvae and an accelerated development, especially when the larvae are kept at low temperatures. In order to explain this phenomenon, the paper addresses the modifications in the activity and levels of the 'canonical' electron transfer system (ETS), i.e., complexes I-IV and of the ATP synthase. In addition, the abundance of different metabolites as well as the NAD+/NADH ratios are measured, finding significant differences between the larvae.
  
  Strengths:
  
  The observations of differences in growth, body mass and food intake in the wt D. melanogaster larvae vs. those expressing the AOX transgene are solid. The evidence that mild uncoupling of the ETS might accelerate development of the fly larvae is convincing.
  
  Weaknesses:
  
  Some of the observations, especially those concerning the origin of the metabolic remodelling in AOX-expressing larvae, are left unexplained, and the argumentation is somewhat speculative. What the authors mean by "reconfiguration" of the mitochondrial electron transfer system is not clear. If this implies that there is an actual change in ETS function and/or structural organisation in the presence of AOX, this conclusion is not supported by the experimental data. In addition, the influence of AOX activity in the mitochondrial ETS system is tested in vitro in the presence of saturating concentrations of substrates. The real degree to which AOX activity is actually influencing ETS activity in vivo remains unknown.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  This manuscript presents intriguing findings about the role of alternative oxidase (AOX) from the tunicate Ciona intestinalis in accelerating growth and development when expressed in Drosophila melanogaster.
  
  Strengths:
  
  The study is overall well-constructed, including appropriate analysis. Likewise, the manuscript is written clearly and supported by high-quality figures. The present study provides valuable insights into AOX's role in Drosophila development. The paper attempts to explore a unique mechanism by which AOX influences Drosophila development, providing insights into mitochondrial respiration and its physiological effects. This is relevant for understanding mitochondrial dysfunction and potential therapeutic applications. The study employs a variety of approaches, including calorimetry, infrared thermography, and genetic analyses, to investigate AOX's impact on metabolism and development.
  
  Weaknesses:
  
  There are a number of methodological limitations and substantial gaps in the interpretation of the data presented, which reduces the strength of its conclusions. For instance, there is a misunderstanding of the non-proton motive nature of the AOX - it does not uncouple respiration, merely decouple it as it neither contributes to nor dissipates the proton motive force, in contrast to chemical uncouplers or proton uncouplers such as UCPs. The authors need to reassess their data in light of the above.
  
  Review 2
4. Public_Reviews 09 May 2025
  
  in eLife
  
  Author response:
  
  Reviewer #1 (Public review):
  
  Summary:
  
  The manuscript by Garcia et al. describes how the expression of a respiratory chain alternative oxidase (AOX) from the tunicate Ciona intestinalis, capable of transferring electrons directly from reduced coenzyme Q (CoQ) to oxygen, is able to induce an increase in the mass of Drosophila melanogaster larvae and an accelerated development, especially when the larvae are kept at low temperatures. In order to explain this phenomenon, the paper addresses the modifications in the activity and levels of the 'canonical' electron transfer system (ETS), i.e., complexes I-IV and of the ATP synthase. In addition, the abundance of different metabolites as well as the NAD+/NADH ratios are measured, finding significant differences between the larvae.
  
  Strengths:
  
  The observations of differences in growth, body mass and food intake in the wt D. melanogaster larvae vs. those expressing the AOX transgene are solid. The evidence that mild uncoupling of the ETS might accelerate development of the fly larvae is convincing."
  
  We appreciate the reviewer’s attention to our results and hope we can improve the manuscript to address all criticism appropriately.
  
  Weaknesses:
  
  Some of the observations, especially those concerning the origin of the metabolic remodelling in AOX-expressing larvae, are left unexplained, and the argumentation is somewhat speculative. What the authors mean by "reconfiguration" of the mitochondrial electron transfer system is not clear. If this implies that there is an actual change in ETS function and/or structural organisation in the presence of AOX, this conclusion is not supported by the experimental data. In addition, the influence of AOX activity in the mitochondrial ETS system is tested in vitro in the presence of saturating concentrations of substrates. The real degree to which AOX activity is actually influencing ETS activity in vivo remains unknown.
  
  Indeed, the term “reconfiguration” may seem a little too strong. However, we do have preliminary structural data on larval mitochondria indicating that the term is adequate in this context. We plan to work on obtaining concrete data to sustain our claims that AOX imparts significant functional and structural remodeling of the organelle, which would be consistent with our respirometry and BN-PAGE data. If the data turns out not to be robust enough, we will consider replacing the term with one that better reflects our findings.
  
  We also realize that the in vivo data we are presenting (body mass, mobility, food intake) are indirect measurements of metabolism and that a more direct approach is necessary to assess the real degree to which AOX influences ETS activity in vivo. To address this issue, we plan to expand our pharmacological treatments of the larval development and to measure whole larval oxygen consumption.
  
  Reviewer #2 (Public review):
  
  Summary:
  
  This manuscript presents intriguing findings about the role of alternative oxidase (AOX) from the tunicate Ciona intestinalis in accelerating growth and development when expressed in Drosophila melanogaster.
  
  Strengths:
  
  The study is overall well-constructed, including appropriate analysis. Likewise, the manuscript is written clearly and supported by high-quality figures. The present study provides valuable insights into AOX's role in Drosophila development. The paper attempts to explore a unique mechanism by which AOX influences Drosophila development, providing insights into mitochondrial respiration and its physiological effects. This is relevant for understanding mitochondrial dysfunction and potential therapeutic applications. The study employs a variety of approaches, including calorimetry, infrared thermography, and genetic analyses, to investigate AOX's impact on metabolism and development.
  
  We sincerely thank the reviewer for recognizing the strengths and acknowledging the novelty of our study.
  
  Weaknesses:
  
  There are a number of methodological limitations and substantial gaps in the interpretation of the data presented, which reduces the strength of its conclusions. For instance, there is a misunderstanding of the non-proton motive nature of the AOX - it does not uncouple respiration, merely decouple it as it neither contributes to nor dissipates the proton motive force, in contrast to chemical uncouplers or proton uncouplers such as UCPs. The authors need to reassess their data in light of the above.
  
  The reviewer is absolutely right about the non-proton motive nature of AOX. We will reassess our data considering that AOX decouples respiration and, if necessary and possible, we will add new experiments to address the methodological limitations raised by the reviewer.
  
  AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 2

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.02.20.639223v1
www.biorxiv.org www.biorxiv.org

Genetic evidence: zebrafish hoxba and hoxbb clusters are essential for the anterior-posterior positioning of pectoral fins

4
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  This important study advances our understanding of vertebrate forelimb development, specifically the contribution of Hox genes to zebrafish pectoral fin formation. While there are reservations about some of the descriptions and interpretations of the data, the results are mostly convincing. The authors have employed a robust and extensive genetic approach to tackle a key and unresolved question. The findings will be of broad interest to developmental and evolutionary biologists.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  The authors have used gene deletion approaches in zebrafish to investigate the function of genes of the hox clusters in pectoral fin "positioning" (but perhaps more accurately pectoral fin "formation").
  
  Strengths:
  
  The authors have employed a robust and extensive genetic approach to tackle an important and unresolved question.
  
  The results are largely presented in a very clear way.
  
  Weaknesses:
  
  The Abstract suggests that no genetic evidence exists in model organisms for a role of Hox genes in limb positioning. There are, however, several examples in mouse and other models (both classical genetic and other) providing evidence for a role of Hox genes in limb position, which is elaborated on in the Introduction.
  
  It would perhaps be more accurate to state that several lines of evidence in a range of model organisms (including the mouse) support a role for Hox genes in limb positioning. The author's work is not weakened by a more inclusive introduction that cites the current literature more comprehensively.
  
  It would be helpful for the authors to make a clear distinction between "positioning" of the limb/fin and whether a limb/fin "forms" at all, independent of the relative position of this event along the body axis.
  
  Discussion of why the zebrafish is sensitive to Hoxb loss with reference to the fin, but mouse Hoxb mutants do make a limb?
  
  Is this down to exclusive expression of Hoxbs in the zebrafish pectoral fin forming region rather than a specific functional role of the protein? This is important as it has implications for the interpretation of results throughout the paper and could explain some apparently conflicting results.
  
  Why is Hoxba more potent than Hoxbb? Is this because Hoxba has Hox4/5 present, while Hoxbb has only Hoxb5? Hoxba locus has retained many more Hox genes in cluster than hoxbb; therefore, one might expect to see greater redundancy in this locus).
  
  Deletion of either Hoxa or Hoxd in the background of the Hoxba mutant does have some effect. Is this a reflection of protein function or expression dynamics of Hoxa/Hoxd genes?
  
  Can we really be confident that there is a "transformation of pectoral fin progenitor cells into cardiac cells"?
  
  The failure to repress Nkx2.5 in the posterior (pelvic fin) domain is clear, but have these cells actually acquired cardiac identity? They would be expected to express Tbx5a (or b) as cardiac precursors, but this domain does not broaden. There is no apparent expansion of the heart (field)/domain or progenitors beyond the 16 somite stage. The claimed "migration" of heart precursors in the mutant is not clear. The heart/cardiac domain that does form in the mutant is not clearly expanded in the mutant. The domain of cmlc2 looks abnormal in the mutant, but I am not convinced it is "enlarged" as claimed by the authors. The authors have not convincingly shown that "the cells that should form the pectoral fin instead differentiate into cardiac cells."
  
  The only clear conclusion is the loss of pectoral fin-forming cells rather than these fin-forming cells being "transformed" into a new identity. It would be interesting to know what has happened to the cells of the pectoral fin-forming region in these double mutants.
  
  It is not clear what the authors mean by a "converse" relationship between forelimb/pectoral fin and heart formation. The embryological relationship between these two populations is distinct in amniotes.
  
  The authors show convincing data that RA cannot induce Tbx5a in the absence of Hob clusters, but I am not convinced by the interpretation of this result. The results shown would still be consistent with RA acting directly upstream of tbx5a, but merely that RA acts in concert with hox genes to activate tbx5a. In the absence of one or the other, Tbx5a would not be expressed. It is not necessary that RA and hoxbs act exclusively in a linear manner (i.e., RA regulates hoxb that in turn regulates tbx5a).
  
  The authors have carried out a functional test for the function of hoxb6 and hoxb8 in the hemizygous hoxb mutant background. What is lacking is any expression analysis to demonstrate whether Hoxb6b or Hoxb8b are even expressed in the appropriate pectoral fin territory to be able to contribute to pectoral fin development, either in this assay or in normal pectoral fin development.
  
  (The term "compensate" used in this section is confusing/misleading.)
  
  The authors' confounding results described in Figures 6-7 are consistent with the challenges faced in other model organisms in trying to explore the function of genes in the hox cluster and the known redundancy that exists across paralogous groups and across individual clusters.
  
  Given the experimental challenges in deciphering the actual functions of individual or groups of hox genes, a discussion of the normal expression pattern of individual and groups of hox genes (and how this may change in different mutant backgrounds) could be helpful to make conclusions about likely normal function of these genes and compensation/redundancy in different mutant scenarios.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  The authors of this manuscript performed a fascinating set of zebrafish mutant analyses on hox cluster deletion and pinpointed the cause of the pectoral fin loss in one combinatorial hox cluster mutant of Hoxba and Hoxbb.
  
  Strengths:
  
  The study is based on a variety of existing experimental tools that enabled the authors' past construction of hox cluster mutants, and is well-designed. The manuscript is well written to report the authors' findings on the mechanism that positions the pectoral fin.
  
  Weaknesses:
  
  The study does not focus on the other hox clusters other than ba and bb, and is confined to the use of zebrafish, as well as the comparison with existing reports from mouse experiments.
  
  Review 2
4. Public_Reviews 09 May 2025
  
  in eLife
  
  Author response:
  
  We appreciate the reviewers' positive feedback on our paper. We especially thank them for their evaluation of the genetic analysis, which required a significant amount of timef time. We acknowledge that several aspects of our interpretation and description of the results need correction, as noted by both reviewers. Additionally, we recognize the importance of providing a more comprehensive overview of previous findings, including those conducted in mice, in the manuscript. In the revised version, we will thoroughly address the reviewers' concerns.
  
  Both reviewers emphasized the need for further validation to ascertain whether the specific requirement of Hox genes in the Hoxba and Hoxbb clusters for pectoral fin bud formation is due to their expression patterns or the functional roles of Hox proteins. This consideration has been on our agenda for some time; however, our submitted paper does not sufficiently address this aspect. In the revised manuscript, we will conduct a comprehensive analysis of the expression patterns of Hox genes in zebrafish to draw informed conclusions on this matter.
  
  AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 2

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.02.03.636262v2
www.biorxiv.org www.biorxiv.org

Epitope Sequence and Modification Fingerprints of Anti-Aβ Antibodies

3
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  Antibodies that selectively bind distinct amyloid-beta variants are vital tools for Alzheimer's disease research. This valuable manuscript aims to delineate the epitope specificity in a panel of anti-amyloid-beta antibodies, including some with clinical relevance. The experiments were rigorously conducted, employing an interesting combination of established and state-of-the-art methodologies, yielding mostly robust findings. While the data regarding antibody sequence preferences for distinct amyloid-beta regions and aggregation states are convincing, a thorough revision of the manuscript would help to highlight the key results.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  The manuscript by Ivan et al aimed to identify epitopes on the Abeta peptide for a large set of anti-Abeta antibodies, including clinically relevant antibodies. The experimental work was well done and required a major experimental effort, including peptide mutational scanning, affinity determinations, molecular dynamics simulations, IP-MS, WB, and IHC. Therefore, it is of clear interest to the field. The first part of the work is mainly based on an assay in which peptides (15-18-mers) based on the human Abeta sequence, including some containing known PTMs, are immobilized, thus preventing aggregation. Although some results are in agreement with previous experimental structural data (e.g. for 3D6), and some responses to disease-associated mutations were different when compared to wild-type sequences (e.g. in the case of Aducanumab) - which may have implications for personalized treatment - I have concerns about the lack of consideration of the contribution of conformation (as in small oligomers and large aggregates) in antibody recognition patterns. The second part of the study used full-length Abeta in monomeric or aggregated forms to further investigate the differential epitope interaction between Aducanumab, donanemab, and lecanemab (Figures 5-7). Interestingly, these results confirmed the expected preference of these antibodies for aggregated Abeta, thus reinforcing my concerns about the conclusions drawn from the results obtained using shorter and immobilized forms of Abeta. Overall, I understand that the work is of interest to the field and should be published without the need for additional experimental data. However, I recommend a thorough revision of the structure of the manuscript in order to make it more focused on the results with the highest impact (second part).
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  This paper investigates binding epitopes of different anti-Abeta antibodies. Background information on the clinical outcome of some of the antibodies in the paper, which might be important for readers to know, is lacking. There are no references to clinical outcomes from antibodies that have been in clinical trials. This paper would be much more complete if the status of the antibodies were included. The binding characteristics of aducanumab, donanemab, and lecanemab should be compared with data from clinical phase 3 studies.
  
  Aducanumab was identified at Neurimmune in Switzerland and licensed to Biogen and Eisai. Aducanumab was retracted from the market due to a very high frequency of the side-effect amyloid-related imaging abnormalities-edema (ARIA-E). Gantenerumab was developed by Roche and had two failed phase 3 studies, mainly due to a high frequency of ARIA-E and low efficacy of Abeta clearance. Lecanemab was identified at Uppsala University, humanized by BioArctic, and licensed to Eisai, who performed the clinical studies. Eisai and Biogen are now marketing lecanemab as Leqembi on the world market. Donanemab was developed by Ely Lilly and is sold in the US as Kisunla.
  
  Limitations:
  
  (1) Conclusions are based on Abeta antigens that may not be the primary targets for some conformational antibodies like aducanumab and lecanemab. There is an absence of binding data for soluble aggregated species.
  
  (2) Quality controls and characterization of different Abeta species are missing. The authors need to verify if monomers remain monomeric in the blocking studies for Figures 5 and 6.
  
  (3) The authors should discuss the limitations of studying synthetic Abeta species and how aggregation might hide or reveal different epitopes.
  
  (4) The authors should elaborate on the differences between synthetic Abeta and patient-derived Abeta. There is a potential for different epitopes to be available.
  
  Review 2
Visit annotations in context

Tags

Summary

Review 1

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.02.26.640323v1
www.biorxiv.org www.biorxiv.org

Targeted Protein Degradation by KLHDC2 Ligands Identified by High Throughput Screening

3
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  This valuable study aims to advance the toolkit of small molecules used for approaches to targeted protein degradation for research and therapeutic applications. The authors provide solid data demonstrating the use of a high-throughput screen of small molecules to target a specific E3 ligase, KLHDC2 (Kelch-like homology domain containing protein 2); the resulting compounds then form the basis for new PROTAC (proteolysis targeting chimera) reagents. The strength of the work lies in expanding the PROTAC reagent inventory. The current work would be strengthened further by confirming that the PROTAC's activity is dependent on KLHDC2 and by a more thorough examination of off-target effects in cellular applications.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  The manuscript "Targeted Protein Degradation by KLHDC2 Ligands Identified by High Throughput Screening" by Zhou, H. et al. describes the development of a high-throughput FP-based screen and the identification of a KLHDC2 ligand from a small molecule library. A counter screen and other filtering criteria led to the identification of lead compounds that contained a tetrahydroquinoline scaffold. Commercially available analogs (52 compounds) that shared this scaffold were characterized by a KLHDC2 competitive binding assay. Optimized compounds were obtained that demonstrated improved potency and increased binding affinity by SPR. Docking of a lead candidate (compound 6) suggested it bound at a distal lipophilic site within the SelK binding pocket of KLHDC2. Based on this model, the authors then synthesized PROTACs that linked the KLHDC2 binder to a BRD4-binding molecule, JQ1. These PROTAC candidates possessed different linker configurations, and PROTAC 8 was able to cause BRD4 degradation in cells, with a half-maximal degradation concentration (DC50) of 80 nM. The authors demonstrate the identification and characterization of small-molecule KLHDC2 ligands that can be used to generate PROTACs that result in BRD4 degradation in cells.
  
  Strengths:
  
  The study by Zhou, H. et al. expands the E3 ligase toolkit by targeting KLHDC2 to identify ligands for PROTAC development, which has predominantly relied on VHL and CRBN. This was accomplished using a described FP-based high-throughput screening strategy (high Z' values in 1536 well format). Both target-specific and counter-specific assays were performed, along with subsequent stringent follow-up assays designed to address non-specific binding/specificity concerns. Label-free direct binding validations by SPR were used to determine binding affinity/kinetics. A strength of the study is the characterization of the interaction between candidate compounds and KLHDC2 versus related KEAP1.
  
  Structural insight into the potential mode of binding was inferred by computational docking studies of the newly discovered KLHDC2 ligands. This was performed to identify where the identified scaffolds could be modified by linker incorporation for the design of PROTACs. The computational predictions were evaluated by linking a solvent-exposed site on the KLHDC2 ligand to JQ1. Three linkers were tested, and two compounds were found to result in BRD4 degradation in cells by HiBiT degradation assay and western blot. These findings demonstrate the feasibility of these compounds for the design of PROTAC-based degraders.
  
  The authors present compelling KLHDC2 binding data for their lead compounds and demonstrate degradation of a target using a PROTAC strategy. Accordingly, the screening approach and compounds identified are likely to be of interest to the field and are likely to be generalizable to other PROTAC targets of interest.
  
  Weaknesses:
  
  The specificity of compounds for KLHDC2 was assessed by using a counter screen against KEAP1 and in vitro binding assays. However, off-target effects might occur in a cellular context, which weren't fully explored in the study. Notably, the authors do not demonstrate that the degradation induced by their PROTACs in cells is KLHDC2-dependent. A requirement for KLHDC2-mediated degradation could be evaluated, for example, by using knockout/knockdown of KLHDC2, or other means, to demonstrate specificity. Addressing specificity is deemed important to evaluate the proposed PROTAC mechanism of action in a cellular context that results in the degradation of BRD4. Specificity is important when considering the utility of these new compounds for PROTAC design.
  
  Additional rationale behind the selection of linkers used to generate candidate PROTACs would be informative and would benefit from additional discussion and/or citation. The reasons for the lack of activity, such as for compound 9, were not fully explored or discussed, such as whether complex assembly is potentially affected by linker choice. Perhaps related to this point, the authors note that a trifluoromethoxy group increased the binding affinity of compound 6. However, the subsequent docking analysis revealed this moiety to be solvent-exposed. The relationship between this site of functionalization, linker selection, and the resulting binding affinity or effect on DC50 was not clear and/or could be developed further.
  
  Minor issues related to the presentation of the manuscript include sections that would benefit from either additional citation and/or description, such as the KI-696 inhibitor used and the BRD4 HiBiT degradation assay that was used to assess PROTAC potency. Figure captions should be reviewed to ensure that the number of independent experiments is indicated, and what data points and error bars represent, as these are not indicated in several figures. BRD4 levels were quantified in 4E; however, error/reproducibility (n) is not indicated.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  PROTACs are a class of small molecules that induce an interaction between a target protein and a ubiquitin ligase, thereby leading to the target protein's ubiquitination and subsequent proteasomal degradation. Given that the vast majority of PROTACs rely on the cereblon and VHL ubiquitin ligases, a major goal within this field has been to identify and develop ligands for additional ubiquitin ligases, in particular those whose expression affords tissue or subcellular specificity or those whose structure allows them to degrade targets that are otherwise incompatible with cereblon or VHL.
  
  In this work, Zhou and colleagues from the Bollong group at Scripps utilize a high-throughput fluorescence polarization screen of >350,000 compounds to identify and optimize a novel ligand for KLHDC2, a ubiquitin ligase which had previously been discovered to be capable of proximity-induced degradation of target proteins. Zhou et al go on to show that this ligand can be used as the basis for PROTACs capable of degrading BRD4 in a cell line. Of note, prior to this paper, three other groups had also developed ligands to KLHDC2 and used them to generate active PROTACs. Interestingly, docking studies by Zhou suggest that their compound may bind to a different region of the KLHDC2's kelch domain.
  
  The major strengths of this work are its brevity and the clarity of the writing and figures. Their claim that they have discovered a ligand for KLHDC2, which can be used to develop BRD4-degrading PROTACs, is well-supported by their findings from the screen, SPR, and cellular assays. The weakness of the work then, is not so much relevant to the paper at hand but rather stems from the fact that their story leaves me wanting to know more. Indeed, there are a number of interesting experiments that we need as a field in order to assess 1) how generalizable their findings are across cell lines and targets, and 2) how this new KLHDC2 ligand stacks up against the other recently discovered ligands for KLDHC2 as well as the existing standards, cereblon and VHL.
  
  Nonetheless, Zhou and colleagues provide a valuable addition to the emerging repertoire of KLHDC2 ligands, and I'm certain that with time, we will come to understand what ligands work best for KLHDC2-based PROTACs and how they compare to the growing set of ubiquitin ligases in our armamentarium.
  
  Review 2
Visit annotations in context

Tags

Summary

Review 1

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.03.31.646306v1
www.biorxiv.org www.biorxiv.org

Endothelial Slit2 guides the Robo1-positive sympathetic innervation during heart development

3
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  This study presents a valuable finding on the role of Slit-Robo signaling in cardiac innervation. The evidence supporting the main claims of the authors is solid. The use of several mouse models including constitutive and cell type specific knockout models make the findings more robust. The scope of the presented studies is somewhat limited, as they primarily focus on evaluating the phenotypic changes in cardiac innervation following the loss of various Slit or Robo genes.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  The study aims to determine the role of Slit-Robo signaling in the development and patterning of cardiac innervation, a key process in heart development. Despite the well-studied roles of Slit axon guidance molecules in the development of the central nervous system, their roles in the peripheral nervous system are less clear. Thus, the present study addresses an important question. The study uses genetic knockout models to investigate how Slit2, Slit3, Robo1, and Robo2 contribute to cardiac innervation.
  
  Using constitutive and cell type-specific knockout mouse models, they show that the loss of endothelial-derived Slit2 reduces cardiac innervation. Additionally, Robo1 knockout, but not Robo2 knockout, recapitulated the Slit2 knockout effect on cardiac innervation, leading to the conclusion that Slit2-Robo1 signaling drives sympathetic innervation in the heart. Finally, the authors also show a reduction in isoproterenol-stimulated heart rate but not basal heart rate in the absence of endothelial Slit2.
  
  The conclusions of this paper are mostly well supported by the data, but some should be modified to account for the study's limitations and discussed in the context of previous literature.
  
  (1) It is well established that Slit ligands undergo proteolytic cleavage, generating N- and C-terminal fragments with distinct biological functions. Full-length Slit proteins and their fragments differ in cell association, with the N-terminal fragment typically remaining membrane-bound, while the C-terminal fragment is more diffusible. This distinction is crucial when evaluating the role of Slit proteins secreted by different cell types in the heart. However, this study does not examine or discuss the specific contributions of different Slit2 fragments, limiting its mechanistic insight into how Slit2 regulates cardiac innervation.
  
  (2) The endothelial-specific deletion of Slit2 leads to its loss in endothelial cells across various organs and tissues in the developing embryo. Therefore, the phenotypes observed in the heart may be influenced by defects in other parts of the embryo, such as the CNS or sympathetic ganglia, and this possibility cannot be ruled out.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  The aims of investigating Slit-Robo signaling in cardiac innervation were achieved by the experiments designed. While questions remain regarding signal regulation and interplay between established axon guidance signals and further role of other Slit ligands and Robo expression in endothelium, the results strongly support the conclusions drawn.
  
  Writing and presentation are easy to follow and well structured, Appropriate controls are used, statistical analysis applied appropriately, and experiments directly test aims following a logical story.
  
  The authors demonstrate a novel mechanism for Slit-Robo signaling in cardiac sympathetic innervation. The data establishes a framework for future studies.
  
  Recommendations:
  
  Further assessment of interplay between Slit ligands as well as other signaling pathways (Semaphorin, NGF, etc) could be investigated. Is it possible to rescue the phenotype by modulation of other signaling pathways? Can combined Slit2/Slit3 KO rescue? Additionally, as the authors state, conditional Robo1 knockouts will be important to validate the findings of constitutive knockout.
  
  Review 2
Visit annotations in context

Tags

Summary

Review 1

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.01.22.634222v1
www.biorxiv.org www.biorxiv.org

Ubiquitination-activated TAB–TAK1–IKK–NF-kB axis modulates gene expression for cell survival in the lysosomal damage response

4
1. Public_Reviews 09 May 2025
 
 in eLife
 
 eLife Assessment
 
 This study presents the important finding that lysosomal damage triggers inflammatory signaling through ubiquitination and the TAB-TAK1-IKK-NF-kB axis. The data obtained from the unbiased transcriptomic and proteomic analyses are convincing and provide invaluable information to the field. Although further experiments will be required to clarify how TAB2/3 are activated, this work will be of interest to researchers in the fields of organelle biology and inflammation.
 
 Summary
2. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Summary:
 
 Lysosomal damage is commonly found in many diseases including normal aging and age-related disease. However, the transcriptional programs activated by lysosomal damage have not been thoroughly characterized. This study aimed to investigate lysosome damage-induced major transcriptional responses and the underlying signaling basis. The authors have convincingly shown that lysosomal damage activates a ubiquitination-dependent signaling axis involving TAB, TAK1, and IKK, which culminates in the activation of NF-kB and subsequent transcriptional upregulation of pro-inflammatory genes and pro-survival genes. Overall, the major aims of this study were successfully achieved.
 
 Strengths:
 
 This study is well-conceived and strictly executed, leading to clear and well-supported conclusions. Through unbiased transcriptomics and proteomics screens, the authors identified NF-kB as a major transcriptional program activated upon lysosome damage. TAK1 activation by lysosome damage-induced ubiquitination was found to be essential for NF-kB activation and MAP kinase signaling. The transcriptional and proteomic changes were shown to be largely driven by TAK1 signaling. Finally, the TAK1-IKK signaling was shown to provide resistance to apoptosis during lysosomal damage response. The main signaling axis of this pathway was convincingly demonstrated.
 
 Weaknesses:
 
 One weakness was the claim of K63-linked ubiquitination in lysosomal damage-induced NF-kB activation. While it was clear that K63 ubiquitin chains were present on damaged lysosomes, no evidence was shown in the current study to demonstrate the specific requirement of K63 ubiquitin chains in the signaling axis being studied. Clarifying the roles of K63-linked versus other types of ubiquitin chains in lysosomal damage-induced NF-kB activation may improve the mechanistic insights and overall impact of this study.
 
 Another weakness was that the main conclusions of this study were all dependent on an artificial lysosomal damage agent. It will be beneficial to confirm key findings in other contexts involving lysosomal damage.
 
 Review 1
3. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 Endo et al. investigate the novel role of ubiquitin response upon lysosomal damage in activating cellular signaling for cell survival. The authors provide a comprehensive transcriptome and proteome analysis of aging-related cells experiencing lysosomal damage, identifying transcription factors involved in transcriptome and proteome remodeling with a focus on the NF-κB signaling pathway. They further characterized the K63-ubiquitin-TAB-TAK1-NF-κB signaling axis in controlling gene expression, inflammatory responses, and apoptotic processes.
 
 Strengths:
 
 In the aging-related model, the authors provide a comprehensive transcriptome and characterize the K63-ubiquitin-TAB-TAK1-NF-κB signaling axis. Through compelling experiments and advanced tools, they elucidate its critical role in controlling gene expression, inflammatory responses, and apoptotic processes.
 
 Weaknesses:
 
 The study lacks deeper connections with previous research, particularly: • The established role of TAB-TAK1 in AMPK activation during lysosomal damage • The potential significance of TBK1 in NF-κB signaling pathways
 
 Review 2
4. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 Summary:
 
 The response to lysosomal damage is a fast-moving and timely field. Besides repair and degradation pathways, increasing interest has been focusing on damaged-induced signaling. The authors conducted both transcriptomics and proteomics to characterize the cellular response to lysosomal damage. They identify a signaling pathway leading to activation of NFkappaB. Based on this and supported by Western blot and microscopy data, the authors nicely show that TAB2/3 and TAK1 are activated at damaged lysosomes and kick off the pathway to alter gene expression, which induces cytokines and protect from cell death. TAB2/3 activation is proposed to occur through K63 ubiquitin chain formation. Generally, this is a careful and well conducted study that nicely delineates the pathway under lysosomal stress. The "omics" data serves as a valuable resource for the field. More work should be invested into how TAB2/3 are activated at the damaged lysosomes, also to increase novelty in light of previous reports.
 
 Strengths:
 
 Generally, this is a careful and well-conducted study that nicely delineates the pathway under lysosomal stress. The "omics" data serves as a valuable resource for the field.
 
 Weaknesses:
 
 More work should be invested into how TAB2/3 are activated at the damaged lysosomes, also to increase novelty in light of previous reports. Moreover, different damage types should be tested to probe relevance for different pathophysiological conditions.
 
 Suggestions:
 
 (1) A recent paper claims that NFkappaB is activated by Otulin/M1 chains upon lysosome damage through TBK1 (PMID: 39744815). In contrast, Endo et al. nicely show that ubiquitylation is needed (shown by TAK-243) for NFkB activation but only have correlative data to link it specifically to K63 chains. On page 15, line 11, the authors even argue a "potential" involvement of K63. This point should be better dealt with. Can the authors specifically block K63 formation? K63R overexpression or swapping would be one way. Is the K63 ligase ITCH involved (PMID: 38503285) or any other NEDD4-like ligase? This could be compared to LUBAC inhibition. Also, the point needs to be dealt with more controversially in the discussion as these are alternative claims (M1 vs K63, TAB vs TBK1).
 
 (2) It would be interesting to know what the trigger is that induces the pathway. Lipid perturbation by LLOMe is a good model, but does activation also occur with GPN (osmotic swelling) or lipid peroxidation (oxidative stress) that may be more broadly relevant in a pathophysiological way? Moreover, what damage threshold is needed? Does loss of protons suffice? Can activation be induced with a Ca2+ agonist in the absence of damage?
 
 (3) The authors nicely define JNK and p38 activation. This should be emphasized more, possibly also in the abstract, as it may contribute to the claim of increased survival fitness.
 
 Review 3
Visit annotations in context

Tags

Summary

Review 1

Review 2

Review 3

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.03.26.645624v1
www.biorxiv.org www.biorxiv.org

Blue-shifted ancyromonad channelrhodopsins for multiplex optogenetics

4
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  This important study describes newly identified light-gated ion channel homologs (channelrhodopsins, ChRs) in several protist species, with a primary focus on the biophysical characterization of ChRs of ancyromonads. The authors employed a powerful combination of bioinformatics, manual and automated patch-clamp electrophysiology, absorption spectroscopy, and flash photolysis. Additionally, they evaluated the applicability of the newly discovered anion-conducting ChRs in cortical neurons of mouse brain slices and in living C. elegans worms. The evidence supporting most of the claims is convincing and this work will be of interest to the microbial rhodopsin community and neuro- and cardioscientists utilizing optogenetics in their research.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  This work by Govorunova et al. identified three naturally blue-shifted channelrhodopsins (ChRs) from ancyromonads, namely AnsACR, FtACR, and NlCCR. The phylogenetic analysis places the ancyromonad ChRs in a distinct branch, highlighting their unique evolutionary origin and potential for novel applications in optogenetics. Further characterization revealed the spectral sensitivity, ionic selectivity, and kinetics of the newly discovered AnsACR, FtACR, and NlCCR. This study also offers valuable insights into the molecular mechanism underlying the function of these ChRs, including the roles of specific residues in the retinal-binding pocket. Finally, this study validated the functionality of these ChRs in both mouse brain slices (for AnsACR and FtACR) and in vivo in Caenorhabditis elegans (for AnsACR), demonstrating the versatility of these tools across different experimental systems.
  
  In summary, this work provides a potentially valuable addition to the optogenetic toolkit by identifying and characterizing novel blue-shifted ChRs with unique properties.
  
  Strengths:
  
  This study provides a thorough characterization of the biophysical properties of the ChRs and demonstrates the versatility of these tools in different ex vivo and in vivo experimental systems. The mutagenesis experiments also revealed the roles of key residues in the photoactive site that can affect the spectral and kinetic properties of the channel.
  
  Weaknesses:
  
  While the novel ChRs identified in this work are spectrally blue-shifted, there still seems to be some spectral overlap with other optogenetic tools. The authors should provide more evidence to support the claim that they can be used for multiplex optogenetics and help potential end-users assess if they can be used together with other commonly applied ChRs. Additionally, further engineering or combination with other tools may be required to achieve truly orthogonal control in multiplexed experiments.
  
  In the C. elegans experiments, partial recovery of pharyngeal pumping was observed after prolonged illumination, indicating potential adaptation. This suggests that the effectiveness of these ChRs may be limited by cellular adaptation mechanisms, which could be a drawback in long-term experiments. A thorough discussion of this challenge in the application of optogenetics tools would prove very valuable to the readership.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  Govorunova et al present three new anion opsins that have potential applications in silencing neurons. They identify new opsins by scanning numerous databases for sequence homology to known opsins, focusing on anion opsins. The three opsins identified are uncommonly fast, potent, and are able to silence neuronal activity. The authors characterize numerous parameters of the opsins.
  
  Strengths:
  
  This paper follows the tradition of the Spudich lab, presenting and rigorously characterizing potentially valuable opsins. Furthermore, they explore several mutations of the identified opsin that may make these opsins even more useful for the broader community. The opsins AnsACR and FtACR are particularly notable, having extraordinarily fast onset kinetics that could have utility in many domains. Furthermore, the authors show that AnsACR is usable in multiphoton experiments having a peak photocurrent in a commonly used wavelength. Overall, the author's detailed measurements and characterization make for an important resource, both presenting new opsins that may be important for future experiments, and providing characterizations to expand our understanding of opsin biophysics in general.
  
  Weaknesses:
  
  First, while the authors frequently reference GtACR1, a well-used anion opsin, there is no side-by-side data comparing these new opsins to the existing state-of-the-art. Such comparisons are very useful to adopt new opsins.
  
  Next, multiphoton optogenetics is a promising emerging field in neuroscience, and I appreciate that the authors began to evaluate this approach with these opsins. However, a few additional comparisons are needed to establish the user viability of this approach, principally the photocurrent evoked using the 2p process, for given power densities. Comparison across the presented opsins and GtACR1 would allow readers to asses if these opsins are meaningfully activated by 2P.
  
  Review 2
4. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Summary:
  
  The authors aimed to develop Channelrhodopsins (ChRs), light-gated ion channels, with high potency and blue action spectra for use in multicolor (multiplex) optogenetics applications. To achieve this, they performed a bioinformatics analysis to identify ChR homologues in several protist species, focusing on ChRs from ancyromonads, which exhibited the highest photocurrents and the most blue-shifted action spectra among the tested candidates. Within the ancyromonad clade, the authors identified two new anion-conducting ChRs and one cation-conducting ChR. These were characterized in detail using a combination of manual and automated patch-clamp electrophysiology, absorption spectroscopy, and flash photolysis. The authors also explored sequence features that may explain the blue-shifted action spectra and differences in ion selectivity among closely related ChRs.
  
  Strengths:
  
  A key strength of this study is the high-quality experimental data, which were obtained using well-established techniques such as manual patch-clamp and absorption spectroscopy, complemented by modern automated patch-clamp approaches. These data convincingly support most of the claims. The newly characterized ChRs expand the optogenetics toolkit and will be of significant interest to researchers working with microbial rhodopsins, those developing new optogenetic tools, as well as neuro- and cardioscientists employing optogenetic methods.
  
  Weaknesses:
  
  This study does not exhibit major methodological weaknesses. The primary limitation of the study is that it includes only a limited number of comparisons to known ChRs, which makes it difficult to assess whether these newly discovered tools offer significant advantages over currently available options. Additionally, although the study aims to present ChRs suitable for multiplex optogenetics, the new ChRs were not tested in combination with other tools. A key requirement for multiplexed applications is not just spectral separation of the blue-shifted ChR from the red-shifted tool of interest but also sufficient sensitivity and potency under low blue-light conditions to avoid cross-activation of the respective red-shifted tool. Future work directly comparing these new ChRs with existing tools in optogenetic applications and further evaluating their multiplexing potential would help clarify their impact.
  
  Review 3
Visit annotations in context

Tags

Summary

Review 1

Review 2

Review 3

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.02.24.639930v1
www.biorxiv.org www.biorxiv.org

map3k1 is required for spatial restriction of progenitor differentiation in planarians

3
1. Public_Reviews 09 May 2025
 
 in eLife
 
 eLife Assessment
 
 This valuable study examines the role of map3k1, a MAP3K family member that has both kinase and ubiquitin ligase domains, in the differentiation of progenitors in the flatworm Planaria. The convincing analyses demonstrate that map3k1 acts within progenitors to restrict their premature differentiation and to prevent formation of teratomas. This work would be of interest to researchers in the fields of regeneration, developmental biology, and aging.
 
 Summary
2. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Summary:
 
 The authors assess the role of map3k1 in adult Planaria through whole body RNAi for various periods of time. The authors' prior work has shown that neoblasts (stem cells that can regenerate the entire body) for various tissues are intermingled in the body. Neoblasts divide to produce progenitors that migrate within a "target zone" to the "differentiated target tissues" where they differentiate into a specific cell type. Here the authors show that map3k1-i animals have ectopic eyes that form along the "normal" migration path of eye progenitors (Fig. 1), ectopic neurons and glands along the AP axis (Fig. 2) and pharynx in ectopic anterior positions (Fig. 3). The rest of the study show that positional information is largely unaffected by loss of map3k1 (Fig. 4,5). However, loss of map3k1 leads to premature differentiated of progenitors along their normal migratory route (Fig. 6). They also show that an ill-defined "long-term" whole body depletion of map3k1 results in mis-specified organs and teratomas.
 
 Strengths:
 
 (1) The study has appropriate controls, sample sizes and statistics. (2) The work appears to be high-quality. (3) The conclusions are supported by the data. (4) Planaria is a good system to analyze the function of map3k1, which exists in mammals but not in other invertebrates.
 
 Weaknesses:
 
 (1) The paper is largely descriptive with no mechanistic insights. (2) Given the severe phenotypes of long-term depletion of map3k1, it is important that this exact timepoint is provided in the methods, figures, figure legends and results. (3) Fig. 1C, the ectopic eyes are difficult to see, please add arrows. (4) line 217 - why does the n=2/12 animals not match the values in Fig. 3B, which is 11/12 and 12/12. The numbers don't add up. Please correct/explain. (5) Figure panels do not match what is written in the results section. There is no Fig. 6E. Please correct.
 
 Review 1
3. Public_Reviews 09 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary: The flatworm planarian Schmidtea mediterranea is an excellent model for understanding cell fate specification during tissue regeneration and adult tissue maintenance. Planarian stem cells, known as neoblasts, are continuously deployed to support cellular turnover and repair tissues damaged or lost due to injury. This reparative process requires great precision to recognize the location, timing, and cellular fate of a defined number of neoblast progeny. Understanding the molecular mechanisms driving this process could have important implications for regenerative medicine and enhance our understanding of how form and function are maintained in long-lived organisms such as humans. Unfortunately, the molecular basis guiding cell fate and differentiation remains poorly understood.
 
 In this manuscript, Canales et al. identified the role of the map3k1 gene in mediating the differentiation of progenitor cells at the proper target tissue. The map3k1 function in planarians appears evolutionarily conserved as it has been implicated in regulating cell proliferation, differentiation, and cell death in mammals. The results show that the downregulation of map3k1 with RNAi leads to spatial patterning defects in different tissue types, including the eye, pharynx, and the nervous system. Intriguingly, long-term map3k1-RNAi resulted in ectopic outgrowths consistent with teratomas in planarians. The findings suggest that map3k1 mediates signaling, regulating the timing and location of cellular progenitors to maintain correct patterning during adult tissue maintenance.
 
 Strengths:
 
 The authors provide an entry point to understanding molecular mechanisms regulating progenitor cell differentiation and patterning during adult tissue maintenance.
 
 The diverse set of approaches and methods applied to characterize map3k1 function strengthens the case for conserved evolutionary mechanisms in a selected number of tissue types. The creativity using transplantation experiments is commendable, and the findings with the teratoma phenotype are intriguing and worth characterizing.
 
 Weaknesses:
 
 The article presents a provocative idea related to the importance of positional control for organs and cells, which is at least in part regulated by map3k1. Nonetheless, the role of map3k1 or its potential interaction with regulators of the anterior-posterior, mediolateral axes, and PCGs is somewhat superficial. The authors could elaborate or even speculate more in the discussion section and the different scenarios incorporating these axial modulators into the map3k1 model presented in Figure 8.
 
 The article can be improved by addressing inconsistencies and adding details to the results, including the main figures and supplements. This represents one of the most significant weaknesses of this otherwise intriguing manuscript. Below are some examples of a few figures, but the authors are expected to pay close attention to the remaining figures in the paper.
 
 Details associated with the number of animals per experiment, statistical methods used, and detailed descriptions of figures appear inconsistent or lacking in almost all figures. In some instances, the percentage of animals affected by the phenotype is shown without detailing the number of animals in the experiment or the number of repeats. Figures and their legends throughout the paper lack details on what is represented and sometimes are mislabeled or unrelated. Specifically, the arrows in Figure 1A are different colors. Still, no reasoning is given for this, and in the exact figure, the top side (1A) shows the percentages and the number of animals below. Conversely, in Figures 1B, C, and D, no details on the number of animals or percentages are shown, nor an explanation of why opsin was used in Figure 1A but not 1B. Is Figure 1B missing an image for the respective control? Figure 1C needs details regarding what the two smaller boxes underneath are. Figure 1C could use an AP labeling map in 10 days (e.g., AP6 has one optic cup present). Figure 1C and F counts do not match. In Figure 1C, we do not know the number of animals tested, controls used, the scale bar sizes in the first two images, nor the degree of magnification used despite the pharynx region appearing magnified in the second image. Figure 1C is also shown out of chronological order; 36 days post RNAi is shown before 10 days post RNAi. Moreover, the legends for Figures 1C and 1D are swapped.
 
 Additionally, Figure 1F and many other figures throughout the paper lack overall statistical considerations. Furthermore, Figure 1F has three components, but only one is labeled. Labeling each of them individually and describing them in the corresponding figure legend may be more appropriate.
 
 Figure 2C shows images of gene expression for two genes, but the counts are shown for only one in Figure 2D. It is challenging to follow the author's conclusions without apparent reasoning and by only displaying quantitative considerations for one case but not the other. These inconsistencies are also observed in different figures. In Figure 2D, 24/24 animals were reported to show the phenotype, but only eight were counted (is there a reason for this?). In Figure 2E, the expression for three genes is shown, with some displaying anterior and posterior regions while others only show the anterior picture. Is there a particular reason for this? Also, in Figure 2F, the counts are shown for only the posterior region of two genes out of the three displayed in Figure 2E. It is unclear why the authors do not show counts for the anterior areas considered in Figure 2E. Furthermore, the legend for Figure 2D is missing, and the legend for 2F is mislabeled as a description for Figure 2D.
 
 Supplement Figure 1 B reports data up to 6 weeks, but no text in the manuscript or supplement mentions any experiment going up to 6 weeks. There are no statistics for data in Supplement Figure 1E. Any significance between groups is unclear.
 
 Review 2
Visit annotations in context

Tags

Summary

Review 1

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.03.04.641450v1
www.biorxiv.org www.biorxiv.org

A tissue boundary orchestrates the segregation of inner ear sensory organs

4
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  This important study is a first report investigating the boundary formation between sensory and non-sensory tissues of the inner ear, which has broad relevance to the developmental field in general. All three reviewers thought the results and data analyses presented are solid. However, the causal relationship between the morphological evidence and the role of Lmx1a is not well supported by the results. The mechanism linking Lmx1a to ROCK is also incomplete, considering ROCK is involved in so many processes.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  This manuscript investigated the mechanism underlying boundary formation necessary for proper separation of vestibular sensory end organs. In both chick and mouse embryos, it was shown that a population of cells abutting the sensory (marked by high Sox2 expression) /nonsensory cell populations (marked by Lmx1a expression) undergo apical expansion, elongation, alignment and basal constriction to separate the lateral crista (LC) from the utricle. Using Lmx1a mouse mutant, organ cultures, pharmacological and viral-mediated Rock inhibition, it was demonstrated that the Lmx1a transcription factor and Rock-mediated actomyosin contractility is required for boundary formation and LC-utricle separation.
  
  Strengths:
  
  Overall, the morphometric analyses were done rigorously and revealed novel boundary cell behaviors. The requirement of Lmx1a and Rock activity in boundary formation was convincingly demonstrated.
  
  Weaknesses:
  
  However, the precise roles of Lmx1a and Rock in regulating cell behaviors during boundary formation were not clearly fleshed out. For example, phenotypic analysis of Lmx1a was rather cursory; it is unclear how Lmx1a, expressed in half of the boundary domain, control boundary cell behaviors and prevent cell mixing between Lmx1a+ and Lmx1a- compartments? Well-established mechanisms and molecules for boundary formation were not investigated (e.g. differential adhesion via cadherins, cell repulsion via ephrin-Eph signaling). Moreover, within the boundary domain, it is unclear whether apical multicellular rosettes and basal constrictions are drivers of boundary formation, as boundary can still form when these cell behaviors were inhibited. Involvement of other cell behaviors, such as radial cell intercalation and oriented cell division, also warrant consideration. With these lingering questions, the mechanistic advance of the present study is somewhat incremental.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  Chen et al. describe the mechanisms that separate the common pan-sensory progenitor region into individual sensory patches, which presage the formation of the sensory epithelium in each of the inner ear organs. By focusing on the separation of the anterior and then lateral cristae, they find that long supra-cellular cables form at the interface of the pan-sensory domain and the forming cristae. They find that at these interfaces, the cells have a larger apical surface area, due to basal constriction, and Sox2 is down-regulated. Through analysis of Lmx1 mutants, the authors suggest that while Lmx1 is necessary for the complete segregation of the sensory organs, it is likely not necessary for the initial boundary formation, and the down-regulation of Sox2.
  
  Strengths:
  
  The manuscript adds to our knowledge and provides valuable mechanistic insight into sensory organ segregation. Of particular interest are the cell biological mechanisms: The authors show that contractility directed by ROCK is important for the maintenance of the boundary and segregation of sensory organs.
  
  Weaknesses:
  
  The manuscript would benefit from a more in-depth look at contractility - the current images of PMLC are not too convincing. Can the authors look at p or ppMLC expression in an apical view? Are they expressed in the boundary along the actin cables? Does Y-27362 inhibit this expression?
  
  The authors suggest that one role for ROCK is the basal constriction. I was a little confused about basal constriction. Are these the initial steps in the thinning of the intervening non-sensory regions between the sensory organs? What happens to the basally constricted cells as this process continues?
  
  The steps the authors explore happen after boundaries are established. This correlates with a down-regulation of Sox2, and the formation of a boundary. What is known about the expression of molecules that may underlie the apparent interfacial tension at the boundaries? Is there any evidence for differential adhesion or for Eph-Ephrin signalling? Is there a role for Notch signalling or a role for Jag1 as detailed in the group's 2017 paper?
  
  A comment on whether cellular intercalation/rearrangements may underlie some of the observed tissue changes.
  
  The change in the long axis appears to correlate with the expression of Lmx1a (Fig 5d). The authors could discuss this more. Are these changes associated with altered PCP/Vangl2 expression?
  
  Review 2
4. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Summary:
  
  Lmx1a is an orthologue of apterous in flies, which is important for dorsal-ventral border formation in the wing disc. Previously, this research group has described the importance of the chicken Lmx1b in establishing the boundary between sensory and non-sensory domains in the chicken inner ear. Here, the authors described a series of cellular changes during border formation in the chicken inner ear, including alignment of cells at the apical border and concomitant constriction basally. The authors extended these observations to the mouse inner ear and showed that these morphological changes occurred at the border of Lmx1a positive and negative regions, and these changes failed to develop in Lmx1a mutants. Furthermore, the authors demonstrated that the ROCK-dependent actomyosin contractility is important for this border formation and blocking ROCK function affected epithelial basal constriction and border formation in both in vitro and in vivo systems.
  
  Strengths:
  
  The morphological changes described during border formation in the developing inner ear are interesting. Linking these changes to the function of Lmx1a and ROCK dependent actomyosin contractile function are provocative.
  
  Weaknesses:
  
  There are several outstanding issues that need to be clarified before one could pin the morphological changes observed being causal to border formation and that Lmx1a and ROCK are involved.
  
  Review 3
Visit annotations in context

Tags

Summary

Review 1

Review 2

Review 3

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.03.03.482809v2
www.biorxiv.org www.biorxiv.org

Identification and classification of ion-channels across the tree of life: Insights into understudied CALHM channels

3
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  The first part of this manuscript describes an interdisciplinary approach to mine the human channelome and discover further ion channel orthologues across diverse organisms. Although the findings and data curation enabled by the new approach are valuable to the ion channel community, as well as to those interested in improved methods for mining sequence space for their protein of interest, this part of the work is incomplete because critical methodological information is missing. Further validation of the improvements this approach shows over others is needed. The second part of the manuscript utilizes the approach described in the first part to delineate co-conserved amino acid patterns in CALHM channels, but the evidence provided to support the role of the identified residues in channel gating is currently inadequate.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  In this manuscript, Taujale et al describe an interdisciplinary approach to mine the human channelome and further discover orthologues across diverse organisms, culminating in delineating co-conserved patterns in an example ion channel: CALHM. Overall, this paper comes in two sections, one where 419 human ion channels and 48,000+ channels from diverse organisms are found through a multidisciplinary data mining approach, and a second where this data is used to find co-conserved sequences, whose functional significance is validated via experiments on CALHM1 and CALHM6. Overall, this is an intriguing data-first approach to better understand even understudied ion channels like CALHM6. However, more needs to be done to pull this story together into a single, coherent narrative.
  
  Strengths:
  
  This manuscript takes advantage of modern-day LLM tools to better mine the literature for ion channel sequences in humans and other species with orthologous ion channel sequences. They explore the 'dark channome' of understudied ion channels to better reveal the information evolution has to tell us about our own proteins, and illustrate the information this provides access to in experimental studies in the final section of the paper. Finally, they provide a wealth of information in the supplementary tables (in the form of Excel spreadsheets) for others to explore. Overall, this is a creative approach to a wide-reaching problem that can be applied to other families of proteins.
  
  Weaknesses:
  
  Overall, while a considerable amount of work has been done for this manuscript, the presentation, both in terms of writing and figures, leaves much to be desired. One can imagine a story that clearly describes the need for a better-curated sequence database of ion channels, and clearly describes how existing resources fall short, but here this is not very clearly illustrated.
  
  One question that arises with the part of the manuscript that discusses the identification and classification of ion channels is whether they plan to make these sequences available to the wider public. For the 419 human sequences, making a small database to share this result so that these sequences can be easily searched and downloaded would be desirable. There are a variety of acceptable formats for this: GitHub/figshare/zenodo/university website that allows a wider community to access their hard work. The authors have included enough information in the supplementary tables that this could be done by a motivated reader, but providing such a resource would greatly expand the impact of this paper. The same question can be asked of the 48,000+ ion channels from diverse organisms. For these, one is even worried that these are not properly sequenced genes? What checks have been done to confirm this? Uniport contains a good deal of unreviewed sequences, especially from single-celled organisms. Potentially, this is covered in the sentence in the Methods: "Finally, the results obtained from both the full-length and pore domains were retained as true orthologous relationships to remove extraneous hits." But this process could be discussed in more detail, clearly illustrating that the risk of gene duplicates and fragments in this final set of ion channel orthologues has been avoided. Related to this, does this analysis include or exclude isoforms?
  
  Another aspect of the identification and classification of ion channel genes that could be improved is the figures for this section. One is relatively used to seeing trees as shown in Figures 3 and 4, which show relationships between genes as distances or evolutionary relationships. The decision to show the families of ion channels in Figure 1 as pie charts within a UMAP embedding is intriguing but somewhat non-intuitive and difficult to understand. Illustrating these results with a standard tree-like visualization of the relationship of these channels to each other would be preferred.
  
  One aspect of the pie-chart/UMAP visualization that works well is the highlighting of the 'dark' ion channels according to the status as designated by IDG, which highlights a strength of this whole paper. However, throughout the paper, this could be emphasized more as the key advantage of this approach and how this or similar approaches could be used for other families of proteins. Specifically, in the initial statement describing 'light' vs 'dark channels', the importance of this distinction and the historical preference in science to study that which has already been studied can be discussed more, even including references to other studies that take this kind of approach. An example of a relevant reference here is to the Structural Genomics Consortium and its goals to achieve structures of proteins for which functions may not be well-characterized. Furthermore, this initial statement mentioning 'light channels' was initially confusing -- does this mean light-sensing channels? As one reads on this is clearly not the case, but for such an important central focus of this paper, these kinds of misunderstandings do not serve the authors well. Clarifying these motivations throughout the entire paper would strengthen it considerably.
  
  Additionally, since the authors have generated this UMAP visualization, it would be interesting to understand how the human vs orthologue gene sets compare in this space. Furthermore, Figure 1, for just the human analysis, should say more clearly that this is an analysis of the human gene set and include more of the information in the text: 419 human ion channel sequences, 75 sequences previously unidentified, 4 major groups and 55 families, 62 outliers, etc. Clearer visualizations of these categories and numbers within the UMAP (and newly included tree) visualization would help guide the reader to better understand these results.
  
  One of the most peculiar aspects of this paper is that it feels like two papers, one about better documenting the ion channel genes across species, and another with well-executed experiments on CALHM channels. One suggestion for how to link these two sections together better is to show that previous methods to analyze conserved residues in CALHM were significantly lacking. What results would that give? Why was this not enough? Were there just not enough identified CALHM orthologues to give strong signals in conservation analysis?
  
  Some of the analysis pipeline is unclear. Specifically, the RAG analysis seems critical, but it is unclear how this works - is it on top of the GPT framework and recursively inquires about the answer to prompts? Some example prompts would be useful to understand this. Furthermore, the existence of 76 auxiliary non-pore containing 'ion channel' genes in this analysis is a little confusing, as it seems a part of the pipeline is looking for pore-lining residues. Furthermore, how many of these are picked up in the larger orthologues search? Are these harder to perform checks on to ensure that they are indeed ion channel genes? A further discussion of the choice to include these auxiliary sequences would be relevant. This could just be further discussion of the literature that has decided to do this in the past.
  
  Overall, this manuscript is a valuable contribution to the field, but it requires a few main things to make it truly useful. Namely, how has this approach really improved the ability to identify conserved residues over a less-involved approach? A better description of their methods and results is required in the first section of the paper, as well as some cosmetic improvements.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  In this paper, the authors defined the "channelome," consisting of 419 predicted human ion channels as well as 48,000 ion channel orthologs from other organisms. Using this information, the ion channels were clustered into groups, which can potentially be used to make predictions about understudied ion channels in the groups. The authors then focused on the CALHM ion channel family, mutating conserved residues and assessing channel function.
  
  Strengths:
  
  The curation of the channelome provides an excellent resource for researchers studying ion channels. Supplemental Table 1 is well organized with an abundance of useful information.
  
  Weaknesses:
  
  There are substantial concerns regarding the analysis of the CALHM channels as detailed below.
  
  (1) There are significant problems with the methodology used for the electrophysiology studies. Pulse protocol is used to assess the current voltage relationship (-100 to +140 mV), which extends far beyond the physiological range; currents for the mutant channels were only assessed at +120 mV. It is also unclear why a holding potential of 0 mV was used for CALHM6 recordings; the channel is already open at this voltage (and in Figure 4, only n = 3 for CALHM6). Further, proper controls were not performed. Inhibitors such as Gd3+ can be used to ensure that only CALHM currents are being measured.
  
  (2) In line 334, the authors state that "expression levels of wild-type proteins and mutants are comparable." However, Western blots showing CALHM protein abundance (Supplementary Figure 3) are not of acceptable quality - in the top blot, WT CALHM1 can't even be seen. Representative blots were not shown for all mutants, and there was no effort to determine if levels were statistically significant compared to the wild-type control. Even if there is more or less protein, what does this mean? The protein could be in an intracellular compartment and not at the plasma membrane. In mammalian cells, CALHM6 is localized to intracellular compartments and only translocates to the plasma membrane upon activating stimulus (Danielli et al, EMBO J, 2023). Thus, if CALHM6 is only intracellular, the protein amount would not change, but the measured current would. Abundant intracellular CALHM1 has also been observed in mammalian cells transfected with this protein (Dreses-Werringloer et al., Cell, 2008). The best way to determine if mutations impact CALHM channel localization is to express GFP-tagged constructs in Xenopus oocytes and look for surface expression.
  
  (3) Since the authors have not definitively shown that there are no defects in localization, they cannot make the claim in lines 346-356 that the mutations "either abolished or markedly reduced channel activity." Further, from their data, there is speculation regarding how these residues impact conformational changes during channel opening and closing. Line 404 - again, there is no concrete evidence that any of these residues play a role in gating function. Lines 406-433 - this entire paragraph is speculation without data to back it up. There is also a lack of specificity with statements such as "all mutants showed either reduced or completely abolished activity." What is meant by activity? Do the authors mean conductance?
  
  (4) Line 303 - 13 aligned amino acids were conserved across all CALHM homologs - are these also aligned in related connexin and pannexin families? It is likely that cysteines and proline in TM2 are since CALHM channels overall share a lot of similarities with connexins and pannexins (Siebert et al, JBC, 2013). As in line 207, it would be expected that pannexins, connexins, and CALHM channel families would group together. Related to this, see Line 406 - in connexins, there is also a proline kink in TM2 that may play a role in mediating conformational changes between channel states (Ri et al, Biophysical Journal, 1999).
  
  Review 2
Visit annotations in context

Tags

Summary

Review 1

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.02.10.637530v1
www.biorxiv.org www.biorxiv.org

Domain Coupling in Allosteric Regulation of SthK Measured Using Time-Resolved Transition Metal Ion FRET

4
1. Public_Reviews 09 May 2025
  
  in eLife
  
  eLife Assessment
  
  This useful work employs transition-metal FRET (tmFRET) to study the cyclic nucleotide binding domain (CNBD) of a bacterial ion channel. The authors employ lifetime measurements of fluorescence to extend their own prior study and observe distance changes within the CNBD domains of a full-length channel; they base these measurements on changes in lifetimes due to tmFRET between a metal at an introduced chelator site and a fluorescent non-canonical amino acid at another site within the channel sequence. This allows the authors to show that coupling of the CNBDs to the rest of the channel stabilizes the CNBDs in their active state relative to an isolated CNBD construct. The data are compelling and of high quality, and support the authors' conclusions.
  
  Summary
2. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  This useful work extends a prior study from the authors to observe distance changes within the CNBD domains of a full-length CNG channel based on changes in single photon lifetimes due to tmFRET between a metal at an introduced chelator site and a fluorescent non-canonical amino acid at another site. The data are excellent and convincingly support the authors' conclusions. The methodology is of general use for other proteins. The authors also show that coupling of the CNBDs to the rest of the channel stabilizes the CNBDs in their active state, relative to an isolated CNBD construct.
  
  Strengths:
  
  The manuscript is very well written and clear.
  
  Review 1
3. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  The manuscript "Domain Coupling in Allosteric Regulation of SthK Measured Using Time-Resolved Transition Metal Ion FRET" by Eggan et al. investigates the energetics of conformational transitions in the cyclic nucleotide-gated (CNG) channel SthK. This lab pioneered transition metal FRET (tmFRET), which has previously provided detailed insights into ion channel conformational changes. Here, the authors analyze tmFRET fluorescence lifetime measurements in the time domain, yielding detailed insights into conformational transitions within the cyclic nucleotide binding domains (CNBDs) of the channel. The integration of tmFRET with time-correlated single-photon counting (TCSPC) represents an advancement of this technique.
  
  The results summarize known conformational transitions of the C-helix and provide distance distributions that agree with predicted values based on available structures. The authors first validated their TCSPC approach using the isolated CNBD construct previously employed for similar experiments. They then study the more complex full-length SthK channel protein. The findings agree with earlier results from this group, demonstrating that the C-helix is more mobile in the closed state than static structures reflect. Upon adding the activating ligand cAMP, the C-helix moves closer to the bound ligand, as indicated by a reduced fluorescence lifetime, suggesting a shorter distance between the donor and acceptor. The observed effects depend on the cAMP concentration, with affinities comparable to functional measurements. Interestingly, a substantial amount of CNBDs appear to be in the activated state even in the absence of cAMP (Figure 6E and F, fA2 ~ 0.4).
  
  This may be attributed to cooperativity among the CNBDs, which the authors could elaborate on further. In this context, the major limitation of this study is that distance distributions are observed only in one domain. While inter-subunit FRET is detected and accounted for, the results focus exclusively on movements within one domain. Thus, the resulting energetic considerations must be assessed with caution. In the absence of the activator, the closed state is favored, while the presence of cAMP favors the open state. This quantifies the standard assumption; otherwise, an activator would not effectively activate the channel. However, the numerical values of approximately 3 kcal/mol are limited by the fact that only one domain is observed in the experiment, and only one distance (C- helix relative to the CNBD) is probed. Additional conformational changes leading to pore opening (including rotation and upward movement of the CNBD, and radial dilation of the tetrameric assembly) are not captured by the current experiments. These limitations should be taken into account when interpreting the results.
  
  Review 2
4. Public_Reviews 09 May 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Summary:
  
  This is a lucidly written manuscript describing the use of transition-metal FRET to assess distance changes during functional conformational changes in a CNG channel. The experiments were performed on an isolated C-terminal nucleotide binding domain (CNBD) and on a purified full-length channel, with FRET partners placed at two positions in the CNBD.
  
  Strengths:
  
  The data and quantitative analysis are exemplary, and they provide a roadmap for use of this powerful approach in other proteins.
  
  Weaknesses/Comments:
  
  A ~3x lower Kd for nucleotide is seen for the detergent-solubilized full-length channel, compared to electrophysiological experiments. This is worth a comment in the Discussion, particularly in the context of the effect of the pore domain on the CNBD energetics.
  
  Review 3
Visit annotations in context

Tags

Summary

Review 1

Review 2

Review 3

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.03.31.646362v1
www.biorxiv.org www.biorxiv.org

Memory at your fingertips: how viscoelasticity affects tactile neuron signaling

4
1. Public_Reviews 08 May 2025
 
 in eLife
 
 eLife Assessment
 
 The fundamental findings reported here provide insight into how the viscoelasticity of the fingertip skin influences the activity of mechanoreceptive afferents and thus the neural coding of force in humans. The basic principle studied was whether and to what extent the previous applied force directions impact the firing of FA-1, SA-1 and SA-2 neurons during the current applied force directions. The data and analyses are compelling and will be helpful for modeling the neural representations of force in the context of object grasping and manipulation.
 
 Summary
2. Public_Reviews 08 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 The authors investigate how the viscoelasticity of the fingertip skin can affect the firing of mechanoreceptive afferents and they find a clear effect of recent physical skin state (memory), which is different between afferents. The manuscript is extremely well-written and well-presented. It uses a large dataset of low threshold mechanoreceptive afferents in the fingertip, where it is particularly noteworthy that the SA-2s have been thoroughly analyzed and play an important role here. They point out in the introduction the importance of the non-linear dynamics of the event when an external stimulus contacts the skin, to the point at which this information is picked up by receptors. Although clearly correlated, these are different processes, and it has been very well-explained throughout. I have some comments and ideas that the authors could think about that could further improve their already very interesting paper. Overall, the authors have more than achieved their aims, where their results very much support the conclusions and provoke many further questions. This impact of the previous dynamics of skin affecting current state can be explored further in so many ways and may help us in understanding skin aging and the effects of anatomical changes of the skin better.
 
 Comments on revised submission:
 
 The authors have taken all my considerations into account and provided excellent responses to them. They have modified their paper accordingly, which improves its clarity even more. Very interesting work and I have no further comments.
 
 Review 1
3. Public_Reviews 08 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 The authors sought to identify the impact skin viscoelasticity has on neural signalling of contact forces that are representative of those experienced during normal tactile behaviour. The evidence presented in the analyses indicate there is a clear effect of viscoelasticity on the imposed skin movements from a force-controlled stimulus. Both skin mechanics and evoked afferent firing were affected based on prior stimulation, which has not previously been thoroughly explored. This study outlines that viscoelastic effects have an important impact on encoding in the tactile system, which should be considered in the design and interpretation of future studies. Viscoelasticity was shown to affect the mechanical skin deflections and stresses/strains imposed by previous and current interaction force, and also the resultant neuronal signalling. The result of this was an impaired coding of contact forces based upon previous stimulation. The authors may be able to strengthen their findings, by using the existing data to further explore the link between skin mechanics and neural signalling, giving a clearer picture than demonstrating shared variability. This is not a critical addition, but I believe would strengthen the work and make it more generally applicable.
 
 Strengths:
 
 -Elegant design of the study. Direct measurements have been made from the tactile sensory neurons to give detailed information on touch encoding. Experiments have been well designed and the forces/displacements have been thoroughly controlled and measured to give accurate measurements of global skin mechanics during a set of controlled mechanical stimuli. -Analytical techniques used. Analysis of fundamental information coding and information representation in the sensory afferents reveals dynamic coding properties to develop putative models of the neural representation of force. This advanced analysis method has been applied to a large dataset to study neural encoding of force, the temporal dynamics of this, and the variability in this.
 
 Weaknesses: -Lack of exploration of the variation in neural responses. Although there is a viscoelastic effect which produces variability in the stimulus effects based on prior stimulation, it is a shame that the variability in neural firing and force induced skin displacements have been presented, and are similarly variable, but there has been no investigation of a link between the two. I believe with these data the authors can go beyond demonstrating shared variability. The force per se is clearly not faithfully represented in the neural signal, being masked by stimulation history, and it is of interest if the underlying resultant contact mechanics are.
 
 Validity of conclusions:
 
 The authors have succeeded in demonstrating skin viscoelasticity has an impact on skin contact mechanics with a given force and that this impacts on the resultant neural coding of force. Their study has been well designed and the results support their conclusions. The importance and scope of the work is adequately outlined for readers to interpret the results and significance.
 
 Impact:
 
 This study will have important implications for future studies performing tactile stimulation and evaluating tactile feedback during motor control tasks. In detailed studies of tactile function, it illustrates the necessity to measure skin contact dynamics to properly understand the effects of a force stimulus on the skin and mechanoreceptors.
 
 Review 2
4. Public_Reviews 08 May 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews
 
 Public Reviews:
 
 Reviewer #1 (Public Review):
 
 The authors investigate how the viscoelasticity of the fingertip skin can affect the firing of mechanoreceptive afferents and they find a clear effect of recent physical skin state (memory), which is different between afferents. The manuscript is extremely well-written and well-presented. It uses a large dataset of low threshold mechanoreceptive afferents in the fingertip, where it is particularly noteworthy that the SA-2s have been thoroughly analyzed and play an important role here. They point out in the introduction the importance of the non-linear dynamics of the event when an external stimulus contacts the skin, to the point at which this information is picked up by receptors. Although clearly correlated, these are different processes, and it has been very well-explained throughout. I have some comments and ideas that the authors could think about that could further improve their already very interesting paper. Overall, the authors have more than achieved their aims, where their results very much support the conclusions and provoke many further questions. This impact of the previous dynamics of the skin affecting the current state can be explored further in so many ways and may help us to better understand skin aging and the effects of anatomical changes of the skin.
 
 At the beginning of the Results, it states that FA-2s were not considered as stimuli did not contain mechanical events with frequency components high enough to reliably excite them. Was this really the case, did the authors test any of the FA-2s from the larger dataset? If FA-2s were not at all activated, this is also relevant information for the brain to signal that it is not a relevant Pacinian stimulus (as they respond to everything). Further, afferent receptive fields that were more distant to the stimulus were included, which likely fired very little, like the FA-2s, so why not consider them even if their contribution was low?
 
 Thank you for bringing this up, we have now clarified in the text that while FA-2s did respond at a low rate during the experiment, their responses were not reliably driven by the force stimuli. In the Methods section we have included the following text:
 
 “Initially, 10 FA-2 neurons were also included in the analysis. But their responsiveness during the experiment was remarkably low, and unlike the other neuron types, their responses were rarely affected by force stimuli. Specifically, only one of the observed FA-2 neurons responded during the force protraction phases. Due to the lack of clear stimulus-driven responses, FA-2 neurons were subsequently excluded from further analysis.”
 
 One question that I wondered throughout was whether you have looked at further past history in stimulation, i.e. not just the preceding stimulus, but 2 or 3 stimuli back? It would be interesting to know if there is any ongoing change that can be related back further. I do not think you would see anything as such here, but it would be interesting to test and/or explore in future work (e.g. especially with sticky, forceful, or sharp indentation touch). However, even here, it could be that certain directions gave more effects.
 
 This is a very interesting question! A discernible effect from the previous stimulus could persist at the end of the current stimulation (see Figure 4C), potentially influencing the next one—a 2-stimuli-back effect. Unfortunately, our experimental design did not allow for rigorous testing of this effect. While all possible pairs of stimulus directions were included in immediately consecutive trials, this was not the case for pairs separated by additional trials. Hence, the combination of a likely weak effect and limited variation in history precluded a thorough analysis of a 2-stimuli-back effect. Future work should delve into the time course of the viscoelastic effect in greater detail.
 
 Did the authors analyze or take into account the difference between receptive field locations? For example, did afferents more on the sides have lower responses and a lesser effect of history?
 
 An investigation into the potential impact of the relationship between the receptive field location on the fingertip skin and the primary contact site of the stimulus surface revealed no discernible influence for SA-1 and SA-2 neurons. In contrast, FA-1 neurons, particularly those predominantly sensitive to the previous stimulation or displaying mixed sensitivity, exhibited a tendency to terminate near the primary stimulation site. We have added these observations to the text:
 
 “We found no straightforward relationship between a neuron's sensitivity to current and previous stimulation and its termination site in fingertip skin. Specifically, there was no statistically significant effect of the distance between a neuron's receptive field center and the primary contact site of the stimulus surface on whether neurons signaled current, prior, or mixed information for SA-1 (Kruskal-Wallis test H(2)=3.86, p= 0.15) or SA-2 neurons (H(2)=0.75, p=0.69). However, a significant difference emerged for FA-1 neurons (H(2)=8.66, p=0.01), indicating that neurons terminating closer to the stimulation site on the flat part of the fingertip were more likely to signal past or mixed information.”
 
 Was there anything different in the firing patterns between the spontaneous and non-spontaneously active SA-2s? For example, did the non-spontaneous show more dynamic responses?
 
 The firing patterns of both spontaneously and non-spontaneously active SA-2 neurons shared similarities in terms of adaptation and range of firing rate modulation in response to force stimuli, i.e., ‘dynamic response’. The distinction lay in the pattern of modulation of the firing rate associated with stimulus presentations. For spontaneously active SA-2 neurons, this modulation occurred around a significant background discharge, implying that a force stimulus could either decrease or increase the firing rate, depending on how it deformed the fingertip. This characteristic is well illustrated by the firing pattern of the neuron depicted in the lower panels of Figure 3D. Conversely, in non-spontaneously active SA-2 neurons, a force stimulus could only induce an increase in the firing rate or no change. Although the neuron depicted in the upper panels of Figure 3D exhibited some background activity, it serves to exemplify this characteristic. In the text, we have elucidated the dynamics of the SA-2 neuron response by highlighting that force stimulation can either decrease or increase the firing rate in neurons with spontaneous activity through the following addition/change:
 
 “This increased variability was most evident during the force protraction phase where most neurons exhibited the most intense responses. Increased variability was also observed in instances where the dynamic response to force stimulation involved a decrease in the firing rate (lower panels of Figure 3D). This phenomenon was observed in SA-2 neurons that maintained an ongoing discharge during intertrial periods (cf. Fig. 2A). In these cases, the response to a force stimulus constituted a modulation of the firing rate around the background discharge, signifying that a force stimulus could either decrease or increase the firing rate depending on the prevailing stimulus direction.”
 
 Were the spontaneously active SA-2 afferents firing all the time or did they have periods of rest - and did this relate to recent stimulation? Were the spontaneously active SA-2s located in a certain part of the finger (e.g. nail) or were they randomly spread throughout the fingertip? Any distribution differences could indicate a more complicated role in skin sensing.
 
 SA-2 neurons, in general, are well-known for undergoing significant post-stimulation depression (e.g., Knibestöl and Vallbo, 1970; Chambers et al., 1972; Burgess and Perl, 1973). In our force stimulations, this post-excitatory depression manifested as a reduced or absent response during the latter part of the stimulus retraction period for stimuli in directions that markedly excited the neuron. The excitability recovered when the fingertip relaxed during the subsequent intertrial period, and for "spontaneously active" neurons, the firing resumed (see examples in Figure 7A). Furthermore, some “spontaneously active” neurons could be silenced or exhibit a near-silent period during force stimulation for certain force directions, while the spontaneous firing returned during the upcoming intertrial period when the fingertip shape recovered (for example, see responses to stimulation in the proximal and especially ulnar directions in the top panel in Figure 7A).
 
 Regarding the location of the receptive field centres of spontaneously active and non-spontaneously active SA-2 neurons on the fingertip we did not observe any obvious spatial segregation. To illustrate this, we have revised Figure 1A by color-marking SA-2 neurons that exhibited ongoing activity in intertrial periods, and the figure caption has been modified accordingly:
 
 “Figure 1. Experimental setup. A. Receptive field center locations shown on a standardized fingertip for all first-order tactile neurons included in the study, categorized by neuron type. Purple symbols denote spontaneously active SA-2 neurons exhibiting ongoing activity without external stimulation.”
 
 Did the authors look to see if the spontaneous firing in SA-2s between trials could predict the extent to which the type 1 afferents encode the proceeding stimulus? Basically, does the SA-2 state relate to how the type 1 units fire?
 
 We found no clear indications that the responses of FA-1 and SA-1 could be readily anticipated based on the firing patterns of SA-2 neurons.
 
 In the discussion, it is stated that "the viscoelastic memory of the preceding loading would have modulated the pattern of strain changes in the fingertip differently depending on where their receptor organs are situated in the fingertip". Can the authors expand on this or make any predictions about the size of the memory effect and the distance from the point of stimulation?
 
 We have explored this topic further in the text, referring to recent studies modeling essential aspects of fingertip mechanics. However, in our view, current models lack the capability to predict the specific nature sought by the reviewer. These models should include a detailed understanding of the intricate networks of collagen fibers anchoring the pulp tissue at the distal phalangeal bone and the nail. They should also consider potential inherent directional preferences of the receptor organs, attributed to their microanatomy. The text modifications are as follows:
 
 “In addition to the receptor organ locations, the variation in sensitivity among neurons to fingertip deformations in response to both previous and current loadings would stem from the fingertip’s geometry and its complex composite material properties. Possible inherent directional preferences of the receptor organs, attributed to their microanatomy, could also be significant. However, mechanical anisotropy, particularly within the viscoelastic subcutaneous tissue of the fingertip induced by intricately oriented collagen fiber strands forming fat columns in the pulp (Hauck et al., 2004), are likely to play a crucial role. This anisotropy would shape the dynamic pattern of strain changes at neurons' receptor sites, intricately influencing a neuron's sensitivity not only to current but also to preceding loadings. Indeed, recent modeling efforts suggest that such mechanical anisotropy strongly influences the spatiotemporal distribution of stresses and strains across the fingertip (Duprez et al., 2024).”
 
 Relatedly, we have included additional text to provide a more comprehensive explanation of the “bulk deformation” of the fingertip that occurs during the loadings:
 
 “As pressure increases in the pulp, the pulp tissue bulges at the end and sides of the fingertip. Simultaneously, the tangential force component amplifies the bulging in the direction of the force while stretching the skin on the opposite side.”
 
 In the discussion, it would be good if the authors could briefly comment more on the diversity of the mechanoreceptive afferent firing and why this may be useful to the system.
 
 The diversity in responses among neurons is instrumental in enhancing the information transmitted to the brain by averting redundancy in information acquisition. This diversity thereby contributes to an overall increase in information. We've included a brief statement, along with several references, underscoring this concept:
 
 "The resulting diversity in the sensitivities of neurons might enhance the overall information collected and relayed to the brain by the neuronal population, facilitating the discrimination between tactile stimuli or mechanical states of the fingertip (see Rongala et al., 2024; Corniani et al., 2022; Tummala et al., 2023, for more extensive explorations of this idea)."
 
 Also, the authors could briefly discuss why this memory (or recency) effect occurs - is it useful, does it serve a purpose, or it is just a by-product of our skin structure? There are examples of memory in the other senses where comparisons could be drawn. Is it like stimulus adaptation effects in the other senses (e.g. aftereffects of visual motion)?
 
 We have expanded the concluding paragraph of the discussion, specifically delving into the question of whether the mechanical memory effect serves a deliberate purpose or is simply an incidental byproduct of our skin structure:
 
 “In any case, the viscoelastic deformability of the fingertips plays a pivotal role in supporting the diverse functions of the fingers. For example, it allows for cushioned contact with objects featuring hard surfaces and allows the skin to conform to object shapes, enabling the extraction of tactile information about objects' 3D shapes and fine surface properties. Moreover, deformability is essential for the effective grasping and manipulation of objects. This is achieved, among other benefits, by expanding the contact surface, thereby reducing local pressure on the skin under stronger forces and enabling tactile signaling of friction conditions within the contact surface for control of grasp stability. Throughout, continuous acquisition of information about various aspects of the current state of the fingertip and its skin by tactile neurons is essential for the functional interaction between the brain and the fingers. In light of this, the viscoelastic memory effect on tactile signaling of fingertip forces can be perceived as a by-product of an overall optimization process within prevailing biological constraints.”
 
 One point that would be nice to add to the discussion is the implications of the work for skin sensing. What would you predict for the time constant of relaxation of fingertip skin, how long could these skin memory effects last? Two main points to address here may be how the hydration of the skin and anatomical skin changes related to aging affect the results. If the skin is less viscoelastic, what would be the implications for the firing of mechanoreceptors?
 
 It is likely that the time constant depends to some extent on mechanical factors of the skin, which will likely change due to age or environmental factors. However, while these questions are intriguing, they fall outside the scope of the current study and we are not aware of studies that have addressed these issues directly in experiments either.
 
 How long does it take for the effect to end? Again, this will likely depend on the skin's viscoelasticity. However, could the authors use it in a psychophysical paradigm to predict whether participants would be more or less sensitive to future stimuli? In this way, it would be possible to test whether the direction modifies touch perception.
 
 Time constants for tissue viscoelasticity have been estimated to extend up to several seconds (see citations in the introduction). While direct perceptual effects could indeed be explored through psychophysical experimental paradigms, we are currently unaware of any studies specifically addressing the type of effect described in this study. In addition to the statement that, concerning manipulation and haptic tasks, "to our knowledge, a possible influence of fingertip viscoelasticity on task performance has not been systematically investigated," we have now also addressed tactile psychophysical tasks conducted during passive touch with the following sentence in the text:
 
 “Similarly, there is a lack of systematic investigation of potential effects of fingertip viscoelasticity on performance in tactile psychophysical tasks conducted during passive touch.”
 
 Reviewer #2 (Public Review):
 
 Summary:
 
 The authors sought to identify the impact skin viscoelasticity has on neural signalling of contact forces that are representative of those experienced during normal tactile behaviour. The evidence presented in the analyses indicates there is a clear effect of viscoelasticity on the imposed skin movements from a force-controlled stimulus. Both skin mechanics and evoked afferent firing were affected based on prior stimulation, which has not previously been thoroughly explored. This study outlines that viscoelastic effects have an important impact on encoding in the tactile system, which should be considered in the design and interpretation of future studies. Viscoelasticity was shown to affect the mechanical skin deflections and stresses/strains imposed by previous and current interaction force, and also the resultant neuronal signalling. The result of this was an impaired coding of contact forces based on previous stimulation. The authors may be able to strengthen their findings, by using the existing data to further explore the link between skin mechanics and neural signalling, giving a clearer picture than demonstrating shared variability. This is not a critical addition, but I believe would strengthen the work and make it more generally applicable.
 
 Strengths:
 
 - Elegant design of the study. Direct measurements have been made from the tactile sensory neurons to give detailed information on touch encoding. Experiments have been well designed and the forces/displacements have been thoroughly controlled and measured to give accurate measurements of global skin mechanics during a set of controlled mechanical stimuli.
 
 - Analytical techniques used. Analysis of fundamental information coding and information representation in the sensory afferents reveals dynamic coding properties to develop putative models of the neural representation of force. This advanced analysis method has been applied to a large dataset to study neural encoding of force, the temporal dynamics of this, and the variability in this.
 
 Weaknesses:
 
 - Lack of exploration of the variation in neural responses. Although there is a viscoelastic effect that produces variability in the stimulus effects based on prior stimulation, it is a shame that the variability in neural firing and force-induced skin displacements have been presented, and are similarly variable, but there has been no investigation of a link between the two. I believe with these data the authors can go beyond demonstrating shared variability. The force per se is clearly not faithfully represented in the neural signal, being masked by stimulation history, and it is of interest if the underlying resultant contact mechanics are.
 
 Thank you for this suggestion. We have added a new section investigating the link between skin deformation and neural firing in more depth via a simple neural model. Please see our answer below in the ‘Recommendations’ section for further details.
 
 Validity of conclusions:
 
 The authors have succeeded in demonstrating skin viscoelasticity has an impact on skin contact mechanics with a given force and that this impacts the resultant neural coding of force. Their study has been well-designed and the results support their conclusions. The importance and scope of the work is adequately outlined for readers to interpret the results and significance.
 
 Impact:
 
 This study will have important implications for future studies performing tactile stimulation and evaluating tactile feedback during motor control tasks. In detailed studies of tactile function, it illustrates the necessity to measure skin contact dynamics to properly understand the effects of a force stimulus on the skin and mechanoreceptors.
 
 Recommendations for the authors:
 
 Reviewer #1 (Recommendations For The Authors):
 
 (Very) minor comments
 
 - The authors say at the beginning of the Results that, "The fourth type of tactile neurons in the human glabrous skin, fast adapting type II neurons...". Although generally written that there are four types of afferent in the glabrous skin, it would be better to state that these are low-threshold A-beta myelinated mechanoreceptive afferents, at least one time, as there are other types of afferent in the glabrous skin that respond to mechanical stimulation (e.g. low and high threshold C-fibers).
 
 This is now clarified at the start of the Results section:
 
 “We recorded action potentials in the median nerve of individual low-threshold A-beta myelinated first-order human tactile neurons innervating the glabrous skin of the fingertip…”
 
 - Fig. 3: Could you add '(N)' as the measurement of force for Fig. 3A for Fz, Fy, and Fz? Also, please change 'Data was recorded' to 'Data were recorded' in the legend.
 
 Fixed.
 
 - At the beginning of the Methods, you say that your study conforms to the Declaration of Helsinki, which actually requires pre-registration in a database. If you did not pre-register your study, please can you add '... in accordance with the Declaration of Helsinki, apart from pre-registration in a database'.
 
 Thanks for making us aware of this. We have added the suggested qualifier to the ethics statement.
 
 Reviewer #2 (Recommendations For The Authors):
 
 The neural representation/encoding of the actual displacement vectors would be a useful addition to the analyses. These vectors have been demonstrated to systematically change with the condition in the irregular series (Figure 2E) and will thus significantly act on the dynamics of induced mechanical changes in the skin with a given interaction force. Thus, it could be examined how the neurons code the magnitude of displacements as well as their direction. An evaluation of the extent to which the imposed displacement magnitudes are encoded in the neural responses would be a useful addition in explaining the signalling of the force events and how the central nervous system decodes these. Evaluating an alternative displacement encoding for comparison to pure force encoding may reveal more about how contact events are represented in the tactile system, which must decode these variable afferent signals to reconstruct a percept of the interaction. It could then be explored how the central nervous system may then scale the dynamic afferent responses based on the background viscoelastic state likely to be present in the SA-II afferent signals (Figure 7) for a context in which to evaluate the dynamic contact forces. This may of course be a complex relationship for the type-I afferents, where the underlying mechanical events evoking the firing (microslips not represented in global forces) have not been measured here. Such a model could be more widely applicable, as the skin viscoelasticity and displacement magnitudes are a straightforward measurement metric and could perhaps be used as a better proxy for neural signalling. This would allow the investigation of a wider variety of forces, and the study of the timing of the viscoelastic effect, both of which have been fixed here. This would give the work a broader impact, rather than just highlighting that this effect produces variability, it could reveal if this mechanical feature is structured in the neural representation. The categorical encoding/decoding tested here is specific to the stimuli used (magnitudes, intervals), but there is the possibility that this may be more generally applicable (within the bounds of forces/speeds) if the underlying basis of the variability in the signalling produced by the viscoelasticity is identified. Since the time course of the viscoelasticity has not been measured here (fixed forces and intervals), further study is required to fully understand the implications this has for a wider variety of situations.
 
 We agree that a better understanding of how the mechanical deformations are reflected in the resulting spike trains would be valuable. While ultimately a full understanding will need precise measurements of skin deformation across the whole fingertip to account for mechanical propagation to mechanoreceptor locations, relating the deformations at the contact location with neural firing patterns directly can provide useful hints into which aspects of deformation are encoded and how. To this end, we ran a new analysis that aimed to predict the time-varying neural responses directly from the recorded mechanical movements of the contactor.
 
 Below we have reproduced the new results and methods text along with the additional figures for this analysis. Note that we have also added text in the Discussion to interpret these findings in the context of our other results.
 
 New section in Results titled Predicting neural responses from contactor movements: “The similarity in the history-dependent variation in neural firing and fingertip deformation at a given force stimulus suggests that neuronal firing is determined by how the fingertip deforms rather than the applied force itself. However, this similarity does not clarify the relationship between fingertip deformation dynamics and neural signaling. To investigate further, we fit cross-validated multiple linear regression models to evaluate how well distinct aspects of contactor movement could predict the time-varying firing rates of individual neurons during the protraction phases of the irregular sequence. The models used predictors based on (1) the three-dimensional position of the contactor, (2) its three-dimensional velocity, (3) a combination of position and velocity signals, and, finally, (4) position and velocity signals along with all possible two-way interactions between them, capturing potentially complex relationship between fingertip deformations and neural signaling.
 
 Comparing the variance explained (R2) by each regression model for each neuron type revealed clear differences between the models (Figure 5A). A two-way mixed design ANOVA, with regression model as within-group effects and neuron type as a between-group effect revealed a main effect of model on variance explained (F(3,462) = 815.5, p < 0.001, ηp2 = 0.84). Model prediction accuracy overall increased with the number of predictors, with the two-way interaction model outperforming all others (p < 0.001 for all comparisons, Tukey’s HSD). Additionally, a significant main effect of neuron type (F(2,154) = 29.8, p < 0.001, ηp2 = 0.28) and a significant interaction between regression model and neuron type were observed (F(6,462) = 50.8, p < 0.001, ηp2 = 0.40).
 
 For neuron type, model predictions were most accurate for SA-2 neurons, followed by SA-1 neurons, with FA-1 neurons showing the lowest accuracy (p < 0.003 for all comparisons, Tukey’s HSD). The interaction between model and neuron type revealed distinct patterns. For SA-1 and SA-2 neurons, position-only and velocity-only models had similar prediction accuracy (p ≥ 0.996, Tukey’s HSD) with no significant differences between these neuron types (p ≥ 0.552, Tukey’s HSD). FA-1 neurons performed poorly with the position-only model but showed higher accuracy with the velocity-only model (p < 0.001, Tukey’s HSD) and better than SA-1 neurons (p = 0.006, Tukey’s HSD). Models combining position and velocity predictors (without interactions) surpassed both position-only and velocity-only models for SA-1 and SA-2 neurons (p < 0.001, Tukey’s HSD). Overall, the differences between neuron types broadly match their tuning to static and dynamic stimulus properties.
 
 The two-way interaction model, accounting for most variance in neural responses, produced mean R2 values of 0.75 for FA-1, 0.88 for SA-1, and 0.91 for SA-2 neurons (Figure 5A). To evaluate the contribution of the different predictors, we ranked them using the permutation feature importance method, focusing on the six most important ones. Regression analyses using only these variables explained almost all of the variance explained by the full model, with a median R2 reduction of just 0.055 across all neurons. Across all neuron types, at least half included all three velocity components (dPx, dPy, dPz) among the top six, with FA-1 neurons showing the highest prevalence (Figure 5B). Interactions between normal position (Pz) and each velocity component were also frequently observed, while interactions involving tangential position and velocity components were less common. Interactions among velocity components were relatively well represented, followed by interactions limited to position components. Position signals were generally less represented, except for normal position (Pz) in slowly adapting neurons, where it appeared in 50% of SA-1 and 68% of SA-2 neurons. Despite these broad trends, important predictors varied widely across ranks even within a given neuron class (see Figure 5-figure supplement 1), and even the most frequent variables appeared in only a subset of cases, suggesting broad variability in sensitivity across neurons.”
 
 New methods paragraph titled Predicting time-varying firing rates from skin deformations:
 
 “This analysis was conducted in Python (v3.13) with pandas for data handling, numpy for numerical operations, and scikit-learn for model fitting and evaluation.
 
 To assess how well individual neurons' time-varying firing rates could be predicted from simultaneous contactor movements, we fitted multiple linear regression models (see Khamis et al., 2015, for a similar approach}. This analysis focused on the force protraction phase of the irregular sequence, where neurons were most responsive and sensitive to stimulation history. Data from 100 ms before to 100 ms after the protraction phase (between -0.100 s and 0.225 s relative to protraction onset) were included for each trial. Neurons were included if they fired at least two action potentials during the force protraction phase and the following 100 ms in at least five of the 25 trials. This ensured sufficient variability in firing rates for meaningful regression analysis, resulting in 68 SA-1, 38 SA-2, and 51 FA-1 neurons being included.
 
 Contractor position signals digitized at 400 Hz were linearly interpolated to 1000 Hz. Instantaneous firing rates, derived from action potentials sampled at 12.8 kHz, were resampled at 1000 Hz to align with position signals. A Gaussian filter (σ = 10 ms, cutoff ~16 Hz) was applied to the firing rate as well as to the position signals before differentiation. To account for axonal conduction (8–15 ms) and sensory transduction delays (1–5 ms), firing rates were advanced by 15 ms to align approximately with independent variables.
 
 Regressions were performed using scikit-learn's Ridge and RidgeCV regressors, which apply L2 regularization to mitigate overfitting. Hyperparameter tuning for the regularization parameter (alpha) was performed using GridSearchCV with a predefined range (0.001–1000.0), incorporating five-fold cross-validation to select the best value. To minimize overfitting risks, model performance was further validated with independent five-fold cross-validation (KFold), and R2 scores were computed using cross_val_score.
 
 We constructed four linear regression models with increasing complexity: (1) Position-only, using three-dimensional contactor positions (Px, Py, Pz); (2) Velocity-only, using three-dimensional velocities (dPx, dPy, dPz); (3) Combined, including all position and velocity signals (6 predictors); and (4) Interaction, including all signals and their two-way interactions (21 predictors). All features were standardized using StandardScaler to improve regularization and model convergence. PolynomialFeatures generated second-order interaction terms for the interaction model. Feature importance was evaluated with permutation_importance, and simpler models were built using the most important features. These models were validated through cross-validation to assess retained explanatory power.”
 
 Minor:
 
 - It would be useful to add a brief description of the material aspects of the contactor tip to the methods (as per Birznieks 2001).
 
 We have added the following statement:
 
 “To ensure that friction between the contactor and the skin was sufficiently high to prevent slips, the surface was coated with silicon carbide grains (50–100 μm), approximating the finish of smooth sandpaper.”
 
 - The axes labelling on Figure 3A and legend description is ambiguous, probably placing the Px, Py, and Pz labels on the far left axes and the Fx, Fy, and Fz on the right side of the far right axes would make this clearer.
 
 Label placement has been improved along with some other minor fixes.
 
 - For the quasi-static phase analysis, the phrase "absence of loading" used in reference to the interstimulus period and SA-II afferents does not seem to be a correct description. The finger is still loaded (at least in the normal direction), with a magnitude of imposed displacement that counteracts the viscoelastic force exerted by the skin mechanics of the fingertip. Although there is a zero net-force load, a mechanical stimulus is still being actively applied to the skin.
 
 We have changed the wording throughout the text and now consistently refer either to the “interstimulus period” directly or to an “absence of externally applied stimulation” to avoid confusion.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 2

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.05.15.540820v2
www.biorxiv.org www.biorxiv.org

Mutations that prevent phosphorylation of the BMP4 prodomain impair proteolytic maturation of homodimers leading to lethality in mice

5
1. Public_Reviews 08 May 2025
  
  in eLife
  
  eLife Assessment
  
  This fundamental work presents two clinically relevant BMP4 mutations that contribute to vertebrate development. The compelling evidence, both from wet lab and AI generated predictions, supports that the site-specific cleavage at the BMP4 pro-domain precisely regulates its function and provides mechanistic insight how homodimers and heterodimers behave differently. The work will be of broad interest to researchers working on growth factor signaling mechanisms and vertebrate development.
  
  Summary
2. Public_Reviews 08 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  The authors demonstrate that two human preproprotein human mutations in the BMP4 gene cause a defect in proprotein cleavage and BMP4 mature ligand formation, leading to hypomorphic phenotypes in mouse knock-in alleles and in Xenopus embryo assays.
  
  Strengths:
  
  They provide compelling biochemical and in vivo analyses supporting their conclusions, showing the reduced processing of the proprotein and concomitant reduced mature BMP4 ligand protein from impressively mouse embryonic lysates. They perform excellent analysis of the embryo and post-natal phenotypes demonstrating the hypomorphic nature of these alleles. Interesting phenotypic differences between the S91C and E93G mutants are shown with excellent hypotheses for the differences. Their results support that BMP4 heterodimers act predominantly throughout embryogenesis whereas BMP4 homodimers play essential roles at later developmental stages.
  
  Weaknesses:
  
  In the revision the authors have appropriately addressed the previous minor weaknesses.
  
  Review 1
3. Public_Reviews 08 May 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  The revised paper by Kim et al. reports two disease mutations in proBMP4, S91C and E93G, disrupt the FAM20C phosphorylation site at Ser91, blocking the activation of proBMP4 homodimers, while still allowing BMP4/7 heterodimers to function. Analysis of DMZ explants from Xenopus embryos expressing the proBMP4 S91C or E93G mutants showed reduced expression of pSmad1 and tbxt1. The expert amphibian tissue transplant studies were expanded to in vivo studies in Bmp4S91C/+ and Bmp4E93G/+ mice, highlighting the impact of these mutations on embryonic development, particularly in female mice, consistent with patient studies. Additionally, studies in mouse embryonic fibroblasts (MEFs) demonstrated that the mutations did not affect proBMP4 glycosylation or ER-to-Golgi transport but appeared to inhibit the furin-dependent cleavage of proBMP4 to BMP4. Based on these findings and AI modeling using AlphaFold of proBMP4, the authors speculate that pSer91 influences access of furin to its cleavage site at Arg289AlaLysArg292 in a new "Ideas and Speculation" section. Overall, the authors addressed the reviewers' comments, improving the presentation.
  
  Strengths:
  
  The strengths of this work continue to lie in the elegant Xenopus and mouse studies that elucidate the impact of the S91C and E93G disease mutations on BMP signaling and embryonic development. Including an "Ideas and Speculation" subsection for mechanistic ideas reduces some shortcomings regarding the analysis of the underlying mechanisms.
  
  Review 2
4. Public_Reviews 08 May 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Summary:
  
  The authors describe important new biochemical elements in the synthesis of a class of critical developmental signaling molecules, BMP4. They also present a highly detailed description of developmental anomalies in mice bearing known human mutations at these specific elements.
  
  Strengths:
  
  This paper presents exceptionally detailed descriptions of pathologies occurring in BMP4 mutant mice. Novel findings are shown regarding the interaction of propeptide phosphorylation and convertase cleavage, both of which will move the field forward. Lastly, a provocative hypothesis regarding furin access to cleavage sites is presented, supported by Alphafold predictions.
  
  Review 3
5. Public_Reviews 08 May 2025
  
  in eLife
  
  Author response:
  
  The following is the authors’ response to the previous reviews
  
  Reviewer #2 (Public review):
  
  Summary:
  
  The revised paper by Kim et al. reports two disease mutations in proBMP4, S91C and E93G, disrupt the FAM20C phosphorylation site at Ser91, blocking the activation of proBMP4 homodimers, while still allowing BMP4/7 heterodimers to function. Analysis of DMZ explants from Xenopus embryos expressing the proBMP4 S91C or E93G mutants showed reduced expression of pSmad1 and tbxt1. The expert amphibian tissue transplant studies were expanded to in vivo studies in Bmp4S91C/+ and Bmp4E93G/+ mice, highlighting the impact of these mutations on embryonic development, particularly in female mice, consistent with patient studies. Additionally, studies in mouse embryonic fibroblasts (MEFs) demonstrated that the mutations did not affect proBMP4 glycosylation or ER-to-Golgi transport but appeared to inhibit the furin-dependent cleavage of proBMP4 to BMP4. Based on these findings and AI modeling using AlphaFold of proBMP4, the authors speculate that pSer91 influences access of furin to its cleavage site at Arg289AlaLysArg292 in a new "Ideas and Speculation" section. Overall, the authors addressed the reviewers' comments, improving the presentation.
  
  Strengths:
  
  The strengths of this work continue to lie in the elegant Xenopus and mouse studies that elucidate the impact of the S91C and E93G disease mutations on BMP signaling and embryonic development. Including an "Ideas and Speculation" subsection for mechanistic ideas reduces some shortcomings regarding the analysis of the underlying mechanisms.
  
  Weaknesses:
  
  (1) (Minor) In Figure S1 and lines 165-174 and 179-180, the authors should consider that, unlike the wild-type protein (Ser), which can be reversibly phosphorylated or dephosphorylated, phosphomimic mutations are locked into mimicking either the phosphorylated state (Asp) or the non-phosphorylated state (Ala). Consequently, if the S91D mutant exhibits lower activity than WT, it could imply that S91D interferes with other regulatory constraints, as the authors suggest. However, it may also be inhibiting activation. Therefore, caution is warranted when comparing S91D with S91C to conclude that Ser91 phosphorylation increases BMP4 activity. While additional experiments are not necessary, further consideration is essential.
  
  (Minor) In lines 394-399, the authors cleverly speculate that pS91 interacts with Arg289-the essential P4 arginine for furin processing. If so, this interaction could hinder the cleavage of proBMP4, as indicated by the results in Figure S1. The discussion would benefit from considering that, contrary to their favored model, dephosphorylation at Ser91 might actually facilitate cleavage.
  
  We have added a paragraph raising this possibility but explaining why it is unlikely and inconsistent with our in vivo data. The S91D construct was a simple control that was tested in ectopic expression assays and not in vivo. We can make no conclusions about whether this construct resembles the phosphorylated state or whether it hinders or facilitates cleavage in vivo. The conclusion that dephosphorylation promotes BMP4 cleavage or activity is not compatible with the finding that two mutations associated with birth defects in humans (p.S91C or p.E93G) that are predicted to prevent FAM20C-mediated phosphorylation of the BMP4 prodomain lead to impaired proteolytic maturation of endogenous BMP4 and reduced BMP activity in vivo.
  
  (2) In Figure 4, panels A, E, and I, the proBMP bands in the mouse embryonic lysates and MEFs expressing the mutations show a clear size shift. Are these shifts a cause or a consequence of the lack of cleavage? Regardless, the size shifts should be explicitly noted.
  
  These intriguing shifts were observed in some but not all biological replicates. When present, the shifts were not reversed by treatment with phosphatases or deglycosylases, and the shifts were never observed in epitope tagged wild type controls. We have added a paragraph noting the shifts and our tests of whether they might be due to glycosylation, phosphorylation or epitope tags.
  
  (3) (Minor) In line 314, the authors should consider modifying the wording to: "is required for modulating proprotein convertase..."
  
  The original wording (“Collectively, our findings are consistent with a model in which FAM20C-mediated phosphorylation of the BMP4 prodomain is not required for folding or exit of the precursor protein from the ER, but is required for proprotein convertase recognition and/or for trafficking to post-TGN compartment(s) where BMP4 is cleaved”) more accurately reflects the model that is supported by our findings. Stating that “phosphorylation ……is required to modulate proprotein convertase recognition and/or trafficking” is vague and leaves open the possibility that it modulates in either direction, which our data do not support as described in point 1 above.
  
  AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 3

AuthorResponse

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.10.08.617306v4
www.biorxiv.org www.biorxiv.org

Glucose-stimulated KIF5B-driven microtubule sliding organizes microtubule networks in pancreatic beta cells

4
1. Public_Reviews 08 May 2025
 
 in eLife
 
 eLife Assessment
 
 In their valuable study, Bracey et al. investigate how microtubule organization within pancreatic islet beta cells supports optimal insulin secretion. Using a combination of live imaging and photo-kinetic assays in an in vitro culture system, they provide convincing evidence that kinesin-1-mediated microtubule sliding, which plays critical roles in neurons and embryos, also plays a critical role in forming the sub-membranous microtubule band in response to glucose in beta cells. This work will be of interest to cell biologists studying cytoskeletal dynamics and organelle trafficking, as well as to translational biologists focused on diabetes.
 
 Summary
2. Public_Reviews 08 May 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 This study investigates the role of microtubules (MT) in regulating insulin secretion from pancreatic islet beta cells. This is of great importance considering that controlled secretion of insulin is essential to prevent diabetes. Previously, it has been shown that KIF5B plays an essential role in insulin secretion by transporting insulin granules to the plasma membrane. High glucose activates KIF5B to increase insulin secretion resulting in cellular uptake of glucose. In order to prevent hypoglycemia, insulin secretion needs to be tightly controlled. Notably, it is known that KIF5B plays a role in MT sliding. This is important, as the authors described previously that beta cells establish a peripheral sub-membrane MT array, which is critical for withdrawal of excessive insulin granules from the secretion sites. At high glucose, the sub-membrane MT array is destabilized to allow for robust insulin secretion. Here the authors aim to answer the question how the peripheral array is formed. Based on the previously published data the authors hypothesize that KIF5B organizes the sub-membrane MT array via microtubule sliding.
 
 General comment: This manuscript provides data that indicate that KIF5B, like in many other cells, mediates MT sliding in beta cells to establish a non-radial sub-membrane MT array. This study is based mainly on in vitro assays and one cell line. To demonstrate the importance of KIF5B in vivo/under physiological conditions, the MT pattern and directionality in beta cells within whole isolated pancreatic islets from KIF5B KO mice was analyzed in comparison to their WT littermates. While the presented effects appear often rather small, it is important to note that small changes in MT configuration can have strong effects. However, the authors provide no link to insulin secretion and glucose uptake. Finally, it remains unclear whether a KIF5B-dependent mechanism regulating microtubule sliding plays a major role in controlling insulin secretion.
 
 Specific comments: (1) It is difficult to appreciate that there is a "peripheral sub-membrane microtubule array" as it is not well defined in the manuscript. This reviewer assumes that this is in the respective field clear. Yet, while it is appreciated that there is an increased amount of MTs close to the cytoplasmic membrane, the densities appear very variable along the membrane. Please provide a clear description in the Introduction what is meant with "peripheral sub-membrane microtubule array". (2) The authors described a "consistent presence of a significant peripheral array in the C57BL/6J control mice, while the KO counterparts exhibited a partial loss of this peripheral bundle. Specifically, the measured tubulin intensity at the cell periphery was significantly reduced in the KO mice compared to their wild-type counterparts". In vitro "control cells had convoluted non-radial MTs with a prominent sub-membrane array, typical for β cells (Fig. 2A), KIF5B-depleted cells featured extra-dense MTs in the cell center and sparse receding MTs at the periphery (Fig. 2B,C)". Please comment/discuss why in vivo there are no "extra-dense MTs in the cell center". (3) Authors should include in the Discussion a paragraph discussing the fact that small changes in MT configuration can have strong effects.
 
 Review 1
3. Public_Reviews 08 May 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 This elegant study provides significant and impactful insights into the factors contributing to the distinct arrangement of sub-membrane microtubules within mouse β-cells of the pancreas. The authors propose that in these cells, the motor protein KIF5B plays a crucial role in sliding existing microtubules toward the cell periphery and aligning them with one another along the plasma membrane. Furthermore, similar to other physiological features of β-cells, high glucose levels enhance this microtubule sliding process. A precise arrangement of microtubules beneath the cell membrane in β-cells is vital for the regulated secretion of pancreatic enzymes and hormones; thus, KIF5B has a significant role in pancreatic activity in both healthy conditions and diseases. The authors support their model by demonstrating that the levels of KIF5B mRNA in MIN6 cells are higher than those of other known kinesins. They show that microtubule sliding becomes less efficient when KIF5B is genetically silenced using two different short hairpin RNAs (shRNAs). Additionally, silencing of KIF5A in the same cells results in a general reorganization of microtubules throughout the cell. Specifically, while control cells exhibit a convoluted and non-radial arrangement of microtubules near the cell membrane, KIF5B-depleted cells display a sparse and less dense sub-membrane array of microtubules. Based on these findings, the authors conclude that the loss of KIF5B strongly affects the localization of microtubules to the cell periphery. Using a dominant-negative approach, the authors also demonstrate that KIF5B facilitates the sliding of microtubules by binding to cargo microtubules through the kinesin-1 tail binding domain. They present evidence suggesting that KIF5B-mediated microtubule sliding is glucose-dependent, similar to the activity levels of kinesin-1, which increase in the presence of glucose. Lastly, they show that this is glucose-dependent.
 
 Strengths:
 
 This study unveils a previously unexplained mechanism that regulates the specific rearrangement of microtubules beneath the cell membrane in pancreatic β-cells. The findings have significant implications because the precise regulation of the microtubule array at the secretion zone plays a critical role in controlling pancreatic function in both healthy and diseased states. The provided data supports the authors' conclusions well, and the study demonstrates the use of state-of-the-art methodologies, including quantification techniques and elegant dominant-negative experiments.
 
 Weaknesses: None
 
 Review 2
4. Public_Reviews 08 May 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews
 
 Public Reviews:
 
 Reviewer #1 (Public Review):
 
 This study investigates the role of microtubules in regulating insulin secretion from pancreatic islet beta cells. This is of great importance considering that controlled secretion of insulin is essential to prevent diabetes. Previously, it has been shown that KIF5B plays an essential role in insulin secretion by transporting insulin granules to the plasma membrane. High glucose activates KIF5B to increase insulin secretion resulting in the cellular uptake of glucose. In order to prevent hypoglycemia, insulin secretion needs to be tightly controlled. Notably, it is known that KIF5B plays a role in microtubule sliding. This is important, as the authors described previously that beta cells establish a peripheral sub-membrane microtubule array, which is critical for the withdrawal of excessive insulin granules from the secretion sites. At high glucose, the sub-membrane microtubule array is destabilized to allow for robust insulin secretion. Here the authors aim to answer the question of how the peripheral array is formed. Based on the previously published data the authors hypothesize that KIF5B organizes the sub-membrane microtubule array via microtubule sliding.
 
 General comment:
 
 This manuscript provides data that indicate that KIF5B, like in many other cells, mediates microtubule sliding in beta cells. This study is limited to in vitro assays and one cell line. Furthermore, the authors provide no link to insulin secretion and glucose uptake and the overall effects described are moderate. Finally, the overall effect of microtubule sliding upon glucose stimulation is surprisingly low considering the tight regulation of insulin secretion. Moreover, the authors state "the amount of MT polymer on every glucose stimulation changes only slightly, often undetectable…. In fact, we observe a prominent effect of peripheral MT loss only after a long-term kinesin depletion (three-four days)". This challenges the view that a KIF5Bdependent mechanism regulating microtubule sliding plays a major role in controlling insulin secretion.
 
 (1) Our initial study was indeed done in a cell line, which is a normal approach to addressing molecular mechanisms of a phenomenon in a challenging cell model: primary pancreatic beta cells are prone to rapidly dedifferentiate outside of the organism and are hard to genetically modify. To address this reviewer’s comment, in the revised manuscript we now confirm the phenotype in beta cells within intact pancreatic islets from a KIF5B KO mouse model (New Figure 2 – Supplemental Figure 1).
 
 (2) We agree that testing the effect of microtubule sliding on insulin secretion is an important question. Unfortunately, the experimental design needed to accomplish this task is not straighDorward. Importantly, besides microtubule sliding, KIF5B is heavily engaged in insulin granule transport, and GSIS deficiency upon KIF5B inactivation is well documented (e.g. Varadi et al 2002). In this study, we choose not to repeat this GSIS assay because of ample existing data. However, this reported GSIS deficiency could result from a combination of lack of insulin granule delivery to the periphery (previous data) and from the depletion of insulin granules from the periphery due to the loss of the submembrane MT bundle (this study and Bracey et al 2020). In order to exclusively test the role of MT sliding in secretion, a significant investment in mutant tool development would be needed. Ideally, a new mutant mouse model where insulin granule transport is allowed by MT sliding in blocked must be developed to specifically address this question. To conclude, answering this question will be the subject for another, follow-up study.
 
 (3) We respecDully disagree with the reviewer’s opinion that the effect of MT sliding in beta cells is moderate. As MT networks go, even a slight change in MT configuration often has dramatic consequences. For example, in mitotic spindles, a tiny overgrowth of microtubule ends during metaphase, which causes them to attach to both kinetochores rather than just one, is very significant for the efficiency of chromosome segregation, causing aneuploidy and cancer. The changes in beta-cell MT networks that we are reporting are much stronger: the effect on the peripheral MT network accumulated over three days of KIF5B depletion is dramatic (Fig 2 B, C). Short-term gross MT network configurations after a single glucose stimulation are harder to detect, but MTs at the cell periphery are, in fact, destabilized and fragmented, as we and others have previously reported (Ho et al 2020, Mueller et al 2021). Preventing this MT rearrangement completely blocks GSIS (Zhu et al 2015, Ho et al 2020).
 
 One of the most fascinating features of insulin secretion regulation is that the amount of generated insulin granules significantly exceeds the normal physiological needs for insulin secretion (~100 times more than needed). At the same time, even slightly facilitated glucose depletion can be devastating. Accordingly, the excessive insulin content of a beta cell resulted in the development of multiple levels of control, preventing excessive secretion. Our previous data suggest that the peripheral MT array provides one of those mechanisms. This study indicates that microtubule sliding is necessary to form the proper peripheral network in the long term. Short-term glucose-induced changes in the peripheral MT array likely need to be subtle to prevent over-secretion. Thus, we are not surprised that a dramatic effect of sliding inhibition is only detectable by our approaches after the changes in the MT network accumulate over time. In the revised paper, we now discuss the potential impact of peripheral MT sliding on positive and negative regulation of secretion and add a schematic model illustrating these processes.
 
 Specific comments:
 
 (1) Notably, the authors have previously reported that high glucose-induced remodeling of microtubule networks facilitates robust glucose-stimulated insulin secretion. This remodeling involves the disassembly of old microtubules and the nucleation of new microtubules. Using real-time imaging of photoconverted microtubules, they report that high levels of glucose induce rapid microtubule disassembly preferentially in the periphery of individual β-cells, and this process is mediated by the phosphorylation of microtubule-associated protein tau. Here, they state that the sub-membrane microtubule array is destabilized via microtubule sliding. What is the relevance of the different processes?
 
 In this comment, the summary of our previous conclusions is correct, but the conclusion of this current study is re-stated incorrectly. Indeed, we have previously shown that in high glucose, MTs are destabilized at the cell periphery and nucleated in the cell interior. However, this current paper does not state that “the sub-membrane microtubule array is destabilized via microtubule sliding”. To answer this reviewer’s question, our data support a model where, during glucose stimulation, MT sliding within the peripheral bundle might move fragments of MTs severed by other mechanisms. Importantly, we propose that MT sliding restores the partially destabilized peripheral bundle by delivery of MTs that are nucleated at the cell interior and incorporating them into that bundle. In our overall model, three processes (destabilization, nucleation, and sliding to restore the bundle) are coordinated to maintain beta cell fitness on each GSIS cycle.
 
 (2) On one hand the authors describe how KIF5B depletion prevents sliding and the transport of microtubules to the plasma membrane to form the sub-membrane microtubule array. This indicates KIF5B is required to form this structure. On the other hand, they describe that at high glucose concentration, KIF5B promotes microtubule sliding to destabilize the sub-membrane microtubule array to allow robust insulin secretion. This appears contradictory.
 
 We never intended to make an impression that MT sliding destabilized the sub-membrane bundle. Apologies if there was a reason in our wording that caused this misunderstanding of our model. We propose that while the bundle is destabilized downstream of glucose signaling (e.g. due to tau phosphorylation, please see Ho et al Diabetes 2020), MT sliding remodels the bundle and thereafter rebuilds it to prevent over-secretion. In the revised manuscript, we have doublechecked the whole text to make sure that such misunderstanding is avoided.
 
 (3) Previously, it has been shown that KIF5B induces tubulin incorporation along the microtubule shaft in a concentration-dependent manner. Moreover, running KIF5B increases microtubule rescue frequency and unlimited growth of microtubules. Notably, KIF5B regulates microtubule network mass and organization in cells (PMID: 34883065). Consequently, it appears possible that the here observed phenomena of changes in the microtubule network might be due to alterations in these processes.
 
 We thank the reviewer for proposing this alternative explanation to the observed change in microtubule networks after KIF5B depletion. We have now directly tested this possibility. Namely, we have re-expressed the kinesin-1 motor domain in MIN6 cells depleted of KIF5B. This motor domain construct by itself is not capable of driving microtubule sliding because it lacks the tail domain. At the same time, it is known to move very efficiently at microtubules and should provide the effects as reported in the article cited by the reviewer. We found that the reexpression of the kinesin motor domain does not rescue microtubule network defects in beta cells (see new Figure 2 – Supplemental Figure 2). Thus, we conclude that the effects of kinesin depletion on the microtubule network in beta cells are due to the lack of microtubule sliding, as reported here.
 
 (4) The authors provide data that indicate that microtubule sliding is enhanced upon glucose stimulation. They conclude that these data indicate that microtubule sliding is an integral part of glucose-triggered microtubule remodeling. Yet, the authors fail to provide any evidence that this process plays a role in insulin secretion or glucose uptake.
 
 We would like to point out that we do not “fail” but rather choose not to overload our study by repeating insulin secretion assays in KIF5B-inactivated cells because this would not have been very informative. It has been found previously that kinesin-1 inactivation or knockout significantly attenuates insulin secretion because kinesin-1 is actively transporting insulin granules and kinesin-1 activity is enhanced under high glucose conditions (e.g. Varadi et al 2002, Cui et al., 2011, Donelan et al, 2002). That said, our current finding is very much in line with these previous data. When kinesin is depleted, two things would be happening at the same time: in the absence of sub-membrane microtubule bundle pre-existing insulin granules would be over-secreted, and new insulin would not be delivered to the periphery, both decreasing GSIS. Unfortunately, we do not have tools yet that would allow us to dissect which part of the insulin secretion defect is due to prior over-secretion (the consequence of deficient MT sliding) and which part is due to the lack of new granule delivery. We plan to develop such tools in the future and elaborate on them in a follow-up study. Here, our goal is to understand microtubule organization principles in beta cells, and we choose not to extend the scope of the current study to metabolic assays.
 
 (5) The authors speculate that the sub-membrane microtubule array prevents the over-secretion of insulin. Would one not expect in this case a change in the distribution of insulin granules at the plasma membrane when this array is affected? Or after glucose stimulation? Notably, it has been reported that "the defects of β-cell function in KIF5B mutant mice were not coupled with observable changes in islet morphology, islet cell composition, or β-cell size" and "the subcellular localization of insulin vesicles was found to not be affected significantly by the decreased Kif5b level. The cytoplasm of both wild-type and mutant β-cells was filled with insulin vesicles. Insulin vesicle numbers per square μm were determined by counting all insulin vesicles in randomly photographed β-cells. More insulin granules were found in Kif5b knockout β-cells compared with control cells. This phenomenon is consistent with the observation that insulin secretion by β-cells is affected" whereby "Insulin vesicles (arrowheads) were distributed evenly in both mutant and control cells" (PMID: 20870970).
 
 Quantitative analyses in the study cited by the reviewer do not include assays that would be relevant to our study. Particularly, in that study neither the amount of insulin granules at the cell periphery nor the ratio between the number of granules at the periphery and the beta cell interior has been analyzed. In addition, in our preliminary observations not shown here, insulin content in beta cells in KIF5B KO mice is highly heterogeneous, with a subpopulation of cells severely depleted of insulin. This opens a new avenue of investigation into beta cell heterogeneity, which is out of the scope of this current study. Thus, we chose to restrict this current study to microtubule organization data.
 
 (6) Does the sub-membrane microtubule array exist in primary beta cells (in vitro and/or in vivo) and how it is affected in KIF5B knockout mice?
 
 Yes, it does exist. In fact, we have first reported it in mouse islets (Bracey et al 2020, Ho et al 2020). Now, we report that the sub-membrane bundle is defective, and microtubules are misaligned in KIF5B KO mice (new Figure 2 – Supplemental Figure 1).
 
 Reviewer #2 (Public Review):
 
 In this article, Bracey et al. provide insights into the factors contributing to the distinct arrangement observed in sub-membrane microtubules (MTs) within mouse β-cells of the pancreas. Specifically, they propose that in clonal mouse pancreatic β-cells (MIN6), the motor protein KIF5B plays a role in sliding existing MTs towards the cell periphery and aligning them with each other along the plasma membrane. Furthermore, similar to other physiological features of β-cells, this process of MTs sliding is enhanced by a high glucose stimulus. Because a precise alignment of MTs beneath the cell membrane in β-cells is crucial for the regulated secretion of pancreatic enzymes and hormones, KIF5B assumes a significant role in pancreatic activity, both in healthy conditions and during diseases.
 
 The authors provide evidence in support of their model by demonstrating that the levels of KIF5B mRNA in MIN6 cells are higher compared to other known KIFs. They further show that when KIF5B is genetically silenced using two different shRNAs, the MT sliding becomes less efficient. Additionally, silencing of KIF5A in the same cells leads to a general reorganization of MTs throughout the cell. Specifically, while control cells exhibit a convoluted and non-radial arrangement of MTs near the cell membrane, KIF5B-depleted cells display a sparse and less dense sub-membrane array of MTs. Based on these findings, the Authors conclude that the loss of KIF5B strongly affects the localization of MTs to the periphery of the cell. Using a dominant-negative approach, the authors also demonstrate that KIF5B facilitates the sliding of MTs by binding to cargo MTs through the kinesin-1 tail binding domain. Additionally, they present evidence suggesting that KIF5B-mediated MT sliding is dependent on glucose, similar to the activity levels of kinesin-1, which increase in the presence of glucose. Notably, when the glucose concentrations in the culturing media of MIN6 cells are reduced from 20 mM to 5 mM, a significant decrease in MT sliding is observed.
 
 Strengths:
 
 This study unveils a previously unexplained mechanism that regulates the specific rearrangement of MTs beneath the cell membrane in pancreatic β-cells. The findings of this research have implications and are of significant interest because the precise regulation of the MT array at the secretion zone plays a critical role in controlling pancreatic function in both healthy and diseased states. In general, the author's conclusions are substantiated by the provided data, and the study demonstrates the utilization of state-of-the-art methodologies including quantification techniques, and elegant dominant-negative experiments.
 
 Weaknesses:
 
 A few relatively minor issues are present and related to data interpretation and the conclusions drawn in the study. Namely, some inconsistencies between what appears to be the overall and sub-membrane MT array in scramble vs. KIF5B-depleted cells, the lack of details about the sub-cellular localization of KIF5B in these cells and the physiological significance of the effect of glucose levels in beta-cells of the pancreas.
 
 We thank the reviewer for this insighDul review. In the revised version, we provided re-worded and extended interpretations and conclusions to prevent any issues or misunderstandings. We trust that while some noted apparent inconsistencies may reflect the intrinsic heterogeneity of the beta cell population, all data presented here indicate the same trend in phenotypes. In the revised version, we have provided additional cell views and, in places, alternative representative images and videos, to clear out any apparent inconsistencies. We also would like to point out that we in fact reported KIF5B localization: not surprisingly, KIF5B predominantly localized to insulin granules and the punctate staining fills the whole cytoplasm (Figure 2A, bottom panel). However, as pointed out in detail in our response to reviewer 1, we choose to leave out an extensive study of the physiological and metabolic consequences of the reported microtubule network dynamics to a follow-up study.
 
 Reviewer #3 (Public Review):
 
 Prior work from the Kaverina lab and others had determined that beta-cells build a microtubule network that differs from the canonical radial organization typical in most mammalian cell types and that this organization facilitates the regulated secretion of insulin-containing secretory granules (IGs). In this manuscript, the authors tested the hypothesis that kinesin-driven microtubule sliding is an underlying mechanism that establishes a sub-membranous microtubule array that regulates IG secretion. They employed knock-down and dominant-negative strategies to convincingly show microtubule sliding does, in fact, drive the assembly of the sub-membranous microtubule band. They also used live cell imaging assays to demonstrate that kinesin-mediated microtubule sliding in beta-cells is triggered by extracellular high glucose. Overall, this is an interesting and important study that relates microtubule dynamics to an important physiological process. The experiments were rigorous and well-controlled.
 
 We truly appreciate this reviewer’ opinion.
 
 Recommendations for the authors:
 
 Reviewer #1 (Recommendations For The Authors):
 
 Figures:
 
 (1) Figure 1:
 
 a) Why can one not see here, and in most following images, the peripheral sub-membrane microtubule array? One can also not see an accumulation of microtubules in the cell interior.
 
 Microtubule pattern in beta cells is variable, and the sub-membrane array is seen in the whole population to a variable extent (see directionality histogram in Figure 2E for statistics). In fact, an array of peripheral MTs parallel to the cell border is present in the example shown in Figure 1 and in all following control images. To make it clearer, we now show the pre-bleach images in Figure 1 D-F at a lower magnification, so that the differences in MT density at the cell periphery and cell center are more clearly seen: MTs lack at the periphery in KF5B-depleted but not the control cells.
 
 b) 5 min appears to be a long time and enough time to polymerize a significant number of new microtubules.
 
 We interpret this comment as the reviewer’s concern that in FRAP assays, fluorescently-labeled MTs moving into the bleached area might be newly polymerizing MTs rather than preexisting MT relocated into that area. However, this is not the case because newly polymerized MTs contain predominantly quenched “dark” tubulin molecules and only a small percent of fluorescent tubulin. These dim MTs are not included in MT sliding assay analysis, where a threshold for bright MTs is introduced. Now, we added more details for the quantification of these data to Materials and Methods section.
 
 c) The overall effects appear minor. It is unclear how Fig. 1-Suppl-Fig.1, where no significant difference is shown, is translated into Figure 1 J and K showing a significant difference.
 
 With all due respect, we do not agree that the effect is minor. Please see our response to the Public Review where we discuss the major consequences of MT defects in detail.
 
 To answer this specific comment, we show that there are significant differences in the number of rapidly moving MTs (5-sec displacement over 0.3 µm) and in the amount of stationary MTs (5sec displacement is below 0.15 µm). There is no significant difference in the amount of slightly displaced MTs (displacements between 0.15 and 0.3 µm; the central part of the histogram). This might indicate that these slight displacements do not depend on kinesin-1 motor but rather are caused by experimental noise, pushing by moving organelles, and/or myosin-dependent forces in the cell. In the revised manuscript, we have this quantification more clearly detailed in Methods and included in Figure legends.
 
 d) The authors utilize single molecule tracking to further strengthen their conclusion that KIF5B promotes microtubule sliding. The observed effects are weaker than the data obtained from photobleaching experiments. The videos clearly show that there is still significant movement also in KIF5B-depleted cells. If K560RigorE236A binds irreversibly to a microtubule and this microtubule is growing (not only by the addition of tubulin dimers to the plus end; see PMID: 34883065) wouldn't that also result in movement of the tagged K560RigorE236A? As KIF5B is also required in the transport of insulin granules, it should also label "interior microtubules". And in Video 2 it appears that pretty much all "labeled" microtubules are moving.
 
 K560RigorE236A forms fiducial marks along the whole MTs lattice, as previously shown in (Tanenbaum et al., 2014). When it is bound to MT lattice, K560RigorE236A moves with the whole MT if it is being relocated. The mechanism described in (PMID: 34883065) appears to be absent or minor in beta cells (see Figure 2- Supplemental Figure 2), thus, even if this mechanism would displace already polymerized MTs, this is not happening in this cell type.
 
 The reviewer is correct, K560RigorE236A does mark all MTs throughout a beta cell. All MTs are moving slightly in a living cell because they are pushed around by moving organelles, actin contractility, etc. MTs may also be slid by other MT-dependent motors (dynein against the membrane and such). So, it is not surprising that the MT network is “breezing,” and kinesindependent sliding is only a part of MT movement. What we show here is that the KIF5Bdependent MT sliding is responsible for a relatively “long-distance” relocation of MTs manifested in long, directional displacement of fiducial marks. This does not exclude other movements. This makes extraction of kinesin-dependent MT movements somewhat challenging, of course, that is why we needed to do those extensive analyses.
 
 e) Figure 1 G to K is misleading, at least in the context of the provided videos. There are several microtubules that move extensively in shRNA#2-treated cells and overall there appears more movement in this cell as in the control cell. Figure 1I is clearly not representative of the movement shown in Video 2.
 
 We apologize if our selection of representative movies/figures for this experiment was imperfect. Indeed, in all depleted cells, SunTag puncta still move to a certain extent, either due to incomplete depletion or to alternative intracellular forces dislocating microtubules. However, there is a clear difference in the fraction of persistently moving puncta (please see Figure 1K and histogram in Figure 1 - Supplemental Figure 1B). Unfortunately, when the number of SunTag puncta per a cell is variable, it sometimes prevents a good visual perception of the actual distribution of moving versus stationary microtubules. We now show an alternative representative movie for the Figure 1I and the corresponding Video 2, with a goal to compare cells with more consistent numbers of Sun-Tag puncta.
 
 (2) Figure 2A.
 
 a) This is the only image that clearly shows the existence of a sub-membrane microtubule array and the concentration of microtubules in the cell interior. The differences are unclear between the experimental setups including the length of cultivation and knockdown of KIF5B or expression of mutants.
 
 We now provide a more detailed description of each image acquisition and processing in Materials and Methods. In brief, while the morphology of MT patterns is intrinsically variable in beta cells, all control cells have populated peripheral MTs that exhibit a more parallel configuration as compared to depletions and mutants.
 
 b) The authors state "While control cells had convoluted non-radial MTs with a prominent sub-membrane array, typical for beta cells (Fig. 2A), KIF5B-depleted cells featured extra-dense MTs in the cell center and sparse reseeding MTs at the periphery (Fig. 2B, C)". Could that not be explained with the observation that "Kinesin-1 controls microtubule length" (PMID: 34883065)?
 
 Thank you for this interesting alternative idea. It does not appear to be the case for beta cells.
 
 Please see Figure 2-Supplemental Figure 2 and our response to Public Review Comment #3.
 
 Also, our apologies for the typo in the original manuscript: this is “receding” nor “reseeding”.
 
 (3) Figure 3:
 
 a) This is an elegant way to determine whether KIF5B is involved in microtubule sliding independent of the fact that the effect appears very small.
 
 Thank you!
 
 b) The assay depends on ectopic expression of a dominant negative mutant. It appears important to show that KIFDNwt is high enough expressed to indeed block the binding of endogenous KIF5B. The authors need to provide a control for this. Furthermore, authors need to provide evidence that other functions of KIF5B are not impaired such as transport of insulin granules and tubulin incorporation or microtubule stability and length.
 
 Expression of cargo-binding motor domains routinely causes a dominant-negative effect of their cargo transport. This exact construct has been used for the purpose of dominant-negative action previously (Ravindran et al., 2017). It does prevent the membrane cargo binding of KIF5B (Ravindran et al., 2017), thus the transport of insulin granules is also impaired in overexpression cells. Confirming this fact would not influence our study conclusions, so we chose not to repeat these assays for the sake of time.
 
 c) N-numbers should be similar. The data for KIFDNmut are difficult to interpret with possibly 2 experiments showing little to no displacement and 3 showing displacement.
 
 In the revised manuscript, additional data have been added to increase N-numbers.
 
 (4) Figure 4 and supplements: The morphology of the KIFDNwt cells is greatly affected and this makes it difficult to say whether the effect on microtubules at the cell periphery is a direct or indirect effect.
 
 Yes, these cells often have less spread appearance, obscuring visual perception of MT distribution. We have now replaced the image of KIFDNwt cell (Figure 4, Supplemental Figure 1 A) to a more visually representative example.
 
 Things to do:
 
 (1) Notably, the authors have previously reported that high glucose-induced remodeling of microtubule networks facilitates robust glucose-stimulated insulin secretion. This remodeling involves the disassembly of old microtubules and the nucleation of new microtubules. Here, they state that the sub-membrane microtubule array is destabilized via microtubule sliding. What is the relevance of the different processes? Please discuss these in the manuscript.
 
 Thank you, we have now extended our discussion of these points and our prior findings. We have also added a schematic model figure for clarity (Figure 7).
 
 (2) 5 min appears to be a long time and enough time to polymerize a significant number of new microtubules. Do the authors have any information about the speed of MT formation in MIN6 cells? Can the authors repeat this experiment by preventing MT polymerization? Or repeat the experiment with EB1/EB3 reporter to visualize microtubule growth in the same experimental setting?
 
 While some MT polymerization will happen in this timeframe, newly polymerized MTs contain predominantly quenched “dark” tubulin molecules and only a small percent of fluorescent tubulin. These dim MTs are not included in MT sliding assay analysis, where a threshold for bright MTs is introduced. We apologize for initially omitting certain details from the FRAP assay analysis. Now these details have been added.
 
 Are the microtubules shown on the cell surface (TIRF microscopy) or do we see here all microtubules?
 
 Please see Materials and Methods for microscopy methods and image processing for each figure. Specifically, FRAP assays show a maximum intensity projection of spinning disk confocal stacks over 2.4µm in height (approximately the ventral half of a cell).
 
 (3) Previously, it has been shown that KIF5B induces tubulin incorporation along the microtubule shaft in a concentration-dependent manner. Moreover, running KIF5B increases microtubule rescue frequency and unlimited growth of microtubules. Notably, KIF5B regulates microtubule network mass and organization in cells (PMID: 34883065). Consequently, it appears possible that the here observed phenomena of changes in the microtubule network might be due to alterations in these processes. Authors need to exclude these possibilities and discuss them.
 
 Thank you for this interesting alternative idea. It does not appear to be the case for beta cells. Please see Figure 2-Supplemental Figure 2 and our response to Public Review Comment #3.
 
 (4) It is important that the authors describe in the text and possibly in the figure legends the differences between the experimental set-ups including the length of cultivation and knock down of KIF5B or expression of mutants.
 
 Thank you, please see these details in the text (Materials and Methods section).
 
 (5) Figure 5: Does KIF5B depletion rescue the kinesore-induced defects
 
 Thank you for suggesting this control. We have now conducted corresponding experiments. The answer is yes, it does. Kinesore does not induce detectable changes in MT patterns in KIF5Bdepleted cells (new Figure 5-Supplemental Figure 2).
 
 (6) Can the authors block kinesin-1 resulting in microtubule accumulation in the cell center and then release the block, and best inhibiting microtubule formation, to see whether the microtubules accumulated in the cell center will be transported to the periphery?
 
 This proposed experiment would have been a nice illustration to the study, however it has proven to be too challenging. Unfortunately we have to leave it for the future studies. However, the experiments already included in the paper are sufficient to prove our conclusions.
 
 Minor comments:
 
 (1) The English needs to be improved. Oaen it is unclear what the authors try to convey. The manuscript is difficult to read and contains several overstatements.
 
 The revised manuscript has been through several rounds of proof-reading for clarity.
 
 (2) It is important to describe in more detail in the introduction what is known about KIF5B in beta cells. Previously, it has been demonstrated that silencing, or inactivation by a dominant negative form of KIF5B, blocks the sustained phase of glucose-stimulated insulin secretion (PMID: 9112396, PMID: 12356920, PMID: 20870970).
 
 Yes, this is of course very important and have been cited in the original manuscript. Now, we have expanded the discussion on the matter.
 
 (3) Figure 1B and Fig. 1 Suppl Fig.1: Please provide band sizes and provide information on the size of KIF5B.
 
 We have replaced Fig. 1B and Suppl Fig 1A with quantitative analysis of KIF5B depletion, not found in new Fig. 1B and Suppl Fig. 1A-C.
 
 (4) It is important to state the used glucose concentrations in Figure 1D (based on the methods section it is probably 25 mM glucose) and all subsequent experiments. Is this correct and comparable to Figure 6A or B? For the non-specialized reader, more information should be provided on why initial glucose starvation is performed.
 
 Cell culture models of pancreatic beta cells are routinely maintained at glucose levels that at considered “high”, or stimulatory for secretion. This is needed to prevent the loss of cells’ capacity to respond to glucose stimulation over generations. In order to test GSIS, cells need to be equilibrated at low (fasting, standardly 2.8mM) glucose levels for several hours, so that they are capable of secreting insulin upon glucose addition. 25mM glucose is normally used to stimulate GSIS in cell culture models of beta cells, like MIN6. This is a higher concentration as compared to what is needed to stimulate primary beta cells in islets.
 
 Reviewer #2 (Recommendations For The Authors):
 
 I have the following specific questions that pertain to data interpretation and the conclusions drawn.
 
 (1) The morphology of the overall MT array before the bleach treatment in both control cells and KIF5B-KD cells depicted in Figure 1D-F and Figure 2A-C appears to be distinct. In Figure 1, it seems that the absence of KIF5B results in a general augmentation of MT mass, whereas the arrangement presented in Figure 2 indicates the contrary. Even in the sub-membrane areas, this phenomenon appears to hold true. However, the images used in this study, which depict entire cells or a significant portion of cells, may not be ideal for visualizing the sub-membrane regions.
 
 It would be beneficial if the author could offer some explanations for this apparent inconsistency.
 
 While beta cell population is intrinsically heterogeneous, all data presented here indicate the same trend in phenotypes. Possibly, some apparent inconsistency between figure 1 and 2 appeared because in the original manuscript we did not show the pre-bleach whole-cell overview in Figure 1. In the revised version, we now show the whole cells for pre-bleach so that MT organization at the cell periphery can be assessed. Please note that in the control cell, MTs are more or less equally distributed over the cell, while in KIF5B depletions the cell periphery is significantly less populated than the cell center. Furthermore, we did not detect MT mass augmentation or increase in KIF5B depletions. One possible explanation for such reviewer’s impression from Figure 2 is that Figure 2 F-H shows thresholded images where threshold was adjusted to highlight peripheral MTs in each cell. Please note that this is not the same threshold for each cell (see Figure 2 - Supplemental Figure 2 and 3). Thus, KIF5B-depleted cells that have fewer MTs at the periphery appear brighter in these thresholded images. For the true comparison of MT intensity, please see Figure 2 A-C (grayscale image, not the threshold).
 
 (2) It would be helpful if the author could provide a visual representation or comment on the sub-cellular localization of KIF5B in MIN6 cells. Is it predominantly localized in the submembrane region, or is it more evenly distributed throughout the cytoplasm?
 
 Please see Fig 2A, lower panel. KIF5B is seen across the cell as a punctate staining, in agreement with previous findings that it mostly localize at IGs.
 
 (3) The alteration in microtubule (MT) organization and sliding in the absence of KIF5B seems to initiate in proximity to the apparent microtubule organizing center (MTOC) depicted in Figure 2A, and then "simply" extends towards the sub-membrane region. Although the authors acknowledge it, it would be advantageous for the readers to have a clearer indication that the sub-membrane microtubule (MT) reorganization in the absence of KIF5B is a result of a broader MT reorganization rather than a specific occurrence restricted to the sub-membrane regions.
 
 Thank you for this comment. We now extend our discussion to clearer state our conclusions and interpretations of this point. We also have added a schematic Figure 7 as an illustration.
 
 (4) Regarding the "glucose experiments," it is common to add 20-25 mM glucose to culture media, but physiological concentrations of glucose typically hover around 5 mM. Therefore, it is somewhat unclear what the implications are when investigating the impact of KIF5B depletion on MT sliding at 2.8 mM of glucose. It would be helpful if the authors could provide some commentary on this matter, particularly in relation to physiological and pathological conditions.
 
 2.8 mM glucose is a standard low glucose condition used to model glucose deprivation/fasting. For functional primary beta cells within pancreatic islets, GSIS can be triggered by glucose stimulation as low as 8-12 mM glucose. However, for glucose stimulation of cultured beta cells such as MIN6 used in this paper, 20-25 mM glucose is standardly used because these cell lines have a higher threshold of stimulation compared to primary beta cells and whole islets.
 
 (5) In supplementary Figure 1A, it would be helpful if the lanes in the WB were marked indicating what is what. In my observation, it appears that Supplementary Figure 1A, particularly lanes #2, 3, and 4, display the GAPDH protein (MW 36 kDa) (or is it alpha-tubulin, as mentioned in the Material and Methods section and indicated in lane #409?) relative to Figure 1A. I am curious about KIF5B (MW 108 kDa). Is it represented by the upper band? Did the author probe the same membrane simultaneously with two different primary antibodies? This should be clarified, and the author should indicate the molecular weight of the ladder.
 
 Indeed, in the original WB two antibodies have been used together, due to a challenge in collecting a sufficient number of shRNA-expressing beta cells. It caused a confusion and improper interpretation of the loading control. We thank the reviewer for catching this. We have now replaced old Fig. 1B and Suppl. Fig. 1A with quantitative analysis of KIF5B depletion based on single-cell immunofluorescent staining. It is now found in new Fig. 1B and Suppl Fig. 1A-C.
 
 Reviewer #3 (Recommendations For The Authors):
 
 In all of the figures that present microtubule orientations (e.g. Figure 2E) the error bars obscure the vertical bins making them difficult to read or interpret. If they were rendered at a larger scale, it would be easier to read and interpret these results.
 
 Thank you pointing this out. We now show these histograms with a different format of error bars and without outliers that obscure the view. A variant with outliers is now shown in the supplement.
 
 Some of the callouts to the videos in the paper are inaccurate. Perhaps the authors reordered sections of the paper but failed to correctly renumber the video citations?
 
 Thank you for this comment, we have corrected all callouts now.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

Review 1

Review 2

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.25.546468v2
www.biorxiv.org www.biorxiv.org

mirror determines the far posterior domain in butterfly wings

2
1. Public_Reviews 08 May 2025
  
  in eLife
  
  eLife Assessment
  
  This important study provides evidence of a deeply conserved role for the gene Mirror in providing positional identity in the posterior part of butterfly and fly wings, despite increased morphological complexity of butterfly wings. The findings are solid for the field of evo-devo. However, the tools in butterflies are more limited than in Drosophila and it is more difficult to determine which specific cells are mutant and whether the effect of mutation is cell-intrinsic. The work will be of interest to evolutionary and developmental biologists working on insect wing evolution and the evolution of patterning more generally.
  
  Summary
2. Public_Reviews 08 May 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  This short report shows that the transcription factor gene mirror is specifically expressed in the posterior region of the butterfly wing imaginal disk, and uses CRISPR mosaic knock-outs to show it is necessary to specify the morphological features (scales, veins, and surface) of this area.
  
  Strengths:
  
  The data and figures support the conclusions. The article is swiftly written and makes an interesting evolutionary comparison to the function of this gene in Drosophila. Based on the data presented, it can now be established that mirror likely has a similar selector function for posterior-wing identity in a plethora of insects.
  
  Comments on revisions:
  
  The revision is satisfactory. I agree with the authors that this article provides interesting insights on the evolution of insect wings. Of note, butterfly and fly wing imaginal disks differ in their mode of development: while fly wing disks grow as epithelial sacs that evaginate during metamorphosis, butterfly wing disks develop as relatively flat epithelial sheets that expand and differentiate progressively. This makes the similar role of mirror all the more interesting.
  
  The revised text appropriately discuss how selector genes like mirror regionalize the wing during larval and pupal development. This article makes a reasonable use of CRISPR mosaic knock outs and uses contralateral controls to show the nature of the phenotypic transformations.
  
  Review 1
Visit annotations in context

Tags

Summary

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.02.15.580576v2

Public_Reviews

Annotations: 10,000

Joined: March 17, 2021

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators