10,000 Matching Annotations

Aug 2025
www.biorxiv.org www.biorxiv.org

Testosterone-Induced Metabolic Changes in Seminal Vesicle Epithelial Cells Alter Plasma Components to Enhance Sperm Motility

1
1. Public_Reviews 01 Aug 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Using a combination of in vivo studies with testosterone-inhibited and aged mice with lower testosterone levels, as well as isolated mouse and human seminal vesicle epithelial cells, the authors demonstrate that testosterone induces an increase in glucose uptake. The study reveals that testosterone triggers differential gene expression, particularly focusing on metabolic enzymes. They specifically identify increased expression of enzymes regulating cholesterol and fatty acid synthesis, leading to heightened production of 18:1 oleic acid. The revised version of the manuscript significantly strengthens the role of ACLY as a central regulator of seminal vesicle epithelial cell metabolic programming. The authors suggest that fatty acids secreted by seminal vesicle epithelial cells are taken up by sperm, resulting in a positive impact on sperm function. While the lipid mixture mimicking the lipids secreted by seminal vesicle epithelial cells shows marginal positive effect on sperm motility, the authors have made considerable progress in refining their conclusions. The revised manuscript acknowledges the complexity of pinpointing the specific seminal vesicle fluid component that potentially positively affects sperm function, providing a more measured and credible interpretation of their findings.
  
  Review 2
Visit annotations in context

Tags

Review 2

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.01.16.575926v7
Jul 2025
www.biorxiv.org www.biorxiv.org

The electrogenicity of the Na+/K+-ATPase poses challenges for computation in highly active spiking cells

4
1. Public_Reviews 31 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 This important study provides new insights into the lesser-known effects of the sodium-potassium pump on how nerve cells process signals, particularly in highly active cells like those of weakly electric fish. The computational methods used to establish the claims in this work are compelling and can be used as a starting point for further studies.
 
 Summary
2. Public_Reviews 31 Jul 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Summary:
 
 The authors aim to explore the effects of the electrogenic sodium-potassium pump (Na+/K+-ATPase) on the computational properties of highly active spiking neurons, using the weakly-electric fish electrocyte as a model system. Their work highlights how the pump's electrogenicity, while essential for maintaining ionic gradients, introduces challenges in neuronal firing stability and signal processing, especially in cells that fire at high rates. The study identifies compensatory mechanisms that cells might use to counteract these effects, and speculates on the role of voltage dependence in the pump's behavior, suggesting that Na+/K+-ATPase could be a factor in neuronal dysfunctions and diseases
 
 Strengths:
 
 (1) The study explores a less-examined aspect of neural dynamics-the effects of Na+/K+-ATPase electrogenicity. It offers a new perspective by highlighting the pump's role not only in ion homeostasis but also in its potential influence on neural computation.
 
 (2) The mathematical modeling used is a significant strength, providing a clear and controlled framework to explore the effects of the Na+/K++-ATPase on spiking cells. This approach allows for the systematic testing of different conditions and behaviors that might be difficult to observe directly in biological experiments.
 
 (3) The study proposes several interesting compensatory mechanisms, such as sodium leak channels and extracellular potassium buffering, which provide useful theoretical frameworks for understanding how neurons maintain firing rate control despite the pump's effects.
 
 Weaknesses:
 
 (1) While the modeling approach provides valuable insights, the lack of experimental data to validate the model's predictions weakens the overall conclusions.
 
 (2) The proposed compensatory mechanisms are discussed primarily in theoretical terms without providing quantitative estimates of their impact on the neuron's metabolic cost or other physiological parameters.
 
 Comments on revisions:
 
 The revised manuscript is notably improved.
 
 Review 1
3. Public_Reviews 31 Jul 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 The paper by Weerdmeester, Schleimer, and Schreiber uses computational models to present the biological constraints under which electrocytes - specialized, highly active cells that facilitate electro-sensing in weakly electric fish-may operate. The authors suggest potential solutions that these cells could employ to circumvent these constraints.
 
 Electrocytes are highly active or spiking (greater than 300Hz) for sustained periods (for minutes to hours), and such activity is possible due to an influx of sodium and efflux of potassium ions into these cells after each spike. The resulting ion imbalance must be restored, which in electrocytes, as with many other biological cells, is facilitated by the Na-K pumps at the expense of biological energy, i.e., ATP molecules. For each ATP molecule the pump uses, three positively charged sodium ions from the intracellular space are exchanged for two positively charged potassium ions from the extracellular space. This creates a net efflux of positive ions into the extracellular space, resulting in hyperpolarized potentials for the cell over time. For most cells, this does not pose an issue, as their firing rate is much slower, and other compensatory mechanisms and pumps can effectively restore the ion imbalances. However, in the electrocytes of weakly electric fish, which spike at exceptionally high rates, the net efflux of positive ions presents a challenge. Additionally, these cells are involved in critical communication and survival behaviors, underscoring their essential role in reliable functioning.
 
 In a computational model, the authors test four increasingly complex solutions to the problem of counteracting the hyperpolarized states that occur due to continuous NaK pump action to sustain baseline activity. First, they propose a solution for a well-matched Na leak channel that operates in conjunction with the NaK pump, counteracting the hyperpolarizing states naturally. Their model shows that when such an orchestrated Na leak current is not included, quick changes in the firing rates could have unexpected side effects. Secondly, they study the implications of this cell in the context of chirps-a means of communication between individual fish. Here, an upstream pacemaking neuron entrains the electrocyte to spike, which ceases to produce a so-called chirp - a brief pause in the sustained activity of the electrocytes. In their model, the authors demonstrate that including the extracellular potassium buffer is necessary to obtain a reliable chirp signal. Thirdly, they tested another means of communication in which there was a sudden increase in the firing rate of the electrocyte, followed by a decay to the baseline. For this to occur reliably, the authors emphasize that a strong synaptic connection between the pacemaker neuron and the electrocyte is necessary. Finally, since these cells are energy-intensive, they hypothesize that electrocytes may have energy-efficient action potentials, for which their NaK pumps may be sensitive to the membrane voltages and perform course correction rapidly.
 
 Strengths:
 
 The authors extend an existing electrocyte model (Joos et al., 2018) based on the classical Hodgkin and Huxley conductance-based models of sodium and potassium currents to include the dynamics of the sodium-potassium (NaK) pump. The authors estimate the pump's properties based on reasonable assumptions related to the leak potential. Their proposed solutions are valid and may be employed by weakly electric fish. The authors explore theoretical solutions to electrosensing behavior that compound and suggest that all these solutions must be simultaneously active for the survival and behavior of the fish. This work provides a good starting point for conducting in vivo experiments to determine which of these proposed solutions the fish employ and their relative importance. The authors include testable hypotheses for their computational models.
 
 Weaknesses:
 
 The model for action potential generation simplifies ion dynamics by considering only sodium and potassium currents, excluding other ions like calcium. The ion channels considered are assumed to be static, without any dynamic regulation such as post-translational modifications. For instance, a sodium-dependent potassium pump could modulate potassium leak and spike amplitude (Markham et al., 2013).
 
 This work considers only the sodium-potassium (NaK) pumps to restore ion gradients. However, in many cells, several other ion pumps, exchangers, and symporters are simultaneously present and actively participate in restoring ion gradients. When sodium currents dominate action potentials, and thus when NaK pumps play a critical role, such as the case in Eigenmannia virescens, the present study is valid. However, since other biological processes may find different solutions to address the pump's non-electroneutral nature, the generalizability of the results in this work to other fast-spiking cell types is limited. For example, each spike could include a small calcium ion influx that could be buffered or extracted via a sodium-calcium exchanger.
 
 Review 2
4. Public_Reviews 31 Jul 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews.
 
 Reviewer #1 (Public review):
 
 Summary:
 
 The authors aim to explore the effects of the electrogenic sodium-potassium pump (Na+/K+-ATPase) on the computational properties of highly active spiking neurons, using the weakly-electric fish electrocyte as a model system. Their work highlights how the pump's electrogenicity, while essential for maintaining ionic gradients, introduces challenges in neuronal firing stability and signal processing, especially in cells that fire at high rates. The study identifies compensatory mechanisms that cells might use to counteract these effects, and speculates on the role of voltage dependence in the pump's behavior, suggesting that Na+/K+-ATPase could be a factor in neuronal dysfunctions and diseases
 
 Strengths:
 
 (1) The study explores a less-examined aspect of neural dynamics-the effects of (Na+/K+-ATPase) electrogenicity. It offers a new perspective by highlighting the pump's role not only in ion homeostasis but also in its potential influence on neural computation.
 
 (2) The mathematical modeling used is a significant strength, providing a clear and controlled framework to explore the effects of the Na+/K+-ATPase on spiking cells. This approach allows for the systematic testing of different conditions and behaviors that might be difficult to observe directly in biological experiments.
 
 (3) The study proposes several interesting compensatory mechanisms, such as sodium leak channels and extracellular potassium buffering, which provide useful theoretical frameworks for understanding how neurons maintain firing rate control despite the pump's effects.
 
 Weaknesses:
 
 (1) While the modeling approach provides valuable insights, the lack of experimental data to validate the model's predictions weakens the overall conclusions.
 
 (2) The proposed compensatory mechanisms are discussed primarily in theoretical terms without providing quantitative estimates of their impact on the neuron's metabolic cost or other physiological parameters.
 
 We thank the reviewer for their concise and accurate summary and appreciate the constructive feedback on the article’s strengths and weaknesses. Experimental work is beyond the scope of our modeling-based study. However, we would like our work to serve as a framework for future experimental studies into the role of the electrogenic pump current (and its possible compensatory currents) in disease, and its role in evolution of highly specialized excitable cells (such as electrocytes).
 
 Quantitative estimates of metabolic costs in this study are limited to the ATP that is required to fuel the pump. By integrating the net pump current over time and dividing by one elemental charge, one can find the rate of ATP that is consumed by the Na+/K+pump for either compensatory mechanism. The difference in net pump current is thus proportional to ATP consumption, which allows for a direct comparison of the cost efficiency of the Na+/K+ pump for each proposed compensatory mechanism. The Na+/K+ pump is, however, not the only ATP-consuming element in the electrocyte, and some of the compensatory mechanisms induce other costs related to cell
 
 ‘housekeeping’ or presynaptic processes. We now added a section in the appendix titled
 
 ‘Considerations on metabolic costs of compensatory mechanisms’ (section 11.4), where we provide ballpark estimates for the influence of the compensatory mechanisms on the total metabolic costs of the cell and membrane space occupation. Although we argue that according these estimates, the impact of discussed compensatory mechanisms could be significant, due to the absence of more detailed experimental quantification, a plausible quantitative cost approximation on the whole cell level remains beyond the scope of this article.
 
 Reviewer #1 (Recommendations for the authors):
 
 (1) For the f-I curves in Figures 1 and 6, the firing rate increases as the input current increases. I am curious to know: (a) whether the amplitudes of the action potentials (APs) vary with increased input current; (b) whether the waveform of APs (such as in Fig. 1I) transitions into smaller amplitude oscillations at higher input currents; and (c) if the waveform does change at higher input currents, how do the "current contributions," "current," and "ion exchanges per action potential" in Figures 1HJ and 6AB respond?
 
 To fully answer these questions, we added a supplemental figure with accompanied text in section 11.1 (Fig. A1). We also added a reference to this figure in the main text (section 4.1). Here, it is shown that, as previously illustrated in [1], AP amplitude decreases when the input current increases (Fig. A1 A, left). This effect remains upon addition of either a pump with constant pump rate and co-expressed sodium leak channels (Fig. A1 A, center), or a voltage-dependent pump (Fig. A1 A, right). Interestingly, even though the shape of the current contributions (Fig. A1 B) and the APs (Fig. A1 C) look very different for low (Fig. A1 C, top) and high inputs (Fig. A1 C, bottom), the total sodium and potassium displacement per AP, and thus the pump rate, is roughly the same (Fig. A1 D). Under the assumption that voltage-gated sodium channel (NaV) expression is adjusted to facilitate fixed-AP amplitudes, however, (as in [1]) more NaV channels would be expressed in fish with higher synaptic drives. This would then result in an additional sodium influx per AP and result in higher energetic requirements per AP for electrocytes with higher firing rates (also shown in [1]).
 
 (2) Could the authors clarify what the vertical dashed line represents in Figures 1B and 1F? Does it correspond to an input current of 0.63uA?
 
 (Reviewer comment refers to Fig. 1C and 1F in new version): Yes, it corresponds to the input current that is also used in figures 1D and 1G. We clarified this by adding an additional tick label on the x-axis in 1F. The current input of 0.63uA was chosen as a representative input for this cell as follows: we first modeled an electrocyte with a periodic synaptic drive as in [1]. The frequency of this drive was set to 400 Hz, which is an intermediate value in the range of reported EODfs (and thus presumably pacemaker firing rates) of 200-600Hz [2]. Then, acetylcholine receptor currents IAChRNa and IAChRNa were summed and averaged to obtain the average input current of 0.63uA. This is now also explained in new Methods section 6.2.1.
 
 (3) What input current was used for Figures 1H, 1I, and 1J?
 
 Response: In a physiological setting, where the electrocyte is electrochemically coupled to the pacemaker nucleus, stimulation of the electrocyte occurs through neurotransmitter release in the synaptic cleft, which then leads to the opening of acetylcholine receptor channels. As figures 1H-J concern different ion fluxes, we aimed to also include currents stemming from acetylcholine receptor channels. We therefore did not stimulate the electrocyte with a constant input current as in Fig. 1C and F, but simulated elevated constant neurotransmitter levels in the synaptic cleft, which then leads to elevated acetylcholine receptor currents. In the model, this neurotransmitter level, or ‘synaptic drive’ is represented by parameter synclamp. A physiologically relevant value for synclamp was deduced by averaging the synaptic drive during a 400 Hz pacemaker stimulus. This is now also explained in new Methods section 6.2.1.
 
 (4) In Figure 4A, there is a slight delay between the PN spikes (driver) and the EO (receiver), and no EO spikes occur without PN spikes. However, the firing rate of EO (receiver) appears to decrease before the chirp initiations in Fig 4B; and this delay seems to disappear in Fig 4C. Could the authors explain these observations?
 
 As shown in the bottom right of figure 4A, when plotting the instantaneous firing rate as one over the inter-spike-interval (1/ISI), the firing rate of a cell is only plotted at the end of every ISI. Therefore, even though the PN drives the electrocyte and thus spikes earlier in time than the electrocyte, when it initiates chirps, these will only be plotted as an instantaneous firing rate at the end of the chirp. If the electrocyte fires spontaneously within this chirp, its instantaneous firing rate will appear earlier in time than the initiation of the chirp of the PN. The PN did, however, initiate the chirp before that and causality between the PN and electrocyte is not disturbed.
 
 (5) Regarding Figure 6, could the authors specify the input current used in Figures 6A and 6B?
 
 Figure 6A and 6B have the same synaptic drive as Fig. 1 H, I and J (synclamp=0.13).
 
 (6) In Section 6, I would recommend that the authors provide a table of parameters and their corresponding values for clarity.
 
 Thank you for your suggestion. We now reorganized the method section and added two tables with parameters for clarity. Table 1 (see Methods 6.1) includes all parameters that differ from the parameters reported in [1], and parameters that arise from the additionally modeled equations to simulate ion concentration dynamics and pump. We also added the parameters used to simulate the different stimulus protocols (and corresponding tuned parameters) that are presented in the article in Table 2 (see Methods 6.2).
 
 Reviewer #2 (Public review):
 
 Summary:
 
 The paper 'The electrogenicity of the Na+/K+-ATPase poses challenges for computation in highly active spiking cells' by Weerdmeester, Schleimer, and Schreiber uses computational models to present the biological constraints under which electrocytes-specialized highly active cells that facilitate electro-sensing in weakly electric fish-may operate. The authors suggest potential solutions these cells could employ to circumvent these constraints.
 
 Electrocytes are highly active or spiking (greater than 300Hz) for sustained periods (for minutes to hours), and such activity is possible due to an influx of sodium and efflux of potassium ions into these cells for each spike. This ion imbalance must be restored after each spike, which in electrocytes, as with many other biological cells, is facilitated by the Na-K pumps at the expense of biological energy, i.e., ATP molecules. For each ATP molecule the pump uses, three positively charged sodium ions from the intracellular space are exchanged for two positively charged potassium ions from the extracellular volume. This creates a net efflux of positive ions into the extracellular space, resulting in hyperpolarized potentials for the cell over time. This does not pose an issue in most cells since the firing rate is much slower, and other compensatory mechanisms and other pumps can effectively restore the ion imbalances. In electrocytes of weakly electric fish, however, that operate under very different circumstances, the firing rate is exceptionally high. On top of this, these cells are also involved in critical communication and survival behaviors, emphasizing their reliable functioning.
 
 In a computation model, the authors test four increasingly complex solutions to the problem of counteracting the hyperpolarized states that occur due to continuous NaK pump action to sustain baseline activity. First, they propose a solution for a well-matched Na leak channel that operates in conjunction with the NaK pump, counteracting the hyperpolarizing states naturally. Additionally, their model shows that when such an orchestrated Na leak current is not included, quick changes in the firing rates could have unexpected side effects. Secondly, they study the implication of this cell in the context of chirps - a means of communication between individual fishes. Here, an upstream pacemaking neuron entrains the electrocyte to spike, which ceases to produce a so-called chirp - a brief pause in the sustained activity of the electrocytes. In their model, the authors show that it is necessary to include the extracellular potassium buffer to have a reliable chirp signal. Thirdly, they tested another means of communication in which there was a sudden increase in the firing rate of the electrocyte followed by a decay to the baseline. For reliable occurrence of this, they emphasize that a strong synaptic connection between the pacemaker neuron and the electrocyte is warranted. Finally, since these cells are energy-intensive, they hypothesize that electrocytes may have energyefficient action potentials, for which their NaK pumps may be sensitive to the membrane voltages and perform course correction rapidly.
 
 Strengths:
 
 The authors extend an existing electrocyte model (Joos et al., 2018) based on the classical Hodgkin and Huxley conductance-based models of Na and K currents to include the dynamics of the NaK pump. The authors estimate the pump's properties based on reasonable assumptions related to the leak potential. Their proposed solutions are valid and may be employed by weakly electric fish. The authors explore theoretical solutions that compound and suggest that all these solutions must be simultaneously active for the survival and behavior of the fish. This work provides a good starting point for exploring and testing in in vivo experiments which of these proposed solutions the fish use and their relative importance.
 
 Weaknesses:
 
 The modeling work makes assumptions and simplifications that should be listed explicitly. For example, it assumes only potassium ions constitute the leak current, which may not be true as other ions (chloride and calcium) may also cross the cell membrane. This implies that the leak channels' reversal potential may differ from that of potassium. Additionally, the spikes are composed of sodium and potassium currents only and no other ion type (no calcium). Further, these ion channels are static and do not undergo any post-translational modifications. For instance, a sodium-dependent potassium pump could fine-tune the potassium leak currents and modulate the spike amplitude (Markham et al., 2013).
 
 This model considers only NaK pumps. In many cell types, several other ion pumps/exchangers/symporters are simultaneously present and actively participate in restoring the ion gradients. It may be true that only NaK pumps are expressed in the weakly electric fish Eigenmannia virescens. This limits the generalizability of the results to other cell types. While this does not invalidate the results of the present study, biological processes may find many other solutions to address the non-electroneutral nature of the NaK pump. For example, each spike could include a small calcium ion influx that could be buffered or extracted via a sodium-calcium exchanger.
 
 Finally, including testable hypotheses for these computational models would strengthen this work.
 
 We thank the reviewer for the detailed summary and the identified weaknesses according to which we improved our article. Our model assumptions and simplifications are now mentioned in more detail in the introduction of the article (section 3), and justified in the Methods (section 6.1).
 
 Furthermore, we added a discussion section (section 5.1) where we outline the conditions under which the present study can be extended to other cell types. We now also state more clearly that the pump current will be present for any excitable cell with significant sodium flux (assuming that the NaK pump carries out the majority of its active transport), but that compensatory mechanisms (if employed at all in a particular cell) could also be implemented via other ionic currents and transporters. We furthermore now highlight the testable hypotheses that we put forward with our computational study on the weakly electric fish electrocyte more explicitly in the first paragraph of the discussion.
 
 Reviewer #2 (Recommendations for the authors):
 
 Main text
 
 Please explicitly state this model's assumptions in the introduction and elaborate on them in the discussion if necessary. For example, some assumptions that I find relevant to mention are: - The Na and K channels are classic HH conductance-based channels, with no post-translational modifications or beta subunit modifications as seen in other high-frequency firing cells (10.1523/JNEUROSCI.23-12-04899.2003).
 
 Neither calcium nor chloride ions are considered in the spike generation. Nor are Na-dependent K channels (10.1152/jn.00875.2012).
 
 Only the Na-K pump (and not the Na-Ca exchanger, Ca-pump, or Cl pumps) is modeled,
 
 Calmodulin, which can buffer calcium, is highly expressed in electric eels, but it is not considered. If some of these assumptions have valid justifications in weakly electric fish electrocytes, please state so with the citations. I recognize that including these in your models is beyond the scope of the current paper.
 
 We thank the reviewer for pointing out this issue. We now specified in the introduction that the model only contains sodium and potassium ions and only classic HH conductance-based channels. We there also explicitly specify the details on the Na+/K+-ATPase: it is the only active transporter in this model, thus solely responsible for maintaining ionic homeostasis; its activity is only modulated by intracellular sodium and extracellular potassium concentrations. In the discussion (6.1), we now elaborate on how ion-channel-related aspects (i.e., the addition of resurgent Na+ or Na+ -dependent K+ channels), additional ion fluxes (including some not relevant for the electrocyte but for other excitable cells), and additional active transporters and pumps would influence the results presented in the article.
 
 In addition, there might be other factors that the authors and the reviewers have yet to consider. The model is a specific case study about the weakly electric fish electrocyte with high-frequency firing. It is almost guaranteed that biology will find other compensatory ways in different cell types, systems, and species (auditory nerve, for example). Given this, it would be prudent to use phrases such as 'this model suggests,' 'perhaps,' 'could,' 'may,' and 'eludes to,' etc., to accommodate other possible solutions to ion homeostasis in rapidly spiking neurons. The solutions the authors are proposing are some of many.
 
 We rephrased some of the statements to highlight more the hypothetical nature of the compensatory mechanisms in specific cells and to draw attention to the fact that there can be many more such factors. This fact is now also explicitly mentioned in discussion section 5.2.
 
 Figures
 
 Some of my comments on the figures are stylistic, others are to improve clarity, and some are critical for accuracy.
 
 The research problem concerns weakly electric fish E. virescens. I suggest introducing a picture of an electric fish in the beginning (such as that in Figure 3, but not exactly; see specific comments on this fish figure) along with a schema of the research question.
 
 We agree, and added an overview schema in Fig. 1A.
 
 Font sizes change between the panels in all the figures. Please maintain consistency. The figure panel titles and axis labels should start with a capital letter.
 
 Thank you for pointing this out, both issues have been resolved in the new version of the article.
 
 Figure 1:
 
 Please rearrange the figure - BCFG belong together and should appear in the same order. The x-axis labels could be better placed.
 
 Consider using fewer pump current f-I curves (B, D, E, F). Five is sufficient to make the point. Having 10 curves adds to the clutter. The placement of the color bar could be better. Similarly, the placement of the panel titles 'without co-expression' and 'with co-expression' and the panel labeling (BCFG) makes it confusing. The panel labels should be above the panel title.
 
 Response (C, D, F, G in new version): We improved the layout of figure 1. Panels B, C, F, G are now C, D, F, G. We opted to include panel E before panels F and G, because it shows the coexpression mechanism before its effect on the tuning curve. We did move the colorbar, added x-axis labels to B and C, and adjusted the location of the panel labels for clarity. We also plotted fewer pump currents.
 
 B, F: What does the dashed line indicate?
 
 Response (C, F in new version): The dashed line indicates the input current that was used in figures 1D and 1G. We now clarified this by adding this value on the x-axis.
 
 C: Any reason not to show the lower firing rates?
 
 Response (B in new version): In the previous version of the article, pump currents were estimated for electrocytes that were stimulated with the mean synaptic drive that stems from periodic stimulation in the 200-600 Hz regime. We now extended the range of synaptic inputs to obtain lower (and higher) firing rates. The linear relationship between firing rate and pump current also holds for these additional firing rates.
 
 D: There is no difference between the curves at the top and the bottom. One fills the area between the curve and the zero line; the other shows the curve itself. Please use only one of the two representations.
 
 Response (panel I in new version): In the previous version, the difference between the plots was that one showed the absolute values of the currents (the curves), and the other plot showed the contributions of the currents to the total (area between the curves). We now only depict the current contributions.
 
 The I and H orders can be swapped.
 
 Thank you, they are now swapped.
 
 The colors used for Na and K are very dull (light blue and pink).
 
 We now use darker colors in the new version of the article.
 
 Figure 2:
 
 Please verify that without the synaptic input perturbations (i.e., baseline in A, D), the firing rate (B, E) and pump current (C, F) converge to the baseline. There is a noticeable drift (downward for firing rate and upward for pump currents) at the 10-second time point.
 
 Thanks to you noticing, we identified a version mismatch in the code that estimates the pump current required for ionic homeostasis (see Methods 6.1.2). We have now corrected the code and made sure to start the simulation in the steady state so that there is no drift at baseline firing. We also used this corrected code to present tuned parameters for different stimulus protocols in Table 2 (Methods 6.2).
 
 Figure 3:
 
 A. The dipole orientation with respect to the fish in panel B needs to be corrected. Consider removing this as this work is not about the dipole.
 
 This panel has been removed.
 
 B. This figure has already been overused in multiple papers; please redraw it. Localized expressions of different pumps and ion channels are present within each electrocyte, which generates the dipole. Either show this correctly or don't at all (the subfigure pointed out by the red arrow).
 
 This panel has been moved to Fig. 1A. We opted to remove the localized expressions.
 
 C and D belong together; please place them next to each other. Consider introducing panel D first since it follows a similar protocol to the last figure.
 
 Response (A in new version): Panel placement has been adjusted. We opted to maintain the order to maintain the flow of the text, but we do now combine them in one panel.
 
 E and F are very similar in that they are swapped on the x and y axes. Either that or I have severely misunderstood something, in which case it needs to be shown better.
 
 Response (B and C in new version): We adjusted the placement of these panels. They are not the same, panel B shows the mean of physiological periodic inputs, and figure C shows that when this mean is fed to the electrocyte, it also induces tonic firing. The range of mean currents that result from periodic synaptic stimulation in the physiological regime (panel B, y-axis) is now indicated in panel C by a grey box along the x-axis.
 
 G. Why show the lines with double arrow ends? The curves are diverging - that's enough.
 
 Good point, we updated this panel accordingly (now panel D).
 
 Figure 4
 
 Please verify the time units in these plots. Something seems amiss. B and D lower plots-perhaps this is seconds? B could use an inset box/ background gray color (t1, t2) indicating the plots of the C panel (left, right). Likewise, for D (t1, t2), connect to E (left, right).
 
 You are right, the x-axes were supposed to be in seconds, we updated this. We indicated the relations between D-C and D-E by gray backgrounds and by adding the corresponding panel label on the x-axis.
 
 A: Indicate the perturbation in the schematic, i.e., extracellular K buffer.
 
 The perturbation is now indicated.
 
 D: Even with the extracellular K buffer, there is a decay (slower than in B) of the pump current over time. Please verify (you do not have to show in your paper) that this decay saturates.
 
 After the ten chirps are initiated, pacemaker firing goes back to baseline. In both cases (panel B and panel D), the pump current goes back to baseline after some time. With extracellular potassium buffering, this happens more slowly due to a decreased reaction speed of the pump to changes in firing rate (in comparison to the case without extracellular potassium buffer).
 
 The decrease in reaction speed however merely delays the effects of changes in firing rates on the pump current in time. Therefore, even with an extracellular potassium buffer, when more chirps are initiated in a short period of time, the pump current can still decrease to an extent that impairs entrainment. Using the same protocol as in panel B and D, we increased the number of chirps and found that with an extracellular potassium buffer, a maximum of 13 chirps could be encoded without entrainment failure (as opposed to 2 chirps without the buffer as shown in panel B).
 
 Figure 5
 
 Please verify the time units in these plots, as for Figure 4. B and E lower plots-perhaps this is seconds? B could use an inset box/ background gray color (t1, t2) indicating the plots of the panels C and D. Likewise, for E (t1, t2), connect to F and G.
 
 The time axis in this figure was indeed also in seconds, which we corrected here. The relations between plots B-C/D and E-F/G are now indicated through gray backgrounds and corresponding panel references on the x-axis.
 
 A: Indicate the perturbation in the schematic, i.e., the synapse's strength. There is no need to include the arrow or to mention freq. rise. The placement of the time scale can be misinterpreted as a current clamp. Instead, plot it as a zoomed inset.
 
 The arrow is removed and we now also show a zoomed inset. Also, the perturbation is now indicated.
 
 E: Verify that the pump current in the strong synapse case already starts at 1.25
 
 We verified this and noticed that the pump current in the strong synapse case is indeed lower than that in the weak synapse case. This is because to ensure a fair comparison for this stimulation protocol, voltage-gated sodium channel conductance was tuned to maintain a spike amplitude of 13 mV in both cases (see Methods 6.2). In this case, a weak synapse leads to a lower influx of sodium via AChR channels, but a higher influx via voltage-gated sodium channels. The total sodium influx in this case is larger than that for a stronger synapse with relatively less voltage-gated sodium currents, and thus a larger pump current. In the previous version of the article, this was wrongly commented on in the figure captions, and we removed the erroneous statement.
 
 This is not critical, but because the R-value here can be obtained as a continuous value, it would be appropriate to show it for the whole duration of the weak and strong synapses in B and E. Maybe consider including a schema that shows how R is calculated in panel A.The caption has a typo, 'during frequency rises before (D) and after (E)'. It should be before C) and after (D) instead.
 
 The caption typo has been corrected. The R-value for the whole duration of the weak and strong synapses in B and E is 1.000. This is because the R-value is the variance of all phase relations between the PN and the electrocyte, and for the entire duration of the stimulus protocol, there are only a few outliers in phase relations at the maxima of the frequency rises. We decided to include this R-value to show that in general, synchronization between the PN and the electrocyte is very stable. The schema that explains how R is calculated has not been included in favor of not overcrowding the figure. We did add a reference in the figure caption to the methods section in which the calculation of R is explained.
 
 Figure 6:
 
 A: The top and bottom plots are redundant. Use one of the two. They show the same thing. It may be better to plot Na, K, pump, and net currents on the top panels and the Na leak, which is of smaller magnitude, in a different panel.
 
 We now only show current contributions.
 
 B: Please change the color schema. It is barely visible on my prints.
 
 D: Pump current, instantaneous case, is barely visible
 
 Color schemes were adjusted.
 
 Figure A1: It's all good.
 
 Methods:
 
 Please provide some internal citations for where specific equations were used in the results/figures. You do this for sections 6.2.3, referencing Figure 5 (c,d,e,g), and 6.2.4, referencing Fig 5 C-E.
 
 There are now internal references in each methods section to where in the figures they were used. We also included a table with stimulus parameters for each figure with a stimulus protocol (Table 2).
 
 Also, the methods could be ordered in the same order as the results are presented. Please consider if some details in the methods could be moved to the appendix.
 
 The ordering of the methods has now been changed to separately explain the model expansions (6.1) and the stimulus protocols (6.2). Both sections are in corresponding order of the figures presented in the article. We opted to maintain all details in the methods.
 
 6.1.1 Please cite 26 after the first line. Where was this used? In Figure 3C, 4, 5?
 
 We added the citation. The effects of co-expressed leak channels are shown in Fig. 1 EG, and were used to compensate for pump currents at baseline firing in figures 1 D, H-J (left, with pump), 2, 4, 5, and 6 A-B (left), C (top). This is now also added to the text for clarity.
 
 Traditionally (Hodgkin, A. L. and Huxley, A. F. (1952). J. Physiol. (Lond.), 117:500-544. Table 3; & Hodgkin, A. L. and Huxley, A. F. (1952). J. Physiol. (Lond.), 116:473-496 Table 5 and the paragraph around it), leak potential is set such that it accounts for all leak from all ions. While in your work, this potential is equal to the reversal of potassium - it need not be so in the animal. There may be leaks from other ions as well, particularly sodium and chloride. Please verify that assuming the leak reversal is the same as that of potassium (Ek, in Equation 3) does not lead to having to model Na leak currents separately.
 
 In the original model [1], it was assumed that the reversal potential of the leak was the same as that of potassium, which contains the implicit assumption that only potassium ions contribute to the leak. In our article, we also assume that sodium ions contribute to the leak. This can be modeled by adjusting the leak reversal potential accordingly, or by adding an additional leak current that solely models the sodium leak. We opted for the latter in order to track all sodium and potassium ions separately so that ion concentration dynamics could also be modeled properly. Chloride ions were neglected in this study; in our model they do not contribute to the leak. If one were to also model chloride currents and chloride concentration dynamics, it would be beneficial to model these as an additional separate leak current.
 
 The notation of I_pump_0 needs to be more convenient. Please consider another notation instead of the _0 (pump at baseline). Similarly for [Na+]_in_0 [Na+]_out_0 and [K+]_in_0 and [K+]_out_0
 
 We changed the notation for baseline similarly to [3], with ‘0’ as a superscript instead of a subscript.
 
 Equation 11: Please mention why AChRs do not let calcium ions through. Please cite a justification for this. If this is an assumption of the model, please state this explicitly.
 
 The AChR channels that were found in the E. virescence electrocytes are muscle-type acetylcholine nicotinic receptors [4], which are non-selective cation channels that could indeed support calcium flux [5]. No calcium currents were, however, modeled in the original electrocyte model [1], presumably due to the lack of significant contributions of calcium currents or extracellular calcium concentrations to electrocyte action potentials of a similar weakly electric electrogenic wave-type fish Sternopygus macrurus [6].
 
 Due to the lack of calcium currents in the original electrocyte model, and due to the limitation of this study to sodium and potassium ions, we chose not to include calcium currents stemming from AChR channels. This assumption is now explicitly stated in Methods 6.1.
 
 Equation 12, V_in, where the intracellular volume. If possible, avoid the notation of 'V' - you already use a small v for membrane potential.
 
 We changed the notation for volume to ‘ω’ similarly to [3]. As we previously used ω as a notation for the firing rate, we changed the notation for firing rate to ‘r’.
 
 Equation 17: Does this have any assumptions? Would the I_AchRNa, and thus Sum(mean(I_Na))) not change depending on the synaptic drive?
 
 The assumptions of this equations are the following (now also mentioned in Methods 6.1.2):
 
 The sum of all sodium currents also includes sodium currents through acetylcholine channels (I_AChRNa).
 
 All active sodium transport (from intra- to extracellular space) is carried out by the Na+/K+-ATPase, and active sodium transport through additional transporters and pumps is negligible.
 
 The time-average of sodium currents is either taken in a tonic firing regime where the timeinterval that is averaged over is a multiple of the spiking period, nT, or if it is taken for a more variable firing regime, the size of the averaging window should be sufficiently large to properly sample all firing statistics.
 
 Under these assumptions, Eq. 17 can be used to compute suitable pump currents for different synaptic drives (as Sum(mean(I_Na))) and thus I_pump0 indeed change with the synaptic drive, see Table 2 in Methods 6.2).
 
 6.2: Please rewrite the first sentence of this paragraph.
 
 The first sentence of this paragraph, which has been moved to section 6.2.2 for improved structuring of the text, has been rewritten.
 
 6.2.1: The text section could use a rewrite.
 
 Please elaborate on what t_p is. If it is not time, please do not use 't.' What is p here? What are the units of the equation (22), t_p < 0.05 (?)
 
 This section has now also been moved to 6.2.2. It has been rewritten to improve clarity and t_p has been renamed to t_pn (as it does reflect time, which is now better explained). The units have now also been added to the equation (which is now Eq. 26).
 
 6.2.4: Please rewrite this.
 
 This section has been rewritten (and has been moved to section 6.1.4).
 
 Bibliography
 
 Some references are omitted (left anonymous) or inconsistent on multiple occasions.
 
 Thank you for pointing this out! It is now rectified.
 
 References used for author response
 
 (1) Joos B, Markham MR, Lewis JE, Morris CE. A model for studying the energetics of sustained high frequency firing. PLOS ONE. 2018 Apr;13:e0196508.
 
 (2) Hopkins CD. Electric communication: Functions in the social behavior of eigenmannia virescens. Behaviour. 1974;50(3-4):270–304.
 
 (3) Hübel N, Dahlem MA. Dynamics from seconds to hours in hodgkin-huxley model with time-dependent ion concentrations and buer reservoirs. PLoS computational biology.ff2014;10(12):e1003941.
 
 (4) BanY, Smith BE, Markham MR. A highly polarized excitable cell separates sodium channels from sodium-activated potassium channels by more than a millimeter. Journal of neurophysiology. 2015; 114(1):520–30.
 
 (5) Vernino S, Rogers M, Radcliffe KA, Dani JA. Quantitative measurement of calcium flux through muscle and neuronal nicotinic acetylcholine receptors. Journal of Neuroscience. 1994;14(9):5514-5524.
 
 (6) Ferrari M, Zakon H. Conductances contributing to the action potential of sternopygus electro-cytes. Journal of Comparative Physiology A. 1993;173:281–92.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

AuthorResponse

Review 2

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.09.24.614486v4
www.biorxiv.org www.biorxiv.org

Deletion of sulfate transporter SUL1 extends yeast replicative lifespan via reduced PKA signaling instead of decreased sulfate uptake

4
1. Public_Reviews 31 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 This study offers a valuable contribution to the understanding of how inorganic nutrient transporters, particularly SUL1, influence yeast lifespan through signaling pathways rather than transport functions. The findings suggest a novel link between SUL1 deletion and extended replicative lifespan, supported by transcriptomic and stress-response data. However, the strength of the evidence remains incomplete, with key experiments-such as sulfate supplementation tests, functional autophagy validation, and transport assays-either missing or insufficiently described. As a result, while the manuscript presents promising insights, additional work is needed to robustly support its conclusions.
 
 Summary
2. Public_Reviews 31 Jul 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 The manuscript by Long et al. focused on SUL1, a gene encoding a sulfate transporter with signaling roles in yeast. The authors claim that the deletion of SUL1, rather than SUL2 (encoding a similar transporter), extended yeast replicative lifespan independent of sulfate transport. They also show that SUL1 loss-of-function mutants display decreased PKA activity, indicated by stress-protective carbohydrate accumulation, relevant transcription factor relocalization (measured during aging in single cells), and changes in gene expression. Finally, they show that loss of SUL1 increases autophagy, which is consistent with the longer lifespan of these cells. Overall, this is an interesting paper, but additional work should strengthen several conclusions, especially for the role of sulfate transport. Specific points include the following:
 
 What prompted the authors to measure the RLS of sul1 mutants? Prior systematic surveys of RLS in the same strain background (which included the same sul1 deletion strain they used) did not report lifespan extension in sul1 cells (PMID: 26456335).
 
 Cells carrying a mutant Sul1 (E427Q), which was reported to be disrupted in sulfate transport, did not have a longer lifespan (Figure 1), leading them to conclude that "lifespan extension by SUL1 deletion is not caused by decreased sulfate uptake". They would need to measure sulfate uptake in the mutants they test to draw that conclusion firmly.
 
 Related to my previous point, another simple experiment would be to repeat the assays in Figure 1 with exogenous sulfur added to see if the lifespan extension is suppressed.
 
 There needs to be more information in the text or the methods about how they did the enrichment analysis in Figure 2B. P-values are typically insufficient, and adjusted FDR values are reported from standard gene ontology platforms (e.g., PANTHER).
 
 It is somewhat puzzling that relocalization of Msn2 was not seen in very old cells (past the 17th generation), but it was evident in younger cells. The authors could consider another possibility, that it was early and midlife experiences that made those cells live longer. Past that window, loss of Sul1 may have no impact on longevity. A conditional shutoff system to regulate SUL1 expression would be needed to test the above, albeit this is probably beyond the scope of this report.
 
 The connections between glucose restriction, autophagy, and sul1 (Figure 4) could be further tested by measuring the RLS of sul1 cells in glucose-restricted cells. If RLS is further extended by glucose restriction, then whatever effects they see should be independent of glucose restriction.
 
 They made and tested the double (sul1, msn2) mutants, but they should also test the sul1, msn4 combination since Msn4 functions similarly to Msn2.
 
 Comments on revisions:
 
 Overall, this is a somewhat improved manuscript, but some prior concerns about the validity of the conclusions remain unresolved.
 
 Review 1
3. Public_Reviews 31 Jul 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 In this study, the authors find that deletion of a sulfate transporter in yeast, Sul1, leads to extension of replicative lifespan. They investigate mechanisms underlying this extension, and claim that the effects on longevity can be separated from sulfate transport, and are instead linked to a previously proposed transceptor function of the Sul1 transporter. Through RNA sequencing analysis, the authors find that Sul1 loss triggers activation of several stress response pathways, and conclude that deletion of two pathways, autophagy or Msn2/4, partially prevents lifespan extension in cells lacking Sul1. Overall, while it is well-appreciated that activation of Msn2/4 or autophagy is beneficial for lifespan extension in yeast, the results of this study would add an important new mechanism by which this could achieved, through perceived sulfate starvation. However, as described below, several of the experiments utilized to support the authors conclusion are not experimentally sound, and significant additional experimentation is required to support the authors claims throughout the manuscript.
 
 Strengths:
 
 The major strength of the study is the robust RNA-seq data that identified differentially expressed genes in cells lacking Sul1. This facilitated the authors focus on two of these pathways, autophagy and the Msn2/4 stress response pathway.
 
 Weaknesses:
 
 Several critical experimental flaws need to be addressed by the authors to more rigorously test their hypothesis.
 
 (1) The lifespan assays throughout the manuscript contain inconsistencies in the mean lifespan of the wild type strain, BY4741. For example, in Figure 1A, the lifespan of BY4741 is 24.3, and the extended lifespan of the sul1 mutant is 31. However, although all mutants tested in Figure 1B also have lifespans close to 30 cell divisions, the wild type control is also at 30 divisions in those experiments as well. This is problematic, as it makes it impossible to conclude anything about the lifespan extension of various mutants with the inconsistencies in the wild type lifespan. Additionally, the mutants analyzed in 1B are what the authors use to claim that loss of the transporter does not extend lifespan through sulfate limitation, but instead through a signaling function. Thus, it remains unclear whether loss of sul1 extends lifespan at all, and if it does, whether this is separable from cellular sulfate levels.
 
 (2) While the authors use mutants in Figure 1 that should have differential effects on sulfate levels in cells, the authors need to include experiments to measure sulfate levels in their various mutant cells to draw any conclusions about their data.
 
 (3) Similar to point 2, the authors focused their RNA sequencing analysis on deletion of sul1 and did not include important RNA seq analysis of the specific Sul1 mutation or other mutants in Figure 1B that do not exhibit lifespan extension. The prediction is that they should not see activation of stress response pathways in these mutants as they do not see lifespan extension, but this needs to be tested.
 
 (4) While the RNA-seq data is robust in Figure 2 as well as the follow up quantitative PCR and trehalose/glycogen assays in 2A-B, the follow-up imaging assays for Msn2/4 localization in Figure 2 are not robust and are difficult to interpret. The authors need to include more high-resolution imaging or at least a close up of the cells in Figure 3C.
 
 (5) The autophagy assays utilized in Figure 4 appear to all be done with a C-terminal GFP-tagged Atg8 protein. As C-terminal GFP is removed from Atg8 prior to conjugation to phosphatidylethanolamine, microscopy assays of this reporter cannot be utilized to report on autophagy activity or flux. Instead, the authors need to utilize N-terminally tagged Atg8, which they can monitor for vacuole uptake as an appropriate readout of autophagy levels. As it stands, the authors cannot draw any conclusions about autophagy activity in their studies.
 
 Comments on revisions:
 
 Their autophagy conclusions are weak at best. As was highlighted in the previous review, they need to use an N-terminal Atg8 fusion for these experiments.
 
 Review 2
4. Public_Reviews 31 Jul 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 Summary:
 
 In the revised manuscript, Long et al., showed that sul1∆ mutants have extended replicative lifespan in budding yeast. In comparison, other mutants that have sulfate transport deficiency did not show extended lifespan, suggesting SUL1 deletion extends lifespan independently of sulfate intake. The authors then explored the transcriptome of sul1∆ mutants by RNA-seq, which suggests that SUL1 deletion impacts common longevity pathways. Furthermore, the authors characterized how the PKA pathway is affected in sul1∆ mutants: SUL1 deletion promotes the nuclear localization of Msn2, as well as autophagy, indicating down-regulation of the PKA pathway.
 
 Strengths:
 
 This study raised an interesting point that inorganic transporters may impact cellular stress response pathways and affect lifespan. Some of the characterizations on the sul1∆ mutants, including the RNA-seq and MSN2 localization could provide valuable sources for people in related fields. Compared with the previous version, the writing is significantly improved, making the manuscript clearer.
 
 Weaknesses:
 
 Several critical flaws have not been revised. The claims are still not well supported by the data.
 
 (1) The revised manuscript still uses Atg8-EGFP, in which GFP is likely tagging at the C-terminus of Atg8. No strain information was provided for this strain, so it is unclear whether it is N- or C- terminal tagged. As pointed by reviewers of the previous version, C-terminal tagged Atg8 is not functional. As a result, the conclusions on autophagy (Figure 4) is questionable.
 
 (2) The nuclear localization of Msn2 is much more convincing after the authors updated Figure 3C. However, the rest of the microscopy images (e.g. Figure 3E, 4B, 4E) are still of low resolution. Again, I suggest to separate the DIC and GFP channels. It is really hard to tell where is the GFP signal from these figures.
 
 (3) In the Kankipati et al. 2015 paper, which is cited by the authors, SUL1E427Q is incorporated on a pRS316 (URA3) plasmic and expressed in sul1∆sul2∆ mutants. In this manuscript, the authors used SUL1E427Q mutants but did not give detailed information on how this construct is expressed. Is it endogenously mutated, incorporated into somewhere in the genome, or expressed from an extrachromosomal plasmid? In Figure 1B, they simply used BY4741 as a control for the SUL1E427Q mutant. This makes me thinking they are using a SUL1E427Q endogenous point mutation mutant. If so, the authors may want to include the information about this strain in their Supplementary table. Or if it is expressed from an extra copy on chromosomes or extrachromosomal plasmids, the authors would need to express this construct in sul1∆ mutant. In this case, the authors may want to use sul1∆ and sul1∆+empty vector as controls, instead of BY4741. As the authors mentioned in their rebuttal letter, lifespan experiments vary between each individual trials and are not comparable between different trials. Thus proper controls are essential to make the results convincing.
 
 (4) As suggested by reviewers of the previous version, the authors tested the sulfate uptake in different mutants within 10 minute of Na2SO4 addition (Figure 1B). The authors concluded from the data that wild type takes up sulfate faster than the mutants but they reach similar concentrations at the end point (as fast as 10 minutes). Are all these cells sulfate-starved before the experiment? If not, the experiment might be affected by the basal level of sulfate in each mutants.
 
 Review 3
Visit annotations in context

Tags

Summary

Review 2

Review 3

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.11.11.566697v2
www.biorxiv.org www.biorxiv.org

An Evaluation of the Tumor Microenvironment through CALR, IL1R1, IFNB1, and IFNG to Assess Prognosis and Immunotherapy Response in Bladder Cancer Patients

3
1. Public_Reviews 31 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 This study presents useful findings that explore the prognostic and immunotherapeutic relevance of specific immune-related genes (CALR, IL1R1, IFNB1, and IFNG) in the bladder cancer tumor microenvironment. While the analysis highlights potentially meaningful associations with survival and treatment response, the strength of evidence is incomplete, as some claims lack sufficient experimental or mechanistic validation. Further refinement and validation of the predictive models would enhance the impact and generalizability of the conclusions.
 
 Summary
2. Public_Reviews 31 Jul 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 The authors aimed to explore the prognostic and therapeutic relevance of immunogenic cell death (ICD)-related genes in bladder cancer, focusing on a risk-scoring model involving CALR, IL1R1, IFNB1, and IFNG. The research indicates that higher expression of certain ICD-related genes is associated with enhanced immune infiltration, prolonged survival, and improved responsiveness to PD1-targeted therapy in bladder cancer patients.
 
 Major strengths:
 
 • The establishment of an ICD-related gene risk model based on publicly available datasets (TCGA and GEO) and further validated through tissue arrays and preliminary single-cell RNA sequencing data provides potential but weak clinical guidance.
 
 • The integration of multi-dimensional data (gene expression, mutation burden, immune infiltration, and treatment responses) strengthens the clinical applicability of the model.
 
 Key limitations and concerns:
 
 (1) Gene Selection and Novelty:
 
 The selection of genes predominantly reflects known regulators of immune responses, somewhat limiting the novelty. Exploring less-characterized ICD markers or extending validation beyond bladder cancer could improve the model's innovative aspect and wider clinical relevance.
 
 (2) Reliance on RNA-Seq for Immune Infiltration:
 
 Immune infiltration analyses based primarily on bulk RNA-Seq data have inherent methodological limitations, such as inability to distinguish cell subsets accurately. Incorporation of robust single-cell sequencing would significantly enhance the reliability of these findings. Although the authors recognize this limitation, future studies should directly address it.
 
 (3) Drug Sensitivity and Immunotherapy Response Data:
 
 While the authors clarify that the drug sensitivity analysis was performed using established databases (TCGA via pRRophetic), the unexpected correlations between ICD-related genes and various targeted therapies need further mechanistic validation. The observed relationships may reflect indirect associations rather than direct biological relevance, which warrants cautious interpretation.
 
 (4) Presentation and Clarity Issues:
 
 Initially noted formatting inconsistencies across figures compromised professional presentation; these have been corrected by the authors. Additionally, the authors have now provided essential methodological details, including clear sample sizes and database versions, enhancing reproducibility.
 
 (5) Immunotherapy Response Evidence:
 
 Conclusions regarding differences in immunotherapy response rates between patient subgroups, although intriguing, remain based on retrospective database analyses with relatively limited demographic and clinical detail. Future prospective studies or more detailed patient characterization would be required to robustly confirm these associations.
 
 (6) Interpretation of ICD Gene Signatures:
 
 The ICD-related gene set includes many genes broadly associated with immune activation rather than specifically ICD. Although this was addressed by the authors, clearly distinguishing ICD-specific versus general immune-response genes in future studies would help clarify biological implications.
 
 Summary and Recommendations for Readers:
 
 Overall, this study presents an interesting and clinically relevant risk-scoring approach to stratify bladder cancer patients based on ICD-related gene expression profiles. It provides useful information about prognosis, immune infiltration, and potential immunotherapy responsiveness. However, readers should interpret the results within the context of its limitations, notably the need for broader validation and careful consideration of the biological significance underlying the observed associations. This work lays a valuable foundation for further investigation into the integration of ICD and immune response signatures in personalized cancer therapy.
 
 Review 1
3. Public_Reviews 31 Jul 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews.
 
 Reviewer #1 (Recommendations for the authors):
 
 Thank you for your thorough review of our manuscript and your valuable suggestions. Here are our responses to each point you raised:
 
 (1) Novelty: Exploring the feasibility of extending the risk-scoring model to diverse cancer types could emphasize the broader impact of the research.
 
 Thank you so much for your thoughtful and insightful feedback. Your suggestion to explore extending the risk-scoring model to diverse cancer types is truly valuable and demonstrates your broad vision in this field. We deeply appreciate your interest in our research and the effort you put into providing such constructive input.
 
 After careful consideration, we have decided to focus our current study on the specific cancer type(s) we initially set out to explore. This decision was made to ensure that we can thoroughly address the research questions at hand, given our current resources, time constraints, and the complexity of the topic. By maintaining this focused approach, we aim to achieve more in-depth and reliable results that can contribute meaningfully to the understanding of this particular area.
 
 However, we fully recognize the potential significance of your proposed direction and firmly believe that it could be an excellent avenue for future research. We will definitely keep your suggestion in mind and may explore it in subsequent studies as our research progresses and evolves.
 
 (2) Improvement in Figure Presentation: The inconsistency in font formatting across figures, particularly in Figure 2 (A-D, E, F-H, I), Figure 3 (A-C, D-J, H, K), and the distinct style change in Figure 5, raises concerns about the professionalism of the visual presentation. It is recommended to standardize font sizes and styles for a more cohesive and visually appealing layout. This ensures that readers can easily follow and comprehend the graphical data presented in the article.
 
 The text in the picture has been revised as requested.
 
 (3) Enhancing Reliability of Immune Cell Infiltration Data: Address the potential limitations associated with relying solely on RNASeq data for immune cell infiltration analysis between ICD and ICD high groups in Figure 2. It is advisable to discuss the inherent challenges and potential biases in this methodology. To strengthen the evidence, consider incorporating bladder cancer single-cell sequencing data, which could provide a more comprehensive and reliable understanding of immune cell dynamics within the tumor microenvironment.
 
 Thank you very much for your meticulous review and the highly constructive suggestions. Your insight regarding the limitations of relying on RNASeq data for immune cell infiltration analysis and the proposal to incorporate bladder cancer single-cell sequencing data truly reflect your profound understanding of the field. We deeply appreciate your efforts in guiding our research and the valuable perspectives you've offered.
 
 After careful deliberation, given our current research scope, timeline, and available resources, we've decided to focus on further discussing and addressing the challenges and biases inherent in RNASeq-based immune cell infiltration analysis. By delving deeper into the methodological limitations and conducting more in-depth statistical validations, we aim to provide a comprehensive and reliable interpretation of the data within our study framework. This focused approach allows us to maintain the integrity of our original research design and deliver robust findings on the relationship between immune cell infiltration and ICD in the current context.
 
 However, we fully acknowledge the significant value of your proposed single-cell sequencing approach. It is indeed a powerful method that could offer more detailed insights into immune cell dynamics, and we believe it holds great promise for future research in this area. We will keep your suggestion in mind as an important direction for potential future studies, especially when we plan to expand and deepen our exploration of the tumor microenvironment.
 
 (4) Clarity in Data Sources and Interpretation of Figure 5: In the results section, provide a detailed and transparent explanation of the sources of data used in Figure 5. This includes specifying the databases or platforms from which the chemotherapy, targeted therapy, and immunotherapy data were obtained. Additionally, elucidate the rationale behind the chosen data sources and how they contribute to the overall interpretation of the study's findings. And, strangely, these immune-related genes are associated with cancer sensitivities to different targeted therapies.
 
 Thank you very much for your detailed and valuable feedback on Figure 5. We sincerely appreciate your careful review and insightful suggestions, which have provided us with important directions for improvement.
 
 Regarding the data sources in Figure 5, we used the pRRophetic algorithm to conduct a drug sensitivity analysis on the TCGA database. The reason for choosing these data sources is multi - faceted. Firstly, these databases and platforms are well - established and widely recognized in the field. They have strict data collection and verification processes, ensuring the accuracy and reliability of the data. For example, TCGA has a large - scale, long - term - accumulated chemotherapy case database, which can comprehensively reflect the clinical application and treatment effects of various chemotherapeutic drugs.
 
 Secondly, these data sources cover a wide range of cancer types and patient information, which can meet the requirements of our study's diverse sample size and variety. This comprehensiveness enables us to conduct a more in - depth and representative analysis of the relationships between different therapies and immune - related genes.
 
 In terms of the overall interpretation of the study's findings, the use of these data sources provides a solid foundation. The accurate chemotherapy, targeted therapy, and immunotherapy data help us clearly demonstrate the associations between immune - related genes and cancer sensitivities to different treatments. This allows us to draw more reliable conclusions and provides a scientific basis for understanding the complex mechanisms of cancer treatment from the perspective of immune - gene - therapy interactions.
 
 As for the unexpected association between immune - related genes and cancer sensitivities to different targeted therapies, this is indeed a fascinating discovery. In our analysis, we hypothesized that immune - related genes may affect the tumor microenvironment, thereby influencing the response of cancer cells to targeted therapies. Although this finding is currently beyond our initial expectations, it has opened up a new research direction for us. We will further explore and verify the underlying mechanisms in future research.
 
 Once again, thank you for your guidance. We will make corresponding revisions and improvements according to your suggestions to make our research more rigorous and complete.
 
 (5) Legends and Methods: Address the brevity and lack of crucial details in the figure legends and methods section. Expand the figure legends to include essential information, such as the number of samples represented in each figure. In the methods section, provide comprehensive details, including the release dates of databases used, versions of coding packages, and any other pertinent information that is crucial for the reproducibility and reliability of the study.
 
 We would like to express our sincere gratitude for your valuable feedback on the figure legends and methods section of our study. We highly appreciate your sharp observation of the issues regarding the brevity and lack of key details, which are crucial for further improving our research.
 
 We have supplemented the methods section with data including the number of samples, the release dates of the databases used, and the versions of the coding packages, etc. For TCGA samples: 421 tumor samples and 19 normal samples.Database release date: March 29, 2022, v36 versions.Coding package version: R version 4.1.1.We will immediately proceed to supplement these key details, making the research process and methods transparent. This will allow other researchers to reproduce our study more accurately and enhance the persuasiveness of our research conclusions.
 
 (6) Evidence Supporting Immunotherapy Response Rates: The importance of providing a robust foundation for the conclusion regarding lower immunotherapy response rates. Strengthen this section by offering a more detailed description of sample parameters, specifying patient demographics, and presenting any statistical measures that validate the observed trends in Figure 5Q-T. More survival data are required to conclude. Avoid overinterpretation of the results and emphasize the need for further investigation to solidify this aspect of the study.
 
 Thank you very much for your professional and meticulous feedback on the content related to immunotherapy response rates in our study! Your suggestions, such as providing a solid foundation for the conclusions and supplementing key information, are of great value in enhancing the quality of our research, and we sincerely appreciate them.
 
 The data in Figures 5Q to T are from the TCGA database, which has already been provided. The statistical measure used for Figures 5Q to T is the P-value, which has been marked in the figures. The survival data have been provided in Figure 3D.
 
 Reviewer #2 (Recommendations for the authors):
 
 Thank you for your thorough review of our manuscript and your valuable suggestions. Here are our responses to each point you raised:
 
 (1) There is no information on the samples studied. Are all TCGA bladder cancer samples studied? Are these samples all treatment naïve? Were any excluded? Even simply, how many samples were studied?
 
 Thank you so much for pointing out the lack of sample - related information. Your attention to these details has been extremely helpful in identifying areas for improvement in our study.
 
 All the samples in our study were sourced from the TCGA (The Cancer Genome Atlas) and TCIA (The Cancer Immunome Atlas) databases. It should be noted that the patient data in the TCIA database are originally from the TCGA database. Regarding whether the patients received prior treatment, this information was not specifically mentioned in our current report. Instead, we mainly relied on the scores of the prediction model for evaluation. Since all samples were obtained from publicly available databases, we understand the importance of clarifying their origin and characteristics.
 
 We sincerely apologize for the omission of the sample size and other relevant details. We will promptly supplement this crucial information in the revised version, including a detailed description of the sample sources and any relevant characteristics. This will ensure greater transparency and help readers better understand the basis of our research.
 
 For TCGA samples: 421 tumor samples and 19 normal samples.Database release date: March 29, 2022, v36 versions.Coding package version: R version 4.1.1.
 
 (2) What clustering method was used to divide patients into ICD high/low? The authors selected two clusters from their "unsupervised" clustering of samples with respect to the 34 gene signatures. A Delta area curve showing the relative change in area under the cumulative distribution function (CDF) for k clusters is omitted, but looking at the heatmap one could argue there are more than k=2 groups in that data. Why was k=2 chosen? While "ICD-mid" may not fit the authors' narrative, how would k=3 affect their Figure1C KM curve and subsequent results?
 
 Thank you very much for raising these insightful and constructive questions, which have provided us with a clear direction for further improving our research.
 
 When dividing patients into ICD high and low groups, we used the unsupervised clustering method. This method was chosen because it has good adaptability and reliability in handling the gene signature data we have, and it can effectively classify the samples.
 
 Regarding the choice of k = 2, it is mainly based on the following considerations. Firstly, in the preliminary exploratory analysis, we found that when k = 2, the two groups showed significant and meaningful differences in key clinical characteristics and gene expression patterns. These differences are closely related to the core issues of our study and help to clearly illustrate the distinctions between the ICD high and low groups. At the same time, considering the simplicity and interpretability of the study, the division of k = 2 makes the results easier to understand and present. Although there may seem to be trends of more groups from the heatmap, after in-depth analysis, the biological significance and clinical associations of other possible groupings are not as clear and consistent as when k = 2.
 
 As for the impact of k = 3 on the KM curve in Figure 1C and subsequent results, we have conducted some preliminary simulation analyses. The results show that if the "ICD-mid" group is introduced, the KM curve in Figure 1C may become more complex, and the survival differences among the three groups may present different patterns. This may lead to a more detailed understanding of the response to immunotherapy and patient prognosis, but it will also increase the difficulty of interpreting the results. Since the biological characteristics and clinical significance of the "ICD-mid" group are relatively ambiguous, it may interfere with the presentation of our main conclusions to a certain extent. Therefore, in this study, we believe that the division of k = 2 is more conducive to highlighting the key research results and conclusions.
 
 Thank you again for your valuable comments. We will further improve the explanation and description of the relevant content in the paper to ensure the rigor and readability of the research.
 
 (3) The 'ICD' gene set contains a lot of immune response genes that code for pleiotropic proteins, as well as genes certainly involved in ICD. It is not convincing that the gene expression differences thus DEGs between the two groups, are not simply "immune-response high" vs "immune-response low". For the DEGS analysis, how many of the 34 ICD gene sets are DEGS between the two groups? Of those, which markers of ICD are DEGs vs. those that are related to immune activation?
 
 a. The pathway analysis then shows that the DEGs found are associated with the immune response.
 
 b. Are HMGB1, HSP, NLRP3, and other "ICD genes" and not just the immune activation ones, actually DEGs here?
 
 c. Figures D, I-J are not legible in the manus.
 
 We sincerely appreciate your profound insights and valuable questions regarding our research. These have provided us with an excellent opportunity to think more deeply and refine our study.
 
 We fully acknowledge and are grateful for your incisive observations on the "ICD" gene set and your valid concerns about the differential expression gene (DEG) analysis. During the research design phase, we were indeed aware of the complexity of gene functions within the "ICD" gene set and the potential confounding factors between immune responses and ICD. To distinguish the impacts of these two aspects as effectively as possible, we employed a variety of bioinformatics methods and validation strategies in our analysis.
 
 Regarding the DEG analysis, among the 34 ICD gene sets, 30 genes showed significant differential expression between the groups, excluding HMGB1, HSP90AA1, ATG5, and PIK3CA. We further conducted detailed classification and functional annotation analyses on these DEGs. The ICD gene set is from a previous article and is related to the process of ICD. Relevant literature is in the materials section. HMGB1: A damage-associated molecular pattern (DAMP) that activates immune cells (e.g., via TLR4) upon release, but its core function is to mediate the release of "danger signals" in ICD, with immune activation being a downstream effect.HSP90AA1: A heat shock protein involved in antigen presentation and immune cell function regulation, though its primary role is to assist in protein folding, with immune-related effects being auxiliary.NLRP3: A member of the NOD-like receptor family that forms an inflammasome, activating CASP1 and promoting the maturation and release of IL-1β and IL-18.Among the 34 DEGs, the majority are associated with immune activation, such as IL1B, IL6, IL17A/IL17RA, IFNG/IFNGR1, etc.
 
 (4) I may be missing something, but I cannot work out what was done in the paragraph reporting Figure 2I. Where is the ICB data from? How has this been analysed? What is the cohort? Where are the methods?
 
 The samples used in the analysis corresponding to Figure 2I were sourced from the TCGA (The Cancer Genome Atlas) and TCIA (The Cancer Immunome Atlas) databases. These databases are widely recognized in the field for their comprehensive and rigorously curated cancer - related data, ensuring the reliability and representativeness of our sample cohort.
 
 Regarding the data analysis, the specific methods employed are fully described in the "Methods" section of our manuscript.
 
 (5) How were the four genes for your risk model selected? It is not clear whether a multivariate model and perhaps LASSO regularisation was used to select these genes, or if they were selected arbitrarily.
 
 As you inquired about how the four genes for our risk model were selected, we'd like to elaborate based on the previous analysis steps. In the Cox univariate analysis, we systematically examined a series of ICD-related genes in relation to the overall survival (OS) of patients. Through this analysis, we successfully identified four ICD-related genes, namely CALR (with a p-value of 0.003), IFNB1 (p = 0.037), IFNG (p = 0.022), and IF1R1 (p = 0.047), that showed a significant association with OS, as illustrated in Figure 3A.
 
 Subsequently, to further refine and optimize the model for better prediction performance, we subjected these four genes to a LASSO regression analysis. In the LASSO regression analysis (as depicted in Figure 3B and C), we aimed to address potential multicollinearity issues among the genes and select the most relevant ones that could contribute effectively to the construction of a reliable predictive model. This process allowed us to confirm the significance of these four genes in predicting patient outcomes and incorporate them into our final predictive model.
 
 (6) How related are the high-risk and ICD-high groups? It is not clear. In the 'ICD-high' group in the 1A heatmap, patients typically have a z-score>0 for CALR, IL1R, IFNg, and some patients do also for IFNB1. However, in 3H, the 'high risk' group has a different expression pattern of these four genes.
 
 Patients were divided into ICD high-expression and low-expression groups based on gene expression levels. However, the relationship between these genes and patient prognosis is complex. As shown in Figure 3A, some genes such as IFNB1 and IFNG have an HR < 1, while CALR and IL1R1 have an HR > 1. Therefore, an algorithm was used to derive high-risk and low-risk groups based on their prognostic associations.
 
 (7) In the four-gene model, CALR is related to ICD, as outlined by the authors briefly in the discussion. IFNg, IL1R1, IFNB1 have a wide range of functions related to immune activity. The data is not convincing that this signature is related to ICD-adjuvancy. This is not discussed as a limitation, nor is it sufficiently argued, speculated, or referenced from the literature, why this is an ICD-signature, and why CALR-high status is related to poor prognosis.
 
 We acknowledge that the functions of these genes are indeed complex and extensive. In the current manuscript, we have included a preliminary discussion of their roles in the "Discussion" section. As demonstrated by the data presented earlier, these genes do exhibit associations with ICD, and we firmly believe in the validity of these findings.
 
 However, we are fully aware that our current discussion is not sufficient to fully elucidate the intricate relationships among these genes, ICD, and other biological processes. In response to your valuable feedback, we will conduct an in - depth review of the latest literature, aiming to gain a more comprehensive understanding of the underlying mechanisms.
 
 (8) Score is spelt incorrectly in Figures 3F-J.
 
 Figures 3F-J have been revised as requested.
 
 (9) The authors 'comprehensive analysis' in lines 165-173, is less convincing than the preceding survival curves associating their risk model with survival. Their 'correlations' have no statistics.
 
 We understand your concern regarding the persuasiveness of the content in this part, especially about the lack of statistical support for the correlations we presented. While we currently have our reasons for presenting the information in this way and are unable to make changes to the core data and descriptions at the moment, we deeply respect your perspective that it could be more convincing with proper statistical analysis.
 
 (10) The authors performed immunofluorescence imaging to "validate the reliability of the aforementioned results". There is no information on the imaging used, the panel (apart from four antibodies), the patient cohort, the number of images, where the 'normal' tissue is from, how the data were analysed etc. This data is not interpretable without this information.
 
 a. Is CD39 in the panel? CD8, LAG3? It's not clear what this analysis is.
 
 The color of each antibody has been marked in Fig 2B. The cohort information and its source have been supplemented. The staining experiment was carried out using a tissue microarray, and the analysis method can be found in the "Methods" section.Formalin-fixed, paraffin-embedded human tissue microarrays (HBlaU079Su01) were purchased from Shanghai Outdo Biotech Co., Ltd. (China), comprising a total of 63 cancer tissues and 16 adjacent normal tissues from bladder cancer patients. Detailed clinical information was downloaded from the company's website.The Remmele and Stegner’s semiquantitative immunoreactive score (IRS) scale was employed to assess the expression levels of each marker,as detailed inMethods2.5.CD39, CD8, and LAG3 were also stained, but the results were not presented.
 
 (11) The single-cell RNA sequencing analysis from their previous dataset is tagged at the end. CALR expression in most identified cells is interesting. Not clear what this adds to the work beyond 'we did scRNA-seq'. How were these data analysed? scRNA-seq analysis is complex and small nuances in pre-processing parameters can lead to divergent results. The details of such analysis are required!
 
 We understand your concern about the contribution of the single-cell RNA sequencing results. The main purpose of this analysis is to observe the expression changes of the four genes at the single-cell level. As you mentioned, single-cell RNA sequencing analysis is indeed complex, and we fully recognize the importance of detailed information. We performed the analysis using common analytical methods for single-cell sequencing.It has been supplemented in the Methods section.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

AuthorResponse

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.01.24.577030v2
www.biorxiv.org www.biorxiv.org

Efficiency and localisation of AURKA degradation by PROTACs is modulated by deubiquitinases UCHL5 and target-selective OTUD6A

5
1. Public_Reviews 31 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This study describes a genetic screen to identify deubiquitinases (DUBs) that counteract the activity of small-molecule degraders (PROTACs). The presented data are valuable, identifying OTUD6A and UCHL5 as DUBs that impact the efficacy and potency of PROTACs. While the conclusions are broadly supported and the methods employed are solid, the mechanistic depth and validation are incomplete. Overall, these findings merit further evaluation by the targeted protein degradation community when developing and optimizing PROTACs.
  
  Summary
2. Public_Reviews 31 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  In this study, the authors investigate the role of deubiquitinases (DUBs) in modulating the efficacy of PROTAC-mediated degradation of the cell-cycle kinase AURKA. Using a focused siRNA screen of 97 human DUBs, they identify UCHL5 and OTUD6A as negative regulators of AURKA degradation by PROTACs. They further offer a mechanistic explanation of enhanced AURKA degradation in the nucleus via OTUD6A expression being restricted to the cytosol, thereby protecting the cytoplasmic pool of AURKA. These findings provide important insight into how subcellular localization and DUB activity influence the efficiency of targeted protein degradation strategies, which could have implications for therapy.
  
  Strengths:
  
  (1) The manuscript is well-structured, with clearly defined objectives and well-supported conclusions.
  
  (2) The study employs a broad range of well-validated techniques - including live-cell imaging, proximity ligation assays, HiBiT reporter systems, and ubiquitin pulldowns - to dissect the regulation of PROTAC activity.
  
  (3) The authors use informative experimental controls, including assessment of cell-cycle progression effects, rescue experiments with siRNA-resistant constructs to confirm specificity, and the application of both AURKA-targeting PROTACs with different warheads and orthogonal degrader systems (e.g., dTAG-13 and dTAGv-1) to differentiate between target- and ligase-specific effects.
  
  (4) The identification of OTUD6A as a cytosol-restricted DUB that protects cytoplasmic but not nuclear AURKA is novel and may have therapeutic relevance for selectively targeting oncogenic nuclear AURKA pools.
  
  Weaknesses:
  
  (1) Although UCHL5 and OTUD6A are shown to limit AURKA degradation, direct physical interaction was not assessed.
  
  (2) Although the authors identify a correlation between DUB knockdown-induced cell cycle progression and enhanced PROTAC activity, only one DUB (USP36) is excluded on this basis. In addition, one DUB is shown in the correlation plot (Figure 3B) whose knockdown enhances PROTAC sensitivity without significantly altering cell cycle progression, but it is not identified/discussed.
  
  (3) While the authors suggest that combining PROTACs with DUB inhibition could enhance degradation, this was not experimentally tested.
  
  (4) The study identifies UCHL5 as a general antagonist of CRBN-recruiting PROTACs, yet the ubiquitin pulldown experiments (Figure 5G, H) show no change in AURKA ubiquitination upon UCHL5 knockdown. This raises questions about the precise step or mechanism by which UCHL5 exerts its protective effect.
  
  Review 1
3. Public_Reviews 31 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  In this study, the authors present a screening approach to identify deubiquitylases that may impact PROTAC efficacy/potency, specifically in this case using a previously reported AURKA PROTAC as an initial model. The authors claim that UCHL5 is able to control the level of degradation of both AURKA and dTAG when using CRBN-mediated PROTACs; however, VHL is not impacted by UCHL5 activity. They additionally claim that OTUD6A is able to control the extent of AURKA degradation in a target protein-specific manner and that this effect is specific to cytoplasm-located AURKA.
  
  Overall, whilst the endeavour is of interest and importance, we found that the claims made were overly generalised, the effects observed when knocking down the respective DUBs were very small, the systems used are highly artificial, and the data is not presented in a way that makes understanding absolute changes transparent.
  
  Strengths:
  
  The topic is of high interest and relevance and explores an underappreciated and understudied area of the PROTAC mechanism of action. If findings could be better supported, they would certainly bring value to the field.
  
  Weaknesses:
  
  The overall effects observed are sometimes limited in real terms. Even if statistically significant, the data presented does not fully support that changes in degradation due to UCHL5 activity represent changes of functional relevance. The data provided often omits the absolute changes in protein abundance observed. Data on endogenous/less engineered systems and/or with higher resolution read-outs would greatly strengthen some conclusions.
  
  Review 2
4. Public_Reviews 31 Jul 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Summary:
  
  Cardno et al. "test the hypothesis that DUBs could oppose PROTAC-mediated degradation of cellular targets, using AURKA as a model target". A screen with a panel of siRNA that depleted 97 DUBs in the presence and absence of AURKA targeted PROTAC-D identified DUBs that regulated AURKA and those that affected the sensitivity of PROTAC-D. Validation studies with DUBs, UCHL5, and OTU6A yielded mixed results. UCHL5 not only affected PROTAC-mediated AURKA degradation but also affected CRBN-associated substrates, OTUD6A, more specifically, affected PROTAC-mediated AURKA degradation, and the effects of OTUD6A were associated with the localisation of AURKA. The findings are interesting; the impact of the findings would be strengthened if the key results are validated in one or more cancer cell lines that have not been modified.
  
  Review 3
5. Public_Reviews 31 Jul 2025
  
  in eLife
  
  Author response:
  
  We therefore plan to make only a minor change to the manuscript to clarify a point raised by Reviewer 1: the DUB shown in the correlation plot in Fig 3B - whose knockdown enhances PROTAC sensitivity without significantly altering cell cycle progression - is BAP1. Since BAP1 subsequently showed no significant effect on endogenous AURKA levels (Fig 3E) it was excluded from further analysis.
  
  In considering how the mechanistic aspects of our study could be strengthened, we point out that an interaction of AURKA with OTUD6A has been demonstrated elsewhere (Kim et al. 2021). We also argue that an interaction of AURKA with UCHL5 would not be expected since UCHL5 is a proteasomal DUB shown to act on substrates recruited to the proteasome via capture of ubiquitin chains by the ubiquitin receptors of the proteasome lid. We agree that mechanistically we have not provided complete evidence for a direct deubiquitinating activity of UCHL5 on AURKA. We cannot explain why there is no change in AURKA ubiquitination upon UCHL5 knockdown in our ubiquitin pulldown experiment, but indeed there is considerable uncertainty in the scientific literature on the precise role of UCHL5 at the proteasome.
  
  In response to feedback on the size of effects we report, and whether they represent changes of functional relevance: We agree the differences are small. Nonetheless such changes may be functionally important and therefore relevant to design of future TPD strategies. Our previous characterization of PROTAC-D (Wang et al. 2021) provides evidence that differential degradation of subcellular pools can have functional relevance. We showed in our study that the lack of degradation of the centrosomal pool (even if this represents only a small fraction of the total pool) led to unexpected phenotypic consequences that were distinct from those observed upon treatment with ATP-competitive inhibitor or siRNA. Therefore we believe our specific finding of spatially restricted action of AURKA-selective OTUD6A to be of clear functional relevance to AURKA TPD strategies and of conceptual importance in establishing the paradigm of TPD modulation by DUBs.
  
  As Reviewer 1 notes, we do not directly test our hypothesis that combining PROTACs with DUB inhibition could enhance degradation. We would have done so had there been suitable small molecule inhibitors available for OTUD6A or UCHL5 at the time of our study. We plan a broader study of OTUD6A mechanisms and its role in PROTAC sensitivity in cancer cell lines, and appreciate Reviewer 3’s suggestion that the impact of our findings would be strengthened if key results were validated in one or more cancer cell lines. The scope of this new study means we plan to report it in a separate, future publication.
  
  AuthorResponse
Visit annotations in context

Tags

Review 2

Review 3

Summary

AuthorResponse

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.04.23.650020v1
www.biorxiv.org www.biorxiv.org

Concatenated Modular BK Channel Constructs Reveal Divergent Stoichiometry in Gating Control by LRRC26 (γ1), Pore, and Selectivity Filter

3
1. Public_Reviews 31 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  In this important contribution, Yan and colleagues describe a powerful and compelling strategy to generate concatamers of the BK channel and their fusion constructs with the auxiliary gamma subunits, which allows exploring contributions of individual subunits of the tetrameric channel to its gating and the study of heteromeric channel complexes of defined composition. Distinct examples are presented, which illustrate great diversity in the stoichiometric control of BK channel gating, depending on the site and nature of molecular perturbations. The molecular approaches could be extended to other membrane proteins whose N and C termini face opposite sides of the membrane.
  
  Summary
2. Public_Reviews 31 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  BK channels are widely distributed and involved in many physiological functions. They have also proven a highly useful tool for studying general allosteric mechanisms for gating and modulation by auxiliary subunits. Tetrameric BK channels are assembled from four separate alpha subunits, which would be identical for homozygous alleles and potentially of five different combinations for heterozygous alleles (Geng et al., 2023, https://doi.org/10.1085/jgp.202213302). Construction of BK channels with concatenated subunits in order to strictly control heteromeric subunit composition had not yet been used because the N-terminus in BK channels is extracellular, whereas the C-terminus is intracellular. In this new work, Chen, Li, and Yan devise clever methods to construct and assemble BK channels of known subunit composition, as well as to fix the number of γ1 axillary subunits per channel. With their novel molecular approaches, Chen, Li and Yan report that a single γ1 axillary subunit is sufficient to fully modulate a BK channel, that the deep conducting pore mutation L312A exhibited a graded effect on gating with each addition mutated subunit replacing a WT subunit in the channel adding an additional incremental left shift in activation, and that the V288A mutation at the selectivity filter must be present on all four alpha subunits in order to induce channel inactivation. Chen, Li, and Yan have been successful in introducing new molecular tools to generate BK channels of known stoichiometry and subunit composition. They validate their methods and provide three examples of their use with useful observations.
  
  Strengths:
  
  Powerful new molecular tools for the study of channel gating have been developed and validated in the study.
  
  Weaknesses:
  
  One example each of auxiliary, deep pore, and selectivity filter allosteric actions is presented, but this is sufficient for the purposes of the paper to establish their methods and present specific examples of applicability.
  
  Review 1
3. Public_Reviews 31 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  This manuscript describes novel BK channel concatemers as a tool to study the stoichiometry of the gamma subunit and mutations in the modulation of the channel. Taking advantage of the modular design of the BK channel alpha subunit, the authors connected S1-S6/1st RCK as two- and four-subunit concatemers and coexpressed with S0-RCK2 to form normal function channels. These concatemers avoided the difficulty that the extracellular N-terminus of S0 was unable to connect with the cytosolic C-terminus of the gamma subunit, allowing a single gamma subunit to be connected to the concatemers. The concatemers also helped reveal the required stoichiometry of mutant BK subunits in modulating channel function. These include L312A in the deep pore region that altered channel function additively with each additional subunit harboring the mutation, and V288A at the selectivity filter that altered channel function cooperatively only when all four subunits were mutated. These results demonstrate that the concatemers are robust and effective in studying BK channel function and molecular mechanisms related to stoichiometry. The different requirement of the gamma subunit and the mutations stoichiometry for altering channel function is interesting, which may relate to the fundamental mechanism of how different motifs of the channel protein control function.
  
  Strengths:
  
  The manuscript presents well-designed experiments with high-quality data, which convincingly demonstrate the BK channel concatemers and their utility. The results are clearly presented.
  
  Weaknesses:
  
  This reviewer did not identify any major concerns with the manuscript.
  
  Review 2
Visit annotations in context

Tags

Summary

Review 2

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.06.26.546634v2
www.biorxiv.org www.biorxiv.org

Chromosome-scale genome assembly of the European common cuttlefish Sepia officinalis

4
1. Public_Reviews 31 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This manuscript reports a high-quality genome assembly of the European cuttlefish, Sepia officinalis, a representative species of the Cephalopod lineage. The data are based on current best practices for sequencing and genome assembly, including PacBio HiFi long reads and Hi-C chromatin conformation capture; the analysis is currently in parts incomplete, as further analyses are required to confirm the correct chromosome number. This genome will be a useful resource for the community of researchers interested in cuttlefish biology and comparative genomics in general.
  
  Summary
2. Public_Reviews 31 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  This manuscript presents a high-quality, chromosome-level genome assembly of the European cuttlefish (Sepia officinalis), a representative species of the cephalopod lineage. Using state-of-the-art sequencing and scaffolding technologies -including PacBio HiFi long reads and Hi-C chromatin conformation capture - the authors deliver a genome assembly with exceptional contiguity and completeness, as evidenced by high BUSCO scores. This genome resource fills a significant gap in cephalopod genomics and offers a valuable foundation for studies in neurobiology, behavior, and evolutionary biology. However, there are several major aspects that need to be strengthened.
  
  Major Revisions Recommended:
  
  (1) Single-individual genome limitation
  
  The genome assembly is based on a single individual, which appears to be male. While this approach is common in genome projects, it does not capture the full genetic diversity of the species. As S. officinalis exhibits a wide geographical range and possible population structure, future efforts (or discussion in this manuscript) should consider re-sequencing multiple individuals - of both sexes and from diverse geographic origins - to characterize population-level variation, sex-linked features, and structural polymorphisms.
  
  (2) Limited experimental validation of chromosomal inferences
  
  The study reports chromosome-scale scaffolding using Hi-C data and proposes a revised karyotype for S. officinalis. However, these inferences would be significantly strengthened by orthogonal validation methods. In particular, fluorescence in situ hybridization (FISH) or karyotyping from cytogenetic preparations would provide direct confirmation of chromosome number and structural arrangements. The reliance solely on Hi-C contact maps for inferring chromosomal organization should be acknowledged as a limitation or supplemented with such validations.
  
  (3) Shallow discussion of chromosomal evolution
  
  The manuscript briefly mentions chromosomal number differences among cephalopods but does not explore their evolutionary or functional implications. A more thorough comparative analysis - linking chromosomal rearrangements (e.g., fusions, fissions) with ecological adaptation, life history, or neural complexity - would greatly enhance the impact of the findings. Referencing chromosomal dynamics in related taxa and possible links to behavioral innovations would contextualize these results more effectively.
  
  (4) Underdeveloped gene family and pathway analysis
  
  While the authors identify expansions in gene families such as protocadherins and C2H2 zinc finger transcription factors, the functional significance of these expansions remains speculative. The manuscript would benefit from:
  
  a) Functional enrichment analyses (e.g., GO, KEGG) targeting these gene families.
  
  b) Expression profiling across tissues or developmental stages to infer regulatory roles.
  
  c) Comparison with expression or expansion patterns in other cephalopods with known behavioral complexity (e.g., Octopus bimaculoides, Euprymna scolopes).
  
  d) Potential integration of transcriptomic or epigenomic data to support regulatory hypotheses.
  
  Review 1
3. Public_Reviews 31 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  This paper concerns an interesting organism, Sepia officinalis. However, in the opinion of this reviewer, the paper reads somewhat like a genome report. The authors have used 23x PacBio HiFi in conjunction with relatively low coverage (11x) Hi-C to scaffold the genome into a karyotype of 47 chromosomes. They have used a combination of short and long read RNA seq to annotate the genome in what looks like a very good annotation. The paper offers basic analyses of the Busco evaluation, some descriptive analyses of gene family and repeat content, and a bit more focused analysis on synteny among sequenced squids. Generally, the data will be useful.
  
  Strengths:
  
  This is a high-quality annotation, and the data ultimately will be useful to other researchers. I appreciate trying to understand what's happening between assemblies of S. officinalis.
  
  Weaknesses:
  
  I don't believe the data at hand makes a strong case for the argument of 47 chromosomes. This is my biggest sticking point with the paper, and it is for a few reasons:
  
  (1) The authors point to assembly differences between the DToL assembly and the one presented in the manuscript and seem to claim that DToL is incorrect. However, the DToL assembly (xcSepOffi3.1) is based on much deeper HiFi and HiC coverage than the one at hand (51x and 80+x respectively). There are many things to try here, including:
  
  a) Downloading the DToL data and reassembling using a common pipeline.
  
  b) Downsampling the DToL data to similar coverage as what the authors have achieved.
  
  c) Combining your data and that of DToL for even deeper coverage (heterozygosity is low enough that I don't imagine this impeding things too badly).
  
  (2) Looking at Figure 1, there appears to be a misjoin at chromosome 42. Looking carefully at Figure S1, that misjoin does not appear on any of the panels - this is confusing. Given the size of that chromosome and the authors' chromosome numbering, I'm guessing this is a manual merge (as it's larger than most of the chromosomes numerically close (40, 41, 43, etc). Further, staring closely at Figure 1, there appear to be cross-scaffold contacts between 42 and 43 and 42 and 44. Secondarily there are contacts between 43 and 44. This bit of the assembly seems potentially problematic.
  
  Review 2
4. Public_Reviews 31 Jul 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Summary:
  
  In this study, authors Simone Rencken and co-authors present and investigate the genome of the common cuttlefish Sepia officinalis.
  
  Strengths:
  
  The authors explain in a detailed yet concise manner the main steps for a genome assembly, with very robust methods for validation, and according to current best practices. In addition to the chromosomal assembly, the authors confirmed the presence of 47 chromosomes using Hi-C data and multiple species synteny. They also generated a comprehensive gene annotation, with assessments of gene completeness, providing a useful resource for the community of researchers interested in cuttlefish biology and comparative genomics.
  
  Weaknesses:
  
  While the study touches upon the subjects of gene content, TE activity, or species-level comparisons, the study does not provide in-depth investigations of these.
  
  Review 3
Visit annotations in context

Tags

Summary

Review 2

Review 3

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.04.22.649952v1
www.biorxiv.org www.biorxiv.org

General Trends in the Calnexin-Dependent Expression and Pharmacological Rescue of Clinical CFTR Variants

3
1. Public_Reviews 31 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This important study systematically investigates the effects of calnexin, an endoplasmic reticulum chaperone, on the drug response of approximately 230 disease-causing variants of the cystic fibrosis transmembrane conductance regulator (CFTR) protein. Through deep mutational scanning, interactome profiling, and functional assays, the findings provide convincing evidence that calnexin significantly influences both CFTR expression and the efficacy of corrector drugs in a variant-specific manner. These insights advance our understanding of how cellular quality control machinery shapes the pharmacological responsiveness of CFTR variants, which are broadly relevant for researchers in protein folding and genetic disease therapeutics.
  
  Summary
2. Public_Reviews 31 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  This research investigates how the cellular protein quality control machinery influences the effectiveness of cystic fibrosis (CF) treatments across different genetic variants. CF is caused by mutations in the CFTR gene, with over 1,700 known disease-causing variants that primarily work through protein misfolding mechanisms. While corrector drugs like those in Trikafta therapy can stabilize some misfolded CFTR proteins, the reasons why certain variants respond to treatment while others don't remain unclear. The authors hypothesized that the cellular proteostasis network-the machinery that manages protein folding and quality control-plays a crucial role in determining drug responsiveness across different CFTR variants. The researchers focused on calnexin (CANX), a key chaperone protein that recognizes misfolded glycosylated proteins. Using CRISPR-Cas9 gene editing combined with deep mutational scanning, they systematically analyzed how CANX affects the expression and corrector drug response of 234 clinically relevant CF variants in HEK293 cells.
  
  In terms of findings, this study revealed that CANX is generally required for robust plasma membrane expression of CFTR proteins, and CANX disproportionately affects variants with mutations in the C-terminal domains of CFTR and modulates later stages of protein assembly. Without CANX, many variants that would normally respond to corrector drugs lose their therapeutic responsiveness. Furthermore, loss of CANX caused broad changes in how CF variants interact with other cellular proteins, though these effects were largely separate from changes in CFTR channel activity.
  
  This study has some limitations: the research was conducted in HEK293 cells rather than lung epithelial cells, which may not fully reflect the physiological context of CF. Additionally, the study only examined known disease-causing variants and used methodological approaches that could potentially introduce bias in the data analysis.
  
  How cellular quality control mechanisms influence the therapeutic landscape of genetic diseases is an emerging field. Overall, this work provides important cellular context for understanding CF mutation severity and suggests that the proteostasis network significantly shapes how different CFTR variants respond to corrector therapies. The findings could pave the way for more personalized CF treatments tailored to patients' specific genetic variants and cellular contexts.
  
  Strengths:
  
  (1) This work makes an important contribution to the field of variant effect prediction by advancing our understanding of how genetic variants impact protein function.
  
  (2) The study provides valuable cellular context for CFTR mutation severity, which may pave the way for improved CFTR therapies that are customized to patient-specific cellular contexts.
  
  (3) The research provides further insight into the biological mechanisms underlying approved CFTR therapies, enhancing our understanding of how these treatments work.
  
  (4) The authors conducted a comprehensive and quantitative analysis, and they made their raw and processed data as well as analysis scripts publicly available, enabling closer examination and validation by the broader scientific community.
  
  Weaknesses:
  
  (1) The study only considers known disease-causing variants, which limits the scope of findings and may miss important insights from variants of uncertain significance.
  
  (2) The cellular context of HEK293 cells is quite removed from lung epithelia, the primary tissue affected in cystic fibrosis, potentially limiting the clinical relevance of the findings.
  
  (3) Methodological choices, such as the expansion of sorted cell populations before genetic analysis, may introduce possible skew or bias in the data that could affect interpretation.
  
  (4) While the impact on surface trafficking is convincingly demonstrated, how cellular proteostasis affects CFTR function requires further study, likely within a lung-specific cellular context to be more clinically relevant.
  
  Review 1
3. Public_Reviews 31 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  In this work, the authors use deep mutational scanning (DMS) to examine the effect of the endogenous chaperone calnexin (CANX) on the plasma membrane expression (PME) and potential pharmacological stabilization cystic fibrosis disease variants. This is important because there are over 1,700 loss-of-function mutations that can lead to the disease Cystic Fibrosis (CF), and some of these variants can be pharmacologically rescued by small-molecule "correctors," which stabilize the CFTR protein and prevent its degradation. This study expands on previous work to specifically identify which mutations affect sensitivity to CFTR modulators, and further develops the work by examining the effect of a known CFTR interactor-CANX-on PME and corrector response.
  
  Overall, this approach provides a useful atlas of CF variants and their downstream effects, both at a basal level as well as in the context of a perturbed proteostasis. Knockout of CANX leads to an overall reduced plasma membrane expression of CFTR with CF variants located at the C-terminal domains of CFTR, which seem to be more affected than the others. This study then repeats their DMS approach, using PME as a readout, to probe the effect of either VX-445 or VX-455 + VX-661-which are two clinically relevant CFTR pharmacological modulators. I found this section particularly interesting for the community because the exact molecular features that confer drug resistance/sensitivity are not clear. When CANX is knocked out, cells that normally respond to VX-445 are no longer able to be rescued, and the DMS data show that these non-responders are CF variants that lie in the VX-445 binding site. Based on computational data, the authors speculate that NBD2 assembly is compromised, but that remains to be experimentally examined. Cells lacking CANX were also resistant to combinatorial treatment of VX-445 + VX-661, showing that these two correctors were unable to compensate for the lack of this critical chaperone.
  
  One major strength of this manuscript is the mass spectrometry data, in which 4 CF variants were profiled in parental and CANX KO cells. This analysis provides some explanatory power to the observation that the delF508 variant is resistant to correctors in CANX KO cells, which is because correctors were found not to affect protein degradation interactions in this context. Findings such as this provide potential insights into intriguing new hypothesis, such as whether addition of an additional proteostasis regulators, such as a proteosome inhibitor, would facilitate a successful rescue. Taken together, the data provided can be generative to researchers in the field and may be useful in rationalizing some of the observed phenotypes conferred by the various CF variants, as well as the impact of CANX on those effects.
  
  To complete their analysis of CF variants in CANX KO cells, the research also attempted to relate their data, primarily based on PME, to functional relevance. They observed that, although CANX KO results in a large reduction in PME (~30% reduction), changes in the actual activation of CFTR (and resultant quenching of their hYFP sensor) were "quite modest." This is an important experiment and caveat to the PME data presented above since changes in CFTR activity does not strictly require changes in PME. In addition, small molecule correctors also do not drastically alter CFTR function in the context of CANX KO. The authors reason that this difference is due to a sort of compensatory mechanism in which the functionally active CFTR molecules that are successfully assembled in an unbalanced proteostasis system (CANX KO) are more active than those that are assembled with the assistance of CANX. While I generally agree with this statement, it is not directly tested and would be challenging to actually test.
  
  The selected model for all the above experiments was HEK293T cells. The authors then demonstrate some of their major findings in Fischer rat thyroid cell monolayers. Specifically, cells lacking CANX are less sensitive to rescue by CFTR modulators than the WT. This highlights the importance of CANX in supporting the maturation of CFTR and the dependence of chemical correctors on the chaperone. Although this is demonstrated specifically for CANX in this manuscript, I imagine a more general claim can be made that chemical correctors depend on a functional/balanced proteostasis system, which is supported by the manuscript data. I am surprised by the discordance between HEK293T PME levels compared to the CTFR activity. The authors offer a reasonable explanation about the increase in specific activity of the mature CFTR protein following CANX loss.
  
  For the conclusions and claims relevant to CANX and CF variant surveying of PME/function, I find the manuscript to provide solid evidence to achieve this aim. The manuscript generates a rich portrait of the influence of CF mutations both in WT and CANX KO cells. While the focus of this study is a specific chaperone, CANX, this manuscript has the potential to impact many researchers in the broad field of proteostasis.
  
  Review 2
Visit annotations in context

Tags

Summary

Review 2

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.04.03.647093v1
www.biorxiv.org www.biorxiv.org

Structural evolution of nitrogenase enzymes over geologic time

4
1. Public_Reviews 31 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This valuable study presents computational analyses of over 5,000 predicted extant and ancestral nitrogenase structures. The data analyses are convincing, it offers unique insights into the relationship between structural evolution and environmental and biological phenotypes. The data generated in this study provide a vast resource that can serve as a starting point for studies of reconstructed and extant nitrogenases.
  
  Summary
2. Public_Reviews 31 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  This was a clearly written manuscript that did an excellent job summarizing complex data. In this manuscript, Cuevas-Zuviría et al. use protein modeling to generate over 5,000 predicted structures of nitrogenase components, encompassing both extant and ancestral forms across different clades. The study highlights that key insertions define the various Nif groups. The authors also examined the structures of three ancestral nitrogenase variants that had been previously identified and experimentally tested. These ancestral forms were shown in earlier studies to exhibit reduced activity in Azotobacter vinelandii, a model diazotroph.
  
  This work provides a useful resource for studying nitrogenase evolution. However, its impact is somewhat limited due to a lack of evidence linking the observed structural differences to functional changes. For example, in the ancestral nitrogenase structures, only a small set of residues (lines 421-431) were identified as potentially affecting interactions between nitrogenase components. Why didn't the authors test whether reverting these residues to their extant counterparts could improve nitrogenase activity of the ancestral variants?
  
  Additionally, the paper feels somewhat disconnected. The predicted nitrogenase structures discussed in the first half of the manuscript were not well integrated with the findings from the ancestral structures. For instance, do the ancestral nitrogenase structures align with the predicted models? This comparison was never explicitly made and could have strengthened the study's conclusions.
  
  Comments on revisions:
  
  I appreciate the authors responding to my comments. I think Fig. S10 helps put the structural data into more context. It would be helpful to make clearer in the legend what proteins are being compared, especially in 10C.
  
  Although I can see why the authors focus on the NifK extension and its potential connection to oxygen protection, I would point out that Vnf and Anf do not have this extension in their K subunit, and you find both Vnf and Anf in aerobic and facultative anaerobic diazotrophs. This is a minor point, but I think it is important to mention in the discussion.
  
  Review 1
3. Public_Reviews 31 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  This work aims to study the evolution of nitrogenanses, understanding how their structure and function adapted to changes in environment, including oxygen levels and changes in metal availability.
  
  The study predicts > 3000 structures of nitrogenases, corresponding to extant, ancestral and alternative ancestral sequences. It is observed that structural variations in the nitrogenases correlate with phylogenetic relationships. The amount of data generated in this study represents a massive and admirable undertaking. The study also provides strong insight into how structural evolution correlates with environmental and biological phenotypes
  
  Review 2
4. Public_Reviews 31 Jul 2025
  
  in eLife
  
  Author response:
  
  The following is the authors’ response to the original reviews.
  
  Reviewer #1 (Recommendations for the authors):
  
  Line 122: There were a number of qualitative descriptors in the paper. For instance, if the authors want to say massive campaign, how massive? How rapid? These are relative terms in this context.
  
  We have revised the text to minimize qualitative descriptors and to provide concrete numbers where possible. The revised sentence (line 121) now reads “We began our structural investigation of nitrogenase evolutionary history by conducting on a large-scale structure prediction analysis of 5378 protein structures, a more than threefold increase compared to available nitrogenase structures in the PDB. We then analyzed our phylogenetic dataset to identify notable structural changes.”
  
  Line 179: "massively scale up" How massive?
  
  We agree with the reviewer’s observation, in response, we have removed the phrase “massively scale up” and revised the text.
  
  Line 182: "no compromise on alignment depth and negligible cost to prediction accuracy". How do you know this? Is this shown somewhere? Was there a comparison between known structures and the predicted structure for those nitrogenases that have structures?
  
  In response to this comment, we have made several clarifications and revisions in the manuscript:
  
  We modified Figure S1, which now shows the pLDDT (per-residue confidence metric from Alphafold) values of all our predictions. These scores are consistently high (over 90 for the D and K subunits, and approximetly 90 for the H subunits) regardless of whether the recycling protocol or the bona-fide protocol was used.
  
  The reviewer’s comment demonstrated to us that the Figure S1 needed to more clearly representing these values, we therefore updated it accordingly.
  
  To prevent any misinterpretation of our claims about the accuracy and cost of the method , we have revised the text at line 179, as follows:
  
  “In total, 2,689 unique extant and ancestral nitrogenase variants were targeted. All structures were generated in approximately 805 hours, including GPU computations and MMseqs2 alignments performed using two different protocols: one for extant or most likely ancestral sequences, and another for ancestral variants.”
  
  To support our analyses further, Figure S10A compares our model predictions with available PDB structures for nitrogenases.
  
  Additionally, Figure S10B compare our predicted structures with the experimental structures reported in this article. In all cases, we observe low RMSD values.
  
  Line 220: "fall within 2 angstroms" instead of "fall 2A"?
  
  We have updated it in the text.
  
  Line 315: It is not clear how the binding affinities and other measurements in Figure 4 and S6C were measured, and it is not discussed in the material and methods.
  
  We thank the reviewer for pointing out this lack of clarity. The binding affinity estimations were performed using Prodigy. We have updated the main text (see line 322) to explicitly state that binding affinities were estimated using Prodigy. In addition, we have expanded the Materials and Methods section to include additional information about the structure characterization methods (lines 745-749). Previously, these details were only noted in Supplementary Table S6.
  
  Line 510-511: "Subtle, modular structural adjustments away from the active site were key to the evolution and persistence of nitrogenases over geologic time". This seems like a bit of an overstatement. While the authors see structural differences in the ancestral nitrogenase and speculate these differences could be involved in oxygen protection, there is no evidence that the ancestral nitrogenase is more sensitive to oxygen than the extant nitrogenase.
  
  We appreciate the reviewer’s comment. Our intention was to emphasize that subtle, modular structural adjustments might have contributed to oxygen protection rather than to assert that ancestral nitrogenases are more oxygen-sensitive than their extant counterparts. We have revised the text to clarify.
  
  Reviewer #2 (Recommendations for the authors):
  
  What is the reference for the measured RMSDs in Fig 2A? What is the value on the y-axis? The range of 'Count' is unclear, given that there are 5000 structures predicted in the study.
  
  Figure 2A presents a histogram of RMSD values from all pairwise alignments among 769 structures (385 extant and 384 ancestral DDKK), totaling 591,361 comparisons. We excluded ancestral DDKK variants due to computational limitations.
  
  Similarly, what is the sequence identity in Figure 2B calculated relative to?
  
  In Figure 2B, sequence identities are derived from pairwise comparisons across all structures in our dataset. Each value represents the identity between two specific structures, rather than being measured against a single reference.
  
  The claim that 'structural analysis could reproduce sequence-based phylogenetic variation' should probably be tempered or qualified, given that the RMSD differences calculated are so low.
  
  We hope to have addressed the concerns about the low RMSD values in the previous comments. We have revised the text (line 204), which now reads: “it still strongly correlates with sequence identity (Figure 2B), indicating that even minor structural variations can recapitulate sequence-based phylogenetic distinctions.”
  
  How are binding affinities (Figure 4) calculated?
  
  We have now clarified the binding affinity calculations in the main text. The model used is now detailed at line 322, with additional information provided in the Methods section.
  
  Presumably, crystallized proteins (Anc1A, Anc1B, Anc2) were also among those whose structures were predicted with AF. A comparison should be provided of the predicted and crystallized structures, as this is an excellent opportunity to further comment on the reliability of AlphaFold.
  
  In the revised manuscript, Figure S10 now present structural comparisons between the crystallized proteins and their AlphaFold-predicted counterparts.
  
  The labels in Figure 5B are not clear. Are the 3rd and 4th panels also comparative RMSD values? But only one complex name is provided.
  
  We appreciate this feedback and now revised the Figure 5B for clarity.
  
  Page 9 line 220, missing word: 'varaints fall within/under 2angstroms'
  
  We thank the reviewer for the correction, we have updated the text.
  
  AuthorResponse
Visit annotations in context

Tags

Summary

AuthorResponse

Review 2

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.11.18.623660v2
www.biorxiv.org www.biorxiv.org

Molecular architecture of thylakoid membranes within intact spinach chloroplasts

3
1. Public_Reviews 31 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  The macromolecular organization of photosynthetic complexes within the thylakoids of higher plant chloroplasts has been a topic of significant debate. Using in situ cryo-electron tomography, this study reveals the native thylakoid architecture of spinach thylakoid membranes with single-molecule precision. The experimental methods are unique and compelling, providing important information for understanding the structural features that impact photosynthetic regulation in vascular plants and addressing several long-standing questions about the organization and regulation of photosynthesis.
  
  Summary
2. Public_Reviews 31 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  In this study, the authors utilized in situ cryo-electron tomography (cryo-ET) to uncover the native thylakoid architecture of spinach chloroplasts and mapped the molecular organization of these thylakoids with single-molecule resolution. The obtained images show the detailed ultrastructural features of grana membranes and highlight interactions between thylakoids and plastoglobules. Interestingly, despite the distinct three-dimensional architecture of vascular plant thylakoids, their molecular organization closely resembles that of green algae. The pronounced lateral segregation of PSII and PSI was observed at the interface between appressed and non-appressed thylakoid regions, without evidence of a specialized grana margin zone where these complexes might intermix. Furthermore, unlike isolated thylakoid membranes, photosystem II (PSII) did not form a semi-crystalline array and was distributed uniformly within the membrane plane and across stacked grana membranes in intact chloroplasts. Based on the above observations, the authors propose a simplified two-domain model for the molecular organization of thylakoid membranes, which can be applied to both green algae and vascular plants. This study suggests that the general understanding of the functional separation of thylakoid membranes in vascular plants requires reconsideration.
  
  Strengths:
  
  By employing and refining AI-driven computational tools for the automated segmentation of membranes and identification of membrane proteins, this study successfully quantifies the spatial organization of photosynthetic complexes both within individual thylakoid membranes and across neighboring stacked membranes.
  
  Weaknesses:
  
  This study's weakness is that it requires the use of chloroplasts isolated from leaves and the need to freeze them on a grid for observation. However, the authors have correctly identified the limitations of this approach and have made some innovations, such as rapid sample preparation. The reliability of the interpretation of the results in light of previous results can be evaluated as high.
  
  Comments on revised version:
  
  The author has responded appropriately to the peer review comments and revised the paper.
  
  Review 1
3. Public_Reviews 31 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  For decades, the macromolecular organization of photosynthetic complexes within the thylakoids of higher plant chloroplasts has been a topic of significant debate. Using focused ion beam milling, cryo-electron tomography, and advanced AI-based image analysis, the authors compellingly demonstrate that the macromolecular organization in spinach thylakoids closely mirrors the patterns observed in their earlier research on Chlamydomonas reinhardtii. Their findings provide strong evidence challenging long-standing assumptions about the existence of a 'grana margin'-a region at the interface between grana and stroma lamellae domains that was thought to contain intermixed particles from both areas. Instead, the study establishes that this mixed zone is absent and reveals a distinct, well-defined boundary between the grana and stroma lamellae.
  
  Strengths:
  
  By situating high-resolution structural data within the broader cellular context, this work contributes valuable insights into the molecular mechanisms governing the spatial organization of photosynthetic complexes within thylakoid membranes.
  
  Comments on revised version:
  
  All reviewer comments have been fully addressed, and I have no further comments.
  
  Review 2
Visit annotations in context

Tags

Summary

Review 2

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.11.24.625035v2
www.medrxiv.org www.medrxiv.org

Forecasting the spatial spread of an Ebola epidemic in real-time: comparing predictions of mathematical models and experts

4
1. Public_Reviews 30 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This manuscript provides valuable evidence comparing the performance of mathematical models and opinions from experts engaged in outbreak response in forecasting the spatial spread of an Ebola epidemic. The evidence supporting the conclusions is convincing. It will be of interest to disease modellers, infectious disease epidemiologists, policy-makers, and those who need to inform policy-makers during an outbreak.
  
  Summary
2. Public_Reviews 30 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Munday, Rosello, and colleagues compared predictions from a group of experts in epidemiology with predictions from two mathematical models on the question of how many Ebola cases would be reported in different geographical zones over the next month. Their study ran from November 2019 to March 2020 during the Ebola virus outbreak in Democratic Republic of the Congo. Their key result concerned predicted numbers of cases in a defined set of zones. They found that neither the ensemble of models nor the group of experts produced consistently better predictions. Similarly, neither model performed consistently better than the other, and no expert's predictions were consistently better than the others'. Experts were also able to specify other zones in which they expected to see cases in the next month. For this part of the analysis, experts consistently outperformed the models. In March, the final month of the analysis, the models' accuracy was lower than in other months, and consistently poorer than the experts' predictions.
  
  A strength of the analysis is use of consistent methodology to elicit predictions from experts during an outbreak that can be compared to observations, and that are comparable to predictions from the models. Results were elicited for a specified group of zones, and experts were also able to suggest other zones that were expected to have diagnosed cases. This likely replicates the type of advice being sought by policymakers during an outbreak.
  
  A potential weakness is that the authors included only two models in their ensemble. Ensembles of greater numbers of models might tend to produce better predictions. The authors do not address whether a greater number of models could outperform the experts.
  
  The elicitation was performed in four months near the end of the outbreak. The authors address some of the implications of this. A potential challenge for the transferability of this result is that the experts' understanding of local idiosyncrasies in transmission may have improved over the course of the outbreak. The model did not have this improvement over time. The comparison of models to experts may therefore not be applicable to early stages of an outbreak when expert opinions may be less well-tuned.
  
  This research has important implications for both researchers and policy-makers. Mathematical models produce clearly-described predictions that will later be compared to observed outcomes. When model predictions differ greatly from observations, this harms trust in the models, but alternative forms of prediction are seldom so clearly articulated or accurately assessed. If models are discredited without proper assessment of alternatives then we risk losing a valuable source of information that can help guide public health responses. From an academic perspective, this research can help to guide methods for combining expert opinion with model outputs, such as considering how experts can inform models' prior distributions and how model outputs can inform experts' opinions.
  
  Comments on revisions:
  
  I am grateful to the authors for their responses to my previous comments. I think their updates have made the paper much clearer. I do not think the updates change the opinions already given in the public review so I have not modified it.
  
  Review 1
3. Public_Reviews 30 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  The manuscript by Munday et al. presents real-time predictions of geographic spread during an Ebola epidemic in north-eastern DRC. Predictions were elicited from individual experts engaged in outbreak response and from two mathematical models. The authors found comparable performance between experts and models overall, although the models outperformed experts in a few dimensions.
  
  Both individual experts and mathematical models are commonly used to support outbreak response, but the relative strengths of each information source are rarely quantified. The manuscript presents an in-depth analysis of the accuracy and decision-relevance of the information provided by each source individually and in combination for a real-time outbreak response effort.
  
  While this paper presents an important and unique comparison, forecast performance is known to be inconsistent and unpredictable across many dimensions such as pathogen, location, forecasting target, and phase of the outbreak. Thus, as the authors note, continuing to replicate such studies will be important for verifying the robustness of their conclusions in other contexts.
  
  Comments on revisions:
  
  I have no further comments. I commend the authors for an interesting and important contribution.
  
  Review 2
4. Public_Reviews 30 Jul 2025
  
  in eLife
  
  Author response:
  
  The following is the authors’ response to the original reviews.
  
  Reviewer #1 (Public review):
  
  Munday, Rosello, and colleagues compared predictions from a group of experts in epidemiology with predictions from two mathematical models on the question of how many Ebola cases would be reported in different geographical zones over the next month. Their study ran from November 2019 to March 2020 during the Ebola virus outbreak in the Democratic Republic of the Congo. Their key result concerned predicted numbers of cases in a defined set of zones. They found that neither the ensemble of models nor the group of experts produced consistently better predictions. Similarly, neither model performed consistently better than the other, and no expert's predictions were consistently better than the others. Experts were also able to specify other zones in which they expected to see cases in the next month. For this part of the analysis, experts consistently outperformed the models. In March, the final month of the analysis, the models' accuracy was lower than in other months and consistently poorer than the experts' predictions.
  
  A strength of the analysis is the use of consistent methodology to elicit predictions from experts during an outbreak that can be compared to observations, and that are comparable to predictions from the models. Results were elicited for a specified group of zones, and experts were also able to suggest other zones that were expected to have diagnosed cases. This likely replicates the type of advice being sought by policymakers during an outbreak.
  
  A potential weakness is that the authors included only two models in their ensemble. Ensembles of greater numbers of models might tend to produce better predictions. The authors do not address whether a greater number of models could outperform the experts.
  
  The elicitation was performed in four months near the end of the outbreak. The authors address some of the implications of this. A potential challenge to the transferability of this result is that the experts' understanding of local idiosyncrasies in transmission may have improved over the course of the outbreak. The model did not have this improvement over time. The comparison of models to experts may therefore not be applicable to the early stages of an outbreak when expert opinions may be less welltuned.
  
  This research has important implications for both researchers and policy-makers. Mathematical models produce clearly-described predictions that will later be compared to observed outcomes. When model predictions differ greatly from observations, this harms trust in the models, but alternative forms of prediction are seldom so clearly articulated or accurately assessed. If models are discredited without proper assessment of alternatives then we risk losing a valuable source of information that can help guide public health responses. From an academic perspective, this research can help to guide methods for combining expert opinion with model outputs, such as considering how experts can inform models' prior distributions and how model outputs can inform experts' opinions.
  
  Reviewer #2 (Public review):
  
  Summary:
  
  The manuscript by Munday et al. presents real-time predictions of geographic spread during an Ebola epidemic in north-eastern DRC. Predictions were elicited from individual experts engaged in outbreak response and from two mathematical models. The authors found comparable performance between experts and models overall, although the models outperformed experts in a few dimensions.
  
  Strengths:
  
  Both individual experts and mathematical models are commonly used to support outbreak response but rarely used together. The manuscript presents an in-depth analysis of the accuracy and decision-relevance of the information provided by each source individually and in combination.
  
  Weaknesses:
  
  A few minor methodological details are currently missing.
  
  We thank the reviewers for taking the time to consider our paper and for their positive reflections and suggestions for our study. We recognise and endorse their characterisation of the study in the public reviews and are greatful for their interest and support for this work.
  
  Reviewer #1 (Recommendations For The Authors):
  
  I initially found Table 1 difficult to interpret. In the final two columns, the rows relate to each other but in the other columns, rows within months don't relate to each other. Could this be made clearer?
  
  Thank you for your helpful suggestion. We agree that this is a little confusing and have now added vertical dividers to the table to indicate which parts of the table relate to each other.
  
  In Figure 1A, the colours are the same as in the colour-bar for Figure 1B but don't have the same meaning. Could different colours be used or could Figure 1A have its own colour-bar to aid clarity?
  
  Thank you for your query. The colours are not the same pallette, but we appreciate that they look very similar. To help the reader we have changed the colour palette of panel A and added a legend to the left.
  
  In Figure 3, can labels for each expert be aligned horizontally, rather than moving above and below the timeline each month?
  
  Thank you for your perspective on this. We made the concious dicision to desplay the experts in this way as it allows the timeline to be presented in a shorter horizontal space. We appreciate that others may prefer a different design, but we are happy with this one.
  
  On lines 292 and 293, the authors state that experts were less confident that case numbers would cross higher thresholds. It seems that this would be inevitable given the number of cases is cumulative. Could this be clarified, please?
  
  Thank you for raising this point. We agree that this wording is confusing. We have now reworked the entire section in response to another reviewer. The equivalent section now reads:
  
  Experts correctly identified Mabalako as the highest-risk HZ in December. They attributed an average 82% probability of exceeding 2 cases; Mabalako reported 38 cases that month, exceeding all thresholds, although the probability assigned to exceeding the higher thresholds was similar to that of Beni (3 cases)
  
  Reviewer #2 (Recommendations For The Authors):
  
  (1) Some methodological details seem to be missing. Most importantly, the results present multiple ensembles (experts, models, and both), but I can't seem to find anywhere in the Methods that details how these ensembles are calculated. Also, I think it would be useful to define the variables in each equation. It would have been easier to connect the equations to the description if the variables were cited explicitly in the text.
  
  Thank you for pointing out these omissions. We have included the following paragraph to detail how ensemble forecasts were calculated.
  
  “Enslemble forecasts
  
  Ensemble forecasts were calculated as an average of the probabilities attributed by the members of the ensemble. For the expert ensemble the arithmetic mean was calculated across all experts with equal weighting. Similarly the model ensemble used the unweighted mean of the model forecasts. For the mixed (model and expert) ensemble, the mean was weighted such that the combined weight of the experts forecasts and the combined weight of the models forecasts were equal.”
  
  (2) Overall, I think the results provide a strong analysis of model vs. expert performance. However, some sections were highly detailed (e.g., the text usually discusses results for every month and all health zones), which clouded my ability to see the salient points. For example, I found it difficult to follow all the details about expert/model predictions vs. observations in the "Expert panel and health zones..." subsection; instead, the graphical illustration of predictions vs. observations in Figure 4 was much easier to interpret. Perhaps some of these details could be trimmed or moved to the supplementary material.
  
  Thank you for your honest feedback on this point. We have shortened this section to highlight the key points that we feel are the most important. We have also simplified the text where we discuss the health zones nominated by experts.
  
  (3) Figure 5C is a nice visualization of the fallibility of relying on a single individual expert (or model). I wonder if it would be useful to summarize these results into the probability that a randomly selected expert outperforms a single model. Is it the case that a single expert is more unreliable than a single model? The discussion emphasizes the importance of ensembles and compares a single model to an ensemble of experts, but eliciting predictions from multiple experts may not always be possible.
  
  Thank you for raising this. We agree that this is an important point that eliciting expert opinions is not a trivial task and should not be taken for granted. We agree with the principle of your suggestion that it would be useful to understand how the models compare to indevidual experts. We don’t however believe that an additional analysis would add sufficiently more information than already shown in Figure 5, which already displays the full distribution of indevidual experts for each month and threshold. If you would like to try this analysis yourself, the relevant data (the indevidual score for each combination of expert, threshold, heal zone and month) is included in the github repo (https://github.com/epiforecasts/Ebola-Expert-Elicitation/blob/main/outputs/indevidual_results_with_scores.csv).
  
  Minor comments:
  
  (1) Figure 2: the color scales in each panel are meant to represent different places, correct? The figure might be easier to interpret if the colors used were different.
  
  Thank you for bringing this to our attention. We have now changed the palette of panel A to differ from panel B.
  
  (2) Equation 7: is o(c>c_thresh) meant to be the indicator function (i.e. 1 if c>c_thresh) and 0 otherwise)?
  
  Thanks for raising this. The function o is the same as in the previous equation – an observation count function. We appreciate that this is not immediately clear so have added a sentence to explain the notation after the equation.
  
  (3) Table 1: a brief description of the column headers would be useful.
  
  Thank you for the suggestion. We have now extended the table caption to include more description of the columns.
  
  “Table 1: Experts and health zones included in each round of the survey. The left part of the table details the experts interviewed (highlighted in green) the health zones included in the main survey in each month. In addition, the right part of the table details the health zones nominated by experts and the number of experts that nominated each one.”
  
  AuthorResponse
Visit annotations in context

Tags

Summary

AuthorResponse

Review 2

Review 1

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2024.03.14.24304285v2
www.biorxiv.org www.biorxiv.org

Multi-omics investigation of spontaneous T2DM macaque emphasizes gut microbiota could up-regulate the absorption of excess palmitic acid in the T2DM progression

2
1. Public_Reviews 30 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This important work substantially advances our understanding of the interaction among gut microbiota, lipid metabolism, and the host in type 2 diabetes. The evidence supporting the claims of the authors is convincing. The work will be of interest to medical biologists working on microbiota and diabetes.
  
  Summary
2. Public_Reviews 30 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  The authors sought to identify the relationships between gut microbiota, lipid metabolites and the host in type 2 diabetes (T2DM) by using spontaneously developed T2DM in macaques, considered among the best human models.
  
  Strengths:
  
  The authors compared comprehensively the gut microbiota, plasma fatty acids between spontaneous T2DM and the control macaques, verifying the results with macaques in a high-fat diet-fed mice model.
  
  Comments on revisions:
  
  The authors responded to the comments raised, and the manuscript has been improved.
  
  Review 1
Visit annotations in context

Tags

Summary

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.10.17.618794v3
www.biorxiv.org www.biorxiv.org

Smed-pou4-2 regulates mechanosensory neuron regeneration and function in planarians

4
1. Public_Reviews 30 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This is a valuable study that explores the role of the conserved transcription factor POU4-2 in the maintenance, regeneration, and function of planarian mechanosensory neurons. The authors provide solid evidence provided by gene expression and functional studies to demonstrate that POU4-2 is required for the maintenance and regeneration of functional mechanosensory neurons in planarians. Furthermore, the authors identify conserved genes associated with human auditory and rheosensory neurons as potential targets of this transcription factor.
  
  Summary
2. Public_Reviews 30 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  In this manuscript, the authors explore the role of the conserved transcription factor POU4-2 in planarian maintenance and regeneration of mechanosensory neurons. The authors explore the role of this transcription factor and identify potential targets of this transcription factor. Importantly, many genes discovered in this work are deeply conserved, with roles in mechanosensation and hearing, indicating that planarians may be a useful model with which to study the roles of these key molecules. This work is important within the field of regenerative neurobiology, but also impactful for those studying the evolution of the machinery that is important for human hearing.
  
  Strengths:
  
  The paper is rigorous and thorough, with convincing support for the conclusions of the work.
  
  Weaknesses:
  
  Weaknesses are relatively minor and could be addressed with additional experiments or changes in writing.
  
  Review 1
3. Public_Reviews 30 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  In this manuscript, the authors investigate the role of the transcription factor Smed-pou4-2 in the maintenance, regeneration, and function of mechanosensory neurons in the freshwater planarian Schmidtea mediterranea. First, they characterize the expression of pou4-2 in mechanosensory neurons during both homeostasis and regeneration, and examine how its expression is affected by the knockdown of soxB1, 2, a previously identified transcription factor essential for the maintenance and regeneration of these neurons. Second, the authors assess whether pou4-2 is functionally required for the maintenance and regeneration of mechanosensory neurons.
  
  Strengths:
  
  The study provides some new insights into the regulatory role of pou4-2 in the differentiation, maintenance, and regeneration of ciliated mechanosensory neurons in planarians.
  
  Weaknesses:
  
  The overall scope is relatively limited. The manuscript lacks clear organization, and many of the conclusions would benefit from additional experiments and more rigorous quantification to enhance their strength and impact.
  
  Review 2
4. Public_Reviews 30 Jul 2025
  
  in eLife
  
  Author response:
  
  (1) We will clarify statements comparing regeneration and developmental processes. Additionally, we will include a new supplemental figure with published data showing that the pou4-2 clone dd_Smed_v6_30562_0_1 (cross-referenced as SMED30002016) is expressed during stages corresponding to organ development in Schmidtea mediterranea (https://planosphere.stowers.org/feature/Schmidtea/mediterranea-sexual/transcript/SMED30002016).
  
  (2) We will reorganize the figures by combining Figures 3 and 4 for improved clarity.
  
  (3) We will address experimental and interpretive concerns regarding the role of atonal in the pou4-2 gene regulatory network.
  
  AuthorResponse
Visit annotations in context

Tags

Summary

AuthorResponse

Review 2

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.05.15.654132v1
www.biorxiv.org www.biorxiv.org

The Anti-Inflammatory Role of GPNMB in Post-Traumatic Osteoarthritis

3
1. Public_Reviews 30 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This study offers useful findings demonstrating the cartilage-protective effects of osteoactivin in inflammatory experimental models. The study provides compelling evidence that osteoactivin may serve as a promising therapeutic target for inflammatory joint diseases.
  
  Summary
2. Public_Reviews 30 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  While previous studies by this group and others have demonstrated the anti-inflammatory properties of osteoactivin, its specific role in cartilage homeostasis and disease pathogenesis remains unknown. Building on current knowledge, Asaad and colleagues investigated the functional role of this protein using both in vitro systems and an in vivo post-traumatic osteoarthritis model. In line with existing literature, the authors report that osteoactivin exerts inhibitory effects in these experimental settings. This study thus offers novel evidence supporting the cartilage-protective effects of osteoactivin in various experimental models.
  
  Strengths:
  
  Strengths of the study include its clinical relevance, given the lack of curative treatments for osteoarthritis, as well as the clarity of the narrative and the quality of most results.
  
  Weaknesses:
  
  A limitation of the study is the reliance on standard techniques; however, this is a minor concern that does not diminish the overall impact or significance of the work.
  
  Review 1
3. Public_Reviews 30 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  This manuscript presents compelling evidence for a novel anti-inflammatory function of glycoprotein non-metastatic melanoma protein B (GPNMB) in chondrocyte biology and osteoarthritis (OA) pathology. Through a combination of in vitro, ex vivo, and in vivo models, including the destabilization of the medial meniscus (DMM) surgery in mice, the authors demonstrate that GPNMB expression is upregulated in OA-affected cartilage and that recombinant GPNMB treatment reduces the expression of key catabolic markers (MMPs, Adamts-4, and IL-6) without impairing anabolic gene expression. Notably, DBA/2J mice lacking functional GPNMB exhibit exacerbated cartilage degradation post-injury. Mechanistically, GPNMB appears to mitigate inflammation via the MAPK/ERK pathway. Overall, the work is thorough, methodologically sound, and significantly advances our understanding of GPNMB as a protective modulator in osteoarthritic joint disease. The findings could open pathways for therapeutic development.
  
  Strengths:
  
  (1) Clear hypothesis addressing a well-defined knowledge gap.
  
  (2) Robust and multi-modal experimental design: includes human, mouse, cell-line, explant, and surgical OA models.
  
  (3) Elegant use of DBA/2J GPNMB-deficient mice to mimic endogenous loss-of-function.
  
  (4) Mechanistic insight provided through MAPK signaling analysis.
  
  (5) Statistical analysis appears rigorous, and figures are informative.
  
  Weaknesses:
  
  (1) Clarify the strain background of the DBA/2J GPNMB+ mice: While DBA/2J GPNMB+ is described as a control, it would help to explicitly state whether these are transgenically rescued mice or another background strain. Are they littermates, congenic, or a separate colony?
  
  (2) Provide exact sample sizes and variance in all figure legends: Some figures (e.g., Figure 2 panels) do not consistently mention how many replicates were used (biological vs. technical) for each experimental group. Standardizing this across all panels would improve reproducibility.
  
  (3) Expand on potential sex differences: The DMM model is applied only in male mice, which is noted in the methods. It would be helpful if the authors added 1-2 lines in the discussion acknowledging potential sex-based differences in OA progression and GPNMB function.
  
  (4) Visual clarity in schematic (Figure 7): The proposed mechanism is helpful, but the text within the schematic is somewhat dense and could be made more readable with spacing or enlarged font. Also, label the MAPK/ERK pathway explicitly in panel B.
  
  Review 2
Visit annotations in context

Tags

Summary

Review 2

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.06.06.658389v1
www.researchsquare.com www.researchsquare.com

Stabilisation of HIF signalling extends epicardial activation and neonatal heart regeneration

4
1. Public_Reviews 30 Jul 2025
  
  in eLife (unscoped)
  
  eLife Assessment
  
  This valuable study investigates the role of HIF1a signaling in epicardial activation and neonatal heart regeneration in mice. Through a combination of genetic and pharmacological approaches, the authors show that stabilization of HIF1a enhances epicardial activation and extends the regenerative capacity of the heart beyond the typical neonatal window following myocardial infarction (MI). However, several aspects of the study remain incomplete and would benefit from further clarification and additional experimental support to solidify the conclusions.
  
  Summary
2. Public_Reviews 30 Jul 2025
  
  in eLife (unscoped)
  
  Reviewer #1 (Public review):
  
  Summary:
  
  The manuscript by Gamen et al. analyzed the functional role of HIF signaling in the epicardium, providing evidence that stabilization of the hypoxia signaling pathway might contribute to neonatal heart regeneration. By generating different conditionally mouse mutants and performing pharmacological interventions, the authors demonstrate that stabilizing HIF signaling enhances cardiac regeneration after MI in P7 neonatal hearts.
  
  Strengths:
  
  The study presents convincing genetic and pharmacological approaches to the role of hypoxia signaling in enhancing the regenerative potential of the epicardium.
  
  Weaknesses:
  
  The major weakness is the lack of convincing evidence demonstrating the role of hypoxia signaling in EMT modulation in epicardial cells. Additionally, novel experimental approaches should be performed to allow for the translation of these findings to the clinical arena.
  
  Review 1
3. Public_Reviews 30 Jul 2025
  
  in eLife (unscoped)
  
  Reviewer #2 (Public review):
  
  Summary:
  
  In this study, Gamen et al. investigated the roles of hypoxia and HIF1a signaling in regulating epicardial function during cardiac development and neonatal heart regeneration. They found that WT1⁺ epicardial cells become hypoxic and begin expressing HIF1a from mid-gestation onward. During development, epicardial HIF1a signaling regulates WT1 expression and promotes coronary vasculature formation. In the postnatal heart, genetic and pharmacological upregulation of HIF1a sustained epicardial activation and improved regenerative outcomes.
  
  Strengths:
  
  HIF1a signaling was manipulated in an epicardium-specific manner using appropriate genetic tools.
  
  Weaknesses:
  
  There appears to be a discrepancy between some of the conclusions and the provided histological data. Additionally, the study does not offer mechanistic insight into the functional recovery observed.
  
  Review 2
4. Public_Reviews 30 Jul 2025
  
  in eLife (unscoped)
  
  Reviewer #3 (Public review):
  
  Summary:
  
  The authors' research here was to understand the role of hypoxia and hypoxia-induced transcription factor Hif-1a in the epicardium. The authors noted that hypoxia was prevalent in the embryonic heart, and this persisted into neonatal stages until postnatal day 7 (P7). Hypoxic regions in the heart were noted in the outer layer of the heart, and expression of Hif-1a coincided with the epicardial gene WT1. It has been documented that at P7, the mouse heart cannot regenerate after myocardial infarction, and the authors speculated that the change in epicardial hypoxic conditions could play a role in regeneration. The authors then used genetic and pharmacological tools to increase the activity of Hif genes in the heart and noted that there was a significant improvement in cardiac function when Hif-1a was active in the epicardium. The authors speculated that the presence of Hif-1a improved cell survival.
  
  Strengths:
  
  A focus on hypoxia and its effects on the epicardium in development and after myocardial infarction. This study outlines the potential to extend the regenerative time window in neonatal mammalian hearts.
  
  Weaknesses:
  
  While the observations of improved cardiac function are clear, the exact mechanism of how increased Hif-1a activity causes these effects is not completely revealed. The authors mention improved myocardium survival, but do not include studies to demonstrate this.
  
  There is an indication that fibrosis is decreased in hearts where Hif activity is prolonged, but there are no studies to link hypoxia and fibrosis.
  
  Review 3
Visit annotations in context

Tags

Review 1

Review 2

Review 3

Summary

Annotators

Public_Reviews

URL

researchsquare.com/article/rs-2496938/'https://www.researchsquare.com/article/rs-2496938/v3
www.medrxiv.org www.medrxiv.org

Untitled document

4
1. Public_Reviews 29 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This study introduces a useful method to estimate the probability that a malaria case is imported and to identify the geographic origin of parasites by using a Bayesian approach that integrates epidemiological, travel, and genetic data. The authors provide convincing evidence that the approach can reliably identify the main sources of malaria imports. This work will be of great interest to the area of genomic epidemiology and public health strategies aiming to eliminate malaria.
  
  Summary
2. Public_Reviews 29 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  This study presents a new Bayesian approach to estimate importation probabilities of malaria, combining epidemiological data, travel history, and genetic data through pairwise IBD estimates. Importation is an important factor challenging malaria elimination, especially in low-transmission settings. This paper focuses on Magude and Matutuine, two districts in southern Mozambique with very low malaria transmission. The results show isolation-by-distance in Mozambique, with genetic relatedness decreasing with distances larger than 100 km, and no spatial correlation for distances between 10 and 100 km. But again, strong spatial correlation in distances smaller than 10 km. They report high genetic relatedness between Matutuine and Inhambane, higher than between Matutuine and Magude. Inhambane is the main source of importation in Matutuine, accounting for 63.5% of imported cases. Magude, on the other hand, shows smaller importation and travel rates than Matutuine, as it is a rural area with less mobility. Additionally, they report higher levels of importation and travel in the dry season, when transmission is lower. Also, no association with importation was found for occupation, sex, and other factors. These data have practical implications for public health strategies aiming for malaria elimination, for example, testing and treating travelers from Matutuine in the dry season.
  
  Strengths:
  
  The strength of this study lies in the combination of different sources of data - epidemiological, travel, and genetic data - to estimate importation probabilities, and the statistical analyses.
  
  Weaknesses:
  
  The authors recognize the limitations related to sample size and the biases of travel reports.
  
  Review 1
3. Public_Reviews 29 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  Based on a detailed dataset, the authors present a novel Bayesian approach to classify malaria cases as either imported or locally acquired.
  
  Strengths:
  
  The proposed Bayesian approach for case classification is simple, well justified, and allows the integration of parasite genomics, travel history, and epidemiological data. The work is well-written, very organized, and brings important contributions both to malaria control efforts in Mozambique and to the scientific community. Understanding the origin of cases is essential for designing more effective control measures and elimination strategies.
  
  Weakness:
  
  While the authors aim to classify cases as imported or locally acquired, the work lacks a quantification of the contribution of each case type to overall transmission.
  
  The Bayesian rationale is sound and well justified; however, the formulation appears to present an inconsistency that is replicated in both the main text and the Supplementary Material.
  
  Review 2
4. Public_Reviews 29 Jul 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  The authors present an important approach to identify imported P. falciparum malaria cases, combining genetic and epidemiological/travel data. This tool has the potential to be expanded to other contexts. The data was analyzed using convincing methods, including a novel statistical model; although some recognized limitations can be improved. This study will be of interest to researchers in public health and infectious diseases.
  
  Strengths:
  
  The study has several strengths, mainly the development of a novel Bayesian model that integrates genomic, epidemiological, and travel data to estimate importation probabilities. The results showed insights into malaria transmission dynamics, particularly identifying importation sources and differences in importation rates in Mozambique. Finally, the relevance of the findings is to suggest interventions focusing on the traveler population to help efforts for malaria elimination.
  
  Weaknesses:
  
  The study also has some limitations. The sample collection was not representative of some provinces, and not all samples had sufficient metadata for risk factor analysis, which can also be affected by travel recall bias. Additionally, the authors used a proxy for transmission intensity and assumed some conditions for the genetic variable when calculating the importation probability for specific scenarios. The weaknesses were assessed by the authors.
  
  Review 3
Visit annotations in context

Tags

Summary

Review 2

Review 3

Review 1

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2025.05.01.25326793v2
www.biorxiv.org www.biorxiv.org

Colony demographics shape nest construction in Camponotus fellah ants

4
1. Public_Reviews 29 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 This study presents an important finding that ant nest structure and digging behavior depend on ant age demographics for a ground-dwelling ant species (Camponotus fellah). By asking whether ants employ age-polyethism in excavation, the authors address a long-standing question about how individuals in collectives determine the overall state of the task they must perform, and their results may prove to be a key consideration for interpreting results from other studies in the field of social insect behavior. The experimental evidence that the age of the ants and the group composition affect the digging of tunnels is solid, although some aspects of the modeling and certain analyses may benefit from further clarification regarding their added value to the core findings.
 
 Summary
2. Public_Reviews 29 Jul 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 This study investigates how ant group demographics influence nest structures and group behaviors of Camponotus fellah ants, a ground-dwelling carpenter ant species (found locally in Israel) that build subterranean nest structures. Using a quasi-2D cell filled with artificial sand, the authors perform two complementary sets of experiments to try to link group behavior and nest structure: first, the authors place a mated queen and several pupae into their cell and observe the structures that emerge both before and after the pupae eclose (i.e., "colony maturation" experiments); second, the authors create small groups (of 5,10, or 15 ants, each including a queen) within a narrow age range (i.e., "fixed demographic" experiments) to explore the dependence of age on construction. Some of the fixed demographic instantiations included a manually induced catastrophic collapse event; the authors then compared emergency repair behavior to natural nest creation. Finally, the authors introduce a modified logistic growth model to describe the time-dependent nest area. The modification introduced parameters that allow for age-dependent behavior, and the authors use their fixed demographic experiments to set these parameters, and then apply the model to interpret the behavior of the colony maturation experiments. The main results of this paper are that for natural nest construction, nest areas, and morphologies depend on the age demographics of ants in the experiments: younger ants create larger nests and angled tunnels, while older ants tend to dig less and build predominantly vertical tunnels; in contrast, emergency response seems to elicit digging in ants of all ages to repair the nest.
 
 The experimental results are solid, providing new information and important insights into nest and colony growth in a social insect species. As presented, I still have some reservations about the model's contribution to a deeper understanding of the system. Additional context and explanation of the model, implications, and limitations would be helpful for readers.
 
 Review 1
3. Public_Reviews 29 Jul 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 I enjoyed this paper and its examination of the relationship between overall density and age polyethism to reduce the computational complexity required to match nest size with population. I had some questions about the requirement that growth is infinite in such a solution, but these have been addressed by the authors in the responses and the updated manuscript. I also enjoyed the discussion of whether collective behaviour is an appropriate framework in systems in which agents (or individuals) differ in the behavioural rules they employ, according to age, location, or information state. This is especially important in a system like social insects, typically held as a classic example of individual-as-subservient to whole, and therefore most likely to employ universal rules of behaviour. The current paper demonstrates a potentially continuous age-related change in target behaviour (excavation), and suggests an elegant and minimal solution to the requirement for building according to need in ants, avoiding the invocation of potentially complex cognitive mechanisms, or information states that all individuals must have access to in order to have an adaptive excavation output.
 
 The authors have addressed questions I had in the review process and the manuscript is now clear in its communication and conclusions.
 
 The modelling approach is compelling, also allowing extrapolation to other group sizes and even other species. This to me is the main strength of the paper, as the answer to the question of whether it is younger or older ants that primarily excavate nests could have been answered by an individual tracking approach (albeit there are practical limitations to this, especially in the observation nest setup, as the authors point out). The analysis of the tunnel structure is also an important piece of the puzzle, and I really like the overall study.
 
 Review 2
4. Public_Reviews 29 Jul 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews.
 
 Reviewer #1 (Public review):
 
 This study investigates how ant group demographics influence nest structures and group behaviors of Camponotus fellah ants, a ground-dwelling carpenter ant species (found locally in Israel) that build subterranean nest structures. Using a quasi-2D cell filled with artificial sand, the authors perform two complementary sets of experiments to try to link group behavior and nest structure: first, the authors place a mated queen and several pupae into their cell and observe the structures that emerge both before and after the pupae eclose (i.e., "colony maturation" experiments); second, the authors create small groups (of 5,10, or 15 ants, each including a queen) within a narrow age range (i.e., "fixed demographic" experiments) to explore the dependence of age on construction. Some of the fixed demographic instantiations included a manually induced catastrophic collapse event; the authors then compared emergency repair behavior to natural nest creation. Finally, the authors introduce a modified logistic growth model to describe the time-dependent nest area. The modification introduces parameters that allow for age-dependent behavior, and the authors use their fixed demographic experiments to set these parameters, and then apply the model to interpret the behavior of the colony maturation experiments. The main results of this paper are that for natural nest construction, nest areas, and morphologies depend on the age demographics of ants in the experiments: younger ants create larger nests and angled tunnels, while older ants tend to dig less and build predominantly vertical tunnels; in contrast, emergency response seems to elicit digging in ants of all ages to repair the nest.
 
 We sincerely thank Reviewer #1 for the time and effort dedicated to our manuscript's detailed review and assessment. The revision suggestions were constructive, and we have provided a point-by-point response to address them.
 
 Reviewer #2 (Public review):
 
 I enjoyed this paper and the approach to examining an accepted wisdom of ants determining overall density by employing age polyethism that would reduce the computational complexity required to match nest size with population (although I have some questions about the requirement that growth is infinite in such a solution). Moreover, the realization that models of collective behaviour may be inappropriate in many systems in which agents (or individuals) differ in the behavioural rules they employ, according to age, location, or information state. This is especially important in a system like social insects, typically held as a classic example of individual-as-subservient to whole, and therefore most likely to employ universal rules of behaviour. The current paper demonstrates a potentially continuous age-related change in target behaviour (excavation), and suggests an elegant and minimal solution to the requirement for building according to need in ants, avoiding the invocation of potentially complex cognitive mechanisms, or information states that all individuals must have access to in order to have an adaptive excavation output.
 
 We sincerely thank reviewer #2 for the time and effort dedicated to our manuscript's detailed review and assessment. We have provided a point-by-point response to the reviewer's comments, which we have incorporated into the revised version of the manuscript.
 
 The only real reservation I have is in the question of how this relationship could hold in properly mature colonies in which there is (presumably) a balance between the birth and death of older workers. Would the prediction be that the young ants still dig, or would there be a cessation of digging by young ants because the area is already sufficient? Another way of asking this is to ask whether the innate amount of digging that young ants do is in any way affected by the overall spatial size of the colony. If it is, then we are back to a problem of perfect information - how do the young ants know how big the overall colony is? Perhaps using density as a proxy? Alternatively, if the young ants do not modify their digging, wouldn't the colony become continuously larger? As a non-expert in social insects, I may be misunderstanding and it may be already addressed in the citations used.
 
 We thank the reviewer for this interesting question. We find that the nest excavation is predominantly performed by the younger ants in the nest, and the nest area increase is followed by an increase in the population. However, if the young ants dig unrestricted, this could result in unnecessary nest growth as suggested by reviewer #2. Therefore, we believe that the innate digging behavior of ants could potentially be regulated by various cues such as;
 
 (a) Density-based: If the colony becomes less dense as its area expands, this could serve as a feedback signal for young ants to reduce or stop digging, as described in references (25, 29, 30).
 
 (b) Pheromone depositions: If the colony reaches a certain population density, pheromone signals could inhibit further digging by young ants, references (25, 29), or space usage as a proxy for the nest area.
 
 Thus, rather than perfect information, decentralized control, and digging-based local cues probably regulate the level of age-dependent digging, without the ants needing to estimate the overall colony size or nest area.
 
 In any case, this is an excellent paper. The modelling approach is excellent and compelling, also allowing extrapolation to other group sizes and even other species. This to me is the main strength of the paper, as the answer to the question of whether it is younger or older ants that primarily excavate nests could have been answered by an individual tracking approach (albeit there are practical limitations to this, especially in the observation nest setup, as the authors point out). The analysis of the tunnel structure is also an important piece of the puzzle, and I really like the overall study.
 
 We thank the reviewer for the comments. We completely agree that individual tracking of ants within our experimental setup would have been the ideal approach, but we were limited by technical and practical limitations of the setup, as pointed out by the reviewer, such as;
 
 (a) Continuous tracking of ants in our nests would have required a camera to be positioned at all times in front of the nest, which necessitates a light background. Since Camponotus fellah ants are subterranean, we aimed to allow them to perform nest excavation in conditions as close to their natural dark environment as possible. Additionally, implementing such a system in front of each nest would have reduced the sample sizes for our treatments.
 
 (b) The experimental duration of our colony maturation and fixed demographics experiments extended for up to six months (unprecedented durations in these kinds of measurements). These naturally limited our ability to conduct individual tracking while maintaining the identity of each ant based on the current design.
 
 These details are described in detail within the revised version of the manuscript.
 
 Reviewer #3 (Public review):
 
 Summary:
 
 In this study, Harikrishnan Rajendran, Roi Weinberger, Ehud Fonio, and Ofer Feinerman measured the digging behaviours of queens and workers for the first 6 months of colony development, as well as groups of young or old ants. They also provide a quantitative model describing the digging behaviours and allowing predictions. They found that young ants dig more slanted tunnels, while older ants dig more vertically (straight down). This finding is important, as it describes a new form of age polyethism (a division of labour based on age). Age polyethism is described as a "yes or no" mechanism, where individuals perform or not a task according to their age (usually young individuals perform in-nest tasks, and older ones foraging). Here, the way of performing the task is modified, not only the propensity to carry it or not. This data therefore adds in an interesting way to the field of collective behaviours and division of labour.
 
 The conclusions of the paper are well supported by the data. Measurements of the same individuals over time would have strengthened the claims.
 
 We sincerely thank reviewer #3 for the time and effort dedicated to our manuscript's detailed review and assessment. We completely agree with the reviewer’s comments on the measurements of the same individuals over time, however, we were limited by the technical and experimental limitations as described above and pointed out by reviewer #2.
 
 Strengths:
 
 I find that the measure of behaviour through development is of great value, as those studies are usually done at a specific time point with mature colonies. The description of a behaviour that is modified with age is a notable finding in the world of social insects. The sample sizes are adequate and all the information clearly provided either in the methods or supplementary.
 
 We thank reviewer #3 for this assessment.
 
 Weaknesses:
 
 I think the paper is failing to take into consideration or at least discuss the role of inter-individual variabilities. Tasks have been known to be undertaken by only a few hyper-active individuals for example. Comments on the choice to use averages and the potential roles of variations between individuals are in my opinion lacking. Throughout the paper wording should be modified to refer to the group and not the individuals, as it was the collective digging that was measured. Another issue I had was the use of "mature colony" for colonies with very few individuals and only 6 months of age. Comments on the low number of workers used compared to natural mature colonies would be welcome.
 
 Regarding the main comment 1
 
 We completely agree with the reviewer’s comment on considering inter-individual variability based on activity levels. We have discussed how individual morphological variability could influence digging behavior (references: 28, 31), and we will elaborate further on this aspect in future revisions.
 
 Regarding the main comment 2:
 
 The term ‘colony maturation’ in our study refers to the progressive development of colonies from a single queen, distinguishing it from experiments that begin with pre-established, demographically stable colonies. We provide a detailed explanation for this terminology in the revised version of the manuscript. We were practically limited by the continuation of the experiments for more than 6 months of age, predominantly due to the stability of nests, as they were made with a sand-soil mix. We also acknowledge that the colony sizes attained in our maturation experiments may be smaller than those of naturally matured colonies. This trend was observed generally in lab-reared colonies and could be attributed to differences in microclimatic conditions, foraging opportunities, space availability, and other factors. We have explicitly described these details in the revised version of the manuscript.
 
 Reviewer #1 (Recommendations for the authors):
 
 The experimental design is fantastic. The large quasi-2D should allow for the direct visualization of the movements of individuals and the creation of the nest, and the inclusion of non-workers (specifically, a mated queen and pupae) is new and important. However, I have some questions and concerns about the results, as outlined below. Also, I found the paper difficult to read, and the connections between the various experiments and the model were not always clear.
 
 We thank the reviewer for the time and effort dedicated to reviewing our manuscript. We have modified the manuscript substantially to address the comments and readability.
 
 The assumption that the digging rate is constant across ants may be a strong one. Previous work (see, for instance, Aguilar, et al, Science 2018) has demonstrated a very heterogeneous workload distribution among ants. I am not sure what implications that may have for the results here, but the authors should comment on this choice. Related to the point above, given a constant digging rate, the variation in digging is attributed to an age-dependent "desired target area". Can the authors comment on the implications of this, specifically in contrast to a variable digging rate? The distinction between digging rate differences and target area differences seems to be important for the authors. However, the way this is presented, it is difficult to fully understand or appreciate this importance and its implications. What is the consequence of this difference, and why is this important?
 
 We apologize to the reviewer for the confusion.
 
 Our model does not assume that the digging rate (da/dt, Equation 1) remains constant throughout the experiment. Instead, we only treat the basal digging rate (r) as a constant.
 
 The variable digging rate (da/dt, Equation 1) is derived by multiplying the basal rate constant (r) by the term (1 - a/aage), which accounts for deviations from the age-dependent target area that the ants aim to achieve. This makes the actual digging rate dynamic, as it responds to changes in excavated area (e.g., expansion or rapid collapse)
 
 For example, according to our model (Equation 1), two ants with the same basal digging rate (r) may exhibit markedly different actual digging rates at a given time if they differ in age. This occurs because the variable digging rate (da/dt) depends not only on ‘r’ but also on the age-dependent term (1 - a/aage). Also, we emphasize that the use of a basal digging rate constant aligns with prior studies (refs. 24, 29, 30).
 
 In our work, we demonstrate that after a collapse event, ants of all ages dig at rates comparable to those observed in the initial (pre-collapse) phase of the experiment. This occurs because the ants are far from their age-dependent target area, effectively resetting their digging behavior. By comparing maximum digging rates pre- and post-collapse, we provide strong empirical evidence that this rate is age-independent (SI Fig. 6A, 6B), supporting the conclusion that the basal digging rate constant (r) is a fundamental property of the ants' behavior, unaffected by age.
 
 We agree with the reviewer that individual tracking of ants within our experimental setup would have been the ideal approach. Then, we could have taken the inter-individual variability of the digging activity into account. However, we were limited to doing so by the technical and practical limitations of the setup, such as;
 
 (a) Continuous tracking of ants in our nests would have required a camera to be positioned at all times in front of the nest, which necessitates a light background. Since Camponotus fellah ants are subterranean, we aimed to allow them to perform nest excavation in conditions as close to their natural dark environment as possible. Additionally, implementing such a system in front of each nest would have reduced the sample sizes for our treatments.
 
 (b) The experimental duration of our colony maturation experiments extended for up to six months (unprecedented durations in these kinds of measurements). These naturally limited our ability to conduct individual tracking while maintaining the identity of each ant based on the current design.
 
 In light of these points, the following lines are added to the discussion (line numbers: 283-295), signifying the above points:
 
 “Our age-dependent model demonstrates that the digging behavior in Camponotus fellah is governed by a basal digging rate constant (r) modulated by the age-dependent feedback (1 − a/aage). Crucially, we show that after a collapse, the maximum digging rates return to their pre-collapse levels, suggesting that this basal rate ’r’ represents an age-independent ceiling on how fast ants can dig, regardless of age or context (SI Fig. 6 A, B). Previous studies have demonstrated both homogeneous and heterogeneous workload distribution, with varying digging rates among ants (24, 29, 30, 35). Studies showing heterogeneous workload distribution relied on continuous individual tracking of ants to quantify digging rates (35). However, this approach was not feasible in our current design due to the experimental durations of both our colony maturation and fixed demographics experiments. Additionally, sample size requirements naturally limited our ability to conduct continuous individual tracking during nest construction in our study. Thus, based on empirical measurements from our fixed-demographics experiments and supported by the age-independent post-collapse digging rates, we adopted a constant basal digging rate for simulating our age-dependent model—an assumption aligned with both prior literature and the collective dynamics observed in our system (24,29,30)”.
 
 Model: as presented, the model seems to lack independent validation. The model seems to have built-in that there is an age-dependent target area, and this is what is recovered from the model. I am failing to see what is learned from the model that the experiments do not already show. Also, the model has no ant interactions, though ants are eusocial and group size is known to have a large effect on behavior (this is acknowledged by the authors at the beginning of the discussion). Can the authors comment on this?My recommendation would be to remove the model from this paper or improve the text to address the above comments.
 
 We did not draw the conclusion of the age-dependent target area from our model. We used the fixed demographics experiments to quantify the age-dependent area target as a function of the age of individuals. We then used this age-dependent area target in our model to quantify the excavation dynamics of the colony maturation experiments, where ants span a variety of ages, as the nest population changes over time, resulting in natural variation in the ages of individuals within the nest. These results could not have been obtained by performing any of the individual experiments, whether colony maturation or the fixed demographics, young or old, on their own. The need for different age demographics was crucial to quantify the age-dependent effects in nest excavation, which were lacking in previous studies.
 
 First, the age-dependent model provides a very good estimate for the natural growth of the nest. More importantly, after fixing an age threshold of 56 days (mean + standard deviation of the young ant age), the model provides an estimate of which ants are doing the majority of the digging during natural nest expansion. This teaches us that during natural expansion, the older ants are far from their density target and therefore do not engage in any substantial digging, which is shown in Figure 4. C.
 
 On the other hand, the younger ants are close to their area targets and induced to dig. Indeed, the target area fitted for the age-independent model closely approximates the empirically measured age-dependent target when extrapolated to very young ants. This provides further support for the idea that, in the colony maturation experiments, the youngest ants are responsible for most of the digging.
 
 Our model is a simple analytical model, inspired by earlier models that used a fixed area target (such as density models) for nest construction. However, because we knew the precise age of workers in our experiments, we were able to obtain age-dependent area targets, thereby challenging the use of a constant area target (as employed in prior studies) in light of our findings from the fixed demographics of young and old colonies.
 
 Empirically Quantifiable Parameters: We wanted our model to have empirically quantifiable parameters. Since we did not continuously record the experiment, we could not quantify agent-agent interactions, pheromonal depositions, or similar factors.
 
 Minimal Model Design: We aimed to keep the model as minimal as possible, which is why we did not include complex interactions such as those found in continuous tracking experiments.
 
 However, the model does set up some interesting hypotheses that could easily be tested with the experimental setup (e.g., marking the ants / tracking individual activity levels). For instance, it is hypothesized that older ants dig less often, but when they do dig, they do so at the same rate. Given the 2D setup, the authors could track individual ants and test this hypothesis. Also, if the desired target area does decrease with age, the authors could verify this hypothesis by placing older ants into arenas with different-sized pre-formed nests to observe how structure is changed to achieve the desired area/ant.
 
 We thank the reviewer for this comment.
 
 We believe that the confusion with the usage of a constant basal digging rate is resolved now. To briefly reiterate, ants dig at variable rates that can be decomposed to a (constant on short time scales but age-dependent) basal rate times the (variable) distance from the density target. The suggested experiments are beyond the scope of our current study, and further studies could utilize the suggested experimental design with better time-resolved imaging for individual ant tracking that could verify the predictions from our model.
 
 Specific comments:
 
 Title:
 
 The title suggests a broad result, yet the study focuses on one ant species. Please modify the title to more accurately reflect the scope of the work.
 
 We thank the reviewer for the comment.
 
 The title is modified as “Colony demographics shape nest construction in Camponotus fellah ants.”
 
 Introduction:
 
 Important information and context are missing about this ant species. For instance, please add the following about this species in the introduction:
 
 What is their natural habitat and substrate? How does the artificial soil compare?
 
 What is their (rough) colony size? [later, discuss experiment group size choice and potential insights/limitations of results when applied to the natural system].
 
 The details have been added to the introduction (line numbers : 49-55) and the materials and methods section (Study species).
 
 “Camponotus fellah ants are native to the Near East and North Africa, particularly found in countries like Israel, Egypt, and surrounding arid and semi-arid regions, where they prefer to nest in moist, decaying wood, including tree trunks, branches, or stumps (49,50). The species lives in monogynous colonies with tens to thousands of individuals. Nests are commonly found in a sand-loamy mix, which is a combination of sand, soil, clay, or gravel, providing structural stability and moisture retention (51). They are typically found under rocks, in the crevices of dried vegetation, or dry, sandy soils, sometimes in areas with loose gravel, with a colony size ranging from tens to thousands of workers”.
 
 What is the natural life expectancy of a worker? A queen? [later, discuss fixed demographic age choices in this context and/or why were age ranges chosen for experiments?].
 
 The lifespan of ants, including both queens and workers, varies significantly based on caste, species, and environmental conditions.
 
 (1) Queen Longevity: From the literature, Camponotus fellah queens can live up to 20 years, with one documented case reaching 26 years (50).
 
 (2) Worker Longevity: In contrast to queens, the lifespan of workers is much shorter. Lab studies on Camponotus fellah (82) and other Camponotus species (83) suggest that workers can live for several months depending on environmental conditions, colony health, and caste-specific roles (e.g., minor vs. major workers)
 
 (3) Laboratory vs. Natural Conditions: Worker longevity is highly variable between laboratory and natural conditions
 
 Therefore, in the context of the old worker lifespan in our experiments, ~200 days (roughly 6–7 months), we strongly believe that the worker lifespan used in our experiments represents a substantial portion of a worker's expected life. While exact figures for C. fellah workers are unavailable, inferences from related species suggest that workers nearing 200 days are approaching the latter stages of their lifespan, making them meaningfully "old".
 
 The details are added to the main text (line numbers: 124-127) and discussion (line numbers: 278-282).
 
 Why was this species chosen? Convenience, or is there something special about this species that the readers should know? Specifically, is there something that might make the results more general or of broader interest?
 
 Camponotus fellah was chosen for this study because it is native to Israel, making it convenient to collect and maintain in the lab. Additionally, its nuptial flights occur close to the study location, ensuring a steady supply of colonies. We were able to provide them with a nesting substrate similar to what they naturally use, as their nests are typically found in a sand-loamy mix, similar to the sand-soil mix in our artificial nests. This was possible because we had the opportunity to observe their habitat and nesting behavior in the wild, allowing us to gather preliminary information on their natural nesting conditions.
 
 Results:
 
 Line 60: "several brood items" - how many exactly? Was this consistent across experiments? Do mated queens ever produce more pupae during the experiments?
 
 Yes, the number of brood items (5) was added consistently across the experiments. Additionally, the mated queen did produce pupae during the course of the experiments, which was evident from the noticeable increase in the number of workers in the nest. This was significantly higher than the number of brood items present at the start of the study.
 
 The above points are added to the section (line numbers : 68-69).
 
 Figure 1: Panel A - The food ports are never mentioned in the text. Are the ants fed during the experiments? If so, what? With what frequency? Is the water column replenished/maintained? If so, how and how often? panel C - how long did this experiment last?
 
 We thank the reviewer for pointing this out. We have now updated the nest maintenance section in the Materials and Methods (line numbers : 349-354) part to include all the necessary details and clarifications.
 
 “We provided food to the ants ad libitum through three separate tubes containing water, 20 % sucrose water, and protein food. The protein mixture included egg powder, tuna, prawns, honey, agar, and vitamins. Each of the three tubes was filled with 5 ml of their respective contents and sealed with a cotton stopper to prevent overflow. The tubes were positioned at a slight angle and connected using a custom-made plexiglass adapter to facilitate the flow of liquids. These tubes were replenished once depleted, and regularly replaced once the nest maintenance was carried out bi-weekly.”
 
 Line 76: "...excavation was commenced by the founding queen". How were the queen and pupae introduced into the system?
 
 We initiated colony maturation experiments by introducing a single mated queen and several brood items (pupae) at random positions on the soil layer of the nest (line numbers : 68-69)
 
 Line 87: Please provide bounds for 11cm2/ant value. Is there any biological or physical justification for this number?
 
 We thank the reviewer for the suggestion. We have now provided the bounds as requested (line numbers : 97-101).
 
 We were unable to pinpoint a specific biological justification based solely on this treatment. However, on extrapolating the age-dependent area fit we derived from the fixed demographics experiment, we found that at the age of 1 day, an ant has a target area of approximately 11.17 cm², which is the largest age-dependent area target possible within our experimental setup.
 
 From the colony maturation experiment, we obtained the value of 11.6 (±1.15) cm² as the area per ant. The consistency between the area per ant obtained from two completely different treatments across different colonies yielded similar results. We propose that under standardized conditions, a 1-day-old ant has a theoretical maximum target area of 11.17 cm²—the highest value observed in our experimental framework.
 
 Lines 98-99: "one straightforward possibility would be that newborn ants are the ones that dig". This statement contradicts the results presented in Figures 1 and S1 - the population increase seems to occur at least a few days before increased excavation in nearly all cases.
 
 We apologize for any confusion caused by our initial phrasing. To clarify, we proposed that a lag likely exists between population growth and nest area expansion. This lag could arise from two sequential processes: (1) newborn ants require time to mature and become active (first delay), and (2) digging to expand the nest takes additional time (second delay; estimated at ~10 days from the cross-correlation analysis). Thus, our results suggest that it is not the population that lags behind the area, but rather the area that lags behind the population, as demonstrated in Figures 2D and SI. Figure. S1.
 
 The sentence “one straightforward possibility would be that newborn ants are the ones that dig” is modified as below (line numbers : 112-119) to prevent further confusion.
 
 “One possible explanation is that, although all ants are capable of digging, it is primarily the newly emerged ants who perform this task. In this case, nest expansion would lag behind colony growth due to two delays: first, the time needed for young ants to mature enough to begin digging, and second, the physical time required to excavate additional space (e.g., around 10 days). This mechanism could eliminate the need for ants to assess overall colony density, as each new group of active workers simply enlarges the nest as they become ready. An alternative possibility is that all ants, regardless of age, respond to increased density by initiating excavation. In that scenario, nest expansion would follow more immediately after the emergence of new individuals, making delays less prominent (24, 29, 30)”.
 
 Line 105: How do group sizes compare to natural colony size? Line 106: How do "young" and "old" classifications compare to natural life expectancy?
 
 We have already addressed this question in an earlier comment. The details are added to the main text (line numbers: 124-127) and discussion (line numbers: 278-282).
 
 Line 118-119: How are nests artificially collapsed?
 
 We have added a new section in the Materials and Methods section that describes the nest collapsing procedure (Nest artificial collapse - line numbers : 386-399).
 
 Figure 2 Panel A: The white dotted line is nearly impossible to see. Please use a more visible color.
 
 We thank the reviewer for the comment.
 
 We changed the solid circles to violet and the dotted line color to continuous white.
 
 Figure 3: The use of circle markers as post-collapse recovery in young and old as well as old pre-collapse is confusing. Use different symbols for old pre-collapse vs young and old post-collapse.
 
 We thank the reviewer for pointing out the confusion. We have revised the figure markers as suggested and modified the main text accordingly.
 
 Young; pre-collapse : star
 
 Young; post-collapse : diamond
 
 Old; pre-collapse : circle
 
 Old; post-collapse: triangle.
 
 Figure 3 Panel C: Indicate that fixed demographic values here are pre-collapse. Also, as presented, it appears that there is a large group-size dependence that is not commented on. Previous results (Line 87 and Figure 2C) suggest a constant excavation area per ant of 11cm2/ant. Figure 3, panel C appears to suggest a group-size dependence. If these values are divided by group size, is excavated area per ant nearly constant across groups? How does the numerical value compare to the slope from Figure 2C?
 
 We thank the reviewer for their insightful comments.
 
 First, we would like to clarify that the area target of 11.1 (±1) cm²/ant, as described in Line 87, was obtained from the colony maturation experiments. In these experiments, we were unable to track the age of each individual ant, so the area target was calculated by normalizing the total excavated area by the number of ants.
 
 We normalized the excavated area by the group size for both young and old colonies as suggested, and found that the area per ant was not significantly different across the group sizes (see new SI Fig. 5A). This indicates that the excavated area per ant remains relatively constant within each demographic group. Moreover, this shows that the total excavated area is proportional to group size, in agreement with previous works (24, 29, and 30).
 
 We have explicitly described the above information in the line numbers: 142-146
 
 Regarding the slope comparisons, the slope of Figure 2C (10.71), from the colony maturation experiments, is the largest, followed by the area per ant from the short-term young (8.79 ± 0.98) cm²/ant, and short-term old experiments (5.16 ± 0.44) cm²/ant.
 
 Lines 128-129: "...younger ants aim to approach a higher target area". Seems hard to know what they "aim" to do... rephrase to report what they are observed to do.
 
 We thank the reviewer for the comment. The sentence is rephrased as suggested (line numbers : 158-161).
 
 “In the previous sections, we showed that in fixed-demographics experiments, younger ants excavated a significantly larger nest area compared to older ants (Fig. 3. C). This difference emerged despite similar temporal patterns in digging rates across age groups, with excavation activity peaking within the first 7 days before asymptotically decaying as nest expansion approached saturation (SI Fig. 8).”
 
 Lines 133-141: The model description is not clear. Specifically, what parameters are ant-dependent? How does A relate to a?
 
 We appreciate the reviewer's request for clarification. In our model:
 
 (1) Equation 1 describes the change in the excavated area due to the digging activity of a single ant. Here, the variable 'a' represents the area excavated by one ant. This formulation allows us to capture the individual digging behavior and its impact on the excavation process.
 
 (2) Equation 2 extends this concept to the total area excavated in the nest, denoted by 'A'. Specifically, 'A' is the sum of the areas excavated by all ants present in the nest. In other words, it aggregates the individual contributions of each ant, linking the microscopic digging behavior to the macroscopic excavation dynamics.
 
 Therefore, the relationship between 'a' and 'A' is as follows:
 
 ● 'a' = Area excavated by a single ant.
 
 ● 'A' = ∑ 'a' (Summed over all ants in the nest).
 
 We have explicitly mentioned this in the line numbers “ 161-179”, and describe the model assumptions and parameters in detail.
 
 Figure 4:
 
 Figure 4, Panel A: The equation quoted in the caption does not match the data in the figure. The equation has a positive slope and negative intercept, while the figure has a negative slope and a positive intercept. Please provide the correct equation and bounds on fit parameters.
 
 We thank the reviewer for spotting this typing mistake.
 
 The equation was already updated in the reviewed preprint published online. The correct equation and the fit bound are provided in the figure caption.
 
 “Target areas decrease linearly with the ant age (y = −0.032x + 11.22 , 95 % CI (Intercept : (-0.035,-0.027), Slope : (10.53,11.91)), R2 = 0.96 ).”
 
 Figure 4, Panel A: There seem to be three "fixed target area per ant values" in the paper: around 11cm2/ant (line 87), 11.6 cm2/ant (SI Figure 2), and linearly dependent value from fit to Figure 4A. The distinctions between these values and their significance are hard to keep track of. Can the authors add a discussion somewhere that helps the reader better understand? Is there a way to connect/rationalize/explain these different values in terms of demographics?
 
 We thank the reviewer for the suggestion.We have added a paragraph in the discussion (line numbers : 270-277) describing the area targets.
 
 “In our colony maturation experiments, we found that area per ant was highest when the workers were youngest, with values around 11.1–11.6 (±1–1.15). This aligns with observations from naturally growing nests, where newly eclosed ants dominate the population and nest volumes are relatively large. Supporting this, fixed-demographics experiments showed that the area excavated per ant declines linearly with worker age, indicating that the youngest ants contribute most to excavation. Notably, the target area we fit for the age-independent model (11.6 ± 1.15) closely matches the extrapolated value for very young workers (Fig. 4. A), reinforcing the idea that young ants are the primary excavators during early colony growth. In contrast, during events like collapses or displacement, when space is urgently needed, ants of all ages participate in excavation.”
 
 Figure 4, Panel A: What are various symbols and colors for data with error bars? If consistent with Figure 3, then this panel and subsequent model confound two factors: (1) the age dependence and (2) the behavioral differences pre- and post-collapse (structures are different pre-and post-collapse, according to SI Figure 6; line 120: "...colonies ceased digging when they recovered 93{plus minus}3% of the area lost by the manual collapse..."; lines 201-202: "We find significant quantitative and qualitative differences between nests constructed within this natural context and nests constructed in the context of an emergency") and behavior is different (according to SI Figure 7 and line 119: "...all ants dig after collapse...")). Therefore, without further supporting evidence, it does not seem that these data should be used to fit a single line that defines a model parameter a_age for each ant in equation 2.
 
 The symbols are the area per ant quantified from the fixed demographics of young, and old experiments. The symbols show the following;
 
 A. Star - Young, pre-collapse
 
 B. Diamond - Young, post-collapse
 
 C. Circle - Old, pre-collapse
 
 D. Triangle - Old, post-collapse.
 
 The details are clearly described in the figure caption.
 
 We apologize to the reviewer for the confusion. We argue that the data can be fit by a single line to quantify the parameter ‘a_age’ as follows.
 
 A. All data presented in Figure 4A were obtained from the same fixed-demographics experiments (containing only young and old ants) under experimental collapse conditions, pre- and post-collapse. These results, therefore, exclusively reflect emergency nest-building behaviors during emergency scenarios and do not include any observations from natural colony maturation processes.
 
 B. Age-dependent excavation differences: As correctly noted by the reviewer, the observed difference in excavated area before versus after collapse reflects the natural aging of ants in our experimental colonies. While colonies recovered >90% of lost area post-collapse, the residual variation was not negligible—instead, it systematically correlated with colony age structure. By tracking colonies across this demographic transition, we obtained additional data points spanning a broader developmental spectrum. This extended range strengthened our ability to detect and quantify the linear relationship between worker age and excavation output.
 
 C.The quoted sentence (lines 201-202, submitted version) refers to comparisons across all three experimental cases: (1) fixed-demographics young ants, (2) fixed-demographics old ants, and (3) the natural scenario (mixed-age colonies). Importantly, these comparisons are based on pre-collapse steady-state excavation areas, ensuring a consistent baseline across treatments. We highlight quantitative and qualitative differences between these distinct experimental groups, not between pre- and post-collapse phases within the same treatment. The pre- and post-collapse data within fixed-demographics groups were analyzed separately to avoid conflating aging effects with emergency responses.
 
 To avoid confusion, the whole paragraph in the discussion (line numbers : 253-260) is rephrased.
 
 In lines 201-202; “We find significant quantitative and qualitative differences between nests constructed within this natural context and nests constructed in the context of an emergency”.
 
 Here, by natural context, we mean the nests excavated in the colony maturation experiments. We believe that it could have been confusing, and the sentence is modified as answered for the previous question.
 
 Figure 4, Panel B: This uses the model with a_age determined by from Figure 4A and the life table (as shown in the supplemental), whereas the supplemental Figure SI 8 uses the fixed blue line a_age value for the model, which comes from the colony maturation experiments. The age-independent model in the supplemental fits the data better, yet the authors claim the supplemental model cannot be applied to the data because of their experimentally determined age-dependent target area. Given the age-independent target area model fits better, additional evidence/justification is needed to support the choice of the model.
 
 We agree with the reviewer that the age-independent model fits the data well. However, we believe that the fixed area target cannot be used to explain the excavation dynamics for the following reasons.
 
 We make an important assumption in our model: that the ants rely on local cues and that individual ants can not distinguish between the fixed demographics and colony maturation experiments (line numbers : 161-166). Given this assumption, the ants cannot change their behavior between experiments, meaning the same model should fit all of our results. However, the fixed demographics experiments revealed a significant difference in the areas excavated by young vs. old cohorts, despite having the same group size. If the ants regulated the excavated area based on an age-independent constant density target model, then the excavated area in the fixed demographics of young and old colonies would have been similar. This discrepancy indicates that the target area per ant is not constant, as assumed in the age-independent density model (SI. Fig. 8). We emphasize that while the age-independent model provides a better fit for the excavated area in colony maturation experiments, the age-dependence of excavation is empirically supported by fixed-demographics experiments. Therefore, we implemented this age-dependence through a variable target area within the age-dependent model framework to explain excavation dynamics in the colony maturation experiments.
 
 These details are explicitly mentioned in the main text (line numbers : 187 - 198)
 
 Figure 4, Panel C: Is this plot entirely from the model, or are the data points measured from experiments? Please label this more clearly.
 
 We apologize to the reviewer for the confusion.
 
 The Figure 4C is based on the age-dependent digging model. We applied the model to population data from the long-term experiments (n = 22). By setting an age threshold of 56 days (since ants used in the short-term young experiment had an average age of 40 ± 16 days), we categorized the ants into young and old groups. We then quantified the area dug by the young ants, the queen, and the old ants in terms of the percentage of the total area excavated. We hypothesized that, because young ants have a lower digging threshold, they would perform the majority of the digging. We indeed confirm this in Figure 4C.
 
 This information is added to the main text and described in detail (line numbers: 200 - 208).
 
 Lines 162-165: "...Furthermore, we quantified the area dug by each ant in the normal colony growth experiment as estimated from the age-dependent model and found that all ants excavated more or less the same amount...". Figure 4D shows a distribution with significant values ranges from 1-16 cm2... how is this interpreted as "more or less the same amount" and what is the significance of this?
 
 We apologise to the reviewer for the confusion.
 
 We quantified the percentage contribution to the excavated area of each histogram bin (provided in the new SI table: 4), and found that the area excavated between 5 cm² and 13 cm² accounts for 73.76% of the total excavated area. This indicates that most ants dug within this range rather than exhibiting extreme variations. Additionally, the mean excavation amount is 7.84 cm², with a standard deviation of 3.44 cm², meaning that most values fall between 4.4 cm² and 11.28 cm², which aligns well with the 5–13 cm² range. Since the majority of the excavation is concentrated within this narrow interval, and the mean is well centered within it, this suggests that ants excavated more or less the same amount, rather than forming distinct groups with highly different excavation behaviors.
 
 We have modified the main text (line numbers: 209-216) to include these points.
 
 The biological significance of this finding is that since all ants in the colony maturation experiments are born inside the nest, we hypothesize that they should excavate similar amounts. To test this, we quantified the area contribution of each ant over the entire duration of the experiment using the age-dependent digging model as described above and found that they indeed excavated more or less the same amount. From our analysis of fixed demographics experiments, we showed that the youngest ants excavate the largest area. Since the majority of the youngest ants participated in the colony maturation experiments, this further supports our hypothesis.
 
 Figure 5.
 
 Figure 5, Panels A-C: Please provide a scale bar.
 
 The scale bar is provided in the figure as suggested. The algorithm for the cutoffs for tunnel vs wide tunnels is described in detail in the section “Nest skeletonization, segmentation, and orientation.”
 
 Figure 5, Panel E: Why does the chamber error bar for 5 ants go to zero?
 
 In Figure 5, E, we plot the standard error, as described in the figure caption. In the experiments, the chamber area contributions were (0,0,39.94,0) respectively. The mean of the 4 numbers is 9.985, the standard deviation is 19.97, and the standard error is 9.985. So, the mean and the standard error are the same, so the lower error bar goes to zero, and the upper error bar goes to 19.97. This implies that in these experiments, the chamber area is often zero.
 
 Figure 5, Panel I: Why are there no chambers for young colonies in I when they are in the histogram in E?
 
 We apologize to the reviewer for the confusion. We initially missed adding the chamber orientation data of the young colonies to Panel I, but it has now been included.
 
 Line 212: "...densities of ants never become too high...". What is too high? Is there some connection to biological or physical constraints?
 
 Under normal growth conditions, nest volume is kept proportional to the number of ants, ensuring that the density remains within a specific range. This prevents overcrowding, which could otherwise lead to excessively high densities.
 
 Yes, we believe there is likely a connection to both biological and physical constraints. The proportional relationship between nest volume and the number of ants is likely driven by factors such as:
 
 (1) Biological Constraints:
 
 Ant Colony Size: Ants typically adjust their behavior and social structure to maintain an optimal population size relative to available resources and space.Overcrowding could lead to potentially a breakdown in colony function.
 
 Colony Health: High densities can lead to faster epidemic spread, leading to negative effects on reproduction, foraging efficiency, and overall colony health. By maintaining density within a specific range, the colony can thrive without these adverse effects.
 
 (2) Physical Constraints:
 
 Spatial Limitations: The physical space within the nest limits how many ants can occupy it before space becomes constrained. The nest’s structure and size must physically accommodate the ants, and the volume must be large enough to prevent overcrowding, and efficient resource distribution.
 
 Lines 272 and 302: How often were photos taken? These two statements seem to suggest different data collection rates.
 
 As stated in line 272, photos were taken every 1 to 3 days. During each photo session, four photos were taken, with each photo separated by 2 seconds, as mentioned in line 302. To avoid confusion, we rephrased the sentence (line numbers: 359-361).
 
 “We photographed the nest development every 1-3 days. During each photography session, four pictures of the nest were taken, with a 2-second interval between each.”
 
 Reviewer #2 (Recommendations for the authors):
 
 Some more minor points/questions/clarifications:
 
 This might be pedantic, but I don't think the nest serves as the skeleton of the superorganism, while it does change and grow, the analogy becomes weak beyond that point. The skeleton serves to protect the internal organs of the organism, facilitates movement and muscle attachment, and creates new blood cells. I would be more comfortable with a statement that the nest can grow or shrink according to need.
 
 We sincerely thank the reviewer for their time and effort in providing a detailed review and assessment of our manuscript. A point-by-point response to the comments is provided below.
 
 The analogy of treating a nest structure to the skeleton of a superorganism was based on the following points;
 
 (a) Protection: A nest protects the colony on a collective scale. This is analogous to protecting "organs" by a skeletal framework.
 
 (b) Organization and Division of Space: The skeletal structure organizes the body's internal layout, just as nest structures are organized into various spatial compartments for various colony functions, with specific regions designated for brood chambers, food storage, and waste disposal.
 
 Thus, we believe that the analogy can still be valid in a metaphorical way.
 
 Does this statement need justification with a citation, or is that information contained in the subsequent clause? "However, for more complex structures where ants congregate in specific chambers, workers are less likely to assess the overall nest density." The idea that workers do (or do not) assess overall density touches on many issues, including that of perfect information and adaptive responses, that it seems it needs to be well founded in previous work to be stated in such unequivocal terms.
 
 We thank the reviewer for this comment. The references for this argument are provided in the next sentence. We have now moved these references to the relevant sentence (reference number: 24, 29,30; line number : 30-31 )
 
 Can you give some more information on this statement? "Experiments were terminated either when the queen died or when she became irreversibly trapped after a structural collapse." Why was this collapse irreversible and therefore unlike treatment 2? Did the queen die in these instances? Was this event more likely than in natural colonies? And if so, was there something inherently different about your experiments that limit interpretation under natural conditions (e.g. the narrow nature of the observation setup? The consistency of the sand?)
 
 Our nest excavation experiments were terminated under two primary scenarios: (1) the queen died of natural causes, reflecting the baseline mortality expected when queens are brought into laboratory conditions, or (2) the nest experienced a structural collapse that left the queen irreversibly trapped. The second scenario is further elaborated below:
 
 Irreversible Collapses: These collapses were classified as irreversible because the queen could not be rescued alive. This occurred when the structural stability of the nest failed, burying the queen in a manner that prevented recovery. In some cases, the collapse resulted in the queen's immediate death, while in others, she was trapped beyond reach, and any rescue attempt risked further structural damage.
 
 Collapse and Experimental Context: These collapses were not uniquely associated with natural colonies or fixed-demographic experiments; rather, they occurred across various experimental setups.
 
 The sentence is modified as below to improve clarity (line numbers : 70-72 ).
 
 “In all instances where a collapse resulted in the queen's death or her being irreversibly trapped in the nest, the experiment was excluded from analysis starting from the point of the collapse, as such events did not reflect normal colony dynamics.”
 
 I want to make sure I understand the following statement: "Moreover, the area excavated by the young cohorts was similar to that excavated by naturally maturing colonies at the point in which they reached the same population size (Tukey's HSD; group size: 5; p = 0.61, group size: 10; p = 0.46, group size: 15; p = 0.20)." Do I have it right that this means a group of (e.g. 10) young ants excavates an area similar to that of a group of 10 naturally maturing ants at the same age as the young ants?
 
 Yes, the interpretation provided is correct. We apologize to the reviewer for the confusion. We have rephrased the sentence for better readability (line numbers : 146-148).
 
 “Furthermore, the area excavated by the young cohorts was comparable to that excavated by naturally maturing colonies when they reached the same population size (Tukey's HSD; group size: 5, p = 0.61; group size: 10, p = 0.46; group size: 15, p = 0.20)”
 
 How old do ants get? Is the 'old' demographic (~200 days) meaningfully old in the context of the overall worker lifespan? While the results certainly demonstrate there is an age effect, I would like to understand how rapid this is in terms of overall lifespan.
 
 The lifespan of ants, including both queens and workers, varies significantly based on caste, species, and environmental conditions.
 
 (1) Queen Longevity: From the literature, Camponotus fellah queens can live up to 20 years, with one documented case reaching 26 years. This remarkable longevity underscores the queen's central role in maintaining the colony.
 
 (2) Worker Longevity: In contrast to queens, the lifespan of workers is much shorter.
 
 However, specific data on worker longevity in Camponotus fellah colonies are lacking. Studies on other Camponotus species (50, 82) suggest that workers can live for several months depending on environmental conditions, colony health, and caste-specific roles (e.g., minor vs. major workers).
 
 (3) Laboratory vs. Natural Conditions: Worker longevity is highly variable between laboratory and natural conditions
 
 Therefore, in the context of the old worker lifespan in our experiments of, ~200 days (roughly 6–7 months) we strongly believe that the worker lifespan used in our experiments represents a substantial portion of a worker's expected life. While exact figures for C. fellah workers are unavailable, inferences from related species suggest that workers nearing 200 days are approaching the latter stages of their lifespan, making them meaningfully "old."
 
 These details are added to the main text (line numbers : 124 - 127) and to the discussion (line numbers : 278-282)
 
 Reviewer #3 (Recommendations for the authors):
 
 We sincerely thank the reviewer for their time and effort in providing a detailed review and assessment of our manuscript. A point-by-point response to the comments is provided below.
 
 L10: "fixed demographics": I find this term unclear, what does it mean, it should specify if the groups are with or without a queen.
 
 We thank the reviewer for the comment. The sentence is modified in the abstract, and definitions are later added in detail in the introduction (line numbers : 8-10) and the Materials and Methods section (Fixed demographics colonies).
 
 “We experimentally compared nest excavation in colonies seeded from a single mated queen and allowed to grow for six months to excavation triggered by a catastrophic event in colonies with fixed demographics, where the age of each individual worker, including the queen, is known”.
 
 The details of the “fixed demographics” treatments were explained in the later portion of the text (line numbers: 58-61).
 
 L36: I think it is documented that younger individuals are the ones who involved in nest construction in many species.
 
 Previous studies on nest construction were predominantly performed on mature colonies of specific age demographics or rather mixed demographics, where age was not considered as a factor influencing nest construction. Some studies have speculated that young ants could be the most probable ones to dig, but this has not been experimentally verified to the best of our knowledge.
 
 L50: I do not think the colony should be called mature after only 6 months, given that colonies reach thousands of workers.
 
 The sentence is changed as suggested (line numbers : 56-57).
 
 “The "Colony-Maturation" experiment observed the development of colonies up to six months, starting from a single fertile queen and progressing to colonies with established worker populations.”
 
 L60: Where was the queen introduced? It is specified in the Methods but a word here would be helpful.
 
 The detail is added as suggested (line numbers : 68-69).
 
 “We initiated colony maturation experiments by introducing a single mated queen and several brood items (n = 5, across all experiments) at random positions on the soil layer of the nest.”
 
 L106: Young vs Old workers 40 vs 171 days. Maybe cite a reference or provide a reason for the selection of those ages?
 
 Previous studies have shown that the Camponotus fellah queens can live up to 20 years, with one documented case reaching 26 years (50). To the best of our knowledge, specific data on worker longevity in Camponotus fellah colonies in natural conditions are lacking. Lab studies on Camponotus fellah (82) and other Camponotus species (50) suggest that workers can live for several months depending on environmental conditions, colony health, and caste-specific roles (e.g., minor vs. major workers).
 
 We intentionally selected workers from two distinct age groups: younger ants (40 ± 16 days old) and older ants (171.56 ± 20 days old). These ages represent functionally different life stages - the younger group had completed about 25% of their expected lifespan at the start of the experiment, while the older group had lived through most of theirs (50, 82). This 4-fold age difference allowed us to compare excavation behaviors across fundamentally different phases of adult life.
 
 Our experiments lasted for 60-90 days, during which all participating workers continued to age. To ensure all ants remained alive throughout the experiments, and given the constraints of the experimental timeline, we selected young and old workers within the specified age range.
 
 These details are added to the main text (line numbers : 124 -127), and the discussion (line numbers : 278-282)
 
 L122-123: But usually ants can vary highly in their behaviours. Can the authors comment on their choice to consider an average, implying that all ants of the same age had the same digging rates?
 
 We thank the reviewer for the comment.
 
 In our experiments, we could not track each worker's activity over time. As described in the methods, we took snapshots of the nest structure over days and recorded the population size of the nest. Thus, we could not capture the activity of single ants in the nest as described in the response to major comments in the reviewed preprint.
 
 We agree that individual tracking of ants within our experimental setup would have been the ideal approach. Then, we could have taken the inter-individual variability of the digging activity into account. However, we were limited to doing so by the technical and practical limitations of the setup, such as;
 
 (a) Continuous tracking of ants in our nests would have required a camera to be positioned at all times in front of the nest, which necessitates a light background. Since Camponotus fellah ants are subterranean, we aimed to allow them to perform nest excavation in conditions as close to their natural dark environment as possible. Additionally, implementing such a system in front of each nest would have reduced the sample sizes for our treatments.
 
 (b)The experimental duration of our colony maturation and fixed demographics experiments extended for up to six months (unprecedented durations in these kinds of measurements). These naturally limited our ability to conduct individual tracking while maintaining the identity of each ant based on the current design.
 
 To clarify this, we have added the following to the discussion (line numbers: 286-292).
 
 “Previous studies have demonstrated both homogeneous and heterogeneous workload distribution, with varying digging rates among ants (24,29,30,35). Studies showing heterogeneous workload distribution relied on continuous individual tracking of ants to quantify digging rates (35). However, this approach was not feasible in our current design due to the experimental durations of both our colony maturation and fixed demographics experiments. Additionally, sample size requirements naturally limited our ability to conduct continuous individual tracking during nest construction in our study.”
 
 L171: A line on how the nest structure was acquired and data extracted would be welcome here.
 
 The algorithm for the nest structure segmentation, data extraction, and analysis is added in detail to the SI section: Nest skeletonization, segmentation, and orientation. The line is modified (line numbers : 221-224) in the main text as suggested.
 
 “We compared nest architectures by segmenting raw nest images into chambers and tunnels (see SI Section: Nest Skeletonization, Segmentation, and Orientation). Chambers were identified as flat, horizontal structures, while tunnels were narrower and more vertical in orientation (see SI Fig. 9, SI Section: Nest Skeletonization, Segmentation, and Orientation)”.
 
 Figure 3: Where does the data of the mean in panel C come from: is it the mean of the first 30 days, before the collapse? How is it comparable with the rest?
 
 We apologize to the reviewer for the confusion.
 
 In panel C, the mean values (solid stars and circles) for fixed-demography colonies (young/old groups) represent pre-collapse excavation areas. For colony maturation experiments (where no collapses were induced), we instead plot the mean saturated excavation area for each group size. This allows direct comparison of mean excavated areas across experimental conditions at equivalent colony sizes.
 
 To improve readability, the following sentences are added to the main text (line numbers : 139 - 146 )
 
 “We compared the saturated excavation areas (pre-collapse) from fixed-demographics experiments (young and old groups) with those from colony maturation experiments of the same colony sizes (Fig. 3C). We find that, for a given age cohort (young or old), the saturation areas increase linearly with the colony size (GLMM, F(35,37); p < 0.0001) (Fig. 3 C, SI. Fig 7 A). The observed proportional scaling between excavated area and group size aligns with previous studies, even though those studies did not explicitly account for age demographics (24, 29, 30). After normalizing the pre-collapse excavated area by group size for both young and old colonies, we found no significant difference in area per ant across group sizes (SI Fig. 5. A). This indicates that the excavated area per ant remains relatively constant within each demographic group”.
 
 L209-210: I would be more parsimonious in saying that the results presented prove that the target area decreases with age, as the individual behaviour of the ants was not monitored. Suggestion: rephrase to "the target of the group decreases with age".
 
 The sentence is rephrased as suggested (line numbers : 265-266).
 
 “Our results reveal that this target area of the group decreases linearly with age, such that young ants are more sensitive to shortages in space.”
 
 L246: Are C.fellah colonies really found with such few workers?
 
 Previous studies have speculated that mature Camponotus fellah colonies are a monogynous species typically founded by a single queen following nuptial flights (50,51,82), and can range from tens to thousands of workers. However, during the founding stage (as in our experiments), colonies naturally pass through smaller developmental sizes comparable to the matured colonies.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

AuthorResponse

Review 2

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.07.09.602713v3
www.biorxiv.org www.biorxiv.org

Phase-specific premotor inhibition modulates leech rhythmic motor output

3
1. Public_Reviews 29 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 The medicinal leech preparation is an amenable system in which to understand the neural basis of locomotion. Here a previously identified non-spiking neuron was studied in leech and found to alter the mean firing frequency of a crawl-related motoneuron, which fires during the contraction phase of crawling. The findings are valuable and the experiments were diligently done and generally solid; The results lay a foundation for additional studies in this system.
 
 Summary
2. Public_Reviews 29 Jul 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 The medicinal leech preparation is an amenable system in which to understand how the underlying cellular networks for locomotion function. A previously identified non-spiking neuron (NS) was studied and found to alter the mean firing frequency of a crawl-related motoneuron (DE-3), which fires during the contraction phase of crawling. The data are mostly solid. Identifying upstream neurons responsible for crawl motor patterning is essential for understanding how rhythmic behavior is controlled.
 
 Review of Revision:
 
 Reviewer: On a positive note, the rationale for the study is clearer to me now after reading the authors' responses to both reviewers, but that information, as described in the authors' responses, is minimally incorporated into the current revised paper. Incorporating a discussion of previous work on the NS cell has, indeed, improved the paper.
 
 I suggested earlier that the paper be edited for clarity but not much text has been changed since the first draft. I will provide an example of the types of sentences that are confusing. The title of the paper is: "Phase-specific premotor inhibition modulates leech rhythmic motor output". Are the authors referring to the inhibition created by premotor neurons (e.g., on to the motoneurons) or the inhibition that the premotor neurons receive?
 
 I also find the paper still confusing with regard to the suggested "functional homology" with the vertebrate Renshaw cells. When the authors set up this expectation of homology (should be analogy) in the introduction and other sections of the paper, one would assume that the NS cell would be directly receiving excitation from a motoneuron (like DE-3) and, in turn, the motoneuron would then receive some sort of inhibitory input to regulate its firing frequency. Essentially, I have always viewed the Renshaw cells as nature's clever way to monitor the ongoing activity of a motoneuron while also providing recurrent feedback or "recurrent inhibition" to modify that cell's excitatory state. The authors present their initial idea below on line 62. Authors write: "These neurons are present as bilateral pairs in each segmental ganglion and are functional homologs of the mammalian Renshaw cells (Szczupak, 2014). These spinal cord cells receive excitatory inputs from motoneurons and, in turn, transmit inhibitory signals to the motoneurons (Alvarez and Fyffe, 2007)."
 
 [Reviewer (minor note): I suggest re-writing this last sentence as "these" is confusing. Change to: 'In the spinal cord, Renshaw interneurons receive excitatory inputs from motoneurons and, in turn, transmit inhibitory signals to them (Alvarez and Fyffe, 2007).']
 
 Reviewer: Furthermore, the authors note that (line 69 on): "In the context of this circuit the activity of excitatory motoneurons evokes chemically mediated inhibitory synaptic potentials in NS. Additionally, the NS neurons are electrically coupled......In physiological conditions this coupling favors the transmission of inhibitory signals from NS to motoneurons." Based on what is being conveyed here, I see a disconnect with the "functional homology" being presented earlier. I may be missing something, but the Renshaw analogy seems to be quite different compared to what looks like reciprocal inhibition in the leech. If the authors want to make the analogy to Renshaw cells clearer, then they should make a simple ball and stick diagram of the leech system and visually compare it to the Renshaw/motoneuron circuit with regard to functionality. This simple addition would help many readers.
 
 Reviewer: The Abstract, Authors write (line 19), "Specifically, we analyzed how electrophysiological manipulation of a premotor nonspiking (NS) neuron, that forms a recurrent inhibitory circuit (homologous to vertebrate Renshaw cells)...." First, a circuit would not be homologous to a cell, and the term homology implies a strict developmental/evolutionary commonality. At best, I would use the term functionally analogous but even then I am still not sure that they are functionally that similar (see comments above). Line 22: "The study included a quantitative analysis of motor units active throughout the fictive crawling cycle that shows that the rhythmic motor output in isolated ganglia mirrors the phase relationships observed in vivo." This sentence must be revised to indicate that not all of the extracellular units were demonstrated to be motor units. Revise to: "The study included a quantitative analysis of identified and putative motor units active throughout the fictive crawling cycle that shows.....'
 
 Line 187 regarding identifying units as motoneurons: Authors write, "While multiple extracellular recordings have been performed previously (Eisenhart et al., 2000), these results (Figure 4) present the first quantitative analysis of motor units activated throughout the crawling cycle in this type of recordings." The authors cannot assume that the units in the recorded nerves belong only to motoneurons. Based on their first rebuttal, the authors seem to be reluctant to accept the idea that the extracellularly recorded units might represent a different class of neurons. They admit that some sensory neurons (with somata located centrally) do, indeed, travel out the same nerves recorded, but go on to explain why they would not be active.
 
 The leech has a variety of sensory organs that are located in the periphery, and some of these sensory neurons do show rhythmic activity correlated with locomotor activity (see Blackshaw's early work). The numerous stretch receptors, in fact, have very large axons that pass through all the nerves recorded in the current paper. In Fig. 4, it is interesting that the waveforms of all the units recorded in the PP nerve exhibit a reversal in waveform as compared to those in the DP nerve, which might indicate (based on bipolar differential recording) that the units in the PP nerve are being propagated in the opposite direction (i.e., are perhaps afferent). Rhythmic presynaptic inhibition and excitation is commonly seen for stretch receptors within the CNS (see the work of Burrows) and many such cells are under modulatory control.
 
 Most likely, the majority of the units are from motoneurons, but we do not really know at this point. The authors should reframe their statements throughout the paper as: 'While multiple extracellular recordings have been performed previously (Eisenhart et al., 2000), these results (Figure 4) present the first quantitative analysis of multiple extracellular units, using spike sorting methods, which are activated throughout the crawling cycle.' In cases where the identity of the unit is known, then it is fine to state that, but when the identity of the unit is not known, then there should be some qualification and stated as 'putative motor units'
 
 Reviewer, the Methods section: needs to include the full parameters that were used to assess whether bursting activity was qualified in ways to be considered crawling activity or not. Typically, crawl-like burst periods of no more than 25 seconds have been the limit for their qualification as crawling activity. In Fig 2F, for example, the inter-burst period is over 35 seconds; that coupled with an average 5 second burst duration would bring the burst period to 40 seconds, which is substantially out of range for there to be bursting relevant to crawl activity. Simply put, long DE-3 burst periods are often observed but may not be indicative of a crawl state as the CV motoneurons are no longer out of phase with DE-3. A number of papers have adopted this criterion.
 
 Review 1
3. Public_Reviews 29 Jul 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews.
 
 Reviewer #1 (Public review):
 
 Summary:
 
 The Szczupak lab published a very interesting paper in 2012 (Rodriquez et al. J Neurophysiol 107:1917-1924) on the effects of the segmentally-distributed non-spiking (NS) cell on crawl-related motoneurons. As far as I can tell, the working model presented in 2012, for how the non-spiking (NS) cell impacts the crawling motor pattern, is the same functional model presented in this new paper. Unfortunately, the Discussion does not address any of the findings in the previous paper or cite them in the context of NS alterations of fictive crawling. Aside from different-looking figures and some new analyses, the results and conclusions are the same.
 
 Reviewers #1 and #2 called our attention to our failure to cite the Rodriguez et al. 2012 article in the context of the main goal of the present work. We do now explain how the present study is framed by the published work. See lines 74-79.
 
 In Rodriguez et al. 2012, we hypothesized that the inhibitory signals onto NS were originated in the motoneuron firing. We now cite this reference in line 104. In the current manuscript we further investigated the connection between the inhibitory signals onto NS and the motoneuron activity (Figure 2) and proved that the hypothesis was wrong. Thus, the model presented here differs from the one proposed in Rodriguez et al. 2012.
 
 In Rodriguez et al. 2012, we speculated that the inhibitory signals received by NS were transmitted to the motoneurons, but an important control was missing in that study. In the current study depolarization of NS during crawling is tested against a control series that allows to properly examine the hypothesis (lines 138-147). But, most important, because NS is so widely connected with the layer of motoneurons it was necessary to test the effect on other motoneurons during the fictive crawling cycle. We now explain this rationale in lines 249-257.
 
 Strengths:
 
 The figures are well illustrated.
 
 Weaknesses:
 
 The paper is a mix of what appears to be two different studies and abruptly switches gears to examine how closely the crawl patterning is in the intact animal as compared to the fictive crawl patterning in the intact animal. Unfortunately, previous studies in other labs are not cited even though identical results have been obtained and similar conclusions were made. Thus, the novelty of the results is missing for those who are familiar with the leech preparation. The lack of appropriate citations and discussion of previous studies also deprives the scientific community of fully comprehending the impact of the data presented and the science it was built upon.
 
 The main aim of the manuscript is to learn the role of premotor NS neurons in the crawling motor pattern studied using spike sorting in extracellular nerve recordings. This readout allows to simultaneously monitor a larger number of units than in any previous study. This approach aims to determine whether and how a recurrent inhibitory peripheral circuit is involved in coordinating or modulating the rhythmic motor pattern.
 
 Our rationale was that the known effect of NS on one particular motoneuron (DE-3) may have overlooked a more general effect on crawling (lines 253-257). Moreover, we wanted to investigate whether this effect was due to the recurrent inhibitory circuit or if other elements were involved, and to study whether the modulation was mediated by the recurrent synapse between NS and the motoneurons.
 
 In the context of this aim we studied the rhythmic activity of cell DE-3, together with motoneurons that fire in-phase and anti-phase, in isolated ganglia (Figure 4). To reveal the effect of NS manipulation we applied a quantitative analysis that showed the phase-specific effect of NS (Figure 6).
 
 Given that this is the first study using a spike sorting algorithm to detect and describe the activity of motoneurons in nerve recordings we found it reasonable to compare these results with an in vivo study; thus, providing information to the general reader, that supports the correspondence between the ex vivo and the in vivo patterns.
 
 (1) Results, Lines 167-170: "While multiple extracellular recordings have been performed previously (Eisenhart et al., 2000), these results present the first quantitative analysis of motor units activated throughout the crawling cycle. The In-Phase units are expected to control the contraction stage by exciting or inhibiting the longitudinal or circular muscles, respectively, and the Anti-Phase units to control the elongation stage by exciting or inhibiting the circular or longitudinal muscles, respectively."
 
 Reviewer: The first line above is misleading. The study by Puhl and Mesce (2008, J. Neurosci, 28:4192- 420) contains a comprehensive analysis of the motoneurons active during fictive crawling with the aim of characterizing their roles and phase relationships and solidifying the idea that the oscillator for crawling resides in a single ganglion. Intracellular recordings from a number of key crawl-related motoneurons were made in combination with extracellular recordings of motoneuron DE-3, a key monitor of crawling. In their paper, it was shown that motoneurons AE, VE-4, DI-1, VI-2, and CV were all correlated with crawl activity, and fired repeatedly either in phase or out-of-phase with DE-3. They were shown to be either excitatory or inhibitory. At a minimum, the above paper should be cited.
 
 The sentence in the submitted manuscript explicitly refers to the quantitative analysis of extracellular recordings, but we recognize that it may lead to confusion. We have now added a clarification (lines 197-199).
 
 The article by Puhl and Mesce 2008 shows very nice intracellular recordings of the AE, CV, VE-4, DE-3, DI-1, and Vi-2, accompanied by extracellular recordings of DE-3 in the DP nerve. In all cases, there is only one intracellular recording paired with the DP nerve recording.
 
 While it is possible to perform up to 3-4 simultaneous intracellular recordings, these are technically challenging, and more so when the recordings have to last 10-20 minutes. Due to this difficulty, and because our objective was to record multiple units simultaneously in order to comprehensively describe the different crawling stages, we implemented the spike sorting analysis on multiple extracellular recordings. This approach enabled us to reliably obtain multiple units per experiment and thus execute a quantitative analysis of the activity of each identified unit.
 
 The article by Puhl and Mesce 2008 mentions several quantitative aspects of the neurons that fire in-phase or out-of-phase with DE-3, but, as far as we understand, there is no figure that summarizes activity levels and span in the way Figures 4 and 6 do in the current manuscript. To the best of our knowledge, no previous work renders this information.
 
 It is very important for us to emphasize that the work by Puhl and Mesce was seminal for our research. We cited it four times in the original manuscript and 10 times in the present version. But, like any important discovery, it sets the ground for further work that can refine certain measurements that in the original discovery were not central.
 
 This is why we believe that the cited sentence in our manuscript is not misleading. However, to comply with the requirement of Reviewer #1, we added a sentence preceding the mentioned paragraph (lines 185-187) that acknowledges the description made using intracellular recordings, and explains the need for implementing the approach we chose.
 
 The submitted paper would be strengthened if some of these previously identified motoneurons were again recorded with intracellular electrodes and concomitant NS cell stimulation. The power of the leech preparation is that cells can be identified as individuals with dual somatic (intracellular) and axonal recordings (extracellular).
 
 Most of the motoneurons mentioned by Reviewer #1 are located on the opposite side (dorsal) of the ganglion to NS (ventral), and therefore, simultaneous intracellular recordings in the context of fictive crawling are challenging.
 
 In the publication of Rodriguez et al. 2009, Mariano Rodriguez did manage to record NS from the dorsal side together with DE-3 and MN-L (!) and this led to the discovery that these motoneurons are electrically coupled, but the recurrent inhibitory circuit masks this interaction. Repeating this type of experiments during crawling, which requires stable recordings for around 15 minutes, is not a reasonable experimental setting.
 
 Rodriguez et al. 2012 shows intracellular recordings of motoneurons AE and CV during crawling in conjunction with NS, and their activity presented the expected correlation.
 
 The shortfall of this aspect of the study (Figure 5) is that the extracellular units have not been identified here.
 
 The Reviewer is right in that the extracellular units have not been identified in terms of cell identity. As we explained earlier, most motoneurons are on the opposite side (ventral/dorsal) of the ganglion relative to NS.
 
 However, we do characterize the units in terms of the nerve through which they project to the periphery and their activity phase. In lines 345-349 we use this information and, based on published work, we propose possible cellular identities of the different units.
 
 In xfact, these units might not even be motoneurons.
 
 We are surprised by this comment. The classical work of Ort and collaborators (1974) showed that spikes detected in extracellular nerve recordings were emitted by specific motoneurons, and several previous publications have validated extracellular nerve recordings as a means to study fictive motor patterns (Wittenberg & Kristan 1992, Shaw & Kristan 1997, Eisenhart et al. 2000).
 
 For further reassurance, we only took in consideration units whose activity was locked to DE3; any non-rhythmical activity was filtered out (see lines 433-435).
 
 They could represent activity from the centrally located sensory neurons, dopamine-modulated afferent neurons or peripherally projecting modulatory neurons.
 
 Peripheral nerves also contain axons from sensory neurons. However, in a previous article, we studied the activity of mechanosensory neurons (Alonso et al. 2020) and showed that they remain silent during crawling. Moreover, the low-threshold T sensory neurons are inhibited in phase with DE-3 bursts and NS IPSPs (Kearney et al. 2022). Alonso et al. 2000 showed that spiking activity of T cells affects the crawling motor pattern, revealing the relevance of keeping them silent.
 
 What does the Reviewer mean by “dopamine-modulated afferents”? We are not aware of this category of leech neurons.
 
 The neuromodulatory Rz neurons project peripherally through the recorded nerves, but intracellular recordings of these neurons from our lab show no rhythmic activity in those cells during dopamine-induced crawling.
 
 Essentially, they may not have much to do with the crawl motor pattern at all.
 
 Does the Reviewer consider that neurons engaged in a coherent rhythmic firing could be unrelated to the pattern? As indicated above, the units reported in our manuscript were selected because dopamine evoked their rhythmic activity, locked to DE-3.
 
 Does the Reviewer consider that dopamine could evoke spurious neuronal activity?
 
 (2) Results Lines 206-210: "with the elongation and contraction stages of in vivo behavior. However the isometric stages displayed in vivo have no obvious counterpart in the electrophysiological recordings. It is important to consider that the rhythmic movement of successive segments along the antero-posterior axis of the animal requires a delay signal that allows the appropriate propagation of the metachronal wave, and this signal is probably absent in the isolated ganglion."
 
 Reviewer: The so-called isometric stages, indeed, have an electrophysiological counterpart due in part to the overlapping activities across segments. This submitted paper would be considerably strengthened if it referred to the body of work that has examined how the individual crawl oscillators operate in a fully intact nerve cord, excised from the body but with all the ganglia (and cephalic ganglion) attached. Puhl and Mesce 2010 (J. Neurosci 30: 2373-2383) and Puhl et al. 2012 (J. Neurosci, 32:17646 -17657) have shown that "appropriate propagation of the metachronal wave" requires the brain, especially cell R3b-1. They also show that the long-distance projecting cell R3b-1 synapses with the CV motoneuron, providing rhythmic excitatory input to it.
 
 We would like to draw the Reviewer’s attention to the fact that Puhl and Mesce 2008, 2010 and Puhl et al. 2012 characterized crawling in intact (or nearly intact) animals considering the whole body. In our in vivo analysis, we studied the changes in length of the whole animal and of sections demarcated by the drawn points, as described in the Materials and Methods/Behavioral
 
 Experiments. Because of this different analysis, we defined “isometric” stages as those in which a given section of the animal does not change its length. We now clarify this (line 230).
 
 In the paragraph cited by the Reviewer, we intended to state that, in the context of our study, the intersegmental lag caused by the coordinating mechanisms has no counterpart “in the electrophysiological recordings of motoneurons in the isolated ganglia”. We have now completed this idea with the expression underlined in the previous sentence (line 231).
 
 As the Reviewer indicates, in the intact nerve cord the behavioral isometric stages correspond to the “waiting time” between segments. We did refer to the metachronal order but did not cite the articles by Puhl and Mesce 2010 and Puhl et al. 2012; we now do so (lines 234).
 
 For this and other reasons, the paper would be much more informative and exciting if the impacts of the NS cell were studied in a fully intact nerve cord. Those studies have never been done, and it would be exciting to see how and if the effects of NS cell manipulation deviated from those in the single ganglion.
 
 The Reviewer may consider that a systematic analysis of multiple nerves in several ganglia along the whole nerve cord would have been a different enterprise than the one we carried out. The Reviewer is right in recognizing the interest of such study, but in our opinion, the value of the present work lies in presenting a thorough quantitative analysis of multiple nerves to demonstrate its usefulness for the study of the network underlying leech crawling. In this manuscript, we used it to analyze the role of the premotor NS neuron. Without the recording of units firing in-phase and out-ofphase with DE-3, we would have been unable to assess the span of NS effects.
 
 (3) Discussion Lines 322-324. "The absence of descending brain signals and/or peripheral signals are assumed as important factors in determining the cycle period and the sequence at which the different behavioral stages take place."
 
 Reviewer: The authors could strengthen their paper by including a more complete picture of what is known about the control of crawling. For example, Puhl et al. 2012 (J Neurosci, 32:17646-17657) demonstrated that the descending brain neuron R3b-1 plays a major role in establishing the crawlcycle frequency. With increased R3b-1 cell stimulation, DE-3 periods substantially shortened throughout the entire nerve cord. Thus, the importance of descending brain inputs should not be merely assumed; empirical evidence exists.
 
 We now strengthen the concept using “known descending brain signals” (line 358) and cite Puhl et al. 2012. We believe that extending the discussion to cell R3b-1 does not contribute meaningfully to the focus of this manuscript.
 
 (4) Discussion Lines 325-327: "the sequence of events, and the proportion of the active cycle dedicated to elongation and contraction were remarkably similar in both experimental settings. This suggests that the network activated in the isolated ganglion is the one underlying the motor behavior."
 
 Reviewer: The results and conclusions drawn in the current manuscript mirror those previously reported by Puhl and Mesce (2008, J. Neurosci, 28:4192- 420) who first demonstrated that the essential pattern-generating elements for leech crawling were contained in each of the segmental ganglia comprising the nerve cord. Furthermore, the authors showed that the duty cycle of DE-3, in a single ganglion treated with dopamine, was statistically indistinguishable from the DE-3 duty cycle measured in an intact nerve cord showing spontaneous fictive crawling, in an intact nerve cord induced to crawl via dopamine, and in the intact behaving animal. What was statistically significant, however, was that the DE-3 burst period was greatly reduced in the intact animal (i.e., a higher crawl frequency), which was replicated in the submitted paper.
 
 There is no doubt that the article by Puhl and Mesce 2008 is seminal to the work we present here. The Reviewer seems to suggest that we do not recognize the value of this work. The contrary is true, all our related papers cite this important breakthrough. We cite the paper very early in the article in the Introduction (see lines 51 and 52-53). Likely, we would like the Reviewer to recognize the novelty of the current report. To clarify what has been shown and what is new in our manuscript, considerer the following:
 
 i. Figures 1-6 in Puhl and Mesce 2008 provide representative intracellular recordings that describe neurons that fire in phase and out of phase relative to DE-3. Some general measurements are given in the text, but none of these figures quantify the relative activity of neurons that fire in different stages; only DE-3 activity was quantified. A quantitative description of multiple units active in phase and out of phase with DE-3 is presented here for the first time, are we wrong? This quantification is particularly relevant when assessing how a treatment affects the function of the circuit.
 
 ii. Regarding the cycle period, we referred to the work from the Kristan lab, which reported this value long before the requested reference. We now cite Puhl and Mesce 2008 in lines 222 regarding in vivo measurements, and in line 221 regarding isolated ganglia.
 
 iii. Regarding the duty cycle:
 
 Puhl and Mesce 2008 measured the duty cycle of DE-3 in three configurations: a. spontaneous whole cord, b. DA-mediated whole cord and c. DA mediated single ganglion crawling. However, it does not report the duty cycle of neurons out-of-phase with DE-3. Our current manuscript carried out this analysis. One could argue that the silence between DE-3 bursts captures that value, but this is a speculation that needed a proper measure.
 
 Puhl and Mesce 2008 does not indicate the duty cycle of the contraction and elongation stages in vivo. Our current manuscript does.
 
 Therefore, the sentence cited by the Reviewer refers to data presented in this manuscript, and not in any prior manuscript. It is true that Puhl and Mesce 2008 inspire the intuition that the sentence is true, but does not present the data that the current manuscript does.
 
 Finally, our study focused only on the body sections corresponding to the same segmental range used in the ex vivo experiments, rather than the whole animal. The comparison was made only to validate that the duty cycles of neurons firing in phase and out of phase with DE-3 matched the dynamic stages in the studied sections of the leech (line 364).
 
 In my opinion, the novelty of the results reported in the submitted manuscript is diminished in the light of previously published studies. At a minimum, the previous studies should be cited, and the authors should provide additional rationale for conducting their studies. They need to explain in the discussion how their approach provided additional insights into what has already been reported.
 
 Throughout our reply, we have provided a detailed explanation of the rationale and necessity behind each experiment. Following the Reviewer’s suggestion, we have rephrased the research objectives, included what is known from our previously published work, and highlighted the substantial new data contributed by the present study. See lines 80-85.
 
 Additionally, we further cite our published article in lines 93, 104, 138, 146 and 250.
 
 Reviewer #2 (Public review):
 
 The paper is well-written overall. The findings are clearly presented, and the data seems solid overall. I do have, however, a few major and some minor comments representing some concerns.
 
 My major comments are below.
 
 (1) This may seem somewhat semantic, yet, it has implications on the way the data is presented and moreover on the conclusions drawn - a single ganglion cannot show fictive crawling. It can demonstrate rhythmic patterns of activity that may serve in the (fictive) crawling motor pattern. The latter is a result of the intrinsic within single-ganglion connectivity AND the inter-ganglia connections and interactions (coupling) among the sequential ganglia. It may be affected by both short-range and long-range connections (e.g., descending inputs) along the ganglia chain.
 
 Semantics is not a trivial issue in science communication. It entails metaphors that enter the bibliography as commonly used “shortcuts” to a complex concept that are adopted by a community of researchers. And yes, indeed, they can be misleading.
 
 However, if recording the activity in an isolated ganglion shows that a wide group of motoneurons, that control known muscle movements, presents a rhythmic output that maintains the appropriate cycle period and phase relationships, the “shortcut” is incomplete but could be valid (Puhl and Mesce 2008). If we were to include the phase lag component, a single ganglion cannot generate the fictive motor output.
 
 Because any new study builds knowledge on the basis of the cited bibliography, the way we name concepts is a sensitive point. Adopting the terminology used by previous publications (Puhl and Mesce 2008) seems important to allow readers to follow the development of knowledge. However, attending the observation made by Reviewer #2, we included a sentence clarifying that the concept “fictive crawling” does not include intersegmental connectivity (lines 54-57)
 
 (2) The point above is even more critical where the authors set to compare the motor pattern in single ganglia with the intact animals. It would have made much more sense to add a description of the motor pattern of a chain of interconnected ganglia. The latter would be expected to better resemble the intact animal. Furthermore, this project would have benefitted from a three-way comparison (isolated ganglion-interconnected ganglia-intact animal.
 
 As we answered to Reviewer #1, the present manuscript does not intend to present a thorough study on how the activity in the isolated nervous system compares with the animal behavior. To do so we would have needed to perform a completely different set of experiments. To better define the relevance of our comparison with the in vivo experiments we rephrased the objective of the behavioral analysis (lines 197-199).
 
 The main aim of the manuscript is to learn the role of premotor NS neurons in the crawling motor pattern studied using a readout (spike sorting in extracellular nerve recordings) that allows simultaneous screening of a larger number of units than in any previous study, in order to determine whether and how a recurrent inhibitory peripheral circuit is involved in coordinating or modulating the rhythmic motor pattern.
 
 Our rationale was that the known effect of NS on one particular motoneuron (DE-3) may have overlooked a more general effect on crawling (lines 253-257). Moreover, we wanted to investigate whether this effect was due to the recurrent inhibitory circuit or if other elements were involved, and to study whether the modulation was mediated by the recurrent synapse between NS and the motoneurons.
 
 In the context of this aim we studied the rhythmic activity of cell DE-3, together with motoneurons that fire in-phase and anti-phase, in isolated ganglia (Figure 4). To reveal the effect of NS manipulation we applied a quantitative analysis that showed the phase-specific effect of NS (Figure 6).
 
 Given that this is the first study using a spike sorting algorithm to detect and describe the activity of motoneurons in nerve recordings we found it reasonable to compare these results with an in vivo study; thus, providing information to the general reader, that supports the correspondence between the ex vivo and the in vivo patterns.
 
 (3) Two previous studies by the same group are repeatedly mentioned (Rela and Szczupak, 2003; Rodriguez et al., 2009) and serve as a basis for the current work. The aim of one of these previous studies was to assess the role of the NS neurons in regulating the function of motor networks. The other (Rodriguez et al., 2009) reported on a neuron (the NS) that can regulate the crawling motor pattern. LL 71-74 of the current report presents the aim of this study as evaluating the role of the known connectivity of the premotor NS neuron in shaping the crawling motor pattern. The authors should make it very clear what indeed served as background knowledge, what exactly was known about the circuitry beforehand, and what is different and new in the current study.
 
 Rela and Szczupak 2003 and Rodriguez et al. 2009 analyze the interactions of motoneurons with NS. We believe that Reviewer #2 refers here to Rodriguez et al. 2012. A similar observation was made by Reviewer #1. Below, we copy the answer previously stated:
 
 Following the Reviewer’s suggestion, we have rephrased the research objectives, included what is known from our previously published work, and highlighted the substantial new data contributed by the present study. See lines 80-85.
 
 Additionally, we further cite our published article in lines 93, 104, 138, 146 and 250.
 
 Reviewer #1 (Recommendations for the authors):
 
 Please edit for correct word usage.
 
 Reviewer #2 (Recommendations for the authors):
 
 Minor Concerns
 
 (1) LL33-36: These lines are somewhat vague and non-informative. Why is the functional organization of motor systems an open question? What are the mechanisms at the level of the nerve cord that are an open question? Maybe be more explicit?
 
 We did as suggested (lines 30-32).
 
 (2) L62: The homology between the NS neurons and the vertebrate Renshaw cells is mentioned already in the Abstract and here again. While a reference is provided (citing the lead author of this current work), the reader would benefit from some further short words of explanation regarding the alleged homology.
 
 We included a description of Renshaw cell connectivity (lines 64-65).
 
 (3) LL90-92: The NS recording in Figure 1 (similar to Figure 3 in Rodriguez et al.) demonstrates clear distinct IPSPs. Could these be correlated with DE-3 spikes?
 
 We investigated this correlation in detail and the answer is that there is no strictly a 1:1 DE-3 spike to IPSP correlation. NS receives inputs from other dorsal and ventral excitors of longitudinal muscles, and the NS trace is too “noisy” to reflect any short-term correlation. Originally we proposed that the NS IPSPs were due to the polysynaptic interaction between the MN and NS (Rodríguez et al. 2012). However, the present work demonstrates that the IPSPs in NS are caused by a source upstream from the MNs.
 
 (4) LL145-145: Do you mean - inhibitory signals FROM NS premotor neurons? Not clear.
 
 We see the confusion, and we rewrote the sentence (lines 164). We hope it is clearer now: “…inhibitory signals onto NS premotor neurons were transmitted to DE-3 motoneurons via rectifying electrical synapses and counteracted their excitatory drive during crawling, limiting their firing frequency.”
 
 (5) LL153-154: Why isn't AA included in Figure 4A?
 
 Reading our original text, the Reviewer #1 is right in expecting to see the AA recording. We changed the sentence: “we performed extracellular recordings of DP along with AA and/or PP root nerves” (lines 171-172).
 
 We dissected the three nerves but, unfortunately, we did not always obtain good recordings from the three of them.
 
 (6) LL237-238: The statistical significance (B- antiphase) is not clear. Furthermore, with N of 7-8, I'm not sure the parametric tests utilized are appropriate.
 
 Regarding the Reviewer's concern about the tests, please note that all the assumptions made for each model were tested (see now Materials and Methods lines 466-467).The information on each model is provided in Supplementary Table 2 under the column 'Model, random effect,' which specifies whether a Linear Mixed Model (LMM) or a Generalized Linear Mixed Model (GLMM) was implemented. For GLMMs, the corresponding distribution and link function are also specified. For the analysis of Max bFF of Anti-Phase motor units, we found a significant interaction between epoch and treatment, indicating a difference between treatments. This is indicated on the left of the y-axis (##). In control experiments, all three comparisons (pre-test, pre-post, test-post) show significant differences in Max bFF: this variable decreased (slightly but significantly) along the subsequent epochs, suggesting a change over time. We now corrected the text to indicate that these changes were small (line 268). In contrast, Max bFF in depo experiments remained stable between pre-test and pre-post, but significantly decreased between the depo and post epochs. Thus, in our view the comparison between control and the test supports the conclusion that NS depolarization was limited to counteracting this decrease (lines 270-273). Supplementary Table 2 provides the significance and modeled estimated ratio for each comparison in the column for pairwise simple contrasts.
 
 Thanks to this question, we realized that the nomenclature used in the table for the epochs (pre - depo - post) needed to be changed to pre - test - post, and we have now corrected it.
 
 (7) LL240-241: I fail to see a difference from Control.
 
 For the Relative HW of In-Phase units, we also found a significant interaction between epoch and treatment, indicating a difference between treatments, as denoted to the left of the y-axis (#). Then, the significance of the comparisons across epochs within each treatment are shown in the figure (*). What is important to notice is that obtaining the same significance for each treatment does not imply identical results, but we failed to describe this in our original text and we do now in lines 275-279.
 
 (8) LL244-245: I must admit that Table 2 is beyond me. Maybe add some detail or point out to the reader what is important (if at all).
 
 We have now clarified what each column of the tables indicates in the corresponding legends.
 
 Here, we also share an insight into how the experiments were designed and analyzed:
 
 To account for possible temporal drifts of the variables during the recordings that could mask or confuse the results, we compared two experimental series: one in which NS was subjected to depolarizing current pulses (depo), and another series (ctrl) in which the neurons were not depolarized.
 
 The statistical analysis was made using Linear Mixed Models (LMMs) or Generalized Linear Mixed Models (GLMMs). In these analyses treatments and epochs are used as explanatory variables to evaluate the interaction between these factors. These models allow us to determine whether changes in each variable across epochs differ depending on the treatment. For example, whether the variation in firing frequency from pre to test to post differs between control experiments and those in which NS was depolarized.
 
 A significant interaction between treatment and epoch indicates that NS depolarization affected the variable. In such cases, we performed pairwise comparisons between epochs (pre-test, test-post, pre-post) within each treatment. In contrast, the absence of a significant interaction can result from two possibilities: either the variable did not change across epoch in either treatment, or a similar temporal drift occurred in both cases.
 
 (9) LL245-256: Move this paragraph to the discussion.
 
 Because we introduced a rationale for the experiments described in Figure 6 (lines 282-284) the paragraph was mostly removed, but the part that supports the methodological approach was left.
 
 (10) LL259-260: see my second minor point above. This is explained in LL270-272 for the first time.
 
 We amended according to comment (2).
 
 (11) Figures: The quantitative analysis shown in Figure 3B is very useful. Why isn't this type of analysis utilized for the comparisons shown in Figures 4 and 6?
 
 We chose different ways of plotting the data based on their nature. In Figure 3B, we present data from an identified neuron (DE-3) recorded in different experiments. In contrast, in Figure 6 we analyze data from neurons classified into the same group based on their activity during the fictive crawling cycle, but their individual identity was not ascertained. Therefore, we consider it important to plot the results for each unit individually, to assess the effect of temporal drift and NS depolarization.
 
 (12) Figures: Figure 7 is meant to be compared to Figure 1C; the point being the addition of an inhibitory connection onto the NS neuron. Why are other details of the figure also different (different colored M)?
 
 While Figure 1C illustrates the known connection between NS and both DE-3 and CV motoneurons, Figure 7 shows the connections between NS and the different groups of motor units described in this study. The units are represented in the circuit using the same colors that identify them in Figures 4 and 6. Since the CV motoneuron was not recorded in this study, the circuit represents the AntiPhase neurons but does not identify them with CV. Figure 7 legend now clarifies what the colors represent, and Figure 1C has been updated to match the same color scheme.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

AuthorResponse

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.12.03.626557v2
www.biorxiv.org www.biorxiv.org

Social Experience Shapes Fighting Strategies for Reproductive Success

5
1. Public_Reviews 29 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 The paper presents a new behavioral assay for Drosophila aggression and demonstrates that social experience influences fighting strategies, with group-housed males favoring high-intensity but low-frequency tussling over aggressive lunging observed in isolated males. This paper is important for researchers studying the impact of social isolation on aggression, while the description of tussling behavior and the interpretation of the link between tussling and mating success are incomplete.
 
 Summary
2. Public_Reviews 29 Jul 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 This work addresses an important question in the field of Drosophila aggression and mating. Prior social isolation is known to increase aggression in males, manifesting as increased lunging, which is suppressed by group housing (GH). However, it is also known that single housed (SH) males, despite their higher attempts to court females, are less successful. Here, Gao et al., develop a modified aggression assay to address this issue by recording aggression in Drosophila males for 2 hours, with a virgin female immobilized by burying its head in the food. They found that while SH males frequently lunge in this assay, GH males switch to higher intensity but very low frequency tussling. Constitutive neuronal silencing and activation experiments implicate cVA sensing Or67d neurons in promoting high frequency lunging, similar to earlier studies, whereas Or47b neurons promote low frequency but higher intensity tussling. Optogenetic activation revealed that three pairs of pC1SS2 neurons increase tussling. Cell-type-specific DsxM manipulations combined with morphological analysis of pC1SS2 neurons and side-by-side tussling quantification link the developmental role of DsxM to the functional output of these aggression-promoting cells. In contrast, although optogenetic activation of P1a neurons in the dark did not increase tussling, thermogenetic activation under visible light drove aggressive tussling. Using a further modified aggression assay, GH males exhibit increased tussling and maintain territorial control, which could contribute to a mating advantage over SH males, although direct measures of reproductive success are still needed
 
 Strengths:
 
 Through a series of clever neurogenetic and behavioral approaches, the authors implicate specific subsets of ORNs and pC1 neurons in promoting distinct forms of aggressive behavior, particularly tussling. They have devised a refined territorial control paradigm, which appears more robust than earlier assays using a food cup (Chen et al., 2002). This new setup is relatively clutter-free and could be amenable to future automation using computer vision approaches. The updated Figure 5, which combines cell-type-specific developmental manipulation of pC1SS2 neurons with behavioral output, provides a link between developmental mechanisms and functional aggression circuits. The manuscript is generally well written, and the claims are largely supported by the data.
 
 Weakness:
 
 Although most concerns have been addressed, the manuscript still lacks a rigorous, objective method for quantifying lunging and tussling. Because scoring appears to have been done manually and a single lunge in a 30 fps video spans only 2-3 frames, the 0.2 s cutoff seems arbitrary, and there are no objective criteria distinguishing reciprocal lunging from tussling. Despite this, the study offers valuable insights into the neural and behavioral mechanisms of Drosophila aggression.
 
 Review 1
3. Public_Reviews 29 Jul 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 Gao et al. investigated the change of aggression strategies by the social experience and its biological significance by using Drosophila. Two modes of inter-male aggression in Drosophila are known: lunging, high-frequency but weak mode, and tussling, low-frequency but more vigorous mode. Previous studies have mainly focused on the lunging. In this paper, the authors developed a new behavioral experiment system for observing tussling behavior and found that tussling is enhanced by group rearing, while lunging is suppressed. They then searched for neurons involved in the generation of tussling. Although olfactory receptors named Or67d and Or65a have previously been reported to function in the control of lunging, the authors found that these neurons do not function in the execution of tussling and another olfactory receptor, Or47b, is required for tussling, as shown by the inhibition of neuronal activity and the gene knockdown experiments. Further optogenetic experiments identified a small number of central neurons pC1[SS2] that induce the tussling specifically. These neurons express doublesex (dsx), a sex-determination factor, and knockdown of dsx strongly suppresses the induction of tussling. In order to further explore the ecological significance of the aggression mode change in group-rearing, a new behavioral experiment was performed to examine the territorial control and the mating competition. And finally, the authors found that differences in the social experience (group vs. solitary rearing) and the associated change in aggression strategy are important in these biologically significant competitions. These results add a new perspective to the study of aggression behavior in Drosophila. Furthermore, this study proposes an interesting general model in which the social experience modified behavioral changes play a role in reproductive success.
 
 Strengths:
 
 A behavioral experiment system that allows stable observation of tussling, which could not be easily analyzed due to its low-frequency, would be very useful. The experimental setup itself is relatively simple, just the addition of a female to the platform, so it should be applicable to future research. The finding about the relationship between the social experience and the aggression mode change is quite novel. Although the intensity of aggression changes with the social experience was already reported in several papers (Liu et al., 2011 etc), the fact that the behavioral mode itself changes significantly has rarely been addressed, and is extremely interesting. The identification of sensory and central neurons required for the tussling makes appropriate use of the genetic tools and the results are clear. A major strength of this study in neurobiology is the finding that another group of neurons (Or47b-expressing olfactory neurons and pC1[SS2] neurons), distinct from the group of neurons previously thought to be involved in low-intensity aggression (i.e. lunging), function in the tussling behavior. Furthermore, the results showing that the regulation of aggression by pC1[SS2] neurons is based on the function of the dsx gene will bring a new perspective to the field. Further investigation of the detailed circuit analysis is expected to elucidate the neural substrate of the conflict between the two aggression modes. The experimental systems examining the territory control and the reproductive competition in Fig. 6 are novel and have advantages in exploring their biological significance. It is important to note that in addition to showing the effects of age and social experience on territorial and mating behaviors, the authors experimentally demonstrated that altered fighting strategy has effects with respect to these behaviors.
 
 Review 2
4. Public_Reviews 29 Jul 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 In this revised manuscript, Gao et al. presented a series of well-controlled behavioral data showing that tussling, a form of high-intensity fighting among male fruit flies (Drosophila melanogaster) is enhanced specifically among socially experienced and relatively old males. Moreover, results of behavioral assays led authors to suggest that increased tussling among socially experienced males may increase mating success. They also concluded that tussling is controlled by a class of olfactory sensory neurons and sexually dimorphic central neurons that are distinct from pathways known to control lunges, a common male-type attack behavior.
 
 A major strength of this work is that it is the first attempt to characterize behavioral function and neural circuit associated with Drosophila tussling. Many animal species use both low-intensity and high-intensity tactics to resolve conflicts. High-intensity tactics are mostly reserved for escalated fights, which are relatively rare. Because of this, tussling in the flies, like high-intensity fights in other animal species, have not been systematically investigated. Previous studies on fly aggressive behavior have often used socially isolated, relatively young flies within a short observation duration. Their discovery that 1) older (14-days old) flies tend to tussle more often than younger (2 to 7-days-old) flies, 2) group-reared flies tend to tussle more often than socially isolated flies, and 3) flies tend to tussle at later stage (mostly ~15 minutes after the onset of fighting), are the result of their creativity to look outside of conventional experimental settings. These new findings are key for quantitatively characterizing this interesting yet under-studied behavior.
 
 Newly presented data have made several conclusions convincing. Detailed descriptions of methods to quantify behaviors help understand the basis of their claims by improving transparency. However, I remain concerned about authors' persistent attempt to link the high intensity aggression to reproductive success. The authors' effort to "tone down" the link between the two phenomena remains insufficient. There are purely correlational. I reiterate this issue because the overall value of the manuscript would not change with or without this claim.
 
 Review 3
5. Public_Reviews 29 Jul 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews.
 
 Reviewer #1 (Public review):
 
 This work addresses an important question in the field of Drosophila aggression and mating- prior social isolation is known to increase aggression in males by increased lunging, which is suppressed by group housing (GH). However, it is also known that single-housed (SH) males, despite their higher attempts to court females, are less successful. Here, Gao et al., developed a modified aggression assay, to address this issue by recording aggression in Drosophila males for 2 hours, over a virgin female which is immobilized by burying its head in the food. They found that while SH males frequently lunge in this assay, GH males switch to higher intensity but very low-frequency tussling. Constitutive neuronal silencing and activation experiments implicate cVA sensing Or67d neurons promoting high-frequency lunging, similar to earlier studies, whereas Or47b neurons promote low-frequency but higher intensity tussling. Using optogenetic activation they found that three pairs of pC1 neurons- pC1SS2 increase tussling. While P1a neurons, previously implicated in promoting aggression and courtship, did not increase tussling in optogenetic activation (in the dark), they could promote aggressive tussling in thermogenetic activation carried out in the presence of visible light. It was further suggested, using a further modified aggression assay that GH males use increased tussling and are able to maintain territorial control, providing them mating advantage over SI males and this may partially overcome the effect of aging in GH males.
 
 Strengths
 
 Using a series of clever neurogenetic and behavioral approaches, subsets of ORNs and pC1 neurons were implicated in promoting tussling behaviors. The authors devised a new paradigm to assay for territory control which appears better than earlier paradigms that used a food cup (Chen et al, 2002), as this new assay is relatively clutter-free, and can be eventually automated using computer vision approaches. The manuscript is generally well-written, and the claims made are largely supported by the data.
 
 Thank you for your precise summary of our study, and being very positive on the novelty and significance of the study.
 
 Weaknesses
 
 I have a few concerns regarding some of the evidence presented and claims made as well as a description of the methodology, which needs to be clarified and extended further.
 
 (1) Typical paradigms for assaying aggression in Drosophila males last for 20-30 minutes in the presence of nutritious food/yeast paste/females or all of these (Chen et al. 2002, Nilsen et al., 2004, Dierick et al. 2007, Dankert et al., 2009, Certel & Kravitz 2012). The paradigm described in Figure 1 A, while important and more amenable for video recording and computational analysis, seems a modification of the assay from Kravitz lab (Chen et al., 2002), which involved using a female over which males fight on a food cup. The modifications include a flat surface with a central food patch and a female with its head buried in the food, (fixed female) and much longer adaptation and recording times respectively (30 minutes, 2 hours), so in that sense, this is not a 'new' paradigm but a modification of an existing paradigm and its description as new should be appropriately toned down. It would also be important to cite these earlier studies appropriately while describing the assay.
 
 We now toned down the description of the paradigm and cited more related references.
 
 (2) Lunging is described as a 'low intensity' aggression (line 111 and associated text), however, it is considered a mid to high-intensity aggressive behavior, as compared to other lower-intensity behaviors such as wing flicks, chase, and fencing. Lunging therefore is lower in intensity 'relative' to higher intensity tussling but not in absolute terms and it should be mentioned clearly.
 
 We have modified the description as suggested.
 
 (3) It is often difficult to distinguish faithfully between boxing and tussling and therefore, these behaviors are often clubbed together as box, tussle by Nielsen et al., 2004 in their Markov chain analysis as well as a more detailed recent study of male aggression (Simon & Heberlein, 2020). Therefore, authors can either reconsider the description of behavior as 'box, tussle' or consider providing a video representation/computational classifier to distinguish between box and tussle behaviors.
 
 Indeed, we could not faithfully distinguish boxing and tussling. To address this concern, we now made textual changes in the result section we occasionally observed the high-intensity boxing and tussling behavior in male flies, which are difficult to distinguish and hereafter simply referred to as tussling.
 
 We also added this information in the Materials and Methods section Tussling is often mixed with boxing, in which both flies rear up and strike the opponent with forelegs. Since boxing is often transient and difficult to distinguish from tussling, we referred to the mixed boxing and tussling behavior simply as tussling.
 
 (4) Simon & Heberlein, 2020 showed that increased boxing & tussling precede the formation of a dominance hierarchy in males, and lunges are used subsequently to maintain this dominant status. This study should be cited and discussed appropriately while introducing the paradigm.
 
 We now cited this important study in both the Introduction and Discussion sections.
 
 (5) It would be helpful to provide more methodological details about the assay, for instance, a video can be helpful showing how the males are introduced in the assay chamber, are they simply dropped to the floor when the film is removed after 30 minutes (Figures 1-2)?
 
 We now provided more detailed description about behavioral assays and how we analyze them. For example All testers were loaded by cold anesthesia. After a 30-minute adaptation, the film was gently removed to allow the two males to fell into the behavioral chamber, and the aggressive behavior was recorded for 2 hours.
 
 (6) The strain of Canton-S (CS) flies used should be mentioned as different strains of CS can have varying levels of aggression, for instance, CS from Martin Heisenberg lab shows very high levels of aggressive lunges. Are the CS lines used in this study isogenized? Are various genetic lines outcrossed into this CS background? In the methods, it is not clear how the white gene levels were controlled for various aggression experiments as it is known to affect aggression (Hoyer et al. 2008).
 
 We used the wtcs flies from Baker lab in Janelia Research Campus, and are not sure where they are originated. We appreciate your concern on the use of wild-type strains as they may show different fighting levels, but this study mainly used wild-type strains to compare behavioral differences between SH and GH males. All flies tested in this study are in w+ background, based on w+ balancers flies but are not backcrossed. We have listed detailed genotypes of all tested flies in Table S1 in the revised manuscript.
 
 (7) How important it is to use a fixed female for the assay to induce tussling? Do these females remain active throughout the assay period of 2.5 hours? Is it possible to use decapitated virgin females for the assay? How will that affect male behaviors?
 
 We used a fixed female to restrict it in the center of food. These females remain active throughout the assay as their legs and abdomens can still move. Such design intends to combine the attractive effects from both female and food. One can also use decapitated females, but in this case, males can push the decapitated female into anywhere in the behavioral chamber. The logic to use fixed females has now been added in the Materials and Methods section of the revised manuscript.
 
 (8) Raster plots in Figure 2 suggest a complete lack of tussling in SH males in the first 60 minutes of the encounter, which is surprising given the longer duration of the assay as compared to earlier studies (Nielsen et al. 2004, Simon & Heberlein, 2020 and others), which are able to pick up tussling in a shorter duration of recording time. Also, the duration for tussling is much longer in this study as compared to shorter tussles shown by earlier studies. Is this due to differences in the paradigm used, strain of flies, or some other factor? While the bar plots in Figure 2D show some tussling in SH males, maybe an analysis of raster plots of various videos can be provided in the main text and included as a supplementary figure to address this.
 
 Indeed, tussling is very low in SH males in our paradigm, which may be due to different genetic backgrounds and behavioral assays. Since tussling behavior is a rare fighting form, it is not surprising to see variation between studies from different labs. Nevertheless, this study compared tussling behaviors in SH and GH males, and our finding that GH males show much more tussling behaviors is convincing. The longer duration of tussling in our paradigm may also be due to the modified behavioral paradigm, which also supports that tussling is a high-level fighting form.
 
 (9) Neuronal activation experiments suggesting the involvement of pC1SS2 neurons are quite interesting. Further, the role of P1a neurons was demonstrated to be involved in increasing tussling in thermogenetic activation in the presence of light (Figure 4, Supplement 1), which is quite important as the role of vision in optogenetic activation experiments, which required to be carried out in dark, is often not mentioned. However, in the discussion (lines 309-310) it is mentioned that PC1SS2 neurons are 'necessary and sufficient' for inducing tussling. Given that P1a neurons were shown to be involved in promoting tussling, this statement should be toned down.
 
 Thank you for this important comment. We now toned down the statement on pC1SS2 function.
 
 (10) Are Or47b neurons connected to pC1SS2 or P1a neurons?
 
 We conducted pathway analysis in the FlyWire electron microscopy database to investigate the connection between Or47b neurons and pC1 neurons. The results indicate that at least three levels of interneurons are required to establish a connection from Or47b neurons to pC1 neurons. Although the FlyWire database currently only contains neuronal data from female brains, they provide a reference for circuit connect in males.
 
 (11) The paradigm for territory control is quite interesting and subsequent mating advantage experiments are an important addition to the eventual outcome of the aggressive strategy deployed by the males as per their prior housing conditions. It would be important to comment on the 'fitness outcome' of these encounters. For instance, is there any fitness advantage of using tussling by GH males as compared to lunging by SH males? The authors may consider analyzing the number of eggs laid and eclosed progenies from these encounters to address this.
 
 Thank you for this suggestion. We agree with you and other reviewers that increased tussling behaviors correlate with better mating competition, but it is difficult for us to make a direct link between them. Thus, in the revised manuscript, we prefer to tone down this statement but not expanding on this part.
 
 Reviewer #2 (Public review):
 
 Summary
 
 Gao et al. investigated the change of aggression strategies by the social experience and its biological significance by using Drosophila. Two modes of inter-male aggression in Drosophila are known lunging, high-frequency but weak mode, and tussling, low-frequency but more vigorous mode. Previous studies have mainly focused on the lunging. In this paper, the authors developed a new behavioral experiment system for observing tussling behavior and found that tussling is enhanced by group rearing while lunging is suppressed. They then searched for neurons involved in the generation of tussling. Although olfactory receptors named Or67d and Or65a have previously been reported to function in the control of lunging, the authors found that these neurons do not function in the execution of tussling, and another olfactory receptor, Or47b, is required for tussling, as shown by the inhibition of neuronal activity and the gene knockdown experiments. Further optogenetic experiments identified a small number of central neurons pC1[SS2] that induce the tussling specifically. In order to further explore the ecological significance of the aggression mode change in group rearing, a new behavioral experiment was performed to examine territorial control and mating competition. Finally, the authors found that differences in the social experience (group vs. solitary rearing) are important in these biologically significant competitions. These results add a new perspective to the study of aggressive behavior in Drosophila. Furthermore, this study proposes an interesting general model in which the social experience-modified behavioral changes play a role in reproductive success.
 
 Strengths
 
 A behavioral experiment system that allows stable observation of tussling, which could not be easily analyzed due to its low frequency, would be very useful. The experimental setup itself is relatively simple, just the addition of a female to the platform, so it should be applicable to future research. The finding about the relationship between the social experience and the aggression mode change is quite novel. Although the intensity of aggression changes with the social experience was already reported in several papers (Liu et al., 2011, etc), the fact that the behavioral mode itself changes significantly has rarely been addressed and is extremely interesting. The identification of sensory and central neurons required for the tussling makes appropriate use of the genetic tools and the results are clear. A major strength of the neurobiology in this study is the finding that another group of neurons (Or47b-expressing olfactory neurons and pC1[SS2] neurons), distinct from the group of neurons previously thought to be involved in low-intensity aggression (i.e. lunging), function in the tussling behavior. Further investigation of the detailed circuit analysis is expected to elucidate the neural substrate of the conflict between the two aggression modes.
 
 Thank you for the acknowledgment of the novelty and significance of the study, and your suggestions for improving the manuscript.
 
 Weaknesses
 
 The experimental systems examining the territory control and the reproductive competition in Figure 5 are novel and have advantages in exploring their biological significance. However, at this stage, the authors' claim is weak since they only show the effects of age and social experience on territorial and mating behaviors, but do not experimentally demonstrate the influence of aggression mode change itself. In the Abstract, the authors state that these findings reveal how social experience shapes fighting strategies to optimize reproductive success. This is the most important perspective of the present study, and it would be necessary to show directly that the change of aggression mode by social experience contributes to reproductive success.
 
 We agree that our data did not directly show that it is the change of aggression mode that results in territory and reproductive advantages in GH males. To address the concern, we have toned down the statement throughout the manuscript. For example, we made textual changes in the abstract as following
 
 Moreover, shifting from lunging to tussling in socially enriched males is accompanied with better territory control and mating success, mitigating the disadvantages associated with aging. Our findings identify distinct sensory and central neurons for two fighting forms and suggest how social experience shapes fighting strategies to optimize reproductive success.
 
 In addition, a detailed description of the tussling is lacking. For example, the authors state that the tussling is less frequent but more vigorous than lunging, but while experimental data are presented on the frequency, the intensity seems to be subjective. The intensity is certainly clear from the supplementary video, but it would be necessary to evaluate the intensity itself using some index. Another problem is that there is no clear explanation of how to determine the tussling. A detailed method is required for the reproducibility of the experiment.
 
 Thank you for this important suggestion. We now analyzed duration of tussling and lunging, and found that a lunging event is often very short (less than 0.2s), while a tussling event may last from seconds to minutes. This new data is added as Figure 2G. In addition, we also provided more detailed methods regarding to tussling behavior
 
 . Reviewer #3 (Public review):
 
 In this manuscript, Gao et al. presented a series of intriguing data that collectively suggest that tussling, a form of high-intensity fighting among male fruit flies (Drosophila melanogaster) has a unique function and is controlled by a dedicated neural circuit. Based on the results of behavioral assays, they argue that increased tussling among socially experienced males promotes access to resources. They also concluded that tussling is controlled by a class of olfactory sensory neurons and sexually dimorphic central neurons that are distinct from pathways known to control lunges, a common male-type attack behavior.
 
 A major strength of this work is that it is the first attempt to characterize the behavioral function and neural circuit associated with Drosophila tussling. Many animal species use both low-intensity and high-intensity tactics to resolve conflicts. High-intensity tactics are mostly reserved for escalated fights, which are relatively rare. Because of this, tussling in the flies, like high-intensity fights in other animal species, has not been systematically investigated. Previous studies on fly aggressive behavior have often used socially isolated, relatively young flies within a short observation duration. Their discovery that 1) older (14-days-old) flies tend to tussle more often than younger (2-days-old) flies, 2) group-reared flies tend to tussle more often than socially isolated flies, and 3) flies tend to tussle at a later stage (mostly ~15 minutes after the onset of fighting), are the result of their creativity to look outside of conventional experimental settings. These new findings are keys for quantitatively characterizing this interesting yet under-studied behavior.
 
 Precisely because their initial approach was creative, it is regrettable that the authors missed the opportunity to effectively integrate preceding studies in their rationale or conclusions, which sometimes led to premature claims. Also, while each experiment contains an intriguing finding, these are poorly related to each other. This obscures the central conclusion of this work. The perceived weaknesses are discussed in detail below.
 
 Thank you for the precise summary of the key findings and novelty of the study, and your insightful suggestions.
 
 Most importantly, the authors' definition of "tussling" is unclear because they did not explain how they quantified lunges and tussling, even though the central focus of the manuscript is behavior. Supplemental movies S1 and S2 appear to include "tussling" bouts in which 2 flies lunge at each other in rapid succession, and supplemental movie S3 appears to include bouts of "holding", in which one fly holds the opponent's wings and shakes vigorously. These cases raise a concern that their behavior classification is arbitrary. Specifically, lunges and tussling should be objectively distinguished because one of their conclusions is that these two actions are controlled by separate neural circuits. It is impossible to evaluate the credibility of their behavioral data without clearly describing a criterion of each behavior.
 
 Thank you for this very important suggestion. We now provided more detailed description of the two fighting forms in the Materials and Methods section. See below
 
 Lunging is characterized by a male raising its forelegs and quickly striking the opponent, and each lunge typically lasts less than 0.2 seconds through detailed analysis. Tussling is characterized by both males using their forelegs and bodies to tumble over each other, and this behavior may last from seconds to minutes. Tussling is often mixed with boxing, in which both flies rear up and strike the opponent with forelegs. Since boxing is often transient and difficult to distinguish from tussling, we referred to the mixed boxing and tussling behavior simply as tussling. As we manually analyze tussling for 2 hours for each pair of males, it is possible that we may miss some tussling events, especially those quick ones.
 
 It is also confusing that the authors completely skipped the characterization of the tussling-controlling neurons they claimed to have identified. These neurons (a subset of so-called pC1 neurons labeled by previously described split-GAL4 line pC1SS2) are central to this manuscript, but the only information the authors have provided is its gross morphology in a low-resolution image (Figure 4D, E) and a statement that "only 3 pairs of pC1SS2 neurons whose function is both necessary and sufficient for inducing tussling in males" (lines 310-311). The evidence that supports this claim isn't provided. The expression pattern of pC1SS2 neurons in males has been only briefly described in reference 46. It is possible that these neurons overlap with previously characterized dsx+ and/or fru+ neurons that are important for male aggressions (measured by lunges), such as in Koganezawa et al., Curr. Biol. 2016 and Chiu et al., Cell 2020. This adds to the concern that lunge and tussling are not as clearly separated as the authors claim.
 
 Thank you very much for this important question. Indeed, there are many experiments that could do to better understand the function of pC1SS2 neurons, and we only provide the initial characterization of them due to the limited scope of this study. My lab has been focused on studying P1/pC1 function in both male and female flies and will continue to do so.
 
 To partially address your concern, we made the following revisions
 
 (1) We provided higher-resolution images of P1a and pC1SS2 (Figure 4C-4E). While their cell bodies are very close, they project to distinct brain regions, in addition to some shared ones.
 
 (2) By staining these neurons with GFP and co-staining with anti-FruM or anti-DsxM antibodies, we showed that P1a neurons are partially FruM-positive and partially DsxM-positive, while pC1SS2 neurons are DsxM-positive and FruM-negative (Figure 5A-5D).
 
 (3) As pC1SS2 neurons are DsxM-positive and FruM-negative, we also examined how DsxM regulates the development of these neurons. We found that knocking down DsxM expression in pC1SS2 neurons using RNAi significantly affected pC1 development regarding to both cell numbers (Figure 5G) and their projections (Figure 5H).
 
 (4) We further found that DsxM in pC1SS2 neurons is crucial for executing their tussling-promoting function, as optogenetic activation of these neurons with DsxM knockdown failed to induce tussling behavior in the initial activation period, and a much lower level of tussling in the second activation period compared to control males (Figure 5I-5K).
 
 (5) While it is very difficult to identify the upstream and downstream neurons of P1a and pC1SS2 neurons, we made an initial step by utilizing trans-tango and retro-Tango to visualize potential downstream and upstream neurons of P1a and pC1SS2 (Figure 4-figure supplement 2), which certainly needs future investigation.
 
 While their characterizations of tussling behaviors in wild-type males (Figures 1 and 2) are intriguing, the remaining data have little link with each other, making it difficult to understand what their main conclusion is. Figure 3 suggests that one class of olfactory sensory neurons (OSN) that express Or47b is necessary for tussling behavior. While the authors acknowledged that Or47b-expressing OSNs promote male courtship toward females presumably by detecting cuticular compounds, they provided little discussion on how a class of OSN can promote two different types of innate behavior. No evidence of a functional or circuitry relationship between the Or47b pathway and the pC1SS2 neurons was provided. It is unclear how these two components are relevant to each other.
 
 It has been previously found that Or47b-expressing ORNs respond to fly pheromones common to both sexes, and group-housing enhances their sensitivity. Regarding to how Or47b ORNs promotes two different types of innate behaviors, a simple explanation is that they act on multiple second-order and further downstream neurons to regulate both courtship and aggression, not mentioning that neural circuitries for courtship and aggression are partially shared. We did not include this in the discussion as we would like to focus on aggression modes, and how different ORNs (Or47b and Or67d) mediate distinct aggression modes.
 
 Regarding to the relationship between Or47b ORNs and pC1SS2 neurons, or in general ORNs to P1/pC1, it is interesting and important to explore, but probably in a separate study. We tried to conduct pathway connection analyses from Or47b to pC1 using the FlyWire database, and found that Or47b neurons can act on pC1 neurons via three layers of interneurons. Although the FlyWire database currently only contains neuronal data from female brains, they can provide a certain degree of reference. We hope the editor and reviewers would agree with us that identifying these intermediate neurons involved in their connection is beyond this study.
 
 Lastly, the rationale of the experiment in Figure 5 and the interpretation of the results is confusing. The authors attributed a higher mating success rate of older, socially experienced males over younger, socially isolated males to their tendency to tussle, but tussling cannot happen when one of the two flies is not engaged. If, for instance, a socially isolated 14-day-old male does not engage in tussling as indicated in Figure 2, how can they tussle with a group-housed 14-day-old male? Because aggressive interactions in Figure 5 were not quantified, it is impossible to conclude that tussling plays a role in copulation advantage among pairs as authors argue (lines 282-288).
 
 Indeed, we do not have direct evidence to show it is tussling that makes socially experienced males to dominate over socially isolated males. To address your concern, we have made following revisions
 
 (1) We toned down the statements about the relationship between fighting strategies and reproductive success throughout the manuscript. For example, in the abstract Moreover, shifting from lunging to tussling in socially enriched males is accompanied with better territory control and mating success.
 
 (2) Regarding to whether a SH male can engage in tussling with a GH male, we found that while two SH males rarely perform tussling, paired SH and GH males displayed similar levels of tussling like two GH males, although tussling duration from paired SH and GH males is significantly lower compared to that in two GH males (Figure 6-figure supplement 2).
 
 (3) To support the potential role of tussling in territory control and mating competition, we performed additional experiments to silence Or47b or pC1SS2 neurons that almost abolished tussling, and paired these males with control males. We found that males with Or47b or pC1SS2 neurons silenced cannot compete over control males, further suggesting the involvement of tussling in territory control and mating competition.
 
 Despite these weaknesses, it is important to acknowledge the authors' courage to initiate an investigation into a less characterized, high-intensity fighting behavior. Tussling requires the simultaneous engagement of two flies. Even if there is confusion over the distinction between lunges and tussling, the authors' conclusion that socially experienced flies and socially isolated flies employ distinct fighting strategies is convincing. Questions that require more rigorous studies are 1) whether such differences are encoded by separate circuits, and 2) whether the different fighting strategies are causally responsible for gaining ethologically relevant resources among socially experienced flies. Enhanced transparency of behavioral data will help readers understand the impact of this study. Lastly, the manuscript often mentions previous works and results without citing relevant references. For readers to grasp the context of this work, it is important to provide information about methods, reagents, and other key resources.
 
 Thank you very much for this comment and we almost totally agree.
 
 (1) Our results suggest the involvement of distinct sensory neurons and central neurons for lunging and tussling, but do not exclude the possibility that they may also utilize shared neurons. For example, activation of P1a neurons promotes both lunging and tussling in the presence of light.
 
 (2) We have now toned down the statements about the relationship between fighting strategies and reproductive success throughout the manuscript.
 
 (3) We provided more detailed methods, genotypes of flies to improve transparency of the manuscript.
 
 Reviewer #1 (Recommendations for the authors):
 
 (1) Figure 1 Supplement 1 shows that increased aging has a linear and inverse relationship with the number of lunges, this is in contrast to a previous study from Dierick lab (Chowdhury, 2021), where using Divider assays they showed that aggressive lunges increased up to day 10 and subsequently decreased in 30-day old flies. Given that this study did not use 14-day-old flies, it might be useful to comment on this.
 
 Thank you for this comment. Indeed, Chowdhury et al., suggested a decline of lunging after 10 days, which is not contradictory to our findings that lunging in 14d-old males is lower than that in 7d-old males. It is ideally to perform a time-series experiments to reveal the detailed relationship between ages and aggression (lunging or tussling) levels, but given our initial findings that 14d-old males showed stable tussling behavior, we prefer to use this time point for the rest of this study.
 
 (2) For Figure 3, do various manipulations also affect the duration of tussling and boxing besides frequency and latency?
 
 Thank you for this comment. We only analyzed latency and frequency, but not duration, as data analysis was performed manually rather than automatically on every fly pair for about 2 hours, which is very labor-consuming. We hope you could agree with us that the two parameters (frequency and latency) for tussling are representative for assaying this behavior.
 
 (3) For Figure 3 A-F, the housing status of the males is not clearly mentioned either in the main text or the figure. What is the status of the tussling and lunging status when this housing condition is reversed when Or47b neurons are silenced, or the gene is knocked down? Do these manipulations overcome the effect of housing conditions similar to what is seen in NaChBac-mediated activation experiments?
 
 Figure 3A-F used group-housed males and we have now added such information in the figure legends as well as Table S1.
 
 We appreciate your suggestion on using different housing conditions. As silencing Or47b neurons or knocking down Or47b reduced tussling, it is reasonable to use GH males (as we did in Figure 3A-F) that performed stable tussling behavior, but not SH males that rarely tussle.
 
 (4) The connections between Or47b neurons and pC1SS2 or P1a neurons can be addressed by available connectomic datasets or TransTango/GRASP approaches.
 
 Thank you for this important suggestion. We used the FlyWire electron microscope database to analyze the pathway connections between these two types of neurons. The results indicated that there are at least three levels of interneurons for connecting Or47b and pC1 neurons. Although the FlyWire database currently only contains neuronal data from female brains, they can provide a certain degree of reference for males.
 
 The lack of direct synaptic connection also suggests that it is challenging to resolve the connection between these two neuronal types using methods like trans-Tango/GRASP. To partially address this question, we utilized trans-Tango and retro-Tango techniques to visualize potential downstream and upstream neurons of P1a and pC1SS2 (Figure 4-figure supplement 2). Future investigations are certainly needed for clarifying functional connections between Or47b/Or67d and P1a/pC1SS2 neurons.
 
 (5) Figure 5, 'Winning index' and 'Copulation advance index' while described in Material and Methods, should be referred to in the main text.
 
 We now described these two indices briefly in the main manuscript, and in the Discussion section with more details.
 
 (6) Figure 6 shows comparisons for territorial control and mating outcomes where four different housing and aging conditions are organized in a hierarchical sequence. It is not clear from the data in Figure 5, how this conclusion was arrived at. A supplementary table with various outcomes with statistical analysis would help with this.
 
 We now added a supplementary table (Table S2) with various outcomes with statistical analysis.
 
 Minor Comments
 
 (1) Line 26 says that the courtship levels in SH and GH males are not different, however, unilateral wing extension is higher in SH males as compared to GH males (Pan & Baker, 2014; Inagaki et al., 2014), also it was shown that courtship attempts are higher in D. paulsitorium (Kim & Ehrman, 1998). It would be better to clarify this statement.
 
 Indeed, it is found in some cases that SH males court more vigorously than GH males. We have added more references on this matter in the introduction.
 
 (2) Figure 4, correct 'Tussing' to 'Tussling' or 'Box, Tussling' as appropriate.
 
 Corrected.
 
 (3) Duistermars, 2018 should be cited while discussing the role of vision in aggression (Figure 4). [A Brain Module for Scalable Control of Complex, Multi-motor Threat Displays]
 
 We now cited this reference and added more discussion in the revised manuscript.
 
 (4) Reviews on Drosophila aggression and social isolation can be cited in the introduction/discussion to incorporate recent literature e.g., Palavicino-Maggio, 2022 [The Neuromodulatory Basis of Aggression Lessons From the Humble Fruit Fly]; Yadav et al., 2024[Lessons from lonely flies Molecular and neuronal mechanisms underlying social isolation], etc.
 
 We now cited these references in both the introduction and discussion sections.
 
 (5) The concentration of apple juice agar should be mentioned in the methods.
 
 We added this and other necessary information for materials in the Materials and Methods section of the study.
 
 (6) Source of the LifeSongX software and, if available, a Github link would be helpful to include in the materials and methods section.
 
 We now provided the source of the LifesongY software (website https//sourceforge.net/projects/lifesongy/), which is a Windows version of LifesongX (Bernstein, Adam S.et al., 1992).
 
 Reviewer #2 (Recommendations for the authors):
 
 (1) Major comment 1
 
 As pointed out in the public review, the weakness of this study is that the relationship between the aggression strategy and reproductive success is an inference that is not based on experimental facts; I understand that the frequency of tussling is not so high, but at least tussling-like behavior can be observed in the territory control experiment shown in Video 3. Wouldn't it be possible to re-analyse data and examine the correlation between aggressive behavior and territory control? Even if the analysis of tussling itself in this setup is difficult, for example, additional experiments using Or47b knock-out fly or pC1[SS2]-inactivated fly could provide stronger support.
 
 Indeed, we can only make a correlation between the type of aggressive behavior and territory control. We now toned down this statement throughout the manuscript. For example, in the abstract, we changed our conclusions as following
 
 Moreover, shifting from lunging to tussling in socially enriched males is accompanied with better territory control and mating success. Our findings identify distinct sensory and central neurons for two fighting forms and suggest how social experience shapes fighting strategies to optimize reproductive success.
 
 To further address the concern, we now performed additional experiments to silence Or47b or pC1SS2 neurons that almost abolished tussling, and paired these males with control males. We found that males with Or47b or pC1SS2 neurons silenced cannot compete over control males (Figure 6-figure supplement 3), further suggesting the involvement of tussling in territory control and mating competition.
 
 In relation to the above, some of the text in the Abstract should be changed.Line 28 These findings "reveal" how social experience shapes fighting strategies to optimise reproductive success.
 
 "suggest" is more accurate at this stage.
 
 Changed as suggested.
 
 (2) Major comment 2
 
 The tussling is the central subject of this paper. However, neither the main text nor Materials and Methods section provides a clear explanation of how this aggression mode was detected. Did the authors determine this behavior manually? Or was it automatically detected by some kind of image analysis? In either case, the criteria and method for detecting the tussling should be clearly described.
 
 The behavioral data analysis in this study was performed manually. We now provided more detailed description of the two fighting forms in the Materials and Methods section. See below
 
 Lunging is characterized by a male raising its forelegs and quickly striking the opponent, and each lunge typically lasts less than 0.2 seconds through detailed analysis. Tussling is characterized by both males using their forelegs and bodies to tumble over each other, and this behavior may last from seconds to minutes. Tussling is often mixed with boxing, in which both flies rear up and strike the opponent with forelegs. Since boxing is often transient and difficult to distinguish from tussling, we referred to the mixed boxing and tussling behavior simply as tussling. As we manually analyze tussling for 2 hours for each pair of males, it is possible that we may miss some tussling events, especially those quick ones.
 
 For the experimental groups where tussling cannot be observed, the latency is regarded as 120 min, but this is a value depending on the observation time. While it is reasonable to use the latency to evaluate the behavior such as the lunging that is observed at relatively early times, care should be taken when using it to evaluate the tussling. Since similar trends to those obtained for the latency are observed for Number of tussles and % of males performing tussling, it may be better to focus on these two indices.
 
 We initially intended to provide all three statistical metrics. However, we found that using the "% of males performing tussling" would require a significantly larger sample size for subsequent statistical analysis (using chi-square tests), greatly increasing the workload. At the same time, we believe that the trend observed with "% of males performing tussling" is consistent with the other two indices, and the percentage information can also be derived from the individual sample scatter data of the other two metrics. Therefore, we opted to use "latency" and "numbers" as the statistical metrics, despite the caveat as you mentioned.
 
 The authors repeatedly mention that tussling is less frequent but more vigorous. The low frequency can be understood from the data in Fig. 1 and Fig. 2, but there are no measured data on the intensity. As the authors mention in line 125, each tussling event appears to be sustained for a relatively long period, as can be seen from the ethogram in Fig. 2. For example, it would be possible to evaluate the intensity by measuring the duration of the tussling event.
 
 Thank you for your valuable suggestion. We now analyzed duration of tussling and lunging, and found that a lunging event is often very short (less than 0.2s), while a tussling event may last from seconds to minutes, further supporting their relative intensities. This new data is added as Figure 2G.
 
 (3) Minor comments
 
 a) Line 117 How many flies were placed in one vial for group-rearing (GH)? Were males and females grouped together? Please specify in the Materials and Methods section.
 
 We have added this information in the Materials and Methods section. In brief, 30-40 virgin males were collected after eclosion and group-housed in each food vial.
 
 b) Line 174 The trans-Tango is basically a postsynaptic cell labeling technique. It is unlikely that the labeling intensity changes depending on neuronal activity. Do the authors want to say in this text the high activity of Or47b-expressing neurons under GH conditions? Or are they trying to show that the expression level of the Or47b gene, which is supposedly monitored by the expression of GAL4, is increased by GH conditions? The authors should clarify which is the case.
 
 Although the primary function of the trans-Tango technique is to label downstream neurons, the original literature indicates that the signal strength in downstream neurons depends on the use of upstream neurons evidenced by age-dependent trans-Tango signals. Therefore, the trans-Tango technique can indirectly reflect the usage of upstream neurons. Our findings that GH males showed broader Or47b trans-Tango signals than SH males can indirectly suggest that group-housing experience acts on Or47b neurons. We made textually changes to clarify this.
 
 c) Line 178 Which fly line labels the mushroom body; R19B03-GAL4?
 
 Yes, we now provided the detailed genotypes for all tested flies in the Table S1.
 
 d) Line 184 It was reported in Koganezawa et al., 2016 that some dsx-expressing pC1 neurons are involved in aggressive behavior. The authors should also refer to this paper as they include tussling in the observed aggressive behavior.
 
 Thank you for this comment, and we now cited this reference in the revised manuscript.
 
 e) Line 339 I think you misspelled fruM RNAi.
 
 Thank you for pointing this out. fruMi refers to microRNAi targeting fruM, and we have now clearly stated this information in the main text.
 
 f) Line 681 Is tussling time (%) the total duration of tussling occurrences during the observation time? Or is it the percentage of individuals observed tussling during the observation time? This needs to be clarified.
 
 It is the former one. We now clearly stated this definition in the Materials and Methods section
 
 Reviewer #3 (Recommendations for the authors):
 
 For authors to support their conclusion that enhanced tussling among socially experienced flies allows them to better retain resources, it is necessary to quantify aggressive behaviors (mainly tussling and lunging) in Figure 5.
 
 We agree that we can only make a correlation between enhanced tussling behavior and mating competition. We now toned down this statement throughout the manuscript. For example, in the abstract, we changed our conclusions as following Moreover, shifting from lunging to tussling in socially enriched males is accompanied with better territory control and mating success. Our findings identify distinct sensory and central neurons for two fighting forms and suggest how social experience shapes fighting strategies to optimize reproductive success.
 
 To further address the concern, we now performed additional experiments to silence Or47b or pC1SS2 neurons that almost abolished tussling, and paired these males with control males. We found that males with Or47b or pC1SS2 neurons silenced cannot compete over control males (Figure 6-figure supplement 3), further suggesting the involvement of tussling in territory control and mating competition.
 
 In contrast to the authors' data in Figure 4, movies in ref 36 clearly show instances of 2 flies exchanging lunges after the optogenetic activation of P1a neurons, like the examples shown in supplementary movies S1-S3. It is a clear discrepancy that requires discussion (and raises a concern about the lack of transparency about behavioral quantification).
 
 In our study, optogenetic activation of P1a neurons failed to induce obvious tussling behavior, and temperature-dependent activation of P1a neurons can only induce tussling in the presence of light. These data are different from Hoopfer et al., (2015), but are generally consistent with a new study (Sten et al., Cell, 2025), in which pC1SS2 neurons but not P1a neurons promote aggression. Such discrepancy has now been discussed in the revised manuscript.
 
 The authors often fail to cite relevant references while discussing previous results, which compromises the scholarship of the manuscript. Examples include (but are not limited to)
 
 (1) Line 85-86 Simon and Heberlein, J. Exp. Biol. 223 jeb232439 (2020) suggested that tussling is an important factor for flies to establish a dominance hierarchy.
 
 Reference added.
 
 (2) Line 142-143 Cuticular compounds such as palmitoleic acid are characterized to be the ligands of Or47b by ref #18.
 
 Reference added.
 
 (3) Line 185-187 pC1SS1 and pC1SS2 are first characterized by ref #46. Expression data of this paper also implies that pC1SS1 and pC1SS2 label different neurons in the male brain.
 
 We have now added this reference at the appropriate place in the revised manuscript. In addition, we have clarified that these two drivers exhibit sexually dimorphic expression patterns in the brain.
 
 (4) Line 196-199 Cite ref #36, which describes the behavior induced by the optogenetic activation of P1a neurons.
 
 Reference added.
 
 (5) Line 233-235 The authors' observation that control males do not form a clear dominance directly contradicts previous observations by others (Nilsen et al., PNAS 10112342 (2002); Yurkovic et al., PNAS 10317519 (2006); also see Trannoy et al., PNAS 1134818 (2016) and Simon and Heberlein above). The authors must at least discuss why their results are different.
 
 There is a misunderstanding here. We clearly state that there is a ‘winner takes all’ phenomenon. However, for wild-type males of the same age and housing condition, we calculated the winning index as (num. of wins by unmarked males – num. of wins by marked males)/10 encounters * 100%, which is roughly zero due to the randomness of marking.
 
 (6) Line 251-254 The authors' observation that aged males are less competitive than younger males contradicts the conclusion in ref #18. Discussion is required.
 
 We have now added a discussion on this matter. In brief, Lin et al., showed that 7d-old males are more competitive than 2d-old males, which is probably due to different levels of sexual maturity of males, but not a matter of age like our study that used up to 21d-old males.
 
 (7) Line 274-275 It is unclear which "previous studies" "have found that social isolation generally enhances aggression but decreases mating competition in animal models". Cite relevant references.
 
 Reference added.
 
 (8) Line 309-310 The evidence supporting the statement that "there are only three pairs of pC1SS2 neurons". If there is a reference, cite it. If it is based on the authors' observation, data is required.
 
 We have now provided additional data on the number of pC1SS2 neurons in Figure 5G of the revised manuscript.
 
 AuthorResponse
Visit annotations in context

Tags

Review 2

Review 3

Summary

AuthorResponse

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.10.25.620166v2
www.biorxiv.org www.biorxiv.org

Endothelin B receptor inhibition rescues aging-dependent neuronal regenerative decline

4
1. Public_Reviews 29 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This important study examines the role of endothelin signaling in nerve regeneration, providing convincing evidence that it functions as a default brake on axon regrowth. Inhibiting endothelin signaling with Bosentan promotes regeneration and counteracts the decline in regenerative potential caused by aging. Since Bosentan is an FDA-approved drug, these findings could have therapeutic value in clinical settings where peripheral nerve regeneration is not adequate or seriously impaired, as is often the case in older individuals.
  
  Summary
2. Public_Reviews 29 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  The manuscript by Feng et al. reported that Endothelin B receptor (ETBR) expressed by the satellite glial cells (SGCs) in the dorsal root ganglions (DRG) acted to inhibit sensory axon regeneration in both adult and aged mice. Thus, pharmacological inhibition of ETBR with specific inhibitors resulted in enhanced sensory axon regeneration in vitro and in vivo. In addition, sensory axon regeneration significantly reduces in aged mice and inhibition of ETBR could restore such defect in aged mice. Moreover, the study provided some evidence that the reduced level of gap junction protein connexin 43 might act downstream of ETBR to suppress axon regeneration in aged mice. Overall, the study revealed an interesting SGC-derived signal in the DRG microenvironment to regulate sensory axon regeneration. It provided additional evidence that non-neuronal cell types in the microenvironment function to regulate axon regeneration via cell-cell interaction.
  
  However, the molecular mechanisms by which ETBR regulates axon regeneration are unclear, and the structure of the manuscript is relatively not well organized, especially the last section. Some discussion and explanation about the data interpretation are needed to improve the manuscript.
  
  (1) The result showed that the level of ETBR was not changed after the peripheral nerve injury. Does it mean that its endogenous function is to limit the spontaneous sensory axon regeneration? In other words, the results suggest that SGCs expressing ETBR or vascular endothelial cells expressing its ligand ET-1 act to suppress sensory axon regeneration. Some explanation or discussion about this are necessary. Moreover, does the protein level of ETBR or its ligand change during aging?
  
  (2) In ex vivo experiments, NGF was added in the culture medium. Previous studies have shown that adult sensory neurons could initiate fast axon growth in response to NGF within 24 hours. In addition, dissociated sensory neurons could also initiate spontaneous regenerative axon growth without NGF after 48 hours. Some discussion or rationale is needed to explain the difference between NGF-induced or spontaneous axon growth of culture adult sensory neurons and the roles of ETBR and SGCs.
  
  (3) In cultured dissociated sensory neurons, inhibiting ETBR also enhanced axon growth, which meant the presence of SGCs surrounding the sensory neurons. Some direct evidence is needed to show the cellular relationship between them in culture.
  
  (4) In Figure 3, the in vivo regeneration experiments first showed enhanced axon regeneration either at 1 day or 3 days after the nerve injury. The study then showed that inhibiting ETBR could enhance sensory axon growth in vitro from uninjured naïve neurons or conditioning lesioned neurons. To my knowledge, in vivo sensory axon regeneration is relatively slow during the first 2 days after the nerve injury and then enter the fast regeneration mode in the 3rd day, representing the conditioning lesion effect in vivo. Some discussion is needed to compare the in vitro and the in vivo model of axon regeneration.
  
  (5) In Figure 5, the study showed that the level of connexin 43 increased after ETBR inhibition in either adult or aged mice, proposing an important role of connexin 43 in mediating the enhancing effect of ETBR inhibition on axon regeneration. However, in the study there was no direct evidence supporting that ETBR directly regulate connexin 43 expression in SGCs. Moreover, there was no functional evidence that connexin 43 acted downstream of ETBR to regulate axon regeneration.
  
  In the revised manuscript, most comments have been addressed with some new experiments or text revisions in the results or discussion. For representative images showing in vitro cultured DRG neurons, it would be much more convincing if several neurons in the same imaging field are shown, rather than a single neuron (Figure 2A, 3J).
  
  Review 1
3. Public_Reviews 29 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  Feng and colleagues set out to investigate the effect of manipulating endothelin signaling on nerve regeneration, focusing on the crosstalk between endothelial cells (ECs) in dorsal root ganglia (DRG), which secrete ET-1, and satellite glial cells (SGCs), which express the ETBR receptor. ETBR signaling limits axon growth. Using in vitro explant assays coupled with pharmacological inhibition in mouse models of nerve injury, the authors demonstrate that the ETAR/ETBR antagonist Bosentan promotes axon regeneration, and that this effect is maintained in aged mice. Although Bosentan inhibits both endothelin receptors A and B, comparison with an ETAR-specific antagonist suggests primary involvement of the ET-1/ETBR pathway. In the DRG, ETBR is mostly expressed by SGCs, a cell type implicated in nerve regeneration. SGCs ensheath and couple with DRG neurons through gap junctions formed by Cx43. The pro-regenerative effects of ETBR inhibition are attributed in part to an increase in Cx43 levels, which are expected to enhance neuron-SGC coupling. snRNA sequencing and TEM analysis reveal a decline in SGC numbers, morphological changes, and transcriptional reprogramming that may impair their pro-regenerative capacity.
  
  Strengths:
  
  The study is well-executed, and the main conclusion (that ETBR signaling inhibits axon regeneration after nerve injury and contributes to the age-related decline in regenerative capacity) is well supported by the data. In addition, the study highlights the importance of vascular signals in nerve regeneration, a topic that has gained traction in recent years. Importantly, these results further emphasize the contribution of long-neglected SGCs to nerve tissue homeostasis and repair. Although the study does not provide a complete mechanistic understanding, the findings are robust and are likely to attract the interest of a broad readership.
  
  Weaknesses:
  
  While certain aspects could have been further addressed experimentally, these points were either technically challenging or considered beyond the scope of the current study, and are appropriately addressed in the Discussion.
  
  (1) It remains to be determined whether the accelerated axon regrowth observed after nerve injury depends on cellular crosstalk mediated by ET-1 at the lesion site. Are ECs along the nerve secreting ET-1? What cells are present in the nerve stroma that could respond and participate in the repair process? Would these interactions be sensitive to Bosentan? Dissecting these contributions would require cell-specific manipulations. The potential roles of ECs, fibroblast and SCs in the nerve are discussed.
  
  (2) It is suggested that the permeability of DRG vessels may facilitate the release of vascular-derived signals. The possibility that the ET-1/ETBR pathway modulates vascular permeability, and that this in turn contributes to the observed effects on regeneration, is discussed.
  
  (3) It cannot be excluded that ET-3 in fibroblasts is relevant for controlling SGC responses. The possibility that both ET-1 and ET-3 participate in ETBR- dependent effect on axon regeneration is discussed.
  
  (4) The discovery that ET-1/ETBR signaling in SGC curtails the growth capacity of axons at baseline raises questions about the physiological role of this pathway. This remains to be elucidated with cell type-specific knockout approaches.
  
  (5) The modulation of Cx43 expression by ET-1/ETBR is examined by immunostaining, but a complementary analysis by quantitative RT-PCR on sorted SGCs would have been a valuable addition. However, quantifying Cx43 on purified SGCs was not attainable due to technical complications.
  
  (6) The conclusion "that ETBR inhibition in SGCs contributes to axonal regeneration by increasing Cx43 levels, gap junction coupling or hemichannels and facilitating SGC-neuron communication" are consistent with previous studies (Procacci et al., 2008) but in apparent discrepancy with increased gap junctions and dye coupling in SGCs of aged mice (Huang et al., 2006). More experiments are required to clarify what distinguishes a beneficial increase in coupling after ETBR inhibition, from what is observed in aging.
  
  (7) The effect of Bosentan likely extends beyond the modulation of Cx43 levels. Cell type-specific knockout of Cx43 and ETBR, studies of SGCs-neuron coupling, and biochemical analysis of Cx43 functions would clarify the link between ETBR, Cx43 regulation, and axon regeneration. A discussion of alternative mechanisms is provided.
  
  Review 2
4. Public_Reviews 29 Jul 2025
  
  in eLife
  
  Author response:
  
  The following is the authors’ response to the original reviews.
  
  Reviewer #1 (Public Review):
  
  The manuscript by Feng et al. reported that the Endothelin B receptor (ETBR) expressed by the satellite glial cells (SGCs) in the dorsal root ganglions (DRG) acted to inhibit sensory axon regeneration in both adult and aged mice. Thus, pharmacological inhibition of ETBR with specific inhibitors resulted in enhanced sensory axon regeneration in vitro and in vivo. In addition, sensory axon regeneration significantly reduces in aged mice and inhibition of ETBR could restore such defect in aged mice. Moreover, the study provided some evidence that the reduced level of gap junction protein connexin 43 might act downstream of ETBR to suppress axon regeneration in aged mice. Overall, the study revealed an interesting SGC-derived signal in the DRG microenvironment to regulate sensory axon regeneration. It provided additional evidence that non-neuronal cell types in the microenvironment function to regulate axon regeneration via cell-cell interaction.
  
  However, the molecular mechanisms by which ETBR regulates axon regeneration are unclear, and the manuscript's structure is not well organized, especially in the last section. Some discussion and explanation about the data interpretation are needed to improve the manuscript.
  
  We thank the reviewer for the positive comments. We agree that the mechanisms by which ETBR signaling functions as a brake on axon growth and regeneration remain to be elucidated. We believe that unraveling the detailed molecular pathways downstream of ETBR signaling in SGCs that promote axon regeneration is beyond the scope of this manuscript. Answering these questions would first require cell specific KO of ETBR and Cx43 to confirm that this pathway is operating in SGCs to control axon regeneration. We would also need to identify how SGCs communicate with neurons to regulate axon regeneration, which is a large area of ongoing research that remains poorly understood. Our data showing that pharmacological inhibition of ETBR with specific FDA-approved inhibitors enhances sensory axon regeneration provide not only new evidence for non-neuronal mechanisms in nerve repair, but also a new potential clinical avenue for therapeutic intervention.
  
  As suggested by the reviewer, we have extensively revised the organization of the manuscript, especially the last section of results. We have performed additional snRNAseq experiments to establish the impact of aging in DRG. We have also performed additional experiments to determine if blocking ETBR improves target tissue reinnervation. Following the reviewer’s suggestion, we have also expanded the Discussion section to discuss alternative mechanisms and o]er additional interpretation of our data. Below we describe how we address each point in detail.
  
  (1) The result showed that the level of ETBR did not change after the peripheral nerve injury. Does this mean that its endogenous function is to limit spontaneous sensory axon regeneration? In other words, the results suggest that SGCs expressing ETBR or vascular endothelial cells expressing its ligand ET-1 act to suppress sensory axon regeneration. Some explanation or discussion about this is necessary. Moreover, does the protein level of ETBR or its ligand change during aging?
  
  We thank the reviewer for this point. Our results indeed indicate that one endogenous function of ETBR is to limit the extent of sensory axon regeneration. This may be a part of a mechanism to limit spontaneous sensory axon growth or plasticity and maladaptive neural rewiring after nerve injury. While the increased growth capacity of damaged peripheral axons can lead to reconnection with their targets and functional recovery, the increased growth capacity can also lead to axonal sprouting of the central axon terminals of injured neurons in the spinal cord, and to pain (see for example Costigan et al 2010, PMID: 19400724). In the context of aging that we describe here, this protective mechanism may hinder beneficial recovery. Other mechanisms that slow axon regeneration have been reported, and include, for example, axonally synthesized proteins, which typically support nerve regeneration through retrograde signaling and local growth mechanisms. RNA binding proteins (RBP) are needed for this process. One such RBP, the RNA binding protein KHSRP is locally translated following nerve injury. Rather than promoting axon regeneration, KHSRP promotes decay of other axonal mRNAs and slows axon regeneration. Another example includes the Rho signaling pathway, which was shown to function as an inhibitory mechanism that slows the growth of spiral ganglion neurites in culture. We have now included these examples in the Discussion section.
  
  To address the reviewer’s second question, we have checked protein levels of ETBR and ET-1 in adult and aged DRG tissue. We observed a robust increase in ET-1 in aged DRG, while the levels of ETBR did not appear to change significantly. These results are now presented in Figure 4- Figure Supplement 1, and further support the notion that in aging, activation of the ETBR signaling hinders axon regeneration.
  
  (2) In ex vivo experiments, NGF was added to the culture medium. Previous studies have shown that adult sensory neurons could initiate fast axon growth in response to NGF within 24 hours. In addition, dissociated sensory neurons could also initiate spontaneous regenerative axon growth without NGF after 48 hours. Some discussion or rationale is needed to explain the di]erence between NGF-induced or spontaneous axon growth of culture adult sensory neurons and the roles of ETBR and SGCs.
  
  We appreciate the reviewer’s suggestion. In adult DRG explant or dissociated cultures, NGF is not typically required for survival or axon outgrowth. However, in dissociated culture, the addition of NGF to the medium stimulates growth from more neurons compared to controls (Smith and Skene 1997). In the DRG explant, NGF does not promote significant e]ects on axon growth, but stimulates glial cell migration (Klimovich et al 2020). We opted to included NGF in our explant assay to increase the potential of stimulating axon regeneration with pharmacological manipulations of ETBR. We have now clarified these considerations in the Method section.
  
  (3) In cultured dissociated sensory neurons, inhibiting ETBR also enhanced axon growth, which meant the presence of SGCs surrounding the sensory neurons. Some direct evidence is needed to show the cellular relationship between them in culture.
  
  We thank the reviewer for raising this point and have added new data, now presented in Figure 2B, to show that in mixed DRG cultures, SGCs labeled with Fabp7 are present in the culture in proximity to neurons labeled with TUJ1, but they do not fully wrap the neuronal soma. These results are consistent with prior findings reporting that as time in culture progresses, SGCs lose their adhesive contacts with neuronal soma and adhere to the coverslip (PMID: 22032231, PMID: 27606776). While in some cases SGCs can maintain their association with neuronal soma in the first day in culture after plating, in our hands, most SGCs have left the soma at the 24h time point we examined.
  
  (4) In Figure 3, the in vivo regeneration experiments first showed enhanced axon regeneration either 1 day or 3 days after the nerve injury. The study then showed that inhibiting ETBR could enhance sensory axon growth in vitro from uninjured naïve neurons or conditioning lesioned neurons. To my knowledge, in vivo sensory axon regeneration is relatively slow during the first 2 days after the nerve injury and then enters the fast regeneration mode on the 3rd day, representing the conditioning lesion e]ect in vivo. Some discussion is needed to compare the in vitro and the in vivo model of axon regeneration.
  
  We agree that axon growth is relatively slow the first 2 days and enters a fast growth mode on day 3. This has been elegantly demonstrated in Shin et al Neuron 2012 (PMID: 22726832), where an in vivo conditioning injury 3 days prior increases axon growth one day after injury. In vitro, similar e]ects have been described: a prior in vivo injury accelerates growth capacity within the first day in culture, but a similar growth mode occurs in naive adult neurons after 2-3 days in vitro (Smith and Skene 1996). We also know that the neurite growth in culture is stimulated by higher cell density, likely because non-neuronal cells can secrete trophic factors (Smith and Skene 1996). Our in vitro results thus suggest that blocking ETBR in SGCs in these mixed cultures may alter the media towards a more growth promoting state. In vivo, our data show that Bosentan treatment for 3 days partially mimics the conditioning injury and potentiate the e]ect of the conditioning injury. One possible interpretation is that inhibition of ETBR alters the release of trophic factors from SGCs. Future studies will be required to unravel how ETBR signaling influence the SGCs secretome and its influence on axon growth. We have now included these discussions points in the Results and Discussion Section.
  
  (5) In Figure 5, the study showed that the level of connexin 43 increased after ETBR inhibition in either adult or aged mice, proposing an important role of connexin 43 in mediating the enhancing e]ect of ETBR inhibition on axon regeneration. However, in the study, there was no direct evidence supporting that ETBR directly regulates connexin 43 expression in SGCs. Moreover, there was no functional evidence that connexin 43 acted downstream of ETBR to regulate axon regeneration.
  
  We thank the reviewer for this point and agree that we do not provide direct evidence that connexin 43 acts downstream of ETBR to regulate axon regeneration. To obtain such functional evidence would require selective KO of ETBR and Cx43 in SGCs, which we believe is beyond the scope of the current study. We have revised the Results and Discussion sections to emphasize that while we observe that ETBR inhibition increases Cx43 levels and Cx43 levels correlates with axon regeneration, whether Cx43 directly mediates the e]ect on axon regeneration remains to be established. We also discuss potential alternative mechanisms downstream of ETBR in SGCs that could contribute to the observed e]ects on axon regeneration. Specifically, we discuss the possibility that ETBR signaling may limit axon regeneration via regulating SGCs glutamate reuptake functions, because of the following reasons: 1) Similarly to astrocytes, glutamate uptake by SGCs is important to regulate neuronal function, 2) exposure of cultured cortical astrocytes to endothelin results in a decrease in glutamate uptake that correlates with a major loss of basal glutamate transporter expression (GLT-1 and1), 3) Both glutamate transporters are expressed in SGCs in sensory ganglia 4) GLAST and glutamate reuptake function is important for lesion-induced plasticity in the developing somatosensory cortex.
  
  Reviewer #2 (Public Review):
  
  Summary:
  
  In this interesting and original study, Feng and colleagues set out to address the e]ect of manipulating endothelin signaling on nerve regeneration, focusing on the crosstalk between endothelial cells (ECs) in dorsal root ganglia (DRG), which secrete ET-1 and satellite glial cells (SGCs) expressing ETBR receptor. The main finding is that ETBR signaling is a default brake on axon growth, and inhibiting this pathway promotes axon regeneration after nerve injury and counters the decline in regenerative capacity that occurs during aging. ET-1 and ETBR are mapped in ECs and SGCs, respectively, using scRNA-seq of DRGs from adult or aged mice. Although their expression does not change upon injury, it is modulated during aging, with a reported increase in plasma levels of ET-1 (a potent vasoconstrictive signal). Using in vitro explant assays coupled with pharmacological inhibition in mouse models of nerve injury, the authors demonstrate that ET-1/ETBR curbs axonal growth, and the ETAR/ETBR antagonist Bosentan boosts regrowth during the early phase of repair. In addition, Bosentan restores the ability of aged DRG neurons to regrow after nerve lesions. Despite Bosentan inhibiting both endothelin receptors A and B, comparison with an ETAR-specific antagonist indicates that the e]ects can be attributed to the ET-1/ETBR pathway. In the DRGs, ETBR is mostly expressed by SGCs (and a subset of Schwann cells) a cell type that previous studies, including work from this group, have implicated in nerve regeneration. SGCs ensheath and couple with DRG neurons through gap junctions formed by Cx43. Based on their own findings and evidence from the literature, the pro-regenerative e]ects of ETBR inhibition are in part attributed to an increase in Cx43 levels, which are expected to enhance neuron-SGC coupling. Finally, gene expression analysis in adult vs aged DRGs predicts a decrease in fatty acid and cholesterol metabolism, for which previous work by the authors has shown a requirement in SGCs to promote axon regeneration.
  
  Strengths:
  
  The study is well-executed and the main conclusion that "ETBR signaling inhibits axon regeneration after nerve injury and plays a role in age-related decline in regenerative capacity" (line 77) is supported by the data. Given that Bosentan is an FDA-approved drug, the findings may have therapeutic value in clinical settings where peripheral nerve regeneration is suboptimal or largely impaired, as it often happens in aged individuals. In addition, the study highlights the importance of vascular signals in nerve regeneration, a topic that has gained traction in recent years. Importantly, these results further emphasize the contribution of longneglected SGCs to nerve tissue homeostasis and repair. Although the study does not reach a complete mechanistic understanding, the results are robust and are expected to attract the interest of a broader readership.
  
  We thank the reviewer for the positive comments, especially in regard to the rigor and originality of our study.
  
  Weaknesses:
  
  Despite these positive comments provided above, the following points should be considered:
  
  (1) This study examines the contribution of the ET-1 pathway in the ganglia, and in vitro assays are consistent with the idea that important signaling events take place there. Nevertheless, it remains to be determined whether the accelerated axon regrowth observed in vivo depends also on cellular crosstalk mediated by ET-1 at the lesion site. Are ECs along the nerve secreting ET-1? What cells are present in the nerve stroma that could respond and participate in the repair process? Would these interactions be sensitive to Bosentan? It may be di]icult to dissect this contribution, but it should at least be discussed.
  
  We thank the reviewer for this important point and agree that the in vivo e]ects observed cannot rule out the contribution of ECs or SCs at the lesion site in the nerve. Dissecting the contribution of ETBR expressing cells in the nerve would require cell-specific manipulations that go beyond the scope of this manuscript. We have revised the Discussion section to highlight the potential contribution of ECs, fibroblast and SCs in the nerve.
  
  (2) It is suggested that the permeability of DRG vessels may facilitate the release of "vascularderived signals" (lines 82-84). Is it possible that the ET-1/ETBR pathway modulates vascular permeability, and that this, in turn, contributes to the observed e]ects on regeneration?
  
  We thank the reviewer for raising this interesting point. ET-1 can have an impact on vascular permeability. It was indeed shown that in high glucose conditions, increased trans-endothelial permeability is associated with increased Edn1, Ednra and Ednrb expression and augmented ET1 immunoreactivity (PMID: 10950122). It is thus possible that part of the e]ects observed results from altered vascular permeability. We have included this point in the Discussion section. Future experiments will be required to test how injury and age a]ects vascular permeability in the DRG.
  
  (3) Is the a]inity of ET-3 for ETBR similar to that of ET-1? Can it be excluded that ET-3 expressed by fibroblasts is relevant for controlling SGC responses upon injury/aging?
  
  We thank the reviewer for raising this point. ET-1 binds to ETAR and ETBR with the same a]inity, but ET3 shows a higher a]inity to ETBR than to ETAR (Davenport et al. Pharmacol. Rev 2016 PMID: 26956245). We attempted to examine ET-3 level in adult and aged DRG by western blot, but in our hands the antibody did not work well enough, and we could not obtain clear results. We thus cannot exclude the possibility that ET-3 released by fibroblasts contribute to the e]ects we observe on axon regeneration. Indeed, in cultured cortical astrocytes, application of either ET-1 or ET-3 leads to inhibition of Cx43 expression. We have revised the text in the Discussion section to highlight the possibility that both ET-1 and ET-3 could participate on the ETBRdependent e]ect on axon regeneration.
  
  (4) ETBR inhibition in dissociated (mixed) cultures uncovers the restraining activity of endothelin signaling on axon growth (Figure 2C). Since neurons do not express ET-1 receptors, based on scRNA-seq analysis, these results are interpreted as an indication that basal ETBR signaling in SGC curbs the axon growth potential of sensory neurons. For this to occur in dissociated cultures, however, one should assume that SGC-neuron association is present, similar to in vivo, or to whole DRG cultures (Figure 2C). Has this been tested?
  
  We thank the reviewer for this point. In dissociated DRG culture, neurons, SGCs and other nonneuronal cells are present, but SGCs do not retain the surrounding morphology as they do in vivo. Within 24 hours in culture, SGCs lose their adhesive contacts with neuronal soma and adhere to the coverslip (PMID: 22032231, PMID: 27606776). We have included new data in Figure 2B to show that in our culture conditions, SGCs are present, but do not wrap neurons soma as they do in vivo. We also know from prior studies that the density of the culture a]ects axon growth, an e]ect that was attributed to trophic factors released from non-neuronal cells (Smith and Skene 1997). Therefore, although SGCs do not surround neurons, the signaling pathway downstream of ETBR may be present in culture and contribute to the release of trophic factors that influence axon growth. We have revised the Results section to better explain our in vitro results and their interpretation.
  
  In both in vitro experimental settings (dissociated and whole DRG cultures) how is ETBR stimulated over up to 7 days of culture? In other words, where does endothelin come from in these cultures (which are unlikely to support EC/blood vessel growth)? Is it possible that the relevant ligand here derives from fibroblasts (see point #6)? Or does it suggest that ETBR can be constitutively active (i.e., endothelin-independent signaling)? Is there any chance that endothelin is present in the culture media or Matrigel?
  
  We thank the reviewer for raising this point. Our single-cell data indicate that ET-1 is expressed by endothelial cells and ET-3 by fibroblasts. In dissociated DRG culture at 24h time point, all DRGs cells are present, including endothelial cells and fibroblasts, and could represent the source of ET-1 or ET-3. In the explant setting, it is also possible that both ET-1 and ET-3 are released by endothelial cells and fibroblasts during the 7 days in culture. According to information for the suppliers, endothelin is not present neither in the culture media nor in the Matrigel. While mutations can facilitate the constitutive activity of the ETBR receptor, we are not aware of data showing that endogenous ETBR can be constitutively active. Because the molecular mechanisms governing ETBR -mediated signaling remain incompletely understood (see for example PMID: 39043181, PMID: 39414992) future studies will be required to elucidate the detailed mechanisms activating ETBR in SGCs and its downstream signaling mechanisms. We have now expanded the Results and discussion sections to clarify these points.
  
  (5) The discovery that ET-1/ETBR signaling in SGC curtails the growth capacity of axons at baseline raises questions about the physiological role of this pathway. What happens when ETBR signaling is prevented over a longer period of time? This could be addressed with pharmacological inhibitors, or better, with cell-specific knock-out mice. The experiments would certainly be of general interest, although not within the scope of this story. Nevertheless, it could be worth discussing the possibilities.
  
  We agree that this is an interesting point. As mentioned above in response to point #1 of reviewer 1, the physiological role of this pathway could be to limit plasticity and prevent maladaptive neural rewiring that can happen after injury (Costigan et al 2009, PMID: 19400724), but can also hinder beneficial recovery after injury. Other mechanisms that limit axon regeneration capacity have been described and involve local mRNA translation and Rho signaling. We have revised the Discussion section to include these points. We agree that understanding the consequence of blocking ETBR over longer time periods is beyond the scope of the current study, but we now discuss the possibility that blocking ETBR with a cell specific KO approach could unravel its physiological function on target innervation and behavior.
  
  (6) Assessing Cx43 levels by measuring the immunofluorescence signal (Figure 5E-F) is acceptable, particularly when the aim is to restrict the analysis to SGCs. The modulation of Cx43 expression by ET-1/ETBR plays an important part in the proposed model. Therefore, a complementary analysis of Cx43 expression by quantitative RT-PCR on sorted SGCs would be a valuable addition to the immunofluorescence data. Is this attainable?
  
  We agree and have attempted to perform these types of experiments but encountered technical di]iculties. We attempted to sorting SGCs from transgenic mice in which SGCs are fluorescently labeled. However, the cells did not survive the sorting process and died in culture. We think that increasing the viability of cells after sorting would require capillary- free fluorescent sorting approaches. However, we do not currently have access to such technology. We attempted this experiment with cultured SGCs, following a previously published protocol (Tonello et al. 2023 PMID: 38156033). In these experiments, SGCs are cultured for 8 days to obtain purity. We did not observe any di]erence in Cx43 protein or mRNA level upon treatment with ET-1 with or without BQ788. However, in these SGCs cultures, Cx43 displayed a di]use localization, rather than puncta as observed in vivo. Therefore, despite our multiple attempts, quantifying Cx43 on sorted or purified SGCs was not attainable.
  
  (7) The conclusions "We thus hypothesize that ETBR inhibition in SGCs contributes to axonal regeneration by increasing Cx43 levels, gap junction coupling or hemichannels and facilitating SGC-neuron communication" (lines 303-305) are consistent with the findings but seem in contrast with the e]ect of aging on gap junction coupling reported by others and cited in line 210: "the number of gap junctions and the dye coupling between these cells increases (Huang et al., 2006)". I am confused by what distinguishes a potential, and supposedly beneficial, increase in coupling after ETBR inhibition, from what is observed in aging.
  
  We agree that the aging impact of Cx43 level and gap junction number appears contradictory. Procacci et al 2008 reported that Cx43 expression in SGCs decreases in the aged mice. Huang et al 2006 report that both the number of gap junctions and the dye coupling between these cells were found to increase with aging. Procacci et al suggested as a possible explanation for this apparent discrepancy that additional connexin types other than Cx43 may contribute to the gap junctions between SGCs in aged mice. Our snRNAseq data did not allow us to verify this hypothesis, because there were less SGCs in aged mice compared to adult, and connexin genes were detected in only 20% or less of SGCs. Furthermore, our quantification did not look specifically at gap junctions, but just at Cx43 puncta. Cx43 can also form hemichannels in addition to gap junctions, and can also perform non-channel functions, such as protein interaction, cell adhesion, and intracellular signaling. Thus, more research examining the role of Cx43 in SGCs is necessary to address this discrepancy in the literature. We have expanded the Discussion section to include these points.
  
  (8) I find it di]icult to reconcile the results in Figure 5F with the proposed model since (1) injury increases Cx43 levels in both adult and aged mice, (2) the injured aged/vehicle group has a similar level to the uninjured adult group, (3) upon injury, aged+Bosentan is much lower than adult+Bosentan (significance not tested). It seems hard to explain the e]ect of Bosentan only through the modulation of Cx43 levels. Whether the increase in Cx43 levels following ETBR inhibition actually results in higher SGC-neuron coupling has not been assessed experimentally.
  
  We thank the reviewer for this point and agree that the e]ect of Bosentan is likely not exclusively through the modulation of Cx43 levels in SGCs, and that Cx43 levels may simply correlate with axon regenerative capacity. We have revised the manuscript to clarify this point. We have also added the missing significance test in Figure 5F.
  
  Cell specific KO of Cx43 and ETBR would allow to test this hypothesis directly but is beyond the scope of the current study. We have not tested SGCs-neuron coupling, as these experiments are currently beyond our area of expertise. Cx43 has also other functions beyond gap junction coupling, such as protein interaction, cell adhesion, and intracellular signaling. Investigating the precise function of Cx43 would require in depth biochemical and cell specific experiments that are beyond the scope of this study. Furthermore, as we now mentioned in response to reviewer #2 point 5, ETBR signaling may also have other downstream e]ects in SGCs, such as glutamate transporters expression, or a]ect other cells in the nerve during the regeneration process. We have revised the Discussion section to include these alternative mechanisms.
  
  Reviewer #3(Public Review):
  
  Summary:
  
  This manuscript suggests that inhibiting ETBR via the FDA-approved compound Bosentan can disrupt ET-1-ETBR signalling that they found detrimental to nerve regeneration, thus promoting repair after nerve injury in adult and aged mice.
  
  Strengths:
  
  (1) The clinical need to identify molecular and cellular mechanisms that can be targeted to improve repair after nerve injury.
  
  (2) The proposed mechanism is interesting.
  
  (3) The methodology is sound.
  
  We thank the reviewer for highlighting the strengths of our study
  
  Weaknesses:
  
  (1) The data appear preliminary and the story appears incomplete.
  
  We appreciate the reviewer’s point. We would like to emphasize that our results provide compelling evidence that ETBR signaling is a default brake on axon growth, and inhibiting this pathway promotes axon regeneration after nerve injury and counters the decline in regenerative capacity that occurs during aging. We also provide evidence that ETBR signaling regulates the levels of Cx43 in SGCs. Furthermore, our results document the use of an FDA approved compound to increase axon regeneration may be of interest to the broader readership, as there is currently no therapies to improve or accelerate nerve repair after injury. We agree that the detailed mechanisms operating downstream of ETBR will need to be elucidated. Answering these questions would first require cell specific KO of ETBR and Cx43 to confirm that this pathway is operating in SGCs to control axon regeneration. We would also need to identify how SGCs communicate with neurons to regulate axon regeneration, which is a large area of ongoing research that remains poorly understood. This extensive and highly complex set of experiments is beyond the scope of the current study. As we discussed in our response to reviewer #1 and #2 we attempted to perform numerous additional experiments to better define the role of ETBR signaling in SGCs in aging and have included additional results in Fig. 2B, Fig 3G-H, Fig 5A-E, and Figure 4- Figure Supplement 1and Figure 5- Figure Supplement 1. We have expanded the
  
  Discussion to acknowledge the limitation of our study and to discuss possible mechanisms.
  
  (2) Lack of causality and clear cellular and molecular mechanism. There are also some loose ends such as the role of connexin 43 in SGCs: how is it related to ET-1- ETBR signalling?
  
  We thank the reviewer for this point and agree that the molecular mechanisms downstream of ETBR remain to be elucidated. However, we believe that our manuscript reports an interesting potential of an FDA-approved compound in promoting nerve repair. We focused on Cx43 downstream of ETBR signaling because decreased Cx43 expression in SGCs in ageing was previously established, but the mechanisms were not elucidated. Furthermore, it was reported that ET1 signaling in cultured astrocytes, which share functional similarities with SGCs, leads to the closure of gap junctions and reduction in Cx43 expression. Our study thus provides a mechanism by which ETBR signaling in SGCs regulates Cx43 expression. Whether Cx43 directly impact axon regeneration remains to be tested. Cell specific KO of Cx43 and ETBR would be required to answer this question. We have revised the Introduction and Discussion section extensively to provide a link between ETBR and Cx43 and to acknowledge the lack of causality in Cx43 in SGCs, as well as to provide additional potential mechanisms by which ETBR inhibition may promote nerve repair.
  
  Reviewer #2 (Recommendations For The Authors):
  
  In addition to the points listed in the Public Review section, please consider the following comments:
  
  (1) ETAR, which is high in mural cells, does not seem to be implicated in the reported proregenerative e]ects. Even so, can vasoconstriction be ruled out as an underlying cause of the age-dependent decline in axon regrowth potential and, more generally, in the e]ects of ET-1 inhibition on regeneration? This could be discussed.
  
  We agree that we can’t exclude a role in vasoconstriction or e]ect on vascular permeability in the age-dependent decline in axon regrowth potential. However, our in vitro and ex vivo experiments, in which vascular related mechanisms are unlikely, suggest that vasoconstriction may not be a major contributor to the e]ects we observed.
  
  (2) The manuscript (e.g. line 287-288) would benefit from a discussion of the role that blood vessels play in the peripheral nervous system, and possibly CNS, repair. Vessels were shown to accompany regenerating fibers and instruct the reorganization of the nerve tissue to favor repair potentially through the release of pro-regenerative signals acting on stromal cells, glia, and other cellular components. Highlighting these processes will help put the current findings into perspective.
  
  We agree and have revised the Discussion section to better explain the role of blood vessels in orientating Schwann cells migration and guiding axon regeneration.
  
  (3) The vast majority of the cells that are sequenced and shown in the UMAP in Figure 1C are from adult (3-month-old) mice [16,923 out of 18,098]. It would be useful to include the UMAP split (or color-coded) by timepoint to appreciate changes in cell clustering that may occur with aging.
  
  We apologize for this misunderstanding, Figure 1C had all cells from all ages. However, the number of cells we obtained from the age group was insu]icient to perform in depth analysis of each cell type. We have thus revised this section and Figure 1, now only presenting the data from adult mice.
  
  It is not discussed why fewer cells were sequenced at later stages. Additionally, I do not know how to interpret the double asterisks next to the labeling "18,098 samples" in Figure 1C.
  
  Since our original sequencing of adult and aged mice using 10x yielded so few cells from the aged DRG, we tested and optimized a new technology for single cell preparation of DRG using Illumina Single Cell 3’ RNA Prep. This preparation creates templated emulsions using a vortex mixer to capture and barcode single-cell mRNA instead of a microfluidics system. This method yielded much better results for nuclei recovery from aged DRG, with more nuclei and better quality of nuclei. Thus, we now present in Figure 5 and Figure 5- Figure Supplement 1 the results from snRNA-sequencing of aged and adult DRG using the Illumina single cell kit. The results of the snRNA-sequencing show a decreased abundance of SGCs in aged mice, consistent with the results from our morphology analysis with EM. We were also able to perform SGCs-specific pathway analysis because of the increased number of nuclei captured in the aged SGCs, which we included in the manuscript.
  
  (4) The in vivo studies are designed to examine the e]ects of ETBR inhibition during the first phase of axon regrowth after nerve injury (1-3 days post-injury, dpi). Is there a reason why later stages have not been studied? It would be interesting to understand whether ETBR inhibition improves long-term recovery or is only e]ective at boosting the initial growth of axons through the lesion. It is possible that early inhibition will be enough for long-term recovery. If so, these experiments would define a sensitivity window with therapeutic value.
  
  We agree that assessing functional recovery requires proper behavioral tests or morphological evaluations of reinnervation. To determine if Bosentan treatment has long-term e]ects on recovery, we administered Bosentan or vehicle for 3 weeks (daily for 1 week, and then once a week for the subsequent 2 weeks) after sciatic nerve crush. At 24 days after SNC, we assessed intraepidermal nerve fiber density (IENFD) in the injured paw and saw a trend towards increased fibers/mm in the treated animals (new Figure 3G,H). Future studies will examine how long-term Bosentan treatment a]ects functional recovery and innervation at later time points. Additionally, behavior assays will be needed to determine if these morphological changes relate to behavioral improvements using IENFD and behavior assays.
  
  (5) I am unsure if the gene expression analysis shown in Figure 6 fits well into this story. It is interesting per se and in line with previous work from this group showing the relevance of fatty acid metabolism in SGCs for axon regeneration. Nevertheless, without a mechanistic link to endothelin signaling and Cx43/gap junction modulation, the observations derived from DEG analysis are not well integrated with the rest and may be more distracting than helpful. One limitation is that there is no cell-type information for the DEGs due to the small number of cells recovered from aged mice. For instance, if ETBR inhibition rescued gene downregulation associated with fatty acid/cholesterol metabolism, then the DGE results would become more relevant for understanding the cellular basis of the pro-regenerative e]ect, which at this point remains quite speculative (lines 264-265; lines 318-319).
  
  We agree and have added new snRNA sequencing data to replace these findings (see above response to point #4, new Figure 5 and Figure 5- Figure Supplement 1. The new data shows a decreased abundance of SGCs in aged mice, consistent with our TEM results. Pathway analysis revealed that aging triggers extensive transcriptional reprogramming in SGCs, reflecting heightened demands for structural integrity, cell junction remodeling, and glia–neuron interactions within the aged DRG microenvironment.
  
  (6) It would be interesting to determine whether Bosentan increases SGC coverage of neuronal cell bodies in aged mice (Figures 6A-C).
  
  We agree that this would be very interesting, but will require extensive EM analysis at di]erent time points and is beyond the scope of the current manuscript.
  
  (7) Finally, adding a summary model would help the readers.
  
  We agree and have made a summary model, now presented in Figure 6F.
  
  Reviewer #3 (Recommendations For The Authors):
  
  Longer time points post-injury and assessment of functional recovery after Bosentan would be of great value here.
  
  We agree that assessing functional recovery requires proper behavioral tests or morphological evaluations of reinnervation. To determine if Bosentan treatment has long-term e]ects on recovery, we administered Bosentan or vehicle for 3 weeks (daily for 1 week, and then once a week for the subsequent 2 weeks) after sciatic nerve crush. At 24 days after SNC, we assessed intraepidermal nerve fiber density in the injured paw and saw a trend towards increased fibers/mm in the treated animals (Fig 3). While the results do not reach significance, we decided to include this new data as it provides evidence that Bosentan treatment may also improves long term recovery. Future studies will be required examine how long-term Bosentan treatment a]ects functional recovery and innervation at later time points. Additionally, behavior assays will be needed to determine if these morphological changes relate to behavioral improvements.
  
  It would be important to know how ET-1- ETBR signalling axis promotes the regeneration of axons:this remains unaddressed. What are the cells that are specifically involved? Endothelial cellsSGC- neurons- SC? There are no experiments addressing the role of any of these?
  
  We agree that the molecular and cellular mechanisms by which ETBR signaling in SGCs promote axon regeneration remains to be elucidated. Answering these questions would first require cell specific KO of ETBR and Cx43 to confirm that this pathway is operating in SGCs to control axon regeneration. We would also need to identify how SGCs communicate with neurons to regulate axon regeneration, which is a large area of ongoing research that remains poorly understood. While these are important experiments, because of numerous technical and temporal constrains, we believe they are beyond the scope of the current manuscript.
  
  How does connexin 43 in SGCs related to ET-1- ETBR signalling?
  
  The relation between connexin 43 and ETBR signaling stems from observations made in astrocytes. ET1 signaling in cultured astrocytes, which share functional similarities with SGCs, was shown to lead to the closure of gap junctions and the reduction in Cx43 expression. Because Cx43 expression, a major connexin expressed in SGCs as in astrocytes, was previously shown to be reduced at the protein level in SGCs from aged mice, we decided to explore it this ETBR-Cx43 mechanism also operates in SGCs. We have revised the Introduction and Discussion section extensively to acknowledge the lack of causality in Cx43 expression SGCs and to provide additional potential mechanisms by which ETBR inhibition may promote nerve repair.
  
  AuthorResponse
Visit annotations in context

Tags

Summary

AuthorResponse

Review 2

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.06.08.597928v3
www.biorxiv.org www.biorxiv.org

Non-equilibrium strategies for ligand specificity in signaling networks

3
1. Public_Reviews 29 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This study presents a valuable finding about how receptor-ligand binding pathways with multi-site phosphorylation can show non-monotonic responses to increasing ligand affinity and to kinase activity. The authors provide convincing evidence through a simple ordinary differential equation model of such signaling networks with the key new ingredient of ligand-induced receptor degradation. The work will be of interest to physicists and biologists working on signal transduction and biological information processing.
  
  Summary
2. Public_Reviews 29 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  The authors study the steady-state solutions of ODE models for molecular signaling involving ligand binding coupled to multi-site phosphorylation at saturating ligand concentrations. Although the results are in principle general, the work highlights the receptor tyrosine kinases (RTK) as model systems. After presenting previous ODE model solutions, the authors present their own "kinetic sorting" model, which is distinguished by ligand-induced phosphorylation-dependent receptor degradation and the property that every phosphorylation state is signaling competent. The authors show that this model recovers the two types of non-monotonicity experimentally reported for RTKs: maximum activity for intermediate ligand affinity and maximum activity for intermediate kinase activity.
  
  The main contribution of the work is in demonstrating that their model can capture both types of non-monotonicity, whereas previous models could at most capture non-monotonicity of ligand binding.
  
  Strengths:
  
  The question of how energy-dissipating, and thus non-equilibrium, molecular systems can achieve steady-state solutions not accessible to equilibrium systems is of fundamental importance in biomolecular information processing and self-organization. Although the authors do not address the energy requirements of their non-equilibrium model, their comparative analysis of different alternative non-equilibrium models provides insight into the design choices necessary to achieve non-monotonic control, a property that is inaccessible at equilibrium.
  
  The paper is succinctly written and easy to follow, and the authors achieve their aims by providing convincing numerical solutions demonstrating non-monotonicity over the range of parameter values encompassing the biologically relevant regime.
  
  Weaknesses:
  
  (1) A key motivating framework for this work is the argument that the ability to tune to recognize intermediate ligand affinities provides a control knob for signal selection that is available to non-equilibrium systems. As such, this seems like a compelling type of ligand selectivity, which is a question of broad interest. However, as the authors note in the results, the previously published "limited signaling model" already achieves such non-monotonicity in ligand binding affinity. The introduction and abstract do not clearly delineate the new contributions of the model.
  
  The novel benefit of the model introduced by the authors is that it also achieves a non-monotonic response to kinase activity. Because such non-monotonicity is observed for RTK, this would make the authors' model a better fit for capturing RTK behavior. However, the broad significance of achieving non-monotonicity to kinase activity is not motivated or supported by empirical evidence in the paper. As such, the conceptual significance of the modified model presented by the authors is not clear.
  
  (2) Whereas previous models used in the literature are schematized in Figure 1, the model proposed by the authors is missing (see line 97 of page 3). Without the schematic, the text description of the model is incomplete.
  
  (3) The authors use the activity of the first phosphorylation site as the default measure of activity. This choice needs to be justified. Why not use the sum of the activities at all sites?
  
  Review 1
3. Public_Reviews 29 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  In classical models of signaling networks, the signaling activity increases monotonically with the ligand affinity. However, certain receptors prefer ligands of intermediate affinity. In the paper, the authors present a new minimal model to derive generic conditions for ligand specificity. In brief, this requires multi-site phosphorylation and that high-aﬃnity complexes be more prone to degrade. This particular type of kinetic discrimination allows for overcoming equilibrium constraints.
  
  Strengths:
  
  The model is simple, and it adds only a few parameters to classical generic models. Moreover, the authors vary these additional parameters in ranges based on experimental observations. They explain how the introduction of these new parameters is essential to ligand specificity. Their model quantitatively reproduces the ligand specificity of a certain receptor. Finally, they provide a testable prediction.
  
  Weaknesses:
  
  The naming of certain variables may be confusing to readers.
  
  Review 2
Visit annotations in context

Tags

Summary

Review 2

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.10.01.615884v2
www.biorxiv.org www.biorxiv.org

Atypical collective oscillatory activity in cardiac tissue uncovered by optogenetics

3
1. Public_Reviews 29 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This important work provides mechanistic insights into the development of cardiac arrhythmia and establishes a new experimental use case for optogenetics in studying cardiac electrophysiology. The agreement between computational models and experimental observations provides a convincing level of evidence that wave train-induced pacemaker activity can originate in continuously depolarized tissue, with the limitation that there may be differences between depolarization arising from constant optogenetic stimulation, as opposed to pathophysiological tissue depolarization. Future experiments in vivo and in other tissue preparations would extend the generality of these findings.
  
  Summary
2. Public_Reviews 29 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  The study by Teplenin and coworkers assesses the combined effects of localized depolarization and excitatory electrical stimulation in myocardial monolayers. They study the electrophysiological behaviour of cultured neonatal rat ventricular cardiomyocytes expressing the light-gated cation channel Cheriff, allowing them to induce local depolarization of varying area and amplitude, the latter titrated by the applied light intensity. In addition, they used computational modeling to screen for critical parameters determining state transitions and to dissect the underlying mechanisms. Two stable states, thus bistability, could be induced upon local depolarization and electrical stimulation, one state characterized by a constant membrane voltage and a second, spontaneously firing, thus oscillatory state. The resulting 'state' of the monolayer was dependent on the duration and frequency of electrical stimuli, as well as the size of the illuminated area and the applied light intensity, determining the degree of depolarization as well as the steepness of the local voltage gradient. In addition to the induction of oscillatory behaviour, they also tested frequency-dependent termination of induced oscillations.
  
  Strengths:
  
  The data from optogenetic experiments and computational modelling provide quantitative insights into the parameter space determining the induction of spontaneous excitation in the monolayer. The most important findings can also be reproduced using a strongly reduced computational model, suggesting that the observed phenomena might be more generally applicable.
  
  Weaknesses:
  
  While the study is thoroughly performed and provides interesting mechanistic insights into scenarios of ventricular arrhythmogenesis in the presence of localized depolarized tissue areas, the translational perspective of the study remains relatively vague. In addition, the chosen theoretical approach and the way the data are presented might make it difficult for the wider community of cardiac researchers to understand the significance of the study.
  
  Review 1
3. Public_Reviews 29 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  In the presented manuscript, Teplenin and colleagues use both electrical pacing and optogenetic stimulation to create a reproducible, controllable source of ectopy in cardiomyocyte monolayers. To accomplish this, they use a careful calibration of electrical pacing characteristics (i.e., frequency, number of pulses) and illumination characteristics (i.e., light intensity, surface area) to show that there exists a "sweet spot" where oscillatory excitations can emerge proximal to the optogenetically depolarized region following electrical pacing cessation, akin to pacemaker cells. Furthermore, the authors demonstrate that a high-frequency electrical wave-train can be used to terminate these oscillatory excitations. The authors observed this oscillatory phenomenon both in vitro (using neonatal rat ventricular cardiomyocyte monolayers) and in silico (using a computational action potential model of the same cell type). These are surprising findings and provide a novel approach for studying triggered activity in cardiac tissue.
  
  The study is extremely thorough and one of the more memorable and grounded applications of cardiac optogenetics in the past decade. One of the benefits of the authors' "two-prong" approach of experimental preps and computational models is that they could probe the number of potential variable combinations much deeper than through in vitro experiments alone. The strong similarities between the real-life and computational findings suggest that these oscillatory excitations are consistent, reproducible, and controllable.
  
  Triggered activity, which can lead to ventricular arrhythmias and cardiac sudden death, has been largely attributed to sub-cellular phenomena, such as early or delayed afterdepolarizations, and thus to date has largely been studied in isolated single cardiomyocytes. However, these findings have been difficult to translate to tissue and organ-scale experiments, as well-coupled cardiac tissue has notably different electrical properties. This underscores the significance of the study's methodological advances: the use of a constant depolarizing current in a subset of (illuminated) cells to reliably result in triggered activity could facilitate the more consistent evaluation of triggered activity at various scales. An experimental prep that is both repeatable and controllable (i.e., both initiated and terminated through the same means).
  
  The authors also substantially explored phase space and single-cell analyses to document how this "hidden" bi-stable phenomenon can be uncovered during emergent collective tissue behavior. Calibration and testing of different aspects (e.g., light intensity, illuminated surface area, electrical pulse frequency, electrical pulse count) and other deeper analyses, as illustrated in Appendix 2, Figures 3-8, are significant and commendable.
  
  Given that the study is computational, it is surprising that the authors did not replicate their findings using well-validated adult ventricular cardiomyocyte action potential models, such as ten Tusscher 2006 or O'Hara 2011. This may have felt out of scope, given the nice alignment of rat cardiomyocyte data between in vitro and in silico experiments. However, it would have been helpful peace-of-mind validation, given the significant ionic current differences between neonatal rat and adult ventricular tissue. It is not fully clear whether the pulse trains could have resulted in the same bi-stable oscillatory behavior, given the longer APD of humans relative to rats. The observed phenomenon certainly would be frequency-dependent and would have required tedious calibration for a new cell type, albeit partially mitigated by the relative ease of in silico experiments.
  
  For all its strengths, there are likely significant mechanistic differences between this optogenetically tied oscillatory behavior and triggered activity observed in other studies. This is because the constant light-elicited depolarizing current is disrupting the typical resting cardiomyocyte state, thereby altering the balance between depolarizing ionic currents (such as Na+ and Ca2+) and repolarizing ionic currents (such as K+ and Ca2+). The oscillatory excitations appear to later emerge at the border of the illuminated region and non-stimulated surrounding tissue, which is likely an area of high source-sink mismatch. The authors appear to acknowledge differences in this oscillatory behavior and previous sub-cellular triggered activity research in their discussion of ectopic pacemaker activity, which is canonically expected more so from genetic or pathological conditions. Regardless, it is exciting to see new ground being broken in this difficult-to-characterize experimental space, even if the method illustrated here may not necessarily be broadly applicable.
  
  Review 2
Visit annotations in context

Tags

Summary

Review 2

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.04.22.650000v1
www.biorxiv.org www.biorxiv.org

Massively parallel reporter assay for mapping gene-specific regulatory regions at single nucleotide resolution

5
1. Public_Reviews 29 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This manuscript presents a valuable methodological approach for investigating context-dependent activity of cis-regulatory elements within defined genomic loci. The authors combine a locus-specific massively parallel reporter assay, enabling unbiased and high-coverage profiling of enhancer activity across large genomic regions, with a degenerate reporter assay to identify nucleotides critical for enhancer function. The data supporting the conclusions are solid, highlighted by the successful identification and characterization of both previously known and new regulatory elements across multiple developmental stages, cell types, and species; however, concerns regarding assay sensitivity, statistical rigor in distinguishing active regions, and limitations inherent to the design of the reporter assays remain to be addressed. With strengthened quantitative analysis, statistical validation, and additional functional experiments to directly establish regulatory element-gene relationships, this study will be of broad interest to researchers investigating gene regulation mechanisms in development and disease.
  
  Summary
2. Public_Reviews 29 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  MPRAs are a high-throughput and powerful tool for assaying the regulatory potential of genomic sequences. However, linking MPRA-nominated regulatory sequences to their endogenous target genes and identifying the more specific functional regions within these sequences can be challenging. MPRAs that tile a genomic region, and saturation mutagenesis-based MPRAs, can help to address these challenges. In this work, Tulloch et al. describe a streamlined MPRA system for the identification and investigation of the regulatory elements surrounding a gene of interest with high resolution. The use of BACs covering a locus of interest to generate MPRA libraries allows for an unbiased and high-coverage assessment of a particular region. Follow-up degenerate MPRAs, where each nucleotide in the nominated sequences is systematically mutated, can then point to key motifs driving their regulatory activity. The authors present this MPRA platform as straightforward, easily customizable, and less time- and resource-intensive than traditional MPRA designs. They demonstrate the utility of their design in the context of the developing mouse retina, where they first use the LS-MPRA to identify active regulatory elements for select retinal genes, followed by d-MPRA, which allowed them to dissect the functional regions within those elements and nominate important regulatory motifs. These assays were able to recapitulate some previously known cis-regulatory modules (CRMs), as well as identify some new potential regulatory regions. Follow-up experiments assessing co-localization of the gene of interest with the CRM-linked GFP reporter in the target cells, and CUT&RUN assays to confirm transcription factor binding to nominated motifs, provided support linking these CRMs to the genes of interest. Overall, this method appears flexible and could be an easy-to-implement tool for other investigators aiming to study their locus of interest with high resolution.
  
  Strengths:
  
  (1) The method of fragmenting BACs allows for high, overlapping coverage of the region of interest.
  
  (2) The d-MPRA method was an efficient way to identify key functional transcription factor motifs and nominate specific transcription factor-driven regulatory pathways that could be studied further.
  
  (3) Additional assays like co-expression analyses using the endogenous gene promoter, and use of the Notch inhibitor in the case of Olig2, helped correlate the activity of the CRMs to the expression of the gene of interest, and distinguish false positives from the initial MPRA.
  
  (4) The use of these assays across different time points, tissues, and even species demonstrated that they can be used across many contexts to identify both common and divergent regulatory mechanisms for the same gene.
  
  Weaknesses:
  
  The LS-MPRA assay most strongly identified promoters, which are not usually novel regulatory elements you would try to discover, and the signal-to-noise ratio for more TSS-distal, non-promoter regulatory elements was usually high, making it difficult to discriminate lower activity CRMs, like enhancers, from the background. For example, NR2 and NR3 in Figure 3 have very minimal activity peaks (NR3 seems non-existent). The ex vivo data in Figure 2 are similarly noisy. Is there a particular metric or calculation that was or could be used to quantitatively or statistically call a peak above the background? The authors mention in the discussion some adjustments that could reduce the noise, such as increased sequencing depth, which I think is needed to make these initial LS-MPRA results and the benchmarking of this assay more convincing and impactful.
  
  Review 1
3. Public_Reviews 29 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  In this study, Tulloch et al. developed two modified massively parallel reporter assays (MPRAs) and applied them to identify cis-regulatory modules (CRMs) - genomic regions that activate gene expression, controlling retinal gene expression. These CRMs usually function at specific developmental stages and in distinct cell types to orchestrate retinal development. Studying them provides insights into how retinal progenitor cells give rise to various retinal cell types.
  
  The first assay, named locus-specific MPRA (LS-MPRA), tests all genomic regions within 150-300 kb of the gene of interest, rather than relying on previously predicted candidate regulatory elements. This approach reduces potential bias introduced during candidate selection, lowers the cost of synthesizing a library of candidate sequences, and simplifies library preparation. The LS-MPRA libraries were electroporated into mouse retinas in vivo or ex vivo. To benchmark the method, the authors first applied LS-MPRA near stably expressed retinal genes (e.g., Rho, Cabp5, Grm6, and Vsx2), and successfully identified both known and novel CRMs. They then used LS-MPRA to identify CRMs in embryonic mouse retinas, near Olig2 and Ngn2, genes expressed in subsets of retinal progenitor cells. Similar experiments were conducted in chick retinas and postnatal mouse retinas, revealing some CRMs with conserved activity across species and developmental stages.
  
  Although the study identified CRMs with robust reporter activity in Olig2+ or Ngn2+ cells, the data do not provide sufficient evidence to support the claims that these CRMs regulate Olig2 or Ngn2, rather than other nearby genes, in a cell-type-specific manner. For example, the authors propose that three regions (NR1/2/3) regulate Olig2 specifically in retinal progenitor cells based on: (1) the three regions are close to Olig2, (2) increased Olig2 expression and NR1/2/3 activity upon Notch inhibition, and (3) reporter activity observed in Olig2+ cells (though also present in many Olig2- cells). While these are promising findings, they do not directly support the claims.
  
  The second assay, called degenerate MPRA (d-MPRA), introduces random point mutations into CRMs via error-prone PCR to assess the impact of sequence variations on regulatory activity. This approach was used on NR1/2/3 to identify mutations that alter CRM activity, potentially by influencing transcription factor binding. The authors inferred candidate transcription factors, such as Mybl1 and Otx2, through motif analysis, co-expression with Olig2 (based on single-cell RNA-seq), and CUR&RUN profiling. While some transcription factors identified in this way overlapped with the d-MPRA results, others did not. This raises questions about how well d-MPRA complements other methods for identifying transcriptional regulators.
  
  Strengths:
  
  (1) The study introduces two technically robust MPRA protocols that offer advantages over standard methods, such as avoiding reliance on predefined candidate regions, reducing cost and labor, and minimizing selection bias.
  
  (2) The identified regulatory elements and transcription factors contribute to our understanding of gene regulation in retinal development and may have translational potential for cell-type-specific gene delivery into developing retinas.
  
  Weaknesses:
  
  (1) The claims for gene-specific and cell type-specific CRMs would benefit from further validation using complementary approaches, such as CRISPR interference or Prime editing.
  
  Review 2
4. Public_Reviews 29 Jul 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Summary:
  
  Use of reporter assays to understand the regulatory mechanisms controlling gene expression moves beyond simple correlations of cis-regulatory sequence accessibility, evolutionary sequence conservation, and epigenetic status with gene expression, instead quantifying regulatory sequence activity for individual elements. Tulloch et al., provide a systematic characterization of two new reporter assay techniques (LS-MPRA and d-MPRA) to comprehensively identify cis-regulatory sequences contained within genomic loci of interest during retinal development. The authors then apply LS-MPRA and d-MPRA to identify putative cis-regulatory sequences controlling Olig2 and Ngn2 expression, including potential regulatory motifs that known retinal transcription factors may bind. Transcription factor binding to regulatory sequences is then assessed via CUT&RUN. The broader utility of the techniques is then highlighted by performing the assays across development, across species, and across tissues.
  
  Strengths:
  
  (1) The authors validate the reporter assays on retinal loci for which the regulatory sequences are known (Rho, Vsx2, Grm6, Cabp5) mostly confirming known regulatory sequence activity but highlighting either limitations of the current technology or discrepancies of previous reporter assays and known biology. The techniques are then applied to loci of interest (Olig2 and Ngn2) to better understand the regulatory sequences driving expression of these transcription factors across retinal development within subsets of retinal progenitor cells, identifying novel regulatory sequences through comprehensive profiling of the region.
  
  (2) LS-MPRA provides broad coverage of loci of interest.
  
  (3) d-MPRA identifies sequence features that are important for cis-regulatory sequence activity.
  
  (4) The authors take into account transcript and protein stability when determining the correlation of putative enhancer sequence activity with target gene expression.
  
  Weaknesses:
  
  (1) In its current form, the many important controls that are standard for other MPRA experiments are not shown or not performed, limiting the interpretations of the utility of the techniques. This includes limited controls for basal-promoter activity, limited information about sequence saturation and reproducibility of individual fragments across different barcode sequences, limitations in cloning and assay delivery, and sequencing requirements. Additional quantitative metrics, including locus coverage and number of barcodes/fragments, would be beneficial throughout the manuscript.
  
  (2) There are no statistical metrics for calling a region/sequence 'active'. This is especially important given that NR3 for Olig2 seems to have a small 'peak' and has non-significant activity in Figure 4.
  
  (3) The authors present correlational data for identified cis-regulatory sequences with target gene expression. Additionally, the significance of transcription factor binding to the putative regulatory sequences is not currently tested, only correlated based on previous single-cell RNA-sequencing data. While putative regulatory sequences with potential mechanisms of regulation are identified/proposed, the lack of validation (and discrepancies with previous literature) makes it hard to decipher the utility of the techniques.
  
  (4) While the interpretations that Olig2 mRNA/protein expression is dynamically regulated improved the proportions of cells that co-expressed CRM-regulated GFP and Olig2, alternate explanations (some noted) are just as likely. First, the electroporation isn't specific to Olig2+ progenitors. Also, the tested, short CRM fragments may have activating signals outside of Olig2 neurogenic cells because chromatin conformation, histone modifications, and DNA methylation are not present on plasmids to precisely control plasmid activity. Alternatively, repressive elements that control Olig2 expression are not contained in the reporter vectors.
  
  (5) It is unclear as to why the d-MPRA uses a different barcoding strategy, placing a second copy of the cis-regulatory sequence in the 3' UTR. As acknowledged by the author, this will change the transcript stability by changing the 3' UTR sequence. Because of this, comparisons of sequence activity between the LS-MPRA and d-MPRA should not be performed as the experiments are not equivalent.
  
  (6) Furthermore, details of the mutational burden in d-MPRA experiments are not provided, limiting the interpretations of these results.
  
  (7) Many figures are IGV screenshots that suffer from low resolution. Many figures could be consolidated.
  
  Review 3
5. Public_Reviews 29 Jul 2025
  
  in eLife
  
  Author response:
  
  We thanks the Reviewers for their thorough reviews and helpful suggestions. We will provide additional quantification as requested for several aspects of the study.
  
  The methods that we developed were meant to provide candidates for regulatory elements for a gene of interest. These candidates could be used to further understand the regulation of a gene, a complex and difficult task, especially for dynamically regulated genes in the context of development. These candidates could also, or instead, be used to drive gene expression specifically in a target cell of interest for applications such as gene therapy or perturbations that need this type of specificity. In the first case, to use the candidates to understand the regulation of a gene, one would need to validate the candidates using the types of methods typically employed for this purpose, most rigorously in the in vivo genomic context. We did not pursue this level of validation as it would encompass a great deal of work outside the scope of the current study. However, by initially testing loci and CRMs which have been studied by several groups (Rho, Grm6, Vsx2, and Cabp5), and at least in the cases of Rho and Vsx2, shown to be relevant in the genomic context in vivo, we provide evidence that the LS-MPRA can identify relevant CRMs. These data show that the method is worth using for loci of interest, particularly when only one or a few loci are of interest, i.e. one does not need to use genome-wide approaches. It is also apparent that our methods are not perfect and that the LS-MPRA does not pick up all CRMs. We do not know of a method that has been shown to do so.
  
  Some of the statistical and quantitative data asked for by the Reviewers will be provided. However, it is important to note that the types of statistics using peak callers asked for regarding candidate choice will be of limited value. If one is testing a library in a single cell type in vitro, and/or running genome-wide assays, these statistics could aid in the choice of candidates. However, here we are electroporating a complex and dynamic set of cells, present at very different frequencies. In addition, at least for Olig2 and Ngn2, their expression is very transient, and each is expressed in only a small subset of cells. An additional confound is that the level of expression of each gene that one might test is variable. All of these variables render a statistical prediction of strong candidates to be less valuable than one might hope, and might lead one to miss those CRMs of interest. Instead, we suggest that one use one’s own level of interest and knowledge in choosing CRM candidates. We provide several examples of experimental, rather than purely statistical, approaches that might help in one’s choice of candidates. We used a functional read-out of CRM activity (Notch perturbation), carried out in the context of the entire LS-MPRA library, as one method. Co-expression in single cells of candidate regulators identified by the d-MPRA is another. One can of course use chromatin structure and sequence conservation, as used in many studies of regulatory regions, as other ways to narrow down candidates. The d-MPRA predictions also can be viewed in light of previous genetic studies, i.e. mutations in TFs that effect the cell type of interest or the regulation of the gene of interest, as we were able to do here for CRMs predicted to be regulated by Otx2.
  
  If one wishes to use a candidate CRM to drive gene expression in a targeted cell type, one needs to establish specificity. In particular, specificity needs to be established in the context of the vector that is being used. Non-integrated vs integrated vectors, different types of viral vectors with their own confounding regulatory sequences, and copy number can all effect specificity. We provided a double in situ hybridization method for the examination of specificity for some of the novel candidate CRMs. It was quite difficult in the case of Olig2 and Ngn2 as their RNAs and proteins are unstable. We would need to provide further evidence should we wish to use these candidate CRMs for directing expression specifically in Olig2- or Ngn2-expressing cells. We suggest that an investigator can choose the vector and method for establishing specificity depending upon the goals of the application.
  
  AuthorResponse
Visit annotations in context

Tags

Review 2

Review 3

Summary

AuthorResponse

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.05.13.653746v2
www.biorxiv.org www.biorxiv.org

An applicable and efficient retrograde monosynaptic circuit mapping tool for larval zebrafish

3
1. Public_Reviews 29 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 This important study offers substantial technical advancements for neural circuit tracing in larval zebrafish, a model for systems and developmental neurobiology. The enhanced rabies virus-based retrograde transneuronal tracing improves efficiency and provides a method for combined structural and functional brain mapping. The supporting evidence is solid, and there is strong confidence in the technique's utility for neurobiologists working with zebrafish.
 
 Summary
2. Public_Reviews 29 Jul 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 (1) Presentation of Figures in the Response Letter
 
 I would like to note that the figures included in the response letter would benefit from improved organization. For example, Author response image 1 lacks clarity for experimental conditions. From the response letter, my understanding is that a "Labeling rate index", Rg−Rn, was calculated to represent the difference in the rate of increase in labeling between neurons and glial across two time intervals based on experiments shown in Figure 2-figure supplement 1C and G. It seems that a mean convergence index was calculated for each experimental condition at each time point for glial and neurons, and then the differences in mean convergence index increase between time intervals were calculated for glial and neurons. The legend needs more detail to enhance clarity.
 
 Furthermore, the manuscript should clearly distinguish between figures generated from re-analysis of existing data and those based on newly conducted experiments. This distinction should be explicitly stated in the figure legends and/or main text. I recommend that all response figures containing data integral to the authors' rebuttal be properly integrated into the manuscript's existing supplementary figure set, rather than remaining isolated in the response document. This would enhance clarity and ensure that key supporting data are fully accessible to readers. For instance, Author response image 1 can be integrated with Figure 2-figure supplement.
 
 (2) Glial Cell Labeling and Specificity of Trans-Synaptic Spread
 
 The authors provided a comprehensive and well-reasoned response to the concern regarding the labeling of radial glial cells. The inclusion of a dedicated section in the revised Discussion and response figures (possibly to be integrated with supplementary figures), strengthens the manuscript.
 
 The authors have made an interesting observation in Author response image 2 that glial labeling was frequently observed near the soma and dendrites of starter cells, suggesting that transneuronal labeled glial cells may be synaptically associated with the starter neurons. Also astroglia starter cells lead to infection of nearby TVA-negative astroglia, suggesting astroglia-to- astroglia transmission.
 
 I find the response scientifically satisfactory and appreciate the authors' transparency in addressing the limitations of their approach.
 
 (3) Temperature Effects and Larval Viability
 
 The authors' justification for raising larvae at 36C to improve labeling efficiency is reasonable. The supporting data indicating minimal impact on larval viability within the experimental timeframe are convincing. Referencing prior behavioral studies and including survival data under controlled conditions adds credibility to their claims. I find this issue satisfactorily addressed.
 
 (4) Viral Toxicity and Dosage Considerations, Secondary Starter Cells
 
 The authors present a well-reasoned explanation that viral cytotoxicity is primarily driven by replication and not by viral titer or injection volume. However, the inclusion of experimental data directly testing the effects of higher titer or volume on starter cell viability would have strengthened this point, particularly since such tests are relatively straightforward to perform.
 
 Regarding the potential contribution of secondary starter cells, the authors provide a convincing rationale for why such effects are unlikely under their sparse labeling conditions. However, in cases where TVA and G are broadly expressed-such as under the vglut2a promoter, as shown in Author response image 2-it would be valuable to directly evaluate this possibility experimentally. While the authors' interpretation is reasonable, empirical validation would further strengthen their conclusions.
 
 Review 1
3. Public_Reviews 29 Jul 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 The study by Chen, Deng et al. aims to develop an efficient viral transneuronal tracing method that allows efficient retrograde tracing in the larval zebrafish. The authors utilize pseudotyped-rabies virus that can be targeted to specific cell types using the EnvA-TvA systems. Pseudotyped rabies virus has been used extensively in rodent models and, in recent years, has begun to be developed for use in adult zebrafish. However, compared to rodents, the efficiency of spread in adult zebrafish is very low (~one upstream neuron labeled per starter cell). Additionally, there is limited evidence of retrograde tracing with pseudotyped rabies in the larval stage, which is the stage when most functional neural imaging studies are done in the field. In this study, the authors systematically optimized several parameters of rabies tracing, including different rabies virus strains, glycoprotein types, temperatures, expression construct designs, and elimination of glial labeling. The optimal configurations developed by the authors are up to 5-10 fold higher than more typically used configurations.
 
 The results are convincing and support the conclusions. There are some additional changes that are recommended:
 
 (1) The new data included in the response to reviewer's letter are important to support the main conclusions and should be included in the manuscript.
 
 (2) Line 357-362: This section should include all of the Author response image and associated details. Additionally, the Author response image 3 is at odds with Fig 2-supplement 1G. In Author response image 3, ~75% of glial cells labeled at 4 dpi loses their fluorescence by 10 dpi. However, Figure 2-supplement 1G shows that glial overall labeling increases ~2 fold from 4 dpi to 10 dpi. This would suggest that the de novo labeling rate for glia is much higher than the net labeling rate calculated from the convergence index. The authors should clarify these findings.
 
 Review 2
Visit annotations in context

Tags

Summary

Review 2

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.06.27.601104v2
www.biorxiv.org www.biorxiv.org

Scheduled feeding improves behavioral outcomes and reduces inflammation in a mouse model of Fragile X syndrome

4
1. Public_Reviews 28 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 This manuscript presents solid experimental data using Fmr1 knockout mice to explore the fundamental role of Fmr1 in sleep regulation. The study supports the hypothesis that scheduled feeding can improve circadian rhythm and behavior in a mouse model of Fragile X syndrome. These findings may offer new insights into neurodevelopmental disorders and their potential treatment strategies.
 
 Summary
2. Public_Reviews 28 Jul 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 The authors conducted a comprehensive investigation into sleep and circadian rhythm disturbances in Fmr1 knockout (KO) mice, a model for Fragile X Syndrome (FXS). They began by monitoring daily home cage behaviors to identify disruptions in sleep and circadian patterns, then assessed the mice's adaptability to altered light conditions through photic suppression and skeleton photoperiod experiments. To uncover potential mechanisms, they examined the connectivity between the retina and the suprachiasmatic nucleus. The study also included an analysis of social behavior deficits in the mutant mice and tested whether scheduled feeding could alleviate these issues. Notably, scheduled feeding not only improved sleep, circadian, and social behaviors but also normalized plasma cytokine levels. The manuscript is strengthened by its focus on a significant and underexplored area-sleep deficits in an FXS model-and by its robust experimental design, which integrates a variety of methodological approaches to provide a thorough understanding of the observed phenomena and potential therapeutic avenues.
 
 Review 1
3. Public_Reviews 28 Jul 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 In the present study, the authors, using a mouse model of Fragile X syndrome, explore the intriguing hypothesis that restricting food access over the daily schedule will improve sleep patterns and subsequently enhanced behavioral capacities. By restricting food access from 12h to 6h over the nocturnal period (the active period for mice), they show, in these KO mice, an improvement in the sleep pattern accompanied by reduced systemic levels of inflammatory markers and improved behavior. These data, using a classical mouse model of neurodevelopmental disorder (NDD), suggest that modifying eating patterns might improve sleep quality, leading to reduced inflammation and enhanced cognitive/behavioral capacities in children with NDD.
 
 Overall, the paper is well-written and easy to follow. The rationale of the study is generally well introduced. Data are globally sound. The interpretation is overall supported by the provided data.
 
 Review 2
4. Public_Reviews 28 Jul 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews.
 
 Reviewer #1 (Public review):
 
 Summary:
 
 The authors investigated sleep and circadian rhythm disturbances in Fmr1 KO mice. Initially, they monitored daily home cage behaviors to assess sleep and circadian disruptions. Next, they examined the adaptability of circadian rhythms in response to photic suppression and skeleton photic periods. To explore the underlying mechanisms, they traced retino-suprachiasmatic connectivity. The authors further analyzed the social behaviors of Fmr1 KO mice and tested whether a scheduled feeding strategy could mitigate sleep, circadian, and social behavior deficits. Finally, they demonstrated that scheduled feeding corrected cytokine levels in the plasma of mutant mice.
 
 Strengths:
 
 (1) The manuscript addresses an important topic-investigating sleep deficits in an FXS mouse model and proposing a potential therapeutic strategy.
 
 (2) The study includes a comprehensive experimental design with multiple methodologies, which adds depth to the investigation.
 
 We thank the reviewer for the positive comments.
 
 Weaknesses:
 
 (1) The first serious issue in the manuscript is the lack of a clear description of how they performed the experiments and the missing definitions of various parameters in the results.
 
 We thank the reviewer for pointing out lapses in the editing of the manuscript. We were trying to keep the descriptions of previously published methods brief but must have gone too far, the manuscript has been carefully checked for grammar and readability. Description of the experimental design has been refined and a graphical presentation has been added as Suppl Fig 3. The sleep and circadian parameters have been thoroughly explained in the methods and briefly in the figure legnds.
 
 (2) Although the manuscript has a relatively long Methods section, some essential information is missing. For instance, the definition of sleep bout, as described above, is unclear. Additional missing information includes
 
 Figure 2: "Rhythmic strength (%)" and "Cycle-to-cycle variability (min)."
 
 Figure 3: "Activity suppression."
 
 Figure 4: "Rhythmic power (V%)" (is this different from rhythmic strength (%)?) and "Subjective day activity (%)."
 
 We have provided definitions for the general audience of the terms used in the field of circadian rhythms, such as sleep bout, rhythm power, cycle-to-cycle, masking, and % of activity during the day in the methods and Fig legends. Most of the techniques used in this study, for example, the behavioral measurement of sleep or locomotor activity, are well established and have been used in multiple published works, including our own. We have made sure to include citations for interested readers.
 
 Figure 5: Clear labeling of the SCN's anatomical features and an explanation for quantifying only the ventral part instead of the entire SCN.
 
 We have added more landmarks (position of the third ventricle and optic chiasm) to Fig 5, and have outlined the shell and core of the SCN in two additional images of the ventral hypothalamus in Suppl fig 4.
 
 We had actually quantified the fluorescence in the whole SCN as well as in the ventral part.This was/is described in the methods as well as reported in the results section and Table 4 “Likewise, a subtle decrease in the intensity of the labelled fibers was found in the whole SCN (Table 4) of the Fmr1 KO mice as compared to WT.“
 
 Methods: ” Two methods of analyses were carried out on the images of 5 consecutive sections per animal containing the middle SCN. First, the relative intensity of the Cholera Toxin fluorescent processes was quantified in the whole SCN, both left and right separately, by scanning densitometry using the Fiji image processing package of the NIH ImageJ software (https://imagej.net). A single ROI of fixed size (575.99 μm x 399.9 μm, width x height) was used to measure the relative integrated density (mean gray values x area of the ROI) in all the images. The values from the left and right SCN were averaged per section and 5 sections per animal were averaged to obtain one value per animal………..”
 
 Since the retinal innervation of the SCN is strongest in the ventral aspect, where the retino-hypothalamic fibers reach the SCN and our goal was to identify differences in the input to the SCN, e.g. defects in the retino-SCN connectivity as suggested by some deficits in circadian behaviour; we also looked at intensity of Cholera Toxin in the fibers arriving to the ventral SCN from the retina.
 
 We have added a sentence in the methods about the rationale for measuring the intensity of the cholera toxin labelled fiber in the whole SCN and also just in the ventral part: “Second, the retinal innervation of the SCN is strongest in the ventral aspect, where the retino-hypothalamic fibers reach the SCN, hence, the distribution….”
 
 Figure 6: Inconsistencies in terms like "Sleep frag. (bout #)" and "Sleep bouts (#)." Consistent terminology throughout the manuscript is essential.
 
 We have now clearly explained that sleep bouts are a measure of sleep fragmentation throughout the manuscript and in the fig legends; in addition, we have corrected the figures, reconciled the terminology, which is now consistent throughout the results and methods.
 
 Methods: “Sleep fragmentation was determined by the number of sleep bouts, which were operationally defined as episodes of continuous immobility with a sleep count greater than 3 per minute, persisting for at least 60 secs.”
 
 (3) Figure 1A shows higher mouse activity during ZT13-16. It is unclear why the authors scheduled feeding during ZT15- 21, as this seems to disturb the rhythm. Consistent with this, the body weights of WT and Fmr1 KO mice decreased after scheduled feeding. The authors should explain the rationale for this design clearly.
 
 We have added to the rationale for the feeding schedule. This protocol was initially used by the Panda group to counter metabolic dysfunction (Hatori et al., 2012). We have used it for many years now (see citations below) in various mouse models presenting with circadian disruption to reset the clock and improve sleep. This study represents our first application/intervention in a mouse model of a neurodevelopmental disease.
 
 Hatori M, Vollmers C, Zarrinpar A, DiTacchio L, Bushong EA, Gill S, Leblanc M, Chaix A, Joens M, Fitzpatrick JA, Ellisman MH, Panda S. Time-restricted feeding without reducing caloric intake prevents metabolic diseases in mice fed a high-fat diet. Cell Metab. 2012 Jun 6;15(6):848-60. doi: 10.1016/j.cmet.2012.04.019. Epub 2012 May 17. PMID: 22608008; PMCID: PMC3491655.
 
 Chiem E, Zhao K, Dell'Angelica D, Ghiani CA, Paul KN, Colwell CS. Scheduled feeding improves sleep in a mouse model of Huntington's disease. Front Neurosci. 2024 18:1427125. doi: 10.3389/fnins.2024.1427125. PMID: 39161652.
 
 Whittaker DS, Akhmetova L, Carlin D, Romero H, Welsh DK, Colwell CS, Desplats P. Circadian modulation by time-restricted feeding rescues brain pathology and improves memory in mouse models of Alzheimer's disease. Cell Metab. 2023 35(10):1704- 1721.e6. doi: 10.1016/j.cmet.2023.07.014. PMID: 37607543
 
 Brown MR, Sen SK, Mazzone A, Her TK, Xiong Y, Lee JH, Javeed N, Colwell CS, Rakshit K, LeBrasseur NK, Gaspar-Maia A, Ordog T, Matveyenko AV. Time-restricted feeding prevents deleterious metabolic effects of circadian disruption through epigenetic control of β cell function. Sci Adv. 2021 7(51):eabg6856. doi: 10.1126/sciadv.abg6856. PMID: 34910509
 
 Whittaker DS, Loh DH, Wang HB, Tahara Y, Kuljis D, Cutler T, Ghiani CA, Shibata S, Block GD, Colwell CS. Circadian-based Treatment Strategy Effective in the BACHD Mouse Model of Huntington's Disease. J Biol Rhythms. 2018 33(5):535-554. doi: 10.1177/0748730418790401. PMID: 30084274.
 
 Wang HB, Loh DH, Whittaker DS, Cutler T, Howland D, Colwell CS. Time-Restricted Feeding Improves Circadian Dysfunction as well as Motor Symptoms in the Q175 Mouse Model of Huntington's Disease. eNeuro. 2018 Jan 3;5(1):ENEURO.0431-17.2017. doi: 10.1523/ENEURO.0431-17.2017.
 
 Loh DH, Jami SA, Flores RE, Truong D, Ghiani CA, O'Dell TJ, Colwell CS. Misaligned feeding impairs memories. Elife. 2015 4:e09460. doi: 10.7554/eLife.09460.
 
 (4) The interpretation of social behavior results in Figure 6 is questionable. The authors claim that Fmr1 KO mice cannot remember the first stranger in a three-chamber test, writing, "The reduced time in exploring and staying in the novelmouse chamber suggested that the Fmr1 KO mutants were not able to distinguish the second novel mouse from the first now-familiar mouse." However, an alternative explanation is that Fmr1 KO mice do remember the first stranger but prefer to interact with it due to autistic-like tendencies. Data in Table 5 show that Fmr1 KO mice spent more time interacting with the first stranger in the 3-chamber social recognition test, which support this possibility. Similarly, in the five-trial social test, Fmr1 KO mice's preference for familiar mice might explain the reduced interaction with the second stranger.
 
 Thank you for this interesting interpretation of the social behavior experiments. We used the common interpretations for both the three-chamber test and the 5-trial social interaction test, but have now modified the text leaving space for alternative interpretations, have soften the language, and mentioned decreased sociability in the Fmr1 KO mice. “The reduced time spent exploring the novel-mouse chamber suggest that the mutants were, perhaps, unable to distinguish the second novel mouse from the first, now familiar, mouse, along with decreased sociability.”
 
 In Figure 6C (five-trial social test results), only the fifth trial results are shown. Data for trials 1-4 should be provided and compared with the fifth trial. The behavioral features of mice in the 5-trial test can then be shown completely. In addition, the total interaction times for trials 1-4 (154 {plus minus} 15.3 for WT and 150 {plus minus} 20.9 for Fmr1 KO) suggest normal sociability in Fmr1 KO mice (it is different from the results of 3-chamber). Thus, individual data for trials 1-4 are required to draw reliable conclusions.
 
 We have added a suppl figure showing the individual trial results for both WT and Fmr1 KO mice as requested (Suppl. Fig. 2).
 
 In Table 6 and Figure 6G-6J, the authors claim that "Sleep duration (Figures 6G, H) and fragmentation (Figures 6I, J) exhibited a moderate-strong correlation with both social recognition and grooming." However, Figure 6I shows a p-value of 0.077, which is not significant. Moreover, Table 6 shows no significant correlation between SNPI of the three-chamber social test and any sleep parameters. These data do not support the authors' conclusions.
 
 Thanks for pointing out the error with statement about Fig. 6I.
 
 “…. Sleep duration (Fig. 6G, H; Table 6) exhibited a moderate to strong correlation with both social recognition and grooming time, while sleep fragmentation (measured by sleep bouts number) only correlated with the latter (Fig. 6J); the length of sleep bouts (Table 6) showed moderate correlation with both social recognition and repetitive behavior. In addition, a moderate correlation was seen between grooming time and the circadian parameters, rhythmic power and activity onset variability (Table 6). In short, our work suggests that even when tested during their circadian active phase, the Fmr1 KO mice exhibit robust repetitive and social behavioral deficits. Moreover, the shorter and more fragmented the daytime sleep, the more severe the behavioral impairment in the mutants.”
 
 (5) Figure 7 demonstrates the effect of scheduled feeding on circadian activity and sleep behaviors, representing another critical set of results in the manuscript. Notably, the WT+ALF and Fmr1 KO+ALF groups in Figure 7 underwent the same handling as the WT and Fmr1 KO groups in Figures 1 and 2, as no special treatments were applied to these mice. However, the daily patterns observed in Figures 7A, 7B, 7F, and 7G differ substantially from those shown in Figures 2B and 1A, respectively. Additionally, it is unclear why the WT+ALF and Fmr1 KO+ALF groups did not exhibit differences in Figures 7I and 7J, especially considering that Fmr1 KO mice displayed more sleep bouts but shorter bout lengths in Figures 1C and 1D.
 
 We appreciate the reviewer’s attention to the subtle details of the behavioral measurement of sleep and believe the reviewer to be referring to differences in the behavioral measurements of sleep with data shown in Table 1 and Table 7. The first set of experiments described in this study was carried out between 2016 and 2017 and involves the comparison between WT and Fmr1 KO mice. The WT and mutants were obtained from JAX. In this initial set of experiments (Table 1), the total amount of sleep in 24 hrs was reduced in the KO, albeit not significantly, and these also exhibited sleep bouts of significantly reduced duration. The pandemic forced us to greatly slow down the research and reduce our mouse colonies. Post-pandemic, we used new cohorts of Fmr1 KO ordered again from JAX for the TRF experiment presented in this study. In these cohorts, the KO mice exhibited a significant reduction in total sleep (Table 7) and the sleep bouts were still shorter but not significantly. We have added to our text to explain that the description of the mutants and TRF interventions were carried out at different times (2017 vs 2022). We would like to emphasize that we always run contemporaneously controls and experimental groups to be used for the statistical analyses. We believe that the data are remarkably consistent over these years, even with different students doing the measurements.
 
 Furthermore, it is not specified whether the results in Figure 7 were collected after two weeks of scheduled feeding (for how many days?) or if they represent the average data from the two-week treatment period.
 
 This is another good point raised by the reviewer. The activity measurements are collected during the 2 weeks (14 days) then the TRF was extended for a 3 more days to allow the behavioral sleep measurements.
 
 We have added a supplementary figure (Supp Fig 3) depicting the different experimental designs.
 
 The rationale behind analyzing "ZT 0-3 activity" in Figure 7D instead of the parameters shown in Figures 2C and 2D is also unclear.
 
 We have added to our explanation. In prior work, we found that the TRF protocol has a big impact on the beginning of the sleep time, hence, we specifically targeted this 3-hours interval in the analysis.
 
 In Figure 7F, some data points appear to be incorrectly plotted. For instance, the dark blue circle at ZT13 connects to the light blue circle at ZT14 and the dark blue circle at ZT17. This is inconsistent, as the dark blue circle at ZT13 should link to the dark blue circle at ZT14. Similarly, it is perplexing that the dark blue circle at ZT16 connects to both the light blue and dark blue circles at ZT17. Such errors undermine confidence in the data. The authors need to provide a clear explanation of how these data were processed.
 
 Thank you for bringing this to our attention. The data were plotted correctly, however, those data points completely overlapped with those behind, masking them. We have now offset a bit them for clarity.
 
 Lastly, in the Figure 7 legend, Table 6 is cited; however, this appears to be incorrect. It seems the authors intended to refer to Table 7.
 
 We have corrected this error, thank you.
 
 (6) Similar to the issue in Figure 7F, the data for day 12 in Supplemental Figure 2 includes two yellow triangles but lacks a green triangle. It is unclear how the authors constructed this chart, and clarification is needed.
 
 We have corrected this error. As the reviewer pointed out, we filled the triangle on day 12 with yellow instead of green.
 
 (7) In Figure 8, a 5-trial test was used to assess the effect of scheduled feeding on social behaviors. It is essential to present the results for all trials (1 to 4). Additionally, it is unclear whether the results for familial mice in Figure 8A correspond to trials 1, 2, 3, or 4.
 
 The legend for Figure 8 also appears to be incorrect: "The left panels show the time spent in social interactions when the second novel stranger mouse was introduced to the testing mouse in the 5-trial social interaction test. The significant differences were analyzed by two-way ANOVA followed by Holm-Sidak's multiple comparisons test with feeding treatment and genotype as factors." This description does not align with the content of the left panels. Moreover, two-way ANOVA is not the appropriate statistical analysis for Figure 8A. The authors need to provide accurate details about the analysis and revise the figure legend accordingly.
 
 We apologies for the confusing Figure legend which has been revised:
 
 “Fig. 8: TRF improved social memory and stereotypic grooming behavior in the Fmr1 KO mice. (A) Social memory was evaluated with the 5-trial social interaction test as described above. The social memory recognition was significantly augmented in the Fmr1 KO by the intervention, suggesting that the treated mutants were able to distinguish the novel mouse from the familiar mouse. The time spent in social interactions with the novel mouse in the 5th-trial was increased to WT-like levels in the mutants on TRF. Paired t-tests were used to evaluate significant differences in the time spent interacting with the test mouse in the 4th (familiar mouse) and 5th (novel mouse) trials. *P < 0.05 indicates the significant time spent with the novel mouse compared to the familiar mouse. (B) Grooming was assessed in a novel arena in mice of each genotype (WT, Fmr1 KO) under each feeding condition and the resulting data analyzed by two-way ANOVA followed by the Holm-Sidak’s multiple comparisons test with feeding regimen and genotype as factors. *P < 0.05 indicates the significant difference within genotype - between diet regimens , and #P < 0.05 those between genotypes - same feeding regimen. (C) TRF did not alter the overall locomotion in the treated mice. See Table 8.”
 
 To assess social recognition memory, mice underwent a five-trial social interaction paradigm in a neutral open-field arena. Each trial lasted 5 minutes and was separated by a 1-minute inter-trial interval. During trials 1–4, the test mouse was exposed to the same conspecific (Stimulus A) enclosed within a wire cup to permit olfactory and limited tactile interaction. In trial 5, a novel conspecific (Stimulus B) was introduced. Time spent investigating the stimulus B mouse (defined as sniffing or directing the nose toward the enclosure within close proximity) was scored using AnyMaze software. A progressive decrease in investigation time across trials 1–4 reflects habituation, while a significant increase in trial 5 indicates dishabituation and intact social recognition memory. In our data, there was not a lot of habituation in both genotypes, but clear differences can be appreciated between trial 4 with the now familiar mouse and trial 5 with novel mouse. Fig. 8A plots the results from individual animals in Trial 4 with a familiar mouse and in Trial 5 with a novel mouse, we have well specified this in the legends. As such, these data were analyzed with a pair t-test.
 
 We used Tow-Way ANOVA to analyse the data reported in Panel 8B and as well as the results in Table 8. This has been clarified in the legend.
 
 (8) The circadian activity and sleep behaviors of Fmr1 KO mice have been reported previously, with some findings consistent with the current manuscript, while others contradict it. Although the authors acknowledge this discrepancy, it seems insufficiently thorough to simply state that the reasons for the conflicts are unknown. Did the studies use the same equipment for behavior recording? Were the same parameters used to define locomotor activity and sleep behaviors? The authors are encouraged to investigate these details further, as doing so may uncover something interesting or significant.
 
 We agree with the reviewers, and believe that the main differences were likely in the experimental design and possibly interpretation.
 
 (9) Some subtitles in the Results section and the figure legends do not align well with the presented data. For example, in the section titled "Reduced rhythmic strength and nocturnality in the Fmr1 KOs," it is unclear how the authors justify the claim of altered nocturnality in Fmr1 KO mice. How do the authors define changes in nocturnality? Additionally, the tense used in the subtitles and figure legends is incorrect. The authors are encouraged to carefully review all subtitles and figure legends to correct these errors and enhance readability.
 
 Nocturnality is defined as the % of total activity within a 24-h cycle that occurred in the night, since this can be confusing and we agree that it was not well explained we have removed it from the subtitle/figure legends.
 
 We have adjusted the subtitles as recommended; however, the tense of the verbs might be a matter of writing style.
 
 Reviewer #2 (Public review):
 
 Summary:
 
 In the present study, the authors, using a mouse model of Fragile X syndrome, explore the very interesting hypothesis that restricting food access over a daily schedule will improve sleep patterns and, subsequently, behavioral capacities. By restricting food access from 12h to 6h over the nocturnal period (active period for mice), they show, in these KO mice, an improvement of the sleep pattern accompanied by reduced systemic levels of inflammatory markers and improved behavior. Using a classical mouse model of neurodevelopmental disorder (NDD), these data suggest that eating patterns might improve sleep quality, reduce inflammation and improve cognitive/behavioral capacities in children with NDD.
 
 Strengths:
 
 Overall, the paper is very well-written and easy to follow. The rationale of the study is generally well-introduced. The data are globally sound. The provided data support the interpretation overall.
 
 Thank you for the positive comments.
 
 Weaknesses:
 
 (1) The introduction part is quite long in the Abstract, leaving limited space for the data provided by the present study.
 
 We have revised the Abstract to better focus on the most impactful findings as suggested.
 
 (2) A couple of points are not totally clear for a non-expert reader: - The Fmr1/Fxr2 double KO mice are not well described. What is the rationale for performing both LD and DD measures?
 
 We did not use the Fmr1/Fxr2 double KO mice in this study.
 
 While measurement of day/night differences in activity rhythms are standardly done in a light/dark (LD) cycle, the organisms must be under constant conditions (DD) to measure their endogenous circadian rhythms (free running activity); this is often needed to uncover a compromised clock as entrainment to the LD cycle can mask deficits in the endogenous circadian rhythms.
 
 (3) The data on cytokines and chemokines are interesting. However, the rationale for the selection of these molecules is not given. In addition, these measures have been performed in the systemic blood. Measures in the brain could be very informative.
 
 The panel that we used had 16 cytokines/chemokines which are reported in Table 9. The experiment included WT and mutants held under 2 different feeding conditions with an n=8 per group. If we are able to obtain more resources, we would like to also carry out a comprehensive investigation of immunomediator levels as well as RNA-seq or Nanostring in selected brain regions associated with ASD aberrant behavioural phenotypes, for instance the prefrontal cortex.
 
 (4) An important question is the potential impact of fasting vs the impact of the food availability restriction. Indeed, fasting has several effects on brain functioning including cognitive functions.
 
 We did not address this issue in the present study. Briefly, the distinction between caloric restriction (CR) and TRF, in which no calories are restricted, has important mechanistic implications in mouse models. While both interventions can impact metabolism, circadian rhythms, and aging, they operate via overlapping but distinct molecular pathways. These have been the topic of recent reviews and investigations. Importantly, the fast-feed cycle can also act as a circadian entrainer (Zeitgeber)
 
 Ribas-Latre A, Fernández-Veledo S, Vendrell J. Time-restricted eating, the clock ticking behind the scenes. Front Pharmacol. 2024 Aug 8;15:1428601. doi: 10.3389/fphar.2024.1428601. PMID: 39175542; PMCID: PMC11338815.
 
 Wang R, Liao Y, Deng Y, Shuang R. Unraveling the Health Benefits and Mechanisms of Time-Restricted Feeding: Beyond Caloric Restriction. Nutr Rev. 2025 Mar 1;83(3):e1209-e1224. doi: 10.1093/nutrit/nuae074.
 
 (5) How do the authors envision the potential translation of the present study to human patients? How to translate the 12 to 6 hours of food access in mice to children with Fragile X syndrome?
 
 Time-restricted feeding (TRF) is a type of intermittent fasting that limits food intake to a specific window of time each day (usually 8–12 hours in humans), is being actively studied in adults for benefits on metabolic health, sleep, and circadian rhythms. However, applying TRF to children is not currently recommended as a general intervention, and there are important developmental, medical, and ethical considerations to take into account.
 
 On the other hand, we believe that the Fmr1 KO mouse is a good preclinical model for FXS because it closely recapitulates key molecular, cellular, and behavioral phenotypes observed in humans with the disorder. A number of the behavioral phenotypes seen in the mouse mirror those seen in patients including increased anxiety-like behavior, sensory hypersensitivity, social interaction deficits and repetitive behaviors so there is strong face validity.
 
 As we show in this study, Fmr1 KO mice present with disrupted sleep/wake cycles and reduced amplitude of circadian rhythms, consistent with findings in individuals with FXS. This makes the Fmr1 KO an excellent model to test out circadian based interventions such as scheduled feeding.
 
 We believe that pre-clinical research in Fmr1 KO mice bridges the gap between basic discovery and human clinical application. It provides a controlled, cost-effective, and biologically relevant platform for understanding disease mechanisms and testing interventions. These types of experiments need to be done before jumping to humans to ensure that the human trials are scientifically justified and ethically sound.
 
 Reviewer #1 (Recommendations for the authors):
 
 The authors should:
 
 (1) Revise the Methods section for clarity and completeness.
 
 We have re-worked the methods for clarity and completeness.
 
 (2) Provide consistent and precise definitions for all parameters and terms.
 
 We believe that we have provided definitions for all terms.
 
 (3) Clarify the rationale for experimental designs, such as the feeding schedule.
 
 We have added to the rationale for the feeding schedule. This feeding schedule has been used in a number of prior studies including our own. All this work is cited in the manuscript.
 
 (4) Reanalyze and transparently present data, including individual trial results.
 
 We have added to the figure showing the individual trail results for the 5-trial tests as requested (Supplementary Fig. 2).
 
 (5) Conduct appropriate statistical tests and correct figure legends.
 
 We believe that we have carried out appropriate statistical tests and have carefully rechecked the figure legends.
 
 (6) Investigate discrepancies with prior studies to enhance the discussion.
 
 We have added to our discussion of prior work.
 
 (7) Improve language quality and ensure consistency in terminology and grammar.
 
 We have edited the manuscript to improve language quality.
 
 Reviewer #2 (Recommendations for the authors):
 
 (1) The Abstract should be rewritten to provide more room for the obtained data.
 
 We have re-written the Abstract to focus on the most impactful findings.
 
 (2) An additional sentence describing the double KO mice should be added.
 
 We did not use double KO mice in this study.
 
 (3) The rationale for studying LD and DD should be provided.
 
 Measurement of day/night differences are standardly done in a light/dark cycle. To measure the endogenous circadian rhythms, the organisms must be under constant conditions (Dark/Dark).
 
 (4) The data on cytokines/chemokines should be strengthened by performing a larger panel of measures both in blood and the brain.
 
 The panel that we used had 16 cytokines/chemokines which we report in Table 9. This was a large experiment with 2 genotypes being held under 2 feeding conditions with n=8 mice per group. If we are able to obtain more resources, we would like to also carry out RNA-seq in different brain regions.
 
 (5) The authors should discuss in more detail the potential role of fastening vs restriction of food access.
 
 We did not address this issue in the present study. Briefly, the distinction between caloric restriction (CR) and TRF when no calories are restricted has important mechanistic implications in mouse models. While both interventions can impact metabolism, circadian rhythms, and aging, they operate via overlapping but distinct molecular pathways.
 
 (6) The authors should also provide some insight into their view on the potential translation of their experimental studies.
 
 We believe that the Fmr1 KO mouse is considered a good preclinical model for FXS because it closely recapitulates key molecular, cellular, and behavioral phenotypes observed in humans with the disorder. A number of the behavioral phenotypes seen in the mouse mirror those seen in patients including increased anxiety-like behavior, sensory hypersensitivity, social interaction deficits and repetitive behaviors so there is strong face validity. As we demonstrate in this study, Fmr1 KO mice exibit disrupted sleep/wake cycles and reduced amplitude of circadian rhythms, consistent with findings in individuals with FXS. This makes the Fmr1 KO an excellent model to test out circadian based interventions such as scheduled feeding.
 
 Still we are mindful that the translation of therapeutic findings from mouse to human has proven challenging e.g., mGluR5 antagonists failed in clinical trials despite strong preclinical data (Berry-Kravis et al., 2016). Therefore, we are cautious in overreaching in our translational interpretations.
 
 Berry-Kravis, E., Des Portes, V., Hagerman, R., Jacquemont, S., Charles, P., Visootsak, J., Brinkman, M., Rerat, K., Koumaras, B., Zhu, L., Barth, G. M., Jaecklin, T., Apostol, G., & von Raison, F. (2016). Mavoglurant in fragile X syndrome: Results of two randomized, double-blind, placebo-controlled trials. Science translational medicine, 8(321), 321ra5. https://doi.org/10.1126/scitranslmed.aab4109).
 
 AuthorResponse
Visit annotations in context

Tags

Summary

AuthorResponse

Review 2

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.09.16.613343v4
www.biorxiv.org www.biorxiv.org

Introduction of cytosine-5 DNA methylation sensitizes cells to oxidative damage

5
1. Public_Reviews 28 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 This important work advances our understanding of DNA methylation and its consequences for susceptibility to DNA damage. This work presents evidence that DNA methylation can accentuate the genomic damage propagated by DNA damaging agents as well as potentially being an independent source of such damage. The experimental results reported are sound but the evidence presented to support the conclusions drawn is incomplete and other interpretations are possible. The work will be of broad interest to biochemists, cell and genome biologists.
 
 Summary
2. Public_Reviews 28 Jul 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Summary:
 
 The manuscript titled "Introduction of cytosine-5 DNA methylation sensitizes cells to oxidative damage" proposes that 5mC modifications to DNA, despite being ancient and wide-spread throughout life, represent a vulnerability, making cells more susceptible to both chemical alkylation and, of more general importance, reactive oxygen species. Sarkies et al take the innovative approach of introducing enzymatic genome-wide cytosine methylation system (DNA methyltransferases, DNMTs) into E. coli, which normally lacks such a system. They provide compelling evidence that the introduction of DNMTs increases the sensitivity of E. coli to chemical alkylation damage. Surprisingly they also show DNMTs increase the sensitivity to reactive oxygen species and propose that the DNMT generated 5mC presents a target for the reactive oxygen species that is especially damaging to cells. Evidence is presented that DNMT activity directly or indirectly produces reactive oxygen species in vivo, which is an important discovery if correct, though the mechanism for this remains obscure.
 
 I am satisfied that the points #2, #3 and #4 relating to non-addativity, transcriptional changes and ROS generation have been appropriately addressed in this revised manuscript. The most important point (previously #1) has not been addressed beyond the acknowledgement in the results section that: "Alternatively, 3mC induction by DNMT may lead to increased levels of ssDNA, particularly in alkB mutants, which could increase the risk of further DNA damage by MMS exposure and heighten sensitivity." This slightly miss-represents the original point that 5mC the main enzymatic product of DNMTs rather or in addition to 3mC is likely to lead to transient damage susceptible ssDNA, especially in an alkB deficient background. And more centrally to the main claims of this manuscript, the authors have not resolved whether methylated cytosine introduced into bacteria is deleterious in the context of genotoxic stress because of the oxidative modification to 5mC and 3mC, or because of oxidative/chemical attack to ssDNA that is transiently exposed in the repair processing of 5mC and 3mC, especially in an alkB deficient background. This is a crucial distinction because chemical vulnerability of 5mC would likely be a universal property of cytosine methylation across life, but the wide-spread exposure of ssDNA is expected to be peculiarity of introducing cytosine methylation into a system not evolved with that modification as a standard component of its genome.
 
 These two models make different predictions about the predominant mutation types generated, in the authors system using M.SssI that targets C in a CG context - if oxidative damage to 5mC dominates then mutations are expected to be predominantly in a CG context, if ssDNA exposure effects dominate then the mutations are expected to be more widely distributed - sequencing post exposure clones could resolve this.
 
 Strengths:
 
 This work is based on an interesting initial premise, it is well motivated in the introduction and the manuscript is clearly written. The results themselves are compelling.
 
 Weaknesses:
 
 I am not currently convinced by the principal interpretations and think that other explanations based on known phenomena could account for key results. Specifically the authors have not resolved whether oxidative modification to 5mC and 3mC, or chemical attack to ssDNA that is transiently exposed in the repair processing of 5mC and 3mC is the principal source of the observed genotoxicity.
 
 (1) Original query which still stands: As noted in the manuscript, AlkB repairs alkylation damage by direct reversal (DNA strands are not cut). In the absence of AlkB, repair of alklylation damage/modification is likely through BER or other processes involving strand excision and resulting in single stranded DNA. It has previously been shown that 3mC modification from MMS exposure is highly specific to single stranded DNA (PMID:20663718) occurring at ~20,000 times the rate as double stranded DNA. Consequently the introduction of DNMTs is expected to introduce many methylation adducts genome-wide that will generate single stranded DNA tracts when repaired in an AlkB deficient background (but not in an AlkB WT background), which are then hyper-susceptible to attack by MMS. Such ssDNA tracts are also vulnerable to generating double strand breaks, especially when they contain DNA polymerase stalling adducts such as 3mC. The generation of ssDNA during repair is similarly expected follow the H2O2 or TET based conversion of 5mC to 5hmC or 5fC neither of which can be directly repaired and depend on single strand excision for their removal. The potential importance of ssDNA generation in the experiments has not been [adequately] considered.
 
 Review 1
3. Public_Reviews 28 Jul 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 5-methylcytosine (5mC) is a key epigenetic mark in DNA and plays a crucial role in regulating gene expression in many eukaryotes including humans. The DNA methyltransferases (DNMTs) that establish and maintain 5mC, are conserved in many species across eukaryotes, including animals, plants, and fungi, mainly in a CpG context. Interestingly, 5mC levels and distributions are quite variable across phylogenies with some species even appearing to have no such DNA methylation.
 
 This interesting and well-written paper discusses continuation of some of the authors' work published several years ago. In that previous paper, the laboratory demonstrated that DNA methylation pathways coevolved with DNA repair mechanisms, specifically with the alkylation repair system. Specifically, they discovered that DNMTs can introduce alkylation damage into DNA, specifically in the form of 3-methylcytosine (3mC). (This appears to be an error in the DNMT enzymatic mechanism where the generation 3mC as opposed to its preferred product 5-methylcytosine (5mC), is caused by the flipped target cytosine binding to the active site pocket of the DNMT in an inverted orientation.) The presence of 3mC is potentially toxic and can cause replication stress, which this paper suggests may explain the loss of DNA methylation in different species. They further showed that the ALKB2 enzyme plays a crucial role in repairing this alkylation damage, further emphasizing the link between DNA methylation and DNA repair.
 
 The co-evolution of DNMTs with DNA repair mechanisms suggest there can be distinct advantages and disadvantages of DNA methylation to different species which might depend on their environmental niche. In environments that expose species to high levels of DNA damage, high levels of 5mC in their genome may be disadvantageous. This present paper sets out to examine the sensitivity of an organism to genotoxic stresses such as alkylation and oxidation agents as the consequence of DNMT activity. Since such a study in eukaryotes would be complicated by DNA methylation controlling gene regulation, these authors cleverly utilize Escherichia coli (E.coli) and incorporate into it the DNMTs from other bacteria that methylate the cytosines of DNA in a CpG context like that observed in eukaryotes; the active sites of these enzymes are very similar to eukaryotic DNMTs and basically utilize the same catalytic mechanism (also this strain of E.coli does not specifically degrade this methylated DNA) .
 
 The experiments in this paper more than adequately show that E. coli expression of these DNMTs (comparing to the same strain without the DNMTS) do indeed show increased sensitivity to alkylating agents and this sensitivity was even greater than expected when a DNA repair mechanism was inactivated. Moreover, they show that this E. coli expressing this DNMT is more sensitive to oxidizing agents such as H2O2 and has exacerbated sensitivity when a DNA repair glycosylase is inactivated. Both propensities suggest that DNMT activity itself may generate additional genotoxic stress. Intrigued that DNMT expression itself might induce sensitivity to oxidative stress, the experimenters used a fluorescent sensor to show that H2O2 induced reactive oxygen species (ROS) are markedly enhanced with DNMT expression. Importantly, they show that DNMT expression alone gave rise to increased ROS amounts and both H2O2 addition and DNMT expression has greater effect that the linear combination of the two separately. They also carefully checked that the increased sensitivity to H2O2 was not potentially caused by some effect on gene expression of detoxification genes by DNMT expression and activity. Finally, by using mass spectroscopy, they show that DNMT expression led to production of the 5mC oxidation derivatives 5-hydroxymethylcytosine (5hmC) and 5-formylcytosine (5fC) in DNA. 5fC is a substrate for base excision repair while 5hmC is not; more 5fC was observed. Introduction of non-bacterial enzymes that produce 5hmC and 5fC into the DNMT expressing bacteria again showed a greater sensitivity than expected. Remarkedly, in their assay with addition of H2O2, bacteria showed no growth with this dual expression of DNMT and these enzymes.
 
 Overall, the authors conduct well thought-out and simple experiments to show that a disadvantageous consequence of DNMT expression leading to 5mC in DNA is increased sensitivity to oxidative stress as well as alkylating agents.
 
 Again, the paper is well-written and organized. The hypotheses are well-examined by simple experiments. The results are interesting and can impact many scientific areas such as our understanding of evolutionary pressures on an organism by environment to impacting our understanding about how environment of a malignant cell in the human body may lead to cancer.
 
 In a new revised version of the paper, the authors have adequately addressed issues put forth by other reviewers. The result is even a better manuscript. Additions to the Results and Discussion sections and a new Supplemental Figure 2 give further credence to their conclusions.
 
 Review 2
4. Public_Reviews 28 Jul 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 Summary:
 
 Krwawicz et al., present evidence that expression of DNMTs in E. coli results in (1) introduction of alkylation damage that is repaired by AlkB; (2) confers hypersensitivity to alkylating agents such as MMS (and exacerbated by loss of AlkB); (3) confers hypersensitivity to oxidative stress (H2O2 exposure); (4) results in a modest increase in ROS in the absence of exogenous H2O2 exposure; and (5) results in the production of oxidation products of 5mC, namely 5hmC and 5fC, leading to cellular toxicity. The findings reported here have interesting implications for the concept that such genotoxic and potentially mutagenic consequences of DNMT expression (resulting in 5mC) could be selectively disadvantageous for certain organisms. The other aspect of this work which is important for understanding the biological endpoints of genotoxic stress is the notion that DNA damage per se somehow induces elevated levels of ROS.
 
 Strengths:
 
 The manuscript is well-written, and the experiments have been carefully executed providing data that support the authors' proposed model presented in Fig. 7 (Discussion, sources of DNA damage due to DNMT expression).
 
 Weaknesses:
 
 (1) The authors have established an informative system relying on expression of DNMTs to gauge the effects of such expression and subsequent induction of 3mC and 5mC on cell survival and sensitivity to an alkylating agent (MMS) and exogenous oxidative stress (H2O2 exposure). The authors state (p4) that Fig. 2 shows that "Cells expressing either M.SssI or M.MpeI showed increased sensitivity to MMS treatment compared to WT C2523, supporting the conclusion that the expression of DNMTs increased the levels of alkylation damage." This is a confusing statement and requires revision as Fig. 2 does ALL cells shown in Fig. 2 are expressing DNMTs and have been treated with MMS. It is the absence of AlkB and the expression of DNMTs that that causes the MMS sensitivity.
 
 (2) It would be important to know whether the increased sensitivity (toxicity) to DNMT expression and MMS is also accompanied by substantial increases in mutagenicity. The authors should explain in the text why mutation frequencies were not also measured in these experiments.
 
 (3) Materials and Methods. ROS production monitoring. The "Total Reactive Oxygen Species (ROS) Assay Kit" has not been adequately described. Who is the Vendor? What is the nature of the ROS probes employed in this assay? Which specific ROS correspond to "total ROS"?
 
 (4) The demonstration (Fig. 4) that DNMT expression results in elevated ROS and its further synergistic increase when cells are also exposed to H2O2 is the basis for the authors' discussion of DNA damage-induced increases in cellular ROS. S. cerevisiae does not possess DNMTs/5mC, yet exposure to MMS also results in substantial increases in intracellular ROS (Rowe et al, (2008) Free Rad. Biol. Med. 45:1167-1177. PMC2643028). The authors should be aware of previous studies that have linked DNA damage to intracellular increases in ROS in other organisms and should comment on this in the text.
 
 Review 3
5. Public_Reviews 28 Jul 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews.
 
 Reviewer #1 (Public review):
 
 Summary:
 
 The manuscript proposes that 5mC modifications to DNA, despite being ancient and widespread throughout life, represent a vulnerability, making cells more susceptible to both chemical alkylation and, of more general importance, reactive oxygen species. Sarkies et al take the innovative approach of introducing enzymatic genome-wide cytosine methylation system (DNA methyltransferases, DNMTs) into E. coli, which normally lacks such a system. They provide compelling evidence that the introduction of DNMTs increases the sensitivity of E. coli to chemical alkylation damage. Surprisingly they also show DNMTs increase the sensitivity to reactive oxygen species and propose that the DNMT generated 5mC presents a target for the reactive oxygen species that is especially damaging to cells. Evidence is presented that DNMT activity directly or indirectly produces reactive oxygen species in vivo, which is an important discovery if correct, though the mechanism for this remains obscure.
 
 Strengths:
 
 This work is based on an interesting initial premise, it is well-motivated in the introduction and the manuscript is clearly written. The results themselves are compelling.
 
 We thank the reviewer for their positive response to our study. We also really appreciate the thoughtful comments raised. We have addressed the comments raised as detailed below.
 
 Weaknesses:
 
 I am not currently convinced by the principal interpretations and think that other explanations based on known phenomena could account for key results. Specific points below.
 
 (1) As noted in the manuscript, AlkB repairs alkylation damage by direct reversal (DNA strands are not cut). In the absence of AlkB, repair of alklylation damage/modification is likely through BER or other processes involving strand excision and resulting in single stranded DNA. It has previously been shown that 3mC modification from MMS exposure is highly specific to single stranded DNA (PMID:20663718) occurring at ~20,000 times the rate as double stranded DNA. Consequently, the introduction of DNMTs is expected to introduce many methylation adducts genome-wide that will generate single stranded DNA tracts when repaired in an AlkB deficient background (but not in an AlkB WT background), which are then hyper-susceptible to attack by MMS. Such ssDNA tracts are also vulnerable to generating double strand breaks, especially when they contain DNA polymerase stalling adducts such as 3mC. The generation of ssDNA during repair is similarly expected follow the H2O2 or TET based conversion of 5mC to 5hmC or 5fC neither of which can be directly repaired and depend on single strand excision for their removal. The potential importance of ssDNA generation in the experiments has not been considered.
 
 We thank the reviewer for this interesting and insightful suggestion. Our interpretation of our findings is that a subset of MMS-induced DNA damage, specifically 3mC, overlaps with the damage introduced by DNMTs and this accounts for increased sensitivity to MMS when DNMTs are expressed. However, the idea that the introduction of 3mC by DNMT actually makes the DNA more liable to damage by MMS, potentially through increasing the level of ssDNA, is also a potential explanation, which could operate in addition to the mechanism that we propose.
 
 (2) The authors emphasise the non-additivity of the MMS + DNMT + alkB experiment but the interpretation of the result is essentially an additive one: that both MMS and DNMT are introducing similar/same damage and AlkB acts to remove it. The non-additivity noted would seem to be more consistent with the ssDNA model proposed in #1. More generally non-additivity would also be seen if the survival to DNA methylation rate is non-linear over the range of the experiment, for example if there is a threshold effect where some repair process is overwhelmed. The linearity of MMS (and H2O2) exposure to survival could be directly tested with a dilution series of MMS (H2O2).
 
 We thank the reviewer for this point. As in the response to point #1, the reviewer’s hypothesis of increased potency of MMS, potentially through increased ssDNA, downstream of 3mC induction by DNMT, is a good one. We have added a dose-response curve for DNMT-expressing cells to MMS to the revised version of the manuscript. This shows that there is a non-linear response to MMS in the WT background. Sensitivity is exacerbated by expression of DNMT and alkB mutation individually but there is also a strong non-additive effect that is particularly marked at low MMS concentrations where sensitivity is much higher in the double mutant than predicted from the two single mutants. This is consistent with induction of DNA damage by DNMT that is repaired by alkB because alkB can be ‘overwhelmed’ even in WT backgrounds as the reviewer suggests. However, it is also perfectly possible that the effect is due to increased levels of DNA damage induction in DNMT-expressing cells. Both these results are compatible with our central hypothesis, namely that DNMT expression induces 3mC. We have included these results along with discussion of them in the revised text in the results section:
 
 In order to investigate the non-additivity between DNMT expression and alkB mutation further, we investigated the effect of MMS over a range of concentrations for the different strains (Supplemental Figure 1A). We quantified the non-additivity by comparing between the survival of alkB expressing DNMT to the predicted combined effect of either alkB mutation alone or DNMT expression alone(Supplemental Figure 1B). Significantly reduced survival than expected was observed, most notably at low concentrations of MMS, which could be due to the saturation of the effect at high concentrations of MMS for alkB mutants expressing DNMT, where extremely high levels of sensitivity were observed. The non-linear shape of the graph observed for WT cells expressing DNMTs further suggests that the ability of AlkB to repair the DNA is overwhelmed at high MMS concentrations even in the WT background. These results are consistent with the idea that AlkB repairs a form of DNA damage from MMS that is more prevalent when DNMT is expressed. This could be because DNMT induces 3mC, repaired by AlkB, and further 3mC is induced by MMS leading to much higher 3mC levels in the absence of AlkB activity. Alternatively, 3mC induction by DNMT may lead to increased levels of ssDNA, particularly in alkB mutants, which could increase the risk of further DNA damage by MMS exposure and heighten sensitivity. Either of these mechanisms are consistent with induction of 3mC by DNMT, and indicate that the induction of DNA damage by DNMT expression has a fitness cost for cells when exposed to genotoxic stress in their environment.
 
 (3) The substantial transcriptional changes induced by DNMT expression (Supplemental Figure 4) are a cause for concern and highlight that the ectopic introduction of methylation into a complex system is potentially more confounded than it may at first seem. Though the expression analysis shows bulk transcription properties, my concern is that the disruptive influence of methylation in a system not evolved with it adds not just consistent transcriptional changes but transcriptional heterogeneity between cells which could influence net survival in a stressed environment. In practice I don't think this can be controlled for, possibly quantified by single-cell RNA-seq but that is beyond the reasonable scope of this paper.
 
 We fully agree with the reviewer and, indeed, we are very interested in what is driving the transcriptional changes that we observed. Work is currently underway in the lab to investigate this further but, as the reviewer suggests, is beyond the scope of this paper. Importantly, we have used the transcriptional data to determine that the effect of DNMTs on ROS is unlikely to be due to failure of ROS-induced detoxification mechanisms by investigating the expression of oxyR regulated genes. Nevertheless we have explicitly mentioned the concern raised by the reviewer in the revised manuscript as follows:
 
 “The substantial transcriptional responses could potentially affect how individual cells respond to genotoxic stress and thus could be contributing to some of the excess sensitivity to MMS and H2O2 in cells expressing DNMTs. However, the induction of oxyR regulated genes such as catalase was unaffected by 5mC (Supplementary Figure 4B). Thus, the increased sensitivity to H2O2 is unlikely to be caused by failure of detoxification gene induction by DNMT expression.”
 
 (4) Figure 4 represents a striking result. From its current presentation it could be inferred that DNMTs are actively promoting ROS generation from H2O2 and also to a lesser extent in the absence of exogenous H2O2. That would be very surprising and a major finding with far-reaching implications. It would need to be further validated, for example by in vitro reconstitution of the reaction and monitoring ROS production. Rather, I think the authors are proposing that some currently undefined, indirect consequence of DNMT activity promotes ROS generation, especially when exogenous H2O2 is available. It would help if this were clarified.
 
 We thank the reviewer for picking this up. In the discussion, we raise two possible explanations for why DNMT (even without H2O2) increases the ROS levels. One idea is direct activity of DNMT, and one is through the product of DNMT activity (5mC) acting as a platform to generate more ROS from endogenous or exogenous sources. Whilst we attempted to measure ROS from mSSSI activity in vitro, this experiment gave inconsistent results and therefore we cannot distinguish between these two possibilities. However, we argued that direct activity is less likely, exactly as the reviewer points out. We have clarified our discussion in the revised version, rewriting the entire section titled
 
 Oxidative stress as a new source of DNA damage induction by DNMT expression to more clearly set out these possibilities.
 
 Reviewer #2 (Public review):
 
 5-methylcytosine (5mC) is a key epigenetic mark in DNA and plays a crucial role in regulating gene expression in many eukaryotes including humans. The DNA methyltransferases (DNMTs) that establish and maintain 5mC, are conserved in many species across eukaryotes, including animals, plants, and fungi, mainly in a CpG context. Interestingly, 5mC levels and distributions are quite variable across phylogenies with some species even appearing to have no such DNA methylation.
 
 This interesting and well-written paper discusses the continuation of some of the authors' work published several years ago. In that previous paper, the laboratory demonstrated that DNA methylation pathways coevolved with DNA repair mechanisms, specifically with the alkylation repair system. Specifically, they discovered that DNMTs can introduce alkylation damage into DNA, specifically in the form of 3-methylcytosine (3mC). (This appears to be an error in the DNMT enzymatic mechanism where the generation 3mC as opposed to its preferred product 5-methylcytosine (5mC), is caused by the flipped target cytosine binding to the active site pocket of the DNMT in an inverted orientation.) The presence of 3mC is potentially toxic and can cause replication stress, which this paper suggests may explain the loss of DNA methylation in different species. They further showed that the ALKB2 enzyme plays a crucial role in repairing this alkylation damage, further emphasizing the link between DNA methylation and DNA repair.
 
 The co-evolution of DNMTs with DNA repair mechanisms suggests there can be distinct advantages and disadvantages of DNA methylation to different species which might depend on their environmental niche. In environments that expose species to high levels of DNA damage, high levels of 5mC in their genome may be disadvantageous. This present paper sets out to examine the sensitivity of an organism to genotoxic stresses such as alkylation and oxidation agents as the consequence of DNMT activity. Since such a study in eukaryotes would be complicated by DNA methylation controlling gene regulation, these authors cleverly utilize Escherichia coli (E.coli) and incorporate into it the DNMTs from other bacteria that methylate the cytosines of DNA in a CpG context like that observed in eukaryotes; the active sites of these enzymes are very similar to eukaryotic DNMTs and basically utilize the same catalytic mechanism (also this strain of E.coli does not specifically degrade this methylated DNA) .
 
 The experiments in this paper more than adequately show that E. coli expression of these DNMTs (comparing to the same strain without the DNMTS) do indeed show increased sensitivity to alkylating agents and this sensitivity was even greater than expected when a DNA repair mechanism was inactivated. Moreover, they show that this E. coli expressing this DNMT is more sensitive to oxidizing agents such as H2O2 and has exacerbated sensitivity when a DNA repair glycosylase is inactivated. Both propensities suggest that DNMT activity itself may generate additional genotoxic stress. Intrigued that DNMT expression itself might induce sensitivity to oxidative stress, the experimenters used a fluorescent sensor to show that H2O2 induced reactive oxygen species (ROS) are markedly enhanced with DNMT expression. Importantly, they show that DNMT expression alone gave rise to increased ROS amounts and both H2O2 addition and DNMT expression has greater effect that the linear combination of the two separately. They also carefully checked that the increased sensitivity to H2O2 was not potentially caused by some effect on gene expression of detoxification genes by DNMT expression and activity. Finally, by using mass spectroscopy, they show that DNMT expression led to production of the 5mC oxidation derivatives 5-hydroxymethylcytosine (5hmC) and 5-formylcytosine (5fC) in DNA. 5fC is a substrate for base excision repair while 5hmC is not; more 5fC was observed. Introduction of non-bacterial enzymes that produce 5hmC and 5fC into the DNMT expressing bacteria again showed a greater sensitivity than expected. Remarkedly, in their assay with addition of H2O2, bacteria showed no growth with this dual expression of DNMT and these enzymes.
 
 Overall, the authors conduct well thought-out and simple experiments to show that a disadvantageous consequence of DNMT expression leading to 5mC in DNA is increased sensitivity to oxidative stress as well as alkylating agents.
 
 Again, the paper is well-written and organized. The hypotheses are well-examined by simple experiments. The results are interesting and can impact many scientific areas such as our understanding of evolutionary pressures on an organism by environment to impacting our understanding about how environment of a malignant cell in the human body may lead to cancer.
 
 We thank the reviewer for their response to our study, and value the time taken to produce a public review that will aid readers in understanding the key results of our study.
 
 Reviewer #3 (Public review):
 
 Summary:
 
 Krwawicz et al., present evidence that expression of DNMTs in E. coli results in (1) introduction of alkylation damage that is repaired by AlkB; (2) confers hypersensitivity to alkylating agents such as MMS (and exacerbated by loss of AlkB); (3) confers hypersensitivity to oxidative stress (H2O2 exposure); (4) results in a modest increase in ROS in the absence of exogenous H2O2 exposure; and (5) results in the production of oxidation products of 5mC, namely 5hmC and 5fC, leading to cellular toxicity. The findings reported here have interesting implications for the concept that such genotoxic and potentially mutagenic consequences of DNMT expression (resulting in 5mC) could be selectively disadvantageous for certain organisms. The other aspect of this work which is important for understanding the biological endpoints of genotoxic stress is the notion that DNA damage per se somehow induces elevated levels of ROS.
 
 Strengths:
 
 The manuscript is well-written, and the experiments have been carefully executed providing data that support the authors' proposed model presented in Fig. 7 (Discussion, sources of DNA damage due to DNMT expression).
 
 Weaknesses:
 
 (1) The authors have established an informative system relying on expression of DNMTs to gauge the effects of such expression and subsequent induction of 3mC and 5mC on cell survival and sensitivity to an alkylating agent (MMS) and exogenous oxidative stress (H2O2 exposure). The authors state (p4) that Fig. 2 shows that "Cells expressing either M.SssI or M.MpeI showed increased sensitivity to MMS treatment compared to WT C2523, supporting the conclusion that the expression of DNMTs increased the levels of alkylation damage." This is a confusing statement and requires revision as Fig. 2 does ALL cells shown in Fig. 2 are expressing DNMTs and have been treated with MMS. It is the absence of AlkB and the expression of DNMTs that that causes the MMS sensitivity.
 
 We thank the reviewer for this and agree that this needs to be clarified with regards to the figure presented and will do so in the revised manuscript. The key comparison is between the active and inactive mSSSI which shows increased sensitivity when active methyltransferases are expressed. We have clarified this in the revised version of the manuscript as follows:
 
 “Cells expressing either M.SssI or M.MpeI showed increased sensitivity to MMS treatment compared to cells expressing inactive M.SssI”
 
 (2) It would be important to know whether the increased sensitivity (toxicity) to DNMT expression and MMS is also accompanied by substantial increases in mutagenicity. The authors should explain in the text why mutation frequencies were not also measured in these experiments.
 
 This is an important point because it is not immediately obvious that increased sensitivity would be associated with increased mutagenicity (if, for example, 3mC was never a cause of innacurate DNA repair even in the absence of AlkB). We have now added a Rif resistance assay which demonstrates increased mutagenesis in the presence of DNMT, and that this is exacerbated by loss of AlkB. This is now added as supplemental figure 2 and described in the manuscript as follows:
 
 “One potential consequence of DNMT activity in inducing DNA damage might be increased mutagenesis. To test this we performed a rifampicin resistance mutagenesis assay, in the absence of MMS, to test whether DNMT induced damage was sufficient to lead to mutation rate increase. Mutation rate was increased by DNMT expression (p=1.6e-12; two way anova; Supplemental Figure 2) and alkB mutation (two way anova) separately (p<1e-16). Moreover, there was a significant interaction such that combined alkB mutation and DNMT expression led to a further increased mutation rate compared to the expectation from alkB mutation and DNMT expression separately (p = 7.9e-10; Supplemental Figure 2). Importantly, DNMT induction alone would be expected to lead to increased mutations due to cytosine deamination(Sarkies, 2022a); however, there is a synergistic effect on mutations when this is combined with loss of AlkB function in alkB mutants. This is consistent with 3mC induction by DNMTs which is repaired by AlkB in WT cells but leads to mutations in alkB mutant cells.
 
 (3) Materials and Methods. ROS production monitoring. The "Total Reactive Oxygen Species (ROS) Assay Kit" has not been adequately described. Who is the Vendor? What is the nature of the ROS probes employed in this assay? Which specific ROS correspond to "total ROS"?
 
 The ROS measurement was with a kit from ThermoFisher: https://www.thermofisher.com/order/catalog/product/88-5930-74. The probe is DCFH-DA. This is a general ROS sensor that is oxidised by a large number of cellular reactive oxygen species hence we cannot attribute the signal to a single species. Use of a technique with the potential to more precisely identify the species involved is something we plan to do in future, but is beyond what we can do as part of this study. We have added a comment as to the specificity of the ROS sensor in the revised version as follows:
 
 “The ROS detection reagent in this system is DCFH-DA, a generalised ROS sensor that is not specific to any particular ROS molecule.”
 
 (4) The demonstration (Fig. 4) that DNMT expression results in elevated ROS and its further synergistic increase when cells are also exposed to H2O2 is the basis for the authors' discussion of DNA damage-induced increases in cellular ROS. S. cerevisiae does not possess DNMTs/5mC, yet exposure to MMS also results in substantial increases in intracellular ROS (Rowe et al, (2008) Free Rad. Biol. Med. 45:1167-1177. PMC2643028). The authors should be aware of previous studies that have linked DNA damage to intracellular increases in ROS in other organisms and should comment on this in the text.
 
 We thank the reviewer for this point. We note that the increased ROS that we observed occur in the presence of DNMTs alone and in the presence of H2O2, not in the presence of MMS; however, the point that DNA damage in general can promote increased ROS in some circumstances is well taken. We have included a comment on this in the revised version as follows:
 
 “We believe this is a plausible mechanism to explain both increased ROS and increased sensitivity to oxidative stress when DNMT is expressed. However, other explanations are possible, and it is notable that DNA damaging agents such as MMS can lead to ROS generation(Rowe et al., 2008). A more detailed chemical and kinetic study of the ROS formation in DNMT-expressing cells would be needed to resolve these questions.”
 
 AuthorResponse
Visit annotations in context

Tags

Review 2

Review 3

Summary

AuthorResponse

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.09.13.612259v2
www.biorxiv.org www.biorxiv.org

Working memory shapes neural geometry in human EEG over learning

5
1. Public_Reviews 28 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 The findings are valuable, given that they highlight the flexible and future-oriented nature of working memory. However, the evidence for the claims about context/color generalization, behavioural relevance of context decoding, dimensionality reduction, neural geometry, the XOR representation, and the specific contribution of working memory is incomplete. The work could be reframed in terms of prospective remapping.
 
 Summary
2. Public_Reviews 28 Jul 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Wojcik et al. conducted a working memory (WM) experiment in which participants had to press the right or left button after being presented with a square (upright) or diamond stimulus. The response mapping ('context') depended on a colour cue presented at the start of each trial. This results in an XOR task, requiring participants to integrate colour and shape information. Importantly, multiple colours could map onto the same context, allowing the authors to disentangle the (neural) representations of context from those of colour.
 
 The authors report that participants learn the appropriate context mappings quickly over the course of the experiment. Neural context representation is evident in the WM delay and emerges later in the experiment, unlike colour representation, which is present only during colour presentation and does not evolve over experimental time. There are furthermore results on neural geometry (averaged cross-generalized decoding) and neural dimensionality (averaged decoding after shattering all task dimensions), which are somewhat harder to interpret.
 
 Overall, the findings are likely Important, as they highlight the flexible and future-oriented nature of WM. The strength of support at the moment is incomplete: there are some loose ends on the context/colour generalization, and the evidence for the XOR neural representation is not (yet) well-established.
 
 I have one (major) concern and several suggestions for improvement.
 
 (1a) As the authors also acknowledge in several places, the XOR dimension is strongly correlated with motor responses, in any case toward the end of the task (and by definition for all correct trials). This should be dealt with properly. Right now, e.g. Figures 2g/i, 2h/j, 3e/g, 3f/h are highly similar, respectively, because of this strong collinearity. I would remove the semi-duplicate graphs and/or deal with this explicitly through some partial regression, trial selection, or similar (and report these correlations).
 
 (1b) Most worrisome in this respect is that one of the key results presented is that XOR decoding increases with learning. But also task accuracy increases, meaning that the proportion of correct trials increases with learning, meaning that the XOR and motor regressors become more similar over experimental time. This means that any classifier picking up on motor signals will be better able to do so later on in the task than earlier on. (In other words, the XOR regressor may be a noisy version of the motor regressor early on, and a more precise version of the motor regressor later on.) Therefore, the increase in XOR decoding over experimental time may be (entirely) due to an increase in similarity between the XOR and motor dimensions. The authors should either rule out this explanation, and/or remove/tone down the conclusions regarding the XOR coding increase. (Note that the takeaway regarding colour/context generalization does not depend on this analysis, fortunately.) The absence of a change in motor decoding with learning (as reported on page 11) does not affect this potential confound; in fact it is made more likely with it.
 
 (2) Bayes factors would be valuable in several places, especially with null results (p. 5) or cases with borderline-significant p-values.
 
 (3) The authors' interpretation of the key results implies that the abstract coding learned over the task should be relevant for behaviour. The current results do not show a particularly strong behavioural relevance of coding, to put it mildly. It might be worth exploring whether neural coding expresses itself in reaction times, rather than (in)correct responses, and reflecting on the (lack of) behavioural relevance in the Discussion.
 
 (4) All data and experiment/analysis code should be made available, in public repositories (i.e., not "upon request").
 
 Review 1
3. Public_Reviews 28 Jul 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 This manuscript describes an experiment in which subjects learned to apply an XOR rule in a task in which an initial color cue conditioned the instruction ("press left" or "press right") conveyed by a subsequent shape.
 
 This manuscript gives the impression of being written to address a sophisticated computational framework, but the experiment was not designed to test this framework. Stated differently, the memory-as-resource-for-computations framework may not be needed to account for the results presented here. Variants of this task have been used for decades, often in the context of prospective processing, and although the authors emphasize a dimensionality reduction operation, the task may actually only require the recoding of retrospectively relevant sensory information into the prospectively relevant rule that is needed to guide the response on that trial. Consequently, many of the claims are only partially supported.
 
 The framework invoked by the authors is summarized in the second paragraph of the manuscript:
 
 "Insights from machine learning and computational neuroscience further highlight the idea that memory processes can be viewed as a resource for computations rather than a passive mechanism for storage (Dasgupta & Gershman, 2021; Ehrlich & Murray, 2022). In this light, working memory adapts computations to the current task demands (Dasgupta & Gershman, 2021); pre-computed information can be stored in working memory, and thus reduce the computation time at the moment of the decision (Braver, 2012; Hunt et al., 2021). This perspective is further supported by computational modelling of neural circuits that contends that working memory will change neural geometry in a way that supports the temporal decomposition of computations (Ehrlich & Murray, 2022). This work suggests that the computational load at the moment of action can be thus alleviated by decomposing complex operations into several simple problems solved sequentially in time."
 
 However, the relevance, certainly the necessity, of this framework leads to mischaracterizations of some elements of the task (including about a hypothesis), the emphasis of constructs that don't actually exist in the task, some logical inconsistencies, and the repeated invocation of operations like "dimensionality reduction" despite the fact that the authors find no evidence for them.
 
 Beginning with the final point, the task presented here is a variant of a Badre-style hierarchical control task, one requiring solution at the second order of abstraction (i.e., the color conditions the interpretation of the shape [2nd order], which then determines the correct response [1st order]. These operations can be accomplished without dimensionality reduction by simply carrying out the remapping instructed by each element. For example, on a trial beginning with a blue color cue, the subject can use a lookup table to translate this into the rule "square = left; diamond = right". When the shape is subsequently presented, the subject responds according to this rule. This is really no different from any of the several studies that have shown prospective recoding of information in working memory, including the work from the 1990s in nonhuman primates, and several subsequent studies using fMRI in humans beginning in the 2000s. Importantly, this account does not involve dimensionality reduction in any overt way. If it were the case that the more recent computational work indicates that this operation of "prospective recoding" does, in fact, entail dimensionality reduction on this type of task, that would be interesting. However, I don't see evidence that this is the case. Although the authors carry out several analyses of shattering dimensionality, I do not find any that track this measure across epochs within the trial, an approach that would presumably capture epoch-to-epoch dimensionality reduction, if it occurred.
 
 With regard to mischaracterization of a hypothesis, the authors state: "We hypothesised that working memory processes control the dimensionality of neural representations by selecting features for maintenance. We tested this prediction by exploring the learning dynamics of the colour representation." However, what is described here is not a test of a prediction about dimensionality reduction. Rather, it's a test of a prediction that color decoding would not persist after color offset. To describe this as "dimensionality reduction" misrepresents/mischaracterizes what's happening, which is the translation of color (on any trial, a low-dimensional variable) into the rule that was cued by that color. It is a translation of what kind of information is being represented, as opposed to a dimensionality reduction applied to a representation.
 
 With regard to constructs that don't actually exist, it is unclear what the reality is in the study of a "color pair"? I.e., because colors are never presented together, nor associated in some way, this would seem to be a device that's helpful to the authors for thinking about how their task might be solved, rather than a fundamental aspect of the task that the reader needs to understand. Furthermore, the example given here wasn't helpful for this reader. (What WAS helpful was the description of the two possible strategies and accompanying references to Mayr & Kleigel and to Vandierendonck.)
 
 With regard to logical inconsistencies, one is the notion that color is irrelevant. This is not true, in a literal sense, because if every color cue were rendered as the same monochromatic patch, one wouldn't be able to solve the task. What the authors could do to make their point is perhaps refer to Strategy 1, which corresponds to a less efficient way to solve the task.
 
 Also inconsistent is the relation of the present work to a previous study carried out by this group in nonhuman primates. That task did not include a working memory delay, and so this is difficult to reconcile the comparison that the authors draw with this task with the many suggestions that they make that it's something about WM, per se, that allows for the efficient performance of this task.
 
 "Crucially, the irrelevant feature was only discarded during the delay after it entered working memory." This statement is in direct contradiction with the authors' own reporting of the results: "Decoding analyses demonstrated that colour information peaked in the early colour locked period of the trial and then rapidly declined over time to reach chance levels before the delay-locked period, 𝑐𝑙𝑢𝑠𝑡𝑒𝑟 1: 0.082 − 0.484 𝑚𝑠, 𝑝 = 0.006 (Fig. 2c)."
 
 Other areas where I had difficulties include:
 
 (1) "These results suggest that participants rapidly discarded irrelevant colour information. Only information relevant for performance (context) entered working memory and was maintained." Although this may be the case, each of the four colors also instructed a rule, and so what's being documented in this study is the translation of a cue into a rule, not the transformation of a "meaningless color" into a "meaningful context." It is very possible that if the authors only used two colors, one for each rule (i.e., one for each "context"), they'd get the same decoding results.
 
 (2) "A defining characteristic of low-dimensional task representations is that they can be easily cross-generalised to different sensory instances of the same task." This result is difficult to reconcile with the loss of color decoding with color offset. Must it not mean that the rule is being represented differently when cued, e.g., by blue vs. by pink, or by green vs. by khaki? If this is true, then this would also argue against the idea of dimensionality reduction during the delay period, because subjects will, in effect, have swapped needing to represent one of four colors with needing to represent one of four rules.
 
 (3) The authors assert that "cross-colour generalisation of context in the delay period is already implied by the significant context decoding combined with the absence of irrelevant colour coding." This is contradicted, however, by the failure of the direct test of cross-color decoding!
 
 (4) "Taken together, these findings imply that participants constructed abstract representations of task features but that the mechanism responsible for this transformation relied heavily on discarding colour information early in trial time."
 
 This statement does not follow from the data because no mechanism is being directly measured. Rather, it's simply the case that after translating the color to a rule, the color is no longer needed and so is no longer kept in an active state. There is certainly no evidence for "heavy reliance".
 
 Review 2
4. Public_Reviews 28 Jul 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 Summary:
 
 Wójcik and colleagues investigated how the maintenance of task information in working memory influences the dimensionality of task representations. The task required an exclusive-or (XOR) mapping as the output by combining stimulus features separated by a delay period. The authors found that context information invariant to input features (i.e., color) is maintained and enhanced over the course of learning the task.
 
 The significance of this study lies in its demonstration of how learning selectively changes the geometry of task representations. The clear-cut results emphasize that learning promotes the abstraction of task representations for context-dependent computations. It is also important to investigate how working memory mechanisms contribute to the geometry and optimization of task representations, as such studies in humans are scarce.
 
 Strengths:
 
 (1) The task design and analyses are clear.
 
 (2) The theoretical motivation to study low-dimensional representations and temporal decomposition is strong. Understanding how learning changes these qualities is a novel and important question.
 
 Weaknesses:
 
 (1) The specific contribution of working memory maintenance to the dimensionality and abstraction of representations is unclear. While the task likely recruits working memory, there are no direct assessments linking the observed results to particular qualities or mechanisms of working memory. In other words, neural representations observed during the delay period are interpreted as working memory.
 
 (2) The dissociation between XOR and motor representations is ambiguous, as they only become distinguishable during error trials. Additionally, they show similar time courses and learning-related changes.
 
 Review 3
5. Public_Reviews 28 Jul 2025
 
 in eLife
 
 Author Response:
 
 Reviewer #1( Public review):
 
 The reviewer raised two main concerns: the potential confound between XOR and motor coding, and the relationship between neural coding and behaviour.
 
 First, we appreciate the consideration of the collinearity between the XOR and motor dimensions. We fully agree that this confound may have contributed to the observed increase in XOR decoding over the course of learning. In response, we will merge the XOR and motor features in the main figures, tone down our interpretation of the XOR learning effect, and clarify how motor signals may obscure or mimic XOR-related changes. As the reviewer noted, this confound does not affect the colour/context cross-generalisation analyses, which remain central to our conclusions regarding flexible and prospective working memory coding.
 
 We also thank the reviewer for the suggestion to examine the behavioural relevance of the neural representations more directly. We agree entirely, and will incorporate new analyses relating coding strength to reaction times, as well as reflect on the implications of these results in the revised Discussion.
 
 Reviewer #2 (Public Review):
 
 The reviewer rightly noted that our manuscript overlooks the established concept of retrospective/prospective coding in working memory, giving the impression that we attempted to reframe it using newer machine learning terminology. We thank the reviewer for catching this important omission. Our intention was not to override this well-established conceptual framework with a newer machine learning term, but rather to build upon it. In fact, prospective coding and the idea of working memory as a resource for computation are closely related—one helps define the functions (prospective and retrospective coding) and the other explains the computational rationale behind applying them. For example, prospective codes specify what is being stored (future-relevant information), while the “memory-as-computation” view addresses why such representation is useful: to enable temporal decomposition of complex tasks and reduce computational load at decision time. We will revise the relevant paragraphs to explicitly reference this cognitive framework and clarify how it relates to — and is complemented by — the newer computational perspective we introduce. Thank you again for highlighting this.
 
 Reviewer 2 also argues that the evidence presented does not support dimensionality reduction, noting that participants likely transition from processing the sensory cue (e.g., blue) to a rule-based representation (e.g., context 1 vs context 2) later in the trial, and that this remapping does not inherently require dimensionality reduction. We agree that our results are consistent with such a transformation into an abstract rule representation during the delay period, as supported by the observed cross- colour context generalisation (Figure 3b) and that this process does not require dimensionality reduction per se. However, we would like to clarify that a shared decision boundary between two colour pairs (e.g., context 1 vs context 2) can manifest in two types of neural geometries. In one case — observed in our data — the irrelevant colour dimension is not maintained after the presentation period, such that blue and pink are maintained as context 1 but variance along the blues vs pink dimension is not represented in neural activity. In the other case, it is possible for the same abstract rule (context 1) to be constructed while maintaining the sensory representation of colour (e.g., “blue” or “pink”), resulting in a change in representational geometry without a reduction in dimensionality. Our data do not support the latter scenario: irrelevant colour information is not maintained in the delay period, suggesting that the abstraction is accompanied by a loss of variance along irrelevant sensory dimensions—i.e., a form of dimensionality reduction. We will clarify this point in the revised manuscript and include a new analysis that explicitly tests whether shattering dimensionality changes as a function of trial time.
 
 The reviewer also raised concerns about inconsistencies in our terminology, particularly the use of “colour pair” and “irrelevant colour.” We agree with the reviewer that the term “colour pair” was a conceptual device rather than a literal aspect of the task, and we will revise the text to make this clear. We recognise that our wording around “irrelevant colour” might have caused confusion. We did not mean “colour” in the broad sense of all colour processing, but rather referred to specific colour dimensions that are not relevant for task performance—for example, when context 1 is cued by both pink and blue, the dimension carrying variance between blue and pink can be considered irrelevant. We will clarify this point in the revised manuscript, using the reviewer’s suggestion to incorporate the description we had already provided in the Methods section.
 
 While we respectfully disagree with the reviewer’s interpretation of our findings—particularly regarding the absence of dimensionality reduction, which they associate with the failure of the direct test of cross-colour context decoding (see Fig. 3b, which shows a significant effect)—we appreciate the opportunity to clarify our position and will revise the manuscript to ensure our reasoning is as transparent and rigorous as possible.
 
 Reviewer #3 (PublIc review):
 
 The reviewer values the study’s demonstration that learning promotes abstraction in task representations, but raises concerns about the lack of direct evidence linking delay-period activity to specific working memory mechanisms and the ambiguous dissociation between XOR and motor representations. We thank the reviewer for their careful reading of the manuscript and will address both concerns in the revised version. As mentioned in our response to Reviewer #1, we will merge the motor and XOR analyses, tone down our interpretations, and clarify why these signals are entangled. Additionally, we will link delay-period neural activity to behavioural performance to establish a more direct connection to working memory processes. Notably, in Figure 4f, we show that early in learning, participants who exhibit stronger cross-generalisation of context during the delay are also more likely to exhibit decreased shattering dimensionality at decision time — providing an early link between the preparation of a contextual signal and the subsequent reduction in computational complexity at decision time. We will include additional analyses to further strengthen this link in the revised manuscript.
 
 AuthorResponse
Visit annotations in context

Tags

Review 2

Review 3

Summary

AuthorResponse

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.01.21.634110v1
www.biorxiv.org www.biorxiv.org

Hypothalamic deiodinase type-3 establishes the period of circannual interval timing in mammals

4
1. Public_Reviews 28 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This study provides potentially important findings on the understanding of circannual timing in mammals, for which iodothyronine deiodinases (DIOs) have been suggested to be of critical importance, yet functional genetic evidence has been missing. The authors aim to implicate dio3, the major inactivator of the biologically active thyroid hormone T3, in circannual timing in Djungarian hamsters, using a combination of correlative and gene knock-out experiments. Currently, several questions have been raised concerning either the methodological description and/or the design of the experiments, and so the experimental evidence is considered incomplete.
  
  Summary
2. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Circannual timing is a phylogenetically widespread phenomenon in long-lived organisms and is central to the seasonal regulation of reproduction, hibernation, migration, fur color changes, body weight, and fat deposition in response to photoperiodic changes. Photoperiodic control of thyroid hormone T3 levels in the hypothalamus dictates this timing. However, the mechanisms that regulate these changes are not fully understood. The study by Stewart et al. reports that hypothalamic iodothyronine deiodinase 3 (Dio3), the major inactivator of the biologically active thyroid hormone T3, plays a critical role in circannual timing in the Djungarian hamster. Overall, the study yields important results for the field and is well-conducted, with the exception of the CRISPR/Cas9 manipulation.
  
  Figure 1 lays the foundation for examining circannual timing by establishing the timing of induction, maintenance, and recovery phases of the circannual timer upon exposure of hamsters to short photoperiod (SP) by monitoring morphological and physiological markers. Measures of pelage color, torpor, body mass, plasma glucose, etc, established that the initiation phase occurred by weeks 4-8 in SP, the maintenance by weeks 12-20, and the recovery after week 20, where all morphological and physiological changes started to reverse back to long photoperiod phenotypes. The statistical analyses look fine, and the results are unambiguous. Their representation could, however, be improved. In Figures 1d and 1e, two different measures are plotted on each graph and differentiated by dots and upward or downward arrowheads. The plots are so small, though, that distinguishing between the direction of the arrows is difficult. Some color coding would make it more reader-friendly. The same comment applies to Figure S4. The authors went on to profile the transcriptome of the mediobasal and dorsomedial hypothalamus, paraventricular nucleus, and pituitary gland (all known to be involved in seasonal timing) every 4 weeks over the different phases of the circannual interval timer. A number of transcripts displaying seasonal rhythms in expression levels in each of the investigated structures were identified, including transcripts whose expression peaks during each phase. This included two genes of particular interest due to their known modulation of expression in response to photoperiod, Dio3 and Sst, found among the transcripts upregulated during the induction and maintenance phases, respectively. The experiments are technically sound and properly analyzed, revealing interesting candidates. Again, my main issues lie with the representation in the figure. In particular, the authors should clarify what the heatmaps on the right of Figures 1f and 1g represent. I suspect they are simply heatmaps of averaged expression of all genes within a defined category, but a description is missing in the legend, as well as a scale for color coding near the figure.
  
  Figure 2 reveals that SP-programmed body mass loss is correlated to increased Dio3-dependent somatostatin (Sst) expression. First, to distinguish whether the body mass loss was controlled by rheostatic mechanisms and not just acute homeostatic changes in energy balance, experiments from hamsters fed ad lib or experiencing an acute food restriction in both LP and SP were tested. Unlike plasma insulin, food restriction had no additional effect on SP-driven epididymal fat mass loss (Figure S7). This clearly establishes a rheostatic control of body mass loss across weeks in SP conditions. Importantly, Sst expression in the mediobasal hypothalamus increased in both ad lib fed or restriction fed SP hamsters and this increase in expression could be reduced by a single subcutaneous injection of active T3, clearly suggesting that increase in Sst expression in SP is due to a decrease of active T3 likely via Dio3 increase in expression in the hypothalamus. The results are unambiguous.
  
  Figure 3 provides a functional test of Dio3's role in the circannual timer. Mediobasal hypothalamic injections of CRISPR-Cas9 lentiviral vectors expressing two guide RNAs targeting the hamster Dio3 led to a significant reduction in the interval between induction and recovery phases seen in SP as measured by body mass, and diminished the extent of pelage color change by weeks 15-20. In addition, hamsters that failed to respond to SP exposure by decreasing their body mass also had undetectable Dio3 expression in the mediobasal hypothalamus. Together, these data provide strong evidence that Dio3 functions in the circannual timer. I noted, however, a few problems in the way the CRISPR modification of Dio3 in the mediobasal hypothalamus was reported in Figure S8. One is in Figure S8b, where the PAM sites are reported to be 9bp and 11bp downstream of sgRNA1 and sgRNA2, respectively. Is this really the case? If so, I would have expected the experiment to fail to show any effect as PAM sites need to immediately follow the target genomic sequence recognized by the sgRNA for Cas9 to induce a DNA double-stranded break. It seems that each guide contains a 3' NGG sequence that is currently underlined as part of sgRNAs in both Fig S8b and in the method section. If this is not a mistake in reporting the experimental design, I believe that the design is less than optimal and the efficiencies of sgRNAs are rather low, if at all functional. The authors report efficiencies around 60% (line 325), but how these were obtained is not specified. Another unclear point is the degree to which the mediobasal hypothalamus was actually mutated. Only one mutated (truncated) sequence in Figure S8c is reported, but I would have expected a range of mutations in different cells of the tissue of interest. Although the authors clearly find a phenotypic effect with their CRISPR manipulation, I suspect that they may have uncovered greater effects with better sgRNA design. These points need some clarification. I would also argue that repeating this experiment with properly designed sgRNAs would provide much stronger support for causally linking Dio3 in circannual timing.
  
  A proposed schematic model for mechanisms of circannual interval timing is presented in Figure S9. I think this represents a nice summary of the findings put in a broader context and should be presented as a main figure in the manuscript itself rather than being relayed in supplementary materials.
  
  Review 1
3. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  Several animals and plants adjust their physiology and behavior to seasons. These changes are timed to precede the seasonal transitions, maximizing chances of survival and reproduction. The molecular mechanisms used for this process are still unclear. Studies in mammals and birds have shown that the expression of deiodinase type-1, 2, and 3 (Dio1, 2, 3) in the hypothalamus spikes right before the transition to winter phenotypes. Yet, whether this change is required or an unrelated product of the seasonal changes has not been shown, particularly because of the genetic intractability of the animal models used to study seasonality. Here, the authors show for the first time a direct link between Dio3 expression and the modulation of circannual rhythms.
  
  Strengths:
  
  The work is concise and presents the data in a clear manner. The data is, for the most part, solid and supports the author's main claims. The use of CRISPR is a clear advancement in the field. This is, to my knowledge, the first study showing a clear (i.e., causal) role of Dio3 in the circannual rhythms in mammals. Having established a clear component of the circannual timing and a clean approach to address causality, this study could serve as a blueprint to decipher other components of the timing mechanism. It could also help to enlighten the elusive nature of the upstream regulators, in particular, on how the integration of day length takes place, maybe within the components in the Pars tuberalis, and the regulation of tanycytes.
  
  Weaknesses:
  
  Due to the nature of the CRISPR manipulation, the low N number is a clear weakness. This is compensated by the fact that the phenotypes shown here are strong enough. Also, this is the only causal evidence of Dio3's role; thus, additional evidence would have significantly strengthened the author's claims. The use of the non-responsive population of hamsters also helps, but it falls within the realm of correlations. Additionally, the consequences of the mutations generated by CRISPR are not detailed; it is not clear if the mutations affect the expression of Dio3 or generate a truncation or deletion, resulting in a shorter protein.
  
  Review 2
4. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  The authors investigated SP-induced physiological and molecular changes in Djungarian hamsters and the endogenous recovery from it after circa half a year. The study aimed to elucidate the intrinsic mechanism and included nice experiments to distinguish between rheostatic effects on energy state and homeostatic cues driven by an interval timer. It also aimed to elucidate the role of Dio3 by introducing a targeted mutation in the MBH by ICV. The experiments and analyses are sound, and the amount of work is impressive. The impact of this study on the field of seasonal chronobiology is probably high.
  
  Even though the general conclusions are well-founded, I have fundamental criticism concerning 3 points, which I recommend revising:
  
  (1) The authors talk about a circannual interval timer, but this is no circannual timer. This is a circa-semiannual timer. It is important that the authors use precise wording throughout the manuscript.
  
  (2) The authors put their results in the context of clocks. For example, line 180/181 seasonal clock. But they have described and investigated an interval timer. A clock must be able to complete a full cycle endogenously (and ideally repeatedly) and not only half of it. In contrast, a timer steers a duration. Thus, it is well possible that a circannual clock mechanism and this circa-semiannual timer of photoperiodic species are 2 completely different mechanisms. The argumentation should be changed accordingly.
  
  (3) The authors chose as animal model the Djungarian hamster, which is a predominantly photoperiodic species and not a circannual species. A photoperiodic species has no circannual clock. That is another reason why it is difficult to draw conclusions from the experiment for circannual clocks. However, the Djungarian hamster is kind of "indifferent" concerning its seasonal timing, since a small fraction of them are indeed able to cycle (Anchordoquy HC, Lynch GR (2000), Evidence of an annual rhythm in a small proportion of Siberian hamsters exposed to chronic short days. J Biol Rhythms 15:122-125.). Nevertheless, the proportion is too small to suggest that the findings in the current study might reflect part of the circannual timing.
  
  Therefore, the authors should make a clear distinction between timers and clocks, as well as between circa-annual and circa-semiannual durations/periods.
  
  Review 3
Visit annotations in context

Tags

Summary

Review 2

Review 3

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.04.22.650143v2
www.biorxiv.org www.biorxiv.org

The C. elegans gustatory receptor homolog LITE-1 is a chemoreceptor required for diacetyl avoidance

4
1. Public_Reviews 28 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  Avoidance of UV and blue light by the nematode C. elegans is mediated by the unusual transmembrane protein LITE-1, a non-canonical photoreceptor. In this valuable work, the authors provide convincing evidence that LITE-1 function is also required for avoidance of very high concentrations of the food-associated cue diacetyl, suggesting that it may also function as a diacetyl chemoreceptor. While the evidence for this idea is incomplete, these intriguing findings suggest an unexpected complexity in the function of this unusual photoreceptor.
  
  Summary
2. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  This paper describes an interesting phenotype of C. elegans lite-1 mutants. Previous work showed that lite-1 mutants lose a violet/blue light avoidance response. The authors show here that lite-1 mutants also show a defect in negative diacetyl chemotaxis. While wild-type worms avoid diacetyl at high concentrations, lite-1 mutants are instead *attracted* to it. The authors go on to perform Ca2+ imaging in sensory neurons and find that ADL and ASK neurons show altered Ca2+ responses to diacetyl in lite-1 mutants, suggesting LITE-1 is required for these responses. As unc-13 mutants with defective synaptic transmission show similar diacetyl Ca2+ responses as wild-type, this suggests these neurons respond cell autonomously to diacetyl. However, whether lite-1 also acts cell-autonomously is not discussed. Indeed, because unc-13 and lite-1 mutants show different ADL and ASK Ca2+ responses, it seems the diacetyl response regulated by LITE-1 is likely acting outside of those cells. An interesting result that is not commented on is the switching of the valence of the ASK Ca2+ response in lite-1 mutants. ASK neurons still respond to diacetyl, but instead of a strong increase in Ca2+, diacetyl appears to drive it strongly lower. This may be consistent with the switch in valence in the diacetyl chemotaxis assay. It also argues against the idea that LITE-1 is a low-affinity diacetyl receptor that drives avoidance or the Ca2+ responses in ASK, since it is still present in lite-1 mutants. The authors then use a strain that expresses LITE-1 in the body wall muscles and show this expression is sufficient to engender them with sensitivity to diacetyl, as measured through altered swimming and hypercontractility. The authors interpret this result as LITE-1 may act as a diacetyl receptor. The authors test whether a structurally similar molecule, 2,3-pentanedione, shows similar effects, and they find it does. Alpha-fold modeling and molecular docking analysis show where diacetyl might bind to the LITE-1 protein. They then test whether lite-1 mutants show chemotaxis defects to other molecules, as seen with diacetyl. Generally, they find that the observed diacetyl responses are unique, although lite-1 mutants do lose their avoidance response to 2,3-pentanedione. However, unlike the acquisition of diacetyl attraction in lite-1 mutants, 2,3 pentanedione avoidance is *lost*; it is not switched to attraction. Overall, I felt the description of the results and their implications could have been more in-depth. Further, the evidence that LITE-1 is a chemoreceptor itself, rather than acting in some way to shape chemoreceptor responses (via light or otherwise), remains unclear, as conceded by the authors.
  
  Strengths:
  
  Overall, the study follows up on an interesting and useful result. The experiments as presented are generally well-conceived and performed. The authors use a variety of behavioral and imaging approaches to test how LITE-1 mediates diacetyl avoidance.
  
  Weaknesses:
  
  The study is missing experiments needed to resolve whether LITE-1 is doing what they propose. The evidence that LITE-1 is a diacetyl receptor is lacking support since lite-1 mutants have their avoidance and calcium responses flipped, which would not be expected if it were acting solely as an avoidance receptor. Presumably, the authors are concluding that the attractive response that is left in the lite-1 mutant is mediated by ODR-10, but that experiment is not shown. Similarly, the authors concede that "the use of lite-1 point mutants that affect specific LITE-1 function, such as light sensing, channel gating, or binding pocket, could further elucidate LITE-1 mechanisms." This reviewer agrees, and such experiments designed to localize diacetyl binding site(s) would be necessary to conclude definitively that LITE-1 is a diacetyl receptor. The body wall muscle assay used or some other heterologous experimental system could work for such a structure-function analysis. A concern is whether the extensive number of LITE-1 point mutants described in the literature affect cell surface expression vs. receptor function, which might complicate the interpretation of a result showing loss of diacetyl responses.
  
  Review 1
3. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  Koh and colleagues investigate the broader sensory role of LITE-1, a gustatory receptor previously linked to UV light detection in C. elegans. Their study explores whether LITE-1 also mediates avoidance of specific chemical stimuli-namely, high concentrations of diacetyl and 2,3-pentanedione. They show that LITE-1 is required in the ADL and ASK neurons for calcium responses to diacetyl, and that its expression in body-wall muscles is sufficient to trigger hypercontraction upon odorant exposure. Molecular docking suggests both odorants may directly bind to LITE-1 with micromolar affinity. These findings suggest LITE-1 may act as a multimodal receptor for both light and chemical stimuli.
  
  Strengths:
  
  (1) Methodological Precision: The study is technically strong, with well-executed calcium imaging and quantitative behavioral assays that clearly show neural and muscular responses to chemical stimuli.
  
  (2) Novelty and Scope: The work presents a compelling case for LITE-1 functioning as a multimodal sensor, which is an intriguing expansion of its known role.
  
  (3) Potential Impact: If validated, the findings could significantly advance the understanding of sensory integration in C. elegans, and the tools developed may be broadly useful to the research community.
  
  (4) Relevance to the Field: The study adds to evidence that C. elegans uses non-canonical sensory pathways and may inspire further exploration of multimodal receptor functions in other systems.
  
  Weaknesses:
  
  (1) Lack of Rescue Experiments: The absence of rescue experiments makes it difficult to definitively link the observed phenotypes to loss of lite-1.
  
  (2) Single Loss-of-Function Approach: The reliance on a single genetic mutant limits interpretability. Additional strategies such as RNAi (e.g., neuron-specific knockdown) would provide stronger evidence.
  
  (3) Unclear Neuronal Contribution: While calcium responses in ADL and ASK are reduced, it's unclear which neuron(s) are necessary for behavioral avoidance. Cell-specific rescue or knockdown experiments are needed.
  
  (4) Unvalidated Docking Data: The molecular docking predictions lack experimental validation. Site-directed mutagenesis would be needed to support claims of direct interaction.
  
  (5) Limited Odorant Specificity Testing: Docking analysis does not include non-binding odorants, making it difficult to assess binding specificity.
  
  (6) Incomplete Quantification: Some calcium imaging results (e.g., in AWA neurons of unc-13 mutants) lack statistical comparisons, which limits their interpretive value.
  
  Review 2
4. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  In this work, Brown and colleagues report that the photosensor protein LITE-1 of the nematode C. elegans may also be a chemosensor that can be activated by high concentrations of the compound diacetyl. LITE-1 was described as a putative ion channel of the gustatory receptor family, which is mainly constituted by insect odorant receptors. These form tetrameric ion channels that can be activated by odorants. Specificity is achieved by forming heteromeric channels from three copies of the odorant receptor co-receptor (ORCO) and another subunit that resembles ORCO in the pore-forming C-terminus, but brings in a binding site for the respective odorant. LITE-1 has a very similar structure, according to Alphafold3 predictions, and also carries a binding pocket. In LITE-1, this was proposed to be occupied by a light-absorbing molecule that activates the channel when a photon is absorbed. Alternatively, compounds generated by absorption of high-energy photons may be formed in vivo and bound by the LITE-1 binding pocket. Koh et al. now demonstrate that another, non-light-activated compound, diacetyl, at high concentrations, can activate cells expressing LITE-1. Such (chemosensory) cells are also responsible for the avoidance of high concentrations of diacetyl. LITE-1 activation in excitable cells, i.e, muscles, causes strong body contraction and paralysis, and the authors show that this is also the case when diacetyl is presented. The authors further present molecular docking studies showing that diacetyl could occupy the binding pocket of LITE-1. Last, they show that another compound chemically resembling diacetyl, i.e., 2,3-pentanedione, can also induce avoidance in a LITE-1 dependent manner, though not as potently.
  
  The data are intriguing, and the demonstration of LITE-1 being a diacetyl chemosensor is interesting. Yet, there are a few questions arising that the authors should address.
  
  The authors identified mutants lacking diacetyl responses. In their chemotaxis assay (Figures 1A, B), they show that lite-1 mutants do not avoid high concentrations of diacetyl. However, the animals actually showed attraction, as the chemotaxis index was positive. If the lite-1 animals were insensitive, they should be indifferent, and the chemotaxis index should be close to zero. This means, other neurons contribute to the diacetyl response, and the result of these neurons being activated means/remains attraction? If so, the authors need to rule out any effects of these neurons on the effects they attribute to LITE-1 in the other assays.
  
  The effect of diacetyl on muscle cells (Figure 3C) is pretty rapid, i.e., already during 1 minute after application, the animals are almost maximally contracted. How fast is it really? Can the authors provide a time course with more time points during the first minute? This is a relevant question, as the compound would have to either pass the worm cuticle or enter through the gut and diffuse through the body to reach the muscle cells. Can one expect this to occur within (less than) a minute?
  
  In this context, the authors need to rule out that other mechanisms may be at play. E.g., diacetyl may be immediately sensed by ciliated chemosensory neurons that might release a signaling molecule that leads to activation of LITE-1 in muscles, or that sensitizes it somehow, responding to light used for filming animals. The authors should repeat this assay in a lite-1 mutant background. Furthermore, the authors tested unc-13 mutants to rule out indirect effects on the neurons recorded. Likewise, they should eliminate neuropeptide signaling via unc-31 mutants (a recent paper cited by the authors showed involvement of neuropeptide signaling in LITE-1-mediated light avoidance behavior). Last, to demonstrate that effects are not indirect in response to chemosensory neurons, the authors should repeat the contraction or swimming assay in a tax-4 mutant, which largely lacks chemosensation. This also applies to the chemotaxis assay. Animals should exhibit a chemotaxis index to diacetyl of zero, then.
  
  Does diacetyl activate other neurons expressing LITE-1? A number of cells express LITE-1 at high levels, which the authors have not tested (they restricted their analyses to chemosensory neurons). This is important to address because it leaves the possibility that LITE-1 requires a specific partner only present in these chemosensory neurons to detect diacetyl. This partner would have to be present also in muscles, where diacetyl could activate ectopically expressed LITE-1. According to CeNGEN scRNAseq data, cells expressing LITE-1 can be identified. The ADL and ASH neurons actually come up only at the lowest threshold, so some of the other cells showing much higher levels of LITE-1 mRNAs, i.e., AVG, ALM, PLM, ASG, PHA, PHB, AVM, RIF, or some pharyngeal neurons, should be tested. ASG was among the cells the authors recorded from, but this neuron did not show a response.
  
  The authors need to show that diacetyl responses of ADL and/or ASK can be rescued by expressing LITE-1 specifically in these neurons in a lite-1 mutant background.
  
  Molecular docking studies are not described in detail. How was this done? Diacetyl is a very small molecule. How well can docking algorithms assess this at all? Did the authors preselect the binding pocket, or did the algorithm sample the entire molecular surface of the LITE-1 model and end up with the binding pocket? The latter would be very convincing. The authors should provide control docking experiments with other molecules that caused avoidance in their hands (i.e. benzaldehyde, 2,4,5,trimethlythiazole, isoamyl alcohol, nonanone, octanone), but did not activate LITE-1. Also, they should try docking molecules related to diacetyl, and if there are some that do not dock under the same conditions, such molecules should be used in a behavioral experiment. Ideally, they should also not activate LITE-1. Examples could be, e.g., diacetyl monoxime or 2,4-pentanedione.
  
  Last, the authors should provide a PDB file with the docked diacetyl to allow readers to assess the binding for themselves. Since a large number of mutations of LITE-1 have been reported, it may be that amino acids shown to be essential for LITE-1 function are also required for diacetyl binding. If so, this could be backed up with an experiment.
  
  Review 3
Visit annotations in context

Tags

Summary

Review 2

Review 3

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.04.20.649642v1
www.biorxiv.org www.biorxiv.org

Development of the axonal βII-spectrin periodic skeleton requires active cytoskeletal remodelling

4
1. Public_Reviews 28 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 This study examines how the neuronal cytoskeleton contributes to the formation of the axonal membrane-associated periodic skeleton (MPS) in embryonic dorsal root ganglia (DRG) neurons, using STED imaging. Conclusions are supported by convincing methods, data, and analyses. This useful work confirms previous data and improves our understanding of the roles of microtubules and actin dynamics in the chronological recruitment of MPS components.
 
 Summary
2. Public_Reviews 28 Jul 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 The axonal membrane periodic skeleton (MPS) comprises axially aligned tetramers of α and β spectrins that are attached to evenly distributed radial F-actin rings, which maintain a typical spacing of 180 - 190 nm. The exact molecular mechanisms underlying the early organization have been unclear. The focus of this study is on those mechanisms.
 
 This is a comprehensive and professionally carried out study. It brings convincing evidence that intact actin and microtubules are required for normal development of MPS and that the actin-binding and lipid-interacting domains of βII-spectrin are critical for its subplasmalemmal confinement and, subsequently, MPS maturation. However, whilst the study does bring new insights, we are still missing the overall understanding of how everything comes together.
 
 The study describes, using spectrin mutations, that the membrane and actin binding of spectrin are required for the proper organization of MPS. However, it is unclear how everything could come together mechanistically.
 
 The authors follow how the MPS is organized by looking at spectrin. Latrunculin affects actin polymerization, as well as CK666 and formin inhibition, but it remains unclear which actin structures are affected. The same is true for microtubules; while they are affected, we don't know how they are affected.
 
 Review 1
3. Public_Reviews 28 Jul 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 In their manuscript, Bodas et al present a chronological analysis of the development of the axonal MPS in embryonic DRG neurons, using a series of biochemical assays coupled with STED nanoscopy. Several interesting conclusions, well supported by the data presented, are drawn that further our understanding of bII-spectrin axonal recruitment and on the role of microtubules and actin dynamics during the early MPS formation and at the latter stages of neuronal maturation.
 
 Strengths:
 
 The assays presented are well-designed, and the results obtained clearly support the main conclusions drawn by the authors. Their findings highlight important aspects of cytoskeleton regulation and dynamics required for MPS formation/maintenance, i.e, during different stages of neuronal development, that remained undocumented.
 
 Weaknesses:
 
 The study is mostly limited to biochemical assays followed by STED microscopy to analyse MPS periodicity and (in certain cases) axonal diameter. Functional implications of the manipulations done are lacking, as well as analyses of axonal integrity/degeneration. This is a relevant aspect, as some of the effects observed may be a secondary effect of decreased neuronal/axonal viability.
 
 Review 2
4. Public_Reviews 28 Jul 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 Summary:
 
 In this study, Shivani Bodas et al. investigate the role of actin, actin-binding proteins, and microtubules in regulating the membrane-associated periodic skeleton (MPS) in neuronal axons. The MPS, first reported by Ke Xu et al. in 2013 (Science), has since been implicated in various neuronal functions, including mechanical support, axonal diameter control, axonal degeneration regulation, and spatial organization of signaling molecules. Given its biological importance, further elucidation of MPS assembly mechanisms is of considerable interest. However, I have concerns regarding the novelty and strength of the conclusions presented in this work. Many of the findings largely reiterate previously published observations, and the most novel conclusions are not fully substantiated by the data.
 
 Strengths:
 
 (1) The MPS represents a structurally and functionally important cytoskeletal system in neurons. Studies aimed at understanding its developmental mechanisms are biologically meaningful and potentially impactful.
 
 (2) The authors attempt to dissect MPS assembly during early neuronal development, a process that could offer mechanistic insight into how the MPS is established and maintained.
 
 Weaknesses:
 
 (1) Limited Novelty Across Results Sections:
 
 Of the seven Results sections, only one (Figure 6) and part of another (Figure 9) present data leading to relatively novel interpretations, specifically, the authors' claim that βII-spectrin is recruited to the axonal cortex via F-actin interactions as early as DIV1, followed by rearrangement into a periodic structure by DIV4. However, this conclusion is not fully supported (see below). The remaining results (Figures 1-5, 7, and 8) largely recapitulate findings reported in earlier studies and thus add limited new knowledge.
 
 (2) Insufficient Evidence for Early Recruitment and Rearrangement of βII-spectrin:
 
 The claim that βII-spectrin is recruited to the axonal cortex via F-actin interactions as early as at DIV 1 and subsequently reorganized into a periodic structure during DIV1-4 is central to the manuscript but lacks robust experimental support.
 
 On Page 17, Line 526, the authors the authors state that " To exclude cytoplasmic spectrin resulting from overexpression, only axons with low expression of βII spectrin-GFP were selected for the analysis". However, selecting for low expression alone does not guarantee the absence of cytoplasmic signal. Without volumetric imaging (e.g., 3D super-resolution imaging to see the cross section of axons), it is difficult to definitively conclude that the FRAP data (Figures 6 and 9) reflect cortical rather than cytoplasmic localization.
 
 Prior FRAP studies (Zhong et al., eLife 2014) observed minimal fluorescence recovery over 1800 seconds in axons expressing βII-spectrin-GFP at low levels, with faster recovery (~200-300 seconds) only evident under high expression conditions. The fast recovery kinetics (tens of seconds) reported in this manuscript could plausibly result from free diffusion of cytoplasmic βII-spectrin-GFP rather than cortical turnover.
 
 Furthermore, on Page 10, Line 310, the authors assert that endogenous βII-spectrin "is recruited early to the axonal cortex, followed by progressive establishment of periodic order". However, the STED images shown in Figure 1 do not convincingly distinguish between cortical and cytoplasmic pools.
 
 As such, the observed disordered βII-spectrin molecules, whether overexpressed or endogenous, could still represent a diffuse cytoplasmic population. An alternative and perhaps more parsimonious interpretation is that βII-spectrin is initially cytoplasmic and only later recruited and arranged into periodic structures at the cortex.
 
 (3) Use of Pharmacological Perturbations:
 
 Like many earlier studies, this manuscript relies heavily on pharmacological perturbation (e.g., cytoskeletal drugs) to assess the roles of actin, actin-binding proteins, and microtubules in MPS assembly. While this approach is widely used, it is important to acknowledge that such agents may have off-target effects. The manuscript would benefit from greater caution in interpreting these results, or better yet, the inclusion of genetic or optogenetic approaches to independently validate these findings.
 
 Review 3
Visit annotations in context

Tags

Summary

Review 2

Review 3

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.02.19.639207v2
www.biorxiv.org www.biorxiv.org

Paraventricular Thalamus Hyperactivity Mediates Stress-Induced Sensitization of Unlearned Fear but Not Stress-Enhanced Fear Learning (SEFL)

4
1. Public_Reviews 28 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  These findings are among some of the first to identify a behavioral and neurobiological substrate that disentangles nonassociative from associative fear responses following stress, providing a fundamental push forward in the field. The evidence supporting this is convincing and uses a variety of conceptual and technological approaches. This investigation will be of interest to neuroscientists and behaviourists broadly, as well as clinicians for its relevance to post-traumatic stress disorder.
  
  Summary
2. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  This study delineates a highly specific role for the pPVT in unconditioned defensive responses. The authors use a novel, combined SEFL and SEFR paradigm to test both conditioned and unconditioned responses in the same animal. Next, a c-fos mapping experiment showed enhanced PVT activity in the stress group when exposed to the novel tone. No other regions showed differences. Fiber photometry measurements in pPVT showed enhancement in response to the novel tone in the stressed but not non-stressed groups. Importantly, there were also no effects when calcium measurements were taken during conditioning. Using DREADDS to bidirectionally manipulate global pPVT activity, inhibition of the PVT reduced tone freezing in stressed mice while stimulation increased tone freezing in non-stressed mice.
  
  Strengths:
  
  A major strength of this research is the use of a multi-dimensional behavioral assay that delineates behavior related to both learned and non-learned defensive responses. The research also incorporates high-resolution approaches to measure neuronal activity and provide causal evidence for a role for PVT in a very narrow band of defensive behavior. The data are compelling, and the manuscript is well-written overall.
  
  Weaknesses:
  
  Figure 1 shows a small, but looks to be, statistically significant, increase in freezing in response to the novel tone in the no-stress group relative to baseline freezing. This observation was also noticed in Figures 2 and 7. The tone presented is relatively high frequency (9 kHz) and high dB (90), making it a high-intensity stimulus. Is it possible that this stimulus is acting as an unconditioned stimulus? In addition, in the final experiment, the tone intensity was increased to 115 dB, and the freezing % in the non-stressed group was nearly identical (~20%) to the non-stressed groups in Figures 1-2 and Figure 7. It seems this manipulation was meant as a startle assay (Pantoni et al., 2020). Because the auditory perception of mice is better at high frequencies (best at ~16 kHz), would the effect seen be evident at a lower dB (50-55) at 9 kHz? If the tone was indeed perceived as "neutral," there should be no freezing in response to the tone. This complicates the interpretation of the results somewhat because while the authors do admit the stimulus is loud, would a less loud stimulus result in the same effect? Could the interaction observed in this set of studies require not a novel tone, but rather a high-intensity tone that elicits an unconditioned response? Along these same lines, it appears there may be an elevation in c-fos in the PVT in the non-stress tone test group versus the no-stress home cage control, and overall it appears that tone increases c-fos relative to homecage. Could PVT be sensitive to the tone outside of stress? Would there be the same results with a less intense stimulus? I would also be curious to know what mice in the non-stressed group were doing upon presentation of the tone besides freezing. Were any startle or orienting responses noticed?
  
  Review 1
3. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  Nishimura and colleagues present findings of a behavioral and neurobiological dissociation of associative and nonassociative components of Stress Enhanced Fear Responding (SEFR).
  
  Strengths:
  
  This is a strong paper that identifies the PVT as a critical brain region for SEFR responses using a variety of approaches, including immunohistochemistry, fiber photometry, and bidirectional chemogenetics. In addition, there is a great deal of conceptual innovation. The authors identify a dissociable behavior to distinguish the effects of PVT function (among other brain regions).
  
  Weaknesses:
  
  (1) The authors find a lack of difference between the Stress and No Stress groups in pPVT activity during SEFL conditioning with fiber photometry but an increase in freezing with Gq DREADD stimulation. How do authors reconcile this difference in activity vs function?
  
  (2) Because the PVT plays a role in defensive behaviors, it would be beneficial to show fiber photometry data during freezing bouts vs exclusively presented during tone a shock cue presentations.
  
  (3) Similar to the above point, were other defensive behaviors expressed as a result of footshock stress or PVT manipulations?
  
  (4) Tone attenuation in Figure 8 seems to be largely a result of minimal freezing to a 115-dB tone. While not a major point of the paper, a more robust fear response would be convincing.
  
  (5) In the open field test, the authors measure total distance. It would be beneficial to also show defensive behavioral (escape, freezing, etc) bouts expressed.
  
  (6) The authors, along with others, show a behavioral and neural dissociation of footshock stress on nonassociative vs associative components of stress; however, the nonassociative components as a direct consequence of the stress seem to be necessary for enhancement of associative aspects of fear. Can authors elaborate on how these systems converge to enhance or potentiate fear?
  
  (7) In the discussion, authors should elaborate on/clarify the cell population heterogeneity of the PVT since authors later describe PVT neurons as exclusively glutamatergic.
  
  Review 2
4. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Summary:
  
  The manuscript by Nishimura et al. examines the behavioural and neural mechanisms of stress-enhanced fear responding (SEFR) and stress-enhanced fear learning (SEFL). Groups of stressed (4 x shock exposure in a context) vs non-stressed (context exposure only) animals are compared for their fear of an unconditioned tone, and context, as well as their learning of new context fear associations. Shock of higher intensity led to higher levels of unlearned stress-enhanced fear expression. Immediate early gene analysis uncovered the PVT as a critical neural locus, and this was confirmed using fiber photometry, with stressed animals showing an elevated neural signal to an unconditioned tone. Using a gain and loss of function DREADDs methodology, the authors provide convincing evidence for a causal role of the PVT in SEFR.
  
  Strengths:
  
  (1) The manuscript uses critical behavioural controls (no stress vs stress) and behavioural parameters (0.25mA, 0.5mA, 1mA shock). Findings are replicated across experiments.
  
  (2) Dissociating the SEFR and SEFL is a critical distinction that has not been made previously. Moreover, this dissociation is essential in understanding the behavioural (and neural) processes that can go awry in fear.
  
  (3) Neural methods use a multifaceted approach to convincingly link the PVT to SEFR: from Fos, fiber photometry, gain and loss of function using DREADDs.
  
  Weaknesses:
  
  No weaknesses were identified by this reviewer; however, I have the following comments:
  
  A closer examination of the Test data across time would help determine if differences may be present early or later in the session that could otherwise be washed out when the data are averaged across time. If none are seen, then it may be worth noting this in the manuscript.
  
  Given the sex/gender differences in PTSD in the human population, having the male and female data points distinguished in the figures would be helpful. I assume sex was run as a variable in the statistics, and nothing came as significant. Noting this would also be of value to other readers who may wonder about the presence of sex differences in the data.
  
  Review 3
Visit annotations in context

Tags

Summary

Review 2

Review 3

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.05.30.657116v1
www.biorxiv.org www.biorxiv.org

Economic and Social Modulations of Innate Decision-Making in Mice Exposed to Visual Threats

4
1. Public_Reviews 28 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  The authors show that innate defensive behavior in mice is shaped by threat intensity, reward value, and social hierarchy, highlighting how value and social context influence instinctive decisions. The authors provide useful behavioural findings supported by strong data, yet the evidence is incomplete due to ambiguities about methodology and the computational model that remains largely descriptive.
  
  Summary
2. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  This study investigates how mice make defensive decisions when exposed to visual threats and how those decisions are influenced by reward value and social hierarchy. Using a naturalistic foraging setup and looming stimuli, the authors show that higher threat leads to faster escape, while lower threat allows mice to weigh reward value. Dominant mice behave more cautiously, showing higher vigilance. The behavioral findings are further supported by a computational model aimed at capturing how different factors shape decisions.
  
  Strengths:
  
  (1) The behavioral paradigm is well-designed and ethologically relevant, capturing instinctive responses in a controlled setting.
  
  (2) The paper addresses an important question: how defensive behaviors are influenced by social and value-based factors.
  
  (3) The classification of behavioral responses using machine learning is a solid methodological choice that improves reproducibility.
  
  Weaknesses:
  
  (1) Key parts of the methods are hard to follow, especially how trials are selected and whether learning across trials is fully controlled for. For example, it is unclear whether animals are in the nest during the looming stimulus presentations. The main text and methods should clarify whether multiple mice are in the nest simultaneously and whether only one mouse is in the arena during looming exposure. From the description, it seems that all mice may be freely exploring during some phases, but only one is allowed in the arena at a time during stimulus presentation. This point is important for understanding the social context and potential interactions, and should be clearly explained in both the main text and methods.
  
  (2) It is often unclear whether the data shown (especially in the main summary figures) come from the first trial or are averages across several exposures. When is the cut-off for trials of each animal? How do we know how many trial presentations were considered, and how learning at different rates between individuals is taken into account when plotting all animals together? This is important because the looming stimulus is learned to be harmless very quickly, so the trial number strongly affects interpretation.
  
  (3) The reward-related effects are difficult to interpret without a clearer separation of learning vs first responses.
  
  (4) The model reproduces observed patterns but adds limited explanatory or predictive power. It does not integrate major findings like social hierarchy. Its impact would be greatly improved if the authors used it to predict outcomes under novel or intermediate conditions.
  
  (5) Some conclusions (e.g., about vigilance increasing with reward) are counterintuitive and need stronger support or alternative explanations. Regarding the interpretation of social differences in area coverage, it's also possible that the observed behavioral differences reflect access to the nesting space. Dominant mice may control the nest, forcing subordinates to remain in the open arena even during or after looming stimuli. In this case, subordinates may be choosing between the threat of the dominant mouse and the external visual threat. The current data do not distinguish between these possibilities, and the authors do not provide evidence to support one interpretation over the other. Including this alternative explanation or providing data that addresses it would strengthen the conclusions.
  
  (6) While potential neural circuits are mentioned in the discussion, an earlier introduction of candidate brain regions and their relevance to threat and value processing would help ground the study in existing systems neuroscience.
  
  (7) Some figures are difficult to interpret without clearer trial/mouse labeling, and a few claims in the text are stronger than what the data fully support. Figure 3H is done for low contrast, but the interesting findings will be to do this experiment with high contrast. Figure 4H - I don't understand this part. If the amount of time in the center after the loom changes for subordinate mice, how does this lead to the conclusion that they spend most of their time in the reward zone?. Figure 3A - The example shown does not seem representative of the claim that high contrast stimuli are more likely to trigger escape. In particular, the 10% sucrose condition appears to show more arena visits under low contrast than high contrast, which seems to contradict that interpretation. Also, the plot currently uses trials on the Y-axis, but it would be more informative to show one line per animal, using only the first trial for each. This would help separate initial threat responses from learning effects and clarify individual variability.
  
  (8) The analysis does not explore individual variability in behavior, which could be an important source of structure in the data. Without this, it is difficult to know whether social hierarchy alone explains behavioral differences or if other stable traits (e.g., anxiety level, prior experiences) also contribute.
  
  (9) The study shows robust looming responses in group-housed animals, which contrasts with other studies that often require single housing to elicit reliable defensive responses. It would be valuable for the authors to discuss why their results differ in this regard and whether housing conditions might interact with social rank or habituation.
  
  Review 1
3. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Zhe Li and colleagues investigate how mice exposed to visual threats and rewards balance their decisions in favour of consuming rewards or engaging in defensive actions. By varying threat intensity and reward value, they first confirm previous findings showing that defensive responses increase with threat intensity and that there is habituation to the threat stimulus. They then find that water-deprived mice have a reduced probability of escaping from low contrast visual looming stimuli when water or sucrose are offered in the environment, but that when the stimulus contrast is high, the presence of sucrose or water increases the probability of escape. By analysing behaviour metrics such as the latency to flee from the threat stimulus, they suggest that this increase in threat sensitivity is due to increased vigilance. Analysis of this behaviour as a function of social hierarchy shows that dominant mice have higher threat sensitivity, which is also interpreted as being due to increased vigilance. These results are captured by a drift diffusion model variant that incorporates threat intensity and reward value.
  
  The main contribution of this work is to quantify how the presence of water or sucrose in water-deprived mice affects escape behaviour. The differential effects of reward between the low and high contrast conditions are intriguing, but I find the interpretation that vigilance plays a major role in this process is not supported by the data. The idea that reward value exerts some form of graded modulation of the escape response is also not supported by the data. In addition, there is very limited methodological information, which makes assessing the quality of some of the analyses difficult, and there is no quantification of the quality of the model fits.
  
  (1) The main measure of vigilance in this work is reaction time. While reaction time can indeed be affected by vigilance, reaction times can vary as a function of many variables, and be different for the same level of vigilance. For example, a primate performing the random dot motion task exhibits differences in reaction times that can be explained entirely by the stimulus strength. Reaction time is therefore not a sound measure of vigilance, and if a goal of this work is to investigate this parameter, then it should be measured. There is some attempt at doing this for a subset of the data in Figure 3H, by looking at differences in the action of monitoring the visual field (presumably a rearing motion, though this is not described) between the first and second trials in the presence of sucrose. I find this an extremely contrived measure. What is the rationale for analysing only the difference between the first and second trials? Also, the results are only statistically significant because the first trial in the sucrose condition happens to have zero up action bouts, in contrast to all other conditions. I am afraid that the statistics are not solid here. When analysing the effects of dominance, a vigilance metric is the time spent in the reward zone. Why is this a measure of vigilance? More generally, measuring vigilance of threats in mice requires monitoring the position of the eyes, which previous work has shown is biased to the upper visual field, consistent with the threat ecology of rodents.
  
  (2) In both low and high contrast conditions, there are differences in escape behaviour between no reward and water or sucrose presence, but no statistically significant differences between water and sucrose (eg, Figure 3B). I therefore find that statements about reward value are not supported by the data, which only show differences between the presence or absence of reward. Furthermore, there is a confound in these experiments, because according to the methods, mice in the no-reward condition were not water deprived. It is thus possible that the differences in behaviour arise from differences in the underlying state.
  
  (3) There is very little methodological information on behavioural quantification. For example, what is hiding latency? Is this the same are reaction time? Time to reach the safe zone? What exactly is distance fled? I don't understand how this can vary between 20 and 100cm. Presumably, the 20cm flights don't reach the safe place, since the threat is roughly at the same location for each trial? How is the end of a flight determined? How is duration measured in reward zone measures, e.g., from when to when? How is fleeing onset determined?
  
  (4) There is little methodological information on how the model was fit (for example, it is surprising that in the no reward condition, the r parameter is exactly 0. What this constrained in any way), and none of the fit parameters have uncertainty measures so it is not possible to assess whether there are actually any differences in parameters that are statistically significant.
  
  Review 2
4. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Male mice were tested in a classic behavioral "flee the looming stimulus" paradigm. This is a purely behavioral study; no neural analyses were done. Mice were housed socially, but faced the looming stimulus individually. Drift-diffusion modeling found that reward-level interacted with threat level such that at low-threat levels, reward contrasted with threat as classically expected (high reward overwhelms low threat, low threat overwhelms low reward), but that reward aligned with threat at higher threat levels.
  
  Note that they define threat level by the darkness of the looming stimulus. I am not sure that darker stimuli are more threatening to mice. But maybe. Figure 3 shows that mice react more quickly to high contrast looming stimuli, but can the authors distinguish between the ability to detect the visual signal from considering it a more dangerous threat? (The fact that vigilance makes a difference in the high contrast condition, not the low contrast condition, actually supports the author's hypotheses here.)
  
  The drift-diffusion model (DDM) is fine. I note that the authors included a "leakage rate", which is not a standard DDM parameter (although I like including it). I would have liked to see more about the parameters. What were the distributions? What did the parameters correlate with behaviorally? I would have liked to see distributions of the parameters under the different conditions and different animals. Figure 2C shows the progression of learning. How do the fit parameters change over time as mice shift from choice to choice? How do the parameters change over mice? How do the parameters change over distance to the threat/distance to safety (as per Fanselow and Lester 1988)? They did a supplemental experiment where the threat arrived halfway along the corridor - we could get a lot more detail about that experiment - how did it change the modeling?
  
  Overall, this is a reasonable study showing mostly unsurprising results. I think the authors could do more to connect the vigilance question to their results (which seems somewhat new to me).
  
  Although the data appear generally fine and the modeling reasonable, the authors do not do the necessary work to set themselves within the extensive literature on decision-making in mice retreating from threats.
  
  First of all, this is not a new paradigm; variants of this paradigm have been used since at least the 1980s. There is an *extensive* literature on this, including extensive theoretical work on the relation of fear and other motivational factors. I recommend starting with the classic Fanselow and Lester 1988 paper (which they cite, but only in passing), and the reviews by Dean Mobbs and Jeansok Kim, and by Denis Paré and Greg Quirk, which have explicit theoretical proposals that the authors can compare their results to. I would also recommend that the authors look into the "active avoidance" literature. Moreover, to talk about a mouse running from a looming stimulus without addressing the other "flee the predator" tasks is to miss a huge space for understanding their results. Again, I would start with the reviews above, but also strongly urge the authors to look at the Robogator task (work by June-Seek Choi and Jeansok Kim, work by Denis Paré, and others).
  
  Similarly, in their anatomical review, they do not mention the amygdala. Given the extensive literature on the role of the amygdala in retreating from danger, both in terms of active avoidance and in terms of encoding the danger itself, it would surprise me greatly if this behavior does not involve amygdala processing. (If there is evidence that the amygdala does not play a role here, but that the superior colliculus does, then that would be a *very* important result that needs to be folded into our understanding of decision-making systems and neural computational processing.)
  
  Second, there is an extensive economic literature on non-human animals in general and on rodents in particular. Again, the authors seem unaware of this work, which would provide them with important data and theories to broaden the impact of their results (by placing them within the literature). First, there are explicit economic literatures in terms of positively-valenced conflicts (e.g., neuroeconomics within the primate literature, sequential foraging and delay-discounting tasks within the rodent literature), but also there is a long history within the rodent conditioning world, such as the classic work by Len Green and Peter Shizgal. I would strongly urge the authors to explore the motivational conflict literature by people like Gavin McNally, Greg Quirk, and Mark Andermann. Again, putting their results into this literature will increase the impact of their experiment and modeling.
  
  Review 3
Visit annotations in context

Tags

Summary

Review 2

Review 3

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.05.12.653401v1
www.biorxiv.org www.biorxiv.org

Center-surround inhibition by expectation: a neuro-computational account

3
1. Public_Reviews 28 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This is a methodologically rich manuscript that is important for elucidating the neural mechanisms of expectation in perception. The analyses are convincing in extending analogous findings in attention and working memory. With further clarification, the findings will be of broad interest to vision researchers.
  
  Summary
2. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  The authors tested two competing mechanisms of expectation: (1) a sharpening model that suppresses unexpected information via center-surround inhibition; (2) a cancelation model that predicts a monotonic gradient response profile. Using two psychophysical experiments manipulating feature space distance between expected and unexpected stimuli, the results consistently supported the sharpening model. Computational modeling further showed that expectation effects were explained by either sharpened tuning curves or tuning shifts. Finally, convolutional neural network simulations revealed that feedback connections critically mediate the observed center-surround inhibition.
  
  Strengths:
  
  The manuscript provides compelling and convergent evidence from both psychophysical experiments and computational modeling to robustly support the sharpening model of expectation, demonstrating clear center-surround inhibition of unexpected information.
  
  Weaknesses:
  
  The manuscript could directly validate the experimental manipulations and address how these results reconcile with existing literature on expectation effects.
  
  Review 1
3. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  This is a compelling and methodologically rich manuscript. The authors used a variety of methods, including psychophysics, computational modeling, and artificial neural networks, to reveal a non-monotonic, center-surround "Mexican-hat" profile of expectation in orientation space. Their data convincingly extend analogous findings in attention and working memory, and the modeling nicely teases apart sharpening vs. shift mechanisms.
  
  Strengths:
  
  The findings are novel and important in elucidating the potential neural mechanisms by which expectation shapes perception. The authors conducted a series of well-designed psychophysical experiments to careful examination of the profile of expectation's modulation. Computational modeling also provides further insights, linking the neural mechanisms of expectation to behavioral results.
  
  Weaknesses:
  
  There are several aspects that could be strengthened or clarified.
  
  (1) The sharpening model of expectation can predict surround suppression. The authors could further clarify how the cancellation model predicts a monotonic profile of expectation (Figure 1C) with the highest response at the expected orientation, while the cancellation model suggests a suppression of neurons tuned toward the expected stimulus.
  
  (2) I'm a bit concerned about whether the profile solely arises from modulation of expectation. The two auditory cues are each associated with a fixed orientation, which may be confounded by other cognitive processes like visual working memory or attention (which I think the authors also discussed). Although the authors tried to use SFD task to render orientation task-irrelevant, luminance edges (i.e., orientation) and spatial frequency in gratings are highly intertwined and orientation of the gratings may help recall the first grating's SF (fixed at 0.9 c/{degree sign}), especially given the first and second grating's orientations are not very different (4.8{degree sign}).
  
  (3) For each of the expected orientations (20{degree sign} or 70{degree sign}), the unexpected ones are linearly separable (i.e., all unexpected ones lie on one side of the expected angle). This might further encourage people to shift their attended or expected orientation, according to the optimal tuning hypothesis. Would this provide an alternative explanation to the tuning shift that the authors found?
  
  (4) It is great that the authors conducted computational modeling to elucidate the potential neuronal mechanisms of expectation. But I think the sharpening hypothesis (e.g., reviewed in de Lange, Heilbron & Kok, 2018) focuses on the neural population level, i.e., narrowing of population tuning profile, while the authors conducted the sharpening at the neuronal tuning level. However, the sharpening of population does not necessarily rely on the sharpening of individual neuronal tuning. For example, neuronal gain modulation can also account for such population sharpening. I think similar logic applies to the orientation adjustment experiment. The behavioral level shift does not necessarily suggest a similar shift at the neuronal level. I would recommend that the authors comment on this.
  
  (5) If the orientation adjustment experiment suggests that both sharpening and shifting are present at the same time, have the authors tried combining both in their computational model?
  
  Review 2
Visit annotations in context

Tags

Summary

Review 2

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.08.26.609781v1
www.biorxiv.org www.biorxiv.org

Center-surround inhibition by expectation: a neuro-computational account

3
1. Public_Reviews 28 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 This valuable study presents a theoretical framework for building continuous attractor networks that integrate with a wide range of topologies, which are of increasing relevance to neuroscientists. While the work offers solid evidence for most claims, the evidence supporting biological plausibility and key claims - such as the existence of a continuum of stable states and robustness across geometries - is currently incomplete and would benefit from further analysis or discussion. The study will be of interest to computational and systems neuroscientists working on neural dynamics and network models of cognition.
 
 Summary
2. Public_Reviews 28 Jul 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 This is a theoretical study addressing the problem of constructing integrator networks for which the activity state and integrated variables display non-trivial topologies. Historically, researchers in theoretical neuroscience have focused on models with simple underlying geometries (e.g., circle, torus), for which analytical models could be more easily constructed. How these models can be generalised to complex scenarios is, however, a non-trivial question. This is furthermore a time-sensitive issue, as population recordings from the brain in complex tasks and environments increasingly require the ability to construct such models.
 
 I believe the authors do a good job of explaining the challenges related to this problem. They also propose a class of models that, although not fully general, overcome many of these difficulties while appearing solid and well-functioning. This requires some non-trivial mathematics, which is nevertheless conveyed in a reasonably accessible form. The manuscript is well written, and both the methodology and the code are well documented.
 
 That said, I believe the manuscript has two major limitations, which could be addressed in a revision. First, some of the assumptions underlying this class of models are somewhat restrictive but are not sufficiently discussed. Second, although the stated goal of the manuscript is to provide practical recipes for constructing integrator networks, the methods section is not very explicit about the specific steps required for different geometries. I elaborate on these limitations below. 
 
 (1) The authors repeatedly describe MADE as a technique for constructing integrators of specified "topologies and geometries." What do they mean by "geometries"? Intuitively, I would associate geometry with properties beyond topology, such as embedding dimensionality or curvature. However, it is unclear to me to what extent these aspects are explicitly specified or controlled in MADE. It seems that geometry is only indirectly defined via the connectivity kernel, which itself obeys certain constraints (e.g., limited spatial scale; see below). I believe it is important for the authors to clarify what they mean by "geometry." They should also specify which aspects are under their control, and whether, in fact, all geometries can be realized. 
 
 (2) The authors make two key assumptions: that connectivity is purely inhibitory and that the connectivity kernel has a small spatial scale. They state that under these conditions, the homogeneous fixed point becomes unstable, leading to a non-periodic state. However, it seems to me that they do not demonstrate that this emergent state is necessarily a bump localized in all manifold dimensions -- although this is assumed throughout the manuscript. Are other solutions possible or observed? For example, might the network converge to states that are localized in one dimension but extended in another, yielding e.g., stripe-like activity in the plane rather than bumps? In other words, does the proposed recipe guarantee convergence to bumps? This is a critical point and should be clarified. 
 
 (3) Related to the question above: What are the failure modes when these two assumptions are violated? Does the network always exhibit runaway activity (as suggested in the text), or can other types of solutions emerge? It would be useful if the authors could briefly discuss this. 
 
 (4) Again, related to the question above: can this formalism be extended to activity profiles beyond bumps? For example, periodic fields as seen in grid cells, or irregular fields as observed in many biological datasets -- particularly in naturalistic environments? These activity profiles are of key importance to neuroscientists, so I believe this is an important point that should at least be addressed in the Discussion. Can MADE be naturally extended to these scenarios? What are the challenges involved? 
 
 (5) Line 119: "Since σ is the only spatial scale being introduced in the dynamics, we qualitatively expect that a localized bump state within the ball will have a spatial scale of O(σ)." Is this statement always true? I understand that the spatial scale of the synaptic inputs exchanged via recurrent interactions (i.e., the argument of the function f in Equation 1) is characterised by the spatial scale σ. But the non-linear function f could modify that spatial scale -- for example, by "cutting" the bump close to its tip. Where am I wrong? Could the authors clarify? 
 
 (6) The authors provide beautiful intuition about the problem of constructing integrators on non-trivial topologies and propose a mathematically grounded solution using Killing vectors. Of course, solutions based on Killing vectors are more complex than those with constant offsets, which raises the question: Is the brain capable of learning and handling such complex structures? Perhaps the authors could speculate in the Discussion about the biological plausibility of these mechanisms. 
 
 (7) A great merit of this paper is that it provides mathematical tools for neuroscience researchers to build integrators on non-trivial geometries. I found that, although all the necessary information is present in the Methods, the authors could improve the presentation by schematizing the steps required to build each type of model. It would be extremely useful if, for each considered geometry, the authors provided a short list of required components: the manifold P, the choice of distance, and the connectivity offsets defined by the Killing vectors. Currently, this information is presented, but scattered (not grouped by geometry).
 
 Review 1
3. Public_Reviews 28 Jul 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 The work by Claudi et al. presents a framework for constructing continuous attractor neural networks (CANs) with user-defined topologies and integration capabilities. The framework unifies and generalizes classical attractor models and includes simulations across a range of topologies, including ring, torus, sphere, Möbius band, and Klein bottle. A key contribution of the paper is the introduction of Killing vectors to enable integration on non-parallelizable manifolds. However, the need for Killing vectors currently appears hypothetical, as biologically discovered manifolds-such as rings and tori-do not require them.
 
 Moreover, throughout the manuscript, the authors claim to be addressing "biologically plausible" attractor networks, yet the constraints required by their construction - such as exact symmetry, fine-tuning of weights, and idealized geometry-seem incompatible with biological variability. It appears that "biologically plausible" is effectively used to mean "capable of integration." While these issues do not diminish the contributions of the work, they should be acknowledged and addressed more explicitly in the text. I applaud the authors for their interesting work. Below are my major and minor concerns.
 
 Strengths:
 
 (1) Theoretical framework for integrating CANs The paper introduces a systematic method for constructing continuous attractor networks (CANs) with arbitrary topologies. This goes beyond classical models and includes novel topologies such as the Möbius band, sphere, and Klein bottle. The approach generalizes well-known ring and torus attractor models and provides a unified view of their construction, dynamics, and integration capabilities.
 
 (2) Novel use of killing vector fields A key theoretical innovation is the introduction of Killing vectors to support velocity integration on non-parallelizable manifolds. This is mathematically elegant and extends the domain of tractable attractor models.
 
 (3) Insightful simulations across manifolds The paper includes detailed simulations demonstrating bump attractor dynamics across a range of topologies.
 
 Weaknesses:
 
 (1) Biological plausibility is overstated Despite frequent use of the term "biologically plausible," the models rely on assumptions (e.g., symmetric connectivity, perfect geometries, fine-tuning) that are not consistent with known biological networks, and the authors do not incorporate heterogeneity, noise, or constraints like Dale's law.
 
 (2) Continuum of states not directly demonstrated The authors claim to generate a continuum of stable states but do not provide direct evidence (e.g., Jacobian analysis with zero eigenvalues along the manifold). This weakens the central claim about the nature of the attractor.
 
 (3) Lack of clarity around assumptions Several assumptions and analyses (e.g., symmetry breaking, linearity, stability conditions) are introduced without justification or overstated. The analytical rigor in discussing alternative solutions and bifurcation behavior is limited.
 
 (4) Scalability to high dimensions The authors claim their method scales better than learning-based approaches. This should be better discussed.
 
 Major Concerns
 
 (1) Biological plausibility
 
 The claim that the proposed framework is "biologically plausible" is misleading, as it is unclear what the authors mean by this term. Biological plausibility could include features such as heterogeneity in synaptic weights, randomness in tuning curves, irregular geometries, or connectivity constraints consistent with known biological architectures (e.g., Dale's law, multiple cell types). None of these elements is implemented in the current framework. Furthermore, it is not clear whether the framework can be extended to include such features-for example, CANs with heterogeneous connections or tuning curves. The connectivity matrix is symmetric to allow an energy-based description and analytical tractability, which is fine, but not a biologically realistic constraint. I recommend removing or significantly qualifying the use of the term "biologically plausible."
 
 (2) Continuum of stable states While the authors claim their model generates a continuum of stable states, this is not demonstrated directly in their simulations or in a stability analysis (though there are some indirect hints). One way to provide evidence would be to compute the Jacobian at various points along the manifold and show that it possesses (approximately) zero eigenvalues in the tangent/on-manifold directions at each point (e.g., see Ságodi et al. 2024 and others). It would be especially valuable to provide such analysis for the more complex topologies illustrated in the paper.
 
 (3) Assumptions, limitations, and analytical rigor Some assumptions and derivations lack justification or are presented without sufficient detail. Examples include:
 
 • Line 126: "If the homogeneous state (all neurons equally active) were unstable, there must exist some other stable state, with broken symmetry." Is this guaranteed? In the ring model with ReLU activation, there could also be unbounded solutions-not just bump solutions-and, in principle, there could also be oscillatory or other solutions. In general, multiple states can co-exist, with differing stability. It appears the authors only analyze the homogeneous case and do not study the stability or bifurcations of other solutions, limiting their theoretical work.
 
 • Line 122: "The conditions for the formation..." What are these conditions, precisely? A citation or elaboration would be helpful. Why is the assumption σ≪L necessary, and how does it impact the construction or conclusions?
 
 • The theory relies heavily on exact symmetries and fine-tuned parameters. Indeed, in line 106, the authors write: "We seek interaction weights consistent with the formation, through symmetry breaking." Is this symmetry-breaking necessary for all CANs? Or is it a limitation specific to hand-crafted models (see also below)? There is insufficient discussion of such limitations. For example, it is difficult to envision how the authors' framework might form attractor manifolds with different geometries or heterogeneous tuning curves.
 
 (4) Comparison with models of learned attractors While the connectivity patterns of learned attractors often resemble classical hand-crafted models (e.g., see also Vafidis et al. 2022), this is not always the case. If initial conditions include randomness or if the geometry of the attractor deviates from standard forms, the solutions can diverge significantly from hand-designed architectures. Such biologically realistic conditions highlight the limitations the hand-crafted CANs like those proposed here. I suggest updating the discussion accordingly.
 
 (5) High-Dimensional Manifolds The authors argue that their method scales better than training-based approaches in high dimensions and that it is straightforward to extend their framework to generate high-dimensional CANs. It would be useful for the authors to elaborate further. First, it is unclear what k refers to in the expression k^M used in the introduction. Second, trained neural networks seem to exhibit inductive bias (e.g., Cantar et al. 2021; Bordelon & Pehlevan 2022; Darshan & Rivkind 2022), which may mitigate such scaling issues. To support their claim, the authors could also provide an example of a high-dimensional manifold and show that their framework efficiently supports a (semi-)continuum of stable states.
 
 Review 2
Visit annotations in context

Tags

Summary

Review 2

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.05.07.652608v1
www.biorxiv.org www.biorxiv.org

A neural mechanism for compositional generalization of structure in humans

4
1. Public_Reviews 28 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This study provides valuable insights into humans' ability to generalize knowledge of learned graph structures to new experiences that share the same structure but are built from different stimuli. However, the evidence for the authors' claims is incomplete, with the main claims of structural generalization and compositionality only partially supported by MEG and behavioral data. This study will be of interest to cognitive neuroscientists studying structure learning and generalization.
  
  Summary
2. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary of the paper:
  
  The paper presents an elegant task designed to investigate humans' ability to generalize knowledge of learned graph structures to new experiences that share the same structure but are built from different stimuli. Using behavior and MEG recordings, the authors test evidence for neural representation and application of structural knowledge.
  
  Review overview:
  
  While the task design is elegant, it isn't clear to me that the data support all the claims made in the paper. I have detailed my concerns below.
  
  Major concerns
  
  (1) The authors claim that their findings reveal "striking learning and generalization abilities based on factorization of complex experiences into underlying structural elements, parsing these into distinct subprocesses derived from past experience, and forming a representation of the dynamical roles these features play within distinct subprocesses." And "neural dynamics that support compositional generalisation, consistent with a structural scaffolding mechanism that facilitates efficient adaption within new contexts".
  
  a. First, terms used in these example quotes (but also throughout the paper) do not seem to be well supported by data or the task design. For example, terms such as 'compositional generalisation' and 'building blocks' have important relevance in other papers by (some of) the same authors (e.g., Schwartenbeck et al., 2023), but in the context of this experiment, what is 'composition'? Can the authors demonstrate clear behavioural or neural evidence for compositional use of multiple graph structures, or alternatively remove reference to these terms? In the current paper, it seems to me that the authors are investigating abstract knowledge for singular graph structures (together with the influence of prior learning), as opposed to knowledge for the compound, more complex graph formed from the product of two simpler graphs.
  
  b. While I would like to be convinced that this data provides evidence for the transfer of abstract, structural knowledge, I think the authors either need to provide more convincing evidence or tone down their claims.
  
  Specifically:
  
  (i) Can the increase in neural similarity between stimuli mapping to the same abstract structural sub-process not be explained by temporal proximity in experiencing the transitions (e.g., Cai et al., 2016)? Indeed, behavior seems to be dominated by direct experience of the structure as opposed to applying abstract knowledge of equivalent structures (and, as a result, there is little difference in behavioural performance between experience and inference probes).
  
  (ii) The strongest evidence for neural representation of abstract task structures seems to be the increase in similarity by transition type. But this common code for 'transition type' is only observed for 6-bridge graphs and only for experienced transitions. There was no significant effect in inference probes. Therefore, there doesn't seem to be evidence for the application of a knowledge scaffold to facilitate transfer learning. Instead, the data reflects learning from direct experience and not generalisation.
  
  (iii) The authors frequently suggest that they are providing insight into temporal dynamics, but there is no mention of particular oscillations or particular temporal sequences of neural representation that support task performance.
  
  (2) Regardless of point (b), can the authors provide more convincing evidence for a graph structure being represented per se (regardless of whether this representation is directly experienced or inferred)? From Figure 3C, it seems that the model RDM doesn't account for relative distance within the graph. Do they see evidence for distance coding? Can they reconstruct the graph from representational patterns using MDS?
  
  (3) In general, the figures are not very clear, and the outcome from statistical tests is not graphically shown. The paper would be easier to digest if, for example, Figures 1-2 were made clearer and statistical significance relative to chance were indicated throughout. To give two examples: (i) Figure 1 should clearly indicate what is meant by observed and held-out transitions and whether it is just the transition or also the compound that is new to the participant. (ii) Figure 2D-E could be shown with relevant comparisons and simpler statistical comparisons. Currently, it is hard to follow without carefully reading the legend.
  
  Review 1
3. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  The authors aimed to investigate the temporal dynamics of how prior experiences shape learning in new complex environments by examining whether the brain reuses abstract structural components from those experiences. They employed a sequence learning task based on graph factorization and recorded neural activity using magnetoencephalography (MEG) to investigate how the underlying graph factors are reused to support learning and inference in a new graph. MEG data was derived from passive stimulus presentation trials, and behavior was assessed through a small number of probe trials testing either experienced or inferred successions in the graph. Representational similarity analysis of the MEG data was performed at a quite aggregated level (the principal components explaining 80% of the variance). The authors report (1) enhanced neural similarity among stimuli that belong to the same graph-factor as well as (2) a correlation between abstract role representations, corresponding to particular positions in the graph, and performance in experience-probes but not in inference-probes.
  
  Strengths & Weaknesses:
  
  (1) The first finding is considered evidence for representational alignment of the graph factors. However, alignment seems to be just one possible arrangement underlying the increased similarity between stimuli of the same vs different graph factors. For instance, a simple categorical grouping of stimuli belonging to the same graph, rather than their structural alignment, could also underlie the reported effect. The wording should be adjusted to avoid overinterpretation.
  
  (2) The second finding of abstract role representations is indeed expected for structural generalisation. While the data presents an interesting indication, its interpretability is constrained by a lack of testing for generalization of the effect to other graph structures (e.g., to rule out graph-specific strategies) as well as the absence of a link to transfer performance in inference-probes. The authors argue that the experienced transitions the classifier was trained on might be more similar in process to the experience-probes than the inference-probes. However, as inference-probes are the key measure of transfer, one could argue that if abstract role representations truly underlie transfer learning, they should be evident in the common neural signal.
  
  (3) The authors write, "we observed a qualitative pattern indicative of increased neural similarity between stimuli that adhered to the same underlying subprocess across task phases. (...) There was a statistically significant interaction effect of condition x graph factor spanning approximately 300 - 680 ms post-stimulus onset". I conclude there was no significant main effect of graph factor, but the relevant statistics are not reported. The authors should report and discuss the complete statistics.
  
  (4) The RSA is performed on highly aggregated data (the PCs that explained 80% of the variance). Could the authors include their rationale for this choice (e.g. over-analysis of sensor-level data)? In case sensor-level analyses have been conducted as well, maybe there are comparisons or implications of the chosen approach that are useful to mention in the discussion. The authors should provide the average and distribution of the number of PCs underlying their analyses.
  
  (5) While the paper is well-written overall, it would benefit from more explicitly identifying the concrete research question and advancing through the results. The authors state their aim as understanding the "temporal dynamics of compositional generalisation", revealing "at which moment during neural information processing are they assembled". They conclude with "providing evidence for temporally resolved neural dynamics that support compositional generalization" and "we show the neural dynamics (...) presented across different task phases...". It remains somewhat vague what specific insight about the process is provided through the temporal resolution (e.g., is the time window itself meaningful, if so, it should be contextualized; is the temporal resolution critical to dissociate subprocesses). The different task phases -initial learning and transfer- are the necessary conditions to investigate transfer learning, but do not by themselves offer a particularly resolved depiction of the process.
  
  Overall, the findings are congruent with prior research on neural correlates of structural abstraction. They offer an elegant, well-suited task design to study compositional representations, replicating the authors' earlier finding and providing temporal information on structural generalisation in a sequence learning task.
  
  Review 2
4. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Summary
  
  This study investigates how task components can be learned and transferred across different task contexts. The authors designed two consecutive sequence learning tasks, in which complex image sequences were generated from the combination of two graph-based structural "building blocks". One of these components was shared between the prior and transfer task environments, allowing the authors to test compositional transfer. Behavioral analyses using generalized linear models (GLMs) assessed participants' sensitivity to the underlying structure. MEG data were recorded and analyzed using classifications and feature representational similarity analysis (RSA) to examine whether neural similarity increased for stimuli sharing the same relational structure. The paper aims to uncover the neural dynamics that support compositional transfer during learning.
  
  Strengths and weaknesses
  
  I found the methods and task design of this paper difficult to follow, particularly the way stimuli were constructed and how the experimental sequences were generated from the graph structures. These aspects would be hard to replicate without some clarification. I appreciate the integration of behavioral and neuroimaging data. The overall approach, especially the use of compositional graph structures in sequence learning, is interesting and could be used and revised in further studies in compositionality and transfer learning. I appreciated the authors' careful interpretation of their findings in the discussion. However, I would have liked a similar level of caution in the abstract, which currently overstates some claims.
  
  Major Comments:
  
  (1) While the introduction mentions brain areas implicated in the low-dimensional representation of task knowledge, the current study uses M/EEG and does not include source reconstruction. As a result, the focus is primarily on the temporal dynamics of the signal rather than its spatial origins. Although I am not suggesting that the authors should perform source reconstruction in this study, it would strengthen the paper to introduce the broader M/EEG literature on task-relevant representations and transfer. The same applies to behavioral studies looking at structural similarities and transfer learning. I encourage the authors to integrate relevant literature to better contextualize their results.
  
  Duan, Y., Zhan, J., Gross, J., Ince, R. A. & Schyns, P. G. Pre-frontal cortex guides dimension-reducing transformations in the occipito-ventral pathway for categorization behaviors. Current Biology 34, 3392-3404 (2024).
  
  Luyckx, F., Nili, H., Spitzer, B. & Summerfield, C. Neural structure mapping in human probabilistic reward learning. eLife 8, e42816 (2019). (This is in the references but not in the text).
  
  Zhang, M. & Yu, Q. The representation of abstract goals in working memory is supported by task-congruent neural geometry. PLoS biology 22, e3002461 (2024).
  
  L. Teichmann, T. Grootswagers, T. Carlson, A.N. Rich Decoding digits and dice with magnetoencephalography: evidence for a shared representation of magnitude Journal of cognitive neuroscience, 30 (7) (2018), pp. 999-1010
  
  Garner, K., Lynch, C. R. & Dux, P. E. Transfer of training benefits requires rules we cannot see (or hear). Journal of Experimental Psychology: Human Perception and Performance 42, 1148 (2016).
  
  Holton, E., Braun, L., Thompson, J., Grohn, J. & Summerfield, C. Humans and neural networks show similar patterns of transfer and interference during continual learning (2025).
  
  (2) I found it interesting that the authors chose to perform PCA for dimensionality reduction prior to conducting RSA; however, I haven't seen such an approach in the literature before. It would be helpful to either cite prior studies that have employed a similar method or to include a comparison with more standard approaches, such as sensor-level RSA or sensor-searchlight analysis.
  
  (3) Connected to the previous point, the choice to use absolute distance as a dissimilarity measure is not justified. How does it compare to standard metrics such as correlation distance or Mahalanobis distance? The same applies to the use of Kendall's tau.
  
  (4) The analysis described in the "Abstract representation of dynamical roles in subprocesses" does not appear to convincingly test the stated prediction of a structural scaffolding account. The authors hypothesize that if structure and dynamics from prior experiences are repurposed, then stimuli occupying the same "dynamical roles" across different sequences should exhibit enhanced neural similarity. However, the analysis seems to focus on decoding transitions rather than directly assessing representational similarity. Rather, this approach may reflect shared temporal representation in the sequences without necessarily indicating that the neural system generalizes the abstract function or position of a stimulus within the graph. To truly demonstrate that the brain captures the dynamical role across different stimuli, it would be more appropriate to directly assess whether neural patterns evoked by stimuli, in the same temporal part of the sequence, with shared roles (but different visual identities) are more similar to each other than to those from different roles.
  
  (5) In the following section, the authors correlate decoding accuracy with participants' behavioral performance across different conditions. However, out of the four reported correlations and the additional comparison of differences between conditions, only one correlation and one correlation difference reach significance, and only marginally so. The interpretation of this finding should therefore be more cautious, especially if it is used to support a link between neural representations and behavior. Additionally, it is possible that correlation with a more clearly defined or targeted neural signature, more directly tied to the hypothesized representational content, could yield stronger or more interpretable correlations.
  
  Minor Comments:
  
  During preprocessing, sensors were excluded based on an identified noise level. However, the authors do not specify the threshold used to define this noise level, nor do they report how many sensors were excluded per participant. It would be helpful to have these details. Additionally, it is unclear why the authors opted to exclude sensors rather than removing noise with MaxFiltering or interpolating bad sensors. Finally, the authors should report how many trials were discarded on average (and standard deviation) per participant.
  
  Review 3
Visit annotations in context

Tags

Summary

Review 2

Review 3

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.09.20.614119v2
www.biorxiv.org www.biorxiv.org

Compensation of Hyperexcitability with Simulation-Based Inference

2
1. Public_Reviews 28 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This study introduces a valuable simulation-based inference (SBI) framework to identify degenerate compensatory mechanisms that stabilize network activity despite neuronal hyperexcitability, a feature common to many brain disorders. By estimating posterior distributions of network parameters, the authors highlight factors such as threshold potential and interneuron-to-principal cell connectivity as key compensators for increased intrinsic excitability and interneuron loss. While the approach is promising and could become a key tool for probing network degeneracy, the study is currently incomplete. To fully realize its potential, the framework requires improved scalability and more rigorous cross-validation.
  
  Summary
2. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Joint Public Review:
  
  Summary:
  
  This manuscript couples a 32-parameter model with simulation-based inference (SBI) to identify parameter changes that can compensate for three canonical hyperexcitability perturbations (interneuron loss, recurrent-excitatory sprouting, and intrinsic depolarisation). The study demonstrates a careful implementation of SBI and offers a practical ranking of "compensatory levers" that could, in principle, guide therapeutic strategies for epilepsy and related network disorders.
  
  Strengths:
  
  (1) By analysing three mechanistically distinct hyper-excitable regimes within the same modelling and inference framework, the work reveals how different perturbations require different compensatory interventions.
  
  (2) The authors adopt posterior estimation to systematically rank the efficiency of different mechanisms in balancing hyperexcitability.
  
  (3) Code and data are available.
  
  Weaknesses:
  
  (1) A highly dense presentation of the simulated models and undefined symbols makes it hard for readers outside the modelling community to follow the biological message. An illustration of the models, accompanied by some explanations and references to the main equations and parameters discussed in this paper, would make the first section much more straightforward.
  
  (2) This methodology appears to be a brute-force approach, requiring millions of simulations to tune 32 parameters in a network of 500-700 cells. It isn't scalable. Moreover, the authors did not use cross-validation, which, with a relatively low increase in computational cost, would provide a quantitative measure as to how well it generalizes; this combination raises doubts about both scalability and reliability.
  
  (3) Several parameters remain so broadly distributed after fitting that the model cannot say with confidence which specific changes matter. Therefore, presenting them as "compensatory levers" is somewhat questionable.
  
  (4) Every conclusion is drawn from simulated data; without testing the predictions on recordings, we have no evidence that the proposed interventions would work in real neural tissue. Because today we cannot diagnose which of the three modelled pathological regimes is actually present in vivo, the paper's recommendations cannot yet be used to guide therapy.
  
  Review 1
Visit annotations in context

Tags

Summary

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.01.07.631838v1
osf.io osf.io

A tight relationship between BOLD fMRI activation/deactivation and increase/decrease in single neuron responses in human association cortex

4
1. Public_Reviews 28 Jul 2025
 
 in eLife (unscoped)
 
 eLife Assessment
 
 This valuable short paper is an ingenious use of clinical patient data to address an issue in imaging neuroscience. The authors clarify the role of face-selectivity in human fusiform gyrus by measuring both BOLD fMRI and depth electrode recordings in the same individuals; furthermore, by comparing responses in different brain regions in the two patients, they suggested that the suppression of blood oxygenation is associated with a decrease in local neural activity. The methods are solid and provide a rare dataset of potentially general importance.
 
 Summary
2. Public_Reviews 28 Jul 2025
 
 in eLife (unscoped)
 
 Reviewer #1 (Public review):
 
 Summary:
 
 Measurement of BOLD MR imaging has regularly found regions of the brain that show reliable suppression of BOLD responses during specific experimental testing conditions. These observations are to some degree unexplained, in comparison with more usual association between activation of the BOLD response and excitatory activation of the neurons (most tightly linked to synaptic activity) in the same brain location. This paper finds two patients whose brains were tested with both non-invasive functional MRI and with invasive insertion of electrodes, which allowed the direct recording of neuronal activity. The electrode insertions were made within the fusiform gyrus, which is known to process information abouit faces, in a clinical search for the sites of intractable epilepsy in each patient. The simple observation is that the electrode location in one patient showed activation of the BOLD response and activation of neuronal firing in response to face stimuli. This is the classical association. The other patient showed an informative and different pattern of responses. In this person, the electrode location showed a suppression of the BOLD response to face stimuli and, most interestingly, an associated suppression of neuronal activity at the electrode site.
 
 Strengths:
 
 Whilst these results are not by themselves definitive, they add an important piece of evidence to a long-standing discussion about the origins of the BOLD response. The observation of decreased neuronal activation associated with negative BOLD is interesting because, at various times, exactly the opposite association has been predicted. It has been previously argued that if synaptic mechanisms of neuronal inhibition are responsible for the suppression of neuronal firing, then it would be reasonable
 
 Weaknesses:
 
 The chief weakness of the paper is that the results may be unique in a slightly awkward way. The observation of positive BOLD and neuronal activation is made at one brain site in one patient, while the complementary observation of negative BOLD and neuronal suppression actually derives from the other patient. Showing both effects in both patients would make a much stronger paper.
 
 Comments on revisions:
 
 The material on lines 165-175 should not be left hidden away in the Methods section. This should be highlighted in the Discussion as a limitation of the current study and an issue that could be improved upon in future studies.
 
 Review 1
3. Public_Reviews 28 Jul 2025
 
 in eLife (unscoped)
 
 Reviewer #2 (Public review):
 
 Summary:
 
 This is a short and straightforward paper describing BOLD fMRI and depth electrode measurements from two regions of the fusiform gyrus that show either higher or lower BOLD responses to faces vs. objects (which I will call face-positive and face-negative regions). In these regions, which were studied separately in two patients undergoing epilepsy surgery, spiking activity increased for faces relative to objects in the face-positive region and decreased for faces relative to objects in the face-negative region. Interestingly, about 30% of neurons in the face-negative region did not respond to objects and decreased their responses below baseline in response to faces (absolute suppression).
 
 Strengths:
 
 These patient data are valuable, with many recording sessions and neurons from human face-selective regions, and the methods used for comparing face and object responses in both fMRI and electrode recordings were robust and well-established. The finding of absolute suppression could clarify the nature of face selectivity in human fusiform gyrus, since previous fMRI studies of the face-negative region could not distinguish whether face < object responses came from absolute suppression, or just relatively lower but still positive responses to faces vs. objects.
 
 Weaknesses:
 
 The authors claim that the results tell us about both 1) face-selectivity in the fusiform gyrus, and 2) the physiological basis of the BOLD signal. However, I would like to see more of the data that supports the first claim included in the paper.
 
 The authors report that ~30% of neurons showed absolute suppression, but those data are not shown separately from the neurons that only show relative reductions. It is difficult to evaluate the absolute suppression claim from the short assertion in the text alone (lines 105-106), although this is a critical claim in the paper.
 
 Comments on revisions:
 
 The authors have provided a figure showing one example neuron that shows absolute suppression in their response to reviewers; I would recommend including a similar panel in one of the paper figures showing data averaged across all neurons classified as showing absolute suppression.
 
 Review 2
4. Public_Reviews 28 Jul 2025
 
 in eLife (unscoped)
 
 Reviewer #3 (Public review):
 
 Summary:
 
 In this paper the authors conduct two experiments an fMRI experiment and intracranial recordings of neurons in two patients P1 and P2. In both experiments, they employ a SSVEP paradigm in which they show images at a fast rate (e.g. 6Hz) and then they show face images at a slower rate (e.g. 1.2Hz), where the rest of the images are a variety of object images. In the first patient, they record from neurons over a region in the mid fusiform gyrus that is face-selective and in the second patient, they record neurons from a region more medially that is not face selective (it responds more strongly to objects than faces). Results find similar selectivity between the electrophysiology data and the fMRI data in that the location which shows higher fMRI to faces also finds face-selective neurons and the location which finds preference to non faces also shows non face preferring neurons.
 
 Strengths:
 
 The data is important in that it shows that there is a relationship between category selectivity measured from electrophysiology data and category-selective from fMRI. The data is unique as it contains a lot of single and multiunit recordings (245 units) from the human fusiform gyrus - which the authors point out - is a humanoid specific gyrus.
 
 Weaknesses:
 
 My major concerns are two-fold: (i) There is a paucity of data; Thus, more information (results and methods) is warranted; and in particular there is no comparison between the fMRI data and the SEEG data.
 
 (ii) One main claim of the paper is that there is evidence for suppressed responses to faces in the non-face selective region. That is, the reduction in activation to faces in the non-face selective region is interpreted as a suppression in the neural response and consequently the reduction in fMRI signal is interpreted as suppression. However, the SSVEP paradigm has no baseline (it alternates between faces and objects) and therefore it cannot distinguish between lower firing rate to faces vs suppression of response to faces.
 
 (1) Additional data: the paper has 2 figures: figure 1 which shows the experimental design and figure 2 which presents data, the latter shows one example neuron raster plot from each patient and group average neural data from each patient. In this reader's opinion this is insufficient data to support the conclusions of the paper. The paper will be more impactful if the researchers would report the data more comprehensively.
 
 (a) There is no direct comparison between the fMRI data and the SEEG data, except for a comparison of the location of the electrodes relative to the statistical parametric map generated from a contrast (Fig 2a,d). It will be helpful to build a model linking between the neural responses to the voxel response in the same location - i.e., estimate from the electrophysiology data the fMRI data (e.g. Logothetis & Wandell, 2004)
 
 (b) More comprehensive analyses of the SSVEP neural data: It will be helpful to show the results of the frequency analyses of the SSVEP data for all neurons to show that there are significant visual responses and significant face responses. It will be also useful to compare and quantify the magnitude of the face responses compared to the visual responses.
 
 (c) The neuron shown in E shows cyclical responses tied to the onset of the stimuli, is this the visual response? If so, why is there an increase in the firing rate of the neuron before the face stimulus is shown in time 0? The neuron's data seems different than the average response across neurons; This raises a concern about interpreting the average response across neurons in panel F which seems different than the single neuron responses
 
 (d) Related to (c) it would be useful to show raster plots of all neurons and quantify if the neural responses within a region are homogeneous or heterogeneous. This would add data relating the single neuron response to the population responses measured from fMRI. See also Nir 2009.
 
 (e) When reporting group average data (e.g., Fig 2C,F) it is necessary to show standard deviation of the response across neurons.
 
 (f) Is it possible to estimate the latency of the neural responses to face and object images from the phase data? If so, this will add important information on the timing of neural responses in the human fusiform gyrus to face and object images.
 
 (g) Related to (e) In total the authors recorded data from 245 units (some single units and some multiunits) and they found that both in the face and nonface selective most of the recoded neurons exhibited face -selectivity, which this reader found confusing: They write " Among all visually responsive neurons, we 87 found a very high proportion of face-selective neurons (p < 0.05) in both activated 88 and deactivated MidFG regions (P1: 98.1%; N = 51/52; P2: 86.6%; N = 110/127)'. Is the face selectivity in P1 an increase in response to faces and P2 a reduction in response to faces or in both it's an increase in response to faces
 
 (1) Additional methods (a) it is unclear if the SSVEP analyses of neural responses were done on the spikes or the raw electrical signal. If the former, how is the SSVEP frequency analysis done on discrete data like action potentials? (b) it is unclear why the onset time was shifted by 33ms; one can measure the phase of the response relative to the cycle onset and use that to estimate the delay between the onset of a stimulus and the onset of the response. Adding phase information will be useful.
 
 (2) Interpretation of suppression:
 
 The SSVEP paradigm alternates between 2 conditions: faces and objects and has no baseline; In other words, responses to faces are measured relative to the baseline response to objects so that any region that contains neurons that have a lower firing rate to faces than objects is bound to show a lower response in the SSVEP signal. Therefore, because the experiment does not have a true baseline (e.g. blank screen, with no visual stimulation) this experimental design cannot distinguish between lower firing rate to faces vs suppression of response to faces. The strongest evidence put forward for suppression is the response of non-visual neurons that was also reduced when patients looked at faces, but since these are non-visual neurons, it is unclear how to interpret the responses to faces.
 
 Comments on revisions:
 
 In the revision, the authors added information and answered several of the main questions. Several points remain unanswered because the authors would like to publish a short format paper here, and suggest that answering these questions is outside the scope of the paper. The authors would like to leave some of the more detailed analyses for a subsequent longer paper.
 
 Review 3
Visit annotations in context

Tags

Review 1

Review 2

Review 3

Summary

Annotators

Public_Reviews

URL

osf.io/preprints/osf/4yh36_v3
www.biorxiv.org www.biorxiv.org

High frequency spike inference with particle Gibbs sampling

3
1. Public_Reviews 28 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  In their study, Diana et al. introduce a novel method for spike inference from calcium imaging data using a Monte Carlo-based approach, emphasizing the quantification of uncertainties in spike time estimates through a Bayesian framework. This method employs particle Gibbs sampling for estimating model parameter probabilities, offering accuracy comparable to existing methods with the added benefit of directly assessing uncertainties. The presentation of the underlying methods and its characterization is convincing and it presents a valuable advancement for neuroscientists interested in new approaches for parameter estimation from calcium imaging data.
  
  Summary
2. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  In this study, Diana et al. present a Monte Carlo-based method to perform spike inference from calcium imaging data. A particular strength of their approach is that they can estimate not only averages but also uncertainties of the modeled process. The authors focus on the quantification of spike time uncertainties in simulated data and in data recorded with high sampling rate in cebellar slices with GCaMP8f, and they demonstrate the high temporal precision that can be achieved with their method to estimate spike timing.
  
  Strengths:
  
  - The author provide a solid ground work for sequential Monte Carlo-based spike inference, which extends previous work of Pnevmatikakis et al., Greenberg et al. and others.
  
  - The integration of two states (silence vs. burst firing) seems to improve the performance of the model.
  
  - The acquisition of a GCaMP8f dataset in cerebellum is useful and helps make the point that high spike time inference precision is possible under certain conditions.
  
  Weaknesses:
  
  - Although the algorithm is compared (in the revised manuscript) to other models to infer individual spikes (e.g., MLSpike), these comparisons could be more comprehensive. Future work that benchmarks this and other algorithms under varying conditions (e.g., noise levels, temporal resolution, calcium indicators) would help assess and confirm robustness and useability of this algorithm.
  
  - The mathematical complexity underlying the method may pose challenges for experimentalist who may want to use the methods for their analyses. While this is not a weakness of the approach itself, this highlights the need for further validation and benchmarking in future work, to build user confidence.
  
  Review 1
3. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  Methods to infer action potentials from fluorescence-based measurements of intracellular calcium dynamics are important for optical measurements of activity across large populations of neurons. The variety of existing methods can be separated into two broad classes: a) model-independent approaches that are trained on ground truth datasets (e.g., deep networks), and b) approaches based on a model of the processes that link action potentials to calcium signals. Models usually contains parameters describing biophysical variables, such as rate constants of the calcium dynamics and features of the calcium indicator. The method presented here, PGBAR, is model-based and uses a Bayesian approach. A novelty of PGBAR is that static parameters and state variables are jointly estimated using particle Gibbs sampling, a sequential Monte Carlo technique that can efficiently sample the latent embedding space.
  
  Strengths:
  
  A main strength of PGBAR is that it provides probability distributions rather than point estimates of spike times. This is different from most other methods and may be an important feature in cases when estimates of uncertainty are desired. Another important feature of PGBAR is that it estimates not only the state variable representing spiking activity, but also other variables such as baseline fluctuations and stationary model variables, in a joint process. PGBAR can therefore provide more information than various other methods. The information in the github repository is well-organized.
  
  Weaknesses:
  
  On the other hand, the accuracy of spike train reconstructions is not higher than that of other model-based approaches, and clearly lower than the accuracy of a model-independent approach based on a deep network. The authors demonstrate convincingly that PGBAR can resolve inter-spike intervals in the range of 5 ms using fluorescence data obtained with a very fast genetically encoded calcium indicator at very high sampling rates (line scans at >= 1 kHz).
  
  Review 2
Visit annotations in context

Tags

Summary

Review 2

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.04.05.487201v3
www.biorxiv.org www.biorxiv.org

Interplay of YEATS2 and GCDH regulates histone crotonylation and drives EMT in head and neck cancer

2
1. Public_Reviews 28 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  These useful findings assigned a novel functional implication of histone acylation, crotonylation. Mechanistic insights have been provided in great detail regarding the role of the YEATS2-GCDH axis in modulating epithelial-to-mesenchymal transition (EMT) in head and neck cancer, and overall the strength of evidence is solid.
  
  Summary
2. Public_Reviews 28 Jul 2025
  
  in eLife
  
  Joint Public Review:
  
  This manuscript investigates a mechanism between the histone reader protein YEATS2 and the metabolic enzyme GCDH, particularly in regulating epithelial-to-mesenchymal transition (EMT) in head and neck cancer (HNC).
  
  The authors addressed most of the concerns of the reviewers. They have:
  
  (1) Increased the patient cohort size from 10 to 23 for evaluating the levels of YEATS2 and H3K27cr.
  
  (2) Checked the expression of major genes involved in the YEATS2-mediated histone crotonylation axis (YEATS2, GCDH, ECHS1, Twist1, along with H3K27cr levels) in head and neck cancer tissues using immunohistochemistry.
  
  (3) Analyzed publicly available head and neck cancer patient datasets, which revealed a significant positive correlation between YEATS2 expression and increasing tumor grade.
  
  (4) Performed GSEA on TCGA HNC patient samples stratified by high versus low YEATS2 expression. This analysis robustly demonstrated a positive enrichment of metastasis-related gene sets in the high YEATS2 expression group, compared to the low YEATS2 group.
  
  (5) Performed extensive experiments to look into the role of p300 in assisting YEATS2 in regulating promoter histone crotonylation. The p300 was knocked down in BICR10 cells, followed by immunoblotting to assess SPARC protein levels.
  
  (6) Performed co-immunoprecipitation assays to check for an interaction between endogenous YEATS2 and p300. The results clearly demonstrate the presence of YEATS2 in the p300-immunoprecipitate sample, indicating that YEATS2 and p300 physically interact and likely function together as a complex to drive the expression of target genes like SPARC.
  
  (7) Performed RNA Polymerase II ChIP-qPCR on the SPARC promoter in YEATS2 knockdown cells.
  
  (8) To confirm p300's specific role in crotonylation at this locus, they performed H3K27cr ChIP-qPCR after p300 knockdown.
  
  (9) Performed SP1 knockdown (which reduces YEATS2 expression) followed by ectopic YEATS2 overexpression, and then assessed p300 occupancy and H3K27cr levels on the SPARC promoter.
  
  Review 1
Visit annotations in context

Tags

Summary

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.09.24.614679v3
www.biorxiv.org www.biorxiv.org

Superoxide Dismutases maintain niche homeostasis in stem cell populations

4
1. Public_Reviews 25 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 In this work, the authors intend to assess the existence of a redox potential across germline stem cells and neighboring somatic stem cells in the Drosophila testis. Some aspects of the manuscript are solid, like the clear effect of SOD KD on cyst cell differentiation state. Other conclusions of the work, such as the non-autonomous effect of this KD in germ cells are not sufficiently supported by the data. The work is potentially useful if the critiques of the reviewers are fully addressed; the strength of the evidence of the manuscript as it stands is incomplete.
 
 Summary
2. Public_Reviews 25 Jul 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Mitochondrial staining difference is convincing, but the status of the mitos, fused vs fragmented, elongated vs spherical, does not seem convincing. Given the density of mito staining in CySC, it is difficult to tell what is an elongated or fused mito vs the overlap of several smaller mitos.
 
 I'm afraid the quantification and conclusions about the gstD1 staining in CySC vs. GSCs is just not convincing-I cannot see how they were able to distinguish the relevant signals to quantify once cell type vs the other.
 
 The overall increase in gstD1 staining with the CySC SOD KD looks nice, but again I can't distinguish different cel types. This experiment would have been more convincing if the SOD KD was mosaic, so that individual samples would show changes in only some of the cells. Still, it seems that KD of SOD in the CySC does have an effect on the germline, which is interesting.
 
 The effect of SOD KD on the number of less differentiated somatic cells seems clear. However, the effect on the germline is less clear and is somewhat confusing. Normally, a tumor of CySC or less differentiated Cyst cells, such as with activated JAK/STAT, also leads to a large increase in undifferentiated germ cells, not a decrease in germline as they conclude they observe here. The images do not appear to show reduced number of GSCs, but if they counted GSCs at the niche, then that is the correct way to do it, but its odd that they chose images that do not show the phenotype. In addition, lower number of GSCs could also be caused by "too many CySCs" which can kick out GSCs from the niche, rather than any affect on GSC redox state. Further, their conclusion of reduced germline overall, e.g. by vasa staining, does not appear to be true in the images they present and their indication that lower vasa equals fewer GSCs is invalid since all the early germline expresses Vasa.
 
 The effect of somatic SOD KD is perhaps most striking in the observation of Eya+ cyst cells closer to the niche. The combination of increased Zfh1+ cells with many also being Eya+ demonstrates a strong effect on cyst cell differentiation, but one that is also confusing because they observe increases in both early cyst cells (Zfh1+) as well as late cyst cells (Eya+) or perhaps just an increase in the Zfh1/Eya double-positive state that is not normally common. The effects on the RTK and Hh pathways may also reflect this disturbed state of the Cyst cells.
 
 However, the effect on germline differentiation is less clear-the images shown do not really demonstrate any change in BAM expression that I can tell, which is even more confusing given the clear effect on cyst cell differentiation.
 
 For the last figure, any effect of SOD OE in the germline on the germline itself is apparently very subtle and is within the range observed between different "wt" genetic backgrounds.
 
 Review 1
3. Public_Reviews 25 Jul 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 The authors want to prove that there is a redox potential between germline stem cells and somatic cyst stem cells in the Drosophila testis, with ROS being higher in the former compared to the latter. They also want to prove that ROS travels from CySCs to GSCs. Finally, they begin to characterize the phenotypes cause by loss of SOD (The function of SOD is to lower ROS levels, and depletion of SOD increases ROS levels) in the tj-Gal4 lineage and how this impacts the germline.
 
 The authors fall short of accomplished their goals in the revised manuscript. There are issues with the concept of the paper (ROS gradient between cells that causes a transfer of ROS across membranes for homeostasis) as this is not supported by the data. In Fig. 1N (tj-SODi), one can see that all of gst-GFP resides within the differentiating somatic cells and none is in the germ cells. Furthermore, the information provided in the materials and methods about quantification of gst-GFP is not sufficient. Focusing on Dlg staining is not sufficient. They need to quantify the overlap of Vasa (a cytoplasmic protein in GSCs) with GFP. I interpret their results as the following: (1) depletion of SOD from somatic support cells leads to autonomous increases in ROS activity; (2) the increase somatic ROS is not transferred to the germline. Instead increase somatic ROS perturbs homeostasis of the somatic linage. As such, the entire premise of the paper is greatly weakened. Additionally, since tj-gal4 is active in hub cells, it is not clear whether the effects of SOD depletion also arise from perturbation of niche cells. These weaknesses negatively impact the conclusions put forward by the authors. As I wrote in my first critique, their data is not compelling: there is no evidence provide by the authors that ROS diffuses from CySCs to GSCs as most of the claims about stem cells is founded on data about differentiating germ and somatic cells.
 
 There are still many issues about the paper apart from the weak premise. First, the authors are studying a developmental affect, rather than an adult phenotype. Second, the characterization of the somatic lineage is incomplete. It appears that high ROS in the somatic lineage autonomously decreases MAP kinase signaling and increases Hh signaling. They assume that the MAPK signaling is due to changes in Egfr activity but there are other tyrosine kinases active in CySCs, including PVR/VEGFR (PMID: 36400422), that impinge on MAPK. In any event, their results are puzzling because lower Egfr should reduce CySC self-renewal and CySC number (Amoyel, 2016) and the ability of cyst cells to encapsulate gonialblasts (Lenhart Dev Cell 2015). The increased Hh should increase CySC number and the ability of CySCs to outcompete GSCs. The fact that the average total number of GSCs declines in tj>SODi testes suggests that high ROS CySCs are indeed outcompeting GSCs. However, as I wrote in my first critique, the characterization of the high ROS soma is incomplete. And the role of high ROS in the hub cells is acknowledged but not investigated.
 
 (1) Concept: The authors still do not describe why would it be important to have a redox gradient across adjacent cells. The paragraph in the introduction (lines 62-76) mentions autonomous ROS levels in stem cells, not the transfer of ROS from one cell to another. And this paragraph is confusing because it starts with the (inaccurate) statement all stem cells have low ROS and then they discuss ISCs, which have high ROS.
 
 (2) Issues with scholarship of the testis. While there has been an improvement in the scholarship of the testis, there are still places where the correct paper is not cited.
 
 a. Lines 80-82 - cite Roach and Lenhart Dev 2024.
 
 b. Lines 86-88. They is no real evidence for concerted division of GSCs and CySCs. In fact, the Dinardo has shown that these stem cells do not divide synchronously (Lenhart and Dinardo, Dev Cell 2015).
 
 (3) Issues with the text;
 
 a. Lines 194-196 - The authors need to cite Tan 2017 (PMID: 28669604) who have already published a paper about what excess ROS does to the GSC lineage.
 
 b. Lines 210-211 - STAT drives expression of ECad. Socs36E and Ptp61F do not drive Ecad. Please correct.
 
 c. Line 225 "uncontrolled proliferation" is an overstatement and should be toned down.
 
 d. Line 237 - Hh-RNAi does not reduce gene dosage (as the authors have written) but it presumably depletes hh mRNAs levels in hub cells and CySCs.
 
 e. Line 147 - C587-Gal4 on its own should not cause a reduction in GSCs. This sentence should be corrected.
 
 f. Lines 177 - why would the authors predict that increasing ROS in GSCs using nos-Gal4 would non-autonomously affect CySCs? The logic is not clear. Please explain.
 
 g. Line 291-293 - this sentence make no sense. Please revise.
 
 Review 3
4. Public_Reviews 25 Jul 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews.
 
 Reviewer #1 (Public review):
 
 In Figure 1, it is very difficult to identify where CySCs end and GSCs begin without using a cell surface marker for these different cell types. In addition, the methods for quantifying the mitochondrial distribution in GSCs vs. CySCs are very much unclear and appear to rely on colocalization with molecular markers that are not in the same cellular compartment (Tj-nuclear vs. Vasa-perinuclear and cytoplasmic) the reader has no way to determine the validity of the mitochondrial distribution. Similarly, the labelling with gstD1-GFP is also very much unclear - I see little to no GFP signal in either GSCs or CySCs in panels 1GK. Lastly, while the expression o SOD in CySCs does increase the gstD1-GFP signal in CySCs, the effects on GSCs claimed by the authors are not apparent.
 
 We appreciate the reviewer’s detailed feedback on Figure 1 and the concerns raised regarding identifying CySCs and GSCs, as well as the methods used for quantifying mitochondrial distribution and gstD1-GFP labeling. Below, we address each point and describe the revisions made to improve clarity and rigor
 
 Distinguishing CySCs and GSCs and Mitochondrial Distribution in GSCs vs. CySCs in Figure1
 
 We acknowledge the difficulty in distinguishing CySCs from GSCs without the use of additional cell surface markers. To improve clarity, we have now included a membrane marker discslarge (Dlg) in our revised Figure 1 and S1 to delineate cell boundaries more clearly. Additionally, we provide higher-magnification images to indicate the mitochondria in CySCs and GSCs. We also agree that ing on mitochondrial distribution might be far-fetched. In the revised manuscript, we have limited our analysis to mitochondrial shape, which was found to be different in GSC and CySC (Fig. 1, D, F, G, and S1B). We have clarified our quantification methods in the revised Methods section, providing details on the image processing and analysis pipeline used to assess mitochondrial distribution.
 
 Clarity of gstD1-GFP Labelling:
 
 We recognize the reviewer’s concern regarding the weak GFP signal in these panels. To improve visualization, we have included fresh set of images by optimizing the contrast and presenting additional monochrome images with higher exposure settings to better illustrate gstD1-GFP expression (Figure 1L,1Q, and S1C’’’-D’’’). Additionally, we have demarcated the cell boundaries using Dlg along with individual labelling of Vasa+ and Tj+ cells. Due to technical difficulty associated with acquisition of images, we could not co-stain Vasa, Tj and Dlg together. Therefore, quantified the gstD-GFP intensity separately for GSCs and CySCs under similar acquisition conditions (Figure 1R).
 
 Effects of SOD depletion on GSCs:
 
 While our initial analysis suggested changes in gstD1-GFP expression in GSCs upon Sod1 depletion in CySCs, we acknowledge that the effects may not be as apparent in the provided images. In response, we have expanded our quantification, included a statistical analysis of gstD1-GFP intensity specifically in GSCs and CySCs (Figure 1S), and added more representative images in the revised figure panels (Figure S1C-D’’’) to support our claims.
 
 In Figure 2, while the cell composition of the niche region does appear to be different from controls when SOD1 is knocked down in the CySCs, at least in the example images shown in Figures 2A and B, how cell type is quantified in figures 2E-G is very much unclear in the figure and methods. Are these counts of cells contacting the niche? If so, how was that defined? Or were additional regions away from the niche also counted and, if so, how were these regions defined?
 
 Thank you for your regarding the quantification of cell types in Figures 2E-G. We counted all cells that were Tj-positive and Zfh1-positive in individual testis, while for GSCs, only those in direct contact with the hub were included. This clarification has been incorporated into the revised figure legend and methods (line no.400-407). We have now provided a clearer description in the text to improve transparency in our analysis.
 
 In Figure 3, it is quite interesting that there is an increase in Eya+, differentiating cyst cells in SOD1 knockdown animals, and that these Eya+ cells appear closer to the niche than in controls. However, this seems at odds with the proliferation data presented in Figure 2, since Eya+ somatic cells do not normally divide at all. Are they suggesting that now differentiating cyst cells are proliferative? In addition, it is important for them to show example images of the changes in Socs36E and ptp61F expression.
 
 Thank you for your insightful observations. We acknowledge the apparent contradiction and appreciate the opportunity to clarify our interpretation.
 
 Regarding the increase in Eya+ differentiating cyst cells in Sod1RNAi individuals and their proximity to the niche, we do not suggest that these differentiating cells are proliferative. Instead, we propose that the knockdown of Sod1 may alter the timing or regulation of cyst cell differentiation, leading to an accumulation of Eya+ cells near the niche. To clarify this point, we have revised the manuscript (line no. 186-189) to emphasize that our proliferation data specifically refers to early-stage somatic cells, not Eya+ differentiating cyst cells.
 
 We also appreciate the reviewer's request for example images illustrating the changes in Socs36E and Ptp61F expression. We could not access the antibodies specific to Socs36E and Ptp61F. Hence, we had to rely on the measurements were obtained using real-time PCR from the tip region of testis. We have clarified the same in the figure legends (line 700).
 
 Overall, the various changes in signaling are quite puzzling-while Jak/Stat signaling from the niche is reduced, hh signaling appears to be increased. Similarly, while the authors conclude that premature differentiation occurs close to the niche, EGF signaling, which occurs from germ cells to cyst cells during differentiation, is decreased. Many times these, changes are contradictory, and the authors do not provide a suitable explanation to resolve these contradictions.
 
 We appreciate the reviewer’s thoughtful feedback on the signaling changes described in our study. We acknowledge that the observed alterations in Jak/Stat, Hedgehog (Hh), and EGF signaling may appear contradictory at first glance. However, our data suggest that these changes reflect a complex interplay between different signaling pathways that regulate cyst cell behavior in response to specific genetic perturbation.
 
 Regarding Jak/Stat and Hh signaling, while Jak/Stat activity is reduced in the niche, the increase in Hh signaling may reflect a compensatory mechanism or a context-dependent response of cyst cells to reduced Jak/Stat input. Prior studies have suggested that Hh signaling can function in parallel and independently of Jak/Stat signaling (PMID: 23175633) and our findings align with this possibility.
 
 The reduction in EGFR signaling in this context appears contradictory to existing literature. One possible explanation is that, the altered GSC -CySC balance and loss of contact in Tj>Sod1i testes, leads to insufficient ligand response, thereby failing to activate EGFR signaling. (line no.222-224, 313-318).
 
 Reviewer #2 (Public review):
 
 We sincerely appreciate the reviewer’s detailed feedback, which has helped refine our manuscript. In this study we have focussed on the role of ROS generated due to manipulation of Sod1 in the interplay between GSC and CySCs. In this regard, we have conducted additional experiments and incorporated quantitative data into the revised manuscript. Additionally, we have refined the text and provided further context to enhance the clarity. Key revisions include:
 
 (1) Clarification of Quantification Methods – We have refined intensity measurements by incorporating a membrane marker (Dlg) to better delineate cell boundaries and have normalized Ptc and Ci expression per cell to improve clarity.
 
 (2) Cell-Specific ROS Measurement – We separately measured ROS in germ cells and cyst cells and performed independent Sod1 depletion in GSCs to determine its direct effects.
 
 (3) Mitochondrial Analysis – We revised our approach, focusing on mitochondrial shape rather than asymmetric distribution, and removed overreaching claims.
 
 (4) Proliferation Analysis – We reanalyzed FUCCI data by normalizing to total cell count, supporting the conclusion that increased proliferation, rather than differentiation delay, underlies the observed phenotype.
 
 (5) E-Cad Quantification – We specifically analyzed E-Cad levels at the GSC-hub interface to strengthen conclusions on GSC attachment.
 
 (6) JAK/STAT Signaling – While we could not obtain a STAT92E antibody, we clarified the spatial limitations of our current analysis and revised the text accordingly.
 
 (7) Rescue Experiments and Gal4 Titration Control – We performed additional control experiments to confirm that observed effects are not due to Gal4 dilution.
 
 (8) Image Quality and Terminology Corrections – We enhanced figure resolution, corrected terminology (e.g., "cystic" to "cyst"), and revised ambiguous phrasing for clarity and accuracy.
 
 As suggested, we have also changed the manuscript title to better align with our results:
 
 Previous Manuscript Title: Non-autonomous cell redox-pairs dictate niche homeostasis in multi-lineage stem populations
 
 Updated Manuscript Title: Superoxide Dismutases maintain niche homeostasis in stem cell populations
 
 Specific responses to the reviewer’s:
 
 While the decrease in pERK in CySCs is clear from the image and matched in the quantification, the increase in cyst cells is not apparent from the fire LUT used. The change in fluorescence intensity therefore may be that more cells have active ERK, rather than an increase per cell (similar arguments apply to the quantifications for p4E-BP or Ptc). Therefore, it is hard to know whether Sod1 knockdownresults in increased or decreased signaling in individual cells.
 
 Thank you for your insightful . To clarify, in the Fire LUT images, only pERK intensity is shown, not the cyst cell number. In our context, while there are more cells, the overall pERK intensity is lower, eliminating any ambiguity about whether the change is occurring per cell or due to an increased number of circulating cells. Moreover, for Ptc and Ci levels, we have normalized Ptc and Ci expression intensity per cell to enhance clarity and ensure an accurate interpretation of signaling changes.
 
 There are several places in which the authors could strengthen their manuscript by explaining the methods more clearly. For example, it is unclear how the intensity graphs in Figure 1Q are obtained. The curves appear smoothed and therefore unlikely to be from individual samples, but this is not clearly explained. However, this quantification method is clearly not helpful, as it shows the overlap between somatic and germline markers, suggesting it cannot accurately distinguish between the two cell types. Additionally, using a nuclear marker (Tj) for the cyst cells and cytoplasmic marker (Vasa) for the germ cells risks being misleading, as one would not expect much overlap between cytoplasmic gstD1-GFP and nuclear Tj. Also related to the methods, it is unclear how Vasa+ cells at the hub were counted. The methods suggest this was from a single plane, but this runs the risk of being arbitrary since GSCs can be distributed around the hub in 3D. (As a note, the label on the graph "Vasa+ cells" is misleading, as there are many more cells that are Vasa-positive than the ones counted.)
 
 We appreciate the reviewer’s careful evaluation of our manuscript and their insightful suggestions for improving the clarity of our methods. Below, we address each concern raised and describe the revisions made accordingly.
 
 Clarification of Intensity Graphs in Figure 1Q
 
 We have removed this graph, as we recognize that the markers previously used were not appropriate for distinguishing the different cell types. To address this concern, we have revised the text and now included a membrane marker discs-large (Dlg) in our revised Figure 1 and S1 to more clearly delineate cell boundaries. Due to technical difficulty associated with acquisition of images, we could not co-stain Vasa, Tj and Dlg together. Therefore, quantified the gstD-GFP intensity separately for GSCs and CySCs under similar acquisition conditions (Figure 1R).
 
 Counting of Vasa+ Cells at the Hub
 
 We appreciate the reviewer’s concern regarding our method for counting Vasa+ cells. In our original analysis, we included GSCs as the Vasa-positive cells that were in direct contact with the hub. To account for the three-dimensional arrangement of GSCs, we used the Cell counter plugin of Fiji and performed counting across different focal planes to ensure all hub-associated cells were considered. For better clarity on cell distribution around the hub, we have presented a single focal place image sliced through mid of the hub zone. To enhance transparency, we have now provided a more detailed explanation of our counting approach in the Methods section (line no 400- 403).
 
 We agree that the label "Vasa+ cells" may be misleading, as many cells express Vasa beyond the specific subset being counted. To address this, we have changed the label to " GSCs" to reflect the subset analyzed more accurately.
 
 The crucial experiment for this manuscript is presented in Figures 1 G-S, arguing that Sod1 knockdown with Tj-Gal4 increases gstD1-GFP expression in germ cells. This needs strengthening as the current quantifications are not convincing and appear to show an overlap between Tj (a nuclear cyst cell marker) and Vasa (a cytoplasmic germ cell marker). Labeling cell outlines would help, or alternatively, labeling different cell types genetically can be used to determine whether the expression is increased specifically within that cell type. Similarly, the measurement of ROS shown in the supplemental data should be conducted in a cell-specific manner. To clearly make the case that Sod1 knockdown in cyst cells is impacting ROS in the germline, it would be important to manipulate germ cell ROS independently. Without this, it will be difficult to prove that any effects observed are a result of increased ROS in the germline rather than indirect effects on the germline of altered cyst cell behaviour.
 
 We appreciate the reviewer’s insightful feedback regarding the specificity of Sod1 knockdown effects in germ cells and the need for clearer quantification in Figures 1G–S. Below, we address each concern and outline the modifications made:
 
 Clarification of Cell Type-Specific Expression:
 
 We acknowledge the overlap observed between Tj (nuclear cyst cell marker) and Vasa (cytoplasmic germ cell marker) in the presented images. To strengthen our claim that gstD1GFP expression increases specifically in germ cells upon Sod1 knockdown, we have now labelled cell outlines using membrane marker discs-large (Dlg) to better distinguish cell boundaries, along with individual labelling of Vasa+ and Tj+ cells. Due to technical difficulty associated with acquisition of images, we could not co-stain Vasa, Tj and Dlg together.
 
 Cell-Specific Measurement of ROS:
 
 We agree that a cell-type-specific ROS measurement is critical to establishing a direct effect on germ cells. To address this, we have now performed ROS measurements separately in germ cells and cyst cells under similar acquisition conditions. These data are now included in the revised (Figure 1R). Similarly, upon CySC-specific Sod1 depletion, we performed measurement of gstD1-GFP intensity which was found to be enhanced in GSCs, along with expected increase in CySCs (Fig 1S). We have independently manipulated ROS levels in GSCs (Nos Gal4> Sod1i) and observed that elevated ROS negatively impacts GSCs, leading to a reduction in their number, while having an insignificant effect on adjacent CySCs.(Fig S2 E, F).
 
 Quantifications of mitochondrial localization in Figure 1 should include some adequate statistical method to evaluate whether the distribution is random or oriented towards the GSC/CySC interface. From the image provided (Figure 1B), it would appear that there are two clusters of mitochondria, on either side of a CySC nucleus, one cluster towards a GSC and one cluster away. Therefore evaluating bias would be important. Additional experiments will be necessary to support the statement that "Redox state of GSC is maintained by asymmetric distribution of CySC mitochondria". This would require manipulating mitochondrial distribution in CySCs.
 
 We appreciate the reviewer’s suggestion regarding the quantification of mitochondrial localization. We agree that ing on mitochondrial distribution might be far-fetched. In revised manuscript, we have demarcated the cell boundary and limited our analysis to mitochondrial shape which was found to be different in GSC and CySC (Fig. 1, D, F, G and S1B). Mitochondrial shape was quantified based on the mitochondrial area and circularity (Figure 1F and G). To prevent any misinterpretation, we have removed the statement, "Redox state of GSC is maintained by asymmetric distribution of CySC mitochondria."
 
 One point raised by the authors is that the increase of somatic cell numbers is driven by accelerated proliferation, based on an increased number of cells in various stages of the cell cycle as assessed by the FUCCI reporter. However, there are more somatic cells in this genetic background, so it could be argued that the observed increase in different phases of the cell cycle is due to an increased number of cells. In order to argue for an increased proliferation rate, the number of cells in each phase should be divided by the total number of cells, expecting to see an increase in S and G2/M phases along with a decrease in G1. Otherwise, the simplest explanation is a block or delay in differentiation, meaning that more cells remain in the cell cycle.
 
 We appreciate the regarding the interpretation of our FUCCI reporter data. We acknowledge that the observed increase in the number of cells in various phases of the cell cycle could be influenced by the overall higher number of somatic cells in this genetic background.
 
 To address this concern, we have now re-analyzed our FUCCI data by normalizing the number of cells in each phase to the total number of cells and we did not observe a significant shift in the proportion of cells in S and G2/M phases relative to G1. This suggests presence of more proliferative cells, that is less cells in Go phase, rather than alterations in the timing of cell cycle progression stages. We are not sure about a block in differentiation because we see an enhanced accumulation of Eya+ cells near the niche. We have also supported our FUCCI data with pH3 staining where we have found more pH3+ spots under SOD1 depleted background. We have revised our manuscript accordingly (Figure 2I, K and S2U) to reflect this interpretation and appreciate the constructive feedback.
 
 In Figure 3, the authors claim that knockdown of Sod1 in the soma decreases the attachment of GSCs to the hub-based on lower E-Cad levels compared to controls. Previous work has shown that in GSCs, E-Cad localizes to the Hub-GSC interface (PMID: 20622868). Therefore, the authors should quantify E-Cad staining at the interphase between the germ cells and the niche.
 
 We appreciate the reviewer’s . As suggested, we have now quantified ECad staining specifically at the interface between the germ cells and the niche. Our analysis confirms that E-Cad levels are significantly reduced at this interphase upon Sod1 knockdown in the soma compared to controls, supporting our conclusion that Sod1 depletion affects GSC attachment to the hub as well as the whole niche. The revised Figure 3M now includes these quantifications, and we have updated the figure legend and results section accordingly.
 
 The authors show decreased expression of the JAK/STAT targets socs36E and ptp61F, arguing that this could be a reason for decreased GSC adhesion to the hub. However, these data were obtained from whole testes and lacked spatial resolution, whereas a STAT92E staining in control and tj>Sod1 RNAi testes could easily prove this point. Indeed, previous work has shown that socs36E is expressed in the CySCs, not GSCs (PMID: 19797664), suggesting that any decrease in JAK/STAT may be autonomous to the CySCs.
 
 We appreciate the reviewer’s observation regarding the spatial resolution of our JAK/STAT target expression analysis. To improve accuracy, we have attempted to collect only the tip of the testes while excluding the rest; however, we acknowledge that this approach may still obscure cell-specific changes. We had attempted to procure the STAT92E antibody but, despite multiple inquiries, we did not receive a positive response. While we agree that STAT92E staining would have strengthen our findings, we are currently unable to perform this experiment. Nevertheless, our observations align with prior work indicating that socs36E is predominantly expressed in CySCs (PMID: 19797664). We have revised the manuscript text accordingly to clarify this limitation.
 
 Additional considerations should be taken regarding the rescue experiments where PI3KDN and Hh RNAi are expressed in a Tj>Sod1 RNAi background. To rule out that any rescue can be attributed to titration of the Gal4 protein when an additional UAS sequence is present, a titration control would be useful. These pathways are not described accurately since Insulin signaling is necessary for the differentiation of somatic cells (not maintenance as written in the text), and its inhibition has been shown to increase the number of undifferentiated somatic cells (PMID:27633989). As far as Hh is concerned, the expression of this molecule is restricted to the niche. It would be important to establish whether the expression is altered in this case, especially as the authors rescue the Sod1 knockdown by also knocking down Hh. One possibility that the authors need to rule out is that some of the effects they observe are due to the knockdown of Sod1 (and/or Hh) in the hub as Tj-Gal4 is expressed in the hub as well as the CySCs (PMID:27546574).
 
 We appreciate the reviewer’s insightful s and suggestions. Below, we address each concern and describe the steps we have taken to incorporate the necessary modifications in our revised manuscript.
 
 Titration Control for Rescue Experiments
 
 We acknowledge the reviewer’s concern regarding potential Gal4 titration effects when introducing additional UAS constructs. To address this, we conducted a control experiment quantifying SOD1 levels in control, Tj > Sod1 RNAi, and Tj > Sod1 RNAi, UAS hhRNAi backgrounds using real-time PCR (Figure S4 M). The Sod1 levels in single and double UAS copy conditions were comparable, indicating that Gal4 titration does not significantly affect the results.
 
 Clarification of Insulin Signaling Role
 
 We appreciate the reviewer’s insight regarding the involvement of insulin signaling in this context. Initially, we included data on PI3K/TOR as we found it intriguing. However, as the data didn’t add much to the overall observations, we have removed them to ensure clarity and prevent any potential confusion.
 
 Hh Expression and Niche Consideration
 
 We recognize the importance of evaluating whether Hedgehog (Hh) expression is altered in the Sod1 RNAi background. We have already quantified hh in qRT-PCR (Figure S4C).
 
 Potential Effects of Sod1 and Hh Knockdown in the Hub
 
 We acknowledge the concern that Tj-Gal4 is expressed in both the hub and CySCs, potentially affecting hub function upon Sod1 and Hh knockdown. To address this, we have included additional data using the CySC-specific driver C-587 Gal4 to distinguish CySC-intrinsic effects from potential hub contributions. Our results show that while the phenotypic changes are consistent across both drivers, the effects are significantly stronger with Tj-Gal4, suggesting a role of the hub in this process. These findings have been incorporated into the revised manuscript (Fig S1G-H, M-N).
 
 In general, the GSCs (and other aspects) are difficult to see in the images; enlargements or higher-resolution images should be provided. Additionally, the manuscript contains several mistakes or inaccuracies (examples include referring to ROS having "evolved" in the abstract when it is cells that have evolved to use ROS, or the references to "cystic" cells when they are usually referred to as "cyst" cells, or that "CySCs also repress GSC differentiation by suppressing transcription of bag-of-marbles" when CySCs produce BMPs that lead to suppression of bam expression in the germline). These would need editing for both clarity and accuracy.
 
 We appreciate the reviewer’s insightful feedback and have made the necessary revisions to address the concerns raised.
 
 Image Clarity and Resolution:
 
 We have provided higher-resolution images in some of the revised images for better understanding. The revised figures now offer better clarity for key observations.
 
 Clarification of Terminology and Accuracy:
 
 The phrase regarding ROS in the abstract has been revised to reflect that cells have evolved to utilize ROS, rather than ROS itself evolving (line no. 27).
 
 References to "cystic" cells have been corrected to "cyst" cells for consistency with standard terminology.
 
 The statement about CySCs repressing GSC differentiation has been revised for accuracy, clarifying that CySCs produce BMPs, which lead to the suppression of bam expression in the germline (line no. 84).
 
 We have carefully reviewed the manuscript for any additional inaccuracies or ambiguities to ensure clarity and precision. We appreciate the reviewer’s constructive s, which have helped improve the manuscript.
 
 Reviewer #3 (Public review):
 
 In response to Reviewer 3’s comments, we would like to highlight the point that in the present study we have focussed on the interplay between CySC and GSC and have accordingly conducted our experiments. We did observe some changes in the hub and do not rule out the effect of hub cells in exacerbating some of our phenotypes. We have included additional controls to highlight the effect of CySC ROS. These points have been appropriately discussed in the manuscript. Key revisions include:
 
 (1) Data Clarity & Visualization: To improve mitochondrial lineage association, we incorporated a membrane marker (Dlg) in Figure 1, enhancing the distinction between CySCs and GSCs. Additionally, we refined gstD-GFP quantifications in individual cell types and provided high-resolution images.
 
 (2) ROS Transfer & Measurement: We revised our discussion to acknowledge indirect ROS transfer mechanisms and added separate ROS quantifications in GSCs and CySCs, confirming higher ROS levels in CySCs (Figure 1R).
 
 (3) Tj-Gal4 Specificity & Niche Characterization: Recognizing Tj-Gal4 expression in hub cells, we included C587-Gal4 as a CySC-specific driver, demonstrating that hub cells contribute partially to the phenotype (Figure S1G,H,M,N).
 
 (4) Signaling Pathway Validation: We optimized dpERK staining, included controls (Tj>EGFRi), and clarified limitations regarding MAPK signaling. Due to lethality, we could not perform an EGFR gain-of-function rescue. We also validated increased Hh signaling via qPCR and a Tj>UAS Ci control (Figure S4).
 
 (5) Conceptual & Terminological Refinements: We revised our discussion of BMP signaling, ROS gradients, and testis-specific terminology. All figures and labels now accurately represent GSC scoring (single Vasa⁺ cells in contact with the niche).
 
 (6) Figure & Methods Improvements: We enhanced image resolution, provided grayscale versions where needed,and expanded Materials & Methods to clarify experimental conditions.
 
 These revisions strengthen our conclusions and address the reviewer’s concerns, ensuring a more precise and transparent presentation of our findings. To align with the reviewer’s s we have changed the title of the manuscript to “Superoxide Dismutases maintain niche homeostasis in stem cell populations”.
 
 Specific responses to the reviewer’s comments:
 
 (1) Data
 
 a. Problems proving which mitochondria are associated with which lineage.
 
 We acknowledge the challenge of distinguishing CySCs from GSCs without additional cell surface markers. To enhance clarity, we have incorporated the membrane marker Discs-large (Dlg) in our revised Figure 1 to better delineate cell boundaries, providing a clearer depiction of mitochondrial distribution in GSCs and CySCs.
 
 b.There is no evidence that ROS diffuses from CySCs into GSCs.
 
 We acknowledge the reviewer’s concern. There are reports which talks about diffusion of ROS across cells on which we have included a few lines in the discussion (line no. 274-276). We do understand that our previous quantifications showed ROS diffusion from CySC to GSC rather indirectly. Therefore, in revised manuscript we have measured ROS separately in the two cell populations. We found that the CySCs show higher ROS profile than GSCs (Fig 1R).
 
 c.The changes in GST-GFP (redox readout) are possibly seen in differentiating germ cells (i.e., spermatogonia) but not in GSCs. This weakens their model that ROS in CySC is transferred to GSCs.
 
 Thank you for your observation. We acknowledge that the changes in gstD-GFP (redox readout) are more prominent in differentiating germ cells. It is known that differentiating cells show higher ROS profile than the stem cells. Hence, expectedly the intensity of gstDGFP was lesser in stem cell zone compared to the differentiating zone. In our manuscript we are focussed on the redox state among stem cell populations. Therefore, we have included better quality images and measured the gstD1-GFP intensity individually in GSCs and CySCs (Figure 1R) by demarcating the cell boundaries (Figure 1M, S1C-D’’’). We found that CySCs show higher ROS profile than GSCs and enhancement of ROS in CySC by Sod1 depletion resulted in a consequent increase in ROS in GSCs. We believe this revision strengthens our model by addressing the potential discrepancy and providing a more comprehensive understanding of ROS dynamics within the GSC niche.
 
 d.Most of the paper examines the effect of SOD depletion (which should increase ROS) on the CySC lineage and GSC lineage. One big caveat is that Tj-Gal4 is expressed in hub cells (Fairchild, 2016), so the loss of SOD from hub cells may also contribute to the phenotype. In fact, the niche in Figure 2D looks larger than the niche in the control in Figure 2C, arguing that the expression of Tj in niche cells may be contributing to the phenotype. The authors need to better characterize the niche in tj>SOD-RNAi testes.
 
 We appreciate the reviewer’s insightful regarding the potential contribution of hub cell to the observed phenotype. We acknowledge that Tj-Gal4 is expressed in hub cells and this could influence the niche size and overall phenotype.
 
 To address this concern, we have included an additional control using C587-Gal4, a CySC specific driver, to distinguish CySC-specific effects from potential hub contributions. All the effects on cell number observed in Tj>Sod1i was replicated in C587>Sod1i testis, except that the observed phenotypes were comparatively weaker. These indicate partial contribution of hub cells to the observed phenotype, exacerbating its severity. However, the effect of Sod1 depletion in CySC on GSC lineages remains significant. These findings have been incorporated into Figure S1- G,H,M and N) and incorporated in the discussion (line no.308311).
 
 e. The Tj>SOD1-RNAi phenotype is an expansion of the Zfh1<sup+ CySC pool, expansion of the Tj+ Zfh1- cyst cells (both due to increased somatic proliferation) and a non-autonomous disruption of the germline.
 
 We appreciate the reviewer’s observation. Our data confirm that Tj>SOD-RNAi leads to an expansion of both Zfh1<sup+ CySCs and Tj+ Zfh1- cyst cells, which we attribute to increased somatic proliferation. Additionally, we observe a non-autonomous disruption of the germline, likely due to dysregulated signaling from the altered somatic niche.
 
 f. I am not convinced that MAPK signaling is decreased in tj>SOD-i testes. Not only is this antibody finicky, but the authors don't have any follow-up experiments to see if they can restore SOD-depleted CySCs by expressing an EGFR gain of function. Additionally, reduced EGFR activity causes fewer somatic cells (not more) (Amoyel, 2016) and also inhibits abscission between GSCs and gonial blasts (Lenhart 2015), which causes interconnected cysts of 8- to 16 germ cells with one GSC emanating from the hub.
 
 We acknowledge that the dpERK antibody can be challenging. We took necessary precautions, including optimizing staining conditions and using positive control (Tj>EGFRi) (Figure: S4B). Our results consistently showed a decrease in dpERK levels in Tj>Sod1i testes, supporting our conclusion.
 
 We agree that inclusion of an experiment using EGFR gain-of-function to rescue the effects of CySC-Sod1 depletion would have strengthened our findings. We had attempted this experiment; however, the progenies constitutively expressing EGFR under Sod1RNAi background were lethal, preventing us from completing the analysis.
 
 We agree that our observations do not align with the reported effects of EGFR signaling on somatic cell numbers and abscission and we appreciate the references provided. Based on our observations, we feel that modulation of MAPK signaling in the niche probably, happens in a context-dependent manner. One possible explanation is that, the altered GSC -CySC balance and loss of contact in Tj>Sod1i testes, leads to insufficient ligand response, thereby failing to activate EGFR signaling. While it is well established that ROS can enhance EGFR signaling to promote cellular proliferation and early differentiation, our results indicate a more nuanced regulation in this context. However, further detailed analysis is required to completely understand the regulatory controls. We have clarified this point in the manuscript (line no.
 
 313-320).
 
 g. The increase in Hh signaling in SOD-depleted CySCs would increase their competitiveness against GSCs and GSCs would be lost (Amoyel 2014). The authors need to validate that Hh protein expression is indeed increased in SOD-depleted CySCs/cyst cells and which cells are producing this Hh. Normally, only hub cells produce Hh (Michel,2012; Amoyel 2013) to promote self-renewal in CySCs.
 
 We appreciate the reviewer’s suggestion regarding the validation of Hh protein expression and its source. Since Tj-Gal4 is expressed in the hub, it is likely activating the Hh pathway and promoting CySC proliferation. Unfortunately, we could not procure Hh antibody to directly assess its protein levels. However, to address this, we performed real-time PCR from RNA derived from the tip region and found a significant increase in hh mRNA levels in SOD-depleted cyst cells. These findings support our hypothesis that elevated Hh signaling enhances CySC competitiveness, leading to GSC loss. To support this idea, we have included a Tj>Ci positive control which caused abnormal proliferation of Tj+ cells resulted in ablation of GSCs. We have incorporated these results in the revised manuscript (Results section, Figure S-4).
 
 h.The increase in p4E-BP is an indication that Tor signaling is increased, but an increase in Tor in the CySC lineage does not significantly affect the number of CySCs or cyst cells (Chen, 2021). So again I am not sure how increased Tor factors into their phenotype.
 
 We acknowledge the reviewer’s concern regarding the role of increased Tor signaling in our phenotype. The observed increase in Tor could indeed be a downstream effect of elevated ROS levels. However, establishing a direct causal relationship between Sod1 and Tor would require additional experiments, which we feel might be a good study in its own merit. To maintain clarity and focus in the revised manuscript, we have opted not to include this preliminary data at this stage.
 
 I.The over-expression of SOD in CySCs part is incomplete. The authors would need to monitor ROS in these testes. They would also need to examine with tj>SOD affects the size of the hub.
 
 We value the reviewer's . To address this, we have now monitored ROS levels in the testes upon SOD overexpression in CySCs using DHE (Figure S5 I). Our results indicate a significant reduction in ROS levels compared to controls.
 
 Additionally, we examined hub size upon Sod1 overexpression and observed a slight, but statistically insignificant, reduction. As our study primarily focuses on ROS-mediated GSCCySC interactions, we did not include a detailed investigation on hub size regulation.
 
 (2) Concept
 
 Why would it be important to have a redox gradient across adjacent cells? The authors mention that ROS can be passed between cells, but it would be helpful for them to provide more details about where this has been documented to occur and what biological functions ROS transfer regulates.
 
 We thank the reviewer for this insightful . We acknowledge that the concept of a redox gradient was not adequately conveyed, as the cell boundary was not clearly defined. To address this, we have revised our interpretation to propose that high ROS levels in one cell may influence the ROS levels in an adjacent cell through either direct transfer or as a secondary effect of altered niche maintenance signaling, rather than through the establishment of a gradient.
 
 Regarding ROS transfer between cells, it has been documented in several biological contexts. For instance, hydrogen peroxide (H2O2) can diffuse through aquaporins, influencing signaling pathways in neighbouring cells (PMID: 17105724). We have incorporated these details and relevant references into the revised manuscript to enhance the conceptual understanding of ROS transfer.
 
 (3) Issues with the scholarship of the testis
 
 a. Line 82 - There is no mention of BMPs, which are the only GSC-self-renewal signal. Upd/Jak/STAT is required for the adhesion of GSCs to the niche but not self-renewal (Leatherman and Dinardo, 2008, 2010). The author should read a review about the testis. I suggest Greenspan et al 2015. The scholarship of the testis should be improved.
 
 We appreciate the reviewer’s feedback regarding the role of BMPs in GSC selfrenewal, we have added this in the revised manuscript (line no. 83) We have now incorporated a discussion on BMP signaling as the primary self-renewal signal for GSCs, distinguishing it from the role of Upd/JAK/STAT in niche adhesion, as highlighted in Leatherman and Dinardo (2010). Additionally, we have cited and reviewed the work by Greenspan et al. (2015) and ensure a more comprehensive discussion of GSC regulation. These revisions can be found in the line no. 285-289 of the revised manuscript.
 
 b. Line 82-84 - BMPs are produced by both hub cells and CySCs. BMP signaling in GSCs represses bam. So it is not technically correct to say the CySCs repress bam expression in GSCs.
 
 We acknowledge the reviewer’s clarification regarding BMP signaling and its role in repressing bam expression in GSCs. We have revised the relevant section (line no.83-85).
 
 c.Throughout the figures the authors score Vasa+ cells for GSCs. This is technically not correct. What they are counting is single, Vasa+ cells in contact with the niche. All graphs should be updated with the label "GSCs" on the Y-axis.
 
 We appreciate the reviewer’s careful assessment of our methodology. We acknowledge that scoring Vasa⁺ cells alone does not definitively identify GSCs. Our quantification specifically considers single Vasa⁺ cells in direct contact with the niche. To ensure clarity and accuracy, we have updated all figure legends and Y-axis labels in the relevant graphs to explicitly state "GSCs" instead of "Vasa⁺ cells."
 
 (4) Issues with the text
 
 a. Line 1: multi-lineage is not correct. Multi-lineage refers to stem cells that produce multiple types of daughter cells. GSCs produce only one type of offspring and CySCs produce only one type of offspring. So both are uni-lineage. Please change accordingly.
 
 We acknowledge the incorrect usage of "multi-lineage" and agree that both GSCs and CySCs are uni-lineage, as they each produce only one type of offspring. We have revised Line 1 accordingly and also updated the title.
 
 b. Lines 62-75 - Intestinal stem cells have constitutively high ROS (Jaspar lab paper), so low ROS in stem cell cells is not an absolute.
 
 We appreciate the clarification. We have revised Lines 62–75 to acknowledge that low ROS is not universal in stem cells, citing the Jaspar lab study on intestinal stem cells (Line 70). Thank you for the valuable insight.
 
 c. Line 79: The term cystic is not used in the Drosophila testis. There are cyst stem cells (CySCs) that produce cyst cells. Please revise.
 
 We have revised the text to replace "cystic" with the correct terminology, referring to cyst stem cells (CySCs) in the manuscript.
 
 d. Line 90 - perfectly balanced is an overstatement and should be toned down.
 
 Thank you for the suggestion. We have revised it to “balanced” instead of "perfectly balanced."
 
 e. Line 98 - division of labour is not supported by the data and should be rephrased.
 
 Thank you for the feedback. We have rephrased it (line no. 98-101) to avoid the term "division of labor".
 
 f. Line 200 - the authors provide no data on BMPs - the GSC self-renewal cue - so they should avoid discussing an absence of self-renewal cues.
 
 We appreciate the reviewer’s point. We have revised it to avoid discussing the absence of self-renewal cues, given that we do not present data on BMP signaling. This ensures that our conclusions remain within the scope of the provided data.
 
 (5) Issues with the figures
 
 a The images are too small to appreciate the location of mitochondria in GSCs and CySCs.
 
 b. Figure 1
 
 c. cell membranes are not marked, reducing the precision of assigning mitochondria to GSC or CySCs. It would be very helpful if the authors depleted ATP5A from GSCs and showed that the puncta are reduced in these cells, and did a similar set of experiments for the Tj-Gal4 lineage. It would also be very helpful if the authors expressed membrane markers (like myrGFP) in the GSC and then in the CySC lineage and then stained with ATP5A. This would pinpoint in which cells ATP5A immunoreactivity is occurring.
 
 d. The presumed changes in gst-GFP (redox readout) are possibly seen in differentiating germ cells (i.e.,spermatogonia) but not in GSC. iii. Panels F, Q, and S are not explained and currently are irrelevant.
 
 e. Figure 3K - The evidence to support less Ecad in GSCs in tj>SOD-i testes is not compelling as the figure is too small and the insets show changes in Ecad in somatic cells, not GSC. d. Figure 4:
 
 f. Panel A, B The apparent decline (not quantified) may not contribute to the phenotype.
 
 ii.dpERK is a finicky antibody and the authors are showing a single example of each genotype. This is an important experiment because the authors are going to use it to conclude that MAPK is decreased in the tj>SOD-i samples. However, the authors don't have any positive (dominantactive EGFR) or negative (tj>mapk-i). As is standing, the data is not compelling. The graph in F does not convey any useful information.
 
 g. Figure S1D - cannot discern green on black. It is critical for the authors to show monochromes (grayscale) for thereabouts that they want to emphasize. I cannot see the green on black in Figure S1D.
 
 h. Figure S4 - there is no quantification of the number of Tj cells in K-N.
 
 We appreciate your detailed feedback regarding the figures in our manuscript. Below, we address each concern and outline the revisions we have made.
 
 (a) Image Size and Mitochondrial Localization in GSCs and CySCs
 
 We acknowledge the need for larger images to better visualize mitochondrial localization. We have now increased the resolution and size of the images in Figure 1. Additionally, we have included high-magnification insets to enhance clarity (Figure 1 B#)
 
 (b) Figure 1 B,B#,C
 
 (i) We have now marked cell membranes using Dlg to improve the precision of mitochondrial assignment to GSCs and CySCs and then stained for ATP5A, which clearly demarcates ATP5A immunoreactivity in specific cell types.
 
 (ii) We have revisited the gstD-GFP (redox readout) data and now provide revised images (Figure S1C-D’’’) and quantification (Figure 1 R,S) to better illustrate changes in the redox state. It is indeed intense in differentiating germ cells as expected but also present in the stem cell zone.
 
 (iii) Panels F, Q, and S have now been removed in the revised figure legend.
 
 (C) Figure 3K: We have digitally magnified the figure size and improved contrast to better visualize E-cadherin levels. The insets have been revised to ensure they focus specifically on GSCs rather than somatic cells. Earlier, we quantified the E-cadherin intensity changes in the GSC-hub interface and provided statistical analysis to support our findings (Figure 3M).
 
 (d) Figure 4: (i) Panels A and B have now been quantified, and we provide statistical comparisons to support our observations. (ii) We acknowledge the variability of dpERK staining. To strengthen our conclusions, we have provided negative (Tj>MAPK-i) controls (Figure S4 B). Additionally, we have removed panel F (MAPK area cover) to avoid confusion.
 
 (e) We appreciate the suggestion regarding grayscale images and have provided the monochrome images for mitochondria and gstD-GFP image representation. We have now removed Figure S1D as it was no longer required.
 
 (f) Figure S4: The quantification of the number of Tj-positive cells was actually included in the main figure along with statistical analysis.
 
 (g) We sincerely appreciate the reviewer’s insightful s, which have significantly improved the quality and clarity of our manuscript. We hope that our revisions adequately address the concerns raised.
 
 (6) Issues with Methods
 
 a. Materials and Methods are not described in sufficient depth - please revise.
 
 b. Note that Tj-Gal4 has real-time expression in hub cells and this is not considered by the authors. The ideal genotype for targeting CySCs is Tj-Gal4, Gal80TS, hh-Gal80. Additionally, the authors do not mention whether they are depleting throughout development into adulthood or only in adults. If the latter, then they must have used a temperature shift, growing the flies at 18C and then upshifting to 25C or 29C during adult stages.
 
 c. The authors need to show data points in all of the graphs. Some graphs do this but others do not.
 
 d. The authors state that all data points are from three biological replicates. This is not sufficient for GSC and CySC counts. Most labs count GSCs and CySCs from at least 10 testes of the correct genotype.
 
 We appreciate the reviewer’s valuable feedback and have made the necessary revisions to improve the clarity and rigor of our study. Below, we address each concern in detail:
 
 Materials and Methods
 
 We have revised the Materials and Methods section to provide a more detailed description of the experimental procedures, including genotypes, sample preparation, and quantification methods.
 
 Tj-Gal4 Expression and Experimental Design
 
 We acknowledge the reviewer’s point regarding Tj-Gal4 expression in hub cells. While Tj-Gal4 is active in hub cells, our focus was on CySCs, and we have now included a discussion of this caveat in the revised manuscript (line no. 308-311)
 
 Thank you for your suggestion on the ideal genotype for targeting CySCs. While we attempted to procure hh-Gal80, we couldn’t manage to get it, so we opted for another well-established Gal4 driver, C-587 Gal4, to target CySCs. Our results indicate that although the phenotypic changes are consistent across both drivers, the effects are significantly stronger with Tj-Gal4, highlighting the role of CySCs in this process with partial contributions from the hub. These findings have been incorporated into the revised manuscript (lines 309–311).
 
 We now clarify whether gene depletion was conducted throughout development or restricted to adulthood. For adult-specific depletion using the UAS-Gal4 system, crosses were set up at 25°C, and after two days, progenies were shifted to 29°C and aged for 3–5 days at 29°C. This process is now explicitly detailed in the revised Methods section (line no. 345-348).
 
 Data Presentation in Graphs
 
 We have updated all graphs to ensure that individual data points are shown consistently across all figures.
 
 Sample Size for GSC and CySC Counts
 
 We acknowledge the reviewer’s concern regarding biological replicates. Our initial study was based on 10 biological replicates, each set consisting of at least 7-8 testes per genotype, in line with standard practice in the field. This change is reflected in the revised Results and Methods sections.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

AuthorResponse

Review 3

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.12.06.570086v3
www.biorxiv.org www.biorxiv.org

Sex differences in bile acid homeostasis and excretion underlie the disparity in liver cancer incidence between males and females

3
1. Public_Reviews 25 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This study provides valuable insights into the influence of sex on bile acid metabolism and the risk of hepatocellular carcinoma (HCC). The data to support that there are inter-relationships between sex, bile acids, and HCC in mice are convincing, although this is a largely descriptive study. Future studies are needed to understand the interaction of sex hormones, bile acids, and chronic liver diseases and cancer at a mechanistic level. Also, there is not enough evidence to determine the clinical significance of the findings given the differences in bile acid composition between mice and men.
  
  Summary
2. Public_Reviews 25 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Liver cancer shows a high incidence in males than females with incompletely understood causes. This study utilized a mouse model that lacks the bile acid feedback mechanisms (FXR/SHP DKO mice) to study how dysregulation of bile acid homeostasis and a high circulating bile acid may underlie the gender-dependent prevalence and prognosis of HCC. By transcriptomics analysis comparing male and female mice, unique sets of gene signatures were identified and correlated with HCC outcomes in human patients. The study showed that ovariectomy procedure increased HCC incidence in female FXR/SHP DKO mice that were otherwise resistant to age-dependent HCC development, and that removing bile acids by blocking intestine bile acid absorption reduced HCC progression in FXR/SHP DKO mice. Based on these findings, the authors suggest that gender-dependent bile acid metabolism may play a role in the male-dominant HCC incidence, and that reducing bile acid level and signaling may be beneficial in HCC treatment. This study include many strengths: 1. Chronic liver diseases often proceed the development of liver and bile duct cancer. Advanced chronic liver diseases are often associated with dysregulation of bile acid homeostasis and cholestasis. This study takes advantage of a unique FXR/SHP DKO model that develop high organ bile acid exposure and spontaneous age-dependent HCC development in males but not females to identify unique HCC-associated gene signatures. The study showed that the unique gene signature in female DKO mice that had lower HCC incidence also correlated with lower grade HCC and better survival in human HCC patients. 2. The study also suggests that differentially regulated bile acid signaling or gender-dependent response to altered bile acids may contribute to gender-dependent susceptibility to HCC development and/or progression. 3. The sex-dependent differences in bile acid-mediated pathology clearly exist but are still not fully understood at the mechanistic level. Female mice have been shown to be more sensitive to bile acid toxicity in a few cholestasis models, while this study showed a male dominance of bile acid promotion of HCC. This study used ovariectomy to demonstrate that female hormones are possible underlying factors. Future studies are needed to understand the interaction of sex hormones, bile acids, and chronic liver diseases and cancer.
  
  Review 1
3. Public_Reviews 25 Jul 2025
  
  in eLife
  
  Author response:
  
  The following is the authors’ response to the original reviews.
  
  Reviewer #1 (Public review):
  
  Comments:
  
  (1) HCC shows heterogeneity, and it is unclear what tissues (tumor or normal) were used from the DKO mice and human HCC gene expression dataset to obtain the gene signature, and how the authors reconcile these gene signatures with HCC prognosis.
  
  Mice studies: Aged DKO mice develop aggressive tumors (major and minor nodules, See Figure 1), and the entire liver is burdened with multiple tumor nodules. It is technically challenging to demarcate the tumor boundaries as most of the surrounding tissues do not display normal tissue architecture. Therefore, livers from age- and sex-matched wild-type C57/BL6 mice were used as control tissue. All the mice were inbred in our facility. Spatial transcriptomics and longitudinal studies are ongoing to collect tumors at earlier time points wherein we can differentiate tumor and non-tumor tissue.
  
  Human Studies: We mined five separate clinical data sets. The human HCC gene expression comprised of samples from the (i) National Cancer Institute (NCI) cohort (GEO accession numbers, GSE1898 and GSE4024) and (ii) Korea, (iii) Samsung, (iv) Modena, and (v) Fudan cohorts as previously described (GEO accession numbers, GSE14520, GSE16757, GSE43619, GSE36376, and GSE54236). We have added a new supplemental table 4, giving details of these datasets. Depending on the cohort, they are primarily HCC samples- surgical resections of HCC, control samples, with some tumors and paired non-tumor tissues.
  
  (2) The authors identified a unique set of gene expression signatures that are linked to HCC patient outcomes, but analysis of these gene sets to understand the causes of cancer promotion is still lacking. The studies of urea cycle metabolism and estrogen signaling were preliminary and inconclusive. These mechanistic aspects may be followed up in revision or future studies.
  
  We agree. Experiments to elicit HCC causality and promotion are complex, given the heterogeneous nature of liver cancer. Moreover, the length of time (12 months) needed to spontaneously develop cancer in this DKO mouse model makes it challenging. As mentioned by the reviewer, mechanistic studies are ongoing, and longitudinal time course experiments are actively being pursued to delineate causality. Having said that, we mined the TCGA LIHC (The Cancer Genome Atlas Liver Hepatocellular Carcinoma) database to examine the expression of the individual urea cycle genes and found them suppressed in liver tumorigenesis (new Supplementary Figure 4). We also evaluated if estrogen receptor a (Era) targets altered in DKO females (DKO_Estrogen) correlate with overall survival in HCC (new Supplementary Figure 6). We note that Era expression per se is reduced in males and females upon liver tumorigenesis. Also, DKO_Estrogen signature positively corroborated with better overall survival (new Supplementary Figure 6). These findings further bolster the relevance of urea cycle metabolism and estrogen signaling during HCC.
  
  (3) While high levels of bile acids are convincingly shown to promote HCC progression, their role in HCC initiation is not established. The DKO model may be limited to conditions of extremely high levels of organ bile acid exposure. The DKO mice do not model the human population of HCC patients with various etiology and shared liver pathology (i.e. cirrhosis). Therefore, high circulating bile acids may not fully explain the male prevalence of HCC incidence.
  
  We agree with this comment that our studies do not show bile acids can initiate HCC and may act as one of the many factors that contribute to the high male prevalence of HCC. This is exactly the reason why throughout the manuscript we do not write about HCC initiation. To clarify further, in the revised discussion of the manuscript, we have added a sentence to highlight this aspect, “while this study demonstrates bile acids promote HCC progression it does not investigate or provide evidence if excess bile acids are sufficient for HCC initiation.”
  
  (4) The authors showed lower circulating bile acids and increased fecal bile acid excretion in female mice and hypothesized that this may be a mechanism underlying the lower bile acid exposure that contributed to lower HCC incidence in female DKO mice. Additional analysis of organ bile acids within the enterohepatic circulation may be performed because a more accurate interpretation of the circulating bile acids and fecal bile acids can be made in reference to organ bile acids and total bile acid pool changes in these mice.
  
  As shown in this manuscript- we provide BA compositional analyses from the liver, serum, urine, and feces (Figures 5 and 6, new Supplementary Figure 8, Supplementary Tables 4 and 5). Unfortunately, we did not collect the intestinal tissue or gallbladders for BA analysis in this study. Separate cohorts of mice are being aged for future BA analyses from different organs within the enterohepatic loop. We thank you for this suggestion. Nevertheless, we have previously measured and reported BA values to be elevated in the intestines and the gall bladder of young DKO mice (PMC3007143).
  
  Reviewer #2 (Public review)
  
  Weaknesses:
  
  (1) The translational value to human HCC is not so strong yet. Authors show that there is a correlation between the female-selective gene signature and low-grade tumors and better survival in HCC patients overall. However, these data do not show whether this signature is more highly correlated with female tumor burden and survival. In other words, whether the mechanisms of female protection may be similar between humans and mice. In that respect, it would also be good to elaborate on whether women have higher fecal BA excretion and lower serum BA concentration.
  
  The reviewer poses an interesting question to test if the DKO female-specific signatures are altered differently in male vs. female HCC samples. As we found the urea cycle and estrogen signaling to be protective and enriched in our mouse model, we tested their expression pattern using the TCGA-LIHC RNA-seq data. We found urea cycle genes and Era transcripts broadly reduced in tumor samples irrespective of the sex (new Supplementary Figure 4 and Supplementary Figure 6), indicating that these pathways are compromised upon tumorigenesis even in the female livers.
  
  While prior studies have shown (i) a smaller BA pool w synthesis in men than women (PMID: 22003820), we did not find a study that systematically investigated BA excretion between the sexes in HCC context. The reviewer is spot on in suggesting BA analysis from HCC and unaffected human fecal samples from both sexes. Designing and performing such studies in the future will provide concrete proof of whether BA excretion protects female livers from developing liver cancer. We thank you for these suggestions.
  
  (2) The authors should perform a thorough spelling and grammar check.
  
  We apologize for the typos, which have been fixed, and as suggested by the reviewer, we have performed a grammar check.
  
  (3) There are quite some errors and inaccuracies in the result section, figures, and legends. The authors should correct this.
  
  We apologize for the inadvertent errors in the manuscript, and we have clarified these inaccuracies in the revised version. Thank you.
  
  AuthorResponse
Visit annotations in context

Tags

Summary

AuthorResponse

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2020.06.25.172635v3
www.biorxiv.org www.biorxiv.org

Fast evolutionary turnover and overlapping variances of sex-biased gene expression patterns defy a simple binary sex-classification of somatic tissues

4
1. Public_Reviews 25 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 This study presents data on sex differences in gene expression across organs of four mice taxa. The authors have generated a unique and convincing dataset that fills a gap left by previous studies. They claim that sex-biased expression in the soma can overlap between genetic males and females, and that the relevant patterns both turn over quickly over short evolutionary times and do so faster in somatic than gonadal tissues. These conclusions could largely have been predicted by extrapolating from previous findings in the field, but nevertheless demonstrating them directly is a fundamental advance.
 
 [Editorial note: The work was originally assessed by colleagues who are active in the field of evolution of sex differences or in areas adjacent to this field (see initial assessment at https://doi.org/10.7554/eLife.99602.2). The appeals process involved consultation with experts working in other areas of evolutionary biology. The above assessment synthesises the opinions of both sets of reviewers.]
 
 Summary
2. Public_Reviews 25 Jul 2025
 
 in eLife
 
 Reviewer #4 (Public review):
 
 The paper by Xie et al. investigates the micro-evolutionary dynamics of sex-biased gene expression across somatic and gonadal tissues in four mouse taxa, with comparative analyses in humans. The study introduces a new metric, the Sex-Bias Index (SBI), to quantify individual-level variation in sex-biased gene expression, and explores the evolutionary turnover, variance, and adaptive evolution of these genes.
 
 These strengths of the paper are not in dispute:
 
 Novelty: The study is among the first to systematically analyze sex-biased gene expression at a micro-evolutionary scale in outbred animals, using closely related mouse taxa. This contrasts with most previous work, which focused on macro-evolutionary comparisons between distant species.
 
 Controlled Sampling: The use of age-matched, outbred individuals raised under standardized conditions minimizes environmental confounders, allowing for robust within- and between-taxon comparisons.
 
 Somatic vs. Gonadal Focus: Unlike many earlier studies that emphasized gonadal tissues, this work provides a detailed analysis of somatic organs, revealing rapid evolutionary turnover and mosaicism in sex-biased gene expression.
 
 Sex-Bias Index (SBI): The SBI offers a cumulative, individual-level measure of sex-biased gene expression, facilitating visualization of variance and overlap between sexes within tissues. While one can argue about whether a new metric is necessary (as the authors argue), the combination of fold-change cutoffs, non-parametric Wilcoxon tests, and FDR correction reduces false positives, addressing concerns raised in the field about inflated detection of sex-biased genes.
 
 Evolutionary implications: The study demonstrates that sex-biased gene expression in somatic tissues evolves more rapidly than in gonads, and that this turnover is often accompanied by signatures of adaptive protein evolution. The lack of correlation in SBI across tissues within individuals supports a mosaic model of sex-biased gene expression, challenging binary models of sexual differentiation.
 
 The weaknesses are already listed by previous rounds of review but I will add one more: in an attempt to be comprehensive, the writing is quite dry and the main conclusions sort of get hidden within the less important observations.
 
 Since the debate is mostly about what words to use to describe the importance and the strength of evidence, I thought it would be useful to directly compare this study to other studies that address the same topic:
 
 Naqvi et al. Science 2019 (David Page lab): Conservation, acquisition, and functional impact of sex-biased gene expression in mammals
 
 Oliva et al. Science 2020 (Stranger lab): The impact of sex on gene expression across human tissues
 
 Rodríguez-Montes et al. Science 2023 (Kaessman, Cardoso-Moreira labs)
 
 Let's start with the fact that all three peer studies have had a major impact. Second, although Naqvi et al. (2019) and Oliva et al. (2020) provided foundational cross-species and cross-tissue analyses of sex-biased gene expression, but did not address micro-evolutionary turnover or individual-level variance. Third, Rodríguez-Montes et al. (2023) focused on developmental and evolutionary patterns of sex-biased expression, but at a broader phylogenetic scale and without the individual-level or module-based analyses presented here. None of the peer studies addressed the possibility of mosaicism within individuals, none of them addressed the relations between expression bias and adaptive evolution. So the comparison is really a bit of an apples to oranges comparison: the peer studies are about patterns in deep phylogeny, whereas the present study is an amazing (to me) analysis of inter-individual mosaicism, which is at the heart of this kind of variation, which would totally be missed or worse misinterpreted in deep phylogenetic analyses. Having said that, in my subjective opinion, all three related papers are better written than the present one, but to me there is no question this belongs in the same pedestal as all of them.
 
 Review 1
3. Public_Reviews 25 Jul 2025
 
 in eLife
 
 Reviewer #5 (Public review):
 
 Xie et al. present a data set of impressive size to study changes in sex-biased gene expression. A clear strength that sets the study apart from previous work is the use of age-matched outbred individuals raised in the same environment, which minimizes non-genetic variance, and the comparison of closely related taxa. Also in contrast to many previous studies, while gonads, which have often been the focus of sex-biased gene expression studies, are not ignored, multiple gonadal tissues are being compared to an array of somatic tissues. The study design therefore can offer a particularly rich and nuanced view of how sex differences change across tissues and over short evolutionary times.
 
 I liked the idea of summarizing over the mean expression of gene sets, instead of just using numbers of DEGs for comparisons, even though the introduction of the term "Sex-Biased Index (SBI)" seems somewhat of an overkill. The summary analyses are definitely useful to visualize variability in sex-biased gene expression programs. The authors find that the expression patterns of sex-biased genes change faster than those of non-sex-biased genes - but only in somatic tissues. They also provide some evidence that this correlates with higher rates of potentially adaptive coding sequence changes in the taxa where expression is sex-biased, with the proviso that a stronger modeling framework would have made these inferences more robust.
 
 I was most surprised by the finding that the fast change in expression patterns is linked to different gene expression modules becoming sex-biased in the different taxa studied. This is in my eyes a remarkable observation that could not have been predicted from previous knowledge.
 
 The use of human GTEx and patient scRNA-seq data is a nice addition, although there are known confounding issues with these resources, given that these are not random samples and environmental conditions are uncontrolled. Nevertheless, as the human data echo the trends seen with the much more rigorous mouse data set, I do not have principal objections to this addition. Furthermore, the human data do allow the authors to conclude that only very few genes with sex-biased expression are shared in the soma of mice and humans.
 
 In summary, I believe that this contribution has the potential to fundamentally change how we see sex-biased gene expression differences in vertebrates, given that the author's conclusions are grounded in a data set of compelling quality and size.
 
 Review 2
4. Public_Reviews 25 Jul 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the previous reviews
 
 Reviewer #2 (Public review):
 
 Summary:
 
 The manuscript by Xie and colleagues presents transcriptomic experiments that measure gene expression in eight different tissues taken from adult female and male mice from four species. These data are used to make inferences regarding the evolution of sex-biased gene expression across these taxa.
 
 Strengths:
 
 The experimental methods and data analysis appear appropriate. The authors promote their study as unprecedented in its size and technical precision.
 
 We do not understand the statement "the authors promote" as if there was a doubt about this. If there is a doubt, we welcome to see it specified.
 
 Weaknesses:
 
 The manuscript does not present a clear set of novel evolutionary conclusions. The major findings recapitulate many previous comparative transcriptomics studies - gene expression variation is prevalent between individuals, sexes, and species; and genes with sex-biased expression evolve more rapidly than genes with unbiased expression - but it is not clear how the study extends our understanding of gene expression or its evolution.
 
 There have been no "previous comparative transcriptomics studies" at a micro- evolutionary scale in animals, hence, we do not "replicate" these. And our contrast between somatic and gonadal patterns reveals insights that have not been recognized before, namely that gonadal sex-specific expression turnover is actually not faster that the corresponding non-sex-specific truover. We have now further clarified this distinction throughout the text and have also adapted the title of the paper accordingly.
 
 We agree with the overall statement that "gene expression variation is prevalent between individuals, sexes, and species" but the aspect of "sex-biased gene expression between individuals" has not been systematically analysed before in such a context.
 
 Concerning the statement that "genes with sex-biased expression evolve more rapidly than genes with unbiased expression", we note that this is mostly derived from gonadal data and that there is no study that has quantified this so far at a population level and between subspecies in comparison to somatic data.
 
 Our results show further that previous assumptions of a substantial set of genes with sex- biased expression conserved between mice and humans are due to underestimating the convergence issues when there is an extremly fast turnover of sex-biased gene expression. This has a major implication for using mice as a model for gender-speficic medicine questions in humans.
 
 Many gene expression differences between individual animals are selectively neutral, because these differences in mRNA concentration are buffered at the level of translation, or differences in protein abundance have no effect on cellular or organismal function. The hypothesis that sex-biased genes are enriched for selectively neutral expression differences is supported by the excess of inter-individual expression variance and inter-specific expression differences in sex-biased genes.
 
 This statement repeats a statement from the first round of reviews. We had added new data and extensive discussion on this topic. We do not understand why this has not been taken into account. In fact, a major strength of our paper is that it shows that most sex- biased gene expression differences are not neutral!
 
 There are two major issues here: to identify sex-biased gene expression in the first place, we (and all other papers in the field) use the neutral model as null-hypothesis. Genes that are not compatible with this null-hypothesis are considered sex-biased. In contrast to most previous papers, we have the possibility to take into account the variances between individuals to add an additional significance test. Hence, we can apply a much more rigorous two-step process: first a ratio-cutoff plus a Wilcoxon rank sum test with correction for multiple testing to identify significant deviations from the null-hypothesis. We have added some additional statements in the Results and Discussion sections to emphasize this.Second, by focusing on the genes that are not following a neutral model, the variance and divergences data support the action of selection, rather than neutral drift.
 
 A higher rate of adaptive coding evolution is inferred among sex-biased genes as a group, but it is not clear whether this signal is driven by many sex-biased genes experiencing a little positive selection, or a few sex-biased genes experiencing a lot of positive selection, so the relationship between expression and protein-coding evolution remains unclear.
 
 Again, there are two major issues here. First, the distribution of alpha-values shown in Figure 3B are rather homogeneous, i.e. there is not support for a scenario that the average is driven by only a few genes.
 
 Second, it seems that the referee wants to see an analysis where dn/ds ratios are broken down for every single gene. This has been done in previous papers, but it is now understood that this procedure is fraught with error because of the demographic contingencies inherent to natural populations that can yield wrong results for individual loci. We have added some statements to the text to clarify this further.
 
 It is likely that only a subset of the gene expression differences detected here will have phenotypic effects relevant for fitness or medicine, but without some idea of how many or which genes comprise this subset, it is difficult to interpret the results in this context.
 
 It is the basic underlying assumption for the whole research field that significantly sex- biased genes are phenotypically relevant for fitness, since they would otherwise not be sex- biased in the first place.
 
 Throughout the paper the concepts of sexual selection and sexually antagonistic selection are conflated; while both modes of selection can drive the evolution of sexually dimorphic gene expression, the conditions promoting and consequence of both kinds of selection are different, and the manuscript is not clear about the significance of the results for either mode of selection.
 
 We had explained in our previous response that our data collection was not designed to distinguish between these two processes. But given that the issue is being brought up again, we have now added some discussion on this issue.
 
 The manuscript's conclusion that "most of the genetic underpinnings of sex-differences show no long-term evolutionary stability" is not supported by the data, which measured gene expression phenotypes but did not investigate the underlying genetic variation causing these differences between individuals, sexes, or species.
 
 We agree that - under a strict definition - our use of the term "genetic underpinning" in this conclusion sentence can be criticized. The most correct term would be "transcriptional underpinnings", but of course, given that it is the current practice of the whole field to assume that "transcriptional" is part of the overall genetics, we do not consider our initial statement as incorrect. Still, we have changed the term accordingly.
 
 Furthermore, most of the gene expression differences are observed between sex-specific organs such as testes and ovaries, which are downstream of the sex-determination pathway that is conserved in these four mouse species, so these conclusions are limited to gene expression phenotypes in somatic organs shared by the sexes.
 
 Yes - correct. But the whole focus of the paper is on somatic expression, i.e. organs that share the same cell compositions. Of course, the comparison between gonadal organs is conflated by being composed of different cell types. We have extended the discussion of this point.
 
 The differences between sex-biased expression in mice and humans are attributed to differences in the two species effective population sizes; but the human samples have significantly more environmental variation than the mouse samples taken from age-matched animals reared in controlled conditions, which could also explain the observed pattern.
 
 These are indeed the two alternative explanations that we had discussed (last paragraph of the discussion section, now the penultimate paragraph).
 
 The smoothed density plots in Figure 5 are confusing and misleading. Examining the individual SBI values in Table S9 reveals that all of the female and male SBI values for each species and organ are non-overlapping, with the exception of the heart in domesticus and mammary gland in musculus, where one male and one female individual fall within the range of the other sex. The smoothed plots therefore exaggerate the overlap between the sexes;
 
 Smoothing across discrete values is an entirely standard procedure for continuous variables. It allows to visualize the inherent data trends that cannot easily be glanced from simple inspection of the actual values. This is a mathematical procedure, not an "exaggeration". We used the same smoothening procedure for all the comparisons, and it is clear that the distributions between females and males of the sex organs and a few somatic organs are well separated (non-overlapping), which serves as a control.
 
 in particular, the extreme variation shown in the SBI in the mammary glands in spretus females and spicilegus males is hard to understand given the normalized values in Table S3. The R code used to generate the smoothed plots is not included in the Github repository, so it is not possible to independently recreate those plots from the underlying data.
 
 We apologize that there was indeed an error in the Figure - the columns for SPR and SPI were accidentally interchanged. We have corrected this figure. Generally, the smoothened patterns we show are easily verified by looking up the respective primary values. We apologize that the code lines for the plots were accidentally omitted. We have used a standard function from ggplot2: geom_density, with "adjust=3, alpha=0.5" for all plots and included this description in the Methods. We have now added this to the R code in the GitHub repository.
 
 The correlations provided in Table S9 are confusing - most of the reported correlations are 1.0, which are not recovered when using the SBI values in Table S9, and which does not support the manuscript's assertion that sex-biased gene expression can vary between organs within an individual. Indeed, using the SBI values in Table S9, many correlations across organs are negative, which is expected given the description of the result in the text.
 
 There is a misunderstanding here. The tables do not report correlations, but only p-values for correlations, the raw ones and the ones after corrections for multiple testing. P = 1.0 means no significant correlation. We have adjusted the caption of this table to clarify this further.
 
 Reviewer #3 (Public review):
 
 This manuscript reports interesting data on sex differences in expression across several somatic and reproductive tissues among 4 mice species or subspecies. The focus is on sex- biased expression in the somatic tissues, where the authors report high rates of turnover such that the majority of sex-biased genes are only sex-biased in one or two taxa. The authors show sex-biased genes have higher expression variance than unbiased genes but also provide some evidence that sex-bias is likely to evolve from genes with higher expression variance. The authors find that sex-biased genes (both female- and male-biased) experience more adaptive evolution (i.e., higher alpha values) than unbiased genes. The authors develop a summary statistic (Sex-Bias Index, SBI) of each individual's degree of sex- bias for a given tissue. They show that the distribution of SBI values often overlap considerably for somatic (but not reproductive) tissues and that SBI values are not correlated across tissues, which they interpret as indicating an individual can be relatively "male-like" in one tissue and relatively "female-like" in another tissue.
 
 This is a good summary of the data, but we are puzzled that it does not include the completely new module analysis and the finding of extremely fast evolution of sex-biased somatic gene expression compared to the gonadal one.
 
 Though the data are interesting, there are some disappointing aspects to how the authors have chosen to present the work. For example, their criteria for sex-bias requires an expression ratio of one sex to the other of 1.25. A reasonably large fraction of the "sex- biased genes" have ratios just beyond this cut-off (Fig. S1). A gene which has a ratio of 1.27 in taxa 1 can be declared as "sex-biased" but which has a ratio of 1.23 in taxa 2 will not be declared as "sex-biased". It is impossible to know from how the data are presented in the main text the extent to which the supposed very high turnover represents substantial changes in dimorphic expression. A simple plot of the expression sex ratio of taxa 1 vs taxa 2 would be illuminating but the authors declined this suggestion.
 
 Choosing a cutoff is the standard practice when dealing with continuously distributed data. As we have pointed out, we looked at various cutoff options and decided to use the present one, based on the observed data distributions. Note that some studies have used even lower ones (e.g. 1.1). To visualize the data distribution, we had provided the overall distribution of ratios, because one would have to look at many more plots otherwise. But we have now also added individual plots as Figure 1, Figure supplement 2, as requested. They confirm what is also evident from the overall plots, namely that most ratio changes are larger than the incremental values suggested by the reviewer. Note that the original data are of course also available for inspection.
 
 I was particularly intrigued by the authors' inference of the proportion of adaptive substitutions ("alpha") in different gene sets. The show alpha is higher for sex-biased than unbiased genes and nicely shows that the genes that are unbiased in focal taxa but sex- biased in the sister taxa also have low alpha. It would be even stronger that sex-bias is associated with adaptive evolution to estimate alpha for only those genes that are sex- biased in the focal taxa but not in the sister taxa (the current version estimates alpha on all sex-biased genes within the focal taxa, both those that are sex-biased and those that are unbiased in the sister taxa).
 
 We have added the respective values in the results section, but since fewer genes are involved, they are less comparable to the other sets of genes. Still, the tendencies remain.
 
 The author's Sex Bias Index is measured in an individual sample as: SBI = median(TPM of female-biased genes) - median(TPM of male-biased genes). This index has some strange properties when one works through some toy examples (though any summary statistic will have limitations). The authors do little to jointly discuss the merits and limitations of this metric. It would have been interesting to examine their two key points (degree of overlapping distributions between sexes and correlation across tissues) using other individual measures of sex-bias.
 
 We had responded to this comment before (including the explanation that it has no strange properties when one applies the normalization that is now implemented) and we have added a whole section devoted to the discussion of the merits of the SBI. We do not know which other "individual measures of sex-bias" this should be compared to. Still, we have now added a paragraph in the discussion about using PCA as an alternative to show that this would result in similar conclusions, but is technically less suitable for this purpose.
 
 Figure 5 shows symmetric gaussian-looking distributions of SBI but it makes me wonder to what extent this is the magic of model fitting software as there are only 9 data points underlying each distribution. Whereas Figure 5 shows many broadly overlapping distributions for SBI, Figure 6 seems to suggest the sexes are quite well separated for SBI (e.g., brain in MUS, heart in DOM).
 
 We use a standard fitting function in R (see above), which tries to fit a normalized distribution, but this function can also add an additional peak when the data are too heterogeneous (e.g. Mammary in Figure 7).
 
 Fig. S1 should be shown as the log(F/M) ratio so it is easier to see the symmetry, or lack thereof, of female and male-biased genes.
 
 The log will work differently for values <1, compared to values >1 when used in a single plot. We have now generated combined plots with symmetric values to allow a better comparability.
 
 It is important to note that for the variance analysis that IQR/median was calculated for each gene within each sex for each tissue. This is a key piece of information that should be in the methods or legend of the main figure (not buried in Supplemental Table 17).
 
 We have now moved these descriptions into the Methods section.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

AuthorResponse

Review 2

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.05.22.595301v3
www.biorxiv.org www.biorxiv.org

The insulin / IGF axis is critically important controlling gene transcription in the podocyte

5
1. Public_Reviews 25 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 This study investigated the role of insulin receptor (IR) and insulin-like growth factor 1 receptor (IGF1R) in the renal glomerular podocytes by characterizing the mice with dual deletion of both receptors in vivo as well as the cultured murine podocytes with induced deletion of both receptors in vitro. The solid data presented in this paper demonstrated the critical requirement of both IR and IGF1R signaling in normal podocyte physiology in mice, albeit a more detailed characterization of the mouse model is desired. Interestingly, long-range sequencing revealed significant retention of introns in mRNAs, due to an altered spliceosome level resulted from the loss of IR and IGF1 signaling in cultured podocytes. This new finding suggests an essential role of IR and IGF1R signaling in regulating RNA metabolism in podocyte, which provides useful information for the understanding of physiology and metabolism of podocytes. However, the underlying molecular mechanism for such a regulation is still unclear and awaits further studies.
 
 [Editors' note: this paper was reviewed by Review Commons.]
 
 Summary
2. Public_Reviews 25 Jul 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Summary:
 
 In this manuscript, the roles of the insulin receptor and the insulin growth factor receptor were investigated in podocytes. Mice in which both receptors were deleted developed glomerular dysfunction and developed proteinuria and glomerulrosclerosis over several months. Because of concerns about incomplete KO, the authors generated podocyte cell lines where both receptors were deleted. Loss of both receptors was highly deleterious with greater than 50% cell death. To elucidate the mechanism, the authors performed global proteomics and find that spliceosome proteins are down-regulated. They confirm this by using long-range sequencing. These results suggest a novel role for these pathways in podocytes.
 
 This is primarily a descriptive study. The mechanism of how insulin and IGF1 signaling are linked to the spliceosome is not addressed and the phenotype of the mice is only superficially explored. The main issues are that the completeness of the mouse KO is never assessed nor is the completeness of the KO in cell lines. The absence of this data is a significant weakness. The mouse experiments would be improved if the serum creatinines were measured to provide some idea about the severity of the kidney injury. An attempt to rescue the phenotype by overexpression of SF3B4 would also be useful. If this didn't rescue the phenotype, an explanation in the text would suffice. As insulin and IGF are regulators of metabolism, some assessment of metabolic parameters would be an optional add-on. Lastly, in the cell line experiments, the authors should discuss the caveats associated with studying the 50% of the cells that survive vs the ones that died.
 
 Significance:
 
 With the GLP1 agonists providing renal protection, there is great interest in understanding the role of insulin and other incretins in kidney cell biology. It is already known that Insulin and IGFR signaling play important roles in other cells of the kidney, therefore, there is great interest in understanding these pathways in podocytes. The major advance is that these two pathways appear to have a role in RNA metabolism, the major limitations are the lack of information regarding the completeness of the KO's. If, for example, they can determine that in the mice, the KO is complete, that the GFR is relatively normal, then the phenotype they describe is relatively mild.
 
 Comments on revision plan:
 
 I agree with the suggested experiments especially, the experiments to examine whether insulin/IGF1 signaling have effects on splicing proteins. An alternative experiment would be to ask whether rescue of IR or IGF1R would ameliorate the splicing effects.
 
 Review 1
3. Public_Reviews 25 Jul 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 In this manuscript, submitted to Review Commons (journal agnostic), Coward and colleagues report on the role of insulin/IGF axis in podocyte gene transcription. They knocked out both the insulin and IGFR1 mice. Dual KO mice manifested a severe phenotype, with albuminuria, glomerulosclerosis, renal failure and death at 4-24 weeks.
 
 Long read RNA sequencing was used to assess splicing events. Podocyte transcripts manifesting intron retention were identified. Dual knock-out podocytes manifested more transcripts with intron retention (18%) compared wild-type controls (18%), with an overlap between experiments of ~30%.
 
 Transcript productivity was also assessed using FLAIR-mark-intron-retention software. Intron retention w seen in 18% of ciDKO podocyte transcripts compared to 14% of wild-type podocyte transcripts (P=0.004), with an overlap between experiments of ~30% (indicating the variability of results with this method). Interestingly, ciDKO podocytes showed downregulation of proteins involved in spliceosome function and RNA processing, as suggested by LC/MS and confirmed by Western blot.
 
 Pladienolide (a spliceosome inhibitor) was cytotoxic to HeLa cells and to mouse podocytes but no toxicity was seen in murine glomerular endothelial cells.
 
 The manuscript is generally clear and well-written. Mouse work was approved in advance. The four figures are generally well-designed, with bars/superimposed dot-plots.
 
 Methods are generally well described. It would be helpful to say that tissue scoring was performed by an investigator masked to sample identity.
 
 Specific comments:
 
 (1) Data are presented as mean/SEM. In general, mean/SD or median/IQR are preferred to allow the reader to evaluate the spread of the data. There may be exceptions where only SEM is reasonable.
 
 (2) It would be useful to for the reader to be told the number of over-lapping genes (with similar expression between mouse groups) and the results of a statistical test comparing WT and KO mice. The overlap of intron retention events between experimental repeats was about 30% in both knock-out podocytes. This seems low and I am curious to know whether this is typical for typical for this method; a reference could be helpful.
 
 (3) Please explain "adjusted p value of 0.01." It is not clear how was it adjusted. The number of differentially-expressed proteins between the two cell types was 4842.
 
 Comments on revision plan:
 
 The authors suggest additional experiments that should address my concerns and probably the other reviewers' concerns.
 
 I encourage the authors to proceed with their proposed experiments and revisions.
 
 Review 2
4. Public_Reviews 25 Jul 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 Summary:
 
 These investigators have previously shown important roles for either insulin receptor (IR) or insulin-like growth factor receptor (IGF1R) in glomerular podocyte function. They now have studied mice with deletion of both receptors and find significant podocyte dysfunction. They then made a podocyte cell line with inducible deletion of both receptors and find abnormalities in transcriptional efficiency with decreased expression of spliceosome proteins and increased transcripts with impaired splicing or premature termination.
 
 The studies appear to be performed well and the manuscript is clearly written.
 
 There are a number of potential issues and questions with these studies.
 
 (1) For the in vivo studies, the only information given is for mice at 24 weeks of age. There needs to be a full time course of when the albuminuria was first seen and the rate of development. Also, GFR was not measured. Since the podocin-Cre utilized was not inducible, there should be a determination of whether there was a developmental defect in glomeruli or podocytes. Were there any differences in wither prenatal post natal development or number of glomeruli?
 
 (2) Although the in vitro studies are of interest, there are no studies to determine if this is the underlying mechanism for the in vivo abnormalities seen in the mice. Cultured podocytes may not necessarily reflect what is occurring in podocytes in vivo.
 
 (3) Given that both receptors are deleted in the podocyte cell line, it is not clear if the spliceosome defect requires deletion of both receptors or if there is redundancy in the effect. The studies need to be repeated in podocyte cell lines with either IR or IGFR single deletions.
 
 (4) There are no studies investigating signaling mechanisms mediating the spliceosome abnormalities.
 
 Comments on revision plan:
 
 I do not have any changes from my prior review. I applaud the authors for developing a plan to address the questions and concerns raised in my prior review.
 
 Review 3
5. Public_Reviews 25 Jul 2025
 
 in eLife
 
 Author response:
 
 Evidence reducibility and clarity
 
 Reviewer 1:
 
 In this manuscript, the role of the insulin receptor and the insulin growth factor receptor was investigated in podocytes. Mice, were both receptors were deleted, developed glomerular dysfunction and developed proteinuria and glomerulosclerosis over several months. Because of concerns about incomplete KO, the authors generated podocyte cell lines where both receptors were deleted. Loss of both receptors was highly deleterious with greater than 50% cell death. To elucidate the mechanism, the authors performed global proteomics and find that spliceosome proteins are downregulated. They confirm this by using long-range sequencing. These results suggest a novel role for these pathways in podocytes.
 
 Thank you
 
 This is primarily a descriptive study and no technical concerns are raised. The mechanism of how insulin and IGF1 signaling are linked to the spiceosome is not addresed.
 
 We do not think the paper is descriptive as we used non-biased phospho and total proteomics in the DKO cells to uncover the alterations in the spliceosome (that have not been previously described) that were detrimental. However, we are happy to look further into the underlying mechanism.
 
 We would propose:
 
 (1) Stimulating/inhibiting insulin/IGF signalling pathways in the Wild-type and DKO knockout cells and check expression levels and/or phosphorylation status of splice factors (including those in Figure 3E) and those revealed by phospho-proteomic data; a variety of inhibitors of insulin/IGF1 pathways could also be used along the pathways that are shown in Fig 2.
 
 (2) Looking at the RNaseq data bioinformatically in more detail – the introns/exons that move up or down are targets of the splice factors involved; most splice factors binding sequences are known, so it should be possible to ask bioinformatically – from the sequences around the splice sites of the exons and introns that move in the DKO, which splice factors binding sites are seen most frequently? To uncover splice factors/RNA-binding proteins (RBPs) that are involved in the insulin signaling we will use a software named MATT which was specifically designed to look for RNA-binding motifs (PMID 30010778). In brief, using the long-sequencing data, we will test 250 nt sequences flanking the splice sites of all regulated splicing events (intronic and exonic) against all RNA- binding proteins in the CISBP-RNA database (PMID 23846655) using MATT. This will result in a list of RBPs potentially involved in the insulin signaling. We will validate these by activating insulin signaling (similar to Figures 2 B,C) and probe whether the RBPs are activated (e.g. phosphorylated or change in expression) or we will manipulate expression of the candidate RBPs and measure how they affect the insulin signaling.
 
 (3) Examining the phospho and total proteomic data for IGF1R and Insulin receptor knockout alone podocytes (which we have already generated) and analysing these in more detail and include this data set to elucidate the relative importance of both receptors to spliceosome function.
 
 The phenotype of the mouse is only superficially addressed. The main issues are that the completeness of the mouse KO is never assessed nor is the completeness of the KO in cell lines. The absence of this data is a significant weakness.
 
 We apologise for not making clear but we did assess the level of receptor knockdown in the animal and cell models. The in vivo model showed variable and non-complete levels of insulin receptor and IGF1 receptor podocyte knock down (shown in supplementary figure 1B). This is why we made the in vitro floxed podocyte cell lines in which we could robustly knockdown both the insulin receptor and IGF1 receptor (shown in Figure 2A)
 
 The mouse experiments would be improved if the serum creatinines were measured to provide some idea how severe the kidney injury is.
 
 We can address this:
 
 We have further urinary Albumin:creatinine ratio (uACR) data at 12, 16 and 20 weeks. We also have more blood tests of renal function that can be added. There is variability in creatinine levels which is not uncommon in transgenic mouse models (probably partly due to variability in receptor knock down with cre-lox system). This is part of rationale of developing the robust double receptor knockout cell models where we knocked out both receptors by >80%.
 
 An attempt to rescue the phenotype by overexpression of SF3B4 would also be useful. If this didn't work, an explanation in the text would suffice.
 
 We would consider over express SF3BF4 in the Wild type and DKO cells and assess the effects on spliceosome if deemed necessary. However, we think it is unlikely to rescue the phenotype as so many other spliceosome components are downregulated in the DKO cells.
 
 As insulin and IGF are regulators of metabolism, some assessment of metabolic parameters would be an optional add-on.
 
 We have some detail on this and can add to the manuscript. However it is not extensive as not a major driver of this work.
 
 Lastly, the authors should caveat the cell experiments by discussing the ramifications of studying the 50% of the cells that survive vs the ones that died.
 
 Thank you, we appreciate this and this was the rationale behind cells being studied after 2 days differentiation before significant cell loss in order to avoid the issue of studying the 50% of cells that survive.
 
 Reviewer 2:
 
 In this manuscript, submitted to Review Commons (journal agnostic), Coward and colleagues report on the role of insulin/IGF axis in podocyte gene transcription. They knocked out both the insulin and IGFR1 mice. Dual KO mice manifested a severe phenotype, with albuminuria, glomerulosclerosis, renal failure and death at 4-24 weeks.
 
 Long read RNA sequencing was used to assess splicing events. Podocyte transcripts manifesting intron retention were identified. Dual knock-out podocytes manifested more transcripts with intron retention (18%) compared wild-type controls (18%), with an overlap between experiments of ~30%.
 
 Transcript productivity was also assessed using FLAIR-mark-intron-retention software. Intron retention w seen in 18% of ciDKO podocyte transcripts compared to 14% of wild-type podocyte transcripts (P=0.004), with an overlap between experiments of ~30% (indicating the variability of results with this method). Interestingly, ciDKO podocytes showed downregulation of proteins involved in spliceosome function and RNA processing, as suggested by LC/MS and confirmed by Western blot.
 
 Pladienolide (a spliceosome inhibitor) was cytotoxic to HeLa cells and to mouse podocytes but no toxicity was seen in murine glomerular endothelial cells. Specific comments.
 
 The manuscript is generally clear and well-written. Mouse work was approved in advance. The six figures are generally well-designed, bars/superimposed dot-plots.
 
 Thank you
 
 Evaluation.
 
 Methods are generally well described. It would be helpful to say that tissue scoring was performed by an investigator masked to sample identity.
 
 We did this and will add this information to the methods/figure legend.
 
 Specific comments.
 
 (1) Data are presented as mean/SEM. In general, mean/SD or median/IQR are preferred to allow the reader to evaluate the spread of the data. There may be exceptions where only SEM is reasonable.
 
 Graphs can be changed to SD rather than SEM.
 
 (2) It would be useful to for the reader to be told the number of over-lapping genes (with similar expression between mouse groups) and the results of a statistical test comparing WT and KO mice. The overlap of intron retention events between experimental repeats was about 30% in both knock-out podocytes. This seems low and I am curious to know whether this is typical for typical for this method; a reference could be helpful.
 
 This is an excellent question. We had 30% overlap as the parameters used for analysis were very stringent. We suspect we could get more than 30% by being less stringent, which still be considered as similar events if requested. Our methods were based on FLAIR analysis (PMID: 32188845)
 
 (3) Please explain "adjusted p value of 0.01." It is not clear how was it adjusted. The number of differentially-expressed proteins between the two cell types was 4842.
 
 We used the Benjamini-Hochberg method to adjust our data. We think the reviewer is referring to the transcriptomic data and not the proteomic data.
 
 Minor comments
 
 Page numbers in the text would help the reviewer communicate more effectively with the author.
 
 We will do this
 
 Reviewer 3:
 
 These investigators have previously shown important roles for either insulin receptor (IR) or insulin-like growth factor receptor (IGF1R) in glomerular podocyte function. They now have studied mice with deletion of both receptors and find significant podocyte dysfunction. They then made a podocyte cell line with inducible deletion of both receptors and find abnormalities in transcriptional efficiency with decreased expression of spliceosome proteins and increased transcripts with impaired splicing or premature termination.
 
 The studies appear to be performed well and the manuscript is clearly written.
 
 Thank you
 
 Referees cross-commenting
 
 I am in agreement with Reviewer 1 that the studies are overly descriptive and do not provide sufficient mechanism and the lack of more investigation of the in vivo model is a significant weakness.
 
 Please see our responses to reviewer 1 above.
 
 Significance
 
 Reviewer 1:
 
 With the GLP1 agonists providing renal protection, there is great interest in understanding the role of insulin and other incretins in kidney cell biology. It is already known that Insulin and IGFR signaling play important roles in other cells of the kidney. So, there is great interest in understanding these pathways in podocytes. The major advance is that these two pathways appear to have a role in RNA metabolism, the major limitations are the lack of information regarding the completeness of the KO's. If, for example, they can determine that in the mice, the KO is complete, that the GFR is relatively normal, then the phenotype they describe is relatively mild.
 
 Thank you. The receptor KO in the mice is unlikely to be complete (Please see comments above and Supplementary Figure 1b). There are many examples of KO models targeting other tissues showing that complete KO of these receptors seems difficult to achieve , particularly in reference to the IGF1 receptor. In the brain (which is also terminally differentiated cells PMID:28595357 (barely 50% iof IGF1R knockdown was achieved in the target cells). Ovarian granulosa cells PMID:28407051 -several tissue specific drivers tried but couldn't achieve any better than 80%. The paper states that 10% of IGF1R is sufficient for function in these cells so they conclude that their knockdown animals are probably still responding to IGF1. Finally, in our recent IGF1R podocyte knockdown model we found Cre levels were important for excision of a single floxed gene (PMID: 38706850) hence we were not surprised that trying to excise two floxed genes (insulin receptor and IGF1 receptor) was challenging. This is the rationale for making the double receptor knockout cell lines to understand process / biology in more detail.
 
 Reviewer 2:
 
 The manuscript is generally clear and well-written. Mouse work was approved in advance. The figures are generally well-designed, bars/superimposed dot-plots.
 
 Evaluation.
 
 Methods are generally well described. It would be helpful to say that tissue scoring was performed by an investigator masked to sample identity.
 
 Thank you we will do this.
 
 Reviewer 3:
 
 There are a number of potential issues and questions with these studies.
 
 (1) For the in vivo studies, the only information given is for mice at 24 weeks of age. There needs to be a full time course of when the albuminuria was first seen and the rate of development. Also, GFR was not measured. Since the podocin-Cre utilized was not inducible, there should be a determination of whether there was a developmental defect in glomeruli or podocytes. Were there any differences in wither prenatal post natal development or number of glomeruli?
 
 Thank you we will add in further phenotyping data. We do not think there was a major developmental phenotype as albuminuria did not become significantly different until several months of age. We could have used a doxycycline inducible model but we know the excision efficiency is much less than the podocin-cre driven model SUPP FIGURE 1. This would likely give a very mild (if any) phenotype and not reveal the biology adequately.
 
 (2) Although the in vitro studies are of interest, there are no studies to determine if this is the underlying mechanism for the in vivo abnormalities seen in the mice. Cultured podocytes may not necessarily reflect what is occurring in podocytes in vivo.
 
 Thank you for this we are happy to employ Immunohistochemistry (IHC) and immunofluorescence (IF) using spliceosome antibodies on tissue sections from DKO and control mice to examine spliceosome changes. However, as the DKO results in podocyte loss, there may not be that many DKO podocytes still present in the tissue sections. This will be taken into consideration.
 
 (3) Given that both receptors are deleted in the podocyte cell line, it is not clear if the spliceosome defect requires deletion of both receptors or if there is redundancy in the effect. The studies need to be repeated in podocyte cell lines with either IR or IGFR single deletions.
 
 Thank you. We have full total and phospho-proteomic data sets from single insulin receptor and IGF1 receptor knockout cell lines that we will investigate for this point.
 
 (4) There are not studies investigating signaling mechanisms mediating the spliceosome abnormalities.
 
 Thank you as outlined as above to reviewer 1 point 1 we are very happy to investigate insulin / IGF signalling pathways in more detail.
 
 AuthorResponse
Visit annotations in context

Tags

Review 2

Review 3

Summary

AuthorResponse

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.05.20.594973v1
www.biorxiv.org www.biorxiv.org

A biochemical mechanism for Stu2/XMAP215-family microtubule polymerases

4
1. Public_Reviews 25 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 In their important manuscript, Gangadharan, Kober and Rice focus on how Stu2/XMAP215-family microtubule polymerases use their TOG domains to catalytically promote microtubule growth, testing whether their mechanism follows an enzyme-like kinetic model similar to that of actin polymerases. The authors integrate measurements including microtubule polymerization rates and TOG-tubulin binding kinetics to convincingly show that Stu2 follows an enzyme-like model where tight tubulin binding enables efficient polymerization, revealing a shared mechanism with actin polymerases despite their evolutionary divergence. This work will be of general interest to the cell biology and biophysics communities.
 
 Summary
2. Public_Reviews 25 Jul 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 This study by Gangadharan and colleagues provides significant progress towards a quantitative biochemical mechanism for Stu2 polymerase activity. A key conceptual advance is the novel application of an enzyme-like model, initially developed for the actin polymerase Ena/VASP, to Stu2.
 
 New refined affinity measurements for a Stu2 TOG domain using Bio-layer interferometry show more than an order of magnitude higher affinity of TOG domains to tubulin compared to previously published reports.
 
 The findings reinforce the "concentrating reactants" or, more specifically, for TOG-domain proteins, the "tubulin-shuttling antenna" model, compared to the "polarized unfurling" model, a more speculative structural hypothesis.
 
 The manuscript builds upon a series of previous manuscripts that showcase the profound intellectual engagement with microtubule polymerization mechanisms by TOG-domain proteins from the Rice lab, a thought leader in microtubule polymerization for over a decade.
 
 Minor remarks:
 
 (1) A major new experimental finding of this paper is the affinity of TOG domains, which is more than an order of magnitude lower (10 nM) than previous measurements from the same lab (~200 nM). The authors attribute this change to ionic strength differences between buffer conditions, citing the lab's previous work (Ayaz et al., 2014). This argument left me contemplating what the buffer conditions are in both experiments, and I wonder if other readers would feel the same. After going down the rabbit hole, I believe the difference in ionic strength is ~2.3 fold, and at least on the back of my envelope, this works out beautifully with the measured differences in affinities. A short version of this argument may strengthen the manuscript.
 
 (2) I am wondering if there may be an alternative explanation to tubulin binding by TOG being the kinetically rate-limiting step for polymerase function:
 
 TOG + Tubulin ⇌ TOG:Tubulin (fast binding rate, high-affinity binding) TOG:Tubulin + MT_end → TOG:MT (tubulin is incorporated into MT, fast transfer rate) The binding rate is 3/s, and the transfer rate is 5/s.
 
 I was wondering if the following step should be considered, which involves a conformational change of tubulin (e.g., straightening) TOG:MT → TOG + MT (rate-limiting straightening and unbinding of TOG from the lattice).
 
 Presumably, the affinity of TOGs for straight tubulin is practically zero for the purpose of this discussion, as there is no lattice binding, which means unbinding is likely very rapid; however, straightening may be the rate-limiting factor here.
 
 In theory, straightening should also be rapid; however, we lack measurements of how fast or slow this step occurs within the context of a TOG domain, which presumably skews the process towards curved tubulin.
 
 A hypothetical Stu2, when bound to the microtubule end and with the TOG domain not disengaged from tubulin, would not permit the processivity of that molecule or the binding of a new molecule. To emphasize the importance of unbinding, when it is not efficient, as reported for the T238 mutant that results in Stu2 lattice binding (Geyer et al., 2018), the polymerase becomes inefficient.
 
 Review 1
3. Public_Reviews 25 Jul 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 The manuscript from the Rice lab by Gangadharan et al. investigates the polymerization mechanism of the yeast microtubule polymerase Stu2. The lab has published a number of articles demonstrating the structural basis by which the two TOG domains of Stu2 each bind free tubulin heterodimers, and has developed a tethered polymerization model by which the TOG domains drive polymerization by shuttling those tubulin subunits onto the microtubule plus end. A second model was proposed by Nithianantham et al. (eLife, 2018) based on a closed-to-open transitional state in which Stu2 unfurls and loads two longitudinally associated tubulin heterodimers onto the microtubule plus end. While the second model is not directly tested, the current work aims to further characterize/model the tethered polymerization model using a kinetic framework developed by Breitsprecher et al. for Ena/VASP actin polymerization activity, using a model that is enzymatic (EMBO J., 2011). The general architecture and function of Ena/VASP on actin polymerization versus Stu2 on microtubule polymerization is a reasonable relation and hits upon, as the authors note, potential convergent mechanistic evolution across distinct cytoskeletal networks. The model effectively treats tubulin as the substrate, and the polymerized microtubule plus end as the product. If Stu2 is "enzymatic" in this framework, the model predicts it would behave with Michaelis-Menten kinetics, that there would a Vmax, and polymerase activity would either be "affinity limited" by TOG:tubulin affinity (KD) and/or "kinetically limited" by TOG:tubulin association (Kon) and transfer of tubulin to the microtubule plus end (Kt). The authors find that the Brietsprecher model works well for Stu2 activity, and that Stu2 best aligns with a "kinetically limited" model. The work is interesting and adds to the growing elucidation of the Stu2 microtubule polymerase model. While yeast microtubule polymerases are somewhat distinct in their architecture, there is significant overlap that findings from the manuscript can be utilized to inform the mechanisms of larger, more complex microtubule polymerases such as human ch-TOG.
 
 Strengths:
 
 The manuscript invokes the enzymatic model of Breitsprecher et al. used for Ena/VASP and conducts an elegant series of (mostly established) experiments to determine whether Stu2 microtubule polymerase activity aligns with the model, which they conclude does align, supported by the data/results obtained.
 
 Weaknesses:
 
 The authors used biolayer interferometry to measure TOG:tubulin affinity. The affinities obtained were significantly higher than the lab obtained in an earlier publication using analytical ultracentrifugation. While differences in buffer and salt conditions may underlie these differences, additional runs using comparable buffer systems, or the use of a third independent assay to measure affinities, would have added rigor.
 
 The discussion could be expanded to better compare and contrast the results with both existing polymerase models introduced in the introduction, as well as expanded to look at reversible enzymatic activity (microtubule depolymerization at low to zero tubulin concentrations) and microtubule plus versus minus end activity.
 
 Review 2
4. Public_Reviews 25 Jul 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 Summary:
 
 This study by Gangadharan and colleagues seeks to establish a quantitative biochemical model for the microtubule polymerase activity of Stu2. Stu2 is the budding yeast member of the XMAP215 protein family, which is broadly conserved across eukaryotes. XMAP215 proteins play a wide variety of important roles in cells, and these are attributed to effects on microtubule dynamics. Many studies over the last ~20 years have shown that XMA215 proteins selectively associate with microtubule ends, where they increase rates of microtubule assembly and disassembly. More recently, structural biology and biochemical studies by the authors and other groups have shown that the multiple TOG domains on XMAP215 proteins are tubulin-binding domains that selectively bind to curved tubulin, which is present in solution and at microtubule ends, but not to straight tubulin which is present in the walls of the microtubule lattice. This has led to the general model that XMAP215 proteins promote polymerization by delivering soluble tubulin to the growing plus end, and two distinct models have been proposed to explain the mechanism. The 'concentrating reactants' model proposed previously by the authors suggests that TOG domains grab hold of tubulin in solution and concentrate at the microtubule end. The 'polarized unfurling' model proposed by the Al Bassam lab suggests that XMAP215 delivers multiple tubulins to the end, using a step-wise mechanism involving different roles for each TOG domain. The current study seeks to improve our understanding of the mechanism by developing a quantitative model to explain the binding and release of tubulins, the number of Stu2 molecules at the end, and the overall rate of tubulin addition. The authors accomplish this goal using new experimental data. The final model fills in new details of the mechanism. The authors draw a comparison between Stu2 and the actin polymerase, which bears similarity to Ena/VASP, and suggest a convergent strategy for cytoskeletal polymerases.
 
 Strengths:
 
 This is a focused and clearly written study that incorporates prior knowledge of XMAP215 and draws inspiration from the actin field. The data are clear and convincing, and the study accomplishes its goal of generating a new, quantitative model for Stu2. The model will be important for microtubule researchers to predict and test key points for altering XMAP215 activity across different organisms and potentially for different tubulin substrates. The comparison to Ena/VASP may also inspire similar comparisons across other microtubule and actin regulators, which could lead to new insights across the cytoskeletal fields.
 
 Weaknesses:
 
 The study is without major weaknesses, but there are several minor weaknesses worth noting. One is that the final model provides new details regarding the Stu2 mechanism, but does not provide a major new advance in our understanding of how the polymerase works. For example, the discussion does not clearly argue for whether the new results and model rule out either of the prior models. This appears consistent with the 'concentrating reactants' model, but does it clearly rule out the 'polarized unfurling' model? A second minor weakness is that the comparison to Ena/VASP is not developed at a deep level based on the final model. I found these ideas exciting and want more critical consideration here, but perhaps it is better suited for a commentary piece to follow.
 
 Review 3
Visit annotations in context

Tags

Summary

Review 2

Review 3

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.06.09.658552v1
www.biorxiv.org www.biorxiv.org

Conduction pathway for potassium through the E. coli pump KdpFABC

4
1. Public_Reviews 25 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This valuable study provides new insights into the movement of ions through the bacterial pump KdpFABC, which regulates intracellular potassium concentration, by solving a 2.1 Å cryo-EM structure of the nanodisc-embedded active wild-type protein, and carrying out mutagenesis and activity assays. Although the structural data and analysis are solid, additional information about other structural classes identified in the EM data, as well as a discussion of relevant work done by others, would further strengthen these findings. The description of the activity assays is currently incomplete because more information is required to rigorously assess these experiments. This work will be of interest to the membrane transporter and channel communities and to microbiologists interested in osmoregulation and potassium homeostasis.
  
  Summary
2. Public_Reviews 25 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  This study on potassium ion transport by the protein complex KdpFABC from E. coli reveals a 2.1 Å cryo-EM structure of the nanodisc-embedded transporter under turnover conditions. The results confirm that K+ ions pass through a previously identified tunnel that connects the channel-like subunit with the P-type ATPase-type subunit.
  
  Strengths:
  
  The excellent resolution of the structure and the thorough analysis of mutants using ATPase and ion transport measurements help to strengthen new and previous interpretations. The evidence supporting the conclusions is solid, including biochemical assays and analysis of mutants. The work will be of interest to the membrane transporter and channel communities and to microbiologists interested in osmoregulation and potassium homeostasis.
  
  Weaknesses:
  
  There is insufficient credit and citation of previous work.
  
  Review 1
3. Public_Reviews 25 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  The paper describes the high-resolution structure of KdpFABC, a bacterial pump regulating intracellular potassium concentrations. The pump consists of a subunit with an overall structure similar to that of a canonical potassium channel and a subunit with a structure similar to a canonical ATP-driven ion pump. The ions enter through the channel subunit and then traverse the subunit interface via a long channel that lies parallel to the membrane to enter the pump, followed by their release into the cytoplasm.
  
  Strengths:
  
  The work builds on the previous structural and mechanistic studies from the authors' and other labs. While the overall architecture and mechanism have already been established, a detailed understanding was lacking. The study provides a 2.1 Å resolution structure of the E1-P state of the transport cycle, which precedes the transition to the E2 state, assumed to be the rate-limiting step. It clearly shows a single K+ ion in the selectivity filter of the channel and in the canonical ion binding site in the pump, resolving how ions bind to these key regions of the transporter. It also resolves the details of water molecules filling the tunnel that connects the subunits, suggesting that K+ ions move through the tunnel transiently without occupying well-defined binding sites. The authors further propose how the ions are released into the cytoplasm in the E2 state. The authors support the structural findings through mutagenesis and measurements of ATPase activity and ion transport by surface-supported membrane (SSM) electrophysiology.
  
  Weaknesses:
  
  While the results are overall compelling, several aspects of the work raised questions. First, the authors determined the structure of the pump in nanodiscs under turnover conditions and observed several structural classes, including E1-P, which is detailed in the paper. Two other structural classes were identified, including one corresponding to E2. It is unclear why they are not described in the paper. Notably, the paper considers in some detail what might occur during the E1-P to E2 state transition, but does not describe the 3.1 Å resolution map for the E2 state that has already been obtained. Does the map support the proposed structural changes?
  
  The paper relies on the quantitative activity comparisons between mutants measured using SSM electrophysiology. Such comparisons are notoriously tricky due to variability between SSM chips and reconstitution efficiencies. The authors should include raw traces for all experiments in the supplementary materials, explain how the replicates were performed, and describe the reproducibility of the results. Related to this point above, size exclusion chromatography profiles and reconstitution efficiencies for mutants should be shown to facilitate comparison between measured activities. For example, could it be that the inactive V496R mutant is misfolded and unstable?
  
  Similarly, are the reduced activities of V496W and V496H (and many other mutants) due to changes in the tunnel or poor biochemical properties of these variants? Without these data, the validity of the ion transport measurements is difficult to assess.
  
  The authors propose that the tunnel connecting the subunits is filled with water and lacks potassium ions. This is an important mechanistic point that has been debated in the field. It would be interesting to calculate the volume of the tunnel and estimate the number of ions that might be expected in it, given their concentration in bulk. It may also be helpful to provide additional discussion on whether some of the observed densities correspond to bound ions with low occupancy.
  
  Review 2
4. Public_Reviews 25 Jul 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Summary:
  
  By expressing protein in a strain that is unable to phosphorylate KdpFABC, the authors achieve structures of the active wild-type protein, capturing a new intermediate state, in which the terminal phosphoryl group of ATP has been transferred to a nearby Asp, and ADP remains covalently bound. The manuscript examines the coupling of potassium transport and ATP hydrolysis by a comprehensive set of mutants. The most interesting proposal revolves around the proposed binding site for K+ as it exits the channel near T75. Nearby mutations to charged residues cause interesting phenotypes, such as constitutive uncoupled ATPase activity, leading to a model in which lysine residues can occupy/compete with K+ for binding sites along the transport pathway.
  
  Strengths:
  
  Although this structure is not so different from previous structures, its high resolution (2.1 Å) is impressive and allows the resolution of many new densities in the potassium transport pathway. The authors are judicious about assigning these as potassium ions or water molecules, and explain their structural interpretations clearly. In addition to the nice structural work, the mechanistic work is thorough. A series of thoughtful experiments involving ATP hydrolysis/transport coupling under various pH and potassium concentrations bolsters the structural interpretations and lends convincing support to the mechanistic proposal.
  
  Weaknesses:
  
  The structures are supported by solid membrane electrophysiology. These data exhibit some weaknesses, including a lack of information to assess the rigor and reproducibility (i.e., the number of replicates, the number of sensors used, controls to assess proteoliposome reconstitution efficiency, and the stability of proteoliposome absorption to the sensor).
  
  Review 3
Visit annotations in context

Tags

Summary

Review 2

Review 3

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.05.05.652293v1
www.biorxiv.org www.biorxiv.org

The promise and peril of comparing fluorescence lifetime in biology revealed by simulations

4
1. Public_Reviews 24 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 This study presents an important computational framework, FLiSimBA (Fluorescence Lifetime Simulation for Biological Applications), for modeling experimental limitations in Fluorescence Lifetime Imaging Microscopy (FLIM). FLiSimBA is readily available in MATLAB and Python, enables users to simulate effects of noise and varying sensor expression levels, and provides practical guidance for both lifetime imaging experiments and biosensor development. The analyses are robust, and the evidence supporting the tool's utility in distinguishing between multiple lifetime signals is compelling, indicating strong potential for multiplexed dynamic imaging. However, users should also consider that the tool's effectiveness depends on the suitability of a two-component discrete exponential model.
 
 Summary
2. Public_Reviews 24 Jul 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 In this study, Ma et al. aimed to determine previously uncharacterized contributions of tissue autofluorescence, detector afterpulse, and background noise on fluorescence lifetime measurement interpretations. They introduce a computational framework they named "Fluorescence Lifetime Simulation for Biological Applications (FLiSimBA)" to model experimental limitations in Fluorescence Lifetime Imaging Microscopy (FLIM) and determine parameters for achieving multiplexed imaging of dynamic biosensors using lifetime and intensity. By quantitatively defining sensor photon effects on signal to noise in either fitting or averaging methods of determining lifetime, the authors contradict any claims of FLIM sensor expression insensitivity to fluorescence lifetime and highlight how these artifacts occur differently depending on analysis method. Finally, the authors quantify how statistically meaningful experiments using multiplexed imaging could be achieved.
 
 A major strength of the study is the effort to present results in a clear and understandable way given that most researcher do not think about these factors on a day-to-day basis. Additionally, the model code is readily available in Matlab and Python, which should allow for open access to a larger community.
 
 Overall, the authors' achieved their aims of demonstrating how common factors (autofluorescence, background, and sensor expression) will affect lifetime measurements and they present a clear strategy for understanding how sensor expression may confound results if not properly considered. This work should bring to awareness an issue that new users of lifetime biosensors may not be aware of and that experts, while aware, have not quantitatively determine the conditions where these issues arise. This work will also point to future directions for improving experiments using fluorescence lifetime biosensors and the development of new sensors with more favorable properties.
 
 Review 1
3. Public_Reviews 24 Jul 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 Summary:
 
 This study presents a useful computational tool, termed FLiSimBA. The MATLAB-based FLiSimBA simulations allow users to examine the effects of various noise factors (such as autofluorescence, afterpulse of the photomultiplier tube detector, and other background signals) and varying sensor expression levels. Under the conditions explored, the simulations unveiled how these factors affect the observed lifetime measurements, thereby providing useful guidelines for experimental designs. Further simulations with two distinct fluorophores uncovered conditions in which two different lifetime signals could be distinguished, indicating multiplexed dynamic imaging may be possible.
 
 Strengths:
 
 The simulations and their analyses were done systematically and rigorously. FliSimba can be useful for guiding and validating fluorescence lifetime imaging studies. The simulations could define useful parameters such as the minimum number of photons required to detect a specific lifetime, how sensor protein expression level may affect the lifetime data, the conditions under which the lifetime would be insensitive to the sensor expression levels, and whether certain multiplexing could be feasible.
 
 Weaknesses:
 
 The analyses have relied on a key premise that the fluorescence lifetime in the system can be described as a two-component discrete exponential decay. This means that the experimenter should ensure that this is the right model for their fluorophores a priori.
 
 Review 3
4. Public_Reviews 24 Jul 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews.
 
 Reviewer #1 (Public review):
 
 In this study, Ma et al. aimed to determine previously uncharacterized contributions of tissue autofluorescence, detector afterpulse, and background noise on fluorescence lifetime measurement interpretations. They introduce a computational framework they named "Fluorescence Lifetime Simulation for Biological Applications (FLiSimBA)" to model experimental limitations in Fluorescence Lifetime Imaging Microscopy (FLIM) and determine parameters for achieving multiplexed imaging of dynamic biosensors using lifetime and intensity. By quantitatively defining sensor photon effects on signal-to-noise in either fitting or averaging methods of determining lifetime, the authors contradict any claims of FLIM sensor expression insensitivity to fluorescence lifetime and highlight how these artifacts occur differently depending on the analysis method. Finally, the authors quantify how statistically meaningful experiments using multiplexed imaging could be achieved.
 
 A major strength of the study is the effort to present results in a clear and understandable way given that most researchers do not think about these factors on a day-to-day basis. The model code is available and written in Matlab, which should make it readily accessible, although a version in other common languages such as Python might help with dissemination in the community. One potential weakness is that the model uses parameters that are determined in a
 
 specific way by the authors, and it is not clear how vastly other biological tissue and microscope setups may differ from the values used by the authors.
 
 Overall, the authors achieved their aims of demonstrating how common factors
 
 (autofluorescence, background, and sensor expression) will affect lifetime measurements and they present a clear strategy for understanding how sensor expression may confound results if not properly considered. This work should bring to awareness an issue that new users of lifetime biosensors may not be aware of and that experts, while aware, have not quantitatively determined the conditions where these issues arise. This work will also point to future directions for improving experiments using fluorescence lifetime biosensors and the development of new sensors with more favorable properties.
 
 We appreciate the comments and helpful suggestions. We now also include FLiSimBA simulation code in Python in addition to Matlab to make it more accessible to the community.
 
 One advantage of FLiSimBA is that the simulation package is flexible and adaptable, allowing users to input parameters based on the specific sensors, hardware, and autofluorescence measurements for their biological and optical systems. We used parameters based on a FRETbased sensor, measured autofluorescence from mouse tissue, and measured dark count/after pulse of our specific GaAsP PMT in this manuscript as examples. In Discussion and Materials and methods, we now emphasize this advantage and further clarify how these parameters can be adapted to diverse tissues, imaging systems, and sensors based on individual experiments. We further explain that these input parameters will not affect the conclusions of our study, but the specific input parameters would alter the quantitative thresholds.
 
 Reviewer #2 (Public review):
 
 Summary:
 
 By using simulations of common signal artefacts introduced by acquisition hardware and the sample itself, the authors are able to demonstrate methods to estimate their influence on the estimated lifetime, and lifetime proportions, when using signal fitting for fluorescence lifetime imaging.
 
 Strengths:
 
 They consider a range of effects such as after-pulsing and background signal, and present a range of situations that are relevant to many experimental situations.
 
 Weaknesses:
 
 A weakness is that they do not present enough detail on the fitting method that they used to estimate lifetimes and proportions. The method used will influence the results significantly. They seem to only use the "empirical lifetime" which is not a state of the art algorithm. The method used to deconvolve two multiplexed exponential signals is not given.
 
 We appreciate the comments and constructive feedback. Our revision based on the reviewer’s suggestions has made our manuscript clearer and more user friendly. We originally described the detail of the fitting methods in Materials and methods. Given the importance of these methodological details for evaluating the conclusions of this study, we have moved the description of the fitting method from Materials and methods to Results. In addition, we provide further clarification and more details of the rationale of using these different methods of lifetime estimates in Discussion to aid users in choosing the best metric for evaluating fluorescence lifetime data.
 
 More specifically, we modified our writing to highlight the following.
 
 (1) In Results, we describe that lifetime histograms were fitted to Equation 3 with the GaussNewton nonlinear least-square fitting algorithm and the fitted P<sub1 was used as lifetime estimation.
 
 (2) In Results, we clarify that our simulation of multiplexed imaging was modeled with two sensors, each displaying a single exponential decay, but the two sensors have different decay constants. We also describe that Equation 3 with the Gauss-Newton nonlinear least-square fitting algorithm was used to deconvolve the two multiplexed exponential signals (Fig. 8)
 
 Reviewer #3 (Public review):
 
 Summary:
 
 This study presents a useful computational tool, termed FLiSimBA. The MATLAB-based FLiSimBA simulations allow users to examine the effects of various noise factors (such as autofluorescence, afterpulse of the photomultiplier tube detector, and other background signals) and varying sensor expression levels. Under the conditions explored, the simulations unveiled how these factors affect the observed lifetime measurements, thereby providing useful guidelines for experimental designs. Further simulations with two distinct fluorophores uncovered conditions in which two different lifetime signals could be distinguished, indicating multiplexed dynamic imaging may be possible.
 
 Strengths:
 
 The simulations and their analyses were done systematically and rigorously. FliSimba can be useful for guiding and validating fluorescence lifetime imaging studies. The simulations could define useful parameters such as the minimum number of photons required to detect a specific lifetime, how sensor protein expression level may affect the lifetime data, the conditions under which the lifetime would be insensitive to the sensor expression levels, and whether certain multiplexing could be feasible.
 
 Weaknesses:
 
 The analyses have relied on a key premise that the fluorescence lifetime in the system can be described as two-component discrete exponential decay. This means that the experimenter should ensure that this is the right model for their fluorophores a priori and should keep in mind that the fluorescence lifetime of the fluorophores may not be perfectly described by a twocomponent discrete exponential (for which alternative algorithms have been implemented: e.g., Steinbach, P. J. Anal. Biochem. 427, 102-105, (2012)). In this regard, I also couldn't find how good the fits were for each simulation and experimental data to the given fitting equation (Equation 2, for example, for Figure 2C data).
 
 We thank the reviewer for the constructive feedback. We agree that the FLiSimBA users should ensure that the right decay equations are used to describe the fluorescent sensors. In this study, we used a FRET-based PKA sensor FLIM-AKAR to provide proof-of-principle demonstration of the capability of FLiSimBA. The donor fluorophore of FLIM-AKAR, truncated monomeric enhanced GFP, displays a single exponential decay. FLIM-AKAR, a FRET-based sensor, displays a double exponential decay. The time constants of the two exponential components were determined and reported previously (Chen, et al, Neuron (2017)). Thus, a double exponential decay equation with known τ1 and τ2 was used for both simulation and fitting. The goodness of fit is now provided in Supplementary Fig. 1 for both simulated and experimental data. In addition to referencing our prior study characterizing the double exponential decay model of FLIM-AKAR in Materials and methods, we have emphasized in Discussion the versality of FLiSimBA to adapt to different sensors, tissues, and analysis methods, and the importance of using the right mathematical models to describe the fluorescence decay of specific sensors.
 
 Also, in Figure 2C, the 'sensor only' simulation without accounting for autofluorescence (as seen in Sensor + autoF) or afterpulse and background fluorescence (as seen in Final simulated data) seems to recapitulate the experimental data reasonably well. So, at least in this particular case where experimental data is limited by its broad spread with limited data points, being able to incorporate the additional noise factors into the simulation tool didn't seem to matter too much.
 
 In the original Fig 2C, the sensor fluorescence was much higher than the contributions from autofluorescence, afterpulse, and background signals, resulting in minimal effects of these other factors, as the reviewer noted. This original figure was based on photon counts from single neurons expressing FLIM-AKAR. For the rest of the manuscript, photon counts were based on whole fields of view (FOV). Since the FOV includes cells that do not express fluorescent sensors, the influence of autofluorescence, dark currents, and background is much more pronounced, as shown in Fig. 2B.
 
 Both approaches – using photon counts from the whole FOV or from individual neurons – have their justifications. Photon counts from the whole FOV simulate data from fluorescence lifetime photometry (FLiP), whereas photon counts from individual neurons simulate data from fluorescence lifetime imaging microscopy (FLIM). However, the choice of approach does not affect the conclusions of the manuscript, as a range of photon count values are simulated. To maintain consistency throughout the manuscript, we have revised the photon counts in this figure (now Supplementary Fig. 1C) to match those from the whole FOV.
 
 Additionally, we have made some modifications in our analyses of Supplementary Fig. 1C and Fig. 2B, detailed in the “FLIM analysis” section of Materials and methods. For instance, to minimize system artifact interference at the histogram edges, we now use a narrower time range (1.8 to 11.5 ns) for fitting and empirical lifetime calculation.
 
 Reviewer #1 (Recommendations for the authors):
 
 (1) The authors report how autofluorescence was measured from "imaged brain slices from mice at postnatal 15 to 19 days of age without sensor expression." However, it remains unclear how many acute slices and animals were used (for example, were all 15um x 15um FOV from a single slice) and if mouse age affects autofluorescence quantification. Furthermore, would in vivo measurements have different autofluorescence conditions given that blood flow would be active? It would help if the authors more clearly explained how reliable their autofluorescence measurement is by clarifying how they obtained it, whether this would vary across brain areas, and whether in vitro vs in vivo conditions would affect autofluorescence.
 
 We have added description in Materials and methods that for autofluorescence ‘Fluorescence decay histograms from 19 images of two brain slices from a single mouse were averaged.’ We have added in Discussion that users should carefully ‘measure autofluorescence that matches the age, brain region, and data collection conditions (e.g., ex vivo or in vivo) of their tissue…’, and emphasize that FLiSimBA offers customization of inputs, and it is important for users to adapt the inputs such as autofluorescence to their experimental conditions. We also clarify in Discussion that the change of input parameters such as autofluorescence across age and brain region would not affect the general insights from this study, but will affect quantitative values.
 
 (2) Does sensor expression level issues arise more with in-utero electroporation compared to AAV-based delivery of biosensors? A brief comment on this in the discussion may help as most users in the field today may be using AAV strategies to deliver biosensors.
 
 In our experience, in-utero electroporation results in higher sensor expression than AAV-based delivery, and so pose less concern for expression-level dependence. However, both delivery methods can result in expression level dependence, especially with a sensor that is not bright. We have added in Discussion ‘For a sensor with medium brightness delivered via in utero electroporation, adeno-associated virus, or as a knock-in gene, the brightness may not always fall within the expression level-independent regime.’
 
 (3) Figure 1. Should the x-axis on the top figures be "Time (ns)" instead of "Lifetime (ns)"?
 
 Similarly in Figure 8A&B, wouldn't it make more sense to have the x-axis be Time not Lifetime?
 
 The x-axis labels in Fig. 1 and Fig. 8A-8B have been changed to ‘Time (ns)’.
 
 (4) Figure 2b: why is the empirical lifetime close to 3.5ns? Shouldn't it be somewhere between
 
 2.14 and 0.69?
 
 In our empirical lifetime calculation, we did not set the peak channel to have a time of 0.0488 ns (i.e. the laser cycle 12.5 ns divided by 256 time channels). Rather, we set the first time channel within a defined calculation range (i.e. 1.8 ns in Supplementary Fig. 1B) to have a time of 0.0488 ns (i.e.). Thus, the empirical lifetime exceeds 2.14 ns and depends on the time range of the histogram used for calculation.
 
 For Fig. 2B and Supplementary Fig. 1C, we have now adjusted the range to 1.8-11.5 ns to eliminate FLIM artifacts at the histogram edges in our experimental data, resulting in an empirical lifetime around 2.255 ns. In contrast, the range for calculating the empirical lifetime of simulated data in the rest of the study (e.g. Fig. 4D) is 0.489-11.5 ns, yielding a larger lifetime of ~3.35 ns.
 
 We have clarified these details and our rationale in Materials and methods.
 
 (5) Figure 2b: how come the afterpulse+background contributes more to the empirical lifetime than the autofluorescence (shorter lifetime). This was unclear in the results text why autofluorescence photons did not alter empirical lifetime as much as did the afterpulse/background.
 
 With a histogram range from 1.8 ns to 11.5 ns used in Fig. 2B, the empirical lifetime for FLIM-AKAR sensor fluorescence, autofluorescence, and background/afterpulse are: 2-2.3 ns, around 1.69 ns, and around 4.90 ns. The larger difference of background/afterpulse from FLIM-AKAR sensor fluorescence leads to larger influence of afterpulse+background than autofluorescence. We have added an explanation of this in Results.
 
 (6) One overall suggestion for an improvement that could help active users of lifetime biosensors understand the consequences would be to show either a real or simulated example of a "typical experiment" conducted using FLIM-AKAR and how an incorrect interpretation could be drawn as a consequence of these artifacts. For example, do these confounds affect experiments involving comparisons across animals more than within-subject experiments such as washing a drug onto the brain slice, and the baseline period is used to normalize the change in signal? I think this type of direct discussion will help biosensor users more deeply grasp how these factors play out in common experiments being conducted.
 
 We have added the following in Discussion, ‘…While this issue is less problematic when the same sample is compared over short periods (e.g. minutes), It can lead to misinterpretation when fluorescence lifetime is compared across prolonged periods or between samples when comparison is made across chronic time periods or between samples with different sensor expression levels. For example, apparent changes in fluorescence lifetime observed over days, across cell types, or subcellular compartments may actually reflect variations in sensor expression levels rather than true differences in biological signals (Fig. 6), Therefore, considering biologically realistic factors in FLiSimBA is essential, as it qualitatively impacts the conclusions.’
 
 Reviewer #2 (Recommendations for the authors):
 
 The paper would be improved with more detail on the fitting methods, and the use of state-of-theart methods. Consult for example the introduction of this paper where many methods are listed: https://www.mdpi.com/1424-8220/22/19/7293
 
 We have moved the description of the Gauss-Newton nonlinear least-square fitting algorithm from Materials and methods to Results to enhance clarity. We appreciate the reviewer’s suggestion to combine FLiSimBA with various analysis methods. However, the primary focus of our manuscript is to call for attention of how specific contributing factors in biological experiments influence FLIM data, and to provide a tool that rigorously considers these factors to simulate FLIM data, which can then be used for fitting. Therefore, we did not expand the scope of our manuscript. Instead, we have added in the Discussion that ‘‘FLiSimBA can be used to test multiple fitting methods and lifetime metrics as an exciting future direction for identifying the best analysis method for specific experimental conditions’, citing relevant references.
 
 I would also improve the content of the GitHub repository as it is very hard to identify to source code used for simulation and fitting.
 
 We have reorganized and relabeled our GitHub repository and now have three folders labeled as ‘Simulation_inMatlab’, ‘DataAnalysis_inMatlab’, and ‘SimulationAnalysis_inPython’. We also updated the clarification of the contents of each folder in the README file.
 
 Reviewer #3 (Recommendations for the authors):
 
 (1) P. 10 "For example, to detect a P1 change of 0.006 or a lifetime change of 5 ps with one sample measurement in each comparison group, approximately 300,000 photons are needed." If I am reading the graphs in Figures 3B and C, this sentence is talking about the red line. However, the intersection of 0.006 in the MDD of P1 in 3B and red is not 3E5 photons. And the intersection of 0.005 ns and red in 3C is not 3E5 photons either. Are you sure you are talking about n=1? Maybe the values are correct for the blue curve with n=5.
 
 Thank you for catching our error. We have corrected the text to ‘with five sample measurements’.
 
 (2) Figure 2 (B) legend: It would be helpful to specify what is being compared in the legend. For example, consider revising "* p < 0.05 vs sensor only; n.s. not significant vs sensor + autoF; # p < 0.05 vs sensor + autoF. Two-way ANOVA with Šídák's multiple comparisons test" to "* p <0.05 for sensor + auto F (cyan) vs sensor only; n.s. not significant for final simulated data (purple) vs sensor + autoF; # p < 0.05 for final simulated data (purple) vs sensor + autoF. Twoway ANOVA with Šídák's multiple comparisons test".
 
 We’ve made the change and thanks for the suggestion to make it clearer.
 
 (3) Figure 2 (c) Can you please show the same Two-way ANOVA test values for Experimental vs. Sensor only and for Experimental vs. Sensor + autoF? Currently, the value (n.s.) is marked only for Experimental vs. Final simulation. Given that the experimental data are sparse (compared to the simulations), it seems likely that there may be no significant difference among the 3 different simulations regarding how well they match the experimental data. Also, can you specify the P1 and P2 of the experimental data used to generate the simulated data on this panel? Also, what is the reason why P1=0.5 was used for panels A and B, instead of the value matching the experimental value?
 
 As the reviewer suggested, we have included statistical tests in the figure (now Supplementary Fig. 1C). Please see our response to the Public Review of Reviewer 3’s comments as well as our changes in Materials and Methods on other changes and their rationale for this figure. We have now specified the P1 value of the experimental data used to generate the simulated data on this panel both in Figure Legends and Materials and Methods. Based on the suggestion, we have now used the same P1 value in Fig. 2B.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

AuthorResponse

Review 3

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2023.12.20.572686v4
www.biorxiv.org www.biorxiv.org

Strip cropping shows promising increases in ground beetle community diversity compared to monocultures

3
1. Public_Reviews 24 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This study presents important findings on increased ground beetle diversity in strip cropping compared with crop monocultures. Solid methods are used to analyze data from multiple sites with heterogeneous systems of mixed crops, allowing broad conclusions, albeit at the expense of lacking taxonomic specificity. The work will be of interest to all those applying plant diversity treatments to improve the diversity of associated animals in agricultural fields.
  
  Summary
2. Public_Reviews 24 Jul 2025
  
  in eLife
  
  Joint Public Review:
  
  Summary:
  
  In this paper the authors examined the effects of strip cropping, a relatively new agricultural technique of alternating crops in small strips of several meters wide, on ground beetle diversity. The results show an increase in species diversity (i.e. abundance and species richness) of the ground beetle communities compared to monocultures.
  
  Strengths:
  
  The article is well written; it has an easily readable tone of voice without too much jargon or overly complicated sentence structure. Moreover, as far as reviewing the models in depth without raw data and R scripts allows, the statistical work done by the authors looks good. They have well thought out how to handle heterogenous, unbalanced and taxonomically unspecific yet spatially and temporarily correlated field data. The models applied and the model checks performed are appropriate for the data at hand. Combining RDA and PCA axes together is a nice touch. Moreover, after the first round of reviews, the authors have done a great job at rewriting the paper to make it less overstated, more relevant to the data at hand and more solid in the findings. Many of the weaknesses noted in the first review have been dealt with. The overall structure of the paper is good, with a clear introduction, hypotheses, results section and discussion.
  
  Review 1
3. Public_Reviews 24 Jul 2025
  
  in eLife
  
  Author response:
  
  The following is the authors’ response to the previous reviews
  
  Reviewer #3 (Public Review)
  
  Summary:
  
  In this paper the authors examined the effects of strip cropping, a relatively new agricultural technique of alternating crops in small strips of several meters wide, on ground beetle diversity. The results show an increase in species diversity (i.e. abundance and species richness) of the ground beetle communities compared to monocultures.
  
  Strengths:
  
  The article is well written; it has an easily readable tone of voice without too much jargon or overly complicated sentence structure. Moreover, as far as reviewing the models in depth without raw data and R scripts allows, the statistical work done by the authors looks good. They have well thought out how to handle heterogenous, unbalanced and taxonomically unspecific yet spatially and temporarily correlated field data. The models applied and the model checks performed are appropriate for the data at hand. Combining RDA and PCA axes together is a nice touch. Moreover, after the first round of reviews, the authors have done a great job at rewriting the paper to make it less overstated, more relevant to the data at hand and more solid in the findings. Many of the weaknesses noted in the first review have been dealt with. The overall structure of the paper is good, with a clear introduction, hypotheses, results section and discussion.
  
  We are grateful for this positive feedback. We are glad that our extensive revision after extensive review from three reviewers has paid off in addressing earlier weakness of our manuscript.
  
  Weaknesses:
  
  The weaknesses that remain are mainly due to a difficult dataset and choices that could have stressed certain aspects more, like the relationship between strip cropping and intercropping. The mechanistic understanding of strip cropping is what is at stake here. Does strip cropping behave similar to intercropping, a technique which has been proven to be beneficial to biodiversity because of added effects due to increased resource efficiency and greater plant species richness.
  
  Unfortunately, the authors do not go into this in the introduction or otherwise and simply state that they consider strip cropping a form of intercropping.
  
  We agree with the reviewer that a mechanistic understanding on how intercropping and strip cropping differ would be very interesting. However, we also feel that this topic is somewhat beyond the scope of the current manuscript. We are already planning work to elucidate mechanisms that may explain the pest and suppressive effects of strip cropping.
  
  I also do not like the exclusive focus on percentages, as these are dimensionless. I think more could have been done to show underlying structure in the data, even after rarefaction.
  
  While we generally agree with this point raised by the reviewer, for our heterogeneous dataset it was difficult to come up with meaningful units with dimensions. Therefore, we believe that percentages are the most suitable approach to present readers a fair comparison of the treatments.
  
  A further weakness is a limited embedding into the larger scientific discourses other than providing references. But this may be a matter of style and/or taste
  
  We believe our manuscript to be well-embedded within the relevant scientific discourse, but as indicated by reviewer 3 this might indeed be a matter of style/taste. Without exact examples it is difficult for us to judge this point.
  
  Reviewer #3 (Recommendations for the authors):
  
  Suggestion for title: "Strip cropping shows promising preliminary increases in ground beetle community diversity compared to monocultures"
  
  We agree that the title could indeed be nuanced. We incorporated the suggested title, except for the word “preliminary”, as we felt that this is slightly misplaced for a 4-year study conducted at 4 locations.
  
  line 26: the word previous may be confusing to readers, as it suggests previous research on beetles or insects. I think it would be better to use for instance "related" or "productivity focused research"
  
  We agree that this wording might be confusing, and changed it to “other studies showed”.
  
  Line 84-85: this is vague. can you make explicit what you are trying to answer here?
  
  We made “biodiversity metric changes” more explicit, and changed the sentence accordingly.
  
  Line 88-89: I think this would fit better with the first question in line 83-84, so I suggest placing it upwards. Also, I think you mean abundant instead of common. Common suggests commonness in the entire population. Abundant suggests found often in this study. While these definitions may very much overlap, they are distinctly different.
  
  We have moved this sentence up and changed “common” to “abundant”. To make the result section more in line with this section, we also moved the section on the relationship between crop configuration and abundant genera up.
  
  Line 146: defining rareness of species should be in the methods section. Also "following" would be better than "according"
  
  We now added a sentence on how we examine habitat preferences and rarity in the methods section (line 316-317). We also changed “according to” to “following”.
  
  Line 291: it is called being "flush" with the soil surface. This expression is not much used by non-native speakers, but is regularly encountered in studies on pitfalls, so the authors could decide to change the sentence using the proper English vernacular.
  
  Suggestion incorporated.
  
  Line 322-327, this method could do with a reference
  
  This method is a relatively standard calculation to calculate relative changes and to center variation around zero. Nevertheless, we added a reference to a paper that used the same method.
  
  Line: 333-335. I would still like to see a reference for this method.
  
  This methodology has not been described in literature to the best of our knowledge. As we compared two crops within strip cropping with their respective monoculture references, we compare one strip cropping field with two monocultural fields. Here we took a conservative approach by comparing the strip crop field with the monoculture with the highest richness and activity density, to see if strip cropped fields outperformed monocultures with diverse ground beetle communities.
  
  Line 364-366. references?
  
  We have added references for these R packages.
  
  AuthorResponse
Visit annotations in context

Tags

Summary

AuthorResponse

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.11.02.621655v3
www.biorxiv.org www.biorxiv.org

Dimerization and dynamics of angiotensin-I converting enzyme revealed by cryo-EM and MD simulations

5
1. Public_Reviews 24 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This study shows, for the first time, the structure and snapshots of the dynamics of the full-length soluble Angiotensin-I converting enzyme dimer. The combination of structural and computational analyses provides compelling evidence that reveals the conformational dynamics of the complex and key regions mediating the conformational change. This fundamental work illustrates how conformational heterogeneity can be used to gain insights into protein function.
  
  Summary
2. Public_Reviews 24 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  The authors report four cryoEM structures (2.99 to 3.65 Å resolution) of the 180 kDa, full-length, glycosylated, soluble Angiotensin-I converting enzyme (sACE) dimer, with two homologous catalytic domains at the N- and C-terminal ends (ACE-N and ACE-C). ACE is a protease capable of effectively degrading Aβ. The four structures are C2 pseudo-symmetric homodimers and provide insight into sACE dimerization. These structures were obtained using discrete classification in cryoSPARC and show different combinations of open, intermediate, and closed states of the catalytic domains, resulting in varying degrees of solvent accessibility to the active sites.
  
  To deepen the understanding of the gradient of heterogeneity (from closed to open states) observed with discrete classification, the authors performed all-atom MD simulations and continuous conformational analysis of cryo-EM data using cryoSPARC 3DVA, cryoDRGN, and RECOVAR. cryoDRGN and cryoSPARC 3DVA revealed coordinated open-closed transitions across four catalytic domains, whereas RECOVAR revealed independent motion of two ACE-N domains, also observed with cryoSPARC focused classification. The authors suggest that the discrepancy in the results of the different methods for continuous conformational analysis in cryo-EM could results from different approaches used for dimensionality reduction and trajectory generation in these methods.
  
  Strengths:
  
  This is an important study that shows, for the first time, the structure and the snapshots of the dynamics of the full-length sACE dimer. Moreover, the study highlights the importance of combining insights from different cryo-EM methods that address questions difficult or impossible to tackle experimentally, while lacking ground truth for validation.
  
  Weaknesses (from the last round of review):
  
  The open, closed, and intermediate states of ACE-N and ACE-C in the four cryo-EM structures from discrete classification were designated quantitatively (based on measured atomic distances on the models fitted into cryo-EM maps). Unfortunately, atomic models were not fitted into cryo-EM maps obtained with cryoSPARC 3DVA, cryoDRGN, and RECOVAR, and the open/closed states in these cases were designated based on a qualitative analysis.
  
  Review 1
3. Public_Reviews 24 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  The manuscript presents a valuable contribution to the field of ACE structural biology and dynamics by providing the first complete full-length dimeric ACE structure in four distinct states. The study integrates cryo-EM and molecular dynamics simulations to offer important insights into ACE dynamics. The depth of analysis is commendable, and the combination of structural and computational approaches enhances our understanding of the protein's conformational landscape.
  
  Review 2
4. Public_Reviews 24 Jul 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Summary:
  
  Mancl et al. report four Cryo-EM structures of glycosylated and soluble Angiotensin-I converting enzyme (sACE) dimer. This moves forward the structural understanding of ACE, as previous analysis yielded partially denatured or individual ACE domains. By performing a heterogeneity analysis, the authors identify three structural conformations (open, intermediate open, and closed) that define the openness of the catalytic chamber and structural features governing the dimerization interface. They show that the dimer interface of soluble ACE consists of an N-terminal glycan and protein-protein interaction regions, as well as C-terminal protein-protein interactions. Further heterogeneity mining and all-atom molecular dynamic simulations show structural rearrangements that lead to the opening and closing of the catalytic pocket, which could explain how ACE binds its substrate. These studies could contribute to future drug design targeting the active site or dimerization interface of ACE.
  
  Strengths:
  
  The authors make significant efforts to address ACE denaturation on cryo-EM grids, testing various buffers and grid preparation techniques. These strategies successfully reduce denaturation and greatly enhance the quality of the structural analysis. The integration of cryoDRGN, 3DVA, RECOVAR, and all-atom simulations for heterogeneity analysis proves to be a powerful approach, further strengthening the overall experimental methodology.
  
  Weaknesses:
  
  No weaknesses noted.
  
  Review 3
5. Public_Reviews 24 Jul 2025
  
  in eLife
  
  Author response:
  
  The following is the authors’ response to the previous reviews
  
  We would like to thank you and your chosen reviewers for the diligent work and insightful comments. Following the latest round of feedback, we have made the following changes to the manuscript:
  
  (1) We have added details regarding the specific versions of Cryosparc and cryoDRGN used in our analysis.
  
  (2) We have addressed Reviewer 2’s comment concerning the negative RMSF values in Figure S12. The negative values occur because this display shows the difference in RMSF values from the MD simulations of glycosylated versus non-glycosylated ACE. To avoid similar confusion, we have split Figure S12 into three panels. Panels A and B show the RMSF values for each residue in the glycosylated and non-glycosylated sACE MD simulations, respectively, and all values here are positive. Panel C (the original Figure S12) now includes expanded labeling to clarify that it depicts the difference in RMSF values between the presence and absence of glycans. In this panel, a negative value indicates that the residues exhibit higher RMSF in simulations where glycans are present. The figure legend has been revised to accurately describe the updated figure.
  
  AuthorResponse
Visit annotations in context

Tags

Review 2

Review 3

Summary

AuthorResponse

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.01.09.632263v6
www.biorxiv.org www.biorxiv.org

Mitochondrial protein FgDML1 regulates DON toxin biosynthesis and cyazofamid sensitivity in Fusarium graminearum by affecting mitochondrial homeostasis

4
1. Public_Reviews 24 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This important study provides a potential framework for understanding the regulatory mechanisms of DON toxin biosynthesis in F. graminearum and identifies potential molecular targets for Fusarium head blight control. While FgDML1 remains under-explored with an unclear role in the biology of filamentous fungi, the supporting evidence in this study is incomplete. Providing details on methods and adding controls will strengthen the work.
  
  Summary
2. Public_Reviews 24 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  In their study, the authors investigated the F. graminearum homologue of the Drosophila Misato-Like Protein DML1 for a function in secondary metabolism and sensitivity to fungicides.
  
  Strengths:
  
  Generally, the topic of the study is interesting and timely, and the manuscript is well written, albeit in some cases, details on methods or controls are missing.
  
  Weaknesses:
  
  However, a major problem I see is with the core result of the study, the decrease in the DON content associated with the deletion of FgDML1. Although some growth data are shown in Figure 6, indicating a severe growth defect, the DON production presented in Figure 3 is not related to biomass. Also, the method and conditions for measuring DON are not described. Consequently, it could well be concluded that the decreased amount of DON detected is simply due to decreased growth, and the specific DON production of the mutant remains more or less the same.
  
  To alleviate this concern, it is crucial to show the details on the DON measurement and growth conditions and to relate the biomass formation under the same conditions to the DON amount detected. Only then can a conclusion as to an altered production in the mutant strains be drawn.
  
  Review 1
3. Public_Reviews 24 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  The manuscript entitled "Mitochondrial Protein FgDML1 Regulates DON Toxin Biosynthesis and Cyazofamid Sensitivity in Fusarium graminearum by affecting mitochondrial homeostasis" identified the regulatory effect of FgDML1 in DON toxin biosynthesis and sensitivity of Fusarium graminearum to cyazofamid. The manuscript provides a theoretical framework for understanding the regulatory mechanisms of DON toxin biosynthesis in F. graminearum and identifies potential molecular targets for Fusarium head blight control. The paper is innovative, but there are issues in the writing that need to be addressed and corrected.
  
  Weaknesses:
  
  (1) The authors speculate that cyazofamid treatment caused upregulation of the assembly factors, leading to a change in the conformation of the Qi protein, thus restoring the enzyme activity of complex III. But no speculation was given in the discussion as to why this would lead to the upregulation of assembly factors, and how the upregulation of assembly factors would change the protein conformation, and is there any literature reporting a similar phenomenon? I would suggest adding this to the discussion.
  
  (2) Would increased sensitivity of the mutant to cell wall stress be responsible for the excessive curvature of the mycelium?
  
  (3) The vertical coordinates of Figure 7B need to be modified with positive inhibition rates for the mutants.
  
  Review 2
4. Public_Reviews 24 Jul 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  Summary:
  
  The manuscript "Mitochondrial 1 protein FgDML1 regulates DON toxin biosynthesis and cyazofamid sensitivity in Fusarium graminearum by affecting mitochondrial homeostasis" describes the construction of a null mutant for the FgDML1 gene in F. graminearum and assays characterising the effects of this mutation on the pathogen's infection process and lifecycle. While FgDML1 remains underexplored with an unclear role in the biology of filamentous fungi, and although the authors performed several experiments, there are fundamental issues with the experimental design and execution, and interpretation of the results.
  
  Strengths:
  
  FgDML1 is an interesting target, and there are novel aspects in this manuscript. Studies in other organisms have shown that this protein plays important roles in mitochondrial DNA (mtDNA) inheritance, mitochondrial compartmentalisation, chromosome segregation, mitochondrial distribution, mitochondrial fusion, and overall mitochondrial dynamics. Indeed, in Saccharomyces cerevisiae, the mutation is lethal. The authors have carried out multi-faceted experiments to characterise the mutants.
  
  Weaknesses:
  
  However, I have concerns about how the study was conceived. Given the fundamental importance of mitochondrial function in eukaryotic cells and how the absence of this protein impacts these processes, it is unsurprising that deletion of this gene in F. graminearum profoundly affects fungal biology. Therefore, it is misleading to claim a direct link between FgDML1 and DON toxin biosynthesis (and virulence), as the observed effects are likely indirect consequences of compromised mitochondrial function. In fact, it is reasonable to assume that the production of all secondary metabolites is affected to some extent in the mutant strains and that such a strain would not be competitive at all under non-laboratory conditions. The order in which the authors present the results can be misleading, too. The results on vegetative growth rate appeared much later in the manuscript, which should have come first, as the FgDML1 mutant exhibited significant growth defects, and subsequent results should be discussed in that context. Moreover, the methodologies are not described properly, making the manuscript hard to follow and difficult to replicate.
  
  Review 3
Visit annotations in context

Tags

Summary

Review 2

Review 3

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.05.23.655648v1
www.biorxiv.org www.biorxiv.org

Dual transcranial electromagnetic stimulation of the precuneus-hippocampus network boosts human long-term memory

5
1. Public_Reviews 23 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 This work presents potentially important findings suggesting that a combination of transcranial stimulation approaches applied for a short period could improve memory performance. However, the evidence supporting the conclusions is currently incomplete. In particular, the claims relating to the specific neural mechanisms and anatomical sites of action underlying effects were viewed as overstated in the current version. The results potentially have implications for non-invasive enhancement of cognitive functions.
 
 Summary
2. Public_Reviews 23 Jul 2025
 
 in eLife
 
 Reviewer #1 (Public review):
 
 Summary:
 
 The authors make a bold claim that a combination of repetitive transcranial magnetic stimulation (intermittent theta burst-iTBS) and transcranial alternating current stimulation (gamma tACS) causes slight improvements in memory in a face/name/profession task.
 
 Strengths:
 
 The idea of stimulating the human brain non-invasively is very attractive because, if it worked, it could lead to a host of interesting applications. The current study aims to evaluate one such exciting application.
 
 Weaknesses:
 
 (1) The title refers to the "precuneus-hippocampus" network. A clear definition of what is meant by this terminology is lacking. More importantly, mechanistic evidence that the precuneus and the hippocampus are involved in the potential effects of stimulation remains unconvincing.
 
 (2) The question of the extent to which the stimulation approach and the stimulation parameters used in these experiments causes specific and functionally relevant neural effects remains open. Invasive recordings that could address this question remain out of the scope of this non-invasive study. The authors conducted scalp EEG experiments in an attempt to address this question using non-invasive methods. However, the results shown in Fig. 3 are unclear. The results are inconsistently reported in units of microvolts squared in some panels (3A, 3B) and in units of microvolts in other panels (3C). Also, there is insufficient consideration of potential contamination by signal components reflecting eye movements, other muscle artifacts, or another volume-conducted signal reflecting aggregate activity inside the brain.
 
 (3) Figure 3 indicates "Precuneus oscillatory activity ...", but evidence that the activity presented reflects precuneus activity is lacking. The maps shown at the bottom of Figure 3C suggest that the EEG signals recorded with scalp EEG reflect activity generated across a wide spatial range, with a peak encompassing at least tens of centimeters. Thus, evidence that effects specifically reflect precuneus activity, as the paper's title and text throughout the manuscript suggest, is lacking.
 
 (4) The paper as currently presented (e.g., Figure 3) also lacks rigorous evidence of relevant oscillatory activity. Prior to filtering EEG signals in a particular frequency band, clear evidence of oscillations in the frequency band of interest should be shown (e.g., demonstration of a clear peak that emerges naturally in the frequency range of interest when spectral analysis is applied to "raw" signals). The authors claim that gamma oscillations change because of the stimulation, but a clear peak in the gamma range prior to stimulation is not apparent in the data as currently presented. Thus, the extent to which spectral measurements during stimulation reflect physiological gamma oscillations remains unclear.
 
 (5) Concerns remain regarding the rigor of statistical analyses in the revised manuscript (see also point 8 below). Figure 3B shows an undefined statistical test with p<0.05. The statistical test that was used is not explained. Also, a description of how corrections for multiple comparisons were made is missing. Figures 3A and 3C are not accompanied by statistics, making the results difficult to interpret. For Figure 4C, a claim was made based on a significant p-value for one statistical test and a non-significant p-value in another test. This is a common statistical mistake (see Figure 1 and accompanying discussion in Makin and Orban de Xivry (2019) Science Forum: Ten common statistical mistakes to watch out for when writing or reviewing a manuscript. eLife 8:e48175).
 
 (6) In the second question posed in the original review, I highlighted that it was unclear how such stimulation would produce memory enhancement. The authors replied that, in the absence of mechanisms, there are many other studies that suffer from the same problem. This raises the question of placebo effects. The paper does not sufficiently address or discuss the possibility that any potential stimulation effects may reflect placebo effects.
 
 (7) The third major concern in the original review was the lack of evidence for a mechanism that is specific to the precuneus. Evidence for specific involvement of the precuneus remains lacking in the revised manuscript. The authors state: "the non-invasive stimulation protocol was applied to an individually identified precuneus for each participant". However, the meaning of this statement is unclear. Specifically, it is unclear how the authors know that they are specifically targeting the precuneus. Without directly recording from the precuneus and directly demonstrating effects, which is outside of the scope of the study, specific involvement of the precuneus seems speculative. Also, it does not seem as though a figure was included in the paper to show how the stimulation protocol specifically targets the precuneus. In their response to the original reviews, the authors state that posterior medial parietal areas are the only regions that show significant differences following the stimulation, but they did not cite a specific figure, or statistics reported in the text, that show this. In any event, posterior medial parietal areas encompass a wide area of the brain, so this would still not provide evidence for an effect specifically involving the precuneus.
 
 (8) Regarding chance levels, it is unfortunate that the authors cannot quantify what chance levels are in the immediate and delayed recall conditions. This makes interpretation of the results challenging. In the immediate and delayed conditions, the authors state that the chance level is 33%. It would be useful to mark this in the figures. If I understand correctly, chance is 33% in Fig. 2A. If this is the case and if I am interpreting the figure correctly: Gray bars for the sham condition appear to be below chance (~20-25%). Why is this condition associated with an accuracy level that is lower than chance? Cyan bars and red bars do not appear to be significantly different from chance (i.e., 33%), with red slightly higher than cyan. What statistic was performed to obtain the level of significance indicated in the figure? The highest average value for the red condition appears to be around 35%. More details are needed to fully explain this figure and to support the claims associated with this figure.
 
 (9) In the revised version of the paper, the authors did not address concerns associated with the block design (please see question 4d in the original review).
 
 In sum, this study presents an admirable aspirational goal, the notion that a non-invasive stimulation protocol could modulate activity in specific brain regions to enhance memory. However, the evidence presented at the behavioral level and at the mechanistic level (e.g. the putative involvement of specific brain regions) remains unconvincing.
 
 Review 1
3. Public_Reviews 23 Jul 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 The manuscript by Borghi and colleagues provides evidence that the combination of intermittent theta burst TMS stimulation and gamma transcranial alternating current stimulation (γtACS) targeting the precuneus increases long-term associative memory in healthy subjects compared to iTBS alone and sham conditions. Using a rich dataset of TMS-EEG and resting-state functional connectivity (rs-FC) maps and structural MRI data, the authors also provide evidence that dual stimulation increased gamma oscillations and functional connectivity between the precuneus and hippocampus. Enhanced memory performance was linked to increased gamma oscillatory activity and connectivity through white matter tracts.
 
 Strengths:
 
 The combination of personalized repetitive TMS (iTBS) and gamma tACS is a novel approach to targeting the precuneus, and thereby, connected memory-related regions to enhance long-term associative memory. The authors leverage an existing neural mechanism engaged in memory binding, theta-gamma coupling, by applying TMS at theta burst patterns and tACS at gamma frequencies to enhance gamma oscillations. The authors conducted a thorough study that suggests that simultaneous iTBS and gamma tACS could be a powerful approach for enhancing long-term associative memory. The paper was well-written, clear, and concise.
 
 Comments on Revision:
 
 I thank the authors for their thoughtful responses to my first review and their inclusion of more detailed methodological discussion of their rationale for the stimulation protocol conditions and timing. Regarding the apparent difference in connectivity at baseline between conditions, the explanation that this is due to intrinsic dynamics, state, or noise implies the baseline is reflecting transient changes in dynamics rather than a true or stable baseline. Based on this, it looks like iTBS solely is significantly greater than the baseline before the iTBS and γtACS condition but maybe not that much lower than post-stimulation period for iTBS and γtACS. A longer baseline period should be used to ensure transient states are not driving baseline levels such that these endogenous fluctuations would average out. This also raises questions about whether the effect of iTBS and γtACS or iTBS alone are dependent on the intrinsic state at the time when stimulation begins. Their additional clarification of memory scoring is helpful but also reveals that the effect of dual iTBS+γtACS specifically on the association between faces and names is just significant. This modest increase in associative memory should be taken into consideration when interpreting these findings.
 
 Review 2
4. Public_Reviews 23 Jul 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 Summary:
 
 Borghi and colleagues present results from 4 experiments aimed at investigating the effects of dual γtACS and iTBS stimulation of the precuneus on behavioral and neural markers of memory formation. In their first experiment (n = 20), they find that a 3-minute offline (i.e., prior to task completion) stimulation that combines both techniques leads to superior memory recall performance in an associative memory task immediately after learning associations between pictures of faces, names, and occupation, as well as after a 15-minute delay, compared to iTBS alone (+ tACS sham) or no stimulation (sham for both iTBS and tACS). Performance in a second task probing short-term memory was unaffected by the stimulation condition. In a second experiment (n = 10), they show that these effects persist over 24 hours and up to a full week after initial stimulation. A third (n = 14) and fourth (n = 16) experiment were conducted to investigate neural effects of the stimulation protocol. The authors report that, once again, only combined iTBS and γtACS increases gamma oscillatory activity and neural excitability (as measured by concurrent TMS-EEG) specific to the stimulated area at the precuneus compared to a control region, as well as precuneus-hippocampus functional connectivity (measured by resting state MRI), which seemed to be associated with structural white matter integrity of the bilateral middle longitudinal fasciculus (measured by DTI).
 
 Strengths:
 
 Combining non-invasive brain stimulation techniques is a novel, potentially very powerful method to maximize the effects of these kinds of interventions that are usually well-tolerated and thus accepted by patients and healthy participants. It is also very impressive that the stimulation-induced improvements in memory performance resulted from a short (3 min) intervention protocol. If the effects reported here turn out to be as clinically meaningful and generalizable across populations as implied, this approach could represent a promising avenue for treatment of impaired memory functions in many conditions.
 
 Methodologically, this study is expertly done! I don't see any serious issues with the technical setup in any of the experiments. It is also very commendable that the authors conceptually replicated the behavioral effects of experiment 1 in experiment 2 and then conducted two additional experiments to probe the neural mechanisms associated with these effects. This certainly increases the value of the study and the confidence in the results considerably.
 
 The authors used a within-subject approach in their experiments, which increases statistical power and allows for stronger inferences about the tested effects. They also used to individualize stimulation locations and intensities, which should further optimize the signal-to-noise ratio.
 
 Weaknesses:
 
 I think one of the major weaknesses of this study is the overall low sample size in all of the experiments (between n = 10 and n = 20). This is, as I mentioned when discussing the strengths of the study, partly mitigated by the within-subject design and individualized stimulation parameters. The authors mention that they performed a power analysis but this analysis seemed to be based on electrophysiological readouts similar to those obtained in experiment 3. It is thus unclear whether the other experiments were sufficiently powered to reliably detect the behavioral effects of interest. In the revised manuscript, the authors provide post-hoc sensitivity analyses that help contextualize the strength of the findings.
 
 While the authors went to great lengths trying to probe the neural changes likely associated with the memory improvement after stimulation, it is impossible from their data to causally relate the findings from experiments 3 and 4 to the behavioral effects in experiments 1 and 2. This is acknowledged by the authors and there are good methodological reasons for why TMS-EEG and fMRI had to be collected in separate experiments, but readers should keep in mind that this limits inferences about how exactly dual iTBS and γtACS of the precuneus modulate learning and memory.
 
 Review 3
5. Public_Reviews 23 Jul 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews.
 
 Reviewer #1 (Public review):
 
 Summary:
 
 The authors claim that they can use a combination of repetitive transcranial magnetic stimulation (intermittent theta burst-iTBS) and transcranial alternating current stimulation (gamma tACS) to cause slight improvements in memory in a face/name/profession task.
 
 Strengths:
 
 The idea of stimulating the human brain non-invasively is very attractive because, if it worked, it could lead to a host of interesting applications. The current study aims to evaluate one such exciting application.
 
 Weaknesses:
 
 (1) It is highly unclear what, if anything, transpires in the brain with non-invasive stimulation. To cite one example of many, a rigorous study in rats and human cadavers, compellingly showed that traditional parameters of transcranial electrical stimulation lead to no change in brain activity due to the attenuation by the soft tissue and skull (Mihály Vöröslakos et al Nature Communications 2018): https://www.nature.com/articles/s41467-018-02928-3. It would be very useful to demonstrate via invasive neurophysiological recordings that the parameters used in the current study do indeed lead to any kind of change in brain activity. Of course, this particular study uses a different non-invasive stimulation protocol.
 
 Thank you for raising the important issue regarding the actual neurophysiological effects of non-invasive brain stimulation. Unfortunately, invasive neurophysiological recordings in humans for this type of study are not feasible due to ethical constraints, while studies on cadavers or rodents would not fully resolve our question. Indeed, the authors of the cited study (Mihály Vöröslakos et al., Nature Communications, 2018) highlight the impossibility of drawing definitive conclusions about the exact voltage required in the in-vivo human brain due to significant differences between rats and humans, as well as the in-vivo human brain and cadavers due to alterations in electrical conductivity that occur in postmortem tissue. Huang and colleagues addressed the difficulties in reaching direct evidence of non-invasive brain stimulation (NIBS) effects in a review published in Clinical Neurophysiology in 2017. They conclude that the use of EEG to assess brain response to TMS has great potential for a less indirect demonstration of plasticity mechanisms induced by NIBS in humans.
 
 To address this challenge, we conducted Experiments 3 and 4, which respectively examined the neurophysiological and connectivity changes induced by the stimulation in a non-invasive manner using TMS-EEG and fMRI. The observed changes in brain oscillatory activity (increased gamma oscillatory activity), cortical excitability (enhanced posteromedial parietal cortex reactivity), and brain connectivity (strengthened connections between the precuneus and hippocampi) provided evidence of the effects of our non-invasive brain stimulation protocol, further supporting the behavioral data.
 
 Additionally, we carefully considered the issue of stimulation distribution and, in response, performed a biophysical modeling analysis and E-field calculation using the parameters employed in our study (see Supplementary Materials).
 
 We acknowledge that further exploration of this aspect would be highly valuable, and we agree that it is worth discussing both as a technical limitation and as a potential direction for future research. We therefore, modify the discussion accordingly (main text, lines 280-289).
 
 “Although we studied TMS and tACS propagation through the E-field modeling and observed an increase in the precuneus gamma oscillatory activity, excitability and connectivity with the hippocampi, we cannot exclude that our results might reflect the consequences of stimulating more superficial parietal regions other than the precuneus nor report direct evidence of microscopic changes in the brain after the stimulation. Invasive neurophysiological recordings in humans for this type of study are not feasible due to ethical constraints. Studies on cadavers or rodents would not fully resolve our question due to significant differences between them (i.e. rodents do not have an anatomical correspondence while cadavers have an alterations in electrical conductivity occurring in postmortem tissue). However, further exploration of this aspect in future studies would help in the understanding of γtACS+iTBS effects.”
 
 (2) If there is any brain activity triggered by the current stimulation parameters, then it is extremely difficult to understand how this activity can lead to enhancing memory. The brain is complex. There are hundreds of neuronal types. Each neuron receives precise input from about 10,000 other neurons with highly tuned synaptic strengths. Let us assume that the current protocol does lead to enhancing (or inhibiting) simultaneously the activity of millions of neurons. It is unclear whether there is any activity at all in the brain triggered by this protocol, it is also unclear whether such activity would be excitatory, or inhibitory. It is also unclear how many neurons, let alone what types of neurons would change their activity. How is it possible that this can lead to memory enhancement? This seems like using a hammer to knock on my laptop and hope that the laptop will output a new Mozart-like sonata.
 
 Thank you for your comment. As you correctly point out, we still do not have precise knowledge of which neurons—and to what extent—are activated during non-invasive brain stimulation in humans. However, this challenge is not limited to brain stimulation but applies to many other therapeutic interventions, including psychiatric medications, without limiting their use.
 
 Nevertheless, a substantial body of research has investigated the mechanisms underlying the efficacy of TMS and tACS in producing behavioral after-effects, primarily through its ability to induce long-term potentiation (Bliss & Collingridge, The Journal of Physiology, 1993a; Ridding & Rothwell, Nature Reviews Neuroscience, 2007; Huang et al., Clinical Neurophysiology, 2017; Koch et al., Neuroimage 2018; Koch et al., Brain 2022; Jannati et al., Neuropsychopharmacology, 2023; Wischnewski et al., Trends in Cognitive Science, 2023; Griffiths et al., Trends in Neuroscience, 2023).
 
 We acknowledge that we took this important aspect for granted. We consequently expanded the introduction accordingly (main text, lines 48-60).
 
 “Repetitive transcranial magnetic stimulation (rTMS) and transcranial alternating current stimulation (tACS) are two forms of NIBS widely used to enhance memory performances (Grover et al., 2022; Koch et al., 2018; Wang et al., 2014). rTMS, based on the principle of Faraday, induces depolarization of cortical neuronal assemblies and leads to after-effects that have been linked to changes in synaptic plasticity involving mechanisms of long-term potentiation (LTP) (Huang et al., 2017; Jannati et al., 2023). On the other hand, tACS causes rhythmic fluctuations in neuronal membrane potentials, which can bias spike timing, leading to an entrainment of the neural activity (Wischnewski et al., 2023). In particular, the induction of gamma oscillatory a has been proposed to play an important role in a type of LTP known as spike timing-dependent plasticity, which depends on a precise temporal delay between the firing of a presynaptic and a postsynaptic neuron (Griffiths and Jensen, 2023). Both LTP and gamma oscillations have a strong link with memory processes such as encoding (Bliss and Collingridge, 1993; Griffiths and Jensen, 2023; Rossi et al., 2001), pointing to rTMS and tACS as good candidates for memory enhancement.”
 
 (3) Even if there is any kind of brain activation, it is unclear why the authors seem to be so sure that the precuneus is responsible. Are there neurophysiological data demonstrating that the current protocol only activates neurons in the precuneus? Of note, the non-invasive measurements shown in Figure 3 are very weak (Figure 3A top and bottom look very similar, and Figure 3C left and right look almost identical). Even if one were to accept the weak alleged differences in Figure 3, there is no indication in this figure that there is anything specific to the precuneus, rather a whole brain pattern. This would be the kind of minimally rigorous type of evidence required to make such claims. In a less convincing fashion, one could look at different positions of the stimulation apparatus. This would not be particularly compelling in terms of making a statement about the precuneus. But at least it would show that the position does matter, and over what range of distances it matters, if it matters.
 
 Thank you for your feedback. Our assumption that the precuneus plays a key role in the observed effects is based on several factors:
 
 (1) The non-invasive stimulation protocol was applied to an individually identified precuneus for each participant. Given existing evidence on TMS propagation, we can reasonably assume that the precuneus was at least a mediator of the observed effects (Ridding & Rothwell, Nature Reviews Neuroscience 2007). For further details about target identification and TMS and tACS propagation, please refer to the MRI data acquisition section in the main text and Biophysical modeling and E-field calculation section in the supplementary materials.
 
 (2) To investigate the effects of the neuromodulation protocol on cortical responses, we conducted a whole-brain analysis using multiple paired t-tests comparing each data point between different experimental conditions. To minimize the type I error rate, data were permuted with the Monte Carlo approach and significant p-values were corrected with the false discovery rate method (see the Methods section for details). The results identified the posterior-medial parietal areas as the only regions showing significant differences across conditions.
 
 (3) To control for potential generalized effects, we included a control condition in which TMS-EEG recordings were performed over the left parietal cortex (adjacent to the precuneus). This condition did not yield any significant results, reinforcing the cortical specificity of the observed effects.
 
 However, as stated in the Discussion, we do not claim that precuneus activity alone accounts for the observed effects. As shown in Experiment 4, stimulation led to connectivity changes between the precuneus and hippocampus, a network widely recognized as a key contributor to long-term memory formation (Bliss & Collingridge, Nature 1993). These connectivity changes suggest that precuneus stimulation triggered a ripple effect extending beyond the stimulation site, engaging the broader precuneus-hippocampus network.
 
 Regarding Figure 3A, it represents the overall expression of oscillatory activity detected by TMS-EEG. Since each frequency band has a different optimal scaling, the figure reflects a graphical compromise. A more detailed representation of the significant results is provided in Figure 3B. The effect sizes for gamma oscillatory activity in the delta T1 and T2 conditions were 0.52 and 0.50, respectively, which correspond to a medium effect based on Cohen’s d interpretation.
 
 We add a paragraph in the discussion to improve the clarity of the manuscript regarding this important aspect (lines 193-198).
 
 “Given the existing evidence on TMS propagation and the computation of the Biophysical model with the Efield, we can reasonably assume that the individually identified PC was a mediator of the observed effects (Ridding and Rothwell, 2007). Moreover, we observed specific cortical changes in the posteromedial parietal areas, as evidenced by the whole-brain analysis conducted on TMS-EEG data and the absence of effect on the lateral posterior parietal cortex used as a control condition.”
 
 (4) In the absence of any neurophysiological documentation of a direct impact on the brain, an argument in this type of study is that the behavioral results show that there must be some kind of effect. I agree with this argument. This is also the argument for placebo effects, which can be extremely powerful and useful even if the mechanism is unrelated to what is studied. Then let us dig into the behavioral results.
 
 Hoping to have already addressed your concern regarding the neurophysiological impact of the stimulation on the brain, we would like to emphasize that the behavioral results were obtained controlling for placebo effects. This was achieved by having participants perform the task under different stimulation conditions, including a sham condition.
 
 4a. There does not seem to be any effect on the STMB task, therefore we can ignore this.
 
 4b. The FNAT task is minimally described in the supplementary material. There are no experimental details to understand what was done. What was the size of the images? How long were the images presented for? Were there any repetitions of the images? For how long did the participants study the images? Presumably, all the names and occupations are different? What were the genders of the faces? What is chance level performance? Presumably, the same participant saw different faces across the different stimulation conditions. If not, then there can be memory effects across different conditions that are even more complex to study. If yes, then it would be useful to show that the difficulty is the same across the different stimuli.
 
 We thank you for signaling the lack in the description of FNAT task. We added the information required in the supplementary information (lines 93-101).
 
 “Each picture's face size was 19x15cm. In the learning phase, faces were shown along with names and occupations for 8 seconds each (totaling approximately 2 minutes). During immediate recall, the faces were displayed alone for 8 seconds. In the delayed recall and recognition phase, pictures were presented until the subject provided answers. We used a different set of stimuli for each stimulation condition, resulting in a total of 3 parallel task forms balanced across conditions and session order. All parallel forms comprised 6 male and 6 female faces; for each sex, there were 2 young adults (around 30 years old), 2 middle-aged adults (around 50 years old), and 2 elderly adults (around 70 years old). Before the experiments, we conducted a pilot study to ensure no differences existed between the parallel forms of the task.”
 
 The chance level in the immediate and delayed recall is not quantifiable since the participants had to freely recall the name and the occupation without a multiple choice. In the recognition, the chance level was around 33% (since the possible answers were 3).
 
 4c. Although not stated clearly, if I understand FNAT correctly, the task is based on just 12 presentations. Each point in Figure 2A represents a different participant. Unfortunately, there is no way of linking the performance of individual participants across the conditions with the information provided. Lines joining performance for each participant would be useful in this regard. Because there are only 12 faces, the results are quantized in multiples of 100/12 % in Figure 3A. While I do not doubt that the authors did their homework in terms of the statistical analyses, it is difficult to get too excited about these 12 measurements. For example, take Figure 3A immediate condition TOTAL, arguably the largest effect in the whole paper. It seems that on average, the participants may remember one more face/name/occupation.
 
 Thank you for the suggestion. We added graphs showing lines linking the performance of individual participants across conditions to improve clarity, please see Fig.2 revised. We apologize for the lack of clarity in the description of the FNAT. As you correctly pointed out, we used the percentage based on the single association between face, name and occupation (12 in total). However, each association consisted of three items, resulting in a total of 36 items to learn and associate – we added a paragraph to make it more explicit in the manuscript (lines 425-430).
 
 “We considered a correct association when a subject was able to recall all the information for each item (i.e. face, name and occupation), resulting in a total of 36 items to learn and associate. To further investigate the effect on FNAT we also computed a partial recall score accounting for those items where subjects correctly matched only names with faces (FNAT NAME) and only occupations with faces (FNAT OCCUPATION). See supplementary information for score details.”
 
 In the example you mentioned, participants were, on average, able to correctly recall and associate three more items compared to the other conditions. While this difference may not seem striking at first glance, it is important to consider that we assessed memory performance after a single, three-minute stimulation session. Similar effects are typically observed only after multiple stimulation sessions (Koch et al., NeuroImage, 2018; Grover et al., Nature Neuroscience, 2022). Moreover, memory performance changes are often measured by a limited set of stimuli due to methodological constraints related to memory capacity. For example, Rey Auditory Verbal learning task, requiring to learn and recall 15 words, is a typical test used to detect memory changes (Koch et al., Neuroimage, 2018; Benussi et al., Brain stimulation 2021; Benussi et al., Annals of Neurology, 2022).
 
 4d. Block effects. If I understand correctly, the experiments were conducted in blocks. This is always problematic. Here is one example study that articulated the big problems in block designs (Li et al TPAMI 2021):https://ieeexplore.ieee.org/document/9264220
 
 Thank you for the interesting reference. According to this paper, in a block design, EEG or fMRI recordings are performed in response to different stimuli of a given class presented in succession. If this is the case, it does not correspond to our experimental design where both TMS-EEG and fMRI were conducted in resting state on different days according to the different stimulation conditions.
 
 4e. Even if we ignore the lack of experimental descriptions, problems with lack of evidence of brain activity, the minimalistic study of 12 faces, problems with the block design, etc. at the end of the day, the results are extremely weak. In FNAT, some results are statistically significant, some are not. The interpretation of all of this is extremely complex. Continuing with Figure 3A, it seems that the author claims that iTBS+gtACS > iTBS+sham-tACS, but iTBS+gtACS ~ sham+sham. I am struggling to interpret such a result. When separating results by name and occupation, the results are even more perplexing. There is only one condition that is statistically significant in Figure 3A NAME and none in the occupation condition.
 
 Thank you again for your feedback. Hoping to have thoroughly addressed your initial concerns in our previous responses, we now move on to your observations regarding the behavioral results, assuming you were referring to Figure 2A. The main finding of this study is the improvement in long-term memory performance, specifically the ability to correctly recall the association between face, name, and occupation (total FNAT), which was significantly enhanced in both Experiments 1 and 2. However, we also aimed to explore the individual contributions of name and occupation separately to gain a deeper understanding of the results. Our analysis revealed that the improvement in total FNAT was primarily driven by an increase in name recall rather than occupation recall. We understand that this may have caused some confusion. We consequently modified the manuscript in the (lines 97-99; 107-111; 425-430) to make it clearer and moved the graph relative to FNAT NAME and OCCUPATION from fig.2 in the main text to fig. S4 in supplementary information.
 
 “Dual iTBS+γtACS increased the performances in recalling the association between face, name and occupation (FNAT accuracy) both for the immediate (F2,38=7.18; p =0.002; η2p=0.274) and the delayed (F2,38=5.86; p =0.006; η2p=0.236) recall performances (Fig. 2, panel A).”
 
 “The in-depth analysis of the FNAT accuracy investigating the specific contribution of face-name and face-occupation recall reveald that dual iTBS+γtACS increased the performances in the association between face and name (FNAT NAME) delayed recall (F2,38 =3.46; p =0.042; η2p =0.154; iTBS+γtACS vs. sham-iTBS+sham-tACS: 42.9±21.5 % vs. 33.8±19 %; p=0.048 Bonferroni corrected) (Fig. S4, supplementary information).”
 
 “We considered a correct association when a subject was able to recall all the information for each item (i.e. face, name and occupation), resulting in a total of 36 items to learn and associate. To further investigate the effect on FNAT we also computed a partial recall score accounting for those items where subjects correctly matched only names with faces (FNAT NAME) and only occupations with faces (FNAT OCCUPATION). See supplementary information for score details.”
 
 Regarding the stimulation conditions, your concerns about the performance pattern (iTBS+gtACS > iTBS+sham-tACS, but iTBS+gtACS ~ sham+sham) are understandable. However, this new protocol was developed precisely in response to the variability observed in behavioral outcomes following non-invasive brain stimulation, particularly when used to modulate memory functions (Corp et al., 2020; Pabst et al., 2022). As discussed in the manuscript, it is intended as a boost to conventional non-invasive brain stimulation protocols, leveraging the mechanisms outlined in the Discussion section.
 
 (5) In sum, it would be amazing to be able to use non-invasive stimulation for any kind of therapeutic purpose as the authors imagine. More work needs to be done to convince ourselves that this kind of approach is viable. The evidence provided in this study is weak.
 
 We hope our response will be carefully considered, fostering a constructive exchange and leading to a reassessment of your evaluation.
 
 Reviewer #2 (Public review):
 
 Summary:
 
 The manuscript "Dual transcranial electromagnetic stimulation of the precuneus-hippocampus network boosts human long-term memory" by Borghi and colleagues provides evidence that the combination of intermittent theta burst TMS stimulation and gamma transcranial alternating current stimulation (γtACS) targeting the precuneus increases long-term associative memory in healthy subjects compared to iTBS alone and sham conditions. Using a rich dataset of TMS-EEG and resting-state functional connectivity (rs-FC) maps and structural MRI data, the authors also provide evidence that dual stimulation increased gamma oscillations and functional connectivity between the precuneus and hippocampus. Enhanced memory performance was linked to increased gamma oscillatory activity and connectivity through white matter tracts.
 
 Strengths:
 
 The combination of personalized repetitive TMS (iTBS) and gamma tACS is a novel approach to targeting the precuneus, and thereby, connected memory-related regions to enhance long-term associative memory. The authors leverage an existing neural mechanism engaged in memory binding, theta-gamma coupling, by applying TMS at theta burst patterns and tACS at gamma frequencies to enhance gamma oscillations. The authors conducted a thorough study that suggests that simultaneous iTBS and gamma tACS could be a powerful approach for enhancing long-term associative memory. The paper was well-written, clear, and concise.
 
 Weaknesses:
 
 (1) The study did not include a condition where γtACS was applied alone. This was likely because a previous work indicated that a single 3-minute γtACS did not produce significant effects, but this limits the ability to isolate the specific contribution of γtACS in the context of this target and memory function
 
 Thank you for your comments. As you pointed out, we did not include a condition where γtACS was applied alone. This decision was based on the findings of Guerra et al. (Brain Stimulation 2018), who investigated the same protocol and reported no aftereffects. Given the substantial burden of the experimental design on patients and our primary goal of demonstrating an enhancement of effects compared to the standalone iTBS protocol, we decided to leave out this condition. However, you raise an important aspect that should be further discussed, we modified the limitation section accordingly (lines 290-297).
 
 “We did not assess the effects of γtACS alone. This decision was based on the findings of Guerra et al. (Guerra et al., 2018), who investigated the same protocol and reported no aftereffects. Given the substantial burden of the experimental design on patients and our primary goal of demonstrating an enhancement of effects compared to the standalone iTBS protocol, we decided to leave out this condition. While examining the effects of γtACS alone could help isolate its specific contribution to this target and memory function, extensive research has shown that achieving a cognitive enhancement aftereffect with tACS alone typically requires around 20–25 minutes of stimulation (Grover et al., 2023).”
 
 (2) The authors applied stimulation for 3 minutes, which seems to be based on prior tACS protocols. It would be helpful to present some rationale for both the duration and timing relative to the learning phase of the memory task. Would you expect additional stimulation prior to recall to benefit long-term associative memory?
 
 Thank you for your comment and for raising this interesting point. As you correctly noted, the protocol we used has a duration of three minutes, a choice based on previous studies demonstrating its greater efficacy with respect to single stimulation from a neurophysiological point of view. Specifically, these studies have shown that the combined stimulation enhanced gamma-band oscillations and increased cortical plasticity (Guerra et al., Brain Stimulation 2018; Maiella et al., Scientific Reports 2022). Given that the precuneus (Brodt et al., Science 2018; Schott et al., Human Brain Mapping 2018), gamma oscillations (Osipova et al., Journal of Neuroscience 2006; Deprés et al., Neurobiology of Aging 2017; Griffiths et al., Trends in Neurosciences 2023), and cortical plasticity (Brodt et al., Science 2018) are all associated with memory formation and encoding processes, we decided to apply the co-stimulation immediately before it to enhance the efficacy. We added this paragraph to the manuscript rationale (lines 48-60).
 
 “Repetitive transcranial magnetic stimulation (rTMS) and transcranial alternating current stimulation (tACS) are two forms of NIBS widely used to enhance memory performances (Grover et al., 2022; Koch et al., 2018; Wang et al., 2014). rTMS, based on the principle of Faraday, induces depolarization of cortical neuronal assemblies and leads to after-effects that have been linked to changes in synaptic plasticity involving mechanisms of long-term potentiation (LTP) (Huang et al., 2017; Jannati et al., 2023). On the other hand, tACS causes rhythmic fluctuations in neuronal membrane potentials, which can bias spike timing, leading to an entrainment of the neural activity (Wischnewski et al., 2023). In particular, the induction of gamma oscillatory a has been proposed to play an important role in a type of LTP known as spike timing-dependent plasticity, which depends on a precise temporal delay between the firing of a presynaptic and a postsynaptic neuron (Griffiths and Jensen, 2023). Both LTP and gamma oscillations have a strong link with memory processes such as encoding (Bliss and Collingridge, 1993; Griffiths and Jensen, 2023; Rossi et al., 2001), pointing to rTMS and tACS as good candidates for memory enhancement.”
 
 Regarding the question of whether stimulation could also benefit recall, the answer is yes. We can speculate that repeating the stimulation before recall might provide an additional boost. This is supported by evidence showing that both the precuneus and gamma oscillations are involved in recall processes (Flanagin et al., Cerebral Cortex 2023; Griffiths et al., Trends in Neurosciences 2023). Furthermore, previous research suggests that reinstating the same brain state as during encoding can enhance recall performance (Javadi et al., The Journal of Neuroscience 2017). We added this consideration to the discussion (lines 305-311).
 
 “Future studies should further investigate the effects of stimulation on distinct memory processes. In particular, stimulation could be applied before retrieval (Rossi et al., 2001), to better elucidate its specific contribution to the observed enhancements in memory performance. Additionally, it would be worth examining whether repeated stimulation - administered both before encoding and before retrieval - could produce a boosting effect. This is especially relevant in light of findings showing that matching the brain state between retrieval and encoding can significantly enhance memory performance (Javadi et al., 2017).”
 
 (3) How was the burst frequency of theta iTBS and gamma frequency of tACS chosen? Were these also personalized to subjects' endogenous theta and gamma oscillations? If not, were increases in gamma oscillations specific to patients' endogenous gamma oscillation frequencies or the tACS frequency?
 
 The stimulation protocol was chosen based on previous studies (Guerra et al., Brain Stimulation 2018; Maiella et al., Scientific Reports 2022). Gamma tACS sinusoid frequency wave was set at 70 Hz while iTBS consisted of ten bursts of three pulses at 50 Hz lasting 2 s, repeated every 10 s with an 8 s pause between consecutive trains, for a total of 600 pulses total lasting 190 s (see iTBS+γtACS neuromodulation protocol section). In particular, the theta iTBS has been inspired by protocols used in animal models to elicit LTP in the hippocampus (Huang et al., Neuron 2005). Consequently, neither Theta iTBS nor the gamma frequency of tACS were personalized. The increase in gamma oscillations was referred to the patient’s baseline and did not correspond to the administrated tACS frequency.
 
 (4) The authors do a thorough job of analyzing the increase in gamma oscillations in the precuneus through TMS-EEG; however, the authors may also analyze whether theta oscillations were also enhanced through this protocol due to the iTBS potentially targeting theta oscillations. This may also be more robust than gamma oscillations increases since gamma oscillations detected on the scalp are very low amplitude and susceptible to noise and may reflect activity from multiple overlapping sources, making precise localization difficult without advanced techniques.
 
 Thank you for the suggestion. We analyzed theta oscillations, finding no changes.
 
 (5) Figure 4: Why are connectivity values pre-stimulation for the iTBS and sham tACS stimulation condition so much higher than the dual stimulation? We would expect baseline values to be more similar.
 
 We acknowledge that the pre-stimulation connectivity values for the iTBS and sham tACS conditions appear higher than those for the dual stimulation condition. However, as noted in our statistical analyses, there were no significant differences at baseline between conditions (p-FDR= 0.3514), suggesting that any apparent discrepancy is due to natural variability rather than systematic bias. One potential explanation for these differences is individual variability in baseline connectivity measures, which can fluctuate due to factors such as intrinsic neural dynamics, participant state, or measurement noise. Despite these variations, our statistical approach ensures that any observed post-stimulation effects are not confounded by pre-existing differences.
 
 (6) Figure 2: How are total association scores significantly different between stimulation conditions, but individual name and occupation associations are not? Further clarification of how the total FNAT score is calculated would be helpful.
 
 We apologize for any lack of clarity. The total FNAT score reflects the ability to correctly recall all the information associated with a person—specifically, the correct pairing of the face, name, and occupation. Participants received one point for each triplet they accurately recalled. The scores were then converted into percentages, as detailed in the Face-Name Associative Task Construction and Scoring section in the supplementary materials.
 
 Total FNAT was the primary outcome measure. However, we also analyzed name and occupation recall separately to better understand their partial contributions. Our analysis revealed that the improvement in total FNAT was primarily driven by an increase in name recall rather than occupation recall.
 
 We acknowledge that this distinction may have caused some confusion. To improve clarity, we revised the manuscript accordingly (lines 97-98; 107-111; 425-430).
 
 “Dual iTBS+γtACS increased the performances in recalling the association between face, name and occupation (FNAT accuracy) both for the immediate (F2,38=7.18 ;p=0.002; η2p=0.274) and the delayed (F2,38=5.86;p=0.006; η2p=0.236) recall performances (Fig. 2, panel A).”
 
 “The in-depth analysis of the FNAT accuracy investigating the specific contribution of face-name and face-occupation recall revealed that dual iTBS+γtACS increased the performances in the association between face and name (FNAT NAME) delayed recall (F2,38 =3.46; p =0.042; η2p =0.154; iTBS+γtACS vs. sham-iTBS+sham-tACS: 42.9±21.5 % vs. 33.8±19 %; p=0.048 Bonferroni corrected) (Fig. S4, supplementary information).”
 
 “We considered a correct association when a subject was able to recall all the information for each item (i.e. face, name and occupation), resulting in a total of 36 items to learn and associate. To further investigate the effect on FNAT we also computed a partial recall score accounting for those items where subjects correctly matched only names with faces (FNAT NAME) and only occupations with faces (FNAT OCCUPATION). See supplementary information for score details.”
 
 We also moved the data regarding the specific contribution of name and occupation recall in the supplementary information (fig.S4) and further specified how we computed the score in the score (lines 102-104).
 
 “The score was computed by deriving an accuracy percentage index dividing by 12 and multiplying by 100 the correct association sum. The partial recall scores were computed in the same way only considering the sum of face-name (NAME) and face-occupation (OCCUPATION) correctly recollected.”
 
 Reviewer #3 (Public review):
 
 Summary:
 
 Borghi and colleagues present results from 4 experiments aimed at investigating the effects of dual γtACS and iTBS stimulation of the precuneus on behavioral and neural markers of memory formation. In their first experiment (n = 20), they found that a 3-minute offline (i.e., prior to task completion) stimulation that combines both techniques leads to superior memory recall performance in an associative memory task immediately after learning associations between pictures of faces, names, and occupation, as well as after a 15-minute delay, compared to iTBS alone (+ tACS sham) or no stimulation (sham for both iTBS and tACS). Performance in a second task probing short-term memory was unaffected by the stimulation condition. In a second experiment (n = 10), they show that these effects persist over 24 hours and up to a full week after initial stimulation. A third (n = 14) and fourth (n = 16) experiment were conducted to investigate the neural effects of the stimulation protocol. The authors report that, once again, only combined iTBS and γtACS increase gamma oscillatory activity and neural excitability (as measured by concurrent TMS-EEG) specific to the stimulated area at the precuneus compared to a control region, as well as precuneus-hippocampus functional connectivity (measured by resting-state MRI), which seemed to be associated with structural white matter integrity of the bilateral middle longitudinal fasciculus (measured by DTI).
 
 Strengths:
 
 Combining non-invasive brain stimulation techniques is a novel, potentially very powerful method to maximize the effects of these kinds of interventions that are usually well-tolerated and thus accepted by patients and healthy participants. It is also very impressive that the stimulation-induced improvements in memory performance resulted from a short (3 min) intervention protocol. If the effects reported here turn out to be as clinically meaningful and generalizable across populations as implied, this approach could represent a promising avenue for the treatment of impaired memory functions in many conditions.
 
 Methodologically, this study is expertly done! I don't see any serious issues with the technical setup in any of the experiments (with the only caveat that I am not an expert in fMRI functional connectivity measures and DTI). It is also very commendable that the authors conceptually replicated the behavioral effects of experiment 1 in experiment 2 and then conducted two additional experiments to probe the neural mechanisms associated with these effects. This certainly increases the value of the study and the confidence in the results considerably.
 
 The authors used a within-subject approach in their experiments, which increases statistical power and allows for stronger inferences about the tested effects. They are also used to individualize stimulation locations and intensities, which should further optimize the signal-to-noise ratio.
 
 Weaknesses:
 
 I want to state clearly that I think the strengths of this study far outweigh the concerns I have. I still list some points that I think should be clarified by the authors or taken into account by readers when interpreting the presented findings.
 
 I think one of the major weaknesses of this study is the overall low sample size in all of the experiments (between n = 10 and n = 20). This is, as I mentioned when discussing the strengths of the study, partly mitigated by the within-subject design and individualized stimulation parameters. The authors mention that they performed a power analysis but this analysis seemed to be based on electrophysiological readouts similar to those obtained in experiment 3. It is thus unclear whether the other experiments were sufficiently powered to reliably detect the behavioral effects of interest. That being said, the authors do report significant effects, so they were per definition powered to find those. However, the effect sizes reported for their main findings are all relatively large and it is known that significant findings from small samples may represent inflated effect sizes, which may hamper the generalizability of the current results. Ideally, the authors would replicate their main findings in a larger sample. Alternatively, I think running a sensitivity analysis to estimate the smallest effect the authors could have detected with a power of 80% could be very informative for readers to contextualize the findings. At the very least, however, I think it would be necessary to address this point as a potential limitation in the discussion of the paper.
 
 Thank you for the observation. As you mentioned, our power analysis was based on our previous study investigating the same neuromodulation protocol with a corresponding experimental design. The relatively small sample could be considered a possible limitation of the study which we will add to the discussion. A fundamental future step will be to replay these results on a larger population, however, to strengthen our results we performed the sensitivity analysis you suggested.
 
 In detail, we performed a sensitivity analysis for repeated-measures ANOVA with α=0.05 and power(1-β)=0.80 with no sphericity correction. For experiment 1, a sensitivity analysis with 1 group and 3 measurements showed a minimal detectable effect size of f=0.524 with 20 participants. In our paper, the ANOVA on total FNAT immediate performance revealed an effect size of η2=0.274 corresponding to f=0.614; the ANOVA on FNAT delayed performance revealed an effect size of η2=0.236 corresponding to f=0.556. For experiment 2, a sensitivity analysis for total FNAT immediate performance (1 group and 3 measurements) showed a minimal detectable effect size of f=0.797 with 10 participants. In our paper, the ANOVA on total FNAT immediate performance revealed an effect size of η2=0.448 corresponding to f=0.901. The sensitivity analysis for total FNAT delayed performance (1 group and 6 measurements) showed a minimal detectable effect size of f=0.378 with 10 participants. In our paper, the ANOVA on total FNAT delayed performance revealed an effect size of η2=0.484 corresponding to f=0.968. Thus, the sensitivity analysis showed that both experiments were powered enough to detect the minimum effect size computed in the power analysis. We have now added this information to the manuscript and we thank the reviewer for her/his suggestion in the statistical analysis and results section (lines 99-100; 127-128; 130-131; 543-545).
 
 “The sensitivity analysis showed a minimal detectable effect size of η2=0.215 with 20 participants.”
 
 “The sensitivity analysis showed a minimal detectable effect size of η2=0.388 with 10 participants.”
 
 “The sensitivity analysis showed a minimal detectable effect size of η2=0.125 with 10 participants.”
 
 “Since we do not have an a priori effect size for experiment 1 and 2, we performed a sensitivity power analysis to ensure that these experiments were able to detect the minimum effect size with 80% power and alpha level of 0.05.”
 
 It seems that the statistical analysis approach differed slightly between studies. In experiment 1, the authors followed up significant effects of their ANOVAs by Bonferroni-adjusted post-hoc tests whereas it seems that in experiment 2, those post-hoc tests where "exploratory", which may suggest those were uncorrected. In experiment 3, the authors use one-tailed t-tests to follow up their ANOVAs. Given some of the reported p-values, these choices suggest that some of the comparisons might have failed to reach significance if properly corrected. This is not a critical issue per se, as the important test in all these cases is the initial ANOVA but non-significant (corrected) post-hoc tests might be another indicator of an underpowered experiment. My assumptions here might be wrong, but even then, I would ask the authors to be more transparent about the reasons for their choices or provide additional justification. Finally, the authors sometimes report exact p-values whereas other times they simply say p < .05. I would ask them to be consistent and recommend using exact p-values for every result where p >= .001.
 
 Thank you again for the suggestions. Your observations are correct, we used a slightly different statistical depending on our hypothesis. Here are the details:
 
 In experiment 1, we used a repeated-measure ANOVA with one factor “stimulation condition” (iTBS+γtACS; iTBS+sham-tACS; sham-iTBS+sham-tACS). Following the significant effect of this factor we performed post-hoc analysis with Bonferroni correction.
 
 In experiment 2, we used a repeated-measures with two factors “stimulation condition” and “time”. As expected, we observed a significant effect of condition, confirming the result of experiment 1, but not of time. Thus, this means that the neuromodulatory effect was present regardless of the time point. However, to explore whether the effects of stimulation condition were present in each time point we performed some explorative t-tests with no correction for multiple comparisons since this was just an explorative analysis.
 
 In experiment 3, we used the same approach as experiment 1. However, since we had a specific hypothesis on the direction of the effect already observed in our previous study, i.e. increase in spectral power (Maiella et al., Scientific Report 2022), our tests were 1-tailed.
 
 For the p-values, we corrected the manuscript reporting the exact values for every result.
 
 While the authors went to great lengths trying to probe the neural changes likely associated with the memory improvement after stimulation, it is impossible from their data to causally relate the findings from experiments 3 and 4 to the behavioral effects in experiments 1 and 2. This is acknowledged by the authors and there are good methodological reasons for why TMS-EEG and fMRI had to be collected in sperate experiments, but it is still worth pointing out to readers that this limits inferences about how exactly dual iTBS and γtACS of the precuneus modulate learning and memory.
 
 Thank you for your comment. We fully agree with your observation, which is why this aspect has been considered in the study's limitations. To address your concern, we add this sentence to the limitation discussion (lines 299-301).
 
 “Consequently, these findings do not allow precise inferences regarding the specific mechanisms by which dual iTBS and γtACS of the precuneus modulate learning and memory.”
 
 There were no stimulation-related performance differences in the short-term memory task used in experiments 1 and 2. The authors argue that this demonstrates that the intervention specifically targeted long-term associative memory formation. While this is certainly possible, the STM task was a spatial memory task, whereas the LTM task relied (primarily) on verbal material. It is thus also possible that the stimulation effects were specific to a stimulus domain instead of memory type. In other words, could it be possible that the stimulation might have affected STM performance if the task taxed verbal STM instead? This is of course impossible to know without an additional experiment, but the authors could mention this possibility when discussing their findings regarding the lack of change in the STM task.
 
 Thank you for your interesting observation. We argue that the intervention primarily targeted long-term associative memory formation, as our findings demonstrated effects only on FNAT. However, as you correctly pointed out, we cannot exclude the possibility that the stimulation may also influence short-term verbal associative memory. We add this aspect when discussing the absence of significant findings in the STM task (lines 205-210).
 
 “Visual short-term associative memory, measured by STBM performance, was not modulated by any experimental condition. Even if we cannot exclude the possibility that the stimulation could have influenced short-term verbal associative memory, we expected this result since short-term associative memory is known to rely on a distinct frontoparietal network while FNAT, used to investigate long-term associative memory, has already been associated with the neural activity of the PC and the hippocampus (Parra et al., 2014; Rentz et al., 2011).”
 
 While the authors discuss the potential neural mechanisms by which the combined stimulation conditions might have helped memory formation, the psychological processes are somewhat neglected. For example, do the authors think the stimulation primarily improves the encoding of new information or does it also improve consolidation processes? Interestingly, the beneficial effect of dual iTBS and γtACS on recall performance was very stable across all time points tested in experiments 1 and 2, as was the performance in the other conditions. Do the authors have any explanation as to why there seems to be no further forgetting of information over time in either condition when even at immediate recall, accuracy is below 50%? Further, participants started learning the associations of the FNAT immediately after the stimulation protocol was administered. What would happen if learning started with a delay? In other words, do the authors think there is an ideal time window post-stimulation in which memory formation is enhanced? If so, this might limit the usability of this procedure in real-life applications.
 
 Thank you for your comment and for raising these important points.
 
 We hypothesized that co-stimulation would enhance encoding processes. Previous studies have shown that co-stimulation can enhance gamma-band oscillations and increase cortical plasticity (Guerra et al., Brain Stimulation 2018; Maiella et al., Scientific Reports 2022). Given that the precuneus (Brodt et al., Science 2018; Schott et al., Human Brain Mapping 2018), gamma oscillations (Osipova et al., Journal of Neuroscience 2006; Deprés et al., Neurobiology of Aging 2017; Griffiths et al., Trends in Neurosciences 2023), and cortical plasticity (Brodt et al., Science 2018) have all been associated with encoding processes, we decided to apply co-stimulation before the encoding phase, to boost it. We enlarged the introduction to specify the link between neural mechanisms and the psychological process of the encoding (lines 55-60).
 
 “In particular, the induction of gamma oscillatory activity has been proposed to play an important role in a type of LTP known as spike timing-dependent plasticity, which depends on a precise temporal delay between the firing of a presynaptic and a postsynaptic neuron (Griffiths and Jensen, 2023). Both LTP and gamma oscillations have a strong link with memory processes such as encoding (Bliss and Collingridge, 1993; Griffiths and Jensen, 2023; Rossi et al., 2001), pointing to rTMS and tACS as good candidates for memory enhancement.”
 
 We applied the co-stimulation immediately before the learning phase to maximize its potential effects. While we observed a significant increase in gamma oscillatory activity lasting up to 20 minutes, we cannot determine whether the behavioral effects we observed would have been the same with a co-stimulation applied 20 minutes before learning. Based on existing literature, a reduction in the efficacy of co-stimulation over time could be expected (Huang et al., Neuron 2005; Thut et al., Brain Topography 2009). However, we hypothesize that multiple stimulation sessions might provide an additional boost, helping to sustain the effects over time (Thut et al., Brain Topography 2009; Koch et al., Neuroimage 2018; Koch et al., Brain 2022).
 
 Regarding the absence of further forgetting in both stimulation conditions, we think that the clinical and demographical characteristics of the sample (i.e. young and healthy subjects) explain the almost absence of forgetting after one week.
 
 Reviewer #1 (Recommendations for the authors):
 
 To address the concerns, the authors should:
 
 (1) Include invasive neuronal recordings (e.g., in rats or monkeys if not possible in humans) demonstrating that the current stimulation protocol leads to direct changes in brain activity.
 
 We understand the interest of the first reviewer in the understanding of neurophysiological correlates of the stimulation protocol, however, we are skeptical about this request as we think it goes beyond the aims of the study. As already mentioned in the response to the reviewer, invasive neurophysiological recordings in humans for this type of study are not feasible due to ethical constraints. At the same time, studies on cadavers or rodents would not fully resolve the question. Indeed, the authors of the study cited by the reviewer (Mihály Vöröslakos et al., Nature Communications, 2018) highlight the impossibility of drawing definitive conclusions about the exact voltage required in the in-vivo human brain due to significant differences between rats and humans, as well as the in-vivo human cadavers due to alterations in electrical conductivity that occur in postmortem tissue. Huang and colleagues addressed the difficulties in reaching direct evidence of non-invasive brain stimulation (NIBS) effects in a review published in Clinical Neurophysiology in 2017. They conclude that the use of EEG to assess brain response to TMS has a great potential for a less indirect demonstration of plasticity mechanisms induced by NIBS in humans.
 
 It is exactly to meet the need to investigate the changes in brain activity after the stimulation protocol that we conducted Experiments 3 and 4. These experiments respectively examined the neurophysiological and connectivity changes induced by the stimulation in a non-invasive manner using TMS-EEG and fMRI. The observed changes in brain oscillatory activity (increased gamma oscillatory activity), cortical excitability (enhanced posteromedial parietal cortex reactivity), and brain connectivity (strengthened connections between the precuneus and hippocampi) provided evidence of the effects of our non-invasive brain stimulation protocol, further supporting the behavioral data.
 
 Additionally, we carefully considered the issue of stimulation distribution and, in response, performed a biophysical modeling analysis and E-field calculation using the parameters employed in our study (see Supplementary Materials).
 
 Acknowledging the reviewer's point of view, we modified the manuscript accordingly, discussing this aspect both as a technical limitation and as a potential direction for future research (main text, lines 280-289).
 
 “Although we studied TMS and tACS propagation through the E-field modeling and observed an increase in the precuneus gamma oscillatory activity, excitability and connectivity with the hippocampi, we cannot exclude that our results might reflect the consequences of stimulating more superficial parietal regions other than the precuneus nor report direct evidence of microscopic changes in the brain after the stimulation. Invasive neurophysiological recordings in humans for this type of study are not feasible due to ethical constraints. Studies on cadavers or rodents would not fully resolve our question due to significant differences between them (i.e. rodents do not have an anatomical correspondence while cadavers have an alterations in electrical conductivity occurring in postmortem tissue). However, further exploration of this aspect in future studies would help in the understanding of γtACS+iTBS effects.”
 
 (2) Address all the technical questions about the experimental design.
 
 We addressed all the technical questions about the experimental design.
 
 (3) Repeat the experiments with randomized trial order and without a block design.
 
 The experiments were conducted with randomized trial order and we did not use a block design.
 
 (4) Add many more faces to the study. It is extremely difficult to draw any conclusion from merely 12 faces. Ideally, there would be lots of other relevant memory experiments where the authors show compelling positive results.
 
 We understand your perplexity about drawing conclusions from 12 faces, however, this is not the case. As we explained in the response reviewer, the task we implemented did not rely on the recall of merely 12 faces. Instead, participants had to correctly learn, associate and recall 12 faces, 12 names and 12 occupations for a total of 36 items. To improve the clarity of the manuscript, we added a paragraph to make this aspect more explicit (lines 425-430).
 
 “We considered a correct association when a subject was able to recall all the information for each item (i.e. face, name and occupation), resulting in a total of 36 items to learn and associate. To further investigate the effect on FNAT we also computed a partial recall score accounting for those items where subjects correctly matched only names with faces (FNAT NAME) and only occupations with faces (FNAT OCCUPATION). See supplementary information for score details.”
 
 The behavioral changes we observed are similar to those who are typically observed after multiple stimulation sessions (Koch et al., NeuroImage, 2018; Grover et al., Nature Neuroscience, 2022, Benussi et al., Annals of Neurology, 2022). Moreover, memory performance changes are often measured by a limited set of stimuli due to methodological constraints related to memory capacity. For example, Rey Auditory Verbal learning task, requiring to learn and recall 15 words, is a typical test used to detect memory changes (Koch et al., Neuroimage, 2018; Benussi et al., Brain stimulation 2021; Benussi et al., Annals of Neurology, 2022).
 
 (5) Provide a clear explanation of the apparent randomness of which results are statistically significant or not in Figure 3. But perhaps with many more experiments, a lot more memory evaluations, many more stimuli, and addressing all the other technical concerns, either the results will disappear or there will be a more interpretable pattern of results.
 
 We provided explanations for all the concerns shown by the reviewer.
 
 Reviewer #2 (Recommendations for the authors):
 
 Minor comments:
 
 (1) Figure 4: Why are connectivity values pre-stimulation for the iTBS and sham tACS stimulation condition so much higher than the dual stimulation? We would expect baseline values to be more similar.
 
 We acknowledge that the pre-stimulation connectivity values for the iTBS and sham tACS conditions appear higher than those for the dual stimulation condition. However, as noted in our statistical analyses, there were no significant differences at baseline between conditions (p-FDR= 0.3514), suggesting that any apparent discrepancy is due to natural variability rather than systematic bias. One potential explanation for these differences is individual variability in baseline connectivity measures, which can fluctuate due to factors such as intrinsic neural dynamics, participant state, or measurement noise. Despite these variations, our statistical approach ensures that any observed post-stimulation effects are not confounded by pre-existing differences.
 
 (2) Figure 2: How are total association scores significantly different between stimulation conditions, but individual name and occupation associations are not? Further clarification of how the total FNAT score is calculated would be helpful.
 
 We apologize for any lack of clarity. The total FNAT score reflects the ability to correctly recall all the information associated with a person—specifically, the correct pairing of the face, name, and occupation. Participants received one point for each triplet they accurately recalled. The scores were then converted into percentages, as detailed in the Face-Name Associative Task Construction and Scoring section in the supplementary materials.
 
 Total FNAT was the primary outcome measure. However, we also analyzed name and occupation recall separately to better understand their partial contributions. Our analysis revealed that the improvement in total FNAT was primarily driven by an increase in name recall rather than occupation recall.
 
 We acknowledge that this distinction may have caused some confusion. To improve clarity, we revised the manuscript accordingly (lines 97-98; 107-111; 425-430).
 
 “Dual iTBS+γtACS increased the performances in recalling the association between face, name and occupation (FNAT accuracy) both for the immediate (F2,38=7.18; p=0.002; η2p=0.274) and the delayed (F2,38=5.86; p =0.006; η2p=0.236) recall performances (Fig. 2, panel A).”
 
 “The in-depth analysis of the FNAT accuracy investigating the specific contribution of face-name and face-occupation recall revealed that dual iTBS+γtACS increased the performances in the association between face and name (FNAT NAME) delayed recall (F2,38 =3.46; p =0.042; η2p =0.154; iTBS+γtACS vs. sham-iTBS+sham-tACS: 42.9±21.5 % vs. 33.8±19 %; p=0.048 Bonferroni corrected) (Fig. S4, supplementary information).”
 
 “We considered a correct association when a subject was able to recall all the information for each item (i.e. face, name and occupation), resulting in a total of 36 items to learn and associate. To further investigate the effect on FNAT we also computed a partial recall score accounting for those items where subjects correctly matched only names with faces (FNAT NAME) and only occupations with faces (FNAT OCCUPATION). See supplementary information for score details.”
 
 We also moved the data regarding the specific contribution of name and occupation recall in the supplementary information (fig.S4) and further specified how we computed the score in the score (lines 102-104).
 
 “The score was computed by deriving an accuracy percentage index dividing by 12 and multiplying by 100 the correct association sum. The partial recall scores were computed in the same way only considering the sum of face-name (NAME) and face-occupation (OCCUPATION) correctly recollected.”
 
 Reviewer #3 (Recommendations for the authors):
 
 A very small detail, in the caption for Figure 2A, OCCUPATION is described as being shown on the 'left' but it should be 'right'.
 
 We corrected this error.
 
 AuthorResponse
Visit annotations in context

Tags

Review 2

Review 3

Summary

AuthorResponse

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.10.25.620008v2
www.biorxiv.org www.biorxiv.org

Premature vision drives aberrant development of response properties in primary visual cortex

1
1. Public_Reviews 23 Jul 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews.
 
 Reviewer #1 (Public Review):
 
 (1) Figure 1: It might be simpler to streamline acronyms for different test cases, e.g, E01contra, E01 ipsi (rather than EO1IPS), E02, and control. Thus, it would be possible to label each of the three schematic panels as E01, E02, control.
 
 Please describe what the dots in the brain mean and move the V1 label so it does not occlude dots.
 
 Please make clear that the "track reconstructions" are the bright spheres in the micrographs (there are track-like elements in some micrographs which may be tears or?)
 
 Thank you. We relabeled the groups as control, EO1contra, EO1ipsi, and EO2. These were changed in all figures and in the document at several places.
 
 We indicated in the new caption that “Dots schematize ocular dominance columns”.
 
 We indicated that electrode track penetrations were the “(bright spots at right/posterior)”.
 
 (2) Figure 2: Should "horizontal" be vertical (line 556) of the caption? When describing the scale bar for firing rate, please explain the meaning of italicized vs regular font.
 
 Please make the purple lines in Figures I and J easier to see (invisible in my PDF).
 
 Not quite clear what is significantly different from what when viewing the figure at a glance. Would it be possible to clarify using standard methods?
 
 Yes, it should say vertical, thank you. We explained the italics (they denote the standard scale bar size if no number is provided.)
 
 We changed the purple lines to yellow in all figures.
 
 We added comparison bars that help indicate significance.
 
 (3) Figures 3-5. Please make corrections like those noted above.
 
 Yes, we applied the previous changes to Figures 3 - 5.
 
 (4) Minor. Sometimes the authors spell out temporal frequency and sometimes abbreviate it. Perhaps adopt a consistent style.
 
 Fixed, thanks.
 
 Reviewer #2 (Public Review):
 
 (1) The assessment of the tuning properties is based on fits to the data. Presumably, neurons for which the fits were poor were excluded? It would be useful to know what the criteria were, how many neurons were excluded, and whether there was a significant difference between the groups in the numbers of neurons excluded (which could further point to differences between the groups).
 
 Yes, this is an important omission, thank you for catching it. We now write in methods (line 213): “ Inclusion/exclusion: For each stimulus type, we examined the set of all responses to visual stimuli and blanks with an ANOVA test to evaluate the null hypothesis that the mean response to all of these stimuli were the same; cells with a p<0.05 to this visual responsiveness test were included in fits and analyses, and cells with p>0.05 were excluded. ”
 
 (2) For the temporal frequency data, low- and high-frequency cut-offs are defined, but then only used for the computation of the bandwidth. Given that the responses to low temporal frequencies change profoundly with premature eye opening, it would be useful to directly compare the low- and high-frequency cut-offs between groups, in addition to the index that is currently used.
 
 We now provide this data in Figure 3 - figure supplement 1 .
 
 (3) In addition to the tuning functions and firing rates that have been analyzed so far, are there any differences in the temporal profiles of neural responses between the groups (sustained versus transient responses, rates of adaptation, latency)? If the temporal dynamics of the responses are altered significantly, that could be part of an explanation for the altered temporal tuning.
 
 This is a great topic for future studies. Unfortunately, with drifting gratings, it is difficult to establish these properties, which could be better assessed with standing or square-wave-modulated gratings or other stimuli. We did not run standing gratings in our battery of stimuli for this initial study.
 
 (4) It would be beneficial for the general interpretation of the results to extend the discussion. First, it would be useful to provide a more detailed discussion of what type of visual information might make it through the closed eyelids (the natural state), in contrast to the structured information available through open eyes. Second, it would be useful to highlight more clearly that these data were collected in peripheral V1 by discussing what might be expected in binocular, more central V1 regions. Third, it would be interesting to discuss the observed changes in firing rates in the context of the development of inhibitory neurons in V1 (which still undergo significant changes through the time period of premature visual experience chosen here).
 
 Thank you, good ideas. Let’s take these three suggestions in turn.
 
 First, in the discussion, we added a subsection “ Biology of early development in mustelids ” that focuses on the developmental conditions of wild and laboratory animals:
 
 In the wild, mustelids raise their young in nests in the ground, in cavities such as holes in trees or caves, or in areas of dense vegetation (Ruggiero et al. 1994). They may move the young from one nest to another as they grow, but otherwise the young are primarily in the relatively dark nest. It is highly likely that some light penetrates and that information about the 24-hour cycle is available, but the light is likely to be dim and unlikely to provide a basis for high luminance, high contrast stimulation through the closed lids. The animals begin to spend substantial time outside the nest after eye opening.
 
 The ferret is a domesticated strain of the European polecat. In laboratory settings, ferret jills give birth and keep their kits in a nest box. A laboratory typically maintains a 24-hour cycle with 12 or 14 hours of light, and the light reaching the closed lids must first pass through the cage, the nest box, and the nesting material. Therefore, developing ferrets have an obvious circadian light signal but the light available for image formation is likely dim and of low contrast.
 
 Although the light that reaches the close lids in developing ferrets is likely to be relatively dim, and any image-forming signal passing through the closed lids would be highly filtered in luminance, spatial frequency, and contrast, it is important to remember that visual input before natural eye opening (through the closed lids) can drive activity in retina, LGN, and cortex (Huttenlocher 1967, Chapman and Stryker 1993, Krug et al., 2001, Akerman et al., 2002,Akerman et al., 2004). Further, orientation selectivity can be observed through the closed lids (Krug et al., 2001), indicating that some coarse image-forming information does make it through the closed lids.
 
 Second, we added text speculating about binocular cortex (lines 492 - 500): … our recordings were performed in monocular cortex so that we could be sure of the developmental condition of the eye that drove the classic responses. It is interesting to speculate about what might occur more centrally in binocular visual cortex. Ocular dominance shifts are not induced when one eye is opened prematurely (Issa et al 1999), indicating that ocular dominance plasticity is not engaged at this early stage, but one might imagine that the impacts on temporal frequency and spontaneous firing rates would still be present.
 
 Third, on inhibition, we added a paragraph (lines 502 - 509):
 
 We introduced premature patterned vision at a time when cortical inhibition is undergoing substantial changes. GABAergic signaling has already undergone its switch (Ben-Ari, 2002) from providing primarily depolarizing input to hyperpolarizing input by P21-23 (Mulholland et al., 2021). In the days prior to eye opening, inhibitory cells exhibit activity that is closely associated with the emerging functional modules that will reflect orientation columns (Mulholland et al., 2021), but do not yet exhibit selectivity to orientation, in contrast to excitatory neurons, which do exhibit selectivity to orientation at that time (Chang and Fitzpatrick, 2022).
 
 (5) In the methods section, the statement 'actively kept in nesting box' is unclear. Presumably this means that the jill prevents the kits from leaving the nesting box? It also would be worth at least mentioning in this context that there obviously are still visual events in the nesting box too.
 
 Thanks. We improved this description (lines 118 - 121): Ferret kits in laboratory housing receive limited visual stimulation through their closed lids, as the mother actively keeps the kits in their relatively dark nest . In order to ensure that animals with early-opened eyes actually had patterned visual experience (and animals with closed lids had the same stimulation filtered through the lids) , animals were brought to the lab for 2 hours a day for 4 consecutive days beginning at P25.
 
 (6) The stimulus presentation could be more clearly described. Is every stimulus presented in an individual trial (surrounded by periods with a blank screen), or are all stimuli shown as a continuous sequence? The description of the parameter screening is also potentially confusing ('orientation was co-varied with stimuli consisting of drifting gratings at different spatial frequencies' sounds as if there are separate stimuli for orientation; might be better to say something like 'in the first set, orientation, spatial frequency, ... were covaried...')
 
 Yes, thank you, we fixed this (lines 184 - 201). We deleted the text indicated and added a sentence “Each individual grating stimulus was full screen and had a single set of parameters (direction, spatial frequency, temporal frequency), and was separated from the other stimuli by a gray screen interstimulus interval.”. We also deleted a repetition of 100% contrast in the description of the second set.
 
 (7) Description of low-pass index is unclear. What is the 'largest temporal frequency response observed'? The maximum response or the response to the largest temporal frequency tested?
 
 Thanks. We added a paragraph at line 236:
 
 We defined a low pass index as the response to the lowest temporal frequency tested (in this case 0.5 Hz) to the maximum response obtained to the set of temporal frequencies shown. LPI = R(TF=0.5 Hz)/max(R(TF=0.5Hz), R(TF=1Hz), … R(TF=32Hz)). If a cell exhibited the highest firing for a temporal frequency of 0.5 Hz, then it would have an low pass index of 1. If it exhibited a similar firing rate in response to a temporal frequency of 0.5 Hz even if the preferred temporal frequency were higher, then the low pass index would still be near 1. If the cell responded poorly at a temporal frequency of 0.5 Hz, then it would have a low pass index near 0.
 
 (8) The discussion should also cite the results of strobe-reared cats by Pasternak et al (1981 and 1985).
 
 Thank you for pointing out the omission. We now write (lines 430-435): Cats raised in a strobe-light environment (mostly after eye opening) exhibited strong changes in subsequent direction selectivity (Kennedy and Orban 1983; Humphrey and Saul 1998) and behavioral sensitivity to motion (Pasternak et al., 1981; Pasternak et al., 1985) that partially recovers with motion detection training . However, temporal frequency tuning of these animals has not been reported in detail. Pasternak et al (1981) reported that strobe-reared ferrets exhibited greater difficulty in distinguishing slow moving stimuli from static stimuli compared to controls, an ability that slightly improved with practice, suggesting possible temporal frequency deficits.
 
 (9) Finally, it would be useful to include a mention of the early development of MT in marmosets in the discussion of impacts of prematurity on motion vision (Bourne & Rosa 2006).
 
 Yes, thank you. We cited Bourne & Rosa and also Lempel and Nielsen (for ferret PSS). (Lines 492-501):
 
 Several other basic mechanistic questions remain unanswered. It is unclear where in the visual circuit cascade these deficits first arise. Does the lateral geniculate nucleus or retina exhibit altered temporal frequency tuning? Is the influence of the patterned visual stimulation instructive, so that if one provided premature stimulation with only certain temporal frequencies, one would see selectivity for those temporal frequencies, or would tuning always be broad? Other questions remain concerning the top-down influence on V1 from “higher” motion areas such as MT (monkeys) or PSS (ferret); MT exhibits mature neural markers earlier than V1 (Bourne and Rosa, 2006), and suppression of PSS impacts motion selectivity in V1 (Lempel and Nielsen, 2021). Future studies will be needed to address these questions.
 
 AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.03.13.643139v2
www.biorxiv.org www.biorxiv.org

Action mechanism of a novel agrichemical quinofumelin against Fusarium graminearum

4
1. Public_Reviews 23 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 In this valuable study, the authors show the physiological response and molecular pathway mediating the effect of quinofumelin, a developed fungicide with an unknown mechanism. The authors present convincing data suggesting the involvement of the uridine/uracil biosynthesis pathway, by combining in vivo microbiology characterization as well as in vitro biochemical binding results.
 
 Summary
2. Public_Reviews 23 Jul 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 Summary:
 
 In the current study, the authors aim to identify the mode of action/molecular mechanism of characterized a fungicide, quinofumelin, and its biological impact on transcriptomics and metabolomics in Fusarium graminearum and other Fusarium species. Two sets of data were generated between quinofumelin and no treatment group, and differentially abundant transcripts and metabolites were identified, suggesting a potential role of pyrimidine biosynthesis. Upon studying the genetic mutants of the uridine/uracil biosynthesis pathway with quinofumelin treatment and metabolite supplementation, combining in vitro biochemical assay of quinofumelin and F.graminearum dihydroorotate dehydrogenase protein, the authors identified that quinofumelin inhibits the dihydroorotate dehydrogenase and blocks downstream metabolite biosynthesis, limiting fungal metabolism and growth.
 
 Strengths:
 
 Omics datasets were leveraged to understand the physiological impact of quinofumelin, showing the intracellular impact of the fungicide. The characterization of FgDHODHII deletion strains with supplemented metabolites clearly showed the impact of the enzyme on fungal growth. Corroborating in vitro and in vivo data revealed the direct interaction of quinofumelin with Fusarium protein target.
 
 Potential Impact:
 
 Understanding this new mechanism could facilitate rational design or screen for molecules targeting the same pathway, or improve binding affinity and inhibitor potency. Confirming the target of quinofumelin may also help understand its resistance mechanism, and further development of other inhibitory molecules against the target.
 
 Review 2
3. Public_Reviews 23 Jul 2025
 
 in eLife
 
 Reviewer #3 (Public review):
 
 Summary:
 
 The manuscript shows the mechanism of action of quinofumelin, a novel fungicide, against the fungus Fusarium graminearum. Through omics analysis, phenotypic analysis and in silico approaches, the role of quinofumelin in targeting DHODH is uncovered.
 
 Strengths:
 
 The phenotypic analysis and mutant generation are nice data and add to the role of metabolites in bypassing pyrimidine biosynthesis.
 
 Weaknesses:
 
 The role of DHODH in this class of fungicides has been known and this data does not add any further significance to the field.
 
 Review 3
4. Public_Reviews 23 Jul 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews.
 
 Reviewer #1 (Public review):
 
 Summary:
 
 Phytophathogens including fungal pathogens such as F. graminearum remain a major threat to agriculture and food security. Several agriculturally relevant fungicides including the potent Quinofumelin have been discovered to date, yet the mechanisms of their action and specific targets within the cell remain unclear. This paper sets out to contribute to addressing these outstanding questions.
 
 We appreciate the reviewer's accurate summary of our manuscript.
 
 Strengths:
 
 The paper is generally well-written and provides convincing data to support their claims for the impact of Quinofumelin on fungal growth, the target of the drug, and the potential mechanism. Critically the authors identify an important pyrimidine pathway dihydroorotate dehydrogenase (DHODH) gene FgDHODHII in the pathway or mechanism of the drug from the prominent plant pathogen F. graminearum, confirming it as the target for Quinofumelin. The evidence is supported by transcriptomic, metabolomic as well as MST, SPR, molecular docking/structural biology analyses.
 
 We appreciate the reviewer's recognition of the strengths of our manuscript.
 
 Weaknesses:
 
 Whilst the study adds to our knowledge about this drug, it is, however, worth stating that previous reports (although in different organisms) by Higashimura et al., 2022 https://pmc.ncbi.nlm.nih.gov/articles/PMC9716045/ had already identified DHODH as the target for Quinofumelin and hence this knowledge is not new and hence the authors may want to tone down the claim that they discovered this mechanism and also give sufficient credit to the previous authors work at the start of the write-up in the introduction section rather than in passing as they did with reference 25? other specific recommendations to improve the text are provided in the recommendations for authors section below.
 
 We appreciate the reviewer's suggestion. In the revised manuscript, we have incorporated the reference in the introduction section and expanded the discussion of previous work on quinofumelin by Higashimura et al., 2022 in the discussion section to more effectively contextualize their contributions. Moreover, we have made revisions and provided responses in accordance with the recommendations.
 
 Reviewer #2 (Public review):
 
 Summary:
 
 In the current study, the authors aim to identify the mode of action/molecular mechanism of characterized a fungicide, quinofumelin, and its biological impact on transcriptomics and metabolomics in Fusarium graminearum and other Fusarium species. Two sets of data were generated between quinofumelin and no treatment group, and differentially abundant transcripts and metabolites were identified. The authors further focused on uridine/uracil biosynthesis pathway, considering the significant up- and down-regulation observed in final metabolites and some of the genes in the pathways. Using a deletion mutant of one of the genes and in vitro biochemical assays, the authors concluded that quinofumelin binds to the dihydroorotate dehydrogenase.
 
 We appreciate the reviewer's accurate summary of our manuscript.
 
 Strengths:
 
 Omics datasets were leveraged to understand the physiological impact of quinofumelin, showing the intracellular impact of the fungicide. The characterization of FgDHODHII deletion strains with supplemented metabolites clearly showed the impact of the enzyme on fungal growth.
 
 We appreciate the reviewer's recognition of the strengths of our manuscript.
 
 Weaknesses:
 
 Some interpretation of results is not accurate and some experiments lack controls. The comparison between quinofumelin-treated deletion strains, in the presence of different metabolites didn't suggest the fungicide is FgDHODHII specific. A wild type is required in this experiment.
 
 Potential Impact: Confirming the target of quinofumelin may help understand its resistance mehchanism, and further development of other inhibitory molecules against the target.
 
 The manuscript would benefit more in explaining the study rationale if more background on previous characterization of this fungicide on Fusarium is given.
 
 We appreciate the reviewer's suggestion. Under no treatment with quinofumelin, mycelial growth remains normal and does not require restoration. In the presence of quinofumelin treatment, the supplementation of downstream metabolites in the de novo pyrimidine biosynthesis pathway can restore mycelial growth that is inhibited by quinofumelin. The wild-type control group is illustrated in Figure 4. Figure 5b depicts the phenotypes of the deletion mutants. With respect to the relationship among quinofumelin, FgDHODHII, and other metabolites, quinofumelin specifically targets the key enzyme FgDHODHII in the de novo pyrimidine biosynthesis pathway, disrupting the conversion of dihydroorotate to orotate, which consequently inhibits the synthesis downstream metabolites including uracil. In our previous study, quinofumelin not only exhibited excellent antifungal activity against the mycelial growth and spore germination of F. graminearum, but also inhibited the biosynthesis of deoxynivalenol (DON). We have added this part to the introduction section.
 
 Reviewer #3 (Public review):
 
 Summary:
 
 The manuscript shows the mechanism of action of quinofumelin, a novel fungicide, against the fungus Fusarium graminearum. Through omics analysis, phenotypic analysis, and in silico approaches, the role of quinofumelin in targeting DHODH is uncovered.
 
 We appreciate the reviewer's accurate summary of our manuscript.
 
 Strengths:
 
 The phenotypic analysis and mutant generation are nice data and add to the role of metabolites in bypassing pyrimidine biosynthesis.
 
 We appreciate the reviewer's recognition of the strengths of our manuscript.
 
 Weaknesses:
 
 The role of DHODH in this class of fungicides has been known and this data does not add any further significance to the field. The work of Higashimura et al is not appreciated well enough as they already showed the role of quinofumelin upon DHODH II.
 
 There is no mention of the other fungicide within this class ipflufenoquin, as there is ample data on this molecule.
 
 We appreciate the reviewer's suggestion. We sincerely appreciate the reviewer's insightful comment regarding the work of Higashimura et al. We agree that their investigation into the role of quinofumelin in DHODH II inhibition provides critical foundational insights for this field. In the revised manuscript, we have incorporated the reference in the introduction section and expanded the discussion of their work in the discussion section to more effectively contextualize their contributions. The information regarding action mechanism of ipflufenoquin against filamentous fungi was added in discussion section.
 
 Reviewer #1 (Recommendations for the authors):
 
 (1) Given that the DHODH gene had been identified as a target earlier, could the authors perform blast experiments with this gene instead and let us know the percentage similarity between the FgDHODHII gene and the Pyricularia oryzae class II DHODH gene in the report by Higashimura et al., 2022.
 
 BLAST experiment revealed that the percentage similarity between the FgDHODHII gene and the class II DHODH gene of P. oryzae was 55.41%. We have added the description ‘Additionally, the amino acid sequence of the FgDHODHII exhibits 55.41% similarity to that of DHODHII from Pyricularia oryzae, as previously reported (Higashimura et al., 2022)’ in section Results.
 
 (2) Abstract:
 
 The authors started abbreviating new terms e.g. DEG, DMP, etc but then all of a sudden stopped and introduced UMP with no full meaning of the abbreviation. Please give the full meaning of all abbreviations in the text, UMP, STC, RM, etc.
 
 We have provided the full meaning for all abbreviations as requested.
 
 (3) Introduction section:
 
 The introduction talks very little about the work of other groups on quinofumelin. Perhaps add this information in and reference them including the work of Higashimura et al., 2022 which has done quite significant work on this topic but is not even mentioned in the background
 
 We have added the work of other groups on quinofumelin in section introduction.
 
 (4) General statements:
 
 Please show a model of the pyrimidine pathway that quinofumelin attacks to make it easier for the reader to understand the context. They could just copy this from KEGG
 
 We have added the model (Fig. 7).
 
 (5) Line 186:
 
 The authors did a great job of demonstrating interactions with the Quinofumelin and went to lengths to perform MST, SPR, molecular docking, and structural biology analyses yet in the end provide no details about the specific amino acid residues involved in the interaction. I would suggest that site-directed mutagenesis studies be performed on FgDHODHII to identify specific amino acid residues that interact with Quinofumelin and show that their disruption weakens Quinofumelin interaction with FgDHODHII.
 
 Thank you for this insightful suggestion. We fully agree with the importance of elucidating the interaction mechanism. At present, we are conducting site-directed mutagenesis studies based on interaction sites from docking results and the mutation sites of FgDHODHII from the resistant mutants; however, due to the limitations in the accuracy of existing predictive models, this work remains ongoing. Additionally, we are undertaking co-crystallization experiments of FgDHODHII with quinofumelin to directly and precisely reveal their interaction pattern
 
 (6) Line 76:
 
 What is the reference or evidence for the statement 'In addition, quinofumelin exhibits no cross-resistance to currently extensively used fungicides, indicating its unique action target against phytopathogenic fungi.
 
 If two fungicides share the same mechanism of action, they will exhibit cross resistance. Previous studies have demonstrated that quinofumelin retains effective antifungal activity against fungal strains resistant to commercial fungicides, indicating that quinofumelin does not exhibit cross-resistance with other commercially available fungicides and possesses a novel mechanism of action. Additionally, we have added the relevant inference.
 
 (7) Line 80-82:
 
 Again, considering the work of previous authors, this target is not newly discovered. Please consider toning down this statement 'This newly discovered selective target for antimicrobial agents provides a valuable resource for the design and development of targeted pesticides.'
 
 We have rewritten the description of this sentence.
 
 (8) Line 138: If the authors have identified DHODH in experimental groups (I assume in F. graminearum), what was the exact locus tag or gene name in F. graminearum, and why not just continue with this gene you identified or what is the point of doing a blast again to find the gene if the DHODH gene if it already came up in your transcriptomic or metabolic studies? This unfortunately doesn't make sense but could be explained better.
 
 The information of FgDHODHII (gene ID: FGSG_09678) has been added. We have revised this part.
 
 Reviewer #2 (Recommendations for the authors):
 
 (1) Line 40:
 
 Please add a reference.
 
 We have added the reference
 
 (2) Line 47:
 
 Please add a reference.
 
 We have added the reference.
 
 (3) Line 50:
 
 The lack of target diversity in existing fungicides doesn't necessarily serve as a reason for discovering new targets being more challenging than identifying new fungicides within existing categories, please consider adjusting the argument here. Instead, the authors can consider reasons for the lack of new targets in the field.
 
 We have revised the description.
 
 (4) Line 63:
 
 Please cite your source with the new technology.
 
 We have added the reference.
 
 (5) Line 68:
 
 What are you referring to for "targeted medicine", do you have a reference?
 
 We have revised the description and the reference.
 
 (6) Line 74:
 
 One of the papers referred to "quinoxyfen", what are the similarities and differences between the two? Please elaborate for the readership.
 
 Quinoxyfen, similar to quinofumelin, contains a quinoline ring structure. It inhibits mycelial growth by disrupting the MAP kinase signaling pathway in fungi (https://www.frac.info). In addition, quinoxyfen still exhibits excellent antifungal activity against the quinofumelin-resistant mutants (the findings from our group), indicating that action mechanism for quinofumelin and quinoxyfen differ.
 
 (7) Line 84:
 
 Please introduce why RNA-Seq was designed in the study first. What were the groups compared? How was the experiment set up? Without this background, it is hard to know why and how you did the experiment.
 
 According to your suggestions, we have added the description in Section Results. In addition, the experimental process was described in Section Materials and methods as follows: A total of 20 mL of YEPD medium containing 1 mL of conidia suspension (1×105 conidia/mL) was incubated with shaking (175 rpm/min) at 25°C. After 24 h, the medium was added with quinofumelin at a concentration of 1 μg/mL, while an equal amount of dimethyl sulfoxide was added as the control (CK). The incubation continued for another 48 h, followed by ﬁltration and collection of hyphae. Carry out quantitative expression of genes, and then analyze the differences between groups based on the results of DESeq2 for quantitative expression.
 
 (8) Figures:
 
 The figure labeling is missing (Figures 1,2,3 etc). Please re-order your figure to match the text
 
 The figures have been inserted.
 
 (9) Line. 97:
 
 "Volcano plot" is a common plot to visualize DEGs, you can directly refer to the name.
 
 We have revised the description.
 
 (10) Figure 1d, 1e:
 
 Can you separate down- and up-regulated genes here? Does the count refer to gene number?
 
 The expression information for down- and up-regulated genes is presented in Figure 1a and 1b. However, these bubble plots do not distinguish down- and up-regulated genes. Instead, they only display the significant enrichment of differentially expressed genes in specific metabolic pathways. To more clearly represent the data, we have added the detailed counts of down- and up-regulated genes for each metabolic pathway in Supplementary Table S1 and S2. Here, the term "count" refers to differentially expressed genes that fall within a certain pathway.
 
 (11) Line 111:
 
 Again, no reasoning or description of why and how the experiment was done here.
 
 Based on the results of KEGG enrichment analysis, DEMs are associated with pathways such as thiamine metabolism, tryptophan metabolism, nitrogen metabolism, amino acid sugar and nucleotide sugar metabolism, pantothenic acid and CoA biosynthesis, and nucleotide sugar production compounds synthesis. To specifically investigate the metabolic pathways involved action mechanism of quinofumelin, we performed further metabolomic experiments. Therefore, we have added this description according the reviewer’s suggestions.
 
 (12) Figure 2a:
 
 It seems many more metabolites were reduced than increased. Is this expected? Due to the antifungal activity of this compound, how sick is the fungus upon treatment? A physiological study on F. graminearum (in a dose-dependent manner) should be done prior to the omics study. Why do you think there's a stark difference between positive and negative modes in terms of number of metabolites down- and up-regulated?
 
 Quinofumelin demonstrates exceptional antifungal activity against Fusarium graminearum. The results indicate that the number of reduced metabolites significantly exceeds the number of increased metabolites upon quinofumelin treatment. Mycelial growth is markedly inhibited under quinofumelin exposure. Prior to conducting omics studies, we performed a series of physiological and biochemical experiments (refer to Qian Xiu's dissertation https://paper.njau.edu.cn/openfile?dbid=72&objid=50_49_57_56_49_49&flag=free). Upon quinofumelin treatment, the number of down-regulated metabolites notably surpasses that of up-regulated metabolites compared to the control group. Based on the findings from the down-regulated metabolites, we conducted experiments by exogenously supplementing these metabolites under quinofumelin treatment to investigate whether mycelial growth could be restored. The results revealed that only the exogenous addition of uracil can restore mycelial growth impaired by quinofumelin.
 
 Quinofumelin exhibits an excellent antifungal activity against F. graminearum. At a concentration of 1 μg/mL, quinofumelin inhibits mycelial growth by up to 90%. This inhibitory effect indicates that life activities of F. graminearum are significantly disrupted by quinofumelin. Consequently, there is a marked difference in down- and up-regulated metabolites between quinofumelin-treated group and untreated control group. The detailed results were presented in Figures 1 and 2.
 
 (13) Figure 2e:
 
 This is a good analysis. To help represent the data more clearly, the authors can consider representing the expression using fold change with a p-value for each gene.
 
 To more clearly represent the data, we have incorporated the information on significant differences in metabolites in the de novo pyrimidine biosynthesis pathway, as affected by quinofumelin, in accordance with the reviewer’s suggestions.
 
 (14) Line 142:
 
 Please indicate fold change and p-value for statistical significance. Did you validate this by RT-qPCR?
 
 We validated the expression level of the DHODH gene under quinofumelin treatment using RT-qPCR. The results indicated that, upon treatment with the EC50 and EC90 concentrations of quinofumelin, the expression of the DHODH gene was significantly reduced by 11.91% and 33.77%, respectively (P<0.05). The corresponding results have been shown in Figure S4.
 
 (15) Line 145:
 
 It looks like uracil is the only metabolite differentially abundant in the samples - how did you conclude this whole pathway was impacted by the treatment?
 
 The experiments involving the exogenous supplementation of uracil revealed that the addition of uracil could restore mycelial growth inhibited by quinofumelin. Consequently, we infer that quinofumelin disrupts the de novo pyrimidine biosynthesis pathway. In addition, as uracil is the end product of the de novo pyrimidine biosynthesis pathway, the disruption of this pathway results in a reduction in uracil levels.
 
 (16) Figure 3:
 
 What sequence was used as the root of the tree? Why were the species chosen? Since the BLAST query was Homo sapiens sequence, would it be good to use that as the root?
 
 FgDHODHII sequence was used as the root of the tree. These selected fungal species represent significant plant-pathogenic fungi in agriculture production. According to your suggestion, we have removed the BLAST query of Homo sapiens in Figure 3.
 
 (17) Figure 4:
 
 How were the concentrations used to test chosen?
 
 Prior to this experiment, we carried out concentration-dependent exogenous supplementation experiments. The results indicated that 50 μg/mL of uracil can fully restore mycelial growth inhibited by quinofumelin. Consequently, we chose 50 μg/mL as the testing concentration.
 
 (18) Line 164:
 
 Why do you hypothesize supplementing dihydroorotate would restore resistance? The metabolite seemed accumulated in the treatment condition, whereas downstream metabolites were comparable or even depleted. The DHODH gene expression was suppressed. Would accumulation of dihydroorotate be associated with growth inhibition by quinofumelin? Please include the hypothesis and rationale for the experimental setup.
 
 DHODH regulates the conversion of dihydroorotate to orotate in the de novo pyrimidine biosynthesis pathway. The inhibition of DHODH by quinofumelin results in the accumulation of dihydroorotate and the depletion of the downstream metabolites, including UMP, uridine and uracil. Consequently, downstream metabolites were considered as positive controls, while upstream metabolite dihydroorotate served as a negative control. This design further demonstrates DHODH as action target of quinofumelin against F. graminearum. In addition, the accumulation of dihydroorotate is not associated with growth inhibition by quinofumelin; however, but the depletion of downstream metabolites in the de novo pyrimidine biosynthesis pathway is closely associated with growth inhibition by quinofumelin.
 
 (19) Line 168:
 
 I'm not sure if this conclusion is valid from your results in Figure 4 showing which metabolites restore growth.
 
 o minimize the potential influence of strain-specific effects, five strains were tested in the experiments shown in Figure 4. For each strain, the first row (first column) corresponds to control condition, while second row (first column) represents treatment with 1 μg/mL of quinofumelin, which completely inhibits mycelial growth. The second row (second column) for each strain represents the supplementation with 50 μg/mL of dihydroorotate fails to restore mycelial growth inhibited by quinofumelin. In contrast, the second row (third column, fourth column, fifth colomns) for each strain demonstrated that the supplementation of 50 μg/mL of UMP, uridine and uracil, respectively, can effectively restore mycelial growth inhibited by quinofumelin.
 
 (20) Figure 5a:
 
 The fact you saw growth of the deletion mutant means it's not lethal. However, the growth was severely inhibited.
 
 Our experimental results indicate that the growth of the deletion mutant is lethal. The mycelial growth observed originates from mycelial plugs that were not exposed to quinofumelin, rather than from the plates amended with quinofumelin.
 
 (21) Figure 5b:
 
 Would you expect different restoration of growth in the presence of quinofumelin vs. no treatment? The wild type control is missing here. Any conclusions about the relationship between quinofumelin, FgDHODHII, and other metabolites in the pathway?
 
 Under no treatment with quinofumelin, mycelial growth remains normal and does not require restoration. In the presence of quinofumelin treatment, the supplementation of downstream metabolites in the de novo pyrimidine biosynthesis pathway can restore mycelial growth that is inhibited by quinofumelin. The wild-type control group is illustrated in Figure 4. Figure 5b depicts the phenotypes of the deletion mutants. With respect to the relationship among quinofumelin, FgDHODHII, and other metabolites, quinofumelin specifically targets the key enzyme FgDHODHII in the de novo pyrimidine biosynthesis pathway, disrupting the conversion of dihydroorotate to orotate, which consequently inhibits the synthesis downstream metabolites including uracil.
 
 (22) Figure 6b:
 
 Lacking positive and negative controls (known binder and non-binder). What does the Kd (in comparison to other interactions) indicate in terms of binding strength?
 
 We tested the antifungal activities of publicly reported DHODH inhibitors (such as leflunomide and teriflunomide) against F. graminearum. The results showed that these inhibitors exhibited no significant inhibitory effects against the strain PH-1. Therefore, we lacked an effective chemical for use as a positive control in subsequent experiments. Biacore experiments offers detailed insights into molecular interactions between quinofumelin and DHODHII. As shown in Figure 6b, the left panel illustrates the time-dependent kinetic curve of quinofumelin binding to DHODHII. Within the first 60 s after quinofumelin was introduced onto the DHODHII surface, it bound to the immobilized DHODHII on the chip surface, with the response value increasing proportionally to the quinofumelin concentration. Following cessation of the injection at 60 s, quinofumelin spontaneously dissociated from the DHODHII surface, leading to a corresponding decrease in the response value. The data fitting curve presented on the right panel indicates that the affinity constant KD of quinofumelin for DHODHII is 6.606×10-6 M, which falls within the typical range of KD values (10-3 ~ 10-6 M) for protein-small molecule interaction patterns. A lower KD value indicates a stronger affinity; thus, quinofumelin exhibits strong binding affinity towards DHODHII.
 
 Reviewer #3 (Recommendations for the authors):
 
 The authors should add information about the other molecule within this class, ipflufenoquin, and what is known about it. There are already published data on its mode of action on DHODH and the role of pyrimidine biosynthesis.
 
 We have added the information regarding action mechanism of ipflufenoquin against filamentous fungi in discussion section.
 
 The work of Higashimura et al is not appreciated well enough as they already showed the role of quinofumelin upon DHODH II.
 
 We sincerely appreciate the reviewer's insightful comment regarding the work of Higashimura et al. We agree that their investigation into the role of quinofumelin in DHODH II inhibition provides critical foundational insights for this field. In the revised manuscript, we have incorporated the reference in the introduction section and expanded the discussion of their work in the discussion section to more effectively contextualize their contributions.
 
 It is unclear how the protein model was established and this should be included. What species is the molecule from and how was it obtained? How are they different from Fusarium?
 
 The three-dimensional structural model of F. graminearum DHODHII protein, as predicted by AlphaFold, was obtained from the UniProt database. Additionally, a detailed description along with appropriate citations has been incorporated in the ‘Manuscript’ file.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

Review 2

Review 3

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.01.13.632717v2
www.biorxiv.org www.biorxiv.org

Online reinforcement learning of state representation in recurrent network: the power of random feedback and biological constraints

1
1. Public_Reviews 22 Jul 2025
  
  in eLife
  
  Author response:
  
  The following is the authors’ response to the previous reviews.
  
  Reviewer #1 (Public Review):
  
  We thank the reviewer for the positive feedback on the work. The reviewer has raised two weaknesses and in the following we discuss how those can be addressed.
  
  Weaknesses:
  
  The impact of the article is limited by using a network with discrete time- steps, and only a small number of time steps from stimulus to reward. They assume that each time step is on the order of hundreds of ms. They justify this by pointing to some slow intrinsic mechanisms, but they do not implement these slow mechanisms is a network with short time steps, instead they assume without demonstration that these could work as suggested. This is a reasonable first approximation, but its validity should be explicitly tested.
  
  Our goal here was to give a proof of concept that online random feedback is sufficient to train an RNN to estimate value. Indeed, it is important to show that the idea works in a model where the slow mechanisms are explicitly implemented. However, this is a non-trivial task and desired to be addressed in future works.
  
  As the delay between cue and reward increases the performance decreases. This is not surprising given the proposed mechanism, but is still a limitation, especially given that we do not really know what a is the reasonable value of a single time step.
  
  In reply to this comment and the other reviewer's related comment, we have conducted two sets of additional simulations, one for examining incorporation of eligibility traces, and the other for considering (though not mechanistically implementing) behavioral time-scale synaptic plasticity (BTSP). We have added their results to the revised manuscript as Appendix. We think that the results addressed this point to some extent while how longer cue-reward delay can be learnt by elaboration of the model remains as a future issue.
  
  Reviewer #2 (Public Review):
  
  We thank the reviewer for the positive feedback on the work. The reviewer gave comments on our revisions, and here we discuss how those can be addressed.
  
  Comments on revisions: I would still want to see how well the network learns tasks with longer time delays (on the order of 100 or even 1000 timesteps). Previous work has shown that random feedback struggles to encode longer timescales (see Murray 2019, Figure 2), so I would be interested to see how that translates to the RL context in your model.
  
  We would like to note that in Murray et al 2019 the random feedback per se appeared not to be primarily responsible for the difficulty in encoding longer timesclaes. In the Figure 2d (Murray 2019), the author compared his RFLO (random feedback local online) and BPTT with two intermediate algorithms, which incorporated either one of the two approximations made in RFLO: i) random feedback instead of symmetric feedback, and ii) omittance of non-local effect (i.e., dependence of the derivative of the loss with respect to a given weight on the other weights). The performance difference between RFLO and BPTT was actually mostly explained by ii), as the author mentioned "The results show that the local approximation is essentially fully responsible for the performance difference between RFLO and BPTT, while there is no significant loss in performance due to the random feedback alone. (Line 6-8, page 7 of Murray, 2019, eLife)".
  
  Meanwhile, regarding the difference in the performance of the model with random feedback vs the model with symmetric feedback in our settings, actually it appeared (already) in the case with 6 time-steps or less (the biologically constrained model with random feedback performed worse: Fig. 6J, left).
  
  In practice, our model, either with random or symmetric feedback, would not be able to learn the cases with very long delays. This is indeed a limitation of our model. However, our model is critically different from the model of Murray 2019 in that we use RL rather than supervised learning and we use a scalar bootstrapped (TD) reward-prediction-error rather than the true output error. We would think that these differences may be major reasons for the limited learning ability of our model.
  
  Regarding the feasibility of the model when tasks involve longer time delays: Indeed this is a problem and the other reviewers have also raised the same point. Our model can be extended by incorporating either a kind of eligibility trace (similar one to those contained in RFLO and e-prop) or behavioral time-scale synaptic plasticity (BTSP), and we have added the results of simulations incorporating each to the revised manuscript as Appendix. But how longer cue-reward delay can be learnt by elaboration of the model remains as a future issue.
  
  Reviewer #3 (Public Review):
  
  Comments on revisions: Thank you for addressing all my comments in your reply.
  
  We are happy to learn that all concerns raised by the reviewer in the previous round were addressed adequately. We agree with the reviewer that there are several ways the work can be improved.
  
  The various points raised by the reviewers at weaknesses are desired to be taken up in future works.
  
  AuthorResponse
Visit annotations in context

Tags

AuthorResponse

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2024.08.22.609100v4
www.medrxiv.org www.medrxiv.org

Heterozygous variants in PLCG1 affect hearing, vision, cardiac, and immune function

3
1. Public_Reviews 22 Jul 2025
 
 in eLife
 
 eLife Assessment
 
 This important study reveals how Drosophila may be used to investigate the role of missense variants in the PLCG1 phospholipase gene in human diseases. The experimental evidence is compelling and brings together rigorous analysis of clinical and model organism phenotypes with a structural analysis of the PLCG1 protein.
 
 Summary
2. Public_Reviews 22 Jul 2025
 
 in eLife
 
 Reviewer #2 (Public review):
 
 The manuscript by Ma et al. reports the identification of three unrelated people who are heterozygous for de novo missense variants in PLCG1, which encodes phospholipase C-gamma 1, a key signaling protein. These individuals present with partially overlapping phenotypes, including hearing loss, ocular pathology, cardiac defects, abnormal brain imaging results, and immune defects. None of the patients present with all of the above phenotypes. PLCG1 has also been implicated as a possible driver for cell proliferation in cancer.
 
 The three missense variants found in the patients result in the following amino acid substitutions: His380Arg, Asp1019Gly, and Asp1165Gly. PLCG1 (and the closely related PLCG2) have a single Drosophila ortholog called small wing (sl). sl-null flies are viable but have small wings with ectopic wing veins and supernumerary photoreceptors in the eye. As all three amino acids affected in the patients are conserved in the fly protein, in this work Ma et al. tested whether they are pathogenic by expressing either reference or patient variant fly or human genes in Drosophila and determining the phenotypes produced by doing so.
 
 Expression in Drosophila of the variant forms of PLCG1 found in these three patients is toxic; highly so for Asp1019Gly and Asp1165Gly, much more modestly for His380Arg. Another variant, Asp1165His which was identified in lymphoma samples and shown by others to be hyperactive, was also found to be toxic in the Drosophila assays. However, a final variant, Ser1021Phe, identified by others in an individual with severe immune dysregulation, produced no phenotype upon expression in flies.
 
 Based on these results, the authors conclude that the PLCG1 variants found in patients are pathogenic, producing gain-of-function phenotypes through hyperactivity. In my view, the data supporting this conclusion are robust, despite the lack of a detectable phenotype with Ser1021Phe, and I have no concerns about the core experiments that comprise the paper.
 
 Fig. 6, the last in the paper, provides information about PLCG1 structure and how the different variants would affect it. It shows that His380, Asp1019 and Asp1165 all lie within catalytic domains or intramolecular interfaces, and that variants in the latter two affect residues essential for autoinhibition. It also shows that Ser1021 falls outside the key interface occupied by Asp1019, but more could have been said about the potential effects of Ser1021Phe.
 
 Overall, I believe the authors fully achieved the aims of their study. The work will have a substantial impact because it reports the identification of novel disease-linked genes, and because it further demonstrates the high value of the Drosophila model for finding and understanding gene-disease linkages.
 
 Comments on revisions:
 
 The single recommendation I made on the original version, which was to further examine H380 mutants, has been satisfactorily addressed in the revised version.
 
 Review 2
3. Public_Reviews 22 Jul 2025
 
 in eLife
 
 Author response:
 
 The following is the authors’ response to the original reviews.
 
 Reviewer #1 (Public Review):
 
 Summary:
 
 This manuscript provides an initial characterization of three new missense variants of the PLCG1 gene associated with diverse disease phenotypes, utilizing a Drosophila model to investigate their molecular effects in vivo. Through the meticulous creation of genetic tools, the study assesses the small wing (sl) phenotype - the fly's ortholog of PLCG1 - across an array of phenotypes from longevity to behavior in both sl null mutants and variants. The findings indicate that the Drosophila PLCG1 ortholog displays aberrant functions. Notably, it is demonstrated that overexpression of both human and Drosophila PLCG1 variants in fly tissue leads to toxicity, underscoring their pathogenic potential in vivo.
 
 Strengths:
 
 The research effectively highlights the physiological significance of sl in Drosophila. In addition, the study establishes the in vivo toxicity of disease-associated variants of both human PLCG1 and Drosophila sl.
 
 Weaknesses:
 
 The study's limitations include the human PLCG1 transgene's inability to compensate for the Drosophila sl null mutant phenotype, suggesting potential functional divergence between the species. This discrepancy signals the need for additional exploration into the mechanistic nuances of PLCG1 variant pathogenesis, especially regarding their gain-of-function effects in vivo.
 
 Overall:
 
 The study offers compelling evidence for the pathogenicity of newly discovered disease-related PLCG1 variants, manifesting as toxicity in a Drosophila in vivo model, which substantiates the main claim by the authors. Nevertheless, a deeper inquiry into the specific in vivo mechanisms driving the toxicity caused by these variants in Drosophila could significantly enhance the study's impact.
 
 Reviewer #2 (Public Review):
 
 The manuscript by Ma et al. reports the identification of three unrelated people who are heterozygous for de novo missense variants in PLCG1, which encodes phospholipase C-gamma 1, a key signaling protein. These individuals present with partially overlapping phenotypes including hearing loss, ocular pathology, cardiac defects, abnormal brain imaging results, and immune defects. None of the patients present with all of the above phenotypes. PLCG1 has also been implicated as a possible driver for cell proliferation in cancer.
 
 The three missense variants found in the patients result in the following amino acid substitutions: His380Arg, Asp1019Gly, and Asp1165Gly. PLCG1 (and the closely related PLCG2) have a single Drosophila ortholog called small wing (sl). sl-null flies are viable but have small wings with ectopic wing veins and supernumerary photoreceptors in the eye. As all three amino acids affected in the patients are conserved in the fly protein, in this work Ma et al. tested whether they are pathogenic by expressing either reference or patient variant fly or human genes in Drosophila and determining the phenotypes produced by doing so.
 
 Expression in Drosophila of the variant forms of PLCG1 found in these three patients is toxic; highly so for Asp1019Gly and Asp1165Gly, much more modestly for His380Arg. Another variant, Asp1165His which was identified in lymphoma samples and shown by others to be hyperactive, was also found to be toxic in the Drosophila assays. However, a final variant, Ser1021Phe, identified by others in an individual with severe immune dysregulation, produced no phenotype upon expression in flies.
 
 Based on these results, the authors conclude that the PLCG1 variants found in patients are pathogenic, producing gain-of-function phenotypes through hyperactivity. In my view, the data supporting this conclusion are robust, despite the lack of a detectable phenotype with Ser1021Phe, and I have no concerns about the core experiments that comprise the paper.
 
 Figure 6, the last in the paper, provides information about PLCG1 structure and how the different variants would affect it. It shows that the His380, Asp1019, and Asp1165 all lie within catalytic domains or intramolecular interfaces and that variants in the latter two affect residues essential for autoinhibition. It also shows that Ser1021 falls outside the key interface occupied by Asp1019, but more could have been said about the potential effects of Ser1021Phe.
 
 Overall, I believe the authors fully achieved the aims of their study. The work will have a substantial impact because it reports the identification of novel disease-linked genes, and because it further demonstrates the high value of the Drosophila model for finding and understanding gene-disease linkages.
 
 Reviewer #3 (Public Review):
 
 Summary:
 
 The paper attempts to model the functional significance of variants of PLCG2 in a set of patients with variable clinical manifestations.
 
 Strengths:
 
 A study attempting to use the Drosophila system to test the function of variants reported from human patients.
 
 Weaknesses:
 
 Additional experiments are needed to shore up the claims in the paper. These are listed below.
 
 Major Comments:
 
 (1) Does the pLI/ missense constraint Z score prediction algorithm take into consideration whether the gene exhibits monoallelic or biallelic expression?
 
 To our knowledge, pLI and missense Z don't consider monoallelic or biallelic expression. Instead, they reflect sequence constraint and are calculated based on the observed versus expected variant frequencies in population databases.
 
 (2) Figure 1B: Include human PLCG2 in the alignment that displays the species-wide conserved variant residues.
 
 We have updated Figure 1B and incorporated the alignment of PLCG2.
 
 (3) Figure 4A:
 
 Given that
 
 (i) sl is predicted to be the fly ortholog for both mammalian PLCγ isozymes: PLCG1 and PLCG2 [Line 62]
 
 (ii) they are shown to have non-redundant roles in mammals [Line 71]
 
 (iii) reconstituting PLCG1 is highly toxic in flies, leading to increased lethality.
 
 This raises questions about whether sl mutant phenotypes are specifically caused by the absence of PLCG1 or PLCG2 functions in flies. Can hPLCG2 reconstitution in sl mutants be used as a negative control to rule out the possibility of the same?
 
 The studies about the non-redundant roles of PLCG1 and PLCG2 mainly concern the immune system.
 
 We have assessed the phenotypes in the slT2A/Y; UAS-hPLCG2 flies. Expression of human PLCG2 in flies is also toxic and leads to severely reduced eclosion rate.
 
 We have updated the manuscript with these results, and included the eclosion rate of slT2A/Y; UAS-hPLCG2 flies in the new Figure 4B.
 
 (4) Do slT2A/Y; UAS-PLCG1Reference flies survive when grown at 22{degree sign}C? Since transgenic fly expressing PLCG1 cDNA when driven under ubiquitous gal4s, Tubulin and Da, can result in viable progeny at 22{degree sign}C, the survival of slT2A/Y; UAS-PLCG1Reference should be possible.
 
 The eclosion rate of slT2A/Y >PLCG1Reference flies at 22°C is slightly higher than at 25°C, but remains severely reduced compared to the UAS-Empty control. We have presented these results in the updated Figure S3.
 
 and similarly
 
 Does slT2A flies exhibit the phenotypes of (i) reduced eclosion rate (ii) reduced wing size and ectopic wing veins and (iii) extra R7 photoreceptor in the fly eye at 22{degree sign}C?
 
 The mutant phenotypes are still observed at 22 °C.
 
 If so, will it be possible to get a complete rescue of the slT2A mutant phenotypes with the hPLCG1 cDNA at 22{degree sign}C? This dataset is essential to establish Drosophila as an ideal model to study the PLCG1 de novo variants.
 
 Thank you for the suggestion. It is difficult to directly assess the rescue ability of the PLCG1 cDNAs due to the toxicity. However, our ectopic expression assays show that the variants are more toxic than the reference with variable severities, suggesting that the variants are deleterious.
 
 The ectopic expression strategy has been used to evaluate the consequence of genetic variants and has significantly contributed to the interpretation of their pathogenicity in many cases (reviewed in Her et al., Genome, 2024, PMID: 38412472).
 
 (5) Localisation and western blot assays to check if the introduction of the de novo mutations can have an impact on the sub-cellular targeting of the protein or protein stability respectively.
 
 Thank you for the suggestion.
 
 We expressed PLCG1 cDNAs in the larval salivary glands and performed antibody staining (rabbit anti-Human PLCG1; 1:100, Cell Signaling Technology, #5690). The larval salivary gland are composed of large columnar epithelia cells that are ideal for analyzing subcellular localization of proteins. The PLCG1 proteins are cytoplasmic and localize near the cell surface, with some enrichment in the plasma membrane region. The variant proteins are detected, and did not show significant difference in expression level or subcellular distribution compared to the reference. We did not include this data.
 
 (6) Analysing the nature of the reported gain of function (experimental proof for the same is missing in the manuscript) variants:
 
 Instead of directly showing the effect of introducing the de novo variant transgenes in the Drosophila model especially when the full-length PLCG1 is not able to completely rescue the slT2A phenotype;
 
 (i) Show that the gain-of-function variants can have an impact on the protein function or signalling via one of the three signalling outputs in the mammalian cell culture system: (i) inositol-1,4,5-trisphosphate production, (ii) intracellular Ca2+ release or (iii) increased phosphorylation of extracellular signal-related kinase, p65, and p38.
 
 We appreciate the reviewer’s suggestion. We utilized the CaLexA (calcium-dependent nuclear import of LexA) system (Masuyama et al., J Neurogenet, 2012, PMID: 22236090) to assess the intracellular Ca2+ change associated with the expression of PLCG1 cDNAs in fly wing discs. The results show that, compared to the reference, expression of the D1019G or D1165G variants leads to elevated intracellular Ca2+ levels, similar to the hyperactive S1021F and D1165H variants. However, the H380R or L597F variants did not show a detectable phenotype in this assay. These results suggest that D1019G and D1165G are hyperactive variants, whereas H380R and L597F variant are not, or their effect is too mild to be detected in this assay. We have updated the related sections in the manuscript and Figures 5A and S5.
 
 OR
 
 (ii) Run a molecular simulation to demonstrate how the protein's auto-inhibited state can be disrupted and basal lipase activity increased by introducing D1019G and D1165G, which destabilise the association between the C2 and cSH2 domains. The H380R variant may also exhibit characteristics similar to the previously documented H335A mutation which leaves the protein catalytically inactive as the residue is important to coordinate the incoming water molecule required for PIP2 hydrolysis.
 
 We utilized the DDMut platform, which predicts changes in the Gibbs Free Energy (ΔΔG) upon single and multiple point mutations (Zhou et al., Nucleic Acid Res, 2023, PMID: 37283042), to gain insight into the molecular dynamics changes of variants. The results are now presented in Figure S7.
 
 Additionally, we performed Molecular dynamics (MD) simulations. The results show that, similar to the hyperactive D1165H variant, the D1019G and D11656G variants exhibit increased disorganization, with a higher root mean square deviations (RMSD) compared to the reference PLCG1.The data are also presented in the updated Figure S7.
 
 (7) Clarify the reason for carrying out the wing-specific and eye-specific experiments using nub-gal4 and eyless-gal4 at 29˚C despite the high gal4 toxicity at this temperature.
 
 We used high temperature and high expression level to see if the mild H380R and L597F variants could show phenotypes in this condition.
 
 The toxicity of the two strong variants (D1019G and D1165G) has been consistently confirmed in multiple assays at different temperatures.
 
 (8) For the sake of completeness the authors should also report other variants identified in the genomes of these patients that could also contribute to the clinical features.
 
 Thank you!
 
 The additional variants and their potential contributions to the clinical features are listed and discussed in Table 1 and its legend.
 
 Reviewer #1 (Recommendations For The Authors):
 
 The manuscript's significant contribution is tempered by a lack of comprehensive analysis using the generated genetic reagents in Drosophila. To enhance our understanding of the PLCG1 orthologs, I suggest the following:
 
 (1) A more detailed molecular analysis to distinguish the actions of sl variants from the wild-type could be very informative. For example, utilizing the HA-epitope tag within the current UAS-transgenes could reveal more about the cellular dynamics and abundance of these variants, potentially elucidating mechanisms beyond gain-of-function.
 
 We appreciate the reviewer’s suggestion. The UAS-sl cDNA constructs contain stop codon and do not express an HA-epitope tag. Alternatively, we utilized commercially available antibodies against human PLCG1 antibodies to assess the subcellular localization and protein stability by expressing the reference and variant PLCG1 cDNAs in Drosophila larval salivary glands. The reference proteins are cytoplasmic with some enrichment along the plasma membrane. However, we did not observe significant differences between the reference and variant proteins in this assay. We did not include this data.
 
 (2) I suggest further investigating the relative contributions of developmental processes and acute (Adult) effects on the sl-variant phenotypes observed. For example, employing systems that allow for precise temporal control of gene expression, such as the temperature-sensitive Gal80, could differentiate between these effects, shedding light on the mechanisms that affect longevity and locomotion. This knowledge would be vital for a deeper understanding of the corresponding human disorders and for developing therapeutic interventions.
 
 We appreciate the reviewer’s suggestion. We utilized Tub-GAL4, Tub-GAL80ts to drive the expression of sl wild-type or variant cDNAs, and performed temperature shifts after eclosion to induce expression of the cDNAs only in adult flies. The slD1184G variant (corresponding to PLCG1D1165G) caused severely reduced lifespan and the flies mostly die within 10 days. The slD1041G variant (corresponding to PLCG1D1019G) led to reduced longevity and locomotion. The slH384R variant (corresponding to PLCG1H380R) showed only a mild effect on longevity and no significant effect on climbing ability. These results suggest that the two strong variants (slD1041G and slD1184G) contribute to both developmental and acute effects while the H384R variant mainly contributes to developmental stages.
 
 I also suggest a more refined analysis of overexpression toxicity. Rather than solely focusing on ubiquitous transgene expression, overexpressing transgene in endogenous pattern using sl-t2a-Gal4 may yield a more nuanced understanding of the pathogenic mechanisms of gain-of-function mutations, particularly in the pathogenesis associated with these variants exclusively located in the coding regions.
 
 We appreciate the reviewer’s suggestion. We therefore performed the experiments using slT2A to drive overexpression ofPLCG1cDNAs in heterozygous female progeny with one copy of wild-type sl+ (slT2A/ yw > UAS-cDNAs). In this context, expression of PLCG1Reference, PLCG1H380RorPLCG1L597F is viable whereas expression of PLCG1D1019G or PLCG1D1165G is lethal, suggesting that the PLCG1D1019G and PLCG1D1165G variants exert a strong dominant toxic effect while the PLCG1H380Rand PLCG1L597F are comparatively milder. Similar patterns have been consistently observed in other ectopic expression assays with varying degrees of severity. These results are updated in the manuscript and figures.
 
 Reviewer #2 (Recommendations For The Authors):
 
 The work in the paper could be usefully extended by determining the effects of expressing His380Phe and His380Ala in flies. These variants suppress PLCG1 activity, so their phenotype, if any, would be predicted not to be the same as His380Arg. Determining this would add further strength to the conclusions of the paper.
 
 We thank the reviewer for the constructive suggestions! We have tested the enzymatic-dead H380A variant, which still exhibits toxicity when expressed in slT2A/Y hemizygous flies, but it is not toxic in heterozygous females suggesting that the reduced eclosion rate is likely not directly associated with enzymatic activity. We have updated the manuscript and figures accordingly.
 
 AuthorResponse
Visit annotations in context

Tags

Summary

Review 2

AuthorResponse

Annotators

Public_Reviews

URL

medrxiv.org/content/10.1101/2024.01.08.23300523v2
www.biorxiv.org www.biorxiv.org

Formation of Task Representations and Replay in Mouse Medial Prefrontal Cortex

4
1. Public_Reviews 22 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This useful study characterizes the evolution of medial prefrontal cortex activity during the learning of an odor-based choice task. While the evidence for an increase in task-informative cells with learning, the emergence of population sequences, and the presence of replay events is intriguing, it remains incomplete; notably, the study does not adequately consider the extensive literature on the role of olfactory and hippocampal networks in similar odor-guided tasks. Furthermore, the experimental design appears insufficient to support strong conclusions regarding pre-existing representations or the functional relevance of neural sequences. The study will be of interest to neuroscientists investigating learning and decision-making processes.
  
  Summary
2. Public_Reviews 22 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  The authors use longitudinal in vivo 1-photon calcium recordings in mouse prefrontal cortex throughout the learning of an odor-guided spatial memory task, with the goal of examining the development of task-related prefrontal representations over the course of learning in different task stages and during sleep sessions. They report replication of their previous results, Muysers et al. 2025, that task and representations in prefrontal cortex arise de novo after learning, comprising of goal selective cells that fire selectively for left or right goals during the spatial working memory component of the task, and generalized task phase selective cells that fire equivalently in the same place irrespective of goal, together comprising task-informative cells. The number of task-informative cells increases over learning, and covariance structure changes resulting in increased sequential activation in the learned condition, but with limited functional relevance to task representation. Finally, the authors report that similar to hippocampal trajectory replay, prefrontal sequences are replayed at reward locations.
  
  Strengths:
  
  The major strength of the study is the use of longitudinal recordings, allowing identification of task-related activity in the prefrontal cortex that emerges de novo after learning, and identification of sub-second sequences at reward wells.
  
  Weaknesses:
  
  (1) The study mainly replicates the authors' previously reported results about generalized and trajectory-specific coding of task structure by prefrontal neurons, and stable and changing representations over learning (Muysers et al., 2024, PMID: 38459033; Muysers et al., 2025, PMID: 40057953), although there are useful results about changes in goal-selective and task-phase selective cells over learning. There are basic shortcomings in the scientific premise of two new points in this manuscript, that of the contribution of pre-existing spatial representations, and the role of replay sequences in the prefrontal cortex, both of which cannot be adequately tested in this experimental design.
  
  (2) The study denotes neurons that show precise spatial firing equivalently irrespective of goal, as generalized task representations, and uses this as a means to testing whether pre-existing spatial representations can contribute to task coding and learning. A previous study using this data has already shown that these neurons preferentially emerge during task learning (Muysers et al., 2025, PMID: 40057953). Furthermore, in order to establish generalization for abstract task rules or cognitively flexibility, as motivated in the manuscript, there is a need to show that these neurons "generalize" not just to firing in the same position during learning of a given task, but that they can generalize across similar tasks, e.g., different mazes with similar rules, different rules with similar mazes, new odor-space associations, etc. For an adequate test of pre-existing spatial structure, either a comparison task, as in the examples above, is needed, or at least a control task in which animals can run similar trajectories without the task contingencies. An unambiguous conclusion about pre-existing spatial structure is not possible without these controls.
  
  (3) The scientific premise for the test of replay sequences is motivated using hippocampal activity in internally guided spatial working memory rule tasks (Fernandez-Ruiz et al., 2019, PMID: 31197012; Kay et al., PMID: 32004462; Tang et al., 2021, PMID: 33683201), and applied here to prefrontal activity in a sensory-cue guided spatial memory task (Muysers et al., 2024, PMID: 38459033; Symanski et al., PMID: 36480255; Taxidis et al, 2020, PMID: 32949502). There are several issues with the conclusion in the manuscript that prefrontal replay sequences are involved in evaluating behavioral outcomes rather than planning future outcomes.
  
  (4) First, odor sampling in odor-guided memory tasks is an active sensory processing state that leads to beta and other oscillations in olfactory regions, hippocampus, prefrontal cortex, and many other downstream networks, as documented in a vast literature of studies (Martin et al., 2007, PMID: 17699692; Kay, 2014, PMID: 24767485; Martin et al., 2014; Ramirez-Gordillo, 2022, PMID: 36127136; Symanski et al., 2022, PMID: 36480255). This is an active sensory state, not conducive to internal replay sequences, unlike references used in this manuscript to motivate this analysis, which are hippocampal spatial memory studies with internally guided rather than sensory-cue guided decisions, where internal replay is seen during immobility at reward wells. These two states cannot be compared with the expectation of finding similar replay sequences, so it is trivially expected that internal replay sequences will not be seen during odor sampling.
  
  (5) Second, sequence replay is not the only signature of reactivation. Many studies have quantified prefrontal replay using template matching and reactivation strength metrics that do not involve sequences (Peyrache et al., 2009, PMID: 19483687; Sun et al., 2024, PMID: 38872470). Third, previous studies have explicitly shown that prefrontal activity can be decoded during odor sampling to predict future spatial choices - this uses sensory-driven ensemble activity in prefrontal cortex and not replay, as odor sampling leads to sensory driven processing and recall rather than a reactivation state (Symanski et al., 2022, PMID: 36480255). It is possible that 1-photon recordings do not have the temporal resolution and information about oscillatory activity to enable these kinds of analyses. Therefore, an unambiguous conclusion about the existence and role of prefrontal reactivation is not possible in this experimental and analytical design.
  
  Review 1
3. Public_Reviews 22 Jul 2025
  
  in eLife
  
  Reviewer #2 (Public review):
  
  Summary:
  
  The first part of the manuscript quantifies the proportion of goal-arm specific and task-phase specific cells during the learning and learned conditions, and similar to their previously published Muysers et al. (2025) paper, find that the task-phase coding cells (Muysers et al. call them path equivalent cells) increase in the learned condition. However, compared to the Muysers et al. 2025 paper, this work quantifies the proportion of cells that change coding type across learning and learned conditions. The second part of the paper reports firing sequences using a sequence similarity clustering-based method that the group developed previously and applied to hippocampal data in the past.
  
  Strengths:
  
  Identifying sequences by a clustering method in which sequence patterns of individual events are compared is an interesting idea.
  
  Weaknesses:
  
  Further controls are needed to validate the results.
  
  Review 2
4. Public_Reviews 22 Jul 2025
  
  in eLife
  
  Reviewer #3 (Public review):
  
  In the study, the authors performed longitudinal 1P calcium imaging of mouse mPFC across 8 weeks during learning of an olfactory-guided task, including habituation, training, and sleep periods. The task had 3 arms. Odor was sampled at the end of the middle arm (named the "Sample" period). The animal then needed to run to one of the two other arms (R or L) based on the odor. The whole period until they reached the end of one of the choice arms was the "Outward" period. The time at the reward end was the "Reward" period. They noted several changes from the learning condition to the learned condition (there are some questions for the authors interspersed):
  
  (1) They classified cells in a few ways. First, each cell was classified as SI (spatially informative) if it had significantly more spatial information than shuffled activity, and ~50% of cells ended up being SI cells. Then, among the SI cells, they classified a cell as a TC (task cell) if it had statistically similar activity maps for R versus L arms, and a GC (goal arm cell) otherwise. Note that there are 4 kinds of these cells: outer arm TCs and GCs, and middle arm TCs and GCs (with middle arm GCs essentially being like "splitter cells" since they are not similarly active in the middle arm for R versus L trials). There was an increase in TCs from the learning to the learned condition sessions.
  
  (2) They analyze activity sequences across cells. They extracted 500 ms duration bursts (defined as periods of activity > 0.5 standard deviations over what I assume is the mean - if so, the authors can add "over the mean" to the burst definition in the methods). They first noted that the resulting "Burst rates were significantly larger during behavioral epochs than during sleep and during periods of habituation to the arena", and "Moreover, burst rates during correct trials were significantly lower than during error trials". For the sequence analysis, they only considered bursts consisting of at least 5 active cells. A cell's activity within the burst was set to the center of mass of calcium activity. Then they took all the sequences from all learned and learning sessions together and hierarchically clustered them based on Spearman's rank correlation between the order of activity in each pair of sequences (among the cells active in both). The iterative hierarchical clustering process produces groups (clusters) of sequences such that there are multiple repeats of sequences within a cluster. Different sequences are expressed across all the longitudinally recorded sessions. They noted "large differences of sequence activation between learning and learned condition, both in the spatial patterns (example animal in Figure 3D) and the distribution of the sequences (Figures 3D, E). Rastermap plots (Figure 3D) also reveal little similarity of sequence expression between task and habituation or sleep condition." They also note that the difference in the sequences between learning and learned conditions was larger than the difference between correct and error trials within each condition. They conclude that during task learning, new representations are established, as measured by the burst sequence content. They do additional analyses of the sequence clusters by assessing the spatial informativeness (SI) of each sequence cluster. Over learning, they find an increase in clusters that are spatially informative (clusters that tend to occur in specific locations). Finally, they analyzed the SI clusters in a similar manner to SI cells and classified them as task phase selective sequences (TSs) and goal arm selective sequences (GSs), and did some further analysis. However, they themselves conclude that the frequency of TSs and GSs is limited (I believe because most sequence clusters were non-SI - the authors can verify this and write it in the text?). In the discussion, they say, "In addition to GSs and TSs, we found that most of the recurring sequences are not related to behavior".
  
  (3) As an alternative to analyzing individual cells and sequences of individual cells, they then look for trajectory replay using Bayesian population decoding of location during bursts. They analyze TS bursts, GS bursts, and non-SI bursts. They say "we found correlations of decoded position with time bin (within a 500 ms burst) strongly exceeding chance level only during outward and reward phase, for both GSs and TSs (Fig 4H)." Figure 4H shows distributions indicating statistically significant bias in the forward direction (using correlations of decoded location versus time bin across 10 bins of 50 ms each within each 500-ms burst). They find that the Outward trajectories appear to reflect the actual trajectory during running itself, so they are likely not replay. But the sequences at the Reward are replay as they do not reflect the current location. Furthermore, replay at the Reward is in the forward direction (unlike the reverse replay at Reward seen in the hippocampus), and this replay is only seen in the learned and not the learning condition. At the same time, they find that replay is not seen during odor Sampling, from which they conclude there is no evidence of replay used for planning. Instead, they say the replay at the Reward could possibly be for evaluation during the Reward phase, though this would only be for the learned condition. They conclude "Together with our finding of strong changes in sequence expression after learning (Figure 3E) these findings suggest that a representation of task develops during learning, however, it does not reflect previous network structure." I am not sure what is meant here by the second part of this sentence (after "however ..."). Is it the idea that the replay represents network structure, and the lack of Reward replay in the learning condition means that the network structure must have been changed to get to the learned condition? Please clarify.
  
  This study provides valuable new information about the evolution of mPFC activity during the learning of an odor-based 2AFC T-maze-like task. They show convincing evidence of changes in single-cell tuning, population sequences, and replay events. They also find novel forward replay at the Reward, and find that this is present only after the animal has learned the task. In the discussion, the authors note "To our knowledge, this study identified for the first time fast recurring neural sequence activity from 1-p calcium data, based on correlation analysis."
  
  (1) There are some statements that are not clear, such as at the end of the introduction, where the authors write, "Both findings suggest that the mPFC task code is locally established during learning." What is the reasoning behind the "locally established" statement? Couldn't the learning be happening in other areas and be inherited by the mPFC? Or are the authors assuming that newly appearing sequences within a 500-ms burst period must be due to local plasticity? I have also pointed out a question about the statement "however, it does not reflect previous network structure" in (3) above.
  
  (2) The threshold for extracting burst events (0.5 standard deviations, presumably above the mean, but the authors should verify this) seems lower than what one usually sees as a threshold for population burst detection. What fraction of all data is covered by 500 ms periods around each such burst? However, it is potentially a strength of this work that their results are found by using this more permissive threshold.
  
  Review 3
Visit annotations in context

Tags

Summary

Review 2

Review 3

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2025.03.07.642001v2
www.biorxiv.org www.biorxiv.org

JAX Animal Behavior System (JABS): A genetics informed, end-to-end advanced behavioral phenotyping platform for the laboratory mouse

2
1. Public_Reviews 22 Jul 2025
  
  in eLife
  
  eLife Assessment
  
  This important study presents JABS, an open-source platform that integrates hardware and user-friendly software for standardized mouse behavioral phenotyping. The work has practical implications for improving reproducibility and accessibility in behavioral neuroscience, especially for linking behavior to genetics across diverse mouse strains. The strength of evidence is convincing, with validation of key platform components, although incomplete methodological details and limited documentation, particularly around pose estimation and classifier generalizability, currently limit its interpretability and broader adoption.
  
  Summary
2. Public_Reviews 22 Jul 2025
  
  in eLife
  
  Reviewer #1 (Public review):
  
  Summary:
  
  This manuscript provides an open-source tool including hardware and software, and a dataset to facilitate and standardize behavioral classification in laboratory mice. The hardware for behavioral phenotyping was extensively tested for safety. The software is GUI-based, facilitating the usage of this tool across the community of investigators who do not have a programming background. The behavioral classification tool is highly accurate, and the authors deposited a large dataset of annotations and pose tracking for many strains of mice. This tool has great potential for behavioral scientists who use mice across many fields; however, there are many missing details that currently limit the impact of this tool and publication.
  
  Strengths:
  
  (1) There is software-hardware integration for facilitating cross-lab adaptation of the tool and minimizing the need to annotate new data for behavioral classification.
  
  (2) Data from many strains of mice were included in the classification and genetic analyses in this manuscript.
  
  (3) A large dataset was annotated and deposited for the use of the community.
  
  (4) The GUI-based software tool decreases barriers to usage across users with limited coding experience.
  
  Weaknesses:
  
  (1) The authors only report the quality of the classification considering the number of videos used for training, but not considering the number of mice represented or the mouse strain. Therefore, it is unclear if the classification model works equally well in data from all the mouse strains tested, and how many mice are represented in the classifier dataset and validation.
  
  (2) The GUI requires pose tracking for classification, but the software provided in JABS does not do pose tracking, so users must do pose tracking using a separate tool. Currently, there is no guidance on the pose tracking recommendations and requirements for usage in JABS. The pose tracking quality directly impacts the classification quality, given that it is used for the feature calculation; therefore, this aspect of the data processing should be more carefully considered and described.
  
  (3) Many statistical and methodological details are not described in the manuscript, limiting the interpretability of the data presented in Figures 4,7-8. There is no clear methods section describing many of the methods used and equations for the metrics used. As an example, there are no details of the CNN used to benchmark the JABS classifier in Figure 4, and no details of the methods used for the metrics reported in Figure 8.
  
  Review 1
Visit annotations in context

Tags

Summary

Review 1

Annotators

Public_Reviews

URL

biorxiv.org/content/10.1101/2022.01.13.476229v3

Public_Reviews

Annotations: 10,000

Joined: March 17, 2021

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators