Hypothesis

20 Matching Annotations

Last 7 days
arxiv.org arxiv.org

2412.12101v1.pdf

2
1. erin.mcgeever 20 Dec 2024
  
  in Arcadia Science
  
  320 neurons to 10,420 latent features
  
  What was the motivation for the value 10,420? Is the ability to extract features pretty consistent (no the actual weights obviously but the behavior of being able to extract meaningful features) as long as the space is large enough?
2. erin.mcgeever 19 Dec 2024
  
  in Public
  
  visualizations primarily focus on features from the fourth layer
  
  Was the fourth layer chosen arbitrarily or did it have any particularly nice properties for visualization?
Visit annotations in context

Annotators

erin.mcgeever

URL

arxiv.org/pdf/2412.12101
www.biorxiv.org www.biorxiv.org

Binary Discriminator Facilitates GPT-based Protein Design

2
1. erin.mcgeever 19 Dec 2024
  
  in Public
  
  assuring that the417discriminator can well capture the sequence-function relationship
  
  Is the discriminator learning something much more complicated than identifying something like how to identify MDH domains?
2. erin.mcgeever 19 Dec 2024
  
  in Public
  
  convolutional neural network (CNN)-based protein discriminator
  
  This seems to work well based on the results, was there previous work/concepts that lead to this as the architecture for the discriminator?
Visit annotations in context

Annotators

erin.mcgeever

URL

biorxiv.org/content/10.1101/2023.11.20.567789v3.full.pdf
Dec 2024
www.biorxiv.org www.biorxiv.org

71208679

1
1. erin.mcgeever 01 Dec 2024
  
  in Arcadia Science
  
  Hyperparameter optimization was 220performed using a hyperband tune
  
  How computationally intensive was it to solve for all of the hyperparams for this model?
Visit annotations in context

Annotators

erin.mcgeever

URL

biorxiv.org/content/10.1101/2024.11.20.624609v1.full.pdf
Nov 2024
www.biorxiv.org www.biorxiv.org

A Large-Scale Foundation Model for RNA Function and Structure Prediction

2
1. erin.mcgeever 30 Nov 2024
  
  in Arcadia Science
  
  masking ratio 0.15
  
  Do different values for percent of tokens masked have much of an impact on model performance?
2. erin.mcgeever 30 Nov 2024
  
  in Arcadia Science
  
  average cluster size is just 2.2
  
  Was there much variation around this?
Visit annotations in context

Annotators

erin.mcgeever

URL

biorxiv.org/content/10.1101/2024.11.28.625345v1.full.pdf
www.biorxiv.org www.biorxiv.org

20985035

1
1. erin.mcgeever 01 Nov 2024
  
  in Arcadia Science
  
  Guider1 andGuider2, were designed to improve the network’s ability to distinguish between differentsequence types. Guider1 consists of a multi-head self-attention mechanism with 8 heads andtwo fully connected layers, while Guider2 is a Gated Recurrent Unit (GRU) with 256neurons
  
  Sorry if I missed this, but what was the motivation for choosing these particular discriminator models? They seem very reasonable given the results, but I'm curious how these two types of models were chosen based on the structure of the initial problem?
Visit annotations in context

Annotators

erin.mcgeever

URL

biorxiv.org/content/10.1101/2024.10.29.620982v1.full.pdf
www.biorxiv.org www.biorxiv.org

Integer programming framework for pangenome-based genome inference

2
1. erin.mcgeever 01 Nov 2024
  
  in Arcadia Science
  
  recombination, or a haplotype switch, occurs between two consecutive vertices ai.u and ai+1.u in P ifai.h ̸ = ai+1.h
  
  Would an "ideal" path be one where there is a single haplotype path that crosses every single \(a_{i}.u\) vertex? This would be something that generally exists for a given sample, but if present this would be the best path?
2. erin.mcgeever 01 Nov 2024
  
  in Arcadia Science
  
  ai.u
  
  Just to verify I'm understanding this notation, \(a_{i}.u\) is being used like an accessor for components of the tuple \(a_{i}\)?
Visit annotations in context

Annotators

erin.mcgeever

URL

biorxiv.org/content/10.1101/2024.10.27.620212v1.full.pdf
Oct 2024
www.biorxiv.org www.biorxiv.org

Generative Models Validation via Manifold Recapitulation Analysis

1
1. erin.mcgeever 31 Oct 2024
  
  in Arcadia Science
  
  PED(X, Y ) = 12 E[infπ(∥d(X, X′) − d(Y, X′)π ∥p)]+ 12 E[infπ(∥d(Y, Y ′) − d(X, Y ′)π ∥
  
  Sorry if this is clear, but I'm a little unclear on the notation. Is X the input data (so empirical results from a scRNA-seq experiment) and Y the generated dist? If so then are X' and Y' subsets of the respective distributions?
Visit annotations in context

Annotators

erin.mcgeever

URL

biorxiv.org/content/10.1101/2024.10.23.619602v2.full.pdf
Jul 2024
www.biorxiv.org www.biorxiv.org

Inference of gene regulatory networks for overcoming low performance in real-world data

2
1. erin.mcgeever 30 Jul 2024
  
  in Arcadia Science
  
  P recisions
  
  In this case does s denote tuning to how results are classified when calling true and false positives and negatives? Or weight terms in the F-measure score it self?
2. erin.mcgeever 30 Jul 2024
  
  in Arcadia Science
  
  x−i,j (t), which is the expression of gene j (the expres-sion of all other genes is masked)
  
  Is this a vector with only a value at position j? So a vector of size N with only position j having a value set, hence being different than x_{j}(t)?
Visit annotations in context

Annotators

erin.mcgeever

URL

biorxiv.org/content/10.1101/2024.07.16.603684v2.full.pdf
May 2024
www.biorxiv.org www.biorxiv.org

Topological Structures in the Space of Treatment-Naïve Patients With Chronic Lymphocytic Leukemia

2
1. erin.mcgeever 24 May 2024
  
  in Arcadia Science
  
  The black arrow highlights the longest barcode.
  
  It might be easier to see if the longest barcode was in a different color or had a dashed line overlayed on top of it.
2. erin.mcgeever 24 May 2024
  
  in Arcadia Science
  
  e green and light blue clusters are on one side, and the other colors(especially the dark blue and magenta) are on the other side of the hole.
  
  It's hard to tell exactly which part of the structure is being referenced (at least for me). It might be helpful to add an annotation like a circle to show which area is being discussed.
Visit annotations in context

Annotators

erin.mcgeever

URL

biorxiv.org/content/10.1101/2024.05.16.593927v1.full.pdf
www.biorxiv.org www.biorxiv.org

Inferring Single-Cell RNA Kinetics from Various Biological Priors

1
1. erin.mcgeever 23 May 2024
  
  in Arcadia Science
  
  xi,j − ̃x
  
  The indexing of x_{ij} doesn't seem to be a unique element but a pair (u_{ij}, s_{ij}). Is this loss function calculating the difference between the spliced and unspliced differences combined?
Visit annotations in context

Annotators

erin.mcgeever

URL

biorxiv.org/content/10.1101/2024.05.21.595179v1.full.pdf
Feb 2024
arxiv.org arxiv.org

2402.14583.pdf

1
1. erin.mcgeever 29 Feb 2024
  
  in Arcadia Science
  
  if di is equal to 1984-01-01, then U encompasses all papers published after diuntil 1989-01-01
  
  This part is assuming that t has been set to 5, correct?
Visit annotations in context

Annotators

erin.mcgeever

URL

arxiv.org/pdf/2402.14583.pdf
www.biorxiv.org www.biorxiv.org

Untitled document

1
1. erin.mcgeever 29 Feb 2024
  
  in Arcadia Science
  
  scGPT v1 outperformed the scGPT model overall, raising the issue146of the need for increasing the size of pre-training datasets for this task
  
  Wasn't scGPT v1 which out performed scGPT trained on a smaller pre-training data set?
Visit annotations in context

Annotators

erin.mcgeever

URL

biorxiv.org/content/10.1101/2023.09.08.555192v5.full.pdf
www.biorxiv.org www.biorxiv.org

A critical reexamination of recovered SARS-CoV-2 sequencing data

1
1. erin.mcgeever 27 Feb 2024
  
  in Arcadia Science
  
  The fact that thisnarrative captured so much attention despite a complete lack of supporting evidence promptsus to reflect on how our biases shape our interpretation of data, and how extreme differencesin believing people based on where they work can lead to incorrect and harmful conclusions.Here, we are reflecting on our experiences, and we invite readers to do the same.
  
  Really interesting article!
  
  Given the impact this had do you feel there are changes or criticisms needed around the review and publication process of the Bloom results? I'm also curious if you have any thoughts on how pre-print and open science can do a better job with contentious results and discussions around them.
Visit annotations in context

Annotators

erin.mcgeever

URL

biorxiv.org/content/10.1101/2024.02.15.580500v2.full.pdf
Oct 2023
www.biorxiv.org www.biorxiv.org

A phylogenetic method linking nucleotide substitution rates to rates of continuous trait evolution

1
1. erin.mcgeever 13 Oct 2023
  
  in Arcadia Science
  
  e also found associations top53, telomere maintenance, and cell fate within 1 Mbp of our top 25 loci of interest. Ourtop 25 loci also have links to cancer and height or body size, though these prevalent diseasesand biomarkers are of course heavily studied and consequently commonly annotated, and sowe cannot know whether their appearance is simply due to their frequency
  
  Is this 1Mbp in either direction of a loci of interest? Just binning the human genome by 25 points gives about 1.7% of the genome within 1Mbp of these uniform bins. Depending on what percent of genes are associated with the traits of interest that could be very rare, or fairly common. Is there a way of viewing how impactful this result is in comparison to the size of the genome annotated as relevant to these traits?
Visit annotations in context

Annotators

erin.mcgeever

URL

biorxiv.org/content/10.1101/2023.10.04.560937v1.full.pdf

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL