2,366 Matching Annotations

Last 7 days
www.frontiersin.org www.frontiersin.org

Pipeliner: A Nextflow-Based Framework for the Definition of Sequencing Data Processing Pipelines

1
1. pbk1 17 Nov 2025
  
  in Public
  
  combines the Nextflow scripting language and Anaconda package manager to generate modular computational workflows
  
  How is Pipeline a "framework" as opposed to just a pipeline made with nextflow?
Visit annotations in context

Annotators

pbk1

URL

frontiersin.org/journals/genetics/articles/10.3389/fgene.2019.00614/full
genome.cshlp.org genome.cshlp.org

Real-time analysis and visualization of nanopore metagenomic samples with MARTi

1
1. pbk1 13 Nov 2025
  
  in Public
  
  MARTi Engine, a Java backend that performs the analysis
  
  Why Java as opposed to Nextflow?
Visit annotations in context

Annotators

pbk1

URL

genome.cshlp.org/content/35/11/2488.long
www.npr.org www.npr.org

The Fascinating World Of The Dung Beetle

3
1. pbk1 13 Nov 2025
  
  in Public
  
  driving across the country with him when I was a college undergraduate. He was an advisor to me. I was doing research out at a place called The Rocky Mountain Lab in Colorado
2. pbk1 13 Nov 2025
  
  in Public
  
  And I remember giving him a really hard time because we were wasting a lot of travel time
  
  Interesting to note how an undergrad can interact so freely with their faculty advisor. This would be unlikely in an Asian culture with a rigid hierarchy.
3. pbk1 13 Nov 2025
  
  in Public
  
  There's a point to this story which is that he found out the hard way from teaching entomology year after year after year, handling cockroaches - people used cockroaches as the lab rat for entomology labs - he got really badly allergic to them
  
  This is counterintuitive to me. I thought exposure to allergens should reduce allergies over time right?
Visit annotations in context

Annotators

pbk1

URL

npr.org/2009/05/04/103775784/the-fascinating-world-of-the-dung-beetle
itsfoss.com itsfoss.com

The Internet is Dying. We Can Still Stop It

1
1. pbk1 13 Nov 2025
  
  in Public
  
  we don't have go into survival mode and opt for devices like Prepper Disk for a post-apocalyptic, offline internet knowledge
  
  A great way to see the most important knowledge distilled from the Internet!
Visit annotations in context

Annotators

pbk1

URL

itsfoss.com/news/internet-is-dying/
itsfoss.com itsfoss.com

Ownership of Digital Content Is an Illusion—Unless You Self‑Host

1
1. pbk1 13 Nov 2025
  
  in Public
  
  Prices are rising across Netflix, Spotify, and their peers, and more people are quietly returning to the oldest playbook of the internet: piracy. Is the golden age of streaming over?
  
  Were the prices low until now only due to VC money in an unsustainable or short term way?
Visit annotations in context

Annotators

pbk1

URL

itsfoss.com/news/digital-content-ownership-illusion/
www.nytimes.com www.nytimes.com

The ‘Worst Test in Medicine’ is Driving America’s High C-Section Rate

1
1. pbk1 11 Nov 2025
  
  in Public
  
  On rare occasions, the fetal heartbeat can reveal when something has gone wrong. The trouble is, healthy babies have highly variable heart patterns. Since the introduction of continuous monitoring, doctors now have many more opportunities to mistakenly interpret these ambiguous signals as telltale signs of distress.
  
  This article would be immensely enriched by adding a small discussion of alternatives to detect the rare disorders / complications that this fetal monitoring (continuous, or otherwise) was supposed to detect.
  
  The only alternative offered is the use of a stethoscope by a doctor, but I see issues of nurse staffing etc. in the comments questioning the feasibility. It would be nice to shed a light on this and discuss other alternatives in a paragraph or two.
  
  I couldn't help but think if the continuous monitored signal could be down-sampled to mimic a stethoscope read. This could be done either randomly to match the time intervals or somehow limit the reading to 5 mins after some other signal that would trigger a doctor to use the stethoscope? This would maintain the convenience of the monitoring scheme without giving too much low quality data that leads to false positive alarms. I'm pretty sure that responsibly implemented AI that is explainable could help with this but that is too many if's..
Visit annotations in context

Annotators

pbk1

URL

nytimes.com/2025/11/06/health/electronic-fetal-monitoring-c-sections.html
Nov 2025
Local file Local file

Untitled document

1
1. pbk1 07 Nov 2025
  
  in Public
  
  The easiest way to understand our research directions is to read our recent research. This research continuesmany of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability,Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from HumanPreferences.
  
  Anthropic reading material
  
  to read
Tags

to read

Annotators

pbk1
Oct 2025
policy.rice.edu policy.rice.edu

Paid Parental Leave | Policies | Rice University

1
1. pbk1 28 Oct 2025
  
  in Public
  
  Intermittent leave requires advance approval by the eligible staff employee’s supervisor.
  
  how long in advance?
Visit annotations in context

Annotators

pbk1

URL

policy.rice.edu/422
pubs.acs.org pubs.acs.org

pBI143 as a Process Indicator for Antibiotic Resistance Gene Removal through Activated Sludge Wastewater Treatment

1
1. pbk1 28 Oct 2025
  
  in Public
  
  Given its high abundance and strong correlation with the tested ARGs, pBI143 shows promise as a process indicator for tracking ARG fate through wastewater treatment systems.
  
  Other than the abundance argument, wouldn't it be better to pick one of the ARGs itself as an indicator rather than this proxy?
Visit annotations in context

Annotators

pbk1

URL

pubs.acs.org/doi/10.1021/acsestwater.5c00883
travishorn.com travishorn.com

Firewall Configuration with nftables

1
1. pbk1 24 Oct 2025
  
  in Public
  
  to add a rule that allows port 3306 (common for some database software), the line will look like this: CopyCopytcp dport {ssh,http,3306} accept
  
  nftables firewall configuration
Visit annotations in context

Annotators

pbk1

URL

travishorn.com/firewall-configuration-with-nftables
aprecruit.berkeley.edu aprecruit.berkeley.edu

Assistant Teaching Professor -- Bioengineering Education -- Department of Bioengineering

1
1. pbk1 23 Oct 2025
  
  in Public
  
  teaching core Bioengineering courses
  
  what are the core bioengineering courses?
Visit annotations in context

Annotators

pbk1

URL

aprecruit.berkeley.edu/JPF05050
git-scm.com git-scm.com

Git - Submodules

1
1. pbk1 21 Oct 2025
  
  in Public
  
  to remedy this situation, the git submodule sync command is required:
  
  if upstream repo changes the URL, need this
Visit annotations in context

Annotators

pbk1

URL

git-scm.com/book/en/v2/Git-Tools-Submodules
fastapi.tiangolo.com fastapi.tiangolo.com

Debugging - FastAPI

1
1. pbk1 20 Oct 2025
  
  in Public
  
  The main purpose of the __name__ == "__main__" is to have some code that is executed when your file is called
  
  called directly
Visit annotations in context

Annotators

pbk1

URL

fastapi.tiangolo.com/tutorial/debugging/
arxiv.org arxiv.org

Viash: from scripts to pipelines

1
1. pbk1 15 Oct 2025
  
  in Public
  
  Viash, a tool for speeding up development of robust pipelines through "code-first" prototyping, separation of concerns and code generation of modular pipeline components
  
  how does this compare to Nextflow?
Visit annotations in context

Annotators

pbk1

URL

arxiv.org/abs/2110.11494
docs.ansible.com docs.ansible.com

YAML Syntax — Ansible Documentation

1
1. pbk1 15 Oct 2025
  
  in Public
  
  # An employee record martin: name: Martin D'vloper job: Developer skill: Elite
  
  nested dictionary
Visit annotations in context

Annotators

pbk1

URL

docs.ansible.com/ansible/latest/reference_appendices/YAMLSyntax.html
quickref.me quickref.me

Screen Command Cheat Sheet & Quick Reference

1
1. pbk1 10 Oct 2025
  
  in Public
  
  Options
  
  Scrolling:ctrl + a ; Esc ; up ; pageUp/pageDown source: stackexchange
Visit annotations in context

Annotators

pbk1

URL

quickref.me/screen.html
www.asimov.press www.asimov.press

Journalists Miss on AlphaFold News

1
1. pbk1 09 Oct 2025
  
  in Public
  
  It’s not every known organism; it’s organisms with UniProt data
  
  Why would you expect structural predictions to depend on an organism's representation in UniProt?
Visit annotations in context

Annotators

pbk1

URL

asimov.press/p/journalists-swing-and-miss-on-alphafold
www.asimov.press www.asimov.press

AI for Science: Dreams of Progress

1
1. pbk1 09 Oct 2025
  
  in Public
  
  a fantastic deep dive into that history
  
  AI history
  
  In 1963, the Department of Defense funded an AI program at MIT (here’s a fantastic deep dive into that history)
Visit annotations in context

Annotators

pbk1

URL

asimov.press/p/science-ai
genome.cshlp.org genome.cshlp.org

Large-scale investigation of species-specific orphan genes in the human gut microbiome elucidates their evolutionary origins

1
1. pbk1 08 Oct 2025
  
  in Public
  
  Species-specific genes, also known as orphans
  
  Have no homologs?
Visit annotations in context

Annotators

pbk1

URL

genome.cshlp.org/content/34/6/888.full
www.howtogeek.com www.howtogeek.com

How to Make Firefox Tabs Open at the End of the Tabs List

1
1. pbk1 06 Oct 2025
  
  in Public
  
  browser.tabs.insertRelatedAfterCurrent
  
  Use these commands to modify if new tabs opened fresh (with control + T) or related tabs (by clocking links).
  
  firefox about:config tabs
Visit annotations in context

Tags

tabs

about:config

firefox

Annotators

pbk1

URL

howtogeek.com/719839/how-to-make-firefox-tabs-open-at-the-end-of-the-tabs-list/
Sep 2025
www.biorxiv.org www.biorxiv.org

agtools: a software framework to manipulate assembly graphs

1
1. pbk1 25 Sep 2025
  
  in Public
  
  agtools, an open-source Python framework that can analyse and manipulate assembly graphs
  
  When would I need to manipulate assembly graphs?
Visit annotations in context

Annotators

pbk1

URL

biorxiv.org/content/10.1101/2025.09.14.676178v1
www.mdpi.com www.mdpi.com

Real-Time Culture-Independent Microbial Profiling Onboard the International Space Station Using Nanopore Sequencing

1
1. pbk1 22 Sep 2025
  
  in Public
  
  Talk from by Sarah Wallace ; 22/Sep/25 - What's this minimalistic lysis method that works on board ISS in space?
Visit annotations in context

Annotators

pbk1

URL

mdpi.com/2073-4425/12/1/106
knowledgecafe.rice.edu knowledgecafe.rice.edu

Life Events

1
1. pbk1 21 Sep 2025
  
  in Public
  
  If the life event is reported on the 2nd or any day after, benefits will start on the 1st of the following month, which would be October 1st.
  
  How will the coverage work between the life event reporting and the 1st of the month after?
Visit annotations in context

Annotators

pbk1

URL

knowledgecafe.rice.edu/life-events
r4ds.hadley.nz r4ds.hadley.nz

R for Data Science (2e) - 29 Quarto

1
1. pbk1 21 Sep 2025
  
  in Public
  
  The following table summarizes which types of output each option suppresses:
  
  quarto, options
  
  quarto options
Visit annotations in context

Tags

quarto

options

Annotators

pbk1

URL

r4ds.hadley.nz/quarto.html
comparem2.readthedocs.io comparem2.readthedocs.io

What analyses does it do? - CompareM2

1
1. pbk1 19 Sep 2025
  
  in Public
  
  By defining a rule to run until, the Snakemake executor can select to use only the parts of the pipeline, that lead up to the production of the desired analysis.
  
  is there a similar parameter for nextflow?
Visit annotations in context

Annotators

pbk1

URL

comparem2.readthedocs.io/30 what analyses does it do/
ratankaliani.com ratankaliani.com

Evo 2: Arc’s DNA Foundation Model Explained

1
1. pbk1 18 Sep 2025
  
  in Public
  
  AlphaFold, one of the first large protein language models (PLMs) came out in 2020 and predicted protein structures
  
  Is alphafold really a protein language model? I believe this term is only applicable to transformer type models?
Visit annotations in context

Annotators

pbk1

URL

ratankaliani.com/posts/evo2/
www.nextflow.io www.nextflow.io

Reports — Nextflow documentation

1
1. pbk1 12 Sep 2025
  
  in Public
  
  Percentage of memory used by the process.
  
  Percentage of allocated memory actually used (seqera-AI)
Visit annotations in context

Annotators

pbk1

URL

nextflow.io/docs/latest/reports.html
taxburst.github.io taxburst.github.io

taxburst docs

1
1. pbk1 11 Sep 2025
  
  in Public
  
  The HTML format is largely unchanged, but the parsing front-end and output mechanisms have been completely rewritten in Python, and enhanced output validation has been added.
  
  taxburst vs krona plots for taxonomy
Visit annotations in context

Annotators

pbk1

URL

taxburst.github.io/taxburst/
docs.seqera.io docs.seqera.io

Overview | Seqera Docs

1
1. pbk1 05 Sep 2025
  
  in Public
  
  If your data comes from a released bioinformatics tool, you shouldn't be using this feature of MultiQC!
  
  define released tool?
Visit annotations in context

Annotators

pbk1

URL

docs.seqera.io/multiqc
Aug 2025
www.nextflow.io www.nextflow.io

Configuration — Nextflow documentation

1
1. pbk1 26 Aug 2025
  
  in Public
  
  Since each configuration file may contain conflicting settings, they are applied in the following order (from lowest to highest priority):
  
  nextflow config priority
Visit annotations in context

Tags

config

nextflow

priority

Annotators

pbk1

URL

nextflow.io/docs/latest/config.html
www.nextflow.io www.nextflow.io

Command line — Nextflow documentation

1
1. pbk1 26 Aug 2025
  
  in Public
  
  Parameters are applied in the following order (from lowest to highest priority):
  
  nextflow parameter config priority
Visit annotations in context

Tags

parameter

config

nextflow

priority

Annotators

pbk1

URL

nextflow.io/docs/latest/cli.html
www.biorxiv.org www.biorxiv.org

CARDBiomedBench: A Benchmark for Evaluating Large Language Model Performance in Biomedical Research

1
1. pbk1 25 Aug 2025
  
  in Public
  
  Claude-3.5-Sonnet demonstrates excessive caution
  
  😄
Visit annotations in context

Annotators

pbk1

URL

biorxiv.org/content/10.1101/2025.01.15.633272v2
rclone.org rclone.org

rclone copy

1
1. pbk1 24 Aug 2025
  
  in Public
  
  Doesn't delete files from the destination. If you want to also delete files from destination, to make it match source, use the sync command instead.
  
  rclone copy
Visit annotations in context

Annotators

pbk1

URL

rclone.org/commands/rclone_copy/
techcrunch.com techcrunch.com

FutureHouse releases AI tools it claims can accelerate science | TechCrunch

4
1. pbk1 21 Aug 2025
  
  in Public
  
  researchers don’t consider AI today to be especially useful in guiding the scientific process,
  
  I see full sentences copied verbatim from this reference into the current article.
  
  Is these also being written by AI?
2. pbk1 21 Aug 2025
  
  in Public
  
  have access to a vast corpus of high-quality open-access papers
  
  This must be a mistake - are there really that many open-access papers?
3. pbk1 21 Aug 2025
  
  in Public
  
  Owl looks for previous work in a given subject area
  
  how is this different from searching scientific literature (crow)
4. pbk1 21 Aug 2025
  
  in Public
  
  Eric Schmidt
  
  yaaru?
Visit annotations in context

Annotators

pbk1

URL

techcrunch.com/2025/05/01/futurehouse-releases-ai-tools-it-claims-can-accelerate-science/
www.cell.com www.cell.com

Computationally guided genome rewiring of Escherichia coli and its application for nanopolyethylene terephthalate (PET) biodegradation and upcycling

1
1. pbk1 18 Aug 2025
  
  in Public
  
  reprogramming endogenous proteins rather than relying on the integration of foreign DNA
  
  What is the benefit of this? Regulatory approval?
Visit annotations in context

Annotators

pbk1

URL

cell.com/trends/biotechnology/fulltext/S0167-7799(25)00269-0
www-science-org.ezproxy.rice.edu www-science-org.ezproxy.rice.edu

Evolutionary-scale prediction of atomic-level protein structure with a language model

1
1. pbk1 07 Aug 2025
  
  in Public
  
  order-of-magnitude acceleration of high-resolution structure prediction
  
  compared to?
Visit annotations in context

Annotators

pbk1

URL

www-science-org.ezproxy.rice.edu/doi/10.1126/science.ade2574
www.ineteconomics.org www.ineteconomics.org

The Zero-Sum Economy

2
1. pbk1 07 Aug 2025
  
  in Public
  
  if all you had done was divert money from another charity about which another equally motivated fundraiser was equally passionate.
  
  Interesting. What other aspects would make it positive sum? - What if a non-charity money was brought in? Even if a tiny amount of money came in here, this will be a postive sum gain for charity / humanity (assuming charity is a good use of money)
2. pbk1 07 Aug 2025
  
  in Public
  
  lawyers
  
  Are lawyers not adding to human welfare? On a basic level they do right?
  
  I guess apart from the basic functions of holding the machinery of law, sophisticated lawyering only serves a zero-sum role of their client against the interest of another client..
Visit annotations in context

Annotators

pbk1

URL

ineteconomics.org/perspectives/blog/the-zero-sum-economy
money.stackexchange.com money.stackexchange.com

Is the stock market a zero-sum game?

2
1. pbk1 07 Aug 2025
  
  in Public
  
  If the only way to make gains in the stock market was for someone else to take a loss, then the stock market wouldn't be able to go up.
  
  What does it mean though for a "stock market" to go up?
2. pbk1 07 Aug 2025
  
  in Public
  
  Over time, trading gains outweigh trading losses for investors as a group.
  
  If this was zero sum though, can the value be accumulating at the loss of some other entity that's not in the stock market, possibly customers of the actual product etc.?
  
  But one could argue that customers paid for whatever value they got from the product. So that is a positive sum transaction too.
Visit annotations in context

Annotators

pbk1

URL

money.stackexchange.com/questions/72945/is-the-stock-market-a-zero-sum-game
linuxhandbook.com linuxhandbook.com

Using If Else in Bash Scripts [Examples]

1
1. pbk1 05 Aug 2025
  
  in Public
  
  I have included some of the most popular test conditions in the table below:
  
  bash test conditions
  
  bash
Visit annotations in context

Tags

bash

Annotators

pbk1

URL

linuxhandbook.com/if-else-bash/
www.howtogeek.com www.howtogeek.com

I Tested 10 Popular Linux Distros, Here's How I Rank Them

1
1. pbk1 05 Aug 2025
  
  in Public
  
  feel more intuitive, powerful, and cohesive thanks to its heavily customized take on GNOME, adding custom tools and extensions. It's also one of the first distros to offer NVIDIA drivers out of the box, making it a hassle-free option for NVIDIA GPU owners—such as myself.
  
  Pop_OS
Visit annotations in context

Annotators

pbk1

URL

howtogeek.com/i-tested-10-popular-linux-distros-heres-how-i-rank-them/
www.biorxiv.org www.biorxiv.org

PCR Bias Impacts Microbiome Ecological Analyses

1
1. pbk1 01 Aug 2025
  
  in Public
  
  perturbation-invariant diversity measures remain unaffected by PCR bias
  
  what are perturbation-invariant diversity metrics?
Visit annotations in context

Annotators

pbk1

URL

biorxiv.org/content/10.1101/2025.07.31.667904v1
Jul 2025
www.sciencedirect.com www.sciencedirect.com

On the biological meaning of the population pangenome

1
1. pbk1 31 Jul 2025
  
  in Public
  
  While these genes would be classified as paralogs within individual genomes [32], at the population level they may be more accurately described as metaparalogs
  
  Doesn't ortholog already describe this in different species/sub-types?
Visit annotations in context

Annotators

pbk1

URL

sciencedirect.com/science/article/pii/S0966842X25002161
www.cell.com www.cell.com

Restoration of the human skin microbiome following immune recovery after hematopoietic stem cell transplantation

1
1. pbk1 31 Jul 2025
  
  in Public
  
  Due to higher relative abundances of viruses on skin in some cohorts of IEIs,1616.Tirosh, O. ∙ Conlan, S. ∙ Deming, C. ..., NISC, Comparative, Sequencing ProgramExpanded skin virome in DOCK8-deficient patientsNat. Med. 2018; 24:1815-1821CrossrefScopus (101)PubMedGoogle Scholar,1717.Blaustein, R.A. ∙ Shen, Z. ∙ Kashaf, S.S., NISC Comparative Sequencing Program ...Expanded microbiome niches of RAG-deficient patientsCell Rep. Med. 2023; 4, 101205Full TextFull Text (PDF)Scopus (4)Google Scholar we also included targeted 16S rRNA gene amplicon sequencing (n = 534 samples) to analyze bacterial communities due to the risk of recurrent bacterial infections
  
  Was 16S more usable then?
Visit annotations in context

Annotators

pbk1

URL

cell.com/cell-host-microbe/fulltext/S1931-3128(25)00274-4
www.biorxiv.org www.biorxiv.org

Reference-free Structural Variant Detection in Microbiomes via Long-read Coassembly Graphs

1
1. pbk1 31 Jul 2025
  
  in Public
  
  in a series that are expected to have similar communities (i.e. longitudinal time series or cross-sectional studies where a significant portion of the strains are shared across samples
  
  what kind of cross-sectional studies will fit in here - Does this qualify?
  
  cross-sectional study of : 10 dairy farm workers and 6 community controls’ gut metagenomes
Visit annotations in context

Annotators

pbk1

URL

biorxiv.org/content/10.1101/2024.01.25.577285v1
nextflow.io nextflow.io

Command line — Nextflow documentation

1
1. pbk1 28 Jul 2025
  
  in Public
  
  Nextflow options use a single dash prefix, e.g. -resume
  
  nextflow restart workflow with 'resume
Visit annotations in context

Annotators

pbk1

URL

nextflow.io/docs/latest/cli.html
training.nextflow.io training.nextflow.io

Part 2: Rewrite Hello for nf-core - training.nextflow.io

2
1. pbk1 24 Jul 2025
  
  in Public
  
  In our case, we didn't explicitly mark anything else as an output, so there's nothing else there.
  
  how to mark as output?
2. pbk1 18 Jul 2025
  
  in Public
  
  In our case, we didn't explicitly mark anything else as an output, so there's nothing else there.
  
  Where do you mark things as output?
Visit annotations in context

Annotators

pbk1

URL

training.nextflow.io/2.2.1/hello_nf-core/02_rewrite_hello/
www.nature.com www.nature.com

Unraveling the functional dark matter through global metagenomics

1
1. pbk1 24 Jul 2025
  
  in Public
  
  functional dark matter
  
  Read good rant against the use of "dark matter" in biology by Murat Eren
  
  The dark matter in physics is a place holder for a not-yet-characterized form of matter that is distinct from ordinary matter
  
  On the other hand, the dark matter in microbiology describes microbes that are right in front of us. We can lyse them, we can see them under our microscopes, we can get pieces of their genomes sequenced, and we can occasionally cultivate them.
  
  ... stick a swab in your mouth and reconstruct a genome or cultivate a member from the “dark matter” of the human oral cavity.
Visit annotations in context

Annotators

pbk1

URL

nature.com/articles/s41586-023-06583-7
merenlab.org merenlab.org

Microbial Dark Matter: The mullet of microbial ecology

1
1. pbk1 24 Jul 2025
  
  in Public
  
  On the other hand, the dark matter in microbiology describes microbes that are right in front of us. We can lyse them, we can see them under our microscopes, we can get pieces of their genomes sequenced, and we can occasionally cultivate them.
  
  I think the core argument is that it is way too easier to bring to light something from the "dark matter" of biology with just a few sequencing runs and clever analysis.
  
  So this doesn't justify the analogy from dark matter in physics
Visit annotations in context

Annotators

pbk1

URL

merenlab.org/2017/06/22/microbial-dark-matter/
www.sciencedirect.com www.sciencedirect.com

Structure-guided metagenome mining to tap microbial functional diversity

4
1. pbk1 24 Jul 2025
  
  in Public
  
  distant proteins with essentially no sequence homology may share the same catalytic residues
  
  If the definition of catalytic residues is very generous to include even 2-3 residues, is there a likelihood that random/evolutionarily unrelated pr/oteins might share such residues by chance?
  
  This evolutionary argument rests on very thin evidence and is going to be hard to prove by showing that the same connection does not occur in random proteins/or a set of curated proteins/domains/motifs that are definitely not related evolutionarily
2. pbk1 24 Jul 2025
  
  in Public
  
  This approach can also be reversed to identify novel protein functions based on unusual catalytic residues.
  
  This application is definitely more defensible since there is a tangible way to test the hypothesis when looking for proteins of related functions
3. pbk1 24 Jul 2025
  
  in Public
  
  Nice article, but some of the arguments are somewhat vague. It would benefit from an enhanced discussion of how evolution would occur or some concrete example of how these distantly related protein fragments are connected through evolution
4. pbk1 24 Jul 2025
  
  in Public
  
  hinting at a potential evolutionary trajectory
  
  Is the tower height an easily evolvable feature of these proteins though? (quick thoughts, didn't read the reference here) - I bet looking at other/unrelated proteins for features such as these arbitrarily defined tower height can give you lot of confounding hypotheses
Visit annotations in context

Annotators

pbk1

URL

sciencedirect.com/science/article/pii/S1369527423001194
lips.cs.princeton.edu lips.cs.princeton.edu

Hashing, streaming and sketching

2
1. pbk1 24 Jul 2025
  
  in Public
  
  (Cormode, 2011).
  
  Read this for an example of what sketching means.
  
  This blog would benefit from a simple example illustration for each of these paradigms..!
2. pbk1 23 Jul 2025
  
  in Public
  
  algorithms are allowed a single or small number of passes over the data; an excellent tutorial-style overview is by Muthukrishnan (2003) .
  
  Streaming
Visit annotations in context

Annotators

pbk1

URL

lips.cs.princeton.edu/hashing-streaming-and-sketching/
nextflow.io nextflow.io

Process reference — Nextflow documentation

1
1. pbk1 22 Jul 2025
  
  in Public
  
  The ext directive can be set in the process definition:
  
  or nextflow configuration
Visit annotations in context

Annotators

pbk1

URL

nextflow.io/docs/latest/reference/process.html
log.bede.im log.bede.im

Precise removal of host DNA sequences - HyperBlogLog

1
1. pbk1 22 Jul 2025
  
  in Public
  
  consider an alignment-based approach like Hostile.
  
  I thought that alignment based approaches are computationally expensive compared to mapping based? How does hostile-minimap2 do this better than kraken2?
Visit annotations in context

Annotators

pbk1

URL

log.bede.im/2023/08/29/precise-host-read-removal.html
Local file Local file

0-primer-posted

1
1. pbk1 22 Jul 2025
  
  in Public
  
  A good model is one that1. You understand
  
  😄
Annotators

pbk1
www.biorxiv.org www.biorxiv.org

SVPG: A pangenome-based structural variant detection approach and rapid augmentation of pangenome graphs with new samples

1
1. pbk1 22 Jul 2025
  
  in Public
  
  How does this compare to rhea which also uses assembly graphs to call SVs (albeit in a timeseries data)
Visit annotations in context

Annotators

pbk1

URL

biorxiv.org/content/10.1101/2025.07.11.664486v1
www.biorxiv.org www.biorxiv.org

Plasmids promote bacterial evolution through a copy number-driven increase in mutation rate

1
1. pbk1 22 Jul 2025
  
  in Public
  
  might delay plasmid evolution
  
  Hmm, does the segregation really affect the diversity pool as much if we consider all copies of the plasmid present in the whole community as a single pool?
Visit annotations in context

Annotators

pbk1

URL

biorxiv.org/content/10.1101/2025.07.21.665944v1
academic.oup.com academic.oup.com

Chromosomal capture of beneficial genes drives plasmids toward ecological redundancy

1
1. pbk1 18 Jul 2025
  
  in Public
  
  Interesting articulation of evolution of beneficial genes from plasmid acquisition to domestication through chromosomal capture.
  
  read-later
Visit annotations in context

Tags

read-later

Annotators

pbk1

URL

academic.oup.com/ismej/article/19/1/wraf091/8128477
wwood.github.io wwood.github.io

README

1
1. pbk1 18 Jul 2025
  
  in Public
  
  It is currently aimed at the analysis of metagenomes sequenced using Illumina short read technology.
  
  Does this work with long-reads as well?
  
  (Todd) likely due to indels in long reads and other issues we had to fight with for SeqScreen-Nano etc
Visit annotations in context

Annotators

pbk1

URL

wwood.github.io/singlem/
www.biorxiv.org www.biorxiv.org

SingleM and Sandpiper: Robust microbial taxonomic profiles from metagenomic data

1
1. pbk1 18 Jul 2025
  
  in Public
  
  ‘SingleM’, which estimates community composition using conserved regions within universal marker genes.
  
  By conserved regions you mean that they are not identical but have some variation that you can use for profiling taxonomy?
Visit annotations in context

Annotators

pbk1

URL

biorxiv.org/content/10.1101/2024.01.30.578060v1
www.science.org www.science.org

Controlled colonization of the human gut with a genetically engineered microbial therapeutic

1
1. pbk1 18 Jul 2025
  
  in Public
  
  Cool stuff!
Visit annotations in context

Annotators

pbk1

URL

science.org/doi/10.1126/science.adu8000
Local file Local file

Untitled document

1
1. pbk1 18 Jul 2025
  
  in Public
  
  https://job-boards.greenhouse.io/anthropic/jobs/4789113008
Annotators

pbk1
nf-co.re nf-co.re

Docs: meta map

1
1. pbk1 17 Jul 2025
  
  in Public
  
  meta, fastq ->
  
  something seems wrong, can't have two inputs to this map. Getting error groovy ERROR ~ Invalid method invocation `call` with arguments: /home/pbk1/practice-nextflow/training/hello-nf-core/greetings.csv (sun.nio.fs.UnixPath) on _closure3 type
Visit annotations in context

Annotators

pbk1

URL

nf-co.re/docs/contributing/components/meta_map
nextflow.io nextflow.io

Processes — Nextflow documentation

1
1. pbk1 17 Jul 2025
  
  in Public
  
  When a process is invoked in a workflow, it must be provided a channel for each channel in the process input section
  
  Are the inputs optional?
Visit annotations in context

Annotators

pbk1

URL

nextflow.io/docs/latest/process.html
ebrc.org ebrc.org

Curriculum Module: Introduction to Engineering Biology | EBRC

1
1. pbk1 17 Jul 2025
  
  in Public
  
  (from Slack chats: CR) What is everyone’s favorite resource to show to undergraduates new to synthetic biology? Bonus points to printable/written/non-video stuff
  
  Synbio introduction resource
Visit annotations in context

Tags

resource

Synbio

introduction

Annotators

pbk1

URL

ebrc.org/curriculum-module-introduction-to-engineering-biology/
static.igem.org static.igem.org

T--SASTRA_Thanjavur--Handbook.pdf

1
1. pbk1 17 Jul 2025
  
  in Public
  
  SYNBIO AUCTIONHANDBOOK22.4.19Made with ❤ for D.A.V. Public School, VelacheryiGEM 2019 SASTRA Team - Human Practices #1
  
  (from Slack chats: CR) What is everyone’s favorite resource to show to undergraduates new to synthetic biology? Bonus points to printable/written/non-video stuff
  
  (GP) Not directly a resource, as this assumes some level of bg has been given, but I was introduced to different parts of a genetic construct via this game during my undergrad:
  
  Synbio resource introduction
Visit annotations in context

Tags

resource

Synbio

introduction

Annotators

pbk1

URL

static.igem.org/mediawiki/2019/4/45/T--SASTRA_Thanjavur--Handbook.pdf
www.biorxiv.org www.biorxiv.org

Physics-constrained neural ordinary differential equation models to discover and predict microbial community dynamics

2
1. pbk1 17 Jul 2025
  
  in Public
  
  Physics constrained ML models are the best of both worlds of flexible fitting (include higher order interactions that won’t be included in first principles models) and interpretability.
  
  such models go in the form of ODEs with coefficient matrix multiplications to known parameters, where the matrix is provided by a trained neural network.
  
  This mixes the best of both worlds where known interactions/dependancies are enforced while the unknown coefficients are fit using black box models. The authors did a great job finding some nice experimental data and fitting it to the models
  
  Read more to figure out exactly how these models different and how the models differ and the discussion about take-aways
  
  But how do we make sure it is not rigid to disallow higher order interactions etc.
2. pbk1 17 Jul 2025
  
  in Public
  
  NSM is more accurate than mechanistic or machine learning components on experimental datasets
  
  Interesting, is the constrained ML model more accurate than the unconstrained one?
Visit annotations in context

Annotators

pbk1

URL

biorxiv.org/content/10.1101/2025.07.08.663743v1
www.science.org www.science.org

‘Lazy’ authors? One in six scientific papers mischaracterize work they cite

1
1. pbk1 17 Jul 2025
  
  in Public
  
  During the next 25 years, hundreds of scholarly articles cited the letter—in many cases overgeneralizing or omitting key details, potentially helping drive overprescription of opioids in the 1990s and contributing to the ensuing wave of overdose deaths, a 2017 analysis found.
  
  This is a wonderful example and some great detective work tracking the citation count of this paper.
  
  But is it really possible that the citation count was that significant a driver in the opioid crisis? - citations could influence policy makers, who might have a higher effect - Drug makers could ride on the scientific sentiment with the misleading citations? ..
Visit annotations in context

Annotators

pbk1

URL

science.org/content/article/lazy-authors-one-six-scientific-papers-mischaracterize-work-they-cite
www.nytimes.com www.nytimes.com

Opinion | When Rights Erode for Some of Us, Something Corrodes in All of Us

1
1. pbk1 17 Jul 2025
  
  in Public
  
  I will not be arrested and deported for writing this essay. In that respect, the legal value of my citizenship remains secure. But what that citizenship is worth, on a deeper level, feels imperiled.
  
  Interesting meta-comment. Might be relevant when someone puts down criticism by citing the limited freedom of making it in other countries such as India
Visit annotations in context

Annotators

pbk1

URL

nytimes.com/2025/06/16/opinion/trump-immigration-deportations-rights.html
decodingml.substack.com decodingml.substack.com

Stop Building AI Agents

1
1. pbk1 16 Jul 2025
  
  in Public
  
  Nice post arguing that don't go to an AI agent first thing. Start with simpler LLM, RAG systems that are easier to debug
  
  When people say "agent," they mean that last step: the LLM output controls the workflow. Most people skip straight to letting the LLM control the workflow without realizing that simpler patterns often work better. Using an agent means handing control to the LLM. But unless your task is so dynamic that its flow can’t be defined upfront, that kind of freedom usually hurts more than it helps. Most of the time, simpler workflows with humans in charge still outperform full-blown agents.
Visit annotations in context

Annotators

pbk1

URL

decodingml.substack.com/p/stop-building-ai-agents
training.nextflow.io training.nextflow.io

Part 1: Run a demo pipeline - training.nextflow.io

1
1. pbk1 15 Jul 2025
  
  in Public
  
  nextflow pull nf-core/demo Nextflow will pull the pipeline code, meaning it will download the full repository to your local drive.
  
  This is the same as git pull; It just organizes the module into modules/nf-core/ for you.
  
  This only works for the demo module which is in the nf-core repo. For all other modules found in nf-core, you need to install and use nf-core modules install ..
Visit annotations in context

Annotators

pbk1

URL

training.nextflow.io/2.3.0/hello_nf-core/01_run_demo/
www.nature.com www.nature.com

Effective binning of metagenomic contigs using contrastive multi-view representation learning

1
1. pbk1 15 Jul 2025
  
  in Public
  
  Many binning methods have been developed
  
  Overview of binning methods in this paragraph. argument for using deep learning is not very persuasive to me.. - Using composition (k-mer) or abundance (coverage) / combined - Hybrid models are "superior" - Integrating such different features efficiently is not easy, so enter the black box/ deep learning stuff.
  
  binning metagenomics
Visit annotations in context

Tags

metagenomics

binning

Annotators

pbk1

URL

nature.com/articles/s41467-023-44290-z
www.illumina.com www.illumina.com

Long-Read Sequencing Technology | For challenging genomes

1
1. pbk1 15 Jul 2025
  
  in Public
  
  amplified
  
  Note: PCR can introduce biases..
Visit annotations in context

Annotators

pbk1

URL

illumina.com/science/technology/next-generation-sequencing/long-read-sequencing.html
www.pacb.com www.pacb.com

The HiFi difference — true long reads vs synthetic long reads - PacBio

1
1. pbk1 15 Jul 2025
  
  in Public
  
  “PacBio CCS” datapoint was taken from a 2019 publication that does not represent today’s performance. For example, Karst et al. (2021) described HiFi sequencing of full-length (4.4 kb) rDNA amplicons of the same sample at 99.9993% (Q51.5) accuracy, almost two orders of magnitude greater than the previous study.
  
  This is a neat way to show improving technologies by overlaying it onto famous benchmarking figures from previous versions ; and following the same methodology from published papers so they don't accuse you of using biased stuff to show your method in better light..
Visit annotations in context

Annotators

pbk1

URL

pacb.com/blog/the-hifi-difference-true-long-reads-vs-synthetic-long-reads/
www.biorxiv.org www.biorxiv.org

The Virtual Lab: AI Agents Design New SARS-CoV-2 Nanobodies with Experimental Validation

4
1. pbk1 15 Jul 2025
  
  in Public
  
  We would like to thank E. Simon and J. Silberg for their helpful discussions of this work.
  
  Very curious to see if this is Joff Silberg, Rice University?
2. pbk1 15 Jul 2025
  
  in Public
  
  Interesting paper using multiple AI agents for multi-step reasoning using interdisciplinary tools to develop binders to new COVID variants
3. pbk1 15 Jul 2025
  
  in Public
  
  requires reasoning across diverse fields from biology to computer science
  
  I doubt there is much computer science expertise in using already existing software?
4. pbk1 15 Jul 2025
  
  in Public
  
  Virtual Lab consists of an LLM principal investigator agent guiding a team of LLM agents with different scientific backgrounds
  
  I wonder how such a multi-agent framework compares to a single large agent?
Visit annotations in context

Annotators

pbk1

URL

biorxiv.org/content/10.1101/2024.11.11.623004v1
www.anthropic.com www.anthropic.com

Mapping the Mind of a Large Language Model

3
1. pbk1 15 Jul 2025
  
  in Public
  
  shows that the internal organization of concepts in the AI model corresponds, at least somewhat, to our human notions of similarity.
  
  This could be confirmation bias since you are likely only flagging/commenting on things that already check the human notion of similarity.
2. pbk1 15 Jul 2025
  
  in Public
  
  measure a kind of "distance" between features based on which neurons appeared in their activation patterns
  
  Is this a measured of shared neuron subsets within features?
3. pbk1 15 Jul 2025
  
  in Public
  
  nouns in mathematics
  
  ? ; variable names?
Visit annotations in context

Annotators

pbk1

URL

anthropic.com/research/mapping-mind-language-model

All non-mandatory command-line tool non-file arguments MUST be provided as a string via the $task.ext.args variable.

Explanation from seqera AI

Why this convention exists:

Flexibility: Users can easily customize module behavior without modifying the module code itself
Separation of concerns: Module logic stays separate from parameter configuration
Reusability: The same module can be used with different parameters across different pipelines
Consistency: All nf-core modules follow the same pattern for optional arguments

What goes in ext.args vs. input channels:

Input channels: Mandatory non-file arguments that are essential for the tool to function (e.g., required modes, essential parameters)
ext.args: Optional flags, parameters with defaults, or any non-essential command-line options

Visit annotations in context

Annotators

pbk1

URL

nf-co.re/docs/guidelines/components/modules

www.nature.com www.nature.com

Processing-bias correction with DEBIAS-M improves cross-study generalization of microbiome-based prediction models

1
1. pbk1 14 Jul 2025
  
  in Public
  
  (reading in progress) Tool to process bias correction (what kinds?) in microbiome datasets ..
  
  Older tools for batch correction: Use outcome variable (means?) hence risk overfitting ; are non-interpretable
  
  Questions
  
  How is this ML model interpretable?
  
  How do you have enough data to learn factors for each microbe and each batch
  
  How does this fit in with Amy Willis’ framework for unobserved taxa etc.?
Visit annotations in context

Annotators

pbk1

URL

nature.com/articles/s41564-025-01954-4
chill-filter.sourmash.bio chill-filter.sourmash.bio

chill-filter sample screening - User Guide

2
1. pbk1 14 Jul 2025
  
  in Public
  
  Web application that comprehensively determines the composition of a shotgun sequence data set. It should take about 10 seconds to process an uploaded sample.
  
  Composition => very high level overview of what % of reads belong to these categories (human, 8 animals, microbes, plants) ; and approx depth.
  
  How is it so fast? Do you do pre-processing/cleanup and QC of the reads before using sourmash like tools for profilling?'
2. pbk1 14 Jul 2025
  
  in Public
  
  If you want a full taxonomic breakdown of your sample
  
  What's "full taxonomic breakdown" vs what this tool gives?
Visit annotations in context

Annotators

pbk1

URL

chill-filter.sourmash.bio/guide
ballsandstrikes.org ballsandstrikes.org

Ketanji Brown Jackson Is Telling the Truth About the Supreme Court

1
1. pbk1 13 Jul 2025
  
  in Public
  
  anytime a justice writes separately, the question is always why: Who is it for, and what does the justice hope to accomplish by voluntarily doing extra work?
  
  Each justice has a different gallery to play to? Seems to be the summary
Visit annotations in context

Annotators

pbk1

URL

ballsandstrikes.org/scotus/ketanji-brown-jackson-dissents-term-recap/
www.ineteconomics.org www.ineteconomics.org

Mainstream Macroeconomics and Modern Monetary Theory: What Really Divides Them?

1
1. pbk1 12 Jul 2025
  
  in Public
  
  A central bank able to control domestic interest rates is a sufficient condition to allow a government to freely pursue countercyclical fiscal policy with no danger of a runaway increase in the debt ratio.
  
  Interesting, since the central bank is supposed to be independent from the executive/government, you are leaving it to the fact that the fiscal and monetary policy decisions don't bleed into each other?
  
  If the fiscal policy decisions proposed by MMT that woiuld increase the debt:GDP ratios don't affect the prices or other market features, only then this could be possible unless the state actively intervenes in the monetary policy of the central/reserve bank. - I think the mandate of central bank is to keep inflation under control and keep a check on unemployment?
Visit annotations in context

Annotators

pbk1

URL

ineteconomics.org/perspectives/blog/mainstream-macroeconomics-and-modern-monetary-theory-what-really-divides-them
academic.oup.com academic.oup.com

PanKB: An interactive microbial pangenome knowledgebase for research, biotechnological innovation, and knowledge mining

2
1. pbk1 11 Jul 2025
  
  in Public
  
  valuable knowledge across diverse microbial species
2. pbk1 11 Jul 2025
  
  in Public
  
  microbial pangenomic research
  
  refresh what pangenomic research means
Visit annotations in context

Annotators

pbk1

URL

academic.oup.com/nar/article/53/D1/D806/7906839
www.biorxiv.org www.biorxiv.org

BioInformatics Agent (BIA): Unleashing the Power of Large Language Models to Reshape Bioinformatics Workflow

3
1. pbk1 11 Jul 2025
  
  in Public
  
  [17].
  
  to read
2. pbk1 11 Jul 2025
  
  in Public
  
  Thankfully, the development of AI technologies, especially Large Language Models (LLMs) [8, 9, 10] with strong reasoning, adequate knowledge reserve and excellent coding capabilities [11], is reshaping the paradigms and precepts of how people leverage bioinformatics data.
  
  introduction
3. pbk1 11 Jul 2025
  
  in Public
  
  Domain specialization as the key to make large language models disruptive
Visit annotations in context

Tags

to read

Annotators

pbk1

URL

biorxiv.org/content/10.1101/2024.05.22.595240v2
academic.oup.com academic.oup.com

FuncFetch: an LLM-assisted workflow enables mining thousands of enzyme–substrate interactions from published manuscripts

4
1. pbk1 11 Jul 2025
  
  in Public
  
  emergence of large language models presents an opportunity to speed up the text-mining of protein activities for biocuration
2. pbk1 11 Jul 2025
  
  in Public
  
  Provided the manuscript, FuncFetch extracted data such as species information, enzyme names, sequence identifiers, substrates, and products, which were subjected to extensive quality analyses
3. pbk1 11 Jul 2025
  
  in Public
  
  precision/recall of 0.86/0.64 in extracting substrates
4. pbk1 11 Jul 2025
  
  in Public
  
  identified multiple extraction errors including incorrect associations, nontarget enzymes, and hallucinations, which highlight the need for further manual curation
Visit annotations in context

Annotators

pbk1

URL

academic.oup.com/bioinformatics/article/41/1/btae756/7932122
genomebiology.biomedcentral.com genomebiology.biomedcentral.com

MetAMOS: a modular and open source metagenomic assembly and analysis pipeline

14
1. pbk1 11 Jul 2025
  
  in Public
  
  Performing these tasks requires the installation, integration, and tuning of multiple software packages, which is not trivial even for groups with extensive bioinformatics expertise. As a result, most studies rely on ad hoc pipelines based on custom scripts and intensive manual analyses, making it difficult to reproduce or extend analysis results and hampering collaboration.
  
  useful text
  
  omi introduction
2. pbk1 11 Jul 2025
  
  in Public
  
  that the choice of assembler has a strong influence on the final assembly results and choosing the ideal assembler requires taking into account both contiguity and correctness
  
  This is something Omi workflow can help if there are clearly defined rules in the field - Where do we get such rules from? maybe review papers? - Might not be automatable but can implement in clearly defined rules (expert curated) in python that interact with user prompts via LLM
3. pbk1 11 Jul 2025
  
  in Public
  
  explained in part by the library size re-estimation automatically performed within the MetAMOS preprocessing stage
4. pbk1 11 Jul 2025
  
  in Public
  
  level of improvement provided by MetAMOS over other assembly tools is highly dependent on the specific characteristics of the dataset being assembled
  
  library size re-estimation within MetAMOS (was incorrect in tongue dorsum dataset so helped a lot)
  
  Number of regions of genomic variation (helps real datasets that have a high number of these since scaffold building pipeline is good)
  
  pipeline-tailoring omi
5. pbk1 11 Jul 2025
  
  in Public
  
  aggressive assembly approaches sometimes result in more contiguous assemblies, but often introduce errors of the most severe kind (chimeras)
  
  There are trade-offs. - Could we ask the user if the tool should err on more contiguous vs chimeras with a set % - or summarize results from both and ask the user to choose..?
  
  pipeline-tailoring omi
6. pbk1 11 Jul 2025
  
  in Public
  
  it can be difficult to choose an appropriate assembler a priori
7. pbk1 11 Jul 2025
  
  in Public
  
  motivates our focus on fast end-to-end analysis and inclusion of multiple assembly methods, allowing the user to tailor the pipeline to their data
  
  This could be iterated by easily with user inputs
  
  pipeline-tailoring omi
8. pbk1 11 Jul 2025
  
  in Public
  
  MetAMOS makes it straightforward to assess the performance of the various tools for each of these steps for a given sample.
9. pbk1 11 Jul 2025
  
  in Public
  
  this framework will reduce duplication of efforts
10. pbk1 11 Jul 2025
  
  in Public
  
  allowing scientists to focus their attention on individual components without having to re-implement all the components of a metagenomic pipeline
11. pbk1 11 Jul 2025
  
  in Public
  
  At the end of the final stage, MetAMOS produces an interactive HTML report that summarizes the main results of the run
12. pbk1 11 Jul 2025
  
  in Public
  
  the installation, integration, and tuning of multiple software packages, which is not trivial even for groups with extensive bioinformatics expertise.
  
  laborious work of setting up multiple packages
  
  omi introduction
13. pbk1 11 Jul 2025
  
  in Public
  
  most studies rely on ad hoc pipelines based on custom scripts and intensive manual analyses, making it difficult to reproduce or extend analysis results and hampering collaboration
14. pbk1 11 Jul 2025
  
  in Public
  
  MetAMOS represents an important step towards fully automated metagenomic analysis, starting with next-generation sequencing reads and producing genomic scaffolds, open-reading frames and taxonomic or functional annotations
Visit annotations in context

Tags

omi

pipeline-tailoring

introduction

Annotators

pbk1

URL

genomebiology.biomedcentral.com/articles/10.1186/gb-2013-14-1-r2
academic.oup.com academic.oup.com

Ruffus: a lightweight Python library for computational pipelines

6
1. pbk1 11 Jul 2025
  
  in Public
  
  the advantages of computational pipelines over ad hoc scripts, even for simple tasks, are all more apparent with increasingly complex datasets and the use of parallel processing.
  
  omi introduction
2. pbk1 11 Jul 2025
  
  in Public
  
  the advantages of computational pipelines over ad hoc scripts, even for simple tasks, are all more apparent with increasingly complex datasets and the use of parallel processing.
  
  why pipelines vs ad hoc scripts - track dependencies (statically inferred, DAGs) - rules reused for many files (parallelization) - data tracking (rapid development in subsets of pipeline ~ changing parameters,ie. avoid duplicate work when resuming workflows.)
  
  omi introduction toolchaining
3. pbk1 11 Jul 2025
  
  in Public
  
  ‘Make’ specifications are written in an obscure and limited language
4. pbk1 11 Jul 2025
  
  in Public
  
  Automatic data tracking in pipelines allows only the out-of-date parts of the analyses to be rescheduled and recalculated, with minimal redundancy
5. pbk1 11 Jul 2025
  
  in Public
  
  same ‘rule’ can be applied to multiple data files at the same time
6. pbk1 11 Jul 2025
  
  in Public
  
  ‘make’ has been widely used to keep track of dependencies in scientific pipelines
Visit annotations in context

Tags

toolchaining

omi

introduction

Annotators

pbk1

URL

academic.oup.com/bioinformatics/article/26/21/2778/214489
academic.oup.com academic.oup.com

Snakemake—a scalable bioinformatics workflow engine

4
1. pbk1 11 Jul 2025
  
  in Public
  
  In contrast to Pwrake and GXP Make, Snakemake does not rely on any password-less SSH setup or custom server processes running on the cluster nodes
2. pbk1 11 Jul 2025
  
  in Public
  
  Snakemake is the first system to support file name inference with multiple named wildcards in rules
3. pbk1 11 Jul 2025
  
  in Public
  
  By default, Snakemake only executes rules if the output files are not present or the modification time of the input files is newer
4. pbk1 11 Jul 2025
  
  in Public
  
  automatic deletion of output files from incomplete rule executions (e.g. due to a failing shell command)
Visit annotations in context

Annotators

pbk1

URL

academic.oup.com/bioinformatics/article/28/19/2520/290322
genome.cshlp.org genome.cshlp.org

Biopipe: A Flexible Framework for Protocol-Based Bioinformatics Analysis

14
1. pbk1 11 Jul 2025
  
  in Public
  
  Despite the short history of bioinformatics, several software efforts have evolved into a pipeline model
  
  history of toolchaining for bioinformatics
2. pbk1 11 Jul 2025
  
  in Public
  
  development of sequence assembly tools and genome annotation systems
3. pbk1 11 Jul 2025
  
  in Public
  
  running analysis in a serial rule-dependent fashion (workflow) and (2) the ability to run these tasks in parallel where possible (high-throughput).
4. pbk1 11 Jul 2025
  
  in Public
  
  Seems to be an early paper on toolchaining for bioinformatic workflows. Would be a good reference for introduction when talking about the modern Nextflow and Snakemake workflows
  
  toolchaining introduction
5. pbk1 11 Jul 2025
  
  in Public
  
  Protocols also make for easy benchmarks
6. pbk1 11 Jul 2025
  
  in Public
  
  encourage software reuse by providing modular components
7. pbk1 11 Jul 2025
  
  in Public
  
  connecting different software applications together, file format conversions
8. pbk1 11 Jul 2025
  
  in Public
  
  “scripting glue,”
9. pbk1 11 Jul 2025
  
  in Public
  
  designed to work over a compute farm
10. pbk1 11 Jul 2025
  
  in Public
  
  easier for experimentation and replication
11. pbk1 11 Jul 2025
  
  in Public
  
  need for introducing an explicit protocol-based approach to bioinformatics analysis that will lend rigorousness to the analysis
12. pbk1 11 Jul 2025
  
  in Public
  
  bioinformatics framework must be flexible and generic enough to accommodate such changes
13. pbk1 11 Jul 2025
  
  in Public
  
  federation of databases
14. pbk1 11 Jul 2025
  
  in Public
  
  bioinformatics analysis to be carried out in a paralle
Visit annotations in context

Tags

toolchaining

introduction

Annotators

pbk1

URL

genome.cshlp.org/content/13/8/1904.full
academic.oup.com academic.oup.com

Human Rights Litigation against Multinationals in Practice: Lessons from the United Kingdom

1
1. pbk1 11 Jul 2025
  
  in Public
  
  A clear statement of the nature of the intrusive judicial inquiry a parent company could be subjected to in such cases was provided by Lord Bingham when the litigation reached the House of Lords as follows:
  
  Intrusive Judicial Inquiry into Parent Company Liability Lord Bingham’s statement in the House of Lords highlights the level of scrutiny that a parent company may face in transnational tort claims. Courts assess whether the parent company played an active role in controlling the subsidiary’s operations, particularly in matters of health, safety, and environmental standards. This includes an inquiry into:
  
  Corporate Oversight – The extent to which the parent company exercised control over subsidiaries. Knowledge and Responsibility – What the parent company’s directors and employees knew or ought to have known about the subsidiary’s activities. Decision-Making and Action – Whether the parent company took positive steps to ensure compliance or failed to act, leading to harm. Documentary Evidence – Courts examine internal company records, including: Board meeting minutes Reports from directors and employees Correspondence related to oversight of the subsidiary Jurisdiction and Access to Justice The House of Lords upheld jurisdiction in the UK by applying the Connelly principle, which states that English courts should hear cases if there is a real risk that justice would not be accessible in the foreign jurisdiction. This was based on:
  
  The complexity of the litigation, making it difficult to fund and pursue in South Africa. The need for extensive corporate records, which were primarily located in the UK parent company’s offices. Precedents in Parent Company Liability By 2001, English courts had ruled on three key cases affirming parent company liability, establishing that:
  
  The legal principle was not controversial. UK courts should retain jurisdiction under forum non conveniens grounds when justice could not be obtained abroad. Impact on Transnational Litigation This judicial approach set an important precedent, paving the way for future cases like Chandler v Cape (2012) and Okpabi v Shell (2021), reinforcing the principle that parent companies may owe a duty of care to individuals harmed by the actions of their foreign subsidiaries.
Visit annotations in context

Annotators

pbk1

URL

academic.oup.com/book/41153/chapter/350510145
academic.oup.com academic.oup.com

Osteopontin Is an Integral Mediator of Cardiac Interstitial Fibrosis in Models of Human Immunodeficiency Virus Infection

1
1. pbk1 11 Jul 2025
  
  in Public
  
  CD8-depletion antibodies
  
  DOI: 10.1093/infdis/jiad149
  
  Resource: (NIH Nonhuman Primate Reagent Resource Cat# PR-0817, RRID:AB_2716320)
  
  Curator: @giovanni.decastro
  
  SciCrunch record: RRID:AB_2716320
  
  What is this?
  
  RRID:AB_2716320
Visit annotations in context

Tags

RRID:AB_2716320

Annotators

pbk1

URL

academic.oup.com/jid/article/228/2/122/7159215
academic.oup.com academic.oup.com

Harlem

1
1. pbk1 11 Jul 2025
  
  in Public
  
  When I began my work, jazz was a stunt,” was Duke Ellington’s later critique of some of this music11Close—but the slick professionalism of the Harlem stride style also served to expand the audience for African American music in the face of discrimination from cultural elites, both within and without the black community, and despite a severe economic downturn.
  
  for final
Visit annotations in context

Annotators

pbk1

URL

academic.oup.com/book/39631/chapter/339586191
academic.oup.com academic.oup.com

Robust Ontology

1
1. pbk1 11 Jul 2025
  
  in Public
  
  within the internal perspective. They are first-order claims about what is right or wrong in specific counterfactual conditions, and can thus be glossed in expressivist terms. This is underpinned by the fact that our moral attitudes respond to natural features of the world. We judge that kicking dogs is wrong because of the pain they suffer when kicked, not because we happen to disapprove of such behaviour. Quasi-realists can therefore hold that kicking dogs remains wrong in worlds at which our counterparts approve of it, for o
  
  .kjgjjhk
Visit annotations in context

Annotators

pbk1

URL

academic.oup.com/book/46598/chapter/410006029
academic.oup.com academic.oup.com

Chaos Rainbows

5
1. pbk1 11 Jul 2025
  
  in Public
  
  Moreover, just like cognitive disinhibition, schizotypy is correlated with creativity, verbal and visual, with one caveat: Desiring isolation, being introvertive, lacking a capacity for pleasure—these do not predict creativity. They don’t make you more creative, according to the studies.
  
  ??
2. pbk1 11 Jul 2025
  
  in Public
  
  hey made themselves schizotypal
  
  ??
3. pbk1 11 Jul 2025
  
  in Public
  
  that he purposefully overrode,
  
  can induce?
4. pbk1 11 Jul 2025
  
  in Public
  
  with no regard for the truth of the assertion. To others and to himself, he willfully defied reality. He’d reverse himself, too. If particular lines of argument failed to persuade, he’d advocate others. He’d throw people off by adopting their position as his own. He’d say an idea was crazy, then a week later call it great.
  
  this sounds horrible
5. pbk1 11 Jul 2025
  
  in Public
  
  And since you can’t connect what never gets in, art is enhanced potential for connectivity.
  
  !!
Visit annotations in context

Annotators

pbk1

URL

academic.oup.com/book/41155/chapter/350514433
developer.nvidia.com developer.nvidia.com

NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost | NVIDIA Technical Blog

1
1. pbk1 11 Jul 2025
  
  in Public
  
  Run a generative AI chatbot on Jetson Orin Nano Super Developer Kit. This chatbot features Ollama with Open WebUI, a widely used, open-source, chatbot server interface that connects to locally running LLMs.
  
  deploying Omi - Open WebUI could be used to run a local LLM through API calls on T8 server?
  
  omi implementation
Visit annotations in context

Tags

implementation

omi

Annotators

pbk1

URL

developer.nvidia.com/blog/nvidia-jetson-orin-nano-developer-kit-gets-a-super-boost/
www.freecodecamp.org www.freecodecamp.org

npm vs npx — What’s the Difference?

1
1. pbk1 11 Jul 2025
  
  in Public
  
  npm is a couple of things. First and foremost, it is an online repository for the publishing of open-source Node.js projects. Second, it is a CLI tool that aids you install those packages and manage their versions and dependencies.
Visit annotations in context

Annotators

pbk1

URL

freecodecamp.org/news/npm-vs-npx-whats-the-difference/
arxiv.org arxiv.org

Evaluating Large Language Models Trained on Code

1
1. pbk1 11 Jul 2025
  
  in Public
  
  our model solves 28.8% of the problems, while GPT-3 solves 0% and GPT-J solves 11.4%
  
  how does this happen?
Visit annotations in context

Annotators

pbk1

URL

arxiv.org/abs/2107.03374
ebooks.iospress.nl ebooks.iospress.nl

CoTran: An LLM-Based Code Translator Using Reinforcement Learning with Feedback from Compiler and Symbolic Execution

1
1. pbk1 11 Jul 2025
  
  in Public
  
  translates whole-programs from one high-level programming language to another
Visit annotations in context

Annotators

pbk1

URL

ebooks.iospress.nl/doi/10.3233/FAIA240968
Local file Local file

Untitled document

1
1. pbk1 11 Jul 2025
  
  in Public
  
  symbolicexecution (symexec)
  
  Read more about : what is symbolic execution?
Annotators

pbk1
arxiv.org arxiv.org

Describing the Persistence Landscape for Introducing Microbes into Complex Communities

2
1. pbk1 11 Jul 2025
  
  in Public
  
  theoretical framework is needed to represent the complex interactions that drive these dynamics
2. pbk1 11 Jul 2025
  
  in Public
  
  successful establishment of the organism into the community
Visit annotations in context

Annotators

pbk1

URL

arxiv.org/abs/2503.22133
www.eventbrite.com www.eventbrite.com

RAD Microbes Boot Camp 2025

1
1. pbk1 11 Jul 2025
  
  in Public
  
  Omi: Bridging the Bioinformatics Expertise Gap with a Natural Language-Driven Metagenomics Co-pilot
  
  :😄 Monday, April 28 RAD Microbes Boot Camp 2025
  
  omi presentation
Visit annotations in context

Tags

omi

presentation

Annotators

pbk1

URL

eventbrite.com/e/rad-microbes-boot-camp-2025-tickets-1117948055729
arxiv.org arxiv.org

Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey

6
1. pbk1 11 Jul 2025
  
  in Public
  
  Domain specification techniques are key to make large language models disruptive in many applications
2. pbk1 11 Jul 2025
  
  in Public
  
  many hurdles, caused by the heterogeneity of domain data, the sophistication of domain knowledge, the uniqueness of domain objectives, and the diversity of the constraints
3. pbk1 11 Jul 2025
  
  in Public
  
  we present a comprehensive survey on domain specification techniques for large language models, an emerging direction critical for large language model applications
4. pbk1 11 Jul 2025
  
  in Public
  
  LLMs, significantly outperforming smaller models in understanding and generating human-like text,have emerged as a promising AI research trend
5. pbk1 11 Jul 2025
  
  in Public
  
  efficient literature analysis, novel hypothesis generation, and complex data interpretation
6. pbk1 11 Jul 2025
  
  in Public
  
  domainspecialization of Large Language Models (LLMs) is defined as the process of customizing general-purpose LLMs accordingto specific domain contextual data, augmented by domain-specific knowledge, optimized by the domain’s objective, andregulated by domain-specific constraints
  
  for introduction
  
  introduction
Visit annotations in context

Tags

introduction

Annotators

pbk1

URL

arxiv.org/abs/2305.18703
arxiv.org arxiv.org

Sketch to Adapt: Fine-Tunable Sketches for Efficient LLM Adaptation

8
1. pbk1 11 Jul 2025
  
  in Public
  
  Parameter-efficient fine-tuning (PEFT)
2. pbk1 11 Jul 2025
  
  in Public
  
  additive adapters applied to frozen model weights
3. pbk1 11 Jul 2025
  
  in Public
  
  suboptimal model quality due to restrictive assumptions
4. pbk1 11 Jul 2025
  
  in Public
  
  sketching, a popular data compression technique, can serve as an efficient adaptation strategy for LLMs while avoiding low-rank assumptions
5. pbk1 11 Jul 2025
  
  in Public
  
  compresses LLM weights into compact fine-tunable sketches
6. pbk1 11 Jul 2025
  
  in Public
  
  faster and more memory-efficient training and inference
7. pbk1 11 Jul 2025
  
  in Public
  
  evaluations with Llama-1/2/3 models
8. pbk1 11 Jul 2025
  
  in Public
  
  diverse tasks including math problem-solving, common sense reasoning, and instruction following
Visit annotations in context

Annotators

pbk1

URL

arxiv.org/abs/2410.06364
www.biorxiv.org www.biorxiv.org

SpatialAgent: An autonomous AI agent for spatial biology

15
1. pbk1 11 Jul 2025
  
  in Public
  
  labor-intensive workflows
2. pbk1 11 Jul 2025
  
  in Public
  
  SpatialAgent, a fully autonomous AI agent
3. pbk1 11 Jul 2025
  
  in Public
  
  SpatialAgent integrates large language models with dynamic tool execution and adaptive reasoning
4. pbk1 11 Jul 2025
  
  in Public
  
  combining autonomy with human collaboration
5. pbk1 11 Jul 2025
  
  in Public
  
  complex high dimensional data
6. pbk1 11 Jul 2025
  
  in Public
  
  computational methods that are quite fragmented
7. pbk1 11 Jul 2025
  
  in Public
  
  extensive human intervention
8. pbk1 11 Jul 2025
  
  in Public
  
  users have to reason over multiple analysis approaches
9. pbk1 11 Jul 2025
  
  in Public
  
  require deep biological knowledge
10. pbk1 11 Jul 2025
  
  in Public
  
  from experimental design to multimodal analysis and data-driven hypothesis generation
11. pbk1 11 Jul 2025
  
  in Public
  
  predefined workflows and rigid models, SpatialAgent employs adaptive reasoning and dynamic tool integration, allowing it to adjust to new datasets, tissue types, and biological questions
12. pbk1 11 Jul 2025
  
  in Public
  
  supports human-in-the-loop interactions
13. pbk1 11 Jul 2025
  
  in Public
  
  tasks such as gene panel design, cell and tissue annotation, and pattern inference in cell-cell communication and pathway analysis
14. pbk1 11 Jul 2025
  
  in Public
  
  Key modules. The action module (left) executes tasks such as retrieving reference datasets, converting gene names, verifying ligand–receptor interactions using existing databases, processing data with established software packages (e.g., numpy) or generating and executing custom code, while reasoning over and aggregating information from multiple sources
15. pbk1 11 Jul 2025
  
  in Public
  
  SpatialAgent consists of three key modules (Fig. 1b): memory, planning, and action.
Visit annotations in context

Annotators

pbk1

URL

biorxiv.org/content/10.1101/2025.04.03.646459v1
www.bsiranosian.com www.bsiranosian.com

Why are bioinformatics workflows different? – Benjamin Siranosian

1
1. pbk1 11 Jul 2025
  
  in Public
  
  Wratten, L., Wilm, A. & Göke, J. Reproducible, scalable, and shareable analysis pipelines with bioinformatics workflow managers. Nat Methods 1–8 (2021) doi:10.1038/s41592-021-01254-9.
  
  To read
  
  cite introduction
Visit annotations in context

Tags

introduction

cite

Annotators

pbk1

URL

bsiranosian.com/bioinformatics/why-are-bioinformatics-workflows-different/
www.nature.com www.nature.com

Community-led, integrated, reproducible multi-omics with anvi’o

7
1. pbk1 11 Jul 2025
  
  in Public
  
  barrier is overcome by workflows, which implement popular analysis strategies
2. pbk1 11 Jul 2025
  
  in Public
  
  accessible to those who have limited training in computation
3. pbk1 11 Jul 2025
  
  in Public
  
  the ability to analyse data beyond predefined strategies they implement continues to be largely limited to ‘master chefs’
4. pbk1 11 Jul 2025
  
  in Public
  
  microbiologists who wanted more freedom in research questions they could ask of their data
5. pbk1 11 Jul 2025
  
  in Public
  
  interactive refinement
6. pbk1 11 Jul 2025
  
  in Public
  
  interactive visualisation and editing software
7. pbk1 11 Jul 2025
  
  in Public
  
  more than 100 interoperable programmes
Visit annotations in context

Annotators

pbk1

URL

nature.com/articles/s41564-020-00834-3
arxiv.org arxiv.org

Llama-Nemotron: Efficient Reasoning Models

1
1. pbk1 11 Jul 2025
  
  in Public
  
  Nano (8B)
Visit annotations in context

Annotators

pbk1

URL

arxiv.org/abs/2505.00949

Prashant Kalvapalle

Graduate student - Systems Synthetic and Physical biology (SSPB)

Rice University

Interested in quantitative biology, microbial ecology, game theory, microbial biosensors. Also take interest in international politics, public health, economics

Annotations: 2,366

Joined: May 31, 2018

Location: Houston

Link: stadler.rice.edu/prashant-kalvapalle

ORCID: 0000-0002-8255-3623

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Tags

Annotators

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Tags

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Tags

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL