Bioinformatics is a disorganised disaster and I am too. So I made a shell.
This is a shell-wrapper to minimize the effort needed to collect command line provenance.
This post covers a lot of ground to describe practical and human requirements.
Bioinformatics is a disorganised disaster and I am too. So I made a shell.
This is a shell-wrapper to minimize the effort needed to collect command line provenance.
This post covers a lot of ground to describe practical and human requirements.
A Containerized Data Lake
A data container + workflow spec
Thorough review of available (cloud, VM, live) image building tools for Debian. Additional ones (e.g. packer, openstack-debian-images, diskimage-builder) are mentioned in the follow up emails
TACC Scientific Apps -- Docker containers to be used on HPC
The Dat Project
A peer-to-peer data distribution service
ReproZip
a python framework for packaging experiments for publication
rr
record and replay application
FOSS service binding together Jupyter and Kubernetes (to manage docker images) to provide executable Jupyter notebooks.
An alternative to docker that Chris says is better for cluster and is designed for reproducibility