Forked from
R3 / school / courses
240 commits behind the upstream repository.
-
Nene Barry authoredNene Barry authored
Code owners
Assign users and groups as approvers for specific file changes. Learn more.
data_flow.md 2.23 KiB
Typical flow of data
Source data
- Experimental results
- Large data sets
- Manually collected data
- External
Intermediate
- Derived data
- Tidy data
- Curated sets
Analyses
- Exploratory
- Model building
- Hypothesis testing
Dissemination
- Manuscript, report, presentation, ...




Preserve
- Version data sets
- Backup
- Protect
Reproduce
- Automate your builds
- Use workflow tools (e.g. Snakemake)
Trace
- Multiple iterations.
- Code versioning (Git)
Track
- Through multiple versions