Skip to content
Snippets Groups Projects
Forked from R3 / school / courses
240 commits behind the upstream repository.
Code owners
Assign users and groups as approvers for specific file changes. Learn more.
data_flow.md 2.23 KiB

Typical flow of data

Source data
  • Experimental results
  • Large data sets
  • Manually collected data
  • External
Intermediate
  • Derived data
  • Tidy data
  • Curated sets
Analyses
  • Exploratory
  • Model building
  • Hypothesis testing
Dissemination
  • Manuscript, report, presentation, ...
Preserve
  • Version data sets
  • Backup
  • Protect
Reproduce
  • Automate your builds
  • Use workflow tools (e.g. Snakemake)
Trace
  • Multiple iterations.
  • Code versioning (Git)
Track
  • Through multiple versions
flow of the data is downstream (mostly), but you are going back and forth applies to all data (financial report, lab safety assessment)