Provenance of datasets
- Short Title: Provenance of datasets
- Source Documnent: Principles and best practices in data versioning for all datasets big and small
- Source Document Link: https://doi.org/10.15497/RDA00042
- Publishing Organisation: RDA Data Versioning WG
- Date of Publication: 2020-01-16
- Topic: Quality control/ curation
- Keywords: provenance
- Addressed Stakeholders: data stewards
- Full Text: The definition of revisions and releases to describe that a dataset has been derived from a precursor helps to describe its lineage, or provenance. Semantic versioning, and related versioning schemes, encode in their release numbers information about a dataset and its precursors. Provenance, however, can be more complex than following a linear path. Information accompanying a dataset release should therefore contain information on the provenance of a dataset.