HMC Home -> HMC Hub Earth & Evironment -> Catalogue of Resources
Go to a collection of other useful resources collected by the hub
Compilation of Recommendations
Details
Short Title
Provenance of datasets
Source Documnent
Principles and best practices in data versioning for all datasets big and small
Source Document Link
https://doi.org/10.15497/RDA00042
Publishing Organisation
RDA Data Versioning WG
Date of Publication
2020-01-16
Topic
Quality control/ curation
Addressed Stakeholders
data stewards
Keywords
provenance
Text
The definition of revisions and releases to describe that a dataset has been derived from a precursor helps to describe its lineage, or provenance. Semantic versioning, and related versioning schemes, encode in their release numbers information about a dataset and its precursors. Provenance, however, can be more complex than following a linear path. Information accompanying a dataset release should therefore contain information on the provenance of a dataset.