HMC Home -> HMC Hub Earth & Evironment -> Catalogue of Resources
Go to a collection of other useful resources collected by the hub
Compilation of Recommendations
Details
Short Title
Identifying manifestations of datasets
Source Documnent
Principles and best practices in data versioning for all datasets big and small
Source Document Link
https://doi.org/10.15497/RDA00042
Publishing Organisation
RDA Data Versioning WG
Date of Publication
2020-01-16
Topic
Quality control/ curation
Addressed Stakeholders
data stewards
Keywords
identification, manifestations, PID
Text
The same dataset may be expressed in different file formats or character encodings without differences in content. While these datasets will have different checksums, the work expressed in these datasets does not differ, they are manifestations of thesame work (Hourclé, 2009). From the perspective of content it might be sufficient to identify only the work, and not its manifestations, but there might be technical considerations such as machine actionability that merit a machine actionable identification of different manifestations of a work and and their instances as items through persistent identifiers (Razum et al., 2009).