Identifying manifestations of datasets
- Short Title: Identifying manifestations of datasets
- Source Documnent: Principles and best practices in data versioning for all datasets big and small
- Source Document Link: https://doi.org/10.15497/RDA00042
- Publishing Organisation: RDA Data Versioning WG
- Date of Publication: 2020-01-16
- Topic: Quality control/ curation
- Keywords: identification, manifestations, PID
- Addressed Stakeholders: data stewards
- Full Text: The same dataset may be expressed in different file formats or character encodings without differences in content. While these datasets will have different checksums, the work expressed in these datasets does not differ, they are manifestations of thesame work (Hourclé, 2009). From the perspective of content it might be sufficient to identify only the work, and not its manifestations, but there might be technical considerations such as machine actionability that merit a machine actionable identification of different manifestations of a work and and their instances as items through persistent identifiers (Razum et al., 2009).