Recommendation S1.0
Status: Under development, Date: 2025/07/08 10:18, Version: 001
As data can only be embedded in semantic frameworks when it is described with rich metadata, the first step toward a standardized approach for implementing semantic resources is for data producers to enrich data with metadata - even if this may seem self-evident. Only the standardized use of metadata enables the annotation with identifiable terms from recognized controlled vocabularies, which allows machines to interpret and connect data across disciplinary and institutional boundaries. Since the needs for these metadata can vary greatly between research communities, data infrastructures, and use cases, we recommend using existing metadata schemas commonly used in each of these. In addition, there are generally applicable schemas that can be recommended.
All data producers in Helmholtz Earth & Environment should enrich their datasets with rich, standardized metadata at the time of dataset creation, submission to repositories or publication. Repositories, sensor registries and other data infrastructures should ensure that metadata is requested and archived in a standardized and structured manner. This should be done following metadata categories specified in established general or discipline-specific metadata schemas or workflows (see also I2.0 Define exchange format).
mandatory | conditional | optional | |
---|---|---|---|
Helmholtz FAIR Principle | x |
Parent: S0
Dependent: S3.0
Other: related to I2.0
A metadata schema defines the structure, content, and semantics of metadata elements used to describe a dataset. It specifies what metadata should be captured, how it should be named, and in which format it should be stored. Schemas often include controlled vocabularies and formal structures, allowing metadata to be understood both by humans and machines. Widely used schemas include DataCite Metadata Schema for citation metadata, ISO 19115 for geospatial data (hat Platz für Details, viele Abhängigkeiten, in Ausformulierungen überlegen), DCAT for XXX (behörden, flach) and Dublin Core for general resource description (bibliothekarische Metadaten/Attribute?(manche dieser Attribute Pflichtfelder bei Datacite und ISO=, aus den 70er Jahren).
Across Helmholtz Earth and Environmental sciences, metadata standards are applied in diverse repositories and infrastructures, to name just a few of them:
Providing sufficient enrichment of data with metadata forms the basis for the implementation of standardized semantic concepts.
(quality of content, limitations, interoperability, sustainability: expected future dissemination / technical availability / funding)
Bibliographic Metadata/(Administrative metadata?)
It is recommended that for the accurate and consistent identification of a resource for citation and retrieval purposes, each published dataset should be provided with the core metadata elements defined in the most up-to-date DataCite Metadata Schema (see https://schema.datacite.org/).
Community and Repository Alignment
When selecting metadata schemas, data producers should always consider the intended purpose of the metadata: