Recommendation M1.3
[Status: Under development, Date: 2023/09/19 22:33, Version: 003]
The Helmholtz Association is determined to make their data available according to the FAIR principles, thus making it findable, accessible, interoperable and reusable. In order to achieve interoperability of datasets among various data infrastructures (DIS) within the Helmholtz Association, a common and agreed procedure to refer to people within and across the DIS is needed.
In order to be able to uniquely and sustainably identify both researchers and employees in data infrastructures and repositories in the Helmholtz Association, the respective person should always be referenced with a persistent identifier (PID) (see recommendation M0).
For the Helmholtz Association we recommend to use ORCID to refer to people and contributors to resources in data infrastructures and repositories of the Helmholtz Association wherever possible (see recommendation M1.0).
To be able to implement this measure, several activities need to be conducted by different stakeholder groups. This recommendation M1.3. calls for activity of the data stewards, repository maintainers and developers.
It is recommended that data infrastructures (data repositories, data bases) should:
mandatory | conditional | optional | |
---|---|---|---|
Helmholtz FAIR Principle | if ORCID is available |
[Precondition 1]: The ORCID Registry is available for all researchers, maintained and further developed.
Parent: M1.0
Dependent: none
Other: M1.1, M1.2
The Helmholtz Association is determined to make their data available according to the FAIR principles, thus making it findable, accessible, interoperable and reusable. In order to achieve interoperability of datasets among various data infrastructures (DIS) within the Helmholtz Association, a common and agreed procedure to refer to people within and across the DIS is needed.
In order to be able to uniquely and sustainably identify both researchers and employees in data infrastructures and repositories in the Helmholtz Association, the respective person should always be referenced with a persistent identifier (PID) (see recommendation M0).
For the Helmholtz Association we recommend to use ORCID to refer to people and contributors to resources in data infrastructures and repositories of the Helmholtz Association wherever possible (see recommendation M1.0).
To be able to implement this measure, several activities need to be conducted by different stakeholder groups. This recommendation M1.3. calls for activity of the data stewards, repository maintainers and developers.
Implementing ORCID requires updating metadata schemas and workflows to support consistent recording and validation of ORCID iDs. Synchronizing metadata with ORCID records can be technically demanding, especially when dealing with discrepancies or outdated profiles. Infrastructures must also handle user consent, as not all researchers have ORCID iDs or keep them current. Integrating ORCID into identity and access management (IAM/AAI) systems adds complexity around authentication flows, attribute handling, and privacy policies. Finally, relying on an external service like ORCID demands ongoing technical maintenance, monitoring of API changes, and alignment with broader PID infrastructure.
When an ORCID is not available, data infrastructures should still record the person’s full name and leave the ORCID field empty or marked as pending, allowing for future updates. To support post-hoc linking, systems should enable users to add ORCIDs later, for example through a “claim your record” function. As a fallback, institutional identifiers can be used to reference the person within identity and access systems. Finally, infrastructures should implement regular quality control processes to identify missing ORCIDs and enrich metadata over time, either manually or through automated tools such as the ORCID Public API.
It is recommended that data infrastructures (data repositories, data bases) should:
Note on 2. + 3.: treating ORCID metadata as the primary source of truth is a conceptual decision on the issue, who is ultimately responsible for personal data. In this model we assume, that each person / contributor, identified by an ORCID is solely responsible for their own data and should only made aware of incorrect information, not forced to update it. Technically, institutions could maintain their own records of these personal data. This, however, leads to several problems: 1. the maintenance of this data is very difficult and keeping it current almost impossible; 2. by keeping personal data of people, data privacy becomes an issue, as people cannot control, what data about them is shared with others. This can be avoided by delegating the responsibility for personal information to the persons themselves, at the cost of not being in full control of that data.
Note on 4.: Identity and access management systems (IAM), authentication and authorization infrastructures (AAI) or upcoming community attribute services like defined in the AARC blueprint architecture are suitable environments for enriching user information, for example with an ORCID attribute. The mechanisms to control forwarding of private and personal user information allows the users to decide if they agree with that. As an attribute of the user objects the usage of ORCID within scientific software applications would be simplified and encouraged.
[1] ORCID Terms of use: https://info.orcid.org/terms-of-use/