OpenAIRE Guidelines Promoting Repositories Interoperability and Supporting Open Access Funder Mandates July 2015 guidelines@openaire.eu Data Providers OpenAIRE Platform Services Content acquisition policy Publications in OpenAIRE Data in OpenAIRE OpenAIRE Guidelines for data sources Where we started… • ‘oai_dc’ 15 elements based on DCMES schema • As established and ‘lowest common denominator’ OAI format • Used in numerous interpretations -> interoperability issue • Requires agreements on syntax (use of OAI-PMH & oai_dc) and semantics (controlled vocabularies) • DRIVER Guidelines v2 introduced in 2008 • info:eu-repo Application Profile for encoding schemes •Mainly used to describe textual resources • Continued in OpenAIRE Guidelines since 2009 In the meantime… • OpenAIRE scope & requirements have expanded since then • By complementary types of data sources • For research data and research information • Non-textual content • Publication and data linking • Project contextualization • New global “authoritative” identifier based infrastructures • For authors/contributors • funders / projects • research outputs Guidelines Timeline OpenAIRE: EC + OA (2013) OpenAIRE: EC-FP7 (2010, 2012) DRIVER: OA (2008) Text Repositories Text Repos. Aggregators Text Rep. Aggr. Oai_DC Data-Rep. OAI_DATACITE CRIS CERIF Literature Repositories (and journal platforms) Dublin Core (DRIVER) Data Repositories (and archives/data centres) Datacite CRIS systems CERIF-XML 3 Guidelines for Data Providers Guidelines for Literature based Data Sources • Continues the DRIVER v2 Guidelines • Transfer protocol primarily but not limited: OAI-PMH •Metadata format (so far): oai_dc • Application Profile (so far): info:eu-repo • Extended by properties to describe • Funder / Project information • Embargo Period / Embargo End Date • Open Access Status / Access Rights • Alternative Identifier • Referenced Dataset(s) • Referenced Publication(s) OpenAIRE OAI-Set • To group metadata relevant for OpenAIRE • See https://www.openaire.eu/content-acquisition- policy/content-acquisition-policy/content-acquisition-policy • Metadata about Open Access Publications • Metadata about Publications funded in EC-FP7 / H2020 • Metadata about Publications funded by other funders • OpenAIRE provides information about supported funding information setName setSpec* The OpenAIRE set OpenAIRE openaire projectID Element name projectID DCMI definition dc:relation Usage Mandatory (if applicable) Usage instruction A vocabulary of projects is exposed by the OpenAIRE API: http://api.openaire.eu/#cha_projects_http , and available for all repository managers. Values include funder, project name and projectID. The projectID equals the Grant Agreement number, and is defined by the namespace: info:eu-repo/grantAgreement/Funder/ FundingProgram/ProjectNumber/ Jurisdiction/ProjectName/ProjectAcronym/ Example: info:eu-repo/grantAgreement/EC/FP7/123456 info:eu-repo/grantAgreement/EC/FP7/12345/EU//Acronym accessRights Element name accessRights DCMI definition dc:rights Usage Mandatory Usage instruction Use values from vocabulary Access Rights at http://purl.org/eu-repo/semantics/#info-eu-repo- AccessRights • info:eu-repo/semantics/closedAccess • info:eu-repo/semantics/embargoedAccess • info:eu-repo/semantics/restrictedAccess • info:eu-repo/semantics/openAccess Examples: info:eu-repo/semantics/openAccess embargoEndDate Element name embargoEndDate DCMI definition dc:date Usage Mandatory (if applicable) Usage instruction Recommended when accessRights = info:eu- repo/semantics/embargoedAccess The date type is controlled by the name space info:eu- repo/date/embargoEnd/, see http://wiki.surffoundation.nl/display/standards/info- eu-repo/#info-eu-repo-DateTypesandvalue. Encoding of this date should be in the form YYYY-MM-DD (conform ISO 8601). Examples: info:eu-repo/date/embargoEnd/2011-05-12 Alternative Identifier Element name Alternative Identifier DCMI definition dc:relation Usage Recommended Usage instruction List alternative identifiers for this publication that are not the primary identifier (repository splash page), e.g., the DOI of publisher’s version, the PubMed/arXiv ID. The term is defined by info:eu- repo/semantics/altIdentifier info:eu- repo/semantics/altIdentifier// where must be one of the following: ark,arxiv, doi, hdl, isbn, purl… Example info:eu-repo/semantics/altIdentifier/doi/10.1234/789.1 Referenced Dataset Element name Referenced Dataset DCMI definition dc:relation Usage Recommended Usage instruction Encodes links to research datasets connected with this publication. The syntax of info:eu- repo/semantics/dataset is: info:eu- repo/semantics/dataset// where must be one of the following: ark,arxiv, doi, hdl, isbn, purl… Example info:eu-repo/semantics/dataset/doi/10.1234/789.1 Referenced Publication Element name Referenced Publication DCMI definition dc:relation Usage Recommended Usage instruction Encode links to publications referenced by this publication. The syntax of info:eu- repo/semantics/reference is: info:eu- repo/semantics/reference// where must be one of the following: ark, arxiv, doi, hdl, isbn… Examples: info:eu-repo/semantics/reference/doi/10.1234/789.1 Meet H2020 OA Guidelines • https://www.iprhelpdesk.eu/sites/default/files/newsdoc uments/Open_Access_in_H2020.pdf Property DC Field Value EU funding acknowledgment dc:contributor “controlled” terms : ["European Union (EU)" and "Horizon 2020"]["Euratom" and "Euratom research and training programme 2014-2018"] Peer reviewed dc:type info:eu-repo/semantics/publishedVersion Embargo period dc:date dc:rights • info:eu-repo/date/embargoEnd/ (as publication date) • info:eu-repo/semantics/embargoedAccess Project information dc:relation info:eu- repo/grantAgreement/EC/H2020/[ProjectID]/[Jurisdiction]/[ProjectName]/[Pr ojectAcronym]/ Persistent identifier dc:identifier or dc:relation License dc:rights URL of license condition Persistent IDs for authors and contributors dc:creator dc:contributor Reference to related research outcome dc:relation info:eu-repo/semantics/dataset// Guidelines for Research Data based Data Sources •Based on DataCite metadata scheme • Proven standard for heterogeneous data sources and cross-disciplinary archives •Maintained by a trusted and sustainable organization • Support of data citation •Metadata Format: oai_datacite • OAI-Set: openaire_data • Zenodo as reference implementation for OpenAIRE Data Repository Guidelines Guidelines for Research Data based Data Sources •Adaptations for OpenAIRE: • Incl. additional PID schemes to DOI, e.g.URN, ARC … • Recommends links to related publications & datasets • Recommends Contributor prop. to relate dataset to funding information • Date and Description prop. are mandatory • Enforces (info:eu-repo) encoding scheme on Rights property Guidelines for CRIS Managers based on CERIF-XML • CERIF as de facto standard for CRIS data model • Entities for organization, person, project, output, data .. • CERIF-XML as data exchange standard (in the CRIS world) •Maintained by euroCRIS as sustainable organization Relevant CERIF Entities for OpenAIRE Guidelines for CRIS Managers based on CERIF-XML •Adaptations for OpenAIRE: • Subset of CERIF entities relevant for OpenAIRE • Semantic Interoperability • Reuse of CERIF Semantics vocabularies for classification and definition of relationships between entities • Consistency with the other OpenAIRE Guidelines • E.g. resource types, access rights Guidelines for CRIS Managers based on CERIF-XML •Syntactic Interoperability •Distinct namespace for OpenAIRE CERIF schema •Nesting only of multi-lingual attributes, federated identifiers, linked entities •System Interoperability: • Transfer Protocol: OAI-PMH •OAI-PMH sets for • Each type of entity • The entire CERIF graph OpenAIRE Compatibility Status: Levels and OAI Sets OpenAIRE basic Only Open Access content via driver oai set OpenAIRE 2.0 EC funded content via ec_fundedres ources oai set OpenAIRE 2.0 + Open Access and EC funded content via driver and ec_fundedreso urces oai set OpenAIRE 3.0 Open Access and/or EC funded and/or National/other funded content via openaire oai set Future Directions Next steps Innovative Scholarly Services – Need of Improved Metadata Quality • Features • Disambiguation by identifiers for authors / contributors, funders / project grants, research outputs • Explicit links to landing pages and fulltext files • Explicit links between publications and other research outputs • Detailed bibliographic citation information • Explicit and machine readable license information • Future proof • Compatible with linked data principles • Extensibility towards Open Science properties • Backward compatibility and low-barrier upgrade • Following DCMI Guide on Publishing Metadata (dcterms) • COAR IG Controlled Vocabularies Innovative Scholarly Services – Need of Network Alignment •Alignment & collaboration needed between major repository initiatives • LA Referencia, SHARE, RIOXX, OpenAIRE, … • COAR-CASRAI OA Interoperability WG • Alignment on metadata elements and vocabularies • COAR IG Controlled Vocabularies • Community driven effort to supersede info:eu-repo application profile • Working on an up to date technological standard • Editorial Board to ensure expertise and long-term support Find and discuss at: https://guidelines.openaire.eu Contact us: guidelines@openaire.eu July 2015 Pedro Principe, University of Minho Jochen Schirrwagen, University of Bielefeld