Facilitate Open Science Training for European Research Facilitate Open Science Training for European Research Repository managers: guidelines to meet H2020 requirements Ricardo Saraiva - University of Minho rsaraiva@sdum.uminho.pt Overview 1. The context: OpenAIRE infrastructure 2. OpenAIRE guidelines for data sources 3. Adoption and mapping with other repository networks 4. Future Directions 5. Tools for repositories compatibility The context: OpenAIRE infrastructure 1/5 OPEN ACCESS INFRASTRUCTURE OpenAIRE for research in Europe www.openaire.eu Project & Infrastructure Guided by OA funder policies (European Commission) Participatory approach (human & e-infrastructure) Integrated Scientific Information System Data Providers OpenAIRE Platform Services Content acquisition policy Publications in OpenAIRE Data in OpenAIRE OpenAIRE Guidelines for data sources 2/5 • ‘oai_dc’ 15 elements based on DCMES schema • As established and ‘lowest common denominator’ OAI format • Used in numerous interpretations -> interoperability issue • Requires agreements on syntax (use of OAI-PMH & oai_dc) and semantics (controlled vocabularies) • DRIVER Guidelines v2 introduced in 2008 • info:eu-repo Application Profile for encoding schemes • Mainly used to describe textual resources • Continued in OpenAIRE Guidelines since 2009 Where we started… In the meantime… • OpenAIRE scope & requirements have expanded since then • By complementary types of data sources • For research data and research information • Non-textual content • Publication and data linking • Project contextualization • New global “authoritative” identifier based infrastructures • For authors/contributors • funders / projects • research outputs Guidelines Timeline OpenAIRE: EC + OA (2013) OpenAIRE: EC-FP7 (2010, 2012) DRIVER: OA (2008) Text Repositories Text Repos. Aggregators Text Rep. Aggr. Oai_DC Data-Rep. OAI_DATACITE CRIS CERIF 1 2 3Literature Repositories (and journal platforms) Dublin Core (DRIVER) Data Repositories (and archives/data centres) Datacite CRIS systems CERIF-XML 3 Guidelines for Data Providers How do they work? • Identification of Open Access and funded research results by OAI-Sets: • ‘ec-fundedresources’ • ‘openaire’ • ‘openaire_data’ • Latest schema guarantees backwards-compatibility with previous versions. • Enhancing research results by metadata enrichment in OpenAIRE. “Repository managers will be able to enable authors to fulfill the EC Commission Open Access requirements” 16 www.openaire.eu 1 2 3Literature Repositories (and journal platforms) Dublin Core (DRIVER) Data Repositories (and archives/data centres) Datacite CRIS systems CERIF-XML 3 Guidelines for Data Providers Guidelines for Literature based Data Sources • Continues the DRIVER v2 Guidelines • Transfer protocol primarily but not limited: OAI-PMH • Metadata format (so far): oai_dc • Application Profile (so far): info:eu-repo • Extended by properties to describe • Funder / Project information • Embargo Period / Embargo End Date • Open Access Status / Access Rights • Alternative Identifier • Referenced Dataset(s) • Referenced Publication(s) OpenAIRE OAI-Set • To group metadata relevant for OpenAIRE • See https://www.openaire.eu/content-acquisition-policy/content- acquisition-policy/content-acquisition-policy • Metadata about Open Access Publications • Metadata about Publications funded in EC-FP7 / H2020 • Metadata about Publications funded by other funders • OpenAIRE provides information about supported funding information  setName setSpec* The OpenAIRE set OpenAIRE openaire OpenAIRE elements projectID access_rights embargo_end_date projectID Element name projectID DCMI definition dc:relation Usage Mandatory (if applicable) Usage instruction A vocabulary of projects is exposed by the OpenAIRE API:http://api.openaire.eu/#cha_projects_http , and available for all repository managers. Values include funder, project name and projectID. The projectID equals the Grant Agreement number, and is defined by the namespace: info:eu-repo/grantAgreement/Funder/ FundingProgram/ProjectNumber/ Jurisdiction/ProjectName/ProjectAcronym/ Example:info:eu-repo/grantAgreement/EC/FP7/123456info:eu-repo/grantAgreement/EC/FP7/12345/EU//Acronym accessRights Element name accessRights DCMI definition dc:rights Usage Mandatory Usage instruction Use values from vocabulary Access Rights at http://purl.org/eu-repo/semantics/#info-eu-repo-AccessRights • info:eu-repo/semantics/closedAccess • info:eu-repo/semantics/embargoedAccess • info:eu-repo/semantics/restrictedAccess • info:eu-repo/semantics/openAccess Examples: info:eu-repo/semantics/openAccess embargoEndDate Element name embargoEndDate DCMI definition dc:date Usage Mandatory (if applicable) Usage instruction Recommended when accessRights = info:eu-repo/semantics/embargoedAccessThe date type is controlled by the name space info:eu- repo/date/embargoEnd/, see http://wiki.surffoundation.nl/display/standards/info-eu- repo/#info-eu-repo-DateTypesandvalue. Encoding of this date should be in the form YYYY-MM-DD (conform ISO 8601). Examples: info:eu-repo/date/embargoEnd/2011-05-12 Alternative Identifier Element name Alternative Identifier DCMI definition dc:relation Usage Recommended Usage instruction List alternative identifiers for this publication that are not the primary identifier (repository splash page), e.g., the DOI of publisher’s version, the PubMed/arXiv ID. The term is defined by info:eu-repo/semantics/altIdentifier info:eu-repo/semantics/altIdentifier// where must be one of the following: ark,arxiv, doi, hdl, isbn, purl… Example info:eu-repo/semantics/altIdentifier/doi/10.1234/789.1 Referenced Dataset Element name Referenced Dataset DCMI definition dc:relation Usage Recommended Usage instruction Encodes links to research datasets connected with this publication. The syntax of info:eu-repo/semantics/dataset is: info:eu-repo/semantics/dataset// where must be one of the following: ark,arxiv, doi, hdl, isbn, purl… Example info:eu-repo/semantics/dataset/doi/10.1234/789.1 Referenced Publication Element name Referenced Publication DCMI definition dc:relation Usage Recommended Usage instruction Encode links to publications referenced by this publication. The syntax of info:eu-repo/semantics/reference is: info:eu-repo/semantics/reference// where must be one of the following: ark, arxiv, doi, hdl, isbn… Examples: info:eu-repo/semantics/reference/doi/10.1234/789.1 Meet H2020 OA Guidelines https://www.iprhelpdesk.eu/sites/default/files/newsdocume nts/Open_Access_in_H2020.pdf Property DC Field Value EU funding acknowledgment dc:contributor “controlled” terms : ["European Union (EU)" and "Horizon 2020"]["Euratom" and "Euratom research and training programme 2014-2018"] Peer reviewed dc:type info:eu-repo/semantics/publishedVersion Embargo period dc:datedc:rights • info:eu-repo/date/embargoEnd/ (as publication date) • info:eu-repo/semantics/embargoedAccess Project information dc:relation info:eu-repo/grantAgreement/EC/H2020/[ProjectID]/[Jurisdiction]/[ProjectName]/[ProjectAcronym]/ Persistent identifier dc:identifier or dc:relation License dc:rights URL of license condition Persistent IDs for authors and contributors dc:creatordc:contributor Reference to related research outcome dc:relation info:eu-repo/semantics/dataset// 1 2 3Literature Repositories (and journal platforms) Dublin Core (DRIVER) Data Repositories (and archives/data centres) Datacite CRIS systems CERIF-XML 3 Guidelines for Data Providers Guidelines for Research Data based Data Sources •Based on DataCite metadata scheme •Proven standard for heterogeneous data sources and cross-disciplinary archives •Maintained by a trusted and sustainable organization •Support of data citation •Metadata Format: oai_datacite •OAI-Set: openaire_data •Zenodo as reference implementation for OpenAIRE Data Repository Guidelines Guidelines for Research Data based Data Sources •Adaptations for OpenAIRE:• Incl. additional PID schemes to DOI, e.g.URN, ARC …•Recommends links to related publications & datasets•Recommends Contributor prop. to relate dataset to funding information•Date and Description prop. are mandatory•Enforces (info:eu-repo) encoding scheme on Rights property 1 2 3Literature Repositories (and journal platforms) Dublin Core (DRIVER) Data Repositories (and archives/data centres) Datacite CRIS systems CERIF-XML 3 Guidelines for Data Providers Guidelines for CRIS Managers based on CERIF-XML •CERIF as de facto standard for CRIS data model•Entities for organization, person, project, output, data ..•CERIF-XML as data exchange standard (in the CRIS world)•Maintained by euroCRIS as sustainable organization Relevant CERIF Entities for OpenAIRE Guidelines for CRIS Managers based on CERIF-XML •Adaptations for OpenAIRE: •Subset of CERIF entities relevant for OpenAIRE •Semantic Interoperability •Reuse of CERIF Semantics vocabularies for classification and definition of relationships between entities •Consistency with the other OpenAIRE Guidelines • E.g. resource types, access rights OpenAIRE Compatibility Status: Levels and OAI Sets OpenAIRE basic Only Open Access content via driver oai set OpenAIRE 2.0 EC funded content via ec_fundedres ources oai set OpenAIRE 2.0 + Open Access and EC funded content via driver and ec_fundedreso urces oai set OpenAIRE 3.0 Open Access and/or EC funded and/or National/other funded content via openaire oai set Adoption and Mapping with other repository networks 3/5 LA Referencia aligned with OpenAIRE Guidelines •Latin America Open Access Repositories Network •Central metadata elements and use of vocabularies •Access rights: openAccess; embargoedAccess •Use of Creative Commons licenses recommended •Considers regional aspects compared to OpenAIRE regarding •Mandatory , mandatory if applicable and recommended use of metadata fields •Collaboration between LA Referencia and OpenAIRE ensures future compatibility of the guidelines OpenAIRE Guidelines, RIOXX and the case in UK •RIOXX to support UK repositories for compliancy with funder OA policies (RCUK, HEFCE) •Introduction of element / attributes to encode project / funder information •Based on oai_dc + rioxx terms and rioxx attributes •Supports NISO-ALI •Support of EC-H2020 OA mandate is (currently) done via agreed mapping between RIOXX and OpenAIRE Literature Guidelines Future Directions Next steps 4/5 Innovative Scholarly Services – Need of Improved Metadata Quality •Features •Disambiguation by identifiers for authors / contributors, funders / project grants, research outputs •Explicit links to landing pages and fulltext files •Explicit links between publications and other research outputs •Detailed bibliographic citation information •Explicit and machine readable license information •Future proof •Compatible with linked data principles •Extensibility towards Open Science properties •Backward compatibility and low-barrier upgrade •Following DCMI Guide on Publishing Metadata (dcterms) •COAR IG Controlled Vocabularies Innovative Scholarly Services – Need of Network Alignment •Alignment & collaboration needed between major repository initiatives •LA Referencia, SHARE, RIOXX, OpenAIRE, … •COAR-CASRAI OA Interoperability WG • Alignment on metadata elements and vocabularies •COAR IG Controlled Vocabularies • Community driven effort to supersede info:eu-repo application profile •Working on an up to date technological standard • Editorial Board to ensure expertise and long-term support Tools for Repositories Compatibility Validator, API, software add-ons 5/5 Data Provider Registration Via validator tool www.openaire.eu/validator Test the OpenAIRE Compliance The OpenAIRE provides its validator service: www.openaire.eu/validator Users can: •Verify if the repository is truly compatible with the guidelines. •Run a compatibility test (enter the OAI-PMH base URL). Run Compatibility Test: View/Select Rules for Usage Use of OAI-DC (M) Use of OAI-PMH: 'openaire' or 'ec_fundedresources' or 'driver' set must exist (M) Use of OAI-PMH: 'openaire' set must exist (M) driver set must exist (R) ec_fundedresources set must exist (openaire2.0) for Content Field Access Level (M) Field Alternative Identifier (R) Field Audience (R) Field Contributor (R) Field Coverage (R) Field Creator (M) Field Date of Publication (M) Field Description (MA) Field Embargo End Date (MA) Field Format (R) Field Language (R) Field License Condition (R) Field Project Identifier for ecfundedresources (M) Field Project Identifier for openaire(MA) Field Publication Type (M) Field Publication Version (R) Field Publisher (MA) Field Referenced Dataset (R) Field Referenced Publication (R) Field Resource Identifier (M) Field Source (M) Field Subject (MA) Field Title (M) http://api.openaire.eu/ http://api.openaire.eu/ Need to integrate project and funding information into your institutional repository based on DSpace or ePrints? •Go for the DSpace/ePrints endpoints. Do you prefer a TSV with the list of projects by funding? •TSV endpoint is meant for Addons, patchs or plugins for Repositories & Journals software Dspace add-ons and versions compliance https://wiki.duraspace.org/pages/viewpage.action?pageId=33238567 OpenAIRE Plug-in (OpenAIRE 2.0) http://files.eprints.org/649/ EPrints - OpenAIRE compliance example (3.0) https://gist.github.com/alenkovich/9596992 OAI_DC_OpenAIRE implementation for Zenodo https://github.com/zenodo/zenodo/blob/master/zenodo/base/format_templates/OAI_DC_OpenAIRE.xsl OJS Plugin: OpenAIRE http://pkp.sfu.ca/support/forum/viewtopic.php?f=28&t=7085 + OJS DRIVER-Plug-In https://www.openaire.eu/rep-managers/rep-man-toolkit/repository-managers OpenAIRE compatibility for different DSpace versions 51 Via OAIextended add-on and also via XOAI add-onDspace 1.8.2 • Compatibility with OpenAIRE 2.0 guidelines OAI 2.0 is a standard part of DSpace 3.0DSpace 3 • Compatibility with OpenAIRE 2.0 guidelines via XOAI. • (fix ‘info:eu-repo/grantAgreement/EC/FP’ on the filter ‘openairerelationFilter’) OAI 2.0 is a standard part of DSpace 4.0DSpace 4 • OpenAIRE 2.0 compatibility via XOAI OAI 2.0 is a standard part of DSpace 5.0DSpace 5 • OpenAIRE 2.0 compatibility via XOAI and will be partially compatible with OpenAIRE 3.0 guidelines via patch (to be presented) . OAI 2.0 What is OAI 2.0? OAI 2.0 is a Java implementation of an OAI-PMH data provider interface that uses XOAI, an OAI-PMH Java Library. Why OAI 2.0? Projects like OpenAIRE, DRIVER have specific metadata requirements. As the OAI-PMH protocol doesn't establish any frame to these specifics, OAI 2.0 can have more than one instance of an OAI interface (feature provided by the XOAI core library) so one could define an interface for each project. OAI 2.0 is a standard part of DSpace 3.0, 4.0 and 5.0 OAI 2.0 • With a Filter is possible to select information from the data source. • A Transformer allows to make some changes in the metadata before showing it in the OAI interface • OAI 2.0 also adds a new concept to the OAI-PMH basic specification, the concept of context.  http://www.example.com/xoai/driver  http://www.example.com/xoai/openaire  http://www.example.com/xoai/request Concepts: Filter, Transformer and Context OAI 2.0 Contexts could be seen as virtual distinct OAI interfaces  http://repositorium.sdum.uminho.pt/oai/driver  http://repositorium.sdum.uminho.pt/oai/openaire  http://repositorium.sdum.uminho.pt/oai/request DRIVER OPENAIRE OAI 2.0 (record in openaire set) https://repositorium.sdum.uminho.pt/oai/openaire? verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:repositorium.sdum.uminho.pt:1822/34546 Dspace Add-ons for project ids •OpenAIRE Authority Control •Dspace 3.2 • http://goo.gl/cEPTZN (updated March 2014) •Dspace 1.8.2 • http://projeto.rcaap.pt/index.php/lang-en/consultar-recursos-de-apoio/re mository?func=fileinfo&id=354 •OpenAIRE funders projects list addon (NEW) • In use on the RCAAP Project (PT repositories) • https://gitlab.fccn.pt/dev-rcaap/addon-openaire/tree/OpenAIRE5.X • https://gitlab.fccn.pt/dev-rcaap/addon-openaire/tree/OpenAIRE3.X Allows users to search and include EC (+ WT + FCT) projects ID in the metadata of the records disposed in accordance with OpenAIRE Using the projects list provided by the OpenAIRE API OpenAIRE Authority Control Add-on •Allows users to search and include FP7 project IDs in the metadata of the records disposed in accordance with the guidelines * Needs the OAIextended Add-on to create the set (ec_fundedresources) OpenAIRE Funders Projects List Add-on Searching by the name, acronym or the project id number Select the project and accept… the necessary namespace will be filled Example: http://hdl.handle.net/1822/21091 Metadata edition Unlocking dc.relation and searching by the name, acronym or the project id number Select the project and accept… the necessary namespace will be filled Some results in OpenAIRE… Useful Links: Find and discuss the guidelines at: https://guidelines.openaire.eu Test and register the repository at: http://validator.openaire.eu Find support at: https://www.openaire.eu/support/helpdesk Access the OpenAIRE API documentation: http://api.openaire.eu About COAR Repository Interoperability: http://coar-repositories.org/activities/repository-interoperability Find the guide for Repository Managers at: https://www.openaire.eu/rep-man-toolkit/repository-managers Search for OpenAIRE data providers at: https://www.openaire.eu/search/openaire-data-providers See the OpenAIRE Content Acquisition policy at: https://www.openaire.eu/openaire-s-content-acquisition-policy/download Thank You! Find and discuss at: https://guidelines.openaire.eu Contact us: guidelines@openaire.eu Ricardo Saraiva, rsaraiva@sdum.uminho.pt