Facilitate Open Science Training for European Research WORKSHOP 1: Open Research Data in Social Sciences and Humanities (ADP) June 18th 2014, Social Science Data Archives, Faculty of Social Sciences, University of Ljubljana Introduction to RDM and emerging good RDM practice in UK universities Joy Davidson, Digital Curation Centre Funded by: “Helping to build capacity, capability and skills in data management and curation across the UK’s higher education research community” Phase 3 Business Plan The DCC Mission www.dcc.ac.uk What is research data management and curation? • Active management, adding value and maintaining access to research data over the course of its scholarly lifecycle. • Part of good research practice UK Context What has been driving this agenda in the UK? • Several factors making RDM a key issue for many UK HEIs • These factors are international and are affecting research practice in many countries (US, Canada, EU) “Data sets are becoming the new instruments of science” Dan Atkins, University of Michigan Digital data as the new special collections? Sayeed Choudhury, Johns Hopkins Research data: institutional crown jewels? http://www.flickr.com/photos/lifes__too_short__to__drink__cheap__wine/4754234186/ What do UK funders expect? Ultimately funders expect: • timely release of data – once patents are filed or on (acceptance for) publication •open data sharing – minimal or no restrictions if possible •preservation of data – typically 5-10+ years if of long-term value See the RCUK Common Principles on Data Policy: www.rcuk.ac.uk/research/Pages/DataPoli cy.aspx http://www.dcc.ac.uk/resources/policy-and-legal/overview-funders-data-policies Working with UK Universities Institutional Engagement programme • Pilot phase ran from Spring 2011 to Spring 2013 • Funded by HEFCE under Universities Modernisation Fund • Provided up to 60 days of resource to 21 institutions • Aimed to: – Increase capacity and capability in Research Data Management – Drive efficiencies in the HE sector by sharing models and lessons The engagement model • Efficient information sharing across HE sector • Redesign of DCC support Who we worked with DCC tailored support www.dcc.ac.uk/tailored-support Roles of main participants Examples of support: CARDIO at Queen Mary • Advised on the mini quiz and full CARDIO • Ran a workshop to reach consensus on results • Wrote up findings to feed into strategy • ‘heatmap’ approach being used to guide roadmap development DCC support fits into the curation strand of the IT Transformations Project at QMUL Fostering skills development Guidance webpages www.gla.ac.uk/datamanag ement www.bath.ac.uk/research/data Online training for PhD students http://datalib.edina.ac.uk/mantra Click to edit Master text styles Second level Third level Fourth level Fifth level http://www.fosteropenscience.eu/ Jorum Vitae RDF Examples of support: Capacity building at UEL • Co-designed a series of modules for training RDM support staff • Developed online learning resource, presentations and exercises • Delivered training with UEL colleagues • Helped to build awareness and confidence in dealing with RDM SupportDM: http://www.uel.ac.uk/trad/outputs/resources/ Review of the course: http://datamanagementuel.wordpress.com/ 2013/06/19/are-you-rdm -ready-from-zeroes-to-heroes/ Supporting data management planning What is a DMP? A short plan that outlines: • what data will be created and how • how it will be managed (storage, back-up, access…) • plans for data sharing and preservation Some other funders that require DMPs or equivalent Five common themes – Description of data to be collected / created (i.e. content, type, format, volume...) – Standards / methodologies for data collection & management – Ethics and Intellectual Property (highlight any restrictions on data sharing e.g. embargoes, confidentiality) – Plans for data sharing and access (i.e. how, when, to whom) – Strategy for long-term preservation DMPonline https://dmponline.dcc.ac.uk Main features of DMPonline •Templates for different requirements (funder or institution) •Tailored guidance (funder, institutional, discipline-specific etc) •Ability to provide examples and boilerplate text •Supports multiple phases (e.g. pre- / during / post-project) •Granular read / write / share permissions •Customised exports to a variety of formats •API for systems interoperability •Shibboleth authentication Institutions can customise DMPonline Select / write desired questions Add your logo, URL etc Profile local support via custom guidance and boilerplate text www.dcc.ac.uk/news/customising-dmponline Facilitating research data discovery Archiving – external data centres Research funders’ data centres… Structured databases Disciplinary& community initiatives Registries of international data centres Institutional data repositories Not intended to replace national, subject or other established data repositories Acknowledge hybrid environment http://datashare.is.ed.a c.uk www.dspace.cam.ac.uk https://databank.ora.ox.ac.uk Research Data at Essex and DataPool at Southampton Data catalogues • Oxford is developing its DataFinder tool http://blogs.it.ox.ac.uk/damaro • Research Data @ Essex has developed a profile based on DataCite, Inspire and DDI standards www.data-archive.ac.uk/ media/395364/rde_march2013_reposi toryoutputs.pdf • C4D is developing a research data extension to the CERIF standard - http://cerif4datasets.wordpress.com • CKAN is being explored by various projects http://ckan.org Research Data Registry pilot • aim is to improve data discovery and provide a national level service • aggregate metadata relating to data collections or datasets held in UK HEIs and subject data centres • have mapped metadata schema to develop crosswalks • trialling ANDS Research Data Australia Jisc RDRDS Pilot Partners RDRDS Metadata Work Metadata catalogues: defining levels RDRDS ongoing investigations Assessing RDM costs and benefits Click to edit Master text styles Second level Third level Fourth level Fifth level http://www.dcc.ac.uk/resources/policy-and-legal/overview-funders-data-policies Collaboration to Clarify the Costs of Curation (4C) http://4cproject.eu/ Collaboration to Clarify the Costs of Curation (4C) http://4cproject.eu/ Lessons we can share Components of RDM services Click to edit Master text styles Second level Third level Fourth level Fifth level UK research data policies “Statement of commitment”  Infrastructure  policy “10 commandments” mutual promises aspirational Baseline of RCUK Code + procedures & support legal tone / language a section in uni DM policy useful guide as appendix Based on Edin. with a few additions Sample roadmaps: University of Bath • Based on Monash University RDM strategy • Identifies the current position and proposes activity • Defines roles and responsibilities and timeframes http://www.bath.ac.uk/rdso/University-of-Bath-Roadmap-for-EPS RC.pdf Architecture models Diagram courtesy of Sally Rumsey, University of Oxford Thanks – any questions? DCC guidance, tools and case studies: www.dcc.ac.uk/resources Follow us on twitter: @digitalcuration and #ukdcc