Www.eudat.eu EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No. 654065 EUDAT The European.

Slides:



Advertisements
Similar presentations
EUDAT Towards a pan-European Collaborative Data Infrastructure Ari Lukkarinen CSC-IT Center for Science, Finland APA Conference, November 6th, 2012.
Advertisements

EUDAT Data Services for Research “The Story” Per Öster Director, Research Infrastructures CSC – IT Center for Science Ltd.
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network Co-ordinated by aparsen.eu #APARSEN.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
EUDAT Training Session RDA Plenary Dublin, March 25th, 2014 B2Share Nordic “ An example of a service that facilitates Data Discovery and uses PIDs and.
Data Archiving and Networked Services DANS is an institute of KNAW en NWO and the Peter Doorn Data Archiving and Networked Services EUDAT Conference Trust.
Save time. Reduce costs. Find and reuse interoperability solutions on Joinup for developing European public services Nikolaos Loutas
CLARIN Infrastructure Vision (and some real needs) Daan Broeder CLARIN EU/NL Max-Planck Institute for Psycholinguistics.
Schets van het landschap Deel C Presentatie EUDAT.
Research Data Management for Research Support staff 30 th June 2015 Isabel Chadwick, Research Data Management Librarian
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Processing services.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No The pan-European.
Sync and Exchange Research Data b2drop.eudat.eu This work is licensed under the Creative Commons CC-BY 4.0 licence B2DROP EUDAT’s Personal.
EUDAT: Data sharing and management in a collaborative data infrastructure Rob Baxter, EPCC, University of Edinburgh.
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No B 2 DROP User.
Replicate Research Data Safely eudat.eu/b2safe B2SAFE How to replicate your data using EUDAT’s B2SAFE Version 3 November 2015 This work is.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No B2SHARE How to.
Store and Share Research Data b2share.eudat.eu B2SHARE How to share and store research data using EUDAT’s B2SHARE This work is licensed under.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI strategy and Grand Vision Ludek Matyska EGI Council Chair EGI InSPIRE.
b2access.eudat.eu B2ACCESS The simple and secure authorisation and authentication platform of EUDAT This work is licensed under the Creative.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Data Preservation.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT EGI interoperability.
B2 Nordic – call for pilot. Introduction B2 Nordic: initiative proposed to NeIC Uptake of the EUDAT B2 service suite in the Nordics. 2.
Open Science and Research – Services for Research Data Management © 2014 OKM ATT 2014–2017 initiative Licenced under.
Active Directory Domain Services (AD DS). Identity and Access (IDA) – An IDA infrastructure should: Store information about users, groups, computers and.
CLARIN EUDAT2020 uptake plan Dieter Van Uytvanck CLARIN ERIC EUDAT User Forum, Rome.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EPOS and EUDAT.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Aalto Data.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No B 2 DROP User.
Get Data to Computation eudat.eu/b2stage B2STAGE How to shift large amounts of data Version 4 February 2016 This work is licensed under the.
B2access.eudat.eu B2ACCESS User Training How to register with B2ACCESS Version 1 February 2016 This work is licensed under the Creative Commons.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No The use of the.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No West-Life.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Public access.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
‘Big Data’ in GLUE2 Thoughts using GLUE2 in Data-Driven Infrastructures Morris Riedel et al. Juelich Supercomputing Centre EUDAT Scientific Community Coordinator.
International Planetary Data Alliance Registry Project Update September 16, 2011.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Collaboration.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Support to scientific.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Services.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No TURBASE-DNS: A.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Herbadrop.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Enriching Europeana.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Aalto Data Repository.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No LTER- Europe &
Towards a pan-European Collaborative Data Infrastructure
This work is licensed under the Creative Commons CC-BY 4.0 licence.
The EUDAT Services Suite
Tokamak data mirror for JET and MAST Moving towards an open data repository for European nuclear fusion research.
EUDAT’s engagement with the Earth Sciences
AAI for a Collaborative Data Infrastructure
The EUDAT Services Suite and how it could support FAIR data
Engaging with Users Daan Broeder Meertens Institute & CLARIN ERIC
Data Services at CSC ©2016 OKM ATT initiative Licensed under Creative Commons BY 4.0.
Mark van de Sanden Giovanni Morelli
Data Access and Re-use Carl Johan Håkansson EUDAT Service Area Manager
EUDAT Collaborative Data Infrastructure
Workshop Data curation and the EUDAT Collaborative Data Infrastructure
DATA SPHINX & EUDAT Collaboration
EOSCpilot Skills Landscape & Framework
NFFA Europe.
An EUDAT-based FAIR Data Approach for Data Interoperability
European Research Data Services, Expertise & Technology Solutions
DATATURB Direct simulation data of turbulent flows
Jisc Research Data Shared Service (RDSS)
EOSC-hub Contribution to the EOSC WGs
Presentation transcript:

EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT The European Collaborative Data Infrastructure (CDI) GRIDKA School, 11 September 2015 Damien Lecarpentier CSC-IT Center for Science

Outline What is the EUDAT CDI? Data Management & Open Access in the CDI Moving forward

Research Infrastructures – Where is it going? EUDAT Final Review, 21st May 2015, Brussels 3 Research Infrastructure trends:  Internationalisation  Diversification  Increasingly relying on on ICT  Data deluge is a common challenge European Ris:  Around 500  € 100 billion investment middle age 19th century 20th century 21st century

A pan-European e-Infrastructure solution for pan-European RI data Challenges All Research Infrastructures are facing data challenges Where to store the growing amount of data? How to find it? How to make the most of it? Many research communities are developing own solutions This is good… … but we also need to make sure that the solutions remain interoperable EUDAT mission is to fill this gap Providing a set of services to help RIs managing their growing amount of data Providing these services across communities to ensure maximum level of interoperability Closer integration of data and computing (HPC centres core partners) 4

Promoting synergies The worst case scenario: 500 RI with 500 incompatible self-made ICT and data management solutions What can we do to promote collaboration and re- use of eInfrastructures? 5 EVERY RI NEEDS TO DEAL WITH DATA MANAGEMENT

e-Science Data Factory EUDAT Partners

B2 SERVICE SUITE

Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is a secure and trusted data exchange service for researchers and scientists to keep their research data synchronized and up-to-date and to exchange with other researchers. An ideal solution to: b2drop.eudat.eu 8

EUDAT 3rd Year EC Review – 21st of May Brussel 9

10

11 Based on ownCloud, open source (GNU AGPLv3) access and manage permissions to files from any device and any location, via browser, desktop, mobile apps and WebDAV up to 20GB of storage space for research data simple to use and open to all researchers, scientists (e.g. self-registration) synchronize and exchange data with one or multiple users users decide with whom to exchange data, for how long and how EUDAT2020 Further integration with EUDAT CDI (e.g. B2SHARE) Integration with EUDAT Federated AAI, access via eduGAIN Federation with community, institutional, regional and national instances Assess B2DROP as workspace area to computing facilities

B2SHARE is a user-friendly, reliable and trustworthy way for researchers, scientific communities and citizen scientists to store and share small-scale research data from diverse contexts. A winning solution to: Store: facilitates research data storage Preserve: guarantees long-term persistence of data Share: allows data, results or ideas to be shared worldwide b2share.eudat.eu 12

15 Based on Invenio, open source (GPL v3) Supports 8 community-metadata templates Data assigned a persistent identifier and a checksum Access via a HTTP Rest API Open accessible, user self-registration Data owner defines access policy Open access license choose feature Discipline choose feature Open harvestable metadata, harvested by B2FIND EUDAT2020 Further integration with EUDAT CDI (e.g. B2SAFE) Integration with EUDAT Federated AAI, focus on authorization Access via eduGAIN Data versioning Extended HTTP Restful API interface Easy installable software package

Provide an abstraction layer which virtualizes large-scale data resources Guard against data loss in long-term archiving and preservation Optimize access for users from different regions B2SAFE is a robust, safe and highly available service which allows community and departmental repositories to implement data management policies on their research data across multiple administrative domains in a trustworthy manner. A solution to: Bring data closer to powerful computers for compute-intensive analysis eudat.eu/b2safe 16

B2SAFE core 17 Based on iRODS, open source (BSD license) able to aggregate data from different disciplines into a storage system of trustworthy and capable data service providers Provide an abstraction layer which virtualizes large-scale data resources Supports data replication and data integrity policies EUDAT2020 Support iRODS v4 Support metadata Optimize and extend policies to support data curation and provenance Further integration with EUDAT Federated AAI Support authorization on basis of community access rules Assess B2SAFE as workspace area to computing facilities

Transfer large data collections from EUDAT storage facilities to external HPC facilities for processing In conjunction with B2SAFE, replicate community data sets, ingesting them onto EUDAT storage resources for long-term preservation Ingest computation results into the EUDAT infrastructure B2STAGE is a reliable, efficient, light-weight and easy-to-use service to transfer research data sets between EUDAT storage resources and high- performance computing (HPC) workspaces. The service allows users to: eudat.eu/b2stage 18

19 Further develop HTTP to a mature interface and extend functionality to metadata Native support PIDs within GridFTP transfers Extend EUDAT client API library to other B2 services (e.g. B2SHARE, B2FIND, PID) Further integration with EUDAT Federated AAI Extension of the B2SAFE and B2FIND services, which allow users to store, preserve and find data Providing access via GridFTP and basic HTTP data-staging script facilitates staging, ingestion and retrieval of persistent identifier (PID) information of transferred data Start development of EUDAT client API library and command line tools Integrated with EUDAT Federated AAI on basis of X.509 certificates EUDAT2020

Find collections of scientific data quickly and easily, irrespective of their origin, discipline or community Get quick overviews of available data Browse through collections using standardized facets B2FIND is a simple, user-friendly metadata catalogue of research data collections stored in EUDAT data centres and other repositories. A service which allows users to: b2find.eudat.eu 20EUDAT Final Review, 21st May 2015, Brussels

EUDAT 3rd Year EC Review – 21st of May Brussel 21

EUDAT 3rd Year EC Review – 21st of May Brussel 22

Based on CKAN, open source (GNU AGPL v3) Facetted search (e.g. 10 facets) and full text search, recently added timeline search Focus on community recommended metadata 13 community repositories harvested, more lined up Open accessible, no registration needed Open harvestable metadata Searchable via Web-based GUI and HTTP RESTful API Harvesting of metadata stored in B2SAFE Community customizations Further assess RDF and Linked Data Further assess scalability and performance EUDAT

Integrated Suite

Working with research communities EUDAT interacts / serves 32 scientific communities. Target is 50!

EUDAT Call for Data Pilots 15 May – 30 Sept

Generic data centres Community data sites Using EUDAT services: finding and accessing data, for instance, or storing smaller data sets by interacting with one of the CDI public front-end services vs Joining the CDI: implies a tighter integration with at least one of the EUDAT centre  partnership between legal entities relying on OLAs and SLAs Using or Joining: You Choose!

DATA MANAGEMENT & OPEN ACCESS IN THE CDI

Data Management Planning (DMP) DMPs are increasingly required as part of grant applications (e.g. European Commission H2020 grants) but are useful whenever you are creating data. A DMP is a brief plan written at the start of your project to define: How you data will be created How it will be documented Who will access it? Where it will be stored Who will back it up? Whether (and how) it will be shared & preserved?

Research Data Lifecycle "Research data management concerns the organisation of data, from its entry to the research cycle through to the dissemination and archiving of valuable results. It aims to ensure reliable verification of results, and permits new and innovative research built on existing information” Whyte, A., Tedds, J. (2011). ‘Making the Case for Research Data Management’. DCC Briefing Papers. DataONE data lifecycle

EUDAT & the DLC

Why open access and open data? “The European Commission’s vision is that information already paid for by the public purse should not be paid for again each time it is accessed or used, and that it should benefit European companies and citizens to the full.” ref/h2020/grants_manual/hi/oa_pilot/h2020- hi-oa-pilot-guide_en.pdf

EUDAT & Open Access Open Access? Funders: “Yes, absolutely!” Researchers: “Yes, but…” Some data is “sensitive” What about credit and merit? How to find one’s way in the legal minefield? What role for e-Infrastructure and service providers ? Providing tools and services to handle sensitive data Licensing guidance, PIDs and usage statistics Training, training, training 33

EUDAT & Open Access EUDAT Policy on OA (an attempt): 1. All data in the CDI should, in time, become full open access. Open access is the norm for CDI data; 2. Embargo periods for original producers are fully supported, on condition that such data become openly accessible when the embargo period expires. 34

B2 Services & Open Access B2DROP– No (definitely not!) But enables sharing and reuse B2SHARE – Yes (ideally) Open is default but users can still restrict access Licence wizard B2SAFE – Yes (arguably) At least metadata Fine-grained authorisation B2FIND – Yes (metadata) 35

MOVING FORWARD

Make it easier for the researcher! E-Infrastructure integration Network Data Computing Global solutions Research Data Alliance (RDA) Cross-country collaboration

Thank you! Questions?