Www.eudat.eu EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No. 654065 EUDAT Services.

Slides:



Advertisements
Similar presentations
EUDAT Towards a pan-European Collaborative Data Infrastructure Ari Lukkarinen CSC-IT Center for Science, Finland APA Conference, November 6th, 2012.
Advertisements

EUDAT Data Services for Research “The Story” Per Öster Director, Research Infrastructures CSC – IT Center for Science Ltd.
January, 23, 2006 Ilkay Altintas
DCC Conference, Glasgow November, Digital Archive Policies and Trusted Digital Repositories MacKenzie Smith, MIT Libraries Reagan Moore, San Diego.
DATA FOUNDATION TERMINOLOGY WG 4 th Plenary Update THE PLUM GOALS This model together with the derived terminology can be used Across communities and stakeholders.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
CLARIN Infrastructure Vision (and some real needs) Daan Broeder CLARIN EU/NL Max-Planck Institute for Psycholinguistics.
Schets van het landschap Deel C Presentatie EUDAT.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
PRACE-2IP WP10 - iRODS workshop iRODS CINES Gerard GIL (CINES) – (Linkoping September 2012)
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No The pan-European.
Sync and Exchange Research Data b2drop.eudat.eu This work is licensed under the Creative Commons CC-BY 4.0 licence B2DROP EUDAT’s Personal.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT The European.
EUDAT: Data sharing and management in a collaborative data infrastructure Rob Baxter, EPCC, University of Edinburgh.
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No B 2 DROP User.
Replicate Research Data Safely eudat.eu/b2safe B2SAFE How to replicate your data using EUDAT’s B2SAFE Version 3 November 2015 This work is.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No B2SHARE How to.
Store and Share Research Data b2share.eudat.eu B2SHARE How to share and store research data using EUDAT’s B2SHARE This work is licensed under.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No B2ACCESS LSDMA.
b2access.eudat.eu B2ACCESS The simple and secure authorisation and authentication platform of EUDAT This work is licensed under the Creative.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Working.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Data Preservation.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT EGI interoperability.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No The Data Type.
CLARIN EUDAT2020 uptake plan Dieter Van Uytvanck CLARIN ERIC EUDAT User Forum, Rome.
AAI needs of the Distributed Computing Infrastructures - CLARIN Dieter Van Uytvanck Max Planck Institute for Psycholinguistics
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EPOS and EUDAT.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Aalto Data.
Get Data to Computation eudat.eu/b2stage B2STAGE How to shift large amounts of data Version 4 February 2016 This work is licensed under the.
B2access.eudat.eu B2ACCESS User Training How to register with B2ACCESS Version 1 February 2016 This work is licensed under the Creative Commons.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No The use of the.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No West-Life.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Public access.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
Discover ScholarSphere A repository service collaboration between the University Libraries and ITS.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Collaboration.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Support to scientific.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Herbadrop.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Enriching Europeana.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Aalto Data Repository.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No LTER- Europe &
1 This slide indicated the continuous cycle of creating raw data or derived data based on collections of existing data. Identify components that could.
PIDs in EUDAT Webinar, 15 Februari 2013
This work is licensed under the Creative Commons CC-BY 4.0 licence.
The EUDAT Services Suite
Tokamak data mirror for JET and MAST Moving towards an open data repository for European nuclear fusion research.
EUDAT: collaborative pan-European infrastructure providing research data services, training and consultancy This work is licensed.
EUDAT’s engagement with the Earth Sciences
AAI for a Collaborative Data Infrastructure
Engaging with Users Daan Broeder Meertens Institute & CLARIN ERIC
Data Services at CSC ©2016 OKM ATT initiative Licensed under Creative Commons BY 4.0.
Xiaogang Ma, John Erickson, Patrick West, Stephan Zednik, Peter Fox,
Mark van de Sanden Giovanni Morelli
Data Access and Re-use Carl Johan Håkansson EUDAT Service Area Manager
EUDAT Collaborative Data Infrastructure
Workshop Data curation and the EUDAT Collaborative Data Infrastructure
DATA SPHINX & EUDAT Collaboration
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
NFFA Europe.
An EUDAT-based FAIR Data Approach for Data Interoperability
European Research Data Services, Expertise & Technology Solutions
EUDAT Site and Service Registry
DATATURB Direct simulation data of turbulent flows
Joining the EOSC Ecosystem
EOSC-hub Contribution to the EOSC WGs
Check-in Identity and Access Management solution that makes it easy to secure access to services and resources.
Presentation transcript:

EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Services Mark van de Sanden, EUDAT Service Manager EUDAT User Meeting, June 2016 Barcelona

History of the EUDAT CDI Common Services for heterogeneous communities Science data rates are exploding and will likely become continue to do so Building bespoke services for new communities is not cost effective Initial Set of Services developed as result of community needs Beyond the original ‘core’ communities New services and specific community issues highlighted

If there are hundreds of Research Infrastructures, how many different data management systems can be sustained? 3

Where Does EUDAT Fit In? Community repositories Institute repositories Scientists personal data Homeless scientists Citizen scientists

Where Does EUDAT Fit In? Trust Data Curation Common Data Services Users User functionalities, data capture & transfer, virtual research environments Persistent storage, identification, authenticity, workflow execution, mining Data Generators Community Support Services Data discovery & navigation, workflow generation, annotation, interpretability

EUDAT Data Domain EUDAT Data Domain modeled on the ANDS 1 Data Curation Continiuum 1. Australian National Data Service organization –

7 Community Repositories (thematic data centres) EUDAT generic data service provider storage, workflows, processing, archive deposit access EUDAT Collaborative Data Infrastructure deposit

Who can use EUDAT service 8 Upload and download Upload, add metadata, share Periodic transfers, quality checks … Single researcher Team Community Different strategies for different usage scenarios

B2 Service Suite B2ACCESS B2Handle

EUDAT2020 Further integration with EUDAT CDI (e.g. B2SHARE) Integration with B2ACCESS to enable access by many different Identity Providers Cloud Storage Federation, collaboration with GEANT in OpenCloudMesh Assess B2DROP as workspace area to computing facilities Who Citizens Scientists and small teams What Store and exchange data Synchronize multiple versions Ensure automatic desktop synchronization Why Ease of Use Trusted European Service 10

11

EUDAT2020 Further integration with EUDAT CDI (e.g. B2DROP, B2SAFE) Integration with B2ACCESS (incl eduGAIN), focus on authorization Embargo period Editing of metadata Data versioning and annotation Extended HTTP Restful API interface Easy installable software package Who Small to Medium Teams What Store data (incl. software) and add domain meta data Share registered research data worldwide Preserve (small-scale) research data for long- term Why Register Data for Publications Make known to wider community 12

13 Collection of official RDA documents

Service Integration Bidirectional Integration

EUDAT2020 Support iRODS v4 Support metadata Optimize and extend policies to support data curation and provenance Further integration with B2ACCESS Support authorization on basis of community access rules Assess B2SAFE as workspace area to computing facilities Who Community Data Managers ‘Sophisticated’ Organisations What Provide an abstraction layer which virtualizes large-scale data resources Guard against data loss in long-term archiving and preservation Optimize access for users from different regions Bring data closer to powerful computers Why Performance Replication between trusted sites Data Preservation 15

Data Policy Manager Data policies are centrally managed Policy rules are implemented and enforced by site-local rule engines Policies describe in an abstract language Community data managers must authenticate to provide trust Support policies for data replication and integrity checking Central logging for auditable data policies to monitor execution Active collaboration with the RDA Practical Policy WG EUDAT2020 Handover to operations Extend number of policies supported Focus on data curation and provenance policies Integrate with B2ACCESS 16

Further develop HTTP to a mature interface and extend functionality to metadata Native support PIDs within GridFTP transfers Extend EUDAT client API library to other B2 services (e.g. B2SHARE, B2FIND, PID) Further integration with B2ACCESS EUDAT2020 Who Users and Communities with Significant Computational Needs What Transfer large data collections from EUDAT storages to external HPC facilities for processing Copy large data sets, ingesting them onto EUDAT storage resources Why Integration/Collaboration with PRACE Simplify Data Transfer 17

Harvesting of metadata stored in B2SAFE Community customizations Annotation of datasets Further assess RDF and Linked Data Further assess scalability and performance EUDAT2020 Who Anyone What Find collections of scientific data quickly and easily, irrespective of their origin, discipline or community Get quick overviews of available data Browse through collections using standardized facets Why Unique collection Ease of Searching 18

EUDAT 3rd Year EC Review – 21st of May Brussel 19

Develop the policies for the B2HANDLE service (e.g. PID namespace mngmt) Migrate service from Handle v7 to v8 Harmonizing PID record structure Integrate with Data Type Registry service B2HANDLE API library Consolidation with EUDAT API library Development plan Who Groups or Communities who want to make their data citable What Follows policies to register data and make it long term refer- and citable Reliability through mutual PID mirroring Provides abstraction layer between a globally unique persistent identifier and physical location of data objects Machine readable via HTTP RESTful API Why Simple integration Technology Agnostic 20

EUDAT2020 Integration with operational and B2 services B2SHARE B2DROP B2STAGE B2SAFE DPM CREG HTTP API GRIDFTP Integration with community IdP domains and portal environments Enabling access via eduGAIN Social IdPs ORCID Focus on authorization Who Anyone wanting to use the B2 Services What Complies with community ownerships and access rights, basis of trust Credential conversion approach (e.g. SAML, OpenID, X.509, Username/password) Identity provider for citizen scientists Why Use your own ID in federated environment 21

New Services in Development 23

Creation RDF triples Harvests information from ontology repositories Supports semi-automatic annotation using text mining Supports manual data annotation Easy to use user interface Integrates with the different B2 services UI for manual annotation, initial focus on annotation of metadata Setup and test Triple store Develop harvesting chain for ontologies Integration with B2SHARE and B2FIND Assess the use of Graph technologies 24 Features Development plan

Service Integration

Registration of data type and metadata definitions Provide persistent references Human and machine interpretable Integratable within community infrastructure and services Integratable within the EUDAT CDI Easy to use HTTP API interface Uptake of RDA output Assessment of the CNRI Cordra technology Provide test instance to evaluate usage with communities Define EUDAT PID and metadata structures according to DTR Integration with B2 services (e.g. B2SHARE, B2FIND, PID) 27 Requirements Development plan

29 Comes from ELIXIR and Euro/Argo, solution must be generic Automatic (re-)distribution of updatable data to data storage providers Data storage providers are inside and outside the EUDAT CDI domain Data owner must be able to mark data as subscribeable Data storage providers and individual users must be able to subscribe to data Data transfers and notifications are triggered by metadata updates Evaluate FTSv3 service for data distributions Assessing subscription policy within B2SAFE DPM service Integration of subscription mechanism within metadata repository Assess technologies for subscription processing Use Case Development plan

Questions…