EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

Slides:



Advertisements
Similar presentations
The Discovery Landscape in Crystallography UKOLN is supported by: Monica Duke UKOLN, University of Bath, UK – eBank UK project A centre.
Advertisements

Supported by EU projects 12/12/2013 Athens, Greece Open Data in Agriculture Hands-on with data infrastructures that can power your agricultural data products.
1 Adaptive Management Portal April
CORDRA Philip V.W. Dodds March The “Problem Space” The SCORM framework specifies how to develop and deploy content objects that can be shared and.
DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Implementing Digital Object Identifiers at the GESIS Data Archive for the Social Sciences Workshop “Persistent Identifiers for the Social Sciences” Bonn,
Architecting an Extensible Digital Repository Anoop Kumar, Ranjani Saigal,Rob Chavez, Nikolai Schwertner Tufts University, Medford, MA.
Z EGU Integration of external metadata into the Earth System Grid Federation (ESGF) K. Berger 1, G. Levavasseur 2, M. Stockhause 1, and M. Lautenschlager.
Using IESR Ann Apps MIMAS, The University of Manchester, UK.
DASISH Metadata Catalogue Binyam Gebrekidan Gebre, Stephanie Roth, Olof Olsson, Catharina Wasner, Matej Durco, Bartholemeus Worcslav, Przemyslaw Lenkiewicz,
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Metadata Lessons Learned Katy Ginger Digital Learning Sciences University Corporation for Atmospheric Research (UCAR)
1 Interoperability of Spatial Data Sets and Services Data quality and Metadata: what is needed, what is feasible, next steps Interoperability of Spatial.
CRISP WP17 2/2 Data Continuum Achievements & Perspectives 18th March 2013Jean-François Perrin - Institut Laue Langevin - CRISP 2nd Annual Meeting1.
Web: Minimal Metadata for Data Services Through DIALOGUE Neil Chue Hong AHM2007.
Adoption of RDA-DFT Terminology and Data Model to the Description and Structuring of Atmospheric Data Aaron Addison, Rudolf Husar, Cynthia Hudson-Vitale.
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
1 Understanding Cataloging with DLESE Metadata Karon Kelly Katy Ginger Holly Devaul
M. Stockhause 1, G. Levavasseur 2, K. Berger 1 1 Deutsches Klimarechenzentrum (DKRZ) 2 Institute Pierre Simon Laplace (IPSL) ESGF-QCWT Quality Control.
Find Research Data b2find.eudat.eu B2FIND User Training How to find data objects and collections using EUDAT’s B2FIND This work is licensed.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Metadata-based Discovery: Experience in Crystallography UKOLN is supported by: Monica Duke UKOLN, University of Bath, UK A centre of.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Developing Metadata Frameworks for Earth System Education NSDL 2003 Annual Meeting October 14, 2003 Katy Ginger and Karon Kelly DLESE Program Center.
B2find.eudat.eu EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
Discovery and Metadata March 9, 2004 John Weatherley
EUDAT’s engagement with the Earth Sciences
Usage scenarios, User Interface & tools
An Overview of Data-PASS Shared Catalog
Data Ingestion in ENES and collaboration with RDA
Integrating Data for Archaeology
Flanders Marine Institute (VLIZ)
ACS 2016 Moving research forward with persistent identifiers
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Fitness for use: Users of the U. S
knowledge organization for a food secure world
Accessing a national digital library: an architecture for the UK DNER
Heinrich Widmann EUDAT & CKAN Heinrich Widmann
Toward FAIR Semantic Resources
Maggie, Carlo, Peter, Rebecca (GEDE discussions)
Doron Goldfarb & Yann LE FRANC
Data Access and Re-use Carl Johan Håkansson EUDAT Service Area Manager
A step-by-step guide to DOI registration
Introducing da|raSearchNet
EOSC & e-Science: enabling the digital transformation of Science
B2FIND Integration and Usage
Disclosing Freedom of Information Releases
NFFA Europe.
Indicator structure and common elements for information flow
Enabling direct data access to social science research data
An ecosystem of contributions
NSDL Data Repository (NDR)
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
Tech introduction.
IDEALS at the University Of Illinois: A Case Study of Integration Between an IR and Library Discovery Systems Sarah L. Shreeves University of Illinois.
Research Data Management
Semantic Annotation service
Session 2: Metadata and Catalogues
Disseminating Service Registry Records
JISC Information Environment Service Registry (IESR)
A Case Study for Synergistically Implementing the Management of Open Data Robert R. Downs NASA Socioeconomic Data and Applications.
Bird of Feather Session
A Research Data Catalogue supporting Blue Growth: the BlueBRIDGE case
EOSC-hub Contribution to the EOSC WGs
Metadata supported full-text search in a web archive
WISE and INSPIRE By Albrecht Wirthmann, GISCO, Eurostat
Presentation transcript:

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal Heinrich Widmann, DKRZ DI4R 2016, Krakow, 28 September 2015

Outline EUDAT and the B2 Service Suite Guidelines and Concepts B2FIND – EUDAT’s Discovery Service MD Ingestion and the B2FIND Schema Disciplines, Communities and the MD catalogue Data Access Identifiers Discovery Portal Outlook and Summary

EUDAT and the B2 Service Suite

EUDAT The project European Data Infrastructure (EUDAT) funded by the EU Horizon2020 program started in 2011, now in 2nd phase 'EUDAT2020', will end 2018 >= 2018 : agreement of cooperation Motivation : Manage the rising tide of research data Improve Interoperability in a wide cross-disciplinary scope Objective : Build up a Collaborate Data Infrastructure, based on common data services driven by requirements of the research communities

B2 Service Suite  http://www.eudat.eu/services

Guidelines and Concepts

The FAIR principles B2FIND approach Findability := “Ease with which information can be found” Powerful and easy-to-use search features and functionalities Accessibility := “Ability to access [ ] data stored within repositories” Unique and persistent identification and resolvability of data objects Interoperability : “Ability of multiple systems with different [] structures to exchange data with minimal loss of content []" (NISO) Comprehensive cross-disciplinary MD catalogue based on common standards and by minimising loss of information Reuseability := “Ability to re-use data created by others” Cross-discipline approach and catalogue covering multiple sources

Levels of Interoperability Heterogeneity Homogeneity Research Communities (Data Provider) Data Repositories (e.g. B2SHARE/B2SAVE or Agreggator as DataCite) Service Provider ( e.g. EUDAT-B2FIND ) Info Loss Schema A Information Loss MD generation ! Collect and extract MD 010101010101010 Schema B2S B2FIND harvest and mapping Schema B Schema B2FIND MD generation 010101010101010 Schema C MD generation 010101010101010 010101010101010 ! 010101010101010 EUDAT B2FIND DI4R2016 28 September 2016

B2FIND MD Ingestion and Common Schema

B2FIND Ingestion Workflow Mapping specification : XPATH rules Community specific MD schemas and … Harvest specification : OAI-URL OAI subsets MD formats For joining B2FIND only a few preconditons has to be fulfilled Harvesting endpoint Spec. of MD format Gurantee data synchronisation by frequent and incremental data harvesting Data provider (Community) MD Generation and Specification User (Scientist or Researcher) MD Provider A MD Harvesting MD Provider MD Provider Mapping and Validation Uploading and Indexer Search and Data Access EUDAT-B2FIND

B2FIND MD Schema (extract) Metadata Type B2FIND Field name Allowed values Semantic definition Level of Obligation Occurence General information Title Free text (unicode) A name or title a resource is known Mandatory 1 Description Free text Additional info Recommended 0-1 Data Access Source Valid URL or URN Unique link to data resource Mandatory (1) 1-3 PID Persistent Identifier + persistent and resolvable DOI Digital Object Identifier + citable Provenance data Creator ‘;’-sep. list of names Main researchers involved in data prod. 0-n Discipline List of values from CV Field of research (Controlled Vocab) Publication Year YYYY The year data are published Formal data Temporal Coverage Interval of 2 DTimes [ Begin, End ] The temporal limits of a date-time Optional 1-n Spatial Coverage Spatial box or point [[minlat,minlon…]] The spatial limits of a place. + more facets ???

B2FIND Disciplines, Communities and MD Catalogue

The Facet ‚Discipline‘ Controlled Vocabulary “Fields of Knowledge” / Humanities Social sciences Natural sciences Professionals Archaeo- logy Earth Sciences Arts History Linguistics Biology Physics Engineering …. Material science Elementary Particle Physics taken from “List of Academic disciplines”  http://en.wikipedia.org/wiki/List_of_academic_disciplines_and_sub-disciplines and „The Fields of Knowledge“  http://www.thingsmadethinkable.com/item/fields_of_knowledge.php?focus=natural_sciences Crystallography

Coverage of Disciplines in B2FIND

B2FIND MD Catalogue Ingestion status Humanities B2FIND MD Catalogue Ingestion status Social Sciences Natural Sciences Cross Discipline 17 communities > 450000 MD records

B2FIND Data Access

Data Access Identifiers Resolvability and ‚Levels of aggregation‘ B2FIND Resource </> <//dc:identifier value> XML Data Collection 010101010101010 Resolution and Access Handle Server Type Unique Persistent Resolvable Citable DOI PID x URL (Source) ? Stricter Policies 010101010101010 Source PID DOI B2FIND Metadta Landing Page DOI Resolver PID_1 010101010101010 PID_2 PID_3 EUDAT B2FIND DI4R2016 28 September 2016 20

Coverage of Data Access Identifiers

B2FIND Discovery Portal

B2FIND Discovery Portal Faceted Search and Data Access B2FIND provides ‘faceted’ search for Free text Geo spatial Temporal coverage Publication year Textual facets as Tags Creator Discipline etc. Dataset view provides display of metadata : Spatial extent Table of field-value pairs Links to data resources

Outlook and Summary

Outlook Handle scalability and granularity issues ‘Levels of aggregation’ Metrics for Key Indicators and Metadata Quality Establish content-related quality assurance Add further search and distribution channels, e.g. Use linked data : Potential for semantic enrichment ‘Annotation’ functionality : Users link datasets to external reference materials (vocabularies, ontologies, etc.) Query-based Taxonomies : Enabling hierarchical search, e.g. in trees of ‘Disciplines’

Summary EUDAT-B2FIND established an operative service based on agreed standards and guidelines as the FAIR principles, provides a discovery portal with powerful search functionalities and is based on a unique catalogue of research data , combining many heterogeneous and cross-discipline sources Improved interoperability is achieved by homogenisation to a common metadata schema Further efforts are made to address the demands of the communities and data projects, to adapt the system for future challenges

Thank you for your attention ! Links : info : http://eudat.eu/b2find portal : http://b2find.eudat.eu Contact www.eudat.eu/support-request widmann@dkrz.de