CERIF: Common European Research Information Format An international standard relational data model for storage and interoperability of research information.

Slides:



Advertisements
Similar presentations
©euroCRIS/Keith G JefferyOA Workshop May 2010 CNR Roma The euroCRIS view of the Rome OA Workshop Keith G Jeffery President, euroCRIS
Advertisements

ERA-Nets as Information Sources Gerry Lawson, UK Natural Environment Research Council.
© Keith G Jeffery, Anne G S Asserson GL6: New York: December 2004: IP & Corporate Context Relating Intellectual Property Products to the Corporate.
© Keith G Jeffery, Anne G S Asserson GL 11 Washington Keith G Jeffery Director, IT & International Strategy, STFC
Data Models There are 3 parts to a GIS: GUI Tools
Helping people find content … preparing content to be found Enabling the Semantic Web Joseph Busch.
Andrei Lopatenko. CERIF Task Group CERIF TG Report 2001-Marth 2002 results Plans for 2002.
CERIF-CRIS Overview Keith G Jeffery
Lecture 7 Access Control
DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)
Databases & Data Warehouses Chapter 3 Database Processing.
CERIF-CRIS Overview Keith G Jeffery
New Task Group CRIS Architecture & Development Maximilian Stempfhuber RWTH Aachen University Library
Russian academy of sciences Siberian branch Institute of computational technologies Virtual museum of of Siberian branch of the Russian academy of sciences.
© Keith G Jeffery & Anne AssersonCERIF Course: Data Model CERIF COURSE Session3: DataModel 1 Keith G Jeffery, Director, IT CLRC
CERIF: Past, Present and Future 1 An Overview Anne Asserson, UiB (NO) Keith G Jeffery, CLRC (UK) Andrei Lopatenko, MU (UK) Kassel, 29 – 31 August 2002.
COMPUTING FOR BUSINESS AND ECONOMICS-III. Lecture no.6 COURSE INSTRUCTOR- Ms. Tehseen SEMESTER- Summer 2010.
Concluding Remarks Ed Simons, President of euroCRIS MEMBERS MEETING, BONN, 14 MAY 2013.
Is a Current Research Information System (CRIS) a critical corporate system for HEIs? A Case Study from the University of St Andrews Anna Clements, Assistant.
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
U.S. Department of the Interior U.S. Geological Survey Data Integration Progress and Guiding Principles Disciplines, generalization, and open-access. David.
CRISs, CERIF, CASRAI and Snowball Metrics (Why) are these key to University Libraries? Anna Clements, Assistant Director (Digital Research), University.
An Overview of the Research Information Metadata Ecosystem Prof Keith G Jeffery ©Keith G JefferyAn Overview.
1 Common Challenges Across Scientific Disciplines Laurence Field CERN 18 th November 2013.
Integrating aspects of interoperability, policy and governance FRIS: a research information infrastructure in Flanders euroCRIS Strategic Members meeting.
CHRIS NELSON METADATA TECHNOLOGY WORK SESSION ON STATISTICAL METADATA GENEVA 6-8 MAY 2013 Designing a Metadata Repository Metadata Technology Ltd.
CountryData Technologies for Data Exchange SDMX Information Model: An Introduction.
T HE SK CRIS - INFORMATION SYSTEM ON RESEARCH, DEVELOPMENT AND INNOVATION Zendulkova Danica.
Donatella Castelli CNR-ISTI
CERIF Common European Research Information Format Jan Dvořák, Jiří Souček, Tomáš Chudlarský  Institute of Information Studies & Librarianship Faculty.
Metadata Models in Survey Computing Some Results of MetaNet – WG 2 METIS 2004, Geneva W. Grossmann University of Vienna.
CERIF for Datasets: Background and Key Findings Workshop, London 26 th July 2013 CERIF slides reproduced from presentations by euroCRIS members : Keith.
Controlled Vocabulary Working Group Virtual Water Cooler Session April 6-7, 2009 Moderator: John Porter rm.action?confKey=jhp7e.
EU Project proposal. Andrei S. Lopatenko 1 EU Project Proposal CERIF-SW Andrei S. Lopatenko Vienna University of Technology
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
November 2005Anne Asserson Research Administration Department,UiB How can you assess your organisation through a CRIS? The Norwegian national CRIS, FRIDA,
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Data modeling using the entity-relationship model Chapter 3 Objectives How entities, tuples, attributes and relationships among entities are represented.
Anne AssersonCRIS seminar, Brussels, September CERIF CRIS seminar, Brussels September 2003 Anne Asserson, University of Bergen.
Database Design. The process of developing database structures from user requirements for data a structured methodology Structured Methodology - a number.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
The Geographic Information System of the European Commission (GISCO) By Albrecht Wirthmann, GISCO, Eurostat ESPON.
Auditing Grey in a CRIS Environment
Data sources of the EuroGroups Register Presentation by Eurostat
Eurostat 4. SDMX: Main objects for data exchange 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October.
Title of the presentation | Date |1 Nikos Houssos National Documentation Centre (EKT/NHRF) CRIS for research information management.
1 Metadata Elements and Domain Groups - Keith G Jeffery.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Analysis of Use Cases (and to some extent, standards) - Keith G Jeffery, Rebecca Koskela.
Catherine Tabone 04 June 2014 European Legislation Identifier (ELI)
GENBANK FILE FORMAT LOCUS –LOCUS NAME Is usually the first letter of the genus and species name, followed by the accession number –SEQUENCE LENGTH Number.
1 The Metadata Groups - Keith G Jeffery. 2 Positioning  Raise profile of metadata  Data first  Also software, resources, users  Achieve outputs/outcomes.
LEuroCRIS 31st October 2001 A list of CERIF 2000 problems Walter Niedermayer
ΕΚΤ Access to Knowledge ΕΚΤ Access to Knowledge R&D Statistics Information System: An Interoperability Tail between CERIF and SDMX Dimitris Karaiskos Dimitrios.
Data and classification governance, key to semantic interoperability of CRIS systems in inter-organizational contexts Sadia Vancauwenbergh
CERIF for research evaluation including REF Keith G Jeffery STFC / euroCRIS 20/04/20111Ready 4 Ref End of Project Workshop
Metadata Issues in Long-term Management of Data and Metadata
European Publication Information Service - a Pilot Project
Integrating Data Management Plan functionality in the CRIS
CORDIS datasets on the EU Open Data Portal
CRIS seminar, Brussels September 2003
Metadata for research outputs management Part 2
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
The European Statistical System
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
Welcome!.
SDMX Information Model: An Introduction
Open Science: the crucial importance of metadata
Presentation transcript:

CERIF: Common European Research Information Format An international standard relational data model for storage and interoperability of research information. Official EU Recommendation to Member States. Reference model for the development of Research Information Systems (CRIS) Standard exchange format (CERIF-XML) for interoperability between systems. Strong points of CERIF: Broad coverage: includes all aspects of RI (projects, persons, organisations, funding, publications, datasets, patents, products, bibliometrics, impact indicators, equipment, etc…). Fine-grained structure and flexible architecture, allowing: In- and output of virtually any (meta)dataformat used in the RI Domain The expression (“translation”) of virtually any formalized use case. The ingestion of an unlimited number of controlled vocabularies (taxonomies, thesauri,...) – “semantic layer”. CERIF = comprehensive “legobox” of research information metadata.

From a report by a working group of the European Parliament STOA-Report: Measuring Scientific Performance for Improved Policy Making, p. 14.

Key feature of CERIF: Linking Entities Basic principle of CERIF: most of the characteristics (attributes) of an object (entity) are not stored with the entity (in the entity table) but expressed through “linking entities” (in database terms: linking tables), allowing multiple roles/characteristics to be expressed for the same aspect. Only the absolute unique characteristics of an entity are stored in the entity table. E.g. for the entity “Person” only a unique identifier, the gender and the birth date are stored with the entity itself. PERSON Identifier Gender Birth date

Key feature of CERIF: Linking Entities Linking entities are used to: Express the type and meaning of a relation between two objects in the field of research information, e.g.: the role of a researcher in a project, or in a publication. Classify objects according to a given classification scheme (controlled vocabulary, thesaurus), e.g.: classify a project or a publication according to a keyword list used in a discipline. Map various classification schemes to each other, e.g. map the Biochemistry keyword list used by the Medical discipline to the one used by Biologists; or another example: mapping researcher classifications used in one country to the one used in another country, etc…. Linking entities have a start- and enddate, so that the exact time frame of each relation or classification is always known. An in principle endless number of linking entities (roles, typologies...) can exist between objects, e.g: a person can be a researcher but at the same time manager of a project.

Example 1: a researcher with various roles in a project PROJECT PUBLICATION DATASET PROJECT PERSON_PROJECT. PERSON Person Id Project Id Role: researcher Start: End: PERSON_PROJECT Person Id Projec Id Role: manager Start: End: The person is researcher in the project from 2010 on and from March 2011 on he is also the manager of the project. To express this in CERIF there are two records in the “Person-Project” linking entities table, one for each role, with start- and enddate.

Example 2: Classifying a given publication according to the subject field or area it belongs to. PERSONPROJECT PERSON_PROJECT Person Id Projec Id Meaning (e.g.: researcher, manager) PERSONPUBLICATION PERSON_PUBL. Person Id Publ. Id Meaning (e.g.: 1st author, editor...) PUBLICATION DATASET PUBL_DATASE Publ. Id Dataset Id Meaning (e.g.: based on) PUBLICATION CLASSIF-SCHEME PUBL_CLASS Publ_Id Class_term Id Class_Scheme Id Start: End: E.g. Subject Areas Linking entity connecting the publication to the correct subject area. Notice: also here more linking entities (records) are possible since a publication may be relevant for various disciplines or subject areas.

Example 3: Map various classification schemes (e.g. keyword lists) PERSONPROJECT PERSON_PROJECT Person Id Projec Id Meaning (e.g.: researcher, manager) PERSONPUBLICATION PERSON_PUBL. Person Id Publ. Id Meaning (e.g.: 1st author, editor...) PUBLICATION DATASET PUBL_DATASE Publ. Id Dataset Id Meaning (e.g.: based on) CLASSIF-SCHEME CLASS_CLASS Class_term Id A Class_term Id B Relation (e.g. Is equal to, is subtype of, etc...) E.g. Biochemstry keywords as used by Biologists CLASSIF-SCHEME E.g. Biochemstry keywords as used by Medical disciplines Classification Scheme A Classification Scheme B Linking table mapping a term from Scheme A to a term from Scheme B, including expression of the nature or meaning of the relation between the two terms.

All the classifications and typologies are stored in a separate part of the CERIF model, the Semantic layer. PERSONPROJECT PERSON_PROJECT Person Id Projec Id Meaning (e.g.: researcher, manager) PERSONPUBLICATION PERSON_PUBL. Person Id Publ. Id Meaning (e.g.: 1st author, editor...) PUBLICATION DATASET PUBL_DATASE Publ. Id Dataset Id Meaning (e.g.: based on) OBJECT A OBJECT B OBJECTA_OBJECTB ObjectA Id ObjectB Id Meaning: Start: End: Class. Scheme 1 E.g. Roles of a Person in a Project. Class. Scheme 2 E.g. Roles of a Person in a Publication. Class. Scheme 3 E.g. Types of Research Results Class. Scheme N E.g. Keywords used to classify research in a given discipline Semantic layer holding the Controlled Vocabularies to express the Meaning of the relation.

Implemented in a CRIS these are all Database Tables. Class. Scheme 1 E.g. Roles of a Person in a Project. Class. Scheme 2 E.g. Roles of a Person in a Publication. Class. Scheme 3 E.g. Types of Research Results Class. Scheme N E.g. Keywords used to classify research in a given discipline TABLESTABLES TABLESTABLES PERSONPROJECT PERSON_PROJECT Person Id Projec Id Meaning (e.g.: researcher, manager) PERSONPUBLICATION PERSON_PUBL. Person Id Publ. Id Meaning (e.g.: 1st author, editor...) PUBLICATION DATASET PUBL_DATASE Publ. Id Dataset Id Meaning (e.g.: based on) OBJECT A OBJECT B OBJECTA_OBJECTB ObjectA Id ObjectB Id Meaning: Start: End: Semantic layer holding the Controlled Vocabularies to express the Meaning of the relation.

Real life example of (part of) CERIF with linking entities (pink) filled in.

Real life example: applying multiple keyword classifications to a publication

Real life example: Mapping Classification Schemes in CERIF

Example of a real life use case

CERIF: summarising CERIF: Comprehensive “legobox”covering all aspects of research. Can express virtually any combination of data from/into virtually any use case. Widely recognized within Europe as a standard storage model for research information metadata and a standard for interoperability of research information (CERIF-XML). Reference for development of CRIS, e.g.