Presentation is loading. Please wait.

Presentation is loading. Please wait.

© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language.

Similar presentations


Presentation on theme: "© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language."— Presentation transcript:

1 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language Technology Lab, German Research Center for Artificial Intelligence (DFKI) German Research Center for Artificial Intelligence (DFKI) Saarbrücken, Germany

2 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 2 Tutorial: CERIF 2008 Release Outline  Active People  What is CERIF?  Explanations (Metadata, data-centric, Model)  The CERIF Model (Entities, Relationships, Structure)  The CERIF Semantic Layer in some Detail  The CERIF XML Interchange Format  Related Activities  The CERIF Evolution  The CERIF Aim and Current Activities

3 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 3 Tutorial: CERIF 2008 Release Active People Active participation in current release (2008):  Brigitte Jörg, (German Res Center for AI) TG Leader  Keith G. Jeffery (UK Science and Techn Facilities Council)  Geert van Grootel (Flemish Ministry)  Anne Asserson (University Bergen)  Henrik Rasmussen (atira A/S)  Adrian Price (University Copenhagen)  Thomas Vestam (atira A/S) Active participation in past release (2006):  Ojars Krast (uniCRIS AG)  Edward Grabczewski (UK Science and Techn Facil Council)

4 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 4 Tutorial: CERIF 2008 Release What is CERIF ? Common European Research Information Format (1) data model (data-centric focus) (2) allows for a (metadata) representation of –research entities –their activities / interconnections (research) –their output (results) (3) enables quality maintenance, archiving, access and interchange of research information (4) supports knowledge transfer to researchers / research managers / research strategists / publication editors / media / brokers / the general public

5 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 5 Tutorial: CERIF 2008 Release Metadata ? Book: Title: The Hitchhiker‘s Guide to the Galaxy Date of Publication: 1979 Game Cover Image: The Hitchhiker‘s Guide to the Galaxy Source: http://egotron.com/http://egotron.com/ Retrieved: May 30, 2008 Radio Series: Title: The Hitchhiker‘s Guide to the Galaxy Description: is a science fiction comedy series created by Douglas Adams. Originally a radio comedy broadcast on BBC Radio 4 in 1978, […] Source: Wikipedia Date of Query: May 30, 2008 Series of five Books: Title: The Hitchhiker‘s Guide to the Galaxy. Between: 1979 - 1982 TV Series: Title: The Hitchhiker‘s Guide to the Galaxy Screened: 1981 Computer Game: Title: The Hitchhiker‘s Guide to the Galaxy Released: 1984 Comic Book Adaptions: Title: The Hitchhiker‘s Guide to the Galaxy Between: 1993 – 1996 Links: http://www.bbc.co.uk/cult/hitchhikers/ HTML-Title: Cult – The Hitchhiker‘s Guide to the Galaxy http://en.wikipedia.org/wiki/The_Hitchhiker' s_Guide_to_the_Galaxy HTML-Title: The Hitchhiker's Guide to the Galaxy Data about Data Structure: Type of Resource Title Description Source Date Author, Creator, …

6 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 6 Tutorial: CERIF 2008 Release What is Metadata ? „Metadata is structured data which describes the characteristics of a resource.” An Introduction to Metadata, by Chris Taylor, University of Queensland An Introduction to Metadata “Metadata is sometimes defined literally as 'data about data,' but the term is normally understood to mean structured data about resources that can be used to help support a wide range of operations. These might include, for example, resource description and discovery, the management of information resources and their long-term preservation.” Metadata in a Nutshell, by Michael Day, UKOLN Metadata in a Nutshell

7 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 7 Tutorial: CERIF 2008 Release What is Metadata for ? „Metadata is structured data which describes the characteristics of a resource.” An Introduction to Metadata, by Chris Taylor, University of Queensland An Introduction to Metadata “Metadata is sometimes defined literally as 'data about data,' but the term is normally understood to mean structured data about resources that can be used to help support a wide range of operations. These might include, for example, resource description and discovery, the management of information resources and their long-term preservation.” Metadata in a Nutshell, by Michael Day, UKOLN Metadata in a Nutshell Support a Wide Range of Operations

8 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 8 Tutorial: CERIF 2008 Release What is data-centric ? CitationTypes Type: Description: Publication URI: Type: Title: PartOf: PublDate: Article Requests 2007 Journal X = 4 Journal Y = 0 Journal Z = 15 Ends in 2010 Journals: Y, Z Organisation URI: Name: Abbreviation: Publications: Academic Staff: Journal Publications 2007 Institute A = 4 Institute B = 10 Institute C = 9 Organisation URI: Name: hasAccess: EndOfAccess ContactPerson: Journal Subscriptions Journal X = 1990 - 2000 Journal Y = 2005 - 2010 Journal Z = 2001 - 2010 PhD Students 2008 Computer Science = 200 Physics = 50 Social Sciences = 9 First Author / No of Papers Person H = 10/35 Person I = 4/12 Person J = 1/10 Citations in 2007 Paper M (publish 2007) = 20 Paper N (publish 2004) = 100 Paper O (publish 2001) = 0 Data Metadata

9 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 9 Tutorial: CERIF 2008 Release What is data-centric ? –Data / Metadata in the center – Data Maintenance, Curation, Preservation and Quality a major interest – Enabling added-value Services based on quality data – Enabling requested views for various stakeholders based on quality data

10 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 10 Tutorial: CERIF 2008 Release What is a model ?  A model is a simplified view to describe a particular area of interest  It allows for a better communication between interested parties  It supports mutual understanding  It supports (re-)design decisions  It supports documentation  It can be exchanged, re-used, iterated, extended A B informs

11 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 11 Tutorial: CERIF 2008 Release C ommon E uropean R esearch I nformation F ormat

12 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 12 Tutorial: CERIF 2008 Release C ommon E uropean R esearch I nformation F ormat A model to manage Research Information Research Entities Project, Person, Organisation, Publication Funding Programme, Service, Equipment, Patent, Product, … Activities / Interconnections in the Research Context Relationships Semantics / Roles / Types -> to exchange research information -> to enable interoperability -> to build CRISs

13 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 13 Tutorial: CERIF 2008 Release CERIF Structure  Core Entities  2nd Level Entities  Link Entities  Language-related Entities  Classification Entities (Semantic Layer)

14 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 14 Tutorial: CERIF 2008 Release Core Entities

15 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 15 Tutorial: CERIF 2008 Release Core Entities Publication ID URI Title Subtitle Abstract Keywords Bibl. Note PublicationDate TotalPages StartPage EndPage Classifications Person ID URI Sex FirstNames OtherNames FamilyNames NameVariants ResearchInterest Keywords Classifications Project ID URI Acronym StartDate EndDate Title Abstract Keywords Classifications Organisation ID URI Acronym Name HeadCount CurrencyCode Turnover ResearchActivity Keywords Classifications

16 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 16 Tutorial: CERIF 2008 Release 2nd Level Entities

17 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 17 Tutorial: CERIF 2008 Release 2nd Level Entities Facility ID URI Name Description Keywords Classifications FundingProgramme ID URI Name CurrencyCode Budget StartDate EndDate Description Keywords Classifications Event ID URI Name FeeOrFree StartDate EndDate CityTown CountryCode Description Keywords Classifications ResultPatent ID URI PatentNumber Title CountryCode RegistrationDate ApprovalDate Description Keywords Classifications Service ID URI Name Description Keywords Classifications

18 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 18 Tutorial: CERIF 2008 Release Link Entities

19 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 19 Tutorial: CERIF 2008 Release Link Entities Person_Publication persID publID Classifications StartDate; EndDate Project_Person projID perslID Classifications StartDate; EndDate Organisation_Publication orgID publID Classifications StartDate; EndDate Project_Publication persID publID Classifications StartDate; EndDate Project_Organisation projID orgID Classifications StartDate; EndDate Person_Organisation persID orgID Classifications StartDate; EndDate Project_Publication projID publID Classifications StartDate; EndDate

20 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 20 Tutorial: CERIF 2008 Release Language-related Entities

21 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 21 Tutorial: CERIF 2008 Release Language-related Entities Publication Title [language] Abstract [languange] Keywords [language] Organisation Name [language] ResearchActivity [languange] Keywords [language] Project Title [language] Abstract [languange] Keywords [language] Person ResearchInterest [language] Keywords [language] Facility Name [language] Description [languange] Keywords [language] Service Name [language] Description [languange] Keywords [language] Patent Name [language] Description [languange] Keywords [language] Product Name [language] Description [languange] Keywords [language]

22 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 22 Tutorial: CERIF 2008 Release Classification Entities (Semantic Layer)

23 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 23 Tutorial: CERIF 2008 Release Classification Entities (Semantic Layer) Classification ClassID ClassSchemeID Term [language] Description [language] StartDate, EndDate URI ClassificationScheme ClassSchemeID Description [language] URI Classification_Classification ClassID1 (Term1) ClassID2 (Term2) ClassSchemeID1 (Schema1) ClassSchemeID2 (Schema2) ClassId (Role) ClassSchemeID (RoleSchema) StartDate, EndDate ClassScheme_ClassScheme ClassSchemeID1 ClassSchemeID2 ClassID (Role) ClassSchemeID (RoleSchema) StartDate, EndDate

24 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 24 Tutorial: CERIF 2008 Release Classification Entities (Semantic Layer) Publication_Classification PublicationType=Journal Publication_Classification ReviewType=peer-reviewed Publication_Classification Person Classification Publication_Classification AccessType=openAccess Publication_Classification Publication Publication_Classification ImpactFactorType=diametric Publication_Classification Category=commissioned Publication_Classification

25 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 25 Tutorial: CERIF 2008 Release Classification Entities (Semantic Layer) personID Lastname OtherNames FirstName Sex publicationID Title Abstract Keywords Date Book Article Technical Report Thesis …… isAuthor isEditor isReviewer … Relationship personID publicationID Relationship personID publicationID Person Classification Person Classification Publication Person Classification

26 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 26 Tutorial: CERIF 2008 Release Semantic Layer Some CERIF Types

27 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 27 Tutorial: CERIF 2008 Release Semantic Layer Some CERIF Relationship Roles

28 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 28 Tutorial: CERIF 2008 Release Semantic Layer Many Schemas (publicly) available For Publication Records: –Dublin Core –Marc Code –Digital Item Declaration Language (DIDL) –Metadata Object Description Schema (MODS) –… For Audio/Video Files: –Metadata Encoding and Transmission Standard (METS) –… For Subject Headings: –Ortelius Thesaurus –…

29 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 29 Tutorial: CERIF 2008 Release Classification Entities (Semantic Layer)  Allows to capture any Schema or Structure Flat Lists Taxonomies Ontologies  Open / Extensible in all directions New Schemas New Concepts / Terms New Relationships  Enables to manage Roles / Types Semantics Subject Headings Archiving (Time component)  Allows for simple Mappings between Schemas  Allows for a efficient (independent) Maintenance

30 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 30 Tutorial: CERIF 2008 Release XML Interchange Format  According to W3C Standards  Refers to XML Schemas for Validation  XML files corresponding to Entities / Separation of Relationships 1 Anne Asserson http://www.linkedin.com1 female 2 Keith Jeffery G. http://www.linkedin.com2 male --- 1 Grey in the R&D Process 2006 http://www.epubs.org/ID1 2 What‘s new in Grey Literature … 2005 http://www.greynet.org/thegrey journal.html?ID2 ---

31 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 31 Tutorial: CERIF 2008 Release CERIF 2008 Release  Model Introduction and Specification Document  Full Data Model, SQL Database Scripts  XML Data Exchange Specification Document  XML Example Files  XML Schemas for XML Validation  CERIF Types / Roles / Semantics as XML  http://www.eurocris.org/http://www.eurocris.org/ June / July 2008

32 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 32 Tutorial: CERIF 2008 Release What is going on ? Source: http://maps.repository66.org/; Reported on: http://www.sparceurope.org/

33 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 33 Tutorial: CERIF 2008 Release What is going on ? JISC Report from April 2008 “Metadata for digital libraries: state of the art and future directions” by Richard Gartner http://www.jisc.ac.uk/media/documents/techwatch/tsw_0801pdf.pdf  Many available Schemas (DC, METS, MODS, …)  Each schema was singularly developed and not designed as an overal architecture to cover integrated object entities  JISC recommends therefore to overcome the problem by best practise guidelines and pragmatic application  Issues of duplicate information (overlap in sections of metadata) need rules and are currently being addressed by the library community in good practise guidelines

34 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 34 Tutorial: CERIF 2008 Release What is going on ? JISC Report from April 2008 “Metadata for digital libraries: state of the art and future directions” by Richard Gartner http://www.jisc.ac.uk/media/documents/techwatch/tsw_0801pdf.pdf –Descriptive Metadata (intellectual contents) –Administrative Metadata (technical metadata [file formats], rights management, provenance [info on creation, subsequent treatment, responsibility, …]) –Structural Metadata (internal structure of items: e.g.: page order, …) METS DIDL …

35 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 35 Tutorial: CERIF 2008 Release What is going on ? JISC Report from April 2008 “Metadata for digital libraries: state of the art and future directions” by Richard Gartner http://www.jisc.ac.uk/media/documents/techwatch/tsw_0801pdf.pdf  XML is of great importance to embed and make use of namespaces  Combining Metadata standards, even a limited such as described above, will always be messier than utilising a single standard that combines their taxonomic powers and resolves any potential clashes or duplications between them.  Integration by itself would of course be of little consequence if the standards themselves failed to address the metadata needs of the digital library community. In this respect, the provenance of each standard is of some importance. All have been constructed by authoritative standard setters within their communities.  Most of the mentioned standards have proved their ability to meet the requirements of major and highly complex digital collections.

36 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 36 Tutorial: CERIF 2008 Release What CERIF aims for Source: http://maps.repository66.org/; Reported on: http://www.sparceurope.org/

37 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 37 Tutorial: CERIF 2008 Release What CERIF aims for Enabling the ERA eInfrastructure Standardization / Integration / Interchange Added-Value Services Middle (Interoperability)-Layer for EU Research Information

38 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 38 Tutorial: CERIF 2008 Release What CERIF aims for The ultimate answer to Life, the Universe, and Everything. from “The Hitchhiker’s Guide to the Galaxy” by Douglas Adams

39 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 39 Tutorial: CERIF 2008 Release Example: Generating Publication Reference Records BibTex Endnote... @article{615182, author = {Veda C. Storey}, title = {Understanding semantic relationships}, journal = {The VLDB Journal}, volume = {2}, number = {4}, year = {1993}, issn = {1066-8888}, pages = {455--488}, doi = {http://dx.doi.org/10.1007/BF01263048}, publisher = {Springer-Verlag New York, Inc.}, address = {Secaucus, NJ, USA}, } Best Practice Guide

40 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 40 Tutorial: CERIF 2008 Release The CERIF Evolution EU Working Group on Research Databases Workshop 1987 1991 CERIF 91 PROJECT Similar Ideas UN/UNESCO OECD CODATA Acronym: ERGO Participant: Keith Jeffery, Anne Asser son, many more Organisations: Rutherford Appleton, Uni- versity of Bergen, … Acronym: ERGO Participant: Keith Jeffery, Anne Asser son, many more Organisations: Rutherford Appleton, Uni- versity of Bergen, … 2000 CLASSIFICATION RESULTSEQUIPMENT PROJECT OrgUnitPERSON EXPERTISE Roles CERIF 2000 Model - Networking of DBs - Exchange of Records - Recommendation to Member States - Data Model (RDBMS, OO, IR) - Multilinguality - Controlled Vocabulary - Roles / Types - User-driven - EC Recommendation to Member States 2ndLevel CORE Language Semantics Link CERIF 2006 / 2008 Model - Data Model (RDBMS, OO, IR) - Model Normalization - Robust Structure - Extensible Structure - Consistent Structure - Semantic Layer - XML Exchange Specification - Connectivity to Repositories (Elaboration on Publication) 20062008

41 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 41 Tutorial: CERIF 2008 Release CERIF 91 –published in a first release –recommended to Member States to harmonise databases on research projects ease exchange of comparable information guidelines for building research databases –only dealt with research project records –demonstrated in the ERGO pilot project access to more than 80.000 project records from more than 20 national information services –demonstrated the feasability of exchange –identified the need for more detailed guidelines –confirmed the need to revise CERIF and extend it to other types of research information, not only projects –revision activities started in 1997 co-ordinated by the EC –led to CERIF 2000

42 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 42 Tutorial: CERIF 2008 Release CERIF 2000 –a full CRIS data model with flexibility to accomodate many database structures –a base framework for data exchange –multilingual subject indexing (Ortelius Thesaurus) –recommendations for controlled attribute values –reflection on user groups and requirements –types of research information –metadata environment as a uniform summary view –extensions to Organisations Persons Results: Products, Patent, Publication Expertise Equipment and Facilities

43 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 43 Tutorial: CERIF 2008 Release Major Current Activities UK: Research Councils specified to use CERIF as the format for IT processes and MM information UK: STFC (Corporate Data Repository) BE: Flanders – CERIF as Standard Interchange Format DK: Danish Universities PURE -> CERIF EUROPE ESF: CERIF for IS under discussion CORDIS, EC R&D Service: Asked for CERIF presentation EuroHORCS: Recommendation for CERIF; join as a euroCRIS member, from its taskgroup

44 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 44 Tutorial: CERIF 2008 Release Major Current Activities Videolectures.net (Teaching Videos) ICERIS (IS): Access to Information on Icelandic Research Projects & R&D Results AURIS-MM (AT): Provides access to Austrian University Research extended with multimedia SICRIS (SI): Access to University Research in Slovenia HUNCRIS (HU): Access to R&D in Hungary SRIS (UK): Scottish Research Information Systems, public research in Scotland CRIS-MER (EC): Research information on Migration and ethnic Relations (planned) STFC (UK): Corporate Data Repository METIS (NL): currently used by Dutch Universities Fdok (NO): University of Bergen, results FRIDA (NO): Joint university CRIS IWETO (BE): Integrating Flemish Research Information BioDiversa ERANET (project) IST World SSA (project)

45 © Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 45 Tutorial: CERIF 2008 Release CERIF TG Activity  Regular CERIF TG meetings and Discussions  Tests and major bugfixes before Releases  Strong Relation to ongoing implementation activities (Geert van Grootel, EWI, Flanders; atira A/S, Aalborg, Denmark)  Exchange with Best Practice (Ales Bosniak, IZUM, Slovenia)  Collaborate with new TG Institutional Repositories (IR-CERIF) (Anna Clements, University of St. Andrews, UK)  Next Steps:  Extension of Semantic Layer with Content  Check Tools for Managing the Semantics  Mappings of major Schemas (Standards)  Check OAI Wrapping  CERIF Ontology


Download ppt "© Brigitte Jörg June 4th, 2008 in Maribor, Slovenia 1 Tutorial: CERIF 2008 Release CERIF 2008 Tutorial Brigitte Jörg, M.A. (Information Science) Language."

Similar presentations


Ads by Google