© Keith G Jeffery & Anne AssersonCERIF Course: Data Model 1 20021024 1 CERIF COURSE Session3: DataModel 1 Keith G Jeffery, Director, IT CLRC

Slides:



Advertisements
Similar presentations
©euroCRIS/Keith G JefferyOA Workshop May 2010 CNR Roma The euroCRIS view of the Rome OA Workshop Keith G Jeffery President, euroCRIS
Advertisements

© Keith G Jeffery, Anne G S Asserson GL6: New York: December 2004: IP & Corporate Context Relating Intellectual Property Products to the Corporate.
© Keith G Jeffery, Anne G S Asserson GL7: Nancy: December Grey in the R&D Process Keith G Jeffery Director, IT CCLRC
© Keith G Jeffery, Anne G S Asserson GL 11 Washington Keith G Jeffery Director, IT & International Strategy, STFC
© Keith G Jeffery, Anne G S Asserson GL 10 Amsterdam Keith G Jeffery Director, IT & International Strategy, STFC
Lukas Blunschi Claudio Jossen Donald Kossmann Magdalini Mori Kurt Stockinger.
CERIF: Common European Research Information Format An international standard relational data model for storage and interoperability of research information.
The Relational Database Model
CERIF-CRIS Overview Keith G Jeffery
The Relational Database Model. 2 Objectives How relational database model takes a logical view of data Understand how the relational model’s basic components.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 3 The Relational Database Model.
3 1 Chapter 3 The Relational Database Model Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
©euroCRIS/Keith G JefferyA Brief History of CRISs A Brief History of CRISs Keith G Jeffery President, euroCRIS
CERIF-CRIS Overview Keith G Jeffery
The Relational Database Model
3 The Relational Model MIS 304 Winter Class Objectives That the relational database model takes a logical view of data That the relational model’s.
Entity Relationship Modeling
CERIF: Past, Present and Future 1 An Overview Anne Asserson, UiB (NO) Keith G Jeffery, CLRC (UK) Andrei Lopatenko, MU (UK) Kassel, 29 – 31 August 2002.
Lecture 2 The Relational Model. Objectives Terminology of relational model. How tables are used to represent data. Connection between mathematical relations.
Chapter 4 The Relational Model Pearson Education © 2014.
Chapter 4 The Relational Model.
Copyright © 2012 Pearson Education, Inc. Publishing as Prentice Hall 9.1.
Chapter 3 The Relational Model Transparencies Last Updated: Pebruari 2011 By M. Arief
DATABASE MANAGEMENT SYSTEMS BASIC CONCEPTS 1. What is a database? A database is a collection of data which can be used: alone, or alone, or combined /
DATABASE MANAGEMENT SYSTEMS BASIC CONCEPTS 1. What is a database? A database is a collection of data which can be used: alone, or alone, or combined /
© euroCRIS/Keith G Jeffery 1 Mutual enhancement of CERIF and project management systems Mutual Enhancement of CERIF and Project Management Systems Keith.
© Keith G Jeffery & Anne AssersonCERIF Course: Implementation CERIF COURSE Session5: Implementation Keith G Jeffery, Director, IT CLRC
© euroCRIS/Keith G Jeffery 1 Achievements & Experience of euroCRIS Achievements and Experience of the euroCRIS Organisation Keith G Jeffery President,
Concepts and Terminology Introduction to Database.
© Keith G Jeffery & Anne AssersonCERIF Course: Evolution CERIF COURSE Session 6: Evolution Keith G Jeffery, Director, IT CLRC
Database Management COP4540, SCS, FIU Relational Model Chapter 7.
Chapter 3 The Relational Model. 2 Chapter 3 - Objectives u Terminology of relational model. u How tables are used to represent data. u Connection between.
CERIF Common European Research Information Format Jan Dvořák, Jiří Souček, Tomáš Chudlarský  Institute of Information Studies & Librarianship Faculty.
©STFC/Keith G Jeffery The value of recording each step of the research process The Value of Recording each Step of the Research Process Keith.
1 The Relational Database Model. 2 Learning Objectives Terminology of relational model. How tables are used to represent data. Connection between mathematical.
©Keith G Jeffery/ Anne AssersonSupporting the Research Process with a CRIS CRIS Supporting the Research Process with a CRIS Keith G Jeffery Director.
Information Systems & Databases 2.2) Organisation methods.
1 Relational Databases and SQL. Learning Objectives Understand techniques to model complex accounting phenomena in an E-R diagram Develop E-R diagrams.
November 2005Anne Asserson Research Administration Department,UiB How can you assess your organisation through a CRIS? The Norwegian national CRIS, FRIDA,
Relational Database. Database Management System (DBMS)
© Keith G Jeffery & Anne AssersonCERIF Course: Data Model CERIF COURSE Session4: DataModel 2 Keith G Jeffery, Director, IT CLRC
Lecture 4 Conceptual Data Modeling. Objectives Define terms related to entity relationship modeling, including entity, entity instance, attribute, relationship,
3 & 4 1 Chapters 3 and 4 Drawing ERDs October 16, 2006 Week 3.
1 A Demo of Logical Database Design. 2 Aim of the demo To develop an understanding of the logical view of data and the importance of the relational model.
Anne AssersonCRIS seminar, Brussels, September CERIF CRIS seminar, Brussels September 2003 Anne Asserson, University of Bergen.
© euroCRIS/Keith G Jeffery 1 euroCRIS and e-Infrastructure Keith G Jeffery President, euroCRIS Premium Members.
+ Information Systems and Databases 2.2 Organisation.
3 1 Chapter 3 The Relational Database Model Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Database Systems: Design, Implementation, and Management Tenth Edition Chapter 3 The Relational Database Model.
Keith G Jeffery Director, IT What does a CRIS add to Open Access Publications ?
Analyzing Systems Using Data Dictionaries Systems Analysis and Design, 8e Kendall & Kendall 8.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
* Database is a group of related objects * Objects can be Tables, Forms, Queries or Reports * All data reside in Tables * A Row in a Table is a record.
DBMS ER model-2 Week 6-7.
Analysis of Use Cases (and to some extent, standards) - Keith G Jeffery, Rebecca Koskela.
Entity Relationship Diagram (ERD). Objectives Define terms related to entity relationship modeling, including entity, entity instance, attribute, relationship.
3 1 Chapter 3 The Relational Database Model Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel.
Chapter 3 The Relational Database Model. Database Systems, 10th Edition 2 * Relational model * View data logically rather than physically * Table * Structural.
©Keith G Jeffery/ Anne AssersonCRIS: Central Relating Information System CRIS CRIS: Central Relating Information System Keith G Jeffery Director.
ENTITY RELATIONSHIP DIAGRAM. Objectives Define terms related to entity relationship modeling, including entity, entity instances, attribute, relationship.
Geert Van Grootel The need for structured data euroCRIS seminar september 2006 Brussels.
CERIF for research evaluation including REF Keith G Jeffery STFC / euroCRIS 20/04/20111Ready 4 Ref End of Project Workshop
The Relational Database Model
Chapter 4 Relational Model Characteristics
Lecture 2 The Relational Model
Chapter 3 The Relational Database Model
CRIS seminar, Brussels September 2003
The Relational Database Model
Database Systems: Design, Implementation, and Management
Presentation transcript:

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model CERIF COURSE Session3: DataModel 1 Keith G Jeffery, Director, IT CLRC Anne Asserson, University of Bergen

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Structure of Session Full, exchange and metadata models Full model – overview (nutshell) The concept of binary relations, linking relations and recursion The concept of character / language variants The concept of enumerated lists – dictionaries, thesauri, ontologies

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Structure of Session Full, exchange and metadata models Full model – overview (nutshell) The concept of binary relations, linking relations and recursion The concept of character / language variants The concept of enumerated lists – dictionaries, thesauri, ontologies

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Full, exchange and metadata models Metadata Model is a subset of Exchange Model is a subset of Full Model Full Model is intersection of existing CRISs excluding uncommon variants

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Structure of Session Full, exchange and metadata models Full model – overview (nutshell) The concept of binary relations, linking relations and recursion The concept of character / language variants The concept of enumerated lists – dictionaries, thesauri, ontologies

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model CERIF2000 Data model –Extended relational model –Linking relations with attributes (roles and time stamp) –3 base entities Person, Organisation, Project –12 secondary base entities (linked to base entities) –36 Look up tables (to ensure data quality) –39 Link tables (flexibility) –all text fields have multiple language fields –maximum representativity with minimum complexity

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model CERIF2000 in a Nutshell

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model PROJECTORGUNIT SkillsCV General Facility Particular Equipment Contact Results Publication Results Patent Results Product Service Funding Programme Event Classification Prize/Award PERSON

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model PROJECTORGUNITPERSON Three Primary Entities Concepts: (1) entities that reflect main ‘views of entry’ into CRISs (2) entities with no direct functional dependency on each other (3) entities that can refer to themselves (recursion) (4) entities linked in pairs by ‘linking relations’ (5) ‘linking relations’ represent temporally-bound roles (6) ‘linking relations’ have primary key of each entity, role, date/time start, date/time end and any other constraints

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model PROJECTORGUNITPERSON Linking Relations As an Example: PERSON-ORGUNIT Concepts: (1) May have many instances of the relationship for each instance of PERSON and ORGUNIT due to role and temporal bounding (2) Role: the purpose of the relationship e.g. employee | head | …. (3) Temporal: the use of and defines the duration of this relationship Analagous for PROJECT_ORGUNIT and PERSON_PROJECT Person-Orgunit

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model PROJECTORGUNITPERSON Primary Base Entity: ORGUNIT Concepts: (1) ORGUNIT may have an organisationally subordinate relationship to another ORGUNIT e.g. a Group within a Department (2) ORGUNIT may have a symbiotic relationship to another ORGUNIT e.g. two Groups that have a cooperation agreement (3) ORGUNIT may have a financial relationship to another ORGUNIT e.g. customer - contractor Orgunit-Orgunit

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model PROJECTORGUNITPERSON Primary Base Entity: PROJECT Concepts: (1) PROJECT may have an organisationally subordinate relationship to another PROJECT e.g. a sub-Project (2) PROJECT may have a symbiotic relationship to another PROJECT e.g. two Projects that cooperate by agreement (3) PROJECT may have a temporal relationship to another PROJECT e.g. one project follows on from another

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model PROJECTORGUNITPERSON Primary Base Entity: PERSON Concepts: (1) PERSON may have a socially subordinate relationship to another PERSON e.g. a child of a parent (2) PERSON may have a symbiotic relationship to another PERSON e.g. two researchers that cooperate by agreement (3) PERSON may have a temporal relationship to PERSON e.g. a lecturer (dates) becomes a reader (dates)

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model PROJECTORGUNITPERSON Funding Programme Concepts: (1) Funding Programme is related to (a) ORGUNIT and / or (b) PROJECT (2) A Person is only funded via (a) ORGUNIT and / or (b) PROJECT (3) any other entities are only funded via (a) ORGUNIT and / or (b) PROJECT FUNDING PROGRAMME Secondary Base Entities:

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model PROJECTORGUNITPERSON Contact Secondary Base Entities: example: CONTACT Concepts: (1) all contacts in one place - no replication, no update problems (2) >1 contact dependent on role e.g. private address|work address (3) the PROJECT contact is usually the project leader: a PERSON (4) the ORGUNIT contact is usually the head: a PERSON (5) but may have a generic address e.g. project URI | Orgunit Analagous for Publication, Product, Patent, Event, Prize/Award....

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model PROJECTORGUNITPERSON Result_Publication Secondary Base Entities: example: RESULT_PUBLICATION Concepts: (1) temporally-bound role linking relations (2) >1 linking relation : Result_Publication and other entities (3) PERSON role may be author, co-author, editor, reviewer…. (4) ORGUNIT role may be publisher, IPR or copyright owner.. (5) PROJECT role may be the source of the idea

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model PROJECT ORGUNIT PERSON Result_Publication Can Express: (where DT-date/time) Person A (DT1 - DT2) (is author of) Publication X Orgunit O (DT1 - DT2) (is owner of IPR in) Publication X Person A (DT1 - DT2) (is employee of ) Orgunit O Person A (DT1 - DT2) (is project leader of) Project P Person A (DT1-DT2) (is member of) Orgunit M Person A (DT1-DT2) (is member of) Orgunit N Orgunit M (DT1-DT2) (is part of) Orgunit O Orgunit N (DT1-DT2) (is part of) Orgunit O Secondary Base Entities: example: RESULT_PUBLICATION

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model PROJECTORGUNITPERSON SkillsCV General Facility Particular Equipment Contact Results Publication Results Patent Results Product Service Funding Programme Event PERSON Links Prize/Award

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model PROJECTORGUNITPERSON SkillsCV General Facility Particular Equipment Contact Results Publication Results Patent Results Product Service Funding Programme EventPrize/Award PROJECT Links

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model PROJECTORGUNITPERSON SkillsCV General Facility Particular Equipment Contact Results Publication Results Patent Results Product Service Funding Programme Event ORGUNIT Links Prize/Award

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model PROJECTORGUNITPERSON SkillsCV General Facility Particular Equipment Contact Results Publication Results Patent Results Product Service Funding Programme Event Classification Prize/Award

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model PROJECTORGUNITPERSON SkillsCV General Facility Particular Equipment Contact Results Publication Results Patent Results Product Service Funding Programme Event Classification The Whole Thing Prize/Award

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model End of CERIF2000 in a Nutshell

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Structure of Session Full, exchange and metadata models Full model – overview (nutshell) The concept of binary relations, linking relations and recursion The concept of character / language variants The concept of enumerated lists – dictionaries, thesauri, ontologies

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Binary Relations The Problem Wish to link flexibly –An instance in an entity to a related instance in another entity (relationship) –An instance in an entity to another instance in the same entity (recursion) Examples –Person Project e.g. x is leader of z –Person Person e.g. x is boss of y

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Binary Relations Relationship Usual Relation ProjectProject PersonPerson PersonPerson PROJECT PERSON PK FK

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Binary Relations Relationship Problem Supports only 1 (Project) to n (Persons) i.e. the persons on any 1 project, with all their attributes (dependencies) In many cases need to indicate that –The same person works on several projects –In different roles (e.g. leader, programmer) –At different (or the same) time periods i.e. 1 (Person) to n (Projects)

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Binary Relations Relationship Binary Relation ProjectProject PersonPerson PROJECT PERSON ProjectProject PersonPerson nm

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Binary Relations Relationship Binary Relation ProjectProject PersonPerson PROJECT PERSON ProjectProject PersonPerson RoleRole S t a rt D a t e EndDateEndDate In practice usually have more attributes than Project / Person

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Binary Relations Recursion Usual Relation PK &FK PersonPerson PERSON PersonPerson Actually works like this PERSON

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Binary Relations Recursion Binary Relation PersonPerson PERSON PersonPerson PersonPerson

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Binary Relations Recursion Binary Relation PersonPerson PERSON PersonPerson PersonPerson How the tuples from Person are represented in the binary relation

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Binary Relations Recursion Binary Relation PersonPerson PERSON PersonPerson RoleRole S t a rt D a t e EndDateEndDate PersonPerson In practice usually have more attributes than Person / Person

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Binary Relations Binary Relation Flexible Allows n : m With added attributes e.g. role, date/time Thus permitting –Conditional relationships –Temporal relationships –i.e. rich semantics

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Structure of Session Full, exchange and metadata models Full model – overview (nutshell) The concept of binary relations, linking relations and recursion The concept of character / language variants The concept of enumerated lists – dictionaries, thesauri, ontologies

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Character / Language Variants Character sets Character sets –Not only ‘Latin-1’ (need also to handle Greek, Arabic, Chinese…) –Can use escape codes technique but only works in linear data streams –Better to use a rich code that can handle any character from any language (including mathematics, financial currencies) as an atomic item - Unicode –But it requires more storage

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Character / Language Variants Language Language CERIF has many text fields Each field may exist in multiple languages For retrieval or update need to know the language (for text- matching) So have within the logical record multiple sub-records differentiated by language for each text field Example: Project.Abstract will usually exist in (US) English and original language and maybe language of country/region where stored

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Structure of Session Full, exchange and metadata models Full model – overview (nutshell) The concept of binary relations, linking relations and recursion The concept of character / language variants The concept of enumerated lists – dictionaries, thesauri, ontologies

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Enumerated Lists, Dictionaries, Thesauri, Ontologies Purpose –Higher quality data: data validation –More accurate retrieval: query keywords limited and stored words (for any attribute) limited –Classification – allowing grouping and ranking by value of attribute

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Enumerated Lists, Dictionaries, Thesauri, Ontologies Enumerated List Example: Country Code There is an ISO standard list of valid 2- character and 3-character country codes On input can validate country code is from this list (commonly with a pull-down) If changes in countries, update the list in one place and whole system reconfigured

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Enumerated Lists, Dictionaries, Thesauri, Ontologies Dictionaries Example: meaning of a word (term) –Used in ensuring correct use of a value in an attribute –For explanation of result output Example: multilingual –Used in multilingual query (query in language 1 and retrieve from records stored in languages 2….n) –Used in result output – translate (crudely) to single language as required

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Enumerated Lists, Dictionaries, Thesauri, Ontologies Thesauri Provide the structural relationships of words (terms) –Synonym (different word same meaning) –Homonym (same word different meaning) –Antonym (word with opposite meaning) –Super-term (a word whose meaning includes the word being used e.g. person includes {student|worker | ….} –Sub-term (a word whose meaning is included in a Super-term)

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Enumerated Lists, Dictionaries, Thesauri, Ontologies Ontologies Ontology: philosophical study of existence and nature of reality In practice a resource of terms, their definitions and their logical inter-relationships E.g. For a publication to exist it is necessary to have a title, at least 1 author Publication  [  title AND  >=1 author]

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Enumerated Lists, Dictionaries, Thesauri, Ontologies Ontologies Domain Ontology: Ontology covering a domain (subject area of interest) Example Publication Publication  [  title] AND  author] Collection  [  title +  >1 author +  editor] If Publication has title, > 1 author and editor it is a collection Publication is_part_of Collection Collection is_a_kind_of Publication

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Enumerated Lists, Dictionaries, Thesauri, Ontologies Ontologies Domain Ontologies in IT A representation in first order logic allowing –Facts to be expressed –Relationships to be expressed –Constraints to be expressed –New facts and relationships to be deduced or induced

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Enumerated Lists, Dictionaries, Thesauri, Ontologies Ontologies Used –Data validation on input –Clarification and improvement of a query –Resolving heterogeneity of terms to homogeneity –Expanding super-terms to subterms and vice- versa conditionally –Deducing or inducing new facts and relationships from stored facts and relationships

© Keith G Jeffery & Anne AssersonCERIF Course: Data Model Conclusion CERIF is a data model with ‘levels’ –Primary base entities e.g. Person –Secondary base entities e.g. Result_Publication –Language-base entities e.g. Abstract –Lookup Tables e.g. Role of Person –Linking Relations e.g. Project Person