Presentation is loading. Please wait.

Presentation is loading. Please wait.

CRIS&OAR for Research Information Management I.Filozova JINR LIT, University “DUBNA” Dubna, Russia SCHOOL ON JINR/CERN GRID AND ADVANCED INFORMATION SYSTEMS.

Similar presentations


Presentation on theme: "CRIS&OAR for Research Information Management I.Filozova JINR LIT, University “DUBNA” Dubna, Russia SCHOOL ON JINR/CERN GRID AND ADVANCED INFORMATION SYSTEMS."— Presentation transcript:

1 CRIS&OAR for Research Information Management I.Filozova JINR LIT, University “DUBNA” Dubna, Russia SCHOOL ON JINR/CERN GRID AND ADVANCED INFORMATION SYSTEMS Dubna NOVEMBER 2-6, 2015

2 Acronyms CRIS&OAR CRIS — Current Research Information System OAR — Open Access Repository [http://jds-test3.jinr.ru]

3 Mission of scientific organization: achievement scientific results, the satisfaction of the scientific community Search for Available Information Data Processing & Data Generation Knowledge Generation Scientific Activity New Knowledge Generation

4 Search for Available Information Data Processing & Data Generation Knowledge Generation Publications: printed articles digital archives repositories Tables Plots Data Bases etc Scientific Activity Knowledge is fixed in images and signs of the natural and artificial languages.

5 Journal Crisis end of the '90s: The cost of subscription to scientific journals has grown 2-3 times faster than the growth rate of the budgets of academic libraries and inflation. Price policy 1 year cost ≥ 500 $ The average cost of an annual subscription to the Chemistry Journal ≥ 3000 $ some journals ≥ 10 000 $

6 JournalPublisherYearPrice $ Journal of Comp. and Applied Mathematics Elsevier20084727 Applied Mathematics and Mechanics (6 issues) Springer2016 5 606 Applied Physics ASpringer20084989 Journal of Fluid MechanicsCambridge Univ. Press 20083200 Annals of PhysicsElsevier20163 928 Biochimica & Biophysica Acta Elsevier201220 930

7 Materials Science & Engineering A, B, C, & R 2015 Volkswagen Golf 1.6 AT new 20 385 $ 3 850 $ Machu Picchu 2008: 17,986 $ 2016: 23 345 $

8 Open Access (OA) to Research What about copyrights? does not cancel the copyright and does not contradict it; How is OA realized? public scientific archives and repositories — Green road publication in open access journals — Gold road Where does OA idea come from? 1.Budapest Declaration Open Access Initiative (http://www.budapestopenaccessinitiative.org/); 2.Berlin Declaration on Open Access to Knowledge in the Sciences and Humanities (http://openaccess.mpg.de/Berlin-Declaration).

9 Open Access Benefits Scientists and Researchers: expansion readership and increasing readability; increasing publication citation; scientific impact; growth of the author popularity and fastening of a scientific priority. Organization: management of their digital resources; increasing the scientific prestige of the organization. Society: return on investment in research; removing barriers to information sharing; creation of additional information services for different users categories.

10 OAI-Protocol for Metadata Harvesting HTTP OAI-PMH 2 types of requests: 1. SELECT ALL RECORDS; 2. SELECT RECORDS WHERE 6 commands: GetRecord, Identify, ListIdentifier, ListMetadataFormats, ListRecords, ListSets BASIS SUPERSTRUCTURE

11 Information Model OAI-PHM RESOURCE ↔ ELEMENT {ID_RECORD; RECORDS} RESOURCE IDENTIFIERMETADATA SETS Dublin Core User Metadata Set MARC21 RECORDS... MARCXML

12 OAI Repositories over the World Archives USA693 UK231 Germ.199 Japan156 Spain156 Brazil136 India102 China90 France87 Canada81 Ukraine 73 Australia75 Archives Italy77 Taiwan69 Russia53 Portugal48 Colombia47 Sweden45 S.Africa40 Malaysia36 Nether35 Belgium28 Greece21 Number of Repositories — 4053 Number of Records ~ 39,000,000 according to the Registry of Open Access Repositories ROAR – http://roar.eprints.org

13 Repository type Open Access Statistics

14 Software to create and manage OARs SoftwareNumber of repositories DSpace1579 EPrints567 Bepress366 OPUS72 Invenio 19 Greenstone22 Fedora57

15 OAR Example 1

16 OAR Example 2 JINR Document Server ̶ http://jds.jinr.ru/

17 Research Information Data/Metadata or Information about: Scientists Project Managers Ongoing and Completed Projects Research Departments Funding Organisations and Programmes Research Results Publications Equipment their timely Relationships (Semantics)

18 Who needs Research Information?

19 What is a CRIS? C urrent R esearch I nformation S ystem = CRIS … information about People + Organisations + Projects + Funding Programmes + Research Results + … … that means Timeliness Vitality … driven by A Concept A Model … incorporated as a Implementation (ICT) An integrated approach towards managing research information

20 CERIF Model C ommon E uropean R esearch I nformation F ormat

21 Instance Diagram Person A Publication X OrgUnit O OrgUnit M OrgUnit N Project P member employee Part of owns IPR author Project leader Repository HR System webpages Project Management Finance

22 CERIF Features (1) data model (data-centric) (2) allows for a (metadata) representation of –research entities –their activities / interconnections (research) –their output (results) (3) allows for high flexibility with formal (semantic) relationships (4) enables quality maintenance, archiving, access and interchange of research information (5) supports knowledge transfer to decision makers, for research evaluation, research managers, strategists, researchers, editors, the general public

23 CRIS Example 1

24 CRIS Example 2 ИСТИНА ( https://istina.msu.ru/)

25 CRIS Example 3 Personal INformation System JINR

26 PIN

27 CRIS&OAR Challenge Collaboration of researchers, administration and librarians CRIS and OARs should join forces to deliver the best possible services

28 Current Research Information Systems (CRIS) & Open Access Repositories (OAR) Strategic Layer Operational Layer CRIS OAR Record the R&D (Research and Development) activity Cover projects, people (expertise), organizational structure, R&D outputs, events, facilities and equipment administrative comprehensive integrative person-centric analytics public file-centric rights preservation distributed paradigm Collect and preservate the R&D outputs Services Set for the collaboration members to manage and distribute digital resources. Commonalities:  Bibliographic Information  Affiliation  Project Information Managment:  Financial information  Staff information  R&D organisation Managment:  Bibliographic Data  Full-Text Documents  Authoritative Data Resources Aggregative Approach –Integrating with institutional HRM, project a.o. systems: Sharing and re-using resources

29 Need Curation Processes & Human Responsibilities CurationView P U B F P Projects research project manager People staff manager Bibliographic Information bibliography specialist, librarian, content manager, identity manager Materials & Equipment facility manager Finance financial officer

30 Normalize as much as possible: Authority Records* +More qualitative, consistent data +Minimizing the data input by end-users +More qualitative, consistent data +Minimizing the data input by end-users Authority Control identify objects and concepts uniquely Authorities Variety Identifiers Variety LinkagesVariety History Tracking People, Institutes, Grants, Experiments, Projects, Journals, … DOI, ORCID,... n:m relations, Vertical linkages, Horizontal linkages Predecessors/Successors *search elements of bibliographic records

31 Authority Control Tool Result Source Data  CRIS & OAR Systems  Bibliographic Databases  Vocabularies, Ontologies, ORCID/AuthorClaim a.o. authors‘ identifying systems Authority Control 1.Accounting of all name variants 2.Authoritative data disambiguation in information search, submission Relevant Information about R&D activity Lists of Publications Scientific Reporting Bibliometrics & Scientometrics

32 JINR CRIS & OAR Systems from file JDS JINR Document Server Staff information: Employment profiles Bibliographic Archive Projects’ Information ©JINR PIN Personal Information System from person Scientific activities management: entire lifecycle for conferences, meetings, lectures Indico, ©CERN Integrated Digital Conferencing System IDC from event Viewpoint Open Access Repository of materials concerning the R&D activity Invenio, ©CERN

33 Jinr Document Server (JDS) JDS has created and developed as an institutional repository with following content: 1. The research and scientific-related documents: – Publications issued in coauthorship with JINR researchers; – Archive documents that describe all the essential stages of the JINR research activity; 2. Documents providing informational support for scientific and technological research performed in JINR.

34 JDS: Information Services Search and navigation, Creation of the user’s groups, Saving search results, Individual and group bookshelves, Manuscripts deposition, Discussions on the publications, Sending out alerts and messages.

35 Invenio SOFT Unix-like OS - GNU/Linux distributions Debian, Gentoo, Scientific Linux (RHEL- based), Ubuntu HTML,CSS,JS Python 2.7.5+ MySQL Redis

36 Architecture

37 Trees Collections Subcollections http://jds.jinr.ru

38 Collection Books

39 Information Card of Resource

40 Attachment to Collection

41 Authority Control Realization Solved by: MARC21 Authorities + Invenio v1.2.1 API MARC21 authorities Repeatable linking fields (fields 4xx, 5xx) Horizontal linking (subfield $w: $w a - predecessor, $w b - successor) Vertical linking (subfield $w: $w t - parent) Repeatable System Control Number (field 035) Repeatable Standard Technical Report Number (field 027) Module BibAuthority Enriching of bibliographic data with data from authority records Re-indexing of bibliographic records containing links to recently updated authority records Cross-referencing between MARC records($0 subfields)

42 Collection Authorities http://jds-test3.jinr.ru

43 Collection Institutes. Record JINR

44 Record LIT. Detailed Information

45 Institute →Publication

46 Collection People. Author → Publication

47 Detailed Information about Author Code Collection - MARC tag 980 defines which documents belong to the given collection

48 Experiment → Publication

49 Grant → Author → Publication

50

51 Thesaurus

52 Repository — place for storage and support any data. Archive — collection of the information resources + classification system (catalog). Knowledge — a existence and systematization form of the results of human cognitive activity. Knowledge (the subject) — the confident understanding of a subject, the ability to deal with it, to understand it and use to achieve some goals. Missing knowledge — knowledge known for humanity, but unknown to some person at the current moment (for example, the student and new subject of the educational program). Knowledge in the wide meaning — a subjective image of reality in the form of concepts and ideas.

53 Knowledge in the narrow meaning — the possession of verified information (answers to questions), that allows to solve the challenge. Knowledge in the theory of artificial intelligence (AI) and expert systems — an information and inference rules about the world, objects properties, patterns of processes and phenomena, as well as the rules for the usage of them for decision-making. New knowledge — an information about the existence of any objects or their properties, of the real processes and phenomena, unknown for science previously, and not included in the current existing system of human representations about the world.

54 Open Access (OA) to Research — way of the scientific communication by realization of the author right of the product on publication in such a manner that any person can get access to product from any place and at any time at an own choice. Open Archives Initiative (OAI) — an organization to develop and apply technical interoperability standards for archives to share catalog information (metadata). Self-archiving — a deposition the digital documents (metadata + full-text) in a OAI-compliant Archive. “Proxy” self-archiving — a deposition on behalf of any authors who feel that they are personally unable ( too busy or technically incapable ) to self-archive for themselves. Harvesting — automatic metadata gathering between repositories. OAI-PHM — Open Archives Initiative Protocol for Metadata Harvesting.

55 Metadata — structured data which describes the characteristics of a resource (“An Introduction to Metadata”, by Chris Taylor, University of Queensland) Book: Title: Pushkin's Fairy Tales Date of Publication: 2012 Author: Alexander Pushkin Editor: Williams Paul Translator: Elton Oliver, Krup Jacob Publisher: Bright City Structure: Type of Resource Title Description Source Date Author Creator … Data about Data Metadata

56 MARC21 — international standard for bibliographic data. A MARC bibliographic record consists of three main components: the Leader, the Directory, and the variable fields (http://www.loc.gov/marc/bibliographic/). 00X: Control Fields 01X-09X: Numbers and Code Fields 1XX: Main Entry Fields 20X-24X: Title and Title-Related Fields 25X-28X: Edition, Imprint, Etc. Fields 3XX: Physical Description, Etc. Fields 4XX: Series Statement Fields 5XX: Note Fields 6XX: Subject Access Fields 70X-75X: Added Entry Fields 76X-78X: Linking Entry Fields 80X-83X: Series Added Entry Fields 841-88X: Holdings, Location, Alternate Graphics, Etc. Fields Example MARC-record Fields 035 - System Control Number (Repeatable) 100 - Personal Name (Not Repeatable) 245 - Title Statement (Not Repeatable) 700 – Add Entry - Personal Name (Not Repeatable) SubFields Values MAchine-Readable Cataloguing

57 XML — EXtensible Markup Language, metalanguage (language for description of other languages), universal format for structured documents and data (derived from SGML - Standard Generalized Markup Language) http://www.w3.org/XML/ Example: ] Prolog Product #1 10.00 Product #2 20.00 Opening Tag Closing Tag Root Element Element Content

58 MARCXML — a framework for working with MARC data in a XML environment ( http://www.loc.gov/standards/marcxml/) Example MARCXML-record Tag datafield = MARC field Tag subfield = MARC subfield Element Content = MARC subfield values

59

60 Open Access Idea Digital Libraries Tools Scientific and Educational Activity Institutional Repositories in the form of Open Access I. Digital Collection. Collection and preservation of intellectual output of organization. II. Set of services for the collaboration members in order to manage and distribute digital resources. Institutional Repository

61 CERIF — Common European Research Information Format 1)CERIF is an EU Recommendation to Member States (http://cordis.europa.eu/cerif/ ) 2)The European Commission (EC) has authorised euroCRIS to maintain and develop CERIF and its usage (http://www.eurocris.org/cerif/cerif-releases/ )

62


Download ppt "CRIS&OAR for Research Information Management I.Filozova JINR LIT, University “DUBNA” Dubna, Russia SCHOOL ON JINR/CERN GRID AND ADVANCED INFORMATION SYSTEMS."

Similar presentations


Ads by Google