The DRIVER initiative for networking repositories Wolfram Horstmann Universität Bielefeld.

Slides:



Advertisements
Similar presentations
DRIV(ER)ing Research Infrastructures Yannis Ioannidis University of Athens, Hellas 1st DRIVER Summit: Towards a Confederation of Digital Repositories,
Advertisements

The REPOX system Nuno Freire -
1 L U N D U N I V E R S I T Y Integrating Open Access Journals in Library Services & Assisting Authors in choosing publishing channels 4th EBIB Conference.
WORKSHOP ON CRIS, CERIF AND INSTITUTIONAL REPOSITORIES, Rome, 10-11/5/2010 Interoperability Challenges and Approaches.
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
1 Ontolog OOR Use Case Review Todd Schneider 1 April 2010 (v 1.2)
The SDMX Registry Model April 2, 2009 Arofan Gregory Open Data Foundation.
PUMA & MetaPub Open Access to Italian CNR Repositories in the Perspective of the European Digital Repository Infrastructure GL9 - NINTH INTERNATIONAL CONFERENCE.
The New Improved OpenDOAR Directory of OA Repositories Peter Millington SHERPA Technical Development Officer University of Nottingham, England.
Reshaping Preserv 2 from a Life(cycle) perspective Steve Hitchcock and Dave Tarrant Preserv 2 Project School of Electronics and Computer Science (ECS),
IST Humboldt University Berlin, Germany – Computer and Media Service – Electronic Publishing Group Birgit Matthaei, 4th Sept. 2003, Bath,
OA-Forum 1 st Workshop: Summing up & way forward Leona Carpenter (UKOLN) with Donatella Castelli (IEI-CNR) & Susanne Dobratz (HUB) Open Archives Forum.
18 Copyright © 2005, Oracle. All rights reserved. Distributing Modular Applications: Introduction to Web Services.
The DRIVER Infrastructure (Digital Repository Infrastructure Vision for European Research) Paolo Manghi ISTI - National Research Council, Italy.
LIBER Annual Conference, 2008, Istanbul 1 LIBER 37th Annual Conference, Istanbul, 3 July 2008 DRIVER: Building a sustainable infrastructure of (European)
The DRIVER initiative for networking repositories Wolfram Horstmann Universität Bielefeld.
1 This work is licensed under a Creative Commons License Attribution Non-commercial ShareAlike 2.0Creative Commons License DRIVER and COAR: from infrastructure.
The IR on the International Stage Mary Robinson SHERPA, University of Nottingham Embedding Repositories event, University of Lincoln,
BELIEF-EELA e-Infrastructures Conference Rio De Janeiro, Brazil June 25-28, 2007 The DRIVER Project D igital R epositories I nfrastructure V ision for.
DRIVER Building a worldwide scientific data repository infrastructure in support of scholarly communication 1 JISC/CNI Conference, Belfast, July.
DRIVER Institutional repositories and CRIS systems – the role of DRIVERs infrastructure, concepts and organisation 1 Nordbib Workshop 2008 Dale Peters,
DRIVER Digital Repository Infrastructure Vision of European Research Guidelines for Content Providers Presented by Martin Feijen, SURF [NL]
The DRIVER Project: Building a European Repository Network Library Science Talks series Geneva/Bern, 7/8 December 2008 Rosemary Russell UKOLN, University.
DRIVER Long Term Preservation for Enhanced Publications in the DRIVER Infrastructure 1 WePreserve Workshop, October 2008 Dale Peters, Scientific Technical.
Curating Research: problems and policy Dale Peters Scientific Technical Manager DRIVER II.
1 Building scientific Virtual Research Environments in D4Science Paul Polydoras University of Athens, Greece.
DRIVER Step One towards a Pan-European Digital Repository Infrastructure Norbert Lossau Bielefeld University, Germany Scientific coordinator of the Project.
Introduction to the Cooperation between CAS and DRIVER National Science Library,CAS Jianxia Ma Xiaolin Zhang Zhongming Zhu DRIVER Confederation.
9 th International Bielefeld Conference, 3-5 February 2009 The impact of DRIVER on the repository community Sophia Jones.
The DRIVER Project D igital R epositories I nfrastructure V ision for E uropean R esearch
1 NECOBELAC Project WORK PACKAGE 3 Cross-national advocacy infrastructure.
The DART-Europe E-theses Portal Martin Moyle Digital Curation Manager UCL Library Services, UK ETD 2009, University of Pittsburgh, June.
The metadata challenge for libraries: a view from Europe Michael Day UKOLN: The UK Office for Library and Information Networking, University of Bath
UKOLN is supported by: JISC Information Environment update Repositories and Preservation Programme meeting, October 24-25, 2006 Rachel Heery UKOLN
Pure Silver Reusing and Repurposing Bibliographic Data in a Current Research Information System and Institutional Repository 15 September.
Digital Repositories: interoperability & common services Closing Remarks Dr Liz Lyon, UKOLN, University of Bath, UK
Linking Repositories Scoping Study Key Perspectives Ltd University of Hull SHERPA University of Southampton.
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
D-Net Technology Paolo Manghi Istituto di Scienza e Tecnologia dellInformazione (ISTI) Italian National Research Council (CNR)
Collections and services in the information environment JISC Collection/Service Description Workshop, London, 11 July 2002 Pete Johnston UKOLN, University.
Access to Knowledge; New roles for universities and libraries Leo Waaijers Disciple of Eve eIFL Seminar OPEN ACCESS: NEW MODELS FOR SCHOLARLY COMMUNICATION.
Access to Knowledge; New roles for universities and libraries Leo Waaijers Disciple of Eve eIFL Seminar OPEN ACCESS: EXPLORING SCHOLARLY COMMUNICATION.
Collaborative Open Access Projects: Collaborative promotion of research outputs Iryna Kuchma, eIFL Open Access program manager, eIFL.net Presented at Open.
DRIVER Providing value-added services on top of Open Access institutional repositories Dr Dale Peters Scientific Technical Manager : DRIVER SUB Goettingen.
31242/32549 Advanced Internet Programming Advanced Java Programming
ICT 2010: "Global Information Structures for Science & Cultural heritage: The Interoperability Challenge" Networking Session Coordination Action on Digital.
11th euroCRIS Strategic Seminar Brussel, Sep 9 – Discovery Metadata Friedrich Summann COAR / Bielefeld University Library.
The DRIVER initiative for networking repositories Wolfram Horstmann Universität Bielefeld.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
1 Digital Libraries and Evidence in the Developing World Context Dr. Jon Ferguson Senior Health Database Scientist IMMPACT Project University of Aberdeen.
Building Repository Networks with DRIVER Wolfram Horstmann Universität Bielefeld.
Dienst Distributed Networked Publishing Carl Lagoze Digital Library Scientist Cornell University.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Supporting further and higher education The UK FAIR Programme: OAI in context Chris Awre OAI3, CERN, February 2004.
The role of Parthenos for CLARIN ERIC Steven Krauwer CLARIN ERIC Executive Director 1.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
Themes Architecture Content Metadata Interoperability Standards Knowledge Organisation Systems Use and Users Legal and Economic Issues The Future.
Building a Network of European Scientific Repositories Wolfram Horstmann Universität Bielefeld.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
OWL Representing Information Using the Web Ontology Language.
Outline Pursue Interoperability: Digital Libraries
SCALABLE OPEN ACCESS Hussein Suleman
DIGITAL LIBRARY.
DRIVER Digital Repository Infrastructure Vision for European Research
NSDL Data Repository (NDR)
Session 2: Metadata and Catalogues
Presentation transcript:

The DRIVER initiative for networking repositories Wolfram Horstmann Universität Bielefeld

DRIVER motivation Scholarly communication changes towards distributed provision of text, data and services Repositories are thought as a saviour in this development building such a distributed system An infrastructure supporting distributed repositories and services is needed (and reactions)

Question today Is an overarching infrastructure bridging between distributed text-data and primary/secondary data possible? DRIVER has adressed many problems and found many answers in the domain of distributed text repositories But we dont know yet, whether or not these are transferable to the data domain

Some observations on data Data landscape very diverse Formats differ widely – unlike text publications Descriptions are often highly subject-specific Some have special provenance (e.g. vendor software) Some require special rendering, education, caution … Data require disciplinary support Better managed by researchers than service providers Still, data interoperability acknowledged Double effort: many data are lost to re-use/remix Good practice in research, also WRT publications Transparency, Falsifiability, testability …

Some observations on repositories They represent a shift towards … open internet-exposure as opposed to closed database (graveyards) content orientation as opposed to mere technical orientation (web-servers) distributed systems centralized structures not immediateley required nowadays

Everybody can be a publisher Common description standards e.g. Dublin Core Metadata Initiative Many subject-specific standards Common transfer protocols e.g. OAI-PMH, but also FTP, XML-RPC, WS, etc. Searchability is possible! Still: many data are lost to re-use/remix Closed: too sensible, weakly described, unimportant (???) Missing service frameworks / infrastructures Problems: Data and service interoperability Solution: Infrastructure Repositories can solve access problem

What infrastructures are: DRIVER terms Not an infrastructure Single repository Single application for search and retrieval (e.g. BASE) Only local operation Backwards causation on repositories is missing Maybe an infrastructure Distributed repository landscape as a whole As a capacity for emergent properties, e.g. quality and quantity incentive for data population Nurturing development of service providers Definitely an infrastructure Many service providers in one organisational and technical context (e.g. run-time environment) Enabling re-use and remix of data and services

DRIVER Objectives Organisational structure for repositories e.g. the Confederation Improving quality and standards in local rep. e.g. validation procedures Building a distributed runtime system e.g. service and data sharing Target Groups Repository Managers Service Providers Information System Executives

The DRIVER approach is incremental Start with publication metadata Existing distributed system, somehow connected Considerable homogeneity and formats: OAI-PMH Extend geographical coverage From 5 countries, to 10, to 27, to ??? Extend towards other contents From publication metadata to enhanced publications, i.e. representations of texts + data Learn about subject specificity Data bring in disciplinary requirements

10 The DRIVER Initiative DRIVER-I 6/2006 – 11/2007 Organisational Models and Technical Test-Bed DRIVER-II 12/2007 – 11/2009 Running Organisation and Production Infrastructure DRIVER-Confederation 2010ff Operations Office and Technical Deployment NB: DRIVER is not an authoritative body, it is a liberal bottom-up initiative of stakeholders

DRIVER partners and related projects Networking, Support, Policy, Studies Göttingen, Nottingham, SURF, Genth, Ljubiljana, Minho, Copenhagen Technical development and deployment Athens, Bielefeld, Pisa, Warsaw Partners make links to many other things OA-services: Sherpa-ROMEO, OpenDOAR, BASE… Projects: Europeana, PEER, DELOS, DL.org, D4Science, PARSE-Insight, NESTOR… Orgs: DINI, JISC, LIBER, SPARC, KE … Platforms: DSPACE/FEDORA/OPUS/ePrints

Some Results: Studies

Some Results: A Portal

Some Results: A Search

Some Results: Repository Registration

Some Results: Guidelines Build on knowledge from past & current IR projects (EU) 26 actively involved contributors (experts and repository managers) from 8 countries. Practical answers on how to: Improve full-text access Standardize metadata quality Create a reliable infrastructure for permanent identification, resolution, traceability and storage Resolve semantic and classification issues

Some Results: Support structures

Some Results: Repositories 185+ harvested repositories 21 countries 856,264+ documents

Some Results: Service-Oriented-Arch. 9 hosting nodes 25+ Functionality typologies (services) 36 service Instances 3 applications: DRIVER Main, Belgium, Spain-Recolecta

20 Some Results: Runtime-System & Hosting Enabling Layer Data Layer EU Open Access Repositories Functionality Layer Administrators End users Advanced User Interfaces National portals Project Applications

Some Results: A software Meant for large service providers only!

22 Current Work: DRIVER-II Networking Confederation with who-is-who advisory board Outreach: LIBER, SPARC, US, JAPAN etc… Consolidation DRIVER-I Services packaged and performing in production quality Enhancement DRIVER-I Services Improved indexing and data aggregation functionalities DRIVER-II Services Enhanced publication management and functionality

Outlook: Enhanced Publications

Based on OAI-ORE

Lessons learnt Distributed data infrastructure requires links between organisational and technical concepts Data specialists, computer scientists, service providers Guidelines / content policies as a glue In distributed data provision, quality and access measures are the most expensive tasks Distributed service operation (not data provision) can be solved but asks novel questions (SLAs) Infrastructure is a very tough concept to get across and eventually forms a complex system Simplification makes it weaker, e.g. re-use is restricted

Summary DRIVER tackles the data infrastructure challenge from the text-repository side (mostly OAI-PMH) DRIVER handshakes with primary & secondary data through enhanced publications DRIVER isnt only a project but a forum for information specialists Products include: Studies, Infrastructure run-time- system in production, software, support … DRIVER has adressed many problems for data and service interoperability and found solutions What are the required steps to support data?

Thanks