The DRIVER Project Paolo Manghi ISTI - National Research Council, Italy.

Slides:



Advertisements
Similar presentations
DRIV(ER)ing Research Infrastructures Yannis Ioannidis University of Athens, Hellas 1st DRIVER Summit: Towards a Confederation of Digital Repositories,
Advertisements

PUMA & MetaPub Open Access to Italian CNR Repositories in the Perspective of the European Digital Repository Infrastructure GL9 - NINTH INTERNATIONAL CONFERENCE.
The DRIVER Infrastructure (Digital Repository Infrastructure Vision for European Research) Paolo Manghi ISTI - National Research Council, Italy.
BELIEF-EELA e-Infrastructures Conference Rio De Janeiro, Brazil June 25-28, 2007 The DRIVER Project D igital R epositories I nfrastructure V ision for.
DRIVER Building a worldwide scientific data repository infrastructure in support of scholarly communication 1 JISC/CNI Conference, Belfast, July.
DRIVER Long Term Preservation for Enhanced Publications in the DRIVER Infrastructure 1 WePreserve Workshop, October 2008 Dale Peters, Scientific Technical.
DRIVER Step One towards a Pan-European Digital Repository Infrastructure Norbert Lossau Bielefeld University, Germany Scientific coordinator of the Project.
9 th International Bielefeld Conference, 3-5 February 2009 The impact of DRIVER on the repository community Sophia Jones.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
D-Net Technology Paolo Manghi Istituto di Scienza e Tecnologia dellInformazione (ISTI) Italian National Research Council (CNR)
DRIVER Providing value-added services on top of Open Access institutional repositories Dr Dale Peters Scientific Technical Manager : DRIVER SUB Goettingen.
Near East Plant Protection Network for Regional Cooperation & Knowledge Sharing Food and Agriculture Organization of the United Nations An Overview on.
Alliance Meeting The Hague (NL), 30/31-MAY-2007 Digital Repository Infrastructure Vision for European Research ‘Alliance’ meeting.
Networking European Digital Repositories. What to Network?
DRIVER Step #1 towards a Pan-European Digital Repository Infrastructure Yannis Ioannidis University of Athens, Hellas IST 2007 Networking Session: Future.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
1 The Australian Partnership for Sustainable Repositories Margaret Henty Digital Futures Industry Briefing November 8, 2006.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
An Agent-Oriented Approach to the Integration of Information Sources Michael Christoffel Institute for Program Structures and Data Organization, University.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Natalia Manola Department of Informatics & Telecommunications University of Athens, Hellas 4th UNICA Scholarly Communication Seminar May 2008, Charles.
Building Repository Networks with DRIVER Wolfram Horstmann Universität Bielefeld.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Agricultural Biotechnology Network for Regional Collaboration and Knowledge Sharing Food and Agriculture Organization of the United Nations An Overview.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
WP5 Digital Business Ecosystem Alessandra Benvenuti, INSIEL SpA (Friuli Venezia Giulia Region) ADC Final Conference Venice, March 13 th 2012.
SYNAT - the Polish National Research Content Infrastructure Wojtek Sylwestrzak, ICM Tomasz Rosiek, ICM Tomasz Krassowski, ICM Tartu, Estonia June 27, 2012.
Architecture domain DL.org Autumn School – Athens, 3-8 October 2010 Leonardo Candela 6 th October 2010.
The OpenAIRE Project Open Access Infrastructure for Research in Europe Stefania Biagioni, Donatella Castelli, Paolo Manghi CNR - ISTI GL11 - Library of.
Open Access to Grey Literature on e-Infrastructures: The BELIEF-II Project Digital Library Stefania Biagioni, Donatella Castelli, Franco Zoppi CNR-ISTI.
1 INFRA : INFRA : Scientific Information Repository supporting FP7 “The views expressed in this presentation are those of the author.
Donatella Castelli CNR-ISTI
Building a Network of European Scientific Repositories Wolfram Horstmann Universität Bielefeld.
© 2005 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice The China Digital Museum Project.
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
A centre of expertise in digital information management RDN, e-Prints UK and NOF- Digitise: a (very) small sample of UK OAI activity Andy.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
CYCLADES IST CYCLADES: A Personalised Collaborative Digital Library Environment Umberto Straccia I.S.T.I. - C.N.R. Pisa (ITALY)
Current Research Information Systems in Greece Dr Nikos Houssos National Documentation Centre (EKT) / National Hellenic Research Foundation (NHRF)‏ Dr.
ON-line SERVICES based on DIGITAL DOCUMENTS Prof. Doina Banciu ROCS Bucharest, 2008.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Recent Developments in CLARIN-NL Jan Odijk P11 LREC, Istanbul, May 23,
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
The Mint Mapping tool The MoRe aggregator Vassilis Tzouvaras, Dimitris Gavrilis National Technical University of Athens Digital Curation Unit - IMIS, Athena.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
System/SDWG Update Management Council Face-to-Face Flagstaff, AZ August 22-23, 2011 Sean Hardman.
Harokopio University of Athens – Department of Informatics and Telematics HAROKOPIOUNIVERSITY A Distributed Architecture for Building Federated Digital.
National Geospatial Enterprise Architecture N S D I National Spatial Data Infrastructure An Architectural Process Overview Presented by Eliot Christian.
Brian Matthews, euroCRIS, 18/09/03 CRIS architecture to support an ERA Brian Matthews.
Pasquale Pagano CNR - ISTI The OpenAIRE Infrastructure EC Policy on Open Access and the OpenAIRE Initiative EGI Scientific Publications Repository Workshop.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
ETICS An Environment for Distributed Software Development in Aerospace Applications SpaceTransfer09 Hannover Messe, April 2009.
Open Science (publishing) as-a-Service Paolo Manghi (OpenAIRE infrastructure) Institute of Information Science and Technologies Italian Research Council.
Open Science and Research – Services for Research Data Management © 2014 OKM ATT 2014–2017 initiative Licenced under.
CLARIN EUDAT2020 uptake plan Dieter Van Uytvanck CLARIN ERIC EUDAT User Forum, Rome.
Joint Information Systems Committee Repositories Support Project Summer School 2008 Amber Thomas, JISC.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Pasquale Pagano CNR-ISTI The OpenAIRE Infrastructure: on measuring research impact Evolving EGI Workshop – 29 January 2013 Paolo Manghi CNR - ISTI.
Joseph JaJa, Mike Smorul, and Sangchul Song
GSAF Grid Storage Access Framework
VI-SEEM Data Repository
DRIVER Digital Repository Infrastructure Vision for European Research
Malte Dreyer – Matthias Razum
Presentation transcript:

The DRIVER Project Paolo Manghi ISTI - National Research Council, Italy

EC: the “Knowledge Infrastructure” Vision Build and maintain environments where content and functionality resources can be openly shared to serve the needs of different user communities e-Infrastructures is the term commonly used to refer to such environments, including both people and technology

DRIVER Project: the Mission Focus: EC Open Access mandates for research output Networking: developing standards and organisational and business models for a sustainable network of European strategic partners in the Open Access community Technical: building an infrastructure for aggregating Open Access research publications from Digital repositories in Europe and making it available for elaboration to any interested community

4 DRIVER Project: the Objectives Develop an environment for integrating existing national, regional, or thematic repositories Create a production-quality European digital repository infrastructure Prepare the future expansion and upgrade of the digital repository infrastructure across Europe Identify and promote the use of a relevant set of standards Raise Open Access awareness among user communities Slide from Prof. Yannis Ioannids, NKUA, Greece

D-NET Technology The DRIVER Project solution 5

Digital Repository Internet OAI-PMH metadata Repository Dspace Greenstone ePrints OpenDLib …. UI Query Ingest

DRIVER’s technical goals Construction and maintenance of the European Information Space for Open Access research publications Arbitrary number of applications consuming the Information Space User portals Consuming applications Key-feature: evolving requirements

Aggregative Digital Library Systems (ADLS) ADLSs Aggregation system: maintaining and populating an Information Space by aggregating content from a collection of OAI-PMH Repositories Custom application: providing community-specific functionalities via Web User Interfaces Well known examples BASE (Germany) DAREnet (Netherlands) OAIster (USA) Others…

Typical ADLS architecture One institution, one community BASE (Germany) DAREnet (Netherlands), OAIster (USA) Others… OAI-PMH … Aggregator Index UI … Store … Search Portal Aggregation system Info Space

The DRIVER project goals Many institutions, many communities Construction of the European Information Space for Open Access research publications Arbitrary number of repositories (data providers) Data curation Arbitrary number of applications consuming the Information Space Community portals Community Information Spaces (comm. metadata format) Etc. Highly evolving requirements!

Typical ADLSs Vs DRIVER requirements OAI-PMH … Aggregator Index UI … Applications Aggregation system Store … Search OAI-PMH New Institution Site Manual maintenance cost New Fun Info Space New UI Index Store New Info Space Aggregator OAI-PMH

Typical ADLSs Vs DRIVER requirements Drawbacks Limited customizability E.g. pre-defined input and target metadata formats High-cost software extensibility E.g, new functionality, new Information Spaces “Manual” repository management Registration, harvesting, curation (XSLT), etc… “Manual” administration for robustness and scalability E.g., store and index replicas Issue: sustainability in the long term 12

DRIVER’s D-Net An Infrastructural approach Service-Oriented Architecture Web Service, service registration, subscription&notification, etc. Distributed running environment Administered by one responsible organization (RO) Used for by participating organizations (POs) to collaboratively build ADLSs ADLSs as service-oriented applications

D-NET’s Service Kit: ADLS components Repositories FS, FTP, NFS Data Sources Web UI Service Web UI Service Recomm. Service Recomm. Service Community Service Community Service User Profile Service User Profile Service Search Service Search Service Data Management Data Management OAI-PMH Service OAI-PMH Service Index Service Index Service Browse Service Browse Service Store Service Store Service OAI-PMH Harvester Service OAI-PMH Harvester Service Information Service Information Service Manager Service Manager Service Authz&Authn Service Authz&Authn Service Collection Service Collection Service Validator Service Validator Service Feature Extraction Service Feature Extraction Service Similarity Service Similarity Service Transformation Service Transformation Service Compound Object Service Compound Object Service Citation Service Citation Service XML Import Service XML Import Service Object Packaging Service Object Packaging Service Repository Man Service Repository Man Service ResultSet Service ResultSet Service End User Functionality Enabling Personalization Service Personalization Service

DRIVER’s D-Net An Infrastructural approach ADLS construction Service compositionality (“LEGO approach”) functionality in “isolation”; e.g. index, storage, aggregation Service customizability e.g., functionality independent from metadata formats Service distribution ADLS components can be distributed over the network Service sharing Hardware and services Application “autonomicity” Services can be orchestrated automatically to accomplish certain tasks 15

ADLSs in D-Net OAI-PMH Repository Index Search Index UI … OAI-PMH … … Enabling Layer Middleware UI Search Index Store Harvester User Profiling … Others Harvester Service Kits Trasformator Store Content Resources Dynamic, distributed Run-time Infrastructure RO PO Repository

Compositionality and customizability Harvester OAI-PMH IndexUISearch IndexUISearchStore Index Transformer Store Index UI Search Metadata Formats IndexUISearch

Sharing of functionality Trasformator Index Search Index UI … Enabling Layer Middleware UI Search Index Trasformator User Profiling … Others Harvester UI Store Dynamic, distributed Run-time Infrastructure Service Kits RO PO OAI-PMH Repository OAI-PMH … … Content Resources Repository

Sharing of content Index Search Index UI … Enabling Layer Middleware UI Search Index Aggregator User Profiling … Others UI Search Store Dynamic, distributed Run-time Infrastructure Service Kits RO PO TrasformatorHarvester OAI-PMH Repository OAI-PMH … … Content Resources Repository

Sharing of content Index Search Index UI … Enabling Layer Middleware UI Search Index Aggregator User Profiling … Others UI Search Index Store Service Kits Dynamic, distributed Run-time Infrastructure RO PO TrasformatorHarvester OAI-PMH Repository OAI-PMH … … Content Resources Repository

D-Net’s benefits Towards software sustainability... Customizability Parametric services and compositionality Openness Designed to integrate new functionality Scalability and Robustness By distribution and replicas Sharing Cost optimization (services and hardware) Autonomic behaviour Reduced maintenance and administration cost Repository management tools GUIs for harvesting and aggregation

D-Net Software Toolkit Software packages Open Source Apache License Release v1.0 (production) and v1.2 (beta) Release 2.0 (beta): Enhanced Publication

D-NET and standards Service Resources are implemented as Web Services and accessed through the corresponding Web Service Interface Parameters calls are enveloped into SOAP messages The Enabling Services are also compatible with REST XML is the lingua-franca for the whole system Resource internal status, i.e. Resource profiles, are represented as XML files conforming to a given schema Profiles are kept into the Information Service, whose underlying engine is an Exist XML engine

D-NET and standards Subscription and Notification Service Any Service can subscribe to events regarding any DRIVER Resource: creation, deletion, and specific action accomplished by a resource The Subscription and Notification mechanism is compliant with the OASIS Standards WS Base Notification 1.3 and WS Topics 1.3 Authorization and Authentication Service offers security contexts to all Resources according to the Access Control Markup Language standard (XACML)

DR IVE R revi ew Wa rsa w Ma y 15t h D-NET standardization Information Service system mediation All relevant resources register their profile into the IS; e.g. Services, collections, indexes, users, etc. Services can access system relevant information through the IS, in a common standard way, with no need to statically know the locations of other Services ResultSet mechanism Standard interfaces and tools for data exchange By reference or value Paging modes, transformation, caching

Example: DRIVER production infrastructure - D-Net’s release v1.2 Enabling Layer Data Layer Functionality Layer Administrators End users Advanced User Interfaces National portals EU Open Access Repositories

DRIVER production infrastructure Current status v1.2 Content About 250+ harvested repositories (more than twice to come) over 33 countries from Europe and beyond 2,200,000+ open access publication metadata records Services Production release: 36 service running instances over 9 nodes located at CNR, UNIBI, ICM and NKUA

D-Net’s uptake European Film Gateway EC project OpenAIRE EC pilot Experimentation of deployment of new infrastructure instances China, India, Portugal, Belgium, Spain, Slovenia Upcoming: Greece and Bulgaria

Technical Team ISTI: Istituto di Scienze e Tecnologie Informatiche, Centro Nazionale delle Ricerche, Pisa, Italy ICM: Interdisciplinary Centre for Mathematical and Computational Modeling, Uniwesytet Warszawski, Poland NKUA: Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Greece UNIBI: Universität Bielefeld, Germany

D-NET’s interoperability 30

Data interoperability Integrating heterogeneous data sources Content (structure and semantics) Access Protocols Tools Mediation Accessing heterogeneous data sources (standard protocols) Normalization of data, using the same exchange formats (XML) Transformation Transforming data of different data models (structure and semantics) into data of other data models Examples: cleaning, enriching 31

Service interoperability Adoption of Standards Web service, XML, Security, etc Internal policies ResultSets, Service Profiles, Discovery mechanisms, etc Best practices Design of “customizable” services Objectives Minimizing D-NET software learning curve Maximizing reusability of D-NET components Facilitaning integration of software 32

Credits Paolo Manghi ISTI - CNR Speaker’s Contact Details DRIVER II Project Supported by European Commission