IWIR-CRIS '06 Data retrieval in PURE Data retrieval in the 4-year old PURE CRIS project at 9 universities.

Slides:



Advertisements
Similar presentations
The DRIVER Infrastructure (Digital Repository Infrastructure Vision for European Research) Paolo Manghi ISTI - National Research Council, Italy.
Advertisements

Pure Silver Reusing and Repurposing Bibliographic Data in a Current Research Information System and Institutional Repository 15 September.
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
Goals for RUcore o Flexible, extensible cyberinfrastructure for Rutgers University o Integrating platform for legacy information systems o Support preservation.
EXtensible Catalog David Lindahl University of Rochester.
Technical Framework Charl Roberts University of the Witwatersrand Source: Repositories Support Project (JISC)
CNRIS CNRIS 2.0 Challenges for a new generation of Research Information Systems.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
Andrea Fojtu Charles University in Prague, National Library of the CR.
Digital Libraries: Study into the features of the DSpace Suite Devika P. Madalli Documentation Research and Training Centre Indian Statistical Institute.
DSpace Devika P. Madalli DRTC, ISI Bangalore.
Archivematica-Islandora Integration Module Evelyn McLellan
DORSDL 2006 The repository as a general university administrative system A case story and practical example.
EASY LOGISTICS CENTER - the TURNTABLE for information, documents and processes EASY LOGISTICS CENTER DOCUMENTS SHOP CONTENT COMMUNITY MODULES EASY ENTERPRISE.
Dspace – Digital Repository Dawn Petherick, University Web Services Team Manager Information Services, University of Birmingham MIDESS Dissemination.
UCLA Digital Library UC Digital Library Forum August 5, 2002 UCLA Digital Library Presenter: Curtis Fornadley Senior Programmer/Analyst.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
Networking Institutional Repositories in Denmark and Scandinavia Technical Knowledge Center of Denmark Mogens Sandfær
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
The Open Archives Initiative Simeon Warner (Cornell University) Open Archives seminar “Facilitating Free and Efficient Scientific.
The eXtensible Past XML As a Means for Easy Access to Historical Research Data and a Strategy for Digital Preservation.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional repository for the University of.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
Norwegian Open Research Archives (NORA) How and why is the NORA project adding value to the institutional repositories established in Norway?
Education Supported by Content Management Systems Milena Stanković, Milan Rajković, Ivan Petković, Petar Rajković Faculty of Electronic Engineering, Niš.
Submitted by: Madeeha Khalid Sana Nisar Ambreen Tabassum.
Using CERIF-based CRIS to support the academic and research community: emerging services in Greece Nikos Houssos National Documentation Centre / National.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
The DSpace Course Module – An introduction to DSpace.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
Wikis are websites where pages can be edited using an online document editor. Users can easily edit and share content. Enterprise wikis are platforms.
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Dec 9-11, 2003ICADL Challenges in Building Federation Services over Harvested Metadata Hesham Anan, Jianfeng Tang, Kurt Maly, Michael Nelson, Mohammad.
The Adoption of METIS GSBPM in Statistics Denmark.
Dias 1 A research information system as a research planning & evaluation tool: Recent developments in Denmark Adrian Price Faculty of LIFE Sciences Library.
R utgers C ommunity R epository RU CORE 1 A Statewide Community of Trust: An RUcore Implementation using Shibboleth and XACML The Fourth International.
1 OAI-PMH harvester for agricultural knowledge gathering (Development, testing and implementation) Francesco Castellani and Stefka Kaloyanova 4 February.
One Platform, Two Stories. Willamette University Oregon State University.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Use & Access 26 March Use “Proof of Concept” Model for General Libraries & IS faculty Model for General Libraries & IS faculty Test bed for DSpace.
Digital Commons & Open Access Repositories Johanna Bristow, Strategic Marketing Manager APBSLG Libraries: September 2006.
May 2, 2013 An introduction to DSpace. Module 1 – An Introduction By the end of this module, you will … Understand what DSpace is, and what it can be.
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
A Fedora 3 to 4 Migration Case Study for UNSW Australia Library Fedora 4 Training Workshop, eResearch Australasia 2015, Brisbane UNSW Library Arif Shaon,
A Fedora 3 to 4 Migration Case Study for UNSW Australia Library Fedora 4 Training Workshop, eResearch Australasia 2015, Brisbane UNSW Library Arif Shaon,
Fedora Content Modeling for Improved Services for Research Databases Open Repositories 2009 Mikael Karstensen Elbæk Alfred Heller Gert Schmeltz Pedersen.
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
DSpace - Digital Library Software
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
Harokopio University of Athens – Department of Informatics and Telematics HAROKOPIOUNIVERSITY A Distributed Architecture for Building Federated Digital.
QlikView Integration Overview June Agenda Data Source Integration Web & Application Integration Security Integration Integration with 3rd party.
PIRUS 2 Creating a common standard for measuring online usage of individual articles Paul Needham, Cranfield University Peter Shepherd, COUNTER October.
Meeting of the Member States Expert Group on Digitisation and Digital Preservation , Luxembourg European Archival Records and Knowledge Preservation.
De Rigueur - Adding Process to Your Business Analytics Environment Diane Hatcher, SAS Institute Inc, Cary, NC Falko Schulz, SAS Institute Australia., Brisbane,
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
DSpace standard Data model and DSpace-CRIS
Institution update KB DK
Building A Repository for Digital Objects
Joseph JaJa, Mike Smorul, and Sangchul Song
mail: DLIS, University of Kalyani, West Bengal
Overview: Fedora Architecture and Software Features
VI-SEEM Data Repository
VI-SEEM Data Repository
Archiving and Delivery of Student Portfolios
Technical Capabilities
Using CuCMS: a workshop
Dataverse for citing and sharing research data
Presentation transcript:

IWIR-CRIS '06 Data retrieval in PURE Data retrieval in the 4-year old PURE CRIS project at 9 universities

2 atira Niels Jernes Vej 10 DK-9220 Aalborg Agenda ■ Overview ■ Retrieval  Validated manual data gathering  Dynamic integration to local back-end systems  Aggregation, enrichment and import of historic data  Experiments with automated imports of historic data ■ Exposure  Two web services  OAI  Z39.50  Reports  Portal framework ■ Archiving ■ Near future

3 atira Niels Jernes Vej 10 DK-9220 Aalborg Overview ■ Brief overview ■ … in order to discuss ingestion, integration, conversion and import in a specific context

4 atira Niels Jernes Vej 10 DK-9220 Aalborg Overview ■ Brief overview ■ History  Development begun in 2002 ■ Users  9 universities (DK+SE), several hospitals + other research institutions ■ Platform and architecture  J2EE enterprise application  Release management: All users have instances of same release version, same code-base ■ Business model  Commercial software licenses, powerful user group, shared budgets ■ Modular  Basic module, Reporting module, Student thesis module, External publications module, Bibliometrics module, Press module.

5 atira Niels Jernes Vej 10 DK-9220 Aalborg Overview

6 atira Niels Jernes Vej 10 DK-9220 Aalborg Retrieval ■ Manual data gathering ■ User roles/right + workflow:  = de-centralized data gathering  = validated data gathering  = continuous data gathering ■ GUI example ■ Management focus is necessary  Reports and statistics, KPI-management, etc. ■ Adding value to researchers is necessary  Instantly in Google indexes, instantly updated personal websites, instantly updated CV, increased citations (source in paper), etc.

7 atira Niels Jernes Vej 10 DK-9220 Aalborg Retrieval ■ Dynamic integration ■ Dynamic integration to local back-end systems:  Personnel systems, payroll systems (for data retrieval)  LDAPs, Active Directories (for data retrieval + authentication)  Single sign-on systems (for authentication)  … to automatically create object types such as “person” or “organization” ■ … and yes, PURE hosts data, too  We need complete objects according to the meta-data model ■ Plug-in architecture in PURE:  Pro = individually adapted integration  Con = individually programmed plug-in necessary  Future = GUI, standardized plug-ins

8 atira Niels Jernes Vej 10 DK-9220 Aalborg Retrieval ■ Import ■ Historic data ■ Many sources  More or less useful data  More or less consequent use of formats :-) ■ The PXA format  PURE XML Archive format -.zip based  Meta-data, relations between entities, binary files ■ Aggregation > enrichment > conversion > import  The process is external to PURE

9 atira Niels Jernes Vej 10 DK-9220 Aalborg Retrieval ■ Experiments ■ Experiments with automated imports of historic data from specific, identified sources ■ [source format] > PXA conversion > import > enrichment/validation ■ Very poor data quality demands the concept of “draft objects” in PURE

10 atira Niels Jernes Vej 10 DK-9220 Aalborg Exposure ■ Web services ■ RPC/encoded + document/literal ■ Rich libraries of methods ■ Including format-specific methods: APA, MLA, HARVARD, VANCOUVER and CBE ■ Free and near-instant adding of methods ■ WS code example (if time)

11 atira Niels Jernes Vej 10 DK-9220 Aalborg Exposure ■ OAI support ■ OAI-PMH data provider ■ OAI-PMH formats ■ DC ■ DDF-MXD (Danish national format) ■ SVEP (Swedish national format)  … more to come ■ Also used to harvest other PURE-repositories for “external publications”

12 atira Niels Jernes Vej 10 DK-9220 Aalborg Exposure ■ Z39.50 ■ Enabling of searches in PURE from library systems ■ SRW/SRU

13 atira Niels Jernes Vej 10 DK-9220 Aalborg Exposure ■ Reports ■ PURE reporting module ■ GUI example

14 atira Niels Jernes Vej 10 DK-9220 Aalborg Exposure ■ Reference manager ■ Export of data to local Reference Manager installation ■ Using RM-formatted export file ■ Promotes registering to the repository rather than in RM ■ GUI example

15 atira Niels Jernes Vej 10 DK-9220 Aalborg Exposure ■ Portal framework ■ PUREportal – free PURE-specific framework for custom development of research exhibition portals ■ Online example ■ Typical cost scenario € 20,000 ■ Typical delivery time 1 month ■ Little need for requirements specification ■ Automatic PURE-API maintenance

16 atira Niels Jernes Vej 10 DK-9220 Aalborg Archiving ■ Data archiving – 2 levels ■ SQL environment ■ Meta-data and relations ■ Binary files just stored in server file system ■ FEDORA via connector (not PURE-specific, Open Source) ■ Facilitates:  Higher quality archival of binary files  Long term preservation in general  Adoption of PURE in institutions’ general FEDORA strategies

17 atira Niels Jernes Vej 10 DK-9220 Aalborg Near future ■ The near future regarding data retrieval ■ More automated imports using increasingly advanced converters ■ Automated data delivery (push and harvest) to:  Industry specific search services (e.g. PubMed, Nordicom)  Documentary data collections (such as clinicaltrials.org), and national collections (such as DDF (DK), ForskDok (NO), etc. ■ Temporary import objects  When imported data are not in sufficient quality to create valid objects  when data cannot be properly related to other objects upon import