3rd July 2007Open Repository, Open Source 1 Where we were, What we've learnt, And what it means to you.

Slides:



Advertisements
Similar presentations
automated single login access to Novell storage resources
Advertisements

Implementing Tableau Server in an Enterprise Environment
IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew.
About store The Project - Phase I StORe StORe StORe stands for: Source-to-Output Repositories.
Comparison of EPrints 3.0 and DSpace digital library systems Kuzma Kudim, Galina Proskudina.
1 The IIPC Web Curator Tool: Steve Knight The National Library of New Zealand Philip Beresford and Arun Persad The British Library An Open Source Solution.
Dspace – Digital Repository Dawn Petherick, University Web Services Team Manager Information Services, University of Birmingham MIDESS Dissemination.
PowerCenter 8.6 SE Installation and Operational Guidelines.
PaperCut NG Chris Dance. Copyright © PaperCut Software Pty. Ltd. 2 Overview Overview of PaperCut NG Why we offer a Mac Version The story of our.
Confidential ODBC May 7, Features What is ODBC? Why Create an ODBC Driver for Rochade? How do we Expose Rochade as Relational Transformation.
Installing DSpace on Window Bharat M. Chaudhari School of Petroleum Management, PANDIT DEENDAYAL PETROLEUM UNIVERSIRY, GANDHINAGAR
1 Chapter Overview Monitoring Server Performance Monitoring Shared Resources Microsoft Windows 2000 Auditing.
The DSpace Course Module – DSpace Installation. Module objectives  By the end of this module you will:  Understand the platforms DSpace can be hosted.
Introduction to Active Directory December 10th, pm Daniels 407.
Chapter 11: Creating and Managing Shared Folders BAI617.
Linux Operations and Administration
® IBM Software Group © 2009 IBM Corporation Rational Publishing Engine RQM Multi Level Report Tutorial David Rennie, IBM Rational Services A/NZ
ProCol~ A jEdit Plugin for Remote Project Collaboration Justin Dieters Spring 2004 CS470 Final Presentation.
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
Window NT File System JianJing Cao (#98284).
The DSpace Course Module – Upgrading from 1.4 to 1.5.
Plug-in Development Environment. Session Outline Tools Installation Configuration New Project Basic Debugging Remote Debugging.
Chad La Joie Shibboleth’s Future.
MCTS Guide to Microsoft Windows Vista Chapter 4 Managing Disks.
Eric Roux – AB/CO/AP Going to a “Linux only” operational world… The consequences for AB/CO/AP European Organisation of Nuclear Research Geneva - Switzerland.
1 Chapter Overview Performing Configuration Tasks Setting Up Additional Features Performing Maintenance Tasks.
WaveMaker Visual AJAX Studio 4.0 Training Basics: Building Your First Application Binding Basics.
Samba – Good Just Keeps Getting Better The new and not so new features available in Samba, and how they benefit your organization. Copyright 2002 © Dustin.
IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.
ETD Software Options Hussein Suleman University of Cape Town October 2003.
Richard Jones, July 2005 Integrating Local Developments to DSpace.
Solutions using Microsoft Content Management Server 2002 Connector for SharePoint Technologies Sue Corke Mark Harrison Microsoft UK.
Experiment Management System CSE 423 Aaron Kloc Jordan Harstad Robert Sorensen Robert Trevino Nicolas Tjioe Status Report Presentation Industry Mentor:
HP0-J10 HP Implementing HP StorageWorks XP Disk Array Solutions Thousands of IT Professionals before you have already passed their HP0-J10 certification.
Nortel Nortel Wireless LAN 2300 Rls.5.0 Solutions Thousands of IT Professionals before you have already passed their certification exams.
WEB SERVER SOFTWARE FEATURE SETS
DSpace System Architecture 11 July 2002 DSpace System Architecture.
TOPIC 7.0 LINUX SERVICES AND CONFIGURATION. ROOT USER Root user is called “super user” because it has power far beyond those of mortal user. As root,
SPI NIGHTLIES Alex Hodgkins. SPI nightlies  Build and test various software projects each night  Provide a nightlies summary page that displays all.
An Enterprise Clinical Data Search Solution. is Designed for: Informatics professionals, clinicians, statisticians, data managers and process/quality.
8 th Semester, Batch 2009 Department Of Computer Science SSUET.
Apache Solr Dima Ionut Daniel. Contents What is Apache Solr? Architecture Features Core Solr Concepts Configuration Conclusions Bibliography.
Active-HDL Server Farm Course 11. All materials updated on: September 30, 2004 Outline 1.Introduction 2.Advantages 3.Requirements 4.Installation 5.Architecture.
Laserfiche Plus AA103 Eric Hu, Software QA Engineer Raymond Cruz, Software Support Engineer.
How to Ensure a Stress-Free Migration!
Symantec Implementation of HA Solutions for Windows using VCS 4.1 Thousands of IT Professionals before you have already passed their certification.
Monitoring Dynamic IOC Installations Using the alive Record Dohn Arms Beamline Controls & Data Acquisition Group Advanced Photon Source.
9 Copyright © 2004, Oracle. All rights reserved. Getting Started with Oracle Migration Workbench.
Introduction to tds_fdw
Backing Up Your System With rsnapshot
Agenda:- DevOps Tools Chef Jenkins Puppet Apache Ant Apache Maven Logstash Docker New Relic Gradle Git.
High Availability 24 hours a day, 7 days a week, 365 days a year…
Shared Services with Spotfire
CARA 3.10 Major New Features
NGS Oracle Service.
Introduction, Features & Technology
JDBC.
Download 2018 Exact Oracle 1z0-134 Exam Study Guide - Oracle 1z0-134 Exam Dumps
DUCKS – Distributed User-mode Chirp-Knowledgeable Server
SharePoint 2019 Changes Point of View.
Oracle Architecture Overview
Migrating your SQL Server Instance
Cloud computing mechanisms
JENKINS TIPS Ideas for making your life with Jenkins easier
NAVIGATING THE MINEFIELD
Rational Publishing Engine RQM Multi Level Report Tutorial
Oracle SQL Developer Data Modeler
Review Test 3 CS 101 Spring 2019.
SBS 2008 – One year on David Overton
Presentation transcript:

3rd July 2007Open Repository, Open Source 1 Where we were, What we've learnt, And what it means to you

3rd July 2007Open Repository, Open Source 2 The Hardware ● 2x DL360 G2 ● 1xMSA 1500cs ● 1xMSA 1000 ● 1xMSA 20 – 2 TBs capacity

3rd July 2007Open Repository, Open Source 3 The Software ● Windows Advanced Server 2000 ● Microsoft Cluster Server ● Oracle 10 (on Sun/Solaris) ● Sun JDK 5 ● Tomcat 5.5 ● DSpace customisations

3rd July 2007Open Repository, Open Source 4 Problem #1: Handle Servers ● Ran as console applications ● Would not start automatically at boot ● Required a user login to start the servers ● Servers would stop if remote login disconnected ● Keeping open remote logins for extended periods cause maintenance problems (limited connections)

3rd July 2007Open Repository, Open Source 5 Solution #1: Services ● Use open source Java Service Wrapper to create Windows services – – Uses single, simple configuration file ● Services started automatically on boot

3rd July 2007Open Repository, Open Source 6 Problem #2: Lucene ● Could not run filter-media, etc. whilst Tomcat was running and had an open searcher ● Existing solution was to stop Tomcat for entire duration of indexing, across all repositories ● Sometimes Tomcat would not be restarted correctly after indexing

3rd July 2007Open Repository, Open Source 7 Solution #2: DSIndexer Patch ● Added configuration option to index in an 'offline' directory ● Indexer attempts to swap in the new index on completion ● If it can't, sets a flag in the filesystem, and the DSQuery will swap it in on next search ● SourceForge patch # Avoid index lock with filter-media

3rd July 2007Open Repository, Open Source 8 Problem #3: Oracle ● Database had been installed as ISO-LATIN ● To store Unicode characters, key columns used the national character set (NCLOB / NVARCHAR) ● N... columns can not be identified correctly with java.sql types

3rd July 2007Open Repository, Open Source 9 Solution #3: Separate Instance ● New instance configured as UTF-8 ● All workarounds for N... column handling could be removed from the code ● All Oracle fixes that were still required (ie. CLOB support) could be contributed back – Patch # Oracle schema changes for CLOB columns – Patch # Workaround Numeric/Integer handling in Oracle

3rd July 2007Open Repository, Open Source 10 And on the 7 th Day Began building for the future Until...

3rd July 2007Open Repository, Open Source 11 Problem #4: PDFBox ● PDFBox likes to create temporary files ● Lots of temporary files (well, one per PDF) ● Doesn't do too well at cleaning them up either (Windows thing?) ● And whilst not exactly the fault of PDFBox: on restarting the servers, the Cluster FS no longer worked

3rd July 2007Open Repository, Open Source 12 Opportunity: Linux Migration ● Debian 4 – all required hardware and software support in Debian repositories ● Oracle Cluster Filesystem (OCFS2) – already used for Oracle installation – easy to configure

3rd July 2007Open Repository, Open Source 13 Handle Servers (Again) ● How do you: – Run multiple handle servers on a single machine? – Retain ability to stop / start them individually? ● Use Java Service Wrapper! – Same simple configuration file – some paths, etc. changed for Linux – Provided script stores PID in file, so each instance can be managed separately

3rd July 2007Open Repository, Open Source 14 The Future: DSpace 1.4.x ● New OR code based on DSpace with – bug fixes (from 1.4.2) – researcher pages (Nathan Sarr / Tim Donohue) – configurable item submission (Tim Donohue) – configurable browse (Richard Jones) / browse ordering ● added Oracle support ● submitted and integrated ordering code

3rd July 2007Open Repository, Open Source 15 Browse: Richard's Marketing Blurb ● Switching browse order on configured sort fields ● Configurable (by end user) results per page ● Long author list truncation (end user configurable) ● Significant performance improvements ● Paging on all result sets in all contexts ● Configurable cross linking to other browse contexts (e.g. authors in author list to items by that author) ● Access to browse tables via DAOs (easily allows different databases to be supported – ie. Postgres, Oracle)

3rd July 2007Open Repository, Open Source 16 Configurable Browse ● Can be configured to use any metadata fields ● Configurable sort fields ● No hard coded limits for – number of browse lists – number of sort fields

3rd July 2007Open Repository, Open Source 17 Existing Browse Ordering ● Relies on database ordering of sort columns ● Doesn't deal with Unicode characters well

3rd July 2007Open Repository, Open Source 18 Configurable Ordering ● Order 'type' defined by browse datatype ● Each type can be configured to have a delegate ● Simple delegates defined as combinations of TextFilters

3rd July 2007Open Repository, Open Source 19 Unicode Ordering ● IBM's ICU4J used to decompose Unicode characters ● ICU4J is the basis of the JDK unicode handling ● Decomposition is not a public API until JDK 6

3rd July 2007Open Repository, Open Source 20 Advanced Ordering ● Sort strings do not necessarily have to be human readable ● Example filter for doing Locale correct ordering (based on a fixed configured Locale, not per-visitor)

3rd July 2007Open Repository, Open Source 21 OR Contributions ● Oracle support ● Code cleanup ● Browse ordering ● Code review – concurrency issues – configurable browse ● Preliminary MySQL support

3rd July 2007Open Repository, Open Source 22 Conclusion ● Open source and community are good things ● Being more closely aligned with the DSpace core is allowing OR to – collaborate with other developers – support other institutions running DSpace – contribute code