Digital Libraries with Greenstone: an open source solution Tod Olson - University of Chicago Fred Miller - Illinois Wesleyan University Curtis Kelch -

Slides:



Advertisements
Similar presentations
OAI from 50,000 Feet OAI develops and promotes interoperability solutions that aim to facilitate the efficient dissemination of content. Begun in 1999.
Advertisements

Possibility in Digital Collection Management Introduction to CONTENTdm TM Hitoshi Kamada University of Arizona Presentation for OCLC-CJK Users Group Annual.
Open Scholarship 2006 Bielefeld Academic Search Engine a Scientific Search Service for Institutional Repositories Open Scholarship 2006 New Challenges.
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
OAI and Publishers metadata Using the static repositories approach to disclose small journals.
Capacity Building Passing on the Experience Dr. Noha Adly World Digital Library Arab Peninsula Regional Group meeting.
What Does the Net Generation Expect From Us? SAC August 8, 2005 SAC August 8, 2005 Copyright © 2005, Joel L. Hartman. This work is the intellectual property.
November 6, 2003 Copyright Robert J. Beck This work is the intellectual property of the author. Permission is granted for this material to be shared.
Digital Collections: Storage and Access Jon Dunn Assistant Director for Technology IU Digital Library Program
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
MacKenzie Smith Associate Director for Technology MIT Libraries.
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
Standards showcase: MODS, METS, MARCXML ALA Annual 2006 Rebecca Guenther and Jackie Radebaugh Network Development and MARC Standards Office Library of.
Finding a Software System to Support ETDs Susan Gibbons Digital Initiatives Librarian University of Rochester.
A Web-based Bibliography Management Initiative: Collaborating for Classroom and Library Technology Integration Brian Nielsen, Academic Technologies Denise.
The Documentum Team Lance Callaway, Brooke Durbin, Perry Koob, Lorie McMillin, Jennifer Song Missouri University of Science and Technology Rolla, Missouri.
1. The Digital Library Challenge The Hybrid Library Today’s information resources collections are “hybrid” Combinations of - paper and digital format.
Andrea Eastman-Mullins Information & Technology Coordinator University of North Carolina, Office of the President Teaching and Learning with Technology.
Joachim Bauer Senior System Engineer, CCS
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
Building Chopin Early Editions Tod A. Olson Graduate School of Library and Information Science University of Illinois at Urbana Champaign University of.
Building Collections Using Greenstone Tod A. Olson Sr. Programmer/Analyst Digital Library Development Center University of Chicago Library
UCLA Digital Library UC Digital Library Forum August 5, 2002 UCLA Digital Library Presenter: Curtis Fornadley Senior Programmer/Analyst.
Introducing Symposia : “ The digital repository that thinks like a librarian”
OLC Spring Chapter Conferences Metadata, Schmetadata … Tell Me Why I Should Care? OLC Spring Chapter Conferences, 2004 Margaret.
UCLA Digital Library Technical Architecture June 13, 2002 UCLA Digital Library Presenter: Curtis Fornadley, Senior Programmer/Analyst.
Greenstone Digital Library Usage and Implementation By: Paul Raymond A. Afroilan Network Applications Team Preginet, ASTI-DOST.
 Copyright Curtis D. Edmonds,  This work is the intellectual property of the author. Permission is granted for this material to be shared for non-commercial,
Training the University Community for Self-Archiving in Institutional Repositories Melanie Feltner-Reichert The University of Tennessee Thura R. Mack The.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
OCLC Online Computer Library Center OCLC’s Digital Archive – Disseminating with METS Jay Goodkin Software Engineer Digital Collection and Preservation.
Educause October 29, 2001 A GEM of a Resource: The Gateway to Educational Materials Copyright Nancy Virgil Morgan, This work is the intellectual.
Katherine Don, Michael Dewsnip, Chi-Yu Huang Workshop on the Greenstone Digital Library Software Open source system.
Digital Library Architecture and Technology
New Partnerships for Smarter Data Discovery, eBooks and Digital Asset Management Thailand IUG 2012 – Mahidol University.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Copyright 2006, The Ohio State University Mary Manning Eric Schnell Using Greenstone Open-Source Digital Library Software at a Cultural Heritage Institution.
“Old Style” Libraries, Digital Libraries: Convergences, Divergences, And the Troubles in Between.
Multimedia Digital Library Marcia Johnson. Collection 25 text documents 25 text documents In HTML, PDF, TXT formats (source: Project Gutenberg) In HTML,
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
IUScholarWorks is a set of services to make the work of IU scholars freely available. Allows IU departments, institutes, centers and research units to.
ALCME: OAI at OCLC Jeffrey A. Young OCLC Online Computer Library Center, Inc.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Overview of IU Digital Collections Search Hui Zhang Jon Dunn Indiana University Digital Library Program IU Digital Library Brown Bag October 19, 2011.
Document management (aka ‘digital libraries’) The Greenstone Group: Professor Ian Witten (leader); David Bainbridge, Dave Nichols, S.J. Cunningham, Steve.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
Introduction to metadata
Collecting History: Profiles in Science Alexa T. McCray National Library of Medicine Bethesda, MD Stanford University August 21, 1999.
A Multi-Tiered Architecture for Distributed Data Collection and Centralized Data Delivery Stacy Kowalczyk and James Halliday April 28, 2008.
Greenstone Internals How to Build a Digital Library Ian H. Witten and David Bainbridge.
5. Applying metadata standards: Application profiles Metadata Standards and Applications Workshop.
Peking University Digital Library Programs Overview
DSpace - Digital Library Software
Collection Management Systems
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Virtual Collections VIRTUAL COLLECTIONS LDI Architecture Meeting, Tuesday, July 19.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
1 ABCD as a digital library tool An introduction on the concept and implementation by Egbert de Smet Univ. of Antwerp.
Bielefeld Academic Search Engine
Defining an IT Workflow, from Request to Support
VI-SEEM Data Repository
Introduction to DSpace
Library Technology Conference: Building Exhibits
Project for OnLine Instructional Support (POLIS)
myIS.neu.edu – presentation screen shots accompany:
Presentation transcript:

Digital Libraries with Greenstone: an open source solution Tod Olson - University of Chicago Fred Miller - Illinois Wesleyan University Curtis Kelch - Illinois Wesleyan University Copyright Tod Olson, Fred Miller, and Curtis Kelch This work is the intellectual property of the authors. Permission is granted for this material to be shared for non-commercial, educational purposes, provided that this copyright statement appears on the reproduced materials and notice is given that the copying is by permission of the author. To disseminate otherwise or to republish requires written permission from the author.

Digital Libraries with Greenstone Introduction About digital libraries Greenstone overview Examples Future Live demos Q & A

The World of Digital Libraries Access to Digital Collections –Text, images, audio, video –Searching and metadata Digital libraries versus repositories –Access and preservation Digital Preservation Tutorial

Sorting Out the Ingredients Raw materials User interface Elements of organization Building the collection

Greenstone New Zealand Digital Library Project at the University of Waikato with UNESCO, Human Info NGO International, every continent Examples: Academic –Digitization projects –Classes on digital libraries Non-academic –UNESCO humanitarian documentation

Greenstone features Works with existing documents –Imports several formats Searching: full text and metadata –Dublin Core, custom metadata Browse Structured documents –Indexing, access Extensible & customizable OpenSource software (GPL)

User Interface overview Finding documents –Search full text and metadata indexes –Classifiers: browse lists for navigating collections Navigating documents –Navigate hierarchical documents by logical structure –Simple page turning (not shown) –Single page for simple documents (not shown)

Greenstone Architecture Receptionist Collection Server DB & Indexes Redrawn from Witten & Bainbridge, How to Build a Digital Library, p. 356 Protocol Collection Import DB & Indexes Collection Import DB & Indexes Collection Import Receptionist

Greenstone Architecture Receptionist Provides user interface Accept user input Send to appropriate collection server Accept results Dynamic page generation Collection Server Handle collection content Search and filter information Return results multiple collections

DB & Indexes HTML PDF ImportBuild GSAF ??? Building Collections

Building collections Create a collection framework –or work with an old collection Select documents Import documents –Converts to internal XML format (GSAF) Build collection –creates search indexes and browse listings

GSAF: internal XML format Section: Description –Metadata fields Content –Text,internal markup, images Section –No limit in number or depth Hierarchical documents Sections nest, tree structure

[Text, images, links, etc.] … GSAF: internal XML format

Config file: collect.cfg Collection-specific configuration file, collect.cfg, specifies: file types to import Indexes and browse lists –Document or section level –paragraph (text index only) display of results and browse listings document displays

Chopin Early Editions Over 400 early edition Chopin scores 1830’s to 1880’s Target audience: music scholars & musicians. On web, page-turnable JPEG images. Online in March 2003 Currently 374 scores in online collection Usage: Nearly100 hits per day, > 30% of use is international.

Catalog records Scanned Images Structural metadata METS XSLT Greenstone Archive Format Greenstone Dig. Library Software Human processing XML-based automated processing Build overview

METS to GSAF dmdSec MODS: Title, … fileSec page1.jpg page2.jpg structMap div: Score div: Page 1 div: Page 2 Section Description Metadata: Title, … Content: Title, … Section Content: Page 1 page1.jpg Section Content: Page 2 page2.jpg

Greenstone benefits for Chopin Robust, mature system Recovered time in project –Fast to bring up –UI out of the box –Dynamic page generation –Incremental customization XML compliant –Natural mapping from METS to GSAF

The Argus Digital Collection Illinois Wesleyan Student Newspaper –1894 to 2000 Preservation and Access Image PDF versus full text Web interface for building metadata Customized searches

Argus Metadata Maintenance

Argus Search

Argus Issue “front door”

Ongoing work: Greenstone Greenstone Librarian Interface (GLI) Greenstone 3

Greenstone Librarian Interface (GLI) Collection management –Informed by work at GS sites –Assist collection designer –Support all phases of collection build process –Do not specify workflow Java-based GUI tool –Formerly called the “Gatherer” 2 yrs in development –Beta sites: Bangalore and elsewhere Training sessions –UNESCO sessions in Asia, Africa –JCDL 2004 tutorial

GLI functions Establish new collection (or work on old) Select files to include in collection Enrich files with metadata Select indexes, classifiers Build collection Customize appearance Preview collection

Greenstone 3 GS2 mature, 5+ yrs., wide deployment –Constraints: support legacy systems –Other technologies have matured: Java, XML GS3: rewrite in Java, XML, XSLT Distributed architecture, SOAP METS as internal format –Group assembled for Greenstone METS profile(s) OAI support planned 1 year in dev; alpha testing in lab

Links & Further Information Greenstone: Chopin Early Editions: Argus Digital Collection: Argus Greenstone Documentation: Witten & Bainbridge. How to Build a Digital Library. Morgan Kaufman, 2003.

More about Greenstone…