Presentation is loading. Please wait.

Presentation is loading. Please wait.

Digital Libraries with Greenstone: an open source solution Tod Olson - University of Chicago Fred Miller - Illinois Wesleyan University Curtis Kelch -

Similar presentations


Presentation on theme: "Digital Libraries with Greenstone: an open source solution Tod Olson - University of Chicago Fred Miller - Illinois Wesleyan University Curtis Kelch -"— Presentation transcript:

1 Digital Libraries with Greenstone: an open source solution Tod Olson - University of Chicago Fred Miller - Illinois Wesleyan University Curtis Kelch - Illinois Wesleyan University Copyright Tod Olson, Fred Miller, and Curtis Kelch 2004. This work is the intellectual property of the authors. Permission is granted for this material to be shared for non-commercial, educational purposes, provided that this copyright statement appears on the reproduced materials and notice is given that the copying is by permission of the author. To disseminate otherwise or to republish requires written permission from the author.

2 Digital Libraries with Greenstone Introduction About digital libraries Greenstone overview Examples Future Live demos Q & A

3 The World of Digital Libraries Access to Digital Collections –Text, images, audio, video –Searching and metadata Digital libraries versus repositories –Access and preservation Digital Preservation Tutorial http://www.library.cornell.edu/iris/tutorial/dpm/ http://www.library.cornell.edu/iris/tutorial/dpm/

4 Sorting Out the Ingredients Raw materials User interface Elements of organization Building the collection

5 Greenstone New Zealand Digital Library Project at the University of Waikato with UNESCO, Human Info NGO International, every continent Examples: Academic –Digitization projects –Classes on digital libraries Non-academic –UNESCO humanitarian documentation

6 Greenstone features Works with existing documents –Imports several formats Searching: full text and metadata –Dublin Core, custom metadata Browse Structured documents –Indexing, access Extensible & customizable OpenSource software (GPL)

7 User Interface overview Finding documents –Search full text and metadata indexes –Classifiers: browse lists for navigating collections Navigating documents –Navigate hierarchical documents by logical structure –Simple page turning (not shown) –Single page for simple documents (not shown)

8

9

10

11

12

13

14

15

16

17

18

19 Greenstone Architecture Receptionist Collection Server DB & Indexes Redrawn from Witten & Bainbridge, How to Build a Digital Library, p. 356 Protocol Collection Import DB & Indexes Collection Import DB & Indexes Collection Import Receptionist

20 Greenstone Architecture Receptionist Provides user interface Accept user input Send to appropriate collection server Accept results Dynamic page generation Collection Server Handle collection content Search and filter information Return results multiple collections

21 DB & Indexes HTML PDF ImportBuild GSAF ??? Building Collections

22 Building collections Create a collection framework –or work with an old collection Select documents Import documents –Converts to internal XML format (GSAF) Build collection –creates search indexes and browse listings

23 GSAF: internal XML format Section: Description –Metadata fields Content –Text,internal markup, images Section –No limit in number or depth Hierarchical documents Sections nest, tree structure

24 [Text, images, links, etc.] … GSAF: internal XML format

25 Config file: collect.cfg Collection-specific configuration file, collect.cfg, specifies: file types to import Indexes and browse lists –Document or section level –paragraph (text index only) display of results and browse listings document displays

26

27

28

29

30

31 Chopin Early Editions Over 400 early edition Chopin scores 1830’s to 1880’s Target audience: music scholars & musicians. On web, page-turnable JPEG images. Online in March 2003 Currently 374 scores in online collection Usage: Nearly100 hits per day, > 30% of use is international.

32

33

34

35

36

37

38

39 Catalog records Scanned Images Structural metadata METS XSLT Greenstone Archive Format Greenstone Dig. Library Software Human processing XML-based automated processing Build overview

40 METS to GSAF dmdSec MODS: Title, … fileSec page1.jpg page2.jpg structMap div: Score div: Page 1 div: Page 2 Section Description Metadata: Title, … Content: Title, … Section Content: Page 1 page1.jpg Section Content: Page 2 page2.jpg

41 Greenstone benefits for Chopin Robust, mature system Recovered time in project –Fast to bring up –UI out of the box –Dynamic page generation –Incremental customization XML compliant –Natural mapping from METS to GSAF

42 The Argus Digital Collection Illinois Wesleyan Student Newspaper –1894 to 2000 Preservation and Access Image PDF versus full text Web interface for building metadata Customized searches

43 Argus Metadata Maintenance

44 Argus Search

45 Argus Issue “front door”

46 Ongoing work: Greenstone Greenstone Librarian Interface (GLI) Greenstone 3

47 Greenstone Librarian Interface (GLI) Collection management –Informed by work at GS sites –Assist collection designer –Support all phases of collection build process –Do not specify workflow Java-based GUI tool –Formerly called the “Gatherer” 2 yrs in development –Beta sites: Bangalore and elsewhere Training sessions –UNESCO sessions in Asia, Africa –JCDL 2004 tutorial

48 GLI functions Establish new collection (or work on old) Select files to include in collection Enrich files with metadata Select indexes, classifiers Build collection Customize appearance Preview collection

49 Greenstone 3 GS2 mature, 5+ yrs., wide deployment –Constraints: support legacy systems –Other technologies have matured: Java, XML GS3: rewrite in Java, XML, XSLT Distributed architecture, SOAP METS as internal format –Group assembled for Greenstone METS profile(s) OAI support planned 1 year in dev; alpha testing in lab

50 Links & Further Information Greenstone: http://www.greenstone.org/http://www.greenstone.org/ Chopin Early Editions: http://chopin.lib.uchicago.edu/http://chopin.lib.uchicago.edu/ Argus Digital Collection: http://www.iwu.edu/library/services/argus1.htm http://www.iwu.edu/library/services/argus1.htm Argus Greenstone Documentation: http://www.iwu.edu/~ckelch/ArgusProjectDoc12.pdf http://www.iwu.edu/~ckelch/ArgusProjectDoc12.pdf Witten & Bainbridge. How to Build a Digital Library. Morgan Kaufman, 2003.

51 More about Greenstone…


Download ppt "Digital Libraries with Greenstone: an open source solution Tod Olson - University of Chicago Fred Miller - Illinois Wesleyan University Curtis Kelch -"

Similar presentations


Ads by Google