Download presentation
Presentation is loading. Please wait.
Published byMavis Gregory Modified over 8 years ago
1
Dr. M.G. Sreekumar UNESCO Coordinator, Greenstone Support, South Asia Librarian & Head, CDDL, IIM Kozhikode OPEN SOURCE TECHNOLOGIES FOR LIBRARIES
2
Agenda The Current Information Landscape The Current Information Landscape Open Source Overview Open Source Overview The OS Treasure Trove The OS Treasure Trove Categories of Open Source Software for Libraries Categories of Open Source Software for Libraries Open Source Digital Library Systems Open Source Digital Library Systems Greenstone Greenstone DSpace DSpace Open Source Suite from PKP, SFU Open Source Suite from PKP, SFU Open Source XML Tools and Systems Open Source XML Tools and Systems
3
Foreword Demand for improved information and knowledge management solutions - universities, enterprises and institutions Demand for improved information and knowledge management solutions - universities, enterprises and institutions Digital Libraries gaining increasing social attention, academic and research interest Digital Libraries gaining increasing social attention, academic and research interest Need for Integrated access to disparate information resources Need for Integrated access to disparate information resources Key challenge - how to create online information environments facilitating internal content publishing and single point access to internal/external information sources Key challenge - how to create online information environments facilitating internal content publishing and single point access to internal/external information sources Latest DL technologies Vs Traditional libraries and knowledge management Latest DL technologies Vs Traditional libraries and knowledge management Options before us – Proprietary Vs Open Standards / Open Source Software Options before us – Proprietary Vs Open Standards / Open Source Software Fortunately we have a plethora of Open Source Solutions available for Library applications Fortunately we have a plethora of Open Source Solutions available for Library applications
4
The Current Environment Fascinating times in the history of libraries, information systems and electronic publishing Fascinating times in the history of libraries, information systems and electronic publishing Possibilities of building large-scale services Possibilities of building large-scale services Collections are in digital formats and Collections are in digital formats and Retrieved over networks Retrieved over networks Materials are stored on computers Materials are stored on computers Network connects the computers to personal computers on the users' desks Network connects the computers to personal computers on the users' desks In a complete digital library, nothing need ever reach paper In a complete digital library, nothing need ever reach paper
5
Feel of the Hour
7
Need of the Hour
8
Future Libraries? What is a library and what should it be in 2012, 2020 and beyond… What does the academic library of the future look like? What does the academic library of the future look like? Where do its walls begin and end? Where do its walls begin and end? On campus? On campus? On our desktop? On our desktop? At home? At home? Does it still have a function as a separate and distinct space? Does it still have a function as a separate and distinct space? Or has it become the first step to an all-virtual future? Or has it become the first step to an all-virtual future? Libraries have never been more interesting, difficult and challenging… Libraries have never been more interesting, difficult and challenging…
9
Challenges of the Day Relevance of Libraries in the Google Era Retention of Users, especially the New Gen Proliferation of Content Diverse Datastreams - Content Categories, Publication Types Multimedia, Polymedia, Multiformats Collection Building – Acquisition, Subscriptions, Licensing… Copyright, Intellectual Property, Fair Use… Technology Complexities, Infrastructure Issues Publishers’ Stringent Policies / Monopolies Integration of legacy systems and the new genre
10
Information Strategy Tips Context = Scenarios, Paradigms Constant = Change Technology = Facilitate, and NO intimidate Information = The Big Picture - Landscape Content = Aggregate, Integrate Service = Markup, Market Capital = Human, Tacit, Values, and Users
11
Factors of Change Enterprise IKM Electronic Publishing Internet & Web Intranets Digital Libraries Knowledge Management E-Information: Usage and User Behavior Experiences? Lessons? Impact? Implications?
12
IM: Key Goals Develop and manage a dynamic, unified information resource base (content repository) that gathers and organizes relevant internal and global information resources, based on a taxonomy of information needs of the enterprise, and make these available for learning and informed decision making.
13
IM : Key Goals… Support different manifestations of information sources – implicit/explicit, print/digital, local/remote, free/commercial, etc. Support for delivering personalized information services to staff, both on demand and in anticipation.
14
User Generated Internet Content: Blogs YouTube MySpace And the same is TRUE with Scholarly Communication too!!
15
Top Tech Trends in IT / LIS Web 2.0 / Library 2.0 Web 2.0 / Library 2.0 Blogs / RSS Feeds / Wikis / Podcasts / Webcasts Blogs / RSS Feeds / Wikis / Podcasts / Webcasts Open Source Software, Open Standards, Open URL Open Source Software, Open Standards, Open URL User Tagging, Automated Tagging User Tagging, Automated Tagging Web OPACs, and Interface Design Web OPACs, and Interface Design Seamless Integration / Aggregation Seamless Integration / Aggregation OA -> OAP + OAA OA -> OAP + OAA Open Resource Discovery Tools - Google Scholar Open Resource Discovery Tools - Google Scholar E-Books, E-Journals, E-Resources E-Books, E-Journals, E-Resources Harvesting, Federation, Metasearching Harvesting, Federation, Metasearching Digital Rights Management Digital Rights Management
17
Multimedia Library Info System Multimedia Library Info System Internet / Intranet Gateway-out Data capture USER @ anywhere (access to information from anywhere)
18
Penetration of E-Content in Libraries PUBLICATION TYPES E-Books, E-Journals… Aggregated Scholarly E- Journal Databases Databases, CBT/ WBT Portals, Vortals… Value added services Preprints, Eprints, E- Documents…. DOCUMENT FORMATS ASCII, RTF, HTML, SGML, Postscript, PDF, Proprietary, Native Application Formats Images, Graphics Audio Video XHTML, ASP, PHP, XML...
19
Internally Generated Internally Generated Externally Sourced Externally Sourced Lib 2.0 & Patron 2.0 Lib 2.0 & Patron 2.0 Open Access Information Landscape [print/digital] Information Landscape [print/digital] Processes Procedures Data/Info. Manuals Reports… Processes Procedures Data/Info. Manuals Reports… Books, eBooks, Journals, eJournals, Databases, Patents, Reports, Online Resources… Books, eBooks, Journals, eJournals, Databases, Patents, Reports, Online Resources… Social Computing & Social Software Social Computing & Social Software OA Journals, OA Archives, Scholarly Articles, ePrint Archives, ETDs, eCoursewares OA Journals, OA Archives, Scholarly Articles, ePrint Archives, ETDs, eCoursewares
20
Shift in Approaches Traditional Automated Dig. Library AACR2 ISO 2709 CCF MARC Thesauri AACR2 CCC CC / LCCS DDC / UDC Thesauri/LCSH Metadata DCMI -- W3C EAD, TEI, DTD METS,MODS, Z39.50 MARC21 OAI-PMH Limited/ Rigid Efficient/ Flexible Improved
21
What Distinguishes a DL? Site Neutrality (3 in 1 Access-Anytime, Anywhere by Anyone Access) Open Access Greater variety and granularity of information Sharing of information ‘Sharium’ Up-to-date ness Always available (365*7*24) New forms of rendering (New Genre)
22
Digital Libraries: An Overview Digital Libraries ComputingNetworkingContentCollectionsServicesCommunity
23
What is open source software? In the phrase open source, source refers to source code, the human-readable computer code which is the origin, or source, of the computer application. In the phrase open source, source refers to source code, the human-readable computer code which is the origin, or source, of the computer application. Open refers to the terms of access to that computer source code. Open refers to the terms of access to that computer source code. So open source software is software for which the source code is freely available. So open source software is software for which the source code is freely available.
26
Advantages and Disadvantages Advantages Mostly issued under an internationally accepted License Mostly issued under an internationally accepted License Access to source code and ability and right to modify it Access to source code and ability and right to modify it Right to redistribute modifications to benefit wider community Right to redistribute modifications to benefit wider community Free Free Excellent support networks Excellent support networks Large and enthusiastic user base Large and enthusiastic user baseDisadvantages Limited or no accountability Limited or no accountability Informal and unaccountable support channels Informal and unaccountable support channels
27
Different Open Source Licenses GNU GPL ("General Public License") GNU GPL ("General Public License") GNU Lesser GPL GNU Lesser GPL BSD License BSD License Mozilla Public License Mozilla Public License IU Open Source License IU Open Source License And more... And more... And more... And more...
32
A Good Starting Point oss4lib: Open Source Systems for Libraries oss4lib: Open Source Systems for Libraries www.oss4lib.org www.oss4lib.org www.oss4lib.org
38
Open Source Resources Open Source Initiative Open Source Initiative Open Source Initiative Open Source Initiative GNU GNU GNU SourceForge SourceForge SourceForge
39
Categories of Open Source Software Operating Systems Operating Systems Linux, Free / Open BSD, Open Solaris… Linux, Free / Open BSD, Open Solaris… Programming Languages Programming Languages Perl, PHP, Python Perl, PHP, Python Applications Applications Apache, Tomcat, emacs, grep, MySQL, sendmail, ssh Apache, Tomcat, emacs, grep, MySQL, sendmail, ssh
40
Open Source Software for DLs Linux, Apache, Tomcat, PHP, Perl, DLXS, ImageMagick, Unreal Media Server, Greenstone, DSpace, ePrints, FEDORA, CDSWare, MySQL, Darwin Streaming Server, emacs, CVS, Webalizer, LibXML, LibXSLT, Saxon, and more! Linux, Apache, Tomcat, PHP, Perl, DLXS, ImageMagick, Unreal Media Server, Greenstone, DSpace, ePrints, FEDORA, CDSWare, MySQL, Darwin Streaming Server, emacs, CVS, Webalizer, LibXML, LibXSLT, Saxon, and more!
41
Some categories of open source library software Library-oriented search engines Library-oriented search engines Cheshire, Pears, dbWiz… Cheshire, Pears, dbWiz… Z39.50 toolkits Z39.50 toolkits ZetaPerl (Perl), JAFER (Java), YAZ (C/C++), Mercury Z39.50 Client… ZetaPerl (Perl), JAFER (Java), YAZ (C/C++), Mercury Z39.50 Client… MARC parsers MARC parsers MARC.pm (Perl), MARC4J (Java) MARC.pm (Perl), MARC4J (Java) MarcEdit http://oregonstate.edu/~reeset/marcedit/html/inde x.php MarcEdit http://oregonstate.edu/~reeset/marcedit/html/inde x.php Image processing Image processing ImageMagick, tiffinfo/tiffdump ImageMagick, tiffinfo/tiffdump
42
Some categories of Open Source Library software / Open Standards Portals Portals MyLibrary MyLibrary OAI service providers and data providers OAI service providers and data providers PHP OAI Data Provider PHP OAI Data Provider Lots! See www.openarchives.org Lots! See www.openarchives.orgwww.openarchives.org METS tools METS tools Page turners, toolkits, more: see www.loc.gov/mets/ Page turners, toolkits, more: see www.loc.gov/mets/www.loc.gov/mets/ Dublin Core Dublin Core http://dublincore.org http://dublincore.org
43
Web Server Apache Apache Lots in Java! see at… Lots in Java! see at… http://java-source.net/open-source/web-servers http://java-source.net/open-source/web-servers
44
Database Management Systems (DBMS) MySQL MySQL PostgreSQL PostgreSQL mSQL mSQL CDS/ISIS, Win/ISIS, GenISIS etc. CDS/ISIS, Win/ISIS, GenISIS etc.
45
Web Server-Side Scripting PHP PHP Architecture Architecture Linux, Apache, MySQL, PHP (LAMP) Linux, Apache, MySQL, PHP (LAMP) Windows, Apache, MySQL, PHP (WAMP) Windows, Apache, MySQL, PHP (WAMP)
46
Web Services Apache Tomcat Web Container/Service Apache Tomcat Web Container/Service Apache Cocoon Content Framework/Service Apache Cocoon Content Framework/Service Apache Ant Build Tool Apache Ant Build Tool
47
Integrated Library Management System (ILMS) Managing legacy systems Managing legacy systems KOHA KOHA http://www.koha.org/ http://www.koha.org/ Evergreen Evergreen http://wiki.code4lib.org/index.php/Evergreen http://wiki.code4lib.org/index.php/Evergreen Emilda Emilda http://wiki.code4lib.org/index.php/Emilda http://wiki.code4lib.org/index.php/Emilda OpenBiblio OpenBiblio http://wiki.code4lib.org/index.php/OpenBiblio http://wiki.code4lib.org/index.php/OpenBiblio phpMyLibrary phpMyLibrary http://wiki.code4lib.org/index.php/PhpMyLibrary http://wiki.code4lib.org/index.php/PhpMyLibrary NewGenLib NewGenLib http://www.verussolutions.biz/ http://www.verussolutions.biz/
48
Server Log Analysis Webalizer Webalizer http://www.webalizer.org/ http://www.webalizer.org/
49
Z39.50 Protocol for online/remote Search & Retrieval ( http://www.loc.gov/z3950/ ) Interoperability standard (ANSI/NISO Standard) and a software which facilitates cross- database/archives search Interoperability standard (ANSI/NISO Standard) and a software which facilitates cross- database/archives search A client-server protocol for searching and retrieving information from remote computer databases A client-server protocol for searching and retrieving information from remote computer databases YAZ Z39.50 Client - http://indexdata.com/yaz/ YAZ Z39.50 Client - http://indexdata.com/yaz/ 'Mercury' Z39.50 Client - http://www.basedowinfosys.com/projects/mzc 'Mercury' Z39.50 Client - http://www.basedowinfosys.com/projects/mzc
51
Serials Manager CUFTS CUFTS http://researcher.sfu.ca/cufts http://researcher.sfu.ca/cufts
52
Citation Manager Citation Manager (from PKP, Simon Fraser University, Canada ) Citation Manager (from PKP, Simon Fraser University, Canada ) Bibliographic Management (http://researcher.sfu.ca/cm) Bibliographic Management (http://researcher.sfu.ca/cm)
53
Link Resolving GODOT - Electronic (Online) Resources Management GODOT - Electronic (Online) Resources Management http://researcher.sfu.ca/godot http://researcher.sfu.ca/godot
54
Open Journal Publishing OJS OJS http://pkp.sfu.ca/ojs http://pkp.sfu.ca/ojs
55
Open Conference Systems OCS : Conference workflow automation OCS : Conference workflow automation http://pkp.sfu.ca/ocs http://pkp.sfu.ca/ocs
56
Open URL Systems Open URL 1.0 Open URL 1.0 http://www.oclc.org/research/ software/openurl/default.htm http://www.oclc.org/research/ software/openurl/default.htm
57
Open Digital Libraries Greenstone Greenstone DSpace DSpace Eprints Eprints FEDORA etc. FEDORA etc.
58
Open Access Archives / IRs DSpace DSpace Eprints Eprints FEDORA FEDORA CDSWare CDSWare Greenstone etc. Greenstone etc.
59
Learning Management Systems (LMS) E-learning Systems E-learning Systems Moodle Moodle Manhattan etc. Manhattan etc.
60
Content Management Systems (CMS) Joomala Joomala Drupal Drupal MediaWiki MediaWiki
61
Open Archives Harvester Harvester Harvester http://pkp.sfu.ca/harvester http://pkp.sfu.ca/harvester
63
Federated Searching dbWiz : PKP Project dbWiz : PKP Project http://researcher.sfu.ca/dbwiz http://researcher.sfu.ca/dbwiz Google Custom Search Google Custom Search http://www.google.com/coop/cse/ http://www.google.com/coop/cse/
66
Social Computing/Software Blogs Blogs Tags Tags Wikis Wikis RSS RSS Feed Aggregation etc. Feed Aggregation etc.
67
Open Courses WWW.W3SCHOOLS.COM WWW.W3SCHOOLS.COM Open Courseware http://ocw.mit.edu/ Open Courseware http://ocw.mit.edu/
70
What are digital libraries for? Knowledge/content management Manage and access internal information assets Scholarly communication, education, research E-journals, e-prints, e-books, data sets, e-learning Access to cultural collections Cultural, heritage, historical & special collections, museums, biodiversity E-governance Improved access to government policies, plans, procedures, rules and regulations Archiving and preservation Many more …
71
DL Software: Alternatives What are your expectations? Develop local web-based application? Commercial DL solution? Adopt open source software? Greenstone Eprints DSpace Fedora…
72
Digital Library Technologies Interoperability Unified interface for heterogeneous libraries Metadata mapping across different libraries OAI-compliant data and service providers Multilingual digital libraries Scalable digital library architectures Publication tools Searching tools
73
DLs: Workflows and Processes Content selection Content acquisition Content publishing Metadata preparation Content loading Content indexing & storage Content access & delivery Preservation Access management Usage monitoring and evaluation Networking and interoperation Maintenance
74
DL Software: Key requirements Document types (book, journal article, lecture …) Document formats (text, PDF, Word, PS, …) Content acquisition (online and offline) –Metadata description, content tagging –Content uploading Indexing and retrieval –Structured/ full text indexing –Automatic metadata extraction Storage –Data compression –Efficient storage for metadata –Efficient location of metadata and documents Access and delivery –Structured search, browse, hierarchical browsing –CD-ROM distribution
75
DL Software: More requirements Scaling up – for large collections Multilingual support Access management and security Usage monitoring and reporting Standards compliance –XML, Dublin Core, Unicode Interoperation –OAI, Z39.50 compliance, MARC21…
77
Complete DL Systems Greenstone Greenstone DSpace DSpace Eprints Eprints
78
Greenstone: Open source Software for Building Digital Library Collections
80
What is the Greenstone software? Software suite for building, maintaining, and distributing digital library collections Comprehensive, open-source Developed by New Zealand Digital Library Project at the University of Waikato Distribution and promotion partners: UNESCO Human Info NGO, Belgium NCSI, Bangalore; UCT, Cape Town; Dakar, Senegal; Almaty, Kazakhstan; … You!
81
Greenstone Features Supports creation and management of collections by administrator(s) Supports creation and management of collections by administrator(s) Web interface for search and retrieval Web interface for search and retrieval Customizable metadata Customizable metadata Supports full text search of content Supports full text search of content Extensive document filters Extensive document filters Word, Excel, PowerPoint, PDF,... Word, Excel, PowerPoint, PDF,... Can extract metadata from documents Can extract metadata from documents Many ways to build a collection, including: Many ways to build a collection, including: Local files Local files Retrieve web sites Retrieve web sites Retrieve objects via OAI-PMH Retrieve objects via OAI-PMH
82
Greenstone Features… Open Source Philosophy Interfacing & Content Delivery via Web Multi S/W Platform Multi Lingual Support Multi Formats Structured Metadata in XML using DC Metadata Extraction Searching & Browsing Plug-ins for Documents Full-text mirroring Text Level Penetration Data Compression Password protection Administrative Functions Concurrent & Dynamic Content Development Uniform Presentation Publishing on CDROMs International Presence
83
Greenstone Features contd... Easy Installation Easy Maintenance Content Development (3 alternate ways) Predominantly GLI now - since (V. 2.41) Hierarchy Structure Interface Customization –Front Page Design, Header for the Digital Library, Collection Icon, Cover Images Collection Configuration (Collect.cfg) File Scalability, Flexibility Interoperability (Crosswalk), OAI Compliance Lifeline : Listserv / E-Group / Archives
84
Ghostscript Kea pdftohtml rtftohtml TextCat wvWare Xlhtml XML::Parser Interpreter for Adobe Postscript documents (Postscript plugin) Keyphrase extraction program (to generate metadata) Converter for PDF documents (PDF plugin) Converter for RTF documents (RTF plugin) Detects languages and document encodings Converter for Word documents (Word plugin) Converter for Excel/Powerpoint documents (plugins) Parses XML documents, used to read and write Greenstone’s internal XML document format The power of open source: Greenstone uses …
85
MG GDBM wget YAZ Stemmer GCC CVS Perl Apache OAI-PMH Creates compressed full-text indexes and performs searches Database used for metadata etc Downloading pages from the Web when creating collections Client and server implementation of Z39.50 English language stemmer C/C++ compiler Version control system Used for plugins etc Web server used by many Greenstone installations OAI Performance and …
86
Example Greenstone collections Rapid growth in use International – Many Countries…China, Germany, India, UK, USA, Russia, Malaysia, Singapore... – Almost all countries/Continents Increasing activity on Greenstone mailing list Promotion by UNESCO – “deployment of DL’s for sharing public domain information” Wide variety of DL collections have been developed in several languages –historical, educational, cultural, and research
87
Greenstone Technology Runs on Windows (back to 3.1), Linux, Mac OS X, Unix Runs on Windows (back to 3.1), Linux, Mac OS X, Unix Written in C++, Perl, and Java Written in C++, Perl, and Java Uses MG/MG++ search engine Uses MG/MG++ search engine Several different Web and Java/Swing user interfaces for various functions Several different Web and Java/Swing user interfaces for various functions Web interface for user access Web interface for user access
88
Greenstone Demonstration Examples at www.greenstone.org Examples at www.greenstone.orgwww.greenstone.org
89
DSpace
90
DSpace “DSpace is a groundbreaking digital institutional repository that captures, stores, indexes, preserves, and redistributes the intellectual output of a university’s research faculty in digital formats.” “DSpace is a groundbreaking digital institutional repository that captures, stores, indexes, preserves, and redistributes the intellectual output of a university’s research faculty in digital formats.” Developed jointly by MIT Libraries and Hewlett- Packard Developed jointly by MIT Libraries and Hewlett- Packard Licensed under BSD distribution license Licensed under BSD distribution license www.dspace.org www.dspace.org www.dspace.org
91
DSpace Supports submission of, management of, and access to digital content Supports submission of, management of, and access to digital content Formats: text, images, audio, video Formats: text, images, audio, video Organized based on organizational needs of a large university Organized based on organizational needs of a large university Communities and collections Communities and collections
93
DSpace Data Model
94
DSpace Features Digital preservation Digital preservation Persistent IDs, support levels for different file formats Persistent IDs, support levels for different file formats Access control Access control Versioning Versioning Search and retrieval Search and retrieval Based on qualified Dublin Core metadata Based on qualified Dublin Core metadata OAI-PMH data provider OAI-PMH data provider To support metadata harvesters To support metadata harvesters
95
DSpace Technology OS: Unix or Linux OS: Unix or Linux Written in Java Written in Java PostgreSQL relational database PostgreSQL relational database Provides complete Web user interface, but Java APIs available Provides complete Web user interface, but Java APIs available
96
DSpace Architecture
97
DSpace Software / Utilities 1. Java SDK 1.4.2 2. Apache Maven 2.0.54 3. Tomcat 5.0.28 4. Apache Ant 1.6.5 5. PostgreSQL 8.0.2 6. DSpace 1.5x / 2.x
98
DSpace Demonstration MIT DSpace MIT DSpace dspace.mit.edu dspace.mit.edu dspace.mit.edu
99
EPrints “Open Source software which creates online archives” “Open Source software which creates online archives” Developed by University of Southampton, UK Developed by University of Southampton, UK Supports self-archiving of e-prints Supports self-archiving of e-prints Can be configured as institutional repository or otherwise, e.g. repository focused on particular research area or discipline Can be configured as institutional repository or otherwise, e.g. repository focused on particular research area or discipline Licensed under GNU General Public License Licensed under GNU General Public License software.eprints.org software.eprints.org software.eprints.org
100
EPrints Supports submission, management of, and access to digital content Supports submission, management of, and access to digital content Can support multiple archives on one server Can support multiple archives on one server Moderated or unmoderated archives Moderated or unmoderated archives Search and retrieval Search and retrieval Based on metadata Based on metadata Metadata can be customized for different archives and document types Metadata can be customized for different archives and document types No access control No access control OAI-PMH data provider OAI-PMH data provider
101
EPrints Technology OS: Unix or Linux OS: Unix or Linux Written in Perl Written in Perl Requirements: Requirements: Apache web server Apache web server MySQL relational database MySQL relational database
102
EPrints Demonstration Digital Library of the Commons Digital Library of the Commons dlc.dlib.indiana.edu dlc.dlib.indiana.edu dlc.dlib.indiana.edu
103
Open Source XML Tools and Systems Utilities Utilities Xalan, Xerces, libxml, libxslt, saxon Xalan, Xerces, libxml, libxslt, saxon Editors Editors emacs / nxml-mode emacs / nxml-mode Database / Search Engines Database / Search Engines Apache Xindice Apache Xindice Berkeley DB XML Berkeley DB XML eXist eXist Publishing/WebApplication Frameworks Publishing/WebApplication Frameworks AxKit AxKit Cocoon Cocoon
104
XML Databases & Search Engines Apache Xindice Apache Xindice Apache Xindice Apache Xindice Berkeley DB XML Berkeley DB XML Berkeley DB XML Berkeley DB XML eXist eXist eXist
105
Greenstone Windows Installation Version 2.81rc
106
Opening Greenstone on Browser Digital Library Server Greenstone Digital Library
107
Opening Greenstone on Browser Greenstone Digital Library Collections
108
GLI
109
GLI Functions Establish new collection (or work on old) Select files to include in collection (Gather) Enrich files with metadata (Enrich) Select Plugins, Indexes, Classifiers (Design) Build Collection (Create) Format and Control Display (Format) Customize Appearance Preview Collection
110
Collection Building… Greenstone used to have three modes of collection building, viz., Command Line, Web Interface and the GLI (Greenstone Librarian Interface) Greenstone used to have three modes of collection building, viz., Command Line, Web Interface and the GLI (Greenstone Librarian Interface) Progressing with version 2.4x., the GLI got strengthened as well as popularized Progressing with version 2.4x., the GLI got strengthened as well as popularized Web Interface mode has been withdrawn temporarily. Web Interface mode has been withdrawn temporarily. The GLI based collection building is quite easy and simple a method. The GLI based collection building is quite easy and simple a method. Collection developers can activate the GLI software and use the ‘Gather’, ‘Enrich’, ‘Design’, ‘Format’ and ‘Create’ panel for making collection Collection developers can activate the GLI software and use the ‘Gather’, ‘Enrich’, ‘Design’, ‘Format’ and ‘Create’ panel for making collection
111
Customization Greenstone is specifically designed to be highly extensible and customizable. New document and metadata formats are accommodated by writing "plugins" (in Perl). Analogously, new metadata browsing structures can be implemented by writing "classifiers." The user interface look-and-feel can be altered using "macros" written in a simple macro language. A Corba protocol allows agents (e.g. in Java) to use all the facilities associated with document collections. Finally, the source code, in C++ and Perl, is available and accessible for modification
112
Customizing with macros Let you customize presentation Let you customize presentation Present pages in different languages Present pages in different languages Print variables into the page text (e.g. number of search hits) Print variables into the page text (e.g. number of search hits) Macro files Macro files stored in greenstone2/macros folder stored in greenstone2/macros folder each file defines one or more “packages” each file defines one or more “packages” (A “package” is a group of macros) loaded on startup (note difference between Local and Web Library) loaded on startup (note difference between Local and Web Library) listed in etc/main.cfg listed in etc/main.cfg Collection-specific macros Collection-specific macros Stored in greenstone2/collect/mycol/macros/extra.dm Stored in greenstone2/collect/mycol/macros/extra.dm Or include argument [c=collectionname] for each macro Or include argument [c=collectionname] for each macro
113
Hierarchy Structure
114
Collection configuration Collection configuration file determines content conversion, extraction and building of indexes and browsing structures –indexes, classifiers, plugins Presentation of search/browse results and collection interface is determined by “format” strings and “macros”
116
DL - Hardships Copyright Issues Copyright Issues Technology Complexities Technology Complexities Infrastructure Issues Infrastructure Issues Publications/Formats – Diverse Datastreams Publications/Formats – Diverse Datastreams Digital Objects/Formats - Multiple Digital Objects/Formats - Multiple Publishers’ Policies – Stringent, Inconsistent Publishers’ Policies – Stringent, Inconsistent
117
Major Tasks Content identification (internal / external) Content identification (internal / external) Content Creation Content Creation Content Collation/Signposts Content Collation/Signposts Organisation Organisation Updation Updation Retrieval / Dissemination Retrieval / Dissemination User Training User Training Archiving Archiving
118
Data/ Objects METS/MODS EAD TEI DCMI OS Z39.50 /OAI-PMH Network DL Software DIGITAL LIBRARY ARCHITECTURE
119
http://greenstonesupport.iimk.ac.in
120
Acknowledgement Team Greenstone, New Zealand Team Greenstone, New Zealand Greenstone Support South Asia Greenstone Support South Asia IIM Kozhikode, India IIM Kozhikode, India UNESCO UNESCO Indiana University Digital Library Program Indiana University Digital Library Program
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.