Presentation is loading. Please wait.

Presentation is loading. Please wait.

Dr. M.G. Sreekumar UNESCO Coordinator, Greenstone Support, South Asia Librarian & Head, CDDL, IIM Kozhikode OPEN SOURCE TECHNOLOGIES FOR LIBRARIES.

Similar presentations


Presentation on theme: "Dr. M.G. Sreekumar UNESCO Coordinator, Greenstone Support, South Asia Librarian & Head, CDDL, IIM Kozhikode OPEN SOURCE TECHNOLOGIES FOR LIBRARIES."— Presentation transcript:

1 Dr. M.G. Sreekumar UNESCO Coordinator, Greenstone Support, South Asia Librarian & Head, CDDL, IIM Kozhikode OPEN SOURCE TECHNOLOGIES FOR LIBRARIES

2 Agenda The Current Information Landscape The Current Information Landscape Open Source Overview Open Source Overview The OS Treasure Trove The OS Treasure Trove Categories of Open Source Software for Libraries Categories of Open Source Software for Libraries Open Source Digital Library Systems Open Source Digital Library Systems Greenstone Greenstone DSpace DSpace Open Source Suite from PKP, SFU Open Source Suite from PKP, SFU Open Source XML Tools and Systems Open Source XML Tools and Systems

3 Foreword Demand for improved information and knowledge management solutions - universities, enterprises and institutions Demand for improved information and knowledge management solutions - universities, enterprises and institutions Digital Libraries gaining increasing social attention, academic and research interest Digital Libraries gaining increasing social attention, academic and research interest Need for Integrated access to disparate information resources Need for Integrated access to disparate information resources Key challenge - how to create online information environments facilitating internal content publishing and single point access to internal/external information sources Key challenge - how to create online information environments facilitating internal content publishing and single point access to internal/external information sources Latest DL technologies Vs Traditional libraries and knowledge management Latest DL technologies Vs Traditional libraries and knowledge management Options before us – Proprietary Vs Open Standards / Open Source Software Options before us – Proprietary Vs Open Standards / Open Source Software Fortunately we have a plethora of Open Source Solutions available for Library applications Fortunately we have a plethora of Open Source Solutions available for Library applications

4 The Current Environment Fascinating times in the history of libraries, information systems and electronic publishing Fascinating times in the history of libraries, information systems and electronic publishing Possibilities of building large-scale services Possibilities of building large-scale services Collections are in digital formats and Collections are in digital formats and Retrieved over networks Retrieved over networks Materials are stored on computers Materials are stored on computers Network connects the computers to personal computers on the users' desks Network connects the computers to personal computers on the users' desks In a complete digital library, nothing need ever reach paper In a complete digital library, nothing need ever reach paper

5 Feel of the Hour

6

7 Need of the Hour

8 Future Libraries? What is a library and what should it be in 2012, 2020 and beyond… What does the academic library of the future look like? What does the academic library of the future look like? Where do its walls begin and end? Where do its walls begin and end? On campus? On campus? On our desktop? On our desktop? At home? At home? Does it still have a function as a separate and distinct space? Does it still have a function as a separate and distinct space? Or has it become the first step to an all-virtual future? Or has it become the first step to an all-virtual future? Libraries have never been more interesting, difficult and challenging… Libraries have never been more interesting, difficult and challenging…

9 Challenges of the Day Relevance of Libraries in the Google Era Retention of Users, especially the New Gen Proliferation of Content Diverse Datastreams - Content Categories, Publication Types Multimedia, Polymedia, Multiformats Collection Building – Acquisition, Subscriptions, Licensing… Copyright, Intellectual Property, Fair Use… Technology Complexities, Infrastructure Issues Publishers’ Stringent Policies / Monopolies Integration of legacy systems and the new genre

10 Information Strategy Tips Context = Scenarios, Paradigms Constant = Change Technology = Facilitate, and NO intimidate Information = The Big Picture - Landscape Content = Aggregate, Integrate Service = Markup, Market Capital = Human, Tacit, Values, and Users

11 Factors of Change Enterprise IKM Electronic Publishing Internet & Web Intranets Digital Libraries Knowledge Management E-Information: Usage and User Behavior Experiences? Lessons? Impact? Implications?

12 IM: Key Goals  Develop and manage a dynamic, unified information resource base (content repository) that gathers and organizes relevant internal and global information resources, based on a taxonomy of information needs of the enterprise, and make these available for learning and informed decision making.

13 IM : Key Goals…  Support different manifestations of information sources – implicit/explicit, print/digital, local/remote, free/commercial, etc.  Support for delivering personalized information services to staff, both on demand and in anticipation.

14 User Generated Internet Content: Blogs YouTube MySpace And the same is TRUE with Scholarly Communication too!!

15 Top Tech Trends in IT / LIS Web 2.0 / Library 2.0 Web 2.0 / Library 2.0 Blogs / RSS Feeds / Wikis / Podcasts / Webcasts Blogs / RSS Feeds / Wikis / Podcasts / Webcasts Open Source Software, Open Standards, Open URL Open Source Software, Open Standards, Open URL User Tagging, Automated Tagging User Tagging, Automated Tagging Web OPACs, and Interface Design Web OPACs, and Interface Design Seamless Integration / Aggregation Seamless Integration / Aggregation OA -> OAP + OAA OA -> OAP + OAA Open Resource Discovery Tools - Google Scholar Open Resource Discovery Tools - Google Scholar E-Books, E-Journals, E-Resources E-Books, E-Journals, E-Resources Harvesting, Federation, Metasearching Harvesting, Federation, Metasearching Digital Rights Management Digital Rights Management

16

17 Multimedia Library Info System Multimedia Library Info System Internet / Intranet Gateway-out Data capture USER @ anywhere (access to information from anywhere)

18 Penetration of E-Content in Libraries PUBLICATION TYPES E-Books, E-Journals… Aggregated Scholarly E- Journal Databases Databases, CBT/ WBT Portals, Vortals… Value added services Preprints, Eprints, E- Documents…. DOCUMENT FORMATS ASCII, RTF, HTML, SGML, Postscript, PDF, Proprietary, Native Application Formats Images, Graphics Audio Video XHTML, ASP, PHP, XML...

19 Internally Generated Internally Generated Externally Sourced Externally Sourced Lib 2.0 & Patron 2.0 Lib 2.0 & Patron 2.0 Open Access Information Landscape [print/digital] Information Landscape [print/digital] Processes Procedures Data/Info. Manuals Reports… Processes Procedures Data/Info. Manuals Reports… Books, eBooks, Journals, eJournals, Databases, Patents, Reports, Online Resources… Books, eBooks, Journals, eJournals, Databases, Patents, Reports, Online Resources… Social Computing & Social Software Social Computing & Social Software OA Journals, OA Archives, Scholarly Articles, ePrint Archives, ETDs, eCoursewares OA Journals, OA Archives, Scholarly Articles, ePrint Archives, ETDs, eCoursewares

20 Shift in Approaches Traditional Automated Dig. Library AACR2 ISO 2709 CCF MARC Thesauri AACR2 CCC CC / LCCS DDC / UDC Thesauri/LCSH Metadata DCMI -- W3C EAD, TEI, DTD METS,MODS, Z39.50 MARC21 OAI-PMH Limited/ Rigid Efficient/ Flexible Improved

21 What Distinguishes a DL?  Site Neutrality (3 in 1 Access-Anytime,  Anywhere by Anyone Access)  Open Access  Greater variety and granularity of information  Sharing of information ‘Sharium’  Up-to-date ness  Always available (365*7*24)  New forms of rendering (New Genre)

22 Digital Libraries: An Overview Digital Libraries ComputingNetworkingContentCollectionsServicesCommunity

23 What is open source software? In the phrase open source, source refers to source code, the human-readable computer code which is the origin, or source, of the computer application. In the phrase open source, source refers to source code, the human-readable computer code which is the origin, or source, of the computer application. Open refers to the terms of access to that computer source code. Open refers to the terms of access to that computer source code. So open source software is software for which the source code is freely available. So open source software is software for which the source code is freely available.

24

25

26 Advantages and Disadvantages Advantages Mostly issued under an internationally accepted License Mostly issued under an internationally accepted License Access to source code and ability and right to modify it Access to source code and ability and right to modify it Right to redistribute modifications to benefit wider community Right to redistribute modifications to benefit wider community Free Free Excellent support networks Excellent support networks Large and enthusiastic user base Large and enthusiastic user baseDisadvantages Limited or no accountability Limited or no accountability Informal and unaccountable support channels Informal and unaccountable support channels

27 Different Open Source Licenses GNU GPL ("General Public License") GNU GPL ("General Public License") GNU Lesser GPL GNU Lesser GPL BSD License BSD License Mozilla Public License Mozilla Public License IU Open Source License IU Open Source License And more... And more... And more... And more...

28

29

30

31

32 A Good Starting Point oss4lib: Open Source Systems for Libraries oss4lib: Open Source Systems for Libraries www.oss4lib.org www.oss4lib.org www.oss4lib.org

33

34

35

36

37

38 Open Source Resources Open Source Initiative Open Source Initiative Open Source Initiative Open Source Initiative GNU GNU GNU SourceForge SourceForge SourceForge

39 Categories of Open Source Software Operating Systems Operating Systems Linux, Free / Open BSD, Open Solaris… Linux, Free / Open BSD, Open Solaris… Programming Languages Programming Languages Perl, PHP, Python Perl, PHP, Python Applications Applications Apache, Tomcat, emacs, grep, MySQL, sendmail, ssh Apache, Tomcat, emacs, grep, MySQL, sendmail, ssh

40 Open Source Software for DLs Linux, Apache, Tomcat, PHP, Perl, DLXS, ImageMagick, Unreal Media Server, Greenstone, DSpace, ePrints, FEDORA, CDSWare, MySQL, Darwin Streaming Server, emacs, CVS, Webalizer, LibXML, LibXSLT, Saxon, and more! Linux, Apache, Tomcat, PHP, Perl, DLXS, ImageMagick, Unreal Media Server, Greenstone, DSpace, ePrints, FEDORA, CDSWare, MySQL, Darwin Streaming Server, emacs, CVS, Webalizer, LibXML, LibXSLT, Saxon, and more!

41 Some categories of open source library software Library-oriented search engines Library-oriented search engines Cheshire, Pears, dbWiz… Cheshire, Pears, dbWiz… Z39.50 toolkits Z39.50 toolkits ZetaPerl (Perl), JAFER (Java), YAZ (C/C++), Mercury Z39.50 Client… ZetaPerl (Perl), JAFER (Java), YAZ (C/C++), Mercury Z39.50 Client… MARC parsers MARC parsers MARC.pm (Perl), MARC4J (Java) MARC.pm (Perl), MARC4J (Java) MarcEdit http://oregonstate.edu/~reeset/marcedit/html/inde x.php MarcEdit http://oregonstate.edu/~reeset/marcedit/html/inde x.php Image processing Image processing ImageMagick, tiffinfo/tiffdump ImageMagick, tiffinfo/tiffdump

42 Some categories of Open Source Library software / Open Standards Portals Portals MyLibrary MyLibrary OAI service providers and data providers OAI service providers and data providers PHP OAI Data Provider PHP OAI Data Provider Lots! See www.openarchives.org Lots! See www.openarchives.orgwww.openarchives.org METS tools METS tools Page turners, toolkits, more: see www.loc.gov/mets/ Page turners, toolkits, more: see www.loc.gov/mets/www.loc.gov/mets/ Dublin Core Dublin Core http://dublincore.org http://dublincore.org

43 Web Server Apache Apache Lots in Java! see at… Lots in Java! see at… http://java-source.net/open-source/web-servers http://java-source.net/open-source/web-servers

44 Database Management Systems (DBMS) MySQL MySQL PostgreSQL PostgreSQL mSQL mSQL CDS/ISIS, Win/ISIS, GenISIS etc. CDS/ISIS, Win/ISIS, GenISIS etc.

45 Web Server-Side Scripting PHP PHP Architecture Architecture Linux, Apache, MySQL, PHP (LAMP) Linux, Apache, MySQL, PHP (LAMP) Windows, Apache, MySQL, PHP (WAMP) Windows, Apache, MySQL, PHP (WAMP)

46 Web Services Apache Tomcat Web Container/Service Apache Tomcat Web Container/Service Apache Cocoon Content Framework/Service Apache Cocoon Content Framework/Service Apache Ant Build Tool Apache Ant Build Tool

47 Integrated Library Management System (ILMS) Managing legacy systems Managing legacy systems KOHA KOHA http://www.koha.org/ http://www.koha.org/ Evergreen Evergreen http://wiki.code4lib.org/index.php/Evergreen http://wiki.code4lib.org/index.php/Evergreen Emilda Emilda http://wiki.code4lib.org/index.php/Emilda http://wiki.code4lib.org/index.php/Emilda OpenBiblio OpenBiblio http://wiki.code4lib.org/index.php/OpenBiblio http://wiki.code4lib.org/index.php/OpenBiblio phpMyLibrary phpMyLibrary http://wiki.code4lib.org/index.php/PhpMyLibrary http://wiki.code4lib.org/index.php/PhpMyLibrary NewGenLib NewGenLib http://www.verussolutions.biz/ http://www.verussolutions.biz/

48 Server Log Analysis Webalizer Webalizer http://www.webalizer.org/ http://www.webalizer.org/

49 Z39.50 Protocol for online/remote Search & Retrieval ( http://www.loc.gov/z3950/ ) Interoperability standard (ANSI/NISO Standard) and a software which facilitates cross- database/archives search Interoperability standard (ANSI/NISO Standard) and a software which facilitates cross- database/archives search A client-server protocol for searching and retrieving information from remote computer databases A client-server protocol for searching and retrieving information from remote computer databases YAZ Z39.50 Client - http://indexdata.com/yaz/ YAZ Z39.50 Client - http://indexdata.com/yaz/ 'Mercury' Z39.50 Client - http://www.basedowinfosys.com/projects/mzc 'Mercury' Z39.50 Client - http://www.basedowinfosys.com/projects/mzc

50

51 Serials Manager CUFTS CUFTS http://researcher.sfu.ca/cufts http://researcher.sfu.ca/cufts

52 Citation Manager Citation Manager (from PKP, Simon Fraser University, Canada ) Citation Manager (from PKP, Simon Fraser University, Canada ) Bibliographic Management (http://researcher.sfu.ca/cm) Bibliographic Management (http://researcher.sfu.ca/cm)

53 Link Resolving GODOT - Electronic (Online) Resources Management GODOT - Electronic (Online) Resources Management http://researcher.sfu.ca/godot http://researcher.sfu.ca/godot

54 Open Journal Publishing OJS OJS http://pkp.sfu.ca/ojs http://pkp.sfu.ca/ojs

55 Open Conference Systems OCS : Conference workflow automation OCS : Conference workflow automation http://pkp.sfu.ca/ocs http://pkp.sfu.ca/ocs

56 Open URL Systems Open URL 1.0 Open URL 1.0 http://www.oclc.org/research/ software/openurl/default.htm http://www.oclc.org/research/ software/openurl/default.htm

57 Open Digital Libraries Greenstone Greenstone DSpace DSpace Eprints Eprints FEDORA etc. FEDORA etc.

58 Open Access Archives / IRs DSpace DSpace Eprints Eprints FEDORA FEDORA CDSWare CDSWare Greenstone etc. Greenstone etc.

59 Learning Management Systems (LMS) E-learning Systems E-learning Systems Moodle Moodle Manhattan etc. Manhattan etc.

60 Content Management Systems (CMS) Joomala Joomala Drupal Drupal MediaWiki MediaWiki

61 Open Archives Harvester Harvester Harvester http://pkp.sfu.ca/harvester http://pkp.sfu.ca/harvester

62

63 Federated Searching dbWiz : PKP Project dbWiz : PKP Project http://researcher.sfu.ca/dbwiz http://researcher.sfu.ca/dbwiz Google Custom Search Google Custom Search http://www.google.com/coop/cse/ http://www.google.com/coop/cse/

64

65

66 Social Computing/Software Blogs Blogs Tags Tags Wikis Wikis RSS RSS Feed Aggregation etc. Feed Aggregation etc.

67 Open Courses WWW.W3SCHOOLS.COM WWW.W3SCHOOLS.COM Open Courseware http://ocw.mit.edu/ Open Courseware http://ocw.mit.edu/

68

69

70 What are digital libraries for?  Knowledge/content management  Manage and access internal information assets  Scholarly communication, education, research  E-journals, e-prints, e-books, data sets, e-learning  Access to cultural collections  Cultural, heritage, historical & special collections, museums, biodiversity  E-governance  Improved access to government policies, plans, procedures, rules and regulations  Archiving and preservation  Many more …

71 DL Software: Alternatives  What are your expectations?  Develop local web-based application?  Commercial DL solution?  Adopt open source software?  Greenstone  Eprints  DSpace  Fedora…

72 Digital Library Technologies  Interoperability  Unified interface for heterogeneous libraries  Metadata mapping across different libraries  OAI-compliant data and service providers  Multilingual digital libraries  Scalable digital library architectures  Publication tools  Searching tools

73 DLs: Workflows and Processes  Content selection  Content acquisition  Content publishing  Metadata preparation  Content loading  Content indexing & storage  Content access & delivery  Preservation  Access management  Usage monitoring and evaluation  Networking and interoperation  Maintenance

74 DL Software: Key requirements Document types (book, journal article, lecture …) Document formats (text, PDF, Word, PS, …) Content acquisition (online and offline) –Metadata description, content tagging –Content uploading Indexing and retrieval –Structured/ full text indexing –Automatic metadata extraction Storage –Data compression –Efficient storage for metadata –Efficient location of metadata and documents Access and delivery –Structured search, browse, hierarchical browsing –CD-ROM distribution

75 DL Software: More requirements Scaling up – for large collections Multilingual support Access management and security Usage monitoring and reporting Standards compliance –XML, Dublin Core, Unicode Interoperation –OAI, Z39.50 compliance, MARC21…

76

77 Complete DL Systems Greenstone Greenstone DSpace DSpace Eprints Eprints

78 Greenstone: Open source Software for Building Digital Library Collections

79

80 What is the Greenstone software?  Software suite for building, maintaining, and distributing digital library collections  Comprehensive, open-source  Developed by New Zealand Digital Library Project at the University of Waikato  Distribution and promotion partners:  UNESCO  Human Info NGO, Belgium  NCSI, Bangalore; UCT, Cape Town; Dakar, Senegal; Almaty, Kazakhstan; …  You!

81 Greenstone Features Supports creation and management of collections by administrator(s) Supports creation and management of collections by administrator(s) Web interface for search and retrieval Web interface for search and retrieval Customizable metadata Customizable metadata Supports full text search of content Supports full text search of content Extensive document filters Extensive document filters Word, Excel, PowerPoint, PDF,... Word, Excel, PowerPoint, PDF,... Can extract metadata from documents Can extract metadata from documents Many ways to build a collection, including: Many ways to build a collection, including: Local files Local files Retrieve web sites Retrieve web sites Retrieve objects via OAI-PMH Retrieve objects via OAI-PMH

82 Greenstone Features… Open Source Philosophy Interfacing & Content Delivery via Web Multi S/W Platform Multi Lingual Support Multi Formats Structured Metadata in XML using DC Metadata Extraction Searching & Browsing Plug-ins for Documents Full-text mirroring Text Level Penetration Data Compression Password protection Administrative Functions Concurrent & Dynamic Content Development Uniform Presentation Publishing on CDROMs International Presence

83 Greenstone Features contd... Easy Installation Easy Maintenance Content Development (3 alternate ways) Predominantly GLI now - since (V. 2.41) Hierarchy Structure Interface Customization –Front Page Design, Header for the Digital Library, Collection Icon, Cover Images Collection Configuration (Collect.cfg) File Scalability, Flexibility Interoperability (Crosswalk), OAI Compliance Lifeline : Listserv / E-Group / Archives

84  Ghostscript  Kea  pdftohtml  rtftohtml  TextCat  wvWare  Xlhtml  XML::Parser Interpreter for Adobe Postscript documents (Postscript plugin) Keyphrase extraction program (to generate metadata) Converter for PDF documents (PDF plugin) Converter for RTF documents (RTF plugin) Detects languages and document encodings Converter for Word documents (Word plugin) Converter for Excel/Powerpoint documents (plugins) Parses XML documents, used to read and write Greenstone’s internal XML document format The power of open source: Greenstone uses …

85  MG  GDBM  wget  YAZ  Stemmer  GCC  CVS  Perl  Apache  OAI-PMH Creates compressed full-text indexes and performs searches Database used for metadata etc Downloading pages from the Web when creating collections Client and server implementation of Z39.50 English language stemmer C/C++ compiler Version control system Used for plugins etc Web server used by many Greenstone installations OAI Performance and …

86 Example Greenstone collections Rapid growth in use International – Many Countries…China, Germany, India, UK, USA, Russia, Malaysia, Singapore... – Almost all countries/Continents Increasing activity on Greenstone mailing list Promotion by UNESCO – “deployment of DL’s for sharing public domain information” Wide variety of DL collections have been developed in several languages –historical, educational, cultural, and research

87 Greenstone Technology Runs on Windows (back to 3.1), Linux, Mac OS X, Unix Runs on Windows (back to 3.1), Linux, Mac OS X, Unix Written in C++, Perl, and Java Written in C++, Perl, and Java Uses MG/MG++ search engine Uses MG/MG++ search engine Several different Web and Java/Swing user interfaces for various functions Several different Web and Java/Swing user interfaces for various functions Web interface for user access Web interface for user access

88 Greenstone Demonstration Examples at www.greenstone.org Examples at www.greenstone.orgwww.greenstone.org

89 DSpace

90 DSpace “DSpace is a groundbreaking digital institutional repository that captures, stores, indexes, preserves, and redistributes the intellectual output of a university’s research faculty in digital formats.” “DSpace is a groundbreaking digital institutional repository that captures, stores, indexes, preserves, and redistributes the intellectual output of a university’s research faculty in digital formats.” Developed jointly by MIT Libraries and Hewlett- Packard Developed jointly by MIT Libraries and Hewlett- Packard Licensed under BSD distribution license Licensed under BSD distribution license www.dspace.org www.dspace.org www.dspace.org

91 DSpace Supports submission of, management of, and access to digital content Supports submission of, management of, and access to digital content Formats: text, images, audio, video Formats: text, images, audio, video Organized based on organizational needs of a large university Organized based on organizational needs of a large university Communities and collections Communities and collections

92

93 DSpace Data Model

94 DSpace Features Digital preservation Digital preservation Persistent IDs, support levels for different file formats Persistent IDs, support levels for different file formats Access control Access control Versioning Versioning Search and retrieval Search and retrieval Based on qualified Dublin Core metadata Based on qualified Dublin Core metadata OAI-PMH data provider OAI-PMH data provider To support metadata harvesters To support metadata harvesters

95 DSpace Technology OS: Unix or Linux OS: Unix or Linux Written in Java Written in Java PostgreSQL relational database PostgreSQL relational database Provides complete Web user interface, but Java APIs available Provides complete Web user interface, but Java APIs available

96 DSpace Architecture

97 DSpace Software / Utilities 1. Java SDK 1.4.2 2. Apache Maven 2.0.54 3. Tomcat 5.0.28 4. Apache Ant 1.6.5 5. PostgreSQL 8.0.2 6. DSpace 1.5x / 2.x

98 DSpace Demonstration MIT DSpace MIT DSpace dspace.mit.edu dspace.mit.edu dspace.mit.edu

99 EPrints “Open Source software which creates online archives” “Open Source software which creates online archives” Developed by University of Southampton, UK Developed by University of Southampton, UK Supports self-archiving of e-prints Supports self-archiving of e-prints Can be configured as institutional repository or otherwise, e.g. repository focused on particular research area or discipline Can be configured as institutional repository or otherwise, e.g. repository focused on particular research area or discipline Licensed under GNU General Public License Licensed under GNU General Public License software.eprints.org software.eprints.org software.eprints.org

100 EPrints Supports submission, management of, and access to digital content Supports submission, management of, and access to digital content Can support multiple archives on one server Can support multiple archives on one server Moderated or unmoderated archives Moderated or unmoderated archives Search and retrieval Search and retrieval Based on metadata Based on metadata Metadata can be customized for different archives and document types Metadata can be customized for different archives and document types No access control No access control OAI-PMH data provider OAI-PMH data provider

101 EPrints Technology OS: Unix or Linux OS: Unix or Linux Written in Perl Written in Perl Requirements: Requirements: Apache web server Apache web server MySQL relational database MySQL relational database

102 EPrints Demonstration Digital Library of the Commons Digital Library of the Commons dlc.dlib.indiana.edu dlc.dlib.indiana.edu dlc.dlib.indiana.edu

103 Open Source XML Tools and Systems Utilities Utilities Xalan, Xerces, libxml, libxslt, saxon Xalan, Xerces, libxml, libxslt, saxon Editors Editors emacs / nxml-mode emacs / nxml-mode Database / Search Engines Database / Search Engines Apache Xindice Apache Xindice Berkeley DB XML Berkeley DB XML eXist eXist Publishing/WebApplication Frameworks Publishing/WebApplication Frameworks AxKit AxKit Cocoon Cocoon

104 XML Databases & Search Engines Apache Xindice Apache Xindice Apache Xindice Apache Xindice Berkeley DB XML Berkeley DB XML Berkeley DB XML Berkeley DB XML eXist eXist eXist

105 Greenstone Windows Installation Version 2.81rc

106 Opening Greenstone on Browser Digital Library Server Greenstone Digital Library

107 Opening Greenstone on Browser Greenstone Digital Library Collections

108 GLI

109 GLI Functions Establish new collection (or work on old) Select files to include in collection (Gather) Enrich files with metadata (Enrich) Select Plugins, Indexes, Classifiers (Design) Build Collection (Create) Format and Control Display (Format) Customize Appearance Preview Collection

110 Collection Building… Greenstone used to have three modes of collection building, viz., Command Line, Web Interface and the GLI (Greenstone Librarian Interface) Greenstone used to have three modes of collection building, viz., Command Line, Web Interface and the GLI (Greenstone Librarian Interface) Progressing with version 2.4x., the GLI got strengthened as well as popularized Progressing with version 2.4x., the GLI got strengthened as well as popularized Web Interface mode has been withdrawn temporarily. Web Interface mode has been withdrawn temporarily. The GLI based collection building is quite easy and simple a method. The GLI based collection building is quite easy and simple a method. Collection developers can activate the GLI software and use the ‘Gather’, ‘Enrich’, ‘Design’, ‘Format’ and ‘Create’ panel for making collection Collection developers can activate the GLI software and use the ‘Gather’, ‘Enrich’, ‘Design’, ‘Format’ and ‘Create’ panel for making collection

111 Customization  Greenstone is specifically designed to be highly extensible and customizable.  New document and metadata formats are accommodated by writing "plugins" (in Perl).  Analogously, new metadata browsing structures can be implemented by writing "classifiers."  The user interface look-and-feel can be altered using "macros" written in a simple macro language.  A Corba protocol allows agents (e.g. in Java) to use all the facilities associated with document collections.  Finally, the source code, in C++ and Perl, is available and accessible for modification

112 Customizing with macros Let you customize presentation Let you customize presentation Present pages in different languages Present pages in different languages Print variables into the page text (e.g. number of search hits) Print variables into the page text (e.g. number of search hits) Macro files Macro files stored in greenstone2/macros folder stored in greenstone2/macros folder each file defines one or more “packages” each file defines one or more “packages” (A “package” is a group of macros) loaded on startup (note difference between Local and Web Library) loaded on startup (note difference between Local and Web Library) listed in etc/main.cfg listed in etc/main.cfg Collection-specific macros Collection-specific macros Stored in greenstone2/collect/mycol/macros/extra.dm Stored in greenstone2/collect/mycol/macros/extra.dm Or include argument [c=collectionname] for each macro Or include argument [c=collectionname] for each macro

113 Hierarchy Structure

114 Collection configuration Collection configuration file determines content conversion, extraction and building of indexes and browsing structures –indexes, classifiers, plugins Presentation of search/browse results and collection interface is determined by “format” strings and “macros”

115

116 DL - Hardships Copyright Issues Copyright Issues Technology Complexities Technology Complexities Infrastructure Issues Infrastructure Issues Publications/Formats – Diverse Datastreams Publications/Formats – Diverse Datastreams Digital Objects/Formats - Multiple Digital Objects/Formats - Multiple Publishers’ Policies – Stringent, Inconsistent Publishers’ Policies – Stringent, Inconsistent

117 Major Tasks Content identification (internal / external) Content identification (internal / external) Content Creation Content Creation Content Collation/Signposts Content Collation/Signposts Organisation Organisation Updation Updation Retrieval / Dissemination Retrieval / Dissemination User Training User Training Archiving Archiving

118 Data/ Objects METS/MODS EAD TEI DCMI OS Z39.50 /OAI-PMH Network DL Software DIGITAL LIBRARY ARCHITECTURE

119 http://greenstonesupport.iimk.ac.in

120 Acknowledgement Team Greenstone, New Zealand Team Greenstone, New Zealand Greenstone Support South Asia Greenstone Support South Asia IIM Kozhikode, India IIM Kozhikode, India UNESCO UNESCO Indiana University Digital Library Program Indiana University Digital Library Program

121


Download ppt "Dr. M.G. Sreekumar UNESCO Coordinator, Greenstone Support, South Asia Librarian & Head, CDDL, IIM Kozhikode OPEN SOURCE TECHNOLOGIES FOR LIBRARIES."

Similar presentations


Ads by Google