Presentation is loading. Please wait.

Presentation is loading. Please wait.

Digital Libraries: Study into the features of the DSpace Suite Devika P. Madalli Documentation Research and Training Centre Indian Statistical Institute.

Similar presentations


Presentation on theme: "Digital Libraries: Study into the features of the DSpace Suite Devika P. Madalli Documentation Research and Training Centre Indian Statistical Institute."— Presentation transcript:

1 Digital Libraries: Study into the features of the DSpace Suite Devika P. Madalli Documentation Research and Training Centre Indian Statistical Institute Bangalore 560059

2 2 Introduction Digital libraries encompass a whole range of information services related work such as –Organization of digital information –Information retrieval –User interface –Archiving and preservation –Services and social issues –Evaluation and applications to particular areas

3 3 Desirable Features of DL Software Structures Accessible Searchable Extensible Massive Heterogeneous Persistent

4 4 DL’s operation should be examined under… Architectural design – Modular and Open Backend Database – scalable, robust, data formats Network capabilities – web-based and seamless operations, persistent Ids, security and authentication Metadata and Interoperability – compatible with world standards such as Dublin Core and OAI-PMH

5 5 Technical Issues Open source software Vs Commercial OS Hardware and peripheral requirements Network Components Standards – data formats, metadata, network, access, interoperability, encoding

6 6 Approaches to Building DL Digitization – retro-conversion of non-digital resources to digital Digitally born resources – involves inter- conversion to standard formats and storage

7 7 Why DSpace Digital Library An open source technology platform which can be customized and its capabilities can be extended A service model for open access and/or digital archiving for perpetual access A platform to build an Institutional Repository and the collections are searchable and retrievable by/on the Web To make available institution-based scholarly material in digital formats. The collection will be open and interoperable. DSpace is

8 8 Architecture and System Requirement The DSpace system is organized into three layers –The Storage Layer: responsible for physical storage of metadata and content –The Business Layer: deals with managing the content of the archive, users of the archive (e-people), authorization, and workflow –The Application Layer: containing components that communicate with the networked world outside of the individual DSpace installation, for example the Web user interface and the modules for metadata harvesting service

9 Features of a near ideal DL Low cost, including all hardware and software components Technically simple to install and manage Robust Scalable Open and inter-operable Modular User Friendly Multi-user (including both searching and maintenance) Multimedia digital object enabled Platform independent (including both client and server components) interoperable

10 DSpace is a joint project of MIT Libraries and Hewlett-Packard Labs

11 What is DSpace? Digital Object management system Create, search and retrieve digital objects Facilitate preservation of digital objects An open source software Allows open access and digital archiving Allows building Institutional Repositories

12 H/W and S/W requirements UNIX recommended (Java-based program should run on anything)‏ Open source, built on Apache web server and Tomcat Servlet engine Uses postgreSQL or Oracle relational database

13 What DSpace can do? Captures –Digital content in any formats directly from creators (e.g. researcher, authors)‏ Describes –Descriptive, technical, rights metadata –Persistent identifiers OAI-PMH version 2.0 compliant –Allow metadata creation

14 Possible types of Content Preprints, articles Postprints Technical Reports Conference Papers Theses/Dissertations Datasets –e.g. statistical, geospatial, scientific

15 Images –visual, scientific, etc. Audio files Video files Digitized library collections Formats of Content

16 File Formats Supported: Repository administrator can inform the submitters which file formats will be supported in the future by his organization Known: recognizes the format, but cannot guarantee full support Unsupported: cannot recognize a format; these will be listed as "application/octet- stream", -- Unknown

17 Information Model Communities –Departments, Labs, Research Centers, Schools… Collections Items Files (bitstreams)‏ –Multiple formats - same content –Complex objects – multiple files

18 Intellectual Property Click-through license during submission Grants DSpace non-exclusive right to acquire, manage, preserve, distribute the item Does not grant DSpace copyright Copy of license stored with item

19 Goodies Modular architecture, well-defined APIs 100% open source –Programmed in java –RDBMS and SQL for metadata CNRI “handles” for persistent identifiers OpenURL linking OAI-PMH for exposing metadata

20 Backend Technology Apache, Tomcat, OpenSSL/mod_ssl Java PostgreSQL/Oracle CNRI Handle System 5 (persistent ids)‏ Lucene Search Engine

21 Standards Dublin Core only –Descriptive metadata only OAI-PMH v 2.0 (Open Archive’s Initiative Protocol for metadata harvesting)‏ UNICODE Compliant

22 Capabilities Exports in XML format Supports crosswalks through OAI-PMH –DC (Dublin Core)‏ –Qualified DC –METS (Metadata Encoding and Transmission Standard –MODS (Metadata Object Description Schema – sibling of MARCXML)‏ Can be extended to any Metadata Schema

23 Customization Screens (Manakin)‏ E-mails Any language interface Metadata Input-forms Display of results Fields to be Indexed Access restrictions License (in addition to Creative Commons)‏

24 Advanced Feature Grid Compliant (Storage)‏ LDAP authentication Usage statistics generation SFX Server integration RSS (Really Simple Syndication)‏ Item Recommendation to a friend Use of Thesaurus (though not OWL/SKOS/RDF)‏ Full-text indexing of PDF, MS-WORD files

25

26

27

28

29

30

31 Important Sites http://www.dspace.org http://www.sourceforge.net/projects/dspace http://wiki.dspace.org http://mailman.mit.edu/mailman/listinfo/dspace- general http://lists.sourceforge.net/lists/listinfo/dspace-tech http://lists.sourceforge.net/lists/listinfo/dspace- devel

32 DRTC Sites https://drtc.isibang.ac.in (Librarians' Digital Library)‏ http://drtc.isibang.ac.in/dlrg (Discussion Forum)‏ http://drtc.isibang.ac.in/sdl (Harvester in LIS)‏ http://drtc.isibang.ac.in http://drtc.isibang.ac.in/blog

33 Questions?

34 Thank You devika@drtc.isibang.ac.in


Download ppt "Digital Libraries: Study into the features of the DSpace Suite Devika P. Madalli Documentation Research and Training Centre Indian Statistical Institute."

Similar presentations


Ads by Google