Presentation is loading. Please wait.

Presentation is loading. Please wait.

Kurt Maly Department of Computer Science Old Dominion University Norfolk, Virginia 23529, USA Digital Libraries, OAI and Free Software.

Similar presentations


Presentation on theme: "Kurt Maly Department of Computer Science Old Dominion University Norfolk, Virginia 23529, USA Digital Libraries, OAI and Free Software."— Presentation transcript:

1 Kurt Maly Department of Computer Science Old Dominion University Norfolk, Virginia 23529, USA maly@cs.odu.edu Digital Libraries, OAI and Free Software for Education and Science 5 th National Conference Computer Application Federation of China Instrument & Control Society Yinchuan, Ningxia Province,PRC September 22-24, 2003

2 Sept 24, 20035th National CACIS Conference2 Outline Digital Libraries The Open Archives Initiative Free Software Systems Arc DP9 Kepler RVOT Conclusions Important URLs

3 Sept 24, 20035th National CACIS Conference3 Digital Libraries DL = library whose content is stored digitally and can be accessed over the Internet Key difference between DLs and the general Web is that the content is structured and has metadata associated with it allowing for more precise results to queries

4 Sept 24, 20035th National CACIS Conference4 Digital Libraries Development of software to support DLs has proceeded along proprietary software lines It is extremely difficult for the average user to find information that is in different DLs Need for interoperability between DLs

5 Sept 24, 20035th National CACIS Conference5 Digital Libraries DL interoperability can be achieved at three levels technical:protocol, format, etc. should be consistent so that messages can be exchanged content: agreements cover the data and metadata, agreements on the interpretation of messages organizational: includes rules for access, for changing collections and services, payment, and authentication Need to federate, filter and provide value- added services on remote content

6 Sept 24, 20035th National CACIS Conference6 Open Archives Initiative address technical interoperability among distributed archives facilitate the discovery of content in distributed archives The OAI framework defines two functional roles: data providers (archives) and service providers

7 Sept 24, 20035th National CACIS Conference7 Open Archives Initiative Data providers: expose the metadata of their objects for harvesting Service providers: extract metadata from data providers via the OAI metadata harvesting protocol Service provider develop value-added services that are based on the metadata collected from data providers such as: cross-archive search engines, linking systems, and peer-review systems

8 Sept 24, 20035th National CACIS Conference8 herbert van de sompel The Open Archives Iinitiative has been set up to create a forum to discuss and solve matters of interoperability between preprint solutions, as a way to promote their global acceptance. Paul Ginsparg, Rick Luce & Herbert Van de Sompel OAI origin herbert van de sompel

9 Sept 24, 20035th National CACIS Conference9 Core concepts of Santa Fe convention herbert van de sompel low-barrier interoperability data-provider & service-provider model metadata harvesting model shared metadata format and parallel, community- specific metadata formats acceptable use Dienst subset OAMS XML reply HTTP based Gentelmen’s agreement

10 Sept 24, 20035th National CACIS Conference10 core concepts in OAI 1.0 herbert van de sompel low-barrier interoperability data-provider & service-provider model metadata harvesting model shared metadata format and parallel, community- specific metadata formats acceptable use flexibility OAI 1.0 protocol Dublin Core HTTP based Community specific Reply XML Schema Self contained

11 Sept 24, 20035th National CACIS Conference11 The Open Archives Initiative develops and promotes interoperability standards that aim to facilitate the efficient dissemination of content. new OAI mission statement herbert van de sompel

12 Sept 24, 20035th National CACIS Conference12 The Open Archives Initiative has its roots in an effort to enhance access to e-print archives as a means of increasing the availability of scholarly communication. Continued support of this work remains a cornerstone of the Open Archives program. new OAI mission statement herbert van de sompel

13 Sept 24, 20035th National CACIS Conference13 The fundamental technological framework and standards that are developing to support this work are, however, independent of the both the type of content offered and the economic mechanisms surrounding that content, and promise to have much broader relevance in opening up access to a range of digital materials. [...] new OAI mission statement herbert van de sompel

14 Sept 24, 20035th National CACIS Conference14 Free software - Arc Arc harvests metadata currently from about 150 OAI compliant archives normalizes them, and stores them in a search service based on a relational database (MySQL or Oracle) over 6 Million metadata records from various subject domains Arc also provides OAI layer, thus making hierarchical harvesting possible

15 Sept 24, 20035th National CACIS Conference15

16 Sept 24, 20035th National CACIS Conference16

17 Sept 24, 20035th National CACIS Conference17 Free Software – DP9 “deep web" or "invisible web" a vast repository of content, such as documents in online databases, that general-purpose web crawlers cannot reach 500 times that of the surface web Internet search engines can not index OAI collections, as they are not aware of the OAI protocol

18 Sept 24, 20035th National CACIS Conference18 Free Software – DP9 A Web crawler indexes a Web site by starting with a base HTML page and following the links on this page to go deeper to retrieve other pages on the Web site DP9 computes and presents an HTML page presented to a Web crawler as a result of an OAI request, and the links on the Web page leads to other OAI requests

19 Sept 24, 20035th National CACIS Conference19 Free Software – DP9 DP9 provides an entry page and if a web crawler finds this entry page, it may follow the links on this page and send requests to DP9. DP9 will then forward the request to corresponding OAI Data Providers and process the returned XML records Depending on the depth a crawler follows, it can index all records in an OAI Data Provider

20 Sept 24, 20035th National CACIS Conference20 Free Software – DP9

21 Sept 24, 20035th National CACIS Conference21

22 Sept 24, 20035th National CACIS Conference22 Free Software - Kepler The objective of the Kepler framework is to satisfy the need for the average researchers at an average university to publish results and disseminate them to a wide audience quickly and conveniently The Kepler framework is based on OAI to support what is called "personal data providers" or "archivelets"

23 Sept 24, 20035th National CACIS Conference23 Free Software - Kepler Kepler framework - a digital library of many ‘little’ publishers. an easy-to-use archivelet that is downloadable and self-installing an automated registration service to support tens of thousands of publishers a simple service provider to harvest metadata from archivelets.

24 Sept 24, 20035th National CACIS Conference24

25 Sept 24, 20035th National CACIS Conference25

26 Sept 24, 20035th National CACIS Conference26

27 Sept 24, 20035th National CACIS Conference27 Free Software - RVOT Rapid Visual OAI Tool (RVOT) is a tool that can help small organizations in making their collections OAI-PMH compliant construct an OAI-PMH repository from a collection of files metadata translation tool records in the original collection can be in any of the supported formats including RFC1807, MARC subset, and COSATI formats lightweight HTTP server including an OAI-PMH request handler

28 Sept 24, 20035th National CACIS Conference28 Free Software - RVOT

29 Sept 24, 20035th National CACIS Conference29 Free Software – RVOT

30 Sept 24, 20035th National CACIS Conference30

31 Sept 24, 20035th National CACIS Conference31

32 Sept 24, 20035th National CACIS Conference32

33 Sept 24, 20035th National CACIS Conference33 Conclusions OAI makes the many digital libraries available today interoperate in such a way that users can discover information across a wide variety of domains without having to be aware of the many different user interfaces of the individual libraries OAI was founded by researchers who were interested not only in free distribution of information but also in free distribution of software

34 Sept 24, 20035th National CACIS Conference34 Conclusions All the software systems described in this paper are freely available either in OpenSource or directly from the research group that created it one caveat: free software does not necessarily mean no cost running of services. One still has to account for the need for technical support and hardware to set up services

35 Sept 24, 20035th National CACIS Conference35 Important URLs http://dlib.cs.odu.edu - ODU digital library research group http://dlib.cs.odu.edu http://www.openarchives.org http://arc.cs.odu.edu http://sourceforge.net/projects/oaiarc/ http://dlib.cs.odu.edu/dp9 http://kepler.cs.odu.edu


Download ppt "Kurt Maly Department of Computer Science Old Dominion University Norfolk, Virginia 23529, USA Digital Libraries, OAI and Free Software."

Similar presentations


Ads by Google