Presentation is loading. Please wait.

Presentation is loading. Please wait.

Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.

Similar presentations


Presentation on theme: "Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment."— Presentation transcript:

1 Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment Muriel Foulonneau (mfoulonn@uiuc.edu)mfoulonn@uiuc.edu Grainger Engineering Library University of Illinois at Urbana-Champaign UIUC June 2006

2 2 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Outlines Improving resource discoverability Hidden Web, portals and distributed digital libraries Interoperability Metadata and protocols The Open Archives Protocol for Metadata Harvesting The protocol, examples of services and repositories Issues for digital libraries of distributed objects

3 3 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Improving resource discoverability

4 4 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Sharing content New services, new representations of the content, new audiences Bring your content to attention of new users outside your immediate community 37% of visits to images of the State Library of New South Wales came from the PictureAustralia portal in 2002/3

5 5 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Integrated Access to CIC Metadata http://cicharvest.grainger.uiuc.edu/

6 6 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Thematic access to resources

7 7 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Russian Publics collection at UIUC

8 8 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC On the CIC metadata portal

9 9 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Search on Google

10 10 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Multiple services use different features Full text Metadata Collection descript. Metadata AND resources Metadata Metadata AND resources

11 11 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Interoperability

12 12 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Content and services Building services => New services need content with similar features Collection service

13 13 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC What is interoperability Interoperability is the capacity for different systems to talk to each other I need A standard language An interpreter 01-04-04 -“01-04-04” - this is a month - 01=“Jan”

14 14 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Various types of interoperability Technical Protocols, hardware, … Mac/PC, Netscape/IE … Organizational Who is in charge? Competence? Politics? Update? Rules Content – related = metadata What do you talk about? The “item” = Granularity and nature of the object Semantic : date…. Created? Published? Syntactical : 04 January 2004 Linguistic : 04 Enero 2004

15 15 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Metadata Are used to Manage Provide information Retrieve Preserve Define rights and conditions of use Describe structure  Descriptive  Administrative  Structural

16 16 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC A metadata format Is a set of elements or information, mandatory or not, to apply together in order to reach one of the above mentioned objectives Standard As a text As a DTD in SGML As a Xschema in XML => MARC, EAD, MODS, Dublin Core, LOM, MPEG7, MyHomeCookedSchema …

17 17 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC The Dublin Core Metadata Element Set 15 elements ContentIntellectual property Instantiation Coverage Description Relation Type Source Title Subject Rights Contributor Publisher Creator Language Identifier Format Date

18 18 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Where metadata lay “Internal” Webpage Embedded TEI, EAD External Catalogs XML records … Includes a link to the resource => Third party metadata Library of Congress home page The Library of Congress

19 19 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Sharing metadata : Federated search My user wants “mills”…. Whatever that comes from Federated search Mill? My resource 04 Eg. Z39.50, SRU/SRW, WAIS

20 20 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Sharing metadata : Data agregation The portal gathers metadata (and resources?) Mill? My resource 04 Eg. Search engines, union catalogs, OAI

21 21 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC OAI divides the world between data providers and service providers

22 22 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC The OAI framework Service provider Harvester Repository Data provider Repository Data provider Repository Aggregator

23 23 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC OAI repositories can be organized in sets

24 24 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Honoré Daumier Lithograph (Brandeis University) MARC Record In XML Dublin Core Record In XML Qualified Dublin Core RecordQualified Dublin Core Record MODS record Multiple representations of an object

25 25 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC OAI is based on standards HTTP protocol XML XML Schemas Dublin Core

26 26 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC OAI supports 6 verbs Identify http://aerialphotos.grainger.uiuc.edu/oai.asp?verb=Identify ListSets http://aerialphotos.grainger.uiuc.edu/oai.asp?verb=ListSets ListRecords http://aerialphotos.grainger.uiuc.edu/oai.asp?verb=ListRecords&metadataPrefix=oai_ dc http://aerialphotos.grainger.uiuc.edu/oai.asp?verb=ListRecords&metadataPrefix=oai_ dc ListMetadataFormats http://aerialphotos.grainger.uiuc.edu/oai.asp?verb=ListMetadataFormats ListIdentifiers http://aerialphotos.grainger.uiuc.edu/oai.asp?verb=ListIdentifiers&metadataPrefix=o ai_dc GetRecord http://aerialphotos.grainger.uiuc.edu/oai.asp?verb=GetRecord&identifier=oai:aerialp hotos.grainger.uiuc.edu:AP-1A-1-1940&metadataPrefix=oai_dc

27 27 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC An OAI response - oai:images.library.uiuc.edu:emblems/324 2003-10-22 emblems - Müller, Johann Heinrich Traugott, 1631-1675 http://images.library.uiuc.edu:8081/u?/emblems,324

28 28 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Examples of repositories Library of Congress http://memory.loc.gov/cgi-bin/oai2_0 ContentDM at UIUC http://images.library.uiuc.edu:8081/cgi- bin/oai.exe Ohio State Knowledge Bank https://kb.osu.edu/dspace-oai/request

29 29 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Examples of services http://oaister.umdl.umich.edu http://nsdl.org/ http://www.americansouth.org/ http://cicharvest.grainger.uiuc.edu/ http://imlsdcc.grainger.uiuc.edu/ http://www.language-archives.org/ http://www.pictureaustralia.org/

30 30 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Turn key systems and modules CWIS : http://scout.wisc.edu/Projects/CWIS/http://scout.wisc.edu/Projects/CWIS/ ContentDM : http://contentdm.com/http://contentdm.com/ Digitool : http://www.exlibrisgroup.com/digitool.htmhttp://www.exlibrisgroup.com/digitool.htm DSpace : http://www.dspace.org/http://www.dspace.org/ EPrints : http://software.eprints.org/http://software.eprints.org/ DLXS: http://www.dlxs.org/http://www.dlxs.org/ OAICat: http://www.oclc.org/research/software/oai/cat.htmhttp://www.oclc.org/research/software/oai/cat.htm XMLFile: http://www.dlib.vt.edu/projects/OAi/software/xmlfile/xmlfile.html http://www.dlib.vt.edu/projects/OAi/software/xmlfile/xmlfile.html DLESE OAI software: http://dlese.org/oai/index.jsp

31 31 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Useful tools UIUC OAI registry http://gita.grainger.uiuc.edu/registry/ OAI repository explorer http://re.cs.uct.ac.za/ Errol http://errol.oclc.org/

32 32 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Digital libraries of distributed objects

33 33 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Metadata shareability issues Granularity Loss of context Completeness DLF-NSDL Best practices on shareable metadata http://oai-best.comm.nsdl.org/cgi-bin/wiki.pl?TableOfContents

34 34 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC What is behind URLs

35 35 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Conveying actionable URLs http://rama.grainger.uiuc.edu/assetactions/ ViewResizeSelect Annotate Share

36 36 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC Conclusions Interoperability: technical, content-related and organizational, well OAI is the easy part Works even better for particular communities with similar organizational structures and metadata formats Extensions of the protocol for: Objects Actionable URLs

37 37 June 15th, 2006 mfoulonn@uiuc.edu University of Illinois at UC References and useful material The Open Archives Website http://www.openarchives.org/OAI/2.0/guidelines.htm DLF/NSDL best practices for OAI and shareable metadata http://oai-best.comm.nsdl.org/cgi-bin/wiki.pl?TableOfContents OAForum Tutorial http://www.oaforum.org/tutorial/ Getting a Leg Up on OAI http://nsdl.comm.nsdl.org/meeting/session_docs/2004/2620_National_ Science_Digital_Library_Conference.doc


Download ppt "Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment."

Similar presentations


Ads by Google