Presentation on theme: "OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)"— Presentation transcript:
OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)
Different versions of this presentation were given at the following meetings: OCLC Australian Advisory Council Melbourne, February 1, 2004 National Library of Australia, Canberra, February 6, 2004 OCLC Members Council Dublin, Ohio, February 9, 2004
Research possibilities ….. are endless! Becoming more complex as more activities enter a network space. Focus … –on maximizing impact of a limited resource. –on where can make an internal and external impact. –on making valuable work more visible –on engaging external partners in useful collaboration.
Overview Collection and user analysis Interoperability System & service architecture Knowledge organization Content management
Collection and user analysis Change creates demand for better data. Growing interest in knowing more about: –Characteristics –Gaps and overlaps –Use Tuning collections based on data. Focus collection spending where creates most value. The idea of the balancedbut unreadcollection is disappearing. Librarians cannot change user behavior so they need to meet the user.
OR objectives Support better management decisions by –Making data work –Exploring user behaviors.
Some projects Characteristics of collections –WorldCat –CIC Compare ILL, circulation and holdings data. Last copy: what is irreplaceable? ARL Global Resources. –Exploring coverage of overseas titles in ARL libraries. Large scale user behavior study –IMLS project with OSU and OCLC
Content management Digital asset management a growing concern –Cultural heritage, special collections, … –Learning objects –Institutional repositories Issues –Repository selection and interoperability –Securing long term access to digital assets
Content management Digital preservation –Economics of digital preservation –Consensus making – OCLC/RLG working groups –Preservation metadata (PREMIS) Repository architectures –Contributions to Dspace codebase to support its interoperability OAI SRW –Reference models IMS repository interoperability
System and service architecture The library systems environment is getting more complex –ILS –Digital asset management –Resolution –Portal –Resource sharing –License management –Auth* Build, buy, opensource? Integration –Integrated workflow Portal Cataloging …
Unplug and play In this model business functionality is unplugged from large integrated centralized workflow applications so that it can be more readily integrated inside local applications and workflow. In this way, functionality potentially reaches a wider audience, provides more value to existing audiences, and extends the life of legacy applications.
OR objectives Investigate new ways of structuring and viewing WorldCat and associated knowledge structures Exploit emerging technologies, open standards and protocols to prototype new services
Some projects Unplug and play –Metadata schema transformation –E-prints UK –Terminology services –Name authority services –XISBN Text searching –Fast searching on Beowulf clusters Harvesting –NDLTD Union Catalog
Metadata schema transformation Metadata schema translator Web services layer Crosswalk repository client Record translation client A transformed record A record A metadata crosswalk
xISBN An experimental web service –Give it an ISBN, it returns all related ISBNs –Based on WorldCat –Designed for machine-to-machine data exchange Examples: –Check user ILL requests against all editions/versions in OPAC –Find librarys editions when user finds any edition/version of item on Amazon –Check OPAC for all editions during selection/acquisitions/gift book processing
Searching for the book on Amazon Searching for the book on Amazon
LibraryLookup bookmarklet LibraryLookup http://www.amazon.co.uk/exec/obidos/A SIN/1860464955/qid=1075134526/sr=1- 1/ref=sr_1_10_1/202-6426661-8213436 Is the book at my library? Single ISBN
xISBN bookmarklet http://www.amazon.co.uk/exec/obidos/A SIN/1860464955/qid=1075134526/sr=1- 1/ref=sr_1_10_1/202-6426661-8213436 xISBN server LibraryLookup xISBN Multiple ISBNs ADDED Is the book at my library?
Knowledge organization and semantic web "The Semantic Web is an extension of the current web in which information is given well-defined meaning, better enabling computers and people to work in cooperation." -- Tim Berners-Lee, James Hendler, Ora Lassila, The Semantic Web, Scientific American, May 2001 The Semantic Web
OR objectives To release the value of the historical library investment in controlled vocabularies and knowledge structures –Redeploy tools for accessing or assigning names, subjects, and classification numbers –Make knowledge organization services more accessible.
Projects FAST Terminology services FRBR Automatic classification VIAF – Virtual International Authority File –Library of Congress, Die Deutsche Bibliothek
FAST Geographic Search by Area Avalon Lake Bellaire, Lake Charlevoix, Lake Fletcher Pond Munro Lake Ocqueoc Lake Bar 1 Bay 5 Bridge 1 Channel 2 Civil 23 Forest 4 Island 4 Lake 6 Park 10 Ppl 92 Stream 10
Knowledge org systems Plethora of vocabularies Incompatible approaches to encoding Few connections –Education GEM Subjects, ERIC Thesaurus, LCSH, CIP (Classification of instructional programs) –Cultural Heritage AAT, Thesaurus for Graphic Materials (TGM) Subjects & Genre Terms Not built for the web –Link to concepts
Terminology services: Webulating knowledge organization The goal of this project is to offer accessible, modular, web-based terminology services. Make vocabularies more available for –Metadata creation –Searching –… Refine and extend mappings Represent vocabularies in major encoding standards, e.g., MARC, Zthes, TIF Prototype custom web services as appropriate
2.6+ million fiction records from Worldcat, clustered by OCLCs FRBR algorithm Make greater use of data (genres, settings, imaginary characters, etc)
Interoperability Extract maximum value from investment in –Metadata –Content –Services By ensuring that they are –Sharable –Reusable –Recombinable
OR objectives Provide leadership in Internet and information standardization Help to raise the visibility of the values and value of librarianship
Some examples Dublin Core –Central to library, cultural heritage and related communities. –Harvested data: OAI –8 Governments –Corporations and NGOs Protocols –Z39.50, SRW/U, OAI, Zthes Identifiers –INFO URI, PURL Registries –DCMI, OpenURL, Info URI Everywhere …! Cliff Lynch on Info URI: … it represents an important new step in collaboration ACROSS standards organizations, and … I think the work is of real importance to the CNI community.