1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University.

Slides:



Advertisements
Similar presentations
Adding OAI-ORE Support to Repository Platforms Alexey Maslov, Adam Mikeal, Scott Phillips, John Leggett, Mark McFarland Texas Digital Library TCDL09.
Advertisements

1 William Y. Arms Cornell University October 25, 2002 The National Science Digital Library (NSDL) as an Example of Information Science Research.
The metadata challenge for libraries: a view from Europe Michael Day UKOLN: The UK Office for Library and Information Networking, University of Bath
1 Building the NSDL William Y. Arms Cornell University Thinking aloud about the NSDL.
ELPUB 2006 June Bansko Bulgaria1 Automated Building of OAI Compliant Repository from Legacy Collection Kurt Maly Department of Computer.
Building Reliable Distributed Information Spaces Carl Lagoze CS /22/2002.
1 DLESE in Context: Educational Computing, Digital Libraries and Scientific Education William Y. Arms Cornell University.
1 CS 430 / INFO 430 Information Retrieval Lecture 22 Metadata 4.
1 NSDL The National Science Foundation's National Digital Library for Science, Mathematics, Engineering and Technology Education [a.k.a. Smete, NSDL, Learns,...]
SCORM-NSDL Workshop May 18, Educational Materials are Scattered across the Internet NASA Math Forum State standards Scientific American Ask.
Mixed content, mixed metadata: Information discovery in the NSDL.
NSDL 2 nd Generation Mathematics Digital Library ASEE Annual Meeting June 13, 2005 Portland, OR William H. Mischo
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
1 William Y. Arms September 26, 2002 A Research Program for Information Science with the NSDL as an Example.
Corporation For National Research Initiatives NSF SMETE Library Building the SMETE Library: Getting Started William Y. Arms.
1 An introduction to the NSDL William Y. Arms Cornell University.
The Open Archives Initiative Simeon Warner Cornell University, Ithaca, NY, USA CREPUQ 2002, Montréal, Canada 14:00, 24 October 2002.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
Promoting Digital Preservation Partnerships at the U.S. Library of Congress April 2004.
Development Principles PHIN advances the use of standard vocabularies by working with Standards Development Organizations to ensure that public health.
Introduction to Digital Libraries hussein suleman uct cs honours 2004.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
9 April '01 1 NSDL The National SMETE* Digital Library *Science, Mathematics, Engineering, & Technology Education An early report on an initiative of the.
Ensemble Computing in the National Science Digital Library (NSDL)
Fedora Content Models for the National Science Digital Library Data Repository Fedora User’s Group Meeting Copenhagen, September 28, 2005 Carl Lagoze Cornell.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Developing a Concept Extraction Technique with Ensemble Pathway Prat Tanapaisankit (NJIT), Min Song (NJIT), and Edward A. Fox (Virginia Tech) Abstract.
NSDL: OAI and a large- scale digital library Carl Lagoze, Cornell University NSDL Director of Technology
Building a large-scale digital library for education Carl Lagoze Common Solutions Group January 16, 2003.
NBDL (National Biology Digital Library) A NSDL Core Integration System Project PI: Su-Shing Chen n University of Missouri-Columbia n National Computational.
1 CS 430 / INFO 430 Information Retrieval Lecture 24 Architecture of Information Retrieval Systems.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
Discovery Metadata for Special Collections Concepts, Considerations, Choices William E. Moen School of Library and Information Sciences Texas Center for.
Mixed content, mixed metadata: Information discovery in the NSDL.
Alexandria Digital Earth ProtoType DIGITAL LIBRARIES AND ENVIRONMENTAL INFORMATION Terence R. Smith Alexandria Digital Library Project.
Extending Access To Information Resource Discovery Service William E. Moen, Ph.D. Kathleen R. Murray, Ph.D. School of Library and Information Sciences.
1 A Very Large Digital Library Technology Demonstration William Y. Arms Cornell University.
Kurt Maly Department of Computer Science Old Dominion University Norfolk, Virginia 23529, USA Digital Libraries, OAI and Free Software.
The NCAR Community Data Portal (CDP) Experiences with OAI metadata record federation presented by Michael Burek (NCAR/SCD/VETS) Acknowledgments:
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
An OAI-Compliant Federated Physics Digital Library for the NSDL Department of Computer Science Old Dominion University, Norfolk, VA In Collaboration.
Core Integration Web Services Dean Krafft, Cornell University
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
1 The NSDL Program Stephen Griffin National Science Foundation.
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
“A Library outranks any other one thing a community can do to benefit its people.” --Andrew Carnegie.
Interoperability: The Digital Library Holy Grail Roy Tennant escholarship.cdlib.org/rtennant/presentations/ala2000/exlibris/
Automatic Metadata Discovery from Non-cooperative Digital Libraries By Ron Shi, Kurt Maly, Mohammad Zubair IADIS International Conference May 2003.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Corporation For National Research Initiatives Technical Issues in Electronic Publishing Corporation for National Research Initiatives William Y. Arms.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
NSDL & Access Management David Millman Columbia University Jan ‘02.
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Surveying the landscape: collection-level description & resource discovery JISC/NSF DLI Projects meeting, Edinburgh, 24 June 2002 Pete Johnston UKOLN,
DLF Fall Forum The Distributed Library: OAI for Digital Library Aggregation UIUC’s Role: Registry of OAI Data Providers
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
June 3-6, 2003E-Society Lisbon Automatic Metadata Discovery from Non-cooperative Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science.
Metayogi Increasing the Accessibility of the Semantic Web Karim Tharani Doug Macdonald Rachel Heidecker.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
1 CS 430: Information Discovery Lecture 13 Case Study: the NSDL.
NSDL: OAI and a large-scale digital library
CS 430 / INFO 430 Information Retrieval
OAI and Metadata Harvesting
Building a large-scale digital library for education
OAI & NSDL Research at Grainger Briefing to UIUC Library Faculty 15 April 2003 Timothy W. Cole William H. Mischo
Presentation transcript:

1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University

2 The NSDL is a program of the National Science Foundation's Directorate for Education and Human Resources, Division of Undergraduate Education. The NSDL Core Integration is a collaboration between the University Center for Atmospheric Research (Dave Fulker), Columbia University (Kate Wittenberg) and Cornell University (Bill Arms). The ideas discussed in this talk do not represent the official views of the NSF or the Core Integration team. Acknowledgement and Disclaimer

3 Research Funding: Europe and USA Europe Grant is awarded to carry out the research plan specified in proposal USA Grant is awarded to carry out research in the area described in the proposal, but is not expected to follow the precise plan.

4 New Initiatives during a Grant ProgramActivityUniversity Gigabit testbedMosaicIllinois CSTRLycosCarnegie Mellon DLI-1Google PageRankStanford DLI-2Open Archives InitiativeCornell Examples of significant partial funding that was not envisaged in the proposal.

5 NSF-funded Research Programs NSF Solicitation Proposals Research New ideas

6 The NSDL Program NSF's objective Build a comprehensive digital library for all aspects of science education NSF's approach Solicitation encouraged wide diversity of proposals divided into general categories Best 60+ proposals funded -- more to follow Grants allow projects flexibility Result A splendid set of projects A challenge in interoperability!

7 NSDL Collections Funded by the NSF (a) Focused collections

8

9

10

11 NSDL Collections Funded by the NSF (b) Aggregates and federations

12

13

14

15 NSDL Service Projects Funded by the NSF

16

17

18

19 NSDL Core Integration Team Funded by the NSF

20 Responsibility without Authority Core Integration Budget $4-6 million Staff Management Diffuse How can a small team, without direct management control, create a very large-scale digital library?

21 All branches of science, all levels of education, very broadly defined: Five year targets 1,000,000 different users 10,000,000 digital objects 10,000 to 100,000 independent sites How Big might the NSDL be?

22 Collections The NSDL program funds only a fraction of the relevant collections.

23 Every Collection is Different

24... to provide a coherent set of services across great diversity. The Core Integration Task...

25 A Spectrum of Interoperability

26 Approaches to interoperability The conventional approach  Wise people develop standards: protocols, formats, etc.  Everybody implements the standards.  This creates an integrated, distributed system. Unfortunately...  Standards are expensive to adopt.  Concepts are continually changing.  Systems are continually changing.  Different people have different ideas

27 Interoperability is about agreements Technical agreements cover formats, protocols, security systems so that messages can be exchanged, etc. Content agreements cover the data and metadata, and include semantic agreements on the interpretation of the messages. Organizational agreements cover the ground rules for access, for changing collections and services, payment, authentication, etc. The challenge is to create incentives for independent digital libraries to adopt agreements

28 Function versus cost of acceptance Function Cost of acceptance Many adopters Few adopters

29 Example: textual mark-up Function Cost of acceptance SGML ASCII HTML XML

30 Example: security Function Cost of acceptance Public key infrastructure IP address Login ID and password

31 Levels of interoperability LevelAgreementsExample FederationStrict use of standardsAACR, MARC (syntax, semantic, Z and business) HarvestingDigital libraries exposeOpen Archives metadata; simplemetadata harvesting protocol and registry GatheringDigital libraries do not Web crawlers cooperate; services mustand search engines seek out information

32 Metadata Strategy Metadata is expensive The NSDL cannot afford to create it manually

33 Metadata Strategy Support eight standard formats Collect all existing metadata in these formats Provide crosswalks to Dublin Core Expose records in the metadata repository for others to harvest Concentrate on collection-level metadata Use automatic generation to augment item-level metadata

34 Users Collections Metadata repository The Metadata Repository Services The metadata repository is a resource for service providers. It holds information about every collection and item known to the NSDL.

35 Services Strategy

36 The Metadata Repository as a Resource Records are exposed through Open Archives Initiative harvesting protocol. Core Integration team will provide some services based on the metadata repository. The architecture encourages others to build services.

37 Example: Search Service Portal Search and Discovery Services Collections SDLIP OAI http Metadata repository James Allan, Bruce Croft (University of Massachusetts, Amherst)

38 Research Challenges: Extending the Architecture to Support Greater Riches  Federations with rich sets of agreements (e.g., MARC, Z39.50)  Rich object models (e.g., interactive, dynamic, continuous time)  Language tools (e.g, thesaurus, gazetteer)... and Lesser Riches  Web crawling  Automated quality control