ISP 433/533 Week 8 IR in libraries. Goal Universal Access to Information Vannevar Bush 1945 article Memex A memex is a device in which an individual stores.

Slides:



Advertisements
Similar presentations
E-resources Collection Management Anna Grigson E-resources Manager.
Advertisements

NATIONAL LIBRARY OF MEDICINE PubMed Central Edwin Sequeira National Library of Medicine May 26, 2004.
Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
Integrated Library Management System
Challenges for the DL and the Standards to solve them Alan Hopkinson Technical Manager (Library Systems) Learning Resources Middlesex University.
© Anselm SpoerriInfo + Web Tech Course Information Technologies Info + Web Tech Course Anselm Spoerri PhD (MIT) Rutgers University
Online Databases and the Online DB Industry Change, change and more change!
Introducing Symposia : “ The digital repository that thinks like a librarian”
1 Minerva The Web Preservation Project. 2 Team Members Library of Congress Roger Adkins Cassy Ammen Allene Hayes Melissa Levine Diane Kresh Jane Mandelbaum.
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links.
ISYS3015 Analytical Methods for Information systems professionals Week 3 Lecture 1: Finding the literature.
Online the Library Michaelmas Term 2011 Trinity College Library Dublin 1 1.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
New Innovative Access to Educational and Cultural Multimedia Contents Yuka Egusa Educational Resources Research Center, National Institute for Educational.
Digital Library Architecture and Technology
Grey Literature, E-Repositories and Evaluation of Academic & Research Institutes. The case study of BPI e-repository Maria V. Kitsiou - Head Librarian,
Homework Full-text article – entire textual contents of article in online format Abstract – brief summary of article Citation – basic information required.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
CS621 : Seminar-2008 DEEP WEB Shubhangi Agrawal ( )‏ Jayalekshmy S. Nair ( )‏
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
1 NAVIGATING INFORMATION RESOURCES IN AGRICULTURE IN ICT ENVIRONMENT Dr. K. VEERANJANEYULU UNIVERSITY LIBRARIAN & CCPI, e-Granth Project Head, University.
Planning a digital library How to Build a Digital Library Ian H. Witten and David Bainbridge.
Linking resources Praha, June 2001 Ole Husby, BIBSYS
Cataloging and Metadata at the University Library.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Plagiarism What is it? Any time a student represents work done by someone else as his or her own, that student has committed an act of plagiarism.
Organizing Internet Resources OCLC’s Internet Cataloging Project -- funded by the Department of Education -- from October 1, 1994 to March 31, 1996.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Linking electronic documents and standardisation of URL’s What can libraries do to enhance dynamic linking and bring related information within a distance.
1 Reference Linking in Project Euclid …with some thoughts on the preservation of digital collections. A presentation at the Workshop on Linking and searching.
Library needs and workflows Diane Boehr Head of Cataloging National Library of Medicine, NIH, DHHS
Planning a digital library How to Build a Digital Library Ian H. Witten and David Bainbridge.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
Extending Access To Information Resource Discovery Service William E. Moen, Ph.D. Kathleen R. Murray, Ph.D. School of Library and Information Sciences.
Introduction to metadata
Tsinghua University Library Yang Zhao & Airong Jiang Tsinghua University Library, Beijing China 4 June, 2004 Electronic Thesis and Dissertation System.
PAN-European Exploitation of the Results of the Libraries Programme - EXPLOIT German Libraries Institute Berlin EXPLOIT 1 Electronic library materials.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
Indexes and Abstracts: Dissecting the Resource By M. Leedy.
The physical parts of a computer are called hardware.
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
Finding Information in the Atmospheric Sciences n Overview of library services n MadCat, the library catalog n Searching n Finding journal articles n Finding.
Collecting History: Profiles in Science Alexa T. McCray National Library of Medicine Bethesda, MD Stanford University August 21, 1999.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Digital Library The networked collections of digital text, documents, images, sounds, scientific data, and software that are the core of today’s Internet.
Corporation For National Research Initiatives Technical Issues in Electronic Publishing Corporation for National Research Initiatives William Y. Arms.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
Tiziana // Alessandra Lenzi - MG Breaking down the walls Project Museo Galileo and the Linked Open Data A joint project between.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Identifiers and Repositories hussein suleman uct cs honours 2006.
The ___ is a global network of computer networks Internet.
Bibliographic Record Description of a book or other library material.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
An information retrieval system may include 3 categories of information:  Factual  Bibliographical  Institutional  Exchange and sharing of these categories.
1 UNIT 13 The World Wide Web. Introduction 2 Agenda The World Wide Web Search Engines Video Streaming 3.
1 UNIT 13 The World Wide Web. Introduction 2 The World Wide Web: ▫ Commonly referred to as WWW or the Web. ▫ Is a service on the Internet. It consists.
General Architecture of Retrieval Systems 1Adrienn Skrop.
Vannevar Bush: As we may think. Consider a future device for individual use, which is a sort of mechanized private file and library. It needs a name,
Networked Information Resources Federated search, link server, e-books.
(class #2) CLICK TO CONTINUE done by T Batchelor.
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Old Dominion University Department of Computer Science
DIGITAL LIBRARY.
OpenURL: Pointing a Loaded Resolver
Presentation transcript:

ISP 433/533 Week 8 IR in libraries

Goal Universal Access to Information Vannevar Bush 1945 article Memex A memex is a device in which an individual stores all his books, records, and communications, and which is mechanized so that it may be consulted with exceeding speed and flexibility. It is an enlarged intimate supplement to his memory.

History 1970s - commercial retrieval systems –Search remote databases to provide reference services 1980s – online public access catalog (OPAC), full text files –Provide online access to end users 1990s – digital library programs, WWW 2000s - ?

Bibliographic Databases Chemical Abstracts (CA), Engineering Index, MEDLINE, PsycINFO, etc. Manually selected, indexed, abstracted and entered into system Record format depends on field –Controlled vocabulary

Database Vendors DIALOG, LEXIS-NEXIS, OCLC, Wilson etc. Provide a common search interface Search on multiple bibliographic databases –Cross-databases search Mostly Boolean retrieval Cater to professional search intermediaries, e.g. reference librarians

OPACs Provide patrons access to library holdings Author, title, call number, subject heading, keywords Machine Readable Catalogue (MARC) Boolean search Web interface to legacy systems OPACs at Albany

Digital Libraries DL is a collection of information that is both digitized and organized - Lesk

How DL differ? Vs. Traditional bibliographic databases and OPACs –Extension and superset –Provide both metadata and data –New technology Vs. WWW –Organization –tightly controlled, and have a targeted customer set

Vs. Traditional Library Physical objects –You have it, I can’t have it –Travel to access –Expensive to maintain –Anything else? TL doesn’t collect “Grey Literature” –technical reports, government reports, unedited proceedings etc.

Converting to Digital Format Scanning –basically “photographing” a page Optical Character Recognition (OCR) –generally when scanning, additional s/w deduces semantic content from the photographed page (“guesses the words”) Keying –retyping it all back in... All too time-consuming and $$$! Best to avoid conversion altogether if possible

Better Way Publishing with a DL in mind Publishing in electronic form What format? ArchivalOriginalIntermediatePresentation PastTIFF--GIF, JPEG Present/Fut ure XML, RTFWord, TeX/LaTeX RTFPS, PDF, HTML

DL Architecture A Framework for Distributed Digital Object Services –Kahn/Wilensky Framework (KWF) digital objects (DOs) –a unit of exchange for the DL with a particular data structure and characteristics repository –the place where DOs live handles –a unique, persistent name for a DO

Kahn/Wilensky Framework

Digital Objects Typed data: –E.g type: computer-science-tech-report, bit- sequence… –with metadata: author, institution, series, etc. Composite DOs: –a DO with data of type digital-object –composite DOs can be used to collect similar works together composite DO than contains a DO for each work of Shakespeare...

Handles Handles can be thought of as a Uniform Resource Name (URN) implementation contains info about the handle system –persistence –location independence Handles are of the general form: GlobalAuthority.LocalAuthority/LocallyUniqueString or, for example: NASA.LaRC/tm Possible project – evaluate various URN implementations (e.g. Handle, Purl, DOI )

Repository Access Protocol (RAP) “Protocol” may be misleading, its really just the skeleton for a protocol RAP is designed to be simple –repositories themselves should be simple KWF defines 3 basic operation classes: –ACCESS_DO –DEPOSIT_DO –ACCESS_REF Return reference to the repository server, this is the catch-all operation for all meta-services... –More operations were defined in implementations

DL Points The underlying architecture should be separate from the content stored in the library Names and identifiers are the basic building block for the digital library Digital library objects are more than collections of bits Users want intellectual works, not digital objects

5S Model Streams Structures Spaces Scenarios Societies

Many, many research projects Multilingual Multimedia Structured documents Distributed collections/federated search User interface Institution: creation, access, and use

Commercial DL Journal Storage Project – –started as a University of Michigan project funded by the Andrew Mellon foundation, now a commercial organization Roughly 100 journals –mostly humanities, social science, math, economics Only WWW access –keeps a list of “allowed” IP names / addresses Provides only images for the pages OCR done, but the results are used for searching and not displaying to the user