Resource Discovery (metadata and searching) Working Group Report.

Slides:



Advertisements
Similar presentations
GEOSS ADC Architecture Workshop Clearinghouse, Catalogues, Registries Doug Nebert U.S. Geological Survey February 5, 2008.
Advertisements

Accessing Distributed Resources Information: An OLAC perspective Steven Bird Gary Simons Chu-Ren Huang Melbourne SIL Academia Sinica ENABLER/ELSNET Workshop.
The Seven Pillars of Open Language Archiving: A Vision Statement Gary Simons and Steven Bird Workshop on Web-based Language Documentation and Description.
Outreach Jeff Good UC Berkeley. OLAC's Needs Maximal involvement from the whole community –The more data providers involved the more useful the services.
White Paper on Establishing an Infrastructure for Open Language Archiving Steven Bird and Gary Simons.
The Open Language Archives Community: Building a worldwide library of digital language resources Gary Simons, SIL International LSA Tutorial on Archiving.
Getting Involved in OLAC Steven Bird University of Pennsylvania LREC Symposium: The Open Language Archives Community 29 May 2002.
Getting Involved in OLAC Steven Bird University of Pennsylvania LSA Symposium: The Open Language Archives Community 4 January 2002.
The Seven Pillars of Open Language Archiving: Introducing the OLAC Vision Gary Simons SIL International LREC Symposium: The Open Language Archives Community.
The Seven Pillars of Open Language Archiving: Introducing the OLAC Vision Gary Simons SIL International LSA Symposium: The Open Language Archives Community.
Metadata workshop, June The Workshop Workshop Timetable introduction to the Go-Geo! project metadata overview Go-Geo! portal hands on session.
Where next…. Stakeholder workshop, 29 Jan To the end of the project.
Advanced Metadata Usage Daan Broeder TLA - MPI for Psycholinguistics / CLARIN Metadata in Context, APA/CLARIN Workshop, September 2010 Nijmegen.
Geospatial One-Stop A Federal Gateway to Federal, State & Local Geographic Data
14 October 2003ADASS 2003 – Strasbourg1 Resource Registries for the Virtual Observatory R.Plante (NCSA), G. Greene (STScI), R. Hanisch (STScI), T. McGlynn.
BC Integration of Systems and Resources MetaLib at Boston College Theresa Lyman Digital Resources Reference Librarian Boston College Libraries.
Search Engines. 2 What Are They?  Four Components  A database of references to webpages  An indexing robot that crawls the WWW  An interface  Enables.
Metadata for Digital Content Jane Mandelbaum, Ann Della Porta, Rebecca Guenther.
Engineering Village ™ ® Basic Searching On Compendex ®
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
Lund Online 07/10/2009 Ingolf Kaspar, Regional Sales Manager EBSCO Publishing.
National Science Foundation: Transforming Undergraduate Education in Science, Technology, Engineering, and Mathematics (TUES)
Federated Searching: The ABC’s of HSE, XML, & Z39.50 Harry Samuels Product Manager Linking & Searching August 27, 2004.
OCLC Online Computer Library Center A Global OpenURL Resolver Registry Phil Norman OCLC Dlsr4lib Workshop March 23 rd, 2006 Arlington VA.
Lecturer: Ghadah Aldehim
ORGANIZING AND STRUCTURING DATA FOR DIGITAL PROJECTS Suzanne Huffman Digital Resources Librarian Simpson Library.
Metadata Harvesting The Hague, 13 & 14 January 2009 Julie Verleyen Scientific Coordinator, Europeana Office EuropeanaLocal Knowledge Sharing Workshop.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
June 20, 2006E-MELD 2006, MSU1 Toward Implementation of Best Practice: Anthony Aristar, Wayne State University Other E-MELD Outcomes.
John Helmer Executive Director, Orbis Cascade Alliance Paul Cappuzzello Senior Library Services Consultant Cheryl Snowdon WorldCat Local Product Manager.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
Cataloguing Electronic resources Prepared by the Cataloguing Team at Charles Sturt University.
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
CLARIN Metadata Infrastructure Component Metadata and intermediate solutions Daan Broeder Claus Zinn Dieter van Uytvanck - Max-Planck Institute for Psycholinguistics.
7. Approaches to Models of Metadata Creation, Storage and Retrieval Metadata Standards and Applications.
Group-based Repositories in Oz Diane Costello Council of Australian University Librarians ICOLC Montreal 2007.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Metadata Helen Aristar Dry Eastern Michigan University LINGUIST List.
Linking electronic documents and standardisation of URL’s What can libraries do to enhance dynamic linking and bring related information within a distance.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
OpenWeb: Expanding access to Digital Collections Marshall Breeding Director for Innovative Technologies and Research Vanderbilt University
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
The Internet 8th Edition Tutorial 4 Searching the Web.
Information Services and Systems Criminology Dissertation October 2013.
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
4 1 SEARCHING THE WEB Using Search Engines and Directories Effectively New Perspectives on THE INTERNET.
Extending Access To Information Resource Discovery Service William E. Moen, Ph.D. Kathleen R. Murray, Ph.D. School of Library and Information Sciences.
ESIP & Geospatial One-Stop (GOS) Registering ESIP Products and Services with Geospatial One-Stop.
1 A Very Large Digital Library Technology Demonstration William Y. Arms Cornell University.
Caltech CODA CODA: Collection of Digital Archives Caltech Scholarly Communication.
Building a Topic Map Repository Xia Lin Drexel University Philadelphia, PA Jian Qin Syracuse University Syracuse, NY * Presented at Knowledge Technologies.
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
Digital Library Repositories and Instructional Support Systems: Repository Interoperability Working Group Leslie Johnston University of Virginia Library.
Uncovering the Invisible Web. Back in the day… Students used to research using resources hand-picked by librarians and teachers. These materials were.
National Library of the Czech Republic Integration of digital materials into EDL Adolf Knoll National Library of the Czech Republic Helsinki CENL Workshop.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
1 Manual LIMO Content  What’s LIMO?  Content of LIMO  Getting started in LIMO  Performing Searches  Using the Search Results  Managing.
ISWG / SIF / GEOSS OOS - August, 2008 GEOSS Interoperability Steven F. Browdy (ISWG, SIF, SCC)
Dynamic/Deferred Document Sharing (D3S) Profile for 2010 presented to the IT Infrastructure Technical Committee Karen Witting February 1, 2010.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Google Scholar Google Scholar allows the researcher to search for scholarly articles on a broad range of subjects.
Session 3A: Catalog Services and Metadata Models
[Slide stating problem]
PDAP Query Language International Planetary Data Alliance
OAI and Metadata Harvesting
The JISC IE Metadata Schema Registry
Overview Ideas Other Stuff
Presentation transcript:

Resource Discovery (metadata and searching) Working Group Report

Issues discussed What kinds of resources should EMELD provide search services for? What should the design be for an EMELD search interface? How can EMELD get good metadata into its search database? What level of metadata should be exposed?

What resources? Anything that might be of value to the endangered language's linguist. –Language data –Tools –Advice (including reviews) –People –"Gateway" websites

What resources? But, there's no reason to rely on this working group for "what". A questionnaire distributed via Linguist

What resources? Two kinds of best practice resources Resources with best practice metadata –These resources can be discovered –Non-digital resources encouraged –Digital resources discouraged, but allowed

What resources? Best practice digital resources All digital resources encouraged to be of this type Benefits –Enhanced search features (due to document interoperability) –Special "BP globe of approval" √

What resources? Side Note –Best Practice "approval" system should be tied into a larger system through which digital resources could be listed as "publications" –A topic for another working group? (Perhaps OLAC?)

What resources? Issues which need to be addressed Metadata for resources interesting to linguists but which are not linguistic data Needed: Best practice metadata standards for –Tools –Advice –People –... Test: EMELD could see how it would classify everything in BPU.

How to search? Assumption: Metadata and data is distributed Query Language –Metadata: OLAC standard –Data from interoperable documents: A new standard

How to search? Resource Query Language Ideal –A generalized query protocal used across the linguistics community –A series of "methods" to be defined can be called on these resources to retrieve structured linguistic data matching query parameters

How to search? Problems implementing ideal –No clear sense as to what "methods" are needed. –One solution: Examine results from questionnaire

How to search? Problems implementing ideal –Very few repositories allow their data to be accessed in a generalized way –First step: Encourage documentation of repository data access systems and develop a metadata standard for this

How to search? Long term implementation issues –An OLAC Query Language Protocol A well-defined linguistic query language A system for "packaging" queries –Linguistic data search registry Linguistic sites register they are data access sites They also register implemented search methods –EMELD will archive best-practice documents for data access for data creators not capable of implementing the query protocol

How to search? Pilot project –Take some small subset of resources Data inputted via Field Nijmegen? SIL? AIATSIS? AILLA? –Take FIELD search out of FIELD –Search over that small set of resources –Ideally, keep both resources in separate databases to begin to develop query interchange protocol

How to search? Another project: Grammatical thesaurus –Develop a grammatical thesaurus that gives common synomyns for a given grammatical term (Ex. oral stop, plosive) –This could then be used to allow a user's search to be expanded to include synonyms for a given term. –In all likelihood, there are other applications of this.

How to search? Search interface –EMELD should implement a VISER-like service for access to its database –There are two distinct kinds of searches Resource location Resource data search

How to search? Search interface –The details of the search interface implemented by EMELD are hard to conceive of until more resources can be accessed through it –A questionnaire can help with this area too. EMELD could ask people to try the search and evaluate it Starting with the people in this room

Getting the data Sticks –EMELD Ambassadors –Assisted by Linguist Spider

Getting the data Carrots –Support harvesting metadata in document headers for submitted URL's. –Resources with best practice metadata can be referenced using some standard EMELD URI which can be used as a reference –These resources could be posted and advertised on Linguist (but consult Baden first)

Getting the data Juiciest Carrots (Best Practice resources only) –"Preferred" EMELD URI's –Marked as such in a search –Could undergo "advanced" search techniques –Be peer-reviewed and vetted by LDRA (Linguistic Digital Resource Association)* *This organization does not exist, as far as I know.

Granularity Right now there are no recommendations for the granularity of exposed metadata records –Large archives, for example, have hierarchical structure, one level of which must be isolated (the IMDI session, for example) –Cutting-edge archives don't work well with the resource=object model. Their resources are "created" based on the user's needs

Granularity The lack of recommendations on this issue inhibits metadata creation Granularity makes a big difference as to what content is searchable Two different audience's in need of advice –"Real" archives (a.k.a. trusted repositories) –Individuals

Granularity Recommendation: EMELD should encourage IMDI and OLAC to devise best- practice recommendations for granularity

The questionnaire Two broad kinds of questions: –What kinds of things would you like? –What kinds of would you hate hate? (Dafydd's Corollary)

The questionnaire Part one: Search capabilities –How do you want to conduct your search (google- style, directory-style, pull-down menus...)? –What kinds of searches are you doing already on other sites? –Search within results? (We wanted this.) –Thesaurus-based search

The questionnaire Part Two: Search content –Free entry (like Google) –Feature-based entry –Statistical questions –Phonetic characters –Geographical search –Time search –...

The questionnaire Part Three: Results –Google-like results –Journal abstract search-like results –Restricted results (only return web sites,.pdf documents,...) –...

The questionnaire Format –Online submission –Combination multiple choice (for the uncreative) and free form (for the creative) –Encourage people to envision the search of the year 2503