Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.

Slides:



Advertisements
Similar presentations
Adding OAI-ORE Support to Repository Platforms Alexey Maslov, Adam Mikeal, Scott Phillips, John Leggett, Mark McFarland Texas Digital Library TCDL09.
Advertisements

Heinrich Stamerjohanns Institute for Science Networking Distributed Open Archives Dr. Heinrich Stamerjohanns Institute for Science Networking at the University.
Possibility in Digital Collection Management Introduction to CONTENTdm TM Hitoshi Kamada University of Arizona Presentation for OCLC-CJK Users Group Annual.
Manage Scientific Metadata Using XML Yang, R., M. Kafatos and X. Wang, Managing Scientific Metadata Using XML, IEEE Internet Computing, Volume: 6, Issue:
Collection Service. 19 February 2001CYCLADES Kick-off meeting Collection A set of documents A set of services on the documents A set of polices that regulate.
Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
A Prototype Implementation of a Framework for Organising Virtual Exhibitions over the Web Ali Elbekai, Nick Rossiter School of Computing, Engineering and.
An Operational Metadata Framework For Searching, Indexing, and Retrieving Distributed GIServices on the Internet By Ming-Hsiang.
Ockham Library Network OAI, other “light-weight” protocols, and scholarly communication.
Multi-Model Digital Video Library Professor: Michael Lyu Member: Jacky Ma Joan Chung Multi-Model Digital Video Library LYU9904 Multi-Model Digital Video.
14 October 2003ADASS 2003 – Strasbourg1 Resource Registries for the Virtual Observatory R.Plante (NCSA), G. Greene (STScI), R. Hanisch (STScI), T. McGlynn.
June 22-23, 2005 Technology Infusion Team Committee1 High Performance Parallel Lucene search (for an OAI federation) K. Maly, and M. Zubair Department.
ELPUB 2006 June Bansko Bulgaria1 Automated Building of OAI Compliant Repository from Legacy Collection Kurt Maly Department of Computer.
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
CORDRA Philip V.W. Dodds March The “Problem Space” The SCORM framework specifies how to develop and deploy content objects that can be shared and.
31 January 2007Craig E. Ward1 Large-Scale Simulation Experimentation and Analysis Database Programming Using Java.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
The Exchange of Retrieval Knowledge about Services between Agents Mirjam Minor Mike Wernicke.
Chemical Toxicity and Safety Information System Shuanghui Luo Ying Li Jin Xu.
Java-Based Middleware IT 490 Stan Senesy IT Program NJIT.
ALCME: OAI at OCLC Jeffrey A. Young OCLC Online Computer Library Center, Inc.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
MTA SZTAKI Department of Distributed Systems The problems of persistent identifiers in the context of the National Digital Data Archives of Hungary András.
WDC-MARE – World Data Center for Marine Environmental Sciences Data portal based on Open Archives Initiative Protocols and Apache Lucene Uwe Schindler,
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
Alexandria Digital Earth ProtoType DIGITAL LIBRARIES AND ENVIRONMENTAL INFORMATION Terence R. Smith Alexandria Digital Library Project.
Jian Gui WANG New Implementation of Agriculture Models APAN19---Jan New Implementations of Agriculture Models Using Mediate Architecture.
Kurt Maly Department of Computer Science Old Dominion University Norfolk, Virginia 23529, USA Digital Libraries, OAI and Free Software.
Indexing Mathematical Abstracts by Metadata and Ontology IMA Workshop, April 26-27, 2004 Su-Shing Chen, University of Florida
ABSTRACT The JDBC (Java Database Connectivity) API is the industry standard for database- independent connectivity between the Java programming language.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
Domain-Expert Repository Management for Adaptive Hypermedia Learning System By Norazah Yusof & Paridah Samsuri Members of SPAtH Group Faculty of Comp.
Digital Library The networked collections of digital text, documents, images, sounds, scientific data, and software that are the core of today’s Internet.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Automatic Metadata Discovery from Non-cooperative Digital Libraries By Ron Shi, Kurt Maly, Mohammad Zubair IADIS International Conference May 2003.
Managing Learning Objects in Large Scale Courseware Authoring Studio Ivo Marinchev, Ivo Hristov Institute of Information Technologies Bulgarian Academy.
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
Model Design using Hierarchical Web-Based Libraries F. Bernardi Pr. J.F. Santucci {bernardi, University of Corsica SPE Laboratory.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
Object storage and object interoperability
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
June 3-6, 2003E-Society Lisbon Automatic Metadata Discovery from Non-cooperative Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Building Search Systems for Digital Library Collections
Panagiotis G. Ipeirotis Tom Barry Luis Gravano
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Database Management System (DBMS)
Ahmet Fatih Mustacoglu
Institutional Repositories
Introduction to World Wide Web
Presentation transcript:

Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science Yuan-Ze University ICADL /12/11

Taiwan - ICADL Outline Introduction Related Technologies System Architecture An Experimental Prototype Conclusions Future work

Taiwan - ICADL Introduction Metadata management is not an easy task: – It requires specific domain knowledge for appropriate data categorization. –It needs to deal with the complicated relationships between the metadata items. –A good management tool for easing metadata construction and manipulation is necessary.

Taiwan - ICADL Introduction Metalogy –Metalogy is a management system developed by ROSS project group in Taiwan. –It can be used to manipulate various digitized items and export/import XML records. –It is mainly designed for metadata management of each digital library.

Taiwan - ICADL Introduction Search across digital libraries: –Metalogy does not consider how to search information across digital libraries. –As digital libraries are widely deployed, searching information across several digital libraries becomes important. –We design a search engine to help users find resources without connecting to digital libraries and inputting the same query terms.

Taiwan - ICADL Introduction We design this search engine based on the XML data exported from Metalogy for some reasons: –XML/Metalogy provides comprehensive metadata descriptions and DTD information for metadata search. –The quality of the distributed service highly depends on the quality of the data resource.

Taiwan - ICADL Related Technologies Z39.50 –It was proposed to search and retrieve information from heterogeneous databases over networks. –Provide abstract search capability. –It is difficult to be implemented because of its strengthened functionality. OAI – Arc –Arc is developed for cross-archive searching. –It adopts the OAI protocol to harvest digital archives.

Taiwan - ICADL Related Technologies Harp –Harp provides a uniform query interface across legacy public libraries through HarpSQL. –A HarpSQL server acts as a query agent for storing and handling the intermediate query results not as a search engine to collect and store all metadata. METALICA –It adopts a meta-search engine like MetaCrawler to provide a uniform user interface for supporting cross- archive search.

Taiwan - ICADL System Architecture XML XML Parser (Java Application) Index Database Search Engine (Java Servlet) DTD Manager (Java Servlet) User Interface Manager Interface Query Request Metadata DTD Digital Library 1 DTD Browser ‧ ‧ Digital Library n Digital Library 2

Taiwan - ICADL System Architecture The search engine is constructed with three modules: –Search engine module Provide an integrated user interface Adopt Java servlets to provide search services –Index database module Provide metadata repository for digital library sources. Adopt simple Dublin Core set as default metadata. Store DTD mapping relationships.

Taiwan - ICADL System Architecture –Metadata/DTD manager Provide an administration interface to manage XML/DTD mapping relationships. Parse and translate the XML/DTD documents provided by remote digital libraries. Gather information from remote digital libraries and update the index database repeatedly.

Taiwan - ICADL An Experimental Prototype Development tool: –Implement this search engine with Java to reach platform-independence. –Parse XML information with JAXP (Java API for XML parsing) package. –The database is constructed with a public domain database MySQL.

Taiwan - ICADL An Experimental Prototype XML/DTD manager Manage functionality

Taiwan - ICADL An Experimental Prototype A mapping example Mapping information

Taiwan - ICADL An Experimental Prototype An search example A famous calligrapher His-Chih Wang ( AD)

Taiwan - ICADL An Experimental Prototype Search results Matched metadata Link to the resource file

Taiwan - ICADL Conclusions Present the design of a search engine for searching information across digital libraries based on metadata/XML. The design of the search engine has three advantages: –First, the system architecture is simple and the cost is low.

Taiwan - ICADL Conclusions –Second, the system extensibility is high for newly required services. –Third, users need not to know how and where to search information by using this uniform user interface.

Taiwan - ICADL Future Work The quality control on the metadata provided by the original digital library source. The mapping scheme to support more heterogeneous digital archives should be further discussed.

Taiwan - ICADL Future Work The performance issue should be further addressed when the environment is in a large scale. How to effectively update information from the remote digital libraries is another important work to do.