We think you have liked this presentation. If you wish to download it, please recommend it to your friends in any social system. Share buttons are a little bit lower. Thank you!
Presentation is loading. Please wait.
Published byLeslie Crawford
Modified over 5 years ago
DCMI Workshop on Metadata and Search Vendor Panel Presentation Bradley P. Allen firstname.lastname@example.org http://www.siderean.com
Copyright © 2003 Siderean Software LLC. All rights reserved. Overview Our perspective is that of a Semantic Web application vendor Our belief is that faceted search will be the first killer application of the Semantic Web Our goal is to show how this is possible and what the benefits are But first, some general statements…
Copyright © 2003 Siderean Software LLC. All rights reserved. Tools that leverage Dublin Core Do supportable tools exist that take advantage of Dublin Core and other metadata standards to enhance search results? Yes, our work is a case in point Also relevant: Weblog CMS RSS aggregators Other RDF applications
Copyright © 2003 Siderean Software LLC. All rights reserved. What's missing? What do people need to be able to do to actually use metadata effectively on their intranets? Start using whats out there Data in relational tables CMS-generated metadata A lot of metadata is lying around unexploited
Copyright © 2003 Siderean Software LLC. All rights reserved. Are Dublin Core guidelines sufficient? What additional specifications are needed? None: DC is an excellent minimal vocabulary that has achieved broad acceptance What we need are best practices, e.g.: Encouraging resource values over literal values for DC attributes as good style dc:subject using controlled vocabularies dc:creator using authority records dc:date using temporal hierarchies Implementing DCMI validation services
Copyright © 2003 Siderean Software LLC. All rights reserved. Is XML the primary coding language? Is it being used for Dublin Core and other metadata applications? Yes, for all the right reasons Open standards Leverage of existing tools What other encoding methods are being used? RDF/N3 for some RDF-based applications
Copyright © 2003 Siderean Software LLC. All rights reserved. Our application: Seamark A navigation engine built on three key ideas Metadata represented in Resource Description Framework (RDF) is aggregated from existing enterprise content and data Faceted metadata retrieval turns the RDF into a navigation web service Web services make navigation applications easy to install and integrate with existing Web applications
Copyright © 2003 Siderean Software LLC. All rights reserved. Faceted search and RDF: why? Enabling more effective retrieval is a major goal for the Semantic Web RDF is a superb foundation for faceted search RDF as an open standard for metadata exchange RDF Schema as a framework for defining facets The Semantic Web will enable faceted search to become pervasive Widespread sharing and reuse of ontologies, vocabularies and DC instance data becomes possible The blogosphere as an existence proof View Source for the Semantic Web
Copyright © 2003 Siderean Software LLC. All rights reserved. Seamark, Dublin Core, and CVs Enables Dublin Core Using RDF encodings of DC Handles controlled vocabularies Using emerging RDF-based standards like TIF(S) Supports building and maintaining controlled vocabularies Concepts and terms represented as resources and encoded in RDF in the same way as other content Therefore the same tools apply
Copyright © 2003 Siderean Software LLC. All rights reserved. Seamarks search interface Use of flat or hierarchical controlled vocabularies Transparency and customizability of results ranking Parametric search with customizable pull-down menus
Copyright © 2003 Siderean Software LLC. All rights reserved. Lookups into large CVs in Seamark Use of standard vocabularies represented in RDF (e.g. LCs Thesaurus of Graphical Materials Faceted search over controlled vocabulary terms Syndication of CVs, instance data and ontologies for sharing
Copyright © 2003 Siderean Software LLC. All rights reserved. Query processing in Seamark Based on XML for Retrieval By Reformulation (XRBR) A query language that Provides support for query reformulation and refinement while minimizing roundtrips Supports a stateless protocol for faceted metadata retrieval with SOAP as a transport mechanism Handles very large result sets gracefully Think of XRBR as an application profile in the digital library sense Specifies a view over heterogeneous metadata schemas with hints as to its interpretation and display
Copyright © 2003 Siderean Software LLC. All rights reserved. Query processing in Seamark Disambiguation Suggestions provide this implicitly Query expansion and concept mapping RDF models plus XRBR structure queries provide a general mechanism for this Entity extraction XSLT extensions at import augments raw metadata with additional extracted attributes Natural language processing Direct manipulation now; QA to come
Copyright © 2003 Siderean Software LLC. All rights reserved. Searching across collections Metadata aggregation using RDF provides a general platform for federated search We can directly leverage emerging SW approaches to: Thesaurus mapping tif:concept-equivalence Schema mapping rdfs:subPropertyOf
Copyright © 2003 Siderean Software LLC. All rights reserved. Setup and maintenance Installation and configuration for Windows, Linux and Mac OS X Administration Simple web-based administration interface for aggregating feeds and specifying initial queries Training 135 page tutorial Extensive on-line API documentation Courses One-day on-site introduction
Copyright © 2003 Siderean Software LLC. All rights reserved. Setup and maintenance Shelley Powers, Practical RDF, O'Reilly & Associates, 2003:... the application is easily installed and configured, and comes with considerable documentation What I was most impressed with about the product, though, was how quickly and easily it integrated my RDF/XML data … into a sophisticated query engine with little or no effort.
Copyright © 2003 Siderean Software LLC. All rights reserved. Seamarks administration interface Users can specify URLs serving RDF to load into a given model … then load them manually or on a schedule basis Alternatively, queries can be executed against an SQL database XSLT stylesheets transform XML documents and SQL result sets into RDF Aggregated models can be dumped to RDF
Copyright © 2003 Siderean Software LLC. All rights reserved. Sites using Seamark
Copyright © 2003 Siderean Software LLC. All rights reserved.
Can I Use It, and If so, How? Christian Lieske SAP AG – MultiLingual Technology Discussion of Consortium Proposal for OLIF2 File Header.
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
DC2001, Tokyo DCMI Registry : Background and demonstration DC2001 Tokyo October 2001 Rachel Heery, UKOLN, University of Bath Harry Wagner, OCLC
Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Developing a Generic Toolkit: Architecture and technology issues ALLC/ACH Conference.
A centre of expertise in digital information management Approaches To The Validation Of Dublin Core Metadata Embedded In (X)HTML Documents Background The.
Native XML Database or RDBMS. Data or Document orientation If you are primarily storing documents, then a Native XML Database may be the best option.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
The Semantic Web and Digital Libraries Eric Miller, W3C DC 2004 / SILF 2004 Shanghai Library, Shanghai, China
Z39.50 and the Web ZIG July 2000 Poul Henrik Jørgensen, Danish Bibliographic Centre,
The Documentum Team Lance Callaway, Brooke Durbin, Perry Koob, Lorie McMillin, Jennifer Song Missouri University of Science and Technology Rolla, Missouri.
Copyright 2005 Digital Enterprise Research Institute. All rights reserved. The Web Services Modeling Toolkit Mick Kerrigan.
The CERIF-2000 Implementation. Andrei S. Lopatenko CERIF Implementation Guidelines Andrei Lopatenko Vienna University of Technology
Ontology Notes are from:
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
A Methodology for Developing a Taxonomy – A Subject Oriented Approach
RDF Kitty Turner. Current Situation there is hardly any metadata on the Web search engine sites do the equivalent of going through a library, reading.
UPortal: A framework for the Personalization of Library Services John Fereira: Programmer/Analyst Cornell University Mann Library.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
© 2019 SlidePlayer.com Inc. All rights reserved.