Compass Semantic search www.ovitas.no.

Slides:



Advertisements
Similar presentations
Wincite Knowledge Warehousing and Networking Sophisticated Simplicity.
Advertisements

Welcome to Middleware Joseph Amrithraj
Sharpdesk Overview Desktop Composer Search Imaging      
CAPTURE SOFTWARE Please take a few moments to review the following slides. Please take a few moments to review the following slides. The filing of documents.
Web Applications Development Using Coldbox Platform Eddie Johnston.
James Martin CpE 691, Spring 2010 February 11, 2010.
ARCHIMÈDE Presented by Guy Teasdale Directeur, Services soutien et développement Bibliothèque de l’Université Laval CARL Workshop on Institutional Repositories.
Project 1 Introduction to HTML.
NetworkedPlanet Networked Information – Networked Knowledge Topic Maps & Web 3.0 © 2007 Networked Planet Limited. Web 3.0 Technology Platform to enable.
A New Learning Tools. Topic Maps is a standard for the representation and interchange of knowledge, with an emphasis on the findability of information.
Technical Tips and Tricks for User Support Mike Gardner
Object-Oriented Enterprise Application Development Tomcat 3.2 Configuration Last Updated: 03/30/2001.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
1st Project Introduction to HTML.
Overview of Search Engines
Search Engine Optimization March 23, 2011 Google Search Engine Optimization Starter Guide.
IBM User Technology March 2004 | Dynamic Navigation in DITA © 2004 IBM Corporation Dynamic Navigation in DITA Erik Hennum and Robert Anderson.
Implementing search with free software An introduction to Solr By Mick England.
HTML 1 Introduction to HTML. 2 Objectives Describe the Internet and its associated key terms Describe the World Wide Web and its associated key terms.
Chapter ONE Introduction to HTML.
Midwest Documentum User Group Harley-Davidson Documentum WCM 10/10/2006.
Web 2.0: Concepts and Applications 2 Publishing Online.
SciFinder Web Version Pootorn R. Book Promotion & Service Co.,Ltd. Thailand.
MOVIE QUOTES SEARCH ENGINE Students: Meytal Bialik Zvi Cahana Supervisors: Hayim Makabee Oren Somekh Technion – Israel Institute Of Technology Computer.
Chapter 1 Introduction to HTML, XHTML, and CSS
CHAPTER 9 DATABASE MANAGEMENT © Prepared By: Razif Razali.
In The Name Of God. Jhaleh Narimisaei By Guide: Dr. Shadgar Implementation of Web Ontology and Semantic Application for Electronic Journal Citation System.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
Ankiro Search for EPiServer CMS by Martin Starch Sørensen Head of Development.
Configuration Management and Server Administration Mohan Bang Endeca Server.
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
Building Search Portals With SP2013 Search. 2 SharePoint 2013 Search  Introduction  Changes in the Architecture  Result Sources  Query Rules/Result.
HTML, XHTML, and CSS Sixth Edition Chapter 1 Introduction to HTML, XHTML, and CSS.
LinkWare LinkWare is a web-enabled, open platform for generation and distribution of electronic technical documentation and e–catalogues. The LinkWare.
HTML. Principle of Programming  Interface with PC 2 English Japanese Chinese Machine Code Compiler / Interpreter C++ Perl Assembler Machine Code.
The S&I Tools & Repository April 12 th, S&I Tools and Repository Agenda: siframework.org S&I Repository repository.siframework.org.
Mock-up of ReStore repository site
Search Engines. Search Strategies Define the search topic(s) and break it down into its component parts What terms, words or phrases do you use to describe.
1 Geospatial and Business Intelligence Jean-Sébastien Turcotte Executive VP San Francisco - April 2007 Streamlining web mapping applications.
The Physiome Model Repository – PMR David Nickerson Auckland Bioengineering Institute The University.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
1 FollowMyLink Individual APT Presentation Third Talk February 2006.
Building a Topic Map Repository Xia Lin Drexel University Philadelphia, PA Jian Qin Syracuse University Syracuse, NY * Presented at Knowledge Technologies.
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Search Overview Search Features: WSS and Office Search Architecture Content Sources and.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Dean Anderson Polk County, Oregon GIS in Action 2014 Modifying Open Source Software (A Case Study)
Scalable Hybrid Keyword Search on Distributed Database Jungkee Kim Florida State University Community Grids Laboratory, Indiana University Workshop on.
Web Design and Development. World Wide Web  World Wide Web (WWW or W3), collection of globally distributed text and multimedia documents and files 
HTML Concepts and Techniques Fifth Edition Chapter 1 Introduction to HTML.
Presentation.
Chapter 1 Introduction to HTML, XHTML, and CSS HTML5 & CSS 7 th Edition.
Web Application Overview RISK REDUCTION OVERVIEW.
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
Apache Solr Dima Ionut Daniel. Contents What is Apache Solr? Architecture Features Core Solr Concepts Configuration Conclusions Bibliography.
General Architecture of Retrieval Systems 1Adrienn Skrop.
June 30, 2005 Public Web Site Search Project Update: 6/30/2005 Linda Busdiecker & Andy Nguyen Department of Information Technology.
1 Using the Lucene Search Engine. 2 Team Phil Corcoran Project Leader 10 Years Software Telecoms, Finance, Manufacturing Reqs, Design, Test Derek O’ Keeffe.
HTML PROJECT #1 Project 1 Introduction to HTML. HTML Project 1: Introduction to HTML 2 Project Objectives 1.Describe the Internet and its associated key.
Doron Orbach UCMDB Product Manager
Project 1 Introduction to HTML.
Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals Wikis are collections of searchable,
Chapter 1 Introduction to HTML.
Warm Handshake with Websites, Servers and Web Servers:
Project 1 Introduction to HTML.
Web Engineering.
Peer–Mediated Distributed Knowledge Management
Lisa Ruff Business Productivity/Accessibility TS Microsoft Federal
The Re3gistry software and the INSPIRE Registry
Eric Sieverts University Library Utrecht Institute for Media &
The New LexisNexis® Statistical
Presentation transcript:

Compass Semantic search www.ovitas.no

Basics Knowledge model based information retrieval Fulltext search enhanced with Topic Maps = Semantic search Search driven navigation 12.10.2006 TMRA '06

Search technologies Semantic search Level of precision ("Intelligence") Conceptual search Full-text search For å sette dette i sammenheng med forskjellige søke metoder: Fulltekst: stort domene (Internett), rask og enkel implementasjon Konseptuelle: Analyse av kombinasjon av ord Semantiske: analyse av søkestreng og data basert på en kunnskapsmodell Problemer med "tradisjonelle søk": basert på statistikk, for mange treff, upresise svar,lite fleksibel mhht formulering/feilstavelser/etc Ex: Skiferie i Nordnorge? Data volume (Domain size) Compass 12.10.2006 TMRA '06

Given... a web site with a lot of text, which is unstructured (no markup, no tags), a controlled domain (we know what the discourse domain is), and non-adequate search engine... 12.10.2006 TMRA '06

We would like to... get relevant hits within a meaningful context, spare the work of structuring our data, add semantics to the content by defining a knowledge model. 12.10.2006 TMRA '06

Compass-bowl: Take a fulltext search engine. Take a Topic Maps engine. Add a hint of semantics. Define the correct processes for orchestrating the components. Mix them thoroughly. Serve to public! 12.10.2006 TMRA '06

Full text search engine Apache Lucene (open source) Possible to index most file formats html, asp, php, jsp, pdf, rtf, txt, doc, ppt, xls, pst… The index is independent of the model No need to re-index when changes are made to the model Small index size typically less than 10% of the size of the data Fast index lookup less than 20 ms for index size >20000 12.10.2006 TMRA '06

The knowledge model Based on the ISO International Standard for Topic Maps Semantic model of the discourse domain Concept words = topic names/synonyms Semantic relationships through associations Compass Weight defines “closeness” between topics property on association types 12.10.2006 TMRA '06

Example Ovitas hasProduct hasEmployee Compass Christopher CW=0.8 type 12.10.2006 TMRA '06

Compass orchestrator Guides the processes of the search: Search for term in the topic map Expand the map for relevant/related topics Send all these terms off to a fulltext search Calculates relevance (based on the combination of CW and Lucene weights) and prepares the result list as an XML instance Render XML as wished 12.10.2006 TMRA '06

Hits in the fulltext gruouped by the related topics Search term Hits in the fulltext gruouped by the related topics Topic Map expansion Relevant documents ranked by the weighting result

Search term in the topic map, but not in the text Relevant information about ”Chris Searle”

Synonym search

Creating/maintaining the model An MS Excel plug-in serves as the topic map editor Can be put under version control Import the model into the topic map engine: one click only For complex topic maps a custom user interface can be used to enter instance data 12.10.2006 TMRA '06

Navigation Navigation through the associations between topics Navigation by search 12.10.2006 TMRA '06

User configurations What pages to index What topic map to use The number of hops to perform The threshold for relevance 12.10.2006 TMRA '06

Content lifecycle management Easy to integrate with content repositories A content management or publishing system can send a request to the indexer to re-index a particular resource Incremental indexing: add, update or delete documents HTTP is used as the basic mechanism to address content 12.10.2006 TMRA '06

Architecture SOA (service oriented architecture), no dependency on platform or components Web service interface (HTTPRest) .NET platform Integrated components: TMCore Topic Maps engine by NetworkedPlanet Apache Lucene: full text engine 12.10.2006 TMRA '06

Publishing System Services Architecture diagram TM Nav TM Core Full Text Excel Editor Compass Service TM editor person Publishing System Services User 12.10.2006 TMRA '06

Compass - Summary Semantic search based on Topic Maps Search in any document formats Organize information in a topic-oriented manner Link to relevant information without touching the data content Conceptual navigation by Topic Maps Tools for maintaining/evolving the classification Fast and easy implementation 12.10.2006 TMRA '06