1 Oct 30, 2006 LogicSQL-based Enterprise Archive and Search System How to organize the information and make it accessible and useful ? Li-Yan Yuan.

Slides:



Advertisements
Similar presentations
Metacrawler Melissa Cyr Information Literacy. A metasearch engine is a search tool that sends user requests to several other search engines and/or databases.
Advertisements

Search in Source Code Based on Identifying Popular Fragments Eduard Kuric and Mária Bieliková Faculty of Informatics and Information.
Chapter 5: Introduction to Information Retrieval
Multimedia Database Systems
Chapter 2. Slide 1 CULTURAL SUBJECT GATEWAYS CULTURAL SUBJECT GATEWAYS Subject Gateways  Started as links of lists  Continued as Web directories  Culminated.
Information Retrieval in Practice
Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan Susan T.Dumains Eric Horvitz MIT,CSAILMicrosoft Researcher Microsoft.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
LYU0101 Wireless Digital Information System Lam Yee Gordon Yeung Kam Wah Supervisor Prof. Michael Lyu Second semester FYP Presentation 2001~2002.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.
A Method for Focused Crawling Using Combination of Link Structure and Content Similarity SeyedMohsen (Mohsen) Jamali
Search engines. The number of Internet hosts exceeded in in in in in
WHAT HAVE WE DONE SO FAR?  Weeks 1 – 8 : various components of an information retrieval system  Now – look at various examples of information retrieval.
Chapter 5: Information Retrieval and Web Search
Overview of Search Engines
SEO from the Ground Up! Jack Roberts President and CEO of Peak Positions.
© 2013 Jones and Bartlett Learning, LLC, an Ascend Learning Company All rights reserved. Security Strategies in Linux Platforms and.
Databases & Data Warehouses Chapter 3 Database Processing.
Enterprise & Intranet Search How Enterprise is different from Web search What to think about when evaluating Enterprise Search How Intranet use is different.
Wikis are websites where pages can be edited using an online document editor. Users can easily edit and share content. Enterprise wikis are platforms.
The way Archiving should be!.  Many organisations have either no archiving policy or is severely fragmented.  Archiving is considered as just another.
Simple Database.
Basic Web Applications 2. Search Engine Why we need search ensigns? Why we need search ensigns? –because there are hundreds of millions of pages available.
Master Thesis Defense Jan Fiedler 04/17/98
Service Computation 2010November 21-26, Lisbon.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
Recap for 2013 Virtual Fall GaIN Meeting Carolann Curry, MLIS, AHIP Reference & Document Delivery Librarian Mercer University Medical Library - Macon Anna.
Chapter Chapter 3 Internet Agents. Chapter Contents Background Web Search Agents Information Filtering Agents Notification Agents Other Service.
Internet Information Retrieval Sun Wu. Course Goal To learn the basic concepts and techniques of internet search engines –How to use and evaluate search.
НИУ ВШЭ – НИЖНИЙ НОВГОРОД EDUARD BABKIN NIKOLAY KARPOV TATIANA BABKINA NATIONAL RESEARCH UNIVERSITY HIGHER SCHOOL OF ECONOMICS A method of ontology-aided.
Search - on the Web and Locally Related directly to Web Search Engines: Part 1 and Part 2. IEEE Computer. June & August 2006.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Search engines are the key to finding specific information on the vast expanse of the World Wide Web. Without sophisticated search engines, it would be.
Chapter 6: Information Retrieval and Web Search
Module 10 Administering and Configuring SharePoint Search.
Text Based Information Retrieval Text Based Information Retrieval H02C8A H02C8B Marie-Francine Moens Karl Gyllstrom Katholieke Universiteit Leuven.
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Curtis Spencer Ezra Burgoyne An Internet Forum Index.
Search Engine Architecture
Search Engine Marketing SEM = Search Engine Marketing SEO = Search Engine Optimization optimizing (altering/changing) your page in order to get a higher.
Search Engines.
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
Search Engines By: Faruq Hasan.
Company small business cloud solution Client UNIVERSITY OF BEDFORDSHIRE.
Problem Query image by content in an image database.
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
Ranking of Database Query Results Nitesh Maan, Arujn Saraswat, Nishant Kapoor.
A search engine is a web site that collects and organizes content from all over the internet Search engines look through their own databases of.
By R. O. Nanthini and R. Jayakumar.  tools used on the web to find the required information  Akeredolu officially described the Web as “a wide- area.
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
The Anatomy of a Large-Scale Hypertextual Web Search Engine S. Brin and L. Page, Computer Networks and ISDN Systems, Vol. 30, No. 1-7, pages , April.
Search Engine Optimization Miami (SEO Services Miami in affordable budget)
Presented By: Carlton Northern and Jeffrey Shipman The Anatomy of a Large-Scale Hyper-Textural Web Search Engine By Lawrence Page and Sergey Brin (1998)
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
Data mining in web applications
Proposal for Term Project
Search Engine Architecture
Building Search Systems for Digital Library Collections
Prepared by Rao Umar Anwar For Detail information Visit my blog:
SIS: A system for Personal Information Retrieval and Re-Use
Content Management Systems
Search Engines & Subject Directories
Data Mining Chapter 6 Search Engines
Agenda What is SEO ? How Do Search Engines Work? Measuring SEO success ? On Page SEO – Basic Practices? Technical SEO - Source Code. Off Page SEO – Social.
Search Engines & Subject Directories
Chapter 5: Information Retrieval and Web Search
Search Engines & Subject Directories
Search Engine Architecture
Presentation transcript:

1 Oct 30, 2006 LogicSQL-based Enterprise Archive and Search System How to organize the information and make it accessible and useful ? Li-Yan Yuan

2 Oct 30, 2006 Projects n How to develop an enterprise search engine based on a database management system challenges: implementation of the inverted index

3 Oct 30, 2006 Projects n How to implement the TOP K query l Ranking formula l Inverted indexes are created with respect to frequences

4 Oct 30, 2006 Internet search n Search for relevant web pages l Good answers: à Relevant à Popular n Public domain knowledge, n Search engines are critical to Internet use l internal workings are secret l Tremendous political, economical, and cultural power

5 Oct 30, 2006 Enterprise search n Search the enterprise information systems for right information n Enterprise information l Internal web pages l Internal documentation systems l File systems l Databases l servers n The internet and enterprise domains differ fundamentally l Contents l User behavior l Economic motivations

6 Oct 30, 2006 Top-K Query n Objective l How to determine the top K objects that are most likely (approximately) related to the given query n Applications l Information retrieval l Internet and enterprise searches l Multimedia similarity search l Scheduling large scale on-demand data broadcase l ……

7 Oct 30, 2006

8

9 Development of Enterprise Search Systems

10 Oct 30, 2006 LogicSQL Enterprise information Archive and Search system n LogicSQL An object-relational database management system à New concurrency control algorithm à Staged database architecture l Developed in the University of Alberta l Commercialized by Shanghai Shifang Software Co.

11 Oct 30, 2006 Enterprise Archive and Search System n To archive all the enterprise information contents l File systems l Web pages l s l Internal documents l Database records? n To provide a web styled search engine n To support user-specified ranking algorithms l focus on the platform of archive and search l Easy implementation and test of various ranking algorithms

12 Oct 30, 2006 n Extend the database functionalities l Security model à Users, roles + security handle à Security primary key l New database objects à Inverted indexes F CREATE INVERTED INDEX F DROP INVESTED INDEX F Automatic population, similar to that of index F ORDER BY clause à User specified aggregate functions F CREATE AGGREGATE FUNCTION l Top-K query evaluation n Specified crawlers Enterprise Archive and Search System

13 Oct 30, 2006 n User configuration l Set up crawlers l Create a list of inverted indexes l Create one aggregate function for object ranking n Extend the query languages l Implement the top K query algorithm n Web based query pages Enterprise Archive and Search System