Aardvark Anatomy of a Large-Scale Social Search Engine.

Slides:



Advertisements
Similar presentations
SEARCHING QUESTION AND ANSWER ARCHIVES Dr. Jiwoon Jeon Presented by CHARANYA VENKATESH KUMAR.
Advertisements

Q&A over Social Networks KSE 801 Uichin Lee. Aardvark: The Anatomy of a Large- Scale Social Search Engine Damon Horowitz, Aardvark Sepandar D. Kamvar,
Web Search - Summer Term 2006 III. Web Search - Introduction (Cont.) (c) Wolfgang Hürst, Albert-Ludwigs-University.
Information Retrieval in Practice
Search Engines and Information Retrieval
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. 1 The Architecture of a Large-Scale Web Search and Query Engine.
CS 345A Data Mining Lecture 1 Introduction to Web Mining.
Web Search – Summer Term 2006 III. Web Search - Introduction (Cont.) - Jeff Dean, Google's Systems Lab:
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
Information Retrieval in Practice
ISP 433/633 Week 7 Web IR. Web is a unique collection Largest repository of data Unedited Can be anything –Information type –Sources Changing –Growing.
Search engines. The number of Internet hosts exceeded in in in in in
Information Retrieval
Overview of Search Engines
A Social Help Engine for Online Social Network Mobile Users Tam Vu, Akash Baid WINLAB, Rutgers University May 21,
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
SUU CSIS 1000 Created by: Dave Barney, Semantic Discovery.
ENSC 105: PROCESS, FORM, AND CONVENTION IN PROFESSIONAL GENRES Spring 2012 Jenna Walsh Engineering and Computing Science Librarian, Surrey Campus
Search Engines and Information Retrieval Chapter 1.
Rutherford Appleton Laboratory SKOS Ecoterm 2006 Alistair Miles CCLRC Rutherford Appleton Laboratory Semantic Web Best Practices and Deployment.
The Internet as a Search Tool Prepared by Ms. Emery, Summer 2006 With billions of web pages online, you could spend a lifetime surfing the Web, following.
©2008 Srikanth Kallurkar, Quantum Leap Innovations, Inc. All rights reserved. Apollo – Automated Content Management System Srikanth Kallurkar Quantum Leap.
Web Search. Structure of the Web n The Web is a complex network (graph) of nodes & links that has the appearance of a self-organizing structure  The.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
Creating Collaborative Partnerships
Pete Bohman Adam Kunk. Real-Time Search  Definition: A search mechanism capable of finding information in an online fashion as it is produced. Technology.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
What to Know: 9 Essential Things to Know About Web Searching Janet Eke Graduate School of Library and Information Science University of Illinois at Champaign-Urbana.
Search Result Interface Hongning Wang Abstraction of search engine architecture User Ranker Indexer Doc Analyzer Index results Crawler Doc Representation.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Natural Search User Interfaces Prof. Marti Hearst UC Berkeley March/April, 2012 Book full text freely available at:
The Internet Do you really know what is out there?
Search Engine Architecture
Search Engines Reyhaneh Salkhi Outline What is a search engine? How do search engines work? Which search engines are most useful and efficient? How can.
Searching the web Enormous amount of information –In 1994, 100 thousand pages indexed –In 1997, 100 million pages indexed –In June, 2000, 500 million pages.
CS315-Web Search & Data Mining. A Semester in 50 minutes or less The Web History Key technologies and developments Its future Information Retrieval (IR)
The Savvy Cyber Teacher ® Using the Internet Effectively in the K-12 Classroom 1 Copyright © 2003 Stevens Institute of Technology, CIESE, All Rights Reserved.
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
OCLC Online Computer Library Center 1 Social Media and Advocacy.
SEO Who knew 3 letters could mean so much?. What is SEO? Search Engine Optimization (SEO) is the practice of improving and promoting a web site in order.
How Do We Find Information?. Key Questions  What are we looking for?  How do we find it?  Why is it difficult? “A prudent question is one-half of wisdom”
Introduction to Information Retrieval Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
Using Search Engines. What is a search engine? A program that searches documents for specific keywords After its search, it gives you a list of the documents.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Image Classification for Automatic Annotation
Digital Literacy Concepts and basic vocabulary. Digital Literacy Knowledge, skills, and behaviors used in digital devices (computers, tablets, smartphones)
Search Result Interface Hongning Wang Abstraction of search engine architecture User Ranker Indexer Doc Analyzer Index results Crawler Doc Representation.
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
Information Retrieval
Information Retrieval Transfer Cycle Dania Bilal IS 530 Fall 2007.
Supporting Knowledge Discovery: Next Generation of Search Engines Qiaozhu Mei 04/21/2005.
Cs Future Direction : Collaborative Filtering Motivating Observations:  Relevance Feedback is useful, but expensive a)Humans don’t often have time.
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
Internet Searching the World Wide Web. The Internet and the World Wide Web The Internet is a worldwide collection of networks that allows people to communicate.
Damon Horowitz, Sepandar D. Kamvar. WWW 2010 Barak Weiss67877 SDBI 2011 HUJI Does the title sound familiar?
Wading Through the Web Conducting Quality Research on the Internet.
CPS 49S Google: The Computer Science Within and its Impact on Society Shivnath Babu Spring 2007.
Searching the Web for academic information Ruth Stubbings.
Proposal for Term Project
Search Engine Architecture
Information Retrieval
Data Mining Chapter 6 Search Engines
Introduction to Information Retrieval
Search Engine Architecture
Information Retrieval and Web Design
Information Retrieval and Web Design
Presentation transcript:

Aardvark Anatomy of a Large-Scale Social Search Engine

The Library vs. The Village The Library Keywords are used to search Knowledge base comes from a small number of publishers Content is created before the question is asked Trust is based on Authority Traditional search engines (e.g. Google) The Village Questions are phrased in natural language Answers are generated in real time Anyone in the community might answer Trust is based on intimacy

Purpose Harness the power of the village to answer questions not easily answered with traditional search engines. Works best for subjective questions “What is a good Italian restaurant in the north end of Boston with live entertainment on Fridays? Utilize the power of mobile devices to provide quick answers from knowledgeable users.

Aardvark Components Crawler / Indexer Finds and labels resources Users, not documents Query Analyzer Classifies queries Filters out non-questions, trivial questions, inappropriate questions Determines if the query is specific to a location Determines the topic Uses natural language processing to find salient phrases and determine what is semantically significant. Uses a taxonomy of popular topics Ranking Function Ranks resources (users) to determine which provide the best information on a topic UI Web, IM, Mobile devices

Aardvark Architecture

User Experience Approached directly, via IM or Mobile Application to answer a specific question they should know something about. Answerers are found within the users social network Interaction is real-time One on One conversation

Ranking Similar to traditional ranking in concept, a statistical probability that the user can answer a question on a topic is computed. Also takes into account the “connectedness” of the users. Asks for feedback as to the quality of the question after it is answered.

Does it work? 87.7% of questions received an answer 57.2% answered in less than 10 minutes 70.4% of answers were ranked “good” In my experience, works great for: Idea generator Getting pointed in the right direction Getting opinions on subjective topics

References “Anatomy of a Large-Scale Social Search Engine” by Damon Horowitz and Sepandar Kumar Try it yourself –