BIG: A Resource-Bounded Information Gathering Agent Victor Lesser, Bryan Horling, Frank Klassner, Anita Raja, Tom Wagner, Shelley Zhang Multi-Agent Systems.

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
T.Sharon-A.Frank 1 Internet Resources Discovery (IRD) Shopping Agents.
Bringing Order to the Web: Automatically Categorizing Search Results Hao Chen SIMS, UC Berkeley Susan Dumais Adaptive Systems & Interactions Microsoft.
Meta-Level Control in Multi-Agent Systems Anita Raja and Victor Lesser Department of Computer Science University of Massachusetts Amherst, MA
Natural Language Processing WEB SEARCH ENGINES August, 2002.
SRTA: The Soft-Real Time Agent Control Architecture Bryan Horling, Victor Lesser, Regis Vincent, Thomas Wagner presented by Anita Raja.
Data warehouse example
Effective Coordination of Multiple Intelligent Agents for Command and Control The Robotics Institute Carnegie Mellon University PI: Katia Sycara
Information Retrieval in Practice
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
Mgt 240 Lecture Decision Support Systems March 3, 2005.
CHAPTER 6 SECONDARY DATA SOURCES. Important Topics of This Chapter Success of secondary data. To understand how to create an internal database. To distinguish.
T.Sharon-A.Frank 1 Internet Resources Discovery (IRD) Concrete Learning Agents.
8 Systems Analysis and Design in a Changing World, Fifth Edition.
Building Knowledge-Driven DSS and Mining Data
Yimam & Kobsa July 13, 2000TWIST 2000 Centralization vs. Decentralization Issues in Internet-based KMS: Experiences from Expertise Recommender Systems.
Connecting Diverse Web Search Facilities Udi Manber, Peter Bigot Department of Computer Science University of Arizona Aida Gikouria - M471 University of.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
1 Introduction to Web Development. Web Basics The Web consists of computers on the Internet connected to each other in a specific way Used in all levels.
Systems Analysis and Design: The Big Picture
Collaboration and Content Customer solution case study The Yaroslavl region Government creates knowledge base of public authorities of the Yaroslavl region.
Jane Hsu 『資訊檢索技術的新驅勢』研討會 智慧型代理人 Intelligent Agents 許永真 臺灣大學資訊工程研究所 October 22, 1998.
MSF Requirements Envisioning Phase Planning Phase.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
©2008 Srikanth Kallurkar, Quantum Leap Innovations, Inc. All rights reserved. Apollo – Automated Content Management System Srikanth Kallurkar Quantum Leap.
An Integration Framework for Sensor Networks and Data Stream Management Systems.
Web Categorization Crawler Mohammed Agabaria Adam Shobash Supervisor: Victor Kulikov Winter 2009/10 Design & Architecture Dec
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
1999 Asian Women's Network Training Workshop Tools for Searching Information on the Web  Search Engines  Meta-searchers  Information Gateways  Subject.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
ITIS 1210 Introduction to Web-Based Information Systems Chapter 27 How Internet Searching Works.
Internet Information Retrieval Sun Wu. Course Goal To learn the basic concepts and techniques of internet search engines –How to use and evaluate search.
SharePoint 2010 Search Architecture The Connector Framework Enhancing the Search User Interface Creating Custom Ranking Models.
Chapter 3 DECISION SUPPORT SYSTEMS CONCEPTS, METHODOLOGIES, AND TECHNOLOGIES: AN OVERVIEW Study sub-sections: , 3.12(p )
Distributed Information Retrieval Using a Multi-Agent System and The Role of Logic Programming.
Data Mining By Dave Maung.
Internet Research Tips Daniel Fack. Internet Research Tips The internet is a self publishing medium. It must be be analyzed for appropriateness of research.
Topical Categorization of Large Collections of Electronic Theses and Dissertations Venkat Srinivasan & Edward A. Fox Virginia Tech, Blacksburg, VA, USA.
Searching the web Enormous amount of information –In 1994, 100 thousand pages indexed –In 1997, 100 million pages indexed –In June, 2000, 500 million pages.
Search Tools and Search Engines Searching for Information and common found internet file types.
1 Context-Aware Internet Sharma Chakravarthy UT Arlington December 19, 2008.
Searching the World Wide Web: Meta Crawlers vs. Single Search Engines By: Voris Tejada.
Medical Information Retrieval: eEvidence System By Zhao Jin Mar
Multiagent System Katia P. Sycara 일반대학원 GE 랩 성연식.
By R. O. Nanthini and R. Jayakumar.  tools used on the web to find the required information  Akeredolu officially described the Web as “a wide- area.
CP3024 Lecture 12 Search Engines. What is the main WWW problem?  With an estimated 800 million web pages finding the one you want is difficult!
ANALYSIS PHASE OF BUSINESS SYSTEM DEVELOPMENT METHODOLOGY.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Bringing Order to the Web : Automatically Categorizing Search Results Advisor : Dr. Hsu Graduate : Keng-Wei Chang Author : Hao Chen Susan Dumais.
Week-6 (Lecture-1) Publishing and Browsing the Web: Publishing: 1. upload the following items on the web Google documents Spreadsheets Presentations drawings.
General Architecture of Retrieval Systems 1Adrienn Skrop.
Seminar on seminar on Presented By L.Nageswara Rao 09MA1A0546. Under the guidance of Ms.Y.Sushma(M.Tech) asst.prof.
WEB BASED DSS Aaron Atuhe. KEY CONCEPTS When software vendors propose implementing a Web-Based Decision Support System, they are referring to a computerized.
SEMINAR ON INTERNET SEARCHING PRESENTED BY:- AVIPSA PUROHIT REGD NO GUIDED BY:- Lect. ANANYA MISHRA.
Information Retrieval in Practice
Systems Analysis and Design in a Changing World, Fifth Edition
Simple and intuitive fare conditions
TÆMS-based Execution Architectures
Discovering User Access Patterns on the World-Wide Web
Systems Analysis – ITEC 3155 Evaluating Alternatives for Requirements, Environment, and Implementation.
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Chattrakul Sombattheera
Chapter 1 (pages 4-9); Overview of SDLC
Data Warehousing and Data Mining
I don’t need a title slide for a lecture
Web Mining Research: A Survey
Decision Support Systems
Context-Aware Internet
Presentation transcript:

BIG: A Resource-Bounded Information Gathering Agent Victor Lesser, Bryan Horling, Frank Klassner, Anita Raja, Tom Wagner, Shelley Zhang Multi-Agent Systems Laboratory University of Massachusetts, Amherst

Multi-Agent Systems Lab, University of Massachusetts Talk Outline  Information Gathering problem.(Motivation)  The BIG Agent. l Interpretation. l Architecture & Components. l Sample Trace.  Performance Evaluation.  Integration Lessons & Future Work.

Multi-Agent Systems Lab, University of Massachusetts 3 Motivation  Rapid growth of WWW.  Growth has outstripped technology.  Information Retrieval technology a start. l Efficient, fast, general. l Access to enormous amount of data. (Alta Vista has indexed 125 million documents). l Browsing & processing documents manually non-trivial.

Multi-Agent Systems Lab, University of Massachusetts 4 The BIG Agent  BIG (resource Bounded Information Gathering) l Takes role of human in support of decision process. l Integration of Planning, scheduling, text processing and interpretation style reasoning. l Helps pick software packages.

Multi-Agent Systems Lab, University of Massachusetts 5 Sample Query  Input l Word processing package for a Mac. l $200 price limit. l Search process should take 10 min. & cost less than $5.

Multi-Agent Systems Lab, University of Massachusetts 6 The BIG Agent  Salient Features l Active search and discovery. l Resource Bounded Reasoning. l Goal-driven and Opportunistic control. l Information extraction and fusion..

Multi-Agent Systems Lab, University of Massachusetts 7 Sample Trace, Cont.  BIG recommends Corel WP3.5

Multi-Agent Systems Lab, University of Massachusetts 8 Information Gathering as Interpretation  Constructing high-level models from low-level data.  Information Gathering is an instance of this class. l Constructive problem solving. l Information fusion. l Sources of Uncertainty.  Tension between opportunism and planned action.

Multi-Agent Systems Lab, University of Massachusetts 9 BIG Agent Architecture

Multi-Agent Systems Lab, University of Massachusetts 10 BIG Components  Task Assessor l Forms initial plan, but not main planner. l Manages balance between opportunism & end-to-end.  Object & Server Database l Stores software product models l Models WWW sites. l Learns through persistence.  Document Classifiers l Distraction phenomenon caused by vendors.  Information Extractors l Builds/extracts structured data from unstructured text. l Extractors have varying tradeoffs and costs.

Multi-Agent Systems Lab, University of Massachusetts 11 TAEMS Task structure

Multi-Agent Systems Lab, University of Massachusetts 12 BIG Components, Cont.  TAEMS Modeling Framework l Domain-independent medium of exchange. l Hierarchical, statistically characterizes actions and alternatives.  Design-to-Criteria Scheduler Tradeoffs of different possible solution paths. l Builds custom schedules to meet a particular solution.  RESUN Planner l Blackboard interpretation planner. l Resolves sources of uncertainty. l Opportunistic problem solving.

Multi-Agent Systems Lab, University of Massachusetts 13 Sample Query  Input l Word processing package for a Mac. l $200 price limit. l Search process should take 10 min. & cost less than $5. l Product Quality attributes like usefulness, stability, ease of use, power features, etc.

Multi-Agent Systems Lab, University of Massachusetts 14 A Sample Trace Decision Maker Results & Supporting Data PlannerSchedulerExecutor Updates User RetrievesAssimilatesProcesses/Extracts Replans & Reschedules User Interface 132 Replans & Reschedules  Step 1: Task assessor forms skeletal plan.  Step 2: Plan scheduled by DTC scheduler.  Step 3: RESUN begins execution.

Multi-Agent Systems Lab, University of Massachusetts 15 Sample Trace, Cont.  Step 4: Queries issued l Parallel requests to MacZone (53) and Cyberian Outpost (61). l URLs returned used to build document-description info.  Step 5: 3 documents retrieved l Document length, recency, and site quality as criteria.  Step 6: Documents classified l Rejected children’s educational package for improving writing skills. l Rejected drawing/wp package by Corel. (dubious?)

Multi-Agent Systems Lab, University of Massachusetts 16 Sample Trace, Cont.  Step 7: 3 Text extractors execute l Produce Nisus Writer object. Product Name:Nisus Writer 5.1 Company Name:Nisus Price: $54.95 Processor:Mac Platform:Macintosh Processing Accuracy(Degree of Belief) range( ) GENRES = 0PRODUCTID=0.8COMPANYID=1.0 PRICE=1.0PROCESSOR=0.8DISKSPACE=0 PLATFORM=0.7

Multi-Agent Systems Lab, University of Massachusetts 17 Sample Trace, Cont.  Step 8-11: Gather more information. l Of remaining 111 document candidates, 4 are selected and retrieved, and classified. 2 are rejected. Extraction is highly uncertain & no new objects are produced.  Step 12-14: Processing new information. l 7 more document candidates are selected, retrieved, classified, and processed, producing 2 more objects:

Multi-Agent Systems Lab, University of Massachusetts 18 Sample Trace, Cont. Product Name:Corel WordPerfect 3.5 ACADEMIC Company name:Corel Price: $29.95 Platform: Mac/PwrMac Processing Accuracy(Degree of Belief): GENRES=0 PRODUCTID=0.8 COMPANYID=1.0 PRICE=1.0 PROCESSOR=0.8 DISKSPACE=0 PLATFORM=0.8 Product Name:Nisus Writer 5.1 Upgrade from 5.0 Company Name:Nisus Price:$29.95 Platform:Macintosh Processing Accuracy(Degree of Belief): GENRES=0 PRODUCTID=0.8 COMPANYID=1.0 PRICE=1.0 PROCESSOR=0.6 DISKSPACE=0 PLATFORM=0.8

Multi-Agent Systems Lab, University of Massachusetts 19 Sample Trace, Cont.  Step 15: Review gathering phase l Reviews retrieved, processed and extraction fills slots for Overall quality, usefulness, future usefulness. Ease of use, power features. Stability, enjoyability and value. l For each review, a pair is associated with the object.

Multi-Agent Systems Lab, University of Massachusetts 20 Sample Trace, Cont. Results & Supporting Data PlannerSchedulerExecutor Updates User RetrievesAssimilates Replans & Reschedules User Interface 12 4,57 Processes/Extracts Decision Maker  Step 16: Decision phase l Prune incomplete objects, discrepancy resolution. l Model includes number of products, coverage, quality, accuracy, and confidence of information.

Multi-Agent Systems Lab, University of Massachusetts 21 Sample Trace, Cont.  Step 17: BIG recommends Corel WP3.5

Multi-Agent Systems Lab, University of Massachusetts 22 Performance Evaluation

Multi-Agent Systems Lab, University of Massachusetts 23 Integration Lessons  Integration of the different AI problem solvers.  Backend processing for Information Extractor.  Integrated document classifier.  Modeling problems with the TAEMs.  Balance of goal driven and opportunisitic view.  Information fusion and Reasoning.  Learning component.

Multi-Agent Systems Lab, University of Massachusetts 24 Limitations and Future Work  Limitations: l Extraction is hard. l New domains require more training for extraction.  Future Work l More opportunism. l Decision confidence. l Multi-agent approach.

Multi-Agent Systems Lab, University of Massachusetts 25 Advantages of Document Classification

Multi-Agent Systems Lab, University of Massachusetts 26 Sample TAEMS task structure

Multi-Agent Systems Lab, University of Massachusetts 27 Related Work  “Moving Up the Food Chain”(Etzioni,AAAI 1996)  Meta Search Engines l Parallel queries, fast, coverage.  Personal Information Agents l Simple text processing, returns relevant list of URLs.  Shopping Agents l Specialized, price comparisons. World Wide Web Indices & Directories Agents & Softbots AltaVista, Yahoo Meta Crawler, Bargain Finder

Multi-Agent Systems Lab, University of Massachusetts 28 Strengths,Limitations and Future Work  Strengths: l Information extraction and fusion. l Incorporation of discovered information into process. l Representing and planning to resolve sources of uncertainty. l Ability to address deadlines and resource constraints. l Learning through experience.

Multi-Agent Systems Lab, University of Massachusetts 29 BIG Components, Cont.  Web Retrieval Interface l Gather URLs and interact with forms.  Document Classifiers l Distraction phenomenon caused by vendors.  Information Extractors l Builds/extracts structured data from unstructured text. l Extractors have varying tradeoffs and costs.  Decision Maker l Model of human decision process. l Considers preferences and confidence in information.