Information Retrieval – Introduction and Survey Norbert Fuhr University of Duisburg-Essen Germany

Slides:



Advertisements
Similar presentations
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Advertisements

DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Information Society Technologies Third Call for Proposals Norbert Brinkhoff-Button DG Information Society European Commission Key action III: Multmedia.
Distributed search for complex heterogeneous media Werner Bailer, José-Manuel López-Cobo, Guillermo Álvaro, Georg Thallinger Search Computing Workshop.
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
Towards Adaptive Web-Based Learning Systems Katerina Georgouli, MSc, PhD Associate Professor T.E.I. of Athens Dept. of Informatics Tempus.
ENTERPRISE SEARCH AND ITS VALUE TO THE ENTERPRISE Lee Atkinson or why search and retrieval of ‘relevant’ information is only the start in meeting the business.
1 DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen, Germany.
Effective Coordination of Multiple Intelligent Agents for Command and Control The Robotics Institute Carnegie Mellon University PI: Katia Sycara
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
Florence Workshop, 16th June 2003 The MESMUSES portal of La Cité des Sciences et de l’Industrie MESMUSES : Involvement of the Mediathèque at the “Cité.
Search Engines and Information Retrieval
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Web Mining Research: A Survey
Spring 2008MMDB - Introduction1 Introduction: Multimedia Databases.
Web Mining Research: A Survey
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
© Anselm SpoerriInfo + Web Tech Course Information Technologies Info + Web Tech Course Anselm Spoerri PhD (MIT) Rutgers University
Web Mining Research: A Survey
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Information Retrieval – Introduction and Survey Norbert Fuhr University of Duisburg-Essen Germany
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
Enterprise & Intranet Search How Enterprise is different from Web search What to think about when evaluating Enterprise Search How Intranet use is different.
Search Engines and Information Retrieval Chapter 1.
1 The BT Digital Library A case study in intelligent content management Paul Warren
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
Information Retrieval and Knowledge Organisation Knut Hinkelmann.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
DL.org All WGs Meetings, Rome, May 2010 Quality Interoperability Approaches, case studies and open issues DL.org Quality Working Group Rome, 28 th.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
Information in the Digital Environment Information Seeking Models Dr. Dania Bilal IS 530 Spring 2006.
Proposal for Term Project J. H. Wang Mar. 2, 2015.
D AFFODIL Strategic Support Evaluated Claus-Peter Klas Norbert Fuhr Andre Schaefer University of Duisburg-Essen.
IST Programme - Key Action III Semantic Web Technologies in IST Key Action III (Multimedia Content and Tools) Hans-Georg Stork CEC DG INFSO/D5
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Software Acquisition and Project Management Lesson I: Introduction.
MIND: An architecture for multimedia information retrieval in federated digital libraries Henrik Nottelmann University of Dortmund, Germany.
Andreas Abecker Knowledge Management Research Group From Hypermedia Information Retrieval to Knowledge Management in Enterprises Andreas Abecker, Michael.
Research Topics/Areas. Adapting search to Users Advertising and ad targeting Aggregation of Results Community and Context Aware Search Community-based.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Information in the Digital Environment Information Seeking Models Dr. Dania Bilal IS 530 Spring 2005.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Conceptual structures in modern information retrieval Claudio Carpineto Fondazione Ugo Bordoni
Contact : Bernadette Bouchon-Meunier, Patrick Gallinari, Jean-Gabriel Ganascia LIP6, UPMC, 8 rue du Capitaine Scott, Paris, France
Information Retrieval
L&I SCI 110: Information science and information theory Instructor: Xiangming(Simon) Mu Sept. 9, 2004.
Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.
Contextual Text Cube Model and Aggregation Operator for Text OLAP
1 DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen, Germany.
User Errors in Formulating Queries and IR Techniques to Overcome Them Birger Larsen Information Interaction and Information Architecture Royal School of.
Information Storage and Retrieval Fall Lecture 1: Introduction and History.
Digital Video Library - Jacky Ma.
Information Day on “Search Engines for Audio-Visual Content”
Proposal for Term Project
What is IR? In the 70’s and 80’s, much of the research focused on document retrieval In 90’s TREC reinforced the view that IR = document retrieval Document.
Data Warehousing and Data Mining
DBMS with probabilistic model
Exploratory search: New name for an old hat?
Interactive Information Access to Federations of DL Services
CSE 635 Multimedia Information Retrieval
Web Mining Department of Computer Science and Engg.
Introduction to Information Retrieval
Search Engine Architecture
CoXML: A Cooperative XML Query Answering System
Presentation transcript:

Information Retrieval – Introduction and Survey Norbert Fuhr University of Duisburg-Essen Germany

What is Information Retrieval? “Information Retrieval deals with uncertainty and vagueness in information systems” (IR Specialist Group of German Informatics Society, 1991) Uncertain representations of the semantics of objects (text, images,…) Vague specifications of information needs (iterative querying) 1. Area definition 2. Global information access 3. Contextual retrieval

What to Retrieve? “Retrieve that amount of knowledge which a user needs in a specific situation for solving his/her current problem” (Kuhlen 1991) Consider specific user, situation and problem  contextual retrieval How to get this information  global information access Workshop “Challenges in Information Retrieval and Language Modeling”, Area definition 2. Global information access 3. Contextual retrieval

Global information access “Satisfy human information needs through natural, efficient interaction with an automated system that leverages world- wide structured and unstructured data in any language.” 1. Area definition 2. Global information access 3. Contextual retrieval

Information access Information properties media structure heterogeneity Access methods 1. Area definition 2. Global information access 3. Contextual retrieval

Information Media Text Facts 2D: graphics, images Speech Video 3D 1. Area definition 2. Global information access 3. Contextual retrieval

Information structure Unstructured Semi-structured (XML) Fully structured Hyperlinked (Web) 1. Area definition 2. Global information access 3. Contextual retrieval

Heterogeneity Language: multilingual Media: multimedia Heterogeneous structures Heterogeneous services 1. Area definition 2. Global information access 3. Contextual retrieval

Information Access Methods Ad-hoc retrieval One time queries (e.g. Web search) Filtering/Routing Constant search profile (e.g. Spam filtering)

Information Access (2): BDCEBDCDAAEAB Topic Detection and Tracking: Cluster news in stream Categorization/Clustering: Group documents into predefined classes/ adaptive clusters

Information Access(3): Summarization for browsing / survey on retrieval results

Info. Access(4): Inform. Extracti on A.C Nielsen Co. said George Garrick, 40 years old, president of Information Resources Inc.'s London-based European Information Services operation, will become president and chief operating officer of Nielsen Marketing Research USA, a unit of Dun & Bradstreet Corp. He succeeds John Costello, who resigned in March

Inform. Access(5): Question answering Find text passage answering fact query

Current IR Research Access methods MediaStructureHeterogeneity … focuses on models, methods and systems for information properties and access methods: 1. Area definition 2. Global information access 3. Contextual retrieval

Contextual retrieval “Combine search technologies and knowledge about query and user context into a single framework in order to provide the most appropriate answer for a user’s information needs.” 1. Area definition 2. Global information access 3. Contextual retrieval

Considering Context social context work context time 1. Area definition 2. Global information access 3. Contextual retrieval

Time-dependence Batch retrieval Constant information needs (Filtering  adaptation) Interactive retrieval Personalization: Preferences Seen items Evolving interests 1. Area definition 2. Global information access 3. Contextual retrieval

Interactive retrieval: Levels of search activities 1. Move: Low-level search function (e.g. type in search term, view retrieved document) 2. Tactic: several moves to further a search (e.g. broaden/narrow a query) 3. Stratagem: set of actions on a single domain (e.g. citation database, tables of contents of journals) 4. Strategy: complete plan for satisfying an information need (e.g. subject search, browse relevant journals, find referenced articles) 1. Area definition 2. Global information access 3. Contextual retrieval

Interactive Retrieval: Current Research Evaluation results: quality differences between methods in batch retrieval vanish in interactive retrieval Empirical studies: information seeking as a sequence of interconnected but diverse searches Specific methods for interactive retrieval required: information seeking: ‘berrypicking’ tactics & stratagems 1. Area definition 2. Global information access 3. Contextual retrieval

Work context Context-free Task-specific searches Workflow (application-specific) 1. Area definition 2. Global information access 3. Contextual retrieval

Workflow: Generic problem solving scheme 1. Problem understanding (Hypermedia system with introductory/survey articles) 2. Identification of possible solutions (Hierarchical hypermedia system) 3. Selection of optimum solution (Information retrieval system)  integrated systems required 1. Area definition 2. Global information access 3. Contextual retrieval

Workflow example: Digital Library Life Cycle Discover Retrieve CollateInterpret Re - Present Metalibray IR/Hypertext system Personal/group libraryAnnotations, discussion threads Authoring system 1. Area definition 2. Global information access 3. Contextual retrieval

Social context Single user (Fixed) user groups Collaborative information access (Open) communities 1. Area definition 2. Global information access 3. Contextual retrieval

Context dimensions social work time batch retrieval personalization interactive retrieval application workflow ad-hoc retrieval single users teams communities generic problem solving 1. Area definition 2. Global information access 3. Contextual retrieval

Conclusion Global information access Focus of current research Contextual retrieval Promises significant quality improvements More research necessary

Organisation Vorlesung: für Kommedia-Studenten (alte PO) nur bis Mitte des Semesters Übungen freiwillig für Kommedia verpflichtend für DAI „Sage es mir, und ich vergesse es; zeige es mir, und ich erinnere mich; lass' es mich tun, und ich behalte es.“ (Konfuzius)

Organisation(2) Prüfung/Leistungsnachweis: Kommedia: zusammen mit 2. Informatik- Fach DAI: Leistungskontrolle: mündlich, im September