Rutgers Components Phase 2 Principal investigators –Paul Kantor, PI; Design, modelling and analysis –Kwong Bor Ng, Co-PI - Fusion; Experimental design.

Slides:

Advertisements

Similar presentations

IB Portfolio Tasks 20% of final grade

Advertisements

DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Developing Science Skills. Preparing for Tasks Level DLevel ELevel F individually or in small groups will identify two or three questions to investigate.

Developing and Evaluating a Query Recommendation Feature to Assist Users with Online Information Seeking & Retrieval With graduate students: Karl Gyllstrom,

UCLA : GSE&IS : Department of Information StudiesJF : 276lec1.ppt : 5/2/2015 : 1 I N F S I N F O R M A T I O N R E T R I E V A L S Y S T E M S Week.

Optimal Design Laboratory | University of Michigan, Ann Arbor 2011 Design Preference Elicitation Using Efficient Global Optimization Yi Ren Panos Y. Papalambros.

Relevance Feedback Content-Based Image Retrieval Using Query Distribution Estimation Based on Maximum Entropy Principle Irwin King and Zhong Jin Nov

Effective Coordination of Multiple Intelligent Agents for Command and Control The Robotics Institute Carnegie Mellon University PI: Katia Sycara

A Robust Process Model for Calculating Security ROI Ghazy Mahjub DePaul University M.S Software Engineering.

Information Retrieval in Practice

Search Engines and Information Retrieval

© Tefko Saracevic, Rutgers University1 digital libraries and human information behavior Tefko Saracevic, Ph.D. School of Communication, Information and.

© Tefko Saracevic, Rutgers University1 Interaction in information retrieval There is MUCH more to searching than knowing computers, networks & commands,

Modern Information Retrieval

Model Personalization (1) : Data Fusion Improve frame and answer (of persistent query) generation through Data Fusion (local fusion on personal and topical.

The Data Mining Visual Environment Motivation Major problems with existing DM systems They are based on non-extensible frameworks. They provide a non-uniform.

1 CS 430 / INFO 430 Information Retrieval Lecture 24 Usability 2.

Law Enforcement Resource Allocation (LERA) Visualization System Michael Welsman-Dinelle April Webster.

An investigation of query expansion terms Gheorghe Muresan Rutgers University, School of Communication, Information and Library Science 4 Huntington St.,

Overview of Search Engines

Customer Focus Module Preview

Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.

GeoPKDD Geographic Privacy-aware Knowledge Discovery and Delivery Kick-off meeting Pisa, March 14, 2005.

AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.

CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.

Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.

ASSESSING READING AND THE ROLE OF APP PGCE (FT) - Week 4.

Chapter 2 The process Process, Methods, and Tools

1 CSE 2102 CSE 2102 CSE 2102: Introduction to Software Engineering Ch9: Software Engineering Tools and Environments.

Search Engines and Information Retrieval Chapter 1.

Evaluation Experiments and Experience from the Perspective of Interactive Information Retrieval Ross Wilkinson Mingfang Wu ICT Centre CSIRO, Australia.

Tomek Strzalkowski & Sharon G. Small ILS Institute, SUNY Albany LAANCOR May 22, 2010 (Tacitly) Collaborative Question Answering Utilizing Web Trails 5/22/10.

Quality Control Project Management Unit Credit Value : 4 Essential

1 Research Groups : KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems SCI 2 SMetrology and Models Intelligent.

Unit 1 University of Sunderland COMM80 Risk Assessment of Systems Change Risk Aspects and Context Covered in the Module COMM80: Risk Assessment of Systems.

Implementation and process evaluation: developing our approach Ann Lendrum University of Manchester Neil Humphrey University of Manchester Gemma Moss Institute.

Carnegie Mellon School of Computer Science Copyright © 2001, Carnegie Mellon. All Rights Reserved. JAVELIN Project Briefing 1 AQUAINT Phase I Kickoff December.

CISC Machine Learning for Solving Systems Problems Presented by: Alparslan SARI Dept of Computer & Information Sciences University of Delaware

Illustrations and Answers for TDT4252 exam, June

MURI: Integrated Fusion, Performance Prediction, and Sensor Management for Automatic Target Exploitation 1 Dynamic Sensor Resource Management for ATE MURI.

Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.

Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.

Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.

Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI

Search Result Interface Hongning Wang Abstraction of search engine architecture User Ranker Indexer Doc Analyzer Index results Crawler Doc Representation.

+ Evidence Based Practice University of Utah Evidence-Based Treatment and Practice: New Opportunities to Bridge Clinical Research and Practice, Enhance.

Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.

Advanced Software Engineering Lecture 4: Process & Project Metrics.

User Interface Design Patterns: Part 1 Kirsten McCane.

Relevance Models and Answer Granularity for Question Answering W. Bruce Croft and James Allan CIIR University of Massachusetts, Amherst.

Relevance Feedback Hongning Wang

Unclassified//For Official Use Only 1 RAPID: Representation and Analysis of Probabilistic Intelligence Data Carnegie Mellon University PI : Prof. Jaime.

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

Software Engineering (CSI 321) Software Process: A Generic View 1.

Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.

Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC Relevance Feedback for Image Retrieval.

哈工大信息检索研究室 HITIR ’ s Update Summary at TAC2008 Extractive Content Selection Using Evolutionary Manifold-ranking and Spectral Clustering Reporter: Ph.d.

C ONTEXT AWARE SMART PHONE YOGITHA N. & PREETHI G.D. 6 th SEM, B.E.(C.S.E) SIDDAGANGA INSTITUTE OF TECHNOLOGY TUMKUR

Introduction to Machine Learning, its potential usage in network area,

WP4 Models and Contents Quality Assessment

WP4 - INERTIA Aggregators Monitoring, Management and Control Hub

A Visualization Tool for fMRI Data Mining

Tracking parameter optimization

MURI Annual Review Meeting Randy Moses November 3, 2008

CSE 635 Multimedia Information Retrieval

MONITORING MESSAGE STREAMS: RETROSPECTIVE AND PROSPECTIVE EVENT DETECTION Rutgers/DIMACS improve on existing methods for monitoring huge streams of textualized.

MONITORING MESSAGE STREAMS: RETROSPECTIVE AND PROSPECTIVE EVENT DETECTION Rutgers/DIMACS improve on existing methods for monitoring huge streams of textualized.

MONITORING MESSAGE STREAMS: RETROSPECTIVE AND PROSPECTIVE EVENT DETECTION Rutgers/DIMACS improve on existing methods for monitoring huge streams of textualized.

Presentation transcript:

Rutgers Components Phase 2 Principal investigators –Paul Kantor, PI; Design, modelling and analysis –Kwong Bor Ng, Co-PI - Fusion; Experimental design –Nina Wacholder, Co-PI; linguistic foundations for modelling

Key Components Adaptive personalization to analyst, task and context Improve effectiveness of information access for question answering -- data fusion of IR methods Improve effectiveness of characterizing document qualities, tuned to specific analyst’s persepctives

Model Personalization (1) : Robust Information Access & Data Fusion For a persistent query, improve frame and answer generation through Data Fusion (local fusion with person, task, topic feedback) and Interactive Relevance Feedback. In stage 1, we have demonstrated effective data fusion into HITIQA to optimize the rate of useful paragraph extraction. In stage 2, the emphasis will be on exploiting user judgments over time to adjust fusion parameters chronologically, with a time-sensitive weighting scheme, to fit the evolving perspective of the analyst on the task, topic an context.

Model Personalization (2) : Document Quality Aspects Personalization of the automatic document quality aspect assessment algorithm, through advanced statistical analysis and machine learning, to identify (1) global quality aspect predictors, (2) a general formal model of quality aspect assessment, and (3) personal parameters settings for individual preference. At stage1, we have established a effective models for estimation of some document qualities, based on textual features and linguistic patterns in a document. While global models do “better than chance”, for high acuracy models must be personalized. In stage 2, we will expand identification of good predictive variables for quality aspects, with emphasis on a local level: to encapsulate the personal mental model of an analyst.

Model Personalization (3): Integration through Experiment We will integrate the personalization and other mechanisms into a single interface, by converting related functionalities into position and iconic information in the user display. At stage 2, focusing on the analyst with a persistent query, we will investigate the impacts of interface options on analyst satisfaction and task effectiveness, to identify the best combination strategy, and to establish effectiveness measures on a personal level.

Sophisticated Statistical Techniques Sophisticated statistical methods (Design of experiment, ANOVA, multiple comparisons by Scheffe and Tukey’s method, and orthogonal arrays) will reduce the number of experimental configurations to be studied. Instead of a case-by-case attention to “failure analysis” the design will focus on how to neutralize negative effects to obtain more accurate evaluations and design selection with fewer experiments

Language Features for Quality Aspects. Expand a scheme, now being developed, for characterizing “aspects” or “facets” of topics. These will be different for e.g. WMD or Biography. Aspects are signalled by the presence of adjective classes. These classes are being defined now, and will be expanded in the proposed work.

Using Language Features With a more refined model of the relation of adjectives to aspects, the system will be better able to “understand” classes that the analyst defines, and to flag further occurences in an incoming message stream.

A note on retrieval fusion Retrieval fusion will be made interactive with a small Java display, now under development, that tracks the contribution of each retrieval scheme to providing useful information. An interactive feature permits the analyst to highlight a region in the “fusion space” for further investigation.

Mock-up Fusion Interface System 2 System 1 1. HITIQA’s Initial retrieval uses both systems. [The occupied region here represents the LOGICAL OR rule. Each document is represented by a small circle. As a passage is marked relevant by the users, the document it came from is flagged (here shown in yellow). 2. The analyst perceives that many of the useful passages came from documents that are clustered near the inner corner, and using the interface tool, draws an extended retrieval region (shown here by the dotted orange box) which HITIQA now explores. =not relevant = relevant 2.5 inches