CONCLUSION & FUTURE WORK Normally, users perform search tasks using multiple applications in concert: a search engine interface presents lists of potentially.

Slides:

Advertisements

Similar presentations

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Advertisements

CONTRIBUTIONS Ground-truth dataset Simulated search tasks environment Multiple everyday applications (MS Word, MS PowerPoint, Mozilla Browser) Implicit.

AVATAR: Advanced Telematic Search of Audivisual Contents by Semantic Reasoning Yolanda Blanco Fernández Department of Telematic Engineering University.

IVITA Workshop Summary Session 1: interactive text analytics (Session chair: Professor Huamin Qu) a) HARVEST: An Intelligent Visual Analytic Tool for the.

Report on Intrusion Detection and Data Fusion By Ganesh Godavari.

Intelligent User Interfaces Research Group Directed by: Frank Shipman.

Recognizing User Interest and Document Value from Reading and Organizing Activities in Document Triage Rajiv Badi, Soonil Bae, J. Michael Moore, Konstantinos.

1 Today  Tools (Yves)  Efficient Web Browsing on Hand Held Devices (Shrenik)  Web Page Summarization using Click- through Data (Kathy)  On the Summarization.

Latent Semantic Analysis (LSA). Introduction to LSA Learning Model Uses Singular Value Decomposition (SVD) to simulate human learning of word and passage.

The Visual Knowledge Builder: A Second Generation Spatial Hypertext Frank M. Shipman III Haowei Hsieh Preetam Maloor J. Michael Moore.

Managing Software Projects in Spatial Hypertext : Experiences in Dogfooding Frank Shipman Department of Computer Science & Center for the Study of Digital.

Overview of Search Engines

Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.

Query session guided multi- document summarization THESIS PRESENTATION BY TAL BAUMEL ADVISOR: PROF. MICHAEL ELHADAD.

Smart Learning Services Based on Smart Cloud Computing

Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.

 A set of objectives or student learning outcomes for a course or a set of courses.  Specifies the set of concepts and skills that the student must.

The 2nd International Conference of e-Learning and Distance Education, 21 to 23 February 2011, Riyadh, Saudi Arabia Prof. Dr. Torky Sultan Faculty of Computers.

CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.

Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.

MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.

RuleML-2007, Orlando, Florida1 Towards Knowledge Extraction from Weblogs and Rule-based Semantic Querying Xi Bai, Jigui Sun, Haiyan Che, Jin.

Custom driven scientific information extraction from digital libraries using integrated text mining services Betim Çiço, Adrian Besimi, Visar Shehu 14th.

CONCLUSION & FUTURE WORK Normally, users perform triage tasks using multiple applications in concert: a search engine interface presents lists of potentially.

A Framework for Examning Topical Locality in Object- Oriented Software 2012 IEEE International Conference on Computer Software and Applications p

Name : Emad Zargoun Id number : EASTERN MEDITERRANEAN UNIVERSITY DEPARTMENT OF Computing and technology “ITEC547- text mining“ Prof.Dr. Nazife Dimiriler.

INF 141 COURSE SUMMARY Crista Lopes. Lecture Objective Know what you know.

PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.

Michael Cafarella Alon HalevyNodira Khoussainova University of Washington Google, incUniversity of Washington Data Integration for Relational Web.

UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.

WEB SEARCH PERSONALIZATION WITH ONTOLOGICAL USER PROFILES Data Mining Lab XUAN MAN.

CONCLUSION & FUTURE WORK Given a new user with an information gathering task consisting of document IDs and respective term vectors, this can be compared.

Chapter 6: Information Retrieval and Web Search

Chapter 12: Web Usage Mining - An introduction Chapter written by Bamshad Mobasher Many slides are from a tutorial given by B. Berendt, B. Mobasher, M.

Toward A Session-Based Search Engine Smitha Sriram, Xuehua Shen, ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.

You Are What You Tag Yi-Ching Huang and Chia-Chuan Hung and Jane Yung-jen Hsu Department of Computer Science and Information Engineering Graduate Institute.

Contextual Ranking of Keywords Using Click Data Utku Irmak, Vadim von Brzeski, Reiner Kraft Yahoo! Inc ICDE 09’ Datamining session Summarized.

Recuperação de Informação B Cap. 10: User Interfaces and Visualization , , 10.9 November 29, 1999.

BioRAT: Extracting Biological Information from Full-length Papers David P.A. Corney, Bernard F. Buxton, William B. Langdon and David T. Jones Bioinformatics.

Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI

Sharad Oberoi Carnegie Mellon University DesignWebs: Learning in Engineering Project Teams.

Gene Clustering by Latent Semantic Indexing of MEDLINE Abstracts Ramin Homayouni, Kevin Heinrich, Lai Wei, and Michael W. Berry University of Tennessee.

CONCLUSIONS & CONTRIBUTIONS Ground-truth dataset, simulated search tasks environment Multiple everyday applications (MS Word, MS PowerPoint, Mozilla Browser)

Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.

Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar

Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.

Conceptual structures in modern information retrieval Claudio Carpineto Fondazione Ugo Bordoni

Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.

Computational Tools for Population Biology Tanya Berger-Wolf, Computer Science, UIC; Daniel Rubenstein, Ecology and Evolutionary Biology, Princeton; Jared.

Towards a Reference Quality Model for Digital Libraries Maristella Agosti Nicola Ferro Edward A. Fox Marcos André Gonçalves Bárbara Lagoeiro Moreira.

Exploring in the Weblog Space by Detecting Informative and Affective Articles Xiaochuan Ni, Gui-Rong Xue, Xiao Ling, Yong Yu Shanghai Jiao-Tong University.

A Knowledge-Based Search Engine Powered by Wikipedia David Milne, Ian H. Witten, David M. Nichols (CIKM 2007)

Comparing Document Segmentation for Passage Retrieval in Question Answering Jorg Tiedemann University of Groningen presented by: Moy’awiah Al-Shannaq

Research Methodology Class.   Your report must contains,  Abstract  Chapter 1 - Introduction  Chapter 2 - Literature Review  Chapter 3 - System.

Supporting Knowledge Discovery: Next Generation of Search Engines Qiaozhu Mei 04/21/2005.

DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.

Web Search Personalization with Ontological User Profile Advisor: Dr. Jai-Ling Koh Speaker: Shun-hong Sie.

The Development of a search engine & Comparison according to algorithms Sung-soo Kim The final report.

How to Write an Abstract Gwendolyn MacNairn Computer Science Librarian.

Text Information Management ChengXiang Zhai, Tao Tao, Xuehua Shen, Hui Fang, Azadeh Shakery, Jing Jiang.

Unified Relevance Feedback for Multi-Application User Interest Modeling Sampath Jayarathna PhD Candidate Computer Science & Engineering.

Acknowledgements : This research is supported by NSF grant INTRODUCTION MULTI LAYER PERCEPTRONS (MLP) DATA SET FOR TRAINING Learning weights using.

Recognizing Document Value from Reading and Organizing Activities in Document Triage Rajiv Badi, Soonil Bae, J. Michael Moore, Konstantinos Meintanis,

CS791 - Technologies of Google Spring A Webbased Kernel Function for Measuring the Similarity of Short Text Snippets By Mehran Sahami, Timothy.

CONCLUSIONS & CONTRIBUTIONS Ground-truth dataset, simulated search tasks environment Implicit feedback, semi-explicit feedback (annotations), explicit.

Personalized Ontology for Web Search Personalization S. Sendhilkumar, T.V. Geetha Anna University, Chennai India 1st ACM Bangalore annual Compute conference,

Visualizing User Activity History

Web Mining Department of Computer Science and Engg.

Michal Rosen-Zvi University of California, Irvine

Latent Semantic Analysis

Presentation transcript:

CONCLUSION & FUTURE WORK Normally, users perform search tasks using multiple applications in concert: a search engine interface presents lists of potentially relevant documents; a document reader displays their contents; and a third tool—a text editor or personal information management application—is used to record notes and assessments (MS Word and MS PowerPoint). An Interest Profile Manager infers users' interests from their interactions with the multi-applications, coupled with the characteristics of the multi-source interest modeling techniques. The resulting interest profile is used to generate visualizations that direct users' attention to documents or parts of documents that match their inferred interests. Statistical methods used in the work (tf-idf, LDA an LSA) infer the interest based on the content similarity and the Ontology-based user model infers the long-term user interest dynamically using spreading activation module. Acknowledgements : This research is supported by NSF grant Hybrid (Statistical and Knowledge-based) Multi-Application User Interest Modeling Sampath Jayarathna and Frank Shipman Computer Science & Engineering, Texas A&M University – College Station Hybrid (Statistical and Knowledge-based) Multi-Application User Interest Modeling Sampath Jayarathna and Frank Shipman Computer Science & Engineering, Texas A&M University – College Station ABSTRACT We are interested about open-ended information gathering tasks— search tasks in particular—in which people collect Web documents for interpretation and synthesis. User interests are usually distributed in different systems during search tasks. Traditional user interest modeling methods are not designed for integrating and analyzing interests from multiple sources, hence, they are not very effective for obtaining comparatively complete description of user interests in a multi- application environment. We propose an approach of user interest modeling based on multi- source interest fusion using statistical/algebraic models (tf-idf, LSA, and LDA) and knowledge-based models (Ontology). Figure 1. Multi-Application Interest Modeling and Fusion Even with the best search engine and the most effective query formulation, “search tasks” require people to work through long lists of documents to synthesize the information they need; there is usually no single document containing one right answer. In fact, as people skim early documents, they may determine additional information needs that suggest further queries and results in even more documents to process. A system can support document search tasks by recommending the documents that best match a user’s interests, thereby ensuring that the user’s time is spent efficiently on the most relevant documents. In the work we present, recommendations based on demonstrated user interest; in other words, the user’s previous interactions with the document collection, along with the characteristics of the documents, are used to infer the user’s interests. USER INTEREST MODELING WebAnnotate - During information task, useful documents may be long, and cover multiple subtopics; users may read some segments and ignore others. In order to record which portion(s) of the document pique the user’s interests, an explicit interest expressions (e.g. annotations using WebAnnotate) capturing tool is used. Latent Semantic Analysis (LSA) - The SVD based LSA can take a large matrix of term document association data and construct a semantic space where terms and documents that are closely associated can be detected with Cosine Similarity. Latent Dirichlet Allocation (LDA) - Our strategy in using LDA is to describe users as a mixture of topics and to assume that each of their actions is motivated by choosing a topic of interest and subsequently a word to describe that action from the catalog of words consistent with that particular interest. We represent each user as a bag of words extracted from those actions and we use the search task to denote generating a word from the bag. Figure 2. Personalized Search & Recommendations Ontology-based User Model - An ontological approach to user profiling has proven to be successful in addressing the cold-start problem in recommender systems where no initial information is available early on upon which to base recommendations. We model the user interests using ontological profiles by assigning implicitly derived interest scores to existing concepts in domain ontology. A spreading activation algorithm is used to maintain the interest scores based on the user’s ongoing behavior. Ontology = Long-term user interest modeling Statistical Methods = Short-term user interest modeling REFERENCES SEARCH AND RECOMMENDATIONS Our major contributions in this work can be summarized as: (1)A novel personalized search & Recommendations based on evidence coming from multiple applications and multi-source interest modeling using a hybrid of statistical and knowledge- based methods. (2)In the future we plan on creating a weighting schema to identify the importance of evidence coming from multiple applications like VKB, Web Browser, Word and PowerPoint. 1.Tolomei, G., Orlando, S. and Silvestri, F., Towards a task-based search and recommender systems. in In Proceedings of ICDE Workshops, (2010), Bae, S., Hsieh, H., Kim, D., Marshall, C.C., Meintanis, K., Moore, J.M., Zacchi, A. and Shipman, F.M. Supporting document triage via annotation-based visualizations. American Society for Information Science and Technology, 45 (1) Landauer, T., Foltz, P.W. and Laham, D. An Introduction to Latent Semantic Analysis. Discourse Processes Sieg A, Mobasher B, and Burke R., “Web search personalization with ontological user profiles,” in ACM Sixteenth Conference on Information and Knowledge Management, CIKM 2007, Lisbon, Portugal, November 2007.