Topic: Semantic Text Mining

Slides:



Advertisements
Similar presentations
Image Retrieval With Relevant Feedback Hayati Cam & Ozge Cavus IMAGE RETRIEVAL WITH RELEVANCE FEEDBACK Hayati CAM Ozge CAVUS.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Chapter 5: Introduction to Information Retrieval
Modelling Relevance and User Behaviour in Sponsored Search using Click-Data Adarsh Prasad, IIT Delhi Advisors: Dinesh Govindaraj SVN Vishwanathan* Group:
Improved TF-IDF Ranker
Query Languages. Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
1 Latent Semantic Mapping: Dimensionality Reduction via Globally Optimal Continuous Parameter Modeling Jerome R. Bellegarda.
Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.
Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)
Information Retrieval Ling573 NLP Systems and Applications April 26, 2011.
Video retrieval using inference network A.Graves, M. Lalmas In Sig IR 02.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
Investigation of Web Query Refinement via Topic Analysis and Learning with Personalization Department of Systems Engineering & Engineering Management The.
Presented by Zeehasham Rasheed
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Latent Semantic Analysis (LSA). Introduction to LSA Learning Model Uses Singular Value Decomposition (SVD) to simulate human learning of word and passage.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Longbiao Kang, Baotian Hu, Xiangping Wu, Qingcai Chen, and Yan He Intelligent Computing Research Center, School of Computer Science and Technology, Harbin.
Temporal Event Map Construction For Event Search Qing Li Department of Computer Science City University of Hong Kong.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
C OLLECTIVE ANNOTATION OF WIKIPEDIA ENTITIES IN WEB TEXT - Presented by Avinash S Bharadwaj ( )
Reyyan Yeniterzi Weakly-Supervised Discovery of Named Entities Using Web Search Queries Marius Pasca Google CIKM 2007.
Funded by: European Commission – 6th Framework Project Reference: IST WP 2: Learning Web-service Domain Ontologies Miha Grčar Jožef Stefan.
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
A Survey for Interspeech Xavier Anguera Information Retrieval-based Dynamic TimeWarping.
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Exploring Online Social Activities for Adaptive Search Personalization CIKM’10 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.
A Probabilistic Graphical Model for Joint Answer Ranking in Question Answering Jeongwoo Ko, Luo Si, Eric Nyberg (SIGIR ’ 07) Speaker: Cho, Chin Wei Advisor:
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
Chapter 6: Information Retrieval and Web Search
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
Deep Learning Powered In- Session Contextual Ranking using Clickthrough Data Xiujun Li 1, Chenlei Guo 2, Wei Chu 2, Ye-Yi Wang 2, Jude Shavlik 1 1 University.
Lecture 1: Overview of IR Maya Ramanath. Who hasn’t used Google? Why did Google return these results first ? Can we improve on it? Is this a good result.
1 Opinion Retrieval from Blogs Wei Zhang, Clement Yu, and Weiyi Meng (2007 CIKM)
Structure of IR Systems INST 734 Module 1 Doug Oard.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Retrieval of Highly Related Biomedical References by Key Passages of Citations Rey-Long Liu Dept. of Medical Informatics Tzu Chi University Taiwan.
Authors: Marius Pasca and Benjamin Van Durme Presented by Bonan Min Weakly-Supervised Acquisition of Open- Domain Classes and Class Attributes from Web.
Conceptual structures in modern information retrieval Claudio Carpineto Fondazione Ugo Bordoni
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Psychiatric document retrieval using a discourse-aware model Presenter : Wu, Jia-Hao Authors : Liang-Chih.
Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.
1 A Fuzzy Logic Framework for Web Page Filtering Authors : Vrettos, S. and Stafylopatis, A. Source : Neural Network Applications in Electrical Engineering,
Context-Aware Query Classification Huanhuan Cao, Derek Hao Hu, Dou Shen, Daxin Jiang, Jian-Tao Sun, Enhong Chen, Qiang Yang Microsoft Research Asia SIGIR.
Divided Pretreatment to Targets and Intentions for Query Recommendation Reporter: Yangyang Kang /23.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
哈工大信息检索研究室 HITIR ’ s Update Summary at TAC2008 Extractive Content Selection Using Evolutionary Manifold-ranking and Spectral Clustering Reporter: Ph.d.
Short Text Similarity with Word Embedding Date: 2016/03/28 Author: Tom Kenter, Maarten de Rijke Source: CIKM’15 Advisor: Jia-Ling Koh Speaker: Chih-Hsuan.
University Of Seoul Ubiquitous Sensor Network Lab Query Dependent Pseudo-Relevance Feedback based on Wikipedia 전자전기컴퓨터공학 부 USN 연구실 G
1 Dongheng Sun 04/26/2011 Learning with Matrix Factorizations By Nathan Srebro.
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Sentence Modeling Representation of sentences is the heart of Natural Language Processing A sentence model is a representation and analysis of semantic.
Context-Specific Intention Awareness through Web Query
Deep Compositional Cross-modal Learning to Rank via Local-Global Alignment Xinyang Jiang, Fei Wu, Xi Li, Zhou Zhao, Weiming Lu, Siliang Tang, Yueting.
Intelligent Information System Lab
Associative Query Answering via Query Feature Similarity
Generating Natural Answers by Incorporating Copying and Retrieving Mechanisms in Sequence-to-Sequence Learning Shizhu He, Cao liu, Kang Liu and Jun Zhao.
Text & Web Mining 9/22/2018.
Web IR: Recent Trends; Future of Web Search
Table Cell Search for Question Answering Huan Sun
Introduction to Information Retrieval
Resource Recommendation for AAN
Using Multilingual Neural Re-ranking Models for Low Resource Target Languages in Cross-lingual Document Detection Using Multilingual Neural Re-ranking.
Relevance and Reinforcement in Interactive Browsing
Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.
Extracting Why Text Segment from Web Based on Grammar-gram
Presentation transcript:

Topic: Semantic Text Mining Bin Li 1 1

Outline Paper 1 : Table Cell Search for Question Answering Background Table cell search framework Evaluation 2 2

Background complexer question unstructured knowledge resource limit of query for traditional knowledge bases 3 3

Background complexer question unstructured knowledge resource limit of query for traditional knowledge bases Precisely retrieve table cells from web to answer a user question Measure 4 4

Table cell search framework formulate question chain and relational chain find the best answer (using deep neural networks) extract the corresponding answer

what languages do people in france speak? natural language question

Question chain what languages do people in france speak? Topic entity natural language question Question chain Topic entity Question pattern

Question chain what languages do people in france speak? Topic entity natural language question Question chain Topic entity Question pattern

Question chain what languages do people in <e> speak? natural language question Question chain Topic entity Question pattern

Question chain relevant table what languages do people in <e> speak? France ?X

Question chain Relevant table what languages do people in <e> speak? France ?X Row graph

people in <e> speak? France ?X Question chain Relevant table what languages do people in <e> speak? France ?X Row graph column name Topic cell Ending cell Relation chain

people in <e> speak? France ?X Question chain Relevant table what languages do people in <e> speak? France ?X matching imply inward and outward relations Row graph column name Topic cell Ending cell Relation chain

Candidate Chains Question chain Relevant table what languages do people in <e> speak? France ?X Candidate Chains matching imply inward and out ward relations Row graph column name Topic cell Ending cell Relation chain

Get a large set of candidate chains via string matching. Evaluate the relevance of a candidate chain to the input question. Get more accurate candidate chains. Explore the information of candidate chains. Use deep neural networks to evaluate the matching degree.

Chain inference Semantic representation non-linear feed-forward neural network fixed-lenth global feature vector extract most salient local feature llocal contextual feature vector convolution concatenate {#-s-p,s-p-e,p-e-a,e-a-k,a-k-#} what languages do people in <e> speak?

Chain inference Semantic representation non-linear feed-forward neural network fixed-lenth global feature vector extract most salient local feature llocal contextual feature vector convolution concatenate {#-s-p,s-p-e,p-e-a,e-a-k,a-k-#} what languages do people in <e> speak? As input: question pattern, word sequence, answer type, peseudo-predicate,entity pairs

Features Shallow features Deep features word-level matching degree Deep features answer type pseudo-predicate entity pairs Calculate the cosine similarity between the question and the candidate chain.

Evaluation

Evaluation Measures Precision Recall measure (Harmonic mean of P and R)

Paper 2 : Dynamic Collective Entity Representations for Entity Ranking Outline Background Dynamic collective entity representations(DCER) Evaluation 21 21

Background Mismatch Manualy query Entity's description in knowledge base Manualy query Context dependency Time dependency

Dynamic collective entity representation Collective intelligence Manualy query Knowledge base entity description Fielded documents represent entities continuously update ranking model incorporate new descriptions Retrain model adjust weights associated to entity's fields Dynamic collective entity representation

Dynamic collective entity representations(DCER) Problem: Query: q Knowledge base: KB (consist of entities e E) Aim: find the best match between e and q Approach: Expand entity representations(field document) Reduce the vocabulary gap between queties and entities Train classification-based ranker Combine content from each field

Description sources Knowledge base External description sources (static: web achives, web anchors... dynamic: tweets, query logs...) Adaptive entity ranking Supervised entity ranker learns to weight the fields but two challenges by constructing DCER: -Heterogeneity(volume, quaity, quantity, type...) within entities between entites -Dynamicness cannot capture the evolving and continually changing so “Adaptive entity ranking”(continuously undated)

Model Entity representaton: : Field term vector, represent the content of e Updating fields: Estimate e's relevance to a query q: Supervises single-field weighting model

Three features express the importance of field and entity Field similarity Query-Field similarity score Field importance status of the field at a point in time Entity importance resently updated entities

Ranker Macine learning Query (Employ a supervised ranker to learn the optimal feature weights for retrieval) Weight vector: ( : weights of each of the field features and the entity importance feature) Top K-retrieval Query Candidate entities user interaction(i.e.,clicks) feature vectore(x) Ranker Optimal classification's condidence score

Evaluation Q1: Does entity ranking effectiveness increase using DCER? Compare and Q2: Does entity ranking effectiveness increase when employing field and entity features? Compare ,KBER(incorperate entity and field importance feature) and DCER Q3: Does entity ranking effectiveness increase when we continuosly learn the optimal entity representation? Compare DCER and (non-adaptive)

Thank you for your attention!