A Probabilistic Model for Fine-Grained Expert Search Shenghua Bao, Huizhong Duan, Qi Zhou, Miao Xiong, Yunbo Cao, Yong Yu June , 2008, Columbus Ohio
Schedule Introduction 1 Fine-grained Expert Search 2 Conclusion 4 Experimental Results 3
3 Introduction Expert Search “who is an expert on X?” UserQuery Search Engine Experts Who are experts on Semantic Web Search Engine ?
Introduction Pioneering Expert Search Systems Log data in software development Kautz et al., 1996; Mockus and Herbsleb, 2002; McDonald and Ackerman, 1998; etc. communications Campbell et al., 2003; Dom et al. 2003; Sihn and Heeren, 2001; etc. General documents Yimam, 1996; Davenport and Prusak, 1998; Steer and Lochbaum, 1988; Mattox et al., 1999; Hertzum and Pejtersen, 2000; Craswell et al., 2001; etc.
Introduction Expert Search at TREC A new task at TREC 2005, 2006, 2007 Craswell et al., 2005; Soboroff et al., 2006; Bailey et al., 2007; Many approaches have been proposed Two generative models, Balog et al Prior distribution, relevance feedback, Fang et al Hierarchical language model, Petkova and Croft 2006 Voting and data fusion, Macdonald and Ounis 2006 …
Introduction Coarse-grained approach. Expert search is carried out under a grain of document. Further improvements are hard to achieve Different blocks of electronic documents Different functions and qualities Different impacts for expert search
Windowed Section Relation irrelevant Window relevant queried topic 7 Examples
Title-Author Relation Title Author Query: Timed Text 8 Examples
Reference Section Relation 9 Examples
Query: W3C Management Team 10 Examples Section Title-Body Relation
Schedule Introduction 1 Fine-grained Expert Search 2 Conclusion 4 Experimental Results 3
12 Fine-grained Evidence Who are experts on Semantic Web Search Engine? Fine-grained Expert Search --Evidence Extraction Document-001: “…a high-level plan of the architecture of the semantic web by Tim Berners- Lee… ” “…later, Berners-Lee describes a semantic web search engine experience…” E1: E2: Tim Berners-Lee
Fine-grained Expert Search –Search Model (t,p,r,d) Expert Candidate (c) Query (q) Expert Matching Model Evidence Matching Model
Fine-grained Expert Search -- Expert Matching MaskSample Full NameRitu Raj Tiwari Combined NameTiwari, Ritu R; Abbr. NameRitu Raj ; Ritu Short NameRRT Alias, new ( for short)
Fine-grained Expert Search -- Evidence Matching TypeSample Query Semantic Web Search Engine Phrase “Semantic Web Search Engine” Bi-gram “Semantic Web” “Search Engine” Proximity “Semantic … Web Search Engine” Fuzzy “Samentic Web Saerch Engine” Stemmed “Semantic Web Search Engin” Relation Type Same Section Windowed Section Reference Section Title-Author Section Title-Body Quality Type Dynamic Quality Static Qualify
Schedule Introduction 1 Fine-grained Expert Search 2 Conclusion 4 Experimental Results 3
Experimental Result W3C Corpus 331,307 web pages 10 training topics of TREC 2005 50 test topics of TREC 2005 49 test topics of TREC 2006 Evaluation Metrics Mean average precision (MAP) R-precision (R-P) Top N precision
Experimental Result Query Matching TREC 2005TREC 2006 Baseline Bi-gram Proximity Fuzzy, Stemmed Improv %17.09%9.80%22.07%11.49%5.30% T-test
Experimental Result Person Matching TREC 2005TREC 2006 Baseline Combined Name Abbr. Name Short Name Alias, new Improv. 6.21%3.60%1.19%14.00%1.96%1.68% T-test
Experimental Result Multiple Relations TREC 2005TREC 2006 Baseline Windowed Section Reference Section Title-Author Section Title-Body Improv %19.91%10.00%8.35%8.77%8.50% T-test
Experimental Result Evidence Quality TREC 2005TREC 2006 Baseline Static quality Dynamic quality Improv. 6.13%4.67%3.74%2.86%3.67%8.61% T-test Rank
Schedule Introduction 1 Fine-grained Expert Search 2 Conclusion 4 Experimental Results 3
Conclusion Fine-grained expert search Probabilistic model and its implementation Evaluation on the TREC data set