INEX: Evaluating content-oriented XML retrieval Mounia Lalmas Queen Mary University of London

INEX: Evaluating content-oriented XML retrieval Mounia Lalmas Queen Mary University of London http://qmir.dcs.qmul.ac.uk

Outline Content-oriented XML retrieval Content-oriented XML retrieval Evaluating XML retrieval: INEX Evaluating XML retrieval: INEX

XML Retrieval Traditional IR is about finding relevant documents to a users information need, e.g. entire book. Traditional IR is about finding relevant documents to a users information need, e.g. entire book. XML retrieval allows users to retrieve document components that are more focussed to their information needs, e.g a chapter of a book instead of an entire book. XML retrieval allows users to retrieve document components that are more focussed to their information needs, e.g a chapter of a book instead of an entire book. The structure of documents is exploited to identify which document components to retrieve. The structure of documents is exploited to identify which document components to retrieve.

Structured Documents Linear order of words, sentences, paragraphs … Hierarchy or logical structure of a books chapters, sections … Links (hyperlink), cross- references, citations … Temporal and spatial relationships in multimedia documents Book Chapters Sections Paragraphs World Wide Web This is only only another to look one le to show the need an la a out structure of and more a document and so ass to it doe not necessary text a structured document have retrieval on the web is an it important topic of todays research it issues to make se last sentence..

Structured Documents Explicit structure formalised through document representation standards (mark-up languages) Explicit structure formalised through document representation standards (mark-up languages) Layout Layout LaTeX (publishing), HTML (Web publishing) Structure Structure SGML, XML (Web publishing, engineering), MPEG-7 (broadcasting) Content/Semantic Content/Semantic RDF, DAML + OIL, OWL (semantic web) World Wide Web This is only only another to look one le to show the need an la a out structure of and more a document and so ass to it doe not necessary text a structured document have retrieval on the web is an it important topic of todays research it issues to make se last sentence.. SDR … …

XML: eXtensible Mark-up Language Meta-language (user-defined tags) currently being adopted as the document format language by W3C Meta-language (user-defined tags) currently being adopted as the document format language by W3C Used to describe content and structure (and not layout) Used to describe content and structure (and not layout) Grammar described in DTD ( used for validation) Grammar described in DTD ( used for validation) Structured Document Retrieval Smith John Introduction into XML retrieval …. … …

XML: eXtensible Mark-up Language Use of XPath notation to refer to the XML structure chapter/title: title is a direct sub-component of chapter //title: any title chapter//title: title is a direct or indirect sub-component of chapter chapter/paragraph[2]: any direct second paragraph of any chapter chapter/*: all direct sub-components of a chapter Structured Document Retrieval Smith John Introduction into SDR …. …

Querying XML documents Content-only (CO) queries Content-only (CO) queries ' open standards for digital video in distance learning ' Content-and-structure (CAS) queries Content-and-structure (CAS) queries //article [about(., 'formal methods verify correctness aviation systems')] /body//section /body//section [about(.,'case study application model checking theorem proving')] [about(.,'case study application model checking theorem proving')] Structure-only (SA) queries Structure-only (SA) queries/article//*section/paragraph[2]

Content-oriented XML retrieval Return document components of varying granularity (e.g. a book, a chapter, a section, a paragraph, a table, a figure, etc), relevant to the users information need both with regards to content and structure.

Content-oriented XML retrieval Retrieve the best components according to content and structure criteria: INEX: most specific component that satisfies the query, while being exhaustive to the query INEX: most specific component that satisfies the query, while being exhaustive to the query Shakespeare study: best entry points, which are components from which many relevant components can be reached through browsing Shakespeare study: best entry points, which are components from which many relevant components can be reached through browsing ??? ???

Article ?XML,?retrieval Article ?XML,?retrieval ?authoring ?authoring 0.9 XML 0.5 XML 0.2 XML 0.9 XML 0.5 XML 0.2 XML 0.4 retrieval 0.7 authoring 0.4 retrieval 0.7 authoring Challenges Title Section 1 Section 2 no fixed retrieval unit + nested elements + element types how to obtain document and collection statistics? which component is a good retrieval unit? which components contribute best to content of Article? how to estimate? how to aggregate? 0.4 0.5 0.2 0.60.4 0.2

Approaches … vector space model probabilistic model bayesian network language model extending DB model boolean model natural language processing cognitive model ontology parameter estimation tuning smoothing fusion phrase term statistics collection statistics component statistics proximity search logistic regression belief model relevance feedback

Vector space model article index abstract index section index sub-section index paragraph index RSVnormalised RSV RSVnormalised RSV RSVnormalised RSV RSVnormalised RSV RSVnormalised RSV merge tf and idf as for fixed and non-nested retrieval units (IBM Haifa, INEX 2003 )

Language model element language model collection language model smoothing parameter element score element size element score article score query expansion with blind feedback ignore elements with 20 terms high value of leads to increase in size of retrieved elements results with = 0.9, 0.5 and 0.2 similar rank element (University of Amsterdam, INEX 2003)

Evaluation of XML retrieval: INEX Evaluating the effectiveness of content-oriented XML retrieval approaches Evaluating the effectiveness of content-oriented XML retrieval approaches Collaborative effort participants contribute to the development of the collection Collaborative effort participants contribute to the development of the collectionqueries relevance assessments Similar methodology as for TREC, but adapted to XML retrieval Similar methodology as for TREC, but adapted to XML retrieval 40+ participants worldwide 40+ participants worldwide Workshop in Schloss Dagstuhl in December (20+ institutions) Workshop in Schloss Dagstuhl in December (20+ institutions)

INEX Test Collection Documents (~500MB), which consist of 12,107 articles in XML format from the IEEE Computer Society; 8 millions elements Documents (~500MB), which consist of 12,107 articles in XML format from the IEEE Computer Society; 8 millions elements INEX 2002 INEX 2002 30 CO and 30 CAS queries inex2002 metric INEX 2003 INEX 2003 36 CO and 30 CAS queries CAS queries are defined according to enhanced subset of XPath inex2002 and inex2003 metrics INEX 2004 is just starting INEX 2004 is just starting

Tasks CO: aim is to decrease user effort by pointing the user to the most specific relevant portions of documents. CO: aim is to decrease user effort by pointing the user to the most specific relevant portions of documents. SCAS: retrieve relevant nodes that match the structure specified in the query. SCAS: retrieve relevant nodes that match the structure specified in the query. VCAS: retrieve relevant nodes that may not be the same as the target elements, but are structurally similar. VCAS: retrieve relevant nodes that may not be the same as the target elements, but are structurally similar.

Relevance in XML A element is relevant if it has significant and demonstrable bearing on the matter at hand A element is relevant if it has significant and demonstrable bearing on the matter at hand Common assumptions in IR Common assumptions in IR Objectivity Objectivity Topicality Topicality Binary nature Binary nature Independence Independence section paragraph article 1 2 1 2 3

Relevance in INEX Exhaustivity Exhaustivity how exhaustively a document component discusses the query: 0, 1, 2, 3 Specificity Specificity how focused the component is on the query: 0, 1, 2, 3 Relevance Relevance (3,3), (2,3), (1,1), (0,0), … (3,3), (2,3), (1,1), (0,0), … section article all sections relevant article very relevant all sections relevant article better than sections one section relevant article less relevant one section relevant section better than article …

Relevance assessment task Completeness Completeness Element parent element, children element Element parent element, children element Consistency Consistency Parent of a relevant element must also be relevant, although to a different extent Parent of a relevant element must also be relevant, although to a different extent Exhaustivity increase going Exhaustivity increase going Specificity decrease going Specificity decrease going Use of an online interface Use of an online interface Assessing a query takes a week! Assessing a query takes a week! Average 2 topics per participants Average 2 topics per participants section paragraph article 1 2 1 2 3

Interface Current assessments Navigation Groups

Assessments With respect to the elemens to assess With respect to the elemens to assess 26 % assessments on elements in the pool (66 % in INEX 2002). 68 % highly specific elements not in the pool 7 % elements automatically assessed 7 % elements automatically assessed INEX 2002 INEX 2002 23 inconsistent assessments per query for one rule

Metrics Need to consider: Two dimensions of relevance Two dimensions of relevance Independency assumption does not hold Independency assumption does not hold No predefined retrieval unit No predefined retrieval unit Overlap Overlap Linear vs. clustered ranking Linear vs. clustered ranking section article

INEX 2002 metric Quantization:strictgeneralized

Precision as defined by Raghavan89 (based on ESL) where n is estimated

Overlap problem

INEX 2003 metric Ideal concept space (Wong & Yao 95) c t

INEX 2003 metric Quantization:strictgeneralised

Ignoring overlap:

INEX 2003 metric Considering overlap:

INEX 2003 metric Penalises overlap by only scoring novel information in overlapping results Penalises overlap by only scoring novel information in overlapping results Assume uniform distribution of relevant information Assume uniform distribution of relevant information Issue of stability Issue of stability Size considered directly in precision (is it intuitive that large is good or not?) Size considered directly in precision (is it intuitive that large is good or not?) Recall defined using exh only Recall defined using exh only Precision defined using spec only Precision defined using spec only

Alternative metrics User-effort oriented measures User-effort oriented measures Expected Relevant Ratio Tolerance to Irrelevance Discounted Cumulated Gain Discounted Cumulated Gain

Lessons learnt Good definition of relevance Good definition of relevance Expressing CAS queries was not easy Expressing CAS queries was not easy Relevance assessment process must be improved Relevance assessment process must be improved Further development on metrics needed Further development on metrics needed User studies required User studies required

Conclusion XML retrieval is not just about the effective retrieval of XML documents, but also about how to evaluate effectiveness XML retrieval is not just about the effective retrieval of XML documents, but also about how to evaluate effectiveness INEX 2004 tracks INEX 2004 tracks Relevance feedback Relevance feedback Interactive Interactive Heterogeneous collection Heterogeneous collection Natural language query Natural language query http://inex.is.informatik.uni-duisburg.de:2004/

INEX: Evaluating content-oriented XML retrieval Mounia Lalmas Queen Mary University of London http://qmir.dcs.qmul.ac.uk

INEX: Evaluating content-oriented XML retrieval Mounia Lalmas Queen Mary University of London

Similar presentations

Presentation on theme: "INEX: Evaluating content-oriented XML retrieval Mounia Lalmas Queen Mary University of London"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

INEX: Evaluating content-oriented XML retrieval Mounia Lalmas Queen Mary University of London

Similar presentations

Presentation on theme: "INEX: Evaluating content-oriented XML retrieval Mounia Lalmas Queen Mary University of London"— Presentation transcript:

Similar presentations

About project

Feedback