Presentation is loading. Please wait.

Presentation is loading. Please wait.

Using Semantic Relations to Improve Passage Retrieval for Question Answering Tom Morton.

Similar presentations


Presentation on theme: "Using Semantic Relations to Improve Passage Retrieval for Question Answering Tom Morton."— Presentation transcript:

1 Using Semantic Relations to Improve Passage Retrieval for Question Answering Tom Morton

2 Introduction Paragraph retrieval for natural-language questions. – Correctness of answers to natural language questions can be accurately determined automatically. – Standard precursor to TREC question answering task. What NLP technologies might help this task and are they robust enough?

3 NLP Technologies Question Analysis: – Questions tend to specify the semantic type of their answer. This component tries to identify this type. Named-Entity Detection: – Named-entity detection determines the semantic type of proper nouns and numeric amounts in text.

4 How these technologies help? Question Analysis – The category predicted is appended to the question. Named-Entity Detection: – The NE categories found in text are included as new terms. This approach requires additional question terms to be in the paragraph. What party is John Major in? (ORGANIZATION) It probably won't be clear for some time whether the Conservative Party has chosen in John Major a truly worthy successor to Margaret Thatcher, who has been a giant on the world stage. +ORGANIZATION +PERSON

5 NLP Technologies Coreference Relations: – Interpretation of a paragraph may depend on the context in which it occurs. Syntactically-based Categorical Relation Extraction: – Appositive and predicate nominative constructions provide descriptive terms about entities.

6 Coreference: – Use coreference relationships to introduce new terms referred to but not present in the paragraph’s text. How these technologies help? How long was Margaret Thatcher the prime minister? (DURATION) The truth, which has been added to over each of her 11 1/2 years in power, is that they don't make many like her anymore. +MARGARET +THATCHER +PRIME +MINISTER +DURATION

7 How these technologies help? Categorical Relation Extraction – Identifies DESCRIPTION category. – Allows descriptive terms to be used in term expansion. Famed architect Frank Lloyd Wright… +DESCRIPTION Buildings he designed include the Guggenheim Museum in New York and Robie House in Chicago. +FRANK +LLOYD +WRIGHT +FAMED +ARCHITECT Who is Frank Lloyd Wright? (DESCRIPTION) What architect designed Robie House? (PERSON)

8 How does it work? Coreference – Use Approach described in ACL (Morton 2000). – Divide referring expressions into three classes and create a separate resolution approach for each. Singular third person pronouns: Statistical Proper nouns: Rule-based Definite noun phrases: Rule-based – Apply resolution approaches to text in an interleaved fashion.

9 Coreference 1.John Major, a truly worthy… 2.Margaret Thatcher, her, … 3.The Conservative Party 4.the undoubted exception 5.Winston Churchill 6.… she ? 20% 70% 10% 5% 10% Pronoun is resolved to entity rather than most recent extent.

10 Paragraph Retrieval Results

11 Conclusion Developed and evaluated new techniques in: – Coreference Resolution. – Categorical Relation Extraction. – Question Analysis. Integrated these techniques with existing NLP components: – NE detection, POS tagging, Sentence detection, etc. Demonstrated that these techniques can be used to improve performance in an information retrieval task. – Paragraph retrieval for natural language questions.

12 Porting this approach to ACE A rapidly developed IE system – Built using the same approach Pipelined Architecture – Easy to construct from existing components – Easy to plug in new components Statistical Components – Maximum Entropy – Require less hand-tuning – Easy to improve with new training data or better machine learning algorithms

13 Input File Tokenizing/ Preprocessing NE Tagging Parsing Nominal Tagging Coreference Relation Extraction Output File

14 Integrating CRF: Results The CRF tagger significantly improves NE detection, giving a higher entity score. Better NE detection allows the system to find more relations, giving a higher relation score. Maxent CRF Maxent +BBN Maxent +BBN Entity ScoresRelation Scores


Download ppt "Using Semantic Relations to Improve Passage Retrieval for Question Answering Tom Morton."

Similar presentations


Ads by Google