Presentation is loading. Please wait.

Presentation is loading. Please wait.

Natural Language Questions for the Web of Data Mohamed Yahya, Klaus Berberich, Gerhard Weikum Max Planck Institute for Informatics, Germany Shady Elbassuoni.

Similar presentations


Presentation on theme: "Natural Language Questions for the Web of Data Mohamed Yahya, Klaus Berberich, Gerhard Weikum Max Planck Institute for Informatics, Germany Shady Elbassuoni."— Presentation transcript:

1 Natural Language Questions for the Web of Data Mohamed Yahya, Klaus Berberich, Gerhard Weikum Max Planck Institute for Informatics, Germany Shady Elbassuoni Qatar Computing Research Institute Maya Ramanath Dept. of CSE, IIT-Delhi, India Volker Tresp Siemens AG, Corporate Technology, Munich, Germany EMNLP 2012

2 Introduction  SPARQL ?x hasGender female ?x isa actor ?x actedIn Casablanca (film) ?x marriedTo ?w ?w isa writer ?w bornIn Rome  Natural language question qNL “Which female actor played in Casablanca and is married to a writer who was born in Rome?”. Problem : SPARQL is way too difficlut. Target : Convert SPARQL to qNL.

3 Knowledge Base: Yago2  Yago2 is a huge semantic knowledge base, derived from Wikipedia, WordNet and GeoNames.

4 Framework  DEANNA (DEep Answers for maNy Naturally Asked questions)

5 Framework Phrase Detection Phrase Mapping Q-Unit Generation Disambiguation of Phrase Mappings Query Generation

6 Phrase Detection  A detected phrase p is a pair where Toks is a phrase and l is a label, l ∈ {concept, relation}. Such as qNL : “Which female actor played in Casablanca and is married to a writer who was born in Rome?”. Concept phrase : Relation phrase : Framework

7 Phrase Detection Framework Concept detection  Using the Yago2 knowledge base.

8 Phrase Detection Framework Relation detection  Using the ReVerb (Fader et al., 2011) which is a relation detector. qNL : “Which female actor played in Casablanca and is married to a writer who was born in Rome?”.

9 Phrase Detection Framework

10 Phrase Mapping Framework  The mapping of concept phrases also relies on the phrase-concept dictionary. : Using Yago2 knowledge base.  The mapping of relation phrases relies on a corpus of textual patterns to relation mappings of the form.

11 Phrase Mapping Framework

12 Dependency Parsing & Q-Unit Generation Framework  Dependency parsing identifies triples of tokens, or triploids,, where trel, targ1, targ2 ∈ qNL are seeds for phrases. Dependency Parsing

13 Dependency Parsing & Q-Unit Generation Framework  qNL : “Which female actor played in Casablanca and is married to a writer who was born in Rome?”. actor played / played in Casablanca  Triploid :

14 Dependency Parsing & Q-Unit Generation Framework  A q-unit is a triple of sets of phrases,, where trel ∈ prel and similarly for arg1 and arg2. Q-Unit Generation

15 Dependency Parsing & Q-Unit Generation Framework

16 Dependency Parsing & Q-Unit Generation Framework

17 Disambiguation of Phrase Mappings Framework Disambiguation Graph  Esim ⊆ Vp × Vs  Ecoh ⊆ Vs × Vs  Eq ⊆ Vq×Vp×d, where d ∈ {rel, arg1, arg2} is a q-edge.

18 Disambiguation of Phrase Mappings Framework Disambiguation Graph(Cohsem)  For Yago2, the characterize an entity e by its inlinks InLinks(e): the set of Yago2 entities whose corresponding Wikipedia pages link to the entity. InLinks(Taipei_zoo):

19 Disambiguation of Phrase Mappings Framework Disambiguation Graph(Cohsem)  For class c with entities e, its inlinks are defined as follows: InLinks(Taiwan):

20 Disambiguation of Phrase Mappings Framework Disambiguation Graph(Cohsem)  For class r with entities e, its inlinks are defined as follows:

21 Disambiguation of Phrase Mappings Framework Disambiguation Graph(Simsem)  For entities How often a phrase refers to a certain entity in Wikipedia.  For classes Normalized prior the reflects the Number of members in a class  For relations The maximum n-gram similarity between the phrase and any of the relation’s surface forms

22 Disambiguation of Phrase Mappings Framework  Objective function is :

23 Disambiguation of Phrase Mappings Framework Definitions:

24 Disambiguation of Phrase Mappings Framework Definitions:

25 Disambiguation of Phrase Mappings Framework Constraints:

26 Disambiguation of Phrase Mappings Framework Constraints:

27 Disambiguation of Phrase Mappings Framework Constraints:

28 Query Generation Framework

29 Evaluation  Experiments are based on two collections of questions: QALD-1 (27 questions out of 50) NAGA (44 questions out of 87)  Using 19 questions from the QALD-1 Test set for tuning hyperparameters (α, β, γ) in the ILP objective function

30 Evaluation  Evaluating the output of DEANNA at three stages in the processing pipeline: a) Disambiguation b) Query Generation c) Question Answering  At each of the three stages, the output was shown to two human assessors. If the two were in disagreement, then a third person resolved the judgment.

31 Evaluation  Define coverage and precision as follows:

32 Evaluation a) Disambiguation

33 Evaluation b) Query Generation

34 Evaluation c) Question Answering

35 Evaluation

36 Conclusions A method for translating natural-language questions into structured queries.


Download ppt "Natural Language Questions for the Web of Data Mohamed Yahya, Klaus Berberich, Gerhard Weikum Max Planck Institute for Informatics, Germany Shady Elbassuoni."

Similar presentations


Ads by Google