TextInfer 2011 – Bar Ilan University 1 Towards a probabilistic Model for Lexical Entailment Eyal Shnarch, Jacob Goldberger, Ido Dagan.

Slides:



Advertisements
Similar presentations
Keynote at ICTIR 2011, Sept. 13, 2011, Bertinoro, Italy Axiomatic Analysis and Optimization of Information Retrieval Models ChengXiang (Cheng) Zhai Department.
Advertisements

Simplifications of Context-Free Grammars
KATIE BUCKLEY, HARVARD UNIVERSITY SCOTT MARION, CENTER FOR ASSESSMENT NATIONAL CONFERENCE ON STUDENT ASSESSMENT (NCSA) NATIONAL HARBOR, MD JUNE 22, 2013.
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 3.1 Chapter 3.
STATISTICS HYPOTHESES TEST (I)
Estimating the detector coverage in a negative selection algorithm Zhou Ji St. Jude Childrens Research Hospital Dipankar Dasgupta The University of Memphis.
Combining Like Terms. Only combine terms that are exactly the same!! Whats the same mean? –If numbers have a variable, then you can combine only ones.
Title I Data Presentation All served Title I students School-wide and Targeted Combined.
NTDB ® Annual Report 2009 © American College of Surgeons All Rights Reserved Worldwide Percent of Hospitals Submitting Data to NTDB by State and.
Regional Policy EUROPEAN COMMISSION 1 EGTC regulation EGTC regulation ESF and EGTC regulations Regulation of the European Parliament and of the Council.
TENNESSEE REVENUE ESTIMATES FISCAL YEARS AND Presentation to the State Funding Board Fiscal Review Committee Staff March 24, 2010.
Tennessee Higher Education Commission Higher Education Recommendations & Finance Overview November 15, 2012.
A presentation to the Board of Education
Continuous Improvement Update June 11, (Your School/Departments Mission Statement Here) (Your School/Departments SMART Goals Here)
1/23 Learning from positive examples Main ideas and the particular case of CProgol4.2 Daniel Fredouille, CIG talk,11/2005.
Dependability analysis and evolutionary design optimisation with HiP-HOPS Dr Yiannis Papadopoulos Department of Computer Science University of Hull, U.K.
How can I find the number of times a work has been cited by other authors?
Spectral Clustering Eyal David Image Processing seminar May 2008.
New EU Forest Strategy and Update on Forest Information and Monitoring
Bridging through 10 Learning objectives:
The Grade 9 Cohort of Fall 2000: Post-secondary Pathways Preliminary Analysis Presentation to HEQCO - June 15, 2009 Dr. Robert S. Brown Organizational.
Friday 2/11 HW: NONE 1.Get out last nights HW 2.While I am checking, log onto the computer. 3.While waiting to log in, complete 6-2 B #1-8 in your packet!
Feedback From bandwidths onwards. First, the guidance hypothesis Remember the guidance hypothesis? Usually examined with simple button tasks Some others.
Turing Machines.
1. 2 Anyone can suggest an idea for a law. Only Members of Congress can introduce a proposed law to the House or Senate. 3.
Galit Haim, Ya'akov Gal, Sarit Kraus and Michele J. Gelfand A Cultural Sensitive Agent for Human-Computer Negotiation 1.
EU Market Situation for Eggs and Poultry Management Committee 21 June 2012.
1 Probabilistic Uncertainty Bounding in Output Error Models with Unmodelled Dynamics 2006 American Control Conference, June 2006, Minneapolis, Minnesota.
Market Research on Car Restriction for First-Year Students: Impact on Undergraduate Recruitment Final Report Overview – May 12, 2010.
1 Highlights of a Systematic Review of Research on Peer-Delivered Services Boston University Center for Psychiatric Rehabilitation March 2010.
CS 6143 COMPUTER ARCHITECTURE II SPRING 2014 ACM Principles and Practice of Parallel Programming, PPoPP, 2006 Panel Presentations Parallel Processing is.
5.9 + = 10 a)3.6 b)4.1 c)5.3 Question 1: Good Answer!! Well Done!! = 10 Question 1:
Copyright © 2013, 2009, 2006 Pearson Education, Inc. 1 Section 5.5 Dividing Polynomials Copyright © 2013, 2009, 2006 Pearson Education, Inc. 1.
10/5/2013Multiplication Rule 11  Multiplication Rule 1: If a > b and c > 0 then a c > bc Examples If 7 > 3 and 5 > 0 then 7(5) > 3(5) If 2x + 6 > 8 then.
* 07/16/96 *.
Minnesota Assessment Conference Mathematics MCA-III Online Testing Jennifer Dugan, Sandy Gimpl, and Rosemary Heinitz.
 Albert Lea Area Schools.  Albert Lea Area Schools will be above the state average at grade level in reading, math, and science according to MCA results.
Using the P-Value Section P-Value (Observed Significance Level)  It’s the measure of the inconsistency between the hypothesized value for a population.
Desired Results Developmental Profile - school readiness© A Project of the California department of education, child development division.
Putting Statistics to Work
1 Let’s Recapitulate. 2 Regular Languages DFAs NFAs Regular Expressions Regular Grammars.
Ye shanshan Friday, May 11, /20.  Problem  Literature Review  Mathematical Model  Conclusion  My idea 参考文献: Gupta, S., and Dutta, K., Modeling.
Document Summarization using Conditional Random Fields Dou Shen, Jian-Tao Sun, Hua Li, Qiang Yang, Zheng Chen IJCAI 2007 Hao-Chin Chang Department of Computer.
Highlights From the Survey on the Use of Funds Under Title II, Part A
Urban Transportation Council Green Guide for Roads Task Force TAC 2009 Annual Conference and Exhibition Vancouver.
PUSD Teacher Evaluation SY 13/14 Governing Board Presentation May 9, 2013 Dr. Heather Cruz, Deputy Superintendent.
What we know (and don’t know) about MC supplies. Steve Gesuale PSI PEPFAR MC Partners Meeting May 2009.
UNIT-III By Mr. M. V. Nikum (B.E.I.T). Programming Language Lexical and Syntactic features of a programming Language are specified by its grammar Language:-
ISCOL 2011 – Bar Ilan University /151 A Probabilistic Model for Lexical Entailment Eyal Shnarch, Jacob Goldberger, Ido Dagan Bar Ilan University.
Obama becomes President. By: Lulwa Al Zaini 6E. Who is Barack Obama? Barack Obama is the President of America. He was born on August He also went.
1 Proximity-Based Opinion Retrieval Mark CarmanFabio CrestaniShima Gerani.
UNED at PASCAL RTE-2 Challenge IR&NLP Group at UNED nlp.uned.es Jesús Herrera Anselmo Peñas Álvaro Rodrigo Felisa Verdejo.
1 Hidden Markov Model Instructor : Saeed Shiry  CHAPTER 13 ETHEM ALPAYDIN © The MIT Press, 2004.
Seven Lectures on Statistical Parsing Christopher Manning LSA Linguistic Institute 2007 LSA 354 Lecture 7.
A Confidence Model for Syntactically-Motivated Entailment Proofs Asher Stern & Ido Dagan ISCOL June 2011, Israel 1.
Probabilistic Lexical Models for Textual Inference Eyal Shnarch, Ido Dagan, Jacob Goldberger Probabilistic Lexical Models for Textual Inference Eyal Shnarch,
FBK August 2012 PLIS : a Probabilistic Lexical Inference System Eyal Shnarch, Ido Dagan, Jacob Goldberger PLIS : a Probabilistic Lexical Inference System.
Overview of the Fourth Recognising Textual Entailment Challenge NIST-Nov. 17, 2008TAC Danilo Giampiccolo (coordinator, CELCT) Hoa Trang Dan (NIST)
Knowledge and Tree-Edits in Learnable Entailment Proofs Asher Stern, Amnon Lotan, Shachar Mirkin, Eyal Shnarch, Lili Kotlerman, Jonathan Berant and Ido.
Vectors Readings: Chapter 3. Vectors Vectors are the objects which are characterized by two parameters: magnitude (length) direction These vectors are.
Crowdsourcing Inference-Rule Evaluation Naomi Zeichner, Jonathan Berant, Ido Dagan Crowdsourcing Inference-Rule Evaluation Naomi Zeichner, Jonathan Berant,
2010/2/4Yi-Ting Huang Pennacchiotti, M., & Zanzotto, F. M. Learning Shallow Semantic Rules for Textual Entailment. Recent Advances in Natural Language.
“Harrison Bergeron” Comparative Paragraph Thursday, April 24, 2014.
DESIGNING AN ARTICLE Effective Writing 3. Objectives Raising awareness of the format, requirements and features of scientific articles Sharing information.
The statistics of pairwise alignment BMI/CS 576 Colin Dewey Fall 2015.
Text Summarization using Lexical Chains. Summarization using Lexical Chains Summarization? What is Summarization? Advantages… Challenges…
Copyright © American Speech-Language-Hearing Association
Recognizing Partial Textual Entailment
Comparing Two Proportions
Presentation transcript:

TextInfer 2011 – Bar Ilan University 1 Towards a probabilistic Model for Lexical Entailment Eyal Shnarch, Jacob Goldberger, Ido Dagan

TextInfer 2011 – Bar Ilan University 2 Entailment at the lexical level Obama gave a speech last night in the Israeli lobby conference In his speech at the American Israel Public Affairs Committee yesterday, the president challenged … Barack Obamas AIPAC address... AIPAC Israeli lobby American Israel Public Affairs Committee address speech Barack Obama the president Obama

TextInfer 2011 – Bar Ilan University 3 Lexical-level systems are very handy Important component within a full inference system Pose hard-to-beat baselines –(Mirkin et. al 2009, Majumdar and Bhattacharyya 2010) Can be used in cases where there are no deep analysis tools for target language –e.g. no parser

TextInfer 2011 – Bar Ilan University 4 The presidents car got stuck in Ireland, surrounded by many people Obamas Cadillac got stuck in Dublin in a large Irish crowd social group Modeling entailment at the lexical level

TextInfer 2011 – Bar Ilan University 5 Mostly heuristic: Percent covered/un-covered –(Majumdar and Bhattacharyya, 2010, Clark and Harrison, 2010) Similarity estimation –(Corley and Mihalcea, 2005; Zanzotto and Moschitti,2006) Vector space –(MacKinlay and Baldwin, 2009) Lexical entailment scores

TextInfer 2011 – Bar Ilan University 6 The presidents car got stuck in Ireland, surrounded by many people Obamas Cadillac got stuck in Dublin in a large Irish crowd social group Terminology rule lexical resource chain rule 2 rule 1

TextInfer 2011 – Bar Ilan University 7 The presidents car got stuck in Ireland, surrounded by many people Obamas Cadillac got stuck in Dublin in a large Irish crowd social group Goal – a probabilistic model 1.Distinguish resources reliability levels 2.Consider transitive chains length 3.Consider multiple evidence Addressing:

TextInfer 2011 – Bar Ilan University 8 Entailment validation process t1t1 tmtm titi h1h1 hnhn hjhj t chain … … …… A hypothesis is entailed if all its terms are entailed A single term is entailed if at least one of its evidence is a valid entailment chain A chain is valid if all its rule steps are valid The validity of a rule depends on the reliability of the resource which provided it

TextInfer 2011 – Bar Ilan University 9 Probabilistic model for Lexical Entailment t1t1 tmtm titi h1h1 hnhn hjhj t AND OR chain … … …… validity prob. of a rule step r is the reliability of the resource R(r) which suggested it EM to estimate parameter set if entailment holds

TextInfer 2011 – Bar Ilan University 10 Lets try a concrete example The presidents car got stuck in Ireland, surrounded by many people Obamas Cadillac got stuck in Dublin in a large Irish crowd social group * numbers in blue are parameter values found by our model

TextInfer 2011 – Bar Ilan University 11 Results on RTE are nice, but… F 1 % Model RTE 6RTE Avg. of all systems Base Prob Best lexical system Best full system F1F1

TextInfer 2011 – Bar Ilan University 12 Extension 1: relaxing with noisy-AND noisy- final AND gate demands the entailment of all hypothesis terms sentence level entailment is possible even if not all terms are entailed this strict demand is especially unfair for longer hypotheses

TextInfer 2011 – Bar Ilan University 13 Better results with extension 1 F 1 % Model RTE 6RTE Avg. of all systems Base Prob Base Prob. + noisy-AND Best lexical system Best full system * * * significant improvement over base prob. according to Mc-Nemars test with p<0.01 F1F1

TextInfer 2011 – Bar Ilan University 14 Extension 2: terms independence assumption uncovered term covered term As T covers more terms of H – our belief in each rule application increases

TextInfer 2011 – Bar Ilan University 15 Same (better) results with extension 2 F 1 % Model RTE 6RTE Avg. of all systems Base Prob Base Prob. + noisy-AND Base Prob. + coverage normalization Best lexical system Best full system * * F1F1

TextInfer 2011 – Bar Ilan University 16 Putting it all together is best Negative result: F1 usually decreases when allowing chains * * F1F1

TextInfer 2011 – Bar Ilan University 17 Summary Learns for each lexical resource an individual reliability value Considers multiple evidence and chain length Two extensions which brings us to… Performance is in line with best entailment systems A probabilistic model: noisy-

TextInfer 2011 – Bar Ilan University 18 Future work Better model for transitivity noisy-AND for chains too Verify rule application in a specific context next talk by Shachar Mirkin Test with other application data sets passage retrieval for QA Integrate into a full entailment system Thank you!