1 Matthieu Hermet, Stan Szpakowicz Automated Analysis of Students Free-text Answers for Computer- Assisted Assessment University of Ottawa, Canada.

Slides:



Advertisements
Similar presentations
Ernst Oberortner Vienna University of Technology.
Advertisements

Parsing V: Bottom-up Parsing
Software Requirements
Jeremy S. Bradbury, James R. Cordy, Juergen Dingel, Michel Wermelinger
Artificial Intelligence: Natural Language and Prolog
Client Logo 1© The Delos Partnership 2006 January 2006 LEAN ENTERPRISE Implementation Workshop.
Chapter 6 Writing a Program
Building a Knowledge Management System as a Life Cycle
OLIF V2 Gr. Thurmair April OLIF April 2000 OLIF: Overview Rationale Principles Entries Descriptions Header Examples Status.
By D. Fisher Geometric Transformations. Reflection, Rotation, or Translation 1.
Introduction to Product Family Engineering. 11 Oct 2002 Ver 2.0 ©Copyright 2002 Vortex System Concepts 2 Product Family Engineering Overview Project Engineering.
1 Welcome Safety Regulatory Function Handbook April 2006.
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.
1 PHARE Operational Scenarios J-P. Nicolaon, Operational Task Force Chairman EUROCONTROL Experimental Centre.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 2 Q 3 Q 4 Q 5 Q 6Q 16Q 11Q 21 Q 7Q 12Q 17Q 22 Q 8Q 13Q 18 Q 23 Q 9 Q 14Q 19Q 24 Q 10Q 15Q 20Q 25 Final Jeopardy Writing Terms.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
0 - 0.
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
Addition Facts
Query optimisation.
Chapter 3 Critically reviewing the literature
Unité 3 Leçon oui! Yes! 2. mais oui! Sure! 3. Bien sûr! Of course! 4. Non! No 5. Mais non! Of course not! 6. Peut-être Maybe 7. Pierre est….. Pierre.
Turn your students into first time authors! You want your students to take risks, increase their fluency, perfect their written skills and still remain.
1. 2 Chairing a meeting… 3 OPENING Lets get started Lets start Shall we start Lets get down to work.
Questions d’ information
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 14 Slide 1 Object-oriented Design 1.
Knowledge Extraction from Technical Documents Knowledge Extraction from Technical Documents *With first class-support for Feature Modeling Rehan Rauf,
Chapter 2-2 A Simple One-Pass Compiler
Lesson 6 CDT301 – Compiler Theory, Spring 2011 Teacher: Linus Källberg.
Configuration management
Software change management
ABC Technology Project
TU e technische universiteit eindhoven / department of mathematics and computer science 1 Empirical Evaluation of Learning Styles Adaptation Language Natalia.
Although, but, however All of these words join clauses in sentences, but they are different parts of speech. This presentation explains the impact of the.
1 Evaluations in information retrieval. 2 Evaluations in information retrieval: summary The following gives an overview of approaches that are applied.
Software Requirements
Machine Translation II How MT works Modes of use.
Lecture 1: Software Engineering: Introduction
Machine Learning: Intro and Supervised Classification
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 4 Slide 1 Software processes 2.
Chapter 11 Software Evolution
What is Word Study? PD Presentation: Union 61 Revised ELA guide Supplement (and beyond)
Addition 1’s to 20.
25 seconds left…...
Test B, 100 Subtraction Facts
©Ian Sommerville 1995/2000 (Modified by Spiros Mancoridis 1999) Software Engineering, 6th edition. Chapters 1,3 Slide 1 Software Engineering Software Engineering.
Week 1.
1 Week 9 Questions / Concerns Hand back Test#2 What’s due: Final Project due next Thursday June 5. Final Project check-off on Friday June 6 in class. Next.
We will resume in: 25 Minutes.
14-1 © Prentice Hall, 2004 Chapter 14: OOSAD Implementation and Operation (Adapted) Object-Oriented Systems Analysis and Design Joey F. George, Dinesh.
1 Programming Languages (CS 550) Mini Language Interpreter Jeremy R. Johnson.
Introduction.  “a technique that enables the computer to encode complex grammatical knowledge such as humans use to assemble sentences, recognize errors.
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
Automatically Extracting and Verifying Design Patterns in Java Code James Norris Ruchika Agrawal Computer Science Department Stanford University {jcn,
1.3 Executing Programs. How is Computer Code Transformed into an Executable? Interpreters Compilers Hybrid systems.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Chapter 1 Introduction Dr. Frank Lee. 1.1 Why Study Compiler? To write more efficient code in a high-level language To provide solid foundation in parsing.
Parser-Driven Games Tool programming © Allan C. Milne Abertay University v
Unit-1 Introduction Prepared by: Prof. Harish I Rathod
1 Compiler Design (40-414)  Main Text Book: Compilers: Principles, Techniques & Tools, 2 nd ed., Aho, Lam, Sethi, and Ullman, 2007  Evaluation:  Midterm.
1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
© University of Manchester Creative Commons Attribution-NonCommercial 3.0 unported 3.0 license Quality Assurance, Ontology Engineering, and Semantic Interoperability.
Jean-Yves Le Meur - CERN Geneva Switzerland - GL'99 Conference 1.
WP4 Models and Contents Quality Assessment
Compiler Design (40-414) Main Text Book:
SOFTWARE DESIGN AND ARCHITECTURE
Presentation transcript:

1 Matthieu Hermet, Stan Szpakowicz Automated Analysis of Students Free-text Answers for Computer- Assisted Assessment University of Ottawa, Canada

2 CAA for CALL To address the specificity of CALL… where student material contains syntactic and orthographic errors …with minimal pre-encoded material : –Content validation : simple –Form validation : difficult A good case for automating based on Natural Language Processing

3 Text comprehension The uOttawa project: CALL solutions for helping French-as-a-Second-Language students to enhance autonomous reading comprehension master the structure of text in order to understand the authors discursive intention guess the meaning of unknown words develop reformulation and synthesis capabilities

4 DidaLect …is a FSL tool aimed at teaching autonomous reading skill (designed for intermediate- and advanced-level students) Is an instance of eLearning Intelligent Tutoring System: –adaptation to individual students skills and agenda –access to external resources (dictionaries) –built to reflect the cognitive concerns such as matching feedback to the students behaviour

5 Intelligence in DidaLect DidaLect begins its operation with a placement test to determine a students initial level: –varying order of questions to pick up the best of a students skill –the implementation includes fuzzy logic methods A separate element of DidaLect is the processing of free-text answers: –need of a robust CAA component –a trade-off between symbolic processing and Machine Learning techniques

6 Free-text answer assessment The problem is to know in advance what material to expect in student answers. Usually implemented as a classification problem: a student answer must match reference answer(s). Size and form of reference material affects the process Here, a reference answer is the text itself A case for trying symbolic processing using techniques of Computational Linguistics

7 Expected limitations No possibility of modifying the size and form of the reference material, except by automatic processing to control reformulation. Therefore, this only works for limited forms of questions. –Strong need to ground selection of questions in a firm didactic theory. –Questions on texts for Text Comprehension (didactics offers a classification of question types).

8 Question types 1 Text-Explicit: based on a single sentence « d'habitude, l'hermaphrodisme frappe surtout les mâles, qu'elle dote de simulacres d'appareils génitaux féminins. » Q : Quarrive-t-il aux ours mâles lorsquils sont frappés dhermaphrodisme ? Ex R : « Ils ont les génitaux féminins » Text-Implicit : based on two co-referenced sentences « le détecteur de décélération situé à l'avant du véhicule génère instantanément un courant électrique, qui déclenche une amorce, qui elle-même enflamme un mélange allumeur. Ce dernier met finalement le feu à l'agent propulseur responsable du gonflement du coussin. » Q : Quelle est la réaction en chaîne qui se produit lorsque survient un impact ?

9 Question types 2 Identification, cause-effect, goal, comparison, definition, instrumental… These categories express linguistically through lexical connectors Goal : for, so that, in order to… Cause–Effect : because, therefore… So, the control of reformulation can be automated

10 Processing 1.Find lexical differences between the students answer S and reference R 2.Parse S and R, produce dependency relations 3.Process different words (using a dictionary) to detect synonyms 4.Control of syntax in S 5.Control of reformulation in S wrt R

11 Tools A robust parser that enables partial recovery from errors in students answer A dictionary of synonyms A derivational dictionary Locally derived resources: –State and action verbs –Ensemble of typical errors, set of syntactic and reformulation structures

12 Semantics and synonyms Examine word set differences and commonalities in search for: –Common words –Reformulated words –Different words Detect synonyms accross parts of speech: 1.Derive forms for a word lemma 2.Search synonyms for each form and look for a match in Word Sets

13 Syntax and reformulation Correct syntactical structures to verify syntax of students answer Lexicalized reformulation structures to verify discourse conformity Ex : pollution has increased with the rise of transportation Q : Why has pollution increased ? Ans : With the rise of transportationis partially wrong Because of the rise of transportation OR it has increased due to the rise of transportation ETC.

14 Parsing and tree-building S : Le cardio-vasculaire d'un rat s'approche à une personne humain. SUBJ(, ) OBJ(, ) VMOD_POSIT1(, ) NMOD_POSIT1(, ) PREPOBJ(, ) DETERM(, ) The above are incrementally recomposed, based on lexical selection that maximizes promise of discovering material which diverges from R. That material is processed in parallel, in a similar fashion: SUBJ(,, )>)>,, )>, )>)

15 Main types of heuristics To address syntactic correctness and/or equivalence between S and R: the same sense but different structures bank of typical errors and correct structures To address discursive variations, detected as supplementary material bank of state and action verbs: action verbs must be present, possibly reformulated To address, partially, errors in S word replacement to relaunch parsing when stopped due to lexical mistakes

16 Reformulation rules Examples : Abstraction : incidence sur le temps de gestation incidence sur la possibilité davoir une gestation écourtée (words like fact, chance, etc.) Cause-Effect : Le plasma augmente et dilue les paramètres chimiques Laugmentation du plasma dilue les paramètres chimiques Is-A : Le rat est un animal qui + S Le rat + S Attribute : Le rat possède un système cardio- vasculaire Le système C-V du rat

17 Assessment Must give student feedback on: –Agreement and orthography –Syntax: signal errors and provides correction via display of a correct structure –Semantics: signals error and provides admissible words –Completeness of content with respect to R

18 Example R: Et puisque le rat est un animal qui possède un système cardio- vasculaire très semblable à celui de lhumain, il est donc permis de tirer les mêmes conclusions pour lhumain. Q: Pourquoi peut-on tirer les mêmes conclusions pour l'humain et pour le rat ? S: Le cardio-vasculaire dun rat sapproche à une personne humain. Start by creating wordlists : 1.Words of S absent in R sapprocher, personne 2.Words of R absent in S animal, posséder, système, semblable 3.Common words rat, cardio-vasculaire, humain

19 Parse (partial output) S: SUBJ(, ) OBJ(, ) VMOD_POSIT1(, ) NMOD_POSIT1(, ) PREPOBJ(, ) DETERM(, ) R: SUBJ(, ) OBJ_SPRED(, ) OBJ(, ) COREF_REL(, ) NMOD_POSIT1(, ) ADJMOD(, ) PREPOBJ(, ) DETERM(, ) DETERM_DEF(, ) CONNECT_REL(, )

20 Comparison (partial) approche^approcherpersonne^personne cardio-vasculaire^cardio- vasculairerat^rat SUBJ(, ) >,, )>) cardio-vasculaire^cardio- vasculairesemblable^semblable humain^humain rat^rat SUBJ(,,,, ) >,,, ) >)>)>) >, ) >) >)>, ) Consider structures to assess for syntactic correctness Heuristics to put some structures into equivalence : Here, «Le SCV du rat» and «Le rat est un animal qui possède un SCV» are equivalent, but expressed in different syntactic structures

21 Synonyms - semblable D C C0065 adj. - qui ressemble à; comparable, similaire. - ….. - approchant F E0074 To retrieve a synonymy relation : 1.Produce derivations for all words in List 1 2.Find matches in a synonymy basis under entries for words of List 2 3.The search process can be repeated at most once, using lexemes semblable = approchant OR ressembler à = sapprocher de 4.In this way we can catch both synonyms and attached prepositions.

22 Conclusions and future work Automation is possible, with 2 main restrictions : –« bad faith » answers –Lexical errors based on homonymy as long as S contains elements of answer, S can be evaluated Future Work : to assemble the parts through software engineering !