Discourse Mode Identification in Essays

Slides:



Advertisements
Similar presentations
Prose Analysis Essay for the AP Language and Composition Exam
Advertisements

A Comparison of Three Language Assessment Tools
On-Demand Writing Assessment
Introduction to: Automated Essay Scoring (AES) Anat Ben-Simon Introduction to: Automated Essay Scoring (AES) Anat Ben-Simon National Institute for Testing.
Farag Saad i-KNOW 2014 Graz- Austria,
® Towards Using Structural Events To Assess Non-Native Speech Lei Chen, Joel Tetreault, Xiaoming Xi Educational Testing Service (ETS) The 5th Workshop.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
Predicting Text Quality for Scientific Articles AAAI/SIGART-11 Doctoral Consortium Annie Louis : Louis A. and Nenkova A Automatically.
Introduction.  Classification based on function role in classroom instruction  Placement assessment: administered at the beginning of instruction 
Longbiao Kang, Baotian Hu, Xiangping Wu, Qingcai Chen, and Yan He Intelligent Computing Research Center, School of Computer Science and Technology, Harbin.
Automated Scoring of Picture- based Story Narration Swapna Somasundaran Chong Min Lee Martin Chodorow Xinhao Wang.
Organizing Your Information
The Annotated Bibliography. What is a Bibliography? What is an Annotation? A Bibliography is a list of citations put together on a topic of interest.
 Text Representation & Text Classification for Intelligent Information Retrieval Ning Yu School of Library and Information Science Indiana University.
Incident Threading for News Passages (CIKM 09) Speaker: Yi-lin,Hsu Advisor: Dr. Koh, Jia-ling. Date:2010/06/14.
A Weakly-Supervised Approach to Argumentative Zoning of Scientific Documents Yufan Guo Anna Korhonen Thierry Poibeau 1 Review By: Pranjal Singh Paper.
Countdown to STAAR Writing Adapted from JoAnn Angelini.
1 KINDS OF PARAGRAPH. There are at least seven types of paragraphs. Knowledge of the differences between them can facilitate composing well-structured.
CIKM Opinion Retrieval from Blogs Wei Zhang 1 Clement Yu 1 Weiyi Meng 2 1 Department of.
1 Technical Communication A Reader-Centred Approach First Canadian Edition Paul V. Anderson Kerry Surman
National Taiwan University, Taiwan
Summarizing Encyclopedic Term Descriptions on the Web from Coling 2004 Atsushi Fujii and Tetsuya Ishikawa Graduate School of Library, Information and Media.
UWMS Data Mining Workshop Content Analysis: Automated Summarizing Prof. Marti Hearst SIMS 202, Lecture 16.
1 Adaptive Subjective Triggers for Opinionated Document Retrieval (WSDM 09’) Kazuhiro Seki, Kuniaki Uehara Date: 11/02/09 Speaker: Hsu, Yu-Wen Advisor:
Automatic Labeling of Multinomial Topic Models
From Words to Senses: A Case Study of Subjectivity Recognition Author: Fangzhong Su & Katja Markert (University of Leeds, UK) Source: COLING 2008 Reporter:
Writing Exercise Try to write a short humor piece. It can be fictional or non-fictional. Essay by David Sedaris.
How to Write a Summary Text ReadAnnotateWrite. Why write a summary? To locate and understand key points from a chapter to study for a test To take notes.
Informational Text and Essay Unit. What is Informational Text? Informational Text: A text that provides facts, ideas, and principles that are related.
Reading literacy. Definition of reading literacy: “Reading literacy is understanding, using and reflecting on written texts, in order to achieve one’s.
Language Identification and Part-of-Speech Tagging
Automatic Writing Evaluation
Neural Machine Translation
Automatically Labeled Data Generation for Large Scale Event Extraction
Dr Anie Attan 26 April 2017 Language Academy UTMJB
What do these mean? Your time is up Ready for anything (Red E)
Korean version of GloVe Applying GloVe & word2vec model to Korean corpus speaker : 양희정 date :
Effective Strategies to be an Excellent Reader
SMARTER BALANCED Student Overview
Sentiment analysis algorithms and applications: A survey
Ma Rui Tianjin Normal University
CRF &SVM in Medication Extraction
A Hierarchical Model of Reviews for Aspect-based Sentiment Analysis
Rhetorical Analysis Essay
Wei Wei, PhD, Zhanglong Ji, PhD, Lucila Ohno-Machado, MD, PhD
张昊.
Writing Workshop: Courage & heroism
ENG 101: Freshman Composition
Improving a Pipeline Architecture for Shallow Discourse Parsing
Generating Natural Answers by Incorporating Copying and Retrieving Mechanisms in Sequence-to-Sequence Learning Shizhu He, Cao liu, Kang Liu and Jun Zhao.
Efficient Estimation of Word Representation in Vector Space
Intro to Rhetorical Analysis
How to navigate the world of argument & persuasion.
What writing practices international students bring in EAP programmes
Lei Sha, Jing Liu, Chin-Yew Lin, Sujian Li, Baobao Chang, Zhifang Sui
AP Language: Shifts and Rhetorical Analysis AP Prompt
Recurrent Neural Networks
Exam Orientation for English for Studying Chair tutor: Cao Wen.
Towards a Personal Briefing Assistant
Welcome to 11AP English Language and Composition
AP Lang Exam Review.
Exam Orientation for English for Studying Chair tutor: Cao Wen.
Presented by : Amna H.Ali MA Student
SMARTER BALANCED Student Overview
How to Enhance Students’ Writing
INTRODUCTION TO ESSAY TYPES
Using Uneven Margins SVM and Perceptron for IE
Fast Sequences of Non-spatial State Representations in Humans
NC Tenth Grade Writing Test
8th Grade CST Prep.
Presentation transcript:

Discourse Mode Identification in Essays Wei Song Capital Normal University Cooperating with Dong Wang, Ruiji Fu, Lizhen Liu, Ting Liu, Guoping Hu IFLYTEK Research and Harbin Institute of Technology

Outline Discourse Modes Data Annotation Discourse Mode Identification Essay Scoring with Discourse Modes Conclusion

Outline Discourse Modes Data Annotation Discourse Mode Identification Essay Scoring with Discourse Modes Conclusion

Discourse Modes Discourse modes, also known as rhetorical modes, describe the purpose and conventions of the main kinds of language based communication Several taxonomies of discourse moods in the literature

Taxonomies of Discourse Modes Discourse modes by C. Smith, studying discourse passages from a linguistic view of point Narration Description Argument Information Report

Taxonomies of Discourse Modes Discourse modes in rhetoric Narration Description Argumentation Exposition

Taxonomies of Discourse Modes Discourse modes in Chinese composition Narration Description Argument Exposition Emotion Expressing

Functions of Discourse Modes in a text Various discourse modes stand for unity of a text Discourse modes can reflect the organization and progression of a text Indicating the intention of writing a passage Discourse modes have rhetorical significance Preferring different expressive styles Flexible use of multiple discourse modes

Research Questions Discourse mode identification is a fundamental but less studied problem in NLP Can we annotate a corpus with acceptable agreement? Can discourse modes be identified automatically? Can discourse mode identification help downstream NLP tasks

Outline Discourse Modes Data Annotation Discourse Mode Identification Essay Scoring with Discourse Modes Conclusion

Discourse Modes in this work We follow the Chinese convention Narration is to introduce an event or series of events Exposition is to explain or instruct or provide background information in narrative context Description is to re-creates, invents, or vividly show what things are like Argument is to make a point of view and prove its validity towards a topic Emotion Expressing is to presents the writer’s motions, usually in a subjective, personal and lyrical way

Data Collect 415 narrative essays written by high school students in native Chinese language 32 sentences and 670 words in average Two annotators were asked to label discourse modes for each sentence Each sentence can have more than one discourse mode, but a dominant mode should be informed

Inter-Annotator Agreement on the dominant mode 50 essays were annotated independently by two annotators Measured by PRF and Kappa Example: “父亲的爱是灯塔,引导我一生前进的路!”

Inter-Annotator Agreement on the dominant mode 50 essays were annotated independently by two annotators Measured by PRF and Kappa

Distribution of Discourse Modes Distribution is imbalanced

Co-Occurrence 22% sentences have more than one discourse modes Description tends to co-occur with narration and emotion Providing details of events Evoking emotions Emotion co-occurs with argument Proper emotional appeals can enhance the strength of argument 海上生明月,天涯共此时。

Transitions Most modes tend to transit to themselves Contextual information should be helpful

Summary Annotators can achieve an acceptable agreement after training About 22% sentences have more than one discourse mode Distribution of discourse modes is imbalanced Discourse modes have local transition patterns

Outline Discourse Modes Data Annotation Discourse Mode Identification Essay Scoring with Discourse Modes Conclusion

Discourse Mode Identification We view it as a multi-label sequence labeling problem Pre-trained Embeddings

Discourse Mode Identification Deal with multiple-Label outputs

Discourse Mode Identification Considering paragraph boundaries

Evaluation Comparisons SVM with unigram and bigram features CNN (Kim et al. 2014) GRU GRU-GRU (GG): Our hierarchical model GRU-GRU-SEG (GG-SEG): Consider paragraph boundaries on the top of GG

Evaluation F1-score is reported Neural models outperform bag-of-words method RNN is slightly better than CNN Sequence information is useful Minority modes are more sensitive to positions Overall average F1 is 0.7 Average F1 on three main modes is above 0.76

Outline Discourse Modes Data Annotation Discourse Mode Identification Essay Scoring with Discourse Modes Conclusion

Automatic Essay Scoring (AES) AES is the task of building a computer-aided scoring system, in order to reduce the involvement of human raters. AES as a regression problem Support Vector Regression Bayesian linear ridge regression

Feature Sets Discourse mode features Basic features (Phandi et al. 2015) Length features Prompt features Content features Selected unigrams and bigrams The number of Chinese idioms The number of words in Chinese Proficiency Test 6 Dictionary Discourse mode features Discourse mode ratio #sentence with the discourse mode / #sentences Unigrams and bigrams of discourse mode sequences

Data and Settings Three prompts Narrative essays written by junior school students in local tests 5-folds cross-validation Evaluated with Quadratic Weighted Kappa (QWK)

Evaluation Overall performance BLRR performs better Discourse mode features are useful

Evaluation Pearson correlation coefficient between discourse mode ratio and scores Narration has a negative correlation Description is most relevant Emotion expressing has a weak correlation

Evaluation Performance on essays with different length When the effect of length becomes weaker, AES becomes harder In hard cases, the role of discourse mode features becomes more important

Outline Discourse Modes Data Annotation Discourse Mode Identification Essay Scoring with Discourse Modes Conclusion

Conclusion We have studied a fundamental but less studied problem in NLP Both manual and automatic discourse mode identification is feasible Discourse mode features are shown useful for automatic essay scoring Discourse mode identification can support other downstream NLP applications potentially

Thank you

Main References Carlota S Smith. 2003. Modes of discourse: The local structure of texts, volume 103. Cambridge University Press. Cleanth Brooks and Robert Penn Warren. 1958. Modern rhetoric. Harcourt, Brace. Yoon Kim. 2014. Convolutional neural networks for sentence classification. In Proceedings of EMNLP 2014. pages 1746–1751. Peter Phandi, Kian Ming A. Chai, and Hwee Tou Ng. 2015. Flexible domain adaptation for automated essay scoring using correlated linear regression. In Proceedings of EMNLP 2015. pages 431–439.