Computational Models of Discourse Analysis Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.

Slides:



Advertisements
Similar presentations
Qualitative methods - conversation analysis
Advertisements

Computational Models of Discourse Analysis Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Critical Thinking Course Introduction and Lesson 1
Computational Models of Discourse Analysis Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Computational Models of Discourse Analysis Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Learning Targets Helping Students Aim for Understanding in Every Lesson! Part II.
HCC class lecture 6 comments John Canny 2/7/05. Administrivia.
PSY 369: Psycholinguistics Some basic linguistic theory part3.
Math 105: Problem Solving in Mathematics. Course Description This course introduces students to the true nature mathematics, what mathematicians really.
LANGUAGE LEARNING STRATEGIES
Unit 1 – Understanding Non-Fiction and Media Texts
Introduction to linguistics II
OSSLT – March 27 th, 2014 You must pass the Literacy Test in order to graduate The passing mark is 300 = 75% You have already had lessons on the main writing.
Language and Technology How does technology augment, constrain and simulate person to person communication?
Literature Review and Parts of Proposal
VCE Learning. To unpack the challenge of enhancing the quality of VCE learning What does the student need to know about how to interpret the task ? Ho.
Cultural Practices of Reading II. Cultural Practices of Reading Goal: To teach rhetorical reading strategies of complex, culturally situated texts.
ELA Common Core Shifts. Shift 1 Balancing Informational & Literary Text.
Breaking Down the Prompt
The Why, What, When, How, and How- to of Book Clubs
Day 3. Standards Reading: 1.0 Word Analysis, Fluency, and Systematic Vocabulary Development- Students apply their knowledge of word origins to determine.
A useful guide from the Chief Examiner. June Paper Q1a) Describe the ways in which your production work was informed by research into real media.
LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.
Literacy I can recall main info, know where to look for it, make inferences linked to evidence, show awareness of characters’ intentions, adapt speech.
Good Morning! Let’s start with some science!. What’s our goal here? Provide an example of a science lesson with literacy and dialogue strategies integrated.
RPDP Secondary Literacy     Southern Nevada Regional Professional Development Program RPDP.net.
Discourse and Genre. What is Genre? Genre – is an activity that people engage in through the use of language. Two types of genre 1. Spoken genres – academic.
Chapter Thirteen Rhetorical and Critical Analyses: Understanding Text And Image In Words.
1 Technical Communication A Reader-Centred Approach First Canadian Edition Paul V. Anderson Kerry Surman
Standards Of Textuality And Speech Acts.
Techniques for Highly Effective Communication Professional Year Program - Unit 5: Workplace media and communication channels.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
ACE TESOL Diploma Program – London Language Institute OBJECTIVES You will understand: 1. The terminology and concepts of semantics, pragmatics and discourse.
The Main Non-linguistic Obstacles in Interpretation Mashael Al-Mutairi College of Languages & Translation Princess Nora Bint Abdurrahman University.
OSSLT Test Preparation
Second Language Acquisition
©2011 1www.id-book.com Data Gathering Chapter 7. ©2011 Data Gathering What is data gathering? –The act of gathering data through a study The data can.
Carolyn Penstein Rosé Language Technologies Institute and Human-Computer Interaction Institute.
Thursday October 3, 2013  Write in your agenda:  Turn in Response Logs  Read “Milton Hershey” levels of questioning  Homework: Read for AR at least.
Machine Learning in Practice Lecture 5 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Computational Models of Discourse Analysis Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
 explain expected stages and patterns of language development as related to first and second language acquisition (critical period hypothesis– Proficiency.
 Presented by:  Dr.Cherry Steffen  Kennesaw State University.
Computational Models of Discourse Analysis Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Objectives: Determine the meaning of words and phrases as they are used in the text. RL.4.4 Identify key ideas and details in a story. RL.4.2 Unit: 2 Lesson:
Computational Models of Discourse Analysis Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
How to structure good history writing Always put an introduction which explains what you are going to talk about. Always put a conclusion which summarises.
Computational Models of Discourse Analysis Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Machine Learning in Practice Lecture 6 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Discourse Analysis Week 10 Riggenbach (1999) Chapter 1 - Quotes.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
 Write about something interesting you saw on TV or the Internet, or something you heard on the radio in the last week.
Machine Learning in Practice Lecture 21 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
EXAMINERS’ COMMENTS RAPHAEL’S LONG TURN GRAMMAR Accurate use of simple grammatical structures and also of some complex sentences: ‘they could also be preparing.
Aristotel‘s concept to language studies was to study true or false sentences - propositions; Thomas Reid described utterances of promising, warning, forgiving.
PHI 208 Course Extraordinary Success tutorialrank.com
Qualitative Data Analysis
Computational Models of Discourse Analysis
PRAGMATICS 3.
Your Standards TODAY’S FLOW MORNING: Standards & 1st Unit Curriculum
Computational Models of Discourse Analysis
Core Course Knowledge Lesson 6
They Say, I Say Chapter 1 and 12
Core Course Knowledge Lesson 6
Computational Models of Discourse Analysis
Information Retrieval
The Invisible Process to help with analysis:
9th Literature EOC Review
Practical Team Sports. UNIT OUTCOMES 1. Understand the skills, techniques and tactics required in contrasting team sports. 2. Understand the rules and.
Presentation transcript:

Computational Models of Discourse Analysis Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute

Warm-Up Can you explain speech acts from Gee’s perspective (using his terms)?  i.e., what is a speech act? Can you explain speech acts from Martin and Rose’s perspective (using their terms)?

Note that we can use work like Levenson’s Differently We talked about Gee’s 42 questions as being like feature extractors Martin & Rose’s systems give us a map, showing us where to look for evidence (almost the same?) Levinson’s work raises questions – shows us distinctions that need to be accounted for, potential holes in our approach

Levinson helps us think in detail on a technical level about how particular words are functioning… According to the performative hypothesis, “frankly” should be functioning the same way in all of these utterances, but it’s not. In 50 and 52 it modifies the way the message is told, in 51 it’s a warning that something negative is coming. What does this tell us about using words as evidence in Pragmatic oriented interpretation?

Which accomplishes a bet and how do you know? * What does that tell you about how to model speech acts computationally?

Plan for Upcoming Unit For next time we will pass out examples from AMI and coding manual By Wed I will link in the readings for Unit 2 Next Monday I will hand out the annotated corpus and documentation for SIDE plugins

Student Comment I'm not sure we can analyze a text using our current knowledge of speech acts. We can only determine if there are any speech acts and where specifically they occur. Speech act theory as we've discussed it so far doesn't include any analytical feature beyond sentence deconstruction, unlike Gee's and SFL methodologies.

Can you pass the salt? It’s hot in here. How do you know what to do?

Digging in to the roots… Gee: Anthropology Martin & Rose: Rhetoric and Literary Theory Levinson: Philosophy and Logic

What’s the connection? Gee: Anthropology  Speech acts are kinds of discourse practices, there are conventional ways of doing things with words as indicated through form-function correspondences  Figured worlds set up conditionally relevant speech acts (adjacency pairs) Martin & Rose: Rhetoric and Literary Theory  The Negotiation framework is a very simplified system of speech acts

Student Questions I think that the main difference between speech act analysis and the other types of analyses that we have covered thus far is that it explicitly asks the analyst to consider a certain text’s many voices—its heteroglossia why would someone want to talk if they are not trying to achieve something (consciously, or unconsciously)?

Please clarify… Throughout the conversation, Jim uses illocutionary force to direct the flow of conversation. From "go ahead" (2) to the series of probing questions (12, 15, 23, 33, 39, 57, 64, 82), his statements range from clearly pushing the conversation ahead to something that is closer to bordering on factual yes/no questions. Even as he gets more into informational questions, near the end of the dialogue, these are still compelling Bonnie to continue her explanations.

What can we do with speech acts? Recognize what actions users are doing in a natural language based interface Sometimes mixed with domain level frames to detect what someone wanted in a task based dialogue system (scheduling a meeting, registering for a conference, making an airplane reservation) If clusters of speech acts are associated with roles, you can use them to identify roles within an interaction  Current work on meeting summarization Some work on “social positioning” in the mixed- initiative dialogue community in the 90s

What questions can we answer with them? You could think of them as moves in a game (like chess)  Each move is part of a strategy  Moves work together to accomplish intentions  But each speaker has their own set of intentions – in some sense they are competing  You can explain what strategies were effective or not for accomplishing any of these intentions  From this analysis, you can conclude things about power, positioning, influence, etc.  Why might someone be insulted when you politely explain something to them? You can talk about different social languages used to enact speech acts (e.g., direct versus indirect)

What do we gain from this diversity? Gee: Anthropology Martin & Rose: Rhetoric and Literary Theory Levinson: Philosophy and Logic * In some ways our goals are different from all of these. But all of them have insights into how language works. Anthropologists know how it functions in societies. Rhetoriticians know how it functions in interpersonal relationships. And logicians have thought about how humans are able to interpret language from the available evidence.

Why all the focus on truth conditions? Formal semantics is based on logic Evaluating meaning of a sentence is: translating the sentence into logic, evaluating it within a model Models define what is given and what inferences can be made Evaluating meaning in a model means evaluating whether the model makes that sentence true or not

What is Pragmatics? Pragmatics is what is beyond truth conditional meaning Part of the chapter focuses on whether there is anything beyond truth conditional meaning If you accept that there is something beyond that, one thing there is is what language does, apart from what it means Formal pragmatics is about building models that allow us to compute what language does

Speech act inventories

Early Computational Approaches

AMI Annotation Scheme

Later Dialogue Act Tagging Approaches Text classification based approaches Things you should know:  Using Naïve Bayes, SVM, HMMs, CRFs Note that HMMs and CRFs are statistical techniques that pick up on sequencing Conditional probability P(X|Y): If you know Y is true already, what is the probability that X is true? If probabilities are independent, you can combine them by multiplying  Features: Unigrams, bigrams, POS bigrams

What to think about as you read? What distinguishes the set of speech acts that are being focused on? What evidence would you as a human use to make your choice? What evidence is this computational approach taking into account? What is it missing? Can you think of examples that you think it would miss?

Corpora for experimentation Unit 2: Maptask data (Negotiation coding)  Possibly other chat corpora with same coding as well Unit 3: Product Reviews (Sentiment) Unit 4: Blog corpus (Age and Gender) Unit 5: AMI meeting corpus (Dialogue Acts) Other corpora  discussion list (Social Support coding)

Assignment 1 (not due til Feb2) Transcribe a scene from a favorite move, play, or TV show As a shortcut, you can find a script online Excerpt should be no more than one page of text Select one of the methodologies we are discussing in Unit 1 (e.g., from Gee, Martin & Rose, or Levinson) Do a qualitative analysis of the data and write it up Use readings from Unit 1 as a collection of models to chose from Due on Week 4 lecture 2 Turn in data, raw analysis (can be annotations added to the data), and write up (your interpretation of the analysis) Not required now!! Prepare a powerpoint presentation for class (no more than 5 minutes of material) Other Ideas: Twitter data, Google Groups, transcribe a real conversation (if your conversational partners agree…)

Questions?