Presentation is loading. Please wait.

Presentation is loading. Please wait.

TagHelper and InfoMagnets Technologies for Exploring the effect of Language Interactions in Learning Carolyn Penstein Rosé, Jaime Arguello, Yue Cui, Rohit.

Similar presentations


Presentation on theme: "TagHelper and InfoMagnets Technologies for Exploring the effect of Language Interactions in Learning Carolyn Penstein Rosé, Jaime Arguello, Yue Cui, Rohit."— Presentation transcript:

1 TagHelper and InfoMagnets Technologies for Exploring the effect of Language Interactions in Learning Carolyn Penstein Rosé, Jaime Arguello, Yue Cui, Rohit Kumar, Emil Albright, Hao-Chuan Wang, Pinar Donmez, Cammie Williams, William Cohen Language Technologies Institute/ Human-Computer Interaction Institute/ Machine Learning Department Carnegie Mellon University

2 Tools for DataMining for Corpus Data TagHelper –Supervised text classification technology –Analysts define categories and tag example texts –Algorithms learn to generalize from training examples –Trained models assign categories to untagged data –Supports categorical corpus analysis InfoMagnets –Automatic initial segmentation and topic analysis –Interactive reclustering –Supports exploratory corpus analysis / sense making

3 Outline for this Demo Session Conceptual overview of data mining from corpus data (15 min) –What is it and what can you do with it? Demo of TagHelper (10 minutes) TagHelper activity (10 minutes) –Goal: Learn to train simple TagHelper classifiers to use in TuTalk dialogues Demo of InfoMagnets (10 minutes) InfoMagnets activity (15 minutes) –Goal: Gain experience with basic topic analysis technology

4 What is DataMining from Corpus Data? Goal: data reduction - identify meaningful patterns in freeform corpus data –Test recall versus recognition –Verify the effect of a manipulation on interaction or thought processes –Correlational analyses used for hypothesis formation Relevant Contexts: tutorial dialogue, collaborative learning, self-explanation, think aloud protocols

5 What is DataMining from Corpus Data? Basic Approach: transform freeform corpus data into a formal structure that can be analyzed using quantitative methods –Rating scales: e.g., depth of an explanation –Categorical Coding: e.g., self-explanation versus summarization Caveat: corpus analysis can be subjective –Reliability standards mitigate the risk of subjectivity in judgments

6 Motivation for TagHelper Project Social scientists, psychologists, and educational scientists code by hand large quantities of corpus data Tools currently used by behavioral researchers do not support decision making of human coders –e.g., MacSHAPA, NVivo, HyperResearch, etc. Text classification technology can support automatic prediction of codes for supporting data analysis tasks Automatic classification technology can also trigger on-line interventions in real time or process freeform student input

7 Example Research Context: Learning in On-Line Discussions Knowledge Media Research Center, Tuebingen Germany

8 Example Coding Scheme for Analyzing Collaborative Learning Interactions Original German: "Es ist seine Faulheit, aber mangelndes Talent würde auch passen" Translation: "It is his laziness, but lack of talent would also fit" Social Modes Code: Integration- Oriented Consensus Building Original German: "Es ist seine Faulheit, mangelndes Talent würde weniger passen" Translation: "It is his laziness, lack of talent would fit less well" Social Modes Code: Conflict- Oriented Consensus Building DimensionNumber of Classes Epistemic (EPI)35 Micro (ATOL)4 Macro (ALEI)6 Social Modes (SOC) 21 Reaction (REA)3 Appropriateness (PRO)4 Quoted (QUO)2 Knowledge Language Relational Style * Training + Coding by hand requires 25% of project resources!

9 Using Automatic Coding in an On-Line Intervention

10 Other TagHelper/InfoMagnets Applications Data Analysis –InfoMagnets style topic analyses reveals more and less effective student strategies (Kumar et al., 2006) –Topicality metrics predict on-line community behavior (Arguello et al., 2006) On-Line interventions –Triggers feedback to qualitative physics explanations offered as justifications for multiple choice answers (Ogilvie group, English data) –Trigger feedback for group and individual brainstorming (Wang et al., submitted, Chinese data)

11 Corpus Analysis Offerings (Tuesday) Basic DataMining from Corpus Data –Introduction to coding scheme design and protocol analysis –In depth walk through of basic TagHelper functionality Advanced Conversational DataMining with TagHelper –Feature space design –Tuning machine learning algorithms Exploratory Corpus Analysis with InfoMagnets –Conceptual discussion of topic segmentation and topic clustering technology –Using a topic analysis as part of Learning Science research

12 Contact Info: Carolyn Penstein Rosé cprose@cs.cmu.edu http://www.cs.cmu.edu/~cprose


Download ppt "TagHelper and InfoMagnets Technologies for Exploring the effect of Language Interactions in Learning Carolyn Penstein Rosé, Jaime Arguello, Yue Cui, Rohit."

Similar presentations


Ads by Google