Toward a Theory Relating Text Complexity, Reader Ability and Comprehension AERA New Orleans April 10, 2011 Jackson Stenner Chairman & CEO, MetaMetrics.

Slides:

Advertisements

Similar presentations

WHAT IS THE NATURE OF SCIENCE?

Advertisements

Individual Centered Growth PCRC San Diego, California February, 2012 A. Jackson Stenner Chairman & CEO, MetaMetrics Research Professor University of North.

11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.

11 Simple Linear Regression and Correlation CHAPTER OUTLINE

1 Artifact Corrected Correlations between theoretical text complexity and empirical text complexity PCRC – San Diego February 7-10, 2013 A.Jackson Stenner.

The Scientific Method n See the problem n Look for the relevant variables n Construct a hypothesis, if possible n Create a research design n Collect data.

1 The Assumptions. 2 Fundamental Concepts of Statistics Measurement - any result from any procedure that assigns a value to an observable phenomenon.

Introduction to Communication Research

Total Quality Management BUS 3 – 142 Statistics for Variables Week of Mar 14, 2011.

Chapter 9 Flashcards. measurement method that uses uniform procedures to collect, score, interpret, and report numerical results; usually has norms and.

Regression Analysis. Regression analysis Definition: Regression analysis is a statistical method for fitting an equation to a data set. It is used to.

Hypothesis Testing in Linear Regression Analysis

1 Causal Rasch Models IOMW April 11-12, 2012 Vancouver, Canada A.Jackson Stenner Donald S. Burdick Mark H. Stone.

Chapter 3 An Overview of Quantitative Research

Analyzing Reliability and Validity in Outcomes Assessment (Part 1) Robert W. Lingard and Deborah K. van Alphen California State University, Northridge.

Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.

P Values Robin Beaumont 10/10/2011 With much help from Professor Chris Wilds material University of Auckland.

© 2001 Prentice-Hall, Inc. Statistics for Business and Economics Simple Linear Regression Chapter 10.

URBDP 591 I Lecture 3: Research Process Objectives What are the major steps in the research process? What is an operational definition of variables? What.

6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)

WHAT IS THE NATURE OF SCIENCE?. SCIENTIFIC WORLD VIEW 1.The Universe Is Understandable. 2.The Universe Is a Vast Single System In Which the Basic Rules.

1 Causal Rasch Models and Individual Growth Trajectories National Center for the Improvement of Educational Assessment January 18, 2011 A.Jackson Stenner.

Perspectives on Text Complexity in 2011 February 23, 2011 Presented by: A. Jackson Stenner.

Question paper 1997.

1 How to Model and Test for the Mechanisms that make Measurement Systems Tick IMEKO Jena, Germany Wednesday, August 31, 2011 A.Jackson Stenner Chairman.

Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.

MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 22.

P Values Robin Beaumont 8/2/2012 With much help from Professor Chris Wilds material University of Auckland.

Chapter 7 Measuring of data Reliability of measuring instruments The reliability* of instrument is the consistency with which it measures the target attribute.

URBDP 591 I Lecture 4: Research Question Objectives How do we define a research question? What is a testable hypothesis? How do we test an hypothesis?

Latent regression models. Where does the probability come from? Why isn’t the model deterministic. Each item tests something unique – We are interested.

Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.

1 Collecting and Interpreting Quantitative Data Deborah K. van Alphen and Robert W. Lingard California State University, Northridge.

Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.

The Scientific Method. Scientifically Solving a Problem Observe Define a Problem Review the Literature Observe some More Develop a Theoretical Framework.

Research Methods Systematic procedures for planning research, gathering and interpreting data, and reporting research findings.

Research Design

Statistica /Statistics Statistics is a discipline that has as its goal the study of quantity and quality of a particular phenomenon in conditions of.

Stats Methods at IC Lecture 3: Regression.

Statistics & Evidence-Based Practice

Descriptive and Causal

Intro to Research Methods

Logic of Hypothesis Testing

11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.

Ch. 2: The Simple Regression Model

Statistical Process Control

Statistics for Managers using Microsoft Excel 3rd Edition

Design (3): quasi-experimental and non-experimental designs

Evaluation of measuring tools: validity

Understanding Results

Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.

Overview Understanding What Science is, and What it isn’t

WHAT IS THE NATURE OF SCIENCE?

پرسشنامه کارگاه.

The Scientific Method in Psychology

Reliability and Validity of Measurement

Analyzing Reliability and Validity in Outcomes Assessment Part 1

Regression Analysis Week 4.

Statistical Methods For Engineers

9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE

Developing and Evaluating Theories of Behavior

Quantitative Methods in HPELS HPELS 6210

Frameworks for Considering Item Response Demands

Econometrics Analysis

Two Halves to Statistics

Analyzing Reliability and Validity in Outcomes Assessment

Data and Data Collection

A Causal Model for Relating Text Complexity, Reader Ability and Comprehension Pacific Coast Reading Conference February 3-6, 2011 Jackson Stenner Chairman.

Collecting and Interpreting Quantitative Data

Presentation transcript:

Toward a Theory Relating Text Complexity, Reader Ability and Comprehension AERA New Orleans April 10, 2011 Jackson Stenner Chairman & CEO, MetaMetrics jstenner@lexile.com

Three well researched constructs Reader ability Text Complexity Comprehension 3 well researched variables Let’s first put them together in a definition of reading, then put them in an equation. Our focus today will be on text complexity but it is one of three variables that cooperate in a causal model.

Reading is a process in which information from the text and the knowledge possessed by the reader act together to produce meaning. 50 years later we revisit one of Rasch’s favorite constructs Let’s put it all together – text, reader and comprehension Ternary Structure (temperature, pressure, volume) (force, mass, acceleration) Making meaning is comprehension Let’s build a foundation for the science underlying The Lexile Framework. Let’s start with a beautiful definition Text and Reader act together to make meaning How do we operationalize “act together” The Lexile Framework uses an equation that specifies how reader measure and text measures cooperate or “act together” to produce comprehension We use this equation to make text measures and reader measures. Anderson, R.C., Hiebert, E.H., Scott, J.A., & Wilkinson, I.A.G. (1985) Becoming a nation of readers: The report of the Commission on Reading Urbana, IL: University of Illinois

An Equation e Conceptual - = = 1 + e Text Reader Complexity Ability Comprehension Statistical e (RA – TC ) Raw Score i = 1 + e (RA – TC i ) i Operationalizes the definition *Imagining a text to be a test. Key innovation. Three inter-related but distinct constructs The familiar Rasch model with one important addition. Item calibrations come from theory. TCi’ s are either empirical or theoretical item calibrations (when making reader measures) or virtual item calibrations (when making text measures). Same equation is used to make reader measures and text measures!!! Major criticism is that this formulation is “just too simple.” We have amassed a staggering array of successful predictions using this “too simple” model. RA = Reading Ability TC = Text Calibrations 4

Eight Features of the Causal Model Relating Text Complexity, Reader Ability and Comprehension The model is individual centered. The focus is on explaining variation within persons over time. In this framework the measurement mechanism is well specified and can be manipulated to produce predictable changes in measurement outcomes (e.g. percent correct). Item parameters are supplied by substantive theory and, thus, person parameter estimates are generated without reference to or use of any data on other persons or populations. Therefore, effects of the examinee population have been completely eliminated from consideration in the estimation of person parameters for reader ability. Don’t need metaphysical baggage – I mean X causes Y if intervention on X produces predicable changes in Y. The trade-off Revisit equation slide. Explain intervention on RA-TC and nextemp

Eight Features of the Causal Model cont’d. The quantitivity hypothesis can be experimentally tested by evaluating the trade-off property for the individual case. A change in the person parameter can be off-set or traded-off for a compensating change in text complexity to hold comprehension constant. The trade-off is not just about the algebra. When uncertainty in item difficulties is too large to ignore, individual item difficulties may be a poor choice to use as calibration parameters in causal models. As an alternative we recommend, when feasible, averaging over individual item difficulties to produce “ensemble” means. For example empirical text complexities can be excellent dependent variables for testing causal theories.

Eight Features of the Causal Model cont’d. Causal Rasch models are individual centered and are explanatory at both within-subject and between-subject levels. The attribute on which I differ from myself a decade ago is the same attribute on which I differ from my brother today. When data fit a Rasch model, differences between person measures are objective. When data fit a causal Rasch model absolute person measures (reader abilities) are objective (i.e. independent of instrument). Causal Rasch models make possible the construction of generally objective growth trajectories. Each trajectory can be completely separated from the instruments used in its construction and from the performance of any other persons, whatsoever.

Text Demands for College and Career Student 1528 6th Grade Male Hispanic Paid Lunch May 2007 – Dec. 2009 284 Encounters 117,484 Words 2,894 Items 848 Minutes Text Demands for College and Career 1200 1000 1400 1600 May 2016 (12th Grade) 8

Mythology Text Complexity Theoretical: 1300L Empirical: 1357L Adapted from Oasis Article courtesy of EBSCO Publishing The study and interpretation of myth and the body of myths of a particular culture. Myth is a complex cultural phenomenon that can be approached from a number of viewpoints. As generally understood, a myth is a story or narrative that is traditional in a certain culture, having been passed down from early times and regarded as true. It may be said to 1 symbolically the origin of the basic elements and assumptions of a culture. Mythic narratives frequently revolve around the doings of gods or heroes, and may relate, for example, how the world began, how humans and animals came into being, or how certain customs, gestures, or forms of human activities 2. Almost all cultures possess or at one time possessed and lived in terms of myths. 1 immerse belittle portray contradict 2 originated adorned handicapped entwined 9999

Plot of Theoretical Text Complexity versus Empirical Text Complexity for 446 passages

What could account for the 10% unexplained variance? Missing Variables or Theory misspecification Better Criterion Variable Improved Proxies/Operationalizations Expanded Error Model – Treat Item Type as Random Rounding Error Imperfections in Theory Implementation Sarah Kershaw’s (FCRR) dissertation will check the most promising text variables not in the LF equation. Very large text corpora (Google 500 billion) will permit very accurate frequency measures for 10,000s of words and word families. Also empirical Lexile word difficulties will be available in the next five years. The task continuum promises new machine generated task types for measuring text complexity and reader ability. MM is bringing on-line a new analyzer that does not round text measures. Theory implementation compromises. Front to back vs. back to front text measurement gives us a way to estimate effects of theory implementation.

Closing No matter how it is sliced and diced, analyses of joint and conditional probability distributions yield no more than patterns of association. Nothing in the response data nor Rasch analyses of these data exposes the processes (features of the object of measurement) or mechanisms (features of the instrument) that are hypothesized to be conjointly causal on the measurement outcomes. In my view the agenda for a sixth decade of Rasch measurement practice should have at the top a focus on Causal Rasch Models.

Contact Info: A. Jackson Stenner CEO, MetaMetrics jstenner@Lexile.com