Chapter 2 Data.

Slides:



Advertisements
Similar presentations
AP Statistics Chapter 2. The Five Ws *Who -are the cases *What -are the variables ( gives information about each of the cases) *Why -helps us to decide.
Advertisements

SECTION 1 CHAPTER 1. DATA What is Statistics? The science of collecting, organizing, and interpreting numerical facts, which we call data Data (def.)
The W’s of Data. Data  Does have to be numbers?  It can be doesn’t have to be.  Without context, it’s useless!  Consider 17, 21, 44, and 76  Are.
Economic Reasoning Using Statistics Econ 138 Dr. Adrienne Ohler.
1.2: The Nature of Data Objective: To understand the different types of data CHS Statistics.
  Data can be numbers, record names, or other labels.  Not all data represented by numbers are numerical data (e.g., 1 = male, 2 = female or zip codes.
Chapter 1 A First Look at Statistics and Data Collection.
In this chapter, we introduce the ideas of what statistics and data are all about.
Chapter 1 The Where, Why, and How of Data Collection
Chapter 2: Data by Alyssa Webb. ● Data can come in many different types but it useless without it’s context. ● Not all data represented by numbers is.
AP STATISTICS CHAPTER 2 Amanda Hua Sophie Ming. UNDERSTANDING AND EXPLORING DATA.
Copyright © 2010 Pearson Education, Inc. Chapter 2 Data.
Chapter 3 Goals After completing this chapter, you should be able to: Describe key data collection methods Know key definitions:  Population vs. Sample.
STA 2023 Chapter 1 Notes. Terminology  Data: consists of information coming from observations, counts, measurements, or responses.  Statistics: the.
Copyright © 2012 Pearson Education. All rights reserved. Chapter 2 Data.
Chapter 2 Data. Slide 2- 2 What Are Data? Data can be numbers, record names, or other labels. Not all data represented by numbers are numerical data (e.g.,
Introduction to Statistics What is Statistics? : Statistics is the sciences of conducting studies to collect, organize, summarize, analyze, and draw conclusions.
Chapter 2: Data CHS Statistics
CHAPTER 1 STATISTICS Statistics is a way of reasoning, along with a collection of tools and methods, designed to help us understand the world.
PADM 582 Quantitative and Qualitative Research Methods Basic Concepts of Statistics Soomi Lee, Ph.D.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 1, Slide 1 Chapter 01 Stats Starts Here.
Probability & Statistics – Bell Ringer  Make a list of all the possible places where you encounter probability or statistics in your everyday life. 1.
Introduction Biostatistics Analysis: Lecture 1 Definitions and Data Collection.
 Share one observation on the board  Turn in paragraph of your observations  Reading Quiz ( You are ALWAYS allowed to take notes and use on reading.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Data.
Chapter 1 Exploring Data 1.1 Displaying Distributions with Graphs.
Chapter 1: Exploring Data
AP Statistics Tuesday, 25 August 2015 OBJECTIVE TSW learn (1) the reasons for studying statistics, and (2) vocabulary. FORM DUE (only if it is signed)
Statistics and Data. inference Sample (Data) statistics probability sampling Population Ch 2, Ch 4, Ch 6 Ch 5, Ch 9, Ch 21 Ch 7, Ch 8 Ch 3, Ch 23 Ch
Slide 2-1 Copyright © 2004 Pearson Education, Inc. What Are Data? Data can be numbers, record names, or other labels. Not all data represented by numbers.
Unit 2 Descriptive Statistics Objective: To correctly identify and display sets of data.
Chapter 2: Levels of Measurement. Researchers classify variables according to the extent to which the values of the variable measure the intended characteristics.
1. Is data always recorded as numbers? 2. What do you organize data into for it to be easily read? 3. What is a categorical variable? 4. What is a quantitative.
Chapter 2: Data Abdullah Syed Fengkai Liao 1 st Period A.P. Statistics.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 2- 1.
Chapter 2 The 6 W’s & H. The “W’s”  To provide context we need the W’s  Who - REQUIRED  What (and in what units) - REQUIRED  When  Where  Why –
Chapter 2 Data. Statistics – the science of collecting, analyzing, and drawing conclusions from data.
Chapter 2 Data. Learning Objectives 1.Define Data. 2.Identify populations and samples. 3.Identify the cases and variables in any data set. 4.Know the.
Copyright © 2009 Pearson Education, Inc. Chapter 2 Data.
1 Copyright © 2014, 2012, 2009 Pearson Education, Inc. Chapter 1 Stats Starts Here.
Statistics 2 Data. What Are Data? Data can be numbers, record names, or other labels. Not all data represented by numbers are numerical data (e.g., 1.
Modular 1. Introduction of the Course Structure and MyLabsPlus.
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapters 1 & 2 An Overview of Statistics Classifying Data Critical Thinking 1 Larson/Farber 4th ed.
Probability & Statistics
August 25, 2015 Please turn in any forms or assignments from yesterday. Take out a sheet of paper and something to write with for notes.
Math 125 Stats Starts Here Copyright © 2009 Pearson Education, Inc.
Chapters 1 and 2 What Is (Are?) Statistics?
Intro to Statistics and Data
Chapter 1,2 Stats Starts Here.
Math 153 Stats Starts Here.
Introduction to Statistics
Warm Up.
Honors Statistics Chapter 2
Chapter 01 Stats Starts Here.
Objectives (IPS chapter 1.1)
The Nature of Probability and Statistics
Chapter 01 Stats Starts Here.
Chapter 2 Data Copyright © 2010 Pearson Education, Inc.
Math 153 Stats Starts Here.
Stats Starts Here Copyright © 2009 Pearson Education, Inc.
Chapter 2 The 6 W’s & H.
Chapter 2 Data Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley.
Chapter 1 Stats Starts Here.
Chapter 01 Stats Starts Here.
Lecture One Data Copyright © 2012 Pearson Education. All rights reserved.
Chapter 1 Stats Starts Here.
Chapter 2 Data Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley.
Presentation transcript:

Chapter 2 Data

Objectives: Data Individuals Population Sample Variables Categorical (or qualitative) Quantitative

Data Definition Data: (latin for fact) Characteristics or numbers that are collected by observation. Data are numbers with context. What Are Data? Data can be numbers, record names, or other labels. Not all data represented by numbers are numerical data (e.g., 1 = male, 2 = female). Data are useless without their context…

The “W’s” To provide context we need the W’s Who What (and in what units) When Where Why (if possible) and How of the data. Note: the answers to “who” and “what” are essential.

Data Tables The following data table clearly shows the context of the data presented: Notice that this data table tells us the What (column) and Who (row) for these data.

Who – Who are the individuals? The first step in understanding data is to answer the W’s Who, What, When, Where, Why, and How Who – Who are the individuals? Individuals: (people, objects, etc.) that we are trying to gain information about. In order to make decisions, we need to know what our population of interest is and whether our data are representative of that population.

Who The Who of the data tells us the individual cases for which (or whom) we have collected data. Individuals who answer a survey are called respondents. People on whom we experiment are called subjects or participants. Animals, plants, and inanimate subjects are called experimental units. Sometimes people just refer to data values as observations and are not clear about the Who. But we need to know the Who of the data so we can learn what the data say.

Definitions Population Sample A complete set of individuals being observed or of interest. This Class This School Broward County USA A subset of the population selected according to some scheme to represent the population. Population – this class Sample – 5 students selected from the class

What and Why What – What variables were recorded about each of the individuals? Variables are characteristics recorded about each individual. The variables should have a name that identify What has been measured. To understand variables, you must Think about what you want to know. EX: Data – Student data base (includes data on each student enrolled). Individuals – students Variables – DOB, Gender, GPA, etc.

What and Why (cont.) Some variables have units that tell how each value has been measured and tell the scale of the measurement.

What and Why (cont.) Two Types of Variables A categorical (or qualitative) variable names categories and answers questions about how cases fall into those categories. Categorical examples: sex, race, ethnicity A quantitative variable is a measured variable (with units) that answers questions about the quantity of what is being measured. Quantitative examples: income ($), height (inches), weight (pounds)

Types of Variables Categorical (qualitative) Quantitative (numerical) Values that fall into separate, nonover-lapping groups such as marital status or hair color. Data that can be counted and put in a specific order. Numerical values are categorical when it makes no sense to find an average for them – zip codes, jersey numbers, etc. Values for which arithmetic operations such as adding and averaging make sense. Data that can be measured. Values that have measurement units such as dollars, degrees, inches, etc.

What and Why (cont.) The questions we ask a variable (the Why of our analysis) shape what we think about and how we treat the variable.

Def: Distribution The pattern of variation of a variable. What values a variable takes and how often it takes these values.

What and Why (cont.) Example: In a student evaluation of instruction at a large university, one question asks students to evaluate the statement “The instructor was generally interested in teaching” on the following scale: 1 = Disagree Strongly; 2 = Disagree; 3 = Neutral; 4 = Agree; 5 = Agree Strongly. Question: Is interest in teaching categorical or quantitative?

What and Why (cont.) Question: Is interest in teaching categorical or quantitative? We sense an order to these ratings, but there are no natural units for the variable interest in teaching. Variables like interest in teaching are often called ordinal variables. With an ordinal variable, look at the Why of the study to decide whether to treat it as categorical or quantitative.

Counts Count When we count the cases in each category of a categorical variable, the counts are not the data, but something we summarize about the data. The category labels are the What, and the individuals counted are the Who.

Counts Count (cont.) When we focus on the amount of something, we use counts differently. For example, Amazon might track the growth in the number of teenage customers each month to forecast CD sales (the Why). The What is teens, the Who is months, and the units are number of teenage customers.

Identifying Identifiers Identifier variables are categorical variables with exactly one individual in each category. Examples: Social Security Number, ISBN, FedEx Tracking Number Don’t be tempted to analyze identifier variables. Be careful not to consider all variables with one case per category, like year, as identifier variables. The Why will help you decide how to treat identifier variables.

Where, When, and How We need the Who, What, and Why to analyze data. But, the more we know, the more we understand. When and Where give us some nice information about the context. Example: Values recorded at a large public university may mean something different than similar values recorded at a small private college.

Where, When, and How (cont.) How the data are collected can make the difference between insight and nonsense. Example: results from Internet surveys are often useless The first step of any data analysis should be to examine the W’s—this is a key part of the Think step of any analysis. And, make sure that you know the Why, Who, and What before you proceed with your analysis.

What Can Go Wrong? Don’t label a variable as categorical or quantitative without thinking about the question you want it to answer. Just because your variable’s values are numbers, don’t assume that it’s quantitative. Always be skeptical—don’t take data for granted.

What have we learned? Data are information in a context. The W’s help with context. We must know the Who (cases), What (variables), and Why to be able to say anything useful about the data.

What have we learned? (cont.) We treat variables as categorical or quantitative. Categorical variables identify a category for each case. Quantitative variables record measurements or amounts of something and must have units. Some variables can be treated as categorical or quantitative depending on what we want to learn from them.

Example #1 A January 2007 Gallup Poll question asked, “In general, do you think things have gotten better or gotten worse in this country in the last 5 years?” Possible answers were “Better”, “Worse”, “No Change”, “Don’t Know”, and “No Response”. What kind of variable is the response? Solution: Mood – Categorical variable.

Your Turn: A medical researcher measures the increase in heart rate of patients under a stress test. What kind of variable is the researcher studying? Solution: Stress – Quantitative variable.

Example #2 For the following description of data, identify the W’s, name the variable, specify for each variable whether its use indicates that it should be treated as categorical or quantitative, and, for any quantitative variable, identify the units in which it was measured. The State Education Department requires local school districts to keep these records on all students: age, race, days absent, current grade level, standardized test scores in reading and math, and any disabilities. Solution: Who – Students What – Age (probably in years), Race, Number of absences, Grade Level, Reading score, Math score, Disabilities. When – Must be kept current. Where – Not specified. Why – State required. How – Information collected and stored as part of school records. Categorical Variables: Race, grade level, disablities Quantitative Variables: Number of absences, age, reading score, math score.

Your Turn: For the following description of data, identify the W’s, name the variable, specify for each variable whether its use indicates that it should be treated as categorical or quantitative, and, for any quantitative variable, identify the units in which it was measured. The Gallup Poll conducted a representative telephone survey of 1180 American voters during the first quarter of 2007. Among the reported results were the voters region (Northeast, South, etc.), age, party affiliation, and whether or not the person had voted in the 2006 midterm congressional election. Solution: Who – 1180 Americans. What – Region, age (in years), political affiliation, and whether or not the person voted. When – First quarter 2007. Where – United States Why – Gallup public opinion poll. How – Telephone survey. Categorical Variables: Region, political affiliation, whether or not the person voted. Quantitative Variable: Age.

Finial Thought on Data

Assignment Chapter 2, pg. 16 – 18; #1, 3, 7 - 17 odd Read Chapter 3, pg. 20-37