Slide 2- 1. Chapter 1-2 Data – A Case Study in Discrimination – 5 W’s Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley.

Slides:



Advertisements
Similar presentations
Numbers Treasure Hunt Following each question, click on the answer. If correct, the next page will load with a graphic first – these can be used to check.
Advertisements

AP STUDY SESSION 2.
1
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.
Author: Julia Richards and R. Scott Hawley
David Burdett May 11, 2004 Package Binding for WS CDL.
Local Customization Chapter 2. Local Customization 2-2 Objectives Customization Considerations Types of Data Elements Location for Locally Defined Data.
Create an Application Title 1Y - Youth Chapter 5.
CALENDAR.
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt BlendsDigraphsShort.
Chapter 7 Sampling and Sampling Distributions
1 Click here to End Presentation Software: Installation and Updates Internet Download CD release NACIS Updates.
Break Time Remaining 10:00.
Table 12.1: Cash Flows to a Cash and Carry Trading Strategy.
PP Test Review Sections 6-1 to 6-6
Bright Futures Guidelines Priorities and Screening Tables
EIS Bridge Tool and Staging Tables September 1, 2009 Instructor: Way Poteat Slide: 1.
Bellwork Do the following problem on a ½ sheet of paper and turn in.
Exarte Bezoek aan de Mediacampus Bachelor in de grafische en digitale media April 2014.
Copyright © 2013, 2009, 2006 Pearson Education, Inc. 1 Section 5.5 Dividing Polynomials Copyright © 2013, 2009, 2006 Pearson Education, Inc. 1.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
1 RA III - Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Buenos Aires, Argentina, 25 – 27 October 2006 Status of observing programmes in RA.
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
Adding Up In Chunks.
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt Synthetic.
1 hi at no doifpi me be go we of at be do go hi if me no of pi we Inorder Traversal Inorder traversal. n Visit the left subtree. n Visit the node. n Visit.
Analyzing Genes and Genomes
Speak Up for Safety Dr. Susan Strauss Harassment & Bullying Consultant November 9, 2012.
Essential Cell Biology
Chapter 8 Estimation Understandable Statistics Ninth Edition
Clock will move after 1 minute
PSSA Preparation.
Essential Cell Biology
Immunobiology: The Immune System in Health & Disease Sixth Edition
Physics for Scientists & Engineers, 3rd Edition
Energy Generation in Mitochondria and Chlorplasts
Select a time to count down from the clock above
Copyright Tim Morris/St Stephen's School
1 Decidability continued…. 2 Theorem: For a recursively enumerable language it is undecidable to determine whether is finite Proof: We will reduce the.
The W’s of Data. Data  Does have to be numbers?  It can be doesn’t have to be.  Without context, it’s useless!  Consider 17, 21, 44, and 76  Are.
Economic Reasoning Using Statistics Econ 138 Dr. Adrienne Ohler.
  Data can be numbers, record names, or other labels.  Not all data represented by numbers are numerical data (e.g., 1 = male, 2 = female or zip codes.
Copyright © 2010 Pearson Education, Inc. Chapter 2 Data.
Chapter 2 Data. Slide 2- 2 What Are Data? Data can be numbers, record names, or other labels. Not all data represented by numbers are numerical data (e.g.,
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 1, Slide 1 Chapter 01 Stats Starts Here.
Chapter 2 Data.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Data.
AP Statistics Tuesday, 25 August 2015 OBJECTIVE TSW learn (1) the reasons for studying statistics, and (2) vocabulary. FORM DUE (only if it is signed)
Slide 2-1 Copyright © 2004 Pearson Education, Inc. What Are Data? Data can be numbers, record names, or other labels. Not all data represented by numbers.
Unit 2 Descriptive Statistics Objective: To correctly identify and display sets of data.
1. Is data always recorded as numbers? 2. What do you organize data into for it to be easily read? 3. What is a categorical variable? 4. What is a quantitative.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 2- 1.
Chapter 2 The 6 W’s & H. The “W’s”  To provide context we need the W’s  Who - REQUIRED  What (and in what units) - REQUIRED  When  Where  Why –
Copyright © 2009 Pearson Education, Inc. Chapter 2 Data.
Statistics 2 Data. What Are Data? Data can be numbers, record names, or other labels. Not all data represented by numbers are numerical data (e.g., 1.
Probability & Statistics
August 25, 2015 Please turn in any forms or assignments from yesterday. Take out a sheet of paper and something to write with for notes.
Chapters 1 and 2 What Is (Are?) Statistics?
Chapter 1,2 Stats Starts Here.
Chapter 01 Stats Starts Here.
Chapter 01 Stats Starts Here.
Chapter 2 Data Copyright © 2010 Pearson Education, Inc.
Chapter 2 Data Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley.
Chapter 1 Stats Starts Here.
Chapter 01 Stats Starts Here.
Chapter 1 Stats Starts Here.
Chapter 2 Data Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley.
Presentation transcript:

Slide 2- 1

Chapter 1-2 Data – A Case Study in Discrimination – 5 W’s Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley

What is Statistics? Statistics is the collection, organization, presentation, summarization, and interpretation of data. ▫We have two branches of Statistics  Descriptive – the stuff you describe like mean, median, and mode…graphs too  Inferential – the stuff you don’t know about yet…performing a test of a hypothesis Statistics is not all about calculation and formulas! You need to write about data… Slide 2- 3

What Are Data? Data can be numbers, record names, or other labels. Not all data represented by numbers are numerical data (e.g., 1=male, 2=female). Data are useless without their context… Slide 2- 4

The “W’s” To provide context we need the W’s ▫Who ▫What (and in what units) ▫When ▫Where ▫Why (if possible) ▫and How of the data. Note: the answers to “who” and “what” are essential. Slide 2- 5

Data Tables The Westvaco data table clearly shows the context of the data presented: ▫Ordinal ▫Nominal ▫Interval ▫Rational Notice that this data table tells us the What (column) and Who (row) for these data. Slide 2- 6 RowJob TitlePayBirth MonthBirth Year Hire MonthHire YearRIFAgeJob Status 1Engineering ClerkH Engineering Tech IIH Engineering Tech IIH Secretary to Engin ManagH Engineering Tech IIH Engineering Tech IIH Engineering Tech IIH Parts Crib AttendantH Engineering Tech IIH Engineering Tech IIH Technical SecretaryH Engineering Tech IIH Engineering Tech IIH Engineering Tech IIH Customer Serv EngineerS Customer Serv Engr AssocS Design EngineerS Design EngineerS Design EngineerS Design EngineerS Engineering AssistantS Engineering AssociateS Engineering ManagerS Machine DesignerS Packaging EngineerS Prod Spec PrintingS Proj Eng ElecS Project EngineerS Project EngineerS Project EngineerS Supv Engineering ServS Supv Machine ShopS ChemistS Design EngineerS Engineering AssociateS Machine DesignerS Machine Parts Cont SupvS Prod SpecialistS Project EngineerS ChemistS Design EngineerS Electrical EngineerS Machine DesignerS Machine Parts Cont CoorS VH Prod SpecialistS Printing CoordinatorS Prod Dev EngineerS Prod SpecialistS VH Prod SpecialistS Engineering AssociateS

Who The Who of the data tells us the individual cases for which (or whom) we have collected data. ▫Individuals who answer a survey are called respondents. ▫People on whom we experiment are called subjects or participants. ▫Animals, plants, and inanimate subjects are called experimental units. Slide 2- 7

Who (cont.) Sometimes people just refer to data values as observations and are not clear about the Who. ▫But we need to know the Who of the data so we can learn what the data say. Slide 2- 8

What and Why Variables are characteristics recorded about each individual. The variables should have a name that identify What has been measured. To understand variables, you must Think about what you want to know. Slide 2- 9

What and Why (cont.) Some variables have units that tell how each value has been measured and tell the scale of the measurement. Others are presented with the data or in a key. Slide Pay: H = hourly, S = salaried. RIF (Reduction in Force): 0 = not chosen for layoff, 1 = chosen for layoff in Round 1, 2 = chosen for layoff in Round 2, and so on. Job Status: 0 = not chosen for layoff, 1 = laid off

What and Why (cont.) A categorical (or qualitative) variable names categories and answers questions about how cases fall into those categories. ▫Categorical examples: sex, race, ethnicity A quantitative variable is a measured variable (with units) that answers questions about the quantity of what is being measured. ▫Quantitative examples: income ($), height (inches), weight (pounds) Slide 2- 11

What and Why (cont.) The questions we ask a variable (the Why of our analysis) shape what we think about and how we treat the variable. Each variable has a level of measurement: ▫Nominal – name or label only ▫Ordinal – number or letter ranking in order ▫Interval – value that can be scaled (no true 0) ▫Rational – value that can be scaled without limit (true 0) Slide 2- 12

What and Why (cont.) Example: In a student evaluation of instruction at a large university, one question asks students to evaluate the statement “The instructor was generally interested in teaching” on the following scale: 1 = Disagree Strongly; 2 = Disagree; 3 = Neutral; 4 = Agree; 5 = Agree Strongly. Question: Is interest in teaching categorical or quantitative? What is the level of measurement? Slide 2- 13

What and Why (cont.) Question: Is interest in teaching categorical or quantitative? We sense an order to these ratings, but there are no natural units for the variable interest in teaching. Variables like interest in teaching are often called ordinal variables. ▫With an ordinal variable, look at the Why of the study to decide whether to treat it as categorical or quantitative. Slide 2- 14

Counts Count When we count the cases in each category of a categorical variable, the counts are not the data, but something we summarize about the data. ▫The category labels are the What, and ▫the individuals counted are the Who. Slide 2- 15

Counts Count (cont.) When we focus on the amount of something, we use counts differently. For example, Amazon might track the growth in the number of teenage customers each month to forecast song downloads (the Why). ▫The What is teens, the Who is months, and the units are number of teenage customers. Slide 2- 16

Identifying Identifiers Identifier variables are categorical variables with exactly one individual in each category. ▫Examples: Social Security Number, ISBN, FedEx Tracking Number Don’t be tempted to analyze identifier variables. They are really nominal measurements. Be careful not to consider all variables with one case per category, like year, as identifier variables. ▫The Why will help you decide how to treat identifier variables. Slide 2- 17

Where, When, and How We need the Who, What, and Why to analyze data. But, the more we know, the more we understand. When and Where give us some nice information about the context. ▫Example: Values recorded at a large public university may mean something different than similar values recorded at a small private college. Slide 2- 18

Where, When, and How (cont.) How the data are collected can make the difference between insight and nonsense. ▫Example: results from Internet surveys are often useless as are results from a call in radio talk show…(why?) The first step of any data analysis should be to examine the W’s—this is a key part of the Think step of any analysis. And, make sure that you know the Why, Who, and What before you proceed with your analysis. Slide 2- 19

Wait…What? Don’t label a variable as categorical or quantitative without thinking about the question you want it to answer. Just because your variable’s values are numbers, don’t assume that it’s quantitative. Always be skeptical—don’t take data for granted. Slide 2- 20

You should know that… Data are information in a context. ▫The W’s help with context. ▫We must know the Who (cases), What (variables), and Why to be able to say anything useful about the data. Before you make any general statements about your data set, think about the context and use that in a statement of evidence. Slide 2- 21

You should know that… We treat variables as categorical or quantitative. ▫Categorical variables identify a category for each case. ▫Quantitative variables record measurements or amounts of something and must have units. ▫Some variables can be treated as categorical or quantitative depending on what we want to learn from them. Slide 2- 22