1.2: Statistical Thinking

Slides:



Advertisements
Similar presentations
Section 1-2 Data Classification
Advertisements

Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Math 2260 Introduction to Probability and Statistics
1.2: The Nature of Data Objective: To understand the different types of data CHS Statistics.
Basic Ideas of Statistics Unit 1.1 Basic Ideas of Statistics Corresponds to Chapter 1 in Triola.
Elementary Statistics MOREHEAD STATE UNIVERSITY
Chapter 1 A First Look at Statistics and Data Collection.
Introduction to Statistics and Research
East Los Angeles College Math 227 – Statistics Fall 2008
© 2004 Prentice-Hall, Inc.Chap 1-1 Basic Business Statistics (9 th Edition) Chapter 1 Introduction and Data Collection.
Elementary Statistics Picturing the World
STA 2023 Chapter 1 Notes. Terminology  Data: consists of information coming from observations, counts, measurements, or responses.  Statistics: the.
Quantitative vs. Categorical Data
MATH1342 S08 – 7:00A-8:15A T/R BB218 SPRING 2014 Daryl Rupp.
Probability & Statistics
Chapter 1 Introduction and Data Collection
Essentials of Statistics 3rd edition
What is Statistics? Chapter GOALS 1. Understand why we study statistics. 2. Explain what is meant by descriptive statistics and inferential statistics.
Statistics: Basic Concepts. Overview Survey objective: – Collect data from a smaller part of a larger group to learn something about the larger group.
Introduction to Statistics What is Statistics? : Statistics is the sciences of conducting studies to collect, organize, summarize, analyze, and draw conclusions.
Chapter 2: Data CHS Statistics
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Elementary Statistics M A R I O F. T R I O L A Copyright © 1998, Triola, Elementary.
Introduction to Probability and Statistics Consultation time: Ms. Chong.
1.What is this graph trying to tell you? 2.Do you see anything misleading, unclear, etc.? 3.What is done well?
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin What is Statistics? Chapter 1.
Eng.Mosab I. Tabash Applied Statistics. Eng.Mosab I. Tabash Session 1 : Lesson 1 IntroductiontoStatisticsIntroductiontoStatistics.
Probability & Statistics – Bell Ringer  Make a list of all the possible places where you encounter probability or statistics in your everyday life. 1.
The Nature of Probability and Statistics
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin What is Statistics Chapter 1.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
1 Chapter 1. Section 1-1 and 1-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Section 1.1 Statistics Statistics :
1  Specific number numerical measurement determined by a set of data Example: Twenty-three percent of people polled believed that there are too many polls.
Chapter 1 Introduction to Statistics. Statistical Methods Were developed to serve a purpose Were developed to serve a purpose The purpose for each statistical.
Basic Business Statistics
Chapter 1 Introduction to Statistics 1-1 Overview 1-2 Types of Data 1-3 Critical Thinking 1-4 Design of Experiments.
MATH Elementary Statistics. Salary – Company A.
What is Statistics Chapter 1.
Types of data. Parameter vs. Statistic Parameter: Measured characteristic of a population Statistic: Measured characteristic of a sample Examples: Which.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Overview and Types of Data
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter 2: Levels of Measurement. Researchers classify variables according to the extent to which the values of the variable measure the intended characteristics.
Introduction To Statistics
Ch1 Larson/Farber 1 1 Elementary Statistics Larson Farber Introduction to Statistics As you view these slides be sure to have paper, pencil, a calculator.
Measurements Statistics WEEK 6. Lesson Objectives Review Descriptive / Survey Level of measurements Descriptive Statistics.
Biostatistics Introduction Article for Review.
Do Now  47 TCNJ students were asked to complete a survey on campus clubs and activities. 87% of the students surveyed participate in campus clubs and.
2 NURS/HSCI 597 NURSING RESEARCH & DATA ANALYSIS GEORGE MASON UNIVERSITY.
Chapter 1 Introduction to Statistics 1-1 Overview 1-2 Types of Data 1-3 Critical Thinking 1-4 Design of Experiments.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Introductions Syllabus Introductions B-day Problem.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Introductions Syllabus Introductions B-day Problem.
What is Statistics Chapter 1.
Introduction to Statistics
Elementary Statistics
Week one Introduction to Statistics Chs 221 Dr. wajed Hatamleh
What is Statistics Chapter 1.
Chapter(1) The Nature of Probability and Statistics
Elementary Statistics
statistics Specific number
Statistical Techniques in Business & Economics
Statistics Chapter 1 Sections
statistics Specific number
The Nature of Probability and Statistics
What is Statistics Chapter 1.
Chapter 1 Introduction to Statistics
Statistical Techniques in Business & Economics
Statistical Techniques in Business & Economics
Business Statistics For Contemporary Decision Making 9th Edition
Presentation transcript:

1.2: Statistical Thinking Basic Practice of Statistics - 3rd Edition Math 1231 - Fall 2013 Math 1231: Spring 2015 Chapter 1, Part One Course Introduction 1.2: Statistical Thinking 1.3: Types of Data Chapter 1 Chapter 1, Part 1 1

Recent Real-World Statistics Math 1231 - Fall 2013 Recent Real-World Statistics 74% of “online adults” use a social networking site of some kind. Among 18-29 year olds, this rises to 89% [Pew Internet Project, Jan. 2014]. U.S. Adults living with at least one child under age 6 spent an average of 2 hrs/day providing primary care and 5.4 hrs/day on secondary care. [Bureau of Labor Statistics, June 2014]. In 2014, 43% of American adults self-identified as political independents [Gallup, Jan. 2015]. Chapter 1, Part 1

Links for Previous Examples Math 1231 - Fall 2013 Links for Previous Examples BLS American Time Use Survey: http://www.bls.gov/news.release/atus.nr0.htm Pew Social Media User Demographics: http://www.pewinternet.org/data-trend/social-media/social-media-user-demographics/ Gallup Political Identification: http://www.gallup.com/poll/180440/new-record-political-independents.aspx Chapter 1, Part 1

Some Interesting Questions… Math 1231 - Fall 2013 Some Interesting Questions… Gallup’s political poll was based on phone interviews conducted during 2014. Were you contacted during this time? Gallup interviewed “only” 16,479 American adults, out of more than 230 million total! Question: Is Gallup justified in making such a claim about all American adults, using information from such a small percentage? Chapter 1, Part 1

Math 1231 - Fall 2013 A Typical Problem The previous scenarios illustrate a central theme in real-world statistics: We have a VERY LARGE set of individuals (all American adults, for example). It is EFFECTIVELY IMPOSSIBLE to gather information from every single individual. Using information from a relatively small number of individuals, we can draw some conclusions about the VERY LARGE set. Chapter 1, Part 1

Statistical Thinking Section 1.2 Math 1231 - Fall 2013 Chapter 1, Part 1

*** Basic Terminology *** Math 1231 - Fall 2013 *** Basic Terminology *** Population: The complete set of individuals that we intend to study (Gallup Poll: All American adults—more than 230 million individuals!). Sample: The set of individuals for which we have obtained actual data. The Sample is a subset of the Population. (Gallup: 16,479 adults, contacted by telephone during 2014). Data: Specific information about each individual (Gallup: Each individual’s political affiliation). Assume we have data only from those in the sample. Chapter 1, Part 1

Population vs. Sample (not to scale!!) Math 1231 - Fall 2013 Population vs. Sample (not to scale!!) Chapter 1, Part 1

Descriptive / Summary Statistics Math 1231 - Fall 2013 Descriptive / Summary Statistics Goal: Take a set of data and organize or summarize it in a useful way. Example: Your current GPA summarizes your academic performance, without needing to see your entire transcript. Is this a fair/accurate summary? It doesn’t matter, many people will use it anyway. We will look at several different graphical and numerical summaries over the next few classes. Chapter 1, Part 1

Statistical Inference Math 1231 - Fall 2013 Statistical Inference Goal: Use (known) data from a sample in order to draw conclusions about the entire population. Since we don’t have data for the entire population, these conclusions ALWAYS have some degree of uncertainty. More carefully stated, Gallup concluded this: Based on sample data from 16,479 individuals, we claim that the percentage of American adults who self-identified as “independent” in 2014 was between 42% and 44%. There is a 95% chance this claim is correct. Chapter 1, Part 1

Statistical Inference Math 1231 - Fall 2013 Statistical Inference Gallup’s conclusion is based on data from a very small percentage, less than 0.01%, of the intended population. It may surprise you that such a small amount of data can be used to make a conclusion about a large population. We’ll discuss the underlying methods (and how to properly interpret these results) later in the course. For now: It is MORE IMPORTANT that the sample data are collected in an “appropriate” way, otherwise our methods will give potentially inaccurate results. Chapter 1, Part 1

How to think about statistical data and results Math 1231 - Fall 2013 How to think about statistical data and results Chapter 1, Part 1

Data: Important Considerations Math 1231 - Fall 2013 Data: Important Considerations Context: What do the data represent? The same numbers can have completely different meanings/interpretations: GRADE A B C D F W No. of Students 3 7 6 5 2 4 Which context is more interesting/relevant?? Day M T W R F S Pieces of Mail 3 7 6 5 2 4 Chapter 1, Part 1

Data: Important Considerations Math 1231 - Fall 2013 Data: Important Considerations Source: Where do the data come from? Who gathered the data? Who summarized or analyzed the data? Who sponsored or funded the research? Are those responsible for collecting/analyzing the data reliable? Is there any incentive to distort results and/or favor a particular type of result? Ask for examples where these might affect the results. Chapter 1, Part 1

Data: Important Considerations Math 1231 - Fall 2013 Data: Important Considerations Sampling Method: What process was used to choose the sample and collect data? Was sample selection limited to individuals who volunteered to provide data? Was sample selection limited to individuals who were convenient? Was data collection based on subjective judgment or ambiguous terminology? Example: Do you spend a lot of time studying? How/why might these affect the results? Chapter 1, Part 1

Important Considerations Math 1231 - Fall 2013 Important Considerations Conclusions: What are the results of the statistical analysis/inference? What is the intended population? Are the results valid for the entire population? Can you restate results in a way that can be understood by someone with no little or no knowledge of statistical terminology? Is there a cause-and-effect relationship, or merely a statistical relationship (“Correlation does not imply causality”—see Chapter 10). Most of these issues will be discussed later in the course. Chapter 1, Part 1

Some Other Considerations Math 1231 - Fall 2013 Some Other Considerations Practical Implications: Are the conclusions useful or relevant in a real-world context? A “Statistically Significant” claim comes from analyzing the data using numerical methods, without any context (see the next slide). “Practically Significant” means useful or relevant to the real world. These are not necessarily the same thing! Chapter 1, Part 1

*** Statistical Significance *** Math 1231 - Fall 2013 *** Statistical Significance *** When doing statistical inference, there is always some degree of randomness in how we gather the sample data. If we wind up with results that are unlikely to occur by random chance, we say the results are statistically significant. Simple Example: How likely is it that a fair coin would come up heads in 95/100 flips? Chapter 1, Part 1

Example: Class Attendance Math 1231 - Fall 2013 Example: Class Attendance In analyzing grade data among Math 1231 students from previous semesters, the average course grade for students with “many” absences was 15 points (out of 100) less than the average for students with “few or no” absences. My Claim: Students with “many” absences tend to have lower course grades than students with “few or no” absences. Is this statistically and/or practically significant? Is frequent absence the cause of lower grades? Chapter 1, Part 1

Math 1231 - Fall 2013 Types of Data Section 1.3 Chapter 1, Part 1

*** Parameters vs. Statistics *** Math 1231 - Fall 2013 *** Parameters vs. Statistics *** The goal of statistical inference is to use sample data to draw conclusions about some VERY LARGE population. A parameter is a numerical value describing some aspect of the population. A statistic is a numerical value describing some aspect of a sample. The value of a statistic (computed from sample data) can be used to estimate the value of a parameter (almost always unknown). Chapter 1, Part 1

Parameters vs. Statistics Math 1231 - Fall 2013 Parameters vs. Statistics Example: I want to estimate the average height of all students currently in class. I choose four students “at random” and compute the average height for those four. Average class height: This is a parameter, its value is unknown to us (the population is the entire class). Average height for the group of four: This is a statistic (the sample consists of these four students). Question for later: Is it reasonable to claim that the sample average is “close” to the population average, based on our sample? Chapter 1, Part 1

Quantitative vs. Categorical Math 1231 - Fall 2013 Quantitative vs. Categorical Quantitative data consist of number that represent counts or measurements. All quantitative data is numerical, but not all numerical data is quantitative. Data with a unit of measurement (seconds, feet, pounds, dollars, etc.) is quantitative. Numerical data used as a label or range of values (Student ID Number, 20-25 years) is not quantitative. Chapter 1, Part 1

Examples: Quantitative Data Math 1231 - Fall 2013 Examples: Quantitative Data The University keeps the following quantitative data about each student. Grade Point Average Number of Credit Hours Completed Age Amount of money owed for tuition Other examples? Chapter 1, Part 1

Math 1231 - Fall 2013 Categorical Data Data that are not quantitative are called categorical. Non-numerical data must be categorical. Numerical data that serves to label or identify individuals are categorical (Example: Social Security Number). A useful guide: Would it make sense to consider an average value? If not, treat the data as categorical. Chapter 1, Part 1

Examples: Categorical Data Math 1231 - Fall 2013 Examples: Categorical Data The University keeps the following categorical data about each student: Name Laker ID Number Date of Birth Gender Residency (“in-state” or “out-of-state”) Other? Chapter 1, Part 1

Discrete vs. Continuous Math 1231 - Fall 2013 Discrete vs. Continuous Quantitative (number) data can be classified as: Discrete: Finitely many possible values, or infinitely many values with clearly-defined “next” and “previous” values. Discrete values can be put into a list. Continuous: Infinitely many values anywhere in a given range/interval, with no holes or gaps. A useful guide: Is it theoretically possible to make your measurements more accurate/precise? If so, then you probably have continuous data. Chapter 1, Part 1

Examples: Discrete or Continuous? Math 1231 - Fall 2013 Examples: Discrete or Continuous? Number of siblings. Amount of time it takes to run one mile. Resting pulse rate (beats per minute). Distance you live from this building. Grade point average. Credit card balance (in dollars/cents). Note: The answers may depend on how the data are measured and/or used. Chapter 1, Part 1

Math 1231 - Fall 2013 Levels of Measurement An alternate way to classify data, based on what can be done to summarize/analyze it. There is some debate on how many levels are needed; these four are commonly used: Nominal (qualitative) Ordinal (ordering is meaningful) Interval (differences are meaningful) Ratio (ratios are meaningful) Chapter 1, Part 1

Nominal Level Consists of names, labels, or well-defined categories. There is no meaningful way to order values (alphabetical is often used). Colors (Red, Green, Yellow, etc.) Gender (Female, Male) Party Affiliation (Democrat, Republican, Other) State of Residence Nominal data is always categorical.

Ordinal Level Data can be arranged in some meaningful order, but differences between values cannot be computed or are useless. Course Grades (A, B, C, D, F) Competitive Rankings (Gold > Silver > Bronze, but “Gold minus Silver” is useless, even if we represent these as numbers 1, 2, 3). Ordinal data is often categorical (notable exceptions are IQ and Body Mass Index).

Interval Level Numerical values that can be put in order, and the difference between two values has some useful meaning. However, there is no “natural zero” level and ratios do not have any practical meaning. Examples: Temperature (Fahrenheit or Celsius): 15 is colder than 30, but zero degrees does not mean an absence of temperature (unless you use Kelvin). Calendar Data: Aug. 7th < Aug. 21st, with a difference of 14 days (but the 21st is not “three times” the 7th). Interval data is the least common of the four levels.

Math 1231 - Fall 2013 Ratio Level Numerical values that can be put in order, the difference between two values has meaning, and there is a natural, non-arbitary “zero level.” Ratio data measures “amount of stuff.” The zero level means that “no stuff” is present. Distance, amount of time, mass/weight, many other physical quantities. Price, Checking account balance, many other monetary quantities. If “twice as much” or “half as much” make sense, then you have ratio data. Ratio data is always quantitative. Chapter 1, Part 1

Examples Classify the following data (about students): Age Math 1231 - Fall 2013 Examples Classify the following data (about students): Age Year of birth Academic major Weight Transfer student? (yes/no) Currently seated in which row? SAT score Chapter 1, Part 1

Examples Answers to the previous slide: Math 1231 - Fall 2013 Examples Answers to the previous slide: Age: Quantitative, Discrete(?), Ratio Year of birth: Quantitative(?), Discrete, Interval Academic major: Categorical, Nominal Weight: Quantitative, Continuous, Ratio Transfer student?: Categorical, Nominal Current row?: Categorical, Ordinal SAT score: Quantitative, Discrete, Ordinal(?) 08/15/11 Chapter 1, Part 1