The Role of Statistics and the Data Analysis Process

Slides:



Advertisements
Similar presentations
Introduction to Statistics
Advertisements

Chapter 3 Graphic Methods for Describing Data. 2 Basic Terms  A frequency distribution for categorical data is a table that displays the possible categories.
AP Statistics Tuesday, 26 August 2014 OBJECTIVE TSW learn (1) the reasons for studying statistics, and (2) vocabulary. FORM DUE (only if it is signed)
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 2 Exploring Data with Graphs and Numerical Summaries Section 2.2 Graphical Summaries.
Chapter 1 A First Look at Statistics and Data Collection.
The Role of statistics and the data analysis process
Chapter 1 & 3.
ISE 261 PROBABILISTIC SYSTEMS. Chapter One Descriptive Statistics.
Organization and description of data
© 2002 Thomson / South-Western Slide 1-1 Chapter 1 Introduction to Statistics with Excel.
STA 2023 Chapter 1 Notes. Terminology  Data: consists of information coming from observations, counts, measurements, or responses.  Statistics: the.
1 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 1 The Role of Statistics & The Data Analysis Process.
Objective To understand measures of central tendency and use them to analyze data.
Chapter 4 Displaying Quantitative Data. Graphs for Quantitative Data.
Section 1.1, Slide 1 Copyright © 2014, 2010, 2007 Pearson Education, Inc. Section 14.1, Slide 1 14 Descriptive Statistics What a Data Set Tells Us.
Quantitative Data Essential Statistics. Quantitative Data O Review O Quantitative data is any data that produces a measurement or amount of something.
Chapter 1 The Role of Statistics. Three Reasons to Study Statistics 1.Being an informed “Information Consumer” Extract information from charts and graphs.
Chapter 1.4. Variable: any characteristic whose value may change from one individual to another Data: observations on single variable or simultaneously.
Analyze the following graph!. Cummulative Relative Frequency Plot 4 A Frequency is the number of times a given datum occurs in a data set 4 A Relative.
Chapters 1 and 2 Week 1, Monday. Chapter 1: Stats Starts Here What is Statistics? “Statistics is a way of reasoning, along with a collection of tools.
The Role of Statistics Sexual Discrimination Problem A large company had to downsize and fire 10 employees. Of these 10 employees, 5 were women. However,
Section 2. Descriptive statistics  the methods of organizing & summarizing data Create a graph If the sample of high school GPAs contained 10,000 numbers,
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 3 Graphical Methods for Describing Data.
An Overview of Statistics Section 1.1. Ch1 Larson/Farber 2 Statistics is the science of collecting, organizing, analyzing, and interpreting data in order.
Section 1.2: The Nature and Role of Variability. Definition Statistics – The science of collecting, analyzing, and drawing conclusions from data.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 1 The Role of Statistics This is officially the most boring PowerPoint presentation.
Dr. Fowler AFM Unit 8-1 Organizing & Visualizing Data Organize data in a frequency table. Visualizing data in a bar chart, and stem and leaf display.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 1 Section 1 – Slide 1 of 20 Chapter 1 Section 1 Introduction to the Practice of Statistics.
Graphical Methods for Describing Data
Statistics the science of collecting, analyzing, and drawing conclusions from data.
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
Displaying Distributions with Graphs. the science of collecting, analyzing, and drawing conclusions from data.
+ Chapter 1: Exploring Data Section 1.1 Analyzing Categorical Data.
Chapter 1 Statistics by Mohamed ELhusseiny
Notes Unit 1 Chapters 2-5 Univariate Data. Statistics is the science of data. A set of data includes information about individuals. This information is.
Basics of Statistics. Statistics 4 the science of collecting, analyzing, and drawing conclusions from data.
AP Statistics Wednesday, 09 September 2015 OBJECTIVE TSW review for Friday’s test over Chapter 1 – 3.
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 1 – Slide 1 of 20 Chapter 1 Section 1 Introduction to the Practice.
Chapters 1 & 3 Graphical Methods for Describing Data.
Section 1.1, Slide 1 Copyright © 2014, 2010, 2007 Pearson Education, Inc. Section 14.1, Slide 1 14 Descriptive Statistics What a Data Set Tells Us.
Organizing and Visualizing Data © 2010 Pearson Education, Inc. All rights reserved.Section 15.1, Slide
Chapter 1: Getting Started Section 1: Essential question: What is statistics?
Unit 1: Statistics and Statistical Thinking Statistics is the science of data Statistics involves collecting, classifying, summarizing, organizing, analyzing.
Chapter 0: Why Study Statistics? Chapter 1: An Introduction to Statistics and Statistical Inference 1
What is Statistics?. Statistics 4 Working with data 4 Collecting, analyzing, drawing conclusions.
Unit 1 - Graphs and Distributions. Statistics 4 the science of collecting, analyzing, and drawing conclusions from data.
The rise of statistics Statistics is the science of collecting, organizing and interpreting data. The goal of statistics is to gain understanding from.
Basics of Statistics.
Section 1.4: Types of Data and Some Simple Graphical Displays
Chapter 2: Methods for Describing Data Sets
The Role of Statistics and the Data Analysis Process
The Role of Statistics & The Data Analysis Process
Basics of Statistics.
Chapter 1 & 3.
The Role of Statistics and the Data Analysis Process
Distributions and Graphical Representations
Unit 1 - Graphs and Distributions
Basics of Statistics.
Common Core Math I Unit 6 One-Variable Statistics Introduction
Common Core Math I Unit 6 One-Variable Statistics Introduction
Statistics the science of collecting, analyzing, and drawing conclusions from data.
Objectives (IPS chapter 1.1)
Common Core Math I Unit 6 One-Variable Statistics Introduction
Overview of Statistics
Warm-Up: Believe It or Not?
CHAPTER 1 Exploring Data
The Role of Statistics and the Data Analysis Process
The Role of Statistics and the Data Analysis Process
The Role of Statistics and the Data Analysis Process
Types of variables. Types of variables Categorical variables or qualitative identifies basic differentiating characteristics of the population.
Presentation transcript:

The Role of Statistics and the Data Analysis Process Chapter 1 The Role of Statistics and the Data Analysis Process

What is statistics? the science of collecting, analyzing, and drawing conclusions from data

Why should one study statistics? Can dogs help patients with heart failure by reducing stress and anxiety? To be informed . . . Extract information from tables, charts and graphs Follow numerical arguments Understand the basics of how data should be gathered, summarized, and analyzed to draw statistical conclusions Examples come from page 2 & 3. When people take a vacation do they really leave work behind?

Why should one study statistics? (continued) Many companies now require drug screening as a condition of employment. With these screening tests there is a risk of a false-positive reading. Is the risk of a false result acceptable? To make informed judgments To evaluate decisions that affect your life If you choose a particular major, what are your chances of finding a job when you graduate? Examples come from page 2 & 3.

What is variability? Suppose you went into a convenience store to purchase a soft drink. Does every can on the shelf contain exactly 12 ounces? NO – there may be a little more or less in the various cans due to the variability that is inherent in the filling process. In fact, variability is almost universal! It is variability that makes life interesting!! Discuss the fact that variability exist in almost everything – provide several examples.

If the Shoe Fits ... The two histograms to the right display the distribution of heights of gymnasts and the distribution of heights of female basketball players. Which is which? Why? Heights – Figure A See Example 1.1 for more explanation. Heights – Figure B

If the Shoe Fits ... Suppose you found a pair of size 6 shoes left outside the locker room. Which team would you go to first to find the owner of the shoes? Why? Suppose a tall woman (5 ft 11 in) tells you see is looking for her sister who is practicing with a gym. To which team would you send her? Why? Center & spread

The Data Analysis Process Understand the nature of the problem Decide what to measure and how to measure it Collect data Summarize data and perform preliminary analysis Perform formal analysis Interpret results It is important to have a clear direction before gathering data. It is important to select and apply the appropriate inferential statistical methods It is important to carefully define the variables to be studied and to develop appropriate methods for determining their values. This step often leads to the formulation of new research questions. It is important to understand how data is collected because the type of analysis that is appropriate depends on how the data was collected! This initial analysis provides insight into important characteristics of the data.

What term would be used to describe “all high school graduates”? Suppose we wanted to know the average GPA of high school graduates in the nation this year. We could collect data from all high schools in the nation. population What term would be used to describe “all high school graduates”?

What do you call it when you collect data about the entire population? The entire collection of individuals or objects about which information is desired A census is performed to gather about the entire population What do you call it when you collect data about the entire population?

We could collect data from all high schools in the nation. GPA Continued: Suppose we wanted to know the average GPA of high school graduates in the nation this year. We could collect data from all high schools in the nation. Why might we not want to use a census here? Discuss some problems associated with performing a census: Takes a lot of time Inaccurate data Missing data costly If we didn’t perform a census, what would we do?

Sample A subset of the population, selected for study in some prescribed manner What would a sample of all high school graduates across the nation look like? High school graduates from each state (region), ethnicity, gender, etc.

Once we have collected the data, what would we do with it? GPA Continued: Suppose we wanted to know the average GPA of high school graduates in the nation this year. We could collect data from a sample of high schools in the nation. Once we have collected the data, what would we do with it? Organize it – graph & make some calculations etc.

Descriptive statistics the methods of organizing & summarizing data If the sample of high school GPAs contained 1,000 numbers, how could the data be organized or summarized? Create a graph State the range of GPAs Calculate the average GPA

Could we use the data from our sample to answer this question? GPA Continued: Suppose we wanted to know the average GPA of high school graduates in the nation this year. We could collect data from a sample of high schools in the nation. Organize it – graph & make some calculations etc. Could we use the data from our sample to answer this question?

Inferential statistics involves making generalizations from a sample to a population Based on the sample, if the average GPA for high school graduates was 3.0, what generalization could be made? The average national GPA for this year’s high school graduate is approximately 3.0. Could someone claim that the average GPA for graduates in your local school district is 3.0? Be sure to sample from the population of interest!! No. Generalizations based on the results of a sample can only be made back to the population from which the sample came from.

The number of wrecks per week at the intersection outside school? Variable any characteristic whose value may change from one individual to another Suppose we wanted to know the average GPA of high school graduates in the nation this year. Define the variable of interest. Is this a variable . . . The number of wrecks per week at the intersection outside school? Give several examples of different variables The variable of interest is the GPA of high school graduates YES

Data The values for a variable from individual observations 0, 1, 2, … For this variable . . . The number of wrecks per week at the intersection outside . . . What could observations be? 0, 1, 2, …

Two types of variables categorical numerical discrete continuous

Categorical variables Qualitative Identifies basic differentiating characteristics of the population Can you name any categorical variables?

Can you name any numerical variables? quantitative observations or measurements take on numerical values makes sense to average these values two types - discrete & continuous Can you name any numerical variables?

Discrete (numerical) Isolated points along a number line usually counts of items

Continuous (numerical) Variable that can be any value in a given interval usually measurements of something

Identify the following variables: the color of cars in the teacher’s lot the number of calculators owned by students at your school the zip code of an individual the amount of time it takes students to drive to school the appraised value of homes in your city Categorical Discrete numerical Categorical Is money a measurement or a count? Continuous numerical discrete numerical

Classifying variables by the number of variables in a data set Suppose that the PE coach records the height of each student in his class. Univariate - data that describes a single characteristic of the population This is an example of a univariate data

Classifying variables by the number of variables in a data set Suppose that the PE coach records the height and weight of each student in his class. Bivariate - data that describes two characteristics of the population This is an example of a bivariate data

Classifying variables by the number of variables in a data set Suppose that the PE coach records the height, weight, number of sit-ups, and number of push-ups for each student in his class. Multivariate - data that describes more than two characteristics (beyond the scope of this course) This is an example of a multivariate data

Bar Chart When to Use Categorical data How to construct Draw a horizontal line; write the categories or labels below the line at regularly spaced intervals Draw a vertical line; label the scale using frequency or relative frequency Place equal-width rectangular bars above each category label with a height determined by its frequency or relative frequency

Bar Chart (continued) What to Look For Frequently or infrequently occurring categories Collect the following data and then display the data in a bar chart: What is your favorite ice cream flavor? Vanilla, chocolate, strawberry, or other

Dotplot How to construct When to Use Small numerical data sets Draw a horizontal line and mark it with an appropriate numerical scale Locate each value in the data set along the scale and represent it by a dot. If there are two are more observations with the same value, stack the dots vertically

Dotplot (continued) What to Look For The representative or typical value The extent to which the data values spread out The nature of the distribution along the number line The presence of unusual values Collect the following data and then display the data in a dotplot: How many body piercings do you have?