 # Chapter 1: The Nature of Statistics

## Presentation on theme: "Chapter 1: The Nature of Statistics"— Presentation transcript:

Chapter 1: The Nature of Statistics
Basic Practice of Statistics - 3rd Edition Discovering Statistics 2nd Edition Daniel T. Larose Chapter 1: The Nature of Statistics Lecture PowerPoint Slides Chapter 5

Chapter 1 Overview 1.1 Data Stories: The People Behind the Numbers
1.2 An Introduction to Statistics 1.3 Gathering Data

The Big Picture Where we are coming from and where we are headed… Chapter 1 introduces the basic ideas of the field of statistics and the methods for gathering data. In Chapter 2 we will learn to summarize the data we have gathered using graphs and tables.

1.1: Data Stories: The People Behind the Numbers
Objectives: Realize that behind each data set lies a story about real people undergoing real-life experiences.

1.1: Data Stories: The People Behind the Numbers
Behind every data set lies a story about the lives of real people.

1.2: An Introduction to Statistics
Objectives: Describe what the field of statistics is. State the meaning of descriptive statistics. Understand that inferential statistics refers to learning about a population by studying a sample from that population.

What is Statistics? Consider this survey from USA Today.
Statistics are numbers that describe a group of people or things. Consider this survey from USA Today. How did the pollsters arrive at these figures? Are the figures accurate? Could they be inaccurate? Is the difference between men and women meaningful, or could it be due to random chance? The Field of Statistics The field of statistics is the art and science of: collecting, analyzing, presenting, and interpreting data

Case Study Does Friday the 13th change human behavior? The following graph displays traffic through a motorway junction on Friday the 6th (green) and on the following Friday, the 13th (yellow). Discuss the collection, analysis, presentation, and interpretation of these data.

Descriptive Statistics
Descriptive statistics refers to the methods for summarizing and organizing the information in a data set. In descriptive statistics, we use numbers, graphs, and tables to describe the data set as a first step in data analysis. Elements, Variables, and Observations An element is a specific entity about which information is collected. A variable is a characteristic of an element that can assume different values for different elements. An observation is the set of values of the variables for a given element.

Descriptive Statistics
Variables Variable―any characteristic of an element Qualitative Variable A variable that may be classified into categories. These variables are also called “categorical variables.” Quantitative Variable A variable that takes on numerical values upon which arithmetic operations may be meaningfully performed. Continuous Variable Can take infinitely many values, forming an interval on the number line. Discrete Variable Can take either a finite or a countable number of values.

Descriptive Statistics
Levels of Measurement Data may be classified according to the following four levels of measurement. Nominal data consists of names, labels, or categories. Ordinal data can be arranged in a particular order. Interval data are similar to ordinal data, with the property that subtraction may be carried out on interval data. Ratio data are similar to interval data, with the property that division may be carried out on ratio data.

Inferential Statistics
Statistical inference consists of methods for estimating and drawing conclusions about population characteristics based on information contained in a subset (sample) of that population. Populations, Parameters, Samples, and Statistics A population is the collection of all elements of interest in a particular study. A parameter is a characteristic of a population. A sample is a subset of the population from which information is collected. A statistic is a characteristic of a sample.

Inferential Statistics
Population Sample Collect data from a Sample. Make an Inference about the Population. Perform Data Analysis.

1.3: Gathering Data Objectives:
Explain what a random sample is, and why we need one. Identify systematic sampling, stratified sampling, cluster sampling, and convenience sampling. Explain selection bias and good questionnaire design. Understand the differences between an observational study and an experiment.

Random Sampling We can use the information gathered from a sample to generalize about the population. However, if we get a “bad” sample, the information gleaned from it will be misleading. The statistician’s remedy is to allow impersonal chance to choose the sample. A sample chosen by chance rules out both favoritism by the sampler and self-selection by respondents. Random Sample A random sample (also known as a simple random sample) is a sample for which every element has an equal chance of being selected.

More Sampling Methods Systematic Sampling
Each element of the population is numbered. The sample is obtained by selecting every kth element where k is some whole number. Stratified Sampling Divide the population into subgroups, or strata, according to some characteristic, such as race or gender. A random sample is then taken from each stratum. Cluster Sampling Divide the population into clusters. Select several clusters at random. All elements within the chosen clusters are selected for the sample. Convenience Sampling Subjects are chosen based on what is convenient for the survey personnel. This method usually does not result in a representative sample.

Selection Bias Some common pitfalls in the design and implementation of a survey include selection bias and the wording of a questionnaire. Target Population, Potential Population, and Selection Bias The target population is the complete collection of all elements that we are interested in studying. The potential population is the collection of elements from the target population that had a chance of being sampled. Selection bias occurs when the population from which the actual sample is drawn is not representative of the target population, due to an inappropriate sampling method.

Good Questionnaire Design
The wording of questions can greatly affect the responses. Here are several factors to consider when designing a questionnaire. Five Factors for Good Questionnaire Design Remember: simplicity and clarity When reporting results, include the actual questions asked. Avoid leading questions. Avoid asking two questions in one. Avoid vague terminology.

Experimental Studies Researchers can gather data by using survey or sampling methods. However, these methods may not always obtain the information required. In this case, an experimental study may be in order. Experimental Studies In an experimental study, researchers investigate how varying the predictor variable affects the response variable. A predictor variable (also called an explanatory variable) is a characteristic intended to explain differences in the response variable. A predictor variable that takes the form of a purposeful intervention is called a treatment. A response variable is an outcome, a characteristic of the subjects of the experiment presumably brought about by differences in the predictor variable or treatment. The subjects in a statistical study represent the elements from which the data are drawn.

Experimental Studies There are three main factors that should be considered when designing an experimental study. Control A control group is necessary to compare against the treatment group. In some experiments, members of the control group receive a placebo, a nonfunctioning simulated treatment. Randomization Many biases can be introduced into an experiment. To eliminate biases, it is necessary to place subjects into the treatment and control groups randomly. Replication Larger samples are usually better, because they allow for more precise inference. The treatment and control groups each must contain a large enough number of subjects to allow detection of meaningful differences.

Observational Studies
There are circumstances where it is either impossible, impractical, or unethical for the researcher to place subjects into treatment and control groups. In an observational study, the researcher observes whether the subjects’ differences in the predictor variable are associated with differences in the response variable. No attempt to manipulate the variables is made. A sample survey is an example of an observational study. Random Sample A random sample (also known as a simple random sample) is a sample for which every element has an equal chance of being selected.

Chapter 1 Overview 1.1 Data Stories: The People Behind the Numbers
1.2 An Introduction to Statistics 1.3 Gathering Data

Download ppt "Chapter 1: The Nature of Statistics"

Similar presentations