Definitions Population: A collection, or set, of individuals, objects, or events whose properties are to be analyzed. Sample: A subset of the population.

Slides:



Advertisements
Similar presentations
Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
Advertisements

Comparing Two Population Means The Two-Sample T-Test and T-Interval.
Data Analysis Statistics. Inferential statistics.
1 Economics 240A Power One. 2 Outline w Course Organization w Course Overview w Resources for Studying.
Chapter Goals After completing this chapter, you should be able to:
Independent t-Test CJ 526 Statistical Analysis in Criminal Justice.
What is Statistics Section 1.1, Page 4.
Introduction to Educational Statistics
Estimates and sample sizes Chapter 6 Prof. Felix Apfaltrer Office:N763 Phone: Office hours: Tue, Thu 10am-11:30.
Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications
Chapter 2 Simple Comparative Experiments
Inferences About Process Quality
Data Analysis Statistics. Inferential statistics.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Chi-Square and Analysis of Variance (ANOVA)
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
AM Recitation 2/10/11.
Estimation and Hypothesis Testing Faculty of Information Technology King Mongkut’s University of Technology North Bangkok 1.
MATH1342 S08 – 7:00A-8:15A T/R BB218 SPRING 2014 Daryl Rupp.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 24 Statistical Inference: Conclusion.
Chapter 1: Introduction to Statistics
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Single-Sample T-Test Quantitative Methods in HPELS 440:210.
Lesson Comparing Two Means.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
Introduction to Statistical Inferences Inference means making a statement about a population based on an analysis of a random sample taken from the population.
Chapter 4 Statistics. 4.1 – What is Statistics? Definition Data are observed values of random variables. The field of statistics is a collection.
1 Objective Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Problems Objective 1.1, Page 181. Problems Objective 1.1, Page 192.
1  Specific number numerical measurement determined by a set of data Example: Twenty-three percent of people polled believed that there are too many polls.
Multinomial Experiments Goodness of Fit Tests We have just seen an example of comparing two proportions. For that analysis, we used the normal distribution.
Probability and Statistics Dr. Saeid Moloudzadeh Fundamental Concepts 1 Contents Descriptive Statistics Axioms of Probability Combinatorial.
Inference about Mean (σ Unknown) When σ is known, the sampling distribution for a sample mean is normal if conditions are satisfied. For many years, it.
Copyright © 2004 Pearson Education, Inc.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
A Course In Business Statistics 4th © 2006 Prentice-Hall, Inc. Chap 9-1 A Course In Business Statistics 4 th Edition Chapter 9 Estimation and Hypothesis.
Review of Chapters 1- 6 We review some important themes from the first 6 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
What is Statistics 1Section 1.1, Page 5. Definition: Statistics Statistics: The science of collecting, describing and interpreting data. Why Study Statistics?
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Inference about Mean (σ Unknown) When σ is known, the sampling distribution for a sample mean is normal if conditions are satisfied. For many years, it.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-1 Review and Preview.
Dependent and Independent Samples 1Section 10.1, Page 208.
Independent t-Test CJ 526 Statistical Analysis in Criminal Justice.
CHI SQUARE TESTS.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Lesson Comparing Two Means. Knowledge Objectives Describe the three conditions necessary for doing inference involving two population means. Clarify.
Review Lecture 51 Tue, Dec 13, Chapter 1 Sections 1.1 – 1.4. Sections 1.1 – 1.4. Be familiar with the language and principles of hypothesis testing.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
11.2 Tests Using Contingency Tables When data can be tabulated in table form in terms of frequencies, several types of hypotheses can be tested by using.
© Copyright McGraw-Hill 2004
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Inferences Concerning Variances
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
Ch1 Larson/Farber 1 1 Elementary Statistics Larson Farber Introduction to Statistics As you view these slides be sure to have paper, pencil, a calculator.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 3 – Slide 1 of 27 Chapter 11 Section 3 Inference about Two Population Proportions.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 11 Multinomial Experiments and Contingency Tables 11-1 Overview 11-2 Multinomial Experiments:
Statistics Vocabulary. 1. STATISTICS Definition The study of collecting, organizing, and interpreting data Example Statistics are used to determine car.
Appendix I A Refresher on some Statistical Terms and Tests.
The F-Distribution Illustrative Problem
Lecture Slides Elementary Statistics Twelfth Edition
statistics Specific number
Lesson Comparing Two Means.
statistics Specific number
The Nature of Probability and Statistics
What are their purposes? What kinds?
Parametric versus Nonparametric (Chi-square)
Presentation transcript:

Definitions Population: A collection, or set, of individuals, objects, or events whose properties are to be analyzed. Sample: A subset of the population. We desire knowledge about an entire population but is most often the case that it is prohibitively expensive, so we select representative sample from the population and study the individual items in the sample. Descriptive Statistics: The collection, presentation, and description of of the sample data. Inferential Statistics: The technique of of interpreting the values resulting from the descriptive techniques and making decisions and drawing conclusions about the population. 1Section 1.1, Page 4

Definitions Parameter: A numerical value summarizing all the data of a population. For example, the average high school grade point of all Shoreline Students is We often use Greek letters to identify parameters, μ = Statistic: A numerical value summarizing the sample data. For example, the average grade point of a sample of Shoreline Students is We would use the symbol, The statistic corresponds to the parameter. We usually don’t know the value of the parameter, so we take a sample and estimate it with the corresponding statistic. Sampling Variation: While the parameter of a population is considered a fixed number, the corresponding statistic will vary from sample to sample. Also, different populations give rise to more or less sampling variability. Considering the variable age, samples of 60 students from a Community college would have less variability than samples of a Seattle neighborhood. 2Section 1.1, Page 4

Variables Variable: A characteristic of interest about each element of a population. Data: The set of values collected for the variable from each of the elements that belong to the sample. Variability: The extent to which data values for a particular variable differ from each other. Numerical or Quantitative Variable: A variable that quantifies an element of the population. The HS grade point of a student is a numerical variable. Numerical variables are numbers for which math operations make sense. The average grade point of a sample makes sense. Continuous Numerical Variable: The variable can take on take on an uncountable number of values between to points on the number line. An example is the weight of people. Discrete Numerical Variable: The variable can take on a countable number of values between two points on a number line. An example is the price of statistics text books. 3Section 1.1, Page 8

Variables (2) Section 1.1, Page 84 Categorical or Qualitative Variable: A variable that describes or categorizes an element of a population. The gender of a person would be a categorical variable. The categories are male and female. Nominal Categorical Variable: A categorical variable that uses a number to describe or name an element of a population. An example is a telephone area code. It is a number, but not a numerical variable used on math operations. The average area code does not make sense. Ordinal Categorical Variable: A categorical variable that incorporates an ordered position or ranking. An example would be a survey response that ranks “very satisfied” ahead of “satisfied” ahead of “somewhat satisfied.” Limited math operations may be done with ordinal variables.

Problems Problems, Page 195

Problems Section 1.3, Page 206

Observational Studies and Experiments Section 1.3, Page 127 Observational Study: Researchers collect data without modifying the environment or controlling the process being observed. Surveys and polls are observational studies. Observational studies cannot establish causality. Example: For a randomly selected high school researchers collect data on each student, grade point and whether the student has music training, to see if there is a relationship between the two variables. Experiments: Researchers collect data in a controlled environment. The investigator controls or modifies the environment and observes the effect of a variable under study. Experiments can establish causality. Example: Randomly divide a sample of people with migraine headaches into a control and treatment groups. Give the treatment group a experimental medication and the control group a placebo, and then measure and compare the reduction of frequency and severity of headaches for both groups.

Single-Stage Sampling Methods Section 1.3, Page 138 Single-stage sampling: A sample design in which the elements of the sampling frame treated equally and there is no subdividing or partitioning of the frame. Simple Random Sample: Sample selected in such a way that every element of the population has an equal probability of being selected and all samples of size n have an equal probability of being selected. Example: Select a simple random sample of 6 students from from a class of Number the students from 1 to 30 on the roster. 2.Get 6 non-recurring random numbers between 1 and The six students who match the six random numbers are the sample.

Multistage Sampling Designs Section 1.3, Page 159 Multistage Sampling: A sample design in which the elements of the sampling frame are subdivided and the sample is chosen in more than one stage. Stratified Random Sampling: A sample is selected by stratifying the population, or sampling frame, and then selecting a number of items from each of the strata by means of a simple random sampling technique. The strata are usually subgroups of the sampling frame that are homogeneous but different from each other. Example: Select a sample of six students from a class of 30 so that the sample contains an equal number of males and females. 1.List the males and females separately 2.Take a simple random sample of 3 students from each group. 3.The six students selected are the sample.

Multi-Stage Sampling Designs Section 1.3, Page 1610 Cluster Sample: A sample obtained stratifying the population, or sampling frame, and then selecting some or all of the items from some, but not all of the strata. The strata are usually easily identified subgroups of the sampling frame that are similar to each other. This is often the most economical way to sample a large population. Example: Take a sample of 300 Catholics in the Seattle Area. 1. Get a list of the Catholic Parishes in the Seattle area. 2. Take a random sample of 3 parishes. 3. In each parish, select a simple random sample of 100 parishioners.

Problem a.Find the mean, variance, and standard deviation. b.Find the 5-number summary. c.Make a box and whisker display and label the numbers. d.Calculate the Interquartile range and the range e.Describe the shape of the distribution 11Problems, Page 50

Summary of Probability Formulas Equally Likely Outcomes: P(A) = n(A)/n Complement: P(A) = 1- P(not A); P(not A) =1- P(A) General Addition Rule: P(A or B) = P(A) + P(B) – P(A and B) If A and B are disjoint, P(A and B) = 0 Then the Special Addition Rule: Then P(A or B) = P(A) + P(B) General Multiplication Rule: P(A and B) = P(A)×P(B|A) If A and B are independent, P(B|A) = P(B) Then the Special Multiplication Rule: P(A and B) = P(A)×P(B) Odds If the odds for A are a:b, then the odds against A are b:a. The probability of A is a/(a+b). The probability of not A is b/(b+a) 12Chapter 4

Problems 13Problems, Page 95

Problem 14Problems, Page 95

Problems 15Problems, Page 97

Problems 16Problems, Page 99

Z Score Problems 17Problems, Page 52

Problems 18Problems, Page 132

Problems 19Problems, Page IQ scores are normally distributed with a mean of 100 and a standard deviation of 16. Find the following: a.The 66 th percentile. b.The 80 th percentile. c.The minimum score required to be in the top 10%. d.The minimum score to be in the top 25% Find the two z-scores that bound the middle 30% of the standard normal distribution.

Problems 20Problems, Page 149

Problems 21Problems, Page 151

Problems 22Problems, Page 50

Problems 23Problems, Page 179 the standard deviation is 5 seconds.

Problems Test the claim that the BMI of the cardiovascular technologists is different than the BMI of the general population. Use α =.05. Assume the population of the BMI of the cardiovascular technologists is normal. a.State the necessary hypotheses. b.Is the sampling distribution normal. Why? c.Find the p-value. d.State your conclusion. e.If you made an error, what type of error did you make? 24Problems Page 181

Problems 25Problems, Page 179

Problems a.Find the 98% confidence interval. b.Find the critical value c.Find the margin of error. d.Find the standard error. e.What assumption must we make about the the population to have a t-sampling distribution. f.What are the proper words to describe the confidence interval? g.If you wanted to have a margin of error of one minute and the 98% confidence interval for this data, how large must the sample be? 26Problems, Page 205

Problems a.Find the p-value. b.State your conclusion. c.What is the name of the probability model used for the sampling distribution d.What is the mean of the sampling distribution? e.What is the value of the standard error? f.If your conclusion is in error, what type of error is it? 27Problems, Page 205

Problems 28Problems, Page 208

Problems a.Check the conditions for a normal sampling distribution. b.State the hypotheses. c.Find the p-value. d.State your conclusion e.If you make an error in your conclusion, what type is it? f.Find the mean of the sampling distribution. g.Find the standard error of the sampling distribution. 29Problems, Page 207

Dependent and Independent Samples 30Section 10.1, Page 208

Problems a.Test the hypotheses that the people increased their knowledge. Use α=.05 and assume normality. State the appropriate hypotheses. b.Find the p-value and state your conclusion. c.Find the 90% confidence interval for the mean estimate of the increase in test scores. 31Problems, Page 231

Problems a.State the hypothesis (Assume Normality) b.Find the p-value, and state you conclusion. c.Find the 95% confidence interval for the difference of the means; Gouda-Brie. d.Find the mean and standard error of the sampling distribution 32Problems, Page 232

Problems a.State the appropriate hypotheses. b.Find the p-value and state your conclusion. c.What model is used for the sampling distribution and what is the mean of the sampling distribution and its standard error? d.Find the 98% confidence interval for the difference in proportions, men – women. 33Problems, Page 234

Summary of Chi-Square Applications Goodness of Fit Test Given one categorical variable with a fixed set of proportions for the categories. Ha: The observed data does not fit the proportions. Calculate expected values (Ho true proportion * total observations) Observed and Expected data in List Editor PRGM: GOODFIT Test for Independence Given two categorical variables measured on the same population. Ha: The variables are not independent (They are related) Observed data in Matrix Editor Stat-Tests-χ2 Test Test for Homogeneity Given one categorical variable and two or more populations. Ha: The proportions for the categories are not the same for for all populations. Observed data in Matrix Editor Stat-Tests-χ2 Test 34Chapter 12, Summary

Chi-Square Distribution Fair Die Example Now we need a sampling distribution for the Χ 2 statistic = 2.2, so we can calculate the probability of getting a Χ 2 ≥ 2.2 when the true proportions are all equal to 1/6. Χ 2 Distribution for 5 df This is a distribution of all possible Χ 2 statistics calculated from all possible samples of 60 observations when there are 6 proportions or cells. Note that the degree of freedom equals the number of proportions – 1. Finding the p-value on the TI-83, Given Χ 2 Stat, df PRGM – CHI2DIST LOWER BOUND: 2.2 UPPER BOUND: 2 ND E99 df: 5 Output: P-VALUE = The null hypothesis cannot be rejected. 35Section 11.2, Page 240

Problems a.Perform a hypotheses test to see if the preferences are not all the same. State the hypotheses. b.Find the p-value and state your conclusion c.What is the name of the model used for the sampling distribution? 36Problems, Page 252

Problems a.Perform a hypotheses test to see if the preferences are not all the same. State the hypotheses. b.Find the p-value and state your conclusion c.What is the name of the model used for the sampling distribution? 37Problems, Page 252

Problems a.Test the hypotheses that the size of community reared in is independent of the size of community residing in. State the appropriate hypotheses. b.Find the p-value and state your conclusion c.What is the name of the sampling distribution? d.What are the necessary conditions, and are they satisfied? What is the value of the smallest expected cell? 38Section 11.3, Page 254

The F-Distribution 39Sec 10.5, Page 226 Each sample must be from a normal distribution 4.

Problem 40Problems, Page 234 Set up the problem so that the the F-Stat >1. a.State the necessary hypotheses. b.Find the p-value and state your conclusion. c.What is the name of the model used for the sampling distribution?

Problems 41Sec 12.1, Page 268 a.State the necessary hypotheses. b.Sketch the side-by-side box plots. Does it appear that the means are all the same? c.Find the p-value and state your conclusion. d.What is the name of the model used for the sampling distribution?

Problems Problems, Page Sample Size Sample Mean Sample St. Dev. Atlanta Boston Dallas Philadelphia Seattle St. Louis a.Test the hypotheses that not all the mean commute times are all the same. State the appropriate hypothesis. b.Find the p-value and state your conclusion. c.What is the name of the sampling distribution? d.What is the F-Statistic, the df numerator and df denominator?

Problems Problems, Page Sample Size Sample Mean Sample St. Dev. Atlanta Boston Dallas Philadelphia Seattle St. Louis a.Test the hypotheses that not all the mean commute times are all the same. State the appropriate hypothesis. b.Find the p-value and state your conclusion. c.What is the name of the sampling distribution? d.What is the F-Statistic, the df numerator and df denominator?