1 Introduction to Statistics. 2 What is Statistics? The gathering, organization, analysis, and presentation of numerical information.

Slides:



Advertisements
Similar presentations
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1.1 Chapter Five Data Collection and Sampling.
Advertisements

Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 1 An Introduction to Business Statistics.
Introduction to Statistics
Chapter 1 Getting Started Understandable Statistics Ninth Edition
Elementary Statistics MOREHEAD STATE UNIVERSITY
Determining the Sample Plan
The Where, Why, and How of Data Collection
QBM117 Business Statistics Statistical Inference Sampling 1.
Chapter 1 The Where, Why, and How of Data Collection
Chapter 1 The Where, Why, and How of Data Collection
AP Statistics Chapter 5 Notes.
© 2004 Prentice-Hall, Inc.Chap 1-1 Basic Business Statistics (9 th Edition) Chapter 1 Introduction and Data Collection.
Who and How And How to Mess It up
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 1-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Chapter 1 The Where, Why, and How of Data Collection
Chapter 1 The Where, Why, and How of Data Collection
2007 會計資訊系統計學 ( 一 ) 上課投影片 5-1 Data Collection and Sampling Chapter 5.
213Sampling.pdf When one is attempting to study the variable of a population, whether the variable is qualitative or quantitative, there are two methods.
Sampling Methods.
Variables and Measurement (2.1) Variable - Characteristic that takes on varying levels among subjects –Qualitative - Levels are unordered categories (referred.
Sampling Moazzam Ali.
PowerPoint Presentation Package to Accompany:
SESSION 5 & 6 Last Update 23 rd February 2011 Introduction to Statistics.
Chapter 3 Goals After completing this chapter, you should be able to: Describe key data collection methods Know key definitions:  Population vs. Sample.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 1-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Chapter 1 Getting Started
COLLECTING QUANTITATIVE DATA: Sampling and Data collection
Chapter 1: Introduction to Statistics
C M Clarke-Hill1 Collecting Quantitative Data Samples Surveys Pitfalls etc... Research Methods.
Data Collection.
Chapter 1 Introduction and Data Collection
© The McGraw-Hill Companies, Inc., by Marc M. Triola & Mario F. Triola SLIDES PREPARED BY LLOYD R. JAISINGH MOREHEAD STATE UNIVERSITY MOREHEAD.
© Copyright McGraw-Hill CHAPTER 1 The Nature of Probability and Statistics.
Statistics: Basic Concepts. Overview Survey objective: – Collect data from a smaller part of a larger group to learn something about the larger group.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Elementary Statistics M A R I O F. T R I O L A Copyright © 1998, Triola, Elementary.
Chapter 1 DATA AND PROBLEM SOLVING. Section 1.1 GETTING STARTED.
Statistics, Data, and Statistical Thinking
Chapter 1: The Nature of Statistics
Surveys and Questionnaires Government agencies, news organizations, and marketing companies often conduct surveys. The results can be factual or subjective.
Data Collection and Sampling
AN INTRODUCTION DATA COLLECTION AND TERMS POSTGRADUATE METHODOLOGY COURSE.
1  Specific number numerical measurement determined by a set of data Example: Twenty-three percent of people polled believed that there are too many polls.
Introduction Biostatistics Analysis: Lecture 1 Definitions and Data Collection.
Ch.1 INTRODUCTION TO STATISTICS Prepared by: M.S Nurzaman, MIDEc. ( deden )‏ (021) /
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 1-1 Statistics for Managers Using Microsoft ® Excel 4 th Edition Chapter.
An Overview of Statistics Section 1.1. Ch1 Larson/Farber 2 Statistics is the science of collecting, organizing, analyzing, and interpreting data in order.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1.1 Chapter Five Data Collection and Sampling.
Chapter Five Data Collection and Sampling Sir Naseer Shahzada.
Unit 1 – Intro to Statistics Terminology Sampling and Bias Experimental versus Observational Studies Experimental Design.
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
Statistics Lecture Notes Dr. Halil İbrahim CEBECİ Chapter 04 Data Collection and Sampling.
A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. Chap 1-1 A Course In Business Statistics 4 th Edition Chapter 1 The Where, Why, and How.
1 Data Collection and Sampling Chapter Methods of Collecting Data The reliability and accuracy of the data affect the validity of the results.
Chap 1-1 Chapter 3 Goals After completing this chapter, you should be able to: Describe key data collection methods Know key definitions:  Population.
1-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Chapter 3 Surveys and Sampling © 2010 Pearson Education 1.
Ch1 Larson/Farber 1 1 Elementary Statistics Larson Farber Introduction to Statistics As you view these slides be sure to have paper, pencil, a calculator.
Sampling technique  It is a procedure where we select a group of subjects (a sample) for study from a larger group (a population)
1 Data Collection and Sampling ST Methods of Collecting Data The reliability and accuracy of the data affect the validity of the results of a statistical.
Statistics Terminology. What is statistics? The science of conducting studies to collect, organize, summarize, analyze, and draw conclusions from data.
AC 1.2 present the survey methodology and sampling frame used
Learning Objectives : After completing this lesson, you should be able to: Describe key data collection methods Know key definitions: Population vs. Sample.
Keller: Stats for Mgmt & Econ, 7th Ed
Keller: Stats for Mgmt & Econ, 7th Ed Data Collection and Sampling
Data Collection and Sampling
Keller: Stats for Mgmt & Econ, 7th Ed Data Collection and Sampling
Keller: Stats for Mgmt & Econ, 7th Ed Data Collection and Sampling
Keller: Stats for Mgmt & Econ, 7th Ed Data Collection and Sampling
Presentation transcript:

1 Introduction to Statistics

2 What is Statistics? The gathering, organization, analysis, and presentation of numerical information

Important Terminology * You can find these terms in your text (p. 91 – 100) 3

4 Population – is a set of all items of interest and can be infinitely large. E.g. Every person who watches television.

5 Sample – is a set of data drawn from a population. It must be representative of the population of interest and be randomly drawn. E.g. The small number of actual viewers on which Nielsen ratings are based.

6 Data are the observed values of a variable and can be measurements or observations. Examples: Student marks {67, 74, 71, 83, 93, 55, 48} Popular car colours {silver, black, white, red} Datum – singular form of the word data.

7 Variable The quantity being measured (ie. marks, car colours) Population Sample Variable

8 Types of Data 1. Qualitative data – are categorical observations that can be measured on the nominal or ordinal scale.  Nominal scale data – generated results are names or categories with no implied order. (e.g. yes or no)  Ordinal scale data – data that falls into distinct categories that can be ranked in some order. (e.g. good, fair, poor)

9 2. Quantitative (Interval) Data – are numerical observations measured on the interval scale (real numbers). Many types of calculations can be performed on such data. This data can be:  Discrete – numerical responses arise from a counting process.  Continuous – numerical responses arise from a measuring process.

10 E.g. Representing Student Grades… Categorical? Data Interval Data e.g. {0..100} Nominal Data e.g. {Pass | Fail} Ordinal Data e.g. {F, D, C, B, A} N Ordered? Y Y N Categorical Data Rank order to data NO rank order to data

Day 1 Ends Here 11

12 Data Collection and Sampling

13 Sources of Data Published data, which can be either primary data (collected and published by the same organization, such as Statistics Canada), or secondary data (published by an organization that did not collect it). Observational studies – obtained by direct observation. Experimental studies – controls are established and participants are randomly selected.

14 Surveys… A survey solicits information from people; e.g. Gallup polls; pre-election polls; marketing surveys. The Response Rate (i.e. the proportion of all people selected who complete the survey) is a key survey parameter. Surveys may be administered in a variety of ways, e.g.  Personal Interview,  Telephone Interview, and  Self Administered Questionnaire.

15 Questionnaire Design… Over the years, a lot of thought has been put into the science of the design of survey questions. Key design principles: 1. Keep the questionnaire as short as possible. 2. Ask short, simple, and clearly worded questions. 3. Start with demographic questions to help respondents get started comfortably. 4. Use dichotomous (yes|no) and multiple choice questions. 5. Use open-ended questions cautiously. 6. Avoid using leading-questions. 7. Pretest a questionnaire on a small number of people. 8. Think about the way you intend to use the collected data when preparing the questionnaire.

16 Sampling It is a way of gathering information. The target population, for which inferences are to be made, must be the same as the sampled population. Samples must be made in a pattern similar to the characteristic make-up of the target population.

17 Sampling Plans A sampling plan is a method or procedure for specifying how a sample will be taken from a population. Three methods:  Simple Random Sampling,  Stratified Random Sampling, and  Cluster Sampling.

18 Simple Random Sampling… A simple random sample is a sample selected in such a way that every possible sample of the same size is equally likely to be chosen. Drawing three names from a hat containing all the names of the students in the class is an example of a simple random sample: any group of three names is as equally likely as picking any other group of three names.

19 Stratified Random Sampling… A stratified random sample is obtained by separating the population into mutually exclusive sets, or strata, and then drawing simple random samples from each stratum. Strata 1 : Gender Male Female Strata 2 : Age < > 60 Strata 3 : Occupation professional clerical blue collar other We can acquire information about the total population, make inferences within a stratum or make comparisons across strata.

20 Stratified Random Sampling… After the population has been stratified, we can use simple random sampling to generate the complete sample: If we only have sufficient resources to sample 400 people total, we would draw 100 of them from the low income group… …if we are sampling 1000 people, we’d draw 50 of them from the high income group.

21 Cluster Sampling… A cluster sample is a simple random sample of groups or clusters of elements (vs. a simple random sample of individual objects). All elements of the cluster are then included in the sample. This method is useful when it is difficult or costly to develop a complete list of the population members or when the population elements are widely dispersed geographically. Cluster sampling may increase sampling error due to similarities among cluster members.

Day 2 Ends Here 22

23 Sample Size… Numerical techniques for determining sample sizes will be described later, but suffice it to say that the larger the sample size is, the more accurate we can expect the sample estimates to be.

24 Errors in Sampling Techniques 1. Sampling error refers to differences between the sample and the population that exist only because of the observations that happened to be selected for the sample. It is expected to occur. E.g. Two samples of size 10 of 1,000 households. If we happened to get the highest income level data points in our first sample and all the lowest income levels in the second, this delta is due to sampling error. Note: Increasing the sample size will reduce this type of error.

25 Errors in Data Acquisition These errors arise from the recording of incorrect responses, due to: ● incorrect measurements being taken because of faulty equipment, ● mistakes made during transcription from primary sources, ● inaccurate recording of data due to misinterpretation of terms, or ● inaccurate responses to questions concerning sensitive issues.

26 Non-Response Bias Refers to error (or bias) introduced when responses are not obtained from some members of the sample, i.e. the sample observations that are collected may not be representative of the target population. Recall that the Response Rate (i.e. the proportion of all people selected who complete the survey) is a key survey parameter and helps in the understanding the validity of the survey and sources of nonresponse error.

27 Sampling Bias Occurs when the sampling frame is such that some members of the target population cannot possibly be selected for inclusion in the sample. E.g. Randomly selecting names from a phone book would exclude individuals without a phone or whose number is not listed.

Measurement Bias Occurs when the data-collection method consistently either under- or over-estimates a characteristic of the population Results from a data-collection process that affects the variable it is measuring (ie. You can tell by the way the question is worded what most respondents will answer. There are often leading questions or loaded questions.) 28

Response Bias Occurs when participants in a survey deliberately give false or misleading answers to influence results unduly or they may be afraid or embarrassed to answer sensitive questions honestly 29