Presentation is loading. Please wait.

Presentation is loading. Please wait.

Elementary Statistics Professor K. Leppel. Introduction and Data Collection.

Similar presentations


Presentation on theme: "Elementary Statistics Professor K. Leppel. Introduction and Data Collection."— Presentation transcript:

1 Elementary Statistics Professor K. Leppel

2 Introduction and Data Collection

3 Definitions Population: All observations of interest in a given context Sample: A subset of a population

4 Example 1 Suppose you are the president of Widener University. Population: All Widener students. Sample: All Widener students taking classes in the School of Business Administration.

5 Example 2 Suppose you are the head of the Economics Dept. Population : All Widener students taking Economics classes. Sample : All Widener students taking Professor Leppel’s classes.

6 More Definitions Population parameter or parameter: numerical characteristics of a population Sample statistics: numerical characteristics of a sample

7 Deductive vs. Inductive Reasoning Deductive: population sample general specific Probability Example: Suppose you have a bowl with 2 red marbles & 3 green ones. If you pick one, what is the probability that the marble is green?

8 Deductive vs. Inductive Reasoning Inductive: sample population specific general Statistics Example: If you take a poll & note the voting preferences of this sample, we will be able to draw some conclusions about the votes of the population.

9 Sampling with & without Replacement Sampling without replacement: once an element of a population has been selected as part of a sample, it cannot be selected again. Sampling with replacement: an element of a population that has been selected as part of a sample can be selected again.

10 Random sampling vs. non-random sampling Random sampling or probability sampling: sampling in which the probability of inclusion of each element in the population is known. Non-random sampling or judgment sampling: sampling in which judgment is exercised in deciding which elements of population to include in the sample.

11 Simple Random Sample A sample of n elements is a simple random sample if sampling is performed such that every combination of n elements has an equal chance of being the sample selected.

12 Two Types of Studies 1. Observational or comparative studies The analyst examines historical relationships among variables of interest. Problem: Deriving cause & effect relationships from historical data is difficult because important environmental factors are generally not controlled & not stable.

13 2. Direct experimentation or controlled studies The investigator directly manipulates factors that affect a variable of interest.

14 Control Group To understand the effect of a “treatment,” we need to compare a group that received a treatment with a group that received no treatment. The “no-treatment” group is the control group.

15 Two types of errors 1. Systematic errors or bias: These errors cause measurement to be incorrect in some systematic way. They are caused by inaccuracies or deficiencies in the measuring instrument. Systematic errors persist even when the sample size is increased.

16 These errors arise from a large number of uncontrolled factors - chance. Random errors decrease on average as the sample size is increased. 2. Random error or sampling error:

17 Some of the variables with which you will work are qualitative and some are quantitative. Qualitative variables are categorical and can be subdivided into nominal and ordinal measures. Quantitative variables are numerical and can be subdivided into interval and ratio measures.

18 Qualitative (categorical) variables that are nominal have no order to them. Example 1: U.S. citizenship (yes, no) Example 2: On what continent were you born? (N. America, S. America, Africa, Antarctica, Asia, Australia, Europe) Sex (male, female) is sometimes considered as a nominal variable. However, if you take into consideration intersex individuals, who can have any of a variety of anatomical conditions that don’t fit the typical definitions of female or male, you no longer have a simple nominal measure.

19 Qualitative (categorical) variables that are ordinal have an implied ranking of a characteristic. Example 1: student class (freshman, sophomore, junior, senior) Example 2: customer service satisfaction (very dissatisfied, somewhat dissatisfied, neither satisfied nor dissatisfied, somewhat satisfied, very satisfied)

20 Example 1: Intelligence A person with an IQ of 150 is much more intelligent than a person with an IQ of 100, while a person with an IQ of 140 is somewhat more intelligent than a person with an IQ of 125. However, what would an IQ of 0 mean? And a person with an IQ of 200 is not twice as smart as one with an IQ of 100. Example 2: SAT scores They are measured on an ordered scale in which the difference between measurements is meaningful. However, there is no true zero point where there is none of a specific characteristic. Also, if the measure is twice as large, that does not imply that there is twice as much of the characteristic. Switching to quantitative (numerical) variables, interval variables are a bit tricky.

21 Example 1: Income A person with zero income has no earnings or other source of money. And someone who has income of $100,000 has twice as much money coming in as someone who has income of $50,000. Example 2: Age Quantitative variables (numerical) that are ratio variables have true zero points and ratios work in the expected way.


Download ppt "Elementary Statistics Professor K. Leppel. Introduction and Data Collection."

Similar presentations


Ads by Google