Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 - 1 Statistics An Introduction. 1 - 2 Learning Objectives 1.Define Statistics 2.Describe the Uses of Statistics 3.Distinguish Descriptive & Inferential.

Similar presentations


Presentation on theme: "1 - 1 Statistics An Introduction. 1 - 2 Learning Objectives 1.Define Statistics 2.Describe the Uses of Statistics 3.Distinguish Descriptive & Inferential."— Presentation transcript:

1 1 - 1 Statistics An Introduction

2 1 - 2 Learning Objectives 1.Define Statistics 2.Describe the Uses of Statistics 3.Distinguish Descriptive & Inferential Statistics 4. Define Population, Sample, Parameter, & Statistic 5. Identify data types

3 1 - 3 What is Statistics? The practice (science?) of data analysis Summarizing data and drawing inferences about the larger population from which it was drawn

4 1 - 4 Statistical Methods Statistical Methods Descriptive Statistics Inferential Statistics

5 1 - 5 Descriptive Statistics 1.Involves Collecting Data Collecting Data Presenting Data Presenting Data Characterizing Data Characterizing Data 2.Purpose Describe Data Describe Data  X = 30.5 S 2 = 113 0 25 50 Q1Q2Q3Q4 $

6 1 - 6 Inferential Statistics 1.Involves Estimation Estimation Hypothesis Testing Hypothesis Testing 2.Purpose Make Decisions About Population Based on Sample Characteristics Make Decisions About Population Based on Sample Characteristics Population?

7 1 - 7 Key Terms 1.Population (Universe) All Items of Interest All Items of Interest 2.Sample Portion of Population Portion of Population 3.Parameter Summary Measure about Population Summary Measure about Population 4.Statistic Summary Measure about Sample Summary Measure about Sample P in Population & ParameterP in Population & Parameter S in Sample & StatisticS in Sample & Statistic

8 1 - 8 Data Types Quantitative Discrete Discrete Continuous ContinuousQualitative Nominal (categorical) Nominal (categorical) Ordinal (rank ordered categories) Ordinal (rank ordered categories)

9 1 - 9 Sampling Representative sample Same characteristics as the population Same characteristics as the population Random sample Every subset of the population has an equal chance of being selected Every subset of the population has an equal chance of being selected

10 1 - 10 Review Descriptive vs. Inferential Statistics Vocabulary Population Population (Random, representative) sample (Random, representative) sample Parameter Parameter Statistic Statistic Data types

11 1 - 11 Methods for Describing Data

12 1 - 12 Learning Objectives 1.Describe Qualitative Data Graphically 2.Describe Numerical Data Graphically 3.Create & Interpret Graphical Displays 4.Explain Numerical Data Properties 5.Describe Summary Measures 6.Analyze Numerical Data Using Summary Measures

13 1 - 13 Data Presentation

14 1 - 14 Presenting Qualitative Data

15 1 - 15 Data Presentation

16 1 - 16 Student Specializations Specialization | Freq. Percent Cum. ---------------+---------------------------------- HCI | 9 39.13 39.13 HCI | 9 39.13 39.13 IEMP | 9 39.13 78.26 IEMP | 9 39.13 78.26 LIS | 3 13.04 91.30 LIS | 3 13.04 91.30 Undecided | 2 8.70 100.00 Undecided | 2 8.70 100.00---------------+---------------------------------- Total | 23 100.00 Total | 23 100.00

17 1 - 17 Student Specializations

18 1 - 18 Undergrad Majors UG major | Freq. Percent Cum. UG major | Freq. Percent Cum.--------------------------+----------------------------------- American Studies | 1 4.76 4.76 American Studies | 1 4.76 4.76 Cog Sci | 1 4.76 9.52 Cog Sci | 1 4.76 9.52 Comp Sci | 3 14.29 23.81 Comp Sci | 3 14.29 23.81 Economics | 3 14.29 38.10 Economics | 3 14.29 38.10 English | 5 23.81 61.90 English | 5 23.81 61.90 Environmental Engineering | 1 4.76 66.67 Graphic Design | 1 4.76 71.43 Graphic Design | 1 4.76 71.43 Math | 2 9.52 80.95 Math | 2 9.52 80.95 Mechanical Engineering | 1 4.76 85.71 Mechanical Engineering | 1 4.76 85.71 Nutrition | 1 4.76 90.48 Nutrition | 1 4.76 90.48 Sci and Tech Policy | 1 4.76 95.24 Sci and Tech Policy | 1 4.76 95.24 Telecommunications | 1 4.76 100.00 Telecommunications | 1 4.76 100.00--------------------------+----------------------------------- Total | 21 100.00 Total | 21 100.00

19 1 - 19 Favorite Colors color | Freq. Percent Cum. color | Freq. Percent Cum.------------+----------------------------------- black | 2 8.70 8.70 black | 2 8.70 8.70 blue | 12 52.17 60.87 blue | 12 52.17 60.87 green | 1 4.35 65.22 green | 1 4.35 65.22 orange | 1 4.35 69.57 orange | 1 4.35 69.57 purple | 1 4.35 73.91 purple | 1 4.35 73.91 red | 5 21.74 95.65 red | 5 21.74 95.65 white | 1 4.35 100.00 white | 1 4.35 100.00------------+----------------------------------- Total | 23 100.00 Total | 23 100.00

20 1 - 20 Calculus Knowledge integrals | Freq. Percent Cum. integrals | Freq. Percent Cum.------------+----------------------------------- 1 | 3 13.04 13.04 1 | 3 13.04 13.04 2 | 1 4.35 17.39 2 | 1 4.35 17.39 3 | 11 47.83 65.22 3 | 11 47.83 65.22 4 | 6 26.09 91.30 4 | 6 26.09 91.30 5 | 2 8.70 100.00 5 | 2 8.70 100.00------------+----------------------------------- Total | 23 100.00 Total | 23 100.00

21 1 - 21 Presenting Numerical Data

22 1 - 22 Data Presentation

23 1 - 23 Student Age (Reported) Data Stem-and-leaf plot for age 2* | 22233444555777899 2* | 22233444555777899 3* | 01257 3* | 01257 4* | 4* | 5* | 5* | 6* | 6* | 7* | 6 7* | 6

24 1 - 24 Histogram

25 1 - 25 Starting Salaries (in $K) 3* | 8 3* | 8 4* | 000025 4* | 000025 5* | 0000 5* | 0000 6* | 0000005 6* | 0000005 7* | 5 7* | 5 8* | 0 8* | 0

26 1 - 26 Numerical Data Properties

27 1 - 27 Thinking Challenge... employees cite low pay -- most workers earn only $20,000.... President claims average pay is $70,000! $400,000 $70,000 $50,000 $30,000 $20,000

28 1 - 28 Standard Notation MeasureSamplePopulation Mean  x  Stand. Dev. s  Variance s 2  2 SizenN

29 1 - 29 Numerical Data Properties Central Tendency (Location) Variation (Dispersion) Shape

30 1 - 30 Numerical Data Properties & Measures Numerical Data Properties Mean Median Mode Central Tendency Range Variance Standard Deviation Variation Skew Shape Interquartile Range

31 1 - 31 Central Tendency

32 1 - 32 Numerical Data Properties & Measures Numerical Data Properties Mean Median Mode Central Tendency Range Variance Standard Deviation Variation Skew Shape Interquartile Range

33 1 - 33 What’s wrong with this? Measurements 1 4 2 9 8 Middle measurement is 2, so that’s the median X X n XXX n i i n     1 12 

34 1 - 34 Ages Mean = 29 Median = 27 2* | 22233444555777899 2* | 22233444555777899 3* | 01257 3* | 01257 4* | 4* | 5* | 5* | 6* | 6* | 7* | 6 7* | 6

35 1 - 35 Summary of Central Tendency Measures MeasureEquationDescription Mean  X i /n Balance Point Median(n+1) Position Position 2 Middle Value When Ordered Modenone Most Frequent

36 1 - 36 Shape

37 1 - 37 Numerical Data Properties & Measures Numerical Data Properties Mean Median Mode Central Tendency Range Interquartile Range Variance Standard Deviation Variation Skew Shape

38 1 - 38 Shape 1.Describes How Data Are Distributed 2.Measures of Shape Skew = Symmetry Skew = Symmetry Right-SkewedLeft-SkewedSymmetric Mean =Median =Mode Mean Median Mode Mode Median Mean

39 1 - 39 Variation

40 1 - 40 Numerical Data Properties & Measures Numerical Data Properties Mean Median Mode Central Tendency Range Variance Standard Deviation Variation Skew Shape Interquartile Range

41 1 - 41 Quartiles 1.Measure of Noncentral Tendency 2.Split Ordered Data into 4 Quarters 3.Position of i-th Quartile 25%25%25%25% Q1Q1Q1Q1 Q2Q2Q2Q2 Q3Q3Q3Q3 Positionin g Point of Q i(n i  1) 4

42 1 - 42 Ages RangeQuartiles 2* | 22233444555777899 2* | 22233444555777899 3* | 01257 3* | 01257 4* | 4* | 5* | 5* | 6* | 6* | 7* | 6 7* | 6

43 1 - 43 Box Plots - Age and Salary Quartiles: 24, 27, 30 Inner fences: (15,39) Outer fences: (6, 48) Quartiles: 41K, 50K, 60K Inner fences: ?? Outer fences: ??

44 1 - 44 Variance & Standard Deviation 1.Measures of Dispersion 2.Most Common Measures 3.Consider How Data Are Distributed 4.Show Variation About Mean (  X or  ) 4681012 X = 8.3 = 8.3

45 1 - 45 Sample Variance Formula n - 1 in denominator! (Use N if Population Variance) S (X X) n (XX)(XX)(XX) n i i n n 2 2 1 1 2 2 22 1 1        ...

46 1 - 46 Equivalent Formula

47 1 - 47 Another Equivalent Formula

48 1 - 48 Empirical Rule If x has a “symmetric, mound-shaped” distribution Justification: Known properties of the “normal” distribution, to be studied later in the course

49 1 - 49 Preview of Statistical Inference You observe one data point Make hypothesis about mean and standard deviation from which it was drawn Empirical Rule tells you how (un)likely the data point is If very unlikely, you are suspicious of the hypothesis about mean and standard deviation, and reject it If very unlikely, you are suspicious of the hypothesis about mean and standard deviation, and reject it

50 1 - 50 Summary of Variation Measures MeasureEquationDescription Range X largest -X smallest Total Spread Interquartile Range Q 3 -Q 1 Spread of Middle 50% Standard Deviation (Sample) XX n i    21 Dispersion about Sample Mean Standard Deviation (Population) X N iX    2 Dispersion about Population Mean Variance (Sample)  (X i -  X) 2 n - 1 - 1 Squared Dispersion about Sample Mean

51 1 - 51 Z-scores Number of standard deviations from the mean

52 1 - 52 Conclusion 1.Described Qualitative Data Graphically 2.Described Numerical Data Graphically 3.Created & Interpreted Graphical Displays 4.Explained Numerical Data Properties 5.Described Summary Measures 6.Analyzed Numerical Data Using Summary Measures


Download ppt "1 - 1 Statistics An Introduction. 1 - 2 Learning Objectives 1.Define Statistics 2.Describe the Uses of Statistics 3.Distinguish Descriptive & Inferential."

Similar presentations


Ads by Google