Presentation is loading. Please wait.

Presentation is loading. Please wait.

Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill.

Similar presentations


Presentation on theme: "Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill."— Presentation transcript:

1

2 Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill Building 9:00 - 9:50 Mondays, Wednesdays & Fridays. http://www.youtube.com/watch?v=oSQJP40PcGI

3 Schedule of readings Before next exam (February 12 th ) Please read chapters 1 - 5 in OpenStax textbook Please read Appendix D, E & F online On syllabus this is referred to as online readings 1, 2 & 3 Please read Chapters 1, 5, 6 and 13 in Plous Chapter 1: Selective Perception Chapter 5: Plasticity Chapter 6: Effects of Question Wording and Framing Chapter 13: Anchoring and Adjustment

4 Remember bring your writing assignment forms notebook and clickers to each lecture Register your clicker by February 1 st and receive extra credit! student.turningtechnologies.com (Please note there is no “www”)

5 Everyone will want to be enrolled in one of the lab sessions Labs continue next week

6 Project 1 - Likert Scale - Correlations - Comparing two means (bar graph) Questions?

7

8 By the end of lecture today 1/29/16 Use this as your study guide Frequency distributions and Frequency tables Guidelines for constructing frequency distributions 1. Classes should be mutually exclusive 2. Set of classes should be exhaustive 3. All classes should have equal intervals 4. Selecting number of classes is subjective (5 -15 will often work) 5. Class width should be round (easy) numbers 6. Try to avoid open ended classes Cumulative Frequency Relative Frequency and percentages Predicting frequency of larger sample based on relative frequency Pie Charts Relative Cumulative Frequency

9 Homework Assignment 5 Frequency Tables and Graphing with Excel Please print out and complete this homework worksheet And hand it in during class on Monday Due: Monday, February 1 st

10

11 Homework review You are looking to see if “class standing” affects the “level of sales”. Independent variable (IV):______________ Number of levels of IV: ________________ (how many means?) Quasi or True experiment:______________ Dependent variable: __________________ Between or within participant design: ______________ In this study, what is the operational definition of “class standing”? In this study, what is the operational definition of “level of sales”? Class standing Level of sales 4 Quasi Between Classification based on units earned Number of bags of peanuts sold

12 Homework review You are looking to see whether “type of program” has an effect on “body transformation”. Please identify the following variables: Independent variable (IV):______________ Number of levels of IV: _______________ (how many means?) Quasi or True experiment:______________ Dependent variable: __________________ Between or within participant design: ______________ What is the operational definition of “type of program”? What is the operational definition of “body transformation”? Type of program Body transformation 2 True Between Type of program = type of diet (regular versus programmatic diet) Body transformation = number of pounds lost

13 Homework review You are looking to see which driving choice is most efficient. So you ask each driver to drive each of the three routes and time themselves on how long it takes. Please identify the following variables: Independent variable (IV):______________ (how many means) Number of levels of IV: ________________ Dependent variable: __________________ Between or within participant design: ______________ What is the operational definition of “driving efficiency”? What is the operational definition of “driving choice”? Type of route driving efficiency 3 Within Driving efficiency = travel time (measured in minutes) Driving choice = route taken

14 Homework review

15 Notice that the operational definition of each construct matters

16 Homework review gender 2 quasi salary between nominal ratio

17 Name of City Quasi- experiment 3 Between Temperature Nominal Interval

18 Homework review city 3 quasi temperature between nominal interval Must be complete and must be stapled Hand in your homework

19 You’ve gathered your data…what’s the best way to display it??

20 141720252129 162527181613 112119242011 202816131714 14168171711 11141719248 16122592017 1114161822 1418231215 1013151111 Describing Data Visually 81114171924 81214172025 91215172025 101315172025 111316172027 111316172128 111416182129 1114161822 1114161823 1114161924 Lists of numbers too hard to see patterns Organizing numbers helps Graphical representation even more clear This is a dot plot

21 Describing Data Visually 81214171924 81214172025 91315172025 101315172025 111316172027 111316172128 111416182129 1114161822 1114161823 1114161924 Measuring the “frequency of occurrence” Then figure “frequency of occurrence” for the bins We’ve got to put these data into groups (“bins”)

22 Frequency distributions Frequency distributions an organized list of observations and their frequency of occurrence How many kids are in your family? What is the most common family size?

23 Another example: How many kids in your family? 3 4 8 2 2 1 4 1 14 2 Number of kids in family 1313 1414 2424 2828 214

24 Frequency distributions Crucial guidelines for constructing frequency distributions: 1. Classes should be mutually exclusive: Each observation should be represented only once (no overlap between classes) 2. Set of classes should be exhaustive: Should include all possible data values (no data points should fall outside range) Wrong 0 - 5 5 - 10 10 - 15 Correct 0 - 4 5 - 9 10 - 14 Correct 0 - under 5 5 - under 10 10 - under 15 How many kids are in your family? What is the most common family size? Number of kids in family 13 14 24 28 214 Wrong 0 - 4 8 - 11 12 - 15 Correct 0 - 3 4 - 7 8 - 11 12 - 15 No place for our families of 4, 5, 6 or 7

25 Frequency distributions Crucial guidelines for constructing frequency distributions: 3. All classes should have equal intervals (even if the frequency for that class is zero) Wrong 0 - 1 2 - 12 14 - 15 Correct 0 - 4 5 - 9 10 - 14 Correct 0 - under 5 5 - under 10 10 - under 15 How many kids are in your family? What is the most common family size? Number of kids in family 13 14 24 28 214

26 4. Selecting number of classes is subjective Generally 5 -15 will often work 8 12 14 17 19 24 8 12 14 17 20 25 9 13 15 17 20 25 10 13 15 17 20 25 11 13 16 17 20 27 11 13 16 17 21 28 11 14 16 18 21 29 11 14 16 18 22 11 14 16 18 23 11 14 16 19 24 How about 6 classes? (“bins”) How about 8 classes? (“bins”) How about 16 classes? (“bins”)

27 5. Class width should be round (easy) numbers 6. Try to avoid open ended classes For example 10 and above Greater than 100 Less than 50 Clear & Easy 8 - 11 12 - 15 16 - 19 20 - 23 24 - 27 28 - 31 8 12 14 17 19 24 8 12 14 17 20 25 9 13 15 17 20 25 10 13 15 17 20 25 11 13 16 17 20 27 11 13 16 17 21 28 11 14 16 18 21 29 11 14 16 18 22 11 14 16 18 23 11 14 16 19 24 Round numbers: 5, 10, 15, 20 etc or 3, 6, 9, 12 etc Lower boundary can be multiple of interval size Remember: This is all about helping readers understand quickly and clearly.

28 Let’s do one Scores on an exam 82586480 75728773 88948478 93697060 53847687 84618995 87917599 If less than 10 groups, “ungrouped” is fine If more than 10 groups, “grouped” might be better How to figure how many values 99 - 53 + 1 = 47 Step 1: List scores 53 58 60 61 64 69 70 72 73 75 76 78 80 82 84 87 88 89 91 93 94 95 99 Step 2: List scores in order Step 3: Decide whether grouped or ungrouped Step 4: Generate number and size of intervals (or size of bins) Largest number - smallest number + 1 Sample size (n) 10 – 16 17 – 32 33 – 64 65 – 128 129 - 255 256 – 511 512 – 1,024 Number of classes 5 6 7 8 9 10 11 If we have 6 bins – we’d have intervals of 8 Whaddya think? Would intervals of 5 be easier to read? Let’s just try it and see which we prefer…

29 Scores on an exam 82586480 75728773 88948478 93697060 53847687 84618995 87917599 53 58 60 61 64 69 70 72 73 75 76 78 80 82 84 87 88 89 91 93 94 95 99 Scores on an exam Score Frequency 95 - 992 90 - 94 3 85 - 89 5 80 – 845 75 - 79 4 70 - 74 3 65 - 69 1 60 - 64 3 55 - 59 1 50 - 54 1 Scores on an exam Score Frequency 93 - 100 4 85 - 92 6 77- 84 6 69 - 76 7 61- 68 2 53 - 60 3 10 bins Interval of 5 6 bins Interval of 8 Let’s just try it and see which we prefer… Remember: This is all about helping readers understand quickly and clearly. Scores on an exam Score Frequency 95 - 992 90 - 94 3 85 - 89 5 80 – 845 75 - 79 4 70 - 74 3 65 - 69 1 60 - 64 3 55 - 59 1 50 - 54 1

30 Scores on an exam 82586480 75728773 88948478 93697060 53847687 84618995 87917599 Scores on an exam Score Frequency 95 - 992 90 - 94 3 85 - 89 5 80 – 845 75 - 79 4 70 - 74 3 65 - 69 1 60 - 64 3 55 - 59 1 50 - 54 1 Let’s make a frequency histogram using 10 bins and bin width of 5!!

31 Scores on an exam Score Frequency 95 - 992 90 - 94 3 85 - 89 5 80 – 845 75 - 79 4 70 - 74 3 65 - 69 1 60 - 64 3 55 - 59 1 50 - 54 1 Step 6: Complete the Frequency Table Scores on an exam 82 58 64 80 75 72 87 73 88 94 84 78 93 69 70 60 53 84 76 87 84 61 89 95 87 91 75 99 Cumulative Frequency 28 26 23 18 13 9 6 5 2 1 Relative Frequency.0715.1071.1786.1429.1071.0357.1071.0357 Relative Cumulative Frequency 1.0000.9285.8214.6428.4642.3213.2142.1785.0714.0357 6 bins Interval of 8 Just adding up the frequency data from the smallest to largest numbers Just dividing each frequency by total number to get a ratio (like a percent) Please note: 1 /28 =.0357 3/ 28 =.1071 4/28 =.1429 Just adding up the relative frequency data from the smallest to largest numbers Please note: Also just dividing cumulative frequency by total number 1/28 =.0357 2/28 =.0714 5/28 =.1786

32 Data based on Gallup poll on 8/24/11 Who is your favorite candidate Candidate Frequency Hillary Clinton45 Bernie Sanders23 Joe Biden17 Jim Webb 1 Other/Undecided 14 Simple Frequency Table – Qualitative Data We asked 100 Democrats “Who is your favorite candidate?” Relative Frequency.4500.2300.1700.0100.1400 Just divide each frequency by total number Please note: 45 /100 =.4500 23 /100 =.2300 17 /100 =.1700 1 /100 =.0100 Percent 45% 23% 17% 1% 14% If 22 million Democrats voted today how many would vote for each candidate? Number expected to vote 9,900,000 5,060,000 3,740,000 220,000 3,080,000 Just multiply each relative frequency by 100 Please note:.4500 x 100 = 45%.2300 x 100 = 23%.1700 x 100 = 17%.0100 x 100 = 1% Just multiply each relative frequency by 22 million Please note:.4500 x 22m = 9,900k.2300 x 22m = 35,060k.1700 x 22m = 23,740k.0100 x 22m= 220k

33

34

35

36 Scores on an exam 82586480 75728773 88948478 93697060 53847687 84618995 87917599 53 58 60 61 64 69 70 72 73 75 76 78 80 82 84 87 88 89 91 93 94 95 99 Scores on an exam Score Frequency 95 - 992 90 - 94 3 85 - 89 5 80 – 845 75 - 79 4 70 - 74 3 65 - 69 1 60 - 64 3 55 - 59 1 50 - 54 1 Remember Dot Plots Score on exam 80 - 84 75 - 79 70 - 74 65 - 69 60 - 64 55 - 59 50 - 54 90 - 94 95 - 99 85 - 89 6 5 4 3 2 1 Step 4: Decide 10 for # bins (classes) 5 for bin width (interval size) Step 1: List scores Step 2: List scores in order Step 3: Decide grouped Step 5: Generate frequency histogram

37 Scores on an exam 82586480 75728773 88948478 93697060 53847687 84618995 87917599 53 58 60 61 64 69 70 72 73 75 76 78 80 82 84 87 88 89 91 93 94 95 99 Scores on an exam Score Frequency 95 - 992 90 - 94 3 85 - 89 5 80 – 845 75 - 79 4 70 - 74 3 65 - 69 1 60 - 64 3 55 - 59 1 50 - 54 1 Score on exam 80 - 84 75 - 79 70 - 74 65 - 69 60 - 64 55 - 59 50 - 54 90 - 94 95 - 99 85 - 89 6 5 4 3 2 1 Remember Dot Plots Step 4: Decide 10 for # bins (classes) 5 for bin width (interval size) Step 1: List scores Step 2: List scores in order Step 3: Decide grouped Step 5: Generate frequency histogram

38 Scores on an exam 82586480 75728773 88948478 93697060 53847687 84618995 87917599 53 58 60 61 64 69 70 72 73 75 76 78 80 82 84 87 88 89 91 93 94 95 99 Scores on an exam Score Frequency 95 - 992 90 - 94 3 85 - 89 5 80 – 845 75 - 79 4 70 - 74 3 65 - 69 1 60 - 64 3 55 - 59 1 50 - 54 1 Score on exam 80 - 84 75 - 79 70 - 74 65 - 69 60 - 64 55 - 59 50 - 54 90 - 94 95 - 99 85 - 89 6 5 4 3 2 1 Remember Dot Plots Step 4: Decide 10 for # bins (classes) 5 for bin width (interval size) Step 1: List scores Step 2: List scores in order Step 3: Decide grouped Step 5: Generate frequency histogram

39 Scores on an exam 82586480 75728773 88948478 93697060 53847687 84618995 87917599 53 58 60 61 64 69 70 72 73 75 76 78 80 82 84 87 88 89 91 93 94 95 99 Scores on an exam Score Frequency 95 - 992 90 - 94 3 85 - 89 5 80 – 845 75 - 79 4 70 - 74 3 65 - 69 1 60 - 64 3 55 - 59 1 50 - 54 1 Score on exam 80 - 84 75 - 79 70 - 74 65 - 69 60 - 64 55 - 59 50 - 54 90 - 94 95 - 99 85 - 89 6 5 4 3 2 1 Remember Dot Plots Step 4: Decide 10 for # bins (classes) 5 for bin width (interval size) Step 1: List scores Step 2: List scores in order Step 3: Decide grouped Step 5: Generate frequency histogram

40 Step 4: Decide 10 for # bins (classes) 5 for bin width (interval size) Scores on an exam 82586480 75728773 88948478 93697060 53847687 84618995 87917599 Step 1: List scores Step 2: List scores in order Step 3: Decide grouped Scores on an exam Score Frequency 95 - 992 90 - 94 3 85 - 89 5 80 – 845 75 - 79 4 70 - 74 3 65 - 69 1 60 - 64 3 55 - 59 1 50 - 54 1 Step 5: Generate frequency histogram Score on exam 80 - 84 75 - 79 70 - 74 65 - 69 60 - 64 55 - 59 50 - 54 90 - 94 95 - 99 85 - 89 6 5 4 3 2 1

41


Download ppt "Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill."

Similar presentations


Ads by Google