ADULT MALE % BODY FAT. Background  This data was taken to see if there are any variables that impact the % Body Fat in males  Height (inches)  Waist.

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

Describing Quantitative Variables
Residuals.
The Normal Distribution
Chapter 3 Bivariate Data
Jan Shapes of distributions… “Statistics” for one quantitative variable… Mean and median Percentiles Standard deviations Transforming data… Rescale:
1.2: Describing Distributions
Linear Regression.
STAT 250 Dr. Kari Lock Morgan
Relationships Scatterplots and correlation BPS chapter 4 © 2006 W.H. Freeman and Company.
1 Chapter 3: Examining Relationships 3.1Scatterplots 3.2Correlation 3.3Least-Squares Regression.
Residuals Target Goal: I can construct and interpret residual plots to assess if a linear model is appropriate. 3.2c Hw: pg 192: 48, 50, 54, 56, 58 -
Section 2.2, Part 1 Standard Normal Calculations AP Statistics Berkley High School/CASA.
Chapter 3 concepts/objectives Define and describe density curves Measure position using percentiles Measure position using z-scores Describe Normal distributions.
NOTES The Normal Distribution. In earlier courses, you have explored data in the following ways: By plotting data (histogram, stemplot, bar graph, etc.)
Name: Angelica F. White WEMBA10. Teach students how to make sound decisions and recommendations that are based on reliable quantitative information During.
Review Measures of central tendency
Density Curves Section 2.1. Strategy to explore data on a single variable Plot the data (histogram or stemplot) CUSS Calculate numerical summary to describe.
3.2 Least Squares Regression Line. Regression Line Describes how a response variable changes as an explanatory variable changes Formula sheet: Calculator.
The Standard Deviation as a Ruler and the Normal Model
Objectives 2.1Scatterplots  Scatterplots  Explanatory and response variables  Interpreting scatterplots  Outliers Adapted from authors’ slides © 2012.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Find out where you can find rand and randInt in your calculator. Write down the keystrokes.
3.2 - Least- Squares Regression. Where else have we seen “residuals?” Sx = data point - mean (observed - predicted) z-scores = observed - expected * note.
Relationships Scatterplots and correlation BPS chapter 4 © 2006 W.H. Freeman and Company.
Review BPS chapter 1 Picturing Distributions with Graphs What is Statistics ? Individuals and variables Two types of data: categorical and quantitative.
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
AP Statistics Semester One Review Part 1 Chapters 1-3 Semester One Review Part 1 Chapters 1-3.
Describing Data: Two Variables
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
CHAPTER 5: Regression ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
GOAL: I CAN USE TECHNOLOGY TO COMPUTE AND INTERPRET THE CORRELATION COEFFICIENT OF A LINEAR FIT. (S-ID.8) Data Analysis Correlation Coefficient.
Unit 3 Correlation. Homework Assignment For the A: 1, 5, 7,11, 13, , 21, , 35, 37, 39, 41, 43, 45, 47 – 51, 55, 58, 59, 61, 63, 65, 69,
AP Statistics Review Day 1 Chapters 1-4. AP Exam Exploring Data accounts for 20%-30% of the material covered on the AP Exam. “Exploratory analysis of.
Describing Relationships. Least-Squares Regression  A method for finding a line that summarizes the relationship between two variables Only in a specific.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Chapter 13 Lesson 13.2a Simple Linear Regression and Correlation: Inferential Methods 13.2: Inferences About the Slope of the Population Regression Line.
AP Review Exploring Data. Describing a Distribution Discuss center, shape, and spread in context. Center: Mean or Median Shape: Roughly Symmetrical, Right.
Unit 4 LSRL.
Sections Review.
Chapter 3: Describing Relationships
Chapter 5 LSRL.
LSRL Least Squares Regression Line
Chapter 4 Correlation.
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
Linear transformations
EQ: How well does the line fit the data?
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Summary (Week 1) Categorical vs. Quantitative Variables
Summary (Week 1) Categorical vs. Quantitative Variables
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Summarizing Bivariate Data
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Honors Statistics Review Chapters 7 & 8
Chapter 3: Describing Relationships
Presentation transcript:

ADULT MALE % BODY FAT

Background  This data was taken to see if there are any variables that impact the % Body Fat in males  Height (inches)  Waist Size (inches)  Chest Size (inches)  Percent Body Fat (%bf)  Each variable was then compared to the calculated % Body Fat to see if there was a correlation between the two

% Body Fat

Summary Statistics (% Body Fat)  Min: 3%  Q1: 12.4%  Med: 19.15%  Q3: 24.8%  Max: 38.1%  Mean: %  Std. Dev.: 7.921% Outlier Test = (1.5) = 18.6 UF: = 43.4% LF: = -6.2% There are no outliers present

Graph Description  The data are unimodal and roughly symmetric  The graph’s center is its mean of % Body Fat  The spread of the data is represented by the std. deviation of 7.921% and a (3%, 38.1%) range  There are no outliers

Percent of Males Overweight  P(x>19%) = 78/150 =.52 or 52% Overweight  P(x<19.1%) = 72/150 =.48 or 48 th percentile (first overweight male)  P(x>25%) = 37/150 =.247 or 24.7% Obese  P(x<25.2%) = 113/150 =.753 or 75 th percentile (first obese male) =.786 Z-Score of first obese male

Compared to Normal Model  Between one standard deviation in either direction from % Body Fat (mean) is 64% of the data  P(11.053<x<26.895) = 96/150 =.64 or 64%  This is lower than the expected 68% for normal data  Between two standard deviations in either direction from the mean is 98%  P(3.132<x<34.816) = 147/150 =.98 or 98%  This is higher than the expected 95% for normal data  There is no data that reaches three standard deviations from the mean  The final 2% falls between two and three standard deviations

Shortest Men  Three Shortest Men at 64in. Tall  Man #1 – 13.5% Body Fat  Man #2 – 13.5% Body Fat  Man #3 – 13.5% Body Fat  All three men have % body fat that Is higher than the minimum 3%  Shows that height is independent of % body fat

Z Scores = Z-Score of shortest men’s height = Z-Score of shortest men’s % Body Fat The shortest men have the smallest Z Score for height. The score would normally be around a 3, but because the standard deviation is 2.663, the data does not go to 3 standard deviations from the mean. The Z Score for % Body Fat is not the lowest as predicted. It is below 1 z score from the mean, or within 64% of the data. This proves that height is independent of % Body Fat as the shortest men had the lowest Z score for height, but not for % Body Fat

Tallest Men  Two tallest men at 77.5in tall  Man #1 – 10.3% Body Fat  Man #2 – 10.3% Body Fat  Both men have % body fats that are lower than the maximum of 38.1% body fat  Re-confirms that height is independent of % body fat

Z Scores = Z-Score of tallest men’s height = Z-Score of tallest men’s % body fat The tallest men have the largest Z Score for height. As with the shortest men, the score would normally be around 3, but the data does not reach as far as 3 standard deviations The Z Score for % Body Fat is not the highest as predicted. It is just above 1 standard deviation away from the mean. This proves that the tallest men will not have the highest % Body Fat, as the score for height is the largest, but the score for % Body Fat is not.

Height: Scatterplot

Description  Form: Scattered  Direction: No Association  Strength: Scattered  There is essentially no correlation between the two variables so there are no unusual points (outliers, high leverage, influential)  Pct_BF= (Height) LSRL Equation

Scatterplot w/ LSRL  r= 0.040

Interpreting Slope  For every additional inch in height, there is a increase in 0.119% body fat from a base body fat of % on average

Interpreting r 2  r 2 =  0.16% of the change in percent body fat is a result of the change in height

Residual Plot

Description  The residual points are scattered and have no set pattern between the positive and negative residuals

Is the model a good fit?  I do not think that a linear model is a good fit for the data  The original plot is scattered with no indication of where a linear model could possibly represent the entirety of the data  However, the residual plot has scattered points and would appear to show a linear model being fine  Nevertheless, the correlation of further proves that the data has almost no correlation and that a linear model is not a good fit

Waist size: Scatterplot

Description  Form: Linear  Direction: Positive  Strength: Moderately Strong  No unusual points  Pct_BF= (Waist) LSRL Equation

Scatterplot w/ LSRL  r= 0.789

Interpreting Slope  For every additional inch in waist size, there is 1.734% of body fat added on from a base percent of % on average

Interpreting r 2  r 2 =  62.3% of the change in percent body fat is affected by the change in waist size

Residual Plot

Description  The residuals are scattered throughout and there is no pattern shown between the positive and negative residuals

Is the Model a Good Fit?  Yes the linear model is a good fit for the data  The original scatterplot had its points close to one another and in a distinct linear form making the model pass through or be close to all the points  The residual plot had scattered points and no set pattern  The correlation of further proves the linear model is a good fit by showing that the scatterplot is strong

Chest size: Scatterplot

Description  Form: Linear  Direction: Positive  Strength: Moderate  No unusual points  Pct_BF= (Chest) LSRL Equation

Scatterplot w/ LSRL  r= 0.651

Interpreting Slope  For every additional inch in chest size, there is an increase of 0.666% in percent body fat from a base percent of % on average

Interpreting r 2  r 2 =  42.4% of the change in percent body fat is affected by the change in chest size

Residual Plot

Description  The residuals are scattered and there is no pattern shown between the positive and negative residuals

Is the Model a Good Fit?  Yes the linear model is a good fit for the data  The original scatterplot has points that are relatively close to one another and shows a distinct linear form, so that a linear model go through or be close to a majority of the points  The residual plot has scattered points and there is no pattern shown for the residuals  The correlation of further reinforces that fact that a linear model is a good fit because it shows that the scatterplot is strong

The Best Model  Waist size is the best model to use for predicting percent body fat  Height is definitely not because there is essentially no correlation between it and percent body fat  Chest size does have good correlation with percent body fat, but waist size has a stronger correlation and therefore a more linear model with percent body fat

Percent Body Fat Predictions  Low: Pct_BF= (30.16)= 8.185%  Middle: Pct_BF= (36.26)= %  High: Pct_BF= (42.72)= %

Residuals  Low: 12.5% %= 4.315%  Middle: 27.2% %= 8.437%  High: 31.9% %= 1.936%

Possible Lurking Variables  Time (hours) spent exercising per week  Grams of saturated fat eaten per day

Conclusion  There is no discernable correlation between %BF and height  Scatterplot did not have a pattern  There is a clear association between %BF and waist size  It appears to be that as waist size increases, %BF usually increases as well According to the graph, 62% of the change in %BF is due to the change in waist size  There is probably an association between %BF and chest size  As chest size increases, %BF usually increases According to the graph, 42% of the change in %BF is due to the change in chest size

Conclusion (cont.)  This data is not conclusive of what exactly causes %BF in Adult Males  Factors such as exercise and diet were not considered, and possibly could have an impact on %BF  The sample size was also limited to 150 individuals, so this group could be limited to sampling error