Formal Inference Multivariate Internal. Introduction This report compares if Auckland or Wellington citizens are more likely to borrow more money. The.

Slides:



Advertisements
Similar presentations
5 - 1 © 1997 Prentice-Hall, Inc. Importance of Normal Distribution n Describes many random processes or continuous phenomena n Can be used to approximate.
Advertisements

Introduction Data sets can be compared and interpreted in the context of the problem. Data values that are much greater than or much less than the rest.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #15.
Sampling Distributions
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Hotel accommodation Task 2. Write a statistical question or hypothesis, which involves comparing the two sets of data. Your question or hypothesis or.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Ch 8 Estimating with Confidence. Today’s Objectives ✓ I can interpret a confidence level. ✓ I can interpret a confidence interval in context. ✓ I can.
Welcome to the Unit 8 Seminar Dr. Ami Gates
© 2003 Prentice-Hall, Inc.Chap 6-1 Business Statistics: A First Course (3 rd Edition) Chapter 6 Sampling Distributions and Confidence Interval Estimation.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
CHAPTER 18: Inference about a Population Mean
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
NOTES The Normal Distribution. In earlier courses, you have explored data in the following ways: By plotting data (histogram, stemplot, bar graph, etc.)
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Turning Data Into Information Chapter 2.
90288 – Select a Sample and Make Inferences from Data The Mayor’s Claim.
How much do you smoke?. I Notice... That the median for the males is 13.5 cigarettes per day and the median for females is 10 cigarettes per day. This.
90288 – Select a Sample and Make Inferences from Data The Mayor’s Claim.
Chapter 1: DESCRIPTIVE STATISTICS – PART I2  Statistics is the science of learning from data exhibiting random fluctuation.  Descriptive statistics:
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
Numerical Measures of Variability
Ch 10 – Intro To Inference 10.1: Estimating with Confidence 10.2 Tests of Significance 10.3 Making Sense of Statistical Significance 10.4 Inference as.
Stage 1 Statistics students from Auckland university Using a sample to make a point estimate.
Chapter 3, Part B Descriptive Statistics: Numerical Measures n Measures of Distribution Shape, Relative Location, and Detecting Outliers n Exploratory.
Confidence Interval Estimation For statistical inference in decision making:
Inference Bootstrapping for comparisons. Outcomes Understand the bootstrapping process for construction of a formal confidence interval for a comparison.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
PPDAC Cycle.
COMPUTATIONAL FORMULAS AND IQR’S. Compare the following heights in inches: BoysGirls
Chapter 0: Why Study Statistics? Chapter 1: An Introduction to Statistics and Statistical Inference 1
SECTION 7.2 Estimating a Population Proportion. Where Have We Been?  In Chapters 2 and 3 we used “descriptive statistics”.  We summarized data using.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Multi-variate data internal 4 Credits. achieved The student: Poses an appropriate comparison question, with or without guidance from the teacher,
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Estimating the Value of a Parameter Using Confidence Intervals
Chapter 4: The Normal Distribution
Confidence Interval Estimation
Chapter 3 Describing Data Using Numerical Measures
Inference.
Elementary Statistics
Descriptive Statistics
Reaction Times for Males Vs Females
Writing the executive summary section of your report
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
LESSON 4: MEASURES OF VARIABILITY AND PROPORTION
Chapter 8: Estimating with Confidence
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 3rd Edition
Chapter 8: Estimating with Confidence
Sampling Distribution Models
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Quantitative Data Who? Cans of cola. What? Weight (g) of contents.
CHAPTER 18: Inference about a Population Mean
(-4)*(-7)= Agenda Bell Ringer Bell Ringer
Chapter 8: Estimating with Confidence
CHAPTER 18: Inference about a Population Mean
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Advanced Algebra Unit 1 Vocabulary
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Presentation transcript:

Formal Inference Multivariate Internal

Introduction This report compares if Auckland or Wellington citizens are more likely to borrow more money. The median house price in Auckland is $ (Apr. 2015)compared with Wellington’s median house price of $ (Apr 2015). The population in Auckland is 1.4million as compared to 400 thousand people in Wellington. The data has been sourced from …… It has been rounded to ….. It is believed that the sample is a random selection of Auckland and Wellington citizens. The size of the sample is 100 citizens from each city.

Analysis

Summary stats are missing

Analysis  The Wellington sample is skewed to the left. There is a group of mortgages in Wellington that fall above $ Auckland’s mortgages are more bell shaped and uniformly spread out.  The range for Auckland and Wellington is similar. (Quote the values). However the interquartile range for Wellington is almost twice that of Auckland's. (Quote values) Which means the debt of Aucklanders are more predictable. (A comment about the standard deviation here would be good for M/E students)

Analysis  It is interesting to note that there are no outliers in this data. It would be expected that one or two people from the sample would have a million dollar plus mortgage.  The median for Wellington mortgages is above Auckland’s upper quartile for mortgages. 50% of the Wellington’s mortgages are above Auckland’s upper quartile.

Analysis

CI Statement It is very likely that the median mortgage for Wellington residence is greater than the median mortgage for Auckland residence. The Bootstrap analysis shows a difference between the medians of between $56000 and $ This interval only contains positive values, therefore it is very likely the median mortgages tend to be larger in Wellington, compared with Auckland.

Interpretation of the CI This report claims that the median mortgage in Wellington is very likely to be between $56000 and $ greater than that of Auckland mortgages.

Sampling Variability Discussion From the original sample the difference between the medians is $ If another sample were collected in the same way from the same population it is highly likely the sample medians will not be $ apart. By re-sampling with replacement from the original sample 1000 times (Boot Strapping) and plotting the difference between each ‘new’ sample a distribution of possibly differences is constructed. This distribution of possibly differences will be a Normal distribution from which a C.I. can be taken to make a formal inference about the population.

Randomisation

Randomisation Analysis In Randomisation analysis I am checking to see if the difference between debt in Wellington and Auckland in my sample could have occurred by chance alone. iNZight randomly re-assigns the categorical data (Auckland and Wellington) and then re-considers the difference between the means. We expect this difference to be approximately zero. We repeat this process 1000 times then look at the distribution. Like when a coin is thrown 1000 times, 500 heads and 500 tails is expected heads to 600 tails can occur and many other combinations make up a normal distribution. The distribution in the randomisation test gives us an idea of the range of possible outcomes by chance alone. When we compare this distribution with the actual difference between the sample means hopefully this value extends outside the chance along range showing that the difference could not have occurred by chance alone.

Other models / Reflections  Bias  Data collection method  Improvements  Translatability  Could you have arrived at your conclusion by chance alone?

Conclusion  Answer your comparison problem giving the correct evidence for your claim.  Briefly highlight the key findings from your investigation