Statistical Hydrology 1 Dr. Muhammad Ajmal Lecturer, Agri. Engg. Dept. UET Peshawar DATA TRANSFORMATION.

Slides:



Advertisements
Similar presentations
Descriptive Statistics-II
Advertisements

DESCRIBING DISTRIBUTION NUMERICALLY
Transformations Getting normal or using the linear model.
Chapter 10 Re-Expressing data: Get it Straight
Get it Straight!! Chapter 10
Assumption of normality
Slide 3- 1 Copyright © 2010 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Business Statistics First Edition.
1 Assessing Normality and Data Transformations Many statistical methods require that the numeric variables we are working with have an approximate normal.
Assessing Normality and Data Transformations. Role of Normality Many statistical methods require that the numeric variables we are working with have an.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 10 Re-expressing Data: Get it Straight!
Re-expressing data CH. 10.
Re-expressing the Data: Get It Straight!
Looking at data: distributions - Describing distributions with numbers IPS chapter 1.2 © 2006 W.H. Freeman and Company.
Jan Shapes of distributions… “Statistics” for one quantitative variable… Mean and median Percentiles Standard deviations Transforming data… Rescale:
MEASURES OF SPREAD – VARIABILITY- DIVERSITY- VARIATION-DISPERSION
Lecture 4 Normal distribution? Sample data at :
Measures of Dispersion
8/7/2015Slide 1 Simple linear regression is an appropriate model of the relationship between two quantitative variables provided: the data satisfies the.
Understanding and Comparing Distributions
SW388R7 Data Analysis & Computers II Slide 1 Assumption of normality Transformations Assumption of normality script Practice problems.
Descriptive Statistics for Numeric Variables Types of Measures: measures of location measures of spread measures of shape measures of relative standing.
LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.
LECTURE 8 Thursday, 19 February STA291 Fall 2008.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
“STANDARD DEVIATION” Standard Deviation: Std deviation is the best and scientific method of dispersion. It is widely used method used in statistical.
Applied Quantitative Analysis and Practices LECTURE#09 By Dr. Osman Sadiq Paracha.
Skewness & Kurtosis: Reference
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Chapter 10 Re-expressing the data
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Chapter 10 Re-expressing Data: Get It Straight!. Slide Straight to the Point We cannot use a linear model unless the relationship between the two.
Lecture 6 Re-expressing Data: It’s Easier Than You Think.
To be given to you next time: Short Project, What do students drive? AP Problems.
UNIT #1 CHAPTERS BY JEREMY GREEN, ADAM PAQUETTEY, AND MATT STAUB.
Chapter 1: Exploring Data, cont. 1.2 Describing Distributions with Numbers Measuring Center: The Mean Most common measure of center Arithmetic average,
If the scatter is curved, we can straighten it Then use a linear model Types of transformations for x, y, or both: 1.Square 2.Square root 3.Log 4.Negative.
Chapter 5 Lesson 5.4 Summarizing Bivariate Data 5.4: Nonlinear Relationships and Transformations.
Re-Expressing Data. Scatter Plot of: Weight of Vehicle vs. Fuel Efficiency Residual Plot of: Weight of Vehicle vs. Fuel Efficiency.
Copyright © 2010 Pearson Education, Inc. Chapter 10 Re-expressing Data: Get it Straight!
Math 4030 – 7b Normality Issues (Sec. 5.12) Properties of Normal? Is the sample data from a normal population (normality)? Transformation to make it Normal?
Understanding & Comparing Distributions Chapter 5.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 10 Re-expressing Data: Get it Straight!
© 2012 W.H. Freeman and Company Lecture 2 – Aug 29.
Chapter 10 Notes AP Statistics. Re-expressing Data We cannot use a linear model unless the relationship between the two variables is linear. If the relationship.
Statistics 10 Re-Expressing Data Get it Straight.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 10 Re-expressing Data: Get it Straight!
Data transformation. The Aim By the end of this lecture, the students will be aware of data transformation methods to make the appropriate statistical.
BAE 6520 Applied Environmental Statistics
BAE 5333 Applied Water Resources Statistics
Let’s Get It Straight! Re-expressing Data Curvilinear Regression
Introductory Statistics: Exploring the World through Data, 1e
Re-expressing the Data: Get It Straight!
Data description and transformation
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
(12) students were asked their SAT Math scores:
Re-expressing Data: Get it Straight!
Re-expressing the Data: Get It Straight!
Re-expressing the Data: Get It Straight!
Theme 5 Standard Deviations and Distributions
So how do we know what type of re-expression to use?
Assessing Normality and Data Transformations
Honors Statistics Day 4 Objective: Students will be able to understand and calculate variances and standard deviations. As well as determine how to describe.
AP Statistics Day 5 Objective: Students will be able to understand and calculate variances and standard deviations.
Lecture 6 Re-expressing Data: It’s Easier Than You Think
Describing Distributions with Numbers
Describing Data Coordinate Algebra.
Lesson Plan Day 1 Lesson Plan Day 2 Lesson Plan Day 3
Numerical Descriptive Measures
Re-expressing Data: Get it Straight!
Presentation transcript:

Statistical Hydrology 1 Dr. Muhammad Ajmal Lecturer, Agri. Engg. Dept. UET Peshawar DATA TRANSFORMATION

What is a data transformation?  It is a mathematical function that is applied to all the observations of a given variable Y represents the original variable, Y* is the transformed variable, and f is a mathematical function that is applied to the data Transformation of a variable can change its distribution from a skewed distribution to a normal distribution (bell-shaped, symmetric about its center 04. Data Transformation

3 ▣ Data Transformation Transformation are used for three purposes ① To make data more symmetric ② To make data more linear, and ③ To make data more constant in variance In order to make an asymmetric distribution become more symmetric, the data can be transformed or re-expressed into new units. The new nits alter the distances between observations on a line plot. The effect is to either expand or contract the distances to extreme observations on one side of the median making it look more like the other side.

04. Data Transformation 4 ▣ Data Transformation The most commonly used transformation in water resources is the logarithm. Logs of water resources data for example, stream discharge, hydraulic conductivity and sediment concentration are often taken before statistical analyses are performed.

UP Bigger Impact Bigger Impact.. Middle rung: No transformation ( = 1) Middle rung: No transformation ( = 1) DOWN Here V represents our variable of interest. We are going to consider this variable raised to a power, i.e. V We go up the ladder to remove left skewness and down the ladder to remove right skewness. Right skewed Left skewed Tukey’s Ladder of Powers for Transformation

To remove right skewness we typically take the square root, cube root, logarithm, or reciprocal of a the variable etc., i.e. V.5, V.333, log 10 (V) (think of V 0 ), V -1, etc. To remove left skewness we raise the variable to a power greater than 1, such as squaring or cubing the values, i.e. V 2, V 3, etc. Tukey’s Ladder of Powers for Transformation

Tukey’s Ladder of Power

To remove right skewness we typically take the square root, cube root, logarithm, or reciprocal of a the variable etc., i.e. V.5, V.333, log 10 (V) (think of V 0 ), V -1, etc. To remove left skewness we raise the variable to a power greater than 1, such as squaring or cubing the values, i.e. V 2, V 3, etc. Tukey’s Ladder of Power

How can we determine if observations are normally distributed? Graphical examination Histogram or boxplot Normal quantile-quantile plot (QQ-plot) Goodness of fit tests Kolmogorov-Smirnov test Shapiro-Wilks test Anderson Darling test Transformations to Achieve Normality

How can we determine if observations are normally distributed? Transformations to Achieve Normality

Questions? 11