Data Transformations. 4 For some data sets, it may be necessary to transform variables –e.g. change units (lb to kg, ˚ C to ˚ F, etc.) This is simply.

Slides:



Advertisements
Similar presentations
TABLES and FIGURES BIOL 4001.
Advertisements

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 2 Exploring Data with Graphs and Numerical Summaries Section 2.2 Graphical Summaries.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Looking at data: distributions - Describing distributions with numbers IPS chapter 1.2 © 2006 W.H. Freeman and Company.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Charts & Graphs.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution Business Statistics: A First Course 5 th.
Density Curve A density curve is the graph of a continuous probability distribution. It must satisfy the following properties: 1. The total area.
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
© SSER Ltd. How Science Works Types of Graph. This presentation looks at the following types of graph: 1.Bar Chart 3.Line Graph4.Pie Chart 5.Scatter Graph.
Chapter 1 – Exploring Data YMS Displaying Distributions with Graphs xii-7.
Chap 6-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 6 The Normal Distribution Business Statistics: A First Course 6 th.
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
1.1 Displaying Distributions with Graphs
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 PROBABILITIES FOR CONTINUOUS RANDOM VARIABLES THE NORMAL DISTRIBUTION CHAPTER 8_B.
Figure 4.6 (page 119) Typical ways of presenting frequency graphs and descriptive statistics.
Topics Covered Discrete probability distributions –The Uniform Distribution –The Binomial Distribution –The Poisson Distribution Each is appropriately.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
The Scientific Method Honors Biology Laboratory Skills.
Graphs An Introduction. What is a graph?  A graph is a visual representation of a relationship between, but not restricted to, two variables.  A graph.
Wednesday, May 13, 2015 Report at 11:30 to Prairieview.
Dot Plots and Histograms Lesson After completing this lesson, you will be able to say: I can create a dot plot and histogram to display a set of.
Graphing Data: Introduction to Basic Graphs Grade 8 M.Cacciotti.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 2-2 Frequency Distributions.
Chapter 11 Data Descriptions and Probability Distributions Section 1 Graphing Data.
IPS Chapter 1 © 2012 W.H. Freeman and Company  1.1: Displaying distributions with graphs  1.2: Describing distributions with numbers  1.3: Density Curves.
Central Tendency A statistical measure that serves as a descriptive statistic Determines a single value –summarize or condense a large set of data –accurately.
Numerical descriptions of distributions
Chapter 4 Displaying Quantitative Data. Quantitative variables Quantitative variables- record measurements or amounts of something. Must have units or.
Chapter 2 Frequency Distributions PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Seventh Edition by Frederick J Gravetter.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution Business Statistics, A First Course 4 th.
14.6 Descriptive Statistics (Graphical). 2 Objectives ► Data in Categories ► Histograms and the Distribution of Data ► The Normal Distribution.
STATS DAY First a few review questions. Which of the following correlation coefficients would a statistician know, at first glance, is a mistake? A. 0.0.
Descriptive Statistics: Tabular and Graphical Methods
Chapter 2: Modeling Distributions of Data
Chapter 1: Exploring Data
Aim: How can we display data in an experiment?
STATS DAY First a few review questions.
Chapter 1 Data Analysis Section 1.2
Describing Distributions of Data
Undergraduated Econometrics
Figure 4.6 In frequency distribution graphs, we identify the position of the mean by drawing a vertical line and labeling it with m or M. Because the.
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
1.1 Cont’d.
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Statistics for Managers Using Microsoft® Excel 5th Edition
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Warmup Find the marginal distribution for age group.
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
The Normal Distribution
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Presentation transcript:

Data Transformations

4 For some data sets, it may be necessary to transform variables –e.g. change units (lb to kg, ˚ C to ˚ F, etc.) This is simply a change in the scale, and such transformations are called ‘Linear’. Linear transformations consist of (1) multiplying all the observations by a constant, (2) adding a constant to all observations, or (3) both.

Data Transformations 4 Multiplicative transformation example –Y = weight in kg –Y’ = weight in lb –Y’ = 2.2Y 4 Additive transformation example –Measurements of nitrate (mg/l) → Y Y = 0.3, 0.35, 0.5, 0.42, 0.38, 0.56… –Add 1 to each number → Y’ Y’ = 1.3, 1.35, 1.5, 1.42, 1.38, 1.56…

Data Transformations 4 Additive and Multiplicative example –Body temperature measurements in ˚ C (Y) were taken for 47 women; if we convert to ˚ F (Y’): Y’ = 1.8Y Multiplicative transformations affect S in the same way that they affect the mean: –e.g., if mean Y = 22, and mean Y’ = 2.2Y –then S Y’ = 2.2S Y

Data Transformations 4 Additive transformations, however, don’t affect S Original observations DeviationsTransformed observations Deviations Mean

Data Transformations 4 Additive transformations thus effectively move probability distributions to the left or the right – but the shape of the histogram is unchanged. 4 Multiplicative transformations shrink or stretch the probability distribution

Nonlinear Transformations 4 These sorts of transformations affect data in more complex ways. 4 Examples:

Nonlinear Transformations 4 These transformations do change the essential shape of frequency distributions 4 They are thus used to try and make distributions more symmetric – i.e., are tools to achieve normality.

Transformations to achieve normality 4 If the distribution is skewed to the right (the most common problem) then each of the following transformations will help produce a more symmetric distribution. 4 The transformations are listed in order of how much they will pull in a right- skewed distribution.

Transformations to achieve normality 4 Percentage or proportion data is a special case – it often appears binomially distributed –e.g., 0-100%, Here the appropriate transformation is:

Results 4 Tables and figures - must have a purpose

Results: Tables 4 When to use: –Present numerical values –Large amounts of information 4 Rules –Numbered consecutively –Must be able to stand alone –Vertical arrangement –Title goes above the table –Definitions/’explanations’ go below the table

“Bad Table” Table 6. Growth rate of cell cultures and activity of ornithine decarboxylase (ODC) and succinate dehydrogenase (SDH) in Pseudomonas aeruginosa in response to various carbon sources

“Good Table” Table 7. Growth rate of cell cultures and activity of ornithine decarboxylase (ODC) and succinate dehydrogenase (SDH) in Pseudomonas aeruginosa in response to various carbon sources

Table 4. Response of male fighting fish (Betta splendens) to their image in a mirror a a Prior to the experiment, fish had been visually isolated from one another for 2 wk. Observation period for each fish was 30 s.

Results: Figures 4 Use to illustrate important points –summarize your data 4 Number graphs consecutively –separately from tables 4 Must be able to stand alone 4 Titles go below figure or on separate “Figure Legends” page 4 Know when to use specific types of graphs –Bar graph vs histogram –Scatter plot vs line graph

Bar graph (refer to page 57) Problems?

Bar graph (refer to page 57) Cleared quadrat Control quadrat

Results: Graphs 4 Do not forget to include error bars –Is your data significant? –Are there differences 4 Complete figure legend

Figure 2. Production of flowers by three species of plants in the absence of interspecific competition and under natural conditions Cleared quadrat Control quadrat

Figure 2. Production of flowers by three species of plants in the absence of interspecific competition (cleared quadrats) and under natural conditions (control quadrats). The plants were Campanula rapunculoides, Epilobium angustifolium, and Hieracium aurantiacum. Plotted are means for eight randomly chosen quadrats. Each 1 x 1 m 2. Cleared quadrat Control quadrat

4 Text –Data summary –Do not discuss or draw conclusions 4 Statistics –Incorporate statistics into the verbal text –Be careful when using the word “significant” –Refer to appropriate tables and figures When do you use “Figure” and when do you use “Fig.”?

As shown in Figure 1, the shoreline of Hicks Pond was generally predominated by grasses and sedges. Observed frequencies of turtles obtaining food differed significantly from expected frequencies (x 2 =58.19, df=8, P<0.001; Fig. 2).