Data Visualization.

Slides:



Advertisements
Similar presentations
Graphing Data on the Coordinate Plane
Advertisements

ENV Envisioning Information Lecture 3 – Multivariate Data Exploration Scatter plots and parallel coordinates Ken Brodlie.
Lecture 06: Design II February 5, 2013 COMP Visualization.
Multi-Dimensional Data Visualization
Lecture 1: Beautiful graphics in R
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Sep 23, 2013 IAT Data ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY.
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/ Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan,
Chapter 2 Summarizing and Graphing Data
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Graphical Examination of Data Jaakko Leppänen
Reading Graphs and Charts are more attractive and easy to understand than tables enable the reader to ‘see’ patterns in the data are easy to use for comparisons.
Multidimensional data processing. Multivariate data consist of several variables for each observation. Actually, serious data is always multivariate.
Unit 0, Pre-Course Math Review Session 0.3, Graphing J. Jackson Barnette Professor of Biostatistics.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 3.1 Chapter Three Art and Science of Graphical Presentations.
Learn to locate and graph points on the coordinate plane.
11-3 The Coordinate Plane Warm Up Problem of the Day
Warm Up Problem of the Day Lesson Presentation Lesson Quizzes.
TEKS 8.6 (A,B) & 8.7 (A,D) This slide is meant to be a title page for the whole presentation and not an actual slide. 8.6 (A) Generate similar shapes using.
LISA Short Course Series Multivariate Analysis in R Liang (Sally) Shan March 3, 2015 LISA: Multivariate Analysis in RMar. 3, 2015.
Shapes and the Coordinate System TEKS 8.6 (A,B) & 8.7 (A,D)
Graphical Data Displays and Interpretation 2009 October 9.
Graphical Data Displays and Interpretation Wednesday, October 9.
Scientific Communication and Technological Failure presentation for ILTM, July 9, 1998 Dan Little.
Visualization and Data Mining. 2 Outline  Graphical excellence and lie factor  Representing data in 1,2, and 3-D  Representing data in 4+ dimensions.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 3.1 Chapter Three Art and Science of Graphical Presentations.
TEKS 8.6 (A,B) & 8.7 (A,D) This slide is meant to be a title page for the whole presentation and not an actual slide. 8.6 (A) Generate similar shapes using.
1 How to interpret scientific & statistical graphs Theresa A Scott, MS Department of Biostatistics
Information Visualization in Data Mining S.T. Balke Department of Chemical Engineering and Applied Chemistry University of Toronto.
Lecture 3: Bivariate Data & Linear Regression 1.Introduction 2.Bivariate Data 3.Linear Analysis of Data a)Freehand Linear Fit b)Least Squares Fit c)Interpolation/Extrapolation.
Visualizing quantitative information martin krzywinski.
Charts and Graphs V
11-3 The Coordinate Plane Warm Up Warm Up Lesson Presentation Lesson Presentation Problem of the Day Problem of the Day Lesson Quizzes Lesson Quizzes.
CMPT 880/890 Writing labs. Outline Presenting quantitative data in visual form Tables, charts, maps, graphs, and diagrams Information visualization.
Graphs in Science You Can Do It!!!.
Class Meeting #11 Data Analysis. Types of Statistics Descriptive Statistics used to describe things, frequently groups of people.  Central Tendency 
CS 478 – Tools for Machine Learning and Data Mining Data Understanding.
Graphs An Introduction. What is a graph?  A graph is a visual representation of a relationship between, but not restricted to, two variables.  A graph.
Let’s play “Name That Car!”. What car is this? And this?
Tan,Steinbach, Kumar: Exploratory Data Analysis (with modifications by Ch. Eick) Data Mining: “New” Teaching Road Map 1. Introduction to Data Mining and.
Points on a Graph Objectives After reviewing this unit you will be able to: Identify the x and y axes. Identify the origin on a graph. Identify x and y.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Elementary Statistics Eleventh Edition Chapter 2.
Thinking about Graphics Scales in Stata. Level of measurement Categorical versus continuous Categorical data may be represented as Position along a categorical.
Chapter The Cartesian Plane Ms. Robin. You will learn: To label the axes and origin of a Cartesian plane Identify and plot points on a Cartesian.
Worth 1,000 Words How to use information graphics to make data meaningful National Association for Career and Technical Education Information May 17, 2012.
W.A.L.T: Reading and Writing Co-ordinates. Our Aim To locate things using co-ordinate grids
In Stat-I, we described data by three different ways. Qualitative vs Quantitative Discrete vs Continuous Measurement Scales Describing Data Types.
16.1 Vis_2002 Data Visualization Lecture 14 Information Visualization Part 1.
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/ Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan,
Scatter Plots Scatter plots are a graphic representation of collated biviariate data via a mathematical diagram using Cartesian coordinates. The data.
Data Mining: Exploring Data
Chapter 3 Exploring Data.
Multivariate Visualization. Projection Distortion.
BY: RONICA NORVELL.  Identify the x and y axis.  Identify the origin on your graph.  Identify x and y coordinates of a point.  Lastly plot points.
Strategies for Success GOOD LUCK!! Strategy 1 Can I plug it in? Can I plug it in?
3/13/2016 Data Mining 1 Lecture 2-1 Data Exploration: Understanding Data Phayung Meesad, Ph.D. King Mongkut’s University of Technology North Bangkok (KMUTNB)
By Christy Quattrone Click to View Types of Graphs Data Analysis, Grade 5.
Graphical Presentation Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
Graphs Another good way to organize this data is with a Graph. Graph – a diagram that shows a relationship between two sets of numbers. So do we have two.
Exploring Data: Summary Statistics and Visualizations
Principles of Graphical Excellence
IAT 355 Data + Multivariate Visualization
Lab 04: Visualizing Multiple Variables
Graphical Data Displays and Interpretation
Statistical power is….
Objective - To graph ordered pairs.
Statistical power is….
Data exploration and visualization
Comp 15 - Usability & Human Factors
Presentation transcript:

Data Visualization

Napoleon Invasion of Russia, 1812

© www. odt. org , from http://www. odt. org/Pictures/minard © www.odt.org , from http://www.odt.org/Pictures/minard.jpg, used by permission

Snow’s Cholera Map, 1855

Far East Asia at Night North Korea Notice how dark it is! Seoul South Korea

Role of Visualization Support interactive exploration Help in result presentation Disadvantage: requires human eyes Can be misleading

Bad Visualization Y-Axis scale gives WRONG impression of big change Year Sales 1999 2110 2000 2105 2001 2120 2002 2121 2003 2124 Y-Axis scale gives WRONG impression of big change

Better Visualization Axis from 0 to 2000 scale gives Year Sales 1999 2110 2000 2105 2001 2120 2002 2121 2003 2124 Axis from 0 to 2000 scale gives CORRECT impression of small change

Another Bad Visualization Lie Factor=14.8 (E.R. Tufte, “The Visual Display of Quantitative Information”, 2nd edition)

Lie Factor The degree to which the graphical representation of a particular effect matches the reality For the fuel economy graph Tufte’s requirement: 0.95<Lie Factor<1.05 (E.R. Tufte, “The Visual Display of Quantitative Information”, 2nd edition)

Tufte’s Principles of Graphical Excellence Give the viewer the greatest number of ideas in the shortest time with the least ink in the smallest space. Tell the truth about the data! (E.R. Tufte, “The Visual Display of Quantitative Information”, 2nd edition)

Visualization Methods Visualizing in 1-D, 2-D and 3-D Well-known visualization methods E.g., histogram, box plot, scatter plot Visualizing more dimensions Scatter Plot Matrix Parallel Coordinates Chernoff Faces Stick Figures

1-D (Univariate) Data Representations Histogram Tukey box plot 20 7 5 3 1 Tukey box plot Middle 50% low high Mean 20 Histogram

2-D (Bivariate) Data Scatter plot, … price mileage

3-D Data (Projection) price

3-D Image Requires 3-D blue and red glasses Taken by Mars Rover Spirit, Jan 2004 Requires 3-D blue and red glasses

4+ dimensions: Multiple Views Give each variable its own display 1 A B C D E 1 4 1 8 3 5 2 6 3 4 2 1 3 5 7 2 4 3 4 2 6 3 1 5 2 3 4 A B C D E Problem: does not show correlations

Scatter Plot Matrix Represent each possible pair of variables in their own 2-D scatterplot (car data) Q: Useful for what? A: linear correlations (e.g. horsepower & weight) Q: Misses what? A: multivariate effects

Parallel Coordinates Encode variables along a horizontal line Vertical line specifies values Same dataset in parallel coordinates Dataset in a Cartesian coordinates Invented by Alfred Inselberg while at IBM, 1985

Example: Visualizing Iris Data Iris versicolor Iris setosa Iris virginica

Example: Iris (2) Sepal Length Petal length Petal Width Sepal Width 3.5 5.1 0.2 1.4

Example: Iris (3) 3.5 5.1 1.4 0.2 Each data point is a line. Similar points correspond to similar lines. Lines crossing over correspond to negatively correlated attributes. Problems: order of axes up to ~20 dimensions

Chernoff Faces Encode different variables’ values in characteristics of human face Fun applets: http://www.cs.uchicago.edu/~wiseman/chernoff/ http://hesketh.com/schampeo/projects/Faces/chernoff.html

Stick Figures Two variables mapped to X, Y axes Other variables mapped to limb lengths and angles

Illustration of Stick Figure Use Census data showing age, income, sex, education, etc. Closed figures correspond to women and we can see more of them on the left. Note also a young woman with high income

Summary Many methods Visualization is possible in more than 3-D Aim for graphical excellence (and truth!) Free and open-source software Ggobi Xmdv Others (see www.kdnuggets.com/software/visualization.html)