Graphs & Charts: The Art of Data Visualisation Alasdair Rutherford SSPC9C6University of StirlingSpring 2016.

Slides:



Advertisements
Similar presentations
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 14 Descriptive Statistics 14.1Graphical Descriptions of Data 14.2Variables.
Advertisements

Unit 2: Presenting of data
Chapter 3 Graphic Methods for Describing Data. 2 Basic Terms  A frequency distribution for categorical data is a table that displays the possible categories.
Statistics 100 Lecture Set 6. Re-cap Last day, looked at a variety of plots For categorical variables, most useful plots were bar charts and pie charts.
Visual Displays of Data Chapter 3. Uses of Graphs >Positive and negative uses Can accurately and succinctly present information Can reveal/conceal complicated.
Statistics Unit 6.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 2 Exploring Data with Graphs and Numerical Summaries Section 2.2 Graphical Summaries.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Descriptive Statistics Summarizing data using graphs.
1.1 Displaying and Describing Categorical & Quantitative Data.
Reading Graphs and Charts are more attractive and easy to understand than tables enable the reader to ‘see’ patterns in the data are easy to use for comparisons.
Business 90: Business Statistics
Statistics Lecture 2. Last class began Chapter 1 (Section 1.1) Introduced main types of data: Quantitative and Qualitative (or Categorical) Discussed.
Introductory Statistics: Exploring the World through Data, 1e
Presenting information
Charts and Graphs V
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
Objectives (BPS chapter 1)
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Chapter 1 Descriptive Analysis. Statistics – Making sense out of data. Gives verifiable evidence to support the answer to a question. 4 Major Parts 1.Collecting.
Objective To understand measures of central tendency and use them to analyze data.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
Let’s Review for… AP Statistics!!! Chapter 1 Review Frank Cerros Xinlei Du Claire Dubois Ryan Hoshi.
3. Data Presentation Graphs & Charts.
Data Presentation.
Welcome to Math 6 Statistics: Use Graphs to Show Data Histograms.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
The introduction to SPSS Ⅱ.Tables and Graphs for one variable ---Descriptive Statistics & Graphs.
Statistical Reasoning for everyday life
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Describing Data.
Are You Smarter Than a 5 th Grader?. 1,000,000 5th Grade Topic 15th Grade Topic 24th Grade Topic 34th Grade Topic 43rd Grade Topic 53rd Grade Topic 62nd.
StatisticsStatistics Graphic distributions. What is Statistics? Statistics is a collection of methods for planning experiments, obtaining data, and then.
AP STATISTIC LESSON 1-1 EXPLORING DATA DISPLAYING DISTRIBUTION WITH GRAPHS.
1 Chapter 3 Looking at Data: Distributions Introduction 3.1 Displaying Distributions with Graphs Chapter Three Looking At Data: Distributions.
CHAPTER 20 Representing Quantitative Data. Why ‘re’present your numbers? Few people can extract meaning from arrays of numbers. Summarising them – whether.
Visual Displays of Data Chapter 3. Uses of Graphs Positive and negative uses – Can accurately and succinctly present information – Can reveal/conceal.
Descriptive Statistics Summarizing data using graphs.
Unit 4 Statistical Analysis Data Representations.
Dr. Serhat Eren Other Uses for Bar Charts Bar charts are used to display data for different categories where the data are some kind of quantitative.
Statistics Chapter 1: Exploring Data. 1.1 Displaying Distributions with Graphs Individuals Objects that are described by a set of data Variables Any characteristic.
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
Describing Data: Graphical Methods ● So far we have been concerned with moving from asking a research question to collecting good quality empirical data.
Displaying Distributions with Graphs. the science of collecting, analyzing, and drawing conclusions from data.
Unit 42 : Spreadsheet Modelling
Presenting Data in Charts, Graphs and Tables #1-8-1.
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Histograms, Frequency Polygons, and Ogives
Statistics with TI-Nspire™ Technology Module E Lesson 1: Elementary concepts.
MATH 2311 Section 1.5. Graphs and Describing Distributions Lets start with an example: Height measurements for a group of people were taken. The results.
Displaying Data  Data: Categorical and Numerical  Dot Plots  Stem and Leaf Plots  Back-to-Back Stem and Leaf Plots  Grouped Frequency Tables  Histograms.
Developing the Skills of Academic numeracy and IT.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Charts Overview PowerPoint Prepared by Alfred P.
Chapter 5: Organizing and Displaying Data. Learning Objectives Demonstrate techniques for showing data in graphical presentation formats Choose the best.
Graphs with SPSS Aravinda Guntupalli. Bar charts  Bar Charts are used for graphical representation of Nominal and Ordinal data  Height of the bar is.
1 By maintaining a good heart at every moment, every day is a good day. If we always have good thoughts, then any time, any thing or any location is auspicious.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Chapter 1.1 Displaying Distributions with graphs.
Descriptive Statistics
Chapter 2: Methods for Describing Data Sets
Unit 4 Statistical Analysis Data Representations
CHAPTER 1 Exploring Data
How could data be used in an EPQ?
Statistical Reasoning
Welcome!.
Unit 4: Describing Data After 10 long weeks, we have finally finished Unit 3: Linear & Exponential Functions. Now on to Unit 4 which will last 5 weeks.
Welcome to AP Statistics
Ten things about Descriptive Statistics
Experimental Design Experiments Observational Studies
Presentation transcript:

Graphs & Charts: The Art of Data Visualisation Alasdair Rutherford SSPC9C6University of StirlingSpring 2016

SPSPC9C6 University of Stirling Spring 2016 Introduction What is data visualisation, and why do we need it? Graphs and Charts: Some common forms of presenting data Developments in Data Visualisation Choosing the right visualisation “The Chart Never Lies”: How visualisation can mislead

SPSPC9C6 University of Stirling Spring 2016 Presenting Data Data visualisation is not just about pretty pictures; it is about selecting the best way to present complex data. It can be used to summarise data, to identify potential patterns, to present your data, or to tell a story.

SPSPC9C6 University of Stirling Spring 2016 Florence Nightingale & Data Data visualisation has a long history – one of the pioneers was Florence Nightingale. Watch: X0OR1_Vfc

SPSPC9C6 University of Stirling Spring 2016 The right chart at the right time Commonly used graphs and charts: – Bar charts & histograms – Line graphs – Pie charts – Scatterplots Generated using: – Spread sheet, such as Microsoft Excel – Statistical software, such as SPSS or STATA – Online tools

SPSPC9C6 University of Stirling Spring 2016 Bar Charts Bar charts can be used to present categorical data. Categories are shown along the x-axis, while data such as averages, percentages or frequencies are represented on the y-axis. Best when you want to show proportional as well as absolute differences.

SPSPC9C6 University of Stirling Spring 2015 Example: Bar Chart of Twitter Activity Days of week categories on the x-axis Percentage of activity on the y-axis SOURCE: Sysomos (2009)

SPSPC9C6 University of Stirling Spring 2015 SOURCE: Sysomos (2009) Example: Bar Chart showing percentages Age categories on the x-axis Height of the bars shows percentage Categories are ordered

SPSPC9C6 University of Stirling Spring 2016 Line Graphs Line graphs are often used to represent continuous data. They can also be used for ordered categorical data that approximates a continuous variable. Best when you want to show changes across the values of the X variable.

SPSPC9C6 University of Stirling Spring 2015 Example: Line Graph Continuous data on the x- axis The shape of the line gives a sense of the trend SOURCE: Sysomos (2009)

SPSPC9C6 University of Stirling Spring 2015 Example: Line graph More complex information captured Line shows the distribution SOURCE: Sysomos (2009)

SPSPC9C6 University of Stirling Spring 2016 Pie Charts Pie charts are an accessible way to represent percentages and proportions. They require categorical data, which may or may not be ordered. They also require that categories are mutually exclusive i.e. the percentages sum to 100%.

SPSPC9C6 University of Stirling Spring 2015 Example: Pie Chart of User Ages The area of the segment represents the percentage Easier to see group combinations e.g. two thirds under 25 SOURCE: Sysomos (2009)

SPSPC9C6 University of Stirling Spring 2015 BUT, Pie Charts can also be misleading … No. Westminster seats in the 2010 General Election by Political Party Would Labour + Lib Dem > Conservatives?

SPSPC9C6 University of Stirling Spring 2015 BUT, Pie Charts can also be misleading …

SPSPC9C6 University of Stirling Spring 2016 Scatterplots Scatterplots allow the comparison of two metric (interval or ratio) variables. They are particularly useful for identifying patterns of association ('correlations') between the two variables, and for spotting 'outliers' (cases with unusually high or low values) within data sets.

SPSPC9C6 University of Stirling Spring 2015 HUSBAND WIFE Example: Scatter plot of Spouse Age Each dot represents one observation e.g. couple Outliers can be identified The correlation seems quite clear SOURCE: BHPS (2008)

SPSPC9C6 University of Stirling Spring 2016 Box (and whisker) plots A box plot is a graphical display, based on quartiles, which help us to picture the range of values in a variable (the distribution of scores). You can use them to explore the distribution of one continuous variable or alternatively you can ask for scores to be broken down for different groups (e.g. age groups).

SPSPC9C6 University of Stirling Spring 2015 Example: Box Plot Contains a lot of information in one graph Helps to understand the centre, range and shape of data The statistics here will be introduced later in this course

SPSPC9C6 University of Stirling Spring 2015 Example: Box plot by Country Variable of interest Plots by category e.g. country Here the data for the USA has a much bigger spread than the data for France

SPSPC9C6 University of Stirling Spring 2016 Developments in visualisation Improvement in technology and design, alongside developments in data accessibility, have lead to the emergence of a number of tools to make presenting data more accessible. This combines statistical analysis with good design principles, using data to tell a story.

SPSPC9C6 University of Stirling Spring 2015 Example: Area plot of CO 2 Emissions Areas are easily compared Numerical data also included Reference point Alternative to a bar chart SOURCE:

SPSPC9C6 University of Stirling Spring 2015 Example: Population Proportions Alternative to a pie chart Also conveys information on gender balance Each ‘person’ represents 1% SOURCE:

SPSPC9C6 University of Stirling Spring 2016 How do you choose a visualisation? What sort of data are you using? What characteristics of the data do you need to communicate? Who is the audience?

SPSPC9C6 University of Stirling Spring 2016 Lies, Damn Lies and Data Visualisation Like the numbers themselves, data visualisation can be used to mislead, either accidentally or intentionally. You need to be particularly careful when using percentages, and also pay attention to scales.

SPSPC9C6 University of Stirling Spring 2016 Top 3 tips for BAD data visualisation i.e. what NOT to do! 1)Cram everything you can into the chart – readability is overrated 2)Choose the scale to hide the inconvenient truth 3)Emphasise the trivial and ignore the important

SPSPC9C6 University of Stirling Spring 2015 Example: Difficult to Read Line Graph SOURCE: Wainer (1984) “How to Display Data Badly” The American Statistician, Vol. 38, No. 2

SPSPC9C6 University of Stirling Spring 2016 Top 3 tips for BAD data visualisation i.e. what NOT to do! 1)Cram everything you can into the chart – readability is overrated 2)Choose the scale to hide the inconvenient truth 3)Emphasise the trivial and ignore the important

SPSPC9C6 University of Stirling Spring 2015 Example: Misleading Bar Chart SOURCE:

SPSPC9C6 University of Stirling Spring 2016 Top 3 tips for BAD data visualisation i.e. what NOT to do! 1)Cram everything you can into the chart – readability is overrated 2)Choose the scale to hide the inconvenient truth 3)Emphasise the trivial and ignore the important

SPSPC9C6 University of Stirling Spring 2015 Example: Concealing the data SOURCE: Wainer (1984) “How to Display Data Badly” The American Statistician, Vol. 38, No. 2

SPSPC9C6 University of Stirling Spring 2016 Further Developments: Global health Increasingly data visualisation is being used in multimedia, animating data in order to capture more complexity and represent dynamics. This video shows changing global health and wealth over time. Watch:

SPSPC9C6 University of Stirling Spring 2015 Summary There are a wide range of data visualisations available Selecting the right graph or chart can help you describe your data Care must be taken in choosing the appropriate graph or chart depending on your data and audience Beware of misleading with your visualisations, either intentionally or accidentally