1 SIMS 247: Information Visualization and Presentation Marti Hearst Feb 18, 2004.

Slides:



Advertisements
Similar presentations
Design of Experiments Lecture I
Advertisements

Multidimensional Detective Alfred Inselberg Presented By Rajiv Gandhi and Girish Kumar.
Parallel Coordinates Representation of multi-dimensional data Discovery Process xmdv Visualization Tool Ganesh K. Panchanathan Christa M. Chewar.
6 Developing Content and Layout Section 6.1 Generate and organize content ideas Write and organize Web text Section 6.2 Identify page dimension guidelines.
Unit 27 Spreadsheet Modelling
Basic Seven Tools of Quality
Seven Quality Tools The Seven Tools
Rolling the Dice: Multidimensional Visual Exploration using Scatterplot Matrix Navigation 1 Niklas Elmqvist | Purdue University Pierre Dragicevic | INRIA.
Polaris: A System for Query, Analysis and Visualization of Multi-dimensional Relational Databases Presented by Darren Gates for ICS 280.
Microsoft Excel Working with Excel Lists, Subtotals and Pivot Tables.
1 SIMS 247: Information Visualization and Presentation Marti Hearst Sep 26, 2005.
1 SIMS 247: Information Visualization and Presentation Marti Hearst Sep 28, 2005.
Statistics for Decision Making Descriptive Statistics QM Fall 2003 Instructor: John Seydel, Ph.D.
ENV 2006 CS3.1 Envisioning Information: Case Study 3 Data Exploration with Parallel Coordinates.
1 i247: Information Visualization and Presentation Marti Hearst Multidimensional Graphing.
Visualization of Multidimensional Multivariate Large Dataset Presented by: Zhijian Pan University of Maryland.
Demonstration with Inxight Eureka Chaonan Geng ICS280 Spring 2001.
Ch. 2: The Art of Presenting Data Data in raw form are usually not easy to use for decision making. Some type of organization is needed Table and Graph.
1 SIMS 247: Information Visualization and Presentation Marti Hearst Oct 24, 2005.
1 i247: Information Visualization and Presentation Marti Hearst Interactive Multidimensional Visualization.
Multidimensional Detective Alfred Inselberg Presented By Cassie Thomas.
Chapter 2 Describing Data Sets
Table Lens From papers 1 and 2 By Tichomir Tenev, Ramana Rao, and Stuart K. Card.
Table Lens Introduction to the Table Lens concept Table Lens Implementation Projected Usage Scenarios Usage Comparison with Splus Critical Analysis.
ISHIKAWA’S BASIC SEVEN TOOLS OF QUALITY
TABLES, CHARTS, AND GRAPHS. TABLES  A Table is simply a set of numbers from which you could draw a graph or chart.  A table should provide a clear summary.
Sometimes, Tables can be confusing
Info Vis: Multi-Dimensional Data Chris North cs3724: HCI.
The Stats Unit.
131 Agenda Overview Review Roles Lists Libraries Columns.
Charts and Graphs V
Information Design and Visualization
XP New Perspectives on Microsoft Access 2002 Tutorial 51 Microsoft Access 2002 Tutorial 5 – Enhancing a Table’s Design, and Creating Advanced Queries and.
CMPT 880/890 Writing labs. Outline Presenting quantitative data in visual form Tables, charts, maps, graphs, and diagrams Information visualization.
Integrating Graphics, Charts, Tables Into your technical writing documents.
MIS 301 Information Systems in Organizations Dave Salisbury ( )
1 Multidimensional Detective Alfred Inselberg, Multidimensional Graphs Ltd Tel Aviv University, Israel Presented by Yimeng Dou
DIMENSIONAL MODELLING. Overview Clearly understand how the requirements definition determines data design Introduce dimensional modeling and contrast.
Scientific Inquiry and the Scientific Method Understanding the World Around Us.
Discovering Computers Fundamentals Fifth Edition Chapter 9 Database Management.
Opinion to ponder… “ Since we are a visual species (especially the American culture), because of our educational system. Many of the tools currently used.
An Internet of Things: People, Processes, and Products in the Spotfire Cloud Library Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist.
ICS 105 Research Team INFOZOOM How to Use Infozoom ICS 105 Research Team Winter 2002.
Seven Quality Tools The Seven Tools –Histograms, Pareto Charts, Cause and Effect Diagrams, Run Charts, Scatter Diagrams, Flow Charts, Control Charts.
Common Fractions © Math As A Second Language All Rights Reserved next #6 Taking the Fear out of Math 1 3 ×1 3 Multiplying.
McGraw-Hill/Irwin © 2009 The McGraw-Hill Companies, All Rights Reserved Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
The Table Lens: Merging Graphical and Symbolic Representations in an Interactive Focus+Context Visualization for Tabular Information Ramana Rao and Stuart.
GrowingKnowing.com © Frequency distribution Given a 1000 rows of data, most people cannot see any useful information, just rows and rows of data.
Descriptive statistics Petter Mostad Goal: Reduce data amount, keep ”information” Two uses: Data exploration: What you do for yourself when.
How to use Infozoom ICS 105 Research Team Fall 2000.
© 2008 The McGraw-Hill Companies, Inc. All rights reserved. ACCESS 2007 M I C R O S O F T ® THE PROFESSIONAL APPROACH S E R I E S Lesson 8 – Adding and.
Info Vis: Multi-Dimensional Data Chris North cs3724: HCI.
VisDB: Database Exploration Using Multidimensional Visualization Maithili Narasimha 4/24/2001.
Microsoft® Access Generate forms quickly 1 Modify controls in Layout View 2 Work with form sections 3 Modify controls in Design View 4 Add calculated.
Microsoft Project 2010 ® Tutorial 1: Planning a Project.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Data Warehousing and Decision Support Chapter 25.
Mulidimensional Detective “Multidimensional” : multivariate, many parameters “Detective” : focus is on the “discovery process”, finding patterns and trends.
Visualization Design Principles cs5984: Information Visualization Chris North.
Scientific Inquiry and the Scientific Method
IAT 355 Data + Multivariate Visualization
Sec. 1.1 HW Review Pg. 19 Titanic Data Exploration (Excel File)
Chapter 2 Presenting Data in Tables and Charts
Information Design and Visualization
Chapter 1: Exploring Data
Type your project title here Your name Your teacher’s name Your school
Information Visualization (Part 1)
Good Morning AP Stat! Day #2
Information Visualization
Scatter Diagrams Slide 1 of 4
Presentation transcript:

1 SIMS 247: Information Visualization and Presentation Marti Hearst Feb 18, 2004

2 Today Multidimensional Visualization –Table Lens –Parallel Coordinates Intro paper Example of usage –Attribute Explorer –Comparative Evaluation of Three Systems Design Problem

3 Table Lens Super Spreadsheets –Combines overview + details in an integrated view –Focus + Context allows for compressed representation –Sorting multiple columns allows patterns to emerge –Represents nominal data in a way that allows patterns to appear Demos:

4 Multidimensional Detective A. Inselberg, Multidimensional Detective, Proceedings of IEEE Symposium on Information Visualization (InfoVis '97), 1997.

5 Inselberg’s Principles A. Inselberg, Multidimensional Detective, Proceedings of IEEE Symposium on Information Visualization (InfoVis '97), Do not let the picture scare you 2.Understand your objectives –Use them to obtain visual cues 3.Carefully scrutinize the picture 4.Test your assumptions, especially the “I am really sure of’s” 5.You can’t be unlucky all the time!

6 A Detective Story A. Inselberg, Multidimensional Detective, Proceedings of IEEE Symposium on Information Visualization (InfoVis '97), 1997 The Dataset: –Production data for 473 batches of a VLSI chip –16 process parameters: –X1: The yield: % of produced chips that are useful –X2: The quality of the produced chips (speed) –X3 … X12: 10 types of defects (zero defects shown at top) –X13 … X16: 4 physical parameters The Objective: –Raise the yield (X1) and maintain high quality (X2)

7 Do Not Let the Picture Scare You!!

8 Multidimensional Detective Each line represents the values for one batch of chips This figure shows what happens when only those batches with both high X1 and high X2 are chosen Notice the separation in values at X15 Also, some batches with few X3 defects are not in this high-yield/high-quality group.

9 Multidimensional Detective Now look for batches which have nearly zero defects. –For 9 out of 10 defect categories Most of these have low yields This is surprising because we know from the first diagram that some defects are ok.

10 Go back to first diagram, looking at defect categories. Notice that X6 behaves differently than the rest. Allow two defects, where one defect in X6. This results in the very best batch appearing.

11 Multidimensional Detective Fig 5 and 6 show that high yield batches don’t have non-zero values for defects of type X3 and X6 –Don’t believe your assumptions … Looking now at X15 we see the separation is important –Lower values of this property end up in the better yield batches

12 Automated Analysis A. Inselberg, Automated Knowledge Discovery using Parallel Coordinates, INFOVIS ‘99

13 Integrating Viz into a UI Vizcraft: VizCraft: A Problem-Solving Environment for Aircraft Configuration DesignVizCraft: A Problem-Solving Environment for Aircraft Configuration Design, Goe, Baker, Shaffer, Grossman, Mason, Watson, Haftka, IEEE Computing, pp , 2001 Solving an Analysis Problem –Optimizing design of aircraft Uses of Viz: –Brushing and linking –Color –Multiple views –Parallel Coordinates

14 Good Use of Color in Vizcraft Incorrect Not Sure

15 Doing Analysis in VizCraft Colored according to value in first attribute Shows that 2 nd and N-6 th are correlated with 1 st

16 Doing Analysis in VizCraft Colored according to value in fifth attribute Shows that 5 th and 7 th attributes are correlated

17 Doing Analysis in VizCraft Select only low values of 1 st variable (normalized after the fact) The idea is to learn about the acceptable ranges for the values of the other variables

18 Doing Analysis in VizCraft Color according to one constraint Confusing – using the constraint colors in two ways simultaneously.

19 Comparing 3 Commercial Systems Alfred Kobsa, An Empirical Comparison of Three Commercial Information Visualization Systems, INFOVIS'01.

20 Eureka (Table Lens)

21 Spotfire (IVEE)

22 InfoZoom

23 Infozoom Presents data in three different views: –Overview mode has all attributes in ascending or descending order and independent of each other. Best for data exploration –Wide view shows data set in a table format A column represents a data item Like a conventional spreadsheet –Compressed view packs the data set horizontally to fit the window width. A column represents a data item Zoomed-out view like Table Lens

24 Slide by Alfred Kobsa InfoZoom Overview View

25 InfoZoom Overview View

26 Slide by Alfred Kobsa InfoZoom Overview View (with hierarchy)

27 InfoZoom Wide Table View (columns are meaningful)

28 InfoZoom Wide Table View

29 InfoZoom Compressed Table View

30 Slide by Kunal Garach Multidimensional data: three databases were used Anonymized data from a web based dating service (60 records, 27 variables) Technical data of cars sold in 1970 – 82 (406 records, 10 variables) Data on the concentration of heavy metals in Sweden (2298 records, 14 variables) Datasets for Study

31 Sample Questions Dating database –Do more women than men want their partners to have a higher education? –What proportion of the men live in California? –Do all people who think the bar is a good place to meet a mate also believe in love at first site? Car database –Do heavier cars have more horsepower? –Which manufacturer produced the most cars in 1980? –Is there a relationship between the displacement and acceleration of a vehicle?

32 Slide by Kunal Garach Experiment Design The experimenters generated 26 tasks from all three data sets. 83 participants. Between-subjects design. Each was given one visualization system and all three data sets. Type of visualization system was the independent variable between them. 30 mins were given to solve the tasks of each data set i.e 26 tasks in 90 mins.

33 Slide by Kunal Garach Overall Results Mean task completion times: Infozoom users: 80 secs Spotfire users: 107 secs Eureka users: 110 secs Answer correctness: Infozoom users: 68% Spotfire users: 75% Eureka users: 71% Not a time-error tradeoff Spotfire more accurate on only 6 questions

34 Slide by Kunal Garach Eureka - problems Hidden labels: Labels are vertically aligned, max 20 dimensions Problems with queries involving 3 or more attributes Correlation problems: Some participants had trouble answering questions correctly that involved correlations between two attributes.

35 Slide by Kunal Garach Spotfire - problems Cognitive setup costs: Takes participants considerable time to decide on the right representation and to correctly set the coordinates and parameters. Biased by scatterplot default: Though powerful, many problems cannot be solved (well) with it.

36 Slide by Kunal Garach Infozoom - problems Erroneous Correlations People forget/don’t realize that overview mode has all attributes sorted independent of each other Narrow row height in compressed view Participants did not use row expansion and scatterplot charting function which shows correlations more accurately

37 Geographic Questions Spotfire should have done better on these Which part of the country has the most copper Is there a relationship between the concentration of vanadin and that of zinc? Is there a low-level chrome area that is high in vanadim? Spotfire was only better only for the last question (out of 6 geographic ones)

38 Discussion Many studies of this kind use relatively simple tasks that mirror the strengths of the system Find the one object with the maximum value for a property Count how many of certain attributes there are This study looked at more complex, realistic, and varied questions.

39 Discussion Success of a visualization system depends on many factors: Properties supplied Spotfire doesn’t visualize as many dimensions simultaneously Operations Zooming easy in InfoZoom; allows for drill-down as well Zooming in Eureka causes context to be lost Column view in Eureka makes labels hard to see

40 Assignment 3 Due March 3 Work in pairs (encouraged, not required) Exploratory Data Analysis

41 Design Exercise: How to Visualize EASPD Pure serial periodic data –A single continuous dimension in which each period has equal duration –Example: days of the week Event-anchored serial periodic data –Data has periods with different durations

42 Design Exercise: How to Visualize EASPD Event-anchored serial periodic data –Data has periods with different durations Examples: –Multi-day races (Tour de France) –May want to discern Is a racer improving starts and finishes as the race progresses? Does a racer peak more rapidly after long stages than short ones? –Project-based time tracking How is a worker’s efficiency effected by the pattern and number of projects?

43 Design Exercise: How to Visualize EASPD Event-anchored serial periodic data –Data has periods with different durations Examples: –Eating habits of a foraging animal Eats different foods, different amounts, in different seasons Start/end of season varies based on when the rains begin

44 Next Time: Problem Analysis Example (Carlis & Konstan) Focus + Context Zooming –Standard and Semantic Distortion-based Views