1 i247: Information Visualization and Presentation Marti Hearst Interactive Multidimensional Visualization.

Slides:



Advertisements
Similar presentations
Multi-Dimensional Data Visualization
Advertisements

Multidimensional Detective Alfred Inselberg Presented By Rajiv Gandhi and Girish Kumar.
Parallel Coordinates Representation of multi-dimensional data Discovery Process xmdv Visualization Tool Ganesh K. Panchanathan Christa M. Chewar.
Inpainting Assigment – Tips and Hints Outline how to design a good test plan selection of dimensions to test along selection of values for each dimension.
Sep 23, 2013 IAT Data ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY.
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
1 SIMS 247: Information Visualization and Presentation Marti Hearst Feb 18, 2004.
Locating in fingerprint space: wireless indoor localization with little human intervention. Proceedings of the 18th annual international conference on.
Reading Graphs and Charts are more attractive and easy to understand than tables enable the reader to ‘see’ patterns in the data are easy to use for comparisons.
More MR Fingerprinting
Mapping Nominal Values to Numbers for Effective Visualization Presented by Matthew O. Ward Geraldine Rosario, Elke Rundensteiner, David Brown, Matthew.
Visual Analytics Research at WPI Dr. Matthew Ward and Dr. Elke Rundensteiner Computer Science Department.
Polaris: A System for Query, Analysis and Visualization of Multi-dimensional Relational Databases Presented by Darren Gates for ICS 280.
1 SIMS 247: Information Visualization and Presentation Marti Hearst Sep 26, 2005.
PaperLens Understanding Research Trends in Conferences using PaperLens Work by Bongshin Lee, Mary Czerwinski, George Robertson, and Benjamin Bederson Presented.
HCI 530 : Seminar (HCI) Damian Schofield. HCI 530: Seminar (HCI) Transforms –Two Dimensional –Three Dimensional The Graphics Pipeline.
Search and Retrieval: More on Term Weighting and Document Ranking Prof. Marti Hearst SIMS 202, Lecture 22.
SIMS 247 Information Visualization and Presentation Prof. Marti Hearst September 14, 2000.
1 SIMS 247: Information Visualization and Presentation Marti Hearst Sep 28, 2005.
1 SIMS 247: Information Visualization and Presentation Marti Hearst Sept 21, 2005.
ENV 2006 CS3.1 Envisioning Information: Case Study 3 Data Exploration with Parallel Coordinates.
1 i247: Information Visualization and Presentation Marti Hearst Multidimensional Graphing.
Visualization of Multidimensional Multivariate Large Dataset Presented by: Zhijian Pan University of Maryland.
Marti Hearst SIMS 247 SIMS 247 Lecture 5 Brushing and Linking February 3, 1998.
Multidimensional Detective Alfred Inselberg Presented By Cassie Thomas.
Infovis and data george, laura, tjerk.
Evaluating the Quality of Image Synthesis and Analysis Techniques Matthew O. Ward Computer Science Department Worcester Polytechnic Institute.
User interface design Designing effective interfaces for software systems Objectives To suggest some general design principles for user interface design.
Cliff Rhyne and Jerry Fu June 5, 2007 Parallel Image Segmenter CSE 262 Spring 2007 Project Final Presentation.
Selective Dynamic Manipulation of Visualizations Chuah, Roth, Mattis, Kolojejchick.
December 2, 2014Computer Vision Lecture 21: Image Understanding 1 Today’s topic is.. Image Understanding.
Mutlidimensional Detective Alfred Inselberg Streeable, Progressive, Mutlidimensional Scaling Matt Williams, Tamara Munzner Rylan Cottrell.
Multidimensional Data Analysis IS 247 Information Visualization and Presentation 22 February 2002 James Reffell Moryma Aydelott Jean-Anne Fitzpatrick.
Circle Drawing algo..
Basic Concepts The Unified Modeling Language (UML) SYSC System Analysis and Design.
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
Chapter 3 Data Exploration and Dimension Reduction 1.
Dynamic Queries –presented by Bhaskar Chatterjee Visual Alternative to SQL for Querying databases Depending on data types and the values decides the input.
` Tangible Interaction with the R Software Environment Using the Meuse Dataset Rachel Bradford, Landon Rogge, Dr. Brygg Ullmer, Dr. Christopher White `
Basic concepts in ordination
Sample size vs. Error A tutorial By Bill Thomas, Colby-Sawyer College.
Space & Order (1) Jing Li The Visual Design and Control of Trellis Display R. A. Becker, W. S. Cleveland, and M. J. Shyu (1996). Source:
Visualization Blaz Zupan Faculty of Computer & Info Science University of Ljubljana, Slovenia.
1 Multidimensional Detective Alfred Inselberg, Multidimensional Graphs Ltd Tel Aviv University, Israel Presented by Yimeng Dou
Opinion to ponder… “ Since we are a visual species (especially the American culture), because of our educational system. Many of the tools currently used.
Geographic Information Science
The Semivariogram in Remote Sensing: An Introduction P. J. Curran, Remote Sensing of Environment 24: (1988). Presented by Dahl Winters Geog 577,
Software Architecture
1 SY DE 542 Navigation and Organization Prototyping Basics Feb 28, 2005 R. Chow
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
‘Externalizing Abstract Mathematical Models’ Lisa Tweedie,Robert Spence, Huw Dawkes and Hua Su Department of Electrical Engineering, Imperial College Of.
VisDB: Database Exploration Using Multidimensional Visualization Maithili Narasimha 4/24/2001.
VizDB A tool to support Exploration of large databases By using Human Visual System To analyze mid-size to large data.
Daniel A. Keim, Hans-Peter Kriegel Institute for Computer Science, University of Munich 3/23/ VisDB: Database exploration using Multidimensional.
Geo479/579: Geostatistics Ch7. Spatial Continuity.
Multiplication of Common Fractions © Math As A Second Language All Rights Reserved next #6 Taking the Fear out of Math 1 3 ×1 3 Applying.
Digital Image Processing Image Enhancement in Spatial Domain
Multi-Dimensional Data Visualization cs5984: Information Visualization Chris North.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A Nonlinear Mapping for Data Structure Analysis John W.
VisDB and Pixel Bar Charts Daniel A. Keim et al. ICS 280 Information Visualization Presented by Jeff Ridenour 4/16/02.
IAT Scent, or Interaction & Navigation ______________________________________________________________________________________ SCHOOL OF INTERACTIVE.
Mulidimensional Detective “Multidimensional” : multivariate, many parameters “Detective” : focus is on the “discovery process”, finding patterns and trends.
Visualization Design Principles cs5984: Information Visualization Chris North.
Singular Value Decomposition and its applications
Spreadsheet Engineering
Advanced Computer Systems
IAT 355 Data + Multivariate Visualization
Multidimensional Data Analysis
What Is Good Clustering?
Getting started – Example 1
Presentation transcript:

1 i247: Information Visualization and Presentation Marti Hearst Interactive Multidimensional Visualization

2 Interactive Techniques Ask what-if questions spontaneously while working through a problem Control the exploration of subsets of data from different viewpoints

3 Adapted from Refell, Aydelott, & Fitzpatrick Problem Statement How to effectively present more than 3 dimensions of information in a visual display with 2 (to 3) dimensions? How to effectively visualize “inherently abstract” data? How to effectively visualize very large, often complex data sets? How to effectively display results – when you don’t know what those results will be?

4 Adapted from Refell, Aydelott, & Fitzpatrick Another Statement of Goals Visualization of multidimensional data Without loss of information With: –Minimal complexity –Any number of dimensions –Variables treated uniformly –Objects remain recognizable across transformations –Easy / intuitive conveyance of information –Mathematically / algorithmically rigorous (Adapted from Inselberg)

5 Adapted from Refell, Aydelott, & Fitzpatrick Characteristics “Data-dense displays” (large number of dimensions and/or values) –Often combine color with position / proximity representing relevance “distance” –Often provide multiple views Build on concepts from previous weeks: –Retinal properties of marks –Gestalt concepts, e.g., grouping –Direct manipulation / interactive queries –Incremental construction of queries –Dynamic feedback Some require specialized input devices or unique gesture vocabulary

6 Adapted from Refell, Aydelott, & Fitzpatrick Examples Warning: These visualizations are not easy to grasp at “first glance”! DON’T PANIC

7 Alternative Network Viz (Legal cases) Network Visualization by Semantic Substrates, Shneiderman & Aris, IEEE TVCG

8 PaperLens Understanding research trends in conferences using PaperLens Lee et al., CHI'05 extended abstracts

9 Highlighting and Brushing: Parallel Coordinates by Inselberg Visual Data Detective –Free implementation: Parvis by Ledermen

10 Multidimensional Detective A. Inselberg, Multidimensional Detective, Proceedings of IEEE Symposium on Information Visualization (InfoVis '97), Do Not Let the Picture Scare You!!

11 Inselberg’s Principles A. Inselberg, Multidimensional Detective, Proceedings of IEEE Symposium on Information Visualization (InfoVis '97), Do not let the picture scare you 2.Understand your objectives –Use them to obtain visual cues 3.Carefully scrutinize the picture 4.Test your assumptions, especially the “I am really sure of’s” 5.You can’t be unlucky all the time!

12 A Detective Story A. Inselberg, Multidimensional Detective, Proceedings of IEEE Symposium on Information Visualization (InfoVis '97), 1997 The Dataset: –Production data for 473 batches of a VLSI chip –16 process parameters: –X1: The yield: % of produced chips that are useful –X2: The quality of the produced chips (speed) –X3 … X12: 10 types of defects (zero defects shown at top) –X13 … X16: 4 physical parameters The Objective: –Raise the yield (X1) and maintain high quality (X2)

13 Multidimensional Detective Each line represents the values for one batch of chips This figure shows what happens when only those batches with both high X1 and high X2 are chosen Notice the separation in values at X15 Also, some batches with few X3 defects are not in this high-yield/high-quality group.

14 Multidimensional Detective Now look for batches which have nearly zero defects. –For 9 out of 10 defect categories Most of these have low yields This is surprising because we know from the first diagram that some defects are ok.

15 Go back to first diagram, looking at defect categories. Notice that X6 behaves differently than the rest. Allow two defects, where one defect in X6. This results in the very best batch appearing.

16 Multidimensional Detective Fig 5 and 6 show that high yield batches don’t have non-zero values for defects of type X3 and X6 –Don’t believe your assumptions … Looking now at X15 we see the separation is important –Lower values of this property end up in the better yield batches

17 Automated Analysis A. Inselberg, Automated Knowledge Discovery using Parallel Coordinates, INFOVIS ‘99

18 Influence Explorer / Prosection Matrix (Tweedie et. al.) Abstract one-way mathematical models: multiple parameters, multiple variables. Data for visualization comes from sampling Visualization of non-obvious underlying structures in models Color coding, attention to near misses

19 Influence Explorer / Prosection Matrix (Tweedie et. al.) Use the sliders to set performance limits. Color coding gives immediate feedback as to effects of changes—both for ‘perfect’ scores and for near-misses. Can also highlight individual values across histograms, show parallel coordinates. Interactive querying!

20 Influence Explorer / Prosection Matrix (Tweedie et. al.) In this view we can shift parameter ranges in addition to performance limits. Red is still a perfect score—blacks miss one parameter limit, blues one or two performance limits. Does this color scheme make sense? Would another work better?

21 Influence Explorer / Prosection Matrix (Tweedie et. al.) Prosection matrix (on right) = scatter plots for pairs of parameters. Color coding matches histograms. Fitting tolerance region (yellow box) to acceptability (red region) gives high yield for minimum cost Or: Make the red bit as big as possible! This aspect closely tuned to task at hand: manufacturing and similar.

22 Adapted from Refell, Aydelott, & Fitzpatrick VisDB (Keim & Kriegel) Mapping entries from relational database to pixels on the screen Include “approximate” answers, with placement and color-coding based on relevance Data points laid out in: –Rectangular spiral –Or, with axes representing positive/negative values for two selected dimensions –Or, group dimensions together (easier to interpret than very large number of dimensions)

23 Adapted from Refell, Aydelott, & Fitzpatrick from

24 Adapted from Refell, Aydelott, & Fitzpatrick VisDB - Relevance Relevance calculation based on “distance” of each variable from query specification Distance calculation depends on data type –Numeric: mathematical –String: character/substring matching, lexical, phonetic?, syntactic? –Nominal: predefined distance matrix –Possibly other “domain-specific” distance metrics

25 Adapted from Refell, Aydelott, & Fitzpatrick VisDB – Screen Resolution Stated screen resolution seems reasonable by today’s standards: 19 inch display, 1024x1280 pixels = 1.3 million data points However, controls take up a lot of space!

26 Adapted from Refell, Aydelott, & Fitzpatrick from

27 Adapted from Refell, Aydelott, & Fitzpatrick Limitations and Issues Complexity Abstract data –These visualizations are oriented toward abstract data –For “naturally” two or three-dimensional data (things that vary over time or space, e.g., geographic data) visualizations which exploit those properties may exist and be more effective