VisDB: Database Exploration Using Multidimensional Visualization Maithili Narasimha 4/24/2001.

Slides:



Advertisements
Similar presentations
ENV Envisioning Information Lecture 6 – Document Visualization Ken Brodlie
Advertisements

Relevance Feedback and User Interaction for CBIR Hai Le Supervisor: Dr. Sid Ray.
Module: Cartography and Geovisualization Lesson: Map symbols
1 Copyright Jiawei Han; modified by Charles Ling for CS411a/538a Data Mining and Data Warehousing  Introduction  Data warehousing and OLAP for data mining.
Graphics Pipeline.
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/ Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan,
3D Shape Histograms for Similarity Search and Classification in Spatial Databases. Mihael Ankerst,Gabi Kastenmuller, Hans-Peter-Kriegel,Thomas Seidl Univ.
Chapter 8 Content-Based Image Retrieval. Query By Keyword: Some textual attributes (keywords) should be maintained for each image. The image can be indexed.
Polaris: A System for Query, Analysis and Visualization of Multi-dimensional Relational Databases Presented by Darren Gates for ICS 280.
Using Trees to Depict a Forest Bin Liu, H. V. Jagadish EECS, University of Michigan, Ann Arbor Presented by Sergey Shepshelvich 1.
Query Operations: Automatic Local Analysis. Introduction Difficulty of formulating user queries –Insufficient knowledge of the collection –Insufficient.
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
1 i247: Information Visualization and Presentation Marti Hearst Multidimensional Graphing.
An efficient and effective region-based image retrieval framework Reporter: Francis 2005/5/12.
Chapter 5: Query Operations Baeza-Yates, 1999 Modern Information Retrieval.
1 i247: Information Visualization and Presentation Marti Hearst Interactive Multidimensional Visualization.
The Table Lens: Merging Graphical and Symbolic Representations in an Interactive Focus + Context Visualization for Tabular Information R. Rao and S. K.
Infovis and data george, laura, tjerk.
Novelty Detection and Profile Tracking from Massive Data Jaime Carbonell Eugene Fink Santosh Ananthraman.
Table Lens From papers 1 and 2 By Tichomir Tenev, Ramana Rao, and Stuart K. Card.
WMES3103: INFORMATION RETRIEVAL WEEK 10 : USER INTERFACES AND VISUALIZATION.
Number and Operations Standard Instructional programs from prekindergarten through grade 12 should enable all students to— Understand numbers Understand.
Information Visualization Chapter 1 - Continued. Reference Model Visualization: Mapping from data to visual form Raw DataData Tables Visual Structures.
Overview of Search Engines
Info Vis: Multi-Dimensional Data Chris North cs3724: HCI.
Pixel Visualization of keyword search results in large databases. Jay Koven Fall 2013.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Modeling (Chap. 2) Modern Information Retrieval Spring 2000.
The Tutorial of Principal Component Analysis, Hierarchical Clustering, and Multidimensional Scaling Wenshan Wang.
By LaBRI – INRIA Information Visualization Team. Tulip 2010 – version Tulip is an information visualization framework dedicated to the analysis.
Information Design and Visualization
CMPT 880/890 Writing labs. Outline Presenting quantitative data in visual form Tables, charts, maps, graphs, and diagrams Information visualization.
Dynamic Queries –presented by Bhaskar Chatterjee Visual Alternative to SQL for Querying databases Depending on data types and the values decides the input.
Perception-Based Classification (PBC) System Salvador Ledezma April 25, 2002.
WEKA - Explorer (sumber: WEKA Explorer user Guide for Version 3-5-5)
ArrayCluster: an analytic tool for clustering, data visualization and module finder on gene expression profiles 組員:李祥豪 謝紹陽 江建霖.
Space & Order (1) Jing Li The Visual Design and Control of Trellis Display R. A. Becker, W. S. Cleveland, and M. J. Shyu (1996). Source:
June 6, 2014 IAT Interaction ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS +
© 2010 Pearson Addison-Wesley. All rights reserved. Addison Wesley is an imprint of Designing the User Interface: Strategies for Effective Human-Computer.
A Focus+Context Technique Based on Hyperbolic Geometry for Visualizing Large Hierarchies. John Lamping, Ramana Rao, and Peter Pirolli Xerox Palo Alto Research.
S TACKING -B ASED V ISUALIZATION OF T RAJECTORY A TTRIBUTE D ATA IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, DECEMBER 2012 Authors: 1.Christian.
Querying Structured Text in an XML Database By Xuemei Luo.
Copyright © 2005, Pearson Education, Inc. Slides from resources for: Designing the User Interface 4th Edition by Ben Shneiderman & Catherine Plaisant Slides.
CHAPTER 8 Color and Texture Mapping © 2008 Cengage Learning EMEA.
Non-Overlapping Aggregated Multivariate Glyphs for Moving Objects Roeland Scheepens, Huub van de Wetering, Jarke J. van Wijk Presented by: David Sheets.
Innovative UI Ideas Marti Hearst SIMS 213, UI Design & Development April 20, 1999.
CS3041 – Final week Today: Searching and Visualization Friday: Software tools –Study guide distributed (in class only) Monday: Social Imps –Study guide.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Externally growing self-organizing maps and its application to database visualization and exploration.
VizDB A tool to support Exploration of large databases By using Human Visual System To analyze mid-size to large data.
Daniel A. Keim, Hans-Peter Kriegel Institute for Computer Science, University of Munich 3/23/ VisDB: Database exploration using Multidimensional.
Polaris: A System for Query, Analysis and Visualization of Multi- dimensional Relational Database by Chris Stolte & Pat Hanrahan presenter Andrew Trieu.
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/ Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan,
© 2010 Pearson Addison-Wesley. All rights reserved. Addison Wesley is an imprint of Designing the User Interface: Strategies for Effective Human-Computer.
Visualization Four groups Design pattern for information visualization
CLUSTERING HIGH-DIMENSIONAL DATA Elsayed Hemayed Data Mining Course.
LineUp: Visual Analysis of Multi- Attribute Rankings Samuel Gratzl, Alexander Lex, Nils Gehlenborg, Hanspeter Pfister, and Marc Streit.
3/13/2016 Data Mining 1 Lecture 2-1 Data Exploration: Understanding Data Phayung Meesad, Ph.D. King Mongkut’s University of Technology North Bangkok (KMUTNB)
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A Nonlinear Mapping for Data Structure Analysis John W.
VisDB and Pixel Bar Charts Daniel A. Keim et al. ICS 280 Information Visualization Presented by Jeff Ridenour 4/16/02.
Visualization Design Principles cs5984: Information Visualization Chris North.
1 INTRODUCTION TO COMPUTER GRAPHICS. Computer Graphics The computer is an information processing machine. It is a tool for storing, manipulating and correlating.
Lecture 2-2 Data Exploration: Understanding Data
Information Design and Visualization
cs5984: Information Visualization Chris North
Star Coordinates A Multi-dimensional Visualization Technique with Uniform Treatment of Dimensions.
Information Visualization (Part 1)
Group 9 – Data Mining: Data
CHAPTER 14: Information Visualization
Approximation of Percolation Thresholds
Data Pre-processing Lecture Notes for Chapter 2
Presentation transcript:

VisDB: Database Exploration Using Multidimensional Visualization Maithili Narasimha 4/24/2001

VisDB Scientific and Geographic databases tend to have large amounts of data. Some of the challenges in dealing with these databases are: Mining these databases for useful information is a difficult task due to the sheer volume of data

VisDB Users do not know what they are looking for exactly. With traditional query specification languages, it is not possible to specify vague queries and thus not possible to get approximate results. There is no feedback. Result set may contain too few or too many points.

VisDB Requirements for a good Visualization System to explore large databases: Flexible Query Specification Good Query Feedback Interactive system

VisDB Also, the users should be able to view as many data points as possible to see the patterns and clusters. Display size and resolution are limiting factors Also necessary to display the interdependencies between data attributes, Hotspots(anomalies).

VisDB The goal of the VisDB system is to address the tasks of visualization of the results, and that of incrementally refining the query to provide an effective way to find interesting data properties.

VisDB The approach. Use each pixel of the screen to visualize the results. Provide data items not only fulfilling the result exactly, but also those that match approximately.

VisDB Approximate results are determined by a relevance factor. The relevance factor of a data item is obtained by calculating distances for each selection predicate and combining them. The more the combined distance, the less the relevance of the data point.

Calculating the Relevance Factor Calculate the distance. Simple for Quantitative data. Nominal and Ordinal? Combining distances. User and Query dependent. Weighting Factor for each attribute. Normalizing. Arithmetic Mean for AND and Geometric mean for OR for combining different condition parts. Relevance Factor is the inverse of the Combined distance.

VisDB Basic Visualization Technique Sort the data points according to their relevance, with respect to the query. Assign colors depending on the relevance. Plot the sorted, colored points starting from the center of the screen moving outwards in a rectangular spiral fashion.

Overall Result Plotting

VisDB To relate the visualization of the overall result to the visualization of different selection predicates, separate windows for each selected predicate of the query are created and shown along with the result window. The position of the data items in all the other windows is determined by their position in the overall result window.

Arrangement of Windows for 5D Data

VisDB Mapping two dimensions to the axes It is possible for the user to assign two attributes to the axes and the system will arrange the relevance factors according to the directions of the distance of the data point from the selection predicate. With this method it is possible to provide better feedback to the user. However, we may not be able to use the display efficiently in some cases (I.e. some quadrants may not be used fully, while others are saturated)

2D Representation

VisDB Grouping the dimensions The pixels corresponding to the different dimensions of one data item are placed in one area instead of distributing them in different windows. Will require more pixels per dimension per data item. May provide more useful visualizations for data sets with larger dimensionality.

Grouping multi dimensional data

VisDB Interactive data exploration Users initially specify their queries, using some query language. Inside the VisDB interactive query and visualization interface, it is possible to view the visualizations and perform query modifications. System provides sliders for modifying selection predicates, weight factors and other options.

VisDB

VisDB Conclusion Useful for identifying and isolating clusters, correlations and hotspots in large databases. Good Query specification system. No Zoom or pan for the visualizations.