SOFTVIS 2005: Saint Louis, Missouri, USA Michael Burch, Stephan Diehl, Peter Weißgerber: Visual data mining in software archives Martin Pinzger, Harald.

Slides:



Advertisements
Similar presentations
COSC2007 Data Structures II Chapter 10 Trees I. 2 Topics Terminology.
Advertisements

Multi-Dimensional Data Visualization
Visualizing Relations in Hierarchical Data Presented by Soha Makady 1.
Chapter 3 – Data Visualization © Galit Shmueli and Peter Bruce 2010 Data Mining for Business Intelligence Shmueli, Patel & Bruce.
22C:19 Discrete Structures Trees Spring 2014 Sukumar Ghosh.
TREES Chapter 6. Trees - Introduction  All previous data organizations we've studied are linear—each element can have only one predecessor and successor.
22C:19 Discrete Math Trees Fall 2011 Sukumar Ghosh.
JOURNAL OF HUMAN–COMPUTER INTERACTION 2010 Ji Soo Yi, Niklas Elmqvist, and Seungyoon Lee.
Planning and Statistics ADVIZOR TRAINING PLANNING AND STATISTICS V.1 1 ADVIZOR Tips.
Software engineering for real-time systems
Binary Decision Diagrams1 BINARY DECISION DIAGRAMS.
Classifier Decision Tree A decision tree classifies data by predicting the label for each record. The first element of the tree is the root node, representing.
Thank you for coming here!. Purpose of Experiment Compare two visualization systems. You will play with one of them.
Content-Based Image Retrieval (CBIR) Student: Mihaela David Professor: Michael Eckmann Most of the database images in this presentation are from the Annotated.
Figure 1: (A) A microarray may contain thousands of ‘spots’. Each spot contains many copies of the same DNA sequence that uniquely represents a gene from.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Statistical Analysis of Transaction Dataset Data Visualization Homework 2 Hongli Li.
Let's zoom in on one corner of the coordinate plane
Data Handling.
Derived from:  Line graphs are used to track changes over short and long periods of time.  Line graphs can also be used to compare.
Types of Numbers Whole numbers 0, 1, 2, 3, 4, 5, …
Chapter 3 Data Models.
By LaBRI – INRIA Information Visualization Team. Tulip 2010 – version Tulip is an information visualization framework dedicated to the analysis.
Exploring Personal CoreSpace For DataSpace Management Li Yukun and Xiaofeng Meng WAMDM Lab Renmin University of China.
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
REU WEEK IV Malcolm Collins-Sibley Mentor: Shervin Ardeshir.
A hierarchical approach to building contig scaffolds Mihai Pop Dan Kosack Steven L. Salzberg Genome Research 14(1), pp , 2004.
Trees Chapter 15 Data Structures and Problem Solving with C++: Walls and Mirrors, Carrano and Henry, © 2013.
© 2010 Pearson Addison-Wesley. All rights reserved. Addison Wesley is an imprint of Designing the User Interface: Strategies for Effective Human-Computer.
Lesson 2-3 Example Graph the ordered pairs C(2, 5) and D(0, 5). Then connect the points. What do you notice? Step 1 Graph point C. Start at the origin,
Computer Vision Lecture 5. Clustering: Why and How.
VAST 2011 Sebastian Bremm, Tatiana von Landesberger, Martin Heß, Tobias Schreck, Philipp Weil, and Kay Hamacher Interactive-Graphics Systems TU Darmstadt,
Introduction Of Tree. Introduction A tree is a non-linear data structure in which items are arranged in sequence. It is used to represent hierarchical.
MyAnimeList Visualizer By Binh Tran. JSON Data Format {"anime":[{ "status":"finished airing", "status":"finished airing", "type":"TV", "type":"TV", "score":0,
Paper Presentation CST 294 – Topic: Interactive Graphics & Visual Analytics Experiences using Visualization Techniques to Present Requirements, Risks to.
Chapter 12: Design Phase n 12.1 Design and Abstraction n 12.2 Action-Oriented Design n 12.3 Data Flow Analysis n Data Flow Analysis Example n
SEMINAR WEI GUO. Software Visualization in the Large.
Graphs, Charts and Tables Describing Your Data. Frequency Distributions.
Presented By: ROLL No IMTIAZ HUSSAIN048 M.EHSAN ULLAH012 MUHAMMAD IDREES027 HAFIZ ABU BAKKAR096(06)
Analyzing Data with Advanced Visualizations
Project Management Methodology Quality Control Diagrams Seven basic tools of quality.
SOFTWARE DESIGN. INTRODUCTION There are 3 distinct types of activities in design 1.External design 2.Architectural design 3.Detailed design Architectural.
We are learning to write expressions using variables. (8-1)
Data Mining Techniques Clustering. Purpose In clustering analysis, there is no pre-classified data Instead, clustering analysis is a process where a set.
Visualizing Massive Multi-Digraphs James Abello Jeffrey Korn Information Visualization Research Shannon Laboratories, AT&T Labs-Research All the graphs.
CHAPTER 2 SECTION 2-1 PATTERNS AND ITERATIONS SEQUENCE An arrangement of numbers in a particular order. The numbers are called terms and the pattern.
Visualizing Large Dynamic Digraphs Michael Burch.
Chapter 11 Information Visualization
(X,Y) Coordinate System ASAUCE. Objective: Plot points in the 4 quadrants to create lines by making a t-table of certain function. Purpose: Needed skills.
Graphs A graphs is an abstract representation of a set of objects, called vertices or nodes, where some pairs of the objects are connected by links, called.
Communicating with Graphs  A visual display of information of data.
GRAPHING RELATIONSHIPS For each graph, determine the graphing relationship and record it on a white board.
Visualization of Washing Powder Formulation ———seeking the best ingredients of washing powder.
USING GRAPHING SKILLS. Axis While drawing graphs, we have two axis. X-axis: for consistent variables Y-axis: for other variable.
Introduction to UML and Rational Rose UML - Unified Modeling Language Rational Rose 98 - a GUI tool to systematically develop software through the following.
Graphs Another good way to organize this data is with a Graph. Graph – a diagram that shows a relationship between two sets of numbers. So do we have two.
Data Coaching Services Chart Interpretation 1. o Bar o Stacked Bar o Pie o Line o Scatter plot 2.
Systems Analysis and Design in a Changing World, Fourth Edition
1 Berger Jean-Baptiste
Lecture 1 (UNIT -4) TREE SUNIL KUMAR CIT-UPES.
CZ5211 Topics in Computational Biology Lecture 3: Clustering Analysis for Microarray Data I Prof. Chen Yu Zong Tel:
Gedas Adomavicius Jesse Bockstedt
Gephi Gephi is a tool for exploring and understanding graphs. Like Photoshop (but for graphs), the user interacts with the representation, manipulate the.
Data Handling.
Multiple Alignment and Phylogenetic Trees
Remember: Answer questions so it’s clear what you are answering
Data Handling.
CHAPTER 7: Information Visualization
1.7 Addition Rule - Tree Diagrams (1/3)
PolyAnalyst Web Report Training
Presentation transcript:

SOFTVIS 2005: Saint Louis, Missouri, USA Michael Burch, Stephan Diehl, Peter Weißgerber: Visual data mining in software archives Martin Pinzger, Harald Gall, Michael Fischer, Michele Lanza: Visualizing multiple evolution metrics

Data Mining Terminology Association rules: Item changed at the same time (related item) Sequence rules: order of these changes Binary Association Rules: how often 2 items changed together Support: Number of transaction containing the item Confidence: Number of Changes for pair item over single item Outliers: unbalance datasets or abnormal distance

Introduction What is visualize - Binary association rules - n-ary association rules - Sequence rules - distribution, support and confidence –histogram Tool EPOSee: Integrates different view Purpose: detect clusters, inspect rules, zoom and filters

EPOSee Interface Pixelmap Support Graph 3D Bar Chart filter Search keywordColors

Parallel Coordinates View Decision Tree 3D branch view

Rule matrix Item list Rule detailSupport & confidence n-ary association rules

3D bar charts Strong dependecies: High Support & confidence Use color and heights

Visualize binary association rule only Pixelmap File ordering: hierarchical

Stronger related Pixelmap Example File coupling at different directory level

Edges: related items Outliers: blue Clusters: sets of items Support Graph Nodes: Items Red:high

Association Rule Matrix y-axis: Items x-axis: Rules Red, blue & white pixels Support: length Confidence color

Parallel Coordinates View

Visualize Sequence Rules Parallel Coordinates View Nodes Color: Support Values Edges Color: Confidences Cluster on same subdirectory

Parallel Coordinates View Green edges: high confidence But, no edges with high confidence is coming into these 2 nodes

Pinzger, Gall, Fischer, Lanza: Visualizing multiple evolution metrics

Objective: Communicate the evolution of metrics of source code entities and their relationships Kiviat Diagram M1, M2..,M6 = 6 metrics increasing decreasing

Metrics

Logical Coupling Edge: Coupling relationship

A module from Mozilla