1 Scalable Visual Comparison of Biological Trees and Sequences Tamara Munzner University of British Columbia Department of Computer Science Imager.

Slides:



Advertisements
Similar presentations
1 TreeJuxtaposer side by side comparison of evolutionary trees.
Advertisements

1 TreeJuxtaposer: Scalable Tree Comparison using Focus+Context with Guaranteed Visibility Tamara Munzner Univ. British Columbia François Guimbretière Univ.
SequenceJuxtaposer: Fluid Navigation For Large-Scale Sequence Comparison in Context James Slack *, Kristian Hildebrand *†, Tamara Munzner * and Katherine.
Spatial Indexing SAMs. Spatial Indexing Point Access Methods can index only points. What about regions? Z-ordering and quadtrees Use the transformation.
H3: Laying Out Large Directed Graphs in 3D Hyperbolic Space Tamara Munzner, Stanford University.
Scalable Visualization with Accordion Drawing Tamara Munzner University of British Columbia Department of Computer Science joint work with James Slack,
Scalability with many lights II (row-column sampling, visibity clustering) Miloš Hašan.
1 Scalable Visual Comparison of Biological Trees and Sequences Tamara Munzner University of British Columbia Department of Computer Science Imager.
1 Information Visualization at UBC Tamara Munzner University of British Columbia.
MDSteer: Steerable and Progressive Multidimensional Scaling Matt Williams and Tamara Munzner University of British Columbia Imager Lab.
University of British Columbia Department of Computer Science Tamara Munzner Interactive Visualization of Evolutionary Trees and Gene Sequences February.
1 Information Visualization with Accordion Drawing Tamara Munzner University of British Columbia.
University of British Columbia Department of Computer Science Tamara Munzner Visualization: From Pixels to Insight March 3, 2007 UBC CS TechTrek.
1 Visualization Process and Collaboration Tamara Munzner Department of Computer Science University of British Columbia
1 Presented by Jean-Daniel Fekete. 2  Motivation  Mélange [Elmqvist 2008] Multiple Focus Regions.
Presented by Marlene Shehadeh Advanced Topics in Computer Vision ( ) Winter
Composite Rectilinear Deformation for Stretch and Squish Navigation James Slack, Tamara Munzner University of British Columbia November 1, 2006 Imager.
Aki Hecht Seminar in Databases (236826) January 2009
“Occlusion” Prepared by: Shreya Rawal 1. Extending Distortion Viewing from 2D to 3D S. Carpendale, D. J. Cowperthwaite and F. David Fracchia (1997) 2.
1 Model View Controller. 2 Outline Review Definitions of MVC Why do we need it? Administiriva Changing the display Event flow Dragging at interactive.
Live Re-orderable Accordion Drawing (LiveRAC) Peter McLachlan, Tamara Munzner Eleftherios Koutsofios, Stephen North AT&T Research Symposium August, 2007.
Hidden Surface Elimination Wen-Chieh (Steve) Lin Institute of Multimedia Engineering I-Chen Lin’ CG Slides, Rich Riesenfeld’s CG Slides, Shirley, Fundamentals.
Large Scale Radial Graph Drawing Cody Robson. Graph Exploration Radial Layout: Constrains graph to rings User can shift focus to avoid panning.
1 Scalable Visual Comparison of Biological Trees and Sequences Tamara Munzner University of British Columbia Department of Computer Science Imager.
Information Retrieval: Human-Computer Interfaces and Information Access Process.
DEPARTMENT OF COMPUTER SCIENCE SOFTWARE ENGINEERING, GRAPHICS, AND VISUALIZATION RESEARCH GROUP 15th International Conference on Information Visualisation.
1 Biology Design Studies comparative functional genomics –Pathline comparative genomics (synteny) –MizBee gene expression + interaction network –Cerebral.
SpaceTree: Supporting Exploration in Large Node Link Tree, Design Evolution and Empirical Evaluation Catherine Plaisant, Jesse Grosjean, Benjamin B.Bederson.
Partitioned Rendering Infrastructure with Stable Accordion Navigation James Slack MSc Thesis Presentation April 4, 2005.
1 PRISAD: A Partitioned Rendering Infrastructure for Scalable Accordion Drawing James Slack, Kristian Hildebrand, Tamara Munzner University of British.
10/11/2001CS 638, Fall 2001 Today Kd-trees BSP Trees.
Ensemble Learning (2), Tree and Forest
Evaluation of Viewport Size and Curvature of Large, High-Resolution Displays Lauren Shupp, Robert Ball, John Booker, Beth Yost, Chris North Virginia Polytechnic.
Navigating and Browsing 3D Models in 3DLIB Hesham Anan, Kurt Maly, Mohammad Zubair Computer Science Dept. Old Dominion University, Norfolk, VA, (anan,
Spatial Data Structures Jason Goffeney, 4/26/2006 from Real Time Rendering.
Interaction Design – Part II by A.Surasit Samaisut Copyrights : All Rights Reserved.
VAST 2011 Sebastian Bremm, Tatiana von Landesberger, Martin Heß, Tobias Schreck, Philipp Weil, and Kay Hamacher Interactive-Graphics Systems TU Darmstadt,
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
IAT 814 Trees Chapter 3.2 of Spence ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS +
Real-Time Rendering SPEEDING UP RENDERING Lecture 04 Marina Gavrilova.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
IEEE Int'l Symposium on Signal Processing and its Applications 1 An Unsupervised Learning Approach to Content-Based Image Retrieval Yixin Chen & James.
Collaborative Visual Computing Lab Department of Computer Science University of Cape Town Graphics Topics in VR By Shaun Nirenstein.
3D Viewers Two main uses: –Detector/event exploration – interactivity priority (15fps min). –Generate presentation material (still/movie renders) – quality.
- Laboratoire d'InfoRmatique en Image et Systèmes d'information
112/5/ :54 Graphics II Image Based Rendering Session 11.
Daniel A. Keim, Hans-Peter Kriegel Institute for Computer Science, University of Munich 3/23/ VisDB: Database exploration using Multidimensional.
Pad++: A Zooming Graphical Interface for Exploring Alternate Interface Physics Presented By: Daniel Loewus-Deitch.
Pad++: A Zoomable Graphical User Interface. Motivations View information at multiple scales –Semantic zooming Tap into natural spatial ways of thinking.
PMR: Point to Mesh Rendering, A Feature-Based Approach Tamal K. Dey and James Hudson
Coherent Hierarchical Culling: Hardware Occlusion Queries Made Useful Jiri Bittner 1, Michael Wimmer 1, Harald Piringer 2, Werner Purgathofer 1 1 Vienna.
Visualization Four groups Design pattern for information visualization
Photo-realistic Rendering and Global Illumination in Computer Graphics Spring 2012 Hybrid Algorithms K. H. Ko School of Mechatronics Gwangju Institute.
Spatial Indexing Techniques Introduction to Spatial Computing CSE 5ISC Some slides adapted from Spatial Databases: A Tour by Shashi Shekhar Prentice Hall.
3D Object Representations 2011, Fall. Introduction What is CG?  Imaging : Representing 2D images  Modeling : Representing 3D objects  Rendering : Constructing.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
CHC ++: Coherent Hierarchical Culling Revisited Oliver Mattausch, Jiří Bittner, Michael Wimmer Institute of Computer Graphics and Algorithms Vienna University.
Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.
3D Object Representations 2009, Fall. Introduction What is CG?  Imaging : Representing 2D images  Modeling : Representing 3D objects  Rendering : Constructing.
ITree: Exploring Time-Varying Data using Indexable Tree Yi Gu and Chaoli Wang Michigan Technological University Presented at IEEE Pacific Visualization.
Clustering [Idea only, Chapter 10.1, 10.2, 10.4].
An Evaluation of Pan & Zoom and Rubber Sheet Navigation with and without an Overview Dmitry Nekrasovski, Adam Bodnar, Joanna McGrenere, François Guimbretière,
Visualizing data from Galaxy
Real-Time Soft Shadows with Adaptive Light Source Sampling
STBVH: A Spatial-Temporal BVH for Efficient Multi-Segment Motion Blur
3D Object Representations
Hierarchies (Trees) Definition Examples
Orthogonal Range Searching and Kd-Trees
Scalable Visual Comparison of Biological Trees and Sequences
Parallel Feature Identification and Elimination from a CFD Dataset
Presentation transcript:

1 Scalable Visual Comparison of Biological Trees and Sequences Tamara Munzner University of British Columbia Department of Computer Science Imager

2 Outline Accordion Drawing –information visualization technique TreeJuxtaposer –tree comparison SequenceJuxtaposer –sequence comparison PRISAD –generic accordion drawing framework

3 Accordion Drawing rubber-sheet navigation –stretch out part of surface, the rest squishes –borders nailed down –Focus+Context technique integrated overview, details –old idea [Sarkar et al 93], [Robertson et al 91] guaranteed visibility –marks always visible –important for scalability –new idea [Munzner et al 03]

4 4 Guaranteed Visibility marks are always visible easy with small datasets

5 Guaranteed Visibility Challenges hard with larger datasets reasons a mark could be invisible

6 Guaranteed Visibility Challenges hard with larger datasets reasons a mark could be invisible –outside the window AD solution: constrained navigation

7 Guaranteed Visibility Challenges hard with larger datasets reasons a mark could be invisible –outside the window AD solution: constrained navigation –underneath other marks AD solution: avoid 3D

8 Guaranteed Visibility Challenges hard with larger datasets reasons a mark could be invisible –outside the window AD solution: constrained navigation –underneath other marks AD solution: avoid 3D –smaller than a pixel AD solution: smart culling

9 Guaranteed Visibility: Small Items Naïve culling may not draw all marked items GVno GV Guaranteed visibility of marks No guaranteed visibility

10 Guaranteed Visibility: Small Items Naïve culling may not draw all marked items GVno GV Guaranteed visibility of marks No guaranteed visibility

11 Outline Accordion Drawing –information visualization technique TreeJuxtaposer –tree comparison SequenceJuxtaposer –sequence comparison PRISAD –generic accordion drawing framework

12 Phylogenetic/Evolutionary Tree M Meegaskumbura et al., Science 298:379 (2002)

13 Common Dataset Size Today M Meegaskumbura et al., Science 298:379 (2002)

14 Future Goal: 10M node Tree of Life David Hillis, Science 300:1687 (2003) Plants Protists Fungi Animals You are here

15 Paper Comparison: Multiple Trees focus context

16 TreeJuxtaposer side by side comparison of evolutionary trees [video] –video/software downloadable from

17 TJ Contributions first interactive tree comparison system –automatic structural difference computation –guaranteed visibility of marked areas scalable to large datasets –250,000 to 500,000 total nodes –all preprocessing subquadratic –all realtime rendering sublinear scalable to large displays (4000 x 2000) introduced –guaranteed visibility, accordion drawing

18 Structural Comparison rayfinned fish lungfish salamander frog mammal turtle bird crocodile lizard snake rayfinned fish bird lungfish salamander frog mammal turtle snake lizard crocodile

19 Matching Leaf Nodes rayfinned fish lungfish salamander frog mammal turtle bird crocodile lizard snake rayfinned fish bird lungfish salamander frog mammal turtle snake lizard crocodile

20 Matching Leaf Nodes rayfinned fish lungfish salamander frog mammal turtle bird crocodile lizard snake rayfinned fish bird lungfish salamander frog mammal turtle snake lizard crocodile

21 Matching Leaf Nodes rayfinned fish lungfish salamander frog mammal turtle bird crocodile lizard snake rayfinned fish bird lungfish salamander frog mammal turtle snake lizard crocodile

22 Matching Interior Nodes rayfinned fish lungfish salamander frog mammal turtle bird crocodile lizard snake rayfinned fish bird lungfish salamander frog mammal turtle snake lizard crocodile

23 Matching Interior Nodes rayfinned fish lungfish salamander frog mammal turtle bird crocodile lizard snake rayfinned fish bird lungfish salamander frog mammal turtle snake lizard crocodile

24 Matching Interior Nodes rayfinned fish lungfish salamander frog mammal turtle bird crocodile lizard snake rayfinned fish mammal lungfish salamander frog bird turtle snake lizard crocodile

25 Matching Interior Nodes rayfinned fish lungfish salamander frog mammal turtle bird crocodile lizard snake rayfinned fish bird lungfish salamander frog mammal turtle snake lizard crocodile ?

26 Previous Work tree comparison –RF distance [Robinson and Foulds 81] –perfect node matching [Day 85] –creation/deletion [Chi and Card 99] –leaves only [Graham and Kennedy 01]

27 Similarity Score: S( m,n ) T1T1 T2T2 A B C D E F A C B D F E m n

28 Best Corresponding Node –computable in O(n log 2 n) –linked highlighting T1T1 T2T2 A B C D E F A C B D F E m BCN(m) = n 1/3 2/3 2/ /2

29 –Matches intuition Marking Structural Differences T1T1 T2T2 A B C D E F A C B D F E m n

30 Outline Accordion Drawing –information visualization technique TreeJuxtaposer –tree comparison SequenceJuxtaposer –sequence comparison PRISAD –generic accordion drawing framework

31 Genomic Sequences multiple aligned sequences of DNA now commonly browsed with web apps –zoom and pan with abrupt jumps –previous work Ensembl [Hubbard 02], UCSC Genome Browser [Kent 02], NCBI [Wheeler 02] investigate benefits of accordion drawing –showing focus areas in context –smooth transitions between states –guaranteed visibility for globally visible landmarks

32 SequenceJuxtaposer comparing multiple aligned gene sequences provides searching, difference calculation [video] –video/software downloadable from

33 Searching search for motifs –protein/codon search –regular expressions supported results marked with guaranteed visibility

34 Differences explore differences between aligned pairs –slider controls difference threshold in realtime results marked with guaranteed visibility

35 SJ Contributions fluid tree comparison system –showing multiple focus areas in context –guaranteed visibility of marked areas thresholded differences, search results scalable to large datasets –2M nucleotides –all realtime rendering sublinear

36 Outline Accordion Drawing –information visualization technique TreeJuxtaposer –tree comparison SequenceJuxtaposer –sequence comparison PRISAD –generic accordion drawing framework

37 Goals of PRISAD generic AD infrastructure –tree and sequence applications PRITree is TreeJuxtaposer using PRISAD PRISeq is SequenceJuxtaposer using PRISAD efficiency –faster rendering: minimize overdrawing –smaller memory footprint correctness –rendering with no gaps: eliminate overculling

38 PRISAD Navigation generic navigation infrastructure –application independent –uses deformable grid –split lines Grid lines define object boundaries –horizontal and vertical separate Independently movable

39 Split line hierarchy data structure supports navigation, picking, drawing two interpretations –linear ordering –hierarchical subdivision ABCDEF

40 PRISAD Architecture world-space discretization preprocessing initializing data structures placing geometry screen-space rendering frame updating analyzing navigation state drawing geometry

41 World-space Discretization interplay between infrastructure and application

42 application-specific layout of dataset –non-overlapping objects initialize PRISAD split line hierarchies –objects aligned by split lines Laying Out & Initializing AACC ATTT

application-specific layout of dataset –non-overlapping objects initialize PRISAD split line hierarchies –objects aligned by split lines Laying Out & Initializing AACC ATTT

44 Gridding each geometric object assigned its four encompassing split line boundaries AACC ATTT

45 Mapping PRITree mapping initializes leaf references –bidirectional O(1) reference between leaves and split lines Map Split line Leaf index

46 Screen-space Rendering control flow to draw each frame

47 Partitioning partition object set into bite-sized ranges –using current split line screen-space positions required for every frame –subdivision stops if region smaller than 1 pixel or if range contains only 1 object [1,2] [3,4] [5] { [1,2], [3,4], [5] } Queue of ranges

48 Seeding reordering range queue result from partition –marked regions get priority in queue drawn first to provide landmarks [1,2] [3,4] [5] { [1,2], [3,4], [5] } { [3,4], [5], [1,2] } ordered queue

49 Drawing Single Range each enqueued object range drawn according to application geometry –selection for trees –aggregation for sequences

50 PRITree Range Drawing select suitable leaf in each range draw path from leaf to the root –ascent-based tree drawing –efficiency: minimize overdrawing only draw one path per range [3,4] { [3,4], [5], [1,2] }

51 Rendering Dense Regions –correctness: eliminate overculling bad leaf choices would result in misleading gaps –efficiency: maximize partition size to reduce rendering too much reduction would result in gaps Intended renderingPartition size too big

52 Rendering Dense Regions –correctness: eliminate overculling bad leaf choices would result in misleading gaps –efficiency: maximize partition size to reduce rendering too much reduction would result in gaps Intended renderingPartition size too big

53 PRITree Skeleton guaranteed visibility of marked subtrees during progressive rendering first frame: one path per marked group full scene: entire marked subtrees

54 PRISeq Range Drawing: Aggregation aggregate range to select box color for each sequence –random select to break ties AACC ATTT [1,4] A T TTT C T

55 PRISeq Range Drawing collect identical nucleotides in column –form single box to represent identical objects attach to split line hierarchy cache lazy evaluation draw vertical column A T T T A { A:[1,1], T:[2,3] } 1 2 3

56 PRISAD Performance PRITree vs. TreeJuxtaposer (TJ) synthetic and real datasets –complete binary trees lowest branching factor regular structure –star trees highest possible branching factor

57 InfoVis Contest Benchmarks two 190K node trees directly compare TJ and PT

58 OpenDirectory benchmarks two 480K node trees too large for TJ, PT results only

59 PRITree Rendering Time Performance TreeJuxtaposer renders all nodes for star trees branching factor k leads to O(k) performance

60 PRITree Rendering Time Performance TreeJuxtaposer renders all nodes for star trees branching factor k leads to O(k) performance

61 PRITree Rendering Time Performance InfoVis 2003 Contest dataset 5x rendering speedup

62 PRITree Rendering Time Performance a closer look at the fastest rendering times

63 PRITree Rendering Time Performance

64 PRITree handles 4 million nodes in under 0.4 seconds TreeJuxtaposer takes twice as long to render 1 million nodes Detailed Rendering Time Performance

65 Detailed Rendering Time Performance TreeJuxtaposer valley from overculling

66 Memory Performance linear memory usage for both applications 4-5x more efficient for synthetic datasets

67 Memory Performance 1GB difference for InfoVis contest comparison marked range storage changes improve scalability

68 Performance Comparison PRITree vs. TreeJuxtaposer –detailed benchmarks against identical TJ functionality 5x faster, 8x smaller footprint handles over 4M node trees PRISeq vs. SequenceJuxtaposer –15x faster rendering, 20x smaller memory size –44 species * 17K nucleotides = 770K items –6400 species * 6400 nucleotides = 40M items

69 Future Work future work –editing and annotating datasets –PRISAD support for application specific actions logging, replay, undo, other user actions –develop process or template for building applications

70 PRISAD Contributions infrastructure for efficient, correct, and generic accordion drawing efficient and correct rendering –screen-space partitioning tightly bounds overdrawing and eliminates overculling first generic AD infrastructure –PRITree renders 5x faster than TJ –PRISeq renders 20x larger datasets than SJ

71 Joint Work TreeJuxtaposer –François Guimbretière, Serdar Taşiran, Li Zhang, Yunhong Zhou SIGGRAPH 2003 SequenceJuxtaposer –James Slack, Kristian Hildebrand, Katherine St.John German Conference on Bioinformatics 2004 TJC/TJC-Q –Dale Beermann, Greg Humphreys EuroVis 2005 PRISAD –James Slack, Kristian Hildebrand IEEE InfoVis Symposium 2005 Information Visualization journal, to appear

72 Open Source software freely available from –SequenceJuxtaposer olduvai.sf.net/sj –TreeJuxtaposer olduvai.sf.net/tj –requires Java and OpenGL JOGL bindings for TJ, GL4Java for SJ (JOGL coming soon) papers, talks, videos also from

73 Other Projects Focus+Context evaluation –high-level user studies of systems –low-level visual search and memory graph drawing dimensionality reduction