Presentation is loading. Please wait.

Presentation is loading. Please wait.

TECHNIQUES FOR VISUALIZING MASSIVE DATA SETS Leilani Battle, Mike Stonebraker.

Similar presentations


Presentation on theme: "TECHNIQUES FOR VISUALIZING MASSIVE DATA SETS Leilani Battle, Mike Stonebraker."— Presentation transcript:

1 TECHNIQUES FOR VISUALIZING MASSIVE DATA SETS Leilani Battle, Mike Stonebraker

2 Context Visualization System Database query result

3 Problem Performance Vis systems dont scale well for big data Or are turning into databases Over-plotting Makes visualizations unreadable Waste of time/resources

4 Solution: Resolution Reduction Visualization System Database Resolution Reduction Layer query queryplan query queryplan result modified query reduced result

5 ScalaR Scalable vis system for data exploration Web front-end Uses SciDB (www.scidb.org)www.scidb.org Visualizes query results Performs Resolution Reduction

6 Demo of ScalaR

7 Array Browser Collaboration with: Brown: Justin DeBrabant, Stan Zdonik, Ugur Cetintemel Stanford: Zhicheng Liu, Jeff Heer Google Maps-style exploration experience Fetches subsets of the data (aka data tiles)

8 Array Browser Example

9 Array Browser Architecture

10 Demo of Array Browser

11 Future Work: Prefetching Goal: Reduce user-wait time by prefetching tiles Cache tiles in the tile buffer Need algorithms to decide what to pre-fetch

12 User Behavior Predictor (Seer) P P P P Learn common query sequences from user traces

13 Statistical Analysis Predictor P P P P P P Look for statistical similarities in tiles Try to guess whats important based on patterns

14 Using Multiple Predictors Run multiple predictors (or experts) in parallel Compare predictions to users actual behavior Use predictions from best performing expert May change over time based on users goals

15 Other Challenges Lots if interesting problems left to address Best eviction policy for the tile buffer? How to share data between multiple users? More predictors?

16 Questions?

17

18 Gemini Sagittarius DogsCats

19

20 Prefetching Experts User behavior predictor (Seer) Learn common query sequences from user traces Stats analysis predictor Look for statistical similarities in tiles Try to guess whats important based on patterns


Download ppt "TECHNIQUES FOR VISUALIZING MASSIVE DATA SETS Leilani Battle, Mike Stonebraker."

Similar presentations


Ads by Google