Presentation is loading. Please wait.

Presentation is loading. Please wait.

One Billion Rows Per Second: Analytics for the Digital Media Markets XLDB October 19, 2011 MICHAEL DRISCOLL CO-FOUNDER &

Similar presentations


Presentation on theme: "One Billion Rows Per Second: Analytics for the Digital Media Markets XLDB October 19, 2011 MICHAEL DRISCOLL CO-FOUNDER &"— Presentation transcript:

1 One Billion Rows Per Second: Analytics for the Digital Media Markets XLDB October 19, 2011 MICHAEL DRISCOLL CO-FOUNDER & CTO @medriscoll

2 Taming the Inferno of the Online Ad Markets billions of microtransactions per day dozens of publisher, advertiser, & audience attributes

3 Goal: Fast Analytics Over 100s of Terabytes

4 data crunched in minutes queries in seconds dashboard database ingestion Goal: Fast Analytics Over 100s of Terabytes

5 data crunched in minutes queries in minutes dashboard database ingestion Solution 1: MPP Database MPP Database Hadoop

6 data crunched in hours queries in seconds dashboard database ingestion Solution 2: HBase Hadoop

7 data crunched in minutes queries in seconds dashboard database ingestion Solution 3: Do It Ourselves: Druid Druid Hadoop

8 Four Principles of Druid’s Performance at Scale SUMMARIZE DISTRIBUTE PARALLELIZE STORE IN-MEMORY 100x smaller vs raw data 100x throughput vs a single node (with 100 cores) 100x faster vs disk = 10^6 Druid can filter and aggregate over 1 billion rows per second on a 50-core cluster, or 20m rows per core per second factor speed-up

9 Consequences of Druid: Faster Queries photo credit tonylanciabeta http://www.flickr.com/photos/tonysphotos/3305157904/sizes/o/in/photostream/

10 Consequences of Druid: Fresher Data photo credit: Lars P. http://www.flickr.com/photos/lars_p/4911238308/sizes/o/in/photostream/

11 Consequences of Druid: Scalable in the Cloud photo credit: MonkeyAt Large http://www.flickr.com/photos/monkeyatlarge/16645379/sizes/l/in/photostream/

12 One Billion Rows Per Second: Analytics for the Digital Media Markets QUESTIONS? CONTACT ME AT MIKE@METAMARKETSGROUP.COM MICHAEL DRISCOLL CO-FOUNDER & CTO @medriscoll


Download ppt "One Billion Rows Per Second: Analytics for the Digital Media Markets XLDB October 19, 2011 MICHAEL DRISCOLL CO-FOUNDER &"

Similar presentations


Ads by Google