Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 1 Apache Hadoop and the Emergence of the Enterprise Data Hub Eli Collins, Chief Technologist ©2014 Cloudera, Inc. All rights reserved.

Similar presentations


Presentation on theme: "1 1 Apache Hadoop and the Emergence of the Enterprise Data Hub Eli Collins, Chief Technologist ©2014 Cloudera, Inc. All rights reserved."— Presentation transcript:

1 1 1 Apache Hadoop and the Emergence of the Enterprise Data Hub Eli Collins, Chief Technologist ©2014 Cloudera, Inc. All rights reserved.

2 2 2 The Enterprise Data Warehouse ©2014 Cloudera, Inc. All rights reserved. Flat Files Operational Store Data Sources Staging Reporting Analysis Mining Operational Store Metadata Summary Facts & Dimensions EDW Archive Data marts

3 3 3 The Enterprise Data Hub ©2014 Cloudera, Inc. All rights reserved. images logs binary DB dumps 1.Inexpensive storage 2.Flexible storage 3.Co-located compute 4.Multiple compute engines MR, Pig/Hive, SQL, Spark, SAS, R, Search, Graph..

4 4 ©2014 Cloudera, Inc. All rights reserved.4 So it’s Like a Data Warehouse?

5 5 5©2014 Cloudera, Inc. All rights reserved. An Analogy

6 6 6©2014 Cloudera, Inc. All rights reserved. What changed? The need? Convenience? Cost?

7 7 Take and share good photos

8 8 Data Warehouse vs. Data Hub ©2014 Cloudera, Inc. All Rights Reserved. Enterprise Data Warehouse Enterprise Data Hub

9 ©2014 Cloudera, Inc. All Rights Reserved. 9 An Operating System APP SCHEDULER FILE SYSTEM MGT SERVICES APP LIB APP 3rd PARTY APP

10 ©2014 Cloudera, Inc. All Rights Reserved. 10 An Enterprise Data Hub BATCH PROCESSING ANALYTIC SQL SEARCH ENGINE MACHINE LEARNING STREAM PROCESSING 3 RD PARTY APPS WORKLOAD MANAGEMENT STORAGE FOR ANY TYPE OF DATA UNIFIED, ELASTIC, RESILIENT, SECURE DATA MANAGEMENT SYSTEM MANAGEMENT FilesystemOnline NoSQL

11 11 Data Warehousing with an EDH ©2014 Cloudera, Inc. All rights reserved. Flat Files Operational Store Data Sources EDH Reporting Analysis Mining Operational Store EDW 1. Stage, transform, archive 3. Exploratory, Discovery, Search, ML.. 2. Reporting, Mining, Analysis

12 12 ©2014 Cloudera, Inc. All rights reserved.12

13 13 ©2014 Cloudera, Inc. All rights reserved.

14 ©2014 Cloudera, Inc. All Rights Reserved. 14 Data Warehousing in Cloudera’s EDH

15 15 ©2014 Cloudera, Inc. All rights reserved.15

16 16 ©2014 Cloudera, Inc. All rights reserved.


Download ppt "1 1 Apache Hadoop and the Emergence of the Enterprise Data Hub Eli Collins, Chief Technologist ©2014 Cloudera, Inc. All rights reserved."

Similar presentations


Ads by Google