Download presentation
Presentation is loading. Please wait.
Published byChester Jefferson Modified over 8 years ago
3
3
7
Hadoop? Cloud data warehousing? Machine learning? NoSQL?
8
Ecosystems around open source projects are very active Basis in commodity hardware Scale out, and cloud Change in economics of computing power Change in economics of storage
18
mapper Input reducer Input Output Input K1K1 K2K2 K3K3 Output
26
Impala + Kafka
32
Store raw data, centrally in HDFS Use different processing engines for different analyses Data Lake
Similar presentations
© 2025 SlidePlayer.com Inc.
All rights reserved.