Presentation is loading. Please wait.

Presentation is loading. Please wait.

Hadoop Ali Sharza Khan High Performance Computing 1.

Similar presentations


Presentation on theme: "Hadoop Ali Sharza Khan High Performance Computing 1."— Presentation transcript:

1 Hadoop Ali Sharza Khan High Performance Computing 1

2 Table of Content Hadoop Where did Hadoop come from ? What problems can Hadoop solve? Where does Hadoop applies to ? How is Hadoop architected? Two main parts of Hadoop Conclusion 2

3 Hadoop What is Hadoop ? – Open Source project – Processing Large data sets in parallel 3

4 Where did Hadoop come from? Google Yahoo, Facebook, Twitter and Linkedln are actively contributing towards Hadoop. 4

5 What problems can Hadoop solve? Where you have lot of data Run analytics that are deep and computational extensive 5

6 Where does Hadoop applies to ? Search engine Finance Online Retail Government Media and entertainment Research Institution and other market 6

7 How is Hadoop architected? Every server has 2 or 4 or 8 Cpu’s. Each server operates on its own little piece of data. Hadoop clusters at Yahoo covers 25000 servers, and store 25 petabytes of application data. The largest cluster being 3500 servers. 7

8 Cloudera CEO Interview http://www.youtube.com/watch?v=q NP4_ICDeqE 8

9 Two main parts of Hadoop HDFS (Hadoop Distributed File System) Map Reduce Framework – Map Phase – Reduce Phase – JobTracker (The master) – TaskTracker (The slave) 9

10 MapReduce FrameWork 10

11 Conclusion Why Hadoop is able to deal with lots of data? Why Hadoop is able to compute complicated Computational questions? 11


Download ppt "Hadoop Ali Sharza Khan High Performance Computing 1."

Similar presentations


Ads by Google