HDFS MapReduce Hadoop
Hadoop Distributed File System (HDFS) An open-source implementation of GFS has many similarities with distributed file systems. However, comes differences with it. HDFS is highly fault-tolerant and is designed to be deployed on low-cost hardware. HDFS provides high throughput access to application data and is suitable for applications that have large data sets.
HDFS MapReduce Hadoop NameNode Sec NDTaskTrackerJobTracker DataNode
How?