Hadoop Concepts

What is Hadoop:  is an open-source software framework for storing and processing big data in a distributed fashion on large clusters of commodity hardware. Essentially, it accomplishes two tasks: massive data storage and faster processing.

Hadoop Concepts

Distributed Reliable File System

  •  Apache Hadoop Distributed File System (HDFS)
  •  Inspired by Google File System
  •  Single Logical View of distributed Linux File Systems

 Data typically is Replicate 3 times

  •  Fault Tolerant
  •  Better I/O

Distributed Compute Framework & Resource Manager

  •  Apache MapReduce and YARN
  •  Inspired by Goolge MapReduce

HDFS Blocks

Files are broken into chunks


104 Industrial Blvd, Suite P,
Sugar Land, TX, 77479

We are located close to Hwy 59 and
Hwy 90 in Sugar Land , TX


Phone : 419 408 3178

Back to Top