i

Hadoop Tutorial

Hadoop Architecture

Hadoop offers a flexible, scalable, and reliable distributed processing big data framework for a cluster of systems that has storage capacity and local computing capability by leveraging commodity hardware. Hadoop's topology is master-slave. We have one master node and several slave nodes in this topology. The role of the master node is to assign and handle funds to different slave nodes a job. The real computing is done by the slave nodes. Slave nodes store the actual information while we have metadata on the master. This means that data is stored on the data.

Hadoop Architecture comprises three primary layers They are:-

  • HDFS (Hadoop Distributed File System)

  • Yarn

  • MapReduce