i

Hadoop Tutorial

Running The MapReduce Program & Storing The Data File To HDFS

In this section, we will learn the process of MapReduce with python example with step by step.

Storing the data file to HDFS

Step 1: We can get a similar data set from the storage http://media.sundog-soft.com/hadoop/ml-100k/u.data or can create a file with the same data format.  

 Step2: Start the Hadoop instances with start-all.sh then copy the file from local to HDFS.

Step 3:  The default port number to access Hadoop is 9870. Use the following URL (Localhost: 9870) to get Hadoop services on the browser. Check the data file (u.data) from Browser.