$25
Setting up Hadoop and running the WordCount example
This assignment aims at letting you setup Hadoop on your KVM on Qubit. After the installation of Hadoop, you need to run the WordCount example on your KVM.
Source Code and Datasets:
The java source code is given in the file “WordCount.java”. You need to run it on two datasets:
1) test.txt
2) peterpan.txt
Report:
Please write a report to explain the key steps. You may take screenshots. You may also explain the commands. You may include the following key steps.
1) Setup Hadoop
2) Setup HDFS and upload the datasets “test.txt” and “peterpan.txt” into HDFS.
3) Create a project in VS Code, import the “WordCount.java”, configure the project, and export the “WordCount.jar” file.
4) Open a terminal, and run the “WordCount.jar” file on “test.txt” and “peterpan.txt” respectively.