We're Open
support@instaresearch.co.uk
+44 7340 9595 39
+44 20 3239 6980

Big Data Final project

Information



  • Post Date 2020-05-11T05:10:53+00:00
  • Post Category Assignment Queries

No Plagiarism Guarantee - 100% Custom Written

Order Details

Big Data Final project

In this final project, you are required to build the Hadoop cluster with more than one instance on any linux flavored platform, filter the input data by letter which has assigned to you while reading the input source, filter the input data by letter which has assigned to you while reading the input source to gain experience in building applications using Hadoop Open Source Platform, Map reduce programming framework, Hive Query Language, Pig – Latin script.

Final Project

Analysis on Million songs

Analysis on Million Songs Data set:

Summer 2016 – Big Data Final project

The goal of this project assignment is to gain experience in building applications using

  • Hadoop Open Source Platform
  • Map reduce programming framework
  • Hive Query Language
  • Pig– Latin script

Please build the Hadoop cluster with more than one instance on any linux flavored platform.

–          Download the million songs meta data from the below repository and load the same into HDFS.

One good source for download ur data set:

  • https://drive.google.com/open?id=0B4qvMVe-iB-eWGI1X29FNDYwVXc

Please filter the input data by letter which has assigned to you while reading the input source and use the corresponding data set for the project work.

Eg: If the letter ‘K’ has assigned to you then consider the input data where 2nd column value starts with letter ‘k’.

In this assignment my letter is ‘k’.

The first row has corresponding column names in the spreadsheet.

Submit JPS Output, ifconfig output, Cluster Details & Total number of files count in HDFS.

Once the data is loaded successfully into HDFS, please submit the below analytical metrics usingHive, Map reduce or Pig latin.

  1. Analyze the Duration of Songs for each year.

Submit the calculated results data and also corresponding bar graph / pie chart.

  1. Analyze on no of songs which ending with same last digit of their digital ID.

Submit the calculated results data and also corresponding bar graph / pie chart.

  1. Analyze on number of artists by the first letter of their name OR

Analyze the familiarity of song for each year.

Submit the calculated results data and also corresponding bar graph / pie chart.

  1. Analyze on range of tempo or loudness for each year.

Submit the calculated results data and also corresponding bar graph / pie chart.

  1. Analyze on songs with same key value.

Submit the calculated results data.

 


Price: £ 99

100% Plagiarism Free & Custom Written, Tailored to your instructions