We're Open
+44 7340 9595 39
+44 20 3239 6980

WHAT IS BIG DATA?

  100% Pass and No Plagiarism Guaranteed

WHAT IS BIG DATA?

Introduction:

 

This assignment will provide a summative assessment of your understanding of Big Data Systems and related technologies. Three mini-tasks that are to be completed have the following aims:

 

  • Introduce Big Data in the context of a given organisation (See Task 1)
  • Understand the problems of working with Big Data and describe technologies that specialize in catering for Big Data (See Task 2)
  • Use a software package that is designed for Big Data Systems to perform a simple analytical task (See Task 3)

 

Task 1 – Introduce Big Data

 

In The Context of Amazon; Amazon is an online book retailer that has expended its retail offering far beyond books over the last decade (www.amazon.com).

 

  • Define Big Data in terms of the four V’s. Describe how each V could apply to Amazon. (E.g. ‘Volume’ is one of the V’s. What data would Amazon likely to be capturing to qualify?)
  • Give an example from Amazon to illuminate your points for each of the 4 V’s discussed above                                                                                                                 (12 Marks)

 

Task 2 – Big Data Technologies

 

Hadoop is a technological framework that enables processing of large datasets at the scale of Big Data. Your task is to research and understand Hadoop. Your description should include:

 

  • What is Hadoop?
  • What are the technological challenges of working with Big Data?
  • How does Hadoop framework overcome abovementioned challenges? (10 Marks)

 

Task 3 – Big Data Analytics with Orange Software Package

 

The dataset that we will be using is contained in the file Titanic.tab that is made available on CloudDeakin under Resources->Assignment 3->Titanic.tab

 

This by no means is a Big Data set. In order to simplify the analytical task (as promised in lectures) we will settle for using a smaller and simpler dataset. Your task is to:

 

  • Analyse the full dataset using Orange and try to get an insight.
  • Take a random sample of 200 records and perform the same analysis. State your findings. Are your conclusions similar to what you have found previously? Explain why or why not.
  • Under what circumstances would it be permissible to use a random sample from a full dataset for analysis? Under what circumstances would it raise red flags? (11.3 Marks)

100% Plagiarism Free & Custom Written,
Tailored to your instructions


International House, 12 Constance Street, London, United Kingdom,
E16 2DQ

UK Registered Company # 11483120


100% Pass Guarantee

STILL NOT CONVINCED?

View our samples written by our professional writers to let you comprehend how your work is going to look like. We have categorised this into 3 categories with a few different subject domains

View Our Samples

We offer a £ 2999

If your assignment is plagiarised, we will give you £ 2999 in compensation

Recent Updates

Details

  • Title: WHAT IS BIG DATA?
  • Price: £ 109
  • Post Date: 2024-08-28T18:23:38+00:00
  • Category: Assignment
  • No Plagiarism Guarantee
  • 100% Custom Written

Customer Reviews

WHAT IS BIG DATA? WHAT IS BIG DATA?
Reviews: 5

A masterpiece of assignment by , written on 2020-03-12

My psychology assignment just came on time and the overall quality is good. It’s also free from errors. I simply loved it!
Reviews: 5

A masterpiece of assignment by , written on 2020-03-12

The rates are a bit expensive for me of the essay writing service but the overall look is amazing. The references and formatting are done beautifully. So, I feel that the high rates are worth all these qualities. I want to thank you for great help.