diff_months: 11

Assignment 1 - 10 marks. Starts on 19th February. Ends on 11th March, 2024 11.55 PM.

Download Solution Now
Added on: 2024-11-14 06:00:04
Order Code: SA Student Hariti IT Computer Science Assignment(3_24_40277_62)
Question Task Id: 502252

Assignment 1 - 10 marks. Starts on 19th February. Ends on 11th March, 2024 11.55 PM.

You can submit after 11th March too, For every day of delay, 1 mark will be deducted from the total marks scored. Every student should submit the assignment individually.

Setup a Hadoop cluster of your choice. A pseudo mode will work too but a real cluster with more than 1 data node will be even more thrilling for you. For a real cluster, you may choose any cloud solution or your machines that you have under disposal.

Execute the attached script (mysql-import.sql) in your favourite RDBMS. This script is written for mysql. Similar commands should work for other RDBMS. This creates a database called purchases & tables called metal_purchases, metal_shops. The tables are self explanatory:

Table 'metal_purchases' has

the shopid from where a purchase has been made

the metal that was purchased

the shortform of the metal which was purchased

the weight in grams purchased

Table metal_shops has

the shopid

the name of the shop

Import the tables into HDFS. Use the right tool.

Write a map reduce program to print the total weight of all metals in descending order purchased in each shop.

Output should be like:

Lalitha Jewellery 568g

PC Jewellers 326g

Obviously, this involves joins

Also import the data from RDBMS into hive or pig. Write a query to print the total weight of all metals in descending order purchased in each shop.

Output:

A compressed file containing the following:

A 5 minute video of various command executions, details of your cluster.

Screenshots in the slides/document, as you feel appropriate.

Place the media in a Google driver folder and grant me permissions to view/comment/download

Record your submission on the taxila with the link to your submission.

Important:

Please submit YOUR output. There is just one submission. So, submit the best of your outputs

Any reproduction/usage of someone else's work leads to disqualification - for both

Submit abundant proof to show the work is original

Do not delete the setup until evaluation is complete as you may have to give additional materials during evaluation

Five minute video only. Hard stop

Ensure video recording is of high quality that is legible enough for evaluation and future reference

Place the zip file at this location. Give permissions to vignesh.karthik@wilp.bits-pilani.ac.in only

  • Uploaded By : Pooja Dhaka
  • Posted on : November 14th, 2024
  • Downloads : 0
  • Views : 166

Download Solution Now

Can't find what you're looking for?

Whatsapp Tap to ChatGet instant assistance

Choose a Plan

Premium

80 USD
  • All in Gold, plus:
  • 30-minute live one-to-one session with an expert
    • Understanding Marking Rubric
    • Understanding task requirements
    • Structuring & Formatting
    • Referencing & Citing
Most
Popular

Gold

30 50 USD
  • Get the Full Used Solution
    (Solution is already submitted and 100% plagiarised.
    Can only be used for reference purposes)
Save 33%

Silver

20 USD
  • Journals
  • Peer-Reviewed Articles
  • Books
  • Various other Data Sources – ProQuest, Informit, Scopus, Academic Search Complete, EBSCO, Exerpta Medica Database, and more