Assignment 1 - 10 marks. Starts on 19th February. Ends on 11th March, 2024 11.55 PM.
Assignment 1 - 10 marks. Starts on 19th February. Ends on 11th March, 2024 11.55 PM.
You can submit after 11th March too, For every day of delay, 1 mark will be deducted from the total marks scored. Every student should submit the assignment individually.
Setup a Hadoop cluster of your choice. A pseudo mode will work too but a real cluster with more than 1 data node will be even more thrilling for you. For a real cluster, you may choose any cloud solution or your machines that you have under disposal.
Execute the attached script (mysql-import.sql) in your favourite RDBMS. This script is written for mysql. Similar commands should work for other RDBMS. This creates a database called purchases & tables called metal_purchases, metal_shops. The tables are self explanatory:
Table 'metal_purchases' has
the shopid from where a purchase has been made
the metal that was purchased
the shortform of the metal which was purchased
the weight in grams purchased
Table metal_shops has
the shopid
the name of the shop
Import the tables into HDFS. Use the right tool.
Write a map reduce program to print the total weight of all metals in descending order purchased in each shop.
Output should be like:
Lalitha Jewellery 568g
PC Jewellers 326g
Obviously, this involves joins
Also import the data from RDBMS into hive or pig. Write a query to print the total weight of all metals in descending order purchased in each shop.
Output:
A compressed file containing the following:
A 5 minute video of various command executions, details of your cluster.
Screenshots in the slides/document, as you feel appropriate.
Place the media in a Google driver folder and grant me permissions to view/comment/download
Record your submission on the taxila with the link to your submission.
Important:
Please submit YOUR output. There is just one submission. So, submit the best of your outputs
Any reproduction/usage of someone else's work leads to disqualification - for both
Submit abundant proof to show the work is original
Do not delete the setup until evaluation is complete as you may have to give additional materials during evaluation
Five minute video only. Hard stop
Ensure video recording is of high quality that is legible enough for evaluation and future reference
Place the zip file at this location. Give permissions to vignesh.karthik@wilp.bits-pilani.ac.in only