Assessment 2 - Case Study - Spotify Top 2000
Assessment 2 - Case Study - Spotify Top 2000
Length: 2000 Words
Due: Friday 10th May, 11:59PM
Submission: Online (Canvas)
Weight: 35%
Instructions: Data for this assessment can be found in the Excel file Dutch Top 2000 Spotify Songs.xlsx
This assessment covers the topics up to and including Hypothesis Testing.
You will be expected to manipulate the data using pivot tables, to create appropriate visuals (tables and graphs) to demonstrate outcomes and to conduct a range of statistical analyses (particularly hypothesis testing) to answer questions.
Should you decide to reference additional material there is no referencing standard specifically required so long as you are consistent.
Complete all calculations in the Excel file and save it as <Your_Name> Assessment 2.xlsx. This will be the file you upload to Canvas.
Separate worksheets have been identified for each question in the Excel File. Please complete each question on the nominated worksheet. It is recommended that you keep an unadjusted, original data set on the first worksheet Dutch Top 2000 which you can refer back to. Some questions may require you to adjust the data set, which will not be needed in later questions, so keep a clean copy on the first worksheet
In addition to the Excel file, you will be required to create a solution document where you will copy your final excel output for each question (Table, graph, formula or excel calculation solution) as well as the written explanation and conclusion for each question. You may use the original Assessment 2 document incorporating your solutions under each question, or you may create a new document, however ensure that each question is clearly identified. This document should be created in Word, but once you have completed the assessment please save it as a PDF using the title <Your_Name> Assessment 2.pdf
For those answers which require written responses these should be short and succinct, with reference to relevant data and provided in non technical language
When you copy your tables, graphs etc across to the solution document please include them in the section relevant to answering a particular question. Do not put them in appendices. Only copy over relevant output. Note that we need to see this in your solution so that we do not have to go searching for it in the Excel output. Any good report always includes the relevant information up front so it can be referred to immediately without the reading having to look elsewhere.
Please ensure you format and label all graphs, tables and output.
This assessment is worth 35% but you will be given a mark out of 70. Marks for the relevant parts are shown below.
Remember to upload both the Excel and PDF file as solutions to Canvas.
Have fun. Learn something about Dutch music preferences. Agree or disagree with the choices made. Leave the assessment with something you could say to your friends along the lines of Did you know..?.
Background
Spotify is the largest music streaming service globally used by over 30% of streamers. Their ability to offer a wide range of services, including curating personal lists and making recommendations, relies on their massive music data base and the ability to analyse a vast range of data on both the consumer and the music itself.
You have been provided with a data set containing information on 2000 (actually 1994) of the top Spotify Songs according to Dutch subscribers released from 1956 to 2019. Being a Dutch data set there are a large number of tracks by Dutch artists.
Information provided in the spreadsheet includes data on the following variables:
Index a simple identifier
Title name of the track
Artist name of the artist
Genre the first listed genre of the track
Year Year of most recent release
Tempo The Beats Per Minute (BPM) of the song
Energy Spotify rating from 0 to 100 with values closer to 100 indicating a more energetic song
Danceability Spotify rating from 0 to 100 with higher values indicating it is more danceable.
Loudness The overall loudness in decibels (db) with higher values indicating louder song
Valence score from 0 to 100 with the higher the score the more positive the mood of the song
Length Duration of the song in seconds
Acousticness score from 0 to 100 with the higher the score the more acoustic it is
Speechiness score from 0 to 100 with the higher the score the more spoken words there are
Popularity Score from 0 to 100 based on an algorithm that includes the number of times it is played. Songs with a higher score are more popular.
Answer the following questions:
General Popularity
What are the 5 most popular songs in this list? (2 Marks)
Who are the 5 most popular artists note the artist must have at least 5 songs in the top 2000 to qualify? Show the top 20 most popular artists using a graph with the most popular on the left and with decreasing popularity. (4 Marks)
What are the 5 most popular genres among the Dutch top 2000? (3 Marks)
Genre Preference
Is there any evidence at the 5% level of significance that there is a preference for Metal over Classic Rock? You should conduct this test in two ways:
Is there a difference in the average popularity of Metal and Classic Rock Songs? (8 Marks)
Is there a difference in the proportion of Metal and Classic Rock songs which have made it to the top 2000? (7 Marks)
Song Duration
Have you seen the movie Bohemian Rhapsody (if you havent you should)? In the movie, famed music producer Ray Foster tells Queen the song Bohemian Rhapsody was too long to be a single release in 1975. Conduct a test at the 5% level of significance to see if the average length of songs in 1975 was less than the length of Bohemian Rhapsody at 354 seconds. Was Ray Foster right, or was 1975 the year that changed the way popular songs were viewed? (5 Marks)
Musical Features impacting Popularity
Construct a correlation matrix including the variables: Popularity; Year; Beats per minute; Energy; Danceability; Loudness; Valence; Length; Acousticness; Speechiness. (2 Mark)
What does the correlation between Popularity and Year imply? (2 Mark)
Other than Year, what are the three factors which seem to have the biggest impact on popularity? Create appropriate graphs to demonstrate the relationship and briefly explain how the graph supports the correlation measurement. (5 Marks)
Dutch v Global Artists
It is unsurprising that since this is the Dutch top 2000 that there are a number of Dutch Artists and Genres in the data set.
Split the data set in to Dutch and Non Dutch songs (you can assume the Dutch Genres represent Dutch Artists). Using a full range of statistics and appropriate graphical methods, what do you conclude about the relative popularity of Dutch vs Non Dutch artists? (10 Marks)
To see if your conclusion is supported, construct a 95% confidence interval for the average popularity of Dutch songs and another of non Dutch songs. Do these results support your conclusion in the last question? Why? (4 Marks)
Your Own Question
From all of the data provided, propose one question of your own that you are interested in. Construct a table, create a graph or conduct a test which answers that question. (4 Marks)
Dashboard
Build a dashboard to show the Dutch Top 200. In the dashboard include the following:
Top 5 most popular songs (Hint: Consider using the =Rank function) (3 Marks)
Graph showing average popularity by genre (3 Marks)
Table showing the top 10 artists by the number of songs (3 Marks)
A Table showing the most popular song and artist by year. NOTE this last table is difficult and is for those who love playing around with, and learning about, Excel. As such it does not count towards your marks on this assessment, HOWEVER, if you can do it you can get a bonus mark on the assessment. (2 bonus marks)
Do not include this information in your spreadsheet solution, however you may use it to test your Dashboard. The markers will use it to make sure the dashboard is live that is the whole dashboard updates when you add the single piece of information and hit refresh, all tables and graphs update automatically.
Index Title Artist Genre Year Length Popularity
1995 Flame Trees Cold Chisel Oz Rock 1984 260 99
Putting it all together summarising your results
In no more than 250 words, synthesise the information provided in all questions above. When you do this dont just repeat the results instead paint a picture of what youve learned about Dutch song tastes. (5 Marks)