BSB131 Applied Data Analytics
BSB131 Applied Data Analytics
51435389255Assessment Instructions
This assessment concentrates on your ability to calculate common numerical statistical measures and to use those in combination with common visualisations to make decisions.
Data for this assessment can be found on the Excel Spreadsheet PST2.xlsx.
For this assessment please do the working in Excel, but copy all of your solutions to an answer document in Word.
Once completed save your solutions as a PDF file using the title <your_name>PST2.pdf e.g. Andrew_paltridge pst2.pdf.
Submit only the PDF and not the Excel worksheet. We will request that if required.
00Assessment Instructions
This assessment concentrates on your ability to calculate common numerical statistical measures and to use those in combination with common visualisations to make decisions.
Data for this assessment can be found on the Excel Spreadsheet PST2.xlsx.
For this assessment please do the working in Excel, but copy all of your solutions to an answer document in Word.
Once completed save your solutions as a PDF file using the title <your_name>PST2.pdf e.g. Andrew_paltridge pst2.pdf.
Submit only the PDF and not the Excel worksheet. We will request that if required.
Problem Solving Task 2
Background
This assessment uses the same data as PST1. Consumer debt has been a major topic of discussion in Australia over the last few years. From the Banking Royal Commission in 2017 through to the current Cost of Living crisis questions are being posed regarding issues surrounding consumer indebtedness and their ability to pay.
You have been given information on 9 variables for a series of credit card holders. In addition to their customer number you have been provided:
Balance the average monthly unpaid credit card balance ($)
Income the average monthly household income from all sources ($)
Spend average amount spent using the credit card per month ($)
Loans the average amount paid servicing other loans per month ($)
Card level of card issued (Blue for lowest level, Gold for mid tier and Platinum highest)
Gender gender of the card holder restricted to Male or Female
Status household income arrangements defined as couple or single income earner
Children Number of children in home
Education highest level of education of card owner (High School, Diploma, Bachelor or Postgraduate)
Your Task
For the Balance variable calculate the full range of statistics including Mean, Median, Mode, First and Third Quartiles, Standard Deviation, Range, Interquartile Range and Skewness Coefficient. Copy all of these measures to your answer document
Recreate the Histogram from last assessment and also do a Box and Whisker Plot for the Balance Variable. Make sure all diagrams are labelled and copy to your answer document.
In less than 100 words describe the variable Balance and indicate if you think that overall unpaid credit card debt is a problem.
Split the Balance Data in to two groups For those customers who have other loan repayments and those who do not. Calculate the full range of statistics for both groups including a coefficient of variation. Do Box-Plots for both groups.
In less than 100 words what can you conclude about credit card debt when comparing the two groups?
Recreate the Debt Stress Index Variable from PST1
Using just the four variables: DSI, Balance, Loans and Spend, construct a correlation matrix showing the relationship between them. This can be done using the Data / Data Analysis / Correlation function in Excel. Copy the results to your answer document.
In less than 100 words what can you say about the relationship between Debt Stress and the three variables Balance, Loans and Spend. Dont just define the values, tell us what you have learned.
Assessment is due Friday 29th March, 11:59pm.