BUS8375 Research and Data Analysis
BUS8375 Research and Data Analysis
Assignment 4 (In-Class) Quantitative Data Analysis
Instructions:
Review the subset of data that has been extracted from the Sustainable Development Report 2022 (https://dashboards.sdgindex.org/). The data is located in Week 11 on eConestoga as well as in the Evaluations Module under Assignment #4. Using this data, complete the following analysis:
Select a Sample from the Data
Data is available for 163 countries in this subset. For the purposes of this assignment we will use a smaller sample of data for the majority of our analysis. Randomly select 25 countries from the list of countries that will act as your sample dataset for this population.
Create a worksheet within your excel document that contains the data for your randomly selected sample of 25 countries. NOTE: This is the data set that you will use for all questions except 3b, c, and d. (2 pts)
Central Tendency of Coded Data SDG12 Responsible Consumption and Production
The dashboard rating (indicated by a colour) can be considered ordinal data.
For your data sample, code the Goal 12 Dashboard data. (2 pts)
Calculate the Mode, Median, Mean, and Interquartile Range for your coded response. (4 pts)
Measure of Dispersion SDG 3 Good Health and Well Being
For your data sample, manually calculate the standard deviation of the Goal 3 Score in excel (be sure to show your work). (5 pts).
For the entire population (i.e. all countries), calculate the standard deviation of the Goal 3 Score in excel (you may use the standard deviation excel formula or manually calculate). (1 pts)
Based on 3a and 3b, comment on whether or not your sample is representative of the population in two sentences or less. (1 pts)
Assume we had a normal distribution for the data we were looking at in Question 6b. Between what range would you expect to find 68% of the data. (1 pts)
Pie Chart: SDG 1 No Poverty
Draw a pie-chart for your data sample that summarizes how countries are trending in achieving SDG 1 No Poverty. Hint: You will first need to create a frequency distribution table for each category (i.e. on track, moderately increasing, etc.). (5 pts)
Summarize the data in the pie-chart in two or less sentences (i.e. what is the chart telling us). (1 pts)
Histogram: SDG 13 Climate Action
Using the Goal 13 Score for your data sample, create a frequency distribution table using four (4) equal intervals (2 pts)
Draw a histogram to present the data from your table (3 pts)
Summarize the data in the histogram in two or less sentences (i.e. what is the data telling us?). (1 pts).
Scatterplot: SDG of Choice vs. Population
Pick any one of the SDG Scores that you think might have a relationship with the overall 2022 SDG Index Score.
Draw a scatterplot showing the 2022 SDG Index Score on the X axis and the Goal Score on the Y axis for your data sample. (3 pts)
Summarize the data in the scatterplot in two sentences or less (i.e. what is the graph telling us). (1 pts)
From your scatterplot, identify the formula for your regression line. To do this add a Simple Linear Trendline. Be sure to select Display Equation on Chart in the formatting window of the trend line. (2 pts)
Using the formula you found in 6d, what is the predicted value of a countries Goal Score for your chosen SDG if they have a 2022 SDG Index Score of 75? (2 pts)
Students Choice
Select one (or two) variable(s) that you would like to create a visual summary for
Select an appropriate method to display your data that we have not already practiced above (i.e. pareto chart, stem and leaf plot, or box plot).
Calculate the Central Tendency for your variable (based on data type, select the appropriate method) (2 pts).
Generate a chart to present the data for your selected variable (4 pts)
Summarize the data in the chart in two or less sentences (i.e. what is the data telling us?). (1 pts).
Deliverable:
Submit an Excel document with your responses to the above. Please be sure to show detailed calculations where required. It is recommended to use one worksheet per question.
Submit the assignment in the Assignment drop box, before the due date. Submissions more than 5 minutes late will be penalized 10%. Assignments received after one day late will be penalized another 10%. Late submissions will not be accepted after 2 days late.
Grading:
Grading for this assignment will be based upon:
Visual appearance of your graphs title, legend, axes labels
Accuracy of your response
Calculations shown where required.
Explanation of results
The total grade for this assignment is 42 points and the assignment is worth 15% of your final grade.