Name: Assignment 4: Capstone project
Overview
Name: Assignment 4: Capstone project
Type: Data Visualisation and Pre-processing report
Due: 11:59 pm AEST Wednesday of Week 7 (21st June, 2023)
Weight: 40% of subject. 120 Points are possible.
Requirements for successful completion of this assessment item:
Completion of Week 2, Week 3, Week 4 and Week 5 Online content.
Completion of the Video tutorials for weeks 1 through week 5 in the R Toolkit folder
Chapter 1 of R Programming Fundamentals : Deal with Data Using Various Modelling Techniques
Chapters 2,3 and 12 of R for Data Science
This assessment involves writing a report that summarises a data science related investigation that you have conducted on data that you have collected yourself. The investigation must involve the main topics covered in the subject, most noticeably data pre-processing (representation, wrangling, tidying) and exploratory data visualisationusing R/RStudio.
It is a merger of Exploratory Visualisation and Data Pre-Processing, however neither the dataset nor the pre-processing/exploratory steps to be carried out will be provided, you have to make independent choices and decisions.
We encourage you to find your own data using good practicesyou will need to upload a copy of your data as a separate file when you submit your report, subject to any confidentiality requirements . If there are confidentiality requirements on the data, please contact the subject coordinator and they will make alternative arrangements with you. Your datasetcannotbe smaller than 1000 observations of 5 variables, except if the targeted data science problem to be addressed relates to spatial-temporal data, case in which less than 5 dimensions could be allowed. We have also made available a number of datasets in the Capstone Support folder should you not be able to locate your own dataset.
The report should not exceed 10 A4 pages. Of these 10 pages you must accommodate the main text within 5 A4 pages. The rest can be used for appendix/code submission. The marking teamwill not assess any content beyond 5 pages. We do not need you to share your data. Please submit your code in an appendix as long as the complete report is no longer than 10 pages. Otherwise, upload the code separately as an ".R" file. We will not mark your code but you may refer to particular sections of your code for explanation.
Referencingis an essential part of capstone type research report. Please check the file inCapstone-Data Sources and support.docxfor more details. The same file also contains additional information on open data sources and various external links for help with writing your report.
Data and writing resources for Capstone
You can always bring your own data that subscribes to the Assessments Data Specification. You must refer to the source of data. Please do not submit your data with us. Please check that you have appropriate permissions for analysing the data for this subject or for academic purposes. If you seek data from a proprietor (including your workplace) you could mention that you wish to use it for an analytics project that would not be published.
This link provides useful list of JCU policy surrounding ethics and management of data - https://libguides.jcu.edu.au/rdm-toolkit
The following links help you find open source data.
JCU Based Data Source:
https://research.jcu.edu.au/data/default/rdmp/home
Australian:
https://www.abs.gov.au/
https://data.gov.au/https://www.health.qld.gov.au/research-reports/data
https://www.health.nsw.gov.au/data/Pages/default.aspx
US:
https://data.nasa.gov/browse HYPERLINK "https://guides.lib.uw.edu/research/federal/data" https://guides.lib.uw.edu/research/federal/data
https://www.data.gov/
https://www.census.gov/data.html
Organization for economic cooperation and development:
https://data.oecd.org/https://data.oecd.org/united-states.htm
Scientific Data: https://data.mendeley.com/
https://www.mendeley.com/datasetsWorld Bank: https://databank.worldbank.org/home.aspx
Miscellaneous: Not all data are open source.
https://data.world/datasets/atmosphere
Data used in online competitions such as Kaggle is not welcome.
JCUs Data Management Policy
https://www.jcu.edu.au/college-of-arts-society-and-education/postgraduate-study-and-research/research-data-managementSome additional writing resources
Writing objectives and process-
https://www.youtube.com/watch?v=UY7sVKJPTMA
https://www.youtube.com/watch?v=y0vLuxIwrZk
https://writingcenter.unc.edu/tips-and-tools/conference-papers/Sample papers for format and style
http://www.irce.org/sample.pdf
https://www.ieee-pes.org/images/files/pdf/pg4-sample-conference-paper.pdfJCU resources on Referencing and a note on plagiarism
Please use the APA referencing style.
https://libguides.jcu.edu.au/referencing