Select an industry
Select an industry
The following are examples of industries:
Healthcare
Retail - clothing
Social Media
Education
Motor vehicles
Fast Foods
Research the industry
Find articles use this as references
Collect visuals make good use of these throughout your report as they help with good storytelling and increase the audience engagement
Referencing this is mandatory (see Academic Success Centre)
The following main sections must exist in your report (1500 words)
(Along with the Cover Page and Contents Page)
1. Industry Background (300 words, 6 marks)
What is the industry?
Are there any market statistics? Visuals?
Who are the main players in the industry?
What has been happening, especially with regards to unstructured data and AI?
Instead of using Industry Background as a main title, you may for example, directly use the name of your industry:
e.g., The Healthcare Industry
2. Unstructured Data and AI/Machine Learning (300 words, 6 marks)
Identify the unstructured data types
How are the unstructured data used?
How is AI/ML (Machine Learning) Applied?
What kind of algorithms? For example, a Convolutional Neural Network (CNN)
Machine Learning Resources:
https://www.ibm.com/au-en/analytics/machine-learninghttps://developers.google.com/machine-learning/crash-course/ml-introHint: if you Google search IBM along with the name of the AI you will see links to IBM resources for that AI
Find diagrams of the AI/ML
Explain the AI/ML
The title of this section could be direct:
e.g., Healthcare: Image Reconstruction and Machine Learning
3. Best Practices (800 words, 15 marks)
Think about the Five Vs of Big Data:
Data Velocity real time data streams
Data Variety the immense variety of data
Data Veracity the reliability, uncertainty, or truthfulness of data
Data Volume the amount of data
Data Value the processing of data leads to business insights and hence are of enormous value
Also, remember the importance of data architectures and best practices:
https://www.snowflake.com/resource/best-practices-for-managing-unstructured-data/https://www.databricks.com/discover/data-lakes/introductionhttps://www.marklogic.com/product/data-hub-service/It is also important to remember how data is being moved around and processed/analysed:
Real-time data streams
Real-time feedback
The following sub-sections must exist:
Data Collection & Access
Are there any ethical considerations?
What about the rules and regulations?
Data Storage
Think about the following data architectures:
Datalakes
Data warehouses
Data lakehouses
Datahubs
Also, think about the following databases:
Relational (SQL) databases identify these if appropriate
NoSQL databases:
Graph Databases
Wide-Column Stores
Document Stores
Key-Value Stores
Remember, databases lives in one or more of the data architectures.
Data Sharing
This will relate to the data architectures and databases used
Data Documentation
Why is this important?
Think data dictionaries
Think about tracking changes made to the data
Data Maintenance
This will relate to the data architectures and databases used
Think cloud platforms
4. Propose a question (100 words, 3 marks)
My question is
Think about what you have learned about the industry and the kinds of data, the data architectures, the databases, and the applications (especially using AI/ML) that have been used.
Do you have an idea of what to do with the data and AI? What is the future of this industry? This will help you write your business question.
What kind of AI would you use?
How would it work?
What kinds of insights would be gained?
Final advice: be creative, use visuals to tell a good story, and have fun .