diff_months: 23

CP3403 Data Mining Practice and Analysis Assignment

Download Solution Now
Added on: 2023-02-04 12:08:07
Order Code: iah 11
Question Task Id: 0
  • Country :

    Australia

Requirements (Tasks)

The whole task of this assignment consists of the following procedural steps.

Step 1 :Set up (by your imagination of a real-like business situation or by applyinganactual analysis problem case) a scenario in which you are given a set of domain-specific datasetand asked to analyze the given dataset. The purpose of the analysis might betounderstand (overview or learn about) the given data or to solve a specific analytical problem depending on the scenario you made up.

Step 2 :Find and get your own domain-specific dataset to fit for the scenario you madeup. Thedataset could be unique or publicly available. Some public datasets are availablefromthe UCI machine learning repository (http://archive.ics.uci.edu/ml/).

Step 3 :Choose appropriate data mining techniques (algorithms) see more details for eachoption in Step 4 below.

** Note: The procedural order of the above three steps can be alternated. For example,you may find an interesting dataset first and then set up a specific data-miningscenariowhich fits for the analysis on the dataset chosen. **

Step 4 :You can select either of two options for this assignment.

Option (1) Programming-intensive Assignment

  • Once you have your own domain-specific dataset and chosen dataminingalgorithm, then you need to design and implement the chosen algorithminyourpreferred programming language.
  • A series of preprocessing will be required at this step. The preprocessingprocedure should be designed carefully (considering what kind of processingwill be required? How? Why?) to make your data ready to be fed to your program.Some parts of this preprocessing procedure can be included in your programasa part of pre-data-mining module.
  • Your final program must become a stand-alone data-mining tool designedforyour own purpose of data analysis. It is expected that your programshouldinclude the following modules (and may include more sub-modules if needed);
    • pre-data-mining module designed for necessary preprocessingandforgetting the data ready to be fed to the next module (data-miningmodule).You dont need to include all required pre-processing in this module. Itisassumed that some initial preprocessing (e.g. cleaning noise data) canbedone externally using other software tools (e.g. Excel or Weka).
    • data-mining module the chosen data mining algorithmis implemented.You can directly borrow the algorithm from one popular existingdatamining method, or you can design your own algorithm(by amendingtheexisting one)
    • post-mining module this module is for presenting/reporting theoutputresult produced through previous modules. The result can be madeinasimple text report or additionally in a non-text visualization way (e.g. graph,chart or diagram).
  • This programming-intensive assignment still requires an analysis. Trytofindall the patterns you can detect with your implemented algorithm. Try tocompareand contrast the result using your chosen preprocessing scheme andalgorithmwith using other existing algorithm or with using other preprocessingmethods.
  • Note: in particular for the comparison the result using your programwithusingother existing algorithm, you can use other existing data mining tools(e.g.Weka) to get the result using other algorithm.

Option (2) Analysis-intensive Assignment

  • Once you have your own domain-specific dataset chosen, you needtodesignyour own data-mining analysis scheme. This analysis scheme canconsist ofmultiple steps of procedures:
    • Set up a strategy for preprocessing on your data.A series of preprocessing will be required and need to be designedcarefully(considering what kind of processing will be required? How? Why?). Youmay include multiple different preprocessing schemes for the comparisonanalysis.
    • Set up a strategy for data-mining.you need to select one data mining areas (clustering, classification,association rules mining) of your choice and select AT LEAST TWOexistingdata mining algorithms in your chosen data mining area. For example, ifyou chose Clustering as your data mining area, you can applytwoalgorithms; DBScan and K-mean and compare the tworesults.Alternatively you can design a combined algorithmwhich applies multiplealgorithms from same/different data mining areas in a series. Your strategyalso can be designed to apply different parameters for one algorithm.Another strategy you can set up is to apply multiple preprocessing(attribute selection) schemes for one algorithm.
  • You can choose one data mining tool (e.g. Weka) to analyze your chosendataset.Apply the data-mining strategy (you had set up) on your chosendata(preprocessed) using the data mining tool and try to find all the patterns youcandetect.
  • Do various comparison experiments either by applying different dataminingalgorithms (or strategy) to the same chosen dataset or by applyingasamealgorithm to the differently pre-processed datasets.
  • Critically analyze experimental results and discuss/demonstrate whyachosenalgorithm (strategy) is superior/inferior to other algorithm(strategy).

Get your CP3403/CP5634 Data Mining Practice and Analysis assignment solved by our Data Mining Experts from Exam Question Bank . Our Assignment Writing Experts are efficient to provide a fresh solution to all question. We are serving more than 10000+ Students in Australia, UK & US by helping them to score HD in their academics. Our Experts are well trained to follow all marking rubrics & referencing Style. Be it a used or new solution, the quality of the work submitted by our assignment experts remains unhampered.

You may continue to expect the same or even better quality with the used and new assignment solution files respectively. Theres one thing to be noticed that you could choose one between the two and acquire an HD either way. You could choose a new assignment solution file to get yourself an exclusive, plagiarism (with free Turn tin file), expert quality assignment or order an old solution file that was considered worthy of the highest distinction.

  • Uploaded By : Katthy Wills
  • Posted on : February 04th, 2023
  • Downloads : 0
  • Views : 244

Download Solution Now

Can't find what you're looking for?

Whatsapp Tap to ChatGet instant assistance

Choose a Plan

Premium

80 USD
  • All in Gold, plus:
  • 30-minute live one-to-one session with an expert
    • Understanding Marking Rubric
    • Understanding task requirements
    • Structuring & Formatting
    • Referencing & Citing
Most
Popular

Gold

30 50 USD
  • Get the Full Used Solution
    (Solution is already submitted and 100% plagiarised.
    Can only be used for reference purposes)
Save 33%

Silver

20 USD
  • Journals
  • Peer-Reviewed Articles
  • Books
  • Various other Data Sources – ProQuest, Informit, Scopus, Academic Search Complete, EBSCO, Exerpta Medica Database, and more