diff_months: 5

Applied Data Science with Python

Flat 50% Off Order New Solution
Added on: 2025-05-23 09:12:14
Order Code: LD523405
Question Task Id: 0
  • Subject Code :

    ML430

Applied Data Science with Python

Course-End Project Problem Statement

Sales Analysis

Project statement:

AAL, established in 2000, is a well-known brand in Australia, particularly recognized for its clothing business. It has opened branches in various states, metropolises, and tier-1 and tier-2 cities across the country.

The brand caters to all age groups, from kids tothe elderly.

Currently experiencing a surge in business, AAL is actively pursuing expansion opportunities. To facilitate informed investment decisions, the CEO has assigned the responsibility to the head of AALs sales and marketing (S&M) department. The specific tasks include:

  • Identify the states that are generating the highest revenues.
  • Develop sales programs for states with lower revenues. The head of sales and marketing has requested your assistance with this task.

Analyze the sales data of the company for the fourth quarter in Australia, examining it on a state-by-state basis. Provide insights to assist the company in making data-driven decisions for the upcoming year.

*Enclosed is the CSV (AusApparalSales4thQrt2020.csv) file that covers the said data.

Perform the following steps:

As a data scientist, you must perform the following steps on the enclosed data:

  1. Data wrangling
  2. Data analysis
  3. Data visualization
  4. Report generation
  5. Data wrangling
  6. Ensure that the data is clean and free from any missing or incorrect entries.
    • Inspect the data manually to identify missing or incorrect information using the functions isna() and notna().
  7. Based on your knowledge of data analytics, include your recommendations for treating missing and incorrect data(dropping the null values or filling them).
  8. Choose a suitable data wrangling techniqueeither data standardization or normalization. Execute the preferred normalization method and present the resulting data.(Normalization is the preferred approach for this problem.)
  9. Share your insights regarding the application of the GroupBy() function for either data chunking or merging, and offer a recommendation based on your analysis.
  10. Data analysis
  11. Perform descriptive statistical analysis on the data in the Sales and Unit columns. Utilize techniques such as mean, median, mode, and standard deviation for this analysis.
  12. Identify the group with the highest sales and the group with the lowest sales based on the data provided.
  13. Identify the group with the highest and lowest sales based on the data provided.
  14. Generate weekly, monthly, and quarterly reports to document and present the results of the analysis conducted.

(Use suitable libraries such as NumPy, Pandas, and SciPy for performing the analysis.)

  1. Data visualization
  2. Use suitable data visualization libraries to construct a dashboard for the head of sales and marketing. The dashboard should encompass key parameters:
    • State-wise sales analysis for different demographic groups (kids, women, men, and seniors).
    • Group-wise sales analysis (Kids, Women, Men, and Seniors) across various states.
    • Time-of-the-day analysis: Identify peak and off-peak sales periods to facilitate strategic planning for S&M teams. This information aids in designing programs likehyper-personalizationandNext Best Offersto enhance sales.
  3. Ensure the visualization is clear and accessible for effective decision-making by the head of sales and marketing (S&M).

The dashboard must contain daily, weekly, monthly, and quarterly charts.

(Any visualization library can be used for this purpose. However, since statistical analysis is being done, Seaborn is preferred.)

  1. Include your recommendation and indicate why you are choosing the recommended visualization package.
  2. Report generation
  3. Use JupyterLab Notebook for generating reports, which includes tasks such as data wrangling, analysis, and visualization. Please note that JupyterLab enables you to integrate code seamlessly with graphs and plots.
  4. Use Markdown in suitable places while presenting your report.
  5. Use suitable graphs, plots, and analysis reports in the report, along with recommendations. Note that various aspects of analysis require different graphs and plots.
    • Use a box plot for descriptive statistics.
    • Use the Seaborn distribution plot for any other statistical plotting.
  • Uploaded By : Nivesh
  • Posted on : May 23rd, 2025
  • Downloads : 0
  • Views : 107

Order New Solution

Can't find what you're looking for?

Whatsapp Tap to ChatGet instant assistance

Choose a Plan

Premium

80 USD
  • All in Gold, plus:
  • 30-minute live one-to-one session with an expert
    • Understanding Marking Rubric
    • Understanding task requirements
    • Structuring & Formatting
    • Referencing & Citing
Most
Popular

Gold

30 50 USD
  • Get the Full Used Solution
    (Solution is already submitted and 100% plagiarised.
    Can only be used for reference purposes)
Save 33%

Silver

20 USD
  • Journals
  • Peer-Reviewed Articles
  • Books
  • Various other Data Sources – ProQuest, Informit, Scopus, Academic Search Complete, EBSCO, Exerpta Medica Database, and more