diff_months: 10

Food and Nutrition Dataset Assignment

Download Solution Now
Added on: 2023-07-13 09:33:28
Order Code: clt317616
Question Task Id: 0
  • Country :

    Australia

Use the dataset “Food Nutrition” and answer the following questions:

  1. Set the working directory for your work
  2. Import the dataset in the console in the .xls(x) format
  3. View the top 10 rows of the data
  4. View the last 20 rows of the data
  5. Show the summary of the data
  6. Create a vector “test” using the top 10 values of variable Protein_(mg)
  7. Select the top 5 rows of initial 5 variables in a matrix format
  8. What is the class of the Sodium_(mg) variable
  9. Create a new variable “EPW” by dividing Energ_Kcal with the Water; what is the dimension of the new dataset?
  10. Create a subset of the dataset, where the Energ_Kcal is less than 500, what is the dimension of this new dataset?
  11. Plot a Bar Plot between Enrg_Kcal and Water using the new subset created
  12. Plot a histogram of Sugar_tot variable using the new subset
  13. Find the top 10 products based on following
    1. Higher the Energy_Kcal, higher the ranking
    2. Lower the water content, higher the ranking
  14. Create a subset of the data where product_desc contains “CHEESE” and list down the summary statistics of the subset
  15. Using the cut function on water variable divide the whole data into 6 bins, list down the summary statistics of all the 6 bins

The objective of the project is to use the dataset 'Factor-Hair-Revised.csv' to build an optimum regression model to predict satisfaction. You are expected to

  • Perform exploratory data analysis on the dataset. Showcase some charts, graphs. Check for outliers and missing values (8 marks)
  •  Is there evidence of multicollinearity ? Showcase your analysis(6 marks)
  • Perform  simple linear regression for the dependent variable with every independent variable (6 marks)
  • Perform PCA/Factor analysis by extracting 4 factors. Interpret the output and name the Factors (20 marks)
  • Perform Multiple linear regression with customer satisfaction as dependent variables and the four factors as independent variables. Comment on the Model output and validity. Your remarks should make it meaningful for everybody (20 marks)

Please note the following:

  • You have to submit 2 files : 
    1. Business Report: In this you need to submit all the answers to all the questions in a sequential manner. Your answer should include detailed explanations & inferences to all the questions. Your report should not be filled with codes. You will be evaluated based on the business report. It should include the detailed explanation of approach used, insights, inferences, all outputs of codes like graphs, tables etc.
    2. R code file : This is a must and will be used for reference while evaluating
  • You must give the sources of data presented. Do not refer to blogs; Wikipedia etc.
  • Any assignment found copied/ plagiarized with other group(s) will not be graded and marked as zero.

Are you struggling to keep up with the demands of your academic journey? Don't worry, we've got your back! Exam Question Bank is your trusted partner in achieving academic excellence for all kind of technical and non-technical subjects.

Our comprehensive range of academic services is designed to cater to students at every level. Whether you're a high school student, a college undergraduate, or pursuing advanced studies, we have the expertise and resources to support you.

To connect with expert and ask your query click here Exam Question Bank

  • Uploaded By : Katthy Wills
  • Posted on : July 13th, 2023
  • Downloads : 0
  • Views : 218

Download Solution Now

Can't find what you're looking for?

Whatsapp Tap to ChatGet instant assistance

Choose a Plan

Premium

80 USD
  • All in Gold, plus:
  • 30-minute live one-to-one session with an expert
    • Understanding Marking Rubric
    • Understanding task requirements
    • Structuring & Formatting
    • Referencing & Citing
Most
Popular

Gold

30 50 USD
  • Get the Full Used Solution
    (Solution is already submitted and 100% plagiarised.
    Can only be used for reference purposes)
Save 33%

Silver

20 USD
  • Journals
  • Peer-Reviewed Articles
  • Books
  • Various other Data Sources – ProQuest, Informit, Scopus, Academic Search Complete, EBSCO, Exerpta Medica Database, and more