Data: Basic Data Analysis in RStudio
Background: This course is all about data visualization. However, we must first have some understanding of the dataset that we are using to create the visualizations. A resource is provided to you to learn the simple commands in RStudio. You are also expected to locate appropriate available resources to learn the functionalities of RStudio.
Assignment:
a. Use RStudio to generate a word document with basic data analysis of the following dataset: dataset_price_personal_computers.csv
b. Use the following video from Dr. Francis to learn the simple commands to conduct basic data analysis with RStudio: Basic Data Analysis in RStudio.mp4
Questions/Requests:
a. Create a summary of stats for the dataset. (provide a screenshot)
b. Create a correlation of stats for the dataset. (provide a screenshot) (Hint: Transform may be needed)
c. What is the Min, Max, Median, and Mean of the Price? (provide a screenshot)
d. What is the correlation values between Price, Ram, and Ads? (provide a screenshot)
e. Create a subset of the dataset with only Price, CD, and Premium. (provide a screenshot)
f. Create a subset of the dataset with only Price, HD, and Ram where Price is greater than or equal to $1750. (provide a screenshot)
g. What percentage of Premium computers were sold? (provide a screenshot)(Hint: Categorical analysis)
h. How many Premium computers with CDs were sold? (provide a screenshot)(Hint: Contingency table analysis)
I. How many Premium computers with CDs priced over $2000 were sold? (provide a screenshot)(Hint: Conditional table analysis)
Your document should be an easy-to-read font in MS Word (other word processors are fine to use but save it in MS Word format). Your cover page should contain the following: Title, Student’s name, University’s name, Course name, Course number, Professor’s name, and Date.
Submit your assignment on or before the due date.