Great Deal! Get Instant $10 FREE in Account on First Order + 10% Cashback on Every Order Order Now

Assignment 5 PS 3780 Data Literacy & Visualization, Summer 2022 Due Date: Thursday, June 23, 2021 at 11:59 p.m. Please save your visualizations and answers to these questions as one .pdf �le (use the...

1 answer below »

Assignment 5
PS 3780 Data Literacy & Visualization, Summer 2022
Due Date: Thursday, June 23, 2021 at 11:59 p.m.
Please save your visualizations and answers to these questions as one .pdf
�le (use the �save as� function in most word processors). Be sure to include your name,
your teammate's name if there is anyone, and the assignment number. Submit the �le
to Carmen by the due date. Remember we are looking for professional visualizations so
please include a meaningful title as well as axis labels and a legend.
Plotting in R - MOMA Artwork
Use the Artworks dataset from Carmen and answer the questions below. The data has
een downloaded and partially cleaned from github and includes pieces in the Museum
of Modern Art Collection. A random subset of around 6000 pieces we selected to make
this a reasonably sized dataset. Import the dataset into R. Write down what command
that you use for answering each question.
Summarize and Transform
1. (1 pt) Load in the data and check that it is read properly. What variables are in
the dataset? Summarize Height and Width; what's the mean, median, max, and
min?
2. (1 pt) Using log(), transform and save di�erent versions of both the Height and
Width variables so that outliers are less extreme. For these new variables, what's
the mean, median, max, and min? (Note: log(data$variable) will calculate the log
of each value in variable and return a vector of the same length.)
What's the Size?
1. (2 pt) Make a basic x-y plot comparing the logged width and logged height of pieces
in the collection. Indicate the department that each piece is in by customizing some
part of the graph. Write a paragraph describing the apparent relationship (or lack
thereof) between these three variables.
1
Density Plots
1. (2 pt) Pick three departments that have at least 300 pieces in the dataset. Plot
overlapping density curves for the logged height of objects in these departments.
Make sure each density curve is visible. Write a paragraph describing the di�erences
and similarities between the pieces in the three departments? (Notes: table() can
help you decide which departments to focus on. Watch video lecture 10c for help
with the plot.)
2
Answered Same Day Jun 22, 2022

Solution

Suraj answered on Jun 23 2022
81 Votes
1.
The data set is loaded in the R-Studio environment. The variables in the data set are given as follows:
colnames(df)
[1] "Title" "Artist" "Nationality" "Classification" "Department" "DateAcquired" "Height" "Width"
The summary statistics for the height and weight variables are given as follows:
summary(df$Height)
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.00 17.90 27.70 37.53 43.82 1574.80
summary(df$Width)
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.00 17.46 25.30 38.19 43.16 9144.00
2.
The height and weight variables transformed to other scale using log transformation. The summary statistics for the transformed variables given as follows:
summary(df$log_height)
Min. 1st Qu. Median Mean 3rd Qu. Max.
-Inf ...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here