Great Deal! Get Instant $10 FREE in Account on First Order + 10% Cashback on Every Order Order Now

Find two bivariate data sets (at least 30 data points, the more, the better), one that you think will have positive correlation and one that you think will have negative correlation. I like to use...

1 answer below »

Find two bivariate data sets (at least 30 data points, the more, the better), one that you think will have positive correlation and one that you think will have negative correlation.


I like to use this website (https://data.world/) to find data sets by searching topics I'm interested in. You can "sign in with google" using your go.shoreline email account. Sometimes finding a usable data set is the hardest part of doing this kind of analysis. Please let me know if you need help turning something you find on this website into a usable spreadsheet. I'm happy to help you with this.


Please note, do not use time as one of your variables. Find a data set where neither x nor y are measured in time.


Report what your data sets are into the WH08 Survey in Canvas. This will help me check in to see that your data sets will work and that you're using data sets unique to you. Save your data sets in Google Sheets, and send me a link by using the "share" button in the upper right corner (or send me a direct link to the data set you'd like to use if you need help converting it to a spreadsheet).


Along with sending me the links, answer these questions for each of your data sets in the survey. Be detailed.

  1. What is your x-variable and what is your y-variable?
  2. Why do you predict that the correlation will be positive or negative?
  3. How strong do you think the correlation will be: weak, medium, or strong? Why do you think so?

Task #2 (Due by Monday, June 8)

Do the following for each of your data sets, once I've approved the sets you'd like to use:

  1. Generate scatter plot of the entire data set andestimatethe line of best fit. Do not use the program to generate it.
  2. Describe the slope and intercept of your estimate for the line of best fit, and construct the equation of this line.
  3. Identify any outliers. Decide whether you will remove them or not, and justify your choice.
  4. Once your outliers are removed, generate a scatter plot of your reduced data set and estimate a new line of best fit.
  5. Compute the slope and intercept of your adjusted data set, and construct the regression line equation. Describe what each of these numbers tells you about your data set. Be specific and use units.
  6. Compute the correlation of your adjusted data set and describe what this number means.
  7. Choose one data point in your data set and compute its residual. Describe what this number means.
  8. Conjecture about the possible relationships between the variables. Is there cause and effect? Reverse Cause and Effect? Lurking Variables? Coincidence? Justify your conjecture.
Answered Same Day Jun 05, 2021

Solution

Aarti answered on Jun 05 2021
154 Votes
Task 1)
Dataset 1 – Hours Worked
Dataset 2 – Demand relationship
a)
In dataset 1, the X variable (Independent variable) is Hours Worked.
the Y variable (Dependent variable) is Yearly Income.
In dataset 2, the X variable (Independent variable) is Price.
the Y variable (Dependent variable) is Quantity Demanded.
)
The co
elation between the 2 variables can be predicted using the scatterplot diagram.
Co
elation tells whether the 2 variables are positively co
elated (if one variable changes, how is the other variable behaving i.e. they should move in same direction) or, negatively co
elation (the variables should move in opposite direction).
c)
The co
elation between Hours Worked per week and Yearly Income (Dataset 1) should be a strong positive relation as if the hours worked per week increases, the yearly income will rise. Or, if the hours worked per week decreases, yearly income will decrease. i.e. they move in same direction showing strong positive co
elation between hours worked and Yearly Income.
The co
elation between quantity demanded and price (Dataset 2) should be a strong negative relation as the price of a commodity increases, individual buys less quantity of that good. i.e. 2 variables move in opposite direction.
Task 2)
a)
The scatterplot for the dataset1 is given below:
The scatterplot for the dataset 2 is given below:
The estimate line of best fit for dataset 1 is calculated as:
The regression slope is calculated as:
    
The intercept is calculated as:
    
The regression slope and intercept are 0.6843 and 36.857 respectively.
The estimated line of best fit is given by: Yearly Income = 36.857 + 0.6843 Hours...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here