Great Deal! Get Instant $10 FREE in Account on First Order + 10% Cashback on Every Order Order Now

Outline The project is to use a multiple regression analysis to analyze a data set that is of interest to you. If you have a strong interest in two group or analysis of variance you can do that with...

1 answer below »
Outline
The project is to use a multiple regression analysis to analyze a data set that is of interest to you. If you have a strong interest in two group or analysis of variance you can do that with my concu
ence.
The final report for the project should be a 5-7 page paper that describes the questions of interest, how you used your data set to analyze these questions with details on the steps you used in your analysis, your findings about your question of interest and the limitations of your study. Specifically, your report should contain the following:

1. Abstract: A one paragraph summary of what you set out to learn, and what you ended up finding. It should summarize the entire report.
1. Introduction: A discussion of what questions you are interested in.
1. Data Set: Describe details about how the data set was collected and the variables in the data set.
1. Analysis: Describe how you used multiple regression to analyze the data set. Specifically, you should discuss how you ca
ied out the steps in analysis discussed in class, i.e., exploration of data to find an initial reasonable model, checking the model and changes to the model based on your checking of the model.
1. Results: Provide inferences about the questions of interest and discussion.
1. Limitations of study and conclusion: Describe any limitations of your study and how they might be overcome in future research and provide
ief conclusions about the results of your study.

1. Abstract
2. Introduction
a. What’s the problem?
. Why is it important
c. How do you plan to solve it?
d. Who cares?
e. Why do they care
f. (industry graph)
g. Lit Review (Background) Industry Review
i. Describe the industry
ii. Scholar.google.com
iii. Academic papers
iv. Articles
v. What have other people done in this research in the past
vi. *cited!!!
2. Data
a. What is your data
. Where did you get it
c. Descriptive statistics table
3. Methodology
a. Describe your methodology
. Include equations
c. Linear regression
d.
4. Results
a. Describe your results
. Are they significant?
c. 10 step process
d. Alpha (.05)
5. Conclusion
a. The model is good
. Why is it important
c. This is important to whom?
d. How will this change the industry
e. Why should I care??
Data Sets
The project will be of most interest to you if you find questions of interest and a data set that are of interest to you. Examples of questions of interest are as follows:
1. What properties of a baseball team best predict its success over the course of a season?
1. What properties of a college are related to its rank in the U.S. News and World Report rankings?
1. Is the unemployment rate related to economic measures such as interest rates, stock returns, and the inflation rate?
1. What properties of a state predict the proportion of the vote that George Bush (John Ke
y) received in it?
You will need a data set to explore your question of interest. I will be happy to help you with suggestions. The data set should ideally contain at least 30-50 observations or more (e.g., companies, people, countries, etc., as the case may be), and at least 4 variables (pieces of information about the observations; e.g., stock price, revenues, profits, salaries, gender, etc.), although if that is not possible, exceptions will be allowed (subject to my approval). Do not be concerned if your dataset is large.
One of the variables should be such that it is a numerical variable that would be of interest to try to model or forecast (e.g., for the examples above, team winning percentage, stock price change, U.S. News and World Report rank, gas mileage, unemployment rate, and proportion of vote received respectively).
I will be happy to discuss ideas with you.
Here are a few potential sources of ideas and data:
1. http:
kaggle.com
1. http:
www.hawkeslearning.com/Statistics/dis/datasets.html
1. https:
www.springboard.com
log/free-public-data-sets-data-science-project
1. https:
www.dataquest.io
log/free-datasets-for-projects
1. http:
lib.stat.cmu.edu/DASL/
Samples
A good sample of what I’m expecting from the projects and reports is contained at the web site http:
pages.stern.nyu.edu/~jsimonof/classes/1305/projdoc/ . Note that these reports are for a class taught at New York University by Jeffrey Simonoff, so some of the methods used in the regression analyses may be unfamiliar to you.
Answered Same Day Nov 27, 2021

Solution

Shakeel answered on Dec 01 2021
148 Votes
7
Abstract
In this paper, it is tried to analyze the factors affect the price of used car. For that purpose, the underlying variables – Years of registration, Power PS and Kilometer are taken. Historical data of past 25 years is taken according to the car model. Descriptive statistic shows that there is a huge variation in price data and no of kilometer a car has run. Data of all four variables are also not normally distributed. Further, the regression analysis shows that there is a significant relationship between price of used cars and the other three variables. Such findings would be useful for better pricing of used car based on significant variables. Industry would grow in guise of better dealing between dealers and customers. Government authority would also be in benefited in deciding motor vehicle tax and insurance premium.
Introduction
The value of running car depends upon the several factors like what is the manufacturing model, how run the car has run; the power of engine, fuel efficiency and so on. Co
ect pricing of old car is important for both dealers as well as customers. People who don’t afford the new car generally pursue to buy the old one and put them on restoration and customization to make them as per choice. Therefore, co
ect pricing is important to have their dream car in budget. Pricing of old car is also important for deciding the premium of motor bike insurance. However, estimation of old car’s price is generally not ca
ied out mathematically and therefore, it always remain in blind alley to find the co
ect price. Hence, we have no sufficient literature and research papers on this field. Therefore, it is interesting to find how the value of old car is inte
elated to the different underlying variables. Such findings would be quit helpful for dealer as well as customers. Government authority may also use such report for their calculation of motor bike tax and insurance purpose. Car making companies may also use the finding to better gauge their production process to yield better price of their car after the usage for certain period.
Data
Four major variables are taken – Price of used car, Year of registration, Power PS and Kilometer of running. According to the model of car, figure of these four variables are found. Data is taken for the last 25 years. All variables are classified as numeric variables because they have certain values and can be measured on the scale.
The descriptive statistics of all four variables is as follows –
    
    Price
    Year Of Registration
    Power PS
    Kilomete
    
    
    
    
    
    Mean
    17,295.44
    2,004.58
    115.55
    125,617.94
    Standard E
o
    5,886.26
    0.15
    0.32
    65.81
    Median
    2,950.00
    2,003.00
    105.00
    150,000.00
    Mode
    0.00
    2,000.00
    0.00
    150,000.00
    Standard Deviation
    3,587,905.48
    92.87
    192.14
    40,113.10
    Sample...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here