Business Statistics 1 Part A (a) Using the sample data attached, calculate the sample mean and...

Question

Business Statistics 1 Part A (a) Using the sample data attached, calculate the sample mean and standard deviation for the variables: - i. Sales (dollars) (2 marks) ii. Age (years) (2 marks) iii....

1 answer below »

Business Statistics 1

Part A

(a) Using the sample data attached, calculate the sample mean and standard deviation for the variables: -
    i.
    Sales (dollars)
    (2 marks)
    ii.
    Age (years)
    (2 marks)
    iii.
    Growth (annual population growth rate)
    (2 marks)
    iv.
    Income (dollars)
    (2 marks)
    v.
    HS
    (2 marks)
(b) Is there any evidence of skewness in these data sets? Which data set(s) displays negative skewness? Which data set(s) display positive skewness? Which data set is least skewed? In answering this question, you should use Pearson’s Coefficient of skewness developed in lectures, and not simply the skewness measure generated in the PhStat2 or Excel output. Nonetheless, you may provide comparison on both methods displaying and comment on any differences between them (if applicable).
    (5 marks)
(c) Check for normality for ALL variables in the data set. Are there any variables that could be considered NOT approximately normally distributed? Use PhStat to determine your answer here. Include your printouts from PhStat supporting your reasoning
    (5 marks)
(d) Using the sample data, calculate the sample proportion and the standard deviation of:
i. stores whose median age of customer base is less than 32 years (3 marks) ii. stores whose customer base have at least 80% High School qualification (3 marks)

(e) Set up and interpret the following confidence intervals -
i. a 95% confidence interval for the true population mean sales     (4 marks)
ii. a 99% confidence interval for the true population mean annual growth rate of the
    customer base over the past 10 years.     (4 marks)
iii. a 90% confidence interval for the true population proportion of all stores whose
    median age of the customer base is less than 32 years.     (4 marks)
iv. A 98% confidence interval to estimate the true population proportion of all stores whose customer base have at least 80% high school qualifications.     (4 marks)
(f) A follow-up study will provide a point estimate of the population proportion of stores whose median age of the customer base is less than 32 years. The study must provide 90% confidence that the point estimate is within 0.10 of the population proportion. If no previous proportion estimate is available (not even that calculated in (d) above), how large a sample would you recommend for this study?
         (5 marks)
(g) Demographic studies claim that on average the growth rate of the customer base of stores is more than 1.3. Do the data provide significant support for this claim? Use a 5% significance level and the critical value approach (classical approach) to test this claim.
         (9 marks)
(h) The Chamber of Commerce has claimed that more than 30% of all stores customer base is less than 32 years old. How much evidence do the data provide to support this claim? Use a 0.05 level of significance and the p-value approach to test this claim.
         (7 marks)
(i) Imagine that your sample data set only included the first fifteen stores listed in your original data set. For this new data set, you are now told that the growth rate of the customer base followed the normal distribution in the past. With this reduced data set, test the claim that on average the growth rate of the customer base of all stores is more than 1.3 at the 1% significance level.
Compare this result to your answer if A(g). Can you suggest any reasons for the variation, if one exists.
(9 marks)
    [Total Marks = 72 marks]

Part B

Using the computer software package Phstat (or any Excel based software) to confirm your answers with the appropriate printout attached for: -

(a) All confidence intervals in Part A(e) above.     (8 marks)
(b) The sample size calculation in Part A(f) above (3 marks) (c) The hypothesis test of the mean in Part A(g) (3 marks)
(d) The p-value approach to the hypothesis test of the proportion in Part A(h)     (3 marks)
(e) The hypothesis test with the reduced data set in Part A(i)     (3 marks)

    [Total Marks = 20 marks]

Part C
Using:
· the complete data set AND
· simple ordinary least squares regression formulae based solutions AND - the regression package on the PhStat software (computer printout).

(a) Develop two linear models to explain the distribution costs as a function of their
(i) Median Family Income (Income)
(ii) Percentage of customer base with a Higher School Certificate (HS)
         (10 marks)
(b) Which model best describes the behaviour of sales?
    Explain your reasons here     (5 marks)

(c) Develop a multiple regression analysis to test the distribution costs as a function of both Sales and the Number of orders.
Distribution Cost = f (Income, HS, Age)
You should
· comment on the significance of each of the independent variables in this new model and whether or not the model is improved by including these variables jointly and
· discuss statistical measures that support your belief. Which variable appears to most influence sales? Explain how you have a
ived at your decision.
· Setup appropriate tests
In this section you are required to use computer printout only to support your argument and refer to tests of hypotheses on each of the coefficients.
          (10 marks)

          [Total Mark= 25 marks]

Part D
At the Lifestyle Furniture Manufacturing Company, an application of the test of the difference between small sample means arises. New employees are expected to attend a three-day seminar to learn about the company. At the end of the seminar, they are tested to measure their knowledge about the company. The traditional training method has been a lecture and a question-and-answer session. Management decided to experiment with a different training procedure, which processes new employees in two days by using DVDs and having no question-and-answer session. If this procedure works, it could save the company thousands of dollars over a period of several years. However, there is some concern about the effectiveness of the two-day method, and company managers would like to know whether there is any difference between the effectiveness of the two training methods.

a) At the 0.05 level of significance, is there a difference in the variability in training methods A and B?
Use the PhStat software to analyse the problem but give the full hypothesis testing steps in your final presentation.
(11 marks)

) To test the difference in the two training methods, the managers randomly select one group of 15 newly hired employees to take the three-day seminar (method A) and a second group of 12 new employees for the two-day DVD method (method B). The test scores of the two groups are shown above.
Using α = 0.05, determine whether there is a significant difference in the mean scores of the two groups. Assume that the scores for this test are normally distributed and that the population variances are approximately equal.
(11 marks)

c) One group of researchers set out to determine whether there is a difference between ‘average citizens’ and those who are ‘phone survey respondents’.
(Note: This is part of a much larger study. The results in this portion of the study are about the same as those of the actual study except that the sample sizes were 500 to 600.)
Their study was based on a well-known personality survey that attempted to assess the personality profile of both average citizens and phone survey respondents. Suppose they sampled nine phone survey respondents and 10 average citizens in this survey and obtained results for one personality factor, conscientiousness, which are displayed below:
Develop a 99% confidence interval for the true difference in population mean personality scores for conscientiousness between phone survey respondents and average citizens.
(11 marks)
NOTE: Attach the appropriately labelled PhStat / Excel computer printouts that are required for section a), b) and c) above.
          [Total Mark= 33 marks]

tae_636625618929835114_118765_1.docx

Answered Same Day May 22, 2020

Solution

Pooja answered on May 22 2020

148 Votes

Table of Contents
Part C)    2
a) (i)    2
a)(ii)    5
)    7
c)    8
(i)    9
(ii)    10
(iii)    10
Part D)    11
a)    11
)    12
c)    13
Part C)
a) (i)
Model...

SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Pooja · Accepted Answer

Table of Contents
Part C)	2
a) (i)	2
a)(ii)	5
b)	7
c)	8
(i)	9
(ii)	10
(iii)	10
Part D)	11
a)	11
b)	12
c)	13
Part C)
a) (i)
Model 1:
	Income
	Sales
	x^2
	y^2
	xy
	26748.51
	1695712.620
	715482787
	2875441289627
	45357785973
	53063.79
	3403862.053
	2815765809
	11586276875853
	180621821169
	36090.14
	2710352.905
	1302498205
	7346012869642
	97817015791
	32058.07
	529215.459
	1027719852
	280069002045
	16965626230
	47843.42
	663686.654
	2288992837
	440479974698
	31753039336
	50180.97
	2546324.335
	2518129750
	6483767619013
	127777025065
	30710.08
	2787046.202
	943109014
	7767626532083
	85590411827
	29141.70
	612696.054
	849238679
	375396454587
	17855004597
	25980.15
	891822.033
	674968194
	795346538544
	23169670191
	18730.88
	1124967.965
	350845866
	1265552922276
	21071639956
	31109.23
	909500.976
	967784191
	827192025345
	28293875048
	35614.12
	2631166.881
	1268365543
	6923039155671
	93706693040
	23038.43
	882972.654
	530769257
	779640707712
	20342303681
	34531.72
	1078573.124
	1192439686
	1163319983815
	37244985117
	30350.36
	844320.194
	921144352
	712876589996
	25625421843
	38964.94
	1849119.029
	1518266549
	3419241183410
	72050812018
	49392.77
	3860007.316
	2439645728
	14899656479574
	190656453558
	25595.69
	826573.880
	655139347
	683224379098
	21156728795
	29622.61
	604682.868
	877499023
	365641370853
	17912284772
	31586.10
	1903611.600
	997681713
	3623737123655
	60127666359
	39674.56
	2356808.391
	1574070711
	5554545791888
	93505335917
	28878.98
	2788571.957
	833995486
	7776133559367
	80531113775
	24287.08
	634878.286
	589862255
	403070438034
	15419339722
	46711.24
	2371627.369
	2181939942
	5624616377390
	110781655224
	33449.81
	2627837.961
	1118889789
	6905532349273
	87900680506
	31694.45
	1868116.330
	1004538161
	3489858622413
	59208919615
	25459.22
	2236796.862
	648171883
	5003260201853
	56947103405
	47047.34
	1318876.234
	2213452201
	1739434520610
	62049618599
	26433.24
	1868097.836
	698716177
	3489789524868
	49379878442
	33396.66
	1695218.566
	1115336899
	2873765986511
	56614638074
	26179.36
	2700194.415
	685358890
	7291049878797
	70689361660
	33454.64
	1156049.774
	1119212938
	1336451079965
	38675229011
	42271.50
	643858.444
	1786879712
	414553695910
	27216862216
	46514.75
	2188687.

Business Statistics 1 Part A (a) Using the sample data attached, calculate the sample mean and standard deviation for the variables: - i. Sales (dollars) (2 marks) ii. Age (years) (2 marks) iii....

Solution

Answer To This Question Is Available To Download

Related Questions & Answers

Submit New Assignment