252x0641 12 15 05 ECO252 QBA2 Final EXAM December 2006 Version 1 Name and Class hour I 18 points Do all the following Note that answers without reasons and citation of appropriate statistical tests receive no credit Most answers require a statistical test that is stating or implying a hypothesis and showing why it is true or false by citing a table value or a p value If you haven t done it lately take a fast look at ECO 252 Things That You Should Never Do on a Statistics Exam or Anywhere Else Regression A seeks to explain the selling price of a home in terms of a group of variables explained on the output sheet Note that regressions 1 and 7 are identical Look at the definitions of the variables carefully and in particular notice which are interaction variables a The homes in this regression are in three different areas There are dummy variables to indicate that the homes are in Area 1 or Area 2 Why isn t there a dummy variable for Area 3 1 b In Regression 1 what coefficients are significant at the 5 level 2 c What independent variables did I remove from the problem to get to Regression 2 from Regression 1 Why 2 d Following the same process I went on to remove one or more variables each time until I got to Regression 5 When I got to Regression 5 I ran the best subsets regression 6 I concluded that it was time to quit removing variables Between the best subsets regression and the characteristics of the coefficients of the results in Regression 5 I felt that I had gone as far as was reasonable in removing independent variables What are the three things that led me to think that regression 5 was the best that I could do 3 e Using Regression 5 and assuming that all homes have two baths Regression 5 effectively becomes 3 regressions relating price to living area Take the coefficient of bath multiply it by two and add it to the constant to get the effective intercept for homes with two baths Using L or any other symbol that you find convenient for living area what are the equations relating living area to price in 3 points Area 1 Area 2 Area 3 11 f Continuing with Regression 5 and assuming that a home has 2 thousand square feet of living area and 2 baths what would it sell for in Area 1 Area 2 Area 3 What is the percent difference between the lowest and highest price 2 g We have not yet dealt with the question of whether the coefficients in Regression 5 are reasonable In order to do this look at two homes in Area 1 that have two baths If one has 2 thousand square feet of 1 252x0641 12 15 05 living area and the other 3 how would there prices differ Does that seem reasonable Try the same for a home in area 3 3 16 h As I warned you I now repeated Regression 1 as Regression 7 without using the VIFs Much to my surprise I ended up dropping the same variables as I did after Regression 1 Why 1 i Continuing in the same way I worked myself to Regression 9 Looking at the things I usually check this looked pretty good Then I tried to check the coefficients in the same way that I did in g Why was I very unhappy What is there in Regression 8 that could explain these results 4 j Regression 11 is a stepwise regression The printout which continues on page 7 presents four different possible regressions in column form Look at in each case a coefficient has a t value under it and a p value for a significance test After the fourth try the computer refused to add any more independent variables The only regression here that I thought was worth looking at was the one with four independent variables What can you tell me about its acceptability 3 24 k Do an F test to compare regressions 2 and 3 and to find out if lot 1 and lot 2 have any explanatory power 3 II Hand in your third computer problem 2 to 7 points 2 252x0641 12 15 05 III Do at least 4 of the following 7 Problems at least 12 each or do sections adding to at least 50 points Anything extra you do helps and grades wrap around You must do parts a and b of problem 1 Show your work State H 0 and H1 where applicable Use a significance level of 5 unless noted otherwise Do not answer questions without citing appropriate statistical tests That is explain your hypotheses and what values from what table were used to test them Clearly label what section of each problem you are doing The entire test has about 151 points but 70 is considered a perfect score Don t waste our time by telling me that two means proportions variances or medians don t look the same to you You need statistical tests There are two blank pages below 1 a If I want to test to see if the mean of x 2 is larger than the mean of x1 my null hypotheses are Note D 1 2 i 1 2 and D 0 v 1 2 and D 0 ii 1 2 and D 0 vi 1 2 and D 0 iii 1 2 and D 0 vii 1 2 and D 0 iv 1 2 and D 0 viii 1 2 and D 0 2 The first two columns below represent times for 25 workers on an industrial task The third column is the difference between them Row x1 x2 d 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 6 11 5 13 6 42 4 65 5 82 4 08 4 01 5 26 5 25 7 66 6 29 5 41 6 17 5 50 4 06 6 19 6 71 4 41 5 25 4 85 6 50 5 24 7 29 4 99 4 26 4 81 4 19 5 17 4 07 4 58 2 97 3 39 4 14 4 31 6 68 5 37 3 95 4 93 4 04 2 40 4 71 5 93 2 93 4 25 4 41 4 68 3 50 6 09 2 87 3 06 1 30 0 94 1 25 0 58 1 24 1 11 0 62 1 12 0 94 0 98 0 92 1 46 1 24 1 46 1 66 1 48 0 78 1 48 1 00 0 44 1 82 1 74 1 20 2 12 1 20 Assume that 05 Minitab gives us the following summary edited Descriptive Statistics x1 x2 d Variable x1 x2 d N 25 25 25 N 0 0 0 Mean 5 50 4 30 1 20 SE Mean 0 200 0 212 StDev 1 00 1 06 Minimum 4 010 2 400 0 4400 …
View Full Document
Unlocking...