DOC PREVIEW
ISU STAT 511 - HW 6 S04

This preview shows page 1 out of 4 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 4 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 4 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

1Stat 511 HW#6 Spring 2004 (Corrected) 1. The book Nonlinear Regression Analysis and its Applications by Bates and Watts contains a small data set taken from an MS thesis of M.A. Treloar “Effects of Puromycin on Galactosyltransferase of Golgi Membranes.” It is reproduced below. y is reaction velocity (in 2counts/min ) for an enzymatic reaction and x is substrate concentration (in ppm) for untreated enzyme and enzyme treated with Puromycin. x 0.02 0.06 0.11 0.22 0.56 1.10 Untreated 67, 51 84, 86 98, 115 131, 124 144, 158 160y Treated 76, 47 97, 107 123, 139 159, 152 191, 201 207, 200 Apparently a standard model here (for either the untreated enzyme or for the treated enzyme) is the “Michaelis-Menten model” 12iiiixyxθεθ=++ (*) Note that in this model, 1) the mean of y is 0 when 0x=, 2) the limiting (large x) mean of y is 1θ, and 3) the mean of y reaches half of its limiting value when 2xθ=. Begin by considering only the “Treated” part of the data set (and an ()2iid N 0, 'siσε version of the model). Of course, use R to help you do all that follows. Begin by reading in 12 1× vectors y and x . a) Plot vs yxand make “eye-estimates” of the parameters based on your plot and the interpretations of the parameters offered above. (Your eye-estimate of 1θ is what looks like a plausible limiting value for y , and your eye-estimate of 2θ is a value of x at which y has achieved half its maximum value.) b) Add the nls package to your R environment. Then issue the command > REACT.fm<-nls(formula=y~theta1*x/(theta2+x),start=c(theta1=#,theta2=##),trace=T) where in place of # and ## you enter your eye-estimates from a). This will fit the nonlinear model (*) via least squares. What are the least squares estimate of the parameter vector and the “deviance” (error sum of squares) ()()()1221OLS 1 212ˆˆˆˆ and /ˆii iiSSE y x xθθθθ===−+∑θ c) Re-plot the original data with a superimposed plot of the fitted equation. To do this, you may type > conc<-seq(0,1.5,.05) > velocity<-coef(REACT.fm)[1]*conc/(coef(REACT.fm)[2]+conc) > plot(c(0,1.5),c(0,250),type="n",xlab="Conc (ppm)",ylab="Vel (counts/sqmin)") > points(x,y)2> lines(conc,velocity) (The first two commands set up vectors of points on the fitted curve. The third creates an empty plot with axes appropriately labeled. The fourth plots the original data and the fifth plots line segments between points on the fitted curve.) d) Get more complete information on the fit by typing > summary(REACT.fm) > vcov(REACT.fm) Verify that the output of this last call is ()1ˆˆMSE−′DD and that the standard errors produced by the first are square roots of diagonal elements of this matrix. Then use the information produced here and make an approximate 95% prediction interval for one additional reaction velocity, for substrate concentration .50 ppm. e) The concentration, say 100x, at which mean reaction velocity is 2100 counts/min is a function of 12 and θθ. Find a sensible point estimate of 100x and a standard error (estimated standard deviation) for your estimate. f) As a means of visualizing what function the R routine nls minimized in order to find the least squares coefficients, do the following. First set up a grid of ()12,θθ pairs as follows. Type > theta<-coef(REACT.fm) > se<-sqrt(diag(vcov(REACT.fm))) > dv<-deviance(REACT.fm) > gsize<-101 > th1<-theta[1]+seq(-4*se[1],4*se[1],length=gsize) > th2<-theta[2]+seq(-4*se[2],4*se[2],length=gsize) > th<-expand.grid(th1,th2) Then create a function to evaluate the sums of squares > ss<-function(t) + { + sum((y-t[1]*x/(t[2]+x))^2) + } As a check to see that you have it programmed correctly, evaluate this function at OLSˆθ for the data in hand, and verify that you get the deviance. Then evaluate the error sum of squares over the grid of parameter vectors θ set up earlier and produce a contour plot using > SumofSquares<-apply(th,1,ss) > SumofSquares<-matrix(SumofSquares,gsize,gsize) > plot(th1,th2,type="n",main="Error Sum of Squares Contours") > contour(th1,th2,SumofSquares,levels=c(seq(1000,4000,200))) What contour on this plot corresponds to an approximately 90% approximate confidence region for the parameter vector θ ?3 g) Now redo the contour plotting, placing only two contours on the plot using the following code. > plot(th1,th2,type="n",main="Error Sum of Squares Contours") > contour(th1,th2,SumofSquares,levels=dv*c((1+.1*qf(.95,1,10)), (1+.2*qf(.95,2,10)))) Identify on this plot an approximately 95% (joint) confidence region for θ and individual 95% confidence regions for 12 and θθ. (By the way, it would have been possible to simply add these contours to first plot, by making the second call to contour() as above, except for setting “add=T” as a parameter of the call.) h) Use the standard errors for the estimates of the coefficients produced by the routine nls() and make 95% t intervals for12 and θθ. How much different are these from your intervals in g)? (Notice that the sample size in this problem is small and reliance on any version of large sample theory to support inferences is tenuous. I would take any of these inferences above as very approximate. We will later discuss the possibility of using "bootstrap" calculations as an alternative method of inference.) i) Make two different approximate 95% confidence intervals forσ. Base one on carrying over the linear model result that 22/~nkSSEσχ−. Base the other on the “profile likelihood” material. j) Use the R function confint() to get 95% intervals for 12and θθ. That is, add the MASS package in order to get access to the function. Then type > confint(REACT.fm, level=.95) How do these intervals compare to the ones you found in part g)? k) Apparently, scientific theory suggests that treated enzyme will have the same value of 2θ as does untreated enzyme, but that 1θ may change with treatment. That is, if 0 if treated (Puromycin is used)1 otherwise iz= a possible model is ()1312iiiizxyxθθεθ+=++ and the parameter 3θ then measures the effect of the treatment. Go back to the data table and now do a fit of the (3 parameter) nonlinear model including a possible Puromycin effect using all 23 data points. Make 2 different approximately 95% confidence intervals for 3θ. Interpret these. (Do they indicate a statistically detectable


View Full Document

ISU STAT 511 - HW 6 S04

Download HW 6 S04
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view HW 6 S04 and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view HW 6 S04 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?