Inference when considering two populationsInference for the difference of two parametersInference for P1 – P2CI for P1 – P2Slide 5Inference for difference of two population means μ1 – μ2Typical study designsMatched pairs vs two samplesInference in μ1 – μ2: matched pairsSlide 10JMP output for odor exampleSlide 12Conclusions from odors exampleInference in μ1 – μ2: two samplesEDA for pygmalion studySample means and SD’sPygmalion confidence intervalConclusions from the pygmalion studyDegrees of FreedomSlide 20Hypothesis tests for difference of two parametersHypothesis test for p1 – p2Slide 23Slide 24Slide 25Slide 26Slide 27Slide 28Hypothesis test for μ1 – μ2: matched pairsConclusions about odorSlide 31Inference in μ1 – μ2: Two independent samplesSlide 33Matched pairs analysisConclusions from previous exampleMatched pairs cont.Determining a sample sizeDetermining sample sizeDetermine sample size for differences in % and averageSlide 40Determining sample sizes for differences in % and avg.Not completely in FPP but good stuff anywayInference when considering two populationsInference for the difference of two parametersOften we are interested in comparing the population average or the population proportion/percentage for two groupsWe can do these types of comparisons using CI’s and hypothesis testsGeneral ideas and equations don’t changeCI: estimate ± multiplier*SETest statistics: (observed– expected)/SEInference for P1 – P2Lets just jump right into an exampleCI for P1 – P2 Estimate ± multiplier*SEMultiplier comes from the z-tableEverything else we know about confidence intervals is the sameInterpretationWhat does 95% confidence mean€ ˆ p 1−ˆ p 2± multiplierˆ p 1(1−ˆ p 1)n1+ˆ p 2(1−ˆ p 2)n2Inference for difference of two population means μ1 – μ2Two possibilities in collecting data on two variables hereDesign 1: Units are matched in pairsUse “matched pairs inference”Design 2: units not matched in pairsUse “two sample inferences”Typical study designsMatched pairsA) two treatments given to each unitB) units paired before treatments are assigned, then treatments are assigned randomly within pairsTwo samplesA) some units assigned to get only treatment a, and other units assigned to get only treatment b. Assignment is completely at randomB) Units in two different groups compared on some survey variableMatched pairs vs two samplesData collected in two independent samples:No matching, so creating values of some “difference” is meaninglessA “matched pairs” analysis is mathematically wrong and gives incorrect CI’s and p-valuesData collected in matched pairs:Matching, when effective, reduces the SE.A two sample analysis artificially inflates the SE, resulting in excessively wide CI’s and unreliable p-valuesAn example towards the end of these slides will demonstrate thisInference in μ1 – μ2: matched pairsGeneral idea with matched pairs design is to compute the difference for pair of observations and treat the differences as the single variableMeasure y1 and y2 on each unit. Then for each unit computed = y1 – y2Then find a confidence interval for the differencedifference estimate ± multiplier*SEaverage of differences ± t-table value * SD of differences/√nInference in μ1 – μ2: matched pairs Do people perform better on tests when smelling flowers versus smelling nothing?Hirsch and Johnston (1996) asked 21 subjects to work a maze while wearing a mask. The mask was either unscented or carried a floral scent. Each subject worked both mazes. The order of the mask was randomized to ensure fair comparison to the two treatments. The response is the difference in completion times for the unscented and scented masks.Example: Person 1 completed the maze in 30.60 seconds while wearing the unscented mask, and in 37.97 seconds while wearing the scented mask. So, this person’s data value is –7.37 (30.60 – 37.97).JMP output for odor exampleThe differences appear to follow the normal curve. There are no outliersSample average difference is 0.96, suggesting people do better with scented mask.01.05.10.25.50.75.90.95.99-2-10123Normal Quantile Plot-30 -20 -10 0 10 20 30MeanStd DevStd Err Meanupper 95% Meanlow er 95% MeanN0.956666712.5478822.73817236.6683939-4.755061 21MomentsHypothesized ValueActual EstimatedfStd Dev 00.95667 2012.5479Test StatisticProb > |t|Prob > tProb < t 0.3494 0.7305 0.3652 0.6348t TestTest Mean=valueDifferenceDistributionsConclusions from odors exampleThe 95% CI ranges from -4.76 to 6.67, which is too wide a range to determine whether floral odors help or hurt performance for these mazes. In other words, the data suggest that any effect of scented masks is small enough that we cannot estimate it with reasonable accuracy using these 21 subjects. We should collect more data to estimate the effect of the odor more precisely.We also note that this study was very specific. The results may not be easily generalized to other populations, other tests, or other treatments.Inference in μ1 – μ2: two samplesPygmalion studyResearchers gave IQ test to elementary school kids.They randomly picked six kids and told teachers the test predicts these kids have high potential for accelerated growth.They randomly picked different six kids and told teachers the test predicts these kids have no potential for growth. At end of school year, they gave IQ test again to all students. They recorded the change in IQ scores of each student.Let’s see what they found…EDA for pygmalion studyIt looks like being labeled “accelerated” leads to larger improvements than being labeled “no growth”Let’s make a 99% CI to confirm thisImprovement05101520accelerated noneGrow th GroupSample means and SD’s Level Number Mean SD SE accelerated 6 15.17 4.708 1.92 none 6 6.17 3.656 1.49Sample difference is 9.00. The SE of this difference:43.22222212121 SESEnSDnSDSEPygmalion confidence interval99% CI for difference in mean scores (accel – none):Estimate ± mulitplier*SEEstimate is mean1 – mean2Multiplier comes from the t-table (we will talk about df in a sec.)SE of difference from the previous slide€ € x 1− x 1±
View Full Document