PACIFIC COMP 155 - Probability and Statistics - D2517671

Home> Schools> University of the Pacific> Computer Science (COMP) > COMP 155> Probability and Statistics

PACIFIC COMP 155 - Probability and Statistics

Pages 13

Download Save

Unformatted text preview:

Lecture 10: Probability and Statistics (part 2)This WeekSamplingSlide 4Sampling: Simulation OutputEstimating Distribution from SamplesSampling DistributionsPoint EstimationConfidence IntervalsConfidence Intervals in SimulationHypothesis TestsErrors in Hypothesis TestingHypothesis Testing in SimulationCOMP155Computer SimulationOctober 1, 2008This WeekReview of probability and statistics needed to understand simulationfollow Appendix C in Arena textOutlineMonday (C.1, C.2):○Probability – basic ideas, terminology○Random variables, joint distributionsToday (C.3-C.5):○Sampling○Statistical inference – point estimation, confidence intervals, hypothesis testingSamplingStatistical analysis: purpose is to estimate or infer something about a large population population is a set of data pointspopulation is too large to look at completely,so we only look at a sample from the populationif the sample is randomly selected from the population, the distribution of the sample should be the same as the distribution of the populationin practice: determine a PMF or PDF for a sampleand assume that distribution holds for the entire populationSamplingRandom sample is a set of independent and identically distributed (IID) observations X1, X2, …, Xn from the populationInput modeling: ○observations come from the real world○Arena’s input analyzer can be used to determine distribution functionOutput analysis: ○observations are the results of multiple runs/replications of the simulation○Arena’s output analyzer can be used to characterize the output population from the observations.Sampling: Simulation OutputRandom sample is a set of independent and identically distributed (IID) observations X1, X2, …, Xn from the populationInput modeling: ○observations come from the real world○Arena’s input analyzer can be used to determine distribution functionOutput analysis: ○observations are the results of multiple runs/replications of the simulation○Arena’s output analyzer can be used to characterize the output population from the observations.Estimating Distribution from SamplesSamples: X1, X2, …, Xn assuming a normal distribution, compute:sample meansample varianceThese statistics have their own sampling distribution, which is generally normalnXXnii11)(122nXXsniiSampling DistributionsIf If underlying distribution of X is normal, then the distribution of is also normal.Point EstimationPoint estimates are estimates of population distribution parameters (2, …)Properties of point estimates Unbiased: E(estimate) = parameterEfficient: Var(estimate) is lowest among competing point estimatorsConsistent: Var(estimate) decreases (usually to 0) as the sample size increasesConfidence IntervalsA confidence interval quantifies the likely imprecision in a point estimatorAn interval that contains (covers) the unknown population parameter some specified probabilityCalled a 100 (1 – )% confidence interval for the parameterExample: 87 <  < 123 with probability 95%The value of  is in (87, 123) with 95% confidenceWe’ll leave the computation of confidence intervals to a statistics course … or to Arena’s output analyzer tool.Confidence Intervals in SimulationRun simulation replications, get resultsView each replication of the simulation as a data pointForm a confidence intervalThe confidence interval tells you how close you are to getting the “true” expected output (what you’d get by averaging an infinite number of replications)Hypothesis TestsA hypothesis test is used to test some assertion about the population or its parametersWith sampling, we don’t get true/false result, only get evidence that points one way or anotherNull hypothesis (H0) – what is to be testedAlternate hypothesis (H1 or HA) – denial of H0H0:  = 6 vs. H1:   6H0:  < 10 vs. H1:   10H0: 1 = 2 vs. H1: 1  2Develop a decision rule to decide on H0 or H1 based on sample dataErrors in Hypothesis Testing H0 is really true H1 is really true Decide H0 (“Accept” H0) No error Probability 1 –  is chosen (controlled) Type II error Probability   is not controlled – affected by  and n Decide H1 (Reject H0) Type I Error Probability  No error Probability 1 –  = power of the test 1-α is the probability of your confidence intervalHypothesis Testing in SimulationInput sideSpecify input distributions to drive the simulationCollect real-world data on corresponding processes“Fit” a probability distribution to the observed real-world dataTest H0: the data are well represented by the selected distributionOutput sideHave two or more “competing” designs modeledTest H0: all designs perform the same on output, or test H0: one design is better than anotherSelection of a “best” model

View Full Document


School:
Email:
New Password:
Confirm Password:

PACIFIC COMP 155 - Probability and Statistics

Sign up for free to view:

Please select your school