PACIFIC COMP 155 - Probability and Statistics

Unformatted text preview:

Lecture 10: Probability and Statistics (part 2)This WeekSamplingSlide 4Sampling: Simulation OutputEstimating Distribution from SamplesSampling DistributionsPoint EstimationConfidence IntervalsConfidence Intervals in SimulationHypothesis TestsErrors in Hypothesis TestingHypothesis Testing in SimulationCOMP155Computer SimulationOctober 1, 2008This WeekReview of probability and statistics needed to understand simulationfollow Appendix C in Arena textOutlineMonday (C.1, C.2):○Probability – basic ideas, terminology○Random variables, joint distributionsToday (C.3-C.5):○Sampling○Statistical inference – point estimation, confidence intervals, hypothesis testingSamplingStatistical analysis: purpose is to estimate or infer something about a large population population is a set of data pointspopulation is too large to look at completely,so we only look at a sample from the populationif the sample is randomly selected from the population, the distribution of the sample should be the same as the distribution of the populationin practice: determine a PMF or PDF for a sampleand assume that distribution holds for the entire populationSamplingRandom sample is a set of independent and identically distributed (IID) observations X1, X2, …, Xn from the populationInput modeling: ○observations come from the real world○Arena’s input analyzer can be used to determine distribution functionOutput analysis: ○observations are the results of multiple runs/replications of the simulation○Arena’s output analyzer can be used to characterize the output population from the observations.Sampling: Simulation OutputRandom sample is a set of independent and identically distributed (IID) observations X1, X2, …, Xn from the populationInput modeling: ○observations come from the real world○Arena’s input analyzer can be used to determine distribution functionOutput analysis: ○observations are the results of multiple runs/replications of the simulation○Arena’s output analyzer can be used to characterize the output population from the observations.Estimating Distribution from SamplesSamples: X1, X2, …, Xn assuming a normal distribution, compute:sample meansample varianceThese statistics have their own sampling distribution, which is generally normalnXXnii11)(122nXXsniiSampling DistributionsIf If underlying distribution of X is normal, then the distribution of is also normal.Point EstimationPoint estimates are estimates of population distribution parameters (2, …)Properties of point estimates Unbiased: E(estimate) = parameterEfficient: Var(estimate) is lowest among competing point estimatorsConsistent: Var(estimate) decreases (usually to 0) as the sample size increasesConfidence IntervalsA confidence interval quantifies the likely imprecision in a point estimatorAn interval that contains (covers) the unknown population parameter some specified probabilityCalled a 100 (1 – )% confidence interval for the parameterExample: 87 <  < 123 with probability 95%The value of  is in (87, 123) with 95% confidenceWe’ll leave the computation of confidence intervals to a statistics course … or to Arena’s output analyzer tool.Confidence Intervals in SimulationRun simulation replications, get resultsView each replication of the simulation as a data pointForm a confidence intervalThe confidence interval tells you how close you are to getting the “true” expected output (what you’d get by averaging an infinite number of replications)Hypothesis TestsA hypothesis test is used to test some assertion about the population or its parametersWith sampling, we don’t get true/false result, only get evidence that points one way or anotherNull hypothesis (H0) – what is to be testedAlternate hypothesis (H1 or HA) – denial of H0H0:  = 6 vs. H1:   6H0:  < 10 vs. H1:   10H0: 1 = 2 vs. H1: 1  2Develop a decision rule to decide on H0 or H1 based on sample dataErrors in Hypothesis Testing H0 is really true H1 is really true Decide H0 (“Accept” H0) No error Probability 1 –  is chosen (controlled) Type II error Probability   is not controlled – affected by  and n Decide H1 (Reject H0) Type I Error Probability  No error Probability 1 –  = power of the test 1-α is the probability of your confidence intervalHypothesis Testing in SimulationInput sideSpecify input distributions to drive the simulationCollect real-world data on corresponding processes“Fit” a probability distribution to the observed real-world dataTest H0: the data are well represented by the selected distributionOutput sideHave two or more “competing” designs modeledTest H0: all designs perform the same on output, or test H0: one design is better than anotherSelection of a “best” model


View Full Document

PACIFIC COMP 155 - Probability and Statistics

Download Probability and Statistics
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Probability and Statistics and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Probability and Statistics 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?