Berkeley COMPSCI 188 - Decision Diagrams - D438071

Home> Schools> University of California, Berkeley> Computer Science (COMPSCI) > COMPSCI 188> Decision Diagrams

DOC PREVIEW

Berkeley COMPSCI 188 - Decision Diagrams

School name University of California, Berkeley

Course Compsci 188- Introduction to Artificial Intelligence

Pages 32

This preview shows page 1-2-15-16-31-32 out of 32 pages.

Save

View full document

Premium Document

Do you want full access? Go Premium and unlock all 32 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 32 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 32 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 32 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 32 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 32 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Premium Document

Do you want full access? Go Premium and unlock all 32 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Unformatted text preview:

CS 188: Artificial Intelligence Fall 2008AnnouncementsSamplingPrior SamplingRejection SamplingLikelihood WeightingLikelihood SamplingSlide 10Slide 11Pacman ContestRecap: Inference ExampleDecision NetworksSlide 17Example: Decision NetworksSlide 19Value of InformationSlide 21VPI ExampleVPI PropertiesVPI ScenariosReasoning over TimeMarkov ModelsConditional IndependenceExample: Markov ChainMini-Forward AlgorithmSlide 30Slide 31Stationary DistributionsWeb Link AnalysisMost Likely ExplanationMini-Viterbi AlgorithmMini-ViterbiCS 188: Artificial IntelligenceFall 2008Lecture 18: Decision Diagrams10/30/2008Dan Klein – UC Berkeley1AnnouncementsP4 EXTENDED to Tuesday 11/4Midterms graded, pick up after lectureMidterm course evaluation up on web soon, please fill out!Final contest instructions out today!Prizes will be good 2SamplingBasic idea:Draw N samples from a sampling distribution SCompute an approximate posterior probabilityShow this converges to the true probability POutline:Sampling from an empty networkRejection sampling: reject samples disagreeing with evidenceLikelihood weighting: use evidence to weight samples3Prior SamplingCloudySprinklerRainWetGrassCloudySprinklerRainWetGrass4Rejection SamplingLet’s say we want P(C)No point keeping all samples aroundJust tally counts of C outcomesLet’s say we want P(C| s)Same thing: tally C outcomes, but ignore (reject) samples which don’t have S=sThis is rejection samplingIt is also consistent for conditional probabilities (i.e., correct in the limit)c, s, r, wc, s, r, wc, s, r, wc, s, r, wc, s, r, wCloudySprinklerRainWetGrassCSRW7Likelihood WeightingProblem with rejection sampling:If evidence is unlikely, you reject a lot of samplesYou don’t exploit your evidence as you sampleConsider P(B|a)Idea: fix evidence variables and sample the restProblem: sample distribution not consistent!Solution: weight by probability of evidence given parentsBurglary AlarmBurglary Alarm8Likelihood SamplingCloudySprinklerRainWetGrassCloudySprinklerRainWetGrass9Likelihood WeightingSampling distribution if z sampled and e fixed evidenceNow, samples have weightsTogether, weighted sampling distribution is consistentCloudyRainCSRW10Likelihood WeightingNote that likelihood weighting doesn’t solve all our problemsRare evidence is taken into account for downstream variables, but not upstream onesA better solution is Markov-chain Monte Carlo (MCMC), more advancedWe’ll return to sampling for robot localization and tracking in dynamic BNsCloudyRainCSRW11Pacman Contest14Recap: Inference ExampleFind P(W|F=bad)Restrict all factorsNo hidden vars to eliminate (this time!)Just join and normalizeWeatherForecastW P(W)sun 0.7rain 0.3F P(F|rain)good 0.1bad 0.9F P(F|sun)good 0.8bad 0.2W P(W)sun 0.7rain 0.3W P(F=bad|W)sun 0.2rain 0.9W P(W,F=bad)sun 0.14rain 0.27W P(W | F=bad)sun 0.34rain 0.6615Decision NetworksMEU: choose the action which maximizes the expected utility given the evidenceCan directly operationalize this with decision diagramsBayes nets with nodes for utility and actionsLets us calculate the expected utility for each actionNew node types:Chance nodes (just like BNs)Actions (rectangles, must be parents, act as observed evidence)Utilities (depend on action and chance nodes)WeatherForecastUmbrellaU16Decision NetworksAction selection:Instantiate all evidenceCalculate posterior over parents of utility nodeSet action node each possible wayCalculate expected utility for each actionChoose maximizing actionWeatherForecastUmbrellaU17Example: Decision NetworksWeatherUmbrellaUW P(W)sun 0.7rain 0.3A W U(A,W)leave sun 100leave rain 0take sun 20take rain 70Umbrella = leaveUmbrella = takeOptimal decision = leave18Example: Decision NetworksWeatherForecast=badUmbrellaUA W U(A,W)leave sun 100leave rain 0take sun 20take rain 70W P(W|F=bad)sun 0.34rain 0.66Umbrella = leaveUmbrella = takeOptimal decision = take19Value of InformationIdea: compute value of acquiring each possible piece of evidenceCan be done directly from decision networkExample: buying oil drilling rightsTwo blocks A and B, exactly one has oil, worth kPrior probabilities 0.5 each, mutually exclusiveCurrent price of each block is k/2MEU = 0 (either action is a maximizer)Solution: compute value of information= expected gain in MEU from observing new informationProbe gives accurate survey of A. Fair price?Survey may say “oil in a” or “oil in b,” prob 0.5 eachIf we know O, MEU is k/2 (either way)Gain in MEU?VPI(O) = k/2Fair price: k/2OilLocDrillLocUD O Ua a k/2a b -k/2b a -k/2b b k/2O Pa 1/2b 1/220Value of InformationCurrent evidence E=e, utility depends on S=sPotential new evidence E’: suppose we knew E’ = e’BUT E’ is a random variable whose value is currently unknown, so:Must compute expected gain over all possible values(VPI = value of perfect information)21VPI ExampleWeatherForecastUmbrellaUA W Uleave sun 100leave rain 0take sun 20take rain 70MEU with no evidenceMEU if forecast is badMEU if forecast is goodF P(F)good 0.59bad 0.41Forecast distribution22VPI PropertiesNonnegative in expectationNonadditive ---consider, e.g., obtaining Ej twiceOrder-independent23VPI ScenariosImagine actions 1 and 2, for which U1 > U2How much will information about Ej be worth?Little – we’re sure action 1 is better.Little – info likely to change our action but not our utilityA lot – either could be much better24Reasoning over TimeOften, we want to reason about a sequence of observationsSpeech recognitionRobot localizationUser attentionMedical monitoringNeed to introduce time into our modelsBasic approach: hidden Markov models (HMMs)More general: dynamic Bayes’ nets25Markov ModelsA Markov model is a chain-structured BNEach node is identically distributed (stationarity)Value of X at a given time is called the stateAs a BN:Parameters: called transition probabilities or dynamics, specify how the state evolves over time (also, initial probs)X2X1X3X4[DEMO: Battleship]26Conditional IndependenceBasic conditional independence:Past and future independent of the presentEach time step only depends on the previousThis is called the (first order) Markov propertyNote that the chain is just a (growing) BNWe can always use

View Full Document