CMU CS 10708 - BN Semantics 3 – Now it’s personal! - D2816095

Home> Schools> Carnegie Mellon University> Computer Science (CS) > CS 10708> BN Semantics 3 – Now it’s personal!

DOC PREVIEW

CMU CS 10708 - BN Semantics 3 – Now it’s personal!

School name Carnegie Mellon University

Course Cs 10708- Probabilistic Graphical Models

Pages 34

This preview shows page 1-2-16-17-18-33-34 out of 34 pages.

Save

View full document

Premium Document

Do you want full access? Go Premium and unlock all 34 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 34 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 34 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 34 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 34 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 34 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 34 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Premium Document

Do you want full access? Go Premium and unlock all 34 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Unformatted text preview:

BN Semantics 3 – Now it’s personal!Independencies encoded in BNUnderstanding independencies in BNs – BNs with 3 nodesUnderstanding independencies in BNs – Some examplesUnderstanding independencies in BNs – Some more examplesAn active trail – ExampleActive trails formalizedActive trails and independence?More generally: Soundness of d-separationExistence of dependency when not d-separatedMore generally: Completeness of d-separationInterpretation of completenessAlgorithm for d-separationWhat you need to knowAnnouncementsBuilding BNs from independence propertiesMinimal I-mapsObtaining a minimal I-mapMinimal I-map not unique (or minimum)Perfect maps (P-maps)Inexistence of P-maps 1Inexistence of P-maps 2Obtaining a P-mapI-EquivalenceSkeleton of a BNWhat about V-structures?Same V-structures not necessaryImmoralities & I-EquivalenceSlide 29Identifying the skeleton 1Identifying the skeleton 2Identifying immoralitiesFrom immoralities and skeleton to BN structuresSlide 341BN Semantics 3 – Now it’s personal!Graphical Models – 10708Carlos GuestrinCarnegie Mellon UniversitySeptember 22nd, 2008Readings:K&F: 3.3, 3.410-708 – Carlos Guestrin 2006-200810-708 – Carlos Guestrin 2006-20082Independencies encoded in BNWe said: All you need is the local Markov assumption(Xi  NonDescendantsXi | PaXi)But then we talked about other (in)dependenciese.g., explaining awayWhat are the independencies encoded by a BN?Only assumption is local MarkovBut many others can be derived using the algebra of conditional independencies!!!10-708 – Carlos Guestrin 2006-20083Understanding independencies in BNs – BNs with 3 nodesZYXLocal Markov Assumption:A variable X is independentof its non-descendants given its parents and only its parents Z YXZ YXZYXIndirect causal effect:Indirect evidential effect:Common cause:Common effect:10-708 – Carlos Guestrin 2006-20084Understanding independencies in BNs – Some examplesAHCEGDBFKJI10-708 – Carlos Guestrin 2006-20085Understanding independencies in BNs – Some more examplesAHCEGDBFKJI10-708 – Carlos Guestrin 2006-20086An active trail – ExampleA HCEGDBFF’’F’When are A and H independent?10-708 – Carlos Guestrin 2006-20087Active trails formalizedA trail X1 – X2 – · · · –Xk is an active trail when variables O{X1,…,Xn} are observed if for each consecutive triplet in the trail:Xi-1XiXi+1, and Xi is not observed (XiO)Xi-1XiXi+1, and Xi is not observed (XiO)Xi-1XiXi+1, and Xi is not observed (XiO)Xi-1XiXi+1, and Xi is observed (Xi2O), or one of its descendents10-708 – Carlos Guestrin 2006-20088Active trails and independence?Theorem: Variables Xi and Xj are independent given Z{X1,…,Xn} if the is no active trail between Xi and Xj when variables Z{X1,…,Xn} are observedAHCEGDBFKJI10-708 – Carlos Guestrin 2006-20089More generally: Soundness of d-separationGiven BN structure GSet of independence assertions obtained by d-separation:I(G) = {(X Y|Z) : d-sepG(X;Y|Z)}Theorem: Soundness of d-separationIf P factorizes over G then I(G)I(P)Interpretation: d-separation only captures true independenciesProof discussed when we talk about undirected models10-708 – Carlos Guestrin 2006-200810Existence of dependency when not d-separatedTheorem: If X and Y are not d-separated given Z, then X and Y are dependent given Z under some P that factorizes over G Proof sketch: Choose an active trail between X and Y given ZMake this trail dependent Make all else uniform (independent) to avoid “canceling” out influenceAHCEGDBFKJI10-708 – Carlos Guestrin 2006-200811More generally: Completeness of d-separationTheorem: Completeness of d-separationFor “almost all” distributions where P factorizes over to G, we have that I(G) = I(P)“almost all” distributions: except for a set of measure zero of parameterizations of the CPTs (assuming no finite set of parameterizations has positive measure)Means that if all sets X & Y that are not d-separated given Z, then ¬ (XY|Z)Proof sketch for very simple case:10-708 – Carlos Guestrin 2006-200812Interpretation of completenessTheorem: Completeness of d-separationFor “almost all” distributions that P factorize over to G, we have that I(G) = I(P)BN graph is usually sufficient to capture all independence properties of the distribution!!!!But only for complete independence:P (X=xY=y | Z=z), 8 x2Val(X), y2Val(Y), z2Val(Z)Often we have context-specific independence (CSI) 9 x2Val(X), y2Val(Y), z2Val(Z): P (X=xY=y | Z=z)Many factors may affect your gradeBut if you are a frequentist, all other factors are irrelevant 10-708 – Carlos Guestrin 2006-200813Algorithm for d-separationHow do I check if X and Y are d-separated given ZThere can be exponentially-many trails between X and YTwo-pass linear time algorithm finds all d-separations for X1. Upward passMark descendants of Z2. Breadth-first traversal from XStop traversal at a node if trail is “blocked”(Some tricky details apply – see reading)AHCEGDBFKJI10-708 – Carlos Guestrin 2006-200814What you need to knowd-separation and independencesound procedure for finding independenciesexistence of distributions with these independencies(almost) all independencies can be read directly from graph without looking at CPTsAnnouncementsHomework 1:Due next Wednesday – beginning of class!It’s hard – start early, ask questionsAudit policyNo sitting in, official auditors only, see course website10-708 – Carlos Guestrin 2006-200816Building BNs from independence propertiesFrom d-separation we learned:Start from local Markov assumptions, obtain all independence assumptions encoded by graphFor most P’s that factorize over G, I(G) = I(P)All of this discussion was for a given G that is an I-map for PNow, give me a P, how can I get a G?i.e., give me the independence assumptions entailed by PMany G are “equivalent”, how do I represent this?Most of this discussion is not about practical algorithms, but useful concepts that will be used by practical algorithmsPractical algs next time10-708 – Carlos Guestrin 2006-200817Minimal I-mapsOne option: G is an I-map for PG is as simple as possibleG is a minimal I-map for P if deleting any edges from G

View Full Document