UW-Madison CS 731 - Representation Size - D89157

Home> Schools> University of Wisconsin, Madison> (CS) > CS 731> Representation Size

DOC PREVIEW

UW-Madison CS 731 - Representation Size

School name University of Wisconsin, Madison

Course Cs 731- Advanced Artificial Intelligence (

Pages 33

This preview shows page 1-2-15-16-17-32-33 out of 33 pages.

Save

View full document

Premium Document

Do you want full access? Go Premium and unlock all 33 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 33 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 33 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 33 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 33 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 33 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 33 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Premium Document

Do you want full access? Go Premium and unlock all 33 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Unformatted text preview:

Representation SizeCanonical DistributionsDeterministic NodesDeterministic Nodes: LogicalDeterministic Nodes: NumericalNoisy-ORNoisy-OR Assumptions…Can Add a Leak NodeDefinition of Noisy-ORValue of Noisy-ORThe Basic Inference Task in Bayesian NetworksGenerality of ApproachExample QueryInference By EnumerationGetting a Full Joint Table Entry from a Bayes NetPowerPoint PresentationExample of Full ProcedureExample (Continued)Slide 19Slide 20Slide 21Slide 22Slide 23Slide 24Slide 25Variable Elimination ProcedureSlide 27Slide 28Slide 29Slide 30Slide 31Slide 32Slide 33Representation Size•At first glance it would appear that the space to represent a Bayes Net is necessarily quadratic (possible number of arcs) in the number of random variables.•But recall we also must represent the CPT of each node, which in general will have size exponential in the number of parents of the node.Canonical Distributions•Often CPTs fall into one of several common categories or canonical distributions.•These canonical forms are based on regularities that permit much more compact representations.Deterministic Nodes•A deterministic node has its value specified exactly by the values of its parents, with no uncertainty.•Different types of deterministic nodes, including logical and numerical.Deterministic Nodes: Logical•Value of a child node is determined by a logical expression over the parent nodes.•Might be negation, disjunction, conjunction, or a more complex expression.•Example: The relationship between Boolean variables Canadian, US, and Mexican as parents and the child node North American is the child is a disjunction of the parents.Deterministic Nodes: Numerical•The value of the child node is a numerical function of the parent values.•If the parent nodes are prices of the same model car at different dealers, and the child is the price paid by a careful shopper, the child is the minimum of the parents.•Might also have differences, sums, or more complex numerical functions.Noisy-OR•Analogous to logical-OR, except we are uncertain about the degree to which each parent can cause the child to be true.•Captures the intuition that the causal relationship between parent and child can be inhibited.•The degree of uncertainty or inhibition can vary from one parent to the next.Noisy-OR Assumptions•Parents and child are Boolean variables.•Inhibition of one parent is independent of the inhibitions of any other parents; e.g., whatever prevents Malaria from causing Fever is independent of whatever inhibits Flu from causing Fever.•All possible causes are listed. In practice this constraint is not an issue because...…Can Add a Leak Node•A leak node is an additional parent of a Noisy-OR node.•Covers “miscellaneous cases.”•Easier to see after a definition of Noisy-OR.Definition of Noisy-OR•A child node is false only if its true parents are inhibited.•The probability of such inhibition is the product of the inhibition probabilities for each parent.•So the probability the child node is true is 1 minus the product of the inhibition probabilities for the true parents.Value of Noisy-OR•To represent a Noisy-OR CPT, we need only specify the inhibition probability for each parent node.•If a node has n parents, then this representation requires only n probabilities in constrast with the 2n ordinarily required.The Basic Inference Task in Bayesian Networks•Given some observed event (some assignment of values to a set of evidence variables), compute the posterior probability distribution over a set of query variables.•Variables that are neither evidence variables nor query variables are hidden variables.•Most common query: P(X|e).Generality of Approach•A Bayes Net is flexible enough that any variable can be the query variables, and any variables can be evidence variables.•Algorithms we consider are easily extended to handle queries with multiple query variables.Example Query•P(Burglary | JohnCalls=true, MaryCalls=true) = <0.284,0.716>•How can we compute such answers?•One approach is to compute entire full joint distribution represented by the network, and use our earlier algorithm. But this would defeat the entire purpose of Bayes Nets.Inference By Enumeration• •Recall the ENUMERATEJOINTASK procedure based on the preceding equation, which computed the answers to queries given the full joint represented as a table.•We will modify ENUMERATEJOINTASK to use the Bayes Net instead of the table. Need only get desired table entries using Bayes Net.P e P e yy( | ) = ( , , )X XGetting a Full Joint Table Entry from a Bayes Net•Recall:•A table entry for X1 = x1,…,Xn = xn is simply P(x1,…,xn) which can be calculated based on the Bayes Net semantics above.•Recall example:P ( ) = P ( )x , . . . , x x | P a r e n t s ( X )n i ii =n11P ( ) = P ( ) P ( ) P ( ) P ( ) P ( )a , j , m , b , e j | a m | aa | b , e b e    Example of Full Procedure•Query: P(Burglary | JohnCalls=true, MaryCalls=true)•P(B|j,m) =•Solve without the normalization constant for both B = true and B = false, and then compute the normalization constant and the final probabilities. ( )PaeB , e , a , j , mExample (Continued)P ( ) = P ( ) P ( ) P ( ) P ( ) P ( ) P ( ) P ( ) P ( ) P ( ) P ( ) G o t oC P T s i n B a y e s N e t t o p u l l o u t n u m b e r s ( F i g . 1 6 . 8 ) .R e p e a t w i t h i n s t e a d o f t h r o u g h o u t . R e s u l t i n gn u m b e r s a r e 0 . 0 0 0 5 9 2 a n d 0 . 0 0 1 4 9 4 r e s p e c t i v e l y .b | j , m b e a | b , e j | a m | ab e a | b , e j | a m | ab baeae.Example (Continued)N o r m a l i z i n g : 0 . 0 0 0 5 9 2 + 0 . 0 0 1 4 9 4 = 1 . 0 . = 1 . 0 / 0 . 0 0 2 0 8 6 4 7 9 .( ) = < 0 . 2 8 4 , 0 . 7 1 6 > . PB u r g l a r yVariable Elimination Procedure•The initial potentials are the CPTs in BN.•Repeat until only query variable remains:–Choose another variable to eliminate.–Multiply all potentials that contain the variable.–If no evidence for the variable then sum the variable out and replace original potential by the new result.–Else, remove variable based on evidence.•Normalize remaining potential to get the final distribution over the query variable.This link between V.E. and J.T. due

View Full Document