DOC PREVIEW
Berkeley COMPSCI 188 - Game Theory

This preview shows page 1-2-3-4-5 out of 14 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 14 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 14 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 14 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 14 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 14 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 14 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

CS 188: Artificial Intelligence Spring 2006Game TheoryStrategiesDominance and OptimalityEquilibriaCoordination GamesMixed Strategy Games(Zero-Sum) Minimax StrategiesContinuous MinimaxRepeated GamesPartially Observed GamesThe Ultimatum GameMechanism DesignAuctionsCS 188: Artificial IntelligenceSpring 2006Lecture 26: Game Theory4/25/2006Dan Klein – UC BerkeleyGame TheoryGame theory: study of strategic situations, usually simultaneous actionsA game has:PlayersActionsPayoff matrixExample: prisoner’s dilemmaATestify RefuseB Testify -5,-5 -10,0Refuse 0,-10 -1,-1Prisoner’s DilemmaStrategiesStrategy = policyPure strategyDeterministic policyIn a one-move game, just a moveMixed strategyRandomized policyEver good to use one?Strategy profile: a spec of one strategy per playerOutcome: each strategy profile results in an (expected) number for each playerOOne TwoE One -2,2 3,-3Two 3,-3 -4,4Prisoner’s DilemmaTwo-Finger MorraATestify RefuseB Testify -5,-5 -10,0Refuse 0,-10 -1,-1Dominance and OptimalityStrategy Dominance:A strategy s for A (strictly) dominates s’ if it produces a better outcome for A, for any B strategyOutcome Dominance:An outcome o Pareto dominates o’ if all players prefer o to o’An outcome is Pareto optimal if there is no outcome that all players would preferOOne TwoE One -2,2 3,-3Two 3,-3 -4,4Prisoner’s DilemmaTwo-Finger MorraATestify RefuseB Testify -5,-5 -10,0Refuse 0,-10 -1,-1EquilibriaIn the prisoner’s dilemma:What will A do?What will B do?What’s the dilemma?Both testifying is a (Nash) equilibriumNeither player can benefit from a unilateral change in strategyI.e., it’s a local optimum (not necessarily global)Nash showed that every game has such an equilibriumNote: not every game has a dominant strategy equilibriumWhat do we have to change for the prisoners to refuse?Change the payoffsConsider repeated gamesLimit the computational ability of the agentsHow would we model a “code of thieves”?ATestify RefuseB Testify -5,-5 -10,0Refuse 0,-10 -1,-1Coordination GamesNo dominant strategyBut, two (pure) Nash equilibriaWhat should agents do?Can sometimes choose Pareto optimal Nash equilibriumBut may be ties!Naturally gives rise to communicationAlso: correlated equilibriaADVD HD-DVDB DVD 5,5 -2,-1HD-DVD -2,-1 8,8ALeft RightB Left 1,1 -1,-1Right -1,-1 1,1Technology ChoiceDriving DirectionMixed Strategy GamesWhat’s the Nash equilibrium?No pure strategy equilibriumMust look at mixed strategiesMixed strategies:Distribution over actions per stateIn a one-move game, a single distributionFor Morra, a single number peven specifies the strategyHow to choose the optimal mixed strategy?OOne TwoE One -2,2 3,-3Two 3,-3 -4,4Two-Finger Morra(Zero-Sum) Minimax StrategiesIdea: force one player to chose and declare a strategy firstSay E reveals firstFor each E strategy, O has a minimax responseUtility of the root favors O (why?) and is -3 (from E’s perspective)If O goes first, root is 2 (for E)If these two utilities matched, we would know the utility of the maximum equilibriumMust look at mixed strategiesOOne TwoE One -2,2 3,-3Two 3,-3 -4,4Two-Finger Morra2 -3 -3 41 21 2 1 22 -3 -3 41 21 2 1 2Continuous MinimaxImagine a minimax tree:Instead of the two pure strategies, first player has infinitely many mixed onesNote that second player should always respond with a pure strategy (why?)Here, can calculate the minimax (and maximin) valuesBoth are ½ (from O’s perspective)Correspond to [7/12; 1, 5/12; 2] for both playersOOne TwoE One -2,2 3,-3Two 3,-3 -4,4Two-Finger Morra(2)(p)+(-3)(1-p)[p;1, (1-p);2]1 2(-3)(p)+(4)(1-p)Repeated GamesWhat about repeated games?E.g. repeated prisoner’s dilemmaFuture responses, retaliation becomes an issueStrategy can condition on past experienceRepeated prisoner’s dilemmaFixed numbers of games causes repeated betrayalIf agents unsure of number of future games, other optionsE.g. perpetual punishment: silent until you’re betrayed, then testify thereafterE.g. tit-for-tat: do what was done to you last roundIt’s enough for your opponent to believe you are incapable of remembering the number of games played (doesn’t actually matter whether the limitation really exists)Partially Observed GamesMuch harder to analyzeYou have to work with trees of belief statesProblem: you don’t know your opponent’s belief state!Newer techniques can solve some partially observable gamesMini-poker analysis shows, e.g., that bluffing can be a rational actionRandomization: not just for being unpredictable, also useful for minimizing what opponent can learn from your actionsThe Ultimatum GameGame theory can reveal interesting issues in social psychologyE.g. the ultimatum gameProposer: receives $x, offers split $k / $(x-k)Accepter: eitherAccepts: gets $k, proposer gets $(x-k)Rejects: neither gets anythingNash equilibrium?Any strategy profile where proposer offers $k and accepter will accept $k or greaterBut that’s not the interesting part…Issues:Why do people tend to reject offers which are very unfair (e.g. $20 from $100)?Irrationality?Utility of $20 exceeded by utility of punishing the unfair proposer?What about if x is very very large?Mechanism DesignOne use of game theory: mechanism designDesigning a game which induces desired behavior in rational agentsE.g. avoiding tragedies of the commonsClassic example: farmers share a common pastureEach chooses how many goats to grazeAdding a goat gains utility for that farmerAdding a goat slightly degrades the pastureInevitable that each farmer will keep adding goats until the commons is destroyed (tragedy!)Classic solution: charge for use of the commonsPrices need to be set to produce the right behaviorAuctionsExample: auctionsConsider auction for one itemEach bidder i has value vi and bids bi for itemEnglish auction: increasing bidsHow should bidder i bid?What will the winner pay?Why is this not an optimal result?Sealed single-bid auction, highest pays bidHow should bidder i bid?Why is bidding your value no longer dominant?Why is this auction not optimal?Sealed single-bid second-price auctionHow should bidder i bid?Bid vi –


View Full Document

Berkeley COMPSCI 188 - Game Theory

Documents in this Course
CSP

CSP

42 pages

Metrics

Metrics

4 pages

HMMs II

HMMs II

19 pages

NLP

NLP

23 pages

Midterm

Midterm

9 pages

Agents

Agents

8 pages

Lecture 4

Lecture 4

53 pages

CSPs

CSPs

16 pages

Midterm

Midterm

6 pages

MDPs

MDPs

20 pages

mdps

mdps

2 pages

Games II

Games II

18 pages

Load more
Download Game Theory
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Game Theory and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Game Theory 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?