Prev: L20, Next: L22
Course Links: Canvas, Piazza, TopHat (212925)
Zoom Links: MW 4:00, TR 1:00, TR 2:30.
Tools
📗 You can expand all TopHat Quizzes and Discussions: , and print the notes: , or download all text areas as text file: .
📗 For visibility, you can resize all diagrams on page to have a maximum height that is percent of the screen height: .
📗 Calculator:
📗 Canvas:


Slide:




# Mixed Strategy

📗 A mixed strategy is when a player randomizes between multiple actions: Wikipedia.
📗 A pure strategy is when a player uses only one action with probability 1.
📗 A mixed strategy Nash equilibrium is a Nash equilibrium for the game in which mixed strategies are allowed (called mixed extension of the original game).
➩ If the mixed strategies are mutual best responses, then they form a mixed strategy Nash equilibrium (see Math Note for the mathematical definition).
Math Note
📗 For a two-player general-sum game with rewards \(r\) (both players are maximizing), an action profile \(\left(a_{1}, a_{2}\right)\) is a pure strategy Nash equilibrium if \(r_{1}\left(a_{1}, a_{2}\right) \geq r_{1}\left(a', a_{2}\right)\) for every \(a'\) valid action of player \(1\); and \(r_{2}\left(a_{1}, a_{2}\right) \geq r_{2}\left(a_{1}, a'\right)\) for every \(a'\) valid action of player \(2\). 
📗 An mixed strategy profile \(\left(\pi_{1}, \pi_{2}\right)\), where \(\pi_{i}\left(a\right)\) is the probability that player \(i\) chooses action \(a\), is a mixed strategy Nash equilibrium if \(\mathbb{E}\left[r_{1}\left(\pi_{1}, \pi_{2}\right)\right] \geq \mathbb{E}\left[r_{1}\left(\pi', \pi_{2}\right)\right]\) for every \(\pi'\) mixed strategy of player \(1\); and if \(\mathbb{E}\left[r_{2}\left(\pi_{1}, \pi_{2}\right)\right] \geq \mathbb{E}\left[r_{2}\left(\pi_{1}, \pi'\right)\right]\) for every \(\pi'\) mixed strategy of player \(2\).
➩ Here, the expectation is over the randomness of the mixed strategies, or \(\mathbb{E}\left[r_{i}\left(\pi_{1}, \pi_{2}\right)\right] = \displaystyle\sum_{a_{1}, a_{2}} r_{i}\left(a_{1}, a_{2}\right) \pi_{1}\left(a_{1}\right) \pi_{2}\left(a_{2}\right)\), where the sum is over all valid \(\left(a_{1}, a_{2}\right)\) actions of players \(1\) and \(2\).
➩ This condition is usually difficult to check: another equivalent condition is: \(\left(\pi_{1}, \pi_{2}\right)\) is a mixed strategy Nash equilibrium if \(\mathbb{E}\left[r_{1}\left(\pi_{1}, \pi_{2}\right)\right] \geq \mathbb{E}\left[r_{1}\left(a', \pi_{2}\right)\right]\) for every \(a'\) valid action of player \(1\); and if \(\mathbb{E}\left[r_{2}\left(\pi_{1}, \pi_{2}\right)\right] \geq \mathbb{E}\left[r_{2}\left(\pi_{1}, a'\right)\right]\) for every \(a'\) valid action of player \(2\).
Example
📗 For rock paper scissor game, there is no pure Nash equilibrium, but there is one mixed Nash equilibrium where every player uses each action with probability \(\dfrac{1}{3}\): Link.
📗 The game matrix is:
\(a_{1} \setminus a_{2}\) Rock Paper Scissors
Rock \(0\) \(-1\) \(1\)
Paper \(1\) \(0\) \(-1\)
Scissors \(-1\) \(1\) \(0\)


TopHat Quiz (Past Exam Question) ID:
📗 [4 points] Given the following game payoff table, suppose the row player uses a pure strategy, and column player uses a mixed strategy playing L with probability \(q\). What is the smallest and largest value of \(q\) in a mixed strategy Nash equilibrium?
Row \ Col L R
U
D

Note: the following is a diagram of the best responses (make sure you understand what they are and how to draw them). The red curve is the best response for the column player and the blue curve is the best response for the row player.

📗 Answer (comma separated vector): .

TopHat Quiz (Past Exam Question) ID:
📗 [4 points] Given the following BoS (Battle of Sexes) game, what is the row (Romeo) player's (expected) value (i.e. payoff) in the mixed strategy Nash equilibrium?
Romeo \ Juliet Bach Stravinsky
Bach
Stravinsky

📗 Answer: .




# Nash Theorem

📗 Every finite game has a (possibly mixed) Nash equilibrium: Wikipedia.



📗 Notes and code adapted from the course taught by Professors Jerry Zhu, Yingyu Liang, and Charles Dyer.
📗 Content from note blocks marked "optional" and content from Wikipedia and other demo links are helpful for understanding the materials, but will not be explicitly tested on the exams.
📗 Please use Ctrl+F5 or Shift+F5 or Shift+Command+R or Incognito mode or Private Browsing to refresh the cached JavaScript.
📗 You can expand all TopHat Quizzes and Discussions: , and print the notes: , or download all text areas as text file: .
📗 If there is an issue with TopHat during the lectures, please submit your answers on paper (include your Wisc ID and answers) or this Google form Link at the end of the lecture.
📗 Anonymous feedback can be submitted to: Form.

Prev: L20, Next: L22





Last Updated: November 21, 2024 at 3:16 AM