# Schedule
This is a directed study course in Computer Science. In Spring 2025, the topic is partial observability in multi-agent reinforcement learning. We will implement and compare algorithms used to solve partially observable Markov games: (1) exact methods to solve the belief-state Markov games by (i) discretization of the domain and (ii) iterative elimination of dominated strategies; (2) approximate methods to solve the Markov games where observations are treated as states using deep reinforcement learning techniques.
| Week |
Date |
Topic |
Notes |
| 1 |
- |
Markov Game |
W1 |
| 2 |
- |
Best Response Dynamics |
W2 |
| 3 |
- |
Minimax Q |
W3 |
| 4 |
- |
Partial Observability |
W4 |
| 5 |
- |
Beliefs |
W5 |
| 6 |
- |
Dynamic Programming |
W6 |
| 7 |
- |
Heuristic Search |
W7 |
| 8 |
- |
Project |
W8 |
| 9 |
- |
Project |
W9 |
| 10 |
- |
Project |
W10 |
| 11 |
- |
Project |
W11 |
| 12 |
- |
Project |
W12 |
| 13 |
- |
Project |
W13 |
| 14 |
- |
Project |
W14 |
| 15 |
- |
Project |
W15 |
Notes from previous semesters:
📗 Multi-agent deep learning:
Link
📗 Multi-agent discrete state:
Link
📗 Single-agent deep learning:
Link
Textbooks: (main) Decentralized POMDPs:
Link, Multi-Agent Reinforcement Learning:
Link, Reinforcement Learning:
Link, Game Theory:
Link.