# Schedule
This is a directed study course in Computer Science. In Spring 2025, the topic is partial observability in multi-agent reinforcement learning. We will implement and compare algorithms used to solve partially observable Markov games: (1) exact methods to solve the belief-state Markov games by (i) discretization of the domain and (ii) iterative elimination of dominated strategies; (2) approximate methods to solve the Markov games where observations are treated as states using deep reinforcement learning techniques. 
| Week | 
Date | 
Topic | 
Notes | 
| 1 | 
- | 
Markov Game | 
W1 | 
| 2 | 
- | 
Best Response Dynamics | 
W2 | 
| 3 | 
- | 
Minimax Q | 
W3 | 
| 4 | 
- | 
Partial Observability | 
W4 | 
| 5 | 
- | 
Beliefs | 
W5 | 
| 6 | 
- | 
Dynamic Programming | 
W6 | 
| 7 | 
- | 
Heuristic Search | 
W7 | 
| 8 | 
- | 
Project | 
W8 | 
| 9 | 
- | 
Project | 
W9 | 
| 10 | 
- | 
Project | 
W10 | 
| 11 | 
- | 
Project | 
W11 | 
| 12 | 
- | 
Project | 
W12 | 
| 13 | 
- | 
Project | 
W13 | 
| 14 | 
- | 
Project | 
W14 | 
| 15 | 
- | 
Project | 
W15 | 
 
Notes from previous semesters: 
📗 Multi-agent deep learning: 
Link  
📗 Multi-agent discrete state: 
Link  
📗 Single-agent deep learning: 
Link  
Textbooks: (main) Decentralized POMDPs: 
Link, Multi-Agent Reinforcement Learning: 
Link, Reinforcement Learning: 
Link, Game Theory: 
Link.