# Schedule


This is a directed study course in Computer Science. In Spring 2025, the topic is partial observability in multi-agent reinforcement learning. We will implement and compare algorithms used to solve partially observable Markov games.

Week Date Topic Notes
1 - Markov Game W1
2 - Best Response Dynamics W2
3 - Minimax Q W3
4 - Partial Observability W4
5 - Beliefs W5
6 - Dynamic Programming W6
7 - Heuristic Search W7
8 - Communication W8
9 - Project W9
10 - Project W10
11 - Project W11
12 - Project W12
13 - Project W13
14 - Project W14
15 - Project W15


Notes from previous semesters:
📗 Multi-agent deep learning: Link
📗 Multi-agent discrete state: Link
📗 Single-agent deep learning: Link

Textbooks: (main) Decentralized POMDPs: Link, Multi-Agent Reinforcement Learning: Link, Reinforcement Learning: Link, Game Theory: Link.

# Grading Scheme


There will be weekly meetings to discuss the progress of the project, and a final project submission: a demo on your personal website and a report.





Last Updated: November 11, 2024 at 2:55 AM