Young Wu's Homepage

# Schedule

This is a directed study course in Computer Science. In Spring 2025, the topic is partial observability in multi-agent reinforcement learning. We will implement and compare algorithms used to solve partially observable Markov games: (1) exact methods to solve the belief-state Markov games by (i) discretization of the domain and (ii) iterative elimination of dominated strategies; (2) approximate methods to solve the Markov games where observations are treated as states using deep reinforcement learning techniques.

Week	Date	Topic	Notes
1	-	Markov Game	W1
2	-	Best Response Dynamics	W2
3	-	Minimax Q	W3
4	-	Partial Observability	W4
5	-	Beliefs	W5
6	-	Dynamic Programming	W6
7	-	Heuristic Search	W7
8	-	Project	W8
9	-	Project	W9
10	-	Project	W10
11	-	Project	W11
12	-	Project	W12
13	-	Project	W13
14	-	Project	W14
15	-	Project	W15

Notes from previous semesters:

📗 Multi-agent deep learning: Link

📗 Multi-agent discrete state: Link

📗 Single-agent deep learning: Link

Textbooks: (main) Decentralized POMDPs: Link, Multi-Agent Reinforcement Learning: Link, Reinforcement Learning: Link, Game Theory: Link.

# Grading Scheme

There will be weekly meetings to discuss the progress of the project, and a final project submission: a demo on your personal website and a report.

Last Updated: April 19, 2025 at 11:20 AM