CS839 Fall 2025, Section 004

Schedule (Subject to Change)

Date	Topic	Assigned Reading	Assignments
Thursday, Sept 4	Welcome and Course Overview (Slides)	Chapter 1 of "Reinforcement Learning: An Introduction"
Tuesday, Sept 9	Bandits (Slides)	Chapters 2 and 3 of "Reinforcement Learning: An Introduction". You may skim 2.5 - 2.9.
Thursday, Sept 11	Markov Decision Processes (Slides)
Tuesday, Sept 16	Dynamic Programming I (Slides)	Chapter 4 of "Reinforcement Learning: An Introduction"
Thursday, Sept 18	Dynamic Programming II (Slides)
Tuesday, Sept 23	Monte Carlo Methods I (Slides)	Chapter 5 of "Reinforcement Learning: An Introduction"
Thursday, Sept 25	Monte Carlo Methods II (Slides)
Tuesday, Sept 30	Temporal Difference Learning I (Slides)	Chapter 6 of "Reinforcement Learning: An Introduction"
Thursday, Oct 2	Temporal Difference Learning II (Slides)		Final project proposal due
Tuesday, Oct 7	Models and Planning I (Slides)	Chapter 8 of "Reinforcement Learning: An Introduction"
Thursday, Oct 9	Models and Planning II (Slides)
Tuesday, Oct 14	Function Approximation I (Slides)	Chapter 9 of "Reinforcement Learning: An Introduction" (You may skim 9.5, 9.7, 9.9, and 9.10; 9.7 will be assigned later) Chapter 11 of "Reinforcement Learning: An Introduction" (Only up to (and including) 11.3)
Thursday, Oct 16	Function Approximation II (Slides)		Programming assignment due
Tuesday, Oct 21	Deep RL I (Slides)	Section 9.7 and 16.5 of "Reinforcement Learning: An Introduction"
Thursday, Oct 23	Deep RL II (Slides)
Tuesday, Oct 28	Policy Gradients I (Slides)	Chapter 13 (skip 13.6) of "Reinforcement Learning: An Introduction"
Thursday, Oct 30	Policy Gradients II (Slides)		Literature survey due
Tuesday, Nov 4	Abstraction and Hierarchy (Slides)	Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
Thursday, Nov 6	Midterm Exam in Class
Tuesday, Nov 11	Finish Abstraction / Multi-agent RL I (Slides)	Multi-agent Reinforcement Learning: Foundations and Modern Approaches (Chapter 1 and 5)
Thursday, Nov 13	Multi-agent RL II (Slides)
Tuesday, Nov 18	Learning from Humans: RLHF, Inverse RL, Imitation Learning (Slides)	Deep Reinforcement Learning from Human Preferences
Thursday, Nov 20	Learning Cognition: Learning to Reason and Plan (Slides)
Tuesday, Nov 25	Reproducibility and Evaluation (Slides)	Empirical Design in Reinforcement Learning
Thursday, Nov 27	Happy Thanksgiving! (No class)
Tuesday, Dec 2	Offline RL (Slides)	Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems (Read all of Sections 1, 2, 4, 6, and 7. Skim Section 5) Challenges of Real-World Reinforcement Learning
Thursday, Dec 4	Applications (Slides)
Tuesday, Dec 9	Project Lightning Talks
FINAL PROJECT REPORTS DUE: December 12
Everything below here is tentative and subject to change.