Pavan's Notes & Blog

Pavan's blog, research notes, and tutorials

Emergent Coordination in Multi-Agent Language Models

Exploring how multi-agent LLM systems develop emergent coordination through information-theoretic analysis and Theory of Mind prompting

23 min read · 2025

Reinforcement Learning - Policy Gradient Algorithms

Notes on policy gradient algorithms in reinforcement learning.

3 min read · 2025

Diffusion Models - An Overview

My effort to understand Diffusion models, and related research directions.

4 min read · 2025

Emergent Coordination in Multi-Agent Language Models

Exploring how multi-agent LLM systems develop emergent coordination through information-theoretic analysis and Theory of Mind prompting

23 min read · December 02, 2025

2025 · deep-learning multi-agent-systems information-theory · paper-reading
Reinforcement Learning - Policy Gradient Algorithms

Notes on policy gradient algorithms in reinforcement learning.

3 min read · November 10, 2025

2025 · AI LLM RL · Notes
Diffusion Models - An Overview

My effort to understand Diffusion models, and related research directions.

4 min read · October 24, 2025

2025 · AI ML Computer Vision Generative Models · Notes