Publications

Jump to Year: [2025] [2024] [2023] [2022] [2021] [2020] [2019] [2018] [2017] [2016] [2013] [2012]

2025

Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning.
Subhojyoti Mukherjee, Josiah P. Hanna, Qiaomin Xie, Robert Nowak.
Proceedings of the Reinforcement Learning Conference (RLC). August 2025.
Abstract     BibTeX     Paper

Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation.
Hongyi Zhou, Josiah P. Hanna, Jin Zhu, Ying Yang, Chengchun Shi.
Proceedings of the International Conference on Machine Learning (ICML). July 2025.
Abstract     BibTeX     Paper

Stable Offline Value Function Learning with Bisimulation-based Representations.
Brahma S. Pavse, Yudong Chen, Qiaomin Xie, Josiah P. Hanna.
Proceedings of the International Conference on Machine Learning (ICML). July 2025.
Abstract     BibTeX     Paper

WeRef: An Open-source and Extensible Dataset for Referee Gesture Recognition in RoboCup.
Zisen Shao, Josiah P. Hanna.
RoboCup-2025: Robot Soccer World Cup XXVIII. July 2025.
Abstract     BibTeX

Thinking Is a Form of Control.
Josiah P. Hanna, Nicholas E. Corrado.
Proceedings of the Finding the Frame Workshop at the Reinforcement Learning Conference (RLC). August 2025.
An extended version of this work is available on arxiv.
Abstract     BibTeX     Paper

Multi-Robot Collaboration through Reinforcement Learning and Abstract Simulation.
Adam Labiosa, Josiah P. Hanna.
Proceedings of the IEEE International Conference on Robotics and Automation (ICRA). May 2025.
Abstract     BibTeX     Paper

Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer.
Adam Labiosa, Zhihan Wang, Siddhant Agarwal, William Cong, Geethika Hemkumar, Abhinav Narayan Harish, Benjamin Hong, Josh Kelle, Chen Li, Yuhao Li, Zisen Shao, Peter Stone, Josiah P. Hanna.
Proceedings of the IEEE International Conference on Robotics and Automation (ICRA). May 2025.
A short version of this work appeared at the Roboletics 2.0 Workshop at ICRA 2025 and received the Best RoboCup-Themed Paper Award.
Abstract     BibTeX     Paper

2024

Data-Efficient Policy Evaluation Through Behavior Policy Search.
Josiah P. Hanna, Yash Chandak, Martha White, Philip Thomas, Scott Niekum, Peter Stone.
Journal of Machine Learning Research (JMLR). November 2024.
This article contains material that was first published at ICML 2017.
Abstract     BibTeX     Paper

Toward the Confident Deployment of Real-world Reinforcement Learning Agents.
Josiah P. Hanna.
AI Magazine. September 2024.
Abstract     BibTeX     Paper

Conservative Evaluation of Offline Policy Learning.
Hager Radi Abdelwahed, Josiah P. Hanna, Matthew E. Taylor.
Transactions of Machine Learning Research (TMLR). August 2024.
Abstract     BibTeX     Paper

Future Prediction Can Be a Strong Evidence of Good History Representation in Partially Observable Environments.
Jeongyeol Kwon, Liu Yang, Josiah P. Hanna, Robert Nowak.
Arxiv Pre-Print. February 2024.
Abstract     BibTeX     Paper

Adaptive Exploration for Data-Efficient General Value Function Evaluations.
Arushi Jain, Josiah P. Hanna, Doina Precup.
Proceedings of Advances in Neural Information Processing Systems (NeurIPS). December 2024.
Abstract     BibTeX     Paper

Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning.
Nicholas Corrado, Yuxiao Qu, John U. Balis, Adam Labiosa, Josiah P. Hanna.
Proceedings of the Reinforcement Learning Conference (RLC). August 2024.
Abstract     BibTeX     Paper

SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP.
Subhojyoti Mukherjee, Josiah P. Hanna, Robert Nowak.
Proceedings of the International Conference on Machine Learning (ICML). July 2024.
Abstract     BibTeX     Paper

Learning To Stabilize Online Reinforcement Learning in Unbounded State Spaces.
Brahma Pavse, Matthew Zurek, Yudong Chen, Qiaomin Xie, Josiah P. Hanna.
Proceedings of the International Conference on Machine Learning (ICML). July 2024.
Abstract     BibTeX     Paper

SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits.
Subhojyoti Mukherjee, Qiaomin Xie, Josiah P. Hanna, Robert Nowak.
Proceedings of the International Conference on Artificial Intelligence and Statistics (AISTATS). May 2024.
This paper contains material that was previously presented at the 2023 ICML Workshop on the Many Facets of Preference-Based Learning.
Abstract     BibTeX     Paper

Understanding When Dynamics-Invariant Data Augmentations Benefit Model-Free Reinforcement Learning Updates.
Nicholas Corrado, Josiah P. Hanna.
Proceedings of the International Conference on Learning Representations (ICLR). May 2024.
Abstract     BibTeX     Paper

Reinforcement Learning Via Auxiliary Task Distillation.
Abhinav Narayan Harish, Larry Heck, Josiah P. Hanna, Zsolt Kira, Andrew Szot.
Proceedings of the European Conference on Computer Vision (ECCV). October 2024.
Abstract     BibTeX     Paper

Replacing Implicit Regression with Classification in Policy Gradient Reinforcement Learning.
Josiah P. Hanna, Brahma S. Pavse, Abhinav Harish.
Proceedings of the Finding the Frame Workshop at the Reinforcement Learning Conference (RLC). August 2024.
Abstract     BibTeX     Paper

2023

On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling.
Nicholas Corrado, Josiah P. Hanna.
Arxiv Pre-Print. November 2023.
Abstract     BibTeX     Paper

State-Action Similarity-Based Representations for Off-Policy Evaluation.
Brahma S. Pavse, Josiah P. Hanna.
Proceedings of Advances in Neural Information Processing Systems (NeurIPS). December 2023.
Abstract     BibTeX     Paper

Multi-task Representation Learning for Pure Exploration in Bilinear Bandits.
Subhojyoti Mukherjee, Qiaomin Xie, Josiah P. Hanna, Robert Nowak.
Proceedings of Advances in Neural Information Processing Systems (NeurIPS). December 2023.
Abstract     BibTeX     Paper

Conditional Mutual Information for Disentangled Representations in Reinforcement Learning.
Mhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah P. Hanna, Stefano V. Albrecht.
Proceedings of Advances in Neural Information Processing Systems (NeurIPS). December 2023.
Abstract     BibTeX     Paper

Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning.
Mhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah P. Hanna, Stefano V. Albrecht.
Proceedings of the International Conference on Learning Representations (ICLR). May 2023.
Abstract     BibTeX     Paper

Scaling Marginalized Importance Sampling To High-Dimensional State-Spaces Via State Abstraction.
Brahma S Pavse, Josiah P. Hanna.
Proceedings of the AAAI Conference on Artificial Intelligence. February 2023.
Abstract     BibTeX     Paper

SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits.
Subhojyoti Mukherjee, Qiaomin Xie, Josiah P. Hanna, Robert Nowak.
ICML Workshop on the Many Facets of Preference-Based Learning. July 2023.
Abstract     BibTeX     Paper

2022

Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning.
Rujie Zhong, Duohan Zhang, Lukas Schäfer, Stefano V. Albrecht, Josiah P. Hanna.
Proceedings of Neural and Information Processing Systems (NeurIPS). December 2022.
This paper contains material that was previously presented at the NeurIPS Workshop on Offline Reinforcement Learning (OfflineRL) 2021.
Abstract     BibTeX     Paper

ReVar: Strengthening Policy Evaluation Via Reduced Variance Sampling.
Subhojyoti Mukherjee, Josiah P. Hanna, Robert Nowak.
Proceedings of the 38th International Conference on Uncertainty in Artificial Intelligence (UAI). August 2022.
Abstract     BibTeX     Paper

Simulation-Acquired Latent Action Spaces for Dynamics Generalization.
Nicholas Corrado, Yuxiao Qu, Josiah P. Hanna.
Proceedings of the 1st Conference on Lifelong Learning Agents (CoLLAs). August 2022.
Abstract     BibTeX     Paper

Decoupled Reinforcement Learning To Stabilise Intrinsically-Motivated Exploration.
Lukas Schäfer, Josiah P. Hanna, Filippos Christiano, Stefano V Albrecht.
Proceedings of the International Conference on Autonomous and Multi-agent Systems (AAMAS). May 2022.
Abstract     BibTeX     Paper

Scaling Marginalized Importance Sampling To High-Dimensional State-Spaces Via State Abstraction.
Brahma S Pavse, Josiah P. Hanna.
Proceedings of the Offline Reinforcement Learning Workshop at NeurIPS 2022. December 2022.
Abstract     BibTeX     Paper

Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning..
Mhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah P. Hanna, Stefano V. Albrecht.
Proceedings of the NeurIPS 2022 Workshop on Deep Reinforcement Learning. December 2022.
Abstract     BibTeX     Paper

Multi-agent Databases Via Independent Learning.
Chi Zhang, Olga Papaemmanouil, Josiah P. Hanna, Aditya Akella.
Proceedings of the 4th International Workshop on Applied AI for Database Systems and Applications. September 2022.
Abstract     BibTeX     Paper

2021

Grounded Action Transformation for Sim-to-Real Reinforcement Learning.
Josiah P. Hanna, Sid Desai, Haresh Karnan, Garrett Warnell, Peter Stone.
Machine Learning (MLJ): Special Issue on Reinforcement Learning for Real Life. May 2021.
This article contains material that was previously published in an AAAI 2017 paper and an IROS 2020 paper.
Abstract     BibTeX     Paper

Importance Sampling in Reinforcement Learning with an Estimated Behavior Policy.
Josiah P. Hanna, Scott Niekum, Peter Stone.
Machine Learning. May 2021.
This article contains material that was previously published in an AAMAS 2019 paper and an ICML 2019 paper.
Abstract     BibTeX     Paper

Interpretable Goal Recognition in the Presence of Occluded Factors for Autonomous Vehicles.
Josiah P. Hanna, Arrasy Rahman, Elliot Fosong, Francisco Eiras, Mihai Dobre, John Redford, Subramanian Ramamoorthy, Stefano V. Albrecht.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). October 2021.
Abstract     BibTeX     Paper

A Joint Imitation-Reinforcement Learning Framework for Reduced Baseline Regret.
Sheelabhadra Dey, Sumedh Pendurkar, Guni Sharon, Josiah P. Hanna.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). September 2021.
Abstract     BibTeX     Paper

Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning.
Rujie Zhong, Josiah P. Hanna, Lukas Schäfer, Stefano V. Albrecht.
Proceedings of the NeurIPS Workshop on Offline Reinforcement Learning (OfflineRL). December 2021.
Abstract     BibTeX     Paper

Safe Evaluation for Offline Learning: Are We Ready To Deploy?
Hager Radi, Josiah P. Hanna, Peter Stone, Matthew E. Taylor.
Proceedings of the NeurIPS Workshop on Deployable Decision Making in Embodied Systems (DDM). December 2021.
Abstract     BibTeX     Paper

Behavior Policy Search for Risk Estimators in RL.
Elita Lobo, Yash Chandak, Subramanian Dharmashankar, Josiah P. Hanna, Marek Petrik.
Proceedings of the NeurIPS Workshop on Safe and Robust Control of Uncertain Systems. December 2021.
Abstract     BibTeX     Paper

Towards Quantum-Secure Authentication and Key Agreement Via Abstract Multi-Agent Interaction.
Ibrahim H. Ahmed, Josiah P. Hanna, Elliot Fosong, Albrecht Stefano V.
Proceedings of the International Conference on Practical Applications of Agents and Multi-Agent Systems (PAAMS). October 2021.
Abstract     BibTeX     Paper

Decoupled Reinforcement Learning To Stabilise Intrinsically-Motivated Exploration.
Lukas Schäfer, Josiah P. Hanna, Filippos Christiano, Stefano V Albrecht.
Proceedings of the ICML Workshop on Unsupervised Reinforcement Learning (URL). July 2021.
Abstract     BibTeX     Paper

2020

RIDM: Reinforced Inverse Dynamics Modeling for Learning From a Single Observed Demonstration.
Brahma S. Pavse, Faraz Torabi, Josiah P. Hanna, Garrett Warnell, Peter Stone.
IEEE Robotics and Automation Letters. October 2020.
Abstract     BibTeX     Paper

An Imitation From Observation Approach To Transfer Learning with Dynamics Mismatch.
Siddharth Desai, Ishan Durugkar, Haresh Karnan, Garrett Warnell, Josiah P. Hanna, Peter Stone.
Proceedings of Advances in Neural Information Processing Systems (NeurIPS). December 2020.
Abstract     BibTeX     Paper

Stochastic Grounded Action Transformation for Robot Learning in Simulation.
Sid Desai, Haresh Karnan, Josiah P. Hanna, Garrett Warnell, Peter Stone.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). October 2020.
Abstract     BibTeX     Paper

Reinforced Grounded Action Transformation for Sim-to-Real Transfer.
Haresh Karnan, Sid Desai, Josiah P. Hanna, Garrett Warnell, Peter Stone.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). October 2020.
Abstract     BibTeX     Paper

Reducing Sampling Error in Batch Temporal Difference Learning.
Brahma Pavse, Ishan Durugkar, Josiah P. Hanna, Peter Stone.
Proceedings of the 37th International Conference on Machine Learning (ICML). July 2020.
Abstract     BibTeX     Paper

Learning an Interpretable Traffic Signal Control Policy.
James Ault, Josiah P. Hanna, Guni Sharon.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems (AAMAS). May 2020.
Abstract     BibTeX     Paper

On Sampling Error in Batch Action-Value Prediction Algorithms.
Brahma Pavse, Ishan Durugkar, Josiah P. Hanna, Peter Stone.
Proceedings of the Offline Reinforcement Learning Workshop at Neural Information Processing Systems (NeurIPS). December 2020.
Abstract     BibTeX     Paper

2019

Importance Sampling Policy Evaluation with an Estimated Behavior Policy.
Josiah P. Hanna, Scott Niekum, Peter Stone.
Proceedings of the 36th International Conference on Machine Learning (ICML). June 2019.
Abstract     BibTeX     Paper

Reducing Sampling Error in the Monte Carlo Policy Gradient Estimator.
Josiah P. Hanna, Peter Stone.
Proceedings of the 18th International Conference on Autonomous Agents and Multiagent Systems (AAMAS). May 2019.
This paper contains material that was previously presented at the 2018 NeurIPS Deep Reinforcement Learning Workshop.
Abstract     BibTeX     Paper

Selecting Compliant Agents for Opt-in Microtolling.
Josiah P. Hanna, Guni Sharon, Stephen D. Boyles, Peter Stone.
Proceedings of the 33rd AAAI Conference on Artificial Intelligence (AAAI). January 2019.
Abstract     BibTeX     Paper

RIDM: Reinforced Inverse Dynamics Modeling for Learning From a Single Observed Demonstration.
Brahma S. Pavse, Faraz Torabi, Josiah P. Hanna, Garrett Warnell, Peter Stone.
Proceedings of the Imitation, Intent, and Interaction (I3) Workshop at ICML 2019. June 2019.
Abstract     BibTeX     Paper

2018

DyETC: Dynamic Electronic Toll Collection for Traffic Congestion Alleviation.
Haipeng Chen, Bo An, Guni Sharon, Josiah P. Hanna, Peter Stone, Chunyan Miao, Yeng Chai Soh.
Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI). February 2018.
Abstract BibTeX Paper

Towards a Data Efficient Off-Policy Policy Gradient.
Josiah P. Hanna, Peter Stone.
AAAI Spring Symposium on Data Efficient Reinforcement Learning. April 2018.
Abstract BibTeX Paper

2017

Network-wide Adaptive Tolling for Connected and Automated Vehicles.
Guni Sharon, Michael W. Levin, Josiah P. Hanna, Tarun Rambha, Stephen D. Boyles, Peter Stone.
Transportation Research Part C. September 2017.
Abstract     BibTeX     Paper

Data-Efficient Policy Evaluation Through Behavior Policy Search.
Josiah P. Hanna, Philip Thomas, Peter Stone, Scott Niekum.
Proceedings of the 34th International Conference on Machine Learning (ICML). August 2017.
Abstract     BibTeX     Paper

Real-time Adaptive Tolling Scheme for Optimized Social Welfare in Traffic Networks.
Guni Sharon, Josiah P. Hanna, Tarun Rambha, Michael W. Levin, Michael Albert, Stephen D. Boyles, Peter Stone.
Proceedings of the 16th International Conference on Autonomous Agents and Multiagent Systems (AAMAS-2017). May 2017.
Abstract     BibTeX     Paper

Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation.
Josiah P. Hanna, Peter Stone, Scott Niekum.
Proceedings of the 16th International Conference on Autonomous Agents and Multiagent Systems (AAMAS). May 2017.
Abstract     BibTeX     Paper

Grounded Action Transformation for Robot Learning in Simulation.
Josiah P. Hanna, Peter Stone.
Proceedings of the 31st AAAI Conference on Artificial Intelligence (AAAI). February 2017.
Abstract     BibTeX     Paper

Fast and Precise Black and White Ball Detection for RoboCup Soccer.
Jacob Menashe, Josh Kelle, Katie Genter, Josiah P. Hanna, Elad Liebman, Sanmit Narvekar, Ruohan Zhang, Peter Stone.
RoboCup-2017: Robot Soccer World Cup XXI. July 2017.
Abstract     BibTeX     Paper

2016

Operations of a Shared, Autonomous, Electric Vehicle Fleet: Implications of Vehicle \& Charging Infrastructure Decisions.
T Donna Chen, Kara M Kockelman, Josiah P. Hanna.
Transportation Research Part A: Policy and Practice. January 2016.
Abstract     BibTeX     Paper

UT Austin Villa: RoboCup 2015 3D Simulation League Competition and Technical Challenges Champions.
Patrick MacAlpine, Josiah P. Hanna, Jason Liang, Peter Stone.
RoboCup-2015: Robot Soccer World Cup XIX. July 2016.
Abstract     BibTeX     Paper

Minimum Cost Matching for Autonomous Carsharing.
Josiah P. Hanna, Michael Albert, Donna Chen, Peter Stone.
Proceedings of the 9th IFAC Symposium on Intelligent Autonomous Vehicles (IAV 2016). June 2016.
Abstract     BibTeX     Paper

2013

Approximation of Lorenz-Optimal Solutions in Multiobjective Markov Decision Processes.
Patrice Perny, Paul Weng, Judy Goldsmith, Josiah P. Hanna.
Proceedings of the International Conference on Uncertainty in Artificial Intelligence (UAI). July 2013.
Abstract BibTeX Paper

2012

The Academic Advising Planning Domain.
Joshua T. Guerin, Josiah P. Hanna, Libby Ferland, Nicholas Mattei, Judy Goldsmith.
Proceedings of the 3rd Workshop on the International Planning Competition at ICAPS. July 2012.
Abstract BibTeX Paper