Learning Interpretable Policies in Hindsight-Observable POMDPs through
Partially Supervised Reinforcement Learning

Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning

14 February 2024

Yevgeniy Vorobeychik

Papers citing "Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning"

12 / 12 papers shown

Title
Learning in POMDPs is Sample-Efficient with Hindsight Observability Jonathan Lee Alekh Agarwal Christoph Dann Tong Zhang 49 20 0 31 Jan 2023
Offline Reinforcement Learning from Images with Latent Space Models Rafael Rafailov Tianhe Yu Aravind Rajeswaran Chelsea Finn OffRL 45 126 0 21 Dec 2020
Deep Reinforcement Learning for Autonomous Driving Sen Wang Daoyuan Jia Xinshuo Weng 45 165 0 28 Nov 2018
Generalization and Regularization in DQN Jesse Farebrother Marlos C. Machado Michael Bowling 59 205 0 29 Sep 2018
A Brief Survey of Deep Reinforcement Learning Kai Arulkumaran M. Deisenroth Miles Brundage Anil Anthony Bharath OffRL 96 2,792 0 19 Aug 2017
Proximal Policy Optimization Algorithms John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov OffRL 236 18,685 0 20 Jul 2017
Simultaneous Policy Learning and Latent State Inference for Imitating Driver Behavior Jeremy Morton Mykel J. Kochenderfer 45 36 0 19 Apr 2017
Deep Reinforcement Learning: An Overview Yuxi Li OffRL VLM 139 1,517 0 25 Jan 2017
OpenAI Gym Greg Brockman Vicki Cheung Ludwig Pettersson Jonas Schneider John Schulman Jie Tang Wojciech Zaremba OffRL ODL 177 5,056 0 05 Jun 2016
Asynchronous Methods for Deep Reinforcement Learning Volodymyr Mnih Adria Puigdomenech Badia M. Berk Mirza Alex Graves Timothy Lillicrap Tim Harley David Silver Koray Kavukcuoglu 166 8,805 0 04 Feb 2016
Deep Reinforcement Learning with Double Q-learning H. V. Hasselt A. Guez David Silver OffRL 131 7,590 0 22 Sep 2015
Representation Learning: A Review and New Perspectives Yoshua Bengio Aaron Courville Pascal Vincent OOD SSL 184 12,384 0 24 Jun 2012