ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.09290
  4. Cited By
Learning Interpretable Policies in Hindsight-Observable POMDPs through
  Partially Supervised Reinforcement Learning

Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning

14 February 2024
Michael Lanier
Ying Xu
Nathan Jacobs
Chongjie Zhang
Yevgeniy Vorobeychik
ArXivPDFHTML

Papers citing "Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning"

12 / 12 papers shown
Title
Learning in POMDPs is Sample-Efficient with Hindsight Observability
Learning in POMDPs is Sample-Efficient with Hindsight Observability
Jonathan Lee
Alekh Agarwal
Christoph Dann
Tong Zhang
49
20
0
31 Jan 2023
Offline Reinforcement Learning from Images with Latent Space Models
Offline Reinforcement Learning from Images with Latent Space Models
Rafael Rafailov
Tianhe Yu
Aravind Rajeswaran
Chelsea Finn
OffRL
45
126
0
21 Dec 2020
Deep Reinforcement Learning for Autonomous Driving
Deep Reinforcement Learning for Autonomous Driving
Sen Wang
Daoyuan Jia
Xinshuo Weng
45
165
0
28 Nov 2018
Generalization and Regularization in DQN
Generalization and Regularization in DQN
Jesse Farebrother
Marlos C. Machado
Michael Bowling
59
205
0
29 Sep 2018
A Brief Survey of Deep Reinforcement Learning
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
96
2,792
0
19 Aug 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
236
18,685
0
20 Jul 2017
Simultaneous Policy Learning and Latent State Inference for Imitating
  Driver Behavior
Simultaneous Policy Learning and Latent State Inference for Imitating Driver Behavior
Jeremy Morton
Mykel J. Kochenderfer
45
36
0
19 Apr 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
139
1,517
0
25 Jan 2017
OpenAI Gym
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
177
5,056
0
05 Jun 2016
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
166
8,805
0
04 Feb 2016
Deep Reinforcement Learning with Double Q-learning
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
131
7,590
0
22 Sep 2015
Representation Learning: A Review and New Perspectives
Representation Learning: A Review and New Perspectives
Yoshua Bengio
Aaron Courville
Pascal Vincent
OOD
SSL
184
12,384
0
24 Jun 2012
1