ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.05795
  4. Cited By
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy
  Observations

Robust Reinforcement Learning in POMDPs with Incomplete and Noisy Observations

15 February 2019
Yuhui Wang
Hao He
Xiaoyang Tan
ArXiv (abs)PDFHTML

Papers citing "Robust Reinforcement Learning in POMDPs with Incomplete and Noisy Observations"

8 / 8 papers shown
Title
Deep Variational Reinforcement Learning for POMDPs
Deep Variational Reinforcement Learning for POMDPs
Maximilian Igl
L. Zintgraf
T. Le
Frank Wood
Shimon Whiteson
BDLOffRL
68
262
0
06 Jun 2018
Mastering Chess and Shogi by Self-Play with a General Reinforcement
  Learning Algorithm
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
...
D. Kumaran
T. Graepel
Timothy Lillicrap
Karen Simonyan
Demis Hassabis
153
1,782
0
05 Dec 2017
Scalable trust-region method for deep reinforcement learning using
  Kronecker-factored approximation
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Yuhuai Wu
Elman Mansimov
Shun Liao
Roger C. Grosse
Jimmy Ba
OffRL
59
630
0
17 Aug 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
541
19,265
0
20 Jul 2017
On Improving Deep Reinforcement Learning for POMDPs
On Improving Deep Reinforcement Learning for POMDPs
Pengfei Zhu
Xin Li
Pascal Poupart
Guanghui Miao
63
126
0
26 Apr 2017
OpenAI Gym
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRLODL
223
5,086
0
05 Jun 2016
Deep Recurrent Q-Learning for Partially Observable MDPs
Deep Recurrent Q-Learning for Partially Observable MDPs
Matthew J. Hausknecht
Peter Stone
109
1,685
0
23 Jul 2015
High-Dimensional Continuous Control Using Generalized Advantage
  Estimation
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
129
3,438
0
08 Jun 2015
1